The present application claims priority from Japanese Patent Application No. JP 2010-172874 filed in the Japanese Patent Office on Jul. 30, 2010, the entire content of which is incorporated herein by reference.
This disclosure relates to a mechanical noise suppression apparatus, a mechanical noise suppression method, a program and an imaging apparatus, and more particularly to a mechanical sound suppression apparatus and so forth for reducing mechanical noise such as motor noise upon optical zooming during video shooting in an imaging apparatus which includes a video shooting function with sound.
In recent years, as an imaging apparatus of a digital camera and so forth, an imaging apparatus has been proposed which includes a video shooting function with sound in addition to a camera function. An imaging apparatus of the type described has a problem in that mechanical noise such as motor noise upon optical zooming during video shooting is mixed into peripheral sound collected by a microphone, resulting in degradation of the recorded sound.
As a technique for removing noise mixed in a sound signal, a spectral subtraction method is known and disclosed, for example, in S. F. Boll, “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Trans. Acoustics, Speech, and Signal Process, Vol. 27, No. 2, pp. 113-120, 1979 (hereinafter referred to as Non-Patent Document 1). According to the spectral subtraction method, a spectrum within a no-sound period is estimated as a noise spectrum, and a signal obtained by multiplying the noise spectrum by a predetermined coefficient, that is, by a subtract coefficient, is subtracted from an input sound spectrum to remove a noise component.
With the method of estimating a spectrum within a no-sound period as a noise spectrum, mechanical noise generated independently of peripheral sound cannot be removed as noise as in such an imaging apparatus having a video shooting function with sound as described above. Therefore, it is proposed in Japanese Patent Laid-Open No. 2006-279185 (hereinafter referred to as Patent Document 1) to retain a frequency spectrum of mechanical noise upon optical zooming during video shooting in advance and subtract, upon zooming operation, the frequency spectrum of the mechanical noise from a spectrum of an input signal to reduce the mechanical noise.
A sound inputting section 51 amplifies a sound signal Sa inputted thereto through a microphone not shown by a predetermined gain and supplies the amplified sound signal Sa to a framing section 52. In this instance, for example, if a zooming operation is carried out during video shooting with sound, then motor noise, that is, zooming noise, which is generated upon the zooming operation, is inputted to the framing section 52 together with the sound signal Sa through the sound inputting section 51. The framing section 52 divides the sound signal Sa inputted thereto from the sound inputting section 51 in a unit of a frame for a predetermined period of time. A Fourier transform section 53 Fourier transforms the sound signal Sa divided in a unit of a frame by the framing section 52 into a input sound spectrum Sb which indicates power for individual frequencies.
In a motor noise spectrum storage section 54, a motor noise spectrum Sc obtained by spectralizing motor noise which is an object of noise removal is stored as a noise spectrum. A subtract section 55 carries out a process of removing noise components based on the input sound spectrum Sb obtained by the Fourier transform section 53 and the motor noise spectrum Sc stored in the motor noise spectrum storage section 54. In particular, the subtract section 55 subtracts a signal obtained by multiplying the motor noise spectrum Sc stored in advance in the motor noise spectrum storage section 54 as a noise spectrum by a predetermined subtract coefficient α from the input sound spectrum Sb.
The spectrum changeover section 56 carries out changeover between the input sound spectrum Sb obtained from the Fourier transform section 53 and a sound spectrum Sd after the noise removal obtained from the subtract section 55 in response to a selection signal outputted from the control section 32 to supply the input sound spectrum Sb or the sound spectrum Sd to an inverse Fourier transform section 57. In particular, the spectrum changeover section 56 supplies, upon driving of the motor 21 such as during a zooming operation, the sound spectrum Sd after the noise removal to the inverse Fourier transform section 57 but supplies, in any other case, the input sound spectrum Sb to the inverse Fourier transform section 57.
The inverse Fourier transform section 57 inverse Fourier transforms the input sound spectrum Sb or the sound spectrum Sd after the noise removal inputted thereto through the spectrum changeover section 56 to obtain an original sound signal Se for each frame unit. A waveform synthesis section 58 synthesizes the sound signals Se for the individual frame units obtained by the inverse Fourier transform section 57 to restore a sound signal Sf which is continuous in a time series. The sound signal Sf is used as a final sound signal for recording and is recorded into a recording medium such as a memory together with video data obtained from the imaging system.
The spectral subtraction method used in Patent Document 1 is described with reference to
Then, a subtraction process of subtracting a power spectrum |N(f, τ)|2 of noise from a power spectrum |X(f, τ)|2 of the input signal x(t) is carried out, and a power spectrum |Y(f, τ)|2 is obtained by the subtraction process. It is to be noted that the noise spectrum N(f, τ) is obtained by estimation using the input signal x(t), assumption of a model of noise in advance or the like. If a result of the subtraction exhibits a negative value, then a suitable value is substituted.
In particular, the subtraction process described is carried out in accordance with the following expression (1):
where α is a fixed coefficient set to a value, for example, between 1 and 2, and also β is a fixed coefficient set to a value, for example, between 0 to 0.1.
After the subtraction, the amplitude spectrum |Y(f, τ)| of a result of the subtraction is multiplied by a deflection angle arg{X(f, τ)} of the frequency spectrum X(f, τ) of the input signal x(t) as represented by the following expression (2) to obtain a frequency spectrum Y(f, τ) as a result of the subtraction:
Y(f,τ)=arg{X(f,τ)}|Y(f,τ)| (2)
Then, the frequency spectrum Y(f, τ) is converted into an output signal y(f) of the time domain by an inverse fast Fourier transform (IFFT).
On the other hand,
In Patent Document 1, the spectral subtraction method is used for suppression of mechanical noise as described hereinabove. However, in this Patent Document 1, an error between a true noise component included in an input signal and mechanical noise measured in advance is not taken into consideration. Therefore, excessive erasure or insufficient erasure of mechanical noise appears in the subtract section 55, and degradation of the sound quality cannot be avoided.
A large number of factors cause an error between a true noise component included in an input signal and mechanical noise measured in advance. The factors may include such as follows:
(a) Difference in machine assembly position or screw tightening force
(b) Abrasion or time-dependent variation of parts by mechanical driving
(c) Temperature variation
(d) Variation in posture (manner or angle of holding of the camera)
(e) Motor for driving camera zooming
In this manner, mechanical noise suppression using the spectral subtraction method cannot sufficiently cope with a dispersion of mechanical noise. Here, for the convenience of description, the expression of the spectral subtraction is transformed. In particular, the foregoing description is directed to subtraction of a spectrum, that is, to a “subtraction method,” a mechanism of a “multiplication type” is introduced newly.
The following expression (3) is a transformed form of the expression (2) given hereinabove wherein the right side is transformed:
From the expression (3), the frequency spectrum Y(f, t) can be represented as the product of the frequency spectrum X(f, τ) of the input signal x(t) and a gain function G(f, τ)=√(1−α)|N(f, τ)|2/|X(f, τ)|2. In other words, the spectrum subtraction of the subtraction type can be represented by that of the multiplication type.
The gain function G(f, τ)=√(1−α|N(f, τ)|2/|X(f, τ)|2) is described. |N(f, τ)|2/|X(f, τ)|2 in the gain function G(f, τ) is a ratio between the power of the noise, that is, the mechanical noise, and the power of the input signal. The value of the gain function G(f, τ) is fluctuated by the power ratio.
Also in Patent Document 1, a countermeasure against a dispersion in mechanical noise, that is, motor noise, is taken. In particular, in the case where the dispersion in mechanical noise is great, the subtract coefficient α for subtraction is set to a higher value. To vary the subtract coefficient α is equivalent to transform of the gain function G(f, τ) if it is considered in the multiplication type represented by the expression (3) given hereinabove.
However, as can be seen apparently also from
Further, in mechanical noise suppression using the spectral subtraction method, for example, when α=1, the gain function G(f, τ) indicates a sudden variation of the gain value where |X(f, τ)|2/|N(f, τ)|2 is 0 dB as indicated by a broken line ellipsis in
On the other hand, in mechanical noise suppression using the spectral subtraction method, for example, when α=1, the gain function G(f, τ) is β where |X(f, τ)|2/|N(f, τ)|2 is lower than 0 dB as indicated by a broken line ellipsis in
Further, in Patent Document 1, the subtract section 55 carries out a process of removing a noise component based on the input sound spectrum Sb obtained by the Fourier transform section 53 and the motor noise spectrum Sc stored in the motor noise spectrum storage section 54. In other words, the same motor noise spectrum Sc is always used by the subtract section 55, and information regarding sound to be recorded during video shooting such as a frequency characteristic or power is not taken into consideration. Therefore, also mechanical noise which cannot actually be perceived is suppressed, and there is a problem that desired sound is degraded inadvertently.
Therefore, it is desirable to provide a mechanical noise suppression apparatus, a mechanical noise suppression method, a program and an imaging apparatus which can implement a fixed reduction effect of mechanical noise independently of a dispersion in mechanical noise among individual apparatus by a simple configuration. Alto it is desirable to provide a mechanical noise suppression apparatus, a mechanical noise suppression method, a program and an imaging apparatus which can reduce mechanical noise while degradation of desired sound by a user is suppressed to the utmost in accordance with the surrounding environment.
According to an embodiment of the present disclosure, there is provided a mechanical noise suppression apparatus including a framing section adapted to divide an input signal into frames of a predetermined time length, a Fourier transform section adapted to transform framed signals obtained by the framing section into a frequency spectrum of a frequency domain, a mechanical noise reduction section adapted to correct the frequency spectrum of the input signal obtained by the Fourier transform section based on frequency spectrum information of mechanical noise to suppress the mechanical noise, an inverse Fourier transform section adapted to return the frequency spectrum corrected by the mechanical noise reduction section into framed signals of a time domain, and a frame synthesis section adapted to carry out frame synthesis of the framed signals of frames obtained by the inverse Fourier transform section to obtain an output signal in which the mechanical noise is suppressed. The mechanical noise reduction section includes a power ratio calculation section adapted to calculate, for each frequency, a power ratio between the frequency spectrum of the input signal and the frequency spectrum of the mechanical noise based on the frequency spectrum of the input signal obtained by the Fourier transform section and the frequency spectrum information of the mechanical noise, a gain readout section adapted to read out, for each frequency, a gain corresponding to the power ratio calculated by the power ratio calculation section from a gain function table in which set values of the gain corresponding to individual values of the power ratio are stored, and a frequency spectrum correction section adapted to multiply, for each frequency, the frequency spectrum of the input signal obtained by the Fourier transform section by the gain read out by the gain readout section to obtain a corrected frequency spectrum.
In the mechanical noise suppression apparatus, an input signal is divided into frames of a predetermined time length by the framing section, and the framed signals are transformed into a frequency spectrum of a frequency domain by the Fourier transform section. Then, the frequency spectrum of the input signal is corrected based on frequency spectrum information of mechanical noise by the mechanical sound reduction section. Then, the frequency spectrum corrected by the mechanical noise reduction section is returned into framed signals of a time domain by the inverse Fourier transform section. Then, frame synthesis of the framed signals of frames obtained by the inverse Fourier transform section is carried out by the frame synthesis section to obtain an output signal in which the mechanical noise is suppressed. For example, the mechanical noise is, in an imaging apparatus having a peripheral sound recording function, for example, mechanical noise such as motor sound which is generated in relation to a specific imaging operation such as, for example, a zooming operation.
In the mechanical noise reduction section, the frequency spectrum of the input signal is corrected based on the frequency spectrum of the mechanical noise by the power ratio calculation section, gain readout section and frequency spectrum correction section. In particular, for each frequency, a power ratio between the frequency spectrum of the input signal and the frequency spectrum of the mechanical noise is calculated based on the frequency spectrum of the input signal obtained by the Fourier transform section and the frequency spectrum information of the mechanical noise by the power ratio calculation section.
Then, for each frequency, a gain corresponding to the power ratio calculated by the power ratio calculation section is read out from the gain function table, in which set values of the gain corresponding to individual values of the power ratio are stored, by the gain readout section. Then, for each frequency, the frequency spectrum of the input signal obtained by the Fourier transform section is multiplied by the gain read out by the gain readout section to obtain a corrected frequency spectrum by the frequency spectrum correction section.
In this manner, in the mechanical noise suppression apparatus of the present disclosure, the frequency spectrum of the input signal is multiplied, for each frequency, by the gain read out from the gain function table, in which the set values of the gain corresponding to the individual values of the power ratio are stored, to correct the frequency spectrum of the input signal to suppress the mechanical noise. In this instance, the shape of the grain function to be set in the gain function table can be set freely in accordance with a dispersion of the mechanical noise. Consequently, a fixed reduction effect of the mechanical noise can be implemented by a simple and easy configuration irrespective of a dispersion of mechanical noise among individual apparatus.
The mechanical noise suppression apparatus may be configured such that each of the set values of the gain stored in the gain function table is low when the power ratio is in the proximity of 0 dB and smoothly increases as the power ratio increases from the proximity of 0 dB such that a gradient thereof does not become discontinuous. In this instance, since the value of the gain does not vary suddenly, such a situation that the output signal is distorted to degrade the sound quality can be prevented.
Further, the mechanical noise suppression apparatus may be configured such that each of the set values of the gain stored in the gain function table smoothly increases as the power ratio decreases from the proximity of 0 dB such that the gradient thereof does not become discontinuous. In this instance, since the gain is increased at a position at which the value of the frequency spectrum of the input signal is low, suppression of a component other than the mechanical noise at this position can be suppressed. Therefore, sound quality degradation by excessive suppression can be prevented.
The mechanical noise suppression apparatus may further include a spectrum information changing section adapted to change the frequency spectrum information of the mechanical noise to be used by the mechanical noise reduction section based on information regarding the input signal, which may be a frequency characteristic, power or the like of the input signal. With the mechanical noise suppression apparatus, mechanical noise can be suppressed while degradation of sound desired by a user is suppressed to the utmost in accordance with a surrounding environment.
According to another embodiment of the present disclosure, there is provided a mechanical noise suppression apparatus including a framing section adapted to divide an input signal into frames of a predetermined time length, a Fourier transform section adapted to transform framed signals obtained by the framing section into a frequency spectrum of a frequency domain, a mechanical noise reduction section adapted to correct the frequency spectrum of the input signal obtained by the Fourier transform section based on frequency spectrum information of mechanical noise to suppress the mechanical noise, a spectrum information changing section adapted to change the frequency spectrum information of the mechanical noise to be used by the mechanical noise reduction section based on information regarding the input signal, an inverse Fourier transform section adapted to return the frequency spectrum corrected by the mechanical noise reduction section into framed signals of a time domain, and a frame synthesis section adapted to carry out frame synthesis of the framed signals of frames obtained by the inverse Fourier transform section to obtain an output signal in which the mechanical noise is suppressed.
In the mechanical noise suppression apparatus, an input signal is divided into frames of a predetermined time length by the framing section, and the framed signals are transformed into a frequency spectrum of a frequency domain by the Fourier transform section. Then, the frequency spectrum of the input signal is corrected based on the frequency spectrum information of mechanical noise by the mechanical noise reduction section. Then, the frequency spectrum corrected in this manner is returned into framed signals of a time domain by the inverse Fourier transform section. Then, frame synthesis of the framed signals of frames obtained by the inverse Fourier transform section is carried out by the frame synthesis section to obtain an output signal in which the mechanical noise is suppressed. For example, the mechanical noise is, in an imaging apparatus having a peripheral sound recording function, for example, mechanical noise such as motor sound which is generated in relation to a specific imaging operation such as, for example, a zooming operation.
In the mechanical noise suppression apparatus, the frequency spectrum information of the mechanical noise to be used by the mechanical noise reduction section is changed based on information regarding the input signal such as a frequency characteristic, power and so forth by the spectrum information changing section. For example, the spectrum information changing section is configured such that it corrects the frequency spectrum information of the mechanical noise stored in a noise table based on the information regarding the input signal to change the frequency spectrum information of the mechanical noise to be used by the mechanical noise reduction section.
The mechanical noise suppression apparatus may be configured such that the spectrum information changing section calculates a parameter representative of a characteristic amount of peripheral sound based on the information regarding the input signal, acquires a correction coefficient based on the calculated parameter, and multiplies the frequency spectrum information of the mechanical noise stored in the noise table by the acquired correction coefficient to correct the frequency spectrum information of the mechanical noise.
In this instance, the mechanical noise suppression apparatus may be configured such that the parameter representative of the characteristic amount is a linear predictive coefficient representative of a spectrum envelope of the frequency spectrum of the input signal, and the spectrum information changing section acquires, based on the linear predictive coefficient representative of the spectrum envelope, a correction coefficient for each frequency such that the value thereof decreases in a corresponding relationship to a mountain portion of the spectrum envelope and multiplies, for each frequency, the frequency spectrum information of the mechanical noise by the acquired correction coefficient to correct the frequency spectrum information of the mechanical noise.
Or, the mechanical noise suppression apparatus may be configured such that the characteristic amount parameter is an average power of the input signal, and the spectrum information changing section acquires, based on the average power of the input signal, a correction coefficient common to different frequencies such that the value thereof is low when the average power is high and multiplies the frequency spectrum information of the mechanical noise for each frequency by the acquired correction coefficient to correct the frequency spectrum information of the mechanical noise.
The mechanical noise suppression apparatus may further include a plurality of noise tables which store the frequency spectrum information of the mechanical noise which is used in a case in which the average power of the input signal is different from each other, and the spectrum information changing section may change over the noise table from which the frequency spectrum information of the mechanical noise is to be read out based on the average power of the input signal to change the frequency spectrum information of the mechanical noise to be used by the mechanical sound reduction section.
In this manner, in the mechanical noise suppression apparatus, the frequency spectrum information of mechanical noise to be used in the mechanical noise reduction section is changed based on the information regarding the input signal such as a frequency characteristic, power and so forth. Therefore, excessive suppression of suppressing also mechanical noise which is not actually perceived can be prevented from being carried out, and degradation of desired sound by excessive suppression can be prevented. In other words, mechanical noise can be reduced while degradation of desired sound of the user is suppressed to the utmost in response to a surrounding environment.
With the mechanical noise suppression apparatus of the present disclosure, a fixed noise reduction effect can be implemented with a simple configuration irrespective of a dispersion of mechanical noise among individual apparatus. Further, with the mechanical noise suppression apparatus, mechanical noise can be suppressed while degradation of sound desired by a user is suppressed to the upmost in accordance with a circumferential environment.
The above and other features and advantages of the present disclosure will become apparent from the following description and the appended claims, taken in conjunction with the accompanying drawings in which like parts or elements denoted by like reference symbols.
In the following, embodiments of the present disclosure are described. The description is given in the following order.
1. First Embodiment
2. Second Embodiment
3. Third Embodiment
4. Modification
<1. First Embodiment>
Sound System of an Imaging Apparatus Including a Video Shooting Function with Sound
Operation of the sound system 100 is controlled by a control section 201 which controls operation of the components of the imaging apparatus. A key inputting section 202 is connected to the control section 201. The key inputting section 202 includes a plurality of keys disposed thereon for allowing a user to carry out various operations of the imaging apparatus. A motor 203 is provided to move a zoom lens in the direction of an optical axis of the latter. A motor driving section 204 is a driving mechanism for driving the motor 203 to rotate. The control section 201 receives an operation signal of a zoom key included in the key inputting section 202 and outputs a motor driving controlling signal to the motor driving section 204. Further, the control section 201 controls, during video shooting with sound, the spectrum changeover section 108 based on a driving timing of the motor 203.
The microphone 101 which is an internal microphone is built in the imaging apparatus and collects peripheral sound or environmental sound to obtain a sound signal. Upon video shooting, a sound signal obtained by the microphone 101 is recorded together with an image signal. The A/D converter 102 converts a sound signal obtained by the microphone 101 from an analog signal into a digital signal. The AGC circuit 103 amplifies the sound signal after conversion into a digital signal by the A/D converter 102 with a gain in response to a level of the same.
The framing section 104 divides a sound signal obtained from the AGC circuit 103 into frames of a predetermined time length, that is, carries out framing of the sound signal, in order to carry out processing for each frame. The Fourier transform section 105 carries out a fast Fourier transform (FFT) process for the framed signals obtained by the framing section 104 to convert the framed signals into a frequency spectrum X(f, τ) of the frequency domain. Here, (f, τ) represents a frequency spectrum of a frame τ of the fth frequency.
The noise table 107 stores therein frequency spectrum information of mechanical noise collected and recorded in advance. The frequency spectrum information of the mechanical noise is that of motor driving sound corresponding to the motor 203. In the present embodiment, the frequency spectrum information is a power spectrum |N(f, τ)|2. However, it may otherwise be an amplitude spectrum |N(f, τ)| or a frequency spectrum N(f, τ). It is to be noted that the driving sound generated by the motor 203 differs between zooming operations in the telephoto direction and the wide-angle direction. Therefore, values of frequency spectrum information of mechanical noise corresponding to zooming operations in the telephoto direction and the wide-angle direction are recorded in the noise table 107.
The mechanical noise reduction section 106 corrects the frequency spectrum X(f, τ) obtained by the Fourier transform section 105 based on the frequency spectrum information |X(f, τ)|2 of mechanical noise stored in the noise table 107 to suppress the mechanical noise. This mechanical noise reduction section 106 acquires the corrected frequency spectrum Y(f, τ) by multiplying the frequency spectrum X(f, τ) by the gain function G(f, τ) as represented by the following expression (4):
Y(f,τ)=X(f,τ)·G(f,τ) (4)
In this instance, the mechanical noise reduction section 106 carries out a mechanical noise reduction process based on zoom controlling information, that is, presence or absence of zooming and the zooming direction, from the control section 201. The mechanical noise reduction section 106 carries out a mechanical noise reduction process upon zooming operation, that is, upon driving of the motor 203. Further, the mechanical noise reduction section 106 reads out, upon zooming operation in the telephoto direction and the wide-angle direction, the frequency spectrum information |N(f, τ)|2 of mechanical noise corresponding to the directions from the noise table 107.
The gain function table 121 stores therein a gain function G(f, τ) set in advance (refer to the expression (4) given hereinabove). In particular, in the gain function table 121, set values of the gain corresponding to different values of the ratio between the power |X(f, τ)|2 of an input signal and the power |N(f, τ)|2 of the mechanical noise are stored.
The gain function G(f, τ) stored in the gain function table 121 is set freely in an arbitrary form so that an output of good sound quality is obtained taking a dispersion of mechanical noise into consideration, different from the gain function G(f, τ) (refer to
The dispersion of mechanical noise has an influence on the magnitude of the frequency spectrum X(f, τ) of the input signal. Therefore, a form of the gain function G(f, τ) is important. Since the dispersion of mechanical noise exhibits various characteristics, by setting a gain function G(f, τ) suitable for each characteristic, an output of high quality can be obtained. Although, with the gain function G(f, τ) represented by the expression (3) given hereinabove, only leftward or rightward shifting can be carried out by change of the subtract coefficient α, the gain function G(f, τ) stored in the gain function table 121 can be set freely in an arbitrary form.
In the example of the gain function G(f, τ) of
A setting method of the gain function G(f, τ) stored in the gain function table 121 is described. For example, the following two methods are available.
(1) In the first setting method, the designer audibly tunes the gain function G(f, τ). With this setting method, although time and labor are required for setting, a gain function G(f, τ) of high quality with a dispersion taken into consideration can be determined.
(2) In the second setting method, mechanical noise is measured with regard to a large number of apparatus in advance, and the gain function G(f, τ) is based on a dispersion in characteristic, that is, based on a variance of a spectrum. In the present setting method, the gain function G(f, τ) based on data can be determined.
In the setting method of (2), for example, a variance of |X(f, τ)|2/|N(f, τ)|2, and a facies of the variance is inversed to obtain the gain function G(f, τ).
In the example of the gain function of
Further, in the example of the gain function of
Referring back to
The frequency spectrum correction block 123 multiplies, for each frequency, the frequency spectrum X(f, τ) of the input signal obtained by the Fourier transform section 105 by the gain G(f, τ) to obtain a corrected frequency spectrum Y(f, τ). It is to be noted that the gain G(f, τ) is read out from the gain function table 121 based on the power ratio |X(f, τ)|2/|N(f, τ)|2 calculated by the power ratio calculation block 122. From this, the mechanical noise reduction section 106 further has a gain readout section though not shown.
The flow chart of
The mechanical noise reduction section 106 starts its processing at step ST1 and then advances the processing to step ST2. At step ST2, the mechanical noise reduction section 106 acquires a frequency spectrum X(f, τ) of the frequency f of the frame τ as an input signal from the Fourier transform section 105. Further, the mechanical noise reduction section 106 acquires a power spectrum |N(f, τ)|2 as mechanical noise spectrum information corresponding to the frequency f from the noise table 107 at step ST3.
Then at step ST4, the power ratio calculation block 122 of the mechanical noise reduction section 106 calculates a power ratio |X(f, τ)|2/|N(f, τ)|2 of the input signal spectrum and the mechanical noise spectrum. Then, the mechanical noise reduction section 106 reads out and acquires, based on the calculated power ratio, a gain G(f, τ) corresponding to the power ratio from the gain function table 121 at step ST5.
Then at step ST6, the frequency spectrum correction block 123 of the mechanical noise reduction section 106 multiplies the frequency spectrum X(f, τ) as the input signal by the gain G(f, τ) to obtain a corrected frequency spectrum Y(f, τ) as an output signal. The mechanical noise reduction section 106 ends its processing at step ST7 after the process at step ST6.
Referring back to
The inverse Fourier transform section 109 carries out, for each frame, an inverse fast Fourier transform (IFFT) process for the frequency spectrum outputted from the spectrum changeover section 108. This inverse fast Fourier transform section 109 carries out inverse processing to that by the Fourier transform section 105 described hereinabove to convert a frequency domain signal into a time domain signal to obtain framed signals.
The waveform synthesis section 110 synthesizes framed signals of frames obtained by the inverse Fourier transform section 109 to restore a sound signal which is continuous in a time series. The waveform synthesis section 110 configures a frame synthesis section. The recording section 111 records the sound signal obtained by the waveform synthesis section 110 on a recording medium such as a disk or a memory, for example, together with an image signal obtained by the image system.
Operation of the sound system 100 of the imaging apparatus including the video shooting function with sound shown in
Framed signals of the frames obtained by the framing section 104 are successively supplied to the Fourier transform section 105. The Fourier transform section 105 carries out a fast Fourier transform (FFT) process for the framed signals to convert them into a frequency spectrum X(f, τ) of the frequency domain. The frequency spectrum X(f, τ) is supplied to the spectrum changeover section 108 and the mechanical noise reduction section 106.
The mechanical noise reduction section 106 carries out, during a zooming operation, a mechanical noise reduction process based on zoom controlling information such as presence or absence of zooming and the zooming direction from the control section 201. In this instance, the mechanical noise reduction section 106 multiplies the frequency spectrum X(f, τ) by the gain function G(f, τ) to produce a frequency spectrum Y(f, τ) corrected so as to suppress mechanical noise, that is, driving sound of the motor 203. This frequency spectrum Y(f, τ) is supplied to the spectrum changeover section 108.
While a zooming operation is not carried out, the spectrum changeover section 108 selects the frequency spectrum X(f, τ) supplied from the Fourier transform section 105. This is because, at this time, the motor 203 is not in a driven state and the frequency spectrum X(f, τ) does not include a component of mechanical noise, that is, the driving sound of the motor 203. On the other hand, when a zooming operation is carried out, the spectrum changeover section 108 selects the frequency spectrum Y(f, τ) corrected so as to suppress mechanical noise, that is, the driving sound of the motor 203, obtained by the mechanical noise reduction section 106.
The frequency spectrum X(f, τ) or the corrected frequency spectrum Y(f, τ) from the spectrum changeover section 108 is supplied to the inverse Fourier transform section 109. The inverse Fourier transform section 109 carries out, for each frame, an inverse fast Fourier transform (IFFT) process for a frequency spectrum from the spectrum changeover section 108 to restore a framed signal of the time domain.
The framed signals are supplied to the waveform synthesis section 110. The waveform synthesis section 110 synthesizes such framed signals of the frames to regenerate a sound signal which is continuous in a time series. The sound signal is supplied to the recording section 111. The recording section 111 records the sound signal supplied from the waveform synthesis section 110 on a recording medium such as a disk or a memory, for example, together with an image signal obtained by the image system.
As described above, in the sound system 100 of the imaging apparatus including a video shooting function with sound shown in
Further, in the sound system 100 shown in
Further, in the sound system 100 shown in
Further, in the sound system 100 shown in
It is to be noted that the foregoing description is directed to an example wherein the gain function G(f, τ) set in the gain function table 121 of the mechanical noise reduction section 106 generally exhibits a curved shape in which the gain drops in the proximity of 0 dB of the power ratio |X(f, τ)|2/|N(f, τ)|2 (refer to
However, the gain function G(f, τ) to be set in the gain function table 121 of the mechanical noise reduction section 106 may possibly indicate some other shape. For example, the gain function G(f, τ) may be set such that the gain indicates a fixed value where the power ratio |X(f, τ) 12/|N(f, τ)|2 is lower than 0 dB, that is, where |X(f, τ)|2<|N(f, τ)|2 is satisfied, similarly as in the technique in the past.
<2. Second Embodiment>
Sound System of an Imaging Apparatus Including a Video Shooting Function with Sound
In particular, referring to
The noise table correction section 112 corrects frequency spectrum information |N(f, τ)|2 of mechanical noise stored in the noise table 107 to change frequency spectrum information of mechanical noise to be used in the mechanical noise reduction section 106. In this instance, the noise table correction section 112 carries out the correction based on a frequency spectrum X(f, τ) of an input signal obtained by the Fourier transform section 105. The noise table correction section 112 configures a spectrum information changing section.
The noise table correction section 112 carries out spectrum correction utilizing a masking characteristic. The noise table correction section 112 calculates a parameter indicative of a characteristic amount of peripheral noise based on the frequency spectrum X(f, τ) of the input signal, acquires a correction coefficient based on the parameter, and multiplies the frequency spectrum information |N(f, τ)|2 of the mechanical noise by the correction coefficient to carry out correction.
In this instance, the noise table correction section 112 carries out a noise table correction process based on the zoom controlling information such as presence or absence of zooming and the zooming direction from the control section 201. Upon a zooming operation, the noise table correction section 112 carries out the noise table correction process when the motor 203 is driven. Further, upon a zooming operation in the telephoto direction and the wide-angle direction, the noise table correction section 112 reads out frequency spectrum information |N(f, τ)|2 of mechanical noise corresponding to the zooming direction from the noise table 107 and corrects the same.
In the case where a correction coefficient for each frequency is to be acquired, the parameter representative of a characteristic amount is, for example, a linear predictive coefficient representative of a spectrum envelope. In this instance, the mathematic operation block 131 determines a linear predictive coefficient representative of a spectrum envelope based on the frequency spectrum X(f, τ) of the input signal and acquires a correction coefficient of each frequency such that the value decreases corresponding to a mountain portion of the spectrum envelope. Details of acquisition of a correction coefficient for each frequency by the mathematic operation block 131 are hereinafter described.
On the other hand, in the case where a correction coefficient common to the frequencies is to be acquired, the parameter representative of a characteristic amount is an average power of the frequency spectrum X(f, τ) of the input signal. In this instance, the mathematic operation block 131 determines an average power based on the frequency spectrum X(f, τ) of the input signal and acquires a correction coefficient common to the frequencies such that the value decreases as the average power increases. Details of acquisition of a correction coefficient common to the frequencies by the mathematic operation block 131 are hereinafter described.
The retaining block 132 retains data necessary for a mathematic operation process by the mathematic operation block 131, a correction coefficient as a result of the mathematic operation and so forth. The correction block 133 corrects the frequency vector information |N(f, τ)|2 of mechanical noise read out from the noise table 107 by multiplying the same by the correction coefficient retained in the retaining block 132. The notification block 134 notifies the mechanical noise reduction section 106 of the frequency vector information |N′(f, τ)|2 of mechanical noise corrected by the correction block 133. While the mechanical noise reduction section 106 of the sound system 100 shown in
A flow chart of
Then at step ST13, the mathematic operation block 131 of the noise table correction section 112 determines a parameter representative of a characteristic amount of peripheral noise from the frequency spectrum X(f, τ) of the input signal for the predetermined period of time acquired at step ST12. This parameter is a linear predictive coefficient representative of a spectrum envelope or an average power as described hereinabove.
Then at step ST14, the noise table correction section 112 acquires a correction coefficient based on the parameter calculated at step ST13. In this instance, in the case where the parameter is a linear predictive coefficient representative of a spectrum envelope, a correction coefficient for each frequency is acquired, but in the case where the parameter is an average power, a correction coefficient common to the frequencies is acquired.
Then at step ST15, the correction block 133 of the noise table correction section 112 reads out frequency spectrum information |N(f, τ)|2 of mechanical noise from the noise table 107 and multiplies the same by the correction coefficient acquired at step ST14 to correct the same. Consequently, the noise table correction section 112 acquires corrected frequency spectrum information |N′(f, τ)|2 of the mechanical noise at step ST15.
Then at step ST16, the notification block 134 of the noise table correction section 112 notifies the mechanical noise reduction section 106 of the corrected frequency spectrum information |N′(f, τ)|2 of the mechanical noise. The noise table correction section 112 returns the processing to the process at step ST12 after the process at step ST16 and then repeats the processing procedure described above. In other words, the corrected frequency spectrum information |N′(f, τ)|2 of the mechanical noise conveyed from the noise table correction section 112 to the mechanical noise reduction section 106 is successively updated based on the frequency spectrum X(f, τ) of the input signal.
In the case where a correction coefficient for each frequency is acquired for correction.
Acquisition and correction of a correction coefficient for each frequency by the mathematic operation block 131 of the noise table correction section 112 is described.
Referring to
Slanting lines shown in
In order to acquire a correction coefficient for each frequency, the mathematic operation block 131 of the noise table correction section 112 calculates an average spectrum for a long period of time, for example, for 1 to 2 seconds, based on a frequency spectrum X(f, τ) of an input signal. Then, the mathematic operation block 131 calculates an average spectrum envelope from the average spectrum and calculates a correction coefficient from the average spectrum envelope. A curve a of
A curve a of
The frequency characteristic of the spectrum envelope, that is, a linear predictive filter F(z) is represented by the following expression (5):
where A(z) is an inverse filter (refer to Sadaoki FURUI, “New Acoustic Sound Engineering,” Kindai Kagakusha Co., Ltd., pp. 126-127).
In the case where a correction coefficient is to be determined from a spectrum envelope, for example, a frequency characteristic of K(z) represented by the following expression (6) which is obtained by modifying the frequency characteristic of F(z) described hereinabove is calculated:
where λ is a value which satisfies 0<λ≦1. As the value of λ approaches 1, the correction coefficient indicates flattened variation.
Then, a frequency characteristic of H(z)=K(z)/F(z) represented by the following expression (7), that is, a frequency characteristic of the correction coefficient, is calculated:
H(z) makes a filter having a valley around a peak frequency of the spectrum envelope.
A curve a of
A flow chart of
Then at step ST23, the noise table correction section 112 decides based on control information from the control section 201 whether or not a zooming operation is being carried out. In the case where a zooming operation is not being carried out, the noise table correction section 112 calculates a correction coefficient based on the frequency spectrum X(f, τ) of the input signal in which driving noise or mechanical noise of the motor 203 is not included. Therefore, when a zooming operation is not being carried out, the noise table correction section 112 advances the processing to step ST24 in order to calculate a correction coefficient.
At step ST24, the noise table correction section 112 decides whether or not a fixed period of time elapses after a correction coefficient is calculated last. When the fixed interval of time does not elapse, the noise table correction section 112 returns the processing immediately to step ST22 without calculating a correction coefficient. On the other hand, if the fixed period of time elapses, then the noise table correction section 112 returns the processing to step ST25.
At step ST25, the noise table correction section 112 decides whether or not a zooming operation has been carried out within the predetermined period of time, that is, within T seconds, in the past. This is because the noise table correction section 112 calculates a correction coefficient based on the frequency spectrum X(f, τ) of the input signal for a predetermined number of frames obtained in the predetermined period of time in the past. The T seconds are, for example, 1 to 2 seconds. When a zooming operation was carried out within the predetermined period of time in the past, the noise table correction section 112 returns the processing to step ST22 immediately without calculating a correction coefficient. On the other hand, if a zooming operation has not been carried out within the predetermined period of time in the past, then the noise table correction section 112 returns the processing to step ST26.
At step ST26, the noise table correction section 112 determines an average spectrum of the frequency spectrum X(f, τ) of the input signal for a predetermined number of frames within the predetermined period of time in the past, and further calculates a linear predictive coefficient of the spectrum envelope (refer to the expression (5) given hereinabove). Then, at step ST27, the noise table correction section 112 calculates a frequency characteristic of H(z)=K(z)/F(z), that is, a frequency characteristic of the correction coefficient (refer to the expression (7) given hereinabove).
Then at step ST28, the noise table correction section 112, calculates a correction coefficient H(k) (k=1, 2, . . . , L) for each frequency from the frequency characteristic of H(z)=K(z)/F(z) calculated at step ST27 and retains the correction coefficients H(k) into the retaining block 132. Here, “k” is an index indicative of a frequency. The noise table correction section 112 returns the processing to step ST22 after the processing at step S28.
When a zooming operation is being carried out, the noise table correction section 112 reads out frequency spectrum information of mechanical noise from the noise table 107 and notifies the mechanical noise reduction section 106 of corrected frequency spectrum information of the mechanical noise. Therefore, when a zooming operation is not being carried out at step ST23, the noise table correction section 112 returns the processing to step ST29.
At step ST29, the noise table correction section 112 reads out frequency spectrum information Ntable(k) (k=1, 2, . . . , L) of each frequency of mechanical noise corresponding to the zooming direction from the noise table 107 based on controlling information from the control section 201. Then, the noise table correction section 112 reads out a correction coefficient H(k) (k=1, 2, . . . , L) for each frequency retained in the retaining block 132 at step ST30.
Then at step ST31, the noise table correction section 112 multiplies, for each frequency, the frequency spectrum information Ntable(k) of the mechanical noise by the correction coefficient H(k) to carry out correction at step ST31. By this correction, corrected frequency spectrum information Ncomp(k)=H(k)·Ntable(k) (k=1, 2, . . . , L) of mechanical noise is obtained. Then at step ST32, the noise table correction section 112 notifies the mechanical noise reduction section 106 of the corrected frequency spectrum information Ncomp(k) (k=1, 2, . . . , L) of the mechanical noise. The noise table correction section 112 returns the processing to step ST22 after the process at step ST32.
If, during a zooming operation, the corrected frequency spectrum information Ncomp(k) (k=1, 2, . . . , L) of the mechanical noise conveyed to the mechanical noise reduction section 106, then since also the output sound is fluctuated similarly, this is not preferable. Therefore, in the processing procedure of the noise table correction section 112 in accordance with the flow chart of
In the case where a correction coefficient common to frequencies is acquired for correction.
Acquisition and correction of a correction coefficient common to frequencies by the mathematic operation block 131 of the noise table correction section 112 is described. The correction process can be applied, for example, to a case in which the recording level is compressed by an AGC circuit so that mechanical noise is observed at a level lower than an actual level.
The role of the AGC circuit resides in keeping a fixed sound volume level as far as possible without depending upon the arrangement of a sound source, a recording target and so forth. To this end, the AGC circuit amplifies a signal inputted thereto such that it can pick up sound also of a low level. On the other hand, in the case where sound of an excessively high level is inputted, the AGC circuit compresses the inputted signal so that the input may not be saturated.
As described above, due to the AGC, zooming noise is sometimes observed in a compressed state (refer to
In this instance, there is a tendency that the level of all frequencies drops. Therefore, not a spectrum shape but a characteristic amount representative of the level is calculated to carry out generally uniform correction. Here, an average power is determined based on the frequency spectrum X(f, τ) of the input signal, and a correction coefficient common to the frequencies is acquired and used for correction so that the value thereof decreases as the average power increases.
The flow chart of
Then at step ST43, the noise table correction section 112 decides based on control information from the control section 201 whether or not a zooming operation is being carried out. In the case where a zooming operation is not being carried out, the noise table correction section 112 calculates a correction coefficient based on the frequency spectrum X(f, τ) of the input signal, which does not include a component of driving noise or mechanical noise of the motor 203. Therefore, when a zooming operation is not being carried out, the noise table correction section 112 advances the processing to step ST44 in order to calculate a correction coefficient.
At step ST44, the noise table correction section 112 decides whether or not a fixed period of time elapses after a correction coefficient is calculated last. If the fixed period of time does not elapse, then the noise table correction section 112 returns the processing to step ST42 immediately without calculating a correction coefficient. On the other hand, if the fixed period of time elapses, then the noise table correction section 112 advances the processing to step ST45.
At step ST45, the noise table correction section 112 decides whether or not a zoom operation is carried out within a predetermined period of time, that is, T seconds, in the past. This is because the noise table correction section 112 calculates a correction coefficient based on the frequency spectrum X(f, τ) of the input signal for a predetermined number of frames obtained in the predetermined period of time in the past. For example, the T seconds are 1 to 2 seconds. If a zooming operation is carried out within the predetermined period of time in the past, then the noise table correction section 112 returns the processing to step ST42 immediately without calculating a correction coefficient. On the other hand, if a zooming operation is not carried out within the predetermined period of time in the past, then the noise table correction section 112 advances the processing to step ST46.
At step ST46, the noise table correction section 112 calculates an average power or average energy P (logarithmic RMS P) of the frequency spectrum X(f, τ) of the input signal within the predetermined period of time in the past in accordance with the following expression (8):
In this instance, for example, only frequency spectra X(f, τ) of frequencies in a frequency region of, for example, 1 to 4 kHz are used.
Then at step ST47, the noise table correction section 112 utilizes the average power P calculated at step ST46 to refer to a table representative of a corresponding relationship between the average power P and the correction coefficient C to determine a correction coefficient C common to the frequencies and retains the correction coefficient C into the retaining block 132.
When a zooming operation is carried out, the noise table correction section 112 reads out frequency spectrum information of mechanical noise from the noise table 107 and notifies the mechanical noise reduction section 106 of the corrected frequency spectrum information of the mechanical noise. Therefore, when a zooming operation is not carried out at step ST43, the noise table correction section 112 advances the processing to step ST48.
At step ST48, the noise table correction section 112 reads out frequency spectrum information Ntable(k) (k=1, 2, . . . , L) of the frequencies of mechanical noise corresponding to the zooming direction from the noise table 107 based on control information from the control section 201. Then, the noise table correction section 112 reads out the correction coefficient C common to the frequencies retained in the retaining block 132 at step ST49.
Then at step ST50, the noise table correction section 112 multiplies, for each frequency, the frequency spectrum information Ntable(k) (k=1, 2, . . . , L) of the frequencies of mechanical noise by the correction coefficient C to carry out correction. By the correction, corrected frequency spectrum information Ncomp(k)=C·Ntable(k) (k=1, 2, . . . , L) of the frequencies of mechanical noise is obtained. Then, the noise table correction section 112 notifies the mechanical noise reduction section 106 of the corrected frequency spectrum information Ncomp(k) (k=1, 2, . . . , L) of the frequencies of mechanical noise at step ST51. The noise table correction section 112 returns the processing to step ST42 after the process at step ST51.
If the corrected frequency spectrum information Ncomp(k) (k=1, 2, . . . , L) of the frequencies of mechanical noise conveyed to the mechanical noise reduction section 106 during a zooming operation fluctuates, then also the output sound similarly fluctuates unfavorably. Therefore, the processing procedure of the noise table correction section 112 in accordance with the flow chart of
Production method of a table indicative of a corresponding relationship of the average power P and the correction coefficient C
Here, an example of a method of producing the table indicative of a corresponding relationship of the average power P and the correction coefficient C (refer to
As seen in
In a region, that is, in a linearly increasing region, in which the AGC carries out amplification at a fixed ratio as indicated by a broken line framework AR1, not only the observed signal of the internal microphone Ma but also the observed signal of the external microphone Mb increase at a fixed ratio. In a region indicated by a broken line framework AR2 in which level compression of the AGC occurs, that is, in a level compression region, although the observed signal of the external microphone Mb increases linearly, the observed signal of the internal microphone Ma is fixed.
The difference D between the observed signal of the internal microphone Ma and the observed signal of the external microphone Mb in the linearly increasing region simply is a characteristic difference between the microphone and the amplifier at the succeeding stage. Therefore, if this portion is corrected, then the level difference in the case where level compression of the AGC is carried out can be seen.
If the difference in power or energy between the observed signal of the internal microphone Ma and the observed signal of the external microphone Mb is represented by a ratio based on
By linearly interpolating discrete data illustrated in
It is to be noted that, in the processing procedure of the noise table correction section 112 in accordance with the flow chart of
However, also it seems a possible idea to otherwise use a sample x(t) of the time domain of the input signal within the predetermined period of time in the past to calculate an average power P, that is, a logarithmic RMS P, in accordance with an expression similar to the expression (8) and utilize the average power P to obtain a correction coefficient C. In this instance, the average power P of the input signal is acquired by signal processing of the time domain.
Referring back to
The mechanical noise reduction section 106 uses the corrected frequency spectrum information |N′(f, τ)|2 of mechanical noise to correct the frequency spectrum X(f, τ) obtained by the Fourier transform section 105 to suppress the mechanical noise. In particular, the mechanical noise reduction section 106 of the sound system 100 shown in
Operation of the sound system 100A of the imaging apparatus which includes a video shooting function with sound shown in
The framed signals of frames obtained by the framing section 104 are successively supplied to the Fourier transform section 105. The Fourier transform section 105 carries out a fast Fourier transform (TFT) process for the framed signals to convert them into a frequency spectrum X(f, τ) of the frequency domain. This frequency spectrum X(f, τ) is supplied to the spectrum changeover section 108, mechanical noise reduction section 106 and noise table correction section 112.
The mechanical noise reduction section 106 carries out, during a zooming operation, a mechanical noise reduction process based on zoom controlling information, that is, presence or absence of zooming and the zooming direction, from the control section 201. In this instance, the mechanical noise reduction section 106 multiplies the frequency spectrum X(f, τ) by the gain function G(f, τ) to obtain a frequency spectrum Y(f, τ) corrected such that mechanical noise, that is, driving noise of the motor 203, is suppressed. This frequency spectrum Y(f, τ) is supplied to the spectrum changeover section 108.
The noise table correction section 112 corrects the frequency spectrum information |N(f, τ)|2 of mechanical noise stored in the noise table 107 based on the frequency spectrum X(f, τ) of the input signal obtained by the Fourier transform section 105. In particular, the frequency spectrum information |N(f, τ)|2 of mechanical noise is corrected with the correction coefficient obtained based on the information regarding the input signal such as a frequency characteristic, power and so forth. The mechanical noise reduction section 106 is notified of and uses the corrected frequency spectrum information |N′(f, τ)|2 of mechanical noise.
When a zooming operation is not being carried out, the spectrum changeover section 108 selects the frequency spectrum X(f, τ) supplied from the Fourier transform section 105. This is because, at this time, the motor 203 is not driven and the frequency spectrum X(f, τ) does not include a component of mechanical noise, that is, driving noise of the motor 203. On the other hand, when a zooming operation is being carried out, the spectrum changeover section 108 selects the frequency spectrum Y(f, τ) obtained by the mechanical noise reduction section 106 and corrected so as to suppress the mechanical noise, that is, driving noise of the motor 203.
The frequency spectrum X(f, τ) or the frequency spectrum Y(f, τ) from the spectrum changeover section 108 is supplied to the inverse Fourier transform section 109. The inverse Fourier transform section 109 carries out an inverse fast Fourier transform (IFFT) process for the frequency spectrum outputted from the spectrum changeover section 108 for each frame to restore framed signals of the time domain.
The framed signals are supplied to the waveform synthesis section 110. The waveform synthesis section 110 synthesizes the framed signals of the frames to restore a sound signal which is continuous in a time series. This sound signal is supplied to the recording section 111. The recording section 111 records the sound signal supplied from the waveform synthesis section 110 into a recording medium such as a disk or a memory, for example, together with an image signal obtained by the image system.
As described hereinabove, in the sound system 100A of the imaging apparatus including a video shooting function with sound shown in
Further, in the sound system 100A shown in
Further, in the sound system 100A shown in
<3. Third Embodiment>
Sound system of an imaging apparatus including a video shooting function with sound
Referring to
The noise tables 107-1 to 107-n have corrected frequency spectrum information |Ni(f, τ)|2 (i=1, 2, . . . , n) of mechanical noise stored therein. The corrected frequency spectrum information |Ni(f, τ)|2 (i=1, 2, . . . , n) is corrected in advance with the values Ci (i=1, 2, . . . , n) of the correction coefficients C of a P (average power P)—C (correction coefficient) table (refer to
It is to be noted that driving noise generated by the motor 203 is different depending upon a zooming operation in the telephoto direction and a zooming operation in the wide-angle direction. Therefore, in the noise tables 107-1 to 107-n, corrected frequency spectrum information of mechanical noise which corresponds to zooming operations in the telephoto direction and the wide-angle direction is stored.
The noise table changeover section 113 determines a noise table to be used by the mechanical noise reduction section 106, that is, a used noise table, to read out corrected frequency spectrum information of mechanical noise from among the noise tables 107-1 to 107-n. The noise table changeover section 113 carries out the determination of a used noise table based on the frequency spectrum X(f, τ) of the input signal obtained by the Fourier transform section 105. Then, the noise table changeover section 113 reads out the corrected frequency spectrum information of mechanical noise from the thus determined used noise table and notifies the mechanical noise reduction section 106 of the read out frequency spectrum information. This noise table changeover section 113 configures a spectrum information changing section.
In this instance, the noise table changeover section 113 carries out a noise table changeover process based on the zoom controlling information from the control section 201 such as presence or absence of zooming and the zooming direction. Upon a zooming operation, the noise table changeover section 113 carries out a noise table changeover process upon driving of the motor 203. On the other hand, upon a zooming operation in the telephoto direction or the wide-angle direction, the noise table changeover section 113 reads out frequency spectrum information corresponding to the direction from the determined used noise table and notifies the mechanical noise reduction section 106 of the frequency spectrum information.
It is to be noted that it seems a possible idea to produce in advance a table indicative of a relationship between the average power P and a used noise table. In this instance, the mathematic operation block 141 can simply determine a used noise table based on the table.
The retaining block 142 retains data necessary for a mathematic operation process by the mathematic operation block 141 or used noise table information as a result of such mathematic operation. The changeover block 143 changes over the noise table from which corrected frequency spectrum information of mechanical noise is to be read out to a noise table indicated by the used noise table information retained in the retaining block 142. The notification block 144 reads out the corrected frequency spectrum information |N′(f, τ)|2 of mechanical noise from the noise table changed over by the changeover block 143 and notifies the mechanical noise reduction section 106 of the frequency spectrum information τ)|2. The mechanical noise reduction section 106 uses the corrected frequency spectrum information |N′(f, τ)|2 of mechanical noise conveyed from the noise table changeover section 113 in this manner.
The flow chart of
Then at step ST63, the noise table changeover section 113 decides based on control information from the control section 201 whether or not a zooming operation is being carried out. If a zooming operation is not being carried out, then the noise table changeover section 113 determines a used noise table based on the frequency spectrum X(f, τ) of the input signal which does not include a component of driving noise or mechanical noise of the motor 203. Therefore, when a zooming operation is not being carried out, the noise table changeover section 113 advances the processing to step ST64 in order to calculate a correction coefficient.
At step ST64, the noise table changeover section 113 decides whether or not a fixed period of time elapses after a used noise table was determined last. If the fixed period of time does not elapse, then the noise table changeover section 113 returns the processing to step ST62 immediately without determining a used noise table. On the other hand, if the fixed period of time elapses, then the noise table changeover section 113 advances the processing to step ST65.
At step ST65, the noise table changeover section 113 decides whether or not a zooming operation has been carried out within a predetermined period of time, that is, within T seconds, in the past. This is because the noise table changeover section 113 determines a used noise table based on the frequency spectrum X(f, τ) of the input signal of a predetermined number of frames obtained within the predetermined period of time in the past. For example, the T seconds are 1 to 2 seconds. If a zooming operation has been carried out within the predetermined period of time in the past, then the noise table changeover section 113 returns the processing to step ST62 immediately without determining a used noise table. On the other hand, if a zooming operation has not been carried out within the predetermined period of time in the past, then the noise table changeover section 113 advances the processing to step ST66.
At step ST66, the noise table changeover section 113 calculates an average power or average energy P (logarithmic RMS P) of the frequency spectrum X(f, τ) of the input signal within the predetermined period of time in the past in accordance with the following expression (9):
In this instance, only the frequency spectrum X(f, τ) of frequencies in the frequency domain of, for example, 1 to 4 kHz is used.
Then at step ST67, the noise table changeover section 113 utilizes the average power P calculated at step ST66 to refer to the table (refer to
When a zooming operation is being carried out, the noise table changeover section 113 reads out corrected frequency spectrum information of mechanical noise from the used noise table from among the noise tables 107-1 to 107-n and notifies the mechanical noise reduction section 106 of the frequency spectrum information. Therefore, when a zooming operation is not being carried out at step ST63, the noise table changeover section 113 advances the processing to step ST68.
At step ST68, the noise table changeover section 113 reads out corrected frequency spectrum information Ntable(k) (k=1, 2, . . . , L) for the frequencies of mechanical noise corresponding to the zooming direction from the used noise table based on controlling information from the control section 201. Then at step ST69, the noise table changeover section 113 notifies the mechanical noise reduction section 106 of the read out corrected frequency spectrum information Ntable(k) (k=1, 2, . . . , L) for the frequencies of mechanical noise. The noise table changeover section 113 returns the processing to step ST62 after the process at step ST69.
If the corrected frequency spectrum information Ntable(k) (k=1, 2, . . . , L) for the frequencies of mechanical noise conveyed to the mechanical noise reduction section 106 fluctuates during a zooming operation, then also the output sound similarly fluctuates unfavorably. Therefore, in the processing procedure of the noise table changeover section 113 in accordance with the flow chart of
It is to be noted that, in the processing procedure of the noise table changeover section 113 in accordance with the flow chart of
However, also it may seem a possible idea to otherwise use samples x(t) in the time domain of the input signal within the predetermined period of time in the past to calculate an average power P (logarithmic RMS P) in accordance with an expression similar to the expression (9) and utilize the average power P to determine a used noise table. In this instance, the average power P of the input signal is acquired by signal processing in the time domain.
Referring back to
The mechanical noise reduction section 106 uses the corrected frequency spectrum information |N′(f, τ)|2 of mechanical noise to correct the frequency spectrum X(f, τ) obtained by the Fourier transform section 105 to suppress the mechanical noise. In particular, the mechanical noise reduction section 106 of the sound system 100A shown in
Operation of the sound system 100B of the imaging apparatus including a video shooting function with sound shown in
The framed signals of frames obtained by the framing section 104 are successively supplied to the Fourier transform section 105. The Fourier transform section 105 carries out a fast Fourier transform (FFT) process for the framed signals to convert them into a frequency spectrum X(f, τ) of the frequency domain. The frequency spectrum X(f, τ) is supplied to the spectrum changeover section 108, mechanical noise reduction section 106 and noise table changeover section 113.
The mechanical noise reduction section 106 carries out, during a zooming operation, a mechanical noise reduction process based on zooming controlling information such as presence or absence of zooming or the zooming direction from the control section 201. In this instance, the mechanical noise reduction section 106 multiplies the frequency spectrum X(f, τ) by the gain function G(f, τ) to obtain a frequency spectrum Y(f, τ) corrected so as to suppress mechanical noise or driving noise of the motor 203. The frequency spectrum Y(f, τ) is supplied to the spectrum changeover section 108.
The noise table changeover section 113 determines a used noise table for reading out corrected frequency spectrum information of mechanical noise to be used by the mechanical noise reduction section 106 from among the noise tables 107-1 to 107-n. This determination is carried out based on the average power P of the input signal obtained by the Fourier transform section 105. The mechanical noise reduction section 106 is notified of and uses the corrected frequency spectrum information |N′(f, τ)|2 of mechanical noise read out from the used noise table from the noise table changeover section 113.
When a zooming operation is not being carried out, the spectrum changeover section 108 selects the frequency spectrum X(f, τ) supplied from the Fourier transform section 105. This is because, at this time, the motor 203 is not in a driven state, the frequency spectrum X(f, τ) does not include a component of mechanical noise, that is, driving noise of the motor 203. On the other hand, when a zooming operation is being carried out, the spectrum changeover section 108 selects the frequency spectrum Y(f, τ) corrected so as to suppress the mechanical noise, that is, driving noise of the motor 203, obtained by the mechanical noise reduction section 106.
The frequency spectrum X(f, τ) from the spectrum changeover section 108 or the corrected frequency spectrum Y(f, τ) is supplied to the inverse Fourier transform section 109. The inverse Fourier transform section 109 carries out, for each frame, an inverse fast Fourier transform (IFFT) process for the frequency spectrum outputted from the spectrum changeover section 108 to restore framed signals of the time domain.
The framed signals are supplied to the waveform synthesis section 110. The waveform synthesis section 110 synthesizes the framed signals of frames to restore a sound signal which is continuous in a time series. The sound signal is supplied to the recording section 111. The recording section 111 records the sound signal supplied thereto from the waveform synthesis section 110 on a recording medium such as a disk or a memory, for example, together with an image signal obtained by the image system.
As described hereinabove, in the sound system 100B of the imaging apparatus including a video shooting function with sound shown in
Further, in the sound system 100B shown in
Further, in the sound system 100B shown in
<4. Modifications>
It is to be noted that, in the embodiments described hereinabove, the spectrum changeover section 108 is provided. When a zooming operation is not being carried out, the spectrum changeover section 108 reads out a frequency spectrum X(f, τ) from the Fourier transform section 105, but when a zooming operation is being carried out, the spectrum changeover section 108 extracts a corrected frequency spectrum Y(f, τ) from the mechanical noise reduction section 106.
However, if the mechanical noise reduction section 106 controls the gain function G(f, τ) for the multiplication with the frequency spectrum X(f, τ) to “1” when a zooming operation is not being carried out, then the output frequency spectrum Y(f, τ) of the mechanical noise reduction section 106 can always be used. In this instance, the output frequency spectrum Y(f, τ) of the mechanical noise reduction section 106 is supplied directly to the inverse Fourier transform section 109, and the spectrum changeover section 108 can be eliminated.
Further, the sound system 100A described hereinabove with reference to
For example, frequency spectrum information of mechanical noise to be supplied to the subtract section may be corrected by and supplied from a correction section similar to the noise table correction section 112 of the sound system 100A shown in
Also the sound system 100B described hereinabove with reference to
For example, corrected frequency spectrum information of mechanical noise may be supplied to the subtract section from a changeover section similar to the noise table changeover section 113 of the sound system 100B shown in
Further, in the embodiments described above, the mechanical noise to be suppressed is driving noise of the motor 203 or zooming noise. However, naturally the mechanical noise to be suppressed is not limited to this. For example, driving noise of a focusing motor, driving noise of motors for panning and tiling and so forth may be suppressed.
Further, the functioning configuration for mechanical noise suppression in the embodiments described hereinabove may be configured by hardware or similar processing may be carried out by software.
The ROM 182 stores therein a processing program of the CPU 181 and necessary data such as frequency spectrum information of mechanical noise collected and recorded in advance. The RAM 183 functions as a working area of the CPU 181. The CPU 181 reads out the processing program stored in the ROM 182 as occasion demands, and the read out processing program is transferred to and developed in the RAM 183. Then, the CPU 181 reads out the developed processing program to execute a mechanical noise suppression process.
In the computer apparatus 50, an input sound signal, that is, an output signal of a microphone, is inputted through the data I/O 184 and accumulated into the RAM 183. A mechanical noise suppression process similar to that in the embodiments described hereinabove is carried out for the input sound signal accumulated in the RAM 183 by the CPU 181. Then, an output sound signal in which mechanical noise is suppressed as a result of the processing is outputted to the outside through the data I/O 184.
The present disclosure can be applied to an imaging apparatus having a mechanical noise generating source which generates mechanical noise in relation to a particular imaging operation such as, for example, a digital camera with a video shooting function with noise.
While preferred embodiments of the present disclosure have been described using specific terms, such description is for illustrative purpose only, and it is to be understood that changes and variations may be made without departing from the spirit or scope of the following claims.
Number | Date | Country | Kind |
---|---|---|---|
P2010-172874 | Jul 2010 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
20060229869 | Nemer | Oct 2006 | A1 |
20080075300 | Isaka | Mar 2008 | A1 |
20080126084 | Lee et al. | May 2008 | A1 |
20100211382 | Sugiyama | Aug 2010 | A1 |
20110035123 | Katrak et al. | Feb 2011 | A1 |
20110305351 | Kimura | Dec 2011 | A1 |
20120245947 | Neuendorf et al. | Sep 2012 | A1 |
Number | Date | Country |
---|---|---|
2006-279185 | Oct 2006 | JP |
Entry |
---|
Steven F. Boll, “Suppression of Acoustic Noise in Speech Using Spectral Subtraction”, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-27, No. 2, Apr. 1979, pp. 113-120. |
Number | Date | Country | |
---|---|---|---|
20120026345 A1 | Feb 2012 | US |