The present invention relates to a microphone unit, a close-talking type speech input device, an information processing system, and a method for manufacturing the microphone unit.
At the time of a conversation by telephone or the like, speech recognition, speech recording, and the like, it is preferable to collect a target speech (a voice of a user). Meanwhile, in some cases, a sound other than a target speech such as a background noise exists depending on a usage environment of a speech input device. Therefore, the development of a speech input device having a function that enables the device to reliably extract a speech of a user, i.e., which cancels the noise even in a case where the device is used in a noisy environment, has been advanced.
As a technology for canceling a noise in a noisy environment, providing sharp directivity to a microphone unit, or a method for canceling a noise such that directions of the incoming sound waves are identified by utilizing a difference in times of incoming sound waves, to perform signal processing, has been known (for example, refer to JP-A-7-312638, JP-A-9-331377, and JP-A-2001-186241).
Further, in recent years, the downsizing of electronics has been advanced, and the emphasis has been on a technology for downsizing a speech input device.
In order to provide sharp directivity to a microphone unit, it is necessary to array a large number of vibrating membranes, which makes it difficult to downsize the microphone unit.
Further, in order to accurately detect directions of the incoming sound waves by utilizing a difference in times of incoming sound waves, it is necessary to install a plurality of vibrating membranes approximately every several wavelengths of an audible sound wave. Accordingly, it is difficult to downsize a microphone unit.
An object of the present invention is to provide a high-quality microphone unit whose outer shape is small and which is capable of performing thorough noise cancellation, a close-talking type speech input device, an information processing system, and a method for manufacturing the microphone unit.
(1) A microphone unit according to the present invention comprising: a case having an internal space; a partition member which is provided in the case, and at least partially composed of a vibrating membrane, the partition member that splits the internal space into a first space and a second space; and an electrical signal output circuit that outputs an electrical signal on the basis of vibration of the vibrating membrane, in which a first through hole through which the first space and an external space of the case are communicated with each other, and a second through hole through which the second space and the external space of the case are communicated with each other are formed in the case.
In accordance with the present invention, a user speech and a noise are incident to the both surfaces of the vibrating membrane. The noise components in the speech incident to the both surfaces of the vibrating membrane are substantially uniformed in sound pressure, and those therefore cancel each other in the vibrating membrane. Therefore, sound pressure vibrating the vibrating membrane may be regarded as sound pressure indicating a user speech, and an electrical signal acquired on the basis of the vibration of the vibrating membrane may be regarded as an electrical signal indicating a user speech whose noise is canceled.
With this, in accordance with the present invention, it is possible to provide a high-quality microphone unit capable of performing thorough noise cancellation with a simple configuration.
(2) In the microphone unit, the partition member may be provided so as not to allow a medium propagating a sound wave to move between the first and second spaces inside the case.
(3) In the microphone unit, an outer shape of the case is a polyhedron, and the first and second through holes may be formed in one surface of the polyhedron.
That is, in the microphone unit, the first and second through holes may be formed in the same surface of the polyhedron. In other words, the first and second through holes may be formed so as to be directed in the same direction. With this, since it is possible to (substantially) equalize sound pressures of noises incident from the first and second through holes into the case, it is possible to accurately cancel the noise.
(4) In the microphone unit, the vibrating membrane may be disposed such that a normal line of the vibrating membrane is parallel to the one surface.
(5) In the microphone unit, the vibrating membrane may be disposed such that a normal line of the vibrating membrane is perpendicular to the one surface.
(6) In the microphone unit, the vibrating membrane may be disposed so as not to overlap with the first or second through hole.
With this, even in the case where foreign matter enters into the internal space via the first and second through holes, it is possible to reduce the possibility that the vibrating membrane is directly damaged by the foreign matter.
(7) In the microphone unit, the vibrating membrane may be disposed beside the first or second through hole.
(8) In the microphone unit, the vibrating membrane may be disposed such that a distance from the first through hole and a distance from the second through hole are not equalized.
(9) In the microphone unit, the partition member may be disposed such that volumes of the first and second spaces are uniformed.
(10) In the microphone unit, a center-to-center distance between the first and second through holes may be 5.2 mm or less.
(11) In the microphone unit, at least a part of the electrical signal output circuit may be formed inside the case.
(12) In the microphone unit, the case may have a shielding structure of electromagnetically shielding the internal space from the external space of the case.
(13) In the microphone unit, the vibrating membrane may be composed of a transducer having SN ratio of approximately 60 decibels or more.
For example, the vibrating membrane may be composed of a transducer whose SN ratio is 60 decibels or more, or may be composed of a transducer whose SN ratio is 60±α decibels or more.
(14) In the microphone unit, a center-to-center distance between the first and second through holes may be set to a distance within a range in which sound pressure in the case where the vibrating membrane is used as a differential microphone does not exceed sound pressure in the case where the vibrating membrane is used as a single microphone with respect to a sound in a frequency band less than or equal to 10 kHz.
The first and second through holes may be disposed along a traveling direction of a sound (for example, a speech) of a sound source, and a center-to-center distance between the first and second through holes may be set to a distance within a range in which sound pressure in the case where the vibrating membrane is used as a differential microphone does not exceed sound pressure in the case where the vibrating membrane is used as a single microphone with respect to a sound from the traveling direction.
(15) In the microphone unit, a center-to-center distance between the first and second through holes may be set to a distance within a range in which sound pressure in the case where the vibrating membrane is used as a differential microphone does not exceed sound pressure in the case where the vibrating membrane is used as a single microphone in all directions with respect to a sound in an extractive target frequency band.
The extractive target frequency band is a frequency of a sound required to be extracted by the microphone. For example, a center-to-center distance between the first and second through holes may be set with a frequency less than or equal to 7 kHz serving as an extractive target frequency band.
(16) The present invention is a close-talking type speech input device in which the microphone unit according to any one of the above descriptions is mounted.
In accordance with this speech input device, it is possible to acquire an electrical signal indicating a user speech whose noise is accurately canceled. Therefore, in accordance with the present invention, it is possible to provide a speech input device capable of achieving highly accurate speech recognition processing and speech authentication processing, or command generation processing based on an input speech.
(17) In the speech input device according to the present invention, an outer shape of the case is a polyhedron, and the first and second through holes may be formed in one surface of the polyhedron.
(18) In the speech input device according to the present invention, a center-to-center distance between the first and second through holes may be 5.2 mm or less.
(19) In the speech input device according to the present invention, the vibrating membrane may be composed of a transducer having SN ratio of approximately 60 decibels or more.
(20) In the speech input device according to the present invention, a center-to-center distance between the first and second through holes may be set to a distance within a range in which sound pressure in the case where the vibrating membrane is used as a differential microphone does not exceed sound pressure in the case where the vibrating membrane is used as a single microphone with respect to a sound in a frequency band less than or equal to 10 kHz.
(21) In the speech input device according to the present invention, a center-to-center distance between the first and second through holes may be set to a distance within a range in which sound pressure in the case where the vibrating membrane is used as a differential microphone does not exceed sound pressure in the case where the vibrating membrane is used as a single microphone in all directions with respect to a sound in an extractive target frequency band.
(22) The present invention is an information processing system comprising: the microphone unit according to any one of the above descriptions; and an analysis processing unit that executes analysis processing of a speech incident to the microphone unit on the basis of the electrical signal.
In accordance with this information processing system, it is possible to acquire an electrical signal indicating a user speech whose noise is accurately canceled. Therefore, in accordance with the present invention, it is possible to provide a speech input device capable of achieving highly accurate speech recognition processing and speech authentication processing, or command generation processing based on an input speech.
(23) A method for manufacturing a microphone unit according to the present invention, the microphone unit including: a case having an internal space; a partition member which is provided in the case, and at least partially composed of a vibrating membrane, the partition member that splits the internal space into a first space and a second space; and an electrical signal output circuit that outputs an electrical signal on the basis of vibration of the vibrating membrane, the method comprising: setting a center-to-center distance between the first and second through holes to a distance within a range in which sound pressure in the case where the vibrating membrane is used as a differential microphone does not exceed sound pressure in the case where the vibrating membrane is used as a single microphone with respect to a sound in a frequency band less than or equal to 10 kHz; and forming a first through hole through which the first space and an external space of the case are communicated with each other, and a second through hole through which the second space and the external space of the case are communicated with each other, in the case according to the set center-to-center distance.
The first and second through holes may be disposed along a traveling direction of a sound (for example, a speech) of a sound source, and a center-to-center distance between the first and second through holes may be set to a distance within a range in which sound pressure in the case where the vibrating membrane is used as a differential microphone does not exceed sound pressure in the case where the vibrating membrane is used as a single microphone with respect to a sound from the traveling direction.
(24) A method for manufacturing a microphone unit according to the present invention, the microphone unit including: a case having an internal space; a partition member which is provided in the case, and at least partially composed of a vibrating membrane, the partition member that splits the internal space into a first space and a second space; and an electrical signal output circuit that outputs an electrical signal on the basis of vibration of the vibrating membrane, the method comprising: setting a center-to-center distance between the first and second through holes to a distance within a range in which sound pressure in the case where the vibrating membrane is used as a differential microphone does not exceed sound pressure in the case where the vibrating membrane is used as a single microphone in all directions with respect to a sound in an extractive target frequency band; and foaming a first through hole through which the first space and an external space of the case are communicated with each other, and a second through hole through which the second space and the external space of the case are communicated with each other, in the case according to the set center-to-center distance.
The extractive target frequency band is a frequency of a sound required to be extracted by the microphone, which may be, for example, a frequency less than or equal to 7 kHz.
1: microphone unit, 2: speech input device, 3: microphone unit, 4: microphone unit, 5: microphone unit, 6: microphone unit, 7: microphone unit, 8: microphone unit, 9: microphone unit, 10: case, 11: case, 12: first through hole, 13: microphone unit, 14: second through hole, 16: convex curved surface, 17: concave curved surface, 18: spherical surface, 20: partition member, 21: partition member, 30: vibrating membrane, 31: vibrating membrane, 32: holding unit, 40: electrical signal output circuit, 41: vibrating membrane unit, 42: capacitor, 44: signal amplifier circuit, 45: gain adjusting circuit, 46: charge-up circuit, 48: operational amplifier, 50: case, 52: aperture, 54: elastic body, 60: arithmetic processing unit, 70: communication processing unit, 80: vibrating membrane, 100: internal space, 101: internal space, 102: first space, 104: second space, 112: first space, 114: second space, 110: external space, 112: first space, 114: second space, 122: first space, 124: second space, 132: first space, 134: second space, 200: condenser microphone, 202: vibrating membrane, 204: electrode, 300: mobile telephone, 400: microphone, 500: remote controller, 600: information processing system, 602: speech input device, 604: host computer.
Hereinafter, an embodiment to which the present invention is applied will be described with reference to the accompanying drawings. However, the present invention is not limited to the following embodiment. Further, the present invention includes the freely-combined following contents.
First, the configuration of a microphone unit 1 according to a present embodiment will be described.
As shown in
As shown in
As shown in
In the present embodiment, as shown in
As shown in
The partition member 20 may be provided so as not to allow a medium propagating a sound wave to move (to be incapable of moving) between the first space 102 and the second space 104 inside the case 10. For example, the partition member 20 may be an airtight bulkhead, which segregates the internal space 100 (the first space 102 and the second space 104) in an airtight manner inside the case 10.
As shown in
Hereinafter, the configuration of a condenser microphone 200 which may have applicability to the microphone 1 according to the present embodiment, will be described. In addition,
The condenser microphone 200 has a vibrating membrane 202. In addition, the vibrating membrane 202 corresponds to the vibrating membrane 30 in the microphone unit 1 according to the present embodiment. The vibrating membrane 202 is a membrane (thin membrane) receiving a sound wave to vibrate, which is electrically conductive and forms one end of an electrode. The condenser microphone 200 further has an electrode 204. The electrode 204 is disposed so as to face the vibrating membrane 202. Accordingly, the vibrating membrane 202 and the electrode 204 form a capacitance. When a sound wave is incident to the condenser microphone 200, the vibrating membrane 202 vibrates, and an interval between the vibrating membrane 202 and the electrode 204 changes, which changes an electrostatic capacitance between the vibrating membrane 202 and the electrode 204. By retrieving the change in electrostatic capacitance as, for example, a change in voltage, it is possible to acquire an electrical signal based on vibration of the vibrating membrane 202. That is, it is possible to convert a sound wave incident to the condenser microphone 200 into an electrical signal, to output the electrical signal. In addition, in the condenser microphone 200, the electrode 204 may be configured so as not to be affected by a sound wave. For example, the electrode 204 may have a mesh structure.
In addition, the vibrating membrane 30 of the microphone 1 according to the present embodiment is not limited to the above-described condenser microphone 200, and vibrating membranes for various sorts of microphones, such as electrodynamic (dynamic type), electromagnetic (magnetic type), and piezoelectric (crystal type) microphones may be applied as the vibrating membrane 30.
Or, the vibrating membrane 30 may be a semiconductor film (for example, a silicon film). That is, the vibrating membrane 30 may be a vibrating membrane for a silicon microphone (Si microphone). Provided that a silicon microphone is used, it is possible to downsize the microphone unit 1 and realize the microphone unit 1 with high performance.
The outer shape of the vibrating membrane 30 is not particularly limited. As shown in
In addition, in the present embodiment, as shown in
In the present embodiment, as shown in
The microphone unit 1 according to the present embodiment includes the electrical signal output circuit 40 that outputs an electrical signal on the basis of vibration of the vibrating membrane 30. The electrical signal output circuit 40 may be formed at least partially inside the internal space 100 of the case 10. The electrical signal output circuit 40 may be formed on the inner wall surface of the case 10, for example. That is, in the present embodiment, the case 10 may be utilized as a circuit substrate for an electric circuit.
Meanwhile, in the case where a silicon microphone is applied as the vibrating membrane 30, the electrical signal output circuit 40 may be realized by forming an integrated circuit on a semiconductor substrate provided in the silicon microphone.
Further, the electrical signal output circuit 40 may further include a conversion circuit that converts an analog signal into a digital signal, a compression circuit that compresses (encodes) a digital signal, and the like.
Further, the vibrating membrane 30 may be composed of a transducer whose SN ratio is approximately 60 decibels or more. In the case where a transducer is functioned as a differential microphone, its SN ratio deteriorates as compared with the case where a transducer is functioned as a single microphone. Accordingly, provided that the vibrating membrane 30 is composed of a transducer whose SN ratio is excellent (for example, an MEMS transducer whose SN ratio is approximately 60 decibels or more), it is possible to realize a sensitive microphone unit.
For example, in the case where a single microphone is used as a differential microphone by setting a distance between a speaker and the microphone to approximately 2.5 cm (a close-talking type microphone unit), its sensitivity deteriorates approximately ten-odd decibels as compared with the case where the microphone is used as a single microphone. However, the microphone unit 1 according to the present embodiment has the vibrating membrane 30 composed of a transducer whose SN ratio is approximately 60 decibels or more, thereby the microphone unit 1 is provided with an necessary sensitivity level for functioning as a microphone.
As described above, the microphone unit 1 according to the present embodiment has a highly accurate noise-canceling function regardless of its simple configuration. Hereinafter, the principle of noise-cancellation of the microphone unit 1 will be described.
(1) Configuration of the Microphone Unit 1 and Principle of Vibration of the Vibrating Membrane 30
First, the principle of vibration of the vibrating membrane 30 derived from the configuration of the microphone unit 1 will be described.
In the microphone unit 1 according to the present embodiment, the vibrating membrane 30 receives sound pressures from the both sides (the first surface 35 and the second surface 37). Therefore, when sound pressures at the same level are simultaneously exerted onto the both sides of the vibrating membrane 30, the two sound pressures cancel each other in the vibrating membrane 30, which do not result in force vibrating the vibrating membrane 30. In contrast thereto, when there is a difference between the sound pressures received by the both sides of the vibrating membrane 30, the vibrating membrane 30 is vibrated by the difference between the sound pressures.
Further, the sound pressures of sound waves incident into the first through hole 12 and the second through hole 14 are uniformly transmitted to the inner wall surfaces of the first space 102 and the second space 104 according to Pascal's law. Therefore, the surface (the first surface 35) of the vibrating membrane 30 on the side of the first space 102 receives sound pressure equal to the sound pressure incident into the first through hole 12, and the surface (the second surface 37) of the vibrating membrane 30 on the side of the second space 104 receives sound pressure equal to the sound pressure incident into the second through hole 14.
That is, the sound pressures received by the first surface 35 and the second surface 37 are respectively the sound pressures of the sounds incident into the first through hole 12 and the second through hole 14, and the vibrating membrane 30 vibrates by a difference between the sound pressures of the sound waves incident from the first through hole 12 and the second through hole 14 to reach the first surface 35 and the second surface 37.
(2) Property of Sound Wave
A sound wave is attenuated as it travels in a medium, and its sound pressure (an intensity and an amplitude of the sound wave) deteriorates. Since sound pressure is reversely proportional to a distance from a sound source, sound pressure P may be, in a relationship with a distance R from the sound source, expressed as follows:
In addition, in expression (1) is a proportional constant.
In the case where the microphone unit 1 is applied to a close-talking type sound input apparatus, a speech of a user is generated from the vicinity of the first through hole 12 and the second through hole 14 of the microphone unit 1. Therefore, the speech of the user is greatly attenuated between the first through hole 12 and the second through hole 14, which shows a great difference between the sound pressures of the speech of a user incident into the first through hole 12 and the second through hole 14, i.e., the sound pressures of the speech of the user incident into the first surface 35 and the second surface 37.
In contrast thereto, a sound source of a noise component exists at a distant position from the first through hole 12 and the second through hole 14 of the microphone unit 1 as compared with the speech of the user. Therefore, the sound pressures of noises are hardly attenuated between the first through hole 12 and the second through hole 14, which hardly shows a difference between the sound pressures of the noise input into the first through hole 12 and the second through hole 14.
(3) Principle of Noise-Cancellation
As described above, the vibrating membrane 30 is vibrated by a difference between sound pressures of sound waves simultaneously incident to the first surface 35 and the second surface 37. Then, since a difference between sound pressures of noises incident to the first surface 35 and the second surface 37 is extremely small, the difference is canceled in the vibrating membrane 30. In contrast thereto, since a difference between sound pressures of a user speech incident to the first surface 35 and the second surface 37 is great, the difference is not canceled in the vibrating membrane 30, which vibrates the vibrating membrane 30.
With this, the vibrating membrane 30 of the microphone unit 1 may be considered to be vibrated by a user speech. Therefore, an electrical signal output from the electrical signal output circuit 40 of the microphone unit 1 may be regarded as a signal indicating the user speech whose noise is canceled.
That is, provided that the microphone unit 1 according to the present embodiment is applied to a speech input device, it is possible to acquire an electrical signal indicating a user speech whose noise is canceled with a simple configuration.
As described above, in accordance with the microphone unit 1, it is possible to acquire an electrical signal indicating a user speech whose noise is canceled. However, the sound waves include their phase components. Therefore, considering a phase difference between the sound waves incident from the first through hole 12 and the second through hole 14 to the first surface 35 and the second surface 37 of the vibrating membrane 30, it is possible to derive the conditions under which it is possible to achieve a higher accuracy noise-canceling function (the design conditions of the microphone unit 1). Hereinafter, the conditions required to be fulfilled by the microphone unit 1 in order to achieve a higher accuracy noise-canceling function, will be described.
In accordance with the microphone unit 1, a noise component included in a sound pressure difference vibrating the vibrating membrane 30 (a difference between sound pressures received by the first surface 35 and the second surface 37: hereinafter called “differential sound pressure”) may be made less than a noise component included in sound pressures incident to the first surface 35 and the second surface 37. To describe in more detail, a noise intensity ratio indicating a ratio of an intensity of the noise component included in the differential sound pressure to an intensity of the noise component included in the sound pressures incident to the first surface 35 or the second surface 37, is made less than a user speech intensity ratio indicating a ratio of an intensity of a user speech component included in the differential sound pressure to an intensity of a user speech component included in sound pressures incident to the first surface 35 or the second surface 37. Thus, since the microphone unit 1 has an excellent noise-canceling function, it is possible to regard a signal output on the basis of a differential sound pressure vibrating the vibrating membrane 30 as a signal indicating a user speech.
Hereinafter, the concrete conditions required to be fulfilled by the microphone unit 1 (the case 10) in order to achieve the noise-canceling function, will be described.
First, the sound pressures of a speech incident to the first surface 35 and the second surface 37 of the vibrating membrane 30 (the first through hole 12 and the second through hole 14) will be considered. Given that a distance from a sound source of a user speech to the first through hole 12 is R, and a center-to-center distance of the first through hole 12 and the second through hole 14 is Δr, when ignoring a phase difference, sound pressures (intensities) P(S1) and P(S2) of a user speech incident into the first through hole 12 and the second through hole 14 may be expressed as follows:
Therefore, a user speech intensity ratio ρ(P) indicating a percentage of an intensity of a user speech component included in a differential sound pressure to an intensity of the sound pressure of the user speech incident to the first surface 35 (the first through hole 12) when ignoring a phase difference of the user speech, is expressed as follows:
Here, in the case where the microphone unit 1 is utilized for a close-talking type speech input device, Δr may be considered to be sufficiently less than R.
Accordingly, the above-described expression (4) may be modified as follows:
That is, it is shown that a user speech intensity ratio when ignoring a phase difference of a user speech is expressed by expression (A).
Meanwhile, considering a phase difference of a user speech, sound pressures Q(S1) and Q(S2) of the user speech may be expressed as follows:
In addition, α in the expression is a phase difference.
At this time, a user speech intensity ratio ρ(S) is expressed as follows:
Considering expression (7), a level of the user speech intensity ratio ρ(S) may be expressed as follows:
Meanwhile, in expression (8), the term of Sin ωt−Sin(ωt−α) indicates an intensity ratio of phase components, and the term of Δr/R sin ωt indicates an intensity ratio of amplitude components. Phase difference components, even when they are the user speech components, are noises for amplitude components. Therefore, in order to accurately extract a user speech, it is necessary for an intensity ratio of phase components to be sufficiently less than an intensity ratio of amplitude components. That is, it is important that Sin ωt−Sin(ωt−α) and Δr/R sin ωt fulfill the relationship as follows:
Here, the following expression may be derived:
Therefore, the above-described expression (B) may be expressed as follows:
Considering the amplitude components of expression (10), it is shown that it is necessary for the microphone unit 1 according to the present embodiment to fulfill the following expression:
In addition, as described above, since Δr may be considered to be sufficiently less than R, sin(α/2) may be considered to be sufficiently small, and may be approximated by the following expression:
Therefore, expression (C) may be modified as follows:
Further, when a relationship between α which is a phase difference and Δr is expressed as follows:
Expression (D) may be modified as follows:
That is, in the present embodiment, when the microphone unit 1 fulfills the relationship shown by expression (E), it is possible to accurately extract a user speech.
Next, sound pressures of noises incident into the first through hole 12 and the second through hole 14 to reach the first surface 35 and the second surface 37 will be considered.
Given that an amplitude of a noise component incident from the first through hole 12 to reach the first surface 35 is A, and an amplitude of a noise component incident from the second through hole 14 to reach the second surface 37 is A′, sound pressures Q(S1) and Q(S2) of the noise when considering a phase difference component, may be expressed as follows:
A noise intensity ratio ρ(N) indicating a percentage of an intensity of the noise component included in a differential sound pressure to an intensity of the sound pressure of the noise component incident from the first through hole 12 to reach the first surface 35, may be expressed as follows:
In addition, as described above, since the amplitude (the intensity) of the noise component incident from the first through hole 12 to reach the first surface 35 and the amplitude (the intensity) of the noise component incident from the second through hole 14 to reach the second surface 37 are substantially the same, those may be handled as A=A′. Accordingly, the above-described expression (15) may be modified as follows:
Then, a level of the noise intensity ratio may be expressed as follows:
Here, considering the above-described expression (9), the expression (17) may be modified as follows:
Then, considering the above-described expression (17), the expression (18) may be modified as follows:
[Expression 21]
ρ(N)=α (19)
Here, with reference to expression (D), a level of the noise intensity ratio may be expressed as follows:
In addition, where Δr/R is an intensity ratio of amplitude components of a user speech as shown in expression (A). Expression (F) shows that a noise intensity ratio is made less than an intensity ratio of a user speech Δr/R in the microphone unit 1.
In accordance with the above descriptions, in accordance with the microphone unit 1 according to the present embodiment, since an intensity ratio of phase components of a user speech is made less than an intensity ratio of amplitude components (refer to expression (B)), a noise intensity ratio is made less than an intensity ratio of the user speech (refer to expression (F)). Accordingly, the microphone unit 1 according to the present embodiment has an excellent noise-canceling function.
Hereinafter, a method for manufacturing the microphone unit 1 according to the present embodiment will be described. In the microphone unit 1 according to the present embodiment, the microphone unit 1 may be manufactured by utilizing data indicating a correspondence relationship between a value of Δr/λ indicating a percentage of a center-to-center distance Δr between the first through hole 12 and the second through hole 14 to a wavelength λ of a noise, and a noise intensity ratio (an intensity ratio based on phase components of the noise).
An intensity ratio based on phase components of a noise is expressed by the above-described expression (18). Therefore, a decibel value of the intensity ratio based on the phase components of the noise may be expressed as follows:
Then, when respective values are substituted for α in expression (20), it is possible to clarify the correspondence relationship between a phase difference α and an intensity ratio based on phase components of a noise.
In addition, as shown in expression (12), a phase difference α may be expressed by a function of Δr/λ that is a ratio between a distance Δr and a wavelength λ, and the abscissa of
In the present embodiment, the microphone unit 1 is manufactured by utilizing this data.
First, data (refer to
Next, intensity ratio of a noise is set (step S12). In addition, in the present embodiment, it is necessary to set the intensity ratio of a noise so as to reduce the intensity ratio of a noise. Therefore, in this step, intensity of a noise is set to 0 decibels or less.
Next, values of Δr/λ corresponding to the intensity ratios of the noise are derived on the basis of the data (step S14).
Then, conditions required to be fulfilled by Δr are derived by substituting a principal noise wavelength for λ (step S16).
As a concrete example, the case where the microphone unit 1 is manufactured in which an intensity of a noise deteriorates by 20 decibels in an environment that the principal noise is 1 kHz and its wavelength is 0.347 m, will be considered.
First, a condition for an intensity ratio of a noise to be made 0 decibels or less will be considered. With reference to
Next, a condition for deteriorating an intensity of a noise of 1 kHz by 20 decibels will be considered. With reference to
In addition, in the case where the microphone unit 1 according to the present embodiment is utilized for a close-talking type speech input device, an interval between a sound source of a user speech and the microphone unit 1 (the first through hole 12 and the second through hole 14) is usually 5 cm or less. Further, it is possible to set an interval between a sound source of a user speech and the microphone unit 1 (the first through hole 12 and the second through hole 14) by a design of the case in which the microphone unit 1 is housed. Therefore, it is shown that a value of Δr/R which is an intensity ratio of a speech of a user is made greater than 0.1 (an intensity ratio of the noise), thereby achieving a noise-canceling function.
In addition, usually, a noise is not limited to a single frequency. However, since a noise at a frequency lower than that of a noise supposed as a principal noise has a wavelength longer than that of the principal noise, a value of Δr/λ is made small, which may be canceled by this microphone unit 1. Further, the higher the frequency is, the faster the energy of a sound wave is attenuated. Therefore, since a noise at a frequency higher than that of a noise supposed as a principal noise is attenuated faster than the principal noise, the effect on the microphone unit 1 (vibrating membrane 30) may be ignored. With this, the microphone unit 1 according to the present embodiment is capable of achieving an excellent noise-canceling function even in an environment in which there is a noise at a frequency different from that of a noise supposed as a principal noise.
Further, in the present embodiment, as shown from expression (12), noises incident from above the straight line connecting the first through hole 12 and the second through hole 14 are assumed. The noises are noises in which an apparent interval between the first through hole 12 and the second through hole 14 is maximized, and noises between which a phase difference is maximized in a real usage environment. That is, the microphone unit 1 according to the present embodiment is configured to be capable of canceling noises between which a phase difference is maximized. Therefore, in accordance with the microphone unit 1 according to the present embodiment, it is possible to cancel noises incident thereto from all directions.
Hereinafter, the effects performed by the microphone unit 1 will be summarized.
As described above, in accordance with the microphone unit 1, it is possible to acquire an electrical signal indicating a speech whose noise components are canceled by merely acquiring an electrical signal indicating vibration of the vibrating membrane 30 (an electrical signal based on vibration of the vibrating membrane 30). That is, it is possible to achieve a noise-canceling function without performing complex analytic arithmetic processing in the microphone unit 1. Therefore, it is possible to provide a high-quality microphone unit capable of performing thorough noise cancellation with a simple configuration. In particular, by setting a center-to-center distance Δr between the first through hole 12 and the second through hole 14 to 5.2 mm, or less, it is possible to provide a microphone unit capable of achieving a higher accuracy noise-canceling function.
Further, a center-to-center distance between the first through hole 12 and the second through hole 14 may be set to a distance within a range in which sound pressure in the case where the vibrating membrane 30 is used as a differential microphone does not exceed sound pressure in the case where the vibrating membrane 30 is used as a single microphone with respect to a sound in a frequency band less than or equal to 10 kHz.
The first through hole 12 and the second through hole 14 may be disposed along a traveling direction of a sound (for example, a speech) from a sound source, and a center-to-center distance between the first and second through holes may be set to a distance within a range in which sound pressure in the case where the vibrating membrane 30 is used as a differential microphone does not exceed sound pressure in the case where the vibrating membrane 30 is used as a single microphone with respect to a sound from the traveling direction.
In
That is, the graphs of
As shown in
Further, as shown in
Further, as shown in
Accordingly, by setting the microphone-to-microphone distance to approximately 5 mm to 6 mm (in more detail, 5.2 mm or less), it is possible to realize a microphone which faithfully extracts a speaker's speech up to a frequency band of 10 kHz, with a high depression effect for a distant noise.
In the present embodiment, by setting a center-to-center distance between the first through hole 12 and the second through hole 14 to approximately 5 mm to 6 mm (in more detail, 5.2 mm or less), it is possible to realize a microphone which faithfully extracts a speaker's speech up to a frequency band of 10 kHz, with a high depression effect for a distant noise.
Further, in the microphone unit 1, it is possible to design the case 10 (the positions of the first through hole 12 and the second through hole 14) so as to be capable of canceling noises incident such that a noise intensity ratio based on its phase difference is maximized. Therefore, in accordance with the microphone unit 1, it is possible to cancel noises incident thereto from all directions. That is, in accordance with the present invention, it is possible to provide a microphone unit capable of canceling noises incident thereto from all directions.
Reference numeral 1110 is a graph indicating the sensitivity (differential sound pressure) of the differential microphone to all directions, and shows the directional characteristics of the differential microphone. Further, reference numeral 1112 is a graph indicating the sensitivity (sound pressure) to all directions when the differential microphone is used as a single microphone, and shows the directional characteristics of the single microphone.
Reference numeral 1114 indicates a direction of a straight line connecting the both microphones in the case where the differential microphone is composed of two microphones, or a direction of a straight line connecting the first through hole and the second through hole through which sound waves are made to reach the both surfaces of the microphone in the case where the differential microphone is realized by one microphone (0 degrees to 180 degrees, two microphones M1 and M2 composing the differential microphone or the first through hole and the second through hole are placed on this straight line). The direction of this straight line is 0 degrees and 180 degrees, and the direction perpendicular to the direction of this straight line is 90 degrees and 270 degrees.
As shown by reference numerals 1112 and 1122, the single microphone detects sounds uniformly from all directions, and has no directivity. Further, the farther the sound source is, the more the sound pressures to be acquired are attenuated.
As shown by reference numerals 1110 and 1120, the differential microphone deteriorates in sensitivity to a certain extent in the directions of 90 degrees and 270 degrees, but has the directivity substantially uniform in all directions. Further, sound pressures to be acquired are further attenuated than those by the single microphone, and in the same way as the single microphone, the farther the sound source is, the more the sound pressures to be acquired are attenuated.
As shown in
In the case where the microphone-to-microphone distance is 5 mm, as shown in
However, in the case where the microphone-to-microphone distance is 10 mm, as shown in
Further, in the case where the microphone-to-microphone distance is 20 mm, as shown in
Accordingly, by setting a microphone-to-microphone distance of the differential microphone to approximately 5 mm to 6 mm (in more detail, 5.2 mm or less), it is possible to say that the differential microphone has a higher depression effect for a distant noise from all directions as compared with the single microphone with respect to the sound in a band of 7 kHz or less, independent of the directivity.
In addition, in the case where the differential microphone is realized by one microphone, it is possible to say the same for a distance between the first through hole and the second through hole through which sound waves are made to reach the both surfaces of the microphone. Accordingly, in the present embodiment, by setting a center-to-center distance between the first through hole 12 and the second through hole 14 to approximately 5 mm to 6 mm (in more detail, 5.2 mm or less), it is possible to realize a microphone unit capable of depressing distant noises from all directions independent of the directivity with respect to a sound of 7 kHz or less.
In addition, in accordance with the microphone unit 1, it is possible to cancel user speech components incident to the vibrating membrane 30 (the first surface 35 and the second surface 37) after being reflected by a wall or the like. Specifically, since a user speech reflected by a wall or the like is incident to the microphone unit 1 after propagating a long distance, the user speech may be regarded as a speech generated from a sound source existing farther from a usual user speech, and since the energy of the user speech is greatly lost by the reflection, the sound pressures are not greatly attenuated between the first through hole 12 and the second through hole 14 in the same way as the noise components. Therefore, in accordance with the microphone unit 1, the user speech components incident after being reflected by a wall or the like as well are canceled in the same way as noises (as a type of noise).
Then, by utilizing the microphone unit 1, it is possible to acquire a signal indicating a user speech with no noise contained. Therefore, by utilizing the microphone unit 1, it is possible to achieve highly accurate speech recognition and speech authentication, and command generation processing.
Next, a speech input device 2 having the microphone unit 1 will be described.
(1) Configuration of the Speech Input Device 2
First, the configuration of the speech input device 2 will be described.
The speech input device 2 has a case 50. The case 50 is a member forming the outer shape of the speech input device 2. A basic position may be set for the case 50, thereby it is possible to regulate a traveling route of a user speech. Apertures 52 for receiving a speech from a user may be formed in the case 50.
In the speech input device 2, the microphone unit 1 is installed inside the case 50. At this time, the microphone unit 1 may be installed in the case 50 such that the first through hole 12 and the second through hole 14 respectively overlap with the apertures 52. With this, the internal space of the microphone unit 1 is communicated with the outside through the first through hole 12, the second through hole 14, and the apertures 52 overlapped with these through holes. The microphone unit 1 may be installed in the case 50 via an elastic body 54. With this, vibration of the case 50 of the speech input device 2 is hard to transmit to the case 10, which makes it possible to accurately operate the microphone unit 1.
The microphone unit 1 may be installed in the case 50 such that the first through hole 12 and the second through hole 14 are disposed out of alignment along the traveling direction of a user speech. Then, a through hole disposed at the upstream side of the traveling route of a user speech may be set as the first through hole 12, and a through hole disposed at the downstream side thereof may be set as the second through hole 14. Provided that the microphone unit 1 in which the vibrating membrane 30 is disposed beside the second through hole 14 is disposed as described above, it is possible to make a user speech incident simultaneously to the both surfaces of the vibrating membrane 30 (the first surface 35 and the second surface 37). Specifically, since a distance from the center of the first through hole 12 to the first surface 35 is substantially equal to a distance from the first through hole 12 to the second through hole 14 in the microphone unit 1, a time required for a user speech passed through the first through hole 12 to be incident to the first surface 35 is made substantially equal to a time required for a user sound wave passed above the first through hole 12 to be incident to the second surface 37 via the second through hole 14. That is, a time required for a speech vocalized by a user to be incident to the first surface 35 is made substantially equal to a time required for the speech vocalized by the user to be incident to the second surface 37. Therefore, it is possible to make the user speech incident simultaneously to the first surface 35 and the second surface 37, and it is possible to vibrate the vibrating membrane 30 so as not to generate a noise due to phase shifting. In other words, it is shown that, since α=0 and Sin ωt−Sin(ωt−α)=0 in expression (8) described above, the term of Δr/R sin ωt (amplitude components) is extracted. Therefore, even in the case where a user speech of approximately 7 kHz which is a high frequency band as a human speech is incident thereto, an effect of phase shifting between sound pressure incident to the first surface 35 and sound pressure input to the second surface 37 is ignorable, and it is possible to acquire an electrical signal accurately indicating the user speech.
(2) Functions of the Speech Input Device 2
Next, the functions of the speech input device 2 will be described with reference to
The speech input device 2 has the microphone unit 1. The microphone unit 1 outputs an electrical signal generated on the basis of vibration of the vibrating membrane 30. In addition, an electrical signal output from the microphone unit 1 is an electrical signal indicating a user speech whose noise components are canceled.
The speech input device 2 may have an arithmetic processing unit 60. The arithmetic processing unit 60 executes various arithmetic processings on the basis of an electrical signal output from the microphone unit 1 (the electrical signal output circuit 40). The arithmetic processing unit 60 may execute analysis processing for an electrical signal. The arithmetic processing unit 60 may execute processing of specifying a person vocalizing a user speech (so-called speech authentication processing) by analyzing an output signal from the microphone unit 1. Or, the arithmetic processing unit 60 may execute processing of specifying the content of a user speech (so-called speech recognition processing) by executing analysis processing for an output signal from the microphone unit 1. The arithmetic processing unit 60 may execute processing of creating various commands on the basis of an output signal from the microphone unit 1. The arithmetic processing unit 60 may execute processing of amplifying an output signal from the microphone unit 1. Further, the arithmetic processing unit 60 may control the operation of a communication processing unit 70 which will be described later. In addition, the arithmetic processing unit 60 may achieve the above-described respective functions by signal processings by CPUs or memories. Or, the arithmetic processing unit 60 may achieve the above-described respective functions by dedicated hardware.
The speech input device 2 may further include the communication processing unit 70. The communication processing unit 70 controls communication between the speech input device 2 and another terminal (a mobile telephone terminal, a host computer, or the like). The communication processing unit 70 may have a function of transmitting a signal (an output signal from the microphone unit 1) to another terminal via a network. The communication processing unit 70 may also have a function of receiving a signal from another terminal via a network. Then, for example, various information processings such as speech recognition processing and speech authentication processing, command generation processing, and data storage processing may be executed by executing analysis processing for an output signal acquired via the communication processing unit 70 by a host computer. That is, the speech input device 2 may compose an information processing system in cooperation with another terminal. In other words, the speech input device 2 may be regarded as an information input terminal structuring the information processing system. Meanwhile, the speech input device 2 may have a configuration without the communication processing unit 70.
In addition, the arithmetic processing unit 60 and the communication processing unit 70 may be disposed as a packaged semiconductor apparatus (integrated circuit apparatus) inside the case 50. Meanwhile, the present invention is not limited thereto. For example, the arithmetic processing unit 60 may be disposed outside the case 50. In the case where the arithmetic processing unit 60 is disposed outside the case 50, the arithmetic processing unit 60 may acquire a differential signal via the communication processing unit 70.
In addition, the speech input device 2 may further include a display device such as a display panel, or a speech output device such as a loudspeaker. Further, the speech input device 2 may further include operation keys for inputting operational information.
The speech input device 2 may have the above-described configuration. This speech input device 2 utilizes the microphone unit 1. Therefore, the speech input device 2 is capable of acquiring a signal indicating an input speech with no noise contained, which makes it possible to achieve highly accurate speech recognition and speech authentication, and command generation processing.
Further, when the speech input device 2 is applied to a microphone system, a voice of a user output from a loudspeaker as well is canceled as a noise. Therefore, it is possible to provide a microphone system hardly causing acoustic feedback.
In addition, the present invention is not limited to the embodiment described above, and various modifications are possible. The present invention contains configurations substantially the same as the configurations described in the embodiments (for example, configurations which are the same in function, method and result, or configurations which are the same in object and effect). Further, the present invention contains configurations in which unessential portions in the configurations described in the embodiments are replaced. Further, the present invention contains configurations with which it is possible to perform the same actions and effects or configurations with which it is possible to achieve the same object as the configurations described in the embodiments. Further, the present invention contains configurations in which publicly known technologies are added to the configurations described in the embodiments.
Hereinafter, concrete modified examples are shown.
The microphone unit 3 includes a vibrating membrane 80. The vibrating membrane 80 composes a part of a partition member, which splits the internal space 100 of the case 10 into a first space 112 and a second space 114. The vibrating membrane 80 is provided such that its normal is perpendicular to the surface 15 (i.e., so as to be parallel to the surface 15). The vibrating membrane 80 may be provided beside the second through hole 14 so as not to overlap with the first through hole 12 and the second through hole 14 (at a position other than the places under the first through hole 12 and the second through hole 14). Further, the vibrating membrane 80 may be disposed with an interval from the inner wall surface of the case 10.
The microphone unit 4 includes a vibrating membrane 90. The vibrating membrane 90 composes a part of a partition member, which splits the internal space 100 of the case 10 into a first space 122 and a second space 124. The vibrating membrane 90 is provided such that its normal is perpendicular to the surface 15. The vibrating membrane 90 may be provided so as to be flat on the same plane of the inner wall surface (the surface on the opposite side of the surface 15) of the case 10. The vibrating membrane 90 may be provided so as to block the second through hole 14 from the inner side of the case 10 (the side of the internal space 100). That is, in the microphone unit 4, the space on the inner side of the second through hole 14 may be the second space 124, and the space other than the second space 124 in the internal space 100 may be the first space 122. Thereby, it is possible to design the case 10 to be thin.
The microphone unit 5 includes a case 11. An internal space 101 is formed inside the case 11. Then, the internal space 101 of the case 11 is split into a first space 132 and a second space 134 with the partition member 20. In the microphone unit 5, the partition member 20 is disposed beside the second through hole 14. Further, in the microphone unit 5, the partition member 20 splits the internal space 101 such that the volumes of the first space 132 and the second space 134 are equalized.
The microphone unit 6 has a partition member 21 as shown in
In the microphone unit 7, as shown in
In the microphone unit 8, as shown in
In the microphone unit 9, as shown in
In the microphone unit 13, as shown in
With these microphone units, it is also possible to perform the same effects described above. Therefore, it is possible to acquire an electrical signal indicating a user speech with no noise contained component by acquiring an electrical signal on the basis of vibration of the vibrating membrane.
This application is based on Japanese Patent Application (JP-A-2008-083294), filed on Mar. 27, 2008, and the contents of which are incorporated herein by reference.
Number | Date | Country | Kind |
---|---|---|---|
2008-083294 | Mar 2008 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2009/056393 | 3/27/2009 | WO | 00 | 3/31/2011 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2009/119852 | 10/1/2009 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5226076 | Baumhauer, Jr. et al. | Jul 1993 | A |
8180082 | Takano et al. | May 2012 | B2 |
20040178938 | Pedersen | Sep 2004 | A1 |
20050094832 | Song et al. | May 2005 | A1 |
20120201410 | Takano et al. | Aug 2012 | A1 |
Number | Date | Country |
---|---|---|
0827360 | Mar 1998 | EP |
2218303 | Nov 1989 | GB |
48-21519 | Mar 1973 | JP |
63-232798 | Sep 1988 | JP |
1-268398 | Oct 1989 | JP |
4-076795 | Mar 1992 | JP |
4-217199 | Aug 1992 | JP |
5-260580 | Oct 1993 | JP |
6-284494 | Oct 1994 | JP |
7-312638 | Nov 1995 | JP |
08-191496 | Jul 1996 | JP |
9-331377 | Dec 1997 | JP |
2001-186241 | Jul 2001 | JP |
2005-295278 | Oct 2005 | JP |
0038477 | Jun 2000 | WO |
Entry |
---|
Office Action issued in corresponding Chinese Application No. 200980111077.3 dated Nov. 2, 2012, and English translation thereof (18 pages). |
Extended European Search Report issued in corresponding European Application No. 09725960.0 dated Dec. 13, 2012 (7 pages). |
International Preliminary Report on Patentability and Written Opinon issued in corresponding International Application No. PCT/JP2009/056393 dated Nov. 9, 2010 (10 pages). |
Number | Date | Country | |
---|---|---|---|
20110170726 A1 | Jul 2011 | US |