The present disclosure relates to a MEMS (micro-electro-mechanical systems) acoustic transducer device having improved detection features and to a corresponding electronic apparatus.
The increasing use is known, for example in portable electronic apparatuses, such as tablets, smartphones, digital audio players, photo- or video cameras and consoles for videogames, of acoustic transducers (microphones) including micromechanical detection structures made, at least in part, of semiconductor materials and using MEMS technology.
A MEMS acoustic transducer generally comprises: a micromechanical detection structure, designed to transduce the mechanical quantity to be detected (in particular, acoustic-pressure waves) into an electrical quantity (for example, a capacitive variation, in the case of capacitive detection structures); and an electronic reading circuit, usually integrated as an ASIC (Application-Specific Integrated Circuit), designed to carry out suitable processing operations (amongst which operations of amplification and filtering) of the transduced electrical quantity for supplying an electrical output signal, whether analog (for example, a voltage) or digital (for example, a PDM—Pulse-Density Modulation—signal). This electrical signal is then made available for an external electronic apparatus (the so-called “host”) incorporating the acoustic transducer; for example, it is received at input by a microprocessor control unit of the electronic apparatus.
The micromechanical detection structure of a MEMS acoustic transducer of a capacitive type generally comprises a mobile electrode, obtained as a diaphragm or membrane, set facing a substantially fixed electrode. The mobile electrode is generally anchored, by a perimetral portion thereof, to a substrate, whereas a central portion thereof is free to move or deflect in response to acoustic-pressure waves incident on a surface thereof. The mobile electrode and the fixed electrode provide the plates of a detection capacitor and bending of the membrane that constitutes the mobile electrode causes a variation of capacitance of the detection capacitor.
A MEMS acoustic transducer of a known type is, for example, described in U.S. Patent Publication No. US 2010/0158279 A1, filed in the name of the present Applicant.
MEMS acoustic transducers have advantageous characteristics, amongst which extremely compact dimensions, reduced consumption levels and a good electrical performance and may be used, for example, for providing UIs (user interfaces) for portable electronic apparatuses, in particular for providing the possibility of imparting voice commands (via sounds or speech).
In this regard, known solutions envisage the use of an acoustic transducer for detecting audio signals and a software module executed within the microprocessor control unit of the host electronic apparatus, for execution of algorithms dedicated to voice recognition (activity known as “VAD—Voice-Activity Detection” or “speech-activity detection” or simply “speech detection”, or again as “ASR—Automatic Speech Recognition”) and activation of corresponding features within the user interface.
These solutions have, however, some problems, which do not enable full exploitation of the advantageous characteristics thereof.
In particular, due to requirements of energy consumption, which are particularly stringent in the case of portable electronic apparatuses, typically the voice-recognition module is required to be de-activated at the end of a given detection period, or set in an energy-saving or low-power mode.
Consequently, the voice-recognition features may not be operative all the time and typically require pressing of a key (or execution of a similar operation) by the user for their re-activation, i.e., for starting analysis of the sound activity and waking up the voice-recognition module.
Further, the voice-recognition module constitutes only one of the various operating modules that are managed by the microprocessor control unit of the electronic apparatus that houses the acoustic transducer. Consequently, voice recognition may at times be executed with a certain delay, for example in the case where the control circuit itself is occupied with other features and in any case execution of the voice-recognition module may prevent execution of other important operations by the microprocessor control unit and in any case constitutes an additional computational load for the same control unit.
One embodiment of the present disclosure is directed to a MEMS acoustic transducer device that includes a micromechanical detection structure configured to detect acoustic-pressure waves and supply a transduced electrical quantity and an integrated circuit coupled to the micromechanical detection structure. The integrated circuit includes a reading module configured to generate at output an audio signal as a function of the transduced electrical quantity and a recognition module configured to recognize a sound activity event associated with the transduced electrical quantity.
For a better understanding of the present disclosure, preferred embodiments thereof are now described, purely by way of non-limiting example and with reference to the attached drawings, wherein:
With reference to
The acoustic transducer device 1 comprises a micromechanical detection structure 2, of a known type (not described in detail herein), for example of a capacitive type (and for this reason represented as a capacitor with variable capacitance in
The acoustic transducer device 1 further comprises a package 5, which encloses the micromechanical detection structure 2 and ASIC 4, constituting the mechanical and electrical interface thereof with respect to the external environment, for example enabling entry of acoustic-pressure waves for enabling detection by the micromechanical detection structure 2, and the electrical connection of the ASIC 4 towards the outside world.
The micromechanical detection structure 2 transduces the acoustic-pressure waves coming from the external environment into an electrical quantity (in particular, a capacitive variation).
The ASIC 4 comprises a reading module 4a, which receives at input the transduced electrical quantity and processes it (for example, carrying out amplification and filtering operations), for generating and supplying an electrical output signal, in particular an audio signal Sa, indicative of the acoustic-pressure waves detected by the micromechanical detection structure 2.
As will be described in greater detail hereinafter, the reading module 4a comprises: a transducer stage, for example including a pre-amplifier (in the case of an analog implementation), which receives the electrical quantity and supplies a transduced electrical signal; possible appropriate stages for further processing; and an output stage, for example, including a biasing stage (in the case of analog implementation) or an analog-to-digital converter stage (in the case of digital implementation).
The electrical output signal is analog (for example, a voltage), or digital (for example, a PDM—Pulse-Density Modulated—signal), according to whether the ASIC 4 is of an analog type or includes digital components (for example, a microprocessor logic unit, a microcontroller, an FPGA—Field-Programmable Gate Array, a DSP—Digital Signal Processor).
According to an aspect of the present solution, the ASIC 4 further comprises a recognition module 4b, provided in addition to the reading module 4a and co-operating therewith.
In particular, the recognition module 4b is configured for evaluating, in an autonomous and automatic way, the sound activity associated to the electrical quantity transduced by the micromechanical detection structure 2 and thus associated to the state, or level, of sound activity of the external environment, in order to recognize the occurrence of at least one preset sound event, for example the presence of a sound having a preset level of intensity or of the speech of a user.
The recognition module 4b supplies at output a data signal Sd, which carries the information regarding recognition of the sound event, for example the speech of the user.
According to a further aspect of the present solution, the recognition module 4b has an input that receives a control signal Sc, on the basis of which it is possible to configure parameters of recognition of the sound activity (for example, the characteristics of the preset sound event, such as the speech of the user or the sound to be recognized).
In a possible embodiment, illustrated in
In a different embodiment, illustrated in
In yet a different embodiment, illustrated in
Irrespective of the particular embodiment chosen from among the ones listed previously, the recognition module 4b carries out the operations that are now described with reference to
In a first step, designated by 10, the recognition module 4b receives the transduced electrical signal (whether analog or digital), appropriately pre-processed, from the reading module 4a.
Then (step 12), the recognition module 4b carries out suitable evaluation operations on the transduced electrical signal, for example for evaluating the level and intensity of the signal (and thus of the sound detected).
The intensity of the signal may, for example, be evaluated in terms of the RMS (Root Mean Square) value, the peak value, or particular statistics, for example regarding the number of zero crossings. Alternatively, more complex algorithms may be executed, for example, for recognition of the speech of the user (such as VAD algorithms).
In this regard, a VAD algorithm may envisage the following operations:
an operation of noise reduction, for example by a spectral subtraction in the transduced electrical signal;
an operation of calculation of characteristics or quantities of the transduced electrical signal, or of a portion thereof; and
a classification stage, which applies appropriate rules to the characteristics/quantities calculated, for determining the possible presence of speech.
Next (step 14), the recognition module 4b verifies whether the intensity (or other characteristics) of the transduced electrical signal satisfies a given relation with one or more preset values (for example, it is higher than a threshold) and/or whether the speech of the user has been recognized.
If the above verification step yields a positive result (step 16), the recognition module 4b suitably generates the data signal Sd for associating thereto the information on the fact that the preset sound activity (for example, the speech of the user) has been recognized. As will be described in detail hereinafter, in this step 16, the recognition module 14b may alternatively, or in addition in the case of digital implementation, set to a preset value (for example the high value) the data signal Sd, which may in this case represent an interrupt signal such as to indicate immediately to the outside world, for example to the microprocessor control unit of the host electronic apparatus, the fact that the sound event has been recognized.
According to an aspect of the present solution, once again in the case where it is verified that the intensity of the transduced electrical signal satisfies the given relation with the preset value and/or the speech of the user is recognized, the recognition module 4b is further configured to enable (step 18), a complete electrical supply and/or a complete operation of the reading module 4a, in such a way that the same reading module 4a, in addition to continuing to execute the operations of transduction of the quantity detected, will generate at output the audio signal Sa.
Otherwise (step 19), according to a further aspect of the present solution, the recognition module 4b may disable supply of at least part of the reading module 4a or at least part of the features of the same reading module 4a, in such a way that the acoustic transducer device 1 enters a condition of energy saving or low-power mode. For example, the operations of further processing of the transduced electrical signal for generation at output of the audio signal Sa may be disabled, or else detection in a part of the audio band (not relevant for the recognition activity described above) may be disabled.
In any case, from the aforesaid steps 16, 18, 19, the operations return to step 10, for reception of the transduced electrical signal. It should be noted, in fact, that the recognition module 4b operates continuously in time for detecting the desired sound activity in a timely manner.
The electronic apparatus 20 is, for example, a portable electronic apparatus, such as a tablet, a smartphone, a cellphone, a laptop, a photo camera or a video camera, a device for video-surveillance (or the like) and comprises a processor control module 22, which manages general operation thereof.
The electronic apparatus 20 further comprises, a display 23, data-input elements 24 (for example, a keyboard or a touch screen), a radiofrequency module 25, with respective antenna and an audio encoding module 26 (the so-called “codec”).
The processor control module 22 is operatively coupled to the acoustic transducer device 1 for receiving the audio signal Sa (via the audio encoding module 26) and in particular the data signal Sd indicative of the state of the detected sound activity.
The audio signal Sa may be used for imparting voice commands in a user interface that is managed by the processor control module 22. Advantageously, the data signal Sd may be used for reactivating, or waking up, the processor control module 22 and/or the user interface (operating, as an example, as an unlocking feature for the display 23, instead of a manual input on the keyboard or on the touch screen).
If the data signal Sd is an interrupt signal, waking-up or re-activation is extremely fast and does not require any further processing operation by the processor control module 22, consequently reducing the computational load thereof.
The processor control module 22 may thus operate in an energy-saving or low-power mode (or in stand-by mode) and be appropriately activated, or woken up, by the data signal Sd, which is directly supplied by the acoustic transducer device 1 following upon processing operations executed autonomously and in an independent way.
The acoustic transducer device 1 is, in fact, at least in part, active continuously in time in order to evaluate the state of the sound activity of the surrounding environment, operating in the so-called “sniff mode”.
Advantageously, as highlighted previously, the acoustic transducer device 1 is further able to reconfigure itself, as regards energy consumption, assuming an energy-saving mode (with reduced detection features, for example in terms of the acoustic band detected or in terms of the generation of the audio signal Sa at output) in the case where the sound activity detected so requires (for example, in so far as the intensity of the transduced electrical signal is lower than a given threshold, or no speech of the user is detected). Instead, the acoustic transducer device 1 assumes a normal operating mode, with higher energy consumption and with complete detection operation (for example, as regards the acoustic band of the signal detected and generation of the audio signal Sa), when the sound activity detected indicates the presence of a specific speech of the user or of specific audio events of some other nature.
As indicated previously, in an equally advantageous way, the operating parameters of the recognition module 4b of the acoustic transducer device 1 are further totally configurable from the outside (for example, by the processor control module 22) for reconfiguring, even during operation of the electronic apparatus 20, the characteristics of the sound activity to be recognized.
The data-concentrator module 28, typically including a microcontroller (or a similar processing unit, for example implemented by an FPGA—Field-Programmable Logic Array), has the task of acquiring the detection signals from the acoustic transducer device 1 and possibly from further sensors incorporated in the electronic apparatus 20 (such as an accelerometer, a gyroscope, or a pressure sensor, not illustrated herein), typically coupled to a single digital communication bus and of supplying them to the processor control module 22, possibly after having subjected them to appropriate processing operations.
As a whole, the presence of the data-concentrator module 28 relieves the microprocessor control module 22 of the task of monitoring the outputs of the plurality of sensors, by providing a single acquisition interface and further of the computational burden linked to at least part of the signal-processing operations.
In the specific case, the data-concentrator module 28 receives from the acoustic transducer device 1 the audio signal Sa and the data signal Sd, and then supplies it to the processor control module 22 (either directly or via the audio encoding module 26). Further, the data-concentrator module 28 receives the control signal Sc from the processor control module 22 and supplies it to the acoustic transducer device 1.
A more detailed description of possible embodiments of the acoustic transducer device 1 now follows, in particular as regards the recognition module 4b and the corresponding lines for connection and interfacing towards the outside world (L1, and possibly L2 and L3).
As a whole, it is emphasized that these connection lines for connection towards the outside world may be advantageously obtained exploiting the same pads (or pins) for connection towards the outside world, with which acoustic transducers of a known type are provided, thus not requiring any modification as regards the layout of the electrical connections with the host electronic apparatus (the so-called “footprint”).
In particular,
The reading module 4a comprises a transducer stage 30, for example including a pre-amplifier, which receives the electrical quantity (for example, the capacitive variation) from the micromechanical detection structure 2 and supplies a transduced electrical signal St and, in the embodiment illustrated, an output stage 32, which receives the transduced electrical signal St and generates (for example, by appropriate power components) the audio signal Sa, which is supplied on a connection pad Pad1, coupled to the first connection line L1.
The recognition module 4b comprises: an analysis stage 34, which receives the transduced electrical signal St and carries out appropriate estimations and evaluations of parameters and characteristics of the same signal in order to evaluate the level of the sound activity detected (as described previously) and to supply analysis information; and a decision stage 36, coupled to the analysis stage 34 and designed for carrying out appropriate actions according to the analysis information supplied by the analysis stage 34 on the basis of the estimates and evaluations performed.
In particular, the decision stage 36 controls, according to the analysis information, a modulator stage 38, operatively coupled to the output stage 32, for supplying the data signal Sd, in this case together with the audio signal Sa on the first connection line L1.
In particular, according to a possible embodiment, the decision stage 36 controls the modulator stage 38 in such a way that the audio signal Sa will present an offset equal to Vcc/2 (where Vcc is the supply voltage of the acoustic transducer device 1), as shown in
In this way, it is easy for the processor control module 22 of the electronic apparatus 20, for example by filtering the DC component, to reconstruct the data signal Sd and obtain the information on the state of sound activity.
Alternatively, as shown in
Further, the decision stage 36 is configured to control the reading module 4a for activating an energy-saving state, in the absence of sound activity recognized, for example by switching off the output stage 32 and in this way disabling generation of the audio signal Sa at output.
This embodiment differs from the embodiment described with reference to
In this case, the decision stage 36 controls the modulator stage 38 for supplying on the first connection line L1 a differential signal in the presence of sound activity and a signal saturated at +Vcc or −Vcc in the absence of sound activity, as shown in
Alternatively, as shown in
Also in this embodiment, the data signal Sd is supplied jointly with the audio signal Sa on the same connection line L1.
In this case, the acoustic transducer device 1 has a connection pad Pad3, designed to receive the control signal Sc and further an interface stage 40, coupled at input to the connection pad Pad3 and at output to the recognition module 4b.
In a first variant, the control signal Sc is a voltage signal, having a variable value as a function of the desired adjustment of the recognition parameters, and the interface stage 40 is a reference-voltage reading stage, designed to receive the control signal Sc and to read the voltage value thereof. The reference voltage may, for example, be used for adjusting the value of a threshold voltage to be used for recognition of the sound activity.
A further variant, shown in
This variant thus applies in the case of MEMS acoustic transducers 1 of a hybrid type, i.e., of an analog type, but provided with serial communication interface. It is in any case evident that any serial communication interface that differs from the I2C protocol could likewise be used, for example a protocol of the SPI type, UART type, or the like.
As illustrated in
In this case, the data signal Sd is an interrupt signal, i.e., a digital signal having two possible values, high and low, which is generated by an interrupt stage 42, controlled by the decision stage 36, once again on the basis of the recognition information.
In particular, the interrupt signal may assume a high value, upon recognition of a desired sound activity and a low value, otherwise. The interrupt stage 42 may, for example, include a FET driver.
The audio signal Sa is once again supplied at output on the first connection line L1, at the connection pad Pad1.
Advantageously, the data signal Sd (in this case, constituted by the interrupt signal) is supplied at output on the second connection line L2, at the connection pad Pad2, being directly available for the processor control circuit 22 of the electronic apparatus 20 that houses the acoustic transducer device 1.
Altogether similar considerations apply to the embodiment with differential audio output discussed with reference to
In this regard, for example,
With reference to
The transducer stage 30 comprises in this case, as the output stage 32, an analog-to-digital converter, in particular a sigma-delta modulator, which receives the transduced electrical signal St and generates the audio signal Sa, in this case of a digital type, which is supplied on the first connection line L1, at the connection pad Pad1.
The serial-interface stage, once again designated by 40, of the acoustic transducer device 1, in the example of an I2C type, is here used both for supplying the data signal Sd and for receiving the control signal Sc.
In particular, the decision stage 36 generates the interrupt signal, as data signal Sd, on the second connection line L2 coinciding with the data line associated to the 12C serial interface.
On the same data line, the recognition module 4b of the acoustic transducer device 1 may receive the control signal Sc, of a digital type, containing the control and configuration information for the analysis stage 34.
In this regard, it may be noted that the control signal Sc and the data signal Sd may be present on the second connection line L2 at separate and distinct times. For example, the control signal Sc may be received in a first time interval, during which configuration of the recognition module 4b is, for example, carried out and then, in a second time interval subsequent to the first time interval, the data signal Sd, which is the result of the processing operations carried out by the recognition module 4b, may be supplied.
Via the clock line, SCL, of the I2C serial protocol a same clock signal for being used by the interface stage 40 and the reading module 4a of the acoustic transducer device 1 may be received, in the case where the operating frequency allows it (for example in the case where the frequency is lower than 10 kHz).
Also in this case, transmission at output of the data signal Sd and possible reception at input of the control signal Sc are implemented using the pin present in traditional MEMS acoustic transducers of a digital type, thus not requiring modifications in the layout of the electrical connections towards the outside world, in particular towards the processor control circuit 22 of the electronic apparatus 20 that houses the acoustic transducer device 1.
On the basis of the recognition information resulting from the analysis of the transduced signal St made by the analysis stage 34, the decision stage 36 may also in this case modify the operating state of the reading module 4a, for example for activating an energy-saving state in the absence of significant sound activity, for example by disabling the output stage 32.
Even though they are not described herein, it is evident that, also in the case of the acoustic transducer device 1 of a digital type, the variants discussed previously as regards transmission or reception of the audio, data and control signals Sa, Sd, Sc on one or more connection lines, in a joint or distinct way, may be envisaged.
The advantages of the solution described are evident from the foregoing discussion.
In particular, it is once again emphasized that this solution provides a number of advantageous characteristics, amongst which the following may be cited:
Finally, it is clear that modifications and variations may be made to what has been described and illustrated herein without thereby departing from the scope of the present disclosure, as defined in the annexed claims.
In particular, it is evident that the embodiments previously described are only provided by way of non-exhaustive example, for example as regards the possible implementations of the connection lines for separate or joint output of the audio signal Sa and of the data signal Sd and for reception of the possible control signal Sc.
Furthermore, it is evident that the implementation of the various modules and stages discussed previously in the acoustic transducer device 1 may be alternatively of a hardware or software type, according to the specific requirements and general characteristics of the same acoustic transducer device 1.
The various embodiments described above can be combined to provide further embodiments. These and other changes can be made to the embodiments in light of the above-detailed description. In general, in the following claims, the terms used should not be construed to limit the claims to the specific embodiments disclosed in the specification and the claims, but should be construed to include all possible embodiments along with the full scope of equivalents to which such claims are entitled. Accordingly, the claims are not limited by the disclosure.
Number | Date | Country | Kind |
---|---|---|---|
TO2013A000910 | Nov 2013 | IT | national |
Number | Name | Date | Kind |
---|---|---|---|
5051799 | Paul et al. | Sep 1991 | A |
9866972 | Veneri et al. | Jan 2018 | B2 |
10028054 | Furst et al. | Jul 2018 | B2 |
10715929 | Veneri | Jul 2020 | B2 |
20050251386 | Kuris | Nov 2005 | A1 |
20060034472 | Bazarjani et al. | Feb 2006 | A1 |
20070281761 | Kim | Dec 2007 | A1 |
20090180628 | Stephanson | Jul 2009 | A1 |
20100158279 | Conti et al. | Jun 2010 | A1 |
20130339028 | Rosner et al. | Dec 2013 | A1 |
20150043755 | Fürst et al. | Feb 2015 | A1 |
20150049884 | Ye | Feb 2015 | A1 |
20150256914 | Wiesbauer | Sep 2015 | A1 |
20150350774 | Fürst et al. | Dec 2015 | A1 |
20200336841 | Zeleznik | Oct 2020 | A1 |
Number | Date | Country |
---|---|---|
2 007 167 | Dec 2008 | EP |
2007009465 | Jan 2007 | WO |
Entry |
---|
Raychowdhury et al. ,“A 2.3nJ/Frame Voice Activity Detector Based Audio Front-end for Context-Aware System-on-Chip Applications in 32nm CMOS,” IEEE Custom Integrated Circuits Conference (CICC), San Jose, CA, Sep. 9-12, 2012, pp. 1-4. |
Number | Date | Country | |
---|---|---|---|
20200314559 A1 | Oct 2020 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 15840501 | Dec 2017 | US |
Child | 16902875 | US | |
Parent | 14532951 | Nov 2014 | US |
Child | 15840501 | US |