The present invention pertains to the art of acoustics and, more particularly, to a system and method employing acoustics in connection with identifying the firing of gunshots indoors.
The broad concept of detecting gunshots utilizing acoustics is known. More specifically, it is known to provide a gunshot detecting system including an array of acoustic sensors positioned in a pattern which enables signals from the sensors to be employed to not only detect the firing of a gunshot but to also locate the origin of the shot. One main requirement of such a system is the need to accurately distinguish between the sound produced from a gunshot and a host of other ambient sounds. In at least one known arrangement, a microphone is used to detect each sound, which is then amplified, converted to an electrical signal and then the electrical signal is compared with a threshold value above which a gunshot sound is expected to exceed.
Regardless of the known arrangements in this field, there is still seen to exist a need for a system and method for acoustically detecting the firing of gunshots indoors which represents an improvement in terms of at least one or more of accuracy, dependability and effectiveness, particularly an acoustic gunshot detection system and method which provides for very low false alarms or false positives while, at the same time, provides for high detection rates.
The present invention is directed to a system and method for acoustically detecting the firing of gunshots indoors wherein multiple microphones are utilized individually and in combination to detect sounds inside a building or other structure and, upon sensing a loud impulsive sound, processing is performed to determine if the sound is that of a gunshot. The system and method relies on the acoustic signature of the noise as collected, with the acoustic signature being analyzed to arrive at values which are then compared to adjustable levels that signify a gunshot. If it is determined that a gun has been fired, the system can issue alerts, including notifying emergency personnel.
In a particular embodiment, two MEMs microphones (microelectromechanical microphones) having different sensitivity levels are employed for each sensor. The microphones are omnidirectional, with one microphone having a low sensitivity and a high clipping level, while the other microphone is more sensitive. Within the overall sensor, the two microphones are arranged orthogonal to each other. The sensor preferably includes a single board computer which is configured to sample the multiple MEMs microphones, such that the outputs from the microphones can be continuously analyzed in near real time for a gunshot signature. The sensor is electrically powered and networkable, thereby enabling output signals to be transferred remotely, either for additional processing or other purposes such as alerting emergency personnel of a shooting at a specific location in a particular building.
In accordance with a preferred embodiment of operation, the initial gunshot identification is accomplished by analyzing incoming acoustic signals from the lower sensitivity microphone, particularly by searching the incoming acoustic signal for a peak amplitude level large enough to be at least preliminarily identified as a gunshot. Once an indication of a possible gunshot has been triggered utilizing the lower sensitivity microphone, the sensed impulsive sound is processed. In particular, a series of calculations are performed, with the results of these calculations are compared with established threshold values and, if the comparisons are positive, a gunshot verification is established. Upon gunshot verification, a threat message is preferably produced which can be sent from the sensor to another computer used to alert emergency personnel. The threshold levels can be selectively adjusted and set based on the acoustics of the building or other structure, as well as the sensor layout employed.
Additional objects, features and advantages of the present invention will become more readily apparent from the following detailed description of preferred embodiments when taken in conjunction with the drawings wherein like reference numerals refer to corresponding parts in the several views.
With initial reference to
In the most preferred form of the invention, each microphone 15, 20 constitutes a MEMs microphone which is omnidirectional. In accordance with the invention, one microphone 15 has a low sensitivity while the other microphone 20 is more sensitive. In accordance with the invention, a low sensitivity is defined as below −40 dBFS while, by “more sensitive” it is meant that microphone 20 has a sensitivity which is at least 70% greater than the sensitivity of the “low sensitivity” microphone 15. In an exemplary embodiment, microphone 15 has a low sensitivity of −46 dBFS, but with a high clipping level, specifically greater than 130 dB. On the other hand, microphone 20 has a sensitivity of −26 dBFS. Although various known microphones could be employed in connection with the invention, in one specific embodiment, currently available MEMs microphone models INMP621ACEZ-R7 and MP34DBO1TR which are digital, 16 bit microphones manufactured by InvenSense, Inc. are utilized for the first and second microphones 15 and 20 respectively.
In general, the system and method operates by initially identifying an incoming acoustic signal which could potentially be from a gunshot. For this purpose, only outputs from microphone 15 are initially, continuously analyzed for a peak amplitude level large enough to be preliminarily identified as a gunshot. Basically, since microphone 15 has a low sensitivity, microphone 15 only provides an output for very loud sounds and is essentially deaf to normal, everyday sounds emanating from within the building or structure and therefore will likely not reach a necessary threshold on any noise other than the loudest sounds. By way of example, a typical trigger value would be −5 dBFS (corresponding to a digital value of approximately 18000 based on the 16 bit unit). After a possible gunshot is identified in this manner, the system then processes acoustic signals to determine if the sound was actually from a gunshot in the manner detailed below.
Reference will now be made to
With these nominal threshold values being established, step 80 is entered wherein the maximum amplitude for each of microphones 15 and 20 is determined (Max_1 and Max_2). Next, the time at which the acoustic signal crosses the threshold is determined in step 90. Basically, there is a time lapse between first microphone 15 sensing the sound and outputting the signal which has been identified as a potential gunshot. Here, it is desired to determine time zero (T_Win_1) for the potential shot and use this time for future calculations. Although other formulations could be employed, for purposes of a preferred embodiment of the invention, T_Win_1 is set equal to the time at which the first microphone amplitude exceeds TH_1 minus a predetermined time period, preferably 10 ms, wherein T_Win_1 is required to be less than Win_1, i.e., 0.3 seconds, from the point at which the amplitude is greater than Trig_1. This same calculated time zero is also used in connection with second microphone 20 (T_Win_2=T_Win_1).
Next, step 100 is entered wherein an enhanced autocorrelation is calculated. At this point, it should be recognized that enhanced autocorrelation is known based on harmonics. Here, a known method is employed to filter data by determining pitches based on frequencies. As enhanced autocorrelation methods are known, further details will not be provided here. By way of example, reference is simply made to the article “A Computationally Efficient Multipitch Analysis Model” by Tolonen et al., IEEE Transactions on Speech and Audio Processing, Vol. 8, No. 6, (November 2000), the contents of which are fully incorporated herein by reference. With the invention, the preset operational enhanced correlation window (EnAuto_Win_1) is employed.
In step 110, a maximum value of the enhanced auto correlation is determined. For this purpose, values in a first frequency range or band between 15 kHz and 25 kHz are relied upon for microphone 15. Here, the process is looking to establish a peak in this frequency range (EA_Max_15_25_1). Next, all amplitudes in a slightly larger, second frequency range, preferably 10 kHz to 25 kHz, are summed in step 120 (EA_10_25_Sum_1). Thereafter, all amplitudes in a third, distinct frequency range, preferably frequency bands between 2 kHz to 5.5 kHz, are summed in step 130 (EA_2_55_Sum_1). These two summation steps in distinct ranges are performed in connection with avoiding a false positive identification based on knowing that sounds from a gunshot have a broad range as compared to many other potentially sensed sounds.
With all the above calculations, the algorithm moves to step 140 wherein a ratio of the summation values determined in steps 130 and 120 is determined, i.e., Ratio_EA_1=EA_2_55_Sum_1/EA_10_25_Sum_1. In this step, the denominator cannot equal zero. Therefore, if EA_10_25_Sum_1 equals zero, the Ratio_ EA_ 1 is set to a predetermined value, such as 3.0. Finally, in step 150, the RMS of microphone 20 is calculated. More specifically, the RMS of microphone 20 (RMS_Full_2) is calculated using Win_1 and starting at T_Win_2. Basically, these steps are performed to see how the sound dissipates over a relatively short period of time, say 0.3 seconds, for microphone 20. Here it should be noted that the sound associated with a gunshot takes a fair amount of time to dissipate versus, say, tapping a microphone. Therefore, it can be verified here that the RMS stays high for a requisite period of time. Additionally, it should be recognized that signals from microphone 20 can be used for further verification, e.g., sensing sounds of screaming versus laughter or minor chatter.
Once the calculations associated with the
Although described with reference to preferred embodiments of the invention, it should be readily understood that various changes and/or modifications can be made to the invention without departing from the spirit thereof. Overall, it has been found that employing two microphones with low and high sensitivities and making a detection decision based on at least certain threshold, root-mean-square (RMS), time window, and auto correlation frequency values, provides for very low false alarms or false positives while, at the same time, provides for high detection rates. In any event, the invention is only intended to be limited by the scope of the following claims.
This application is a § 371 National Phase Application of International Application No. PCT/US2017/046940, filed on Aug. 15, 2017, now International Publication No. WO 2018/044553, published on Mar. 8, 2018, which International Application claims the benefit under 35 USC 119(e) of U.S. Provisional Application No. 62/380,701, filed on Aug. 29, 2016, both of which are incorporated herein by reference in their entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2017/046940 | 8/15/2017 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2018/044553 | 3/8/2018 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
4091366 | Lavallee | May 1978 | A |
4279027 | Van Sloun | Jul 1981 | A |
5237512 | Davidson | Aug 1993 | A |
5455868 | Sergent | Oct 1995 | A |
5504717 | Sharkey | Apr 1996 | A |
5917775 | Salisbury | Jun 1999 | A |
6185153 | Hynes et al. | Feb 2001 | B1 |
6538570 | Smith | Mar 2003 | B1 |
8421015 | Scott | Apr 2013 | B1 |
9830932 | Gunderson | Nov 2017 | B1 |
10210726 | Carter | Feb 2019 | B2 |
20030021188 | Baranek | Jan 2003 | A1 |
20040161120 | Petersen | Aug 2004 | A1 |
20060256660 | Berger | Nov 2006 | A1 |
20090059724 | Scanlon | Mar 2009 | A1 |
20090180628 | Stephanson | Jul 2009 | A1 |
20110218952 | Mitchell | Sep 2011 | A1 |
20110246402 | Burman | Oct 2011 | A1 |
20120300587 | Azimi-Sadjadi | Nov 2012 | A1 |
20140327543 | Showen | Nov 2014 | A1 |
20140361886 | Cowdry | Dec 2014 | A1 |
20150070166 | Boyden | Mar 2015 | A1 |
20150071450 | Boyden | Mar 2015 | A1 |
20150106095 | Mitchell | Apr 2015 | A1 |
20150131411 | Gudgel et al. | May 2015 | A1 |
20150347079 | Price | Dec 2015 | A1 |
20150364028 | Child | Dec 2015 | A1 |
20160157032 | Kane | Jun 2016 | A1 |
20160209390 | Overcast | Jul 2016 | A1 |
20160232774 | Noland | Aug 2016 | A1 |
20160260307 | Skorpik | Sep 2016 | A1 |
20160334500 | Damarla | Nov 2016 | A1 |
20170103776 | Kim | Apr 2017 | A1 |
20170123038 | Griggs | May 2017 | A1 |
20170169686 | Skorpik | Jun 2017 | A1 |
20180293858 | Carter | Oct 2018 | A1 |
20190347920 | Anderson | Nov 2019 | A1 |
20190371160 | Pace | Dec 2019 | A1 |
Entry |
---|
International Search Report and Written Opinion, dated Nov. 7, 2017, from International Application No. PCT/US2017/046940, filed on Aug. 15, 2017. 15 pages. |
Tolonen, T., et al., “A Computationally Efficient Multipitch Analysis Model,” IEEE Transations on Speech and Audio Processing, 8(6): 708-716 (2000). |
International Preliminary Report on Patentability, dated Mar. 14, 2019, from International Application No. PCT/US2017/046940, filed on Aug. 15, 2017. 9 pages. |
Number | Date | Country | |
---|---|---|---|
20190180606 A1 | Jun 2019 | US |
Number | Date | Country | |
---|---|---|---|
62380701 | Aug 2016 | US |