Distinguishing driver and passenger phone use is a building block for a variety of applications but its greatest promise arguably lies in helping reduce driver distraction. Cell phone distractions have been a factor in high-profile accidents and are associated with a large number of automobile accidents. For example, a National Highway Traffic Safety Administration (“NHTSA”) study identifies cell phone distraction as a factor in crashes that led to 995 fatalities and 24,000 injuries in 2009. This has led to increasing public attention and the banning of handheld phone use in several US states as well as many countries around the world.
Unfortunately, an increasing amount of research suggests that the safety benefits of handsfree phone operation are marginal at best. The cognitive load of conducting a cell phone conversation seems to increase accident risk, rather than the holding of a phone to the ear. Of course, texting, email, navigation, games and many other apps on smartphones are also increasingly competing with driver attention and pose additional dangers. This has led to a renewed search for technical approaches to the driver distraction problem. Such approaches run the gamut from improved driving mode user interfaces, which allow quicker access to navigation and other functions commonly used while driving, to apps that actively prevent phone calls. In between these extremes lie more subtle approaches: routing incoming calls to voicemail or delaying incoming text notifications.
All of these applications would benefit from and some of them depend on automated mechanisms for determining when a cell phone is used by a driver. Prior research and development has led to a number of techniques that can determine whether a cell phone is in a moving vehicle—for example, based on cell phone handoffs, cell phone signal strength analysis, or speed as measured by a Global Positioning System (“GPS”) receiver. The latter approach appears to be the most common among apps that block incoming or outgoing calls and texts. That is, the apps determine that the cell phone is in a vehicle and activate blocking policies once speed crosses a threshold. Some apps require the installation of specialized equipment in an automobile's steering column, which then allows blocking calls/text to/from a given phone based on car's speedometer readings, or even rely on a radio jammer. None of these solutions, however, can automatically distinguish a driver's cell phone from a passenger's.
While there does not exist any detailed statistics on driver versus passenger cell phone use in vehicles, a federal accident database reveals that about 38% of automobile trips include passengers. Not every passenger carries a phone—still this number suggests that the false positive rate when relying only on vehicle detection would be quite high. It would probably be unacceptably high even for simple interventions such as routing incoming calls to voicemail. Distinguishing drivers and passengers is challenging because car and phone usage patterns can differ substantially. Some might carry a phone in a pocket, while others place it on the vehicle console. Since many vehicles are driven mostly by the same driver, one promising approach might be to place a Bluetooth device into the vehicles, which allows the phone to recognize it through the Bluetooth identifier. Still, this cannot cover cases where one person uses the same vehicle as both driver and passenger, as is frequently the case for family cars. Also, some vehicle occupants might pass their phone to others, to allow them to try out a game, for example.
The present invention concerns systems and methods for determining a location of a device (e.g., a Mobile Communication Device (“MCD”)) in a space (e.g., a confined space of the interior of a vehicle) in which a plurality of external speakers are disposed. The methods involve: optionally communicating the discrete audio signal from the MCD to an external audio unit disposed within the space via a short range communication (e.g., a Bluetooth communication); and causing the discrete audio signal to be output from the external speakers. In some scenarios, the discrete audio signal is sequentially output from the external speakers in the pre-assigned order. Subsequently, the combined audio signal is received by a single microphone of the MCD. The combined audio signal is defined by the discrete audio signal which was output from the external speakers. The discrete audio signal may comprise at least one sound component (e.g., a beep) having a frequency greater than frequencies within an audible frequency range for humans. Thereafter, the MCD analyzes the combined audio signal to detect an arriving time of the sound component of the discrete audio signal output from a first speaker (e.g., a left speaker or a right speaker) and an arriving time of the sound component of the discrete audio signal output from a second speaker (e.g., a left speaker or a right speaker). A first relative time difference is then determined between the discrete audio signals arriving from the first and second speakers based on the arriving times which were previously detected. The location of the MCD within the confined space is determined based on the first relative time difference.
In some scenarios, the first relative time difference is computed using a first number of samples and a sampling frequency. The first number of samples comprises the number of samples between the sound component of the discrete audio signal output from the first speaker (e.g., a front-left speaker) and the sound component of the discrete audio signal output from the second speaker (e.g., a front-right speaker). A first physical distance is then computed between the MCD and two first speakers (i.e., the first and second speakers) using the first relative time difference and speed of sound. Next, the first physical distance is compared to a threshold value. The location of the MCD can be determined based on results of the comparing. For example, the results of the comparing may indicate that the MCD is located within a driver-side portion of the confined space of a vehicle's interior or a passenger-side portion of the confined space of the vehicle's interior. In this case, the MCD may subsequently perform one or more operations to reduce distractions of a driver of the vehicle based on its determined location within the confined space of the vehicle's interior.
In some scenarios, the first relative time difference is computed using the discrete audio signal output from the first speaker (e.g., a front-left speaker) and the sound component of the discrete audio signal output from the second speaker (e.g., a rear-left speaker). Also, a second relative time difference is determined between the discrete audio signals arriving from third and fourth speakers (e.g., the front-right speaker and the rear-right speaker) using a second number of samples and the sampling frequency. The second number of samples comprises the number of samples between the sound component of the discrete audio signal output from the third speaker and the sound component of the discrete audio signal output from the fourth speaker. A second physical distance is then determined between the MCD and two second speakers (i.e., the third and fourth speakers) using the second relative time difference and the speed of sound. An average of the first and second physical distances is then compared to a threshold value. The location of the MCD can then be determined based on results of the comparing. For example, the results of the comparing may indicate that the MCD is located within a front portion of the confined space of a vehicle's interior or a rear portion of the confined space of the vehicle's interior. In this case, the MCD may perform one or more operations to reduce distractions of a driver of the vehicle based on its determined location within the confined space of the vehicle's interior.
Embodiments will be described with reference to the following drawing figures, in which like numerals represent like items throughout the figures, and in which:
It will be readily understood that the components of the embodiments as generally described herein and illustrated in the appended figures could be arranged and designed in a wide variety of different configurations. Thus, the following more detailed description of various embodiments, as represented in the figures, is not intended to limit the scope of the present disclosure, but is merely representative of various embodiments. While the various aspects of the embodiments are presented in drawings, the drawings are not necessarily drawn to scale unless specifically indicated.
The present invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described embodiments are to be considered in all respects as illustrative. The scope of the invention is, therefore, indicated by the appended claims. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope.
Reference throughout this specification to features, advantages, or similar language does not imply that all of the features and advantages that may be realized with the present invention should be or are in any single embodiment of the invention. Rather, language referring to the features and advantages is understood to mean that a specific feature, advantage, or characteristic described in connection with an embodiment is included in at least one embodiment of the present invention. Thus, discussions of the features and advantages, and similar language, throughout the specification may, but do not necessarily, refer to the same embodiment.
Furthermore, the described features, advantages and characteristics of the invention may be combined in any suitable manner in one or more embodiments. One skilled in the relevant art will recognize, in light of the description herein, that the invention can be practiced without one or more of the specific features or advantages of a particular embodiment. In other instances, additional features and advantages may be recognized in certain embodiments that may not be present in all embodiments of the invention.
Reference throughout this specification to “one embodiment”, “an embodiment”, or similar language means that a particular feature, structure, or characteristic described in connection with the indicated embodiment is included in at least one embodiment of the present invention. Thus, the phrases “in one embodiment”, “in an embodiment”, and similar language throughout this specification may, but do not necessarily, all refer to the same embodiment.
As used in this document, the singular form “a”, “an”, and “the” include plural references unless the context clearly dictates otherwise. Unless defined otherwise, all technical and scientific terms used herein have the same meanings as commonly understood by one of ordinary skill in the art. As used in this document, the term “comprising” means “including, but not limited to”.
Introduction
The present invention generally concerns an Acoustic Relative-Ranging System (“ARRS”) that leverages an existing audio infrastructure of a vehicle, building or room to determine an approximate location of an MCD within a confined space thereof. In some scenarios, the ARRS is used to determine on which car seat an MCD is being used. Accordingly, the ARRS may rely on the assumptions that: (i) the car seat location is one of the most useful decimators for distinguishing driver and passenger cell phone use; and (ii) most cars will allow phone access to the car audio infrastructure. Indeed, an industry report discloses that more than 8 million built-in Bluetooth systems were sold in 2010 and predicts that 90% of new cars will be equipped in 2016. Therefore, in the car scenario, ARRS may leverage this Bluetooth access to the audio infrastructure to avoid the need to deploy additional infrastructure in cars. In all scenarios, the classifier's strategy first uses high frequency sound components (e.g., beeps) sent from an MCD (e.g., a Smartphone) over a short range communication connection (e.g., a Bluetooth connection) through the vehicles, building or room's stereo system. The sound components (e.g., beeps) are recorded by the MCD, and then analyzed to deduce the timing differentials between the left and right speakers (and if possible, front and rear ones). From the timing differentials, the MCD can self-determine which side or quadrant of the vehicle, building or room it is in.
While acoustic localization and ranging have been extensively studied for human speaker localization through microphone arrays, the present invention addresses several unique challenges in the ARRS. First, the ARRS uses only a single microphone and multiple speakers, requiring a solution that minimizes interference between the speakers. Second, the small confined space inside a vehicle, building or room presents a particularly challenging multipath environment. Third, any sounds emitted should be unobtrusive to minimize distraction. Salient features of the present solution that address these challenges are:
By relaxing the problem from full localization to classification of whether the MCD is in a particular area (e.g., a driver or passenger seat area) of a confined space, a first generation system may be enabled through a software application (e.g., a smart-phone application) that is practical today in all cases with built-in short range communication technology (e.g., Bluetooth technology). This is because left-right classification can be achieved with only stereo audio.
Discussion of Exemplary AARS
Embodiments will now be described with respect to
In the vehicle context, embodiments generally relate to ARRSs and methods employing an Acoustic Relative-Ranging (“ARR”) approach for determining which car seat an MCD is being used. Notably, the present systems and methods do not require the addition of dedicated infrastructure to the vehicle. In many vehicles (e.g., cars), the speaker system is already accessible over Short Range Communication (“SRC”) connections (e.g., Bluetooth connections) and such systems can be expected to trickle down to most new vehicles (e.g., cars) over the next few years. This allows software solutions and/or hardware solutions. The ARR approach leads to the following additional challenges: unobtrusiveness; robustness to noise and multipath; and computational feasibility on MCDs (e.g., Smartphones). With regard to the unobtrusiveness challenge, the sounds emitted by the audio system should not be perceptible to the human ear, so that it does not annoy or distract the vehicle occupant. With regard, to the robustness challenge, engine noise, tire and road noise, wind noise, and music or conversations all contribute to a relatively noisy in-vehicle environment. A vehicle is also a relatively small confined space creating a challenging heavy multipath scenario. With regard to the computation feasibility challenge, standard MCD (e.g., Smartphone) platforms should be able to execute signal processing and detection algorithms with sub-second runtimes. The manner in which each of these challenges is addressed by the present invention will become evident as the discussion progresses.
Referring now to
As shown in
During operation of system 100, components 114, 116, 118, 120, 130 are used in conjunction with the MCD 104 to perform ARR. ARR operations can be triggered in various ways. For example, ARR operations can be triggered in response to: the reception of an incoming communication (e.g., a call, a text message or an email) at the MCD 104; a registration of the MCD 104 with the audio unit 130 via a Short Range Communication (“SRC”); the detection of movement of the MCD 104 (e.g., through the use of an accelerometer thereof) and/or vehicle 102; the detection that the MCD 104 is in proximity of the vehicle 102; the detection of a discrete audio signal transmitted from another MCD in proximity to MCD 104 or the audio unit 130 of the vehicle 102; and/or the auto-pairing of the MCD with the SRC equipment of the vehicle. The SRC can include, but is not limited to, a Near Field Communication (“NFC”), InfRared (“IR”) technology, Wireless Fidelity (“Wi-Fi”) technology, Radio Frequency Identification (“RFID”) technology, Bluetooth technology, and/or ZigBee technology.
When the ARR operations are triggered, the MCD 104 generates and transmits an audio signal to the speakers 114, 116, 118, 120 of the vehicle via an SRC (e.g., a Bluetooth communication). In some scenarios, the audio signal is inserted into a music stream being output from the MCD. The audio signal is then output through the speakers 114, 116, 118, 120. The MCD 104 records the sound emitted from the speakers 114, 116, 118, 120. The recorded sound is then processed by the MCD 104 to evaluate propagation delay. Rather than measuring absolute delay (which is affected by unknown processing delays on the MCD 104 and in the audio unit 130), the system 100 measures relative delay between the audio signal output from the left and right speaker(s). This is similar in spirit to time-difference-of-arrival localization and does not require clock synchronization.
In vehicle 102, the speakers 114, 116, 118, 120 are placed so that the plane equidistant to the left and right (front) speaker locations separates the driver-side and passenger-side area. This has two benefits. First, for front seats 106, 108 (the most frequently occupied seats), the system 100 can distinguish the driver seat and the passenger seat by measuring only the relative time difference between the front speakers 114, 118. Second, the system 100 does not require any fingerprinting or calibration since a time difference of zero always indicates that the MCD 104 is located between driver and passenger (on the center console).
The two-channel approach is practical with current hands-free and SRC (e.g., Bluetooth) profiles which provide for stereo audio. The concept can be easily extended to a four-channel approach, which promises better accuracy but requires updated surround sound audio units and SRC profiles of the vehicle 102. The two-channel approach and the four-channel approach will both be described herein.
System 100 differs from typical acoustic human speaker localization, in that a single microphone and multiple sound sources are used for ARR, rather than a microphone array to detect a single sound source. This means that time differences only need to be measured between signals arriving at the same microphone. This time difference can be estimated simply by counting the number of audio samples between the start of two audio signals. Most modern MCDS (e.g., Smartphones) offer an audio sampling frequency of 44.1 kHz, which given the speed of sound theoretically provides an accuracy of about 0.8 cm—the resolution under ideal situation, since the audio signal will be distorted.
The ARR technique of the present invention employs a Time-Division Multiplexing (“TDM”) approach for addressing signal interference and multi-signal differentiation. The TDM approach involves emitting sound from the speakers 114, 116, 118, 120 at different points in time, with a sufficiently large gap such that no interference occurs therebetween. The sound is emitted from the speakers 114, 116, 118, 120 in a pre-assigned order. The pre-assigned order may be pre-stored in the audio unit 130 and/or MCD 104. Additionally or alternatively, the pre-assigned order may be dynamically generated during each iteration of the ARR operations based on one or more parameters by the audio unit 130 and/or MCD 104. The parameters can include, but are not limited to, the manufacturer of the vehicle 102, the model of the vehicle 102, the production year of the vehicle 102, and/or the type of audio unit 130 installed in the vehicle 102.
Referring now to
The hardware architecture of
The controller 210 also provides information to the transmitter circuitry 206 for encoding and modulating information into RF signals. Accordingly, the controller 210 is coupled to the transmitter circuitry 206 via an electrical connection 238. The transmitter circuitry 206 communicates the RF signals to the antenna 202 for transmission to an external device (e.g., a node of a network) via the Rx/Tx switch 204.
An antenna 240 may be coupled to an SRC transceiver 214 for transmitting and receiving SRC signals (e.g., Bluetooth signals). The SRC transceiver 214 may include, but is not limited to, an NFC transceiver or a Bluetooth transceiver. NFC transceivers and Bluetooth transceivers are well known in the art, and therefore will not be described in detail herein. However, it should be understood that the SRC transceiver 214 transmits audio signals to an external audio unit (e.g., audio unit 130 of
The controller 210 may store received and extracted information in memory 212 of the MCD 104. Accordingly, the memory 212 is connected to and accessible by the controller 210 through electrical connection 232. The memory 212 may be a volatile memory and/or a non-volatile memory. For example, the memory 212 can include, but is not limited, a RAM, a DRAM, an SRAM, a ROM and a flash memory. The memory 212 may also comprise unsecure memory and/or secure memory. The memory 212 can be used to store various other types of information therein, such as authentication information, cryptographic information, location information and various service-related information.
As shown in
The controller 210 is also connected to a user interface 230. The user interface 230 comprises input devices 216, output devices 224 and software routines (not shown in
The display 328, keypad 320, directional pad (not shown in
The ARR operations can include performing a calibration process to select values of certain parameters (e.g., threshold values) based on the manufacturer of the vehicle 102, the model of the vehicle 102, the production year of the vehicle 102, and/or the type of audio unit 130 installed in the vehicle 102. The ARR operations can also include selecting a two channel ARR technique or a four channel ARR technique for subsequent use in determining the approximate location of the MCD 104 within a confined space. The type of ARR technique can be selected based on the manufacturer of the vehicle 102, the model of the vehicle 102, the production year of the vehicle 102, and/or the type of audio unit 130 installed in the vehicle 102.
The ARR operations can further involve: determining whether or not a vehicle is moving; receiving an incoming communication (e.g., a call, a text message, or an email); generating an audio signal in response to the reception of the incoming communication; causing an external audio unit to generate the audio signal; cause the audio signal to be transmitted from the MCD 104 to an external audio unit (e.g., audio unit 130 of
Such safety operations can include, but are not limited to: automatically displaying less distracting driver user interfaces; outputting an indicator only for calls and/or text messages received from certain people; directing incoming calls to voicemail when they are being received from select external devices and/or people; causing a driving status to be displayed in friends dialer applications to discourage them from calling; and/or the MCD could be locked to prevent out going communications. The safety operations can also involve integrating with vehicle controls. Perhaps a driver chatting on the phone should increase the responsiveness of a vehicle's braking system, since this driver is more likely to brake late. The level of intrusiveness of lane-departure warning and driver asset systems could also be affected as a result of the safety operations.
Referring now to
After triggering the ARR operations, optional steps 308 and 310 may be performed. Step 308 involves optionally performing a calibration process to select values for certain parameters, such as threshold values for two-channel and/or four-channel ARR processes to determine an approximate location of the MCD within a confined space of the vehicle. The parameters values can be selected based on the manufacturer of the vehicle 102, the model of the vehicle 102, the production year of the vehicle 102, and/or the type of audio unit 130 installed in the vehicle 102. The optional calibration process may not be performed by the MCD in step 308 when the calibration process was previously performed, such as at the factory.
Step 308 also involves transmitting an audio signal from the MCD to an audio unit (e.g., audio unit 130 of
Next, step 312 is performed where the audio signal is output from the vehicle's speaker (e.g., speakers 114-120 of
Subsequent to completing step 312, the audio signals are received by the microphone (e.g., microphone 222 of
A decision is then made in step 320 to determine which speaker is closest to the MCD based on the results of the propagation delay evaluation of step 318. Once the closest speaker is identified, step 322 is performed where one or more select operations are performed by the MCD, such as safety operations to reduce distraction to a driver of the vehicle. The safety operations can include, but are not limited to, re-directing an incoming communication to a mailbox or voice mail without outputting an auditory or tactile indicator indicating that an incoming communication is being received by the MCD. Thereafter, step 324 is performed where method 300 ends or other processing is performed.
Referring now to
Δ(Tij)=Δt′ij−Δtij;i≠j i,j=1,2,3,4 (1)
When the microphone is equidistant from the two speakers i and j, Δ(Tij)=0. If Δ(Tij)<0, then the MCD (e.g., MCD 104) is closer to speaker i. If Δ(Tij)>0, then the MCD (e.g., MCD 104) is closer to speaker j.
In the present system 100, the absolute time the sounds emitted by the speakers (e.g., speakers 114 and 118 of
An exemplary discrete audio signal design will now be described in relation to
Such high frequency sounds are also hard to perceive by the human auditory system. Although the frequency range of human hearing is generally considered to be 20 Hz to 20 kHz, high frequency sounds must be much louder to be noticeable. This is characterized by the Absolute Threshold of Hearing (“ATH”), which refers to the minimum sound pressure that can be perceived in a quiet environment.
Fortunately, the MCD microphone (e.g., microphone 222 of
The length of the sound components (e.g., beeps) impacts the overall detection time as well as the reliability of recording the sound components (e.g., beeps). Too short a sound component (e.g., a beep) is not picked up by the MCD microphone (e.g., microphone 222 of
Referring now to
As shown in
Next in step 606, the filtered audio signal is processed to detect at least a first Arriving Beep Signal (“ABS”) and a second ABS corresponding to signals emitted from a first set of speakers (e.g., the front speakers). Thereafter in step 608, a first sound component (e.g., a first beep) of the first ABS and the first sound component (e.g., a first beep) of the second ABS are identified, and their start times are noted.
Detecting the arrival of an ABS under heavy multipath in-car environments is challenging because the sound components (e.g., beeps) can be distorted due to interference from the multi-path components. In particular, the commonly used correlation technique, which detects the point of maximum correlation between a received signal and a known transmitted signal, is susceptible to such distortion. Furthermore, the use of high frequency sound components (e.g., beeps) can lead to distortions due to the reduced microphone sensitivity in this range.
For these reasons, a novel approach is used with the present invention is some scenarios. The novel approach involves detecting the first strong ABS in a specified frequency band. The signal detection is possible since there is relatively little noise and interference from outside sources in the chosen frequency range (e.g., a 16-18 kHz range or an 18-20 kHz range). This is known as sequential change-point detection in signal processing. The basic idea is to identify the first ABS that deviates from the noise after filtering out background noise. Let {X1, . . . , Xn} be a sequence of recorded audio signal by the MCD over n time points. Initially, without the sound component (e.g., beep), the observed signal comes from noise, which follows a distribution with density function p0. Later on, at an unknown time , the distribution changes to density function p1 due to the transmission of an audio (e.g., beep) signal. The objective is to identify this time , and to declare the presence of a sound component (e.g., a beep) as quickly as possible to maintain the shortest possible detection delay, which corresponds to ranging accuracy.
To identify time , the problem is formulated as sequential change-point detection. In particular, at each time point , a determination is made as to whether or not an audio (e.g., a beep) signal is present and, if so, when the audio (e.g., beep) signal is present. Since the algorithm runs online, the sound component (e.g., beep) may not yet have occurred. Thus based on the observed sequence up to time point t {X1, . . . , Xn}, the following two hypotheses are distinguished and time point is identified.
H0: Xi follows p0, i=1, . . . , t
H1: Xi follows p0, i=1, . . . , −1
Xi follows p1, i=, . . . , t
If Ho is true, the algorithm repeats once more data samples are available. If the observed signal sequence {X1, . . . , Xn} includes one sound component (e.g., a beep) recorded by the microphone, the procedure will reject H0 with the stopping time td, at which the presence of the audio signal is declared. A false alarm is raised whenever the detection is declared before the change occurs, i.e., when td<. If td≧, then (td−) is the detection delay, which represents the ranging accuracy.
Sequential change-point detection requires that the signal distribution for both noise and the sound component (e.g., beep) is known. This is difficult because the distribution of the audio signal frequently changes due to multipath distortions. Thus, rather than trying to estimate this distribution, the cumulative sum of difference to the averaged noise level is used. This allows first arriving signal detection without knowledge knowing the distribution of the first ABS. Suppose the MCD estimates the mean value μ of noise starting at time t0 until ti, which is the time that the MCD starts transmitting the sound component (e.g., beep). It is desirable to detect the first ABS as the signal that significantly deviates from the noise in the absence of the distribution of the first ABS. Therefore, the likelihood that the observed signal is from Xi the sound component (e.g., beep) can be approximated as
l(X1)=(Xi−μ)
given that the recorded audio signal is stronger than the noise. The likelihood l(Xi) shows a negative drift if the observed signal Xi is smaller than the mean value of the noise, and a positive drift after the presence of the sound component (e.g., beep), i.e., Xi stronger than noise. The stopping time for detecting the presence of the sound component (e.g., beep) is given by
t
d
=inf(k|sk>h), satisfy sm>h, m=k, . . . , k+W
where h is the threshold, W is the robust window used to reduce the false alarm, and sk is the metric for the observed signal sequence {X1, . . . , Xk}, which can be calculated recursively:
s
k=max{sk-1+l(Xk),0}
where s0=0.
In some scenarios, the threshold was set as the mean value sk plus three standard deviations sk when k belongs to t0 to t1 (i.e., 99.7% confidence level of noise). The window W (e.g., W=40) is used to filter out outliers in the cumulative sum sequence due to any sudden changes of the noise. At the same time point that the MCD starts to emit a sound component (e.g., a beep sound), the MCD starts to record received audio signals. Once the first ABS is detected, the window W is shifted to the approximate time point of the next sound component (e.g., a next beep) since the fixed interval between two adjacent sound components (e.g., beeps) is known.
Referring again to
In step 610, the number of samples Sij is determined between the first sound component (e.g., beep) of the first ABS and the first sound component (e.g., beep) of the second ABS. Next in step 612, a time difference ΔTij is computed between the two speakers (e.g., a front-left speaker i and a front-right speaker j) using the number of samples Sij and a sampling frequency f. The computation of step 612 can be defined by the following mathematical equation (2).
ΔTij=Sij/f (2)
Thereafter in step 614, a physical distance Δdij is computed between the MCD and the two speakers using the time difference ΔTij and the speed of sound c. The computation performed in step 614 can be defined by the following mathematical equation (3).
Δdij=c·ΔTij (3)
After completing the relative ranging operations of steps 610-614, a determination is made in step 616 as to whether the stereo system of the vehicle is a two channel stereo system. If the stereo system is a two channel stereo system [616:YES], then method 600 continues with steps 618-622 in which location classification operations are performed to determine which one of two speakers (e.g., a front-left speaker or a front-right speaker) is closest to the MCD. In this regard, step 618 involves making a determination as to whether or not the physical distance Δdij is greater than a threshold value THlr. In some scenarios, the value of THlr is selected to be zero. Embodiments of the present invention are not limited in this regard. For example, the value of THlr can alternatively be set to −5 cm since drivers are often likely to place the MCD in a center console of the vehicle. If the physical distance Δdij is greater than the threshold value THlr [618:YES], then it is concluded that the speaker on the left-side (or driver-side) of the vehicle is closest to the MCD. In contrast, if the physical distance Δdij is less than the threshold value THlr, then it is concluded that the speaker on the right-side (or passenger-side) of the vehicle is closest to the MCD.
If the stereo system is not a two channel stereo system [616:NO] (or is a four channel stereo system), then method 600 continues with steps 624-636 of
Thereafter, a decision is made in step 626 as to whether the physical distance (ΔdLS+ΔdRS)/2 is greater than a threshold value THfb, where ΔdLS represents the distance difference from two left speakers and ΔdRS represents the distance difference from two right speaker. If the physical distance (ΔdLS+ΔdRS)/2 is greater than a threshold value THfb [626:YES], then method 600 continues with step 628 where it is concluded that the front speakers are closer to the MCD than the rear speakers. In this case, step 630 is performed to discriminate driver side and passenger side. Accordingly, steps 618-622 are performed in step 630 to determine whether the left or right side front speaker is closest to the MCD. Subsequently, step 636 is performed where method 600 ends or other processing is performed.
If the physical distance (ΔdLs+ΔdRS)/2 is less than a threshold value THfb [626:NO], then method 600 continues with step 632 where it is concluded that the rear speakers are closer to the MCD than the front speakers. In this case, step 634 is performed to discriminate driver side and passenger side. Accordingly, steps 602-622 are repeated using the ABSs corresponding to signals emitted from a fourth set of speakers (e.g., the rear speakers). Subsequently, step 636 is performed where method 600 ends or other processing is performed.
Exemplary implementations of the present invention will be described below in relation two different types of mobile phones. The present invention is not limited by the particularities of the exemplary implementations. The following discussion is simply provided to assist a reader in understanding the present invention, and the advantages of the same.
As noted above, the MCD can include, but is not limited to, a mobile phone such as an ADP2 phone (“phone I”) and/or an iPhone 3G (“phone II”). Each phone I and II has a Bluetooth radio and supports 16-bit 44.1 kHz sampling from a microphone thereof. Phone I is equipped with 192 MB RAM and an 528 MHz MSM7200A processor. Phone II is equipped with a 256 MB RAM and a 600 MHz ARM Cortex A8 processor.
As also noted above, the vehicle can include, but is not limited to, a car such as a Honda Civic (“car I”) and/or an Acura Sedan (“car II”). Cars I and II have two front speakers located at two front door's lower front sides, and two rear speakers in a rear deck. The interior dimensions of car I are about 175 cm (width) by 183 cm (length). The interior dimensions of car II are about 185 cm (width) by 203 cm (length).
Since both cars I and II are equipped with the two channel stereo system, the four channel sound system can be simulated by using a fader system of an audio unit thereof. Specifically, a two channel beep sound can be encoded and emitted first from the front speakers while the rear speakers are muted. Thereafter, the two channel beep sound can be emitted from the rear speakers while the front speakers are muted. The two channel beep sound can be pre-generated and stored in an audio file. The two channel beep sound can be pre-generated by: creating a beep defined by uniformly distributed white noise; bandpass filtering the uniformly distributed white noise to the 16-18 kHz band for phone 1 and 18-20 kHz band for phone II; and replicating the beep four times with a fixed interval of 5,000 samples between each beep so as to avoid interference from two adjacent beeps. The four beep sequence can then be stored first in the left channel of the audio file and after a 10,000 sample gap repeated on the right channel of the audio file.
Experiments were conducted in accordance with three scenarios. The three scenarios are described below.
Scenario 1: Phone I, Car I
In this scenario, phone I is used while car I is stationary. Background noises stem from conversation and an idling engine. As illustrated in
Scenario 2: Phone II, Car II
In this scenario, phone II is used while car II is stationary. Background noise is not present. Three occupy variant cases are studied: only the driver is in the car II; driver and co-driver are in the car; driver, co-driver and a passenger are in the car II. Two positions are tested in the first occupy variant case: driver door handle; and cup holder. Four positions are tested in the second occupy variant case: driver door handle; cup holder; co-driver's left pant pocket; and co-driver's door handle. Six positions are tested in the third occupy variant case: driver door handle; cup holder; co-driver's left pant pocket; co-driver's door handle; passenger's left seat; and passenger's rear left seat door handle.
Scenario 3: Highway Driving
In this scenario, phone I is deployed in car I. Background noise is not present at first, but then becomes present due to both front windows being opened. The car is driving on the highway at the speed of 60 MPH with music playing therein. The four positions are tested in this scenario: driver's left pant pocket; cup holder; co-driver holding the phone; and co-driver's right pant pocket.
For experimentation purposes, certain metrics are defined. Classification Accuracy (“accuracy”) as used herein refers to the percentage of the trials that were correctly classified as driver phone use or correctly classified as passenger phone use. Detection Rate (“DR”) as used herein refers to the percentage of trials within the driver control area that are classified as driver phone use. False Positive Rate (“FPR”) as used herein refers to the percentage of passenger phone use that is classified as driver phone use. Measurement Error (“ME”) as used herein refers to the difference between the measured distance difference (i.e., Δdij) and the true distance difference. The ME metric directly evaluates the performance of relative ranging in the ARR algorithm.
Driver Vs. Passenger Phone Use
Values for DR, FPR and Accuracy are shown in Table 1 when determining driver phone use using the two channel stereo system.
Note that since the two channel system cannot distinguish the driver-side passenger seat from the driver seat, only front phone positions are tested. To test the robustness of the system in relation to two different types of cars, an un-calibrated system (which uses a default threshold THlr) and a calibrated system (which uses a threshold value THlr selected based on the car's dimensions and speaker layout) is distinguished. The threshold value THlr in the un-calibrated system is set to −5 cm for both cars I and II. The threshold value THlr in the calibrated system is set to −7 cm for car I and −2 cm for car II.
Two Channel Stereo System
From TABLE 1, the important observation in scenario 3 is that the present system can achieve close to 100% DR (with a 4% FPR), which results in about 98% accuracy, suggesting that the present system is highly effective in detecting driver phone use while driving. DR for both un-calibrated and calibrated systems is more than 90% while FPR is around 5% except for car II setting. This indicates the effectiveness of the detection operations of the present system. The high FPR of car II setting can be reached through calibration of the threshold THlr. Although DR is reduced when reducing FPR for car II, the overall detection accuracy is improved. These results show that the present system is robust to different types of vehicles and can provide reasonable accuracy without calibration.
Recall that in this experiment, only front phone positions were considered since the two channel stereo system can only distinguish between driver-side and passenger-side positions. With phone positions on the back seat, particularly the driver-side rear passenger seat, detection accuracy will be degraded, although DR remains the same. Real life accuracy will depend on where drivers place their phones in the vehicle and how often passengers use their phone from other seats. Statistics show that the two front seats are the most frequency occupied seats. In particular, according to an FARS 2009 database, 83.5% of vehicles are only occupied by a driver and possibly one front seat passenger, whereas only about 16.5% of trips occur with back seat passengers. More specifically, only 8.7% of the trips include a passenger sitting behind the driver seat—the situation that would increase the FPR.
If the phone locations are weighed by these probabilities, the FPR rate only increases to about 20% even with the two channel system. The overall accuracy of detecting driver phone use remains about 90% for all three scenarios. Accordingly, the present invention successfully produces high detection accuracy even with systems limited to a two channel stereo.
Four Channel Stereo System
The experimental results of using a four channel stereo system employing un-calibrated threshold values and calibrated threshold values are also shown in TABLE 1. The un-calibrated threshold value THfb (i.e., the threshold for the front and back speaker discrimination) is set to 0 cm for cars I and II and the un-calibrated threshold value THlr (i.e., the threshold for the left and right speaker discrimination) is set to −5 cm for cars I and II. For car I, the calibrated threshold value THfb (i.e., the threshold for the front and back speaker discrimination) is set to 15 cm and the un-calibrated threshold value THlr (i.e., the threshold for the left and right speaker discrimination) is set to −5 cm. For car II, the calibrated threshold value THfb (i.e., the threshold for the front and back speaker discrimination) is set to −24 cm and the un-calibrated threshold value THlr (i.e., the threshold for the left and right speaker discrimination) is set to −2 cm. With the calibrated thresholds, DR is above 90% and the accuracy is around 95% for both settings. This shows that the four channel system can improve the detection performance, compared to that of the two-channel stereo system. In addition, the performance under un-calibrated thresholds is similar to that under calibrated thresholds for car I setting. However, it is much worse than that of calibrated thresholds for car II settings. This suggests that calibration is more important for distinguishing the rear area, because the seat locations very more in the front-back dimensions across cars (and due to manual seat adjustments).
The present algorithm accuracy is now evaluates at different positions and seats within the vehicle.
Seat classification results are also derived. TABLE 2 shows the accuracy when determining a phone at each seat under un-calibrated and calibrated thresholds using a four channel stereo system.
As can be seen from TABLE 2, the accuracy of the back seats is higher than that of the front seats. Notably, it is hard to classify the cup holder and co-driver's left position since they are physically close to each other.
Left vs. Right Classification
To compare the stability of the ranging results under the Highway driving scenario to the stationary scenario, a graph was created plotting the standard deviation of the relative ranging results at different positions. This graph is shown in
Front vs. Back Seat Classification
In front and back classification, the detection rate is defined as the percentage of the trials on front seats that are classified as front seats. FPR is defined as the percentage of back seat trials that are classified as front seats.
The ME of a relative ranging mechanism is now presented. Also, the ME is compared to previous work using a chirp signal and correlation signal detection method with a multipath mitigation mechanism.
To be resistant to ambient noise, the correlation method uses the chirp signal as a beep sound. To perform signal detection, this method correlates the chirp sound with the recorded signal using L2-norm cross-correlation, and picks the time point when the correlation value is the maximum as the time signal detected. To mitigate the multipath, instead of using the maximum correlation value, the earliest sharp peak in the correlation values is suggested as the signal detected time. This approach is referred to as the correlation method with mitigation mechanism.
Strategy for Comparison
To investigate the effect of multipath in an enclosed in-vehicle environment and the resistance of beep signals to background noise, experiments were designed by putting phone I in car I at three different positions with Line Of Sight (“LOS”) to two front speakers. At each position, MEs were calculated to obtain a statistical result. To evaluate multipath effects, the TDOA values were measured for the present method and the correlation method with mitigation mechanism. To test the robustness under background noise, music was played in the vehicle at different sound pressure levels, which are 60 dB and 80 dB, representing moderate noise (e.g., people talking in the vehicle) and heavy noise (e.g., traffic on a busy road), respectively. The chirp sound used for the correlation method is a 50 millisecond length of 2-6 kHz linear chirp signal at 80 dB SPL.
Impact of Multipath
Impact of Background Noise
In view of the forgoing, a driver mobile phone use detection system has been provides that requires minimal hardware and/or software medications on MCDs. The present system achieves this by leveraging the existing infrastructure of speakers for ranging via SRCs. The present system detects driver phone use by estimating the range between the phone and speakers. To estimate range, an ARR technique is employed in which the MCD plays and records a specially designed acoustic signal through a vehicle's speakers. The acoustic signal is unobtrusive as well as robust to background noise when driving. The present system achieves high accuracy under heavy multipath in-vehicle environments by using sequential change-point detection to identify the first arriving signal.
All of the apparatus, methods and algorithms disclosed and claimed herein can be made and executed without undue experimentation in light of the present disclosure. While the invention has been described in terms of preferred embodiments, it will be apparent to those of skill in the art that variations may be applied to the apparatus, methods and sequence of steps of the method without departing from the concept, spirit and scope of the invention. More specifically, it will be apparent that certain components may be added to, combined with, or substituted for the components described herein while the same or similar results would be achieved. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the invention as defined.
This application is a non-provisional application of U.S. Provisional Application Ser. No. 61/657,139 filed on Jun. 8, 2012, which is herein incorporated in its entirety. The inventive arrangements relate to systems and methods for acoustic relative-ranging for determining an approximate location of a mobile device in a confined area. More particularly, the inventive arrangements concern systems and methods leveraging an existing car audio infrastructure to determine on which car seat a phone is being used.
Number | Date | Country | |
---|---|---|---|
61657139 | Jun 2012 | US |