The present disclosure relates to methods, techniques, and systems for ability enhancement and, more particularly, to methods, techniques, and systems for vehicular threat detection based at least in part on analyzing image data obtained from a transportation-related context.
Human abilities such as hearing, vision, memory, foreign or native language comprehension, and the like may be limited for various reasons. For example, as people age, various abilities such as hearing, vision, or memory, may decline or otherwise become compromised. In some countries, as the population in general ages, such declines may become more common and widespread. In addition, young people are increasingly listening to music through headphones, which may also result in hearing loss at earlier ages.
In addition, limits on human abilities may be exposed by factors other than aging, injury, or overuse. As one example, the world population is faced with an ever increasing amount of information to review, remember, and/or integrate. Managing increasing amounts of information becomes increasingly difficult in the face of limited or declining abilities such as hearing, vision, and memory.
These problems may be further exacerbated and even result in serious health risks in a transportation-related context, as distracted and/or ability impaired drivers are more prone to be involved in accidents. For example, many drivers are increasingly distracted from the task of driving by an onslaught of information from cellular phones, smart phones, media players, navigation systems, and the like. In addition, an aging population in some regions may yield an increasing number or share of drivers who are vision and/or hearing impaired.
Current approaches to addressing limits on human abilities may suffer from various drawbacks. For example, there may be a social stigma connected with wearing hearing aids, corrective lenses, or similar devices. In addition, hearing aids typically perform only limited functions, such as amplifying or modulating sounds for a hearer. Furthermore, legal regimes that attempt to prohibit the use of telephones or media devices while driving may not be effective due to enforcement difficulties, declining law enforcement budgets, and the like. Nor do such regimes address a great number of other sources of distraction or impairment, such as other passengers, car radios, blinding sunlight, darkness, or the like.
FIGS. 3.1-3.112 are example flow diagrams of ability enhancement processes performed by example embodiments.
Embodiments described herein provide enhanced computer- and network-based methods and systems for ability enhancement and, more particularly, for enhancing a user's ability to operate or function in a transportation-related context (e.g., as a pedestrian or vehicle operator) by performing vehicular threat detection based at least in part on analyzing image data that represents vehicles and other objects present in a roadway or other context. Example embodiments provide an Ability Enhancement Facilitator System (“AEFS”). Embodiments of the AEFS may augment, enhance, or improve the senses (e.g., hearing), faculties (e.g., memory, language comprehension), and/or other abilities (e.g., driving, riding a bike, walking/running) of a user.
In some embodiments, the AEFS is configured to identify threats (e.g., posed by vehicles to a user of a roadway, posed by a user to vehicles or other users of a roadway), and to provide information about such threats to the user so that he may take evasive action. Identifying threats may include analyzing information about a vehicle that is present in the roadway in order to determine whether the user and the vehicle may be on a collision course. The analyzed information may include or be represented by image data (e.g., pictures or video of a roadway and its surrounding environment), audio data (e.g., sounds reflected from or emitted by a vehicle), range information (e.g., provided by a sonar or infrared range sensor), conditions information (e.g., weather, temperature, time of day), or the like. The user may be a pedestrian (e.g., a walker, a jogger), an operator of a motorized (e.g., car, motorcycle, moped, scooter) or non-motorized vehicle (e.g., bicycle, pedicab, rickshaw), a vehicle passenger, or the like. In some embodiments, the vehicle may be operating autonomously. In some embodiments, the user wears a wearable device (e.g., a helmet, goggles, eyeglasses, hat) that is configured to at least present determined vehicular threat information to the user.
In some embodiments, the AEFS is configured to receive image data, at least some of which represents an image of a first vehicle. The image data may be obtained from various sources, including a camera of a wearable device of a user, a camera on a vehicle of the user, an in-situ road-side camera, a camera on some other vehicle, or the like. The image data may represent electromagnetic signals of various types or in various ranges, including visual signals (e.g., signals having a wavelength in the range of about 390-750 nm), infrared signals (e.g., signals having a wavelength in the range of about 750 nm-300 micrometers), or the like.
Then, the AEFS determines vehicular threat information based at least in part on the image data. In some embodiments, the AEFS may analyze the received image data in order to identify the first vehicle and/or to determine whether the first vehicle represents a threat to the user, such as because the first vehicle and the user may be on a collision course. The image data may be analyzed in various ways, including by identifying objects (e.g., to recognize that a vehicle or some other object is shown in the image data), determining motion-related information (e.g., position, velocity, acceleration, mass) about objects, or the like.
Next, the AEFS informs the user of the determined vehicular threat information via a wearable device of the user. Typically, the user's wearable device (e.g., a helmet) will include one or more output devices, such as audio speakers, visual display devices (e.g., warning lights, screens, heads-up displays), haptic devices, and the like. The AEFS may present the vehicular threat information via one or more of these output devices. For example, the AEFS may visually display or speak the words “Car on left.” As another example, the AEFS may visually display a leftward pointing arrow on a heads-up screen displayed on a face screen of the user's helmet. Presenting the vehicular threat information may also or instead include presenting a recommended course of action (e.g., to slow down, to speed up, to turn) to mitigate the determined vehicular threat.
The AEFS may use other or additional sources or types of information. For example, in some embodiments, the AEFS is configured to receive data representing an audio signal emitted by a first vehicle. The audio signal is typically obtained in proximity to a user, who may be a pedestrian or traveling in a vehicle as an operator or a passenger. In some embodiments, the audio signal is obtained by one or more microphones coupled to the user's vehicle and/or a wearable device of the user, such as a helmet, goggles, a hat, a media player, or the like. Then, the AEFS may determine vehicular threat information based at least in part on the data representing the audio signal. In some embodiments, the AEFS may analyze the received data in order to determine whether the first vehicle and the user are on a collision course. The audio data may be analyzed in various ways, including by performing audio analysis, frequency analysis (e.g., Doppler analysis), acoustic localization, or the like.
The AEFS may combine information of various types in order to determine vehicular threat information. For example, because image processing may be computationally expensive, rather than always processing all image data obtained from every possible source, the AEFS may use audio analysis to initially determine the approximate location of an oncoming vehicle, such as to the user's left, right, or rear. For example, having determined based on audio data that a vehicle may be approaching from the rear of the user, the AEFS may preferentially process image data from a rear-facing camera to further refine a threat analysis. As another example, the AEFS may incorporate information about the condition of a roadway (e.g., icy or wet) when determining whether a vehicle will be able to stop or maneuver in order to avoid an accident.
1. Ability Enhancement Facilitator System Overview
In this example, the moped 110a is driving towards the motorcycle 110b from a side street, at approximately a right angle with respect to the path of travel of the motorcycle 110b. The traffic signal 106 has just turned from red to green for the motorcycle 110b, and the user 104 is beginning to drive the motorcycle 110 into the intersection controlled by the traffic signal 106. The user 104 is assuming that the moped 110a will stop, because cross traffic will have a red light. However, in this example, the moped 110a may not stop in a timely manner, for one or more reasons, such as because the operator of the moped 110a has not seen the red light, because the moped 110a is moving at an excessive rate, because the operator of the moped 110a is impaired, because the surface conditions of the roadway are icy or slick, or the like. As will be discussed further below, the AEFS 100 will determine that the moped 110a and the motorcycle 110b are likely on a collision course, and inform the user 104 of this threat via the helmet 120a, so that the user may take evasive action to avoid a possible collision with the moped 110a.
The moped 110 emits or reflects a signal 101. In some embodiments, the signal 101 is an electromagnetic signal in the visible light spectrum that represents an image of the moped 110a. Other types of electromagnetic signals may be received and processed, including infrared radiation, radio waves, microwaves, or the like. Other types of signals are contemplated, including audio signals, such as an emitted engine noise, a reflected sonar signal, a vocalization (e.g., shout, scream), etc. The signal 101 may be received by a receiving detector/device/sensor, such as a camera or microphone (not shown) on the helmet 120a and/or the motorcycle 110b. In some embodiments, a computing and communication device within the helmet 120a receives and samples the signal 101 and transmits the samples or other representation to the AEFS 100. In other embodiments, other forms of data may be used to represent the signal 101, including frequency coefficients, compressed audio/video, or the like.
The AEFS 100 determines vehicular threat information by analyzing the received data that represents the signal 101. If the signal 101 is a visual signal, then the AEFS 100 may employ various image data processing techniques. For example, the AEFS 100 may perform object recognition to determine that received image data includes an image of a vehicle, such as the moped 110a. The AEFS 100 may also or instead process received image data to determine motion-related information with respect to the moped 110, including position, velocity, acceleration, or the like. The AEFS 100 may further identify the presence of other objects, including pedestrians, animals, structures, or the like, that may pose a threat to the user 104 or that may be themselves threatened (e.g., by actions of the user 104 and/or the moped 110a). Image processing also may be employed to determine other information, including road conditions (e.g., wet or icy roads), visibility conditions (e.g., glare or darkness), and the like.
If the signal 101 is an audio signal, then the AEFS 100 may use one or more audio analysis techniques to determine the vehicular threat information. In one embodiment, the AEFS 100 performs a Doppler analysis (e.g., by determining whether the frequency of the audio signal is increasing or decreasing) to determine that the object that is emitting the audio signal is approaching (and possibly at what rate) the user 104. In some embodiments, the AEFS 100 may determine the type of vehicle (e.g., a heavy truck, a passenger vehicle, a motorcycle, a moped) by analyzing the received data to identify an audio signature that is correlated with a particular engine type or size. For example, a lower frequency engine sound may be correlated with a larger vehicle size, and a higher frequency engine sound may be correlated with a smaller vehicle size.
In one embodiment, where the signal 101 is an audio signal, the AEFS 100 performs acoustic source localization to determine information about the trajectory of the moped 110a, including one or more of position, direction of travel, speed, acceleration, or the like. Acoustic source localization may include receiving data representing the audio signal 101 as measured by two or more microphones. For example, the helmet 120a may include four microphones (e.g., front, right, rear, and left) that each receive the audio signal 101. These microphones may be directional, such that they can be used to provide directional information (e.g., an angle between the helmet and the audio source). Such directional information may then be used by the AEFS 100 to triangulate the position of the moped 110a. As another example, the AEFS 100 may measure differences between the arrival time of the audio signal 101 at multiple distinct microphones on the helmet 120a or other location. The difference in arrival time, together with information about the distance between the microphones, can be used by the AEFS 100 to determine distances between each of the microphones and the audio source, such as the moped 110a. Distances between the microphones and the audio source can then be used to determine one or more locations at which the audio source may be located.
Determining vehicular threat information may also or instead include obtaining information such as the position, trajectory, and speed of the user 104, such as by receiving data representing such information from sensors, devices, and/or systems on board the motorcycle 110b and/or the helmet 120a. Such sources of information may include a speedometer, a geo-location system (e.g., GPS system), an accelerometer, or the like. Once the AEFS 100 has determined and/or obtained information such as the position, trajectory, and speed of the moped 110a and the user 104, the AEFS 100 may determine whether the moped 110a and the user 104 are likely to collide with one another. For example, the AEFS 100 may model the expected trajectories of the moped 110a and user 104 to determine whether they intersect at or about the same point in time.
The AEFS 100 may then present the determined vehicular threat information (e.g., that the moped 110a represents a hazard) to the user 104 via the helmet 120a. Presenting the vehicular threat information may include transmitting the information to the helmet 120a, where it is received and presented to the user. In one embodiment, the helmet 120a includes audio speakers that may be used to output an audio signal (e.g., an alarm or voice message) warning the user 104. In other embodiments, the helmet 120a includes a visual display, such as a heads-up display presented upon a face screen of the helmet 120a, which can be used to present a text message (e.g., “Look left”) or an icon (e.g., a red arrow pointing left).
The AEFS 100 may also use information received from in-situ sensors and/or devices. For example, the AEFS 100 may use information received from a camera 108 that is mounted on the traffic signal 106 that controls the illustrated intersection. The AEFS 100 may receive image data that represents the moped 110a and/or the motorcycle 110b. The AEFS 100 may perform image recognition to determine the type and/or position of a vehicle that is approaching the intersection. The AEFS 100 may also or instead analyze multiple images (e.g., from a video signal) to determine the velocity of a vehicle. Other types of sensors or devices installed in or about a roadway may also or instead by used, including range sensors, speed sensors (e.g., radar guns), induction coils (e.g., mounted in the roadbed), temperature sensors, weather gauges, or the like.
As noted above, the AEFS 100 may utilize data that represents a signal as detected by one or more detectors/sensors, such as microphones or cameras. In the example of
In an image context, the AEFS 100 may perform image processing on image data obtained from one or more of the camera sensors 124a and 124b. As discussed, the image data may be processed to determine the presence of the moped, its type, its motion-related information (e.g., velocity), and the like. In some embodiments, image data may be processed without making any definite identification of a vehicle. For example, the AEFS 100 may process image data from sensors 124a and 124b to identify the presence of motion (without necessarily identifying any objects). Based on such an analysis, the AEFS 100 may determine that there is something approaching from the left of the motorcycle 110b, but that the right of the motorcycle 110b is relatively clear.
Differences between data obtained from multiple sensors may be exploited in various ways. In an image context, an image signal may be perceived or captured differently by the two (camera) sensors 124a and 124b. The AEFS 100 may exploit or otherwise analyze such differences to determine the location and/or motion of the moped 110a. For example, knowing the relative position and optical qualities of the two cameras, it is possible to analyze images captured by those cameras to triangulate a position of an object (e.g., the moped 110a) or a distance between the motorcycle 110b and the object.
In an audio context, an audio signal may be perceived differently by the two sensors 124a and 124b. For example, if the strength of the signal 101 is stronger as measured at microphone 124a than at microphone 124b, the AEFS 100 may infer that the signal 101 is originating from the driver's left of the motorcycle 110b, and thus that a vehicle is approaching from that direction. As another example, as the strength of an audio signal is known to decay with distance, and assuming an initial level (e.g., based on an average signal level of a vehicle engine) the AEFS 100 may determine a distance (or distance interval) between one or more of the microphones and the signal source.
The AEFS 100 may model vehicles and other objects, such as by representing their motion-related information, including position, speed, acceleration, mass and other properties. Such a model may then be used to determine whether objects are likely to collide. Note that the model may be probabilistic. For example the AEFS 100 may represent an object's position in space as a region that includes multiple positions that each have a corresponding likelihood that that the object is at that position. As another example, the AEFS 100 may represent the velocity of an object as a range of likely values, a probability distribution, or the like. Various frames of reference may be employed, including a user-centric frame, an absolute frame, or the like.
The AEFS 100 may interact with various types of wearable devices 120, including a motorcycle helmet 120a (
In some embodiments, a wearable device may perform some or all of the functions of the AEFS 100, even though the AEFS 100 is depicted as separate in these examples. Some devices may have minimal processing power and thus perform only some of the functions. For example, the eyeglasses 120b may receive vehicular threat information from a remote AEFS 100, and display it on a heads-up display displayed on the inside of the lenses of the eyeglasses 120b. Other wearable devices may have sufficient processing power to perform more of the functions of the AEFS 100. For example, the personal media device 120e may have considerable processing power and as such be configured to perform acoustic source localization, collision detection analysis, or other more computational expensive functions.
Note that the wearable devices 120 may act in concert with one another or with other entities to perform functions of the AEFS 100. For example, the eyeglasses 120b may include a display mechanism that receives and displays vehicular threat information determined by the personal media device 120e. As another example, the goggles 120c may include a display mechanism that receives and displays vehicular threat information determined by a computing device in the helmet 120a or 120d. In a further example, one of the wearable devices 120 may receive and process audio data received by microphones mounted on the vehicle 110c.
The AEFS 100 may also or instead interact with vehicles 110 and/or computing devices installed thereon. As noted, a vehicle 110 may have one or more sensors or devices that may operate as (direct or indirect) sources of information for the AEFS 100. The vehicle 110c, for example, may include a speedometer, an accelerometer, one or more microphones, one or more range sensors, or the like. Data obtained by, at, or from such devices of vehicle 110c may be forwarded to the AEFS 100, possibly by a wearable device 120 of an operator of the vehicle 110c.
In some embodiments, the vehicle 110c may itself have or use an AEFS, and be configured to transmit warnings or other vehicular threat information to others. For example, an AEFS of the vehicle 110c may have determined that the moped 110a was driving with excessive speed just prior to the scenario depicted in
The AEFS 100 may also or instead interact with sensors and other devices that are installed on, in, or about roads or in other transportation related contexts, such as parking garages, racetracks, or the like. In this example, the AEFS 100 interacts with the camera 108 to obtain images of vehicles, pedestrians, or other objects present in a roadway. Other types of sensors or devices may include range sensors, infrared sensors, induction coils, radar guns, temperature gauges, precipitation gauges, or the like.
The AEFS 100 may further interact with information systems that are not shown in
In some embodiments, the AEFS 100 may transmit information to law enforcement agencies and/or related computing systems. For example, if the AEFS 100 determines that a vehicle is driving erratically, it may transmit that fact along with information about the vehicle (e.g., make, model, color, license plate number, location) to a police computing system.
Note that in some embodiments, at least some of the described techniques may be performed without the utilization of any wearable devices 120. For example, a vehicle 110 may itself include the necessary computation, input, and output devices to perform functions of the AEFS 100. For example, the AEFS 100 may present vehicular threat information on output devices of a vehicle 110, such as a radio speaker, dashboard warning light, heads-up display, or the like. As another example, a computing device on a vehicle 110 may itself determine the vehicular threat information.
In some embodiments, the AEFS 100 processes the image 140 to perform object identification. Upon processing the image 140, the AEFS 100 may identify the moped 110a, the child 141, the sun 142, and/or the puddle 143. A sequence of images, taken at different times (e.g., one tenth of a second apart) may be used to determine that the moped 110a is moving, how fast the moped 110a is moving, acceleration/deceleration of the moped 110a, or the like. Motion of other objects, such as the child 141 may also be tracked. Based on such motion-related information, the AEFS 100 may model the physics of the identified objects to determine whether a collision is likely.
Determining vehicular threat information may also or instead be based on factors related or relevant to objects other than the moped 110a or the user 104. For example, the AEFS 100 may determine that the puddle 143 will likely make it more difficult for the moped 110a to stop. Thus, even if the moped 110a is moving at a reasonable speed, he still may be unable to stop prior to entering the intersection due to the presence of the puddle 143. As another example, the AEFS 100 may determine that evasive action by the user 104 and/or the moped 110a may cause injury to the child 141. As a further example, the AEFS 100 may determine that it may be difficult for the user 104 to see the moped 110a and/or the child 141 due to the position of the sun 142. Such information may be incorporated into any models, predictions, or determinations made or maintained by the AEFS 100.
The threat analysis engine 210 includes an audio processor 212, an image processor 214, other sensor data processors 216, and an object tracker 218. In the illustrated example, the audio processor 212 processes audio data received from the wearable device 120. As noted, such data may be received from other sources as well or instead, including directly from a vehicle-mounted microphone, or the like. The audio processor 212 may perform various types of signal processing, including audio level analysis, frequency analysis, acoustic source localization, or the like. Based on such signal processing, the audio processor 212 may determine strength, direction of audio signals, audio source distance, audio source type, or the like. Outputs of the audio processor 212 (e.g., that an object is approaching from a particular angle) may be provided to the object tracker 218 and/or stored in the data store 240.
The image processor 214 receives and processes image data that may be received from sources such as the wearable device 120 and/or information sources 130. For example, the image processor 214 may receive image data from a camera of the wearable device 120, and perform object recognition to determine the type and/or position of a vehicle that is approaching the user 104. As another example, the image processor 214 may receive a video signal (e.g., a sequence or stream of images) and process them to determine the type, position, and/or velocity of a vehicle that is approaching the user 104. Multiple images may be processed to determine the presence or absence of motion, even if no object recognition is performed. Outputs of the image processor 214 (e.g., position and velocity information, vehicle type information) may be provided to the object tracker 218 and/or stored in the data store 240.
The other sensor data processor 216 receives and processes data received from other sensors or sources. For example, the other sensor data processor 216 may receive and/or determine information about the position and/or movements of the user and/or one or more vehicles, such as based on GPS systems, speedometers, accelerometers, or other devices. As another example, the other sensor data processor 216 may receive and process conditions information (e.g., temperature, precipitation) from the information sources 130 and determine that road conditions are currently icy. Outputs of the other sensor data processor 216 (e.g., that the user is moving at 5 miles per hour) may be provided to the object tracker 218 and/or stored in the data store 240.
The object tracker 218 manages a geospatial object model that includes information about objects known to the AEFS 100. The object tracker 218 receives and merges information about object types, positions, velocity, acceleration, direction of travel, and the like, from one or more of the processors 212, 214, 216, and/or other sources. Based on such information, the object tracker 218 may identify the presence of objects as well as their likely positions, paths, and the like. The object tracker 218 may continually update this model as new information becomes available and/or as time passes (e.g., by plotting a likely current position of an object based on its last measured position and trajectory). The object tracker 218 may also maintain confidence levels corresponding to elements of the geo-spatial model, such as a likelihood that a vehicle is at a particular position or moving at a particular velocity, that a particular object is a vehicle and not a pedestrian, or the like.
The agent logic 220 implements the core intelligence of the AEFS 100. The agent logic 220 may include a reasoning engine (e.g., a rules engine, decision trees, Bayesian inference engine) that combines information from multiple sources to determine vehicular threat information. For example, the agent logic 220 may combine information from the object tracker 218, such as that there is a determined likelihood of a collision at an intersection, with information from one of the information sources 130, such as that the intersection is the scene of common red-light violations, and decide that the likelihood of a collision is high enough to transmit a warning to the user 104. As another example, the agent logic 220 may, in the face of multiple distinct threats to the user, determine which threat is the most significant and cause the user to avoid the more significant threat, such as by not directing the user 104 to slam on the brakes when a bicycle is approaching from the side but a truck is approaching from the rear, because being rear-ended by the truck would have more serious consequences than being hit from the side by the bicycle.
The presentation engine 230 includes a visible output processor 232 and an audible output processor 234. The visible output processer 232 may prepare, format, and/or cause information to be displayed on a display device, such as a display of the wearable device 120 or some other display (e.g., a heads-up display of a vehicle 110 being driven by the user 104). The agent logic 220 may use or invoke the visible output processor 232 to prepare and display information, such as by formatting or otherwise modifying vehicular threat information to fit on a particular type or size of display. The audible output processor 234 may include or use other components for generating audible output, such as tones, sounds, voices, or the like. In some embodiments, the agent logic 220 may use or invoke the audible output processor 234 in order to convert a textual message (e.g., a warning message, a threat identification) into audio output suitable for presentation via the wearable device 120, for example by employing a text-to-speech processor.
Note that one or more of the illustrated components/modules may not be present in some embodiments. For example, in embodiments that do not perform image or video processing, the AEFS 100 may not include an image processor 214. As another example, in embodiments that do not perform audio output, the AEFS 100 may not include an audible output processor 234.
Note also that the AEFS 100 may act in service of multiple users 104. In some embodiments, the AEFS 100 may determine vehicular threat information concurrently for multiple distinct users. Such embodiments may further facilitate the sharing of vehicular threat information. For example, vehicular threat information determined as between two vehicles may be relevant and thus shared with a third vehicle that is in proximity to the other two vehicles.
2. Example Processes
FIGS. 3.1-3.112 are example flow diagrams of ability enhancement processes performed by example embodiments.
At block 3.103, the process performs receiving image data, at least some of which represents an image of a first vehicle. The process may receive and consider image data, such as by performing image processing to identify vehicles or other hazards, to determine whether collisions may occur, determine motion-related information about the first vehicle (and possibly other entities), and the like. The image data may be obtained from various sources, including from a camera attached to the wearable device or a vehicle, a road-side camera, or the like.
At block 3.105, the process performs determining vehicular threat information based at least in part on the image data. Vehicular threat information may include information related to threats posed by the first vehicle (e.g., to the user or to some other entity), by a vehicle occupied by the user (e.g., to the first vehicle or to some other entity), or the like. Note that vehicular threats may be posed by vehicles to non-vehicles, including pedestrians, animals, structures, or the like. Vehicular threats may also include those threats posed by non-vehicles (e.g., structures, pedestrians) to vehicles. Vehicular threat information may be determined in various ways, including by analyzing image data to identify objects, such as vehicles, pedestrians, fixed objects, and the like. In some embodiments, determining the vehicular threat information may also or instead include determining motion-related information about identified objects, including position, velocity, direction of travel, accelerations, or the like. Determining the vehicular threat information may also or instead include predicting whether the path of the user and one or more identified objects may intersect.
At block 3.107, the process performs presenting the vehicular threat information via a wearable device of the user. The determined threat information may be presented in various ways, such as by presenting an audible or visible warning or other indication that the first vehicle is approaching the user. Different types of wearable devices are contemplated, including helmets, eyeglasses, goggles, hats, and the like. In other embodiments, the vehicular threat information may also or instead be presented in other ways, such as via an output device on a vehicle of the user, in-situ output devices (e.g., traffic signs, road-side speakers), or the like.
At block 3.204, the process performs receiving image data from a camera of a vehicle that is occupied by the user. The user's vehicle may include one or more cameras that may capture views to the front, sides, and/or rear of the vehicle, and provide these images to the process for image processing or other analysis.
At block 3.504, the process performs receiving image data from a camera of the wearable device. For example, where the wearable device is a helmet, the helmet may include one or more helmet cameras that may capture views to the front, sides, and/or rear of the helmet.
At block 3.604, the process performs receiving image data from a camera of the first vehicle. In some embodiments, the first vehicle may itself have cameras and broadcast or otherwise transmit image data obtained via that camera.
At block 3.704, the process performs receiving image data from a camera of a vehicle that is not the first vehicle and that is not occupied by the user. In some embodiments, other vehicles in the roadway may have cameras and broadcast or otherwise transmit image data obtained via those cameras. For example, some vehicle traveling between the user and the first vehicle may transmit images of the first vehicle to be received by the process as image data.
At block 3.804, the process performs receiving image data from a road-side camera. In some embodiments, road side cameras, such as may be mounted on traffic lights, utility poles, buildings, or the like may transmit image data to the process.
At block 3.904, the process performs receiving video data that includes multiple images of the first vehicle taken at different times. In some embodiments, the image data comprises video data in compressed or raw form. The video data typically includes (or can be reconstructed or decompressed to derive) multiple sequential images taken at distinct times.
At block 3.1004, the process performs receiving a first image of the first vehicle taken at a first time.
At block 3.1005, the process performs receiving a second image of the second vehicle taken at a second time, wherein the first and second times are sufficiently different such that velocity and/or direction of travel of the first vehicle may be determined with respect to positions of the first vehicle shown in the first and second images. Various time intervals between images may be utilized. For example, it may not be necessary to receive video data having a high frame rate (e.g., 30 frames per second or higher), because it may be preferable to determine motion or other properties of the first vehicle based on images that are taken at larger time intervals (e.g., one tenth of a second, one quarter of a second). In some embodiments, transmission bandwidth may be saved by transmitting and receiving reduced frame rate image streams.
At block 3.1104, the process performs determining a threat posed by the first vehicle to the user. As noted, the vehicular threat information may indicate a threat posed by the first vehicle to the user, such as that the first vehicle may collide with the user unless evasive action is taken.
At block 3.1204, the process performs determining a threat posed by the first vehicle to some other entity besides the user. As noted, the vehicular threat information may indicate a threat posed by the first vehicle to some other person or thing, such as that the first vehicle may collide with the other entity. The other entity may be a vehicle occupied by the user, a vehicle not occupied by the user, a pedestrian, a structure, or any other object that may come into proximity with the first vehicle.
At block 3.1304, the process performs determining a threat posed by a vehicle occupied by the user to the first vehicle. The vehicular threat information may indicate a threat posed by the user's vehicle (e.g., as a driver or passenger) to the first vehicle, such as because a collision may occur between the two vehicles.
At block 3.1404, the process performs determining a threat posed by a vehicle occupied by the user to some other entity besides the first vehicle. The vehicular threat information may indicate a threat posed by the user's vehicle to some other person or thing, such as due to a potential collision. The other entity may be some other vehicle, a pedestrian, a structure, or any other object that may come into proximity with the user's vehicle.
At block 3.1504, the process performs identifying the first vehicle in the image data. Image processing techniques may be employed to identify the presence of a vehicle, its type (e.g., car or truck), its size, license plate number, color, or other identifying information about the first vehicle.
At block 3.1604, the process performs determining whether the first vehicle is moving towards the user based on multiple images represented by the image data. In some embodiments, a video feed or other sequence of images may be analyzed to determine the relative motion of the first vehicle. For example, if the first vehicle appears to be becoming larger over a sequence of images, then it is likely that the first vehicle is moving towards the user.
At block 3.1704, the process performs determining motion-related information about the first vehicle, based on one or more images of the first vehicle. Motion-related information may include information about the mechanics (e.g., kinematics, dynamics) of the first vehicle, including position, velocity, direction of travel, acceleration, mass, or the like. Motion-related information may be determined for vehicles that are at rest. Motion-related information may be determined and expressed with respect to various frames of reference, including the user's frame of reference, the frame of reference of the first vehicle, a fixed frame of reference, or the like.
At block 3.1804, the process performs determining the motion-related information with respect to timestamps associated with the one or more images. In some embodiments, the received images include timestamps or other indicators that can be used to determine a time interval between the images. In other cases, the time interval may be known a priori or expressed in other ways, such as in terms of a frame rate associated with an image or video stream.
At block 3.1904, the process performs determining a position of the first vehicle. The position of the first vehicle may be expressed absolutely, such as via a GPS coordinate or similar representation, or relatively, such as with respect to the position of the user (e.g., 20 meters away from the first user). In addition, the position of the first vehicle may be represented as a point or collection of points (e.g., a region, arc, or line).
At block 3.2004, the process performs determining a velocity of the first vehicle. The process may determine the velocity of the first vehicle in absolute or relative terms (e.g., with respect to the velocity of the user). The velocity may be expressed or represented as a magnitude (e.g., 10 meters per second), a vector (e.g., having a magnitude and a direction), or the like.
At block 3.2104, the process performs determining the velocity with respect to a fixed frame of reference. In some embodiments, a fixed, global, or absolute frame of reference may be utilized.
At block 3.2204, the process performs determining the velocity with respect to a frame of reference of the user. In some embodiments, velocity is expressed with respect to the user's frame of reference. In such cases, a stationary (e.g., parked) vehicle will appear to be approaching the user if the user is driving towards the first vehicle.
At block 3.2304, the process performs determining a direction of travel of the first vehicle. The process may determine a direction in which the first vehicle is traveling, such as with respect to the user and/or some absolute coordinate system or frame of reference.
At block 3.2404, the process performs determining acceleration of the first vehicle. In some embodiments, acceleration of the first vehicle may be determined, for example by determining a rate of change of the velocity of the first vehicle observed over time.
At block 3.2504, the process performs determining mass of the first vehicle. Mass of the first vehicle may be determined in various ways, including by identifying the type of the first vehicle (e.g., car, truck, motorcycle), determining the size of the first vehicle based on its appearance in an image, or the like.
At block 3.2604, the process performs determining that the first vehicle is driving erratically. The first vehicle may be driving erratically for a number of reasons, including due to a medical condition (e.g., a heart attack, bad eyesight, shortness of breath), drug/alcohol impairment, distractions (e.g., text messaging, crying children, loud music), or the like.
At block 3.2704, the process performs determining that the first vehicle is driving with excessive speed. Excessive speed may be determined relatively, such as with respect to the average traffic speed on a road segment, posted speed limit, or the like. For example, a vehicle may be determined to be driving with excessive speed if the vehicle is driving more than 20% over the posted speed limit. Other thresholds (e.g., 10% over, 25% over) and/or baselines (e.g., average observed speed) are contemplated.
At block 3.2804, the process performs identifying objects other than the first vehicle in the image data. Image processing techniques may be employed by the process to identify other objects of interest, including road hazards (e.g., utility poles, ditches, drop-offs), pedestrians, other vehicles, or the like.
At block 3.2904, the process performs determining driving conditions based on the image data. Image processing techniques may be employed by the process to determine driving conditions, such as surface conditions (e.g., icy, wet), lighting conditions (e.g., glare, darkness), or the like.
At block 3.3004, the process performs determining vehicular threat information that is not related to the first vehicle. The process may determine vehicular threat information that is not due to the first vehicle, including based on a variety of other factors or information, such as driving conditions, the presence or absence of other vehicles, the presence or absence of pedestrians, or the like.
At block 3.3104, the process performs receiving and processing image data that includes images of objects and/or conditions aside from the first vehicle. At least some of the received image data may include images of things other than the first vehicle, such as other vehicles, pedestrians, driving conditions, and the like.
At block 3.3204, the process performs receiving image data of at least one of a stationary object, a pedestrian, and/or an animal. A stationary object may be a fence, guardrail, utility pole, building, parked vehicle, or the like.
At block 3.3304, the process performs processing the image data to determine the vehicular threat information that is not related to the first vehicle. For example, the process may determine that a difficult lighting condition exists due to glare or overexposure detected in the image data. As another example, the process may identify a pedestrian in the roadway depicted in the image data. As another example, the process may determine that poor road surface conditions exist.
At block 3.3404, the process performs processing data other than the image data to determine the vehicular threat information that is not related to the first vehicle. The process may analyze data other than image data, such as weather data (e.g., temperature, precipitation), time of day, traffic information, position or motion sensor information (e.g., obtained from GPS systems or accelerometers), or the like.
At block 3.3504, the process performs determining that poor driving conditions exist. Poor driving conditions may include or be based on weather information (e.g., snow, rain, ice, temperature), time information (e.g., night or day), lighting information (e.g., a light sensor indicating that the user is traveling towards the setting sun), or the like.
At block 3.3604, the process performs determining that a limited visibility condition exists. Limited visibility may be due to the time of day (e.g., at dusk, dawn, or night), weather (e.g., fog, rain), or the like.
At block 3.3704, the process performs determining that there is slow traffic in proximity to the user. The process may receive and integrate information from traffic information systems (e.g., that report accidents), other vehicles (e.g., that are reporting their speeds), or the like.
At block 3.3804, the process performs receiving information from a traffic information system regarding traffic congestion on a road traveled by the user. Traffic information systems may provide fine-grained traffic information, such as current average speeds measured on road segments in proximity to the user.
At block 3.3904, the process performs determining that one or more vehicles are traveling slower than an average or posted speed for a road traveled by the user. Slow travel may be determined based on the speed of one or more vehicles with respect to various baselines, such as average observed speed (e.g., recorded over time, based on time of day, etc.), posted speed limits, recommended speeds based on conditions, or the like.
At block 3.4004, the process performs determining that poor surface conditions exist on a roadway traveled by the user. Poor surface conditions may be due to weather (e.g., ice, snow, rain), temperature, surface type (e.g., gravel road), foreign materials (e.g., oil), or the like.
At block 3.4104, the process performs determining that there is a pedestrian in proximity to the user. The presence of pedestrians may be determined in various ways. In some embodiments, the process may utilize image processing techniques to recognize pedestrians in received image data. In other embodiments pedestrians may wear devices that transmit their location and/or presence. In other embodiments, pedestrians may be detected based on their heat signature, such as by an infrared sensor on the wearable device, user vehicle, or the like.
At block 3.4204, the process performs determining that there is an accident in proximity to the user. Accidents may be identified based on traffic information systems that report accidents, vehicle-based systems that transmit when collisions have occurred, or the like.
At block 3.4304, the process performs determining that there is an animal in proximity to the user. The presence of an animal may be determined as discussed with respect to pedestrians, above.
At block 3.4404, the process performs determining the vehicular threat information based on motion-related information that is not based on images of the first vehicle. The process may consider a variety of motion-related information received from various sources, such as the wearable device, a vehicle of the user, the first vehicle, or the like. The motion-related information may include information about the mechanics (e.g., position, velocity, acceleration, mass) of the user and/or the first vehicle.
At block 3.4504, the process performs determining the vehicular threat information based on information about position, velocity, and/or acceleration of the user obtained from sensors in the wearable device. The wearable device may include position sensors (e.g., GPS), accelerometers, or other devices configured to provide motion-related information about the user to the process.
At block 3.4604, the process performs determining the vehicular threat information based on information about position, velocity, and/or acceleration of the user obtained from devices in a vehicle of the user. A vehicle occupied or operated by the user may include position sensors (e.g., GPS), accelerometers, speedometers, or other devices configured to provide motion-related information about the user to the process.
At block 3.4704, the process performs determining the vehicular threat information based on information about position, velocity, and/or acceleration of the first vehicle obtained from devices of the first vehicle. The first vehicle may include position sensors (e.g., GPS), accelerometers, speedometers, or other devices configured to provide motion-related information about the user to the process. In other embodiments, motion-related information may be obtained from other sources, such as a radar gun deployed at the side of a road, from other vehicles, or the like.
At block 3.4804, the process performs determining the vehicular threat information based on gaze information associated with the user. In some embodiments, the process may consider the direction in which the user is looking when determining the vehicular threat information. For example, the vehicular threat information may depend on whether the user is or is not looking at the first vehicle, as discussed further below.
At block 3.4904, the process performs receiving an indication of a direction in which the user is looking. In some embodiments, an orientation sensor such as a gyroscope or accelerometer may be employed to determine the orientation of the user's head, face, or other body part. In some embodiments, a camera or other image sensing device may track the orientation of the user's eyes.
At block 3.4905, the process performs determining that the user is not looking towards the first vehicle. As noted, the process may track the position of the first vehicle. Given this information, coupled with information about the direction of the user's gaze, the process may determine whether or not the user is (or likely is) looking in the direction of the first vehicle.
At block 3.4906, the process performs in response to determining that the user is not looking towards the first vehicle, directing the user to look towards the first vehicle. When it is determined that the user is not looking at the first vehicle, the process may warn or otherwise direct the user to look in that direction, such as by saying or otherwise presenting “Look right!”, “Car on your left,” or similar message.
At block 3.5004, the process performs identifying multiple threats to the user. The process may in some cases identify multiple potential threats, such as one car approaching the user from behind and another car approaching the user from the left.
At block 3.5005, the process performs identifying a first one of the multiple threats that is more significant than at least one other of the multiple threats. The process may rank, order, or otherwise evaluate the relative significance or risk presented by each of the identified threats. For example, the process may determine that a truck approaching from the right is a bigger risk than a bicycle approaching from behind. On the other hand, if the truck is moving very slowly (thus leaving more time for the truck and/or the user to avoid it) compared to the bicycle, the process may instead determine that the bicycle is the bigger risk.
At block 3.5007, the process performs instructing the user to avoid the first one of the multiple threats. Instructing the user may include outputting a command or suggestion to take (or not take) a particular course of action.
At block 3.5104, the process performs modeling multiple potential accidents that each correspond to one of the multiple threats to determine a collision force associated with each accident. In some embodiments, the process models the physics of various objects to determine potential collisions and possibly their severity and/or likelihood. For example, the process may determine an expected force of a collision based on factors such as object mass, velocity, acceleration, deceleration, or the like.
At block 3.5105, the process performs selecting the first threat based at least in part on which of the multiple accidents has the highest collision force. In some embodiments, the process considers the threat having the highest associated collision force when determining most significant threat, because that threat will likely result in the greatest injury to the user.
At block 3.5204, the process performs determining a likelihood of an accident associated with each of the multiple threats. In some embodiments, the process associates a likelihood (probability) with each of the multiple threats. Such a probability may be determined with respect to a physical model that represents uncertainty with respect to the mechanics of the various objects that it models.
At block 3.5205, the process performs selecting the first threat based at least in part on which of the multiple threats has the highest associated likelihood. The process may consider the threat having the highest associated likelihood when determining the most significant threat.
At block 3.5304, the process performs determining a mass of an object associated with each of the multiple threats. In some embodiments, the process may consider the mass of threat objects, based on the assumption that those objects having higher mass (e.g., a truck) pose greater threats than those having a low mass (e.g., a pedestrian).
At block 3.5305, the process performs selecting the first threat based at least in part on which of the objects has the highest mass.
At block 3.5404, the process performs selecting the most significant threat from the multiple threats.
At block 3.5504, the process performs determining that an evasive action with respect to the first vehicle poses a threat to some other object. The process may consider whether potential evasive actions pose threats to other objects. For example, the process may analyze whether directing the user to turn right would cause the user to collide with a pedestrian or some fixed object, which may actually result in a worse outcome (e.g., for the user and/or the pedestrian) than colliding with the first vehicle.
At block 3.5505, the process performs instructing the user to take some other evasive action that poses a lesser threat to the some other object. The process may rank or otherwise order evasive actions (e.g., slow down, turn left, turn right) based at least in part on the risks or threats those evasive actions pose to other entities.
At block 3.5604, the process performs identifying multiple threats that each have an associated likelihood and cost. In some embodiments, the process may perform a cost-minimization analysis, in which it considers multiple threats, including threats posed to the user and to others, and selects a threat that minimizes or reduces expected costs. The process may also consider threats posed by actions taken by the user to avoid other threats.
At block 3.5607, the process performs determining a course of action that minimizes an expected cost with respect to the multiple threats. Expected cost of a threat may be expressed as a product of the likelihood of damage associated with the threat and the cost associated with such damage.
At block 3.5804, the process performs identifying multiple threats that are each related to different persons or things. In some embodiments, the process considers risks related to multiple distinct entities, possibly including the user.
At block 3.5904, the process performs identifying multiple threats that are each related to the user. In some embodiments, the process also or only considers risks that are related to the user.
At block 3.6004, the process performs minimizing expected costs to the user posed by the multiple threats. In some embodiments, the process attempts to minimize those costs borne by the user. Note that this may cause the process to recommend a course of action that is not optimal from a societal perspective, such as by directing the user to drive his car over a pedestrian rather than to crash into a car or structure.
At block 3.6104, the process performs minimizing overall expected costs posed by the multiple threats, the overall expected costs being a sum of expected costs borne by the user and other persons/things. In some embodiments, the process attempts to minimize social costs, that is, the costs borne by the various parties to an accident. Note that this may cause the process to recommend a course of action that may have a high cost to the user (e.g., crashing into a wall and damaging the user's car) to spare an even higher cost to another person (e.g., killing a pedestrian).
At block 3.6204, the process performs presenting the vehicular threat information via an audio output device of the wearable device. The process may play an alarm, bell, chime, voice message, or the like that warns or otherwise informs the user of the vehicular threat information. The wearable device may include audio speakers operable to output audio signals, including as part of a set of earphones, earbuds, a headset, a helmet, or the like.
At block 3.6304, the process performs presenting the vehicular threat information via a visual display device of the wearable device. In some embodiments, the wearable device includes a display screen or other mechanism for presenting visual information. For example, when the wearable device is a helmet, a face shield of the helmet may be used as a type of heads-up display for presenting the vehicular threat information.
At block 3.6404, the process performs displaying an indicator that instructs the user to look towards the first vehicle. The displayed indicator may be textual (e.g., “Look right!”), iconic (e.g., an arrow), or the like.
At block 3.6504, the process performs displaying an indicator that instructs the user to accelerate, decelerate, and/or turn. An example indicator may be or include the text “Speed up,” “slow down,” “turn left,” or similar language.
At block 3.6604, the process performs directing the user to accelerate.
At block 3.6704, the process performs directing the user to decelerate.
At block 3.6804, the process performs directing the user to turn.
At block 3.6904, the process performs transmitting to the first vehicle a warning based on the vehicular threat information. The process may send or otherwise transmit a warning or other message to the first vehicle that instructs the operator of the first vehicle to take evasive action. The instruction to the first vehicle may be complimentary to any instructions given to the user, such that if both instructions are followed, the risk of collision decreases. In this manner, the process may help avoid a situation in which the user and the operator of the first vehicle take actions that actually increase the risk of collision, such as may occur when the user and the first vehicle are approaching head but do not turn away from one another.
At block 3.7004, the process performs presenting the vehicular threat information via an output device of a vehicle of the user, the output device including a visual display and/or an audio speaker. In some embodiments, the process may use other devices to output the vehicular threat information, such as output devices of a vehicle of the user, including a car stereo, dashboard display, or the like.
At block 3.7404, the process performs presenting the vehicular threat information via goggles worn by the user. The goggles may include a small display, an audio speaker, or haptic output device, or the like.
At block 3.7504, the process performs presenting the vehicular threat information via a helmet worn by the user. The helmet may include an audio speaker or visual output device, such as a display that presents information on the inside of the face screen of the helmet. Other output devices, including haptic devices, are contemplated.
At block 3.7604, the process performs presenting the vehicular threat information via a hat worn by the user. The hat may include an audio speaker or similar output device.
At block 3.7704, the process performs presenting the vehicular threat information via eyeglasses worn by the user. The eyeglasses may include a small display, an audio speaker, or haptic output device, or the like.
At block 3.7804, the process performs presenting the vehicular threat information via audio speakers that are part of at least one of earphones, a headset, earbuds, and/or a hearing aid. The audio speakers may be integrated into the wearable device. In other embodiments, other audio speakers (e.g., of a car stereo) may be employed instead or in addition.
At block 3.7904, the process performs performing the receiving image data, the determining vehicular threat information, and/or the presenting the vehicular threat information on a computing device in the wearable device of the user. In some embodiments, a computing device of or in the wearable device may be responsible for performing one or more of the operations of the process. For example, a computing device situated within a helmet worn by the user may receive and analyze audio data to determine and present the vehicular threat information to the user.
At block 3.8004, the process performs performing the receiving image data, the determining vehicular threat information, and/or the presenting the vehicular threat information on a road-side computing system. In some embodiments, an in-situ computing system may be responsible for performing one or more of the operations of the process. For example, a computing system situated at or about a street intersection may receive and analyze audio signals of vehicles that are entering or nearing the intersection. Such an architecture may be beneficial when the wearable device is a “thin” device that does not have sufficient processing power to, for example, determine whether the first vehicle is approaching the user.
At block 3.8005, the process performs transmitting the vehicular threat information from the road-side computing system to the wearable device of the user. For example, when the road-side computing system determines that two vehicles may be on a collision course, the computing system can transmit vehicular threat information to the wearable device so that the user can take evasive action and avoid a possible accident.
At block 3.8104, the process performs performing the receiving image data, the determining vehicular threat information, and/or the presenting the vehicular threat information on a computing system in the first vehicle. In some embodiments, a computing system in the first vehicle performs one or more of the operations of the process. Such an architecture may be beneficial when the wearable device is a “thin” device that does not have sufficient processing power to, for example, determine whether the first vehicle is approaching the user.
At block 3.8105, the process performs transmitting the vehicular threat information from the computing system to the wearable device of the user.
At block 3.8204, the process performs performing the receiving image data, the determining vehicular threat information, and/or the presenting the vehicular threat information on a computing system in a second vehicle, wherein the user is not traveling in the second vehicle. In some embodiments, other vehicles that are not carrying the user and are not the same as the first user may perform one or more of the operations of the process. In general, computing systems/devices situated in or at multiple vehicles, wearable devices, or fixed stations in a roadway may each perform operations related to determining vehicular threat information, which may then be shared with other users and devices to improve traffic flow, avoid collisions, and generally enhance the abilities of users of the roadway.
At block 3.8205, the process performs transmitting the vehicular threat information from the computing system to the wearable device of the user.
At block 3.8304, the process performs receiving data representing an audio signal emitted by the first vehicle. The data representing the audio signal may be raw audio samples, compressed audio data, frequency coefficients, or the like. The data representing the audio signal may represent the sound made by the first vehicle, such as from its engine, a horn, tires, or any other source of sound. The data representing the audio signal may include sounds from other sources, including other vehicles, pedestrians, or the like. The audio signal may be obtained at or about a user who is a pedestrian or who is in a vehicle that is not the first vehicle, either as the operator or a passenger.
At block 3.8306, the process performs determining the vehicular threat information based further on the data representing the audio signal. As discussed further below, determining the vehicular threat information based on audio may include acoustic source localization, frequency analysis, or other techniques that can identify the presence, position, or motion of objects.
At block 3.8404, the process performs receiving data obtained at a microphone array that includes multiple microphones. In some embodiments, a microphone array having two or more microphones is employed to receive audio signals. Differences between the received audio signals may be utilized to perform acoustic source localization or other functions, as discussed further herein.
At block 3.8504, the process performs receiving data obtained at a microphone array, the microphone array coupled to a vehicle of the user. In some embodiments, such as when the user is operating or otherwise traveling in a vehicle of his own (that is not the same as the first vehicle), the microphone array may be coupled or attached to the user's vehicle, such as by having a microphone located at each of the four corners of the user's vehicle.
At block 3.8604, the process performs receiving data obtained at a microphone array, the microphone array coupled to the wearable device. For example, if the wearable device is a helmet, then a first microphone may be located on the left side of the helmet while a second microphone may be located on the right side of the helmet.
At block 3.8704, the process performs performing acoustic source localization to determine a position of the first vehicle based on multiple audio signals received via multiple microphones. The process may determine a position of the first vehicle by analyzing audio signals received via multiple distinct microphones. For example, engine noise of the first vehicle may have different characteristics (e.g., in volume, in time of arrival, in frequency) as received by different microphones. Differences between the audio signal measured at different microphones may be exploited to determine one or more positions (e.g., points, arcs, lines, regions) at which the first vehicle may be located.
At block 3.8804, the process performs receiving an audio signal via a first one of the multiple microphones, the audio signal representing a sound created by the first vehicle. In one approach, at least two microphones are employed. By measuring differences in the arrival time of an audio signal at the two microphones, the position of the first vehicle may be determined. The determined position may be a point, a line, an area, or the like.
At block 3.8805, the process performs receiving the audio signal via a second one of the multiple microphones.
At block 3.8806, the process performs determining the position of the first vehicle by determining a difference between an arrival time of the audio signal at the first microphone and an arrival time of the audio signal at the second microphone. In some embodiments, given information about the distance between the two microphones and the speed of sound, the process may determine the respective distances between each of the two microphones and the first vehicle. Given these two distances (along with the distance between the microphones), the process can solve for the one or more positions at which the first vehicle may be located.
At block 3.8904, the process performs triangulating the position of the first vehicle based on a first and second angle, the first angle measured between a first one of the multiple microphones and the first vehicle, the second angle measured between a second one of the multiple microphones and the first vehicle. In some embodiments, the microphones may be directional, in that they may be used to determine the direction from which the sound is coming. Given such information, the process may use triangulation techniques to determine the position of the first vehicle.
At block 3.9004, the process performs performing a Doppler analysis of the data representing the audio signal to determine whether the first vehicle is approaching the user. The process may analyze whether the frequency of the audio signal is shifting in order to determine whether the first vehicle is approaching or departing the position of the user. For example, if the frequency is shifting higher, the first vehicle may be determined to be approaching the user. Note that the determination is typically made from the frame of reference of the user (who may be moving or not). Thus, the first vehicle may be determined to be approaching the user when, as viewed from a fixed frame of reference, the user is approaching the first vehicle (e.g., a moving user traveling towards a stationary vehicle) or the first vehicle is approaching the user (e.g., a moving vehicle approaching a stationary user). In other embodiments, other frames of reference may be employed, such as a fixed frame, a frame associated with the first vehicle, or the like.
At block 3.9104, the process performs determining whether frequency of the audio signal is increasing or decreasing.
At block 3.9204, the process performs performing a volume analysis of the data representing the audio signal to determine whether the first vehicle is approaching the user. The process may analyze whether the volume (e.g., amplitude) of the audio signal is shifting in order to determine whether the first vehicle is approaching or departing the position of the user. As noted, different embodiments may use different frames of reference when making this determination.
At block 3.9304, the process performs determining whether volume of the audio signal is increasing or decreasing.
At block 3.9404, the process performs receiving data representing the first vehicle obtained at a road-based device. In some embodiments, the process may also consider data received from devices that are located in or about the roadway traveled by the user. Such devices may include cameras, loop coils, motion sensors, and the like.
At block 3.9406, the process performs determining the vehicular threat information based further on the data representing the first vehicle. For example, the process may determine that a car is approaching the user by analyzing an image taken from a camera that is mounted on or near a traffic signal over an intersection. As another example, the process may determine the speed of a vehicle with reference to data obtained from a radar gun/detector.
At block 3.9504, the process performs receiving the data from a sensor deployed at an intersection. Various types of sensors are contemplated, including cameras, range sensors (e.g., sonar, radar, LIDAR, IR-based), magnetic coils, audio sensors, or the like.
At block 3.9604, the process performs receiving an image of the first vehicle from a camera deployed at an intersection. For example, the process may receive images from a camera that is fixed to a traffic light or other signal at an intersection.
At block 3.9704, the process performs receiving ranging data from a range sensor deployed at an intersection, the ranging data representing a distance between the first vehicle and the intersection. For example, the process may receive a distance (e.g., 75 meters) measured between some known point in the intersection (e.g., the position of the range sensor) and an oncoming vehicle.
At block 3.9804, the process performs receiving data from an induction loop deployed in a road surface, the induction loop configured to detect the presence and/or velocity of the first vehicle. Induction loops may be embedded in the roadway and configured to detect the presence of vehicles passing over them. Some types of loops and/or processing may be employed to detect other information, including velocity, vehicle size, and the like.
At block 3.9904, the process performs identifying the first vehicle in an image obtained from the road-based sensor. Image processing techniques may be employed to identify the presence of a vehicle, its type (e.g., car or truck), its size, or other information.
At block 3.10004, the process performs determining a trajectory of the first vehicle based on multiple images obtained from the road-based device. In some embodiments, a video feed or other sequence of images may be analyzed to determine the position, speed, and/or direction of travel of the first vehicle.
At block 3.10104, the process performs receiving data representing vehicular threat information relevant to a second vehicle, the second vehicle not being used for travel by the user. As noted, vehicular threat information may in some embodiments be shared amongst vehicles and entities present in a roadway. For example, a vehicle that is traveling just ahead of the user may determine that it is threatened by the first vehicle. This information may be shared with the user so that the user can also take evasive action, such as by slowing down or changing course.
At block 3.10106, the process performs determining the vehicular threat information based on the data representing vehicular threat information relevant to the second vehicle. Having received vehicular threat information from the second vehicle, the process may determine that it is also relevant to the user, and then accordingly present it to the user.
At block 3.10204, the process performs receiving from the second vehicle an indication of stalled or slow traffic encountered by the second vehicle. Various types of threat information relevant to the second vehicle may be provided to the process, such as that there is stalled or slow traffic ahead of the second vehicle.
At block 3.10304, the process performs receiving from the second vehicle an indication of poor driving conditions experienced by the second vehicle. The second vehicle may share the fact that it is experiencing poor driving conditions, such as an icy or wet roadway.
At block 3.10404, the process performs receiving from the second vehicle an indication that the first vehicle is driving erratically. The second vehicle may share a determination that the first vehicle is driving erratically, such as by swerving, driving with excessive speed, driving too slowly, or the like.
At block 3.10504, the process performs receiving from the second vehicle an image of the first vehicle. The second vehicle may include one or more cameras, and may share images obtained via those cameras with other entities.
At block 3.10604, the process performs transmitting the vehicular threat information to a second vehicle. As noted, vehicular threat information may in some embodiments be shared amongst vehicles and entities present in a roadway. In this example, the vehicular threat information is transmitted to a second vehicle (e.g., one following behind the user), so that the second vehicle may benefit from the determined vehicular threat information as well.
At block 3.10704, the process performs transmitting the vehicular threat information to an intermediary server system for distribution to other vehicles in proximity to the user. In some embodiments, intermediary systems may operate as relays for sharing the vehicular threat information with other vehicles and users of a roadway.
At block 3.10804, the process performs transmitting the vehicular threat information to a law enforcement entity. In some embodiments, the process shares the vehicular threat information with law enforcement entities, including computer or other information systems managed or operated by such entities. For example, if the process determines that the first vehicle is driving erratically, the process may transmit that determination and/or information about the first vehicle with the police.
At block 3.10904, the process performs determining a license place identifier of the first vehicle based on the image data. The process may perform image processing (e.g., optical character recognition) to determine the license number on the license plate of the first vehicle.
At block 3.10905, the process performs transmitting the license plate identifier to the law enforcement entity.
At block 3.11004, the process performs determining a vehicle description of the first vehicle based on the image data. Image processing may be utilized to determine a vehicle description, including one or more of type, make, year, and/or color of the first vehicle.
At block 3.11005, the process performs transmitting the vehicle description to the law enforcement entity.
At block 3.11104, the process performs determining a location associated with the first vehicle. The process may reference a GPS system to determine the current location of the user and/or the first vehicle, and then provide an indication of that location to the police or other agency. The location may be or include a coordinate, a street or intersection name, a name of a municipality, or the like.
At block 3.11105, the process performs transmitting an indication of the location to the law enforcement entity.
At block 3.11204, the process performs determining a direction of travel of the first vehicle. As discussed above, the process may determine direction of travel in various ways, such as by modeling the motion of the first vehicle. Such a direction may then be provided to the police or other agency, such as by reporting that the first vehicle is traveling northbound.
At block 3.11205, the process performs transmitting an indication of the direction of travel to the law enforcement entity.
3. Example Computing System Implementation
Note that one or more general purpose or special purpose computing systems/devices may be used to implement the AEFS 100. In addition, the computing system 400 may comprise one or more distinct computing systems/devices and may span distributed locations. Furthermore, each block shown may represent one or more such blocks as appropriate to a specific embodiment or may be combined with other blocks. Also, the AEFS 100 may be implemented in software, hardware, firmware, or in some combination to achieve the capabilities described herein.
In the embodiment shown, computing system 400 comprises a computer memory (“memory”) 401, a display 402, one or more Central Processing Units (“CPU”) 403, Input/Output devices 404 (e.g., keyboard, mouse, CRT or LCD display, and the like), other computer-readable media 405, and network connections 406. The AEFS 100 is shown residing in memory 401. In other embodiments, some portion of the contents, some or all of the components of the AEFS 100 may be stored on and/or transmitted over the other computer-readable media 405. The components of the AEFS 100 preferably execute on one or more CPUs 403 and implement techniques described herein. Other code or programs 430 (e.g., an administrative interface, a Web server, and the like) and potentially other data repositories, such as data repository 420, also reside in the memory 401, and preferably execute on one or more CPUs 403. Of note, one or more of the components in
The AEFS 100 interacts via the network 450 with wearable devices 120, information sources 130, and third-party systems/applications 455. The network 450 may be any combination of media (e.g., twisted pair, coaxial, fiber optic, radio frequency), hardware (e.g., routers, switches, repeaters, transceivers), and protocols (e.g., TCP/IP, UDP, Ethernet, Wi-Fi, WiMAX) that facilitate communication between remotely situated humans and/or devices. The third-party systems/applications 455 may include any systems that provide data to, or utilize data from, the AEFS 100, including Web browsers, vehicle-based client systems, traffic tracking, monitoring, or prediction systems, and the like.
The AEFS 100 is shown executing in the memory 401 of the computing system 400. Also included in the memory are a user interface manager 415 and an application program interface (“API”) 416. The user interface manager 415 and the API 416 are drawn in dashed lines to indicate that in other embodiments, functions performed by one or more of these components may be performed externally to the AEFS 100.
The UI manager 415 provides a view and a controller that facilitate user interaction with the AEFS 100 and its various components. For example, the UI manager 415 may provide interactive access to the AEFS 100, such that users can configure the operation of the AEFS 100, such as by providing the AEFS 100 with information about common routes traveled, vehicle types used, driving patterns, or the like. The UI manager 415 may also manage and/or implement various output abstractions, such that the AEFS 100 can cause vehicular threat information to be displayed on different media, devices, or systems. In some embodiments, access to the functionality of the UI manager 415 may be provided via a Web server, possibly executing as one of the other programs 430. In such embodiments, a user operating a Web browser executing on one of the third-party systems 455 can interact with the AEFS 100 via the UI manager 415.
The API 416 provides programmatic access to one or more functions of the AEFS 100. For example, the API 416 may provide a programmatic interface to one or more functions of the AEFS 100 that may be invoked by one of the other programs 430 or some other module. In this manner, the API 416 facilitates the development of third-party software, such as user interfaces, plug-ins, adapters (e.g., for integrating functions of the AEFS 100 into vehicle-based client systems or devices), and the like.
In addition, the API 416 may be in at least some embodiments invoked or otherwise accessed via remote entities, such as code executing on one of the wearable devices 120, information sources 130, and/or one of the third-party systems/applications 455, to access various functions of the AEFS 100. For example, an information source 130 such as a radar gun installed at an intersection may push motion-related information (e.g., velocity) about vehicles to the AEFS 100 via the API 416. As another example, a weather information system may push current conditions information (e.g., temperature, precipitation) to the AEFS 100 via the API 416. The API 416 may also be configured to provide management widgets (e.g., code modules) that can be integrated into the third-party applications 455 and that are configured to interact with the AEFS 100 to make at least some of the described functionality available within the context of other applications (e.g., mobile apps).
In an example embodiment, components/modules of the AEFS 100 are implemented using standard programming techniques. For example, the AEFS 100 may be implemented as a “native” executable running on the CPU 403, along with one or more static or dynamic libraries. In other embodiments, the AEFS 100 may be implemented as instructions processed by a virtual machine that executes as one of the other programs 430. In general, a range of programming languages known in the art may be employed for implementing such example embodiments, including representative implementations of various programming language paradigms, including but not limited to, object-oriented (e.g., Java, C++, C#, Visual Basic.NET, Smalltalk, and the like), functional (e.g., ML, Lisp, Scheme, and the like), procedural (e.g., C, Pascal, Ada, Modula, and the like), scripting (e.g., Perl, Ruby, Python, JavaScript, VBScript, and the like), and declarative (e.g., SQL, Prolog, and the like).
The embodiments described above may also use either well-known or proprietary synchronous or asynchronous client-server computing techniques. Also, the various components may be implemented using more monolithic programming techniques, for example, as an executable running on a single CPU computer system, or alternatively decomposed using a variety of structuring techniques known in the art, including but not limited to, multiprogramming, multithreading, client-server, or peer-to-peer, running on one or more computer systems each having one or more CPUs. Some embodiments may execute concurrently and asynchronously, and communicate using message passing techniques. Equivalent synchronous embodiments are also supported. Also, other functions could be implemented and/or performed by each component/module, and in different orders, and by different components/modules, yet still achieve the described functions.
In addition, programming interfaces to the data stored as part of the AEFS 100, such as in the data store 420 (or 240), can be available by standard mechanisms such as through C, C++, C#, and Java APIs; libraries for accessing files, databases, or other data repositories; through scripting languages such as XML; or through Web servers, FTP servers, or other types of servers providing access to stored data. The data store 420 may be implemented as one or more database systems, file systems, or any other technique for storing such information, or any combination of the above, including implementations using distributed computing techniques.
Different configurations and locations of programs and data are contemplated for use with techniques of described herein. A variety of distributed computing techniques are appropriate for implementing the components of the illustrated embodiments in a distributed manner including but not limited to TCP/IP sockets, RPC, RMI, HTTP, Web Services (XML-RPC, JAX-RPC, SOAP, and the like). Other variations are possible. Also, other functionality could be provided by each component/module, or existing functionality could be distributed amongst the components/modules in different ways, yet still achieve the functions described herein.
Furthermore, in some embodiments, some or all of the components of the AEFS 100 may be implemented or provided in other manners, such as at least partially in firmware and/or hardware, including, but not limited to one or more application-specific integrated circuits (“ASICs”), standard integrated circuits, controllers executing appropriate instructions, and including microcontrollers and/or embedded controllers, field-programmable gate arrays (“FPGAs”), complex programmable logic devices (“CPLDs”), and the like. Some or all of the system components and/or data structures may also be stored as contents (e.g., as executable or other machine-readable software instructions or structured data) on a computer-readable medium (e.g., as a hard disk; a memory; a computer network or cellular wireless network or other data transmission medium; or a portable media article to be read by an appropriate drive or via an appropriate connection, such as a DVD or flash memory device) so as to enable or configure the computer-readable medium and/or one or more associated computing systems or devices to execute or otherwise use or provide the contents to perform at least some of the described techniques. Some or all of the components and/or data structures may be stored on tangible, non-transitory storage mediums. Some or all of the system components and data structures may also be stored as data signals (e.g., by being encoded as part of a carrier wave or included as part of an analog or digital propagated signal) on a variety of computer-readable transmission mediums, which are then transmitted, including across wireless-based and wired/cable-based mediums, and may take a variety of forms (e.g., as part of a single or multiplexed analog signal, or as multiple discrete digital packets or frames). Such computer program products may also take other forms in other embodiments. Accordingly, embodiments of this disclosure may be practiced with other computer system configurations.
From the foregoing it will be appreciated that, although specific embodiments have been described herein for purposes of illustration, various modifications may be made without deviating from the spirit and scope of this disclosure. For example, the methods, techniques, and systems for ability enhancement are applicable to other architectures or in other settings. For example, instead of providing vehicular threat information to human users who are vehicle operators or pedestrians, some embodiments may provide such information to control systems that are installed in vehicles and that are configured to automatically take action to avoid collisions in response to such information. Also, the methods, techniques, and systems discussed herein are applicable to differing protocols, communication media (optical, wireless, cable, etc.) and devices (e.g., desktop computers, wireless handsets, electronic organizers, personal digital assistants, tablet computers, portable email machines, game machines, pagers, navigation devices, etc.).
The present application is related to and claims the benefit of the earliest available effective filing date(s) from the following listed application(s) (the “Related Applications”) (e.g., claims earliest available priority dates for other than provisional patent applications or claims benefits under 35 USC §119(e) for provisional patent applications, for any and all parent, grandparent, great-grandparent, etc. applications of the Related Application(s)). All subject matter of the Related Applications and of any and all parent, grandparent, great-grandparent, etc. applications of the Related Applications is incorporated herein by reference to the extent such subject matter is not inconsistent herewith. For purposes of the USPTO extra-statutory requirements, the present application constitutes a continuation-in-part of U.S. patent application Ser. No. 13/309,248, entitled AUDIBLE ASSISTANCE, filed 1 Dec. 2011, which is currently co-pending, or is an application of which a currently co-pending application is entitled to the benefit of the filing date. For purposes of the USPTO extra-statutory requirements, the present application constitutes a continuation-in-part of U.S. patent application Ser. No. 13/324,232, entitled VISUAL PRESENTATION OF SPEAKER-RELATED INFORMATION, filed 13 Dec. 2011, which is currently co-pending, or is an application of which a currently co-pending application is entitled to the benefit of the filing date. For purposes of the USPTO extra-statutory requirements, the present application constitutes a continuation-in-part of U.S. patent application Ser. No. 13/340,143, entitled LANGUAGE TRANSLATION BASED ON SPEAKER-RELATED INFORMATION, filed 29 Dec. 2011, which is currently co-pending, or is an application of which a currently co-pending application is entitled to the benefit of the filing date. For purposes of the USPTO extra-statutory requirements, the present application constitutes a continuation-in-part of U.S. patent application Ser. No. 13/356,419, entitled ENHANCED VOICE CONFERENCING, filed 23 Jan. 2012, which is currently co-pending, or is an application of which a currently co-pending application is entitled to the benefit of the filing date. For purposes of the USPTO extra-statutory requirements, the present application constitutes a continuation-in-part of U.S. patent application Ser. No. 13/362,823, entitled VEHICULAR THREAT DETECTION BASED ON AUDIO SIGNALS, filed 31 Jan. 2012, which is currently co-pending, or is an application of which a currently co-pending application is entitled to the benefit of the filing date. For purposes of the USPTO extra-statutory requirements, the present application constitutes a continuation-in-part of U.S. patent application Ser. No. 13/397,289, entitled ENHANCED VOICE CONFERENCING WITH HISTORY, filed 15 Feb. 2012, which is currently co-pending, or is an application of which a currently co-pending application is entitled to the benefit of the filing date.
Number | Name | Date | Kind |
---|---|---|---|
5239586 | Marui | Aug 1993 | A |
5983161 | Lemelson et al. | Nov 1999 | A |
5995898 | Tuttle | Nov 1999 | A |
6226389 | Lemelson et al. | May 2001 | B1 |
6304648 | Chang | Oct 2001 | B1 |
6326903 | Gross et al. | Dec 2001 | B1 |
6529866 | Cope et al. | Mar 2003 | B1 |
6628767 | Wellner et al. | Sep 2003 | B1 |
6731202 | Klaus | May 2004 | B1 |
6944474 | Rader et al. | Sep 2005 | B2 |
7224981 | Deisher et al. | May 2007 | B2 |
7324015 | Allen et al. | Jan 2008 | B1 |
7606444 | Erol et al. | Oct 2009 | B1 |
7783022 | Jay et al. | Aug 2010 | B1 |
8050917 | Caspi et al. | Nov 2011 | B2 |
8369184 | Calhoun | Feb 2013 | B2 |
8618952 | Mochizuki | Dec 2013 | B2 |
8669854 | D'Ambrosio et al. | Mar 2014 | B2 |
20020021799 | Kaufholz | Feb 2002 | A1 |
20020196134 | Lutter et al. | Dec 2002 | A1 |
20030139881 | Miller et al. | Jul 2003 | A1 |
20030158900 | Santos | Aug 2003 | A1 |
20040064322 | Georgiopoulos et al. | Apr 2004 | A1 |
20040100868 | Patterson, Jr. et al. | May 2004 | A1 |
20040122678 | Rousseau | Jun 2004 | A1 |
20040172252 | Aoki et al. | Sep 2004 | A1 |
20040230651 | Ivashin | Nov 2004 | A1 |
20040263610 | Whynot et al. | Dec 2004 | A1 |
20050018828 | Nierhaus et al. | Jan 2005 | A1 |
20050038648 | Ju et al. | Feb 2005 | A1 |
20050041529 | Schliep et al. | Feb 2005 | A1 |
20050088981 | Woodruff et al. | Apr 2005 | A1 |
20050135583 | Kardos | Jun 2005 | A1 |
20050207554 | Ortel | Sep 2005 | A1 |
20060080004 | Cheok et al. | Apr 2006 | A1 |
20060195850 | Knight et al. | Aug 2006 | A1 |
20080061958 | Birk et al. | Mar 2008 | A1 |
20080195387 | Zigel et al. | Aug 2008 | A1 |
20080270132 | Navratil et al. | Oct 2008 | A1 |
20080300777 | Fehr et al. | Dec 2008 | A1 |
20090040037 | Schraga | Feb 2009 | A1 |
20090070102 | Maegawa | Mar 2009 | A1 |
20090198735 | Yu et al. | Aug 2009 | A1 |
20090204620 | Thione et al. | Aug 2009 | A1 |
20090271176 | Bodin et al. | Oct 2009 | A1 |
20090281789 | Waibel et al. | Nov 2009 | A1 |
20090282103 | Thakkar et al. | Nov 2009 | A1 |
20090306957 | Gao et al. | Dec 2009 | A1 |
20090307616 | Nielsen | Dec 2009 | A1 |
20100040217 | Aberg et al. | Feb 2010 | A1 |
20100135478 | Wald et al. | Jun 2010 | A1 |
20100153497 | Sylvain et al. | Jun 2010 | A1 |
20100185434 | Burvall et al. | Jul 2010 | A1 |
20100222098 | Garg | Sep 2010 | A1 |
20100315218 | Cades et al. | Dec 2010 | A1 |
20110010041 | Wagner et al. | Jan 2011 | A1 |
20110153324 | Ballinger et al. | Jun 2011 | A1 |
20110184721 | Subramanian et al. | Jul 2011 | A1 |
20110196580 | Xu et al. | Aug 2011 | A1 |
20110237295 | Bartkowiak et al. | Sep 2011 | A1 |
20110270922 | Jones et al. | Nov 2011 | A1 |
20110307241 | Waibel et al. | Dec 2011 | A1 |
20120010886 | Razavilar | Jan 2012 | A1 |
20120025965 | Mochizuki et al. | Feb 2012 | A1 |
20120046833 | Sanma et al. | Feb 2012 | A1 |
20120069131 | Abelow | Mar 2012 | A1 |
20120072109 | Waite et al. | Mar 2012 | A1 |
20120075407 | Wessling | Mar 2012 | A1 |
20120197629 | Nakamura et al. | Aug 2012 | A1 |
20120323575 | Gibbon et al. | Dec 2012 | A1 |
20130021950 | Chen et al. | Jan 2013 | A1 |
20130022189 | Ganong, III et al. | Jan 2013 | A1 |
20130057691 | Atsmon et al. | Mar 2013 | A1 |
20130058471 | Garcia | Mar 2013 | A1 |
20130063542 | Bhat et al. | Mar 2013 | A1 |
20130103399 | Gammon | Apr 2013 | A1 |
20130204616 | Aoki et al. | Aug 2013 | A1 |
20140055242 | Mendonca et al. | Feb 2014 | A1 |
Entry |
---|
Menon, Arvind et al; “Roadside Range Sensors for Intersection Decision Support”; bearing a date of Apr. 1, 2004; IEEE; pp. 1-6. |
Number | Date | Country | |
---|---|---|---|
20130142393 A1 | Jun 2013 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13309248 | Dec 2011 | US |
Child | 13407570 | US | |
Parent | 13324232 | Dec 2011 | US |
Child | 13309248 | US | |
Parent | 13340143 | Dec 2011 | US |
Child | 13324232 | US | |
Parent | 13356419 | Jan 2012 | US |
Child | 13340143 | US | |
Parent | 13362823 | Jan 2012 | US |
Child | 13356419 | US | |
Parent | 13397289 | Feb 2012 | US |
Child | 13362823 | US |