This application claims priority to and the benefit of European Patent Application No. EP 13182410.4, filed on Aug. 30, 2013, pending. The entire disclosure of the above application is expressly incorporated by reference herein.
The present disclosure relates to an audio rendering system comprising at least one portable terminal configured to receive geospatial object data from at least one geospatial object data server. The geospatial object data is interrelated to a geographical position. The at least one portable terminal is further configured to render retrieved geospatial object data into an acoustic scene by a rendering algorithm. The acoustic scene is spatially interrelated to the geographical position. The at least one audio unit is configured to sound a rendered acoustic scene information into at least one ear of a user. The audio rendering system is further configured for rendering retrieved geospatial object data into the acoustic scene based on categorised acoustic scene information representing a corresponding categorised geospatial object data.
Walking and navigating in a geographical environment is, for most people, not considered challenging, but for a person being visually impaired it is a complicated and time consuming challenge.
Since it is challenging for a visually impaired person to walk and navigate in a geographical environment, many of these people are prevented from having a “normal life”, including having a job, going to school, going out for shopping, visiting friends and family etc. Many of these visually impaired people suffer from depression and low self-confidence since they are afraid of leaving their home.
Guide dogs and canes have long been the staple assistive devices used by the blind community when navigating city streets. More recently, GPS has broadened the possibilities for autonomous exploration. A visual impaired person may use a GPS for navigating and for planning a route going from one place to another. Unfortunately, these systems do not comprise a sufficient amount of detail regarding the geographical environment entangling the planed route, which makes it uncomfortable for a visually impaired person to navigate in the geographical environment being entangled by the planed route. Furthermore, today's GPS systems guide a person from a start to a finish destination by a voice guide, but does not comprise an audio representation of the geographical environment surrounding the user.
Considerable research has been invested in using spatialisied audio to navigate or render waypoints and points of interest (POI) information, but the resulting systems require the use of bulky, expensive or custom hardware and are thus not well-suited for wide deployment. Many research systems also depend on proprietary POI databases that cover only a small area, and which therefore are not easy to generalize to multiple cities or countries. The confluence of advanced smartphone technology and widely available geospatial databases offers the opportunity for a fundamentally different approach.
The current generation of smartphones is sufficiently powerful to render multiple sounds of spatialised audio, and the quality and the physical size of today's GPS antenna, accelerometer and other sensors allows for a complete audio augmented reality system which is useful and enriching to the blind community. Our objective is to create a solution usable by simply installing a piece of software on a widely available device and by using an audio unit able to detect the orientation of the user's head.
US2012053826A discloses a navigation system which helps users navigate through an environment by a
plurality of sensors. The sensors include one or both of short and long range sensors that detect objects
within the user's environment. Information obtained from the sensors' detection of objects within the
user's environment can be used to help the user avoid colliding with objects within the environment and
help navigate the user to a destination. The navigation system may provide the user with audible feedback regarding the objects with the user's environment and/or instructions regarding how to avoid colliding with an object and how to navigate to a destination.
US2012268563A discloses that a person is provided with the ability to auditorily determine the spatial
geometry of his current physical environment. A spatial map of the current physical environment of the
person is generated. The spatial map is then used to generate a spatialized audio representation of the
environment. The spatialized audio representation is then output to a stereo listening device which is being worn by the person.
An objective is achieved by an audio rendering system comprising at least one portable terminal configured to receive geospatial object data from at least one geospatial object data server. The geospatial object data being interrelated to a geographical position. The at least one portable terminal is further configured to render retrieved geospatial object data into an acoustic scene by a rendering algorithm. The acoustic scene is spatially interrelated to the geographical position in such a way that the acoustic scene is perceived as observed from the geographical position. The at least one audio unit is configured to sound rendered acoustic scene information into at least one ear of a user. The audio rendering system is further configured for rendering retrieved geospatial object data into the acoustic scene based on categorised acoustic scene information representing corresponding categorised geospatial object data.
Thereby, what provided is an audio rendering system that overcomes problems of the prior art by providing a 3D acoustic scene which may be translated in the mind of a user into a picture of a virtual geographical environment representing the real geographical environment surrounding the user. For example, this would then give the user, e.g. a visually impaired person, a better impression of the geographical environment surrounding the user, and this would cause the visually impaired person to be more exploring and comfortable in a geographical environment by increasing the amount of insight of the surroundings and reducing the amount of time spend going from one place to another.
The portable terminal may be configured to transmit rendered acoustic scene information to an audio unit, wherein the audio unit may be configured to recreate the rendered acoustic information into a 3D sound and emitting the 3D sound. The emitted 3D sound may create a 3D scene to a user.
In one or more embodiments the portable terminal may be a smart phone, a laptop, a tablet, a headset with in-built processor and wireless connection, or an electronic intelligent processor device. The portable terminal may be configured to comprise rendered acoustic information, wherein rendered acoustic information may include an acoustic scene augmenting geographical environment. The geographical environment may be a school area, a street, a local park, inner city, a boat and a building and/or indoor constructions etc. The portable terminal may at least include 2g, 3g, 4g and/or 5g wireless network connectivity, a GPS unit, an orientation unit, a communication interface and a display unit. The orientation unit may include a gyro scope, an accelerometer and/or an electronic compass. A communication interface may receive and/or transmit acoustic information, acoustic scene, rendered acoustic scene information and/or recorded acoustic information.
The audio based learning system comprises an audio unit, wherein the audio unit may comprise at least one speaker, a headband or a neckband, a geographical position unit and a geographical orientation unit. Furthermore, the audio unit may comprise at least one microphone.
The geospatial object data may include geographical coordinates of the related first geospatial object. Furthermore, the geospatial object data may include at least a second geographical coordinate of at least a second geospatial objects being within a distance range of the first geospatial object.
The geospatial object data may be dynamical data, that is, data representing the coordinates of a moving object, such as a bus, a train or any kind of public transport. Furthermore, a sign, such as a bus sign, a road sign etc., may comprise an in-built GPS transmitter transmitting geographical coordinates, denoted as dynamical data, to a server whenever the sign is moved. This makes it possible to render the sign into an acoustic scene no matter which geographical position the sign has attained.
The acoustic scene may comprise categorised acoustic scene information including a specific sound denoting the interrelated geospatial object. Furthermore, the acoustics scene may comprise at least one categorised acoustic scene background sound. The categorised acoustic scene background sound being automatically configured by the portable terminal based on the categorised acoustic scene information. A user of the portable terminal may also generate a categorised acoustic scene background sound by recording a sound.
The categorisation of a categorised geospatial object data, categorised acoustic scene information and a categorised acoustic scene background sound may be carried out by a user or by a categorisation algorithm implemented in the audio rendering system.
It is understood that in a 3D acoustic scene, the audio unit may provide directional information about geospatial objects in the universe or the acoustic scene, according to the location of the user.
The audio rendering system comprises categorised geospatial object data and is configured to render a categorised acoustic scene information sounding a distinguishing sound representing at least one category.
Thereby, the user of the audio rendering system, receiving at least one piece of rendered acoustic scene information, is able to distinguish between geospatial objects categorised in different categories.
For example, a visually impaired user would be able to distinguish between different categorised geospatial objects placed within both short and long distances from the user by listening to the distinguishing rendered acoustic scene information. Today, a visually impaired person may listen to sonic sounds which are interpreted as a certain object by the person. This is done at short distances using a cane. Listening to the distinguishing rendered acoustic scene information compared to just listening to sonic sounds, gives the user a longer respond time to react to the geospatial object, whether it is a public transport, a building, a sign, or any kind of a geospatial object having geographical coordinates.
The audio rendering system including rendered acoustic scene information may comprise at least one 3D sound configured to sound at least one distinguishing acoustic scene, wherein the at least one acoustic scene is spatially interrelated to at least one geographical position.
The audio rendering system including rendered acoustic scene information may comprise at least one 3D sound configured to sound at least three distinguishing acoustic scenes, wherein the at least three acoustic scenes are spatially interrelated to at least one geographical position, respectively.
Thereby, the user may be able to orientate according to the 3D sound and being attracted by at least one rendered acoustic scene information leading the user towards a geospatial object spatially interrelated to the at least one rendered acoustic scene information. This would give the user a better opportunity of orienting according to an audio scene representing a geographical environment.
The audio rendering system including an audio unit comprising a geographical position unit configured to estimate the geographical position of the audio unit.
A user wearing the portable terminal and the audio unit may experience a 3D acoustic scene comprising plurality of acoustic scene objects. When the user is moving away from a geospatial object being augmented by an acoustic scene, the user will experience that the sound level of the acoustic scene would change, and thereby, causing a change in the 3D acoustic scene with respect to the estimated geographical position of the audio unit.
It is understood that in a 3D acoustic scene the audio unit may provide directional information about a geospatial object in the geographical environment according to where the user is.
A person skilled in the art will easily implement a 2D universe also with directional information, and in principle also a 1D universe.
In one or more embodiments the geographical position unit may comprise a global positioning system (GPS) unit for receiving a satellite signal for determining and/or providing the geographical position of the audio unit. Throughout the present disclosure, the term GPS-unit is used to designate a receiver of satellite signals of any satellite navigation system that provides location and time information anywhere on or near the Earth, such as the satellite navigation system maintained by the United States government and freely accessible to anyone with a GPS receiver and typically designated “the GPS-system”, the Russian GLObal NAvigation Satellite System (GLONASS), the European Union Galileo navigation system, the Chinese Compass navigation system, the Indian Regional Navigational Satellite System, etc, and also including augmented GPS, such as StarFire, Omnistar, the Indian GPS Aided Geo Augmented Navigation (GAGAN), the European Geostationary Navigation Overlay Service (EGNOS), the Japanese Multifunctional Satellite Augmentation System (MSAS), etc.
In one or more embodiments the geographical position unit is a WiFi-network with different stations or fix points and means for determining a position by triangulation or geometrical functions alike.
The user moving around in the local environment would experience a spatially interrelation between the audio unit and the plurality of geospatial objects, since when the user is moving towards or away from a geospatial object the user would experience a change of the 3D acoustic scene according to his/her position, e.g. the sound level of the acoustic scene would decrease when the user is moving away from the zone.
Again, the audio unit may provide directional information about the geospatial objects according to where the user is.
The audio rendering system's audio unit comprises a geographical orientation unit for estimating a geographical orientation of a user when the user operates the orientation unit in its intended operational position.
A user wearing a portable terminal and the audio unit would experience an improved spatial interrelation since the 3D acoustic scene would change according to his/her position and orientation in the local environment, e.g. when the user is moving away from a geospatial object the user would experience that the sound level of the acoustic scene would change. If the user changes his/her orientation the user would experience a change of sound levels of the acoustic scene, e.g. the user changing the attention from a first geospatial object to a second geospatial object, the sound level of the second acoustic scene interrelating to the second geospatial object would be higher than the sound level of the first acoustic scene interrelating to the first geospatial object. Thereby, since the 3D acoustic scene depends on the position and the orientation, the spatial interrelation between a geospatial object and the audio unit is further improved.
In a particular embodiment a geospatial object may start to interact with a user when the audio unit is directed towards the geospatial object. In a particular case this may be when the user faces the geospatial object. It may also be possible that a moveable geospatial object becomes relatable with the audio unit when the user has directed his/hers attention towards the moveable geospatial object.
The geographical position unit and the orientation unit enhance the comfort of a visually impaired person moving in a geographical environment, and furthermore, enables the visually impaired person i to orient in relation to the audio sounds.
The audio rendering system comprises a rendering algorithm configured to render the retrieved geospatial object data into the acoustic scene based on the geographical position and/or the geographical orientation.
The rendering algorithm may also be configured to render the retrieved geospatial object data into the acoustic scene based on the surroundings, e.g. the user wearing the audio unit and the portable terminal and the user may be in a tunnel, the 3D acoustic scene would be modified by adjusting the volume, the treble, the bass and the echo of the plurality of acoustic objects, to obtain a 3D acoustic scene generating the impression of standing in a tunnel to the user.
The audio rendering system including the rendering algorithm is configured to render the retrieved geospatial object data into the acoustic scene based on a field-of-view range. The field of view range interrelate to the vision field of the user wearing the audio unit. For example, a visually impaired person would be able to search and find specific geospatial objects, since the rendering algorithm would create a 3D acoustic scene leaving the impression to the user that he/she is moving in the right direction.
The audio rendering system comprises a category selection tool configured to select at least one categorised geospatial object data, wherein the selected geospatial object data is being rendered into at least one acoustic scene based on at least one category variable.
Thereby, the user may be able to select at least one category of interest, and thereby, the rendering algorithm may retrieve and render at least one relevant categorised geospatial object data into at least one acoustic scene. For example, the user is searching for a specific category, e.g. “shoe shops”, the user selects the category “shoe shop” and/or “clothing”. Thereby, the portable terminal may only retrieve categorised geospatial objects, which is about “shoe shops” and/or “clothing” shops selling shoes. This would give the user the possibility of being able to orientate in a geographical environment listening to a geographical environment background sound and to plurality of rendered acoustic scene information having the interest of the user. The geographical background sound may represent the geographical environment surrounding the user. The geographical background sound may be generated by the portable terminal.
Orientating in a geographical environment listening to plurality of rendered acoustic scene information having the interest of the user, makes it easier for a visually impaired person to go out having a certain agenda and following it, e.g. the agenda is shopping or transporting from A-position to Z-position including several public transport shifts, i.e. the user is only interested in receiving rendered acoustic scene information about public transport signs.
The audio rendering system comprises a safety tool configured to activate at least one rendered warning sound when a warning object is within a warning zone, and wherein the at least one rendered warning sound is spatially interrelated to a geographical position of the warning object.
The audio rendering system comprises a safety tool configured to mute at least one rendered acoustic scene information and playing at least one rendered warning sound.
Thereby, the user is able to define at least one warning object, such as a public transport, which needs the attention of the user. For example, the user is nearing a rail crossing and a train is approaching the rail crossing. When the train has entered the warning zone the safety tool is able to either mute or lower the sound level of the rendered acoustic scene information and playing a rendered warning sound spatially interrelating to the train. This would enhance the safety of wearing an audio unit, such as a headset or an earphone.
The audio rendering system comprises a routing tool for determining at least one route between at least one start location and/or at least one end location or destination with at least one geographical position. The at least one route includes at least one rendered acoustic scene information being spatially interrelated to the at least one geographical position along the least one route.
Thereby, the user is able to plan a route or a tracking route in a geographical environment beforehand. Furthermore, the user is able to generate a 3D acoustic scene for the geographical environment being entangled by the planned route including rendered acoustic scene information spatially interrelated to a geographical object and geographical position. Furthermore, the user is able to simulate the planned route or tracking route when the routing tool is in a demo mode. This would adapt the user to the geographical environment entangled by the planed route or the tracking route beforehand. The routing tool would then increase the comfort of a visually impaired person moving in the geographical environment.
The audio rendering system includes the routing tool, wherein the routing tool comprises a marker or a geographical attribute, wherein the marker or geographical attribute enables the possibility of inducing an acoustic marker being spatially interrelated to the geographical position.
Thereby, the routing tool provides the possibility for the user of being able to add a marker or geographical attribute to a geographical position relating to an obstacle which he/she would like to avoid. When the user is walking the route or the tracking route and the marker or geographical attribute s retrieved by the portable terminal the audio unit would sound a distinguishing sound representing the geographical marker. This would increase even more the comfort of a visually impaired person moving around in a geographical environment.
The audio rendering system including the routing tool is able to receive at least one geographical acoustic marker from a marker server.
Thereby, a marker server is configured to share marker or geographical attribute being created by a plurality of users. The user of the audio rendering system has the possibility of adding geographical marker, generated by another user, to the geographical environment being entangled by the route or the tracking route. This would increase the possibility of marker any kind of obstacles which the user is not aware of. This would increase the comfort of a visually impaired person walking in a geographical environment.
In one aspect, the marker is a tag with properties as a beacon. In one embodiment, a street light may be categorised and used a marker being represented by a distinctive sound such as a beep. Each street light will then represent a marker and be represented as beeps in the acoustic scene. Thus, a user using the audio rendering system will experience an audio universe with beep sounds from positions relative to the geographical position, and the user will be able to hear the shape of the street lights and then the shape of the border between the pavement and the street.
In a variant, the beeps of such markers will appear sequentially and be observed as running.
In one aspect such marker are distributed by the user along distinctive geographical positions along a route. Hence, each marker being a distinctive sound may serve as ad beacon. The user may then be able to practice a route by means of simple distinctive sounds as beacons in a virtual reality, or use the markers as beacons in a real world to navigate.
In one aspect, a method of sounding rendered acoustic scene information into at least one ear of a user using an audio rendering system which may comprise the steps of receiving geospatial object data from at least one geospatial object data server, the said geospatial object data being interrelated to a geographical position. The audio rendering system then renders the retrieved geospatial object data into an acoustic scene by a rendering algorithm, which the acoustic scene is spatially interrelated to the geographical position. The audio rendering system then sounds the rendered acoustic scene into at least one ear of a user. The audio rendering system then renders the retrieved geospatial object data into the acoustic scene based on a categorised acoustic scene representation corresponding to a categorised geospatial object data.
According to an embodiment, the system may be configured with means for allowing a user to focus on a geospatial object data. When a geospatial object data is focused on and subsequently selected, geospatial object data is retrieved and rendered into the acoustic scene as a narrative.
It is understood that the geospatial object data—such as text or numbers—may be interpreted and made into speech by a speech processor so that the data is made into a sound similar to a spoken language of the user.
Thereby, the user may be able to obtain (further) detailed information about the geographical object. The user may also be able to verify if the selected geographical is actually correct or as expected.
According to an embodiment, focus on a geospatial object data is determined as an intersection between a line of sight from the geographical position, for a given orientation, and a geographical position of the geographical object.
In such embodiment the focusing is performed easily and automatically.
According to an embodiment, geospatial object data within a given area is resolved by separating each geospatial object data. Such separation may be performed spatially and may be performed by stacking each geospatial object data on top of each other in the acoustic scene (3D) or with different polar angles. The separation may also be performed temporally by sounding each geospatial object data sequentially and separated in time.
Thereby, the system is capable of separating and distinguishing objects that are clustered together in an area that, from the point of observation, would otherwise be inseparable.
According to an embodiment, a method of sounding rendered acoustic scene information into at least one ear of a user using an audio rendering system comprises a step of receiving geospatial object data from at least one geospatial object data server, said geospatial object data being interrelated to a geographical position. A step of rendering retrieved geospatial object data into an acoustic scene by a rendering algorithm, which acoustic scene is spatially interrelated to the geographical position, and where the rendering of retrieved geospatial object data into the acoustic scene is based on a categorised acoustic scene representation corresponding to a categorised geospatial object data.
According to an embodiment, further steps of providing at least one route with at least one geographical position between at least one start location and at least one end location, wherein the at least one route includes at least one acoustic scene being spatially interrelated to the at least one geographical position along the least one route, and moving said geographic position between said least one start location and said least one end location and continuously sounding rendered acoustic scene information into at least one ear of a user for each geographic position.
According to an embodiment, a method may further comprise one or more steps of providing at least one route with at least one geographical position between at least one start location and at least one end location, wherein the at least one route includes at least one acoustic scene being spatially interrelated to the at least one geographical position along the least one route (27), and moving said geographic position between said least one start location and said least one end location and continuously sounding rendered acoustic scene information into at least one ear of a user.
The audio rendering system may comprise a number of parameters including sound source specification (device, file, and signal generator plug-ins), source gain, source location, source trajectory, listener position, listener HRTF (Head-Related Transfer Function) database, surface location, surface material type, rendered plug-in specification, scripting, and low-level signal processing parameters.
Potential applications include psychoacoustic research, spatial auditory display prototypes, virtual reality for simulation and training, augmented reality for improved situational awareness and enhanced communication systems. For these applications and others, the audio rendering system provides a low-cost system for dynamic synthesis of virtual audio over an audio unit, e.g. a headset, without the need of special purpose signal processing hardware.
Rendered acoustic scene information may be generated by the rendering algorithm running on a computer, providing a flexible, maintainable, and extensible architecture to enable the quick development of an audio based route or tracking route. The rendering algorithm may be provided by an API (Application Programming Interface), for specifying the route and the acoustic scenes as well as an extensible architecture for exploring multiple routing and rendering strategies.
An acoustic scene information may comprise a virtual source generated by the portable terminal. The acoustic scene information may be transferred to a portable terminal or a terminal, and thereby the portable terminal and/or terminal may transfer the acoustic scene information to an audio unit.
The audio rendering system comprises a search tool configured to specifically render at least one categorised geospatial object data into at least one acoustic scene based on at least one category variable and at least one search variable.
Thereby, the user may be able to search more specifically after certain objects, such as brands, types of shoes, clothing etc. This has the advantage of making shopping for certain objects easier for everybody, including the visually impaired.
The audio rendering system comprises a rendering algorithm being able to render a retrievable geospatial object according to the interrelated categorised colour data, wherein the categorised colour data may comprise at least one colour representing the retrievable geospatial objects and interrelate to a categorised colour sound.
Thereby, the rendering algorithm may be able to enhance the senses of a user being visually impaired. This would not only increase the ease with which a visually impaired person can move around in a geographical environment, but also increase his/hers life quality, since the user is able to distinguish objects by a sound and a colour. Hence, a visually impaired person will be able to share the experience of colours with non-visually impaired persons. In one example, a geospatial object will include data about the colour, say red (bricks), of an object, say a house. Such red house may be a distinctive building being a landmark and this will allow the visually impaired person to navigate relatively to the red building, since by categorising according to the colour “red” will result in an acoustic scene with a distinctive sound, say an intermittent sound with a specific frequency.
The audio rendering system comprises a rendering algorithm which is able to render retrievable geospatial objects according to their physical size and shape. E.g. a first building interrelating to a first size/shape sound and a second building being smaller than the first building interrelating to a second size/shape sound. The first building and the second building may be categorised similarly comprising the same articles. The first building is larger than the second building and/or the first building having a different shape than the second building. The first size/shape sound may have a different configuration compared to the second size/shape sound representing the size and/or shape difference between the first and second buildings.
Thereby, the senses of a user are even more strengthened since the user is able to distinguish between different kinds of objects, colours, shapes and sizes. Therefore, the life quality of a visually impaired person would increase.
The audio rendering system comprises a rating feature, wherein the rating feature is able to rate at least one categorised geospatial object data based on a rating variable.
Thereby, the user is able to distinguish between the quality of similar categorised geospatial objects. E.g. a user may be able to distinguish the service quality of a plurality of similar service businesses, such as restaurants, cafes etc.
The audio rendering system may comprise a geospatial object data server including at least one dynamical geospatial data and/or at least one geospatial object data.
The audio rendering system may comprise a marker server and/or a storage device for storing an acoustic marker and a marker geographical marker interrelated to the geographical position of the acoustic marker.
A visually impaired person is a person who has lost his/her vision to such a degree as to qualify as an additional support need due to a significant limitation of visual capability resulting from either disease, trauma, congenital, or degenerative conditions that cannot be corrected by conventional means, such as refractive correction or medication.
An audio rendering system includes: at least one portable terminal configured to receive geospatial object data from at least one geospatial object data server, the geospatial object data being interrelated to a geographical position, the at least one portable terminal being configured to render the retrieved geospatial object data into an acoustic scene using a rendering algorithm, the acoustic scene being spatially interrelated to the geographical position in such a way that the acoustic scene is perceived observed from the geographical position; and at least one audio unit configured to sound a rendered acoustic scene information into at least one ear of a user; wherein the at least one portable terminal is configured to render the retrieved geospatial object data into the acoustic scene based on categorized acoustic scene information representing corresponding categorized geospatial object data.
Optionally, the categorized acoustic scene information comprises a distinguishing sound representing the corresponding categorized geospatial object data.
Optionally, the audio unit comprises a geographical position unit configured to estimate the geographical position.
Optionally, the at least one audio unit comprises a geographical orientation unit for estimating a geographical orientation of the user, when the geographical orientation unit is placed in its intended operational position.
Optionally, the rendering algorithm is configured to render the retrieved geospatial object data into the acoustic scene based on the geographical position and/or the geographical orientation.
Optionally, the rendering algorithm is configured to render the retrieved geospatial object data into the acoustic scene based on a field-of-view range.
Optionally, the portable terminal comprises a category selection tool configured to select the categorized geospatial object data, wherein the at least one portable terminal is configured to render the geospatial object data into the acoustic scene based on at least one category variable.
Optionally, the at least one portable terminal comprises a safety tool configured to provide at least one warning sound when a warning object is within a warning zone, and wherein the at least one warning sound is spatially interrelated to a geographical position of the warning object.
Optionally, the safety tool is configured to mute at least one rendered acoustic scene information, and to play the at least one warning sound.
Optionally, the audio rendering system further includes a routing tool for providing at least one route between at least one start location and at least one end location, wherein the rendered acoustic scene information is spatially interrelated to the geographical position along the at least one route.
Optionally, the routing tool is configured to handle a geographical marker, and wherein the geographical marker is configured to represent an acoustic marker being spatially interrelated to the geographical position.
Optionally, the routing tool is configured to receive at least one geographical acoustic marker from a marker server.
Optionally, the audio rendering system further includes a user interface for allowing a user to focus on a geospatial object.
Optionally, the user interface is configured to determine the geospatial object based on an intersection between a line of sight from the geographical position for a given orientation and a geographical position of the geographical object.
Optionally, the audio rendering system is configured to resolve multiple geospatial object data within a given area by separating each geospatial object data spatially or temporally.
A method of sounding rendered acoustic scene information into at least one ear of a user using an audio rendering system, includes: receiving geospatial object data from at least one geospatial object data server, wherein the geospatial object data is interrelated to a geographical position; and rendering the retrieved geospatial object data into an acoustic scene using a rendering algorithm, wherein the acoustic scene is spatially interrelated to the geographical position; wherein the act of rendering the retrieved geospatial object data into the acoustic scene is performed based on a categorized acoustic scene representation corresponding to a categorized geospatial object data.
Optionally, the method further includes: providing at least one route with the geographical position between at least one start location and at least one end location; and changing the geographic position to another position located between the at least one start location and the at least one end location, and sounding rendered acoustic scene information into the at least one ear of the user for the other position.
Other and further aspects and features will be evident from reading the following detailed description of the embodiments.
Embodiments will be described in the figures, whereon:
Various embodiments are described hereinafter with reference to the figures. It should be noted that the figures are only intended to facilitate the description of the embodiments. They are not intended as an exhaustive description of the claimed invention or as a limitation on the scope of the claimed invention. In addition, an illustrated embodiment needs not have all the aspects or advantages shown. An aspect or an advantage described in conjunction with a particular embodiment is not necessarily limited to that embodiment and can be practiced in any other embodiments even if not so illustrated, or if not so explicitly described.
The at least one terminal may also retrieve at least one categorised geospatial object data and/or at least one categorised acoustic scene information storage in an internally or an externally storage device. The geospatial object data and categorised geospatial object data may comprise geographic coordinate, such as GPS coordinate, Universal Transverse Mercator (UTM) coordinate, Universal Polar Stereographic (UPS) coordinate and/or Cartesian coordinate. The categorised acoustic scene information may include a distinguishing sound representing at least one category of the corresponding categorised geospatial object data.
The terminal may be a portable terminal connected wired or wirelessly to an audio unit.
The terminal may also be a stationary terminal connected wirelessly to an audio unit, e.g. the stationary terminal may be a server of any kind or a PC.
In this particularly example, the audio rendering system 1 comprises a portable terminal 2, an audio unit 3 and a geospatial object 18. The portable terminal 2 comprises at least acoustic scene information 4 and at least one geospatial object data set 7. A user or an algorithm may categorise the geospatial object data 7 into a categorised geospatial object data 16. Furthermore, the user or the algorithm may also categorise the at least one acoustic scene information 5 into a categorised acoustic scene information 17.
The portable terminal 2 is configured to receive geospatial object data 4 from the at least one geospatial object data server 8, which geospatial object data 7 is interrelated to a geographical position 6. The portable terminal 2 is further configured to render retrieved geospatial object data 4 into an acoustic scene 5 by a rendering algorithm 9, wherein the acoustic scene 5 comprises at least rendered acoustic scene information 10 spatially interrelated to the geographical position 6 such that the listening point is the geographical position 6 or equivalently, the point of observing or spatially interrelating the geographical object 18 is the geographical position 6. The audio unit 3 is configured to sound rendered acoustic scene information 10 into at least one ear of a user.
Thus, the geographical position 6 may be the point of observing or listening.
Furthermore, the portable terminal 2 may be configured for rendering retrieved geospatial object data 7 into the acoustic scene 5 based on categorised acoustic scene information 17 representing a corresponding categorised geospatial object data 16.
Hence, the rendered acoustic scene 10 may comprises only information representing only categorised geospatial object data 16 and thus providing a clear, simple audio landscape of only the selected, i.e. according to the categorisation, geospatial objects 18 presented to the user as from the listening point of the geographical position 6.
In a particular, and by no means exclusive example, a geospatial object 18 type, say an entrance to a subway is categorised as a “hole in the ground” and represented with a high pitch single beep that is repeated periodically just like when a radar scans an area. When the geographical position 6 moves and/or the orientation changes, the spatial interrelation between the listening point and the geospatial object 18 then changes, said change is reflected in the volume and/or the orientation of the high pitch single beep.
The geographical position 6 may be the actual location of the audio unit 3 and the point of observing or listening of the user 50.
Another embodiment of the audio rendering system 1 that is similar to the one disclosed in
The portable terminal 2 may render the retrieved geospatial object data into the acoustic scene 5 based on the categorised acoustic scene information 17 representing a corresponding categorised geospatial object data 16. The audio unit 3 then sounds the rendered acoustic scene information 10 being spatially interrelated to the geographical position 6. The audio unit 3 may be a headset having a neckband or a headband. The audio unit 3 may comprise at least one speaker and/or a microphone.
The audio unit 3 may include an activation button 32, so when the user 50 focus on the acoustic scene 5 and initializes the activation button 32, the corresponding rendered acoustic scene information 10 may be played on top of the categorised acoustic scene background sound 24.
In another embodiment, when the user initializes the activation button 32 the rendered acoustic scene information 10 may be sounded and the categorised acoustic scene background sound 24 may be muted.
In this particularly example, the user 50 wears an audio unit 3 and focus 55 on a first geospatial object 18A being a “STOP sign” 54, and the portable terminal 2 retrieves a first geospatial object data 7A, including first geographical coordinates, and/or geographical position of the geospatial object, of the “STOP sign” 54. The “STOP sign” 54 is represented by a first acoustic scene object 5A comprising at least one first categorised acoustic scene information 17A and possibly at least first categorised acoustic scene background sound 24A. The first acoustic scene object 5A may be spatially interrelated to a geographic position 6.
Stop signs 54 may be categorised as “high pitch beeps” thus resulting in “high pitch beeps” being sounded from the position of the stop sign 54. The “high pitch beeps” may be more frequent since the user 50 focus 55 on the stop sign 54.
The user 50 may also be in a setting with a second geospatial object 18B being a church 53 present. Again the portable terminal 2 retrieves a second geospatial object data 7B including second geographical coordinates, and/or geographical position of the geospatial object, of the church 53. The church is represented by a second acoustic scene object 5B comprising at least one second categorised acoustic scene information 17B and possibly at least second categorised acoustic scene background sound 24B. The second acoustic scene object 5B is spatially interrelated to the geographic position 6.
Churches 53 may be categorised and assigned a “church bell”-sound thus resulting in “chimes of a bell” being sounded from the relative position of the church 53. The “chimes of the bell” may be less frequent since the user 50 does not focus 55 on the church.
The portable terminal 2 generates at least one rendered acoustic scene information 10 based on a rendering algorithm 9 and on the retrieved first categorised acoustic scene information 17A representing the corresponding first categorised geospatial object data 16A.
Additionally, the portable terminal 2 may render the first categorised acoustic scene background sound 24A and the second categorised acoustic scene background sound 24B.
In an embodiment the user 50 may select the rendered acoustic scene information 10 to be played on top of the first and the second categorised acoustic scene background sounds (24A-24B) into the ears of the user 50.
Thus, the categorised geospatial object 18A being a Stop Sign 54 in the category of “signs regulating traffic” generates a “picture in mind” 52 or makes the user 50 associate a certain class or category of objects.
In this particular example, the user 50 who wants to navigate to the church 53 will get a simplified (yet relevant) representation of the scene to navigate in order to move about say to get to the church 53.
In an embodiment the activation button 32 may be a simple switch turning on and off the system and the activation may happen has a result an intersection of a line of sight of the user wearing the audio unit 3 and a particular geographical position 6.
The geographical locations included in the geospatial object data (7A-7D) are spatially interrelated to respective acoustic scene objects (5A-5D), wherein the respective acoustic scene objects (5A-5D) contain categorised acoustic scene information (17A-17D) and possibly a categorised acoustic scene background sound 24.
The portable terminal 2 retrieves the geospatial object data (7A-7D) and matches the categorised acoustic scene information (17A-17D) based on the categorised geospatial object data (16A-16D). The portable terminal 2 renders the retrieved geospatial object data (7A-7D) into the respective acoustic scene objects (5A-5D) based on the categorised acoustic scene information (17A-17D) and the categorised geospatial object data (16A-16D) forms an acoustic scene 5 that generates a rendered acoustic scene information 10 soundable to the user 50.
The audio unit 3 sounding the respective rendered acoustic scene information 10 (10A-10D) into the ears of the user 50, wherein the respective rendered acoustic scene information (10A-10B) spatial interrelating to the respective geographic locations contained in the geospatial object data (7A-7D) may be configured to sounding a 3D sound into the ears of the user 50.
In another embodiment, the respective rendered acoustic scene information 10 (10A-10D) may be categorised acoustic scene background sounds 24 (24A-24D).
The portable terminal 2 includes a rendering algorithm 9 configured to render the respective retrieved geospatial object data (7A-7D) into the respective acoustic scene objects (5A-5D), wherein the rendering may depend on the geographical position 6 and the geographical orientation 19 of the user. In this particular example the user 50 is placed in a uniformly distance to each of the geospatial object (18A-18D) having a main focus on the first geospatial object 18A. The rendering of the respective retrieved categorised geospatial object data (16A-16D) is differently performed since the user 50 is oriented differently to each of the respective geospatial objects (18A-18D). The first rendered acoustic scene information 10A of the first geospatial object 18A would be played on top of the remaining geospatial object (18B-18D) and the rendered acoustic scene information 10A would sound like it comes from in-front of the user. The remaining rendered acoustic scene information (10B-10D) would sound lower and having respective acoustic directions coming from the respective geographical locations contained in the geospatial objects data (7B-7D).
The field-of-view range 64 is a total angle span including the sum of the first field-of-view angle Θ1 and the second field-of-view angle Θ2. The first field-of-view angle Θ1 and the second field-of-view angle Θ2 may be in the range of 5° to 180°, such as 10° to 170°, such as 20° to 160°, such as 40° to 150°, such 80° to 140°, and such as around the field of view of a human.
The field-of.-view range 64 may be initialized in a field-of-view attribute 15, wherein the user is able set the first field-of-view angle Θ1 and the second field-of-view angle Θ2.
In this particular example, the user 50 is focusing towards a geospatial object 18 comprising a first geospatial object data 7A and a second geospatial object data 7B relating to a first categorised geospatial object data 16A and a second categorised geospatial object data 16B, respectively. Both geospatial object data (7A,7B) may have the same geographical location, but be categorised differently. The portable terminal 2 rendering the retrieved first geospatial object data 7A and the second geospatial object data 7B into the acoustic scene 5 generating a first rendered acoustic scene information 10A and a second rendered acoustic scene information 10B based by the first categorised geospatial object data 16A and the second categorised geospatial object data 16B and the corresponding first categorised acoustic scene information 17A and the second categorised acoustic scene information 17B.
In this situation the first rendered acoustic scene information 10A is about a shoe shop. Furthermore, this categorised audio may further tell the user 50 about the week's discount and new brands for sale. The second rendered acoustic scene information 10B is about a confectioner's shop. This may furthermore tell the user 50 about prices of different sweet delicacies.
In a further embodiment, which will be described later on, the user would be able to filter the rendering of the retrieved categorised geospatial object data (16A, 16B) by a category selection tool 20 based on a category variable 21, e.g. the user 50 has defined “clothing & shoes” as the category variable, and thereby the portable terminal may only render the first retrieved geospatial object data 7A since the “shoe shop” is categorised as “clothing & shoes” and the second geospatial object data 7B is categorised as “food & delicates”.
In this particular example the capture zone 59 comprises a plurality of retrievable geospatial objects 60 and a plurality of none-retrievable geospatial objects 61 are configured outside the capture zone 59. The user 50 is centralised in the capture zone 59 retrieving a plurality of geospatial object data (7A-7F) of the retrievable geospatial objects 60. The user 50 does not retrieve any geospatial object data 16 interrelating to none-retrievable geospatial objects 61.
The capture radius Rcapture may be in the range of 0.1 m to 300 m, such as 1 m to 250 m, such as 1.5 m to 150 m, such as 1.5 m to 100 m and such as 1.5 m to 50 m.
The geographic position 6 of a user 50 changes and the geographic position counts one up 62A and the geographic orientation counter 62B scanning in an orientation range 25 centralized at the geographical position 6 of the user 50.
When the counting of the geographic orientation 19 has completed 62B, the rendering algorithm 9 may have retrieved 62C at least one geospatial object data 7. If the rendering algorithm 9 has not found any retrievable geospatial object data 7 the loop stops and the next step is 62A.
The retrieved geospatial object data 7 may be rendered 62D into the acoustic scene 5 based on categorised acoustic scene information 17 representing a corresponding categorised geospatial object data 16. The rendering algorithm 9 repeats 62E until the geographic position counter 62A has finished counting.
The orientation range 25 may be in the range of 10° to 360°, such as 10° to 180° and such as 10° to 120°.
In 20B the category variable 20A is used for extracting the corresponding categorised geospatial object data 16, e.g. the corresponding categorised object data 16A to the category variable 20A may be “sport shop” as the geospatial object 18A.
The at least one categorised geospatial object data 16A from the geospatial object 18A e.g. “sport shops”, is then matched 20C with at least one categorised acoustic scene information 17. If no match found the category selection tool 20 ends 20F.
The at least one categorised geospatial object data 16 and the matched categorised acoustic scene information 17 are stored 20E in a local storage device or on a server. After storing the matched categorised geospatial object data 16 and the categorised acoustic scene information 17 the category selection tool 20 ends 20F.
Thus, the methods outlined and exemplified in
The portable terminal 2 may include a safety tool 29 or a safety feature comprising the feature of generating a warning zone 30 and defining at least one warning object 28 which would activate a rendered warning sound 31 interrelating to at least one warning object 28 being within the warning zone 30.
In
In another embodiment, the audio unit 3 sounds the first rendered acoustic scene information 10A spatially interrelated to the geographical location of the first retrievable categorised geospatial object 10A. When a warning object 28 is within the warning zone 30, a safety tool 29 is configured to play the rendered warning sound 31 on top of the plurality of categorised acoustic scene background sounds (24A-24C) interrelating to retrieved geospatial object data (7A-7C).
The warning zone 30 has a warning radius Rwarning, which may be in the range of 1 m to 1000 m, such as 20 m to 900 m, such as 50 m to 800 m and such as 100 m to 500 m.
The following
In steps 36B to 36C, the user 50 is able to choose between a random mode 36B or a specific mode 36C. If entering the specific mode 36C, the user 50 enters the category selection tool 22, wherein the user 50 is able to initialize at least one category variable representing a categorised geospatial object data 16, and thereby, storing the matched categorised geospatial object data 16 and the corresponding categorised acoustic scene information 17 into an internally storage device of the portable terminal 2 or a server 36F.
If entering random mode 36B the routing tool 26 generates a storing plurality of categorised geospatial objects 16 of randomly chosen categories.
In a further embodiment, the random categories may be decided by a category algorithm based on personal interest being logged or tracked by a social networking server, such as Facebook or Google.
In step 36D the user 50 sets the orientation range 25 and the capture radius Rcapture, and in step 36F the user 50 may choose to activate the field-of-view attribute 15 wherein the user 50 initializes the first field-of-view angle Θ1 and the second field-of-view angle Θ2. Then afterwards the user 50 may define at least one warning object 28 and the warning radius Rwarning in 36G.
In step 36H, the user starts tracking, and thereby the rendering algorithm 9 is initialized. In step 36I the geographical position 6 of the user 50 is determined (e.g. measuring GPS coordinates) and when the user 50 moves, a geographic position counter 62A increments. At the specific geographic position 6′ of the user 50, the orientation range 26 and/or the field-of-view range 64 may be scanned in steps 36J and 36K, respectively.
When finished scanning in 36J and 36K, the portable terminal 2 may retrieve 36L at least one geospatial object data 7 interrelating to a retrievable geospatial objects 60. In 36M the rendering algorithm 9 renders the at least one retrieved geospatial object 18 containing geospatial object data 7 into an acoustic scene 5 generating at least one rendered acoustic scene information 10 and/or at least one categorised acoustic scene background sound 24. If the portable terminal 2 does not retrieve any geospatial object data 7 the rendering is not performed.
If the user 50 has reached the finally destination, defined in step 36A, the rendering algorithm 9 ends 360.
The system may be configured so that the user is able to set the field-of-view attribute 15, the random mode 36B and the specific mode 36C in 21E, 21F and 21G, respectively.
By selecting 21G, the user 50 is able to simulate the tracking route 27. By voice recognition 21M, the user may control the automatic routing tool 21 with voice commands, and by the speaker 21L the user may receive guiding instruction to the automatic routing tool 21.
The system may be configured so that the user 50 may activate the rendering algorithm 9 by activating start 21L. The system may further be configured so that the user is able to load 21J a previous saved tracking route 27 and a geographical environment 63. The system may be able to save 21K the generated tracking route 27. Furthermore, the system may be able to simulate the automatically planned route or tracking route in a demo mode 21H.
Furthermore, selecting load marker 23A, at least one relevant and previous saved marker geographical marker 45 is loaded from a marker server 49 or a storage device into the geographical environment 63 of the tracking route 27. The at least one marker geographical marker 45 interrelates to a geographic position 6 and to an acoustic marker 48. The loaded marker geographical marker 45 may represent an obstacle of any kind which the user 50 or another user has previously experienced when being in the geographical environment 63.
By the set marker feature 23B of the system, the user is able to change the geographical location of a geospatial object 6 of the marker geographical marker 45. Furthermore, the system may be implemented so that the user 50 may apply a new 23C marker geographical marker 45. The system may further be enabled to cancel a marker geographical marker 23D.
An exit 23H may also be provided.
As an example, markers 45 are placed—or created by categorising street lamps—along the pavement of the route. Each Marker 45
The audio unit 3 sounds a 3D sound comprising plurality of categorised acoustic scene background sounds (24A-24D), which is spatially interrelated to the geospatial object data 7 containing geographical locations (7A-7D) of the retrievable geospatial object (60A-60D), respectively. The user 50 listens to the 3D sound generated so that the user experiences a 3D audio world or audio scene which may be translated in the mind of the user into a picture of a virtual geographical environment representing the real geographical environment surrounding the user 50.
In this particular example, the user 50 has activated categorisation according to “street signs” whereby the second retrievable geospatial object 60B is retrieved, and thereby, the audio unit 3 is sounding into the ears of the user 50 a 3D sound comprising a second rendered acoustic scene information 10B playing onto top of the remaining categorised acoustic scene objects (5A, 5C and 5D) being spatially interrelated to the geographical position 6 according to the respective geographical locations (78A, 7C and 7D). The second rendered acoustic scene information 10B is spatially interrelated to the geographic position 6 according to the location contained in the data of the second retrievable geospatial object 60B.
In the situation illustrated in
The audio unit 3 sounds a 3D sound comprising a plurality of categorised acoustic scene background sounds (24A-24D) that are spatially interrelated to the geographical locations (7A-7D) of the retrievable geospatial object (60A-60D), and furthermore, the 3D sound comprises an acoustic marker 48 playing on top of the categorised acoustic scene background sounds (24A-24D). The acoustic marker 48 is spatially to the geographical position 6 interrelated to the location of the marker geographical marker 45.
In this particular example, the acoustic marker 48 tells the user 50 that he/she should be careful, e.g. the pavement is in poor condition.
In another example, the audio rendering system may comprise a tracking route for a visually impaired user wanting to go from a start location to an end location using public transportation and with a minimum of walking. The user is blind.
In the routing tool the user initializes voice recognition for operating the routing tool. The user defines start and end locations in the routing tool. Furthermore, the user commands the routing tool to use public transportation. The routing tool automatically generates a route. The first proposal of a route did not satisfy the user. The user then commands the routing tool to redo the route. The user is now satisfied. Furthermore, the user has chosen that he/she is only interested in a category being “public transportation signs”, and thereby, the user does not receive rendered acoustic scene information which is not related to the chosen category. Additionally, the user has loaded geographical marker.
The planned route is now initialized and the user starts walking.
The user receives from the audio rendering system guiding voice and sounds and background sounds representing the geographical environment which the planned route is entangling.
Suddenly, the user hears a categorised acoustic scene background sound representing a retrievable geospatial object being a first public transportation sign. The user is focusing towards the categorised acoustic scene background sound and presses an activation button on the audio unit. The user now receives the rendered acoustic scene information spatially interrelated to the first public transportation sign.
While the user is guided towards the first public transportation sign the rendered acoustic scene information tells the user that “bus A6 going towards destination X arrives in 5 minutes”. The user knows that he has arrived at the correct waypoint being the first public transportation sign.
While the user is sitting in the bus he/she retrieves continuously from the audio rendering system information regarding the next stop, e.g. the name of the street where the next bus stop is configured to. The user has now gone off the bus A6 and the audio rendering system is guiding the user towards the second public transportation sign (i.e. second waypoint).
While the user is listening to the background sound and the guiding voice, the user suddenly hears an acoustic marker representing an obstacle on his route. The user is focusing on the obstacle while still walking on the tracking route. The sound level of the acoustic marker increases while he/she is nearing the obstacle. The user avoids the obstacle since he/she now hears the sound level of the acoustic marker is reducing and coming from behind of the user while walking towards the second waypoint.
The user hears a second categorised acoustic scene background sound representing the second public transportation sign (i.e. second waypoint). The user is guided towards the second waypoint by the second categorised acoustic scene background sound while listening to the second rendered acoustic scene information telling that “bus A2 going towards destination B arrives in 2 minutes”.
The bus arrives and the user enters the bus and being driven to the end location.
Although particular embodiments have been shown and described, it will be understood that it is not intended to limit the claimed inventions to the preferred embodiments, and it will be obvious to those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the claimed inventions. The specification and drawings are, accordingly, to be regarded in an illustrative rather than restrictive sense. The claimed inventions are intended to cover alternatives, modifications, and equivalents.
Number | Date | Country | Kind |
---|---|---|---|
13182410.4 | Aug 2013 | EP | regional |