The present invention relates generally to communication methods and systems, and more particularly, to methods and systems that estimate the location of terminals in a wireless network environment.
The growth of wireless networking has generated commercial and research interest in statistical methods to track people and things. Inside stores, hospitals, warehouses, and factories, where Global Positioning System devices generally do not work, Indoor Positioning Systems (IPS) aim to provide location estimates for wireless devices such as laptop computers, handheld devices, and electronic badges. The proliferation of “Wi-Fi” (IEEE 802.11b) wireless internet access in cafes, college campuses, airports, hotels, and homes has generated particular interest in indoor positioning systems that utilize physical attributes of Wi-Fi signals. Typical applications include tracking equipment and personnel in hospitals, providing location-specific information in supermarkets, museums, and libraries, and location-based access control.
In a standard Wi-Fi implementation, one or more access points serve end-users. Wi-Fi location estimation can employ one or more of several physical attributes of the medium, such as received signal strength (RSS) from the access points, the angle of arrival of the signal, and the time difference of arrival. A number of techniques have been proposed or suggested that use RSS for location estimation in wireless networks. See, for example, P. Bahl et al., “RADAR: An In-Building RF-Based User Location and Tracking System,” Proc. of IEEE Infocom 2000, Tel Aviv, Israel (March, 2000); or T. Roos et al., “A Statistical Modeling Approach to Location Estimation,” IEEE Transactions on Mobile Computing, 1, 59-69 (2002).
In a laboratory setting, RSS decays linearly with log distance and a simple triangulation using RSS from three access points can uniquely identify a location in a two-dimensional space. In practice, however, physical characteristics of a building, such as walls, elevators, and furniture, as well as human activity, add significant noise to RSS measurements. Consequently, statistical approaches to location estimation prevail.
Supervised learning techniques are typically employed in statistical approaches to location estimation. The training data comprise vectors of signal strengths, one for each of a collection of known locations. The dimension of each vector equals the number of access points. The corresponding location could be one-dimensional (e.g., location on a long airport corridor), two-dimensional (e.g., location on one floor of a museum), or three-dimensional (e.g., location within a multi-story office building).
Two types of location estimation systems exist. In a client-based deployment, the client measures the signal strengths as seen by it from various access points. The client uses this information to locate itself. The cost to an enterprise for such deployments is the cost of profiling the site, building the model, and maintaining the model. In an infrastructure-based deployment, the administrator deploys so-called sniffing devices that monitor the signal strength from clients. U.S. patent application Ser. No. 10/776,058, filed Feb. 11, 2004 and entitled “Estimating the Location of Inexpensive Wireless Terminals by Using Signal Strength Measurements,” incorporated by reference herein, discloses a system for estimating the location of wireless terminals using such sniffing devices. U.S. patent application Ser. No. 10/776,588, filed Feb. 11, 2004 and entitled “Estimating the Location of Wireless Terminals In A Multistory Environment,” incorporated by reference herein, discloses a system for estimating the location of wireless terminals on multiple floors.
The cost to enterprises in such deployments is the typically modest cost of deploying the necessary hardware and software, and the time and effort to build and maintain the model (if it is not completely automated). Collecting the location data is labor intensive, requiring physical distance measurements with respect to a reference object, such as a wall. Furthermore, even in normal office environments, changing environmental, building, and occupancy conditions can affect signal propagation and require repeated data gathering to maintain predictive accuracy. The model building phase then learns a predictive model that maps signal strength vectors to locations. A number of supervised learning methods have been applied to this problem, including nearest neighbor methods, support vector machines, and assorted probabilistic techniques.
A need therefore exists for improved location estimation techniques that can provide accurate location estimates without location information in the training data. A further need therefore exists for location estimation techniques that do not require profiling.
Generally, methods and apparatus are provided for estimating a location of a plurality of wireless terminals. Signal strength measurements are obtained for at least one packet transmitted by each of the wireless terminals; and a Bayesian algorithm is applied to the signal strength measurements to estimate the location of each wireless terminal. In an infrastructure-based deployment, signal strength measurements are obtained from one or more signal monitors. In a client-based model, signal strength measurements are obtained from a client associated with a respective wireless terminal.
A more complete understanding of the present invention, as well as further features and advantages of the present invention, will be obtained by reference to the following detailed description and drawings.
The present invention provides improved location estimation techniques that provide accurate location estimates without location information in the training data and that do not require profiling. A Bayesian hierarchical model is provided for indoor location estimation in wireless networks. The disclosed terminal position system 100 reduces the requirement for training data as compared with conventional techniques. The present invention uses a hierarchical Bayesian framework to incorporate important prior information and the graphical model framework to facilitate the construction of realistically complex models. While the present invention is illustrated herein using an infrastructure-based deployment, where sniffing devices, referred to herein as signal monitors, monitor the signal strength from clients, the present invention may also be implemented using a client-based model, where a client executing on each wireless device provides signal strength data to a location estimation server.
It is often important to know the location of wireless terminals 101 within wireless network 100. Knowledge of the location of wireless terminals 101 enables services that use end-user location information, such as location-aware content delivery, emergency location, services based on the notion of “closest resource,” and location-based access control.
Signal monitor 202-j, for j=1 to N, measures (i.e., “sniffs”) signals that are present on the wireless medium and transmitted by various signal sources, and determines the received signal strength (RSS) of those signals. Signal sources include wireless terminals 101, 204. Signal monitor 202-j sends the signal strength measurements to location estimation server 203. In addition, in some embodiments signal monitor 202-j receives the identifying information transmitted by wireless terminal 204 and sends the information to location estimation server 203. In some embodiments, signal monitor 202-j provides information (e.g., its coordinates, its identifier, etc.) with which to determine its location—either directly or indirectly—to location estimation server 203. The signal monitor 202-j is described below with respect to
Location estimation server 203 acquires the received signal strength measurements from signal monitors 202-1 through 202-N. Location estimation server processes the received signal strength measurements corresponding to the wireless terminals 101 in accordance with the present invention. The location estimation server 203 is described below with respect to
Wireless terminals 204 are capable of transmitting packets of data over a wireless medium in well-known fashion. The packets of data can comprise information that identifies wireless terminal 204. Wireless terminals 204 comprise a transmitter for the purpose of transmitting the packets of data. Wireless terminals 204 can be, for example, a communications station, a locating device, a handheld computer, a laptop with wireless capability or a telephone. It will be clear to those skilled in the art how to make and use wireless terminals 204.
Wireless terminals 204, in some embodiments, exchange packets with access point 205. Signal monitor 202-j can measure these packets for the purpose of estimating location. In other embodiments, wireless terminals 204 transmit packets specifically for the purpose of estimating the location of the wireless terminals 204.
Access point 205, in some embodiments, exchanges packets of data with wireless terminals 204 in well-known fashion. Access point 205 can be used to coordinate communication in network 200 and to provide wireless terminals 204 with access to networks that are external to network 200, in well-known fashion. In other embodiments, access point 205 is not present. It will be clear to those skilled in the art how to make and use access point 205.
In some embodiments, signal monitor 202-j and access point 205 are collocated. In other embodiments, additional signal monitors, or signal monitors not collocated with access point 205 are placed to ensure that signal monitors 202-1 through 202-N are not collinear (or no three signal monitors are collinear) within the x-y coordinate plane mentioned earlier.
While the present invention is illustrated in
The IEEE 802.11b High-Rate standard uses radio frequencies in the 2.4 GHz band. Wi-Fi adaptors use spread-spectrum technology that spreads the signal over several frequencies. In this way, interference on a single frequency does not entirely block the signal. The signal itself propagates in a complex manner. Reflection, absorption, and diffraction occur when the waves of the signal encounter opaque obstacles resulting in essentially random variations of signal strength. A variety of other factors, such as noise, interference from other sources, and interference between channels, also affect the signal. The resonant frequency of water happens to be 2.4 GHz so people also absorb the radio waves and impact the signal strength. Other common devices using the 2.4 GHz band include microwave ovens, Blue Tooth devices, and 2.4 GHz cordless phones.
Thus, received signal strength varies over time at a single location and varies across different locations. The present invention recognizes, however, that signal profiles corresponding to spatially adjacent locations are similar as the various external variables remain approximately the same over short distances. Furthermore, the local average of the signal strength varies slowly over time and the signal strength decays approximately in proportion to log distance.
As previously indicated, the present invention provides a Bayesian hierarchical models for indoor location estimation in wireless networks. A graphical model is a multivariate statistical model embodying a set of conditional independence relationships. A graph displays the independence relationships. The vertices of the graph correspond to random variables and the edges encode the relationships. To date, most research on graphical models has focused on acyclic digraphs, chordal undirected graphs, and chain graphs that allow both directed and undirected edges, but have no partially directed cycles.
The present invention focuses on acyclic digraphs (ADGs) with both continuous and categorical random variables.
The directed graph 300 of
p(Xα,Xβ,Xγ)=p(Xα)p(Xβ|Xα)p(Xγ|Xβ).
For graphical models where all the variables are discrete, it has been shown how independent Dirichlet prior distributions can be updated locally to form posterior distributions as data arrive; and corresponding closed-form expressions for complete-data likelihoods and posterior model probabilities, and corresponding Bayesian model averaging procedures have been provided. In the Bayesian framework, model parameters are random variables and appear as vertices in the graph.
When some variables are discrete and others continuous, or when some of the variables are latent or have missing values, a closed-form Bayesian analysis generally does not exist. Analysis then requires either analytic approximations of some kind or simulation methods. The present invention considers a Markov chain Monte Carlo (MCMC) simulation method. For an introduction to a particular MCMC algorithm, the univariate Gibbs sampler, for Bayesian graphical models, see D. J. Spiegelhalter, “Bayesian Graphical Modeling: A Case Study in Monitoring Health Outcomes,” Applied Statistics, 47, 115-133 (1988).
Generally, the Gibbs sampler starts with some initial values for each unknown quantity (that is, model parameters, missing values, and latent variables), and then cycles through the graph simulating each variable v in turn from its conditional probability distribution, given all the other quantities, denoted V\v, fixed at their current values (known as the “full conditional”). The simulated v replaces the old value and the simulation shifts to the next quantity. After sufficient iterations of the procedure, it is assumed that the Markov chain has reached its stationary distribution, and then future simulated values for vertices of interest are monitored. Inferences concerning unknown quantities are then based on data analytic summaries of these monitored values, such as empirical medians and 95% intervals. Some delicate issues do arise with the Gibbs sampler, such as assessment of convergence, sampling routines, as described in W. R. Gilks et al., “Markov Chain Monte Carlo in Practice,” Chapman and Hall, London (1996).
The crucial connection between directed graphical models and Gibbs sampling lies in expression (1). The full conditional distribution for any vertex v is equal to:
i.e., a prior term and a set of likelihood terms, one for each child of v. Thus, when sampling from the full conditional for v, only vertices which are parents, children, or parents of children of v need be considered, and local computations can be performed. The BUGS language and software, D. J. Spiegelhalter et al., “WinBUGS Version 1.2 User Manual,” MRC Biostatistics Unit (1999), implements a version of the Gibbs sampler for Bayesian graphical models.
Processor 402 is a general-purpose processor that is capable of performing the tasks described below and with respect to
Network interface 501 is a circuit that is capable of receiving, in well-known fashion, received signal strength measurements and identifier information from one or more of signal monitors 202-1 through 202-N (or from clients in the wireless devices 204 in a client-based model). In some embodiments, network interface 501 receives signal monitor identifier information from one or more of signal monitors 202-1 through 202-N. Network interface 501 is also capable of forwarding the signal strength measurements and identifier information received to processor 502.
Processor 502 may be embodied as a general-purpose processor that is capable of performing the tasks described herein. Memory 503 is capable of storing programs and data used by processor 502.
In some embodiments, the signal strength measurements that represents one or more wireless terminals 204 may be (i) the median of, or (ii) the mean of more than one signal strength measurement made over time on multiple packets transmitted by a respective wireless terminal 204. It will be clear to those skilled in the art how to determine either the median or the mean of more than one signal strength measurement. In some embodiments, a wireless terminal 204 is prompted by another device (e.g., access point 205) to transmit a packet.
The location estimation server 203 receives the signal strength measurements of wireless terminals 204 from at least one of signal monitors 202-1 through 202-N during step 702 and forms signal strength vectors. During step 703, the location estimation server 203 applies the vectors formed in the previous step to a Bayesian algorithm to obtain the location of each terminal, as discussed further below in conjunction with
The present invention provides a model that embodies extant knowledge about Wi-Fi signals as well as physical constraints implied by the target building. The following discussion presents a series of models of increasing complexity.
Non-Hierarchical Bayesian Graphical Model
The vertices X and Y represent location. The vertex Di represents the Euclidean distance between the location specified by X and Y and the i'th signal monitor (where i=1, . . . , 4). Since it is assumed that the locations of the signal monitors are known, the Di's are deterministic functions of X and Y The vertex Si represents the signal strength measured by the signal monitor 202 at (X, Y) with respect to the i'th signal monitor, i=1, . . . , 4. The model assumes that X and Y are marginally independent.
Specification of the model requires a conditional density for each vertex given its parents as follows:
X˜uniform (0, L),
Y˜uniform (0, B),
Si˜N(bi0+bi1 log Di, τi), i=1,2,3,4,
bi0˜N(0, 0.001), i=1, 2, 3, 4,
bi1˜N(0, 0.001), i=1, 2, 3, 4.
Here, L and B denote the length and breadth of the building, respectively. The distributions for X and Y reflect the physical constraints of the building. The model for Si reflects the fact that signal strength decays approximately linearly with log distance. Note that N(μ, τ) is used to denote a Gaussian distribution with mean μ and precision τ so that the prior distributions for bi0 and bi1 have large variance.
In one exemplary implementation, Markov chain Monte Carlo algorithms estimate the parameters of the Bayesian model and produce location estimates. For real-time or for larger-scale applications variational approximations (such as those described in T. Jaakola and M. I. Jordan, “Bayesian Parameter Estimation via Variational Methods,” Statistics and Computing, 10, 25-37 (2000)) may be employed.
Hierarchical Bayesian Graphical Model
The present invention recognizes that the coefficients of the linear regression models corresponding to each of the signal monitors should be similar since the similar physical processes are in play at each signal monitor. Physical differences between locations of the different signal monitors will tend to mitigate the similarity but, nonetheless, borrowing strength across the different regression models might provide some predictive benefits.
X˜uniform (0, L),
Y˜uniform (0, B),
Si˜N(bi0+bi1 log Di, τii), i=1, . . . , d
bi0˜N(b0, τb
bi1˜N(b1, τb
b0˜N(0, 0.001),
b1˜N(0, 0.001),
τb
τb
It can be shown that the hierarchical model performs similarly to its non-hierarchical counterpart, although M2 does provide improvement in average error for the smallest training sample size.
Training Data With No Location Information
Model M2 incorporates two sources of prior knowledge. First, M2 embodies the knowledge that signal strength decays approximately linearly with log distance. Second, the hierarchical portion of M2 reflects prior knowledge that the different signal monitors behave similarly. The present invention recognizes that this prior knowledge provides sufficient constraints to obviate the need to know the actual locations of the training data observations. Specifically, the training data now comprise vectors of signal strengths with unknown locations; X and Y in M1 and M2 become latent variables.
Removal of the location data requirement affords significant practical benefits. As discussed above, the location measurement process is slow and human-intensive. By contrast, gathering signal strengths vectors without the corresponding locations does not require human intervention; in the infrastructure approach, suitably instrumented access points or sniffing devices can solicit signal strength measurements from existing Wi-Fi devices and can do this repeatedly at essentially no cost. It is noted that the existing location estimation algorithms require location information in the training data to produce any estimates.
Incorporating Corridor Effects and Other Prior Knowledge
The disclosed graphical modeling framework coupled with MCMC provides a very flexible tool for multivariate modeling.
A. Corridor Model
It has been observed that when an signal monitor is located in a corridor, the signal strength tends to be substantially stronger along the entire corridor. In many office building floors, corridors are mostly parallel to the walls. Hence, a location that shares either an x-coordinate or a y-coordinate with a signal monitor (at least approximately) tends to be in the same corridor as that signal monitor.
The conditional densities for model M3 are:
X˜uniform (0, L),
Y˜uniform (0, B),
Si˜N(bi0+bi1 log Di+bi2 Ci+bi3 CiDi, τi), i=1, . . . , d,
bij˜N(bj, τb
bj˜N(0, 0.001), j=0,1,2,3,
τb
It is noted that a corridor main effect and a corridor-distance interaction term are included. Such corridor effects can be extended to include more detailed information concerning wall locations as well as locations of potentially interfering objects, such as elevators, kitchens, or printers.
B. Informative Priors for the Regression Co-Efficients
According to another aspect of the invention, mildly informative prior distributions for the regression coefficients are incorporated in the model. Specifically, in one exemplary implementation, a N(10, 0.1) prior was used for b0 and a N(−19.5, 0.1) prior was used for b1 in Model M2. The means of these priors correspond to the average intercept and slope from a maximum likelihood analysis of the combined data over all signal monitors from a number of exemplary locations. The precisions of 0.1 permit considerable posterior variability around these values.
System and Article of Manufacture Details
As is known in the art, the methods and apparatus discussed herein may be distributed as an article of manufacture that itself comprises a computer readable medium having computer readable code means embodied thereon. The computer readable program code means is operable, in conjunction with a computer system, to carry out all or some of the steps to perform the methods or create the apparatuses discussed herein. The computer readable medium may be a recordable medium (e.g., floppy disks, hard drives, compact disks, or memory cards) or may be a transmission medium (e.g., a network comprising fiber-optics, the world-wide web, cables, or a wireless channel using time-division multiple access, code-division multiple access, or other radio-frequency channel). Any medium known or developed that can store information suitable for use with a computer system may be used. The computer-readable code means is any mechanism for allowing a computer to read instructions and data, such as magnetic variations on a magnetic media or height variations on the surface of a compact disk.
The computer systems and servers described herein each contain a memory that will configure associated processors to implement the methods, steps, and functions disclosed herein. The memories could be distributed or local and the processors could be distributed or singular. The memories could be implemented as an electrical, magnetic or optical memory, or any combination of these or other types of storage devices. Moreover, the term “memory” should be construed broadly enough to encompass any information able to be read from or written to an address in the addressable space accessed by an associated processor. With this definition, information on a network is still within a memory because the associated processor can retrieve the information from the network.
It is to be understood that the embodiments and variations shown and described herein are merely illustrative of the principles of this invention and that various modifications may be implemented by those skilled in the art without departing from the scope and spirit of the invention.
The present invention recognizes that the current model can be generalized in a number of ways. For example, piecewise linear or spline-based models can be employed for the core signal strength-log distance relationship, as the data exhibits some evidence of non-linearity, especially at shorter distances. In addition, models that can incorporate approximate location information can be employed. For example, when sensors are attached to wireline telephones, the room location may be available but not the location of the sensor within the room. Further, models that can incorporate angle-of-arrival information for the signals can also be employed.
Number | Name | Date | Kind |
---|---|---|---|
5491644 | Pickering et al. | Feb 1996 | A |
6263208 | Chang et al. | Jul 2001 | B1 |
6564065 | Chang et al. | May 2003 | B1 |
6785254 | Korus et al. | Aug 2004 | B2 |
6839027 | Krumm et al. | Jan 2005 | B2 |
6889053 | Chang et al. | May 2005 | B1 |
6992625 | Krumm et al. | Jan 2006 | B1 |
7053830 | Krumm et al. | May 2006 | B2 |
7116988 | Dietrich et al. | Oct 2006 | B2 |
7149196 | Bims | Dec 2006 | B1 |
7196662 | Misikangas et al. | Mar 2007 | B2 |
7202816 | Krumm et al. | Apr 2007 | B2 |
7250907 | Krumm et al. | Jul 2007 | B2 |
20030043073 | Gray et al. | Mar 2003 | A1 |
20040003042 | Horvitz et al. | Jan 2004 | A1 |
20040072577 | Myllymaki et al. | Apr 2004 | A1 |
20040263388 | Krumm et al. | Dec 2004 | A1 |
20050020277 | Krumm et al. | Jan 2005 | A1 |
20050030929 | Swier et al. | Feb 2005 | A1 |
20050125369 | Buck et al. | Jun 2005 | A1 |
20050136972 | Smith et al. | Jun 2005 | A1 |
20050243936 | Agrawala et al. | Nov 2005 | A1 |
20050251328 | Merwe et al. | Nov 2005 | A1 |
20060041615 | Blank et al. | Feb 2006 | A1 |
20060119516 | Krumm et al. | Jun 2006 | A1 |
Number | Date | Country |
---|---|---|
1 500 949 | Jan 2005 | EP |
WO 2004008795 | Jan 2004 | WO |
WO 2004095868 | Nov 2004 | WO |
Number | Date | Country | |
---|---|---|---|
20060205417 A1 | Sep 2006 | US |