Confidence blanket for randomly-located measurements

CROSS-REFERENCE TO RELATED APPLICATIONS

The present application is related to concurrently filed, co-pending, and commonly assigned U.S. patent application Ser. No. 10/846,301, entitled “ADAPTIVE DATA COLLECTION,” the disclosure of which is hereby incorporated herein by reference.

TECHNICAL FIELD

Embodiments of the present invention are directed towards the reconstruction of measurable phenomena, and more specifically to reconstructions of underlying phenomena that can use randomly distributed inaccurate measurements and do not assume a function type for the phenomenon.

BACKGROUND OF THE INVENTION

The design and management of complex systems often involve the reconstruction of patterned phenomena. In order to better design cellular telephone networks, for example, engineers need to have accurate approximations of transmission tower signals at all points within the network's coverage area. Other examples may include computer packet routing software that require accurate estimations of routing times to every node within the network, or atmospheric prediction models that require accurate estimations of temperatures at all points covered by the models, or cell density estimations for tissue samples. In order to design and implement applications in these, and like systems, designers often rely on their knowledge of the nature of the phenomena upon which those applications are based.

Most applications do not involve phenomena that can be perfectly reconstructed, with absolute precision and certainty, at every point within the measurement space, but do involve variables capable of being measured. Many methods have been developed that approximate or estimate the value of a phenomenon from those measurements within a limited region. In traditional methods, designers assume that the phenomenon behaves as a polynomial. Through continued measurements, such a method will continue to refine the approximation, but will retain the assumption that the underlying phenomenon values are polynomic through the measurement space.

Although the most common assumption is that phenomena mimic polynomials, an understanding of what generated the phenomena may lead to a trigonometric model, an exponential model, or some other appropriate mathematical prediction of value. Typically, reconstruction using these models will take the form of a measurement system that attempts to collect enough points to satisfy the function type assumption, and to further isolate any unknown parameters of that function. For example, spectrum analysis assumes that a signal can be decomposed into some combination of sine and cosine functions. Using a spectrum analyzer, the phenomenon being reconstructed is then measured and the appropriate coefficients for the sine and cosine functions are determined through continued measurement. When the data is plotted on top of the prediction, the functional form will then approximate the data. Residuals may also be calculated that give an estimate of how well the measured data fits the predicted functional form.

BRIEF SUMMARY OF THE INVENTION

Embodiments of the present invention reconstruct an underlying phenomenon, without the need for predicting a functional form for that signal, by establishing an upper bound and a lower bound for the value of the phenomenon at every point within a measurement space. By acquiring a large enough number of measurements of the phenomenon, a sufficiently high probability that the value of said phenomenon lies between said upper and said lower bound at every point within said measurement space may be established.

Further embodiments of the present invention are systems for reconstructing a phenomenon that include a means for measuring the phenomenon computer readable memory that stores those measurements, and a processor. These systems run code that calculates an upper bound and a lower bound for the value of the phenomenon at every point within a measurement space, and code that calculates a bound on the probability, dependent upon the number of said measurements, that the value of the phenomenon lies between the upper bound and the lower bound.

The foregoing has outlined rather broadly the features and technical advantages of the present invention in order that the detailed description of the invention that follows may be better understood. Additional features and advantages of the invention will be described hereinafter which form the subject of the claims of the invention. It should be appreciated that the conception and specific embodiment disclosed may be readily utilized as a basis for modifying or designing other structures for carrying out the same purposes of the present invention. It should also be realized that such equivalent constructions do not depart from the invention as set forth in the appended claims. The novel features which are believed to be characteristic of the invention, both as to its organization and method of operation, together with further objects and advantages will be better understood from the following description when considered in connection with the accompanying figures. It is to be expressly understood, however, that each of the figures is provided for the purpose of illustration and description only and is not intended as a definition of the limits of the present invention.

BRIEF DESCRIPTION OF THE DRAWINGS

For a more complete understanding of the present invention, reference is now made to the following descriptions taken in conjunction with the accompanying drawing, in which:

FIG. 1 illustrates a cellular telephone network arranged according to one embodiment of the present invention;

FIG. 2 illustrates a reconstruction of an underlying phenomenon according to an embodiment of the present invention;

FIGS. 3A, 3B, and 3C illustrate an example test signal and an example set of measurements in a one-dimensional measurement space;

FIG. 4 is an illustration of a computer network management system;

FIG. 5 is an illustration of a method for determining the density of cancer cells in accordance with an embodiment of the present invention;

FIG. 6 is an illustration of a system for determining the air temperature a distance from the Earth's surface; and

FIG. 7 illustrates an example computer system 700 adapted according to embodiments of the present invention.

DETAILED DESCRIPTION OF THE INVENTION

Embodiments of the present invention do not have to explicitly assume any functional form for the underlying phenomenon, but rather, attempt to bound errors in the measurement of it. If a system designer knows, to a high enough degree of confidence that the value of a future measurement of the phenomenon will lie between an upper bound and a lower bound of sufficiently narrow separation, then it may become unnecessary to predict the exact form of that phenomenon. For example, in the reconstruction of a signal from a cellular transmission tower, designers can use measurements of the signal produced to generate an upper and lower bound for that signal, rather than a specific function-prediction of that signal's value through the measurement space. If such a reconstruction leads to a 95% confidence that the signal lies within a 10 milliwatt range, then exact reconstruction of the underlying signal can be become unnecessary.

FIG. 1 illustrates a cellular telephone network arranged according to one embodiment of the present invention. The service area for network 100 is represented by measurement space 101 bounded by an x and a y axis, with a transmission tower 102 located at the origin. Measurement space 101, may be further divided into 48 sub-regions 103 representing a portion of the area within range of transmission tower 102. In order to properly maintain cellular service, the network provider needs to have an accurate measurement of the signal emitted by tower 102 in every sub-region 103. However, the signal emitted by tower 102 may not be uniformly measurable across the sub-regions. Physical obstacles, such as mountains 111, buildings 112, and trees 113 will interfere, reflect, and otherwise distort the underlying signal within a sub-region. This volatility leads to increased uncertainty in the measurements and makes reconstruction of the underlying signal more difficult in sub-regions that contain such obstacles.

FIG. 2 illustrates a reconstruction of an underlying phenomenon according to an embodiment of the present invention. In FIG. 2, graph 200 represents reconstruction of signal 201, which is the signal of transmission tower 102 of FIG. 1. Embodiments of the present invention determine an upper bound 202 and a lower bound 203, and calculate a lower bound on the probability that a new measurement of signal 201 will lie between upper bound 202 and lower bound 203. If the separation 204 is appropriately small, and the probability is appropriately high, a function based prediction of signal 201 can be rendered unnecessary. When initially generated, some embodiments of the present invention may create upper bound 202 as a plane located at +∞ and lower bound 203 as a plane located at −∞. As additional measurements are taken, the calculations of upper bound 202 and lower bound 203 lead to surfaces that more precisely mirror underlying signal 201.

Embodiments of the present invention may use a mathematical analysis that places error bounds and confidence bounds on mathematical models built using randomly acquired measurements, such as the measurement of the spatial variation of the radio frequency (RF) fields associated with a cellular phone infrastructure. This analysis uses a probably approximately correct (PAC) model of the underlying data, as discussed by M. Anthony and N. Biggs in Computational Learning Theory, Cambridge University Press, UK, 1997, the disclosure of which is hereby incorporated herein by reference. The following example mathematical method is described using the cellular telephone network as an example.

Confidence Blanket Calculations

An example set of measurements can be treated as a family of mutually independent random variables (measurements) ξ(z), indexed by a space variable z∈D, where D is a domain in custom character ^k, a Euclidean space of dimension k. D can be represented as a union of open balls of a fixed radius r. Let r*>0 be the exact upper bound of values of r for which the representation is possible. Hence, r* is a characteristic of “smoothness” of the boundary of D. The space variable z, as viewed in the distributed measurements environment, is also random with a distribution function F(b)=P(z∈b), where b is a measurable subset of D in the standard probability theory sense. In the context of RF field measurement, k=2. ξ(z) represents the value of an individual observation (i.e. a measurement result) at point z. For the present example, ξ(z) can represent a measurement of the RF power for a specific cellular channel and antenna. Let the mean value Eξ(x)=m(x) be an unknown function of x describing the physical variable being measured. The function m(x) is to be estimated based on spatially distributed observations ξ(z_i) at n points, represented by the set A={z₁, . . . , z_n}, where A is generated by a random process. For the example of cellular telephones as measurement devices, the randomness can arise from the normal usage patterns of cell phone users. This analysis does assume, however, that every subdomain of D (each sub-region of a cellular network) has sufficient coverage by measurement points.

PAC Learning of m(x)

Assumption 1: Measurement Points Distribution Regularity

If d, d⊂D, is a ball of a radius r and F is a distribution function of these points, in this embodiment it has been assumed:

F(d)→0, when r→0 (continuity), and F(d)>γ>0 (1)

where γ=γ(r) depends on the radius of a ball but not on the location of its center.

Assumption 2: Smoothness

It has also been assumed that the unknown function m(z) describing the underlying variable being measured is sufficiently smooth. Further consideration of excessively volatile functions m(z), that is, functions with unbounded change over a bounded domain are excluded.

|m(x)−m(y)|<Kρ^α(x,y) (2)

for all x,y∈D; where K and α are positive constants, and ρ(x,y) is the Euclidean distance between x and y.

Assumption 3: Bounded Measurement Range.

It has been further assumed that the random variable ξ(z) representing a measurement result at point z has a limited range between some value a and some value b:

b≦ξ(z)≦a (3)

Construct an Estimator for the Measured Variable:

To construct an estimator M(x) for m(x), this embodiment defines M(x)=M(x; ξ(z₁), . . . , ξ(z_n)) for x∈D, and measurements ξ(z_i) such that for any ε>0, and δ>0 there is an n, such that

$\begin{matrix} P {\sup_{x \in D} \langle M (x; ξ (z_{1}), \dots, ξ (z_{n})) - m (x) \rangle < ɛ} > 1 - δ & (4) \end{matrix}$

If (4) is satisfied for an estimator M(x), a Confidence Blanket (CB) can be constructed for m(x). The CB at the confidence level 1-δ is defined such that for all x∈D

M(x; ξ(z₁), . . . , ξ(z_n))−ε(lower bound 203)<m(x)<M(x; ξ(z₁), . . . , ξ(z_n))+ε(upper bound 202) and (4) holds. (5)

For every subset s of measurements A, having cardinality |s|, this embodiment defines

$G (ξ, s) = {\begin{matrix} \frac{1}{\langle s \rangle} \sum_{w \in s} ξ (w), & if \langle s \rangle \neq 0 \\ 0, & if \langle s \rangle = 0 \end{matrix}} .$

Let d(x;r) be an open ball of radius r centered at point x and s=s(x;r) is defined as A∩d(x;r), i.e. a subset of measurement points within d(x;r). Let S=S(r) be the class of all such subsets. One simple candidate for an estimator of m(x) is an average value of observations made at points inside d(x;r), i.e.,

$\begin{matrix} M (x) = G (ξ, A ⋃ d (x; r)) = \frac{1}{k_{n}} \sum_{w \in s} ξ (w) where k_{n} = k_{n} (x; r) = \langle s \rangle . & (6) \end{matrix}$

From equation (6), M(x) is effectively defined as a function on S.

M(x)=M(s(x;r))=M(s) (7)

Hereafter all expectations and probabilities are conditional upon A={z₁, . . . , z_n}, i.e., E(●) and P{●} stand for E(●/A) and P{●/A} respectively. This embodiment returns to using unconditional probabilities at the end of the derivation for CB bounds based on the Total Expectation/Probability Formula: E(●)=E(E(●/A)), as described in A. N. Shiryaev, Probability, Springer-Verlag, 1996, hereby incorporated by reference, which is transformed into P{●}=E(P{●/A}), when applied to indicators of events.

It follows from condition (1) that k_n→∞ along with n, for any x and r, i.e., measurements visit every sub domain of D infinitely often when n→∞.

If {circumflex over (m)}(x) is the mean value of M(x) then (6) yields:

$\begin{matrix} \hat{m} (x) = E (M (x)) = {\begin{matrix} \frac{1}{k_{n}} \sum_{w \in d (x)} m (w), & if k_{n} \neq 0 \\ 0, & if k_{n} = 0 \end{matrix}} & (8) \end{matrix}$

The natural way to assess

$P {\sup_{x \in D} \langle M (x) - m (x) \rangle > ɛ}$

is to approximate the difference M(x)−m(x) by M(x)−{circumflex over (m)}(x) and then employ a variety of tools to deal with the latter which is a centralized sum of independent random variables.

Straightforward manipulations result in:

$\begin{matrix} \begin{matrix} P {\sup_{x \in D} \langle M (x) - m (x) \rangle > ɛ} = P {\sup_{x \in D} \langle M (x) - \hat{m} (x) + \hat{m} (x) - m (x) \rangle > ɛ} \\ \leq P {\sup_{x \in D} \langle M (x) - \hat{m} (x) \rangle + \sup_{x \in D} \langle \hat{m} (x) - m (x) \rangle > ɛ} \\ = P {\sup_{x \in D} \langle M (x) - \hat{m} (x) \rangle > ɛ - \sup_{x \in D} \langle \hat{m} (x) - m (x) \rangle} \end{matrix} & (9) \end{matrix}$

It follows from the definition of M(x) and {circumflex over (m)}(x), which also can be viewed as functions on S, that

$\begin{matrix} \sup_{x \in D} \langle M (x) - \hat{m} (x) \rangle = \sup_{s \in S} \langle M (s) - \hat{m} (s) \rangle & (10) \end{matrix}$

The original definition of “sup” can be thus reduced over an infinite number of points x in D to a definition over a finite number of different subsets of S obtained by intersecting S with the set B(r)={d(x;r)} of all balls of fixed radius r.

The upper bound on the cardinality |S| of S will play a role in the quality of the estimate of

$P {\sup_{x \in D} \langle M (x) - m (x) \rangle > ɛ} .$

Computing the Bounds on |S|

This embodiment derives an upper bound for |S| using Sauer's Lemma as described in L. Devroy and G. Lugosi, Combinatorial Methods in Density Estimation, Springer-Verlag, 2001, hereby incorporated herein by reference. Let V be the Vapnic-Chervonenkis (VC), dimension, as described in V. Vapnik, Statistical Learning Theory, John Wiley & Sons, 1998, hereby incorporated herein by reference of the set of balls of the same radius and let U be the VC dimension of the set of all balls. V≦U=k+1=3 in the case of the Euclidean plane. On the other hand, a set of three points at the vertices of an equilateral triangle can be shattered by a set of circles with a fixed radius r greater than √{square root over (3)}/3, if the distance between vertices is 1. Therefore V≧3 and therefore V=3. Sauer's Lemma now implies: |S|≦(ne/3)³<n³(e=2.7 . . . ). However it is possible to bound |S| more tightly.

Let the maximal achievable value of |S| over all sets of n points, called the shatter coefficient for the set of all balls B(r) of fixed radius r in custom character ²be denoted as N(n). This value is independent of r. The following can establish upper and lower bounds on N(n).

Lemma 1. n²−n+2≦N(n) Using a simple construction to prove the lower bound, n points can be equidistantly placed on the circumference of a circle of radius h. Then, for any integer 1≦k<n, all n subsets of k consecutive points can be cut off by circles of a fixed radius r>h. Including the empty set and the entire set of n points, the total number of generated subsets is n(n−1)+2.

Lemma 2. N(n)≦n²−n+2 For each point in A place a circle centered at this point with the same radius (say one) as is used to achieve N(n) distinct subsets. This arrangement of circles partitions the plane into a number of components, usually referred to as faces. The quantity of components can be an upper bound for N(n). Let c be one of the centers of unit circles, which generate S. Generally speaking c∉A. A corresponding subset s(c)⊂S consists of all points whose distance to c is less than 1. By construction these are exactly the points whose corresponding circles bound the component to which c belongs, and every component in the arrangement contains no more than one c. Therefore N(n)≦{maximum number of components}=n²−n+2. The latter can be proven by induction arguments.

Derivation of the CB for m(x)

Theorem 1. Under assumptions 1, 2 and 3

P{Ω}≦2(n²−n+2)(exp(−λ²h²(n))+4exp(−nβ²/32)) (11)

where λ=√2/(b−a), and h(n)=(n(γ−β))^1/2(ε−Kr^α).

Proof

Sensible bounds for

$P {\sup_{x \in D} \langle M (x) - \hat{m} (x) \rangle \geq ɛ - \sup_{x \in D} \langle \hat{m} (z) - m (x) \rangle; B_{β}}$

require

$\sup_{x \in D} \langle \hat{m} (x) - m (x) \rangle$

to be limited by ε. Given B_β has occurred,

$\sup_{x \in D} \langle \hat{m} (x) - m (x) \rangle = \sup_{x \in D} \langle \frac{1}{k_{n}} \sum_{w \in s} (m (w) - m (x)) \rangle$

can be assessed, due to the definition (8) and Assumption 2 (smoothness). All terms |m(w)−m(x)| in (8) are bounded uniformly, since w and x are inside a certain ball d(x;r) of radius r: |m(w)−m(x)|≦Kρ^α(x,w)≦Kr^α. This leads straight to:

$\langle \hat{m} (x) - m (x) \rangle = \frac{1}{k_{n}} \sum_{w \in s} \langle m (w) - m (x) \rangle \leq \frac{1}{k_{n}} \sum_{w \in s} \langle m (w) - m (x) \rangle \leq {Kr}^{α}$

Therefore, a choice for r such that Kr^α<ε is feasible. Two more observations are needed to bound the probability in (9). First, the reduction in (10) allows for representing probabilities in (9) in terms of a new variable, s. Second, the event

${\sup_{s \in S} \langle M (s) - \hat{m} (s) \rangle \geq t; B_{β}}$

is equivalent to the union

$⋃_{s \in S} {\langle M (s) - \hat{m} (s) \rangle \geq t; B_{β}} .$

These observations yield the chain of inequalities:

$\begin{matrix} P {\sup_{s \in S} \langle M (s) - \hat{m} (s) \rangle \geq ɛ - \sup_{z \in D} \langle \hat{m} (z) - m (z) \rangle; B_{β}} \leq P {\sup_{s \in S} \langle M (s) - \hat{m} (s) \rangle \geq ɛ - {Kr}^{α}; B_{β}} \leq \sum_{s \in S} P {\langle M (s) - \hat{m} (s) \rangle \geq ɛ - {Kr}^{α}; B_{β}} & (12) \end{matrix}$

The number of summands in (12) is not greater than |S|≦n²−n+2, hence Hoeffding's inequality applied to every term in (12) implies:

$\begin{matrix} P {\sup_{s \in S} \langle M (s) - \hat{m} (s) \rangle > ɛ - {Kr}^{α}; B_{β}} \leq 2 (n^{2} - n + 2) \max_{s \in S} {\exp (- 2 \frac{k_{n}}{{(b - a)}^{2}} {(ɛ - {Kr}^{α})}^{2})} = 2 (n^{2} - n + 2) \exp (- {λ^{2} (ɛ - {Kr}^{α})}^{2} k_{n}) & (13) \end{matrix}$

where k_nis such that

$B_{β} = {\sup_{x \in D} \langle k_{n} (x) / n - F (d (x)) \rangle \leq β}$

holds. Now the analysis returns to unconditional probabilities via TEF, i.e.

$P {\sup \langle M (s) - \hat{m} \underset{s \in S}{(s)} \rangle \geq ɛ - {Kr}^{α}; B_{β}} \leq 2 (n^{2} - n + 2) E {\exp (λ^{2} (ɛ - {Kr}^{α}) \min_{s \in S} k_{n}); B_{β}}$

where E{X(ω); A} stands for

$\int_{A} X (ω) P (ⅆ ω),$

X(ω) is a random variable, A is a measurable subset of a probability space and P(●) is a probability measure on this space. Remember that random variable k_nis a function of x. Since

$E {\exp (- λ^{2} (ɛ - {Kr}^{α}) \min_{s \in S} k_{n}); B_{β}} = E {\exp (- {λ^{2} (ɛ - {Kr}^{α})}^{2} \min_{x \in D} k_{n}); B_{β}}$

on B_β the value of

$\frac{k_{n}}{n} \geq γ - β$

and consequently,

$E {\exp (- {λ^{2} (ɛ - {Kr}^{α})}^{2} \min_{x \in D} k_{n}); B_{β}} \leq P (B_{β}) \exp (- n {λ^{2} (ɛ - {Kr}^{α})}^{2} (γ - β)) < \exp (- λ^{2} h^{2} (n)) .$

Finally (6), (7), and (12) yield the following bound:

P{Ω}≦2(n²−n+2)(exp(−λ²h²(n))+4exp(−nβ²/32)).

Now the confidence probability for CB can be estimated as:

$\begin{matrix} P {\sup_{x \in D} \langle M (x; ξ (z_{1}), \dots, ξ (z_{n})) - m (x) \rangle < ɛ} \geq 1 - 2 (n^{2} - n + 2) (\exp (- λ^{2} h^{2} (n)) + 4 \exp (- n β^{2} / 32)) & (14) \end{matrix}$

Remark 1. This bound holds for an arbitrary r and β within the interval 0<β<γ. Recall that γ is a function of r and therefore there are optimal values for r and β that maximize the bound in (14).

Remark 2. Assumption 3 can be relaxed in favor of any condition on the distribution of ξ(z) that preserves, for example, an exponential decay of the density's tail. The finiteness of E(exp(νξ(z))) for some ν>0, uniformly over z, for instance, is sufficient. It permits using Hoeffding's inequality to estimate the probability in (9).

Integral Metric

Some preliminary preparations have been made that are similar to the previous case. As before, a set A={z₁, . . . , z_n} of measurement points' locations is treated as fixed and all expectations/probabilities are conditional upon A unless stated otherwise. Let us centralize M(x) with

$\begin{matrix} \hat{m} (x) : P {\int_{D} \langle M (x) - m (x) \rangle Φ (ⅆ x) \geq ɛ} \leq P {\int_{D} \langle M (x) - \hat{m} (x) \rangle Φ (ⅆ x) + \int_{D} \langle \hat{m} (x) - m (x) \rangle Φ (ⅆ x) \geq ɛ} & (15) \end{matrix}$

and estimate |{circumflex over (m)}(x)−m(x)| as:

$\begin{matrix} \langle \hat{m} (x) - m (x) \rangle \leq {\begin{matrix} {Kr}^{α}, & if k_{n} \neq 0 \\ m (x) \leq a, & if k_{n} = 0 \end{matrix}} & (16) \end{matrix}$

Since

$\int_{D} \langle \hat{m} (x) - m (x) \rangle Φ (ⅆ x) = \int_{{k_{n} (•) \neq 0}} \langle \hat{m} (x) - m (x) \rangle Φ (ⅆ x) + \int_{{k (•) = 0}} \langle \hat{m} (x) - m (x) \rangle Φ (ⅆ x),$

the inequality (16) leads to the bound

$\int_{D} \langle \hat{m} (x) - m (x) \rangle Φ (ⅆ x) \leq {Kr}^{α} Φ (x \langle k_{n} (x) \neq 0) + (b - a) Φ (x \rangle k_{n} (x) = 0)$

With the last remark, Markov's inequality

$P (Y \geq t) \leq \frac{EY}{t} (t, Y \geq 0),$

applied to the random variable

$\int_{D} \langle M (x) - \hat{m} (x) \rangle Φ (ⅆ x) + (b - a) Φ (x : k_{n} (x) = 0)$

implies:

$\begin{matrix} P {\int_{D} \langle M (x) - \hat{m} (x) \rangle Φ (ⅆ x) + u_{2} \geq ɛ - u_{1}} \leq \frac{E \int_{D} \langle M (x) - \hat{m} (x) \rangle Φ (ⅆ x) + u_{2}}{ɛ - u_{1}} & (17) \end{matrix}$

where u₁=Kr^αΦ(x|k_n(x)≠0) and u₂=(b−a)Φ(x|k_n(x)=0). Naturally, r should be such that Kr^αΦ(x: k_n(x)≠0)<ε.

By Fubini's theorem one can swap ∫ and E in (17). To treat E|M(s)−{circumflex over (m)}(s)| for this disclosure the fact

$EY = \int_{0}^{\infty} P {Y \geq t} ⅆ t$

has been used, which holds for any nonnegative random variable Y if EY exists. That way the assessment of E|M(s)−{circumflex over (m)}(s)| is reduced to finding a bound for the corresponding probability. Hoeffding's inequality applied to P{|M(x)−{circumflex over (m)}(x)|≧t} bounds the numerator of the right hand side in (17) by

$\int_{D} (\int_{0}^{\infty} 2 \exp (- 2 t^{2} k_{n} (x) ❘ {(b - a)}^{2}) ⅆ t) Φ (ⅆ x) + u_{2} = (b - a) \sqrt{\frac{π}{2}} \int_{{k_{n} (•) \neq 0}} \frac{Φ (ⅆ x)}{\sqrt{k_{n} (x)}}$

and, consequently inequality (17) now looks as follows:

$\begin{matrix} P {\int_{D} \langle M (x) - \hat{m} (x) \rangle Φ (ⅆ x) + u_{2} \geq ɛ - u_{1}} \leq \frac{1}{ɛ - u_{1}} ((b - a) \sqrt{\frac{π}{2}} \int_{{k_{n} (•) \neq 0}} \frac{Φ (ⅆ x)}{\sqrt{k_{n} (x)}} + u_{2}) & (18) \end{matrix}$

Remark: This derivation is true only for those x's where k_n(x)≠0. Such x's do not contribute to the value of the integral on the right hand side of (7) due to the definition of M(x) and {circumflex over (m)}(x). The latter is reflected on of the domain of integration in (17) and (18). In order not to have these reservations in the future and for the convenience in applying Fubini's theorem again, let us define random, with respect to set A={z₁, . . . , z_n}, variable j_n(x) as:

$j_{n} (x) = {\begin{matrix} 1 / \sqrt{k_{n} (x)}, & if k_{n} (x) \neq 0 \\ 0, & if k_{n} (x) = 0 \end{matrix}} .$

We can now return to actual probabilities via TEF by taking expectations over all possible sets A={z₁, . . . , z_n} from both sides of the inequality

$\begin{matrix} P {\int_{D} \langle M (x) - m (x) \rangle Φ (ⅆ x) \geq ɛ} \leq \frac{(b - a) \sqrt{\frac{π}{2}} \int_{D} j_{n} (x) Φ (ⅆ x) + u_{2}}{ɛ - u_{1}} & (19) \end{matrix}$

which follows from (15) and 18). Indeed if Fubini's theorem is applied to the right hand side of (19), we can evaluate Ej_n(x) first and then estimate the integral over D. Following the definition of j_n, the calculation of Ej_ncan be represented as:

Ej_n=E{j_n; k_n≠0}+E{j_n; k_n=0}=E{j_n; k_n≠0}=E{1/√k_n; k_n≠0}

In its own order, set {k_n≠0} can be further split into two subsets

{k_n(x)≠0}={k_n(x)/n−F(d(x))>−β; k_n(x)≠0}∩{k_n(x)/n−F(d(x))≦−β; k_n(x)≠0}

and therefore

$E {1 / \sqrt k_{n}; k_{n} (x) \neq 0} = E {1 / \sqrt k_{n}; {k_{n} (x) / n - F (d (x)) > - β} ⋃ {k_{n} \neq 0}} + E {1 / \sqrt k_{n}; {k_{n} (x) / n - F (d (x)) \leq - β} ⋃ {k_{n} \neq 0}}$

This embodiment works with each of two terms separately. Since {k_n(x)/n−F(d(x))>β}⊂{k_n(x)≠0}, for the first term

$E {1 / \sqrt k_{n}; {k_{n} (x) / n - F (d (x)) > - β} ⋃ {k_{n} \neq 0}} = E {1 / \sqrt k_{n}; k_{n} (x) / n - F (d (x)) > - β} < \frac{1}{\sqrt{n (F (d (x)) - β}} P (k_{n} (x) / n - F (d (x)) > - β) < \frac{1}{\sqrt{n (F (d (x)) - β}} < \frac{1}{\sqrt{n (γ - β}}$

and for the second

E{1/√k_n; {k_n(x)/n−F(d(x))≦−β}∪{k_n≠0}}≦1·P(k_n(x)/n−F(d(x))≦−β)≦exp(−2nβ²)

The last is Hoeffding's bound for Bernoulli's random variable k_n(x). Here β<γ is an otherwise arbitrary constant. As the result of this observation:

$\begin{matrix} {Ej}_{n} < \frac{1}{\sqrt{n (γ - β}} + \exp (2 - n β^{2}) & (20) \end{matrix}$

Notice that this bound depends on r, since γ=γ(r), but does not depend on x. The next step is to estimate u₁=Kr^αEΦ(x; k_n(x)≠0). Since Φ is a probability measure u₁≦Kr^α. In order to estimate Eu₂, the random indicator function I_[k_n_(●)=0](x) is introduced and by applying Fubini's theorem:

$\begin{matrix} \begin{matrix} E Φ (x; k_{n} (x) = 0) \\ = E \int_{D} I_{[k_{n} (•) = 0]} (x) Φ (ⅆ x) \\ = \int_{D} (E (I_{[k_{n} (•) = 0]} (x))) Φ (ⅆ x) \end{matrix} & (21) \end{matrix}$

By the definition of indicator function E(I_[k_n_(●)=0](x))=P(k_n(x)=0)=(1−F(d(x)))ⁿ. Finally,

$\int_{D} {(1 - F (ⅆ (x)))}^{n} Φ (ⅆ x) \leq {(1 - γ)}^{n} .$

The latter can now be combined with (17), (19) and (20) to enable the bound

$\begin{matrix} \frac{(b - a) (\sqrt{\frac{π}{2 n (γ - β)}} + \exp (- 2 n β^{2}) + {(1 - γ)}^{n})}{ɛ - K r^{α}} = (\sqrt π / λ) h^{- 1} (n) + \frac{(b - a) (\exp (- 2 n β^{2}) + {(1 - γ)}^{n})}{ɛ - K r^{α}} & (22) \end{matrix}$

This derivation can be summarized as:

Theorem 2. For arbitrary ε>0, r<(ε/K)^1/α and β<γ, if conditions 1–3 are true the following bound holds:

$\begin{matrix} P {\int_{D} \langle M (x) - m (x) \rangle Φ (ⅆ x) < ɛ} \geq 1 - (\sqrt π / λ) h^{- 1} (n) - \frac{(b - a) (\exp (- 2 n β^{2}) + {(1 - γ)}^{n})}{ɛ - K r^{α}} & (23) \end{matrix}$

Remark 1. To bring the formulation of CB to the form similar to the uniform case, observe the inclusion of events

$\begin{matrix} E_{1} = {Φ {x : \langle M (x) - m (x) \rangle \geq t} \geq ɛ / t} \subseteq E_{2} \\ = {\int_{D} \langle M (x) - m (x) \rangle Φ (ⅆ x) \geq ɛ} \end{matrix}$

and consequently P(E₁)≦P(E₂).

In other words, under the conditions of Theorem 2, one is unlikely to observe large (≧t) absolute deviations of the model M(x) from an unknown phenomenon m(x) over a significant (in terms of Φ) portion of the domain D.

Remark 2. The bounds in both the cases of uniform (14) and integral (23) metrics are functions of r, β, ε and n. Since the left hand side of these inequalities depends on ε as well, a value for ε is usually picked beforehand by some practical concerns. It is obvious that only by raising n can bounds be made arbitrarily close to 1. For the two other variables there exist optimal values r=r(n) and β=β(n) to tighten the bound in (22). On the other hand, that is the “slowest” term

$\sqrt{\frac{π}{2 n (γ - β)}}$

which sets the rate of convergence in (22) and (23) as a function of n. Therefore it makes sense to set β(n) such that exp(−2nβ²)=O(n^1/2), e.g. β=(ln(n)/n)^1/2.

Rate Comparison for Two Metrics

The rates of convergence have been compared for confidence bounds using the integral metric and uniform metric. To make calculations simpler we set here K=α=1.

With constant values for ε, r, and β convergence of the confidence bound is faster for the uniform metric case. However, a scenario when all parameters—ε, r, and β—are functions of n can produce an asymptotic feature with the opposite tendency, i.e. convergence is faster for the case with integral metric. Because the rate of convergence to zero of a sum is not faster then that of a summand, it is sufficient, for instance, to show that with proper choice of ε(n), r(n), and β(n) every term in (22) goes to zero faster than, say, the first term, n²exp(−λ²h²(n)), up to the O(●) equivalence, in (14). Choice of h(n) should strike a balance: h(n) must grow fast enough to offset a second degree multiplier n²=exp(2ln(n)), but not too fast, so that all terms in (22) converge to zero faster than exp(2ln(n)−λ²h²(n)). If, for example, r(n) is chosen such that γ(n)=(2ln(n)+lnlnln(n))/n)^1/3, ε(n) is set as r(n)+√2λ⁻¹γ(n) and β(n)=(½)γ(n) then it can be verified that the bound in (22) is O(1/ln(n))^−1/2which is o((lnln(n))⁻¹), whereas the asymptotic behavior of the first term in (14) is like O((lnln(n))⁻¹).

Application to a Sample Set of Measurements

Theorem (22) can be applied to a set of sample measurements of the spatial variation of RF field strength on a single channel from a selected antenna.

Lee Model of Propagation

The Lee propagation model, as described in The Optimization and Application of the W. C. Y Lee Propagation Model in the 1900 MHz Frequency Band, by Greg Evans, Bob Joslin, Lin Vinson, Bill Foose, 1997 IEEE Vehicular Technology Conference, Phoenix, Ariz., hereby incorporated herein by reference is given by:

RSL=P₁−η log {R/R_REF}+15 log {h_b/h_bREF}+10 log {P_tx/P_txREF}+10 log(h_m/h_mREF} (24)

where:

RSL=Received signal level, the mean signal level (in dBm) received at distance R from the transmitter.
P₁=The signal strength expected for reference conditions (R_REF, h_bREF, P_txREF, and h_mREF
η=The slope, i.e. the rate of decay in signal strength (in dB/decade).
R=The distance (in miles) from the transmitter.
R_REF=The reference distance (in miles).
h_b=The height (in feet) of the base transmit antenna.
h_bREF=The reference height (in feet) of the base transmit antenna.
P_tx=The transmit ERP (in Watts)
P_txREF=The reference transmit ERP (in Watts).
h_m=The height (in feet) of the mobile receive antenna.
h_mREF=The reference height (in feet) of the mobile receive antenna.

For the present calculation it can be assumed that the reference conditions hold for transmit power, and transmit and receiver antenna height. Then (24) becomes:

RSL=P₁−η log {R/R_REF} (25)

Reasonable values for η range from 30 to 40 dB/decade for R_REF=1 mile. This equation typically applies only to regions beyond the reference distance, because within this radius propagation is dominated by factors other than terrain and clutter. However, the conditions described by (25) are at least as demanding as conditions within the reference radius and therefore (25) provides a reasonable basis for worst-case analysis.

Estimation of Smoothness Parameters (Power Measured in dBm)

For the case where the underlying variable is RF power measured in dBm we have from (2) and (25):

|m(x)−m(y)|=|−η log {R_x/R_REF}+η log {R_y/R_REF}|=η log {R_y/R_x} (26)

where R_xand R_yare the values of R at x and y respectively.

In cellular systems the signal loss is a more useful quantity than the actual power. m(x) is the power at x, m(y) is the power at y, and |m(x)−m(y)| measures the difference between them—i.e. the loss. Let:

R_x=x, and R_y=y with R_x>R_REFand y=x+ρ (27)

then (26) becomes:

|m(x)−m(y)|=η log {R_y/R_x}=η log {y/x}=η log {1+ρ/x} (28)

From (1) and (28) we have:

|m(x)−m(y)|=η log {1+ρ/x}<Kρ^α (29)

A lower bound for Kρ^α is desirable, however, the maximum value of the lower bound can give a conservative estimate of the confidence blanket. This maximum value is found when x=R_REF. The desired constraint on the smoothness parameters (note that ρ must be measured in miles):

Kρ^α>η log(1+ρ/R_REF) (30)

Inequality (30) establishes a lower bound on the term Kρ^α in the definition of λ (18).

λ=[2/(a−b)²][ε−η log(1+ρ/R_REF)]² (31)

Since the term Kρ^α is replaced by a lower bound involving only ρ, the number of free parameters in (11) is reduced to ρ, n, γ, ε, and β for the assumption that the underlying RF field is given by the Lee model in (24) and the region of interest is outside of a circle of radius R_REFfrom the transmitter. Note that (11) was derived using the Hoeffding inequality. The correct use of the Hoeffding inequality requires that values of ε and Kρ^α be restricted such that:

ε>Kρ^α (32)

There is one additional consideration for the case where RF power is measured in dBm. Hoeffding's inequality assumes bounded random variables. In the case of RF power in dBm there is of course no natural lower bound since the actual power tends to zero at large distances. However as a practical matter the actual lower limit is non-zero and is set by noise considerations that provide an effective lower bound. This lower limit is likely to be between −100 and −150 dBm. The upper bound is P_txREFexpressed in dBm. The difference in dBm is the parameter b−a in equation (11).

Numerical Simulation

FIGS. 3A, 3B, and 3C illustrate an example test signal and an example set of measurements in a one-dimensional measurement space. In FIG. 3A, test signal 301 is a sample signal as emitted from a cellular telephone tower. Measurements 302 represent measurements made of signal 301 at various distances from an antenna.

Choice of Parameter Values

The signal strength can be set to range between a minimum and maximum observed experimentally, and the RSL can be set to r^−3.4, a common value for even terrain and moderate clutter. R can be varied from a minimum of 1 mile as described above to a maximum of 15 miles. Measurements 302 made at uniformly-distributed ranges from the signal source are shown in FIG. 3A.

In evaluating equation (25) for this example set of measurements 302, ε=8 dB, shown as range 304, is an error bound commonly accepted for drive tests and model optimizations. Parameters a and b are −40 dBm and −125 dBm, respectively, as described above. A choice of η=34 dB/decade is consistent with the choice of RSL=r^−3.4above. Typically, measurement locations are uniformly distributed thus, γ=(x_max−x_min)/2r. The value for β which is chosen to minimize δ, given all the other parameters. Analysis can then be made of the value of δ for different n. In particular, what value of n is required to give acceptable confidence (e.g., 95%) for realistic values of the other parameters.

A “blanket” may then be created that bounds the interval of ±ε around the function's known value. Equation (25) bounds the probability that the worst error will exceed ε; it does not bound the number of errors. In the present example of FIGS. 3A–3C, there is a uniform sampling of all possible measurement locations, and equation (25) places no restrictions on the locations. Equation (25) gives an analysis of the fraction of trials in which the worst measurement is in error by ε or more. FIG. 3B illustrates an upper bound 305 and a lower bound 306 forming a blanket of thickness ε. The analysis represented by FIGS. 3A and 3B uses n=50. For 50 measurements, equation (25) gives δ=39,398—this is not typically a useful bound. As a result, the reconstructed (or estimated) signal 303 does not reasonably resemble underlying signal 301. For a useful bound, larger values of n become necessary. Thus, in order to give a useful confidence, a large number of measurements may be necessary. For a δ=0.05 (95% confidence), for example, a system may need an n on the order of 530,000 measurements. An analysis of the present example using n=530,000 is shown in FIG. 3C.

The preceding methods are not limited to utilization in cellular telephone networks, but rather may be used in any system requiring the estimation of an underlying phenomenon. FIG. 4 is an illustration of a computer network management system. In network 400, an agent can be launched that piggy-backs itself on a randomly-chosen outbound message, and arrives at whichever node 402 is the destination. Once there, the agent program can measure a designated message queue parameter (average length, longest, shortest, or other parameter). The agent program then reports this information, along with its location, back to host node 401. For this simple example, location may be the number of hops from host node 401. To reconstruct the queue length at an arbitrarily chosen node, a designer may consider an interval, r hops in radius, centered at the chosen node, and average all the queue length measurements lying inside the interval. The resulting Confidence Blanket calculated by the above method, would bound the confidence that the reconstruction would differ from the true value by >ε.

FIG. 5 is an illustration of a system for determining the density of cancer cells in accordance with an embodiment of the present invention. Microscope slide 500 contains a cross-section 501 of tissue. To determine how the density of cancer cells varies across the slide (e.g. away from the center of a tumor), the microscope, set to high magnification, is moved to a randomly-chosen location, and cancer cells 502 in the field of view are counted. The resulting number is one measurement. If this procedure is repeated over a number of randomly selected points, the density can be reconstructed. On a uniform grid, a designer may consider a circle of radius r centered at each grid coordinate, and average all the measurements inside the circle. The Confidence Blanket computation would apply as before.

FIG. 6 is an illustration of a system for determining the air temperature a distance from the Earth's surface. System 600 determines the air temperature of a band 602 that is 5 km above Earth's surface 601. By taking sample measurements from weather balloons, airplanes, or the like, data points can be acquired at specific longitude, latitude and altitude. To reconstruct the temperature on a uniform grid (or along a flight path or other useful construct), the system can consider a sphere of radius r centered at each position of interest, and average all the measurements falling inside it. The Confidence Blanket computation would apply as before.

When implemented via computer-executable instructions, various elements of embodiments of the present invention are in essence the software code defining the operations of such various elements. The executable instructions or software code may be obtained from a readable medium (e.g., a hard drive media, optical media, EPROM, EEPROM, tape media, cartridge media, flash memory, ROM, memory stick, and/or the like) or communicated via a data signal from a communication medium (e.g., the Internet). In fact, readable media can include any medium that can store or transfer information.

FIG. 7 illustrates an example computer system 700 adapted according to embodiments of the present invention. That is, computer system 700 comprises an example system on which embodiments of the present invention may be implemented. Central processing unit (CPU) 701 is coupled to system bus 702. CPU 701 may be any general purpose CPU. However, the present invention is not restricted by the architecture of CPU 701 as long as CPU 701 supports the inventive operations as described herein. CPU 701 may execute the various logical instructions according to embodiments of the present invention. For example, CPU 701 may execute machine-level instructions according to the exemplary calculations described in conjunction with the Confidence Blankets.

Computer system 700 also preferably includes random access memory (RAM) 703, which may be SRAM, DRAM, SDRAM, or the like. Computer system 700 preferably includes read-only memory (ROM) 704 which may be PROM, EPROM, EEPROM, or the like. RAM 703 and ROM 704 hold user and system data and programs, as is well known in the art.

Computer system 700 also preferably includes input/output (I/O) adapter 705, communications adapter 711, user interface adapter 708, and display adapter 709. I/O adapter 705, user interface adapter 708, and/or communications adapter 711 may, in certain embodiments, enable a user to interact with computer system 700 in order to input information, such as parameters necessary for the calculation of confidence levels.

I/O adapter 705 preferably connects to storage device(s) 706, such as one or more of hard drive, compact disc (CD) drive, floppy disk drive, tape drive, etc. to computer system 700. The storage devices may be utilized when RAM 703 is insufficient for the memory requirements associated with storing data for the appropriate calculations or the retention of phenomena measurements. Communications adapter 711 is preferably adapted to couple computer system 700 to network 712. User interface adapter 708 couples user input devices, such as keyboard 713, pointing device 707, and microphone 714 and/or output devices, such as speaker(s) 715 to computer system 700. Display adapter 709 is driven by CPU 701 to control the display on display device 710 to, for example, display a user interface of embodiments of the present invention.

It shall be appreciated that the present invention is not limited to the architecture of system 700. For example, any suitable processor-based device may be utilized, including without limitation personal computers, laptop computers, computer workstations, and multi-processor servers. Moreover, embodiments of the present invention may be implemented on application specific integrated circuits (ASICs) or very large scale integrated (VLSI) circuits. In fact, persons of ordinary skill in the art may utilize any number of suitable structures capable of executing logical operations according to the embodiments of the present invention.

Although the present invention and its advantages have been described in detail, it should be understood that various changes, substitutions and alterations can be made herein without departing from the invention as defined by the appended claims. Moreover, the scope of the present application is not intended to be limited to the particular embodiments of the process, machine, manufacture, composition of matter, means, methods and steps described in the specification. As one will readily appreciate from the disclosure, processes, machines, manufacture, compositions of matter, means, methods, or steps, presently existing or later to be developed that perform substantially the same function or achieve substantially the same result as the corresponding embodiments described herein may be utilized. Accordingly, the appended claims are intended to include within their scope such processes, machines, manufacture, compositions of matter, means, methods, or steps.

Number	Name	Date	Kind
4901009	Schultze et al.	Feb 1990	A
20040034465	Spiesberger	Feb 2004	A1

Confidence blanket for randomly-located measurements

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

CPC

US Classifications

Field of Search

US

International Classifications

Term Extension

Abstract

Description

Claims

US Referenced Citations (2)

Foreign Referenced Citations (1)

Related Publications (1)