The invention relates to methods and algorithms that can be executed by a software-computer system.
Binding assays probing the molecular interactions between receptor-ligand or probe-target pairs often involve non-uniformities in the receptor or probe population. For example, “spotting” produces highly uneven distributions as well as widely varying configurations of receptors and probes that affect diagnostic and analytical uses of “spotted arrays” of proteins and oligonucleotides (as outlined in “Multianalyte Molecular Analysis Using Application-Specific Random Particle Arrays,” U.S. application Ser. No. 10/204,799, filed on Aug. 23, 2002; WO 01/98765). Arrays of oligonucleotides or proteins also can be formed by displaying these capture moieties on chemically encoded microparticles (“beads”) which are then assembled into planar arrays composed of a set of such encoded functionalized carriers, preferably in accordance with the READ™ format (discussed in US Patent Application “Multianalyte Molecular Analysis Using Application-Specific Random Particle Arrays,” supra). Microparticle arrays displaying proteins or oligonucleotides of interest, also referred to herein for convenience collectively as receptors, can be produced by light-controlled electrokinetic assembly near semiconductor surfaces (see U.S. Pat. Nos. 6,468,811, and 6,514,771) or by a direct disposition assembly method (previously described in Provisional Application Ser. No. 60/343,621, filed Dec. 28, 2001 and in U.S. application Ser. No. 10/192,352, filed Jul. 9, 2002).
To perform protein or nucleic acid analysis, such encoded carrier arrays are placed in contact with samples anticipated to contain protein ligands or target polynucleotides of interest. Capture of ligand or target to particular capture agents displayed on carriers of corresponding type as identified by a color code produces an optical signature such as a fluorescence signal, either directly or indirectly by way of subsequent decoration, in accordance with one of several known methods (See U.S. patent application Ser. No. 10/271,602 “Multiplexed Analysis of Polymorphic Loci by Concurrent Interrogation and Enzyme-Mediated Detection,” filed Oct. 15, 2002, and Ser. No. 10/204,799 supra.), and resulting images are analyzed to extract assay signal intensity distributions. The identities of capture agents including protein receptors or oligonucleotide probes generating a positive assay signal can be determined by decoding carriers within the array.
Solid phase carriers including microparticles functionalized with either proteins or oligonucleotides can display chemical or physical heterogeneity, because, once immobilized, these molecules can adopt varying orientations or configurations so that receptor-ligand affinities display corresponding non-uniformities or heterogeneities. Further, the number of (viable or accessible) receptors per unit surface area of carrier or the number of receptors per (entire) carrier (“receptor-coverage”) can be non-uniform across a population of nominally identical carriers, for example as a result of non-uniformities or fluctuations affecting the process of receptor immobilization. These effects can introduce significant variations in assay signal intensities recorded from a population of such nominally identical, but chemically heterogeneous carriers, as described herein in several examples of receptor-ligand binding. The corresponding signal intensity distribution, also referred to herein as an intensity histogram, typically will display a significant variance and/or skew, and limit the level of statistical confidence attained in the assay.
The effect of heterogeneity on signal intensities can be seen, for example, in solid phase screening assay designed to detect allo-antibodies (“Panel-Reactive Antibodies,” or PRA) by using extracts or membrane fragments containing Class I and II antigens immobilized on encoded microparticles (see U.S. Pat. Nos. 5,948,627 and 6,150,122).
One aspect of heterogeneity is that arising from a distribution of affinity constants governing the receptor-ligand interaction of interest, as previously discussed (See J. V. Selinger and S. Y. Rabbany, Anal. Chem 69, 170-174 (1997); D. Lancet, E. Sadovsky, and E. Seidemann, Proc. Natl. Acad. Sci. Vol. 90, pp. 3715-3719, April 1993.) The effect of the participation of multiple receptors in ligand-binding also has been discussed (see P. J. Munson and D. Rodbard, Anal. Biochem. 107, pp. 220-239 (1980); E. M. Melikhova, I. N. Kurochkin, S. V. Zaitsev, and S. D. Varfolomeev, Anal. Biochem 175, pp. 507-515 (1988); A. K. Thakur, M. L. Jaffe, and D. Rodbard, Anal. Biochem 107, 279-295 (1980)).
However, no method has been suggested or developed to characterize a heterogeneous population in terms of a systematic parameterization based on a multiplexed “test” or “reference” assay, designed to probe that heterogeneity using suitable reference reagents such as monoclonal antibody ligands or synthetic DNA targets, and no method of analysis has been disclosed to design and apply filters in order to “sharpen” assay signal distributions, also referred to herein as histograms, to enhance the level of confidence in the assay results. A method is clearly desirable to transform assay signal histograms in a controlled and systematic way so as to improve the confidence associated with estimates of histogram moments, notably mean and variance.
Disclosed is a method for increasing the degree of confidence in receptor-ligand and probe-target interaction assays involving ligand or target capture by a heterogeneous population of receptors or probes displayed on solid phase carriers. The method comprises a process of establishing a weight function reflecting the heterogeneity in the receptor population. The weight function is applied to “filter” the assay signal distribution (“histogram”) produced in an assay involving the interaction of immobilized receptors with ligands in an actual clinical sample in order to “sharpen” the assay signal intensity distribution.
To establish a weight function, a reference assay is performed using a solution of well-defined ligand or target “test” molecules, which are permitted to interact with carrier-displayed receptors. The resulting signal intensity distribution is divided into segments representing subpopulations composed of equal numbers of carriers, and the corresponding intensity distributions are analyzed in terms of a theoretical model in order to characterize heterogeneity in the receptor population manifesting itself in the interaction of the two or more types (or states) of receptors with the test ligand; specifically, the model yields the relative density (“coverage”) of two or more distinct types (or states) of receptors within each carrier subpopulation. For the special case of a two-state model, the fractional coverage of high affinity receptors as a function of segment index represents a “heterogeneity descriptor” of the invention, also referred to herein as an η-plot. Provided that the sets of carriers used in the reference assay and in the sample assay both comprise a representative sample of the same carrier reservoir (“parent population”), the nature of receptor heterogeneity manifesting itself in the reference assay, and more specifically the η-plot, will be identical to that manifesting itself in the sample assay. Further, differences in the η-plots for the reference assay and an assay performed on an actual clinical sample provides a method of the invention of detecting heterogeneity in the ligand population. The η-plot forms the basis for the construction of a weight function by application of a mapping, as described herein.
The receptor-ligand pairs suitable for use with this method can be either protein-protein interaction pairs, including antigen-antibody and enzyme-substrate pairs, or complementary oligonucleotide pairs. The carriers can be microparticles as disclosed in U.S. application Ser. No. 10/204,799, filed on Aug. 23, 2002, and entitled: “Multianalyte Molecular Analysis Using Application-Specific Random Particle Arrays.” The methods of the present invention also can be applied to transform signals produced by other formats of solid phase molecular analysis, including signals from flow cytometry (R. Pei, G. Wang, C. Tarsitani, S. Rojo, T. Chen, S. Takemura, A. Liu, and J. Lee, Human Immun. 59, pp. 313-322 (1998)) as well as signals recorded from “spotted” arrays. The methods also can be applied to the transformation of assay signals other than optical signatures, for example, to radiation counts recorded from radioactively labeled target molecules.
The method of the present invention proceeds in several steps, the entire sequence of which is applied in turn to each carrier type. First, the distribution of mean intensities of a given type is ranked by mean intensity values and then divided into S equal subpopulations. That is, the intensity histogram, {Ip, Np}, p counting histogram bins, is transformed into a representation of the form {k, N(k)}, k ε[1, S], wherein the occupancies N(k) are (initially) identical. Each subpopulation is then analyzed in terms of a two-state model, defined in terms of {KL, RL} and {KH, RH}, as well as a mixing coefficient, η; here, KL and KH denote the affinity constants governing the interaction of ligands or targets with receptors or probes of two types (or receptors in two states), these being present on solid phase carriers at respective numbers per carrier of RL and RH. These parameters are obtained by a two-pass regression analysis of adsorption isotherms for individual subpopulations. First, {KL(k), RL(k)} and {KH(k), RH(k)} are determined for individual subpopulations. Next, average values, KL=<KL(k)> and KH=<KH(k)> are calculated, and final values for the coverage parameters, RL(k) and RH(k) are obtained by again fitting adsorption isotherms for each subpopulation to the two-state model, this time keeping KH and KL fixed. As discussed herein, the total “coverage”, R(k)=RL(k)+RH(k), serves as a subpopulation index. To characterize the heterogeneity within the carrier population, manifesting itself in the presence of at least two types (or two distinct configurations) of receptors, one defines a parameter, η, representing the fraction of “high affinity” receptors, η=RH/R; generally, as described herein, η(k) will vary with R(k). The function η(R) forms the basis for construction of a weight function, {wk, 1≦k≦S} which may assume several functional forms, as disclosed herein. This model produces a systematic parameterization to control the subsequent process of applying the weight function in order to “filter” the signal intensity distribution recorded in the analysis of actual samples.
To initiate the filtering operation, the intensity distribution produced in the sample assay, having been sorted by intensity, is first transformed by segmentation into S equal subpopulations, and these are mapped to corresponding subpopulations of the reference intensity histogram. Once established, this one-to-one correspondence permits the assignment of a unique weight, wk, to the k-th sample histogram segment. That is, the actual filtering operation entails the multiplication of the set of occupancies, {Nk; 1≦k ≦S} by the weight function {wk, 1≦k ≦S}. Corrected estimates of mean and variance associated with the weighted (“filtered”) subpopulations are computed in the manner described herein. To facilitate the direct comparison to the original histogram to assess the qualitative transformation of the shape of the original histogram, the filtered occupancy representation can be transformed back into an altered histogram using the procedure described herein.
The disclosed method also relates to the detection of non-uniformities in the ligand population. In principle, a reference assay would be performed using an ideal receptor to probe the heterogeneity of a representative sample of carriers. While this latter reference assay may not be available or practical in every circumstance, heterogeneity in the ligand population may be detected by analyzing the sample assay in the same way as the reference assay in order to derive a second parameterization in the form ηk=η(Rk), and comparing this to the parameterization derived from the reference assay. Any substantial difference in the parameterization of reference and sample assay manifesting itself in the construction of the respective η-plots is attributed to heterogeneity in the ligand population.
In one embodiment, signal intensity distributions are established for each type of carrier, each such type corresponding to one or more types of receptors or probes, by evaluating the mean intensity of all carriers of a given type. For this embodiment, the method averages over spatial non-uniformities occurring on individual carriers in a manner corresponding, for example, to flow cytometric analysis of microparticle-displayed binding complexes. The Random Encoded Array Detection (READ) format, a preferred embodiment of multianalyte molecular analysis, permits the extension of the methods of the present invention to smaller length scales, as discussed herein.
These methods are described further below with reference to the drawings.
1. Characterizing Heterogeneity in Receptor-Ligand Interactions
Interactions between ligands and their cognate receptors in a solid phase assay format frequently produce signals of considerable variance indicating chemical heterogeneity in the receptor or ligand populations. Thus, it has long been recognized in the art that the interaction between ligand proteins (used herein to include target nucleic acid) in solution and immobilized receptor proteins (used herein to include probe nucleic acids) will be affected by the physical-chemical characteristics of the solid phase (Hermanson, G. T. “Bioconjugate Techniques” Academic Press, San Diego, Calif., 1996; J. E. Butler “Solid Phases in Immunoassay” pp 205-225 in Immunoassay, Eds. Diamandis, E. P., Christopoulos, T. K. Academic Press, San Diego, Calif., 1996). For example, when receptors are immobilized on nylon membranes or other planar substrates, local variations in surface topography or surface chemical characteristics including the lateral density of “spotted” receptors will affect the assay signal intensity. When receptors are immobilized on microparticles (“beads”), the physical-chemical characteristics of a population of such carriers and their effects on the receptor population will affect the assay signal distribution produced by that receptor population. Immobilized receptors, as in the case of proteins may exist in multiple configurations of varying viability in which epitopes are occluded or denatured or, as in the case of nucleic acid probes, may assume a wide variety of configurations determined by random adsorption to the solid phase, limiting the accessibility of cognate subsequences.
This phenomenon is illustrated here by the example of an allo-antibody (also referred to as “Panel-Reactive Antibody”, or PRA) profiling assay performed using extracts of fragments of cellular membranes affixed to encoded microparticles (“beads”), each bead type representing a small number (“pool”) of human leukocyte antigens (HLA) (see
In contrast, as illustrated here (
However, the asymmetric shape and significant variance of the signal intensity histograms, and the fact that both features increase with increasing ligand concentration, is not accounted for by the usual statistical fluctuations of physical-chemical characteristics of the carrier population, pointing instead to heterogeneities in the receptor and/or ligand populations. The READ-PRA assay provides direct evidence of heterogeneity in the form of images recorded from arrays of beads in the saturation regime of the titration curves in which ligands occupy all available receptors. For example, assay images (not shown) of the fluorescence signals produced by decoration (using a Cy5 dye-labeled secondary antibodies) of hAb-II captured to bead-displayed class II antigens reveal spatial non-uniformities in the signal intensity across individual beads.
These observations suggest that a likely explanation for the observed characteristics of the signal intensity distribution lies in the heterogeneity of the population of immobilized receptors.
It will be desirable to compensate for such heterogeneity in a systematic manner, and the present invention discloses, for this purpose, a systematic procedure that identifies populations of low-affinity and high-affinity receptors within carrier subpopulations and places relatively lower weight on the signal contribution from low-affinity receptors (or carriers displaying them), and higher weight on high-affinity receptors (or carriers displaying them), thereby producing a “filtered” distribution. The invention thus provides, first, a method of characterizing the heterogeneity in receptor-ligand (used herein to include nucleic acid probe-target) interactions manifesting itself in characteristic fashion, as illustrated above, in the shape of signal intensity distributions, and further, provides a method of “sharpening” such distributions in order to enhance the reliability of the assay results.
1-1. Characterizing Heterogeneity in Receptor Population
The characterization method of the invention employs a homogeneous population of well-defined ligand “test” molecules to probe a heterogeneous receptor population displayed on microparticles or other solid phase carriers. In this “reference assay,” an adsorption isotherm is obtained by varying the test ligand concentration and recording the effect of this variation on the signal intensity distribution. The adsorption isotherm is analyzed in terms of a two-state model, described herein below, to extract binding parameters, namely, affinity constants and values of receptor coverage, to parameterize the heterogeneity of the receptor population. This process is akin to that of determining the point spread function of an optical instrument, or more generally, the transfer function of a linear system. As with the finite resolution of linear systems, the heterogeneity of a receptor population manifests itself, in the form of a convolution, in any measurement performed with that receptor population.
Specifically, to characterize the heterogeneity underlying the results described in connection with
While this conclusion is confirmed by regression analysis of the adsorption isotherms in terms of the two-state model of the invention which yields the fits shown as solid lines in
1-2 Parameterization and Analysis of Assay Intensity Signal Distributions
Introduced is a method of parametrizing the heterogeneity of a receptor population, manifesting itself in the form of at least two branches in the adsorption isotherms, for example in the Scatchard (or analogous) representation and indicating the existence of receptors of differing affinity and differing values of receptor coverage within subpopulations of nominally identical solid phase carriers such as microparticles. This method of the invention of characterizing the heterogeneity in a receptor population proceeds by dividing the signal intensity histogram for an entire carrier population (of given type), sorted by mean intensity values, into S equal subpopulations, and establishing the form of the dependence of ηk≡RHk/Rk on Rk, 1≦k≦S. This parameterization serves as a quantitative “descriptor” of a carrier population displaying receptor(s) of interest, and also forms the basis for a filtering operation, described in detail in Sect. 2.
1-2-1. Scatchard Analysis: Multiple Receptor Populations
To establish the functional dependence of ηk on the parameter Rk, intensities recorded in the reference assay (
Since adsorption isotherms for the different carrier subpopulations display similar slopes for asymptotes of high-affinity and low-affinity regimes and do not cross, as evident in
In practice, only intensity values are sorted and segmented, and values of receptor coverage are obtained by regression analysis to confirm same ordering. As illustrated in
1-2-2. Theoretical Description
The theoretical description, while presented here for the case of averaging carrier-internal non-uniformities to produce distributions of carrier mean intensities, is readily extended to smaller (“sub-carrier”) length scales, as discussed herein below.
When two receptor types with differing affinity constants participate in the binding interaction, or, when a single receptor type exists in two discrete affinity states, this model is termed herein a “two-receptor model,” or a “two-state model”. The following notations apply to a two-receptor/state model:
If left-hand-side is then normalized to KHηiH+KL(1−ηiH), the effective affinity at infinite dilution, both x- and y- intercepts converge to 1. However, the bulk of the curves differ from each other unless KH and KL are equal, and their curvature and transition depend on the relative magnitude of KH and KL at varying η.
1-2-3. Application of the Theoretical Description: Two-State Model
The theoretical model permits the analysis of adsorption isotherms in terms of any number of populations of receptors. However, the two-state model captures the salient features of the data described herein, thereby providing a firm theoretical basis for the analysis in terms of the straight-line asymptotes of the Scatchard plots constructed from the mAb-I and mAb-II reference assays described in connection with FIGS. 7 and 8. This is demonstrated by performing a regression analysis of the same data in terms of the two-state model, yielding the fits shown as solid lines in those figures.
The theoretical model thus provides a useful parameterization of receptor heterogeneity in terms of affinities and values of receptor coverage. As properties of the carrier population change over time, such changes can be tracked by repeatedly performing a reference assay as a quality control measure providing updated parameters.
Crossover Locus
The transition (or “crossover”) separating high-affinity and low-affinity regimes in each of the subpopulations of increasingly higher coverage, apparent in
In the following discussion, the superscript i is dropped for convenience for variables, RHi, RLi, Ri, and for the auxiliary variables
In terms of variables, z and B, expressions for the asymptotes have the form:
z=zH−{tilde over (K)}HB and z=−{tilde over (K)}L(B-BL),
where
The solution of the interception point, (Bc, zc), as an function of ηi and Clroi, is:
The crossover locus, constructed from these expression using the values of KH, KL, RHi and RLi produced by regression analysis in terms of the two-state model (
Shape of Distribution
Adsorption isotherms of carrier subpopulations derived from a reference assay such as those described in connection with
In the Langmuir regime of the adsorption isotherm, i.e., when the decrease in the initial solution concentration of ligand due to receptor binding is negligible, one has Cl0≦≦Cr0, and Cl≡Cl0i. Then, {ξiCr0i} denotes the intensity distribution, or the “intensity histogram”, at any given ligand concentration, C10. Only if ηi, the ratio of high affinity to low-affinity receptors, is identical for all values of receptor coverage, will ξi not depend on Croi (or i), and will {ξiCr0i}, the intensity histogram, display similar shapes at different ligand concentrations. However, as now shown, if ηi is Cr0i (or i)-dependent, the intensity histogram will change shape as ligand concentration increases.
To determine how the shape of the distribution evolves, one invokes the expressions given above for the occupancy fraction and the crossover locus in the two-state model.
The function η(R) (“η-plot”), representing the heterogeneity characteristics intrinsic to the parent carrier reservoir, relates to receptor coverage only, that is, it neither depends on affinity constants nor on ligand concentration. An evolution of the shape of the intensity distribution with varying ligand concentration in accordance with the predictions of
Choice of S: Carrier Redundancy vs. Segmentation Frequency
To reliably characterize the heterogeneity in a given receptor population by application of the segmentation method of the invention, it generally will be preferable to employ a large number of carriers. According to the statistical theory of sampling distributions, the error in the estimates of the sample mean (and other moments) and the associated confidence interval(s), are related to sample size. The minimal number of carriers required should be sufficiently large so as to ensure that a specified confidence interval placed upon the estimate of the mean contains the actual population mean. As S increases, the number of carriers per subpopulation decreases, and the uncertainty associated with computed subpopulation mean values correspondingly increases: when standard deviations of those estimates exceed the difference in mean values of adjacent segments, the segmentation process is no longer a well defined one.
Related to this condition is the requirement that η-plots of adjacent segments must not intersect, in order to ensure the same ordering of Rk and segment indices, k, and ensuring a one-to-one correspondence between segment indices for different histograms, as discussed above. The effective affinities, {tilde over (K)}L and {tilde over (K)}L, computed using the parameters produced by regression analysis of the data in
Thus, given an initial choice of carriers of a specific type, the value S of segments is thus limited by a certain maximal value, Smax, corresponding to a minimal number of carriers (of given type) in each segment, permitting the evaluation of carrier subpopulation mean values to be within a desirable statistical confidence. In practice, it will be desirable to have a greater total number of carriers than the minimum discussed above. Preferably, in the READ embodiment of the invention, each subpopulation will contain at least 20 carriers of each type, more preferably at least 50 carriers of each type.
The resolution attained by ηk(Rk) as an approximation to the underlying continuous function, η(R), illustrated in
2. Filtering Sample Intensity Histograms
The method of filtering the signal intensity histogram produced in a sample assay by application of the steps is summarized in
Underlying this filtering method is the assumption that all samples of receptor-functionalized carriers taken from the same reservoir (“parent population”) of carriers display the same degree of heterogeneity. Accordingly, the sample assay histogram is assumed to reflect the degree of heterogeneity previously characterized in the reference assay and parameterized in terms of the two-state (or generally n-state) model of the present invention. That is, the same mapping and weight functions apply to the transformation of any distribution of signal intensities produced by subsets of carriers extracted from such a parent population. A generalization of this method applying to the detection of heterogeneity in the ligand population is described below.
2-1 Preprocessing
Reference and sample assays are performed, not necessarily at the same time, and not necessarily in that order. In the preferred READ format, images are recorded from the array showing the pattern of ligand binding to carrier-immobilized receptors as a function of ligand concentration. Preprocessing steps are applied to the images as necessary using conventional methods to remove spatial variations in intensity reflecting, for example, non-uniformities in the illumination intensity profile (“flat-fielding”) and fluctuations reflecting “cross-talk” between adjacent beads and geometric effects reflecting, for example, the spherical shape of beads. Intensities recorded are then sorted in ascending order to form a 1-D array.
An optional anomaly (outlier)-removal step is then applied to this array. First, intensity differences are calculated for each pair of adjacent elements in the presorted intensity array to form an array of differences. The average difference is obtained and a threshold value set by multiplying the average difference by an adjustable factor, say 5. Next, in the array of differences, starting from elements whose positions correspond to preset minimal and maximal percentiles, say 20% and 80%, of the intensity array, one searches for the first element that exceeds the threshold. If found, the position of the element is marked as a new boundary. If not found, the end of array is the new boundary. In the intensity array, all elements falling outside the new boundaries are rejected as outliers.
2-2 Filtering
2-2-1 Histogram Segmentation
To transform the sample assay signal, one first segments the preprocessed intensity distribution into S subpopulations of equal area (
This segmentation step corresponds to a transformation of the assay intensity distribution from an intensity representation (Ib, Nb), the index b counting histogram bins, into a subpopulation representation, (k, Nk), in which the abscissa represents segment indices and the ordinate represents the area (or subtotal of carriers) of each segment. By construction, segment areas are initially identical, that is, Nl=. . . =Nk=. . . =Ns. Each of the S subpopulations in the segmented sample histogram is matched to a corresponding subpopulation in the segmented reference histogram.
2-2-2 Filter Construction: Weight Function
Central to the filtering operation is a weight function that is associated with the heterogeneity of the parent population characterized with the reference assay. Defined herein is a weight function as a set of S coefficients, {wk, 1≦k≦S}, to be multiplied to the subpopulation occupancies in order to emphasize contributions from the “high affinity” portion of the receptor population. The weight function w(k) is derived from w(Rk), which represents a “stretched” version of function ηk=η(Rk). That is, as with histogram “stretching”, familiar in the context of image analysis, the η axis is “stretched” in order to extend the range of the function η(R), generally by employing a monotonic mapping that assigns a value of wk to each segment index, k.
Many functional forms of the mapping η→w may be considered, in a manner analogous to pixel histogram transformation “stretching” familiar in the context of image analysis (Seul, O'Gorman & Sammon, “Practical Algorithms for Image Analysis”, Cambridge University Press, 2000). Examples include:
1. A step function placing full weight to segments, k≧kThreshold, and assigning a weight of zero to other segments;
2. A monotonically increasing function, such as a straight line with an offset, or a polynomial function.
2-3 Transformation and Calculation of Moments
The next step is to apply the weight coefficients to the sample assay signal intensity distribution; the transformed segment occupancy is thus the product of the area of the segment and its corresponding weight. A set of moments of the transformed histograms can be computed using the expressions described below.
Assuming the mean intensity, carrier count, and variance of those S subpopulations are ({overscore (x)}1,n1,σ1), ({overscore (x)}2,n2,σ2) . . . ({overscore (x)}s,ns,σs), the new moments, ({overscore (x)}, n, σ), associated with the above transformed subpopulations, each being assigned a weight {wk1≦k≦S}, are:
Following the completion of this step, the filtered subpopulation representation may be transformed back into the conventional representation showing a histogram of S bins (
2-4 Implementation
2-4-1 Flow Chart
A flow chart of the sequence of processing steps in this heterogeneity-filtering algorithm (“HetFilter”) is shown below.
For all dilution ratios, filtering substantially reduces the width of the original histogram, thereby revealing the “true” intensity and produces corrected estimates of the mean. Values for the mean and variance calculated before and after filtering (Table I) reveal that the transformation greatly improves discrimination without displaying significant sensitivity to the number, S, of segments employed in the parameterization.
The effect of the shape of the mapping η→w on the shape of the filtered histogram is illustrated in
2-5 General Case: Heterogeneity in Ligand Population
In the version of the filtering method described so far, one has assumed that heterogeneity observed in the signal intensity distributions produced in receptor-ligand interaction assays is dominated by heterogeneity in the receptor population. In accordance with this embodiment of the method, sample assay intensity histograms are filtered simply by applying the weight function derived from the parameterization of the corresponding reference assay.
More generally, as described below, the disclosed method also addresses the case in which non-uniformities in the ligand population contribute in significant measure to the overall heterogeneity, manifesting itself in the form of broad and skewed intensity histograms and, in the READ format, in spatial non-uniformities visible in assay images. The effect of heterogeneity in the ligand population on the signal intensity distribution may be determined in accordance with the methods disclosed herein by realizing that heterogeneity in the ligand population will be independent from the heterogeneity of any particular carrier-displayed receptor population. The regression analysis described in Sect. I may be repeated with a larger number of parameters to allow for the existence of more than a single constituent in the ligand population. That is, this method permits detection of heterogeneity in the ligand population. Further, the shape of the intensity distribution reflecting ligand heterogeneity may be estimated once the multiple relevant affinities have been estimated, as in the example of the 2×2 matrix of affinities discussed in greater detail below.
2-5-1 Extension of Theoretical Description to General Case
Assuming the presence of two levels of heterogeneity, namely receptor coverage, indexed by i, and affinity constant, indexed by j, the theoretical description in Sect. 1-2-2 presented expressions for affinity constants in terms of mass action:
yielding, for each carrier of i-th coverage, the number of receptor-ligand complexes in the form:
The general case considers the effect of heterogeneity in the ligand population, associated with different affinity constants reflecting, for example, the presence of a mixture of ligands (as in a serum containing polyclonal antibodies with different prevalent epitopes or the partial deterioration of some ligands. It is important to note that the heterogeneity in the ligand population is independent of that in the receptor population. Indexing the different subpopulations by q, the law of mass action produces a modified expression for the affinity constants in terms of mass action:
Before proceeding to compute the ratios, defined are a separate mixing coefficient ηq, indexed by q:
Here, a subscript is used to distinguish it from ηij.ηq does not involve indices, i and j; note, that because of the independence of the heterogeneity in the receptor population and that in the ligand population:
Hence, the above equation becomes:
The analogy of the special case of the 2-state-model described for receptor heterogeneity involving high-affinity and low-affinity states in the receptor population now involves “high-efficiency” and “low-efficiency” states in the-ligand population. For this 2×2 state, one can express the ratio of total number of occupied sites (or the histogram bin height) and ligand concentration, for the i-th subpopulation, in the form:
where ηH is the high-affinity fraction in terms of receptor coverage as in two-state (receptor) model. ηH is the fraction of “high-efficiency” ligand. Affinity constants thus correspond to the following interactions suggested by the binding and cross-binding activities among four identities, where R represents receptors and L represents ligands. The above equation, mathematically, is equivalent to a reaction between a homogeneous ligand population and a heterogeneous four-receptor population.
The set of affinity constants corresponding to this configuration is equivalent to the co-affinity matrix disclosed in “Multianalyte Molecular Analysis Using Application-Specific Random Particle Arrays,” U.S. Application Ser. No. 10/204,799, filed on Aug. 23, 2002; WO 01/98765, although the present invention relates to the presence of multiple types (or states) of receptors on a single type of carrier. The special case of a two-state model for receptors and ligands produces a 2×2 matrix,
In the special case of a homogeneous receptor population, then KLH=KHH and KLL=KHL. By combining terms with equal K's and dropping the subscripts, the above equation reduces to
This expression has the same form as that which describes the reference experiment in the two-state (receptor) model. One can thus characterize the heterogeneity in the ligand population in terms of a mixing coefficient, ηH, which may be determined in a reference experiment using an “ideal” test receptor to probe the heterogeneity in the ligand population.
Consider a more general case in which the heterogeneity in ligand is assumed to be continuous and is characterized by a density function θ(q), one has:
Note that θ(q) is independent of index i and can be factored out of the summation. In the special case of two distinct receptor types (or states) displayed on carriers, the above equation is thus reduced to
For any given q, if there is ηi,H is independent or monotonically increasing as a function of Croi, or even though ηi,H decreases as Croi increases but the product, (ηi,H Croi) is constant or monotonically increasing, there should exist the same ranking of the quantity, ƒ(i,q),
as that of the value of Croi, given KH>KL, that is ƒ(m,q)≦ƒ(n,q), for V m<n. In this case, since θ(q) is not i-dependent and hence the integrand has the same ranking as that of the value of Cr0i, the result of the integration should have the same ranking as that of the value of Cr0i.
The actual peaks recorded from interaction between human sera and carrier population displaying HLA Class-II antigens, as described in Example 2, are broader than those recorded from assays performed using monoclonal antibodies. Those features suggest an additional contribution to the intensity histogram shape arising from heterogeneity in the ligand population. Considering the fact that, in the reference experiment, ηi shows a monotonically increasing dependence on Ri, one concludes that, in the actual experiment involving hAb-II serum, regardless of the detailed form of heterogeneity present in ligand, as characterized by θ(q), the intensity corresponding to the i-th receptor coverage has the same ordering as that of Cr0i or that of the index, i. Thus, regardless of the additional heterogeneity present in the ligand population, application of the filtering method of the present invention in order to reduce the effect of receptor heterogeneity on the assay signal intensity histogram using the weight function constructed from the reference assay represents a well-defined process.
2-6 Extension: Pixel-Intensity Distribution
The method, as described so far, addresses transformations of the distributions of carrier mean intensities, Ij,k where j denotes the j-th carrier type and k denotes the k-th of S carrier subpopulations. In this form, the method applies to several assay formats, notably to READ™ and to flow cytometric analysis of sets of encoded microparticles
In the preferred embodiment, READ, spatial non-uniformities manifesting themselves on the “pixel” scale, that is, on a length scales smaller than the carrier size; for example, using typical configurations described in the Examples included herein, the image of a 3 μm-diameter bead comprises tens of pixels. In the experiments described herein with reference to Examples 1-3, spatial non-uniformities were in some instances optically resolvable. In such instances, it may be desirable to analyze signal intensities directly by forming signal intensity distributions based on averaging not over entire carriers such as beads but averaging over “pixel” elements, to obtain pixel mean intensities, Ipj,k.So long as our assumptions, especially the 4th assumption—namely, that receptor-ligand binding events are uncorrelated—remain valid, the methodology described herein also can be applied to the transformation of pixel intensity distributions to further improve assay fidelity and reliability. Such finer levels of resolution are not available in flow cytometric detection methods.
Examples of application of the pseudocode and examples that illustrate other features of making and using the methods are set forth herein below.
Reference Assay and Parameterization: Anti-HLA Monoclonal Antibody Titration
Pools of antigens were extracted from cells with known HLA antigens and affixed to color-encoded beads using methods of passive adsorption known in the art. Approximately 500 μg of the protein were incubated with 1 mg of the color-encoded beads in a coupling buffer containing 3 mM sodium chloride, 2 mM sodium phosphate, pH 3.0, and incubated overnight at 37° C. under constant rotation. Bovine serum albumin (BSA) was used as a negative control protein in the coupling reaction. After protein coupling, the particles were washed in phosphate-buffered saline (PBS; 0.1 M sodium phosphate, 0.15 M sodium chloride, pH 7.2) also containing 0.05% Tween-20 (PBST). For array assembly, all of the functionalized beads of interest were combined into a test tube and subjected to Light-controlled Electrokinetic Assembly of Particles near Surfaces (LEAPS) or methods of direct assembly as described (op.cit.). The array was decoded using fluorescence microscopy to detect the signals from the fluorescent coding dyes associated with beads of different types. Unreacted reactive sites were blocked using 1% BSA in PBST prior to conducing the assay.
To perform the reference assay, mouse monoclonal antibody specific for HLA Class I or Class II (also mAb-I and mAb-II) were serially diluted in PBST with 1% BSA, and 1-10% NP-40. Diluted solutions of known concentration of mAb were placed in contact with arrays of HLA-functionalized beads, and incubated in a small (home-built) humidified and temperature-controlled reaction chamber for 1 hour under modest shaking. Unbound antibodies were removed by intensive washing with PBST. The bound antigen-antibody complexes were detected using Cy5-labeled anti-mouse IgG Fab fragments. Titration data are displayed in
Actual (“Sample”) Assay and Application of Filtering: Allo-Antibody Profiling
Arrays of HLA-functionalized, color-coded beads were prepared, and unreacted sites blocked as in Example 1. Instead of mouse monoclonal antibodies, human polyclonal antisera specific for HLA Class I or Class II molecules, serially diluted in PBST with 1% BSA, and 1-10% NP-40, were used to establish titration curves. Increasingly dilute sera were placed in contact with arrays of HLA-functionalized beads, and incubated in a small (home-built) humidified and temperature-controlled reaction chamber for 1 hour under modest shaking. Unbound antibodies were removed by intensive washing with PBST. The bound antigen-antibody complexes were detected using Cy5-labeled goat anti-human fragment. Titration data are displayed in
Profiling of Anti-Jo1 Antibody
A titration curve was established using arrays of color-encoded beads displaying a set of autoantigens and a human serum sample known to contain anti-Jo-1 antibody (as well as possibly other antibodies). A 6-antigen panel was formed by attaching auto-antigens—Jo-1, SSA-60, Sm, Sm/RNP, CENP, and SSB—to core-shell beads by passive adsorption under conditions of low salt and low pH. Following a blocking step with buffer containing 0.1% BSA (w/v), beads were stored individually at 4° C. until the time of array assembly. To form arrays, beads displaying the 6 antigens were pooled and assembled on silicon chips.
A titration curve was obtained using the serially diluted anti-Jo-1 positive serum at dilution ratios of 1:3, 1:9, 1:27, 1:81, 1:243, 1:729, 1:2187 and 1:6561. Diluted sera were incubated with chips for 30 min at room temperature. After removing weakly bound antibodies, an Alexa-labeled goat anti-human IgG was used to visualize captured anti-Jo-1. After a second washing step, decoding and assay images were collected and the assay signals were then extracted and analyzed.
Analysis of Images of “Spotted” Probe Arrays
An image of a spotted array produced, for example, by raster scanning the fluorescence intensity distributions of constituent “spots”, the intensity distribution, I=I(x, y), across each spot, as illustrated in
A direct correspondence to the methods of parameterization and filtering method described herein is established by applying a random sampling step to the function I(x, y). For example, overlay the spot image with an opaque mask containing one or more “holes”permitting the sampling of the underlying spot intensity at the aperture position(s), and compute the mean of the sampled portion of I=I(x, y), also referred to as the aperture mean. This sampling operation preferably is performed in software using standard methods. For example, multiple single samples may be taken at randomly selected positions within the spot, optionally ensuring that sampled positions are separated by a pre-selected minimal distance (in order to avoid correlations between spatially proximal samples); alternatively, an entire set of such samples may be taken, and aperture mean values computed, by sampling in a random set of positions within the spot, as illustrated
It should be understood that the terms, expressions and examples used herein are exemplary only, and not limiting, and that the scope of the invention is defined only in the claims which follow, and includes all equivalents of the subject matter of the claims. Process and method steps in the claims can be carried out in any order, including the order set forth in the claims, unless otherwise specified in the claims.
This application claims priority to U.S. Provisional Application No. 60/515,332, filed Oct. 29, 2003, and to No. 60/544518, filed Feb. 14, 2004.
Number | Date | Country | |
---|---|---|---|
60515332 | Oct 2003 | US | |
60544518 | Feb 2004 | US |