Claims
- 1. A computer implemented method for calculating a normalization factor comprising:
providing a first intensity value (I(1)) of a probe in a first probe array and a second intensity value (I(2)) of said probe in a second probe array; obtaining the geometric mean (x) of said I(1) and said I(2); calculating said normalization factor according to: f(x)=eh(x), wherein said h(x) is derived from referential intensities from said first and second probe arrays.
- 2. The method of claim 1 wherein said h(x) is derived by relating geometric means (xi′) of first referential intensities (RIi(1)) in the first probe array and second referential intensities (RIi(2)) in the second probe array to:
- 3. The method of claim 2 wherein said relating comprising:
sorting (xi, yi) pairs according to xi into a plurality (m number) of bins with no overlapping; computing medians ({overscore (x)}k) of xi's and medians ({overscore (y)}k) of yi's for each bin; and interpolating said medians ({overscore (x)}k, {overscore (y)}k).
- 4. The method of claim 3 wherein said bins are of approximately equal size.
- 5. The method of claim 4 wherein said h(x) is:
- 6. The method of claim 5 wherein said m is 3.
- 7. A computer implemented method for comparing the expression of a gene in a first sample with a second sample comprising:
providing a first plurality of intensity values (Ii(1)), each of which reflects the expression of said gene in said first sample, wherein said intensity values are obtained from a first nucleic acid probe array; providing a second plurality of intensity values (Ii(2)), each of which reflects the expression of said gene in said second sample, wherein said intensity values are obtained from a second nucleic acid probe array; calculating a p-value using one-sided Wilcoxon's signed rank test, wherein the p-value is for a null hypothesis that median(f(x)Ii(2)−Ii(1))=0 and an alternative hypothesis that median((f(x)Ii(1)−Ii(2))>0, wherein said f(x) is a normalization factor; and indicating whether said transcript is present based upon said p-value.
- 8. The method of claim 7 further comprising a step of calculating normalization factor, said step comprising:
obtaining the geometric mean (x) of said Ii(1) and said Ii(2): calculating said normalization factor according to: f(x)=eh(x), wherein said h(x) is derived from referential intensities from said first and second probe arrays.
- 9. The method of claim 8 wherein said h(x) is derived by relating geometric means (xi′) of first referential intensities (RIi(1)) in said first probe array and said second referential intensities (RIi(2)) in said second probe array to:
- 10. The method of claim 9 wherein said relating comprising:
sorting (xi, yi) pairs according to xi into a plurality (m number) of bins with no overlapping; computing medians ({overscore (x)}k) of xi's and medians ({overscore (y)}k) of yi's for each bin; and interpolating said medians ({overscore (x)}k, {overscore (y)}k).
- 11. The method of claim 10 wherein said bins are of approximately equal size.
- 12. The method of claim 11 wherein said h(x) is:
- 13. The method of claim 12 wherein said m is 3.
- 14. A system for calculating a normalization factor comprising:
a processor; and a memory coupled with the processor, the memory storing a plurality of machine instructions that cause the processor to perform a plurality of logical steps when implemented by the processor, the logical steps comprising:
providing a first intensity value (I(1)) of a probe in a first probe array and a second intensity value (I(2)) of said probe in a second probe array; obtaining the geometric mean (x) of said I(1) and said I(2); calculating said normalization factor according to: f(x)=eh(x), wherein said h(x) is derived from referential intensities from said first and second probe arrays.
- 15. The system of claim 14 wherein said h(x) is derived by relating geometric means (xi′) of first referential intensities (RIi(1)) in the first probe array and second referential intensities (RIi(2)) in the second probe array to:
- 16. The system of claim 15 wherein said relating comprising:
sorting (xi, yi) pairs according to xi into a plurality (m number) of bins with no overlapping; computing medians ({overscore (x)}k) of xi's and medians ({overscore (y)}k) of yi's for each bin; and interpolating said medians ({overscore (x)}k, {overscore (y)}k).
- 17. The system of claim 16 wherein said bins are of approximately equal size.
- 18. The system of claim 17 wherein said h(x) is:
- 19. The system of claim 18 wherein said m is 3.
- 20. A system for comparing the expression of a gene in a first sample with a second sample comprising:
a processor; and a memory coupled with the processor, the memory storing a plurality of machine instructions that cause the processor to perform a plurality of logical steps when implemented by the processor, the logical steps comprising:
providing a first plurality of intensity values (Ii(1)), each of which reflects the expression of said gene in said first sample, wherein said intensity values are obtained from a first nucleic acid probe array; providing a second plurality of intensity values (Ii(2)), each of which reflects the expression of said gene in said second sample, wherein said intensity values are obtained from a second nucleic acid probe array; calculating a p-value using one-sided Wilcoxon's signed rank test, wherein the p-value is for a null hypothesis that median(f(x)Ii(2)−Ii(1))=0 and an alternative hypothesis that median((f(x)Ii(1)−Ii(2))>0, wherein said f(x) is a normalization factor; and indicating whether said transcript is present based upon said p-value.
- 21. The system of claim 20 further comprising a step of calculating normalization factor, said step comprising:
obtaining the geometric mean (x) of said Ii(1) and said Ii(2); calculating said normalization factor according to:
f(x)=eh(x), wherein said h(x) is derived from referential intensities from said first and second probe arrays.
- 22. The system of claim 21 wherein said h(x) is derived by relating geometric means (xi′) of first referential intensities (RIi(1)) in said first probe array and said second referential intensities (RIi(2)) in said second probe array to:
- 23. The system of claim 22 wherein said relating comprising:
sorting (xi, yi) pairs according to xi into a plurality (m number) of bins with no overlapping; computing medians ({overscore (x)}k) of xi's and medians ({overscore (y)}k) of yi's for each bin; and interpolating said medians ({overscore (x)}k, {overscore (y)}k).
- 24. The system of claim 23 wherein said bins are of approximately equal size.
- 25. The system of claim 24 wherein said h(x) is:
- 26. The system of claim 25 wherein said m is 3.
- 27. A computer software product for calculating a normalization factor comprising:
computer program code for providing a first intensity value (I(1)) of a probe in a first probe array and a second intensity value (I(2)) of said probe in a second probe array; computer program code for obtaining the geometric mean (x) of said I(1) and said I(2); computer program code for calculating said normalization factor according to:
f(x)=eh(x), wherein said h(x) is derived from referential intensities from said first and second probe arrays; and a computer readable medium for storing said codes.
- 28. The computer software product of claim 27 wherein said h(x) is derived by relating geometric means (xi′) of first referential intensities (RIi(1)) in the first probe array and second referential intensities (RIi(2)) in the second probe array to:
- 29. The computer software product of claim 28 wherein said code for relating comprising:
computer program code for sorting (xi, yi) pairs according to xi into a plurality (m number) of bins with no overlapping; computer program code for computing medians ({overscore (x)}k) of xi's and medians ({overscore (y)}k) of yi's for each bin; and computer program code for interpolating said medians ({overscore (x)}k, {overscore (y)}k).
- 30. The computer software product of claim 29 wherein said bins are of approximately equal size.
- 31. The computer software product of claim 30 wherein said h(x) is:
- 32. The computer software product of claim 31 wherein said m is 3.
- 33. A computer software product for comparing the expression of a gene in a first sample with a second sample comprising:
computer program code for providing a first plurality of intensity values (Ii(1)), each of which reflects the expression of said gene in said first sample, wherein said intensity values are obtained from a first nucleic acid probe array; computer program code for providing a second plurality of intensity values (Ii(2)), each of which reflects the expression of said gene in said second sample, wherein said intensity values are obtained from a second nucleic acid probe array; computer program code for calculating a p-value using one-sided Wilcoxon's signed rank test, wherein the p-value is for a null hypothesis that median(f(x)Ii(2)−Ii(1))=0 and an alternative hypothesis that median((f(x)Ii(1)−Ii(2))>0, wherein said f(x) is a normalization factor; computer program code for indicating whether said transcript is present based upon said p-value; and a computer readable medium for storing said codes.
- 34. The computer program code of claim 33 further comprising computer program code for calculating normalization factor, said code comprising:
code for obtaining the geometric mean (x) of said Ii(1) and said Ii(2); code for calculating said normalization factor according to:
f(x)=eh(x), wherein said h(x) is derived from referential intensities from said first and second probe arrays.
- 35. The computer software product of claim 34 wherein said h(x) is derived by relating geometric means (xi′) of first referential intensities (RIi(1)) in said first probe array and said second referential intensities (RIi(2)) in said second probe array to:
- 36. The computer software product of claim 35 wherein said code for relating comprising:
computer code for sorting (xi, yi) pairs according to xi into a plurality (m number) of bins with no overlapping; computer code for computing medians ({overscore (x)}k) of xi's and medians ({overscore (y)}k) of yi's for each bin; and computer code for interpolating said medians ({overscore (x)}k, {overscore (y)}k).
- 37. The computer software product of claim 36 wherein said bins are of approximately equal size.
- 38. The computer software product of claim 37 wherein said h(x) is:
- 39. The computer software product of claim 38 wherein said m is 3.
RELATED APPLICATIONS
[0001] This application is related to U.S. application Ser. No. ______, filed Dec. 12, 2000, Attorney Docket No. 3298.1, which is incorporated herein by reference in its entirety for all purposes.