The present invention relates to the cancerology field. More particularly, the invention relates to a method for breast cancer diagnosis.
In women, breast cancer is the leading cause of mortality due to cancer in industrialized countries. Age is the most important risk factor. Thus, the risk increases by 0.5% per year of age in countries in the West. Other risk factors are known, such as the number of pregnancies and the age at the time of the first pregnancy, breast-feeding, the age at puberty and at the menopause, estrogenic treatments after the menopause has occurred, stress and nutrition.
Breast cancer diagnosis is generally carried out by mammography. However, it is estimated that the minimum size of a tumor that can be detected by mammography is 1 cm, which means that the tumor has an evolutive past of 8 years, on average, at the time of diagnosis. In addition, the early detection of a tumor is all the more important because small tumors are much less malignant than what can be extrapolated from their sizes: the aggressiveness of large tumors does not come only from their size, but also from their “inherent aggressiveness”, which increases with the age of a tumor (Bucchi et al., Br J Cancer 2005, p. 156-161; Norden T, Eur J Cancer 1997, p. 624-628). The analysis of the expression of a panel of target genes is also relevant in combating breast cancer, and mention may in particular be made of the analysis of a panel of 176 genes, which is expressed differentially between patients expressing the ER receptor and patients not expressing the ER receptor (Bertucci et al., Human Molecular Genetics, 2000; 9: 2981-2991). Mention may also be made of the analysis of a panel of 37 genes which makes it possible to make an early diagnosis of the breast cancer (Sharma et al., Breast cancer Research 2005, 7:R634—R644). However, all the patients had a suspect initial mammogram. This gene panel could be unsuitable for a routine diagnostic test for breast cancer prior to any mammography.
It is therefore very useful to have a tool for making a very early breast cancer diagnosis.
The present invention proposes a novel method for breast cancer diagnosis, which makes it possible to distinguish patients suffering from a breast cancer very early, but also at an advanced stage. The analysis of the expression of these target genes can be carried out directly using a blood sample, which makes it possible to avoid serious or detrimental procedures, and the uncertainties associated with the taking of a sample during these procedures. Since peripheral blood is the most clinically accessible interior compartment, the use of this source in particular allows a routine diagnosis, which is less restricting than mammography, to be carried out.
Before proceeding with the disclosure of the invention, the following definitions, which apply for all the variants of the invention, should be given.
For the purpose of the present invention, the term “biological sample” is intended to mean any sample taken from a patient and liable to contain a biological material as defined hereinafter. This biological sample may in particular be a blood, serum, saliva, tissue, tumor or bone marrow sample or a sample of circulating cells from the patient. This biological sample is obtained by any means for taking a sample known to those skilled in theart.
For the purpose of the present invention, the term “biological material” is intended to mean any material which makes it possible to detect the expression of a target gene. The biological material may in particular comprise proteins, or nucleic acids such as, in particular, deoxyribonucleic acids (DNA) or ribonucleic acids (RNA). The nucleic acid may in particular be an RNA (ribonucleic acid). According to a preferred embodiment of the invention, the biological material comprises nucleic acids, preferably RNA, and even more preferably total RNA. Total RNA comprises transfer RNA, messenger RNA (mRNA), such as the mRNA transcribed from the target gene, but also transcribed from any other gene, and ribosomal RNA. This biological material comprises material specific for a target gene, such as in particular the mRNA transcribed from the target gene or the proteins derived from this mRNA, but may also comprise material not specific for a target gene, such as in particular the mRNA transcribed from a gene other than the target gene, the tRNA, or the rRNA derived from genes other than the target gene.
During step a) of the method according to the invention, the biological material is extracted from the biological sample by any of the protocols for extracting and purifying nucleic acids well known to those skilled in the art. By way of indication, the nucleic acid extraction can be carried out by means of:
The RNA can then be pelleted by centrifugation, washed and redissolved. During step b, and for the purposes of the present invention, the term “specific reagent” is intended to mean a reagent which, when it is brought into contact with biological material as defined above, binds with the material specific for said target gene. By way of indication, when the specific reagent and the biological material are of nucleic origin, bringing the specific reagent into contact with the biological material allows the specific reagent to hybridize with the material specific for the target gene. The term “hybridization” is intended to mean the process during which, under suitable conditions, two nucleotide fragments bind with stable and specific hydrogen bonds so as to form a double-stranded complex. These hydrogen bonds form between the complementary adenine (A) and thymine (T) (or uracil (U)) bases (this is then referred to as an A-T bond) or between the complementary guanine (G) and cytosine (C) bases (this is then referred to as a G-C bond). The hybridization of two nucleotide fragments may be total (reference is then made to complementary nucleotide fragments or sequences), i.e. the double-stranded complex obtained during this hybridization comprises only A-T bonds and C-G bonds. This hybridization may be partial (reference is then made to sufficiently complementary nucleotide fragments or sequences), i.e. the double-stranded complex obtained comprises A-T bonds and C-G bonds allowing the double-stranded complex to form, but also bases not bonded to a complementary base. The hybridization between two nucleotide fragments depends on the working conditions which are used, and in particular on the stringency. The stringency is defined in particular according to the base composition of the two nucleotide fragments, and also by the degree of mismatching between two nucleotide fragments. The stringency may also depend on the reaction parameters, such as the concentration and the type of ionic species present in the hybridization solution, the nature and the concentration of denaturing agents and/or the hybridization temperature. All these data are well known and the appropriate conditions can be determined by those skilled in the art. In general, according to the length of the nucleotide fragments that it is desired to hybridize, the hybridization temperature is between approximately 20 and 70° C., in particular between 35 and 65° C., in a saline solution at a concentration of approximately 0.5 to 1 M. A sequence, or nucleotide fragment, or oligonucleotide, or polynucleotide is a series of nucleotide motifs assembled together by phosphoric ester bonds, characterized by the informational sequence of the natural nucleic acids, capable of hybridizing to a nucleotide fragment, it being possible for the series to contain monomers of different structures and to be obtained from a natural nucleic acid molecule and/or by genetic recombination and/or by chemical synthesis. A motif is derived from a monomer which may be a natural nucleotide of a nucleic acid, the constitutive elements of which are a sugar, a phosphate group and a nitrogenous base; in DNA, the sugar is deoxy-2-ribose, in RNA, the sugar is ribose; depending on whether it is a question of DNA or RNA, the nitrogenous base is chosen from adenine, guanine, uracil, cytosine and thymine; or else the monomer is a nucleotide modified in at least one of the three constitutive elements: by way of example, the modification may occur either at the level of the bases, with modified bases such as inosine, methyl-5-deoxycytidine, deoxyuridine, dimethylamino-5-deoxyuridine, diamino-2,6-purine, bromo-5-deoxyuridine or any other modified base capable of hybridization, or at the level of the sugar, for example the replacement of at least one deoxyribose with a polyamide (P. E. Nielsen et al., Science, 254, 1497-1500 (1991)), or else at the level of the phosphate group, for example replacement thereof with esters chosen in particular from diphosphates, alkyl and aryl phosphonates and phosphorothioates.
For the purpose of the present invention, the specific reagent is an amplification primer. For the purpose of the present invention, the term “amplification primer” is intended to mean a nucleotide fragment comprising from 5 to 100 nucleic motifs, preferably from 15 to 30 nucleic motifs, allowing the initiation of an enzymatic polymerization, such as, in particular, an enzymatic amplification reaction. The term “enzymatic amplification reaction” is intended to mean a process which generates multiple copies of a nucleotide fragment through the action of at least one enzyme. Such amplification reactions are well known to those skilled in the art and mention may in particular be made of the following techniques:
When the enzymatic amplification is a PCR, the specific reagent comprises at least two amplification primers, specific for a target gene, which allow the amplification of the target gene-specific material. The target gene-specific material then preferably comprises a complementary DNA obtained by reverse transcription of messenger RNA derived from the target gene (reference is then made to target gene-specific cDNA) or a complementary RNA obtained by transcription of the cDNAs specific for a target gene (reference is then made to target gene-specific cRNA). When the enzymatic amplification is a PCR carried out after a reverse transcription reaction, this is referred to as RT-PCR.
The specific reagent may also be a hybridization probe. The term “hybridization probe” is intended to mean a nucleotide fragment comprising at least 5 nucleic motifs, preferably from 5 to 100 nucleic motifs, even more preferably from 10 to 35 nucleic motifs, having a hybridization specificity under given conditions so as to form a hybridization complex with the material specific for a target gene. In the present invention, the target gene-specific material may be a nucleotide sequence included in a messenger RNA derived from the target gene (reference is then made to target gene-specific mRNA), a nucleotide sequence included in a complementary DNA obtained by reverse transcription of said messenger RNA (reference is then made to target gene-specific cDNA), or else a nucleotide sequence included in a complementary RNA obtained by transcription of said cDNA as described above (reference will then be made to target gene-specific cRNA). The hybridization probe may comprise a label for its detection. The term “detection” is intended to mean either a direct detection by a physical method, or an indirect detection by a detection method using a label. Many detection methods exist for the detection of nucleic acids [see, for example, Kricka et al., Clinical Chemistry, 1999, no 45(4), p. 453-458 or Keller G. H. et al., DNA Probes, 2nd Ed., Stockton Press, 1993, sections 5 and 6, p. 173-249]. The term “label” is intended to mean a tracer capable of generating a signal that can be detected. A nonlimiting list of these tracers includes enzymes which produce a signal detectable, for example, by colorimetry, fluorescence or luminescence, such as horseradish peroxidase, alkaline phosphatase, beta-galactosidase or glucose-6-phosphate dehydrogenase; chromophores such as fluorescent, luminescent or dye compounds; electron dense groups detectable by electron microscopy or by their electrical properties such as conductivity, by amperometry or voltametry methods, or by impedance measurements; groups that can be detected by optical methods such as diffraction, surface plasmon resonance, contact angle variation or by physical methods such as atomic force spectroscopy, tunnel effect, etc.; radioactive molecules such as 32P, 35S or 125I.
For the purpose of the present invention, the hybridization probe may be a “detection” probe. In this case, the “detection” probe is labeled with a label as defined above. The detection probe may in particular be a “molecular beacon” detection probe as described by Tyagi & Kramer (Nature Biotechnol., 1996, 14:303-308). These “molecular beacons” become fluorescent during hybridization. They have a stem-loop structure and contain a fluorophore and a quencher group. The binding of the specific loop sequence with its complementary target nucleic acid sequence causes the stem to uncoil and a fluorescent signal to be emitted during excitation at the appropriate wavelength.
For the detection of the hybridization reaction, use may be made of target sequences that have been labeled directly (in particular by the incorporation of a label within the target sequence) or indirectly (in particular using a detection probe as defined above) the target sequence. A step for labeling and/or cleaving the target sequence can in particular be carried out before the hybridization step, for example using a labeled deoxyribonucleotide triphosphate during the enzymatic amplification reaction. The cleavage can be carried out in particular through the action of imidazole and manganese chloride. The target sequence can also be labeled after the amplification step, for example by hybridizing a detection probe according to the sandwich hybridization technique described in document WO 91/19812. Another specific preferred method for labeling nucleic acids is described in application FR2780 059.
The hybridization probe may also be a “capture” probe. In this case, the “capture” probe is immobilized or immobilizable on a solid support by any appropriate means, i.e. directly or indirectly, for example by covalence or adsorption. As solid support, use may be made of synthetic materials or natural materials, optionally chemically modified, in particular polysaccharides such as cellulose-based materials, for example paper, cellulose derivatives such as cellulose acetate or nitrocellulose, or dextran, polymers, copolymers, in particular based on styrene-type monomers, natural fibers such as cotton, and synthetic fibers such as nylon; mineral materials such as silica, quartz, glasses, ceramics; latices; magnetic particles; metal derivatives, gels, etc. The solid support may be in the form of a microtitration plate, of a membrane as described in application WO-A-94/12670, or of a particle. Several different capture probes, each being specific for a target gene, can also be immobilized on the support. In particular, a biochip on which a large number of probes can be immobilized may be used as support. The term “biochip” is intended to mean a solid support which is small in size and to which are attached a multitude of capture probes at predetermined positions. The biochip, or DNA chip, concept dates from the beginning of the 1990s. It is based on a multidisciplinary technology which integrates microelectronics, nucleic acid chemistry, image analysis and information technology. The operating principle is based on a foundation of molecular biology: the hybridization phenomenon, i.e. the pairing, by complementarity, of the bases of two DNA and/or RNA sequences. The biochip method is based on the use of capture probes attached to a solid support, on which probes a sample of target nucleotide fragments directly or indirectly labeled with fluorochromes is made to act. The capture probes are positioned specifically on the substrate or chip and each hybridization gives a specific piece of information, in relation to the target nucleotide fragment. The pieces of information obtained are cumulative, and make it possible, for example, to quantify the level of expression of one or more target genes. In order to analyze the expression of a target gene, a biochip carrying a very large number of probes which correspond to all or part of the target gene, which is transcribed to mRNA, can then be prepared. The cDNAs or cRNAs specific for a target gene that it is desired to analyze are then hybridized, for example, on specific capture probes. After hybridization, the support or chip is washed and the labeled cDNA or cRNA/capture probe complexes are revealed by means of a high-affinity ligand bound, for example, to a fluorochrome-type label. The fluorescence is read, for example, with a scanner and the analysis of the fluorescence is processed by information technology. By way of indication, mention may be made of the DNA chips developed by the company Affymetrix (“Accessing Genetic Information with High-Density DNA arrays”, M. Chee et al., Science, 1996, 274, 610-614. “Light-generated oligonucleotide arrays for rapid DNA sequence analysis”, A. Caviani Pease et al., Proc. Natl. Acad. Sci. USA, 1994, 91, 5022-5026), for molecular diagnoses. In this technology, the capture probes are generally small in size, around 25 nucleotides. Other examples of biochips are given in the publications by G. Ramsay, Nature Biotechnology, 1998, no 16, p. 40-44; F. Ginot, Human Mutation, 1997, no 10, p. 1-10; J. Cheng et al., Molecular diagnosis, 1996, no 1(3), p. 183-200; T. Livache et al., Nucleic Acids Research, 1994, no 22(15), p. 2915-2921; J. Cheng et al., Nature Biotechnology, 1998, no 16, p. 541-546 or in U.S. Pat. No. 4,981,783, U.S. Pat. No. 5,700,637, U.S. Pat. No. 5,445,934, U.S. Pat. No. 5,744,305 and U.S. Pat. No. 5,807,522. The main characteristic of the solid support should be to conserve the hybridization characteristics of the capture probes on the target nucleotide fragments while at the same time generating a minimum background noise for the method of detection.
Three main types of fabrication can be distinguished for immobilizing the probes on the support.
First of all, there is a first technique which consists in depositing presynthesized probes. The attachment of the probes is carried out by direct transfer, by means of micropipettes or of microdots or by means of an inkjet device. This technique allows the attachment of probes having a size ranging from a few bases (5 to 10) up to relatively large sizes of 60 bases (printing) to a few hundred bases (microdeposition): Printing is an adaptation of the method used by inkjet printers. It is based on the propulsion of very small spheres of fluid (volume<1 nl) at a rate that may reach 4000 drops/second. The printing does not involve any contact between the system releasing the fluid and the surface on which it is deposited.
Microdeposition consists in attaching long probes of a few tens to several hundred bases to the surface of a glass slide. These probes are generally extracted from databases and are in the form of amplified and purified products. This technique makes it possible to produce chips called microarrays that carry approximately ten thousand spots, called recognition zones, of DNA on a surface area of a little less than 4 cm2. The use of nylon membranes, referred to as “macroarrays”, which carry products that have been amplified, generally by PCR, with a diameter of 0.5 to 1 mm and the maximum density of which is 25 spots/cm2, should not however be forgotten. This very flexible technique is used by many laboratories. In the present invention, the latter technique is considered to be included among biochips. A certain volume of sample can, however, be deposited at the bottom of a microtitration plate, in each well, as in the case in patent applications WO-A-00/71750 and FR 00/14896, or a certain number of drops that are separate from one another can be deposited at the bottom of one and the same Petri dish, according to another patent application, FR00/14691.
The second technique for attaching the probes to the support or chip is called in situ synthesis. This technique results in the production of short probes directly at the surface of the chip. It is based on in situ oligonucleotide synthesis (see, in particular, patent applications WO 89/10977 and WO 90/03382), and is based on the oligonucleotide synthesizer process. It consists in moving a reaction chamber, in which the oligonucleotide extension reaction takes place, along the glass surface.
Finally, the third technique is called photolithography, which is a process that is responsible for the biochips developed by Affymetrix. It is also an in situ synthesis. Photolithography is derived from microprocessor techniques. The surface of the chip is modified by the attachment of photolabile chemical groups that can be light-activated. Once illuminated, these groups are capable of reacting with the 3′ end of an oligonucleotide. By protecting this surface with masks of defined shapes, it is possible to selectively illuminate and therefore activate areas of the chip where it is desired to attach one or other of the four nucleotides. The successive use of various masks makes it possible to alternate cycles of protection/reaction and therefore to produce the oligonucleotide probes on spots of approximately a few tens of square micrometers (μm2). This resolution makes it possible to create up to several hundred thousand spots on a surface area of a few square centimeters (cm2). Photolithography has advantages: in bulk in parallel, it makes it possible to create a chip of N-mers in only 4×N cycles. All these techniques can be used with the present invention.
For the purpose of the present invention, the determination of the expression (step c) of the method according to the invention) of a target gene can be carried out by any of the protocols known to those skilled in the art.
In general, the expression of a target gene can be analyzed by detection of the mRNAs (messenger RNAs) which are transcribed from the target gene at a given instant or by the detection of the proteins derived from these mRNAs.
When the specific reagent is an amplification primer, the expression of a target gene can be determined in the following way:
1) after having extracted, as biological material, the total RNA (comprising the transfer RNAs (tRNAs), the ribosomal RNAs (rRNAs) and the messenger RNAs (mRNAs)) from a biological sample as presented above, a reverse transcription step is carried out in order to obtain the complementary DNAs (or cDNAs) of said mRNAs. By way of indication, this reverse transcription reaction can be carried out using a reverse transcriptase enzyme which makes it possible to obtain, from an RNA fragment, a complementary DNA fragment. The reverse transcriptase enzyme originating from AMV (avian myoblastosis virus) or from MMLV (moloney murine leukemia virus) can in particular be used. When it is more particularly desired to obtain only the cDNAs of the mRNAs, this reverse transcription step is carried out in the presence of nucleotide fragments comprising only thymine bases (polyT), which hybridize by complementarity on the polyA sequence of the mRNAs so as to form a polyT-polyA complex which then serves as a starting point for the reverse transcription reaction carried out by the reverse transcriptase enzyme. cDNAs complementary to the mRNAs derived from a target gene (target gene-specific cDNA) and cDNAs complementary to the mRNAs derived from genes other than the target gene (cDNAs not specific for the target gene) are then obtained.
2) the amplification primer(s) specific for a target gene is (are) brought into contact with the target gene-specific cDNAs and the cDNAs not specific for the target gene. The amplification primer(s) specific for a target gene hybridize(s) with the target gene-specific cDNAs and a predetermined region, of known length, of the cDNAs originating from the mRNAs derived from the target gene is specifically amplified. The cDNAs not specific for the target gene are not amplified, whereas a large amount of target gene-specific cDNAs is then obtained. For the purpose of the present invention, reference is made, without distinction, to “target gene-specific cDNAs” or to “cDNAs originating from the mRNAs derived from the target gene”. This step can be carried out in particular by means of a PCR-type amplification reaction or by any other amplification technique as defined above. By PCR, it is also possible to simultaneously amplify several different cDNAs, each one being specific for various target genes, by using several pairs of different amplification primers, each one being specific for a target gene: reference is then made to multiplex amplification.
3) the expression of the target gene is determined by detecting and quantifying the target gene-specific cDNAs obtained in step 2) above. This detection can be carried out after electrophoretic migration of the target gene-specific cDNAs according to their size. The gel and the migration medium can include ethydium bromide so as to allow direct detection of the target gene-specific cDNAs when the gel is placed, after a given migration period, on a UV (ultraviolet)-ray light table, through the emission of a light signal. The greater the amount of target gene-specific cDNAs, the brighter this light signal. These electrophoresis techniques are well known to those skilled in the art. The target gene-specific cDNAs can also be detected and quantified using a quantification range obtained by means of an amplification reaction carried out until saturation. In order to take into account the variability and enzymatic efficiency that may be observed during the various steps (reverse transcription, PCR, etc.), the expression of a target gene of various groups of patients can be normalized by simultaneously determining the expression of a “housekeeping” gene, the expression of which is similar in the various groups of patients. By realizing a ratio of the expression of the target gene to the expression of the housekeeping gene, i.e. by realizing a ratio of the amount of target gene-specific cDNAs to the amount of housekeeping gene-specific cDNAs, any variability between the various experiments is thus corrected. Those skilled in the art may refer in particular to the following publications: Bustin S A, J Mol Endocrinol, 2002, 29: 23-39; Giulietti A Methods, 2001, 25: 386-401.
When the specific reagent is a hybridization probe, the expression of a target gene can be determined in the following way:
1) after having extracted, as biological material, the total RNA from a biological sample as presented above, a reverse transcription step is carried out as described above in order to obtain cDNAs complementary to the mRNAs derived from a target gene (target gene-specific cDNA) and cDNAs complementary to the mRNAs derived from genes other than the target gene (cDNA not specific for the target gene).
2) all the cDNAs are brought into contact with a support, on which are immobilized capture probes specific for the target gene whose expression it is desired to analyze, in order to carry out a hybridization reaction between the target gene-specific cDNAs and the capture probes; the cDNAs not specific for the target gene do not hybridize to the capture probes. The hybridization reaction can be carried out on a solid support which includes all the materials as indicated above. According to a preferred embodiment, the hybridization probe is immobilized on a support. Preferably, the support is a biochip. The hybridization reaction can be preceded by a step of enzymatic amplification of the target gene-specific cDNAs as described above, so as to obtain a large amount of target gene-specific cDNAs and to increase the probability of a target gene-specific cDNA hybridizing to a capture probe specific for the target gene. The hybridization reaction may also be preceded by a step for labeling and/or cleaving the target gene-specific cDNAs as described above, for example using a labeled deoxyribonucleotide triphosphate for the amplification reaction. The cleavage can be carried out in particular by the action of imidazole and manganese chloride. The target gene-specific cDNA can also be labeled after the amplification step, for example by hybridizing a labeled probe according to the sandwich hybridization technique described in document WO-A-91/19812. Other preferred specific methods for labeling and/or cleaving nucleic acids are described in applications WO 99/65926, WO 01/44507, WO 01/44506, WO 02/090584 and WO 02/090319.
3) a step for detection of the hybridization reaction is subsequently carried out. The detection can be carried out by bringing the support on which the capture probes specific for the target gene are hybridized with the target gene-specific cDNAs into contact with a “detection” probe labeled with a label, and detecting the signal emitted by the label. When the target gene-specific cDNA has been labeled beforehand with a label, the signal emitted by the label is detected directly.
When the specific reagent is a hybridization probe, the expression of a target gene can also be determined in the following way:
1) after having extracted, as biological material, the total RNA from a biological sample as presented above, a reverse transcription step is carried out as described above in order to obtain the cDNAs of the mRNAs of the biological material. The polymerization of the complementary RNA of the cDNA is subsequently carried out using a T7 polymerase enzyme which functions under the control of a promoter and which makes it possible to obtain, from a DNA template, the complementary RNA. The cRNAs of the cDNAs of the mRNAs specific for the target gene (reference is then made to target gene-specific cRNA) and the cRNAs of the cDNAs of the mRNAs not specific for the target gene are then obtained.
2) all the cRNAs are brought into contact with a support on which are immobilized capture probes specific for the target gene whose expression it is desired to analyze, in order to carry out a hybridization reaction between the target gene-specific cRNAs and the capture probes; the cRNAs not specific for the target gene do not hybridize to the capture probes. When it is desired to simultaneously analyze the expression of several target genes, several different capture probes can be immobilized on the support, each one being specific for a target gene. The hybridization reaction can also be preceded by a step for labeling and/or cleaving the target gene-specific cRNAs as described above.
3) a step for detecting the hybridization reaction is subsequently carried out. The detection can be carried out by bringing the support on which the capture probes specific for the target gene are hybridized with the target gene-specific cRNA into contact with a “detection” probe labeled with a label, and detecting the signal emitted by the label. When the target gene-specific cRNA has been labeled beforehand with a label, the signal emitted by the label is detected directly. The use of cRNA is particularly advantageous when a support of biochip type on which a large number of probes are hybridized is used. According to a specific embodiment of the invention, steps B and C are carried out at the same time. This preferred method can in particular be carried out by “real time NASBA”, which groups together, in a single step, the NASBA amplification technique and real-time detection which uses “molecular beacons”. The NASBA reaction takes place in the tube, producing the single-stranded RNA with which the specific “molecular beacons” can simultaneously hybridize to give a fluorescent signal. The formation of the new RNA molecules is measured in real time by continuous verification of the signal in a fluorescent reader. Unlike an RT-PCR amplification, NASBA amplification can take place with contaminating DNA being present in the sample. It is not therefore necessary to verify that the DNA has indeed been completely eliminated during the RNA extraction.
Surprisingly, the inventors have demonstrated that the analysis of the expression of target genes selected from 74 genes, as presented in table 1 below, is highly relevant for the early diagnosis of breast cancer.
Among these genes, it is possible to distinguish genes for which the function is known but which have never been associated with breast cancer (SEQ ID Nos. 3 to 6; 11; 13 to 15; 17; 25; 31 to 34; 39; 42 to 46; 48; 50; 51; 54; 60; 62; 64; 67; 69; 70; 72 to 74) and also genes for which the function is unknown (SEQ ID Nos. 1; 2; 7 to 10; 16; 18 to 23; 26 to 30; 35 to 38; 40; 47; 49; 52; 53; 55 to 57; 61; 63; 65; 66; 68; 71).
All the isoforms of the genes according to the invention are relevant for the present invention. In this respect, it should in particular be noted that several variants exist for the target gene of SEQ ID No. 14; only the first variant is presented in the table above, but the variants having Genbank accession number: NM—207292; NM—207293; NM—207294; NM—207295; NM—207296; NM—207297 are just as relevant for the purpose of the present invention.
Similarly, for SEQ ID No. 17, only the first variant is presented in the table above, but the variant having Genbank accession number NM—183419 is just as relevant for the purpose of the present invention.
Similarly, for SEQ ID No. 31, only the first variant is presented in the table above, but the variant having Genbank accession number NM—177438 is just as relevant for the purpose of the present invention.
Similarly, for SEQ ID No. 34, only the first variant is presented in the table above, but the variant having Genbank accession number NM—000651 is just as relevant for the purpose of the present invention.
Similarly, for SEQ ID No. 41, only the first variant is presented in the table above, but the variant having Genbank accession number NM—015928 is just as relevant for the purpose of the present invention.
For the target gene of SEQ ID No. 46, only the first variant is presented in the table above, but the variants having Genbank accession number: NM—016592; NM—080425; NM—080426 are just as relevant for the purpose of the present invention.
For the target gene of SEQ ID No. 47, only the first variant is presented in the table above, but the variant having Genbank accession number NM—203326 is just as relevant for the purpose of the present invention.
For the target gene of SEQ ID No. 48, only the first variant is presented in the table above, but the variant having Genbank accession number NM—138578 is just as relevant for the purpose of the present invention.
For the target gene of SEQ ID No. 50, only the first variant is presented in the table above, but the variants having Genbank accession number NM—001001485; NM—001001486; NM—001001487; NM—014382 are just as relevant for the purpose of the present invention.
For the target gene of SEQ ID No. 53, only the first variant is presented in the table above, but the variant having Genbank accession number NM—203497 is just as relevant for the purpose of the present invention.
For the target gene of SEQ ID No. 55, only the first variant is presented in the table above, but the variant having Genbank accession number NM—182646 is just as relevant for the purpose of the present invention.
For the target gene of SEQ ID No. 57, only the first variant is presented in the table above, but the variant having Genbank accession number Genbank NM—013986 is just as relevant for the purpose of the present invention.
For the target gene of SEQ ID No. 59, only the first variant is presented in the table above, but the variant having Genbank accession number NM—080491 is just as relevant for the purpose of the present invention.
For the target gene of SEQ ID No. 60, only the first variant is presented in the table above, but the variants having Genbank accession number: NM—002259; NM—002260; NM—007328; NM—213657; NM—213658 are just as relevant for the purpose of the present invention.
For the target gene of SEQ ID No. 64, only the first variant is presented in the table above, but the variant having Genbank accession number NM—133433 is just as relevant for the purpose of the present invention.
For the target gene of SEQ ID No. 74, only the first variant is presented in the table above, but the variant having Genbank accession number NM—212479 is just as relevant for the purpose of the present invention.
Furthermore, the inventors have demonstrated that the analysis of target genes selected from the 95 genes presented in table 2 below are highly relevant for the diagnosis of advanced breast cancer.
Homo sapiens, clone IMAGE: 5267398, mRNA cDNA
Among these genes, it is possible to distinguish genes for which the function is known but which have never been associated with breast cancer (SEQ ID Nos. 3 to 6; 13; 14; 39; 46; 48; 60; 67; 69; 70; 72 to 74; 76; 78; 79; 82; 83; 85; 87; 92 to 94; 97; 99 to 101; 103; 105; 110; 111; 113 to 116; 118; 120; 122; 126; 130 to 132; 134; 136; 137) and also genes for which the function is unknown (SEQ ID Nos. 1; 2; 20; 26; 28; 36; 37; 40; 61; 63; 65; 71; 75; 77; 80; 81; 84; 86; 88 to 91; 95; 96; 102; 104; 106 to 109; 119; 121; 123 to 125; 128; 129; 133; 135; 138; 139).
All the isoforms of the genes according to the invention are relevant for the present invention. In this respect, it should in particular be noted that several variants exist for the target gene of SEQ ID No. 14; only the first variant is presented in the table above, but the variants having Genbank accession number: NM—207292; NM—207293; NM—207294; NM—207295; NM—207296; NM—207297 are just as relevant for the purpose of the present invention.
Similarly, for the target gene of SEQ ID No. 41, only the first variant is presented in the table above, but the variant having Genbank accession number NM—015928 is just as relevant for the purpose of the present invention.
Similarly, for the target gene of SEQ ID No. 74, only the first variant is presented in the table above, but the variant having Genbank accession number NM—212479 is just as relevant for the purpose of the present invention.
Similarly, for the target gene of SEQ ID No. 96, only the first variant is presented in the table above, but the variant having Genbank accession number NM—030758 is just as relevant for the purpose of the present invention.
For the target gene of SEQ ID No. 98, only the first variant is presented in the table above, but the variants having Genbank accession number: NM—178578; NM—178579 are just as relevant for the purpose of the present invention.
For the target gene of SEQ ID No. 99, only the first variant is presented in the table above, but the variants having Genbank accession number: NM—080391; NM—080392 are just as relevant for the purpose of the present invention.
Similarly, for the target gene of SEQ ID No. 100, only the first variant is presented in the table above, but the variant having Genbank accession number NM—016225 is just as relevant for the purpose of the present invention.
For the target gene of SEQ ID No. 112, only the first variant is presented in the table above, but the variants having Genbank accession number: NM—016377; NM—138633 are just as relevant for the purpose of the present invention.
For the target gene of SEQ ID No. 115, only the first variant is presented in the table above, but the variants having Genbank accession number: NM—020475; NM—020476; NM—020477; NM—020478; NM—020479; NM—020480; NM—020481 are just as relevant for the purpose of the present invention.
Similarly, for the target gene of SEQ ID No. 137, only the first variant is presented in the table above, but the variant having Genbank accession number NM—022173 is just as relevant for the present invention.
In this respect, the invention relates to a method for the in vitro diagnosis of breast cancer in a patient who may be suffering from a breast cancer, characterized in that it comprises the following steps:
The analysis of the expression of a target gene chosen from any one of SEQ ID Nos. 1 to 74 then makes it possible to have a tool for the diagnosis of breast cancer, and is very suitable for an early diagnosis. It is, for example, possible to analyze the expression of a target gene in a patient for whom the diagnosis is not known, and to compare with known average expression values for the target gene of normal patients and known average expression values for the target gene of patients suffering from an early-stage breast cancer.
According to a specific embodiment of the invention, in step b), the biological material is brought into contact with at least 5, at least 10, at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70 or at least 74 specific reagents chosen from the specific reagents for the target genes with a nucleic sequence having any one of SEQ ID NOs. 1 to 74, and, in step c, the expression of at least 5, at least 10, at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70 or at least 74 of said genes is determined.
According to a specific embodiment of the invention, the invention relates to an in vitro method for the diagnosis, preferably early diagnosis, of breast cancer in a patient who may be suffering from a breast cancer, characterized in that it comprises the following steps:
a. biological material is extracted from a biological sample taken from the patient,
b. the biological material is brought into contact with at least 46 specific reagents chosen from the specific reagents for the target genes with a nucleic sequence having any one of SEQ ID Nos. 1 to 46,
c. the expression of said target genes is determined.
According to another preferred embodiment of the invention, in step b), the biological material is brought into contact with at least 23 or with 23 specific reagents chosen from the specific reagents for the target genes with a nucleic sequence having any one of SEQ ID Nos. 1 to 23, and, in step c, the expression of at least 23 or 23 of said target genes is determined.
According to a particularly preferred embodiment of the invention, the invention relates to an in vitro method for the diagnosis, preferably early diagnosis, of breast cancer in a patient who may be suffering from a breast cancer, characterized in that it comprises the following steps:
a. biological material is extracted from a biological sample taken from the patient,
b. the biological material is brought into contact with at least 23 specific reagents chosen from the specific reagents for the target genes with a nucleic sequence having any one of SEQ ID Nos. 1 to 23,
c. the expression of said target genes is determined.
According to another preferred embodiment of the invention, in step b), the biological material is brought into contact with at least 8 or with 8 specific reagents chosen from the specific reagents for the target genes with a nucleic sequence having any one of SEQ ID Nos. 1 to 8, and, in step c, the expression of at least 8 or 8 of said genes is determined. According to a preferred embodiment of the invention, the invention relates to an in vitro method for the diagnosis, preferably early diagnosis, of breast cancer in a patient who may be suffering from a breast cancer, characterized in that it comprises the following steps:
a. biological material is extracted from a biological sample taken from the patient,
b. the biological material is brought into contact with at least 8 specific reagents chosen from the specific reagents for the target genes with a nucleic sequence having any one of SEQ ID Nos. 1 to 8,
c. the expression of said target genes is determined.
The use of a restricted panel of genes is particularly suitable for obtaining a prognostic tool. In fact, the analysis of the expression of about ten genes does not require the custom-made fabrication of DNA chips, and can be carried out directly by PCR or NASBA techniques, or low-density chip techniques, which provides a considerable economic asset and a simplified implementation.
The invention also relates to a method for the in vitro diagnosis of breast cancer in a patient who may be suffering from a breast cancer, characterized in that it comprises the following steps:
The analysis of the expression of a target gene chosen from any one of SEQ ID No. 1 to SEQ ID Nos. 1 to 6; 13; 14; 20; 26; 28; 36 to 41; 46; 48; 59 to 61; 63; 65; 67; 69 to 139 then makes it possible to provide a tool for the diagnosis of breast cancer, which is very suitable for the diagnosis of a late-stage cancer. It is, for example, possible to analyze the expression of a target gene in a patient for whom the diagnosis is not known, and to compare with known average expression values for the target gene of normal patients and known average expression values for the target gene of patients suffering from a late-stage breast cancer. This tool also makes it possible, for example, to monitor a treatment prescribed for a patient suffering from an advanced breast cancer.
According to a specific embodiment of the invention, in step b), the biological material is brought into contact with at least 5, at least 10, at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95 or at least 97 specific reagents chosen from the specific reagents for the target genes with a nucleic sequence having any one of SEQ ID Nos. 1 to 6; 13; 14; 20; 26; 28; 36 to 41; 46; 48; 59 to 61; 63; 65; 67; 69 to 139, and, in step c, the expression of at least 5, at least 10, at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95 or at least 97 of said target genes is determined.
According to a preferred embodiment of the invention, the invention relates to a method for the in vitro late diagnosis of breast cancer in a patient who may be suffering from a breast cancer, characterized in that it comprises the following steps:
a. biological material is extracted from a biological sample taken from the patient,
b. the biological material is brought into contact with at least 54 or 54 specific reagents chosen from the specific reagents for the target genes with a nucleic sequence having any one of SEQ ID Nos. 1 to 6; 13; 14; 20; 26; 28; 38 to 41; 69; 74 to 110,
c. the expression of said target genes is determined.
According to another preferred embodiment of the invention, in step b), the biological material is brought into contact with at least 29 or with 29 specific reagents chosen from the specific reagents for the target genes with a nucleic sequence having any one of SEQ ID Nos. 1; 2; 4 to 6; 13; 14; 20; 26; 38; 39; 41; 69; 75; 79 to 81; 87; 89; 93; 95 to 96; 101; 103 to 106; 108; 110, and, in step c, the expression of at least 29 or 29 of said target genes is determined.
According to a specific embodiment of the invention, the invention relates to a method for the in vitro early diagnosis of breast cancer in a patient who may be suffering from a breast cancer, characterized in that it comprises the following steps:
a. biological material is extracted from a biological sample taken from the patient,
b. the biological material is brought into contact with at least 29 or 29 specific reagents chosen from the specific reagents for the target genes with a nucleic sequence having any one of SEQ ID Nos. SEQ ID Nos 1; 2; 4 to 6; 13; 14; 20; 26; 38; 39; 41; 69; 75; 79 to 81; 87; 89; 93; 95 to 96; 101; 103 to 106; 108; 110,
c. the expression of said target genes is determined.
According to another preferred embodiment of the invention, in step b), the biological material is brought into contact with at least 10 or with 10 specific reagents chosen from the specific reagents for the target genes with a nucleic sequence having any one of SEQ ID Nos. 1; 2; 4; 6; 13; 14; 26; 69; 81; 105, and, in step c, the expression of at least 10 or 10 of said target genes is determined.
According to a preferred embodiment of the invention, the invention relates to an in vitro method for the diagnosis, preferably late diagnosis, of breast cancer in a patient who may be suffering from a breast cancer, characterized in that it comprises the following steps:
a. biological material is extracted from a biological sample taken from the patient,
b. the biological material is brought into contact with at least 10 or 10 specific reagents chosen from the specific reagents for the target genes with a nucleic sequence having any one of SEQ ID Nos. 1; 2; 4; 6; 13; 14; 26; 69; 81; 105,
c. the expression of said target genes is determined.
The use of a restricted panel of genes is particularly suitable for obtaining a prognostic tool. In fact, the analysis of the expression of about ten genes does not require the custom-made fabrication of DNA chips, and can be carried out directly by PCR or NASBA techniques, or low-density chip techniques, which provides a considerable economic asset and a simplified implementation.
Irrespective of the variant of the method according to the invention, the biological sample taken from the patient is preferably a blood sample. This makes it possible to obtain a method of diagnosis that is easy to implement and relatively painless for the patient. Irrespective of the variant of the method according to the invention, the biological material extracted in step a) preferably comprises nucleic acids, which allows an easy and rapid analysis of the expression of the target gene(s) in step c). In this case, said specific reagents of step b) are preferably hybridization probes. These hybridization probes are preferably immobilized on a support, which is preferably a biochip. This biochip then allows the simultaneous analysis of all the target genes according to the invention.
The invention also relates to a support, as defined above, comprising at least 8 specific hybridization probes for target genes with a nucleic sequence having any one of SEQ ID Nos. 1 to 8. The invention also relates to a support, as defined above, consisting of 8 specific hybridization probes for target genes with a nucleic sequence having any one of SEQ ID Nos. 1 to 8.
The invention also relates to a support, as defined above, comprising at least 23 specific hybridization probes for target genes with a nucleic sequence having any one of SEQ ID Nos. 1 to 23. The invention also relates to a support, as defined above, consisting of 23 specific hybridization probes for target genes with a nucleic sequence having any one of SEQ ID Nos. 1 to 23.
The invention also relates to a support, as defined above, comprising at least 46 specific hybridization probes for target genes with a nucleic sequence having any one of SEQ ID Nos. 1 to 46. This support preferably also comprises at least 28 specific hybridization probes for target genes with a nucleic sequence having any one of SEQ ID Nos. 47 to 74. The invention also relates to a support, as defined above, consisting of 46 specific hybridization probes for target genes with a nucleic sequence having any one of SEQ ID Nos. 1 to 46. This support preferably also consists of 28 specific hybridization probes for target genes with a nucleic sequence having any one of SEQ ID Nos. 47 to 74.
The invention also relates to the use of a support as defined above, for the early diagnosis of a breast cancer. The invention also relates to a kit for the early diagnosis of a breast cancer, comprising a support as defined above.
The invention also relates to a support comprising at least or consisting of 10 specific hybridization probes for target genes with a nucleic sequence having any one of SEQ ID Nos. 1; 2; 4; 6; 13; 14; 26; 69; 81; 105.
The invention also relates to a support comprising at least or consisting of 29 specific hybridization probes for target genes with a nucleic sequence having any one of SEQ ID Nos. 1; 2; 4 to 6; 13; 14; 20; 26; 38; 39; 41; 69; 75; 79 to 81; 87; 89; 93; 95 to 96; 101; 103 to 106; 108; 110.
The invention also relates to a support comprising at least or consisting of 54 specific hybridization probes for target genes with a nucleic sequence having any one of SEQ ID Nos. 1 to 6; 13; 14; 20; 26; 28; 38 to 41; 69; 74 to 111. This support preferably also comprises specific hybridization probes for target genes with a nucleic sequence having any one of SEQ ID Nos. 36; 37; 46; 48; 59 to 61; 63; 65; 67; 70 to 72; 73; 112 to 139. The invention also relates to the use of a support as defined above, for the late diagnosis of a breast cancer.
Finally, the invention relates to a kit for the early diagnosis of a breast cancer, comprising a support as defined above.
The attached figures are given by way of explanatory examples and are in no way limiting in nature. They will make it possible to understand the invention more clearly.
The following examples are given by way of illustration and are in no way limiting in nature. They will make it possible to understand the invention more clearly.
Biological sample characteristics: The example presented hereinafter was carried out using 46 blood samples (5 ml of whole blood, taken in two PaxGene tubes). These samples included 12 blood samples originating from normal control patients (S, obtained from the French Blood Bank) and 24 samples from patients suffering from a phase I/II breast cancer (Ci/ii).
Extraction of the biological material (total RNA) from the biological sample: The blood samples were collected directly in PAXGene™ Blood RNA tubes (PreAnalytix, Frankin Lakes, USA). After the step in which the blood sample was taken, and in order to obtain complete lysis of the cells, the tubes were left at ambient temperature for 4 h and then stored at −20° C. until extraction of the biological material. More specifically, in this protocol, the total RNA was extracted using the PAXGene Blood RNA® kits (PreAnalytix), according to the manufacturer's recommendations. Briefly, the tubes were centrifuged (10 min, 3000 g) in order to obtain a nucleic acid pellet. This pellet was washed and taken up in a buffer containing proteinase K, required for digestion of the proteins (10 min at 55° C.). A further centrifugation (5 min, 19 000 g) was carried out in order to remove the cell debris, and ethanol was added in order to optimize the conditions for binding of the nucleic acids. The total RNA was specifically bound to PAXgene RNA spin columns and, before the elution of said RNA, the contaminating DNA was digested using the RNAse free DNAse set (Qiagen Ltd, Crawley, UK). The quality of the total RNA was analyzed using the AGILENT 2100 bioanalyzer (Agilent Technologies, Waldbronn, Germany). The total RNA comprises the transfer RNAs, the messenger RNAs (mRNAs) and the ribosomal RNAs.
cDNA synthesis, cRNA production and cRNA labeling, and quantification: In order to analyze the expression of the target genes according to the invention, the complementary DNAs (cDNAs) of the mRNAs contained in the total RNA as purified above were obtained from 10 μg of total RNA using 400 units of the SuperScriptII reverse transcription enzyme (Invitrogen) and 100 μmol of poly-T primer containing the T7 promoter (T7-oligo(dT)24-primer, Proligo, Paris, France).
The cDNAs thus obtained were subsequently extracted with phenol/chloroform and precipitated as described previously with ammonium acetate and ethanol, and redissolved in 24 μl of DEPC water. A volume of 20 μl of this purified cDNA solution was subsequently subjected to in vitro transcription using a T7 RNA polymerase which specifically recognizes the T7 polymerase promoter as mentioned above. This transcription makes it possible to obtain the cRNA of the cDNA. This transcription was carried out using an IVT labeling kit (Affymetrix, Santa Clara, Calif.), which makes it possible not only to obtain the cRNA, but also the incorporation of biotinylated pseudouridine bases during the synthesis of the cRNA. The purified cRNAs were subsequently quantified by spectrophotometry, and the cRNA solution was adjusted to a concentration of 1 μg/μl of cRNA. The step consisting of cleavage of these cRNAs was subsequently carried out at 94° C. for 35 min, using a fragmentation buffer (40 mM of tris acetate, pH 8.1, 100 mM of potassium acetate, 30 mM of magnesium acetate) in order to bring about the hydrolysis of the cRNAs and to obtain fragments of 35 to 200 bp. The success of such a fragmentation was verified by means of a 1.5% agarose gel electrophoresis.
Demonstration of an Expression Profile for the Genes which Makes it Possible to Distinguish Between the Control Patients (S) and the Patients Suffering from a Stage I/II Cancer
The expression of approximately 30 000 genes was analyzed and compared between S and C I/II patients. For this, 10 μg of fragmented cRNAs derived from each sample were added to a hybridization buffer (Affymetrix) and 200 μl of this solution were brought into contact for 16 h at 45° C. on an expression chip (Human Genome U133Plus2 GeneChip® (Affymetrix)), which comprises 54 000 groups of probes representing approximately 30 000 genes, according to the Affymetrix protocol.
In order to record the best hybridization and washing performance levels, RNAs described as “control” RNAs that were biotinylated (bioB, bioC, bioD and cre) and oligonucleotides (oligo B2) were also included in the hybridization buffer. After the hybridization step, the biotinylated cRNAs hybridized on the chip were visualized using a solution of streptavidin-phycoerythrin and the signal was amplified using an anti-streptavidin antibody. The hybridization was carried out in a “GeneChip Hybridisation oven” (Affymetrix), and the Euk GE-WS2 protocol of the Affymetrix protocol was followed. The washing and visualization steps were carried out on a “Fluidics Station 450” (Affymetrix). Each U133—Plus—2 chip was subsequently analyzed on an Agilent G3000 GeneArray Scanner at a resolution of 1.5 microns in order to pinpoint the areas hybridized on the chip. This scanner makes it possible to detect the signal emitted by the fluorescent molecules after excitation with an argon laser using the epifluorescence microscope technique. A signal proportional to the amount of cRNAs bound is thus obtained for each position. The signal was subsequently analyzed using the GeneChip Operating Software (GCOS1.2, Affymetrix).
In order to prevent the variations obtained by using various chips, a normalization approach was carried out using the “bioconductor” tool, which makes it possible to harmonize the mean distribution of the raw data for each chip. The results obtained on a chip can then be compared with the results obtained on another chip. The GCOS 1.2 software also made it possible to include a statistical algorithm for deciding whether or not a gene was expressed. Each gene represented on the U133_Plus—2 chip was covered by 11 to 16 pairs of probes of 25 nucleotides. The term “pair of probes” is intended to mean a first probe which hybridizes perfectly (reference is then made to PM or perfect match probes) with one of the cRNAs derived from a target gene, and a second probe, identical to the first probe with the exception of a mismatch (reference is then made to MM or mismatched probe) at the center of the probe. Each MM probe was used to estimate the background noise corresponding to a hybridization between two nucleotide fragments of noncomplementary sequence (Affymetrix technical note “Statistical Algorithms Reference Guide”; Lipshutz, et al., (1999) Nat. Genet. 1 Suppl., 20-24). The remaining 46 samples showed an average of 42.1% of expressed genes.
On the basis of the 54 000 groups of probes, representing approximately 30 000 genes, of the chip, the inventors selected the relevant genes which were correlated with the development of a breast cancer.
The genes which have an expression level that is too low on the majority of the chips and also the genes which do not show any substantial variation between the various chips were excluded (Li et al., 2001, Bioinformatics, 17: 1131-1142). The search for a panel of genes for distinguishing the groups of patients EFS [French Blood Bank] and CI/II was carried out by a Data Mining technique (http://ligarto.org/rdiaz/Papers/jomadas.bioinfo.randomForest.pdf).
This analysis made it possible to reveal a first panel of genes, comprising 46 relevant genes according to the invention (SEQ ID Nos. 1 to 46). A complementary analysis (SAM, significance analysis of microarrays) also made it possible to reveal additional genes which also proved to be very relevant (SEQ ID Nos. 47 to 74).
The increase or decrease in expression of each of these genes, observed in the S patients compared with the C I/II patients, is shown in table 3.
The inventors also studied the simultaneous expression of 74 genes of nucleotide sequence chosen from SEQ ID Nos. 1 to 74 in order to obtain an expression profile. The results are given in
The inventors also studied the simultaneous expression of 46 genes of nucleotide sequence chosen from SEQ ID Nos. 1 to 46 in order to obtain an expression profile. The results are given in
The inventors also studied the simultaneous expression of 23 genes of nucleotide sequence chosen from SEQ ID Nos. 1 to 23 in order to obtain an expression profile. The results are given in
The inventors also studied the simultaneous expression of 8 genes of nucleotide sequence chosen from SEQ ID Nos. 1 to 8 in order to obtain an expression profile. The results are given in
Biological sample characteristics: The example presented hereinafter was carried out using 46 blood samples (5 ml of whole blood, taken in two PaxGene tubes). These samples included 12 blood samples originating from normal control patients (S, obtained from the French Blood Bank) and 24 samples from patients suffering from a phase III/IV breast cancer (CIII/IV), i.e. an advanced stage of breast cancer.
Extraction of the biological material (total RNA) from the biological sample: this step was carried out in a manner comparable to example 1.
cDNA synthesis, cRNA production and cRNA labeling, and quantification: this step was carried out in a manner comparable to example 1.
Demonstration of an Expression Profile for the Genes which Makes it Possible to Distinguish Between the Control Patients (S) and the Patients Suffering from a Stage III/IV Cancer
The expression of approximately 30 000 genes was analyzed and compared between S and C III/IV patients. For this, 10 μg of fragmented cRNAs derived from each sample were added to a hybridization buffer (Affymetrix) and 200 μl of this solution were brought into contact for 16 h at 45° C. on an expression chip (Human Genome U133Plus2 GeneChip® (Affymetrix)), which comprises 54 000 groups of probes representing approximately 30 000 genes, according to the Affymetrix protocol.
In order to record the best hybridization and washing performance levels, RNAs described as “control” RNAs that were biotinylated (bioB, bioC, bioD and cre) and oligonucleotides (oligo B2) were also included in the hybridization buffer. After the hybridization step, the biotinylated cRNAs hybridized on the chip were visualized using a solution of streptavidin-phycoerythrin and the signal was amplified using an anti-streptavidin antibody. The hybridization was carried out in a “GeneChip Hybridisation oven” (Affymetrix), and the Euk GE-WS2 protocol of the Affymetrix protocol was followed. The washing and visualization steps were carried out on a “Fluidics Station 450” (Affymetrix). Each U133—Plus—2 chip was subsequently analyzed on an Agilent G3000 GeneArray Scanner at a resolution of 1.5 microns in order to pinpoint the areas hybridized on the chip. This scanner makes it possible to detect the signal emitted by the fluorescent molecules after excitation with an argon laser using the epifluorescence microscope technique. A signal proportional to the amount of cRNAs bound is thus obtained for each position. The signal was subsequently analyzed using the GeneChip Operating Software (GCOS 1.2, Affymetrix).
In order to prevent the variations obtained by using various chips, a normalization approach was carried out using the “bioconductor” tool, which makes it possible to harmonize the mean distribution of the raw data for each chip. The results obtained on a chip can then be compared with the results obtained on another chip. The GCOS 1.2 software also made it possible to include a statistical algorithm for deciding whether or not a gene was expressed. Each gene represented on the U133_Plus—2 chip was covered by 11 to 16 pairs of probes of 25 nucleotides. The term “pair of probes” is intended to mean a first probe which hybridizes perfectly (reference is then made to PM or perfect match probes) with one of the cRNAs derived from a target gene, and a second probe, identical to the first probe with the exception of a mismatch (reference is then made to MM or mismatched probe) at the center of the probe. Each MM probe was used to estimate the background noise corresponding to a hybridization between two nucleotide fragments of noncomplementary sequence (Affymetrix technical note “Statistical Algorithms Reference Guide”; Lipshutz, et al., (1999) Nat. Genet. 1 Suppl., 20-24). The remaining 46 samples showed an average of 42.1% of expressed genes.
On the basis of the 54 000 groups of probes, representing approximately 30 000 genes, of the chip, the inventors selected the relevant genes which were correlated with the development of a breast cancer.
The genes which have an expression level that is too low on the majority of the chips and also the genes which do not show any substantial variation between the various chips were excluded (Li et al., 2001, Bioinformatics, 17: 1131-1142). The search for a panel of genes for distinguishing the groups of patients EFS [French Blood Bank] and CI/II was carried out by a Data Mining technique (http://ligarto.org/rdiaz/Papers/jornadas.bioinfo.randomForest.pdf).
This analysis made it possible to reveal a first panel of genes, comprising 54 relevant genes according to the invention (SEQ ID Nos. 1 to 6; 13; 14; 20; 26; 28; 38 to 41; 69; 74 to 111). A complementary analysis (SAM, significance analysis of microarrays) also made it possible to reveal additional genes which also proved to be very relevant (SEQ ID Nos. 36; 37; 46; 48; 59 to 61; 63; 65; 67; 70 to 72; 73; 112 to 139).
The increase or decrease in the expression of each of these genes, observed in the S patients compared with the C I/II patients, is given in table 3.
Homo sapiens, clone IMAGE: 5267398, mRNA
The inventors also studied the simultaneous expression of the 96 genes presented above in order to obtain an expression profile. The results are given in
The inventors also studied the simultaneous expression of 54 genes of nucleotide sequence chosen from SEQ ID Nos. 1 to 6; 13; 14; 20; 26; 28; 38 to 41; 69; 74 to 111, in order to obtain an expression profile. The results are given in
The inventors also studied the simultaneous expression of 29 genes of nucleotide sequence chosen from SEQ ID Nos. 1; 2; 4 to 6; 13; 14; 20; 26; 38; 39; 41; 69; 75; 79 to 81; 87; 89; 93; 95 to 96; 101; 103 to 106; 108; 110, in order to obtain an expression profile. The results are given in
This confirms that the analysis of the expression of all or part of the genes of SEQ ID Nos. 1 to 139 is a good tool for distinguishing between patients suffering from a cancer or not suffering from a cancer, and, if the patient is suffering from a cancer, for determining the stage of progression of his or her cancer.
Number | Date | Country | Kind |
---|---|---|---|
0552087 | Jul 2005 | FR | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/FR2006/001593 | 7/5/2006 | WO | 00 | 1/7/2008 |