The instant application contains a Sequence Listing which has been submitted electronically in XML file format and is hereby incorporated by reference in its entirety. Said XML copy, created on Oct. 25, 2024, is named 60448-701_301_SL.xml and is 19,152 bytes in size.
Only a small portion of mammalian genome is transcribed to protein-coding mRNAs, with the rest is transcribed to non-coding RNAs. Among them, long non-coding RNAs (lncRNA) are a type of RNA with usually more than 200 nucleotides that are not translated into protein. LncRNAs have been shown to regulate gene expression networks at different levels via various mechanisms (see, e.g., Yao et al., Nature Cell Biology, 21, pages 542-551, 2019).
Acute respiratory distress syndrome (ARDS) occurs when fluid builds up in the air sacs (alveoli) of lungs. It can occur in any individuals who are critically ill or who have significant injuries, and it is often fatal. It is estimated that ARDS affects more than 190,000 people in the USA alone annually, with a mortality of 27-45% (see, e.g., Burnham et al., Eur Respir J. 43(1): 276-285, 2014). Patients with ARDS have severe shortness of breath and often are unable to breathe on their own without support from a ventilator. Patients with ARDS often have reduced oxygenation, extensive alveolar damage, lung inflammation, and lung fibrosis.
There is no fundamental treatment for ARDS at this time, and treatments in the clinic, such as oxygen and fluid management, focus on supporting the patient while the lungs heal. Therefore, there exists an urgent need to develop related therapeutics to prevent, alleviate, and treat ARDS and fibrosis and inflammation that accompanied ARDS.
In one aspect, to address the need to prevent, alleviate, and treat ARDS and fibrosis and inflammation that accompanied ARDS, provided herein is a modulator of a long noncoding transcript that is associated with initiation, development, or prognosis of ARDS, or pulmonary fibrosis associated with ARDS. In some aspects, the long-noncoding transcript is transcribed from a genomic region located within chr3:45,806,503 to chr3:45,831,916. Also provided herein is a modulator of a long noncoding transcript, which is transcribed from a genomic region located within chr3:45,806,503 to chr3:45,831,916, and the long noncoding transcript is transcribed using Crick Strand as template.
In some instances, the genomic region is LOC107986083 (chr3: 45,817,379-45,827,511). In other instances, the long noncoding transcript comprises at least a portion of XR_001740681.1 (NCBI), locus ENSG00000288720 (Gencode/ENSEMBL), transcript ENST00000682011.1 (Gencode/ENSEMBL), transcript ENST00000684202.1 (Gencode/ENSEMBL), or transcript RP11-852E15.3 (Gencode).
In some instances, the expression of the long noncoding transcript is elevated in a subject affected by pulmonary fibrosis associated with acute respiratory distress syndrome (ARDS). In other instances, the elevated expression of the long-noncoding transcript is associated with severity of an ARDS symptom in the subject. In other instances, the long noncoding transcript comprises a single nucleotide polymorphism associated with the ARDS.
In some instances, the modulator modifies an expression level and/or an activity of the long noncoding transcript. In some instances, the modulator reduces the elevated expression level and/or activity of long noncoding transcript by at least 50%, at least 60%, at least 70%, at least 80%, at least 90% in a cell or a tissue of the subject, wherein the elevated expression level is associated with initiation, development, or prognosis of ARDS or initiation, development, or prognosis of pulmonary fibrosis associated with ARDS. In other instances, the modulator reduces the elevated expression level and/or activity of the long noncoding transcript to a baseline level.
In some instances, the modulator is a nucleic acid editing or modifying moiety. In some instances, the nucleic acid editing or modifying moiety targets the genomic region, the long noncoding transcript, or a premature form thereof. In some instances, the nucleic acid editing or modifying moiety is a CRISPR-based moiety, a meganuclease-based moiety, a zinc finger nuclease (ZFN)-based moiety, or a transcription activator-like effector-based nuclease (TALEN)-based moiety.
In some instances, the modulator is a synthetic or artificial oligonucleotide or polynucleotide. In some instances, the synthetic or artificial oligonucleotide comprises a nucleic acid sequence complementary to at least 10, 11, 12, 13, 14, or 15 nucleotides of the long noncoding transcript. In some instances, the synthetic or artificial oligonucleotide or polynucleotide is a small interfering RNA (siRNA), a microRNA (miRNA), an inhibitory double stranded RNA (dsRNA), a small or short hairpin RNA (shRNA), an antisense oligonucleotide (ASO), a piwi-interacting RNA (piRNA), a heterogeneous nuclear RNA (hnRNA), a small nuclear RNA (snRNA), or an enzymatically-prepared siRNA (esiRNA) or the precursors thereof.
In some instances, the synthetic oligonucleotide is about 10-50 nucleotides long, about 10-30 nucleotides long, or about 14-20 nucleotides long. In some instances, the synthetic oligonucleotide is about 16 nucleotides long.
In some instances, the synthetic oligonucleotide comprises one or more sugar modifications, one or more phosphate modifications, one or more base modifications, one or more pyrimidine modifications, or any combination thereof. In some instances, the one or more sugar modifications are locked nucleic acid (LNA), tricyclo-DNA, 2′-fluoro, 2′-O-methyl, 2′-methoxyethyl (2′-MOE), 2′-cyclic ethyl (cET), UNA, and conformationally restricted nucleoside (CRN), or any combination thereof. In some instances, the one or more phosphate modifications comprise phosphorothioate internucleotide linkage, methylphosphonate internucleotide linkage, guanidinopropyl phosphoramidate internucleotide linkage, or any combination thereof. In some instances, the one or more base modifications comprise a purine modifications selected from a group consisting of 2,6-diaminopurin, 3-deaza-adenine, 7-deaza-guanine, 8-zaido-adenine, or any combination thereof. In some instances, the one or more base modifications comprise a pyrimidine modifications selected from a group consisting of 2-thio-thymidine, 5-carboxamide-uracil, 5-methyl-cytosine, 5-ethynyl-uracil, or any combination thereof. In some instances, the synthetic oligonucleotide comprises a phosphorodiamidate morpholino oligomer (PMO).
In some instances, the synthetic oligonucleotide is an ASO, wherein the ASO comprises a gapmer comprising a central region of consecutive DNA nucleotides flanked by a 5′-wing region and 3′-wing region, wherein at least one of 5′-wing region and 3′-wing region comprises a nucleic acid analogue. In some instances, the nucleic acid analogue comprises an LNA. In some instances, the LNA comprises a beta-D-oxy LNA, an alpha-L-oxy-LNA, a beta-D-amino-LNA, an alpha-L-amino-LNA, a beta-D-thio-LNA, an alpha-L-thio-LNA, a 5′-methyl-LNA, a beta-D-ENA, or an alpha-L-ENA. In some instances, the LNA comprises a beta-D-oxy LNA.
In some instances, the 5′-wing region comprises at least two LNAs. In some instances, the 5′-wing region comprises three consecutive LNAs. In some instances, the 3′-wing region comprises an LNA. In some instances, the 3′-wing region comprises two consecutive LNAs.
In some instances, the synthetic oligonucleotide comprises at least 9, 10, 11, 12, 13, 14 consecutive nucleotides with no more than 1, 2, 3 mismatches from any one of SEQ ID NOs: 1-4. In some instances, the synthetic oligonucleotide comprises a nucleic acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical to a sequence selected from any one of SEQ ID NOs: 1-4.
In some aspects, provided herein is a pharmaceutical composition comprising the modulator disclosed herein and a pharmaceutically acceptable salt or derivative thereof. In other aspects, provided herein is a kit comprising the modulator disclosed herein or the pharmaceutical composition disclosed herein.
In some aspects, provided herein is a modulator comprising an antisense oligonucleotide (ASO), wherein the ASO comprises at least 9, 10, 11, 12, 13, 14 consecutive nucleotides with no more than 1, 2, 3 mismatches from SEQ ID NO: 1 (5′-GGATAATGGTTGGTCA-3′). In some instances, the ASO comprises a nucleic acid sequence of 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical to SEQ ID NO: 1 (5′-GGATAATGGTTGGTCA-3′).
In some instances, the ASO comprises a gapmer comprising a central region of consecutive DNA nucleotides flanked by a 5′-wing region and 3′-wing region, wherein at least one of 5′-wing region and 3′-wing region comprises a nucleic acid analogue. In some instances, the nucleic acid analogue comprises a locked nucleic acid (LNA). In some instances, the LNA comprises a beta-D-oxy LNA, an alpha-L-oxy-LNA, a beta-D-amino-LNA, an alpha-L-amino-LNA, a beta-D-thio-LNA, an alpha-L-thio-LNA, a 5′-methyl-LNA, a beta-D-ENA, or an alpha-L-ENA.
In some instances, the 5′-wing region comprises at least two LNAs. In some instances, the 5′-wing region comprises three consecutive LNAs. In some instances, the 3′-wing region comprises an LNA. In some instances, the 3′-wing region comprises two consecutive LNAs.
In some aspects, provided herein is a method of modulating a long noncoding transcript in a subject in need thereof, the method comprising administering to the subject the modulator disclosed herein, or a pharmaceutical composition disclosed herein.
In some aspects, provided herein is a method of preventing, alleviating, or treating pulmonary fibrosis in a subject in need thereof, the method comprising administering to the subject an effective amount of the modulator disclosed herein, or a pharmaceutical composition disclosed herein. In some aspects, provided herein is a method of preventing, alleviating, or treating idiopathic pulmonary fibrosis in a subject in need thereof, the method comprising administering to the subject an effective amount of the modulator disclosed herein, or a pharmaceutical composition disclosed herein.
In some instances, the modulator is expressed or encapsulated in a viral or plasmid vector, a liposome, or a nanoparticle. In some instances, the administering is performed intratracheally, orally, nasally, intravenously, intraperitoneally, or intramuscularly. In some instances, the administering is a targeted delivery to a lung tissue of the subject. In some instances, the administering is in a form of aerosol.
All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference. To the extent publications and patents or patent applications incorporated by reference contradict the disclosure contained in the specification, the specification is intended to supersede and/or take precedence over any such contradictory material.
The novel features of the disclosure are set forth with particularity in the appended claims. A better understanding of the features and advantages of the present disclosure will be obtained by reference to the following detailed description that sets forth illustrative aspects, in which the principles of the disclosure are utilized, and the accompanying drawings (also “Figure” and “FIG.” herein), of which:
The present disclosure includes that long noncoding transcripts (e.g., lncRNA) that are associated with onset, development or prognosis of acute respiratory distress syndrome (ARDS) are identified, and that differential expression or transcriptional regulation of the long noncoding transcripts are associated with symptoms of with onset, development or prognosis of ARDS. In some aspects, the long noncoding transcripts are associated with onset or development of pulmonary fibrosis that is associated with or induced by ARDS or a symptom of ARDS. Such long noncoding transcripts could be a druggable target to prevent, alleviate, or treat ARDS or pulmonary fibrosis associated with onset, development or prognosis of, or induced by. As such, in one aspect, provided herein, are modulators of a long noncoding transcript associated with onset, development or prognosis of ARDS, or associated with onset or development of pulmonary fibrosis associated with or induced by ARDS or a symptom of ARDS. Also provided herein includes pharmaceutical compositions comprising the modulator, and kits comprising the modulator. Also provided herein are methods of modulating a long noncoding transcript associated with onset, development or prognosis of ARDS or associated with onset or development of pulmonary fibrosis that is associated with or induced by ARDS or a symptom of ARDS in a subject in need thereof. Further provided herein are methods of preventing, alleviating, or treating ARDS or preventing, alleviating, or treating pulmonary fibrosis or inflammation in a subject affected by ARDS.
Long noncoding transcripts (lncRNAs) Associated with ARDS Induced Pulmonary Fibrosis
ARDS, which occurs when liquid builds up in lungs, is caused by or associated with viral infection, for example, infection of pulmonary disease-associated virus (e.g., COVID-19) or human immunodeficiency virus (HIV), and/or caused by or associated with increased immune response (e.g., autoimmune disease). Particularly, it has been reported that ARDS and lung failure are the main lung diseases in COVID-19 patients, and they are proportional to the severity of COVID-19 (see, e.g., Aslan et al., Pneumonia volume 13, Article number: 14, 2021). The progression of the ARDS is generally shown in three overlapping stages: exudative stage, proliferative stage, and fibrotic stage. Exudative stage is represented by various inflammatory symptoms, including cytokine releases and influx of immune cells (e.g., neutrophils). Proliferative stage is characterized by early fibrotic changes, which often progresses to fibrotic stage. Fibrotic stage, which is an advanced stage of the ARDS, is characterized by intra-alveolar and interstitial fibrosis, increased collagen deposition, a prolonged period of ventilation-perfusion mismatching, and diminished compliance of the lung. (see, e.g., Walkey et al., Clin. Epidemiol., 2012; 4: 159-19). A significant portion of ARDS patients advances to proliferative stage and even to fibrotic stage, and the hallmarks of such stages, intra-alveolar and interstitial fibrosis, have been associated with a poor prognosis with high mortality and/or prolonged ventilator dependence.
Long non-coding transcripts (or long non-coding RNAs (lncRNAs)) are RNA segments that lack protein-coding capacity, yet mediate various regulatory mechanisms in cell cycle or cell metabolism by regulating transcription and/or post-transcriptional modification of various genes. As such dysregulation of certain long non-coding transcripts can be associated with an onset, development, or prognosis of a disease or a symptom of a disease. Alternatively and/or additionally, dysregulation of certain long non-coding transcripts can be a signature or indication of an onset, development, or prognosis of a disease or a symptom of a disease.
Some long non-coding transcripts affect development of certain types of fibrosis by promoting extracellular matrix (ECM) synthesis by affecting fibroblast cells in the tissue(s). In some aspects, as provided herein, certain long non-coding transcript is associated with onset, development, and/or prognosis of ARDS. In some aspects, certain long non-coding transcript is associated with onset, development, and/or prognosis of pulmonary fibrosis associated with ARDS. In some instances, the pulmonary fibrosis is induced by ARDS. In some instances, the pulmonary fibrosis is resulted from one or more symptoms or pathophysiology of ARDS.
In some aspects, the expression of the long noncoding transcripts disclosed herein is elevated in a subject affected by fibrosis (e.g., pulmonary fibrosis) associated with ARDS. In some aspects, the expression of the long noncoding transcripts disclosed herein is elevated in a subject affected by ARDS. In some aspects, the expression of the long noncoding transcripts disclosed herein is elevated in a subject affected by ARDS related to or resulted from COVID-19 infection. In some aspects, the expression of the long noncoding transcripts disclosed herein is elevated in a subject affected by COVID-19 infection. In some aspects, the expression of the long noncoding transcripts disclosed herein is elevated in a subject having an inflammation or increased immune response associated with onset or development of ARDS. In some aspects, the inflammation or increased immune response is represented or shown by increased infiltration of immune cells to the tissue (e.g., lung tissue), activation of immune cells in the lung tissue or to the lung tissue (including lung fibroblast), increased secretion or accumulation of inflammatory cytokines or chemokines in the lung tissue. In some specific aspects, the elevation is at least by 1%, 2%, 3%, 4%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 200%, 300%, 400%, 500%, 600%, 700%, 800%, 900%, or 1000%. In some aspects, the activity of the long noncoding transcripts disclosed herein is elevated in a subject affected by fibrosis associated with ARDS. In some specific aspects, the elevation is at least by 1%, 2%, 3%, 4%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 200%, 300%, 400%, 500%, 600%, 700%, 800%, 900%, or 1000%.
In some aspects, the elevated expression and/or activity of the long-noncoding transcript disclosed herein is associated with severity of an ARDS symptom in the subject. In some specific aspects, the elevated expression is associated with severity of labored and rapid breathing. In some aspects, the elevated expression is associated with severity of shortness of breath. In some aspects, the elevated expression is associated with severity of low blood pressure. In some aspects, the elevated expression is associated with severity of extreme tiredness.
In some aspects, the elevated expression and/or activity of the long-noncoding transcript disclosed herein is detected in the pulmonary myofibroblasts compared to fibroblasts. In some aspects, the elevated expression and/or activity of the long-noncoding transcript disclosed herein is detected in the induced pulmonary myofibroblasts compared to fibroblasts. In some instances, the expression level of the long-noncoding transcript is increased in the pulmonary myofibroblasts or induced pulmonary myofibroblasts at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 200%, 300%, 400%, 500% compared to the fibroblasts.
In some aspects, the long-noncoding transcript disclosed herein is associated with or modulates expression or activity of one or more fibrosis marker genes. In some instances, the fibrosis-related markers comprise smooth muscle a actin (ACTA2), alpha 1 chain of collagen type I (COL1A1), alpha 1 chain of collagen type 3 (COL3A1), fibroblast activation protein (FAP), fibronectin 1 (FN1), periostin (POSTN), or a combination thereof. In some instances, the increased expression of noncoding transcript disclosed herein is proportional to the increased expression of one or more fibrosis marker genes.
As such, in some aspects, inhibition of expression or activity of the long-noncoding transcript modulates expression of one or more fibrosis marker genes (e.g., ACTA2, COL1A1, COL3A1, FAP, FN1, POSTN, etc.). In some instances, inhibition of expression or activity of the long-noncoding transcript decreases expression of one or more fibrosis marker genes (e.g., ACTA2, COL1A1, COL3A1, FAP, FN1, POSTN, etc.). In some instances, inhibition of expression or activity of the long-noncoding transcript decreases expression of one or more fibrosis marker genes (e.g., ACTA2, COL1A1, COL3A1, FAP, FN1, POSTN, etc.) at least 5%, 10%, 15%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% or more. In some instances, inhibition of expression or activity of the long-noncoding transcript decreases expression of one or more fibrosis marker genes (e.g., ACTA2, COL1A1, COL3A1, FAP, FN1, POSTN, etc.) at least 5%, 1, 15%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% or more from the increased expression level of the one or more fibrosis marker genes in the myofibroblast, induced myofibroblast, a cell affected by pulmonary fibrosis associated with ARDS, a cell affected by ARDS, a cell affected by COVID-19 related ARDS, a tissue affected by pulmonary fibrosis associated with ARDS, a tissue affected by ARDS, or a tissue affected by COVID-19 related ARDS. In some instances, inhibition of expression or activity of the long-noncoding transcript modulates a fibrosis status of a tissue affected by pulmonary fibrosis associated with ARDS, a tissue affected by ARDS, or a tissue affected by COVID-19 related ARDS. In some instances, inhibition of expression or activity of the long-noncoding transcript reduces or prevents progress of a fibrosis status of a tissue affected by pulmonary fibrosis associated with ARDS, a tissue affected by ARDS, or a tissue affected by COVID-19 related ARDS. In some instances, inhibition of expression or activity of the long-noncoding transcript reverses progress of a fibrosis status of a tissue affected by pulmonary fibrosis associated with ARDS, a tissue affected by ARDS, or a tissue affected by COVID-19 related ARDS.
In some aspects, the long noncoding transcript disclosed herein is transcribed from a genomic region located at the same chromosome with one or more fibrosis marker genes. In some aspects, the long-noncoding transcript disclosed herein is transcribed from a genomic region where one or more single nucleotide polymorphism (SNP) associated with COVID-19 risk variants are located. In some aspects, the long-noncoding transcript disclosed herein is transcribed from a genomic allele where one or more single nucleotide polymorphism (SNP) associated with COVID-19 risk variants are located. As used herein, COVID-19 risk variant is any genetic or epigenetic variation, mutation, or modification that contributes to COVID-19 susceptibility, severity, and/or mortality. In some embodiments, a certain aspect of COVID-19 susceptibility, severity, and/or mortality relates to respiratory failure. In some embodiments, a certain aspect of COVID-19 susceptibility, severity, and/or mortality relates to an increased risk of respiratory failure. In some embodiments, respiratory failure is defined as a condition of a subject in which a medical treatment comprising the use of oxygen supplementation or mechanical ventilation is administered to the subject as an indicated medical intervention. In some embodiments, a severity of respiratory failure is graded according to a maximum level of respiratory support received by a subject at any point during hospitalization (e.g., in order of increasing severity of respiratory support: supplemental oxygen therapy only, noninvasive ventilatory support, invasive ventilatory support, extracorporeal membrane oxygenation). In some embodiments, a certain aspect of COVID-19 susceptibility, severity, and/or mortality relates to an increased risk of respiratory failure of at least two-fold. In some embodiments, a certain aspect of COVID-19 susceptibility, severity, and/or mortality relates to interstitial pneumonia. In some embodiments, a certain aspect of COVID-19 susceptibility, severity, and/or mortality relates to bilateral interstitial pneumonia. n some embodiments, a certain aspect of COVID-19 susceptibility, severity, and/or mortality relates to an elevated risk of bilateral interstitial pneumonia. In some instances, the COVID-19 risk variant is identified from genome-wide association study (GWAS).
In some aspects, the genomic region in which the long-noncoding transcript (lncRNAs) are transcribed as disclosed herein is located within chromosome 3. In some aspects, the genomic region in which the long-noncoding transcript (lncRNAs) are transcribed as disclosed herein is annotated according to the Human Genome Resources at NCBI. In some embodiments, the genomic region in which the long-noncoding transcript (lncRNAs) are transcribed as disclosed herein is annotated according to the Human Genome Resources at NCBI according to Assembly: GRCh30.p14 (GCF_000001405.40). In some aspects, the genomic region in which the long-noncoding transcript (lncRNAs) are transcribed as disclosed herein is located within chr3:45,806,503 to chr3:45,834,110. In some aspects, the genomic region in which the long-noncoding transcript (lncRNAs) are transcribed as disclosed herein is located within chr3:45,806,503 to chr3:45,831,916. In some aspects, the genomic region in which the long-noncoding transcript (lncRNAs) are transcribed as disclosed herein is located within chr3:45,796,552 to chr3: 45,834,110. In some aspects, the genomic region in which the long-noncoding transcript (lncRNAs) are transcribed as disclosed herein is located within chr3:45,796,552 to chr3: 45,831,916. In some aspects, the genomic region in which the long-noncoding transcript (lncRNAs) are transcribed as disclosed herein is located within chr3:45,796,552 to chr3: 45,829,603. In certain aspects, the genomic region in which the long-noncoding transcript (lncRNAs) are transcribed as disclosed herein is located within LOC107986083 (chr3: 45,817,379-45,827,511). In some aspects, the long-noncoding transcript disclosed herein is transcribed using Crick Strand of the genomic region as a template.
In some aspects, the long noncoding transcript disclosed herein comprises a nucleic acid sequence of at least a portion of XR_001740681.1 (NCBI). In some aspects, the long noncoding transcript disclosed herein comprises a nucleic acid sequence of at least a portion of locus ENSG00000288720 (Gencode/ENSEMBL). In some aspects, the long noncoding transcript disclosed herein comprises a nucleic acid sequence of at least a portion of transcript ENST00000682011.1 (Gencode/ENSEMBL). In other aspects, the long noncoding transcript disclosed herein comprises a nucleic acid sequence of at least a portion of transcript ENST00000684202.1 (Gencode/ENSEMBL). In other aspects, the long noncoding transcript disclosed herein comprises a nucleic acid sequence of at least a portion of transcript RP11-852E15.3 (Gencode).
In some aspects, the lncRNA associated with ARDS induced pulmonary fibrosis is transcribed from the genomic region LOC107986083 (chr3: 45,817,379-45,827,511). In some aspects, the lncRNA associated with ARDS induced pulmonary fibrosis is transcribed from the Crick strand of the genomic region LOC107986083 (chr3: 45,817,379-45,827,511). In some aspects, the lncRNA associated with ARDS induced pulmonary fibrosis comprises at least a portion of XR_001740681.1 (NCBI). In some aspects, the lncRNA associated with ARDS induced pulmonary fibrosis comprises at least a portion of locus ENSG00000288720 (Gencode/ENSEMBL). In some aspects, the lncRNA associated with ARDS induced pulmonary fibrosis comprises at least a portion of transcript ENST00000682011.1 (Gencode/ENSEMBL). In other aspects, the lncRNA associated with ARDS induced pulmonary fibrosis comprises at least a portion of transcript ENST00000684202.1 (Gencode/ENSEMBL). In other aspects, the lncRNA associated with ARDS induced pulmonary fibrosis comprises at least a portion of transcript RP11-852E15.3 (Gencode). In some aspects, the lncRNA associated with ARDS induced pulmonary fibrosis is also associated with leukocyte infiltration accompanying ARDS. In some aspects, the lncRNA associated with ARDS induced pulmonary fibrosis is also associated with leukocyte infiltration preceding onset of ARDS.
In some aspects, the lncRNA associated with ARDS induced pulmonary fibrosis is transcribed from the human genomic region on chromosome 3 (chr3: 45,796,552-chr3: 45,834,110). In some aspects, the lncRNA associated with ARDS induced pulmonary fibrosis is transcribed from the human genomic region on chromosome 3 (chr3: 45,817,792-chr3: 45,834,110). In some aspects, the lncRNA associated with ARDS induced pulmonary fibrosis is transcribed from the genomic region LOC126806670 chr3:45,817,792-chr3: 45,818,991). In some aspects, In some aspects, the lncRNA associated with ARDS induced pulmonary fibrosis is transcribed from the Crick strand of the human genomic region on chromosome 3. In some embodiments, the lncRNA associated with ARDS induced pulmonary fibrosis is transcribed from the human genomic region on chromosome 3 in which the genome location is annotated according to GRCh37/hgl9. In some aspects, the lncRNA associated with ARDS induced pulmonary fibrosis comprises at least a portion of an enhancer contained within LOC126806670.
In some aspects, the lncRNA associated with ARDS induced pulmonary fibrosis is also associated with leukocyte infiltration accompanying ARDS. In some aspects, the lncRNA associated with ARDS induced pulmonary fibrosis is also associated with leukocyte infiltration preceding onset of ARDS.
In some aspects, the long noncoding transcript disclosed herein comprises a single nucleotide polymorphism (SNP) associated with the ARDS. In some aspects, the long noncoding transcript disclosed herein comprises a single nucleotide polymorphism (SNP) associated with the COVID-19 risk. In some aspects, the long noncoding transcript disclosed herein comprises a single nucleotide polymorphism (SNP) that is an COVID-19 risk variant SNP. In some aspects, the long noncoding transcript disclosed herein is transcribed from a genomic region in which one or more single nucleotide polymorphism (SNP) associated with the COVID-19 risk is located. In some aspects, the long noncoding transcript disclosed herein is transcribed from a genomic region in which one or more single nucleotide polymorphism (SNP) that is an COVID-19 risk variant SNP. In some embodiments, the COVID-19 risk variant SNP has been identified in a GWAS study. In some embodiments, the COVID-19 risk variant SNP is rs1040770, rs1886814, rs72711165, rs10774671, rs77534576, rs1819040, rs74956615, rs2109069, rs13050728, or rs17713054. In some embodiments, the COVID-19 risk variant SNP is rs17713054. In some embodiments, the COVID-19 risk variant SNP is described in (Severe Covid-19 GWAS Group et al. Genomewide Association Study of Severe Covid-19 with Respiratory Failure. N Engl J Med. 2020 Oct. 15; 383(16):1522-1534) which is herein incorporated by reference for techniques of identifying and characterizing COVID-19 risk variant SNPs. In some embodiments, the COVID-19 risk variant SNP is described in (Downes D J et al. Identification of LZTFL1as a candidate effector gene at a COVID-19 risk locus. Nat Genet. 2021 November; 53(11):1606-1615) which is herein incorporated by reference for techniques of identifying and characterizing COVID-19 risk variant SNPs. In some embodiments, the COVID-19 risk variant SNP is linked to a causative allele affecting a component of COVID-19 risk in a subject. In some embodiments, the COVID-19 risk variant SNP allele is in linkage disequilibrium to a nearby allele of a genetic variant affecting a component of COVID-19 risk in a subject. In some embodiments, a long noncoding transcript (lncRNA) comprises a COVID-19 risk variant SNP. In some embodiments, a long noncoding transcript (lncRNA) does not comprise a COVID-19 risk variant SNP. In some embodiments, a COVID-19 risk variant SNP is in a genomic region in close proximity to a long noncoding transcript (lncRNA)affecting a component of COVID-19 risk in a subject. In some embodiments, COVID-19 risk includes risk of a subject exhibiting one or more symptoms of COVID-19. In some embodiments, COVID-19 risk includes risk of a subject exhibiting a certain degree of severity of one or more symptoms of COVID-19. In some aspects, the long noncoding transcript disclosed herein comprises rs17713054. In some aspects, the long noncoding transcript disclosed herein does not comprise rs17713054. In some aspects, increased CEBPβ binding at an enhancer of the long non-coding transcript disclosed herein due to rs1773054 risk variant facilitates PU.1 binding and transactivation of enhancer and the expression of the long noncoding transcript disclosed herein.
In some instances, the long noncoding transcript is preferentially or highly expressed in pulmonary tissues in a healthy individual. In some instances, the expression level of the long noncoding transcript is at least 10%, 20%, 30%, 40%, 50% higher in pulmonary tissues than other tissues in a healthy individual. In some instances, the long noncoding transcript is ubiquitously expressed in various body tissues in a healthy individual. In some instances, the expression of the long noncoding transcript is preferentially or specifically increased in the pulmonary tissues than other tissues in an individual affected by pulmonary fibrosis associated with or induced by ARDS associated with COVID-19 infection. In some instances, the expression level of long noncoding transcript is at least 10%, 20%, 30%, 40%, 50% higher in the pulmonary tissues than other tissues in an individual affected by pulmonary fibrosis associated with or induced by ARDS associated with COVID-19 infection. In some instances, the expression of the long noncoding transcript is preferentially or specifically increased in the pulmonary tissues than other tissues in an individual affected by COVID-19 infection. In some instances, the expression level of long noncoding transcript is at least 10%, 20%, 30%, 40%, 50% higher in the pulmonary tissues than other tissues in an individual affected by COVID-19 infection. In some embodiments, the expression level of long noncoding transcript in the pulmonary tissue is in pulmonary mesenchymal cells. In some embodiments, the expression level of long noncoding transcript in the pulmonary tissue is in pulmonary fibroblasts. In some embodiments, the pulmonary fibroblasts are interstitial resident fibroblasts (iReFs). In some embodiments, the expression level of long noncoding transcript in the pulmonary tissue is in myofibroblasts. In some embodiments, the expression level of long noncoding transcript in the pulmonary tissue is in matrix fibroblasts. In some embodiments, the expression level of long noncoding transcript in the pulmonary tissue is in lipofibroblasts. In some embodiments, the expression level of long noncoding transcript in the pulmonary tissue is in fibrocytes. In some embodiments, the expression level of long noncoding transcript in the pulmonary tissue is in alveolar niche cells. In some embodiments, the expression level of long noncoding transcript in the pulmonary tissue is in alveolar niche progenitor cells. In some embodiments, the expression level of long noncoding transcript in the pulmonary tissue is in respiratory epithelial cells. In some embodiments, the expression level of long noncoding transcript in the pulmonary tissue is in ciliated pseudostratified columnar epithelium. In some embodiments, the cells types in ciliated pseudostratified columnar epithelium in which the long noncoding transcript is expressed are ciliated cells, goblet cells, basal cells, brush cells, or neuroendocrine cells, or any combination thereof. In some embodiments, the expression level of long noncoding transcript in the pulmonary tissue is in Type I pneumocytes (alveolar type I epithelial cells), Type II pneumocytes (alveolar type II epithelial cells), both Type I pneumocytes and Type II pneumocytes. In some embodiments, the expression level of long noncoding transcript in the pulmonary tissue is in alveolar macrophages, neutrophils, T cells, or any combination thereof. In some embodiments, the expression level of long noncoding transcript in the pulmonary tissue is in endothelial cells.
In some aspects, the modulator disclosed herein modifies the genomic DNA that is transcribed to the long noncoding transcript disclosed herein. In some instances, the modulator disclosed herein modifies a portion of such genomic DNA so that the genomic DNA is mutated.
In some instances, the modulator disclosed herein modifies a portion of such genomic DNA so that the transcription level is suppressed. Accordingly, in some instances, the modulator reduces the amount of the long noncoding transcript. In some instances, the modulator disclosed herein modifies a portion of such genomic DNA so that the transcription level is activated. In some aspects, the modulator disclosed herein modifies the long noncoding transcript disclosed herein.
In some instances, the modulator disclosed herein modifies a portion of such long noncoding transcript so that the long noncoding transcript is degraded. In some instances, the modulator disclosed herein modifies a portion of such long noncoding transcript so that the long noncoding transcript is retained longer. In some instances, the modulator disclosed herein modifies a portion of such long noncoding transcript so that the activity of the long noncoding transcript on downstream reactions or pathways is suppressed. Accordingly, in some instances, the modulator reduces the activity of the long noncoding transcript. In some instances, the modulator disclosed herein modifies a portion of such long noncoding transcript so that the activity of the long noncoding transcript on downstream reactions or pathways is activated.
In some aspects, a modulator of the long noncoding transcript increases or decreases of the expression or activity of the long noncoding transcript. In some instances, a modulator of the long noncoding transcript prevent increases or decreases of the expression or activity of the long noncoding transcript. In some instances, a modulator of the long noncoding transcript reverses the pathological changes of the expression or activity of the long noncoding transcript. In some instances, a modulator of the long noncoding transcript reverses the pathological changes of the expression or activity of the long noncoding transcript at least ±5%, ±10%, ±15%, ±20%, ±25% of the normal expression or activity of the long noncoding transcript before the onset or development of the pathological symptoms or diseases (e.g., baseline level), or at least ±5%, ±10%, ±15%, ±20%, ±25% of the expression or activity of the long noncoding transcript of a healthy individual or healthy tissue (e.g., baseline level). In some aspects, the modulator disclosed herein reduces the elevated amount of long noncoding transcript by at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90% in a cell or a tissue of the subject, wherein the elevated amount is associated with ARDS. In some aspects, the modulator disclosed herein reduces the elevated activity of long noncoding transcript by at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90% in a cell or a tissue of the subject, wherein the elevated amount is associated with ARDS.
In some aspects, a long noncoding transcript modulates transcription of one or more downstream gene, which expression is a marker for a disease onset, development, or prognosis.
As such, in some instances, a modulator of the long noncoding transcript affects (e.g., increases or decreases) the expression or activity of the downstream gene or the long noncoding transcript.
In some instances, a modulator of the long noncoding transcript increases the expression or activity of one or more downstream genes at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, or higher. In some instances, a modulator of the long noncoding transcript decreases the expression or activity of one or more downstream genes at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, or higher. In some instances, a modulator of the long noncoding transcript reverses the pathological changes of the expression or activity of one or more downstream genes. In some instances, a modulator of the long noncoding transcript reverses the pathological changes of the expression or activity of one or more downstream genes to the level differing no more than ±5%, ±10%, ±15%, ±20%, ±25% from the normal expression or activity of the long noncoding transcript before the onset or development of the pathological symptoms or diseases. In some instances, a modulator of the long noncoding transcript reverses the pathological changes of the expression or activity of one or more downstream genes to the level differing no more than at least +5%, ±10%, ±15%, ±20%, ±25% from the normal expression or activity of the long noncoding transcript of a healthy individual or healthy tissue. In some embodiments, the downstream gene is a marker gene for pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection, e.g., ACTA2, COL1A1, COL3A1, FAP, FN1, POSTN, etc. In some embodiments, the downstream gene is a marker gene for pulmonary immune cell infiltration associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection. In some embodiments, the downstream gene is a marker gene associated with leukocyte migration. In some embodiments, the downstream gene is a marker gene associated with leukocyte chemotaxis. In some embodiments, the downstream gene is a marker gene associated with leukocyte proliferation. In some embodiments, the downstream gene is a marker gene associated with leukocyte activation. In some embodiments, the one or more downstream genes are included as part of an IPF gene signature. In some embodiments, the IPF gene signature is a human IPF gene signature constructed from expression data derived from human cells. In some embodiments, the IPF gene signature is a mouse IPF gene signature constructed from expression data derived from mouse cells. In some embodiments, the one or more downstream genes included as part of an IPF gene signature are upregulated in pulmonary fibrosis associated with or induced by ARDS, or in ARDS associated with COVID-19 infection. In some embodiments, the one or more upregulated downstream genes included as part of a human IPF gene signature are listed in Table 1. In some embodiments, the one or more downstream genes included as part of a human or mouse IPF gene signature are downregulated in pulmonary fibrosis associated with or induced by ARDS, or in ARDS associated with COVID-19 infection. In some embodiments, the one or more downregulated downstream genes included as part of a human IPF gene signature are listed in Table 2. In some embodiments, the one or more downstream genes included as part of an IPF gene signature comprise one or more upregulated downstream genes and one or more downregulated downstream genes. In some instances, a modulator of the long noncoding transcript decreases the expression or activity of one or more downstream genes listed in Table 1. In some instances, a modulator of the long noncoding transcript increases the expression or activity of one or more downstream genes listed in Table 2. In some instances, a modulator of the long noncoding transcript decreases the expression or activity of one or more downstream genes listed in Table 1 and increases the expression or activity of one or more downstream genes listed in Table 2. In some embodiments, the IPF gene signature is constructed according to Example 7 herein.
In some aspects, a long noncoding transcript modulates or expression of a long noncoding transcript is associated with cell phenotypes. As such, in some instances, a modulator of the long noncoding transcript affects cell phenotypes. For example, a modulator of the long noncoding transcript inhibits or prevents the morphological or physiological changes of a fibroblast to protomyofibroblast or to myofibroblast in a tissue affected by pulmonary fibrosis associated with or ARDS, e.g., ARDS associated with COVID-19 infection, or in a tissue affected by COVID-19 infection. In some instances, a modulator of the long noncoding transcript delays the morphological or physiological changes of a fibroblast to protomyofibroblast or to myofibroblast in a tissue affected by pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection, or in a tissue affected by COVID-19 infection. In some instances, a modulator of the long noncoding transcript reverses the morphological or physiological changes from protomyofibroblast or myofibroblast to fibroblast, or from fibroblast to protomyofibroblast in a tissue affected by pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection, or in a tissue affected by COVID-19 infection.
In some aspects, modulation of long noncoding transcript that is associated with onset, development, prognosis of a disease indication can be used to identify the druggable target for treating disease or can be used as a tool for diagnosis of the disease. As such, a modulator of the long noncoding transcript affects onset, development, prognosis of a disease indication. In some instances, a modulator of the long noncoding transcript affects onset, development, prognosis of pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection. In some instances, a modulator of the long noncoding transcript reduces symptoms or progress of pulmonary fibrosis associated with or induced by COVID-19 infection. In some instances, a modulator of the long noncoding transcript reduces intensity or severity of symptoms or progress of pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection. In some instances, a modulator of the long noncoding transcript reverses symptoms or progress of pulmonary fibrosis associated with or induced by ARDS associated with COVID-19 infection. In some instances, a modulator of the long noncoding transcript reduces ECM synthesis in the tissue affected by pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection, or in the tissue affected by COVID-19 infection. In some instances, a modulator of the long noncoding transcript increases or facilitate inflammation, inflammation symptoms, release of inflammation related cytokines or chemokines in the tissue affected by pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection, or in the tissue affected by COVID-19 infection. In some instances, a modulator of the long noncoding transcript increases or facilitate immune cell activation and/or immune cell infiltration in the tissue affected by pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection, or in the tissue affected by COVID-19 infection. In some instances, a modulator of the long noncoding transcript decreases or inhibits immune cell activation and/or immune cell infiltration in the tissue affected by pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection, or in the tissue affected by COVID-19 infection. In some instances, a modulator of the long noncoding transcript decreases or inhibits leukocyte infiltration, migration, and/or chemotaxis in the tissue affected by pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection, or in the tissue affected by COVID-19 infection. In some instances, a modulator of the long noncoding transcript decreases or inhibits granulocyte infiltration, migration, and/or chemotaxis in the tissue affected by pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection, or in the tissue affected by COVID-19 infection. In some instances, a modulator of the long noncoding transcript decreases or inhibits monocyte infiltration, migration, and/or chemotaxis in the tissue affected by pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection, or in the tissue affected by COVID-19 infection. In some instances, a modulator of the long noncoding transcript decreases or inhibits lymphocyte infiltration, migration, and/or chemotaxis in the tissue affected by pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection, or in the tissue affected by COVID-19 infection. In some instances, a modulator of the long noncoding transcript decreases or inhibits leukocyte activation in the tissue affected by pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection, or in the tissue affected by COVID-19 infection. In some instances, a modulator of the long noncoding transcript decreases or inhibits leukocyte cell-cell adhesion in the tissue affected by pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection, or in the tissue affected by COVID-19 infection. In some instances, a modulator of the long noncoding transcript decreases or inhibits leukocyte proliferation in the tissue affected by pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection, or in the tissue affected by COVID-19 infection. In some instances, a modulator of the long noncoding transcript decreases or inhibits leukocyte antigen processing and presentation in the tissue affected by pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection, or in the tissue affected by COVID-19 infection. In some instances, a modulator of the long noncoding transcript decreases or inhibits a leukocyte function sensitive to modulation by INFγ in the tissue affected by pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection, or in the tissue affected by COVID-19 infection. In some instances, a modulator of the long noncoding transcript decreases or inhibits T cell activation in the tissue affected by pulmonary fibrosis associated with or induced by ARDS, e.g., ARDS associated with COVID-19 infection, or in the tissue affected by COVID-19 infection. In some instances, a modulator of the long noncoding transcript decreases or inhibits one or more cellular functions listed by one or more GO terms in
In some aspects, modulation of long noncoding transcript that is associated with onset, development, prognosis of a disease indication can regulate a transition in cell type. In some embodiments, the regulated transition in cell type is a transition from fibroblast to myofibroblast. In some embodiments, lung fibroblasts exhibit a higher rate or higher proportion of cells undergoing apoptosis. In some embodiments, lung fibroblasts do not exhibit a significant change in rate or proportion of cells undergoing apoptosis. In some embodiments, an extent of lung myofibroblast quiescence influences or correlates with a rate or a proportion of lung fibroblast cells undergoing apoptosis. In some embodiments, a greater extent of lung myofibroblast quiescence influences or correlates with a greater rate or a greater proportion of lung fibroblast cells undergoing apoptosis compared with lung tissue having a lesser extent of lung myofibroblast quiescence. In some embodiments, Annexin V and DAPI staining can be used to determine a rate or a proportion of lung fibroblast cells undergoing apoptosis. In some embodiments, Caspase 3/7 and LDD staining can be used to determine a rate or a proportion of lung fibroblast cells undergoing apoptosis. In some embodiments, treatment comprising administering an ASO targeting a lncRNA described herein does not significantly increase apoptosis in treated cells. In some embodiments, treatment comprising administering ASO-1 targeting hsCORAL does not significantly increase apoptosis in treated cells. In some embodiments, treatment comprising administering ASO-1 targeting hsCORAL does not significantly increase apoptosis in lung fibroblasts. In some embodiments, treatment comprising administering ASO-1 targeting hsCORAL to lung cells does not significantly increase apoptosis in lung fibroblasts compared with a treatment comprising administering a scrambled ASO to lung cells. In some embodiments, treatment comprising administering ASO-1 targeting hsCORAL to a subject does not significantly increase apoptosis in lung fibroblasts compared with a treatment comprising administering a scrambled ASO to a subject.
Types of Modulators
In some aspects, the modulator disclosed herein is a nucleic acid editing or modifying tool. In some instances, the nucleic acid editing or modifying tool targets the genomic region disclosed herein. In some instances, the nucleic acid editing or modifying tool targets the long noncoding transcript. In some instances, the nucleic acid editing or modifying tool targets a premature form of the long noncoding transcript.
In some aspects, the nucleic acid editing or modifying tool is a programmable nucleic acid sequence specific endonuclease. In some instances, the nucleic acid editing or modifying tool is a nucleic acid guided endonuclease. In some instances, the nucleic acid editing or modifying tool is a CRISPR-based tool. In other instances, the nucleic acid editing or modifying tool is a meganuclease-based tool. In other instances, the nucleic acid editing or modifying tool is a zinc finger nuclease (ZFN)-based tool. In other aspects, the nucleic acid editing or modifying tool is a transcription activator-like effector-based nuclease (TALEN)-based tool. In other instances, the nucleic acid editing or modifying tool is an Argonaute system.
In some aspects, the CRISPR-based tool disclosed herein is a Type I, Type II, Type III, Type IV, Type V, or Type VI CRISPR system. CRISPR/Cas systems may be multi-protein systems or single effector protein systems. Multi-protein, or Class 1, CRISPR systems include Type I, Type III, and Type IV systems. In some instances, Class 2 systems include a single effector molecule and include Type II, Type V, and Type VI. In some aspects, the CRISPR-based tool disclosed herein comprises a single or multiple effector proteins. An effector protein may comprise one or multiple nuclease domains. An effector protein may target DNA or RNA, and the DNA or RNA may be single stranded or double stranded. Effector proteins may generate double strand or single strand breaks. Effector proteins may comprise mutations in a nuclease domain thereby generating a nickase protein. Effector proteins may comprise mutations in one or more nuclease domains, thereby generating a catalytically dead nuclease that is able to bind but not cleave a target sequence.
In some aspects, the CRISPR-based tool disclosed comprises a single or multiple guiding RNAs (gRNAs). In some aspects, the gRNA disclosed herein targets a portion of chr3:45,806,503 to chr3:45,834, 110. In some aspects, the gRNA disclosed herein targets a portion of chr3:45,806,503 to chr3:45,831,916. In some aspects, the gRNA disclosed herein targets a portion of chr3:45,796,552 to chr3: 45,834,110. In some aspects, the gRNA disclosed herein targets a portion of chr3:45,796,552 to chr3: 45,831,916. In some aspects, the gRNA disclosed herein targets a portion of chr3:45,796,552 to chr3: 45,829,603. In some aspects, the gRNA disclosed herein targets a portion of LOC107986083 (chr3: 45,817,379-45,827,511). The gRNA may comprise a crRNA. The gRNA may comprise a chimeric RNA with crRNA and tracrRNA sequences. The gRNA may comprise a separate crRNA and tracrRNA. Target nucleic acid sequences may comprise a protospacer adjacent motif (PAM) or a protospacer flanking site (PFS). The PAM or PFS may be 3′ or 5′ of the target or protospacer site. Cleavage of a target sequence may generate blunt ends, 3′ overhangs, or 5′ overhangs.
The gRNA disclosed herein may comprise a spacer sequence. Spacer sequences may be complementary to target sequences or protospacer sequences. Spacer sequences may be 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, or 36 nucleotides in length. In some aspects, the spacer sequence may be less than 10 or more than 36 nucleotides in length.
The gRNA disclosed herein may comprise a repeat sequence. In some aspects, the repeat sequence is part of a double stranded portion of the gRNA. A repeat sequence may be 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 nucleotides in length. In some aspects, the spacer sequence may be less than 10 or more than 50 nucleotides in length.
The gRNA disclosed herein may comprise one or more synthetic nucleotides, non-naturally occurring nucleotides, nucleotides with a modification, deoxyribonucleotide, or any combination thereof. Additionally and/or alternatively, a gRNA may comprise a hairpin, linker region, single stranded region, double stranded region, or any combination thereof. Additionally or alternatively, a gRNA may comprise a signaling or reporter molecule.
The gRNA disclosed herein may be encoded by genetic or episomal DNA. The gRNA disclosed herein may be provided or delivered concomitantly with a CRISPR nuclease or sequentially. The gRNA disclosed herein may be chemically synthesized, in vitro transcribed or otherwise generated using standard RNA generation techniques known in the art.
The CRISPR-based tool disclosed herein can be a Type II CRISPR system, for example a Cas9 system. The Type II nuclease can comprise a single effector protein, which, In some aspects, comprises a RuvC and HNH nuclease domains. In some aspects, a functional Type II nuclease may comprise two or more polypeptides, each of which comprises a nuclease domain or fragment thereof. The target nucleic acid sequences may comprise a 3′ protospacer adjacent motif (PAM). In some aspects, the PAM may be 5′ of the target nucleic acid. Guide RNAs (gRNA) may comprise a single chimeric gRNA, which contains both crRNA and tracrRNA sequences. In some instances, the gRNA may comprise a set of two RNAs, for example a crRNA and a tracrRNA. The Type II nuclease may generate a double strand break, which in some cases creates two blunt ends. In some aspects, the Type II CRISPR nuclease is engineered to be a nickase such that the nuclease only generates a single strand break. In such cases, two distinct nucleic acid sequences may be targeted by gRNAs such that two single strand breaks are generated by the nickase. In some aspects, the two single strand breaks effectively create a double strand break. In some aspects where a Type II nickase is used to generate two single strand breaks, the resulting nucleic acid free ends may either be blunt, have a 3′ overhang, or a 5′ overhang. In some aspects, a Type II nuclease may be catalytically dead such that it binds to a target sequence, but does not cleave. For example, a Type II nuclease may have mutations in both the RuvC and HNH domains, thereby rendering the both nuclease domains non-functional.
A Type II CRISPR system may be one of three sub-types, namely Type II-A, Type II-B, or Type II-C.
The CRISPR-based tool disclosed herein can be a Type V CRISPR system, for example a Cpf1, C2cl, or C2c3 system. The Type V nuclease may comprise a single effector protein, which comprises a single RuvC nuclease domain. In other cases, a function Type V nuclease comprises a RuvC domain split between two or more polypeptides. In such cases, the target nucleic acid sequences may comprise a 5′ PAM or 3′ PAM. Guide RNAs (gRNA) may comprise a single gRNA or single crRNA, such as may be the case with Cpf1. In some aspects, a tracrRNA is not needed. In other examples, such as when C2cl is used, a gRNA may comprise a single chimeric gRNA, which contains both crRNA and tracrRNA sequences or the gRNA may comprise a set of two RNAs, for example a crRNA and a tracrRNA. The Type V CRISPR nuclease may generate a double strand break, which generates a 5′ overhang. In some aspects, the Type V CRISPR nuclease is engineered to be a nickase such that the nuclease only generates a single strand break. In such cases, two distinct nucleic acid sequences may be targeted by gRNAs such that two single strand breaks are generated by the nickase. In some aspects, the two single strand breaks effectively create a double strand break. In some aspects where a Type V nickase is used to generate two single strand breaks, the resulting nucleic acid free ends may either be blunt, have a 3′ overhang, or a 5′ overhang. In some aspects, a Type V nuclease may be catalytically dead such that it binds to a target sequence, but does not cleave. For example, a Type V nuclease may have mutations a RuvC domain, thereby rendering the nuclease domain non-functional.
The CRISPR-based tool disclosed herein may be a Type VI CRISPR system, for example a C2c2 system. A Type VI nuclease may comprise a HEPN domain. In some aspects, the Type VI nuclease comprises two or more polypeptides, each of which comprises a HEPN nuclease domain or fragment thereof. In such cases, the target nucleic acid sequences may by RNA, such as single stranded RNA. When using Type VI CRISPR system, a target nucleic acid may comprise a protospacer flanking site (PFS). The PFS may be 3′ or 5′ or the target or protospacer sequence. Guide RNAs (gRNA) may comprise a single gRNA or single crRNA. In some aspects, a tracrRNA is not needed. In other examples, a gRNA may comprise a single chimeric gRNA, which contains both crRNA and tracrRNA sequences or the gRNA may comprise a set of two RNAs, for example a crRNA and a tracrRNA. In some aspects, a Type VI nuclease may be catalytically dead such that it binds to a target sequence, but does not cleave. For example, a Type VI nuclease may have mutations in a HEPN domain, thereby rendering the nuclease domains non-functional.
Non-limiting examples of suitable nucleases, including nucleic acid-guided nucleases, for use in the present disclosure include C2cl, C2c2, C2c3, Cas1, Cas1B, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csn1 and Csx12), Cas10, Cpf1, Csy1, Csy2, Csy3, Cse1, Cse2, Csc1, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx100, Csx16, CsaX, Csx3, Csxl, Csx15, Csf1, Csf2, Csf3, Csf4, homologues thereof, orthologues thereof, or modified versions thereof.
In some aspects, The CRISPR-based tool disclosed herein is an Argonaute (Ago) system. Ago protein may be derived from a prokaryote, eukaryote, or archaea. The target nucleic acid may be RNA or DNA. A DNA target may be single stranded or double stranded. In some aspects, the target nucleic acid does not require a specific target flanking sequence, such as a sequence equivalent to a protospacer adjacent motif or protospacer flanking sequence. The Ago protein may create a double strand break or single strand break. In some aspects, when an Ago protein forms a single strand break, two Ago proteins may be used in combination to generate a double strand break. In some aspects, an Ago protein comprises one, two, or more nuclease domains. In some aspects, an Ago protein comprises one, two, or more catalytic domains. One or more nuclease or catalytic domains may be mutated in the Ago protein, thereby generating a nickase protein capable of generating single strand breaks. In other aspects, mutations in one or more nuclease or catalytic domains of an Ago protein generates a catalytically dead Ago protein that may bind but not cleave a target nucleic acid.
Ago proteins may be targeted to target nucleic acid sequences by a guiding nucleic acid. In some aspects, the guiding nucleic acid is a guide DNA (gDNA). The gDNA may have a 5′ phosphorylated end. The gDNA may be single stranded or double stranded. Single stranded gDNA may be 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 nucleotides in length. In some aspects, the gDNA may be less than 10 nucleotides in length. In some aspects, the gDNA may be more than 50 nucleotides in length.
Argonaute-mediated cleavage may generate blunt end, 5′ overhangs, or 3′ overhangs. In some aspects, one or more nucleotides are removed from the target site during or following cleavage.
In some aspects, the nucleic acid editing or modifying tool is a repressive dCas9 with the aid of a (single) guide RNA targeting the portion of the genomic region that is transcribed to the long noncoding transcript. In some instances, the nucleic acid editing or modifying tool is dCas9-KRAB-MECP2 with the aid of a (single) guide RNA targeting the portion of the genomic region that is transcribed to the long noncoding transcript. In other specific aspects, the nucleic acid editing or modifying tool is dCas9-KRAB-DNMT1 with the aid of a (single) guide RNA targeting the portion of the genomic region that is transcribed to the long noncoding transcript.
In the above-mentioned aspects, the (single) guide RNA targets 5′ side of an enhancer region the genomic region that is transcribed to the long noncoding transcript. In the certain aspects, the (single) guide RNA targets 5′ side of an enhancer region the genomic region that is transcribed to the long noncoding transcript.
In some aspects, the modulator disclosed herein is a synthetic or artificial oligonucleotide or polynucleotide. In some instances, the oligonucleotide is a single-stranded nucleic acid molecule. In some instances, the single-stranded nucleic acid molecule comprises a nucleic acid sequence at least 9, 10 11, 12, 13, 14, 15 consecutive nucleotides that are complementary to at least a portion of a long noncoding transcript disclosed herein. In some instances, the single-stranded nucleic acid molecule comprises a nucleic acid sequence at least 80%, at least 85%, at least 90%, at least 95% complementary to at least a portion of a long noncoding transcript disclosed herein. In some instances, the oligonucleotide is a double-stranded nucleic acid molecule comprising a sense strand and an antisense strand. In some instances, the sense strand comprises a nucleic acid sequence at least 9, 10 11, 12, 13, 14, 15 consecutive nucleotides that are identical or at least 80%, at least 85%, at least 90%, at least 95% identical to at least a portion of a long noncoding transcript disclosed herein. In some instances, the antisense strand comprises a nucleic acid sequence at least 9, 10 11, 12, 13, 14, 15 consecutive nucleotides that are fully complementary or at least 80%, at least 85%, at least 90%, at least 95% complementary to at least a portion of a long noncoding transcript disclosed herein. In specific aspects, the synthetic or artificial oligonucleotide or polynucleotide is a small interfering RNA (siRNA), a microRNA (miRNA), an inhibitory double stranded RNA (dsRNA), a small or short hairpin RNA (shRNA), an antisense oligonucleotide (ASO), a phosphorodiamidate morpholino oligomer (PMO), a piwi-interacting RNA (piRNA), a heterogeneous nuclear RNA (hnRNA), a small nuclear RNA (snRNA), an enzymatically-prepared siRNA (esiRNA), or the precursors thereof.
In some aspects, the synthetic oligonucleotide disclosed herein is about 10-50 nucleotides long. In some instances, the synthetic oligonucleotide disclosed herein is about 10-40 nucleotides long. In some instances, the synthetic oligonucleotide disclosed herein is about 10-30, 10-28, 14-28, 14-25, 14-20, 15-25, or 18-25 nucleotides long. In other aspects, the synthetic oligonucleotide disclosed herein is at least 10 nucleotides long, at least 11 nucleotides long, at least 12 nucleotides long, at least 13 nucleotides long, at least 14 nucleotides long, at least 15 nucleotides long, or at least 16 nucleotides long. In other aspects, the synthetic oligonucleotide disclosed herein is at most 50 nucleotides long. In other aspects, the synthetic oligonucleotide disclosed herein is at most 40 nucleotides long. In other aspects, the synthetic oligonucleotide disclosed herein is at most 30 nucleotides long. In other aspects, the synthetic oligonucleotide disclosed herein is at most 20 nucleotides long. In other aspects, the synthetic oligonucleotide disclosed herein is about 15 nucleotides long. In other aspects, the synthetic oligonucleotide disclosed herein is about 16 nucleotides long. In other aspects, the synthetic oligonucleotide disclosed herein is about 17 nucleotides long. In other aspects, the synthetic oligonucleotide disclosed herein is 16 nucleotides long.
In some aspects, the synthetic oligonucleotide comprises one or more sugar modifications, one or more phosphate backbone modifications, one or more purine modifications, one or more pyrimidine modifications, or any combination thereof.
In some aspects, the sugar modifications disclosed herein comprises a modification at a 2′ hydroxyl group of the ribose moiety. In some instances, the modification includes an H, OR, R, halo, SH, SR, NH2, NHR, NR2, or CN, wherein R is an alkyl moiety. In some instances, the modification at the 2′ hydroxyl group is a 2′-O-methyl modification or a 2′-O-methoxyethyl (2′-O-MOE) modification, 2′-halo modification, 2′-fluoro modification, 2′-O-aminopropyl modification, locked or bridged ribose modification (e.g., locked nucleic acid (LNA)), ethylene nucleic acids (ENA), tricyclo-DNA, 2′ cyclic ethyl (cET), unlocked nucleic acid (UNA), and conformationally restricted nucleoside (CRN), or any combination thereof. In some aspects, the LNA disclosed herein comprises a beta-D-oxy LNA, an alpha-L-oxy-LNA, a beta-D-amino-LNA, an alpha-L-amino-LNA, a beta-D-thio-LNA, an alpha-L-thio-LNA, a 5′-methyl-LNA, a beta-D-ENA, or an alpha-L-ENA.
In some aspects, the one or more phosphate backbone modifications comprise phosphorothioate linkage, methylphosphonate linkage, guanidinopropyl phosphoramidate linkage, or any combination thereof.
In some aspects, the one or more purine modifications comprise 2,6-diaminopurin, 3-deaza-adenine, 7-deaza-guanine, 8-zaido-adenine, or any combination thereof.
In some aspects, the one or more pyrimidine modifications comprise 2-thio-thymidine, 5-carboxamide-uracil, 5-methyl-cytosine, 5-ethynyl-uracil, or any combination thereof.
In some aspects, the synthetic oligonucleotide comprises a phosphorodiamidate morpholino oligomer (PMO).
In some aspects, the synthetic oligonucleotide disclosed herein is an ASO. In some instances, the ASO comprises a gapmer comprising a central region of consecutive DNA nucleotides flanked by a 5′-wing region and 3′-wing region, wherein at least one of 5′-wing region and 3′-wing region comprises a nucleic acid analogue. In some instances, 5′-wing region comprises or consists of 2 or 3 nucleotides, RNA mimics, nucleic acid analogues, or combination thereof. In some instances, 3′-wing region comprises or consists of 2 or 3 nucleotides, RNA mimics, nucleic acid analogues, or combination thereof. In some instances, 5′-wing region comprises or consists of 3 nucleotides or nucleic acid analogues, or combination thereof, and 3′-wing region comprises or consists of 2 nucleotides or nucleic acid analogues, or combination thereof. In some instances, 5′-wing region comprises or consists of 2 nucleotides or nucleic acid analogues, or combination thereof, and 3′-wing region comprises or consists of 3 nucleotides or nucleic acid analogues, or combination thereof. In some instances, 5′-wing region comprises or consists of 3 nucleotides or nucleic acid analogues, or combination thereof, and 3′-wing region comprises or consists of 3 nucleotides or nucleic acid analogues, or combination thereof. In some instances, 5′-wing region comprises or consists of 3 nucleic acid analogues, and 3′-wing region comprises or consists of 2 nucleic acid analogues.
In some instance, the nucleic acid analogue comprises an LNA. In some aspects, the 5′-wing region of the gapmer disclosed herein comprises at least one LNA. In some aspects, the 5′-wing region of the gapmer disclosed herein comprises at least two LNAs. In some instances, the 5′-wing region of the gapmer disclosed herein comprises two consecutive LNAs. In some aspects, the 5′-wing region of the gapmer disclosed herein comprises at least three LNAs. In some specific aspects, the 5′-wing region of the gapmer disclosed herein comprises three consecutive LNAs. In some aspects, the 5′-wing region of the gapmer disclosed herein comprises at least four LNAs. In some specific aspects, the 5′-wing region of the gapmer disclosed herein comprises four consecutive LNAs.
In some aspects, the 3′-wing region comprises an LNA. In some aspects, the 3′-wing region comprises at least two LNAs. In specific aspects, the 3′-wing region comprises two consecutive LNAs. In some aspects, the 3′-wing region comprises at least three LNAs. In specific aspects, the 3′-wing region comprises three consecutive LNAs.
In some aspects, the synthetic oligonucleotide disclosed herein comprises at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15 consecutive nucleotides from SEQ ID NO: 1 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 8 consecutive nucleotides with no more than 8 mismatches, no more than 7 mismatches, no more than 6 mismatches, no more than 5 mismatches, no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 1 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 9 consecutive nucleotides with no more than 7 mismatches, no more than 6 mismatches, no more than 5 mismatches, no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 1 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 10 consecutive nucleotides with no more than 6 mismatches, no more than 5 mismatches, no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 1 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 11 consecutive nucleotides with no more than 5 mismatches, no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 1 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 12 consecutive nucleotides with no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 1 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 13 consecutive nucleotides with no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 1 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 14 consecutive nucleotides with no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 1 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises 15 consecutive nucleotides with 1 mismatch from SEQ ID NO: 1 from Table 3.
In some aspects, the synthetic oligonucleotide disclosed herein comprises at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15 consecutive nucleotides from SEQ ID NO: 2 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 8 consecutive nucleotides with no more than 8 mismatches, no more than 7 mismatches, no more than 6 mismatches, no more than 5 mismatches, no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 2 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 9 consecutive nucleotides with no more than 7 mismatches, no more than 6 mismatches, no more than 5 mismatches, no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 2 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 10 consecutive nucleotides with no more than 6 mismatches, no more than 5 mismatches, no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 2 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 11 consecutive nucleotides with no more than 5 mismatches, no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 2 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 12 consecutive nucleotides with no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 2 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 13 consecutive nucleotides with no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 2 from Table 3. In other aspects, the synthetic oligonucleotide disclosed herein comprises at least 14 consecutive nucleotides with no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 2 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises 15 consecutive nucleotides with 1 mismatch from SEQ ID NO: 2 from Table 3.
In some instances, the synthetic oligonucleotide disclosed herein comprises at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15 consecutive nucleotides from SEQ ID NO: 3 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 8 consecutive nucleotides with no more than 8 mismatches, no more than 7 mismatches, no more than 6 mismatches, no more than 5 mismatches, no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 3 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 9 consecutive nucleotides with no more than 7 mismatches, no more than 6 mismatches, no more than 5 mismatches, no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 3 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 10 consecutive nucleotides with no more than 6 mismatches, no more than 5 mismatches, no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 3 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 11 consecutive nucleotides with no more than 5 mismatches, no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 3 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 12 consecutive nucleotides with no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 3 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 13 consecutive nucleotides with no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 3 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 14 consecutive nucleotides with no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 3 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises 15 consecutive nucleotides with 1 mismatch from SEQ ID NO: 3 from Table 3.
In some aspects, the synthetic oligonucleotide disclosed herein comprises at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15 consecutive nucleotides from SEQ ID NO: 4 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 8 consecutive nucleotides with no more than 8 mismatches, no more than 7 mismatches, no more than 6 mismatches, no more than 5 mismatches, no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 4 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 9 consecutive nucleotides with no more than 7 mismatches, no more than 6 mismatches, no more than 5 mismatches, no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 4 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 10 consecutive nucleotides with no more than 6 mismatches, no more than 5 mismatches, no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 4 from Table 3. In v, the synthetic oligonucleotide disclosed herein comprises at least 11 consecutive nucleotides with no more than 5 mismatches, no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 4 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 12 consecutive nucleotides with no more than 4 mismatches, no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 4 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 13 consecutive nucleotides with no more than 3 mismatches, no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 4 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises at least 14 consecutive nucleotides with no more than 2 mismatches or no more than 1 mismatch from SEQ ID NO: 4 from Table 3. In some instances, the synthetic oligonucleotide disclosed herein comprises 15 consecutive nucleotides with 1 mismatch from SEQ ID NO: 4 from Table 3.
In some aspects, the synthetic oligonucleotide disclosed herein comprises a nucleic acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical to a sequence selected from SEQ ID NO: 1 from Table 3. In some aspects, the synthetic oligonucleotide disclosed herein comprises a nucleic acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical to a sequence selected from SEQ ID NO: 2 from Table 3. In some aspects, the synthetic oligonucleotide disclosed herein comprises a nucleic acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical to a sequence selected from SEQ ID NO: 3 from Table 3. In some aspects, the synthetic oligonucleotide disclosed herein comprises a nucleic acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical to a sequence selected from SEQ ID NO: 4 from Table 3.
Further provided herein are pharmaceutical compositions comprising the modulator disclosed herein and a pharmaceutically acceptable salt, excipient, or derivative thereof.
The suitable pharmaceutically acceptable salts or derivative thereof include but are not limited to (i) salts formed with cations such as sodium, potassium, ammonium, magnesium, calcium, polyamines such as spermine and spermidine, etc.; (ii) acid addition salts formed with inorganic acids, for example hydrochloric acid, hydrobromic acid, sulfuric acid, phosphoric acid, and the like; and (iii) salts formed with organic acids such as, for example, acetic acid, oxalic acid, tartaric acid, succinic acid, maleic acid, fumaric acid, gluconic acid, citric acid, malic acid, ascorbic acid, benzoic acid, tannic acid, palmitic acid, alginic acid, polyglutamic acid, naphthalenesulfonic acid, methanesulfonic acid, p-toluenesulfonic acid, naphthalenedisulfonic acid, polygalacturonic acid, and the like.
A pharmaceutical composition described herein can be prepared to include the modulator disclosed herein, into a form suitable for administration to a subject using carriers, excipients, and vehicles. In some instances, excipients include magnesium carbonate, titanium dioxide, lactose, mannitol and other sugars, talc, milk protein, gelatin, starch, vitamins, cellulose and its derivatives, animal and vegetable oils, polyethylene glycols and solvents, such as sterile water, alcohols, glycerol, and polyhydric alcohols. Intravenous vehicles include fluid and nutrient replenishers. Preservatives include antimicrobial, anti-oxidants, chelating agents, and inert gases.
Other pharmaceutically acceptable vehicles include aqueous solutions, non-toxic excipients, including salts, preservatives, buffers and the like, as described, for instance, in Remington: The Science and Practice of Pharmacy, 21st Ed., Gennaro, Ed., Lippencott Williams & Wilkins (2005), and The United States Pharmacopeia: The National Formulary (USP 36 NF31), published in 2013. The pH and exact concentration of the various components of the pharmaceutical composition are adjusted according to routine skills in the art. See Goodman and Gilman's, The Pharmacological Basis for Therapeutics.
The pharmaceutical compositions described herein may be administered locally or systemically. The therapeutically effective amounts will vary according to factors, such as the degree of infection in a subject, the age, sex, health conditions, and weight of the individual.
Dosage regimes can be adjusted to provide the optimum therapeutic response. For example, several divided doses can be administered daily or the dose can be proportionally reduced as indicated by the exigencies of the therapeutic situation.
The pharmaceutical composition can be administered in a convenient manner, such as by injection (e.g., subcutaneous, intravenous, intraorbital, and the like), oral administration, ophthalmic application, inhalation, topical application, or rectal administration. Depending on the route of administration, the pharmaceutical composition can be coated with a material to protect the pharmaceutical composition from the action of enzymes, acids, and other natural conditions that may inactivate the pharmaceutical composition. The pharmaceutical composition can also be administered parenterally or intraperitoneally. Dispersions can also be prepared in glycerol, liquid polyethylene glycols, and mixtures thereof, and in oils. Under ordinary conditions of storage and use, these preparations may contain a preservative to prevent the growth of microorganisms.
Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions (where water soluble) or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersions. The composition can be sterile and fluid to the extent that easy syringability exists. The composition can be stable under the conditions of manufacture and storage and preserved against the contaminating action of microorganisms, such as bacteria and fungi. The vehicle can be a solvent or dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid polyethylene glycol, and the like), suitable mixtures thereof, and vegetable oils. The proper fluidity can be maintained, for example, by the use of a coating, such as lecithin, by the maintenance of certain particle size, in the case of dispersion, and by the use of surfactants. Prevention of the action of microorganisms can be achieved by various antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many cases, isotonic agents, for example, sugars, polyalcohols, such as mannitol, sorbitol, or sodium chloride are used in the composition. Prolonged absorption of the injectable compositions can be brought about by including in the composition an agent that delays absorption, for example, aluminum monostearate and gelatin.
Sterile injectable solutions can be prepared by incorporating the pharmaceutical composition in an appropriate solvent with one or a combination of ingredients enumerated above followed by filtered sterilization. Generally, dispersions are prepared by incorporating the pharmaceutical composition into a sterile vehicle that contains a basic dispersion medium and the other ingredients from those enumerated above.
It is advantageous to formulate parenteral compositions in dosage unit form for ease of administration and uniformity of dosage. Dosage unit form as used herein, refers to physically discrete units suited as unitary dosages for the subject to be treated; each unit containing a predetermined quantity of pharmaceutical composition is calculated to produce the desired therapeutic effect in association with the pharmaceutical vehicle. The specification for the dosage unit forms are related to the characteristics of the pharmaceutical composition and the particular therapeutic effect to be achieve. The principal pharmaceutical composition is compounded for convenient and effective administration in effective amounts with a suitable pharmaceutically acceptable vehicle in an acceptable dosage unit. In the case of compositions containing supplementary active ingredients, the dosages are determined by reference to the usual dose and manner of administration of the ingredients.
The pharmaceutical composition can be orally administered, for example, in a carrier, e.g., in an enteric-coated unit dosage form. The pharmaceutical composition and other ingredients can also be enclosed in a hard or soft-shell gelatin capsule or compressed into tablets.
For oral therapeutic administration, the pharmaceutical composition can be incorporated with excipients and used in the form of ingestible tablets, troches, capsules, pills, wafers, and the like.
Such compositions and preparations may contain at least 1% by weight of active compound. The percentage of the compositions and preparations can, of course, be varied and can conveniently be between about 5% to about 80% of the weight of the unit. The tablets, troches, pills, capsules, and the like can also contain the following: a binder, such as gum tragacanth, acacia, corn starch, or gelatin; excipients such as dicalcium phosphate; a disintegrating agent, such as corn starch, potato starch, alginic acid, and the like; a lubricant, such as magnesium stearate; and a sweetening agent, such as sucrose, lactose or saccharin, or a flavoring agent such as peppermint, oil of wintergreen, or cherry flavoring. When the dosage unit form is a capsule, it can contain, in addition to materials of the above type, a liquid carrier. Various other materials can be present as coatings or to otherwise modify the physical form of the dosage unit. For instance, tablets, pills, or capsules can be coated with shellac, sugar, or both. A syrup or elixir can contain the agent, sucrose as a sweetening agent, methyl and propylparabens as preservatives, a dye, and flavoring, such as cherry or orange flavor. Any material used in preparing any dosage unit form can be of pharmaceutically acceptable purity and substantially non-toxic in the amounts employed. In addition, the pharmaceutical composition can be incorporated into sustained-release preparations and formulations.
The pharmaceutical composition described herein may comprise one or more permeation enhancer that facilitates bioavailability of the modulator described herein. WO 2000/67798, Muranishi, 1990, Crit. Rev. Ther. Drug Carrier Systems, 7, 1, Lee et al., 1991, Crit. Rev. Ther. Drug Carrier Systems, 8, 91 are herein incorporated by reference in its entirety. In some aspects, the permeation enhancer is intestinal. In some aspects, the permeation enhancer is transdermal. In some aspects, the permeation enhancer is to facilitate crossing the brain-blood barrier. In some aspects, the permeation enhancer improves the permeability in the oral, nasal, buccal, pulmonary, vaginal, or corneal delivery model. In some aspects, the permeation enhancer is a fatty acid or a derivative thereof. In some aspects, the permeation enhancer is a surfactant or a derivative thereof. In some aspects, the permeation enhancer is a bile salt or a derivative thereof. In some aspects, the permeation enhancer is a chelating agent or a derivative thereof. In some aspects, the permeation enhancer is a non-chelating non-surfactant or a derivative thereof. In some aspects, the permeation enhancer is an ester or a derivative thereof. In some aspects, the permeation enhancer is an ether or a derivative thereof. In some specific aspects, the permeation enhancer is arachidonic acid, undecanoic acid, oleic acid, lauric acid, caprylic acid, capric acid, myristic acid, palmitic acid, stearic acid, linoleic acid, linolenic acid, dicaprate, tricaprate, monoolein, dilaurin, glyceryl 1-monocaprate, 1-dodecylazacycloheptan-2-one, an acylcarnitine, an acylcholine, or a monoglyceride, a diglyceride or a pharmaceutically acceptable salt thereof. In one aspect, the permeation enhancer is sodium caprate (C10). In some instances, the permeation enhancer is chenodeoxycholic acid (CDCA), ursodeoxychenodeoxycholic acid (UDCA), cholic acid, dehydrocholic acid, deoxycholic acid, glucholic acid, glycholic acid, glycodeoxycholic acid, taurocholic acid taurodeoxycholic acid, sodium tauro-24,25-dihydro-fusidate or sodium glycodihydrofusidate. In some instances, the permeation enhancer is polyoxyethylene-9-lauryl ether, or polyoxyethylene-20-cetyl ether.
Further provided herein are kits comprising the modulator disclosed herein. Further provided herein are kits comprising the pharmaceutical composition disclosed herein. In some aspects, the kit comprises suitable instructions in order to perform the methods of the kit. The instructions may provide information of performing any of the methods disclosed herein, whether or not the methods may be performed using only the reagents provided in the kit.
For use in the therapeutic applications described herein, kits and articles of manufacture are also described herein. In some aspects, such kits include a carrier, package, or container that is compartmentalized to receive one or more containers such as vials, tubes, and the like, each of the container(s) including one of the separate elements to be used in a method described herein.
Suitable containers include, for example, bottles, vials, syringes, and test tubes. The containers can be formed from a variety of materials such as glass or plastic. The articles of manufacture provided herein contain packaging materials. Examples of pharmaceutical packaging materials include, but are not limited to, blister packs, bottles, tubes, inhalers, pumps, bags, vials, containers, syringes, bottles, and any packaging material suitable for a selected formulation and intended mode of administration and treatment. The container(s) optionally have a sterile access port (for example the container is an intravenous solution bag or a vial having a stopper pierceable by a hypodermic injection needle). Such kits optionally comprise a composition with an identifying description or label or instructions relating to its use in the methods described herein.
A kit may include one or more additional containers, each with one or more of various materials (such as reagents, optionally in concentrated form, and/or devices) desirable from a commercial and user standpoint for use of the modulator described herein. Non-limiting examples of such materials include, but not limited to, buffers, diluents, filters, needles, syringes, carrier, package, container, vial and/or tube labels listing contents and/or instructions for use, and package inserts with instructions for use. A set of instructions will also typically be included.
In some aspects, a label is on or associated with the container. A label can be on a container when letters, numbers or other characters forming the label are attached, molded or etched into the container itself; a label can be associated with a container when it is present within a receptacle or carrier that also holds the container, e.g., as a package insert. A label can be used to indicate that the contents are to be used for a specific therapeutic application. The label can also indicate directions for use of the contents, such as in the methods described herein.
In certain aspects, a pharmaceutical composition comprising the modulators provided herein and optional additional active agent is presented in a pack or dispenser device which can contain one or more unit dosage forms. The pack can for example contain metal or plastic foil, such as a blister pack. The pack or dispenser device can be accompanied by instructions for administration. The pack or dispenser can also be accompanied with a notice associated with the container in form prescribed by a governmental agency regulating the manufacture, use, or sale of pharmaceuticals, which notice is reflective of approval by the agency of the form of the drug for human or veterinary administration. Such notice, for example, can be the labeling approved by the U.S. Food and Drug Administration for prescription drugs, or the approved product insert.
Compositions containing the modulators described herein formulated in a compatible pharmaceutical carrier can also be prepared, placed in an appropriate container, and labeled for treatment of an indicated condition.
Provided herein comprises methods of modulating the expression or activity of long noncoding transcript disclosed herein by contacting a cell comprising the long noncoding transcript to the modulator disclosed herein, or a pharmaceutical composition disclosed herein. In some instances, the cell is affected by ARDS. In some instances, the ARDS is associated with or resulted from COVID-19 infection. Also provided herein comprises methods of modulating the expression or activity of long noncoding transcript disclosed herein in a subject in need thereof by administering to the subject the modulator disclosed herein, or a pharmaceutical composition disclosed herein. In some instances, the subject has been developing ARDS, affected by ARDS, suffering from one or more symptoms of ARDS, and/or has been infected by viruses (e.g., COVID-19) inducing or causing the onset or development ARDS.
Specifically provided herein are methods of reducing expression or activity of the long noncoding transcript disclosed herein in a subject in need thereof by administering to the subject the modulator disclosed herein, or a pharmaceutical composition disclosed herein. In some instances, the subject has been developing ARDS, affected by ARDS, suffering from one or more symptoms of ARDS, and/or has been infected by viruses (e.g., COVID-19) inducing or causing the onset or development ARDS.
Further provided herein are methods of preventing, alleviating, or treating pulmonary fibrosis or a symptom associated with ARDS in a subject in need thereof, the method comprising administering to the subject an effective amount of the modulator disclosed herein or the pharmaceutical composition provided herein. In some instances, ARDS is associated, induced, caused, or resulted from COVID-19 infection.
Further provided herein are methods of preventing, alleviating, or treating pulmonary fibrosis or a symptom associated with pulmonary fibrosis in a subject in need thereof, the method comprising administering to the subject an effective amount of the modulator disclosed herein or the pharmaceutical composition provided herein. In some instances, the pulmonary fibrosis is associated with ARDS. In some instances, the pulmonary fibrosis is idiopathic pulmonary fibrosis. In some instances, the pulmonary fibrosis is associated with or induced by ARDS that is associated, induced, caused, or resulted from COVID-19 infection.
Further provided herein are methods of preventing, alleviating, or treating inflammation in a subject in need thereof, the method comprising administering to the subject an effective amount of the modulator disclosed herein or the pharmaceutical composition provided herein. In some instances, the inflammation is associated with onset or development of pulmonary fibrosis.
In some instances, the inflammation is associated with onset or development of idiopathic pulmonary fibrosis. In some instances, the inflammation is associated with or induced by onset of ARDS that is associated or resulted from COVID-19 infection.
Further provided herein are methods of preventing, alleviating, or treating myocardial fibrosis, pancreatic fibrosis, adipose tissue fibrosis, intestinal fibrosis, and/or uterine fibrosis or a symptom associated with such fibrosis in a subject in need thereof, the method comprising administering to the subject an effective amount of the modulator disclosed herein of a lncRNA transcript derived from the human genomic location LOC107986083 (chr3: 45,817,379-45,827,511) or the pharmaceutical composition provided herein that modulates a lncRNA transcript derived from the human genomic location LOC107986083 (chr3: 45,817,379-45,827,511). Genotype-Tissue Expression (GTEx) analysis indicated that CORAL transcripts containing sequence derived from the human genomic location LOC107986083 (chr3: 45,817,379-45,827,511) is expressed at a moderate level in some tissues of mesenchymal origin such as lung and instestine. Expression of these hsCORAL transcripts was also detected in pancreas. Detectable, but low levels of expression of these hsCORAL transcripts was found in muscle, uterus, and skin. Expression of these hsCORAL transcripts was very low-to-undetectable in other tissues assayed.
In some aspects, the subject in need of treatment has or is suspected to have ARDS. In some instances, the subject has or is suspected to have ARDS that is caused by or associated with viral infection. In some specific aspects, the subject has or is suspected to have ARDS that is caused by or associated with viral pulmonary disease. In further specific aspects, the subject has or is suspected to have ARDS that is caused by or associated with COVID-19 infection. In some instances, the subject has or is suspected to have ARDS that is caused by or associated with HIV. In some instances, the subject has or is suspected to have ARDS that is caused by or associated with an increased immune response. In some instances, the subject has or is suspected to have ARDS that is caused by or associated with an autoimmune disease.
In some aspects, the effective amount of a modulator as disclosed herein can decrease the amount, expression level, or activity of the long noncoding transcript disclosed herein at least 1%, 2%, 3%, 4%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90% or more in the cell contacted with the modulator. In some aspects, the effective amount of a modulator as disclosed herein can decrease the amount, expression level, or activity of the long noncoding transcript disclosed herein at least 1%, 2%, 3%, 4%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90% or more in the cells or tissues of the subject affected by pulmonary fibrosis, e.g., pulmonary fibrosis associated with ARDS, affected by ARDS, and/or affected by COVID-19 infection.
In some aspects, the effective amount of a modulator as disclosed herein can modulate the amount, expression level, or activity of transcripts or mRNAs of genes associated with or markers of pulmonary fibrosis, e.g., pulmonary fibrosis associated with or induced by ARDS in a at least 1%, 2%, 3%, 4%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90% or more in the cells or tissues of the subject affected by pulmonary fibrosis, e.g., pulmonary fibrosis associated with ARDS, affected by ARDS, and/or affected by COVID-19 infection. In some aspects, the effective amount of a modulator as disclosed herein can increase the amount, expression level, or activity of transcripts or mRNAs of at least one or more genes associated with or markers of pulmonary fibrosis, e.g., pulmonary fibrosis associated with or induced by ARDS in a at least 1%, 2%, 3%, 4%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90% or more in the cells or tissues of the subject affected by pulmonary fibrosis, e.g., pulmonary fibrosis associated with ARDS, affected by ARDS, and/or affected by COVID-19 infection. In some aspects, the effective amount of a modulator as disclosed herein can decrease the amount, expression level, or activity of transcripts or mRNAs of at least one or more genes associated with or markers of pulmonary fibrosis, e.g., pulmonary fibrosis associated with or induced by ARDS in a at least 1%, 2%, 3%, 4%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90% or more in the cells or tissues of the subject affected by pulmonary fibrosis, e.g., pulmonary fibrosis associated with ARDS, affected by ARDS, and/or affected by COVID-19 infection.
In some aspects, the effective amount of a modulator as disclosed herein can alleviate or reduce the severity or frequencies of symptoms of pulmonary fibrosis associated with ARDS or severity or frequencies of symptoms of ARDS. In some aspects, the effective amount of a modulator as disclosed herein can reverse severity or frequencies symptoms of pulmonary fibrosis associated with ARDS or severity or symptoms of ARDS, or progress of pulmonary fibrosis associated with ARDS or ARDS.
In some aspects, the modulator is expressed in a viral vector. In other aspects, the modulator is expressed in a plasmid vector. In some instances, the modulator is encapsulated in a liposome. In some instances, the modulator is encapsulated in a nanoparticle. In some instances, the modulator is encapsulated in an extracellular vesicle.
For delivery to the target cell, the modulator described herein can non-covalently bind an excipient to form a complex. The excipient can be used to alter biodistribution after delivery, to enhance uptake, to increase half-life or stability of the strands in the modulator described herein (e.g., improve nuclease resistance), and/or to increase targeting to a particular cell or tissue type.
Exemplary excipients include but are not limited to a condensing agent (e.g., an agent capable of attracting or binding a nucleic acid through ionic or electrostatic interactions); a fusogenic agent (e.g., an agent capable of fusing and/or being transported through a cell membrane); a protein to target a particular cell or tissue type (e.g., thyrotropin, melanotropin, lectin, glycoprotein, surfactant protein A, or any other protein); a lipid; a lipopolysaccharide; a lipid micelle or a liposome (e.g., formed from phospholipids, such as phosphotidylcholine, fatty acids, glycolipids, ceramides, glycerides, cholesterols, or any combination thereof); a nanoparticle (e.g., silica, lipid, carbohydrate, or other pharmaceutically-acceptable polymer nanoparticle); a polyplex formed from cationic polymers and an anionic agent (e.g., a CRO), where exemplary cationic polymers include but are not limited to polyamines (e.g., polylysine, polyarginine, poly amidoamine, and polyethylene imine); cholesterol; a dendrimer (e.g., a polyamidoamine (PAMAM) dendrimer); a serum protein (e.g., human serum albumin (HSA) or low-density lipoprotein (LDL)); a carbohydrate (e.g., dextran, pullulan, chitin, chitosan, inulin, cyclodextrin, or hyaluronic acid); a lipid; a synthetic polymer, (e.g., polylysine (PLL), polyethylenimine, poly-L-aspartic acid, poly-L-glutamic acid, styrene-maleic acid anhydride copolymer, poly(L-lactide-co-glycolic) copolymer, divinyl ether-maleic anhydride copolymer, N-(2-hydroxypropyl)methacrylamide copolymer (HMPA), polyethylene glycol (PEG), polyvinyl alcohol (PVA), polyurethane, poly(2-ethylacrylic acid), N-isopropylacrylamide polymer, pseudopeptide-polyamine, peptidomimetic polyamine, or polyamine); a cationic moiety (e.g., cationic lipid, cationic porphyrin, quaternary salt of a polyamine, or alpha helical peptide); a multivalent sugar (e.g., multivalent lactose, multivalent galactose, N-acetyl-galactosamine, N-acetyl-glucosamine, multivalent mannose, or multivalent fucose); a vitamin (e.g., vitamin A, vitamin E, vitamin K, vitamin B, folic acid, vitamin B12, riboflavin, biotin, or pyridoxal); a cofactor; or a drug to disrupt cellular cytoskeleton to increase uptake (e.g., taxol, vincristine, vinblastine, cytochalasin, nocodazole, japlakinolide, latrunculin A, phalloidin, swinholide A, indanocine, or myoservin).
In some aspects, the administering is performed intratracheally, orally, nasally, intravenously, intraperitoneally, or intramuscularly. In some aspects, the administering is performed intratracheally.
In some aspects, the administering is a targeted delivery to a lung tissue of the subject. In some instances, the targeted delivery is via a local application. In some instances, the targeted delivery is via one or more specific binding moieties that target the lung tissue.
In some aspects, the administering is in a form of aerosol. In some instances, the aerodynamic diameter of particles of the modulator disclosed herein is less than 10 μm. In some instances, the aerodynamic diameter of particles of the modulator disclosed herein is less than 5 μm. In some instances, the aerodynamic diameter of particles of the modulator disclosed herein is less than 3 μm.
Further provided herein are methods of diagnosing or monitoring pulmonary fibrosis in a subject, the method comprising: (a) obtaining a sample from the subject; (b) detecting an amount and/or an activity of a long noncoding transcript, wherein the long noncoding transcript is transcribed from a genomic region LOC107986083; and (c) diagnosing the subject with pulmonary fibrosis or to have a high/higher chance to contract pulmonary fibrosis if the amount and/or the activity is at least 50%, at least 60%, at least 70%, at least 80%, at least 90% higher when compared to a control. In some aspects, the detecting comprises using Si nuclease protection assay, microarray analysis, polymerase chain reaction (PCR), hybridization technologies, reverse transcriptase polymerase chain reaction (RT-PCR), Northern blot, serial analysis of gene expression (SAGE), immunoassay, and/or mass spectrometry.
Further provided herein are methods of diagnosing or monitoring idiopathic pulmonary fibrosis in a subject, the method comprising: (a) obtaining a sample from the subject; (b) detecting an amount and/or an activity of a long noncoding transcript, wherein the long noncoding transcript is transcribed from a genomic region LOC107986083; and (c) diagnosing the subject with idiopathic pulmonary fibrosis or to have a high/higher chance to contract idiopathic pulmonary fibrosis if the amount and/or the activity is at least 50%, at least 60%, at least 70%, at least 80%, at least 90% higher when compared to a control. In some aspects, the detecting comprises using S1 nuclease protection assay, microarray analysis, polymerase chain reaction (PCR), hybridization technologies, reverse transcriptase polymerase chain reaction (RT-PCR), Northern blot, serial analysis of gene expression (SAGE), immunoassay, and/or mass spectrometry.
In some aspects, the method of treating pulmonary fibrosis in a subject comprises: (a) obtaining a sample from the subject; (b) detecting an amount and/or an activity of a long noncoding transcript, wherein the long noncoding transcript is transcribed from a genomic region LOC107986083; (c) diagnosing the subject with pulmonary fibrosis if the amount and/or the activity is at least 50%, at least 60%, at least 70%, at least 80%, at least 90% higher when compared to a control; and (d) treating the subject with an effective amount of the modulators disclosed herein. In some aspects, the detecting comprises using Si nuclease protection assay, microarray analysis, polymerase chain reaction (PCR), hybridization technologies, reverse transcriptase polymerase chain reaction (RT-PCR), Northern blot, serial analysis of gene expression (SAGE), immunoassay, and/or mass spectrometry.
In some aspects, the method of treating idiopathic pulmonary fibrosis in a subject comprises: (a) obtaining a sample from the subject; (b) detecting an amount and/or an activity of a long noncoding transcript, wherein the long noncoding transcript is transcribed from a genomic region LOC107986083; (c) diagnosing the subject with idiopathic pulmonary fibrosis if the amount and/or the activity is at least 50%, at least 60%, at least 70%, at least 80%, at least 90% higher when compared to a control; and (d) treating the subject with an effective amount of the modulators disclosed herein. In some aspects, the detecting comprises using Si nuclease protection assay, microarray analysis, polymerase chain reaction (PCR), hybridization technologies, reverse transcriptase polymerase chain reaction (RT-PCR), Northern blot, serial analysis of gene expression (SAGE), immunoassay, and/or mass spectrometry.
Further provided herein are methods of monitoring or predicting the clinical development of a SARS-CoV2 infection in a subject having or suspected of having the infection, the method comprising: (a) obtaining a sample from the subject; (b) detecting an amount and/or an activity of a long noncoding transcript, wherein the long noncoding transcript is transcribed from a genomic region LOC107986083; and (c) diagnosing the subject with the infection or determining the subject having a high/higher chance to develop clinically substantive symptoms from the SARS-CoV2 infection if the amount and/or the activity is at least 50%, at least 60%, at least 70%, at least 80%, at least 90% higher when compared to a control. In some aspects, the detecting comprises using S1 nuclease protection assay, microarray analysis, polymerase chain reaction (PCR), hybridization technologies, reverse transcriptase polymerase chain reaction (RT-PCR), Northern blot, serial analysis of gene expression (SAGE), immunoassay, and/or mass spectrometry.
In some aspects, methods of managing a SARS-CoV2 infection in a subject having or suspected of having the infection comprises (a) obtaining a sample from the subject; (b) detecting an amount and/or an activity of a long noncoding transcript, wherein the long noncoding transcript is transcribed from a genomic region LOC107986083; (c) diagnosing the subject with the infection or determining the subject having a high/higher chance to develop clinically substantive symptoms from the SARS-CoV2 infection if the amount and/or the activity is at least 50%, at least 60%, at least 70%, at least 80%, at least 90% higher when compared to a control; and (d) treating the subject with an advanced procedure and/or therapeutics preemptively. In some instances, step (d) comprises use of albumin for resuscitation. In some instances, step (d) comprises use of norepinephrine as a vasopressor. In some specific aspects, step (d) comprises titrating vasoactive agents to target a mean arterial pressure (MAP) of 60 to 65 mm Hg. In some instances, step (d) comprises adding either vasopressin (up to 0.03 units/min) or epinephrine to norepinephrine to raise MAP. In some instances, step (d) comprises adding vasopressin (up to 0.03 units/min) to decrease norepinephrine dosage. In some instances, step (d) comprises a low-dose dopamine for renal protection. In some instances, step (d) comprises dobutamine. In some instances, step (d) comprises corticosteroids. In some instances, step (d) comprises high-flow nasal cannula (HFNC) oxygen. In some instances, step (d) comprises a closely monitored trial of noninvasive ventilation. In some instances, step (d) comprises a trial of awake prone positioning.
In some instances, step (d) comprises intubation. In some instances, step (d) comprises low tidal volume (VT) ventilation (VT 4-8 mL/kg of predicted body weight). In some instances, step (d) comprises using a higher positive end-expiratory pressure. In some instances, step (d) comprises prone ventilation for 12 to 16 hours per day. In some instances, step (d) comprises using intermittent boluses of neuromuscular blocking agents. In some instances, step (d) comprises recruitment maneuvers. In some instances, step (d) comprises using an inhaled pulmonary vasodilator as a rescue therapy. In some instances, step (d) comprises a continuous renal replacement therapy. In some instances, step (d) comprises a use of empiric broad-spectrum antimicrobial therapy. In some instances, step (d) comprises a use of extracorporeal membrane oxygenation.
In some aspects, the clinical development comprises acute respiratory distress syndrome (ARDS). In other aspects, the clinical development comprises asthma, chronic obstructive pulmonary diseases (COPD), and/or Chronic mucus hypersecretion (CMH) pathogenesis. In some aspects, the clinical development comprises idiopathic pulmonary fibrosis (IPF).
In some aspects, the detecting comprises using Si nuclease protection assay, microarray analysis, polymerase chain reaction (PCR), hybridization technologies, reverse transcriptase polymerase chain reaction (RT-PCR), Northern blot, serial analysis of gene expression (SAGE), immunoassay, and/or mass spectrometry.
The term “noncoding RNA” as used herein, refers to RNA species that are not translated into protein.
The term “CORAL” as used herein, refers to a long noncoding transcript that is associated with onset, development, or prognosis of ARDS and/or pulmonary fibrosis associated with, induced by, or resulted from ARDS. CORAL includes, but not limited to, a long noncoding transcript that is associated with onset, development, or prognosis of ARDS developed after, directly or indirectly, COVID-19 infection, and/or pulmonary fibrosis associated with, induced by, or resulted from COVID-19 infection. As such, CORAL includes, but not limited to, a long noncoding transcript transcribed from a genomic region LOC107986083 as described herein.
CORAL includes, but not limited to, a long noncoding transcript transcribed from a genomic region LOC126806670. CORAL includes, but not limited to, a long noncoding transcript transcribed from a genomic region that is identified by using the methods described in the flowchart listed in
The term “baseline level” of a transcript as used herein, refers to the level of the transcript in healthy cells of the subject or the cells of the healthy subject. In some aspects, it refers to the level of the long noncoding transcripts in healthy cells of a healthy individual. In some aspects, it refers to the level of the long noncoding transcripts in cells of the same subject but before ARDS.
The term “nucleic acid editing or modifying moiety,” as used herein, refers to a moiety that edits or cleaves the target nucleic acid. It can also refer to a moiety that suppresses the transcription of the target nucleic acid.
The term “SARS-CoV2” and “COVID 19” are used sometimes interchangeably, either refer to the virus or the infection caused by the virus.
The term “nucleic acid analogue,” as used herein, refers to compounds which are analogous (structurally similar) to naturally occurring nucleic acid (see, e.g., Freier & Altmann; Nucl. Acid. Res., 1997, 25, 4429-4443 and Uhlmann; Curr. Opinion in Drug Development, 2000, 3(2), 293-213), and examples of suitable nucleic acid analogues are provided by WO2007031091, which are hereby incorporated by reference.
The term “gapmer” is a chimeric nucleic acid that contains a central sequence of DNA nucleotides (“DNA gap”) flanked by sequences of modified RNA residues at either end to protect the DNA gap from nuclease degradation, whereas the central DNA gap region allows RNase-H-mediated cleavage of the target RNA. Gapmer has an internal region having a plurality of nucleosides which is capable of recruiting RNase H activity, such as RNaseH, which region is positioned between external wings at either end, having one or more nucleosides, wherein the nucleosides comprising the internal region are chemically distinct from the nucleoside or nucleosides comprising the external wings.
A “locked nucleic acid” or “LNA” is often referred to as inaccessible RNA, and is a modified RNA nucleobase. The ribose moiety of an LNA nucleobase is modified with an extra bridge connecting the 2′ oxygen and 4′ carbon. An LNA oligonucleotide offers substantially increased affinity for its complementary strand, compared to traditional DNA or RNA oligonucleotides.
The terms “microRNA,” “miRNA,” and MiR” are interchangeable and refer to endogenous or artificial non-coding RNAs that are capable of regulating gene expression. It is believed that miRNAs function via RNA interference. The terms “siRNA” and “short interfering RNA” are interchangeable and refer to single-stranded or double-stranded RNA molecules that are capable of inducing RNA interference. In some aspects, siRNA molecules typically have a duplex region that is between 18 and 30 base pairs in length.
The terms “piRNA” and “Piwi-interacting RNA” are interchangeable and refer to a class of small RNAs involved in gene silencing. piRNA molecules typically are between 26 and 31 nucleotides in length.
The terms “snRNA” and “small nuclear RNA” are interchangeable and refer to a class of small RNAs involved in a variety of processes including RNA splicing and regulation of transcription factors. The subclass of small nucleolar RNAs (snoRNAs) is also included. The term is also intended to include artificial snRNAs, such as antisense derivatives of snRNAs comprising antisense sequences directed against one or more lncRNAs.
The terms “polynucleotide” “oligonucleotide” “polynucleic acid”, “nucleic acid”, and “nucleic acid molecule” are used herein to include a polymeric form of nucleotides of any length, either ribonucleotides, deoxyribonucleotides, modified forms thereof, or hybrid or chimeric forms thereof. This term refers to the primary structure of the molecule. Thus, the term includes triple-, double- and single-stranded DNA, as well as triple-, double- and single-stranded RNA. In some aspects, it also includes modifications, such as by methylation and/or by capping, and unmodified forms of the polynucleotide. More particularly, the terms “polynucleotide” “oligonucleotide” “polynucleic acid”, “nucleic acid”, and “nucleic acid molecule” include polydeoxyribonucleotides (containing 2-deoxy-D-ribose), polyribonucleotides (containing D-ribose), any other type of polynucleotide which is an N- or C-glycoside of a purine or pyrimidine base, and other polymers containing non-nucleotidic backbones, for example, polyamide (e.g., peptide nucleic acids (PNAs)) and polymorpholino (commercially available from the Anti-Vials, Inc., Corvallis, Oreg., as Neugene) polymers, and other synthetic sequence-specific nucleic acid polymers providing that the polymers contain nucleobases in a configuration which allows for base pairing and base stacking, such as is found in DNA and RNA. There is no intended distinction in length between the terms “polynucleotide” “oligonucleotide” “polynucleic acid”, “nucleic acid”, and “nucleic acid molecule” and these terms will be used interchangeably. Thus, these terms include, for example, RNA, double- and single-stranded DNA, as well as double- and single-stranded RNA, microRNA, DNA:RNA hybrids, and hybrids between PNAs and DNA or RNA, and also include known types of modifications, for example, labels which are known in the art, methylation, “caps,” substitution of one or more of the naturally occurring nucleotides with an analog (e.g., 2-aminoadenosine, 2′-thiothymidine, inosine, pyrrolo-pyrimidine, 3′-methyl adenosine, C5-propynylcytidine, C5-propynyluridine, C5-bromouridine, C5-fluorouridine, C5-iodouridine, C5-methylcytidine, 7-deazaadenosine, 7-deazaguanosine, 8-oxoadenosine, 8-oxoguanosine, 0(6)-methylguanine, and 2′-thiocytidine), internucleotide modifications such as, for example, those with uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoramidates, carbamates, etc.), with negatively charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.), and with positively charged linkages (e.g., aminoalklyphosphoramidates, aminoalkylphosphotriesters), those containing pendant moieties, such as, for example, proteins (including nucleases, toxins, antibodies, signal peptides, poly-L-lysine, etc.), those with intercalators (e.g., acridine, psoralen, etc.), those containing chelators (e.g., metals, radioactive metals, boron, oxidative metals, etc.), those containing alkylators, those with modified linkages (e.g., alpha anomeric nucleic acids, etc.), as well as unmodified forms of the polynucleotide or oligonucleotide. The term also includes locked nucleic acids (e.g., comprising a ribonucleotide that has a methylene bridge between the 2′-oxygen atom and the 4′-carbon atom). See, for example, Kurreck et al. (2002) Nucleic Acids Res. 30: 1911-1918.
The term “complementary” and “complementarity” are interchangeable and refer to the ability of polynucleotides to form base pairs with one another. Base pairs are typically formed by hydrogen bonds between nucleotide units in antiparallel polynucleotide strands or regions.
Complementary polynucleotide strands or regions can base pair in the Watson-Crick manner (e.g., A to T, A to U, C to G). 100% complementary refers to the situation in which each nucleotide unit of one polynucleotide strand or region can hydrogen bond with each nucleotide unit of a second polynucleotide strand or region. Less than perfect complementarity refers to the situation in which some, but not all, nucleotide units of two strands or two regions can hydrogen bond with each other and can be expressed as a percentage.
The term “administering”, as it applies in the present disclosure, refers to contact of an effective amount of a modulator of one or more lncRNAs of the disclosure, to the subject.
Administering a nucleic acid, such as a microRNA, siRNA, piRNA, snRNA, or antisense nucleic acid, to a cell comprises transducing, transfecting, electroporating, translocating, fusing, phagocytosing, shooting or ballistic methods, or any means by which a nucleic acid can be transported across a cell membrane.
The term “pharmaceutically acceptable excipient or carrier” refers to an excipient that may optionally be included in the compositions of the disclosure and that causes no significant adverse toxicological effects to the patient.
The term “pharmaceutically acceptable salt” includes, but is not limited to, amino acid salts, salts prepared with inorganic acids, such as chloride, sulfate, phosphate, diphosphate, bromide, and nitrate salts, or salts prepared from the corresponding inorganic acid form of any of the preceding, e.g., hydrochloride, etc., or salts prepared with an organic acid, such as malate, maleate, fumarate, tartrate, succinate, ethylsuccinate, citrate, acetate, lactate, methanesulfonate, benzoate, ascorbate, para-toluenesulfonate, palmoate, salicylate and stearate, as well as estolate, gluceptate and lactobionate salts. Similarly salts containing pharmaceutically acceptable cations include, but are not limited to, sodium, potassium, calcium, aluminum, lithium, and ammonium (including substituted ammonium).
An “effective amount” of modulator of one or more lncRNAs of the disclosure (e.g., microRNA, siRNA, piRNA, snRNA, antisense nucleic acid, ribozyme, or small molecule inhibitor, CRISPRs etc.) is an amount sufficient to effect any beneficial or desired results, such as an amount that inhibits the activity of a lncRNA, at any level, for example by interfering with transcription. An effective amount can be administered in one or more administrations, applications, or dosages.
By “therapeutically effective dose or amount” of a modulator of one or more lncRNAs of the disclosure is intended an amount that, when administered as described herein, brings about a positive therapeutic response. The exact amount required will vary from subject to subject, depending on the species, age, and general condition of the subject, the severity of the condition being treated, the particular drug or drugs employed, mode of administration, and the like. An appropriate “effective” amount in any individual case may be determined by one of ordinary skill in the art using routine experimentation, based upon the information provided herein.
“Homology” refers to the percent identity between two polynucleotides or two polypeptide moieties. Two nucleic acid sequences, or two polypeptide sequences are “substantially homologous” to each other when the sequences exhibit at least about 50% sequence identity, at least about 75% sequence identity, at least about 80%-85% sequence identity, at least about 90% sequence identity, or about 95%-98% sequence identity over a defined length of the molecules. As used herein, substantially homologous sequences also refer to sequences showing complete identity to the specified sequence.
In general, “identity” refers to an exact nucleotide to nucleotide or amino acid to amino acid correspondence of two polynucleotides or polypeptide sequences, respectively. Percent identity can be determined by a direct comparison of the sequence information between two molecules by aligning the sequences, counting the exact number of matches between the two aligned sequences, dividing by the length of the shorter sequence, and multiplying the result by 100. Alternatively, homology can be determined by readily available computer programs or by hybridization of polynucleotides under conditions which form stable duplexes between homologous regions, followed by digestion with single stranded specific nuclease(s), and size determination of the digested fragments. DNA sequences that are substantially homologous can be identified in a Southern hybridization experiment under, for example, stringent conditions, as defined for that particular system. Defining appropriate hybridization conditions is within the skill of the art.
As used herein, a “sample” refers to a sample of tissue or fluid isolated or obtained from a subject, including but not limited to, for example, urine, blood, plasma, serum, fecal matter, bone marrow, bile, spinal fluid, lymph fluid, samples of the skin, external secretions of the skin, respiratory, intestinal, and genitourinary tracts, tears, saliva, milk, blood cells, organs, biopsies, and also samples containing cells or tissues derived from the subject and grown in culture, and in vitro cell culture constituents, including but not limited to, conditioned media resulting from the growth of cells and tissues in culture, recombinant cells, stem cells, and cell components.
The terms “quantity,” “amount,” and “level” are used interchangeably herein and may refer to an absolute quantification of a molecule or an analyte in a sample, or to a relative quantification of a molecule or analyte in a sample, i.e., relative to another value such as relative to a reference value as taught herein, or to a range of values for the biomarker. These values or ranges can be obtained from a single patient or from a group of patients.
“Diagnosis” as used herein generally includes determination as to whether a subject is likely affected by a given disease, disorder or dysfunction of the disclosure. The skilled artisan often makes a diagnosis on the basis of one or more diagnostic indicators, i.e., a biomarker, the presence, absence, or amount of which is indicative of the presence or absence of the disease, disorder or dysfunction.
“Prognosis” as used herein generally refers to a prediction of the probable course and outcome of a clinical condition or disease. A prognosis of a patient is usually made by evaluating factors or symptoms of a disease that are indicative of a favorable or unfavorable course or outcome of the disease. It is understood that the term “prognosis” does not necessarily refer to the ability to predict the course or outcome of a condition with 100% accuracy. Instead, the skilled artisan will understand that the term “prognosis” refers to an increased probability that a certain course or outcome will occur; that is, that a course or outcome is more likely to occur in a patient exhibiting a given condition, when compared to those individuals not exhibiting the condition.
“IMR-90” refers to a commonly used immortalized human lung fibroblast cell line available at ATCC.
Whenever the term “at least,” “more than,” or “less than” precedes the first numerical value in a series of two or more numerical values, the term “at least,” “greater than” or “greater than or equal to” applies to each of the numerical values in that series of numerical values. For example, greater than or equal to 1, 2, or 3 is equivalent to greater than or equal to 1, greater than or equal to 2, or greater than or equal to 3.
The term “a,” “an” and “the” include plural referents unless the content clearly dictates otherwise. Thus, for example, reference to “a lncRNA” includes a mixture of two or more lncRNAs, and the like.
The term “about,” particularly in reference to a given quantity, is meant to encompass deviations of plus or minus ten percent.
The term “pulmonary fibrosis” as used herein, refers to a set of lung diseases that affect the respiratory system. In some aspects, pulmonary fibrosis refers to thickening or scarring of the lung tissue or a portion thereof. In some aspects, pulmonary fibrosis is idiopathic pulmonary fibrosis.
The following is a description of various methods and materials used in the studies, and are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how to make and use the present disclosure, and are not intended to limit the scope of what the inventors regard as their disclosure nor are they intended to represent that the experiments below were performed and are all of the experiments that may be performed. Efforts have been made to ensure accuracy with respect to numbers used (e.g., amounts, percentages, etc.), but some experimental errors and deviations should be accounted for.
Genetic characterization of LOC107986083 was carried out on different levels as depicted in the flow chart of
First, to gain an understanding of chromosomal regulation of LOC107986083, ATAC-seq, DNAse-seq, and ChIP-seq with H3K27ac were conducted and analyzed. ATAC-seq was performed on cells from an immortalized human lung fibroblast (HLF) cell line. “HLF” stands for normal primary human lung fibroblasts isolated from healthy donors. DNase-Seq was performed on primary HLF cells. ChIP-Seq H3K27ac was performed on primary HLF cells. snATAC-Seq was performed on primary HLF cells. RNA-Seq was performed on cells from in vivo biopsies taken from healthy individuals, or subjects exhibiting one or more symptoms of COVID-19, or subjects having exhibited one or more symptoms of COVID-19. Briefly, processed bam files for the conditions shown in
The chr3:45796893-45863928 (hg38) region was tiled with custom 1× dsDNA probes, subtracting all known exons (GENCODE) from protein-coding gene LZTFL1. Probes were synthetized with Twist Bioscience (ref. 101001).
High Quality Total RNA (RIN>9) was extracted from Human Lung Fibroblasts (FB and MyoFB conditions) using High Pure RNA Isolation Kit (Roche, Cat. N°11828665001) according to the manufacturer protocol. Libraries, enrichment, and sequencing were performed by Lausanne Genomic Technologies Facility (GTF). Briefly, PacBio Isoform Sequence (Iso-Seq) libraries were prepared per manufacturer's instructions. A target enrichment step using Twist reagents, probes panel and protocol was done between the cDNA amplification and End repair steps of the Iso-seq library. A change was applied to target enrichment Twist protocol: Universal Blockers were replaced by custom PacBio compatible blocker. Libraries were sequenced on a Sequel II instrument.
Samples were processed following the standard PacBio isoseq3 (1) workflow: PacBio Circular Consensus Sequencing (CCS) reads were demultiplexed and refined (isoseq3 refine) to remove polyA tails and artificial concatemers. Full-length non-concatemer (flnc) reads from replicate runs of the same condition were combined into one file-of-filename (fofn). Fofn reads were then clustered (isoseq3 cluster), mapped with pbmm2 (—preset ISOSEQ—sort) (2) to Gencode's human GRCh38 genome, and collapsed (isoseq3 collapse) into unique isoforms. Collapsed isoforms of the target of interest were extracted from each condition sample and merged into one non-redundant set of isoforms using tama-merge (-a 200-z 200) (3). This set was curated to remove mono-exonic as well as isoforms representing less than 2.5% of the captured locus reads, yielding the final robust set of 8 human CORAL isoforms.
In addition, tissue-specific expression of LOC107986083 was studied, and the results are shown in
A comprehensive characterization of the transcription of LOC107986083 and its regulation mechanisms revealed that LOC107986083 is involved in pulmonary fibrosis. Equally important, LOC107986083 exhibited an elevated transcription level in response to fibrosis associated with acute respiratory distress syndrome (ARDS). Furthermore, rs17713054, a causal variant for COVID-19 respiratory failure, was found to be located in the enhancer region of LOC107986083 (see
Various isoforms of hsCORAL transcripts comprise rs17713054. Other distinct isoforms of hsCORAL transcripts do not comprise rs17713054.
Further validation and characterization of LOC107986083 are carried out.
LOC107986083 expression in hLung myofibroblasts of different genotypes and changes of its expression in hLung myofibroblasts harboring risk variants are investigated. Specifically, NHLF cell lines from different donors are purchased from Lonza. gDNA is extracted with the GeneJET Genomic DNA Purification Kit (K0721) and Sanger sequencing is performed on the LOC107986083 locus to assess the presence of the single nucleotide polymorphism (SNP) of interest. All cell lines are subsequently treated with serum starvation and TGFβ as described herein, and qPCR or RNA-Seq is performed to evaluate expression levels of the LOC107986083 locus. For HLF datasets or other public data without gDNA information or SNP calling available, a variant calling pipeline based on the software GATK is employed. The RNA-Seq workflow starts from raw sequencing reads that are first mapped to the reference using STAR aligner (basic 2-pass method) to produce a bam file. Duplicates are marked and removed with Picard's MarkDuplicate. Downstream analysis steps involve a base recalibration performed with BQSR and variant calling performed with HaplotypeCaller. The output comprises a VCF file (one per sample) with the list of variants called.
Enhancer activity and CEBPO binding of LOC107986083 in lung myofibroblasts harboring risk variants are studied. PRO-seq is performed as described herein. Changes observed in nascent RNA after analysis is performed to demonstrate changes in enhancer activity at the LOC107986083 locus. Separately, CEBPO and H3K27Ac CUT&RUN and PRO-Seq in genotyped lung fibroblasts-myofibroblasts with or without serum starvation and TGFβ treatment are carried out as shown in
Furthermore, LOC107986083 expression in GTEX datasets from lung of different genotypes are examined, and its expression is expected to be increased in GTEX samples harboring risk variants. Raw and processed data were obtained from GTEx after obtaining dbGaP access to repository phs000424. Gene expression data used in downstream analyses are preprocessed using the same procedure as described above. Briefly, FASTQ files are mapped to the reference genome, and expression levels of loci of interest across different samples are quantified. Variants are preselected based on genomic regions of interest (e.g., LOC107986083 genomic locus). The expression of lncRNA loci is quantified using the Subread software package, as described herein. The association between expression levels and SNP presence is established. Simple multivariate linear or logistic regression models are employed to perform a statistical analysis since only the association between relatively few SNPs and gene expression is of interest. Multiple covariates can be introduced in the model, e.g., to account for sample features such as tissue of origin or donor sex/ethnicity. Samples are thus stratified by tissue type and presence or absence of SNPs of interest.
An in-vitro model was established and was validated to be a faithful model for pulmonary fibrosis.
As shown in
For 6 well plates, 150,000 cells were seeded per well and were left to recover overnight before starting experiments. No cells past passage 6 were used for experiments.
After starvation and treatment of TGFβ for 24-48 hours, myofibroblasts were simulated, considering that the morphologies (see
lncRNA transcript expression from the LOC107986083 genomic locus (CORAL) was examined in differentiating fibroblasts in
Lung FB that have been induced to differentiate into lung MyoFB demonstrated a significant increase in expression of CORAL transcripts. In contrast, human cardiac fibroblasts (HCF), human cardiac myofibroblasts, human dermal fibroblasts (HDF), and human dermal myofibroblast all exhibited very weak/undetectable levels of expression with no evidence of increases in expression level when differentiating from fibroblast cell state to myofibroblast cell state. These results demonstrate cell-type specificity and tissue-type specificity for CORAL. The increase in CORAL expression during lung MyoFB differentiation may indicate a role in the initiation, progression, and/or severity of the fibrosis phenotype induced in this model system.
For qPCR results testing expression of selected fibrosis markers in
From 24 through 96 hr, aSMA protein levels remained elevated in MyoFB from the basal level found in FB, and aSMA protein levels continued to rise at each time point evaluated. The elevated and increasing expression of this fibrosis marker over a period of differentiation into MyoFB further validates this model of pulmonary fibrosis.
In addition, the spliced annotated lncRNA of LOC107986083 (hsCORAL) was detectable by qPCR in primary human lung fibroblasts (HLF). Specifically, RNA was isolated with the RNeasy RNA extraction kit, and was reverse transcribed using the Quantitect reverse transcription kit. hsCORAL qPCR primers were as follows: Forward: 5′-CACGTGAGCATACTGGGC-3′ Reverse: 5′-GCAGAGTCATCAAAGGGTCG-3′. qPCR was performed with the TB Green Premix Ex Taq (Takara Cat. #RR420W) master mix according to manufacturer's protocol. Cycles and temperature settings from this manual were used on a QuantStudio 6 Pro Thermo Fisher Real-Time PCR system. Data were then exported and analyzed using the RQ method on the Design and Analysis 2 software provided by the manufacturer. As seen in
Subcellular localization of the transcript of LOC107986083 was further investigated. Cell culture and starvation and treatment steps were performed as described above and shown in
To further validate the in-vitro model of pulmonary fibrosis, the myofibroblasts generated in vitro were compared to myofibroblasts from public annotated datasets containing samples from patients with pulmonary fibrosis.
First, a single cell atlas was constituted for the myofibroblasts generated in vitro. Pirfenidone, a first-line medicine for pulmonary fibrosis and labeled as “drug” in relevant figures, was used. Specifically, single-cell RNA-Seq with 10× Genomics kit and multiplexing hashtags antibodies were carried out for cells with and without pirfenidone, with and without serum starvation and TGFβ treatment, for different durations (see
Furthermore, based on their different gene signatures, cell subpopulations relevant for idiopathic pulmonary fibrosis (IPF) were identified in this dataset (see
Idiopathic Pulmonary Fibrosis (IPF) Cell Atlas is developed as a multi-institutional collaboration to continuously publish datasets presenting in-vivo human disease. Categorical label transfer was performed from Banovich et al. and Kaminski et al. datasets using the TransferData tool in Seurat. To transfer the categorical labels comprising cell identities known from the in vivo scRNA-Seq datasets derived from patient biopsy samples the following method was used. Label transfer between In Vivo patient datasets (reference) and In Vitro model (query) was performed separately for each reference dataset. Expression from each query and reference dataset was preprocessed and normalized in the R environment using Seurat functions. The MAGIC R package was used for data imputation to enhance the signal and counteract the dropout effect of scRNASeq profiling. Label transfer was performed using a proper sequence of functions from the Seurat R package. Anchors between reference and query were obtained using Seurat function FindTransferAnchors using 30 dimensions. Next, data were transferred between datasets using function TransferData (using 30 dimensions), and query cells were annotated with an identity from the reference dataset using Seurat function AddMetaData. As shown in
Taken together, the myofibroblasts generated in vitro recapitulated the in vivo human pulmonary fibrosis, and they are a useful tool to test reagents treating lung diseases such as pulmonary fibrosis and pathology related to the development and progression of ARDS.
First, various antisense oligonucleotides (ASOs) were generated in silico targeting the RNA transcript of LOC107986083. Then development candidates were selected in silico for screening and synthesis. The selection criteria were: (1) no off-target hybridization; (2) absence of questionable motifs; (3) target accessibility; (4) hybridization free energy (AG); and (5) secondary structures (AG). Several ASO candidates shown in Table 3 were selected for further validation.
GGATAATGGTTGGTCA
CAAGTAAGCGTGTAGC
GTCTGGTAACTATGAG
AGATTCATGCAGATGG
Primary NHLFs (normal human lung fibroblasts) from patient biopsies (Lonza, ref. LZ-CC-2512) were cultured with FGM-2 Fibroblast Growth Medium-2 BulletKit (Lonza, ref. CC-3132). The ASOs were delivered to the cells by transfection (transfection reagent X-tremeGENE, Roche, ref. 6366244001). Briefly, for primary NHLFs grew in 6-well plates with a density of 150,000 cells/well, the final concentration of the ASOs used was 50 nM, and the amount of the ASOs per well was 100pmol. ASOs were diluted in TE buffer with a working dilution of 20 μM.
For each triplicate of 3 wells and accounting for pipetting errors, 16 μL of such working dilution of the ASO, and 6 μL of X-tremeGENE HP DNA Transfection Reagent was mixed respectively into 298 μL of OptiMEM medium gently, which resulted in a total volume of 320 μL with a concentration of 1 pmol/μL. The mixture was incubated at room temperature (15-25° C.) for a minimum of 15 minutes. After incubation, the mixture was added to each well dropwise gently (100 μL of the mixture/well).
The fibrotic phenotype was then induced 24 hours after transfection through serum starvation and TGFβ addition, the cells were collected and evaluated another 24 hours later (see
In the fibrotic phenotype-induced cells, transfection of an ASO (SEQ ID NO: 1) targeting the lncRNA polyA transcript knocked down the expression level of the lncRNA polyA transcript (see
Expression of one or more transcripts of LOC107986083 in ASO-1-treated fibrotic phenotype induced cells was knocked down to a level of expression similar to that in primary NHLFs not having undergone serum starvation and TGFβ treatment (Control). Such results indicate that expression level of the lncRNA polyA transcript (hsCORAL) was substantially increased upon induction of fibrotic phenotype in the fibroblast (e.g., upon transitioning to myofibroblast), and such increase of the lncRNA polyA transcript (CORAL) could be reduced or even reversed to the pre-induction state (e.g., fibroblast state) by treating with ASO-1. In this experiment, qPCR primers used were as follows: Forward: 5′-CACGTGAGCATACTGGGC-3′ Reverse: 5′-GCAGAGTCATCAAAGGGTCG-3′. Cycles and temperature settings were adopted from manufacturers' protocols.
Expression of fibrotic marker genes were further profiled upon modulation of expression of one or more transcripts of LOC107986083. In the fibrotic phenotype-induced cells, expression of several profiled fibrotic marker genes could be reduced upon transfection of ASO-1 (SEQ ID NO: 1), in statistically significant levels of decrease (see
Taken together, these results demonstrate that ASO-1 treatment targeting hsCORAL could reduce expression of the target lncRNA as well as several pro-fibrotic genes, which are potential downstream genes modulated by the target lncRNA. Consequently, the ASO-1 treatment targeting hsCORAL could have an anti-fibrotic impact on a lung cell model of pathological fibrosis.
Ontology analysis was performed for genes that were up-regulated or down-regulated in response to treatment of the ASO (SEQ ID NO: 1) (see
In the primary NHLF cells in which the fibrotic phenotype has been induced, treatment with ASO-1 (SEQ ID NO: 1) resulted in a knockdown of lncRNA polyA transcript (see
ASO-1 was tested for dose-response in its effect on hsCORAL expression and expression of select markers of fibrosis (
ASO-1 was tested for effect on modulating various molecular and cellular pathways in the in-vitro model for pulmonary fibrosis used in this example (see
Effects of ASO-1 treatment on gene expression in primary NHLF cells in which the fibrotic phenotype has been induced was tested using RNA-Seq for a comparative analysis of quantitative measures of expression changes. ASO treatment at either 50 nM or 20 nM concentrations was used for analysis. Analysis was performed on genes with average(FPM)>10 (N=10179) and Euclidean distance of log scaled_FPM was used as a metric of measurement. Pairwise distance between each gene were calculated for hierarchical clustering within a latent space with a cluster choice of optimization of average silhouette width. ASO-1 treatment was found to downregulate several clusters of genes as shown in
To refine the analysis further, a cluster-based target engagement panel (TEP) was derived. In this TEP, the most downregulated cluster for ASO-1 treatment on knockdown of CORAL expression was selected. CORAL ASO-1 (Cluster 6): n=287 was selected. As controls, the most upregulated and the most downregulated clusters following ASO-Scr treatment were selected.
Within cluster 6, ASO-1-sensitive genes were enriched for GO terms relating to ECM and actin cytoskeleton terms.
GSEA was performed on data derived from 50 nM treatment of ASO-1 vs. 50 nM treatment of ASO-Scr on primary NHLF cells in which the fibrotic phenotype has been induced. Results from GSEA are shown in
As shown in
As shown in
An ISG gene set has been derived for FB and MyoFB in which the cell types were treated with IFNγ. As this gene set has been analyzed and compared to existing gene sets, genes shared between IFNγ treatment and TGFβ treatment are enriched for GO terms relating to leukocyte migration (e.g., cell chemotaxis, leukocyte migration, leukocyte chemotaxis, monocyte chemotaxis). Further tested indicated that ASO-1 treatment in fibroblasts downregulated a subset of ISGs and was not found to upregulate any gene member of this subset. ISGs downregulated by ASO-1 treated were found to be enriched for leukocyte activation—and proliferation-related GO terms.
PRO-seq of primary human lung fibroblasts treated with ASOs of interest is performed. PRO-seq experimental approach and its bioinformatic analysis is as described herein. After bioinformatic analysis is performed, dysregulation of nascent RNA on key fibrosis marker genes (e.g., ACTA2, POSTN, FAP, COL1A1, COL3A1, and FN1) and dysregulation of nascent RNA at the LOC107986083 locus are observed. Immortalized human lung fibroblasts are included as well.
Enhancer status of LOC107986083 locus upon ASO treatment is evaluated. Cleavage Under Targets & Release Using Nuclease (Cut&Run) was performed with CTCF (negative control), H3K4me3 (using the CUTANA™ Cut&Run kit and antibodies recommended by the manufacturer). Single-nucleus ATAC-Seq (snATAC-Seq) was also performed to enable quantification of chromatin accessibility in single nuclei. H3K27ac ChIP/C&R analysis is performed.
FACS profiling of extracellular markers (e.g., α-SMA, FAP) with or without treating of the ASOs described herein are performed. Proteins of ECM markers such as ACTA2 and FAP are quantified on the surface of treated pulmonary cells by staining and flow cytometry are planned to validate the effect of the ASOs described herein (e.g., SEQ ID NO: 1) Specifically, HLFs transfected with target ASOs or Scramble, treated with serum starvation and TGFβ or not, are collected. The anti-alpha smooth muscle Actin antibody [1A4](Abcam ab7817) is used for the detection of the alphα-SMA protein encoded by the ACTA2 human gene, at a concentration of 1.137 μg/mL as suggested by the manufacturer. Anti-Fibroblast activation protein, alpha antibody (Abcam ab28244) is used for the detection of protein encoded by the FAP human gene.
FACS is performed according to the manufacturer's protocol (https://www.abcam.com/protocols/indirect-flow-cytometry-protocol) on a Beckman Coulter Gallios flow cytometer. Results are read and analyzed with the Kaluza acquisition software. A change in extracellular levels of alphα-SMA (ACTA2) and FAP is quantified upon knocking down LOC107986083.
CUT&RUN of H3K27ac with the Abcam antibody ab4729 is performed according to the protocol in the “mammalian cells” rubric in Skene et al., eLife 2017;6:e21856 DOI: 10.7554/eLife.21856. It is expected that a dysregulation of H3K27ac marks on and around the LOC107986083 locus is observed. A difference in H3K27 levels at the regulatory regions of key fibrotic genes is also observed.
First, a mouse homolog of LOC107986083 was identified. Specifically, whole-lung RNA-Seq with about 200 million reads/sample using 1-year old mouse lung tissues was performed. Mapping was performed as described above with the mm39 mouse genome as reference. A transcribed region at the 3′ end of mouse Lztf11 transcript was observed (see
Two candidate ASOs shown in Table 4 were designed and selected for further testing.
ATGATGTGCAACAGGC
CCCTGTATTACCCTGC
Immortalized MLg mouse lung fibroblast cell line was cultured in DMEM supplemented with 10% FBS. Transfection was performed with polymer reagent (X-tremeGENE, Roche, ref. 6366244001). Specifically, RNA was isolated with the RNeasy RNA extraction kit from Qiagen (ref. 74106), and was reverse transcribed using the Quantitect reverse transcription kit from Qiagen (ref. 205311). Shown in
After investigating the ASOs with the immortalized MLg mouse lung fibroblast cell line, primary mouse lung fibroblasts were used to further validate the selected ASOs. Specifically, primary mouse lung fibroblasts were isolated directly from 5 month old C57BL/6 mice cultured in DMEM supplemented with 10% FBS and 1% P/S. Transfection was performed with polymer reagent (X-tremeGENE™, Roche, ref. 6366244001). The following TaqMan® probes are purchased from Thermo Fisher and used with the TaqMan® method described above: Acta2 Mm00725412_s1 Collal Mm00801666_gI Col3al Mm00802296_gl Postn Mm01284913_gl Fn1 Mm01256744_ml Fap Mm01329177_ml Gapdh Mm99999915_gl mmCoral primers are as follows and are used with the TB Green SYBR method mentioned above: mmCoral_fwdl: 5′-AGAACTTGAAGCTGTCAGGG-3′, mmCoral_revl: 5′-TGCATGTTGAAGACAGCACT-3′. As shown in
In addition, primary mouse lung fibroblasts isolated directly from 24 month old C57BL/6 mice were tested. The fibroblasts were cultured in DMEM supplemented with 10% FBS and 1% P/S. Transfection was performed with polymer reagent (X-tremeGENE™, Roche, ref. 6366244001). Specifically, RNA was isolated with the RNeasy RNA extraction kit from Qiagen (ref. 74106), and was reverse transcribed using the Quantitect reverse transcription kit from Qiagen (ref. 205311). As shown in
The results demonstrate a reduction in expression of fibrosis-related genes in young (5 month old) primary mouse lung fibroblasts. The results also demonstrate a more pronounced reduction in expression of fibrosis-related genes in old (24 month old) primary mouse lung fibroblasts. As shown in
RNA-Seq was performed on primary mouse lung fibroblasts treated with SEQ ID NO: 6. RNA-Seq was performed on a NovaSeq 6000 with ribodepleted Illumina TruSeq® library preparation 150 PE. Samples were sequenced at an expected depth of up to 40M reads/sample. Related bioinformatics analysis pipeline for bulk RNA-Seq was as described above.
A pulmonary fibrosis (e.g., idiopathic pulmonary fibrosis) mouse model is established by an application of bleomycin, as shown in
Four test groups of mice were assays in this example. Group 1 mice received no bleomycin treatment and served as a negative control. Group 2 mice received bleomycin treatment and aerosol administration of vehicle and serve as a negative control of induced lung inflammation and lung fibrosis. Group 3 mice received bleomycin treatment and administration of a therapeutic ASO (e.g., mmASO-2). Group 4 mice received bleomycin treatment and administration of a known therapeutic serving as a positive control (e.g., pirfenidone). A sufficient number of animals were tested for each group for reproducibility of results and statistical analyses of the data. During the study and at the endpoint, animals were assayed for survival, body weight, analysis of bronchoalveolar lavage fluid (BALF) including ELISA assays of select cytokines. At the endpoint of the study, surviving animals were assayed with various tests of functional genomics and scoring of immune system function and pulmonary fibrosis.
A therapeutic ASO directed against the mouse homolog of LOC107986083 (e.g., mmASO-2) was administered according to the study protocol. Group 3 mice receiving administration of a therapeutic ASO received two administrations that were spaced apart by 3 days. 100 μg of ASO/mouse/administration was given by intratracheal microspray in an aerosol formulation on Day -5 and on Day -2. This study protocol was designed to assay a prophylactic response of ASO treatment in mice to bleomycin-induced lung inflammation and lung fibrosis. Bleomycin administration was given on Day 0.
Pirfenidone was administered to Group 4 mice starting at Day -1 and was continued twice daily until the endpoint of the study. Pirfendone was administered via oral galvage at a dosage of 100 mg/kg of body weight (100 μL/dose/BID).
Brochoalveolar lavage (BAL) was used to assess immune cell numbers present in mice of the various test groups.
RNA-Seq analysis was performed using bulk lung tissue of each test group at the study endpoint. Selected markers of fibrosis (e.g., Acta2, Collal, Col3al, Fap, Fn1, and Postn) were assayed. GSEA analysis of RNA-Seq data from bulk mouse lung tissue of mmASO-2-treated mice compared to ASO-Scr-treated mice (
An anti-ARDS proof-of-concept signal was determined by the use of CORAL ASO targeting.
Various cytokines were assayed in bulk lung tissue of each test group at the study endpoint using ELISA. Cytokines tested were IFNγ, IL-1b, IL-2, IL-4, IL-5, IL-6, KC/GRO, IL-10, IL-12p70, TNFα, IL-9, IL-15, MIP-2, IL27p28/IL-30, IP-10, IL-33, MIP-la, MCP-1 and IL-17A/F. As shown in
Typical plasma biochemistry markers (e.g., blood urea nitrogen, creatine, phosphorous, calcium, total protein, albumin, globulin, ALT, AST, ALP) were assayed for all test groups and were found to reside in the normal range with no toxicity observed. Normal ranges plasma biochemistry markers in mice for comparison to test results were obtained from Charles River C57/BL/6 mice datasheet information (Charles River C57BL/6NCtr Data —Clinical Chemistry).
As shown in
In this example, administration of oropharyngeal bleomycin induced histopathologic lesions typical for this model, including pulmonary fibrosis with mixed inflammatory cell infiltration/aggregate formation. Prophylactic treatment with an ASO targeting one or more lncRNA polyA transcripts of the mouse homolog of LOC107986083 exhibited efficacy in the reduction of inflammatory cell infiltrate severity but did not significantly reduce detected levels of fibrosis as compared to vehicle treatment. Treatment with the reference positive control compound pirfenidone produced slight, non-significant reductions in the severity of pulmonary fibrosis in this study.
Excessive organ infiltration of inflammatory monocytes and macrophages is a hallmark pathological feature of severe ARDS and severe COVID-19/ARDS. ASO-treated mice in this bleomycin study presented with a decrease of immune cell infiltration in the lungs. This was observed through i) histopathological scoring (infiltrates not visually observed in ASO-treated mice), ii) BAL differential counts (leukocyte numbers in the fluid were not increased), iii) cytokine ELISA from bulk lung tissue (IL-1b, MIP-la, IP-10, MCP-1, and MIP-2 are all significantly increased, and iv) RNA-Seq analysis (GO terms related to leukocyte adhesion and migration were downregulated). These results demonstrate that the CORAL locus is a potential regulator of leukocyte adhesion and/or infiltration in ARDS.
A human IPF gene signature was developed from in vitro and in vivo datasets described herein. This gene signature conveys statistical information regarding how the genes that were assayed which are stably differentially expressed across various in vitro and in vivo conditions behave in other datasets and how the developed gene signature constitutes a core signature of IPF. Development of the human IPF gene signatures used a meta-analysis methodology.
The IPF gene signature comprises select genes that are upregulated in the presence of IPF pathology. The IPF gene signature also comprises select genes that are downregulated in the presence of IPF pathology.
It shall be understood that different aspects of the disclosure can be appreciated individually, collectively, or in combination with each other. Various aspects of the disclosure described herein may be applied to any of the particular applications disclosed herein. The compositions of matter disclosed herein in the composition section of the present disclosure may be utilized in the method section including methods of use and production disclosed herein, or vice versa.
While preferred aspects of the present disclosure have been shown and described herein, it will be obvious to those skilled in the art that such aspects are provided by way of example only. It is not intended that the disclosure be limited by the specific examples provided within the specification. While the disclosure has been described with reference to the aforementioned specification, the descriptions and illustrations of the aspects herein are not meant to be construed in a limiting sense. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the disclosure. Furthermore, it shall be understood that all aspects of the disclosure are not limited to the specific depictions, configurations or relative proportions set forth herein which depend upon a variety of conditions and variables. It should be understood that various alternatives to the aspects of the disclosure described herein may be employed in practicing the disclosure. It is therefore contemplated that the disclosure shall also cover any such alternatives, modifications, variations or equivalents. It is intended that the following claims define the scope of the disclosure and that methods and structures within the scope of these claims and their equivalents be covered thereby.
This application is a continuation of International Application No. PCT/IB2023/000233, filed Apr. 28, 2023, which claims the benefit of U.S. Provisional Application No. 63/336,648 filed Apr. 29, 2022, which are incorporated herein by reference in their entireties.
| Number | Date | Country | |
|---|---|---|---|
| 63336648 | Apr 2022 | US |
| Number | Date | Country | |
|---|---|---|---|
| Parent | PCT/IB2023/000233 | Apr 2023 | WO |
| Child | 18927263 | US |