METHODS AND COMPOSITIONS FOR MODULATING SUPPRESSION OF LYMPHOCYTE ACTIVITY

Information

  • Patent Application
  • 20210130776
  • Publication Number
    20210130776
  • Date Filed
    October 01, 2018
    6 years ago
  • Date Published
    May 06, 2021
    3 years ago
Abstract
The subject matter disclosed herein is generally directed to a novel CD8+ T cell subtype associated with suppressive or regulatory T cell functions. Moreover, the subject matter disclosed herein is generally directed to methods and compositions for use of the subtype. Also, disclosed herein are gene signatures and markers associated with the subtype and use of said signatures and markers. Further disclosed are therapeutic methods of using said gene signatures and immune cell subtype. Further disclosed are pharmaceutical compositions comprising populations of CD8+ TILs depleted for a specific subtype. Further disclosed are interactions with other T cell subtypes.
Description
REFERENCE TO AN ELECTRONIC SEQUENCE LISTING

The contents of the electronic sequence listing (BROD_2167WP_ST25.txt”; Size is 8 Kilobytes and it was created on Jul. 1, 2020) is herein incorporated by reference in its entirety.


TECHNICAL FIELD

The subject matter disclosed herein is generally directed to CD4+ and CD8+T lymphocyte subtypes and their interactions associated with immune responses in cancer. Moreover, the subject matter disclosed herein is generally directed to detecting, isolating and modulating said subtypes.


BACKGROUND

Characterizing different T cell subpopulations and their underlying driving mechanisms contributes to our understanding of protective immunity in successful pathogen clearance, T cell regulation during uncontrolled tumor growth and chronic infections, and T cell regulation during autoimmunity. Recent advances on this front have enabled the development of improved vaccines and novel immune-based therapies for various cancers. Applicants have previously shown that a CD8 T cell dysfunction gene signature can be decoupled from activation gene signature and have shown that the signatures for each CD8 T cell state is present in distinct single cell populations (see, e.g., WO2017075451A1, WO2017075478A2, WO2017075465A1 and U.S. provisional application No. 62/384,557, filed Sep. 7, 2016). Previous studies have characterized subsets of regulatory T cells (Treg) that selectively suppress development of autoantibody formation by inhibiting function of follicular T-helper cells (see, e.g., US20130302276A1; and WO2016196912A1). It is believed that the breadth of the functional potential of CD4+ and CD8+ T cells is far from understood, and that gaining a deeper understanding will lead to further advancements.


Consequently, there exists a continuous need to provide additional and preferably improved markers, products and methods allowing to determine the functional state of immune cells. Likewise, there exists a continuous need to provide additional and preferably improved molecular targets involved in immune responses, as well as therapeutically useful substances and compositions impinging on such molecular targets to modulate immune responses.


Citation or identification of any document in this application is not an admission that such document is available as prior art to the present invention.


SUMMARY

It is an objective of the present invention to identify CD8+ TIL subtypes present in tumor infiltrating lymphocytes (TIL) during tumor growth. It is another objective of the present invention to detect gene signatures and biomarkers specific to the CD8+ TIL subtypes, whereby cells may be detected and isolated. It is another objective of the present invention to provide for adoptive cell transfer methods for treatment of a cancer by transferring more functional CD8+ TIL populations. It is another objective of the present invention to provide for treatment of a cancer by modulating CD8+ T cell populations to be more functional. It is another objective of the present invention to improve immunotherapy treatment.


In one aspect, the present invention provides for an isolated T cell characterized in that the T cell comprises expression of one or more genes selected from the group consisting of: TNFRSF9, PRF1, BHLHE40 (DEC1), IRF8, GLDC, STAT3, CST7, IL1R2, EEF2, SLC2A3, SQSTM1, RBPJ, NABP1, ACTN1, TNFRSF4, SERPINB9, FOSL2, CAPG, KLRC1, IL18R1, JUNB, EEF1A1, TNFRSF18, RGS2, NFKB2, RPL5, PEX16, LAT2, KDM5B, HILPDA, GEM, DENND4A, BCL2L11, ADAM8, PGLYRP1, IKZF2 (Helios), MT1, KIT, SERPINE2, CCRL2, C5F1, EPAS1, RUNX2, SPRY2 and XCR1; or genes in Table 6. In another example embodiment, the isolated T cell is characterized in that the T cell does not comprise expression of HMMR and comprises expression of one or more genes selected from TNFRSF9, PRF1, BHLHE40 (DEC1), IRF8, GLDC, STAT3, CST7, IL1R2, EEF2, SLC2A3, SQSTM1, RBPJ, NABP1, ACTN1, TNFRSF4, SERPINB9, FOSL2, CAPG, KLRC1, IL18R1, JUNB, EEF1A1, TNFRSF18, RGS2, NFKB2, RPL5, PEX16, LAT2, KDM5B, HILPDA, GEM, DENND4A, BCL2L11, ADAM8, PGLYRP1, IKZF2, MT1, KIT, SERPINE2, CCRL2, CSF1, EPAS1, RUNX2, SPRY2 and XCR1. In another example embodiment, the isolated T cell is characterized by expression of one or more CD8, TIM3, PD1, MT1, and IKZF2, as well as expression of one or more genes selected from the group consisting of TNFRSF9, PRF1, BHLHE40 (DEC1), IRF8, GLDC, STAT3, CST7, IL1R2, EEF2, SLC2A3, SQSTM1, RBPJ, NABP1, ACTN1, TNFRSF4, SERPINB9, FOSL2, CAPG, KLRC1, IL18R1, JUNB, EEF1A1, TNFRSF18, RGS2, NFKB2, RPL5, PEX16, LAT2, KDM5B, HILPDA, GEM, DENND4A, BCL2L11, ADAM8, PGLYRP1, KIT, SERPINE2, CCRL2, CSF1, EPAS1, RUNX2, SPRY2 and XCR1. In another example embodiment, the isolated T cell may be characterized by expression of one or more CD8, TIM3, PD1, MT1, and IKZF2, as well as expression of one or more genes selected from the group consisting of TNFRSF9, PRF1, BHLHE40 (DEC1), IRF8, GLDC, STAT3, CST7, IL1R2, EEF2, SLC2A3, SQSTM1, RBPJ, NABP1, ACTN1, TNFRSF4, SERPINB9, FOSL2, CAPG, KLRC1, IL18R1, JUNB, EEF1A1, TNFRSF18, RGS2, NFKB2, RPL5, PEX16, LAT2, KDM5B, HILPDA, GEM, DENND4A, BCL2L11, ADAM8, PGLYRP1, KIT, SERPINE2, CCRL2, CSF1, EPAS1, RUNX2, SPRY2 and XCR1, and does not comprise expression of HMMR. The isolated T cell may be further characterized in that the T cell comprises upregulation of one or more genes selected from the group consisting of TNFRSF9, PRF1, BHLHE40, IRF8, GLDC, STAT3, CST7, IL1R2, EEF2, SLC2A3, SQSTM1, RBPJ, NABP1, ACTN1, TNFRSF4, SERPINB9, FOSL2, CAPG, KLRC1, IL18R1, JUNB, EEF1A1, TNFRSF18, RGS2, NFKB2, RPL5, PEX16, LAT2, KDM5B, HILPDA, GEM, DENND4A, BCL2L11, ADAM8, PGLYRP1, IKZF2, KIT, SERPINE2, CCRL2, CSF1, EPAS1, RUNX2, SPRY2 and XCR1 as compared to all CD8+TIM3+PD1+ T cells. The isolated T cell may be further characterized in that the T cell comprises downregulation of a cell cycle signature as compared to all CD8+TIM3+PD1+ T cells. The T cell may be further characterized in that the T cell suppresses T cell proliferation. The isolated T cell may be further characterized by a gene signature comprising one or more genes or polypeptides selected from Tables 1 to 6, preferably, Table 6. Tables 1 to 6 list the genes in ranked order (i.e., most specific to the cells described herein). In certain embodiments, the signature may comprise the top 10, 20, 50, 100, 200, 300, 400, or 500 top genes. In preferred embodiments, the signature comprises genes selected from the top 100, 50, 20, or top 10 genes in each ranked list. In other preferred embodiments, T cells are detected, isolated or targeted using cell surface or cytokines (e.g., Table 3). The T cell may be a human cell. The T cell may be autologous for a subject suffering from cancer.


In another aspect, the present invention provides for a method for detecting or quantifying T cells in a biological sample of a subject, the method comprising detecting or quantifying in a biological sample of the subject T cells as defined in any embodiment herein. The T cells may be detected or quantified using a set of markers comprising: a) TIM3, SERPINE2 and HMI or b) SERPINE2 and HMI or c) TIM3, KIT and HMI or d) TIM3, TNFRSF4 and HMMR; or e) any of (a), (b), (c) or (d) and one or more of CD8, CD45 and PD1; or any of (a), (b), (c), (d) or (e) and one or more of TNFRSF9, PRF1, BHLHE40, IRF8, GLDC, STAT3, CST7, IL1R2, EEF2, SLC2A3, SQSTM1, RBPJ, NABP1, ACTN1, TNFRSF4, SERPINB9, FOSL2, CAPG, KLRC1, IL18R1, JUNB, EEF1A1, TNFRSF18, RGS2, NFKB2, RPL5, PEX16, LAT2, KDM5B, HILPDA, GEM, DENND4A, BCL2L11, ADAM8, PGLYRP1, IKZF2, KIT, SERPINE2, CCRL2, CSF1, EPAS1, RUNX2, SPRY2 and XCR1. The T cells may be detected or quantified using a technique selected from the group consisting of RT-PCR, RNA-seq, single cell RNA-seq, flow cytometry, mass cytometry, fluorescence activated cell sorting, fluorescence microscopy, affinity separation, magnetic cell separation, microfluidic separation, and combinations thereof.


In one embodiment, intact T cells may be detected or quantified using a set of surface markers comprising: a) TIM3, SERPINE2 and HMMR; or b) SERPINE2 and HMMR; or c) TIM3, KIT and HMMR; or d) TIM3, TNFRSF4 and HMMR; or e) any of (a), (b), (c) or (d) and one or more of CD8, CD45 and PD1; or any of (a), (b), (c), (d) or (e) and one or more of TNFRSF9, IL1R2, SLC2A3, TNFRSF4, KLRC1, IL18R1, TNFRSF18, LAT2, ADAM8, KIT, SERPINE2 and XCR1. The intact T cells may be detected or quantified using a technique selected from the group consisting of flow cytometry, fluorescence activated cell sorting, affinity separation, magnetic cell separation, microfluidic separation, and combinations thereof.


In another aspect, the present invention provides for a method for isolating T cells from a biological sample of a subject, the method comprising isolating from the biological sample T cells as defined in any embodiment herein. The T cells may be isolated using a set of surface markers comprising: a) TIM3, SERPINE2 and HMMR; or b) SERPINE2 and HMMR; or c) TIM3, KIT and HMMR; or d) TIM3, TNFRSF4 and HMMR; or e) any of (a), (b), (c) or (d) and one or more of CD8, CD45 and PD1; or any of (a), (b), (c), (d) or (e) and one or more of TNFRSF9, IL1R2, SLC2A3, TNFRSF4, KLRC1, IL18R1, TNFRSF18, LAT2, ADAM8, KIT, SERPINE2 and XCR1. The T cells may be isolated, using a technique selected from the group consisting of flow cytometry, fluorescence activated cell sorting, affinity separation, magnetic cell separation, microfluidic separation, and combinations thereof.


In certain embodiments, the technique for detecting, quantitating, or isolating T cells according to any embodiment herein may employ one or more agents capable of specifically binding to one or more gene products expressed or not expressed by the T cells, preferably on the cell surface of the T cells. The one or more agents may be one or more antibodies.


In certain embodiments, the biological sample may be a tumor sample obtained from a subject in need thereof. In certain embodiments, the biological sample may be a sample obtained from a subject suffering from an autoimmune disease. In certain embodiments, the biological sample may be a sample obtained from a subject suffering from a chronic infection. Not being bound by a theory detecting suppressive T cells in a biological sample may provide information as to the immune state of a subject (e.g., for prognosis, treatment selection). In certain embodiments, the biological sample may comprise ex vivo or in vitro T cells. Not being bound by a theory, it may be advantageous to detect or quantitate the presence of suppressive T cells in an ex vivo sample of T cells. For example, after the ex vivo T cells are treated with a differentiating agent or immunomodulatory. Not being bound by a theory, it may be advantageous to deplete suppressive T cells from an ex vivo population of T cells.


In another aspect, the present invention provides for a population of T cells comprising T cells as defined in any embodiment herein. The population of T cells may be depleted for T cells as defined in any embodiment herein by a method of isolation according to any embodiment herein. The population of T cells may comprise chimeric antigen receptor (CAR) T cells or T cells expressing an exogenous T-cell receptor (TCR). The population of T cells may comprise T cells autologous for a subject suffering from cancer. The population of T cells may comprise T cells displaying tumor specificity. Not being bound by a theory, the population of T cells may comprise a heterogeneous population of cells including effector and suppressor T cells. In certain embodiments, it is advantageous to remove the suppressive T cells (e.g., when an enhanced immune response is desired). The population of T cells may be expanded.


In certain embodiments, the population of T cells may comprise activated T cells. The population of T cells may comprise T cells activated with tumor specific antigens. The tumor specific antigens may be subject specific antigens.


In another aspect, the present invention provides for a pharmaceutical composition comprising the depleted T cell population as defined in any embodiment herein.


In another aspect, the present invention provides for a method of treating cancer comprising administering to a subject in need thereof the pharmaceutical composition according to any embodiment herein.


In another aspect, the present invention provides for a method of treating cancer in a subject in need thereof comprising: depleting T cells as defined in any embodiment herein from a population of T cells obtained from the subject; in vitro expanding the population of T cells; and administering the in vitro expanded population of T cells to the subject. The T cell population may be administered after ablation therapy or lymphodepletion therapy. Not being bound by a theory, ablation therapy or lymphodepletion therapy will eliminate any endogenous suppressive cells in a subject, whereby the subject and the cells administered may be depleted for suppressive T cells, thus the adoptive cell therapy may result in an enhanced anti-tumor response.


In another aspect, the present invention provides for a method of treating cancer or chronic infection in a subject in need thereof comprising administering to the subject a therapeutically effective amount of an agent: capable of reducing the activity of a T cell as defined in any embodiment herein; or capable of reducing the activity or expression of one or more genes or polypeptides selected from the group consisting of TNFRSF9, PRF1, BHLHE40, IRF8, GLDC, STAT3, CST7, IL1R2, EEF2, SLC2A3, SQSTM1, RBPJ, NABP1, ACTN1, TNFRSF4, SERPINB9, FOSL2, CAPG, KLRC1, IL18R1, JUNB, EEF1A1, TNFRSF18, RGS2, NFKB2, RPL5, PEX16, LAT2, KDM5B, HILPDA, GEM, DENND4A, BCL2L11, ADAM8, PGLYRP1, IKZF2, KIT, SERPINE2, CCRL2, CSF1, EPAS1, RUNX2, SPRY2 and XCR1; or capable of targeting or binding to one or more cell surface exposed genes or polypeptides on a T cell as defined in any embodiment herein; or capable of targeting or binding to one or more receptors or ligands specific for a cell surface exposed gene or polypeptide on a T cell as defined in any embodiment herein; or capable of targeting or binding to one or more genes or polypeptides secreted from a T cell as defined in any embodiment herein; or capable of targeting or binding to one or more receptors specific for a gene or polypeptide secreted from a T cell as defined in any embodiment herein. The agent may comprise a therapeutic antibody, antibody fragment, antibody-like protein scaffold, aptamer, protein, CRISPR system or small molecule. The therapeutic antibody may be an antibody drug conjugate. The agent capable of targeting or binding to a cell surface exposed gene or polypeptide may comprise a CAR T cell capable of targeting or binding to the cell surface exposed gene or polypeptide.


In another aspect, the present invention provides for a method of treating an autoimmune disease in a subject in need thereof comprising administering to the subject a therapeutically effective amount of an agent capable of inducing the activity of a T cell as defined in any embodiment herein.


In another aspect, the present invention provides for a method of treating an autoimmune disease comprising administering T cells as defined in any embodiment herein to a subject in need thereof. Not being bound by a theory, administering suppressive T cells may reduce an autoimmune response in a subject.


In another aspect, the present invention provides for a method for identifying an immunomodulant capable of modulating one or more phenotypic aspects of the T cell as defined in any embodiment herein, comprising: applying a candidate immunomodulant to the T cell or T cell population; and detecting modulation of one or more phenotypic aspects of the T cell or T cell population by the candidate immunomodulant, thereby identifying the immunomodulant. The immunomodulant may be capable of modulating suppression of T cell proliferation by the T cell. Thus, in certain embodiments, detecting modulation of one or more phenotypic aspects comprises detecting modulation of a suppressive phenotype. The immunomodulant may comprise a therapeutic antibody, antibody fragment, antibody-like protein scaffold, aptamer, protein or small molecule.


In another aspect, the present invention provides for a pharmaceutical composition comprising the immunomodulant as defined in any embodiment herein.


In another aspect, the present invention provides for a method for determining the T cell status of a subject, or for diagnosing, prognosing or monitoring a disease comprising an immune component in a subject, the method comprising detecting or quantifying in a biological sample of the subject T cells as defined in any embodiment herein, wherein an increase as compared to a reference level indicates a suppressed immune response. The disease may be cancer, an autoimmune disease, or chronic infection.


In another aspect, the present invention provides for a method of preparing cells for use in adoptive cell transfer comprising: obtaining a population of T cells; and depleting suppressive T cells as defined in any embodiment herein from the population of T cells. The method may further comprise expanding the depleted cells. The method may further comprise activating the depleted cells. The population of T cells may comprise CAR T cells. The population of T cells may comprise autologous TILs.


In another aspect, the present invention provides for a method of screening for genes required for suppression of effector T cells by suppressive CD8+ T cells comprising: introducing a library of sgRNAs specific to a set of target genes to a population of T cells expressing a CRISPR system; culturing the cells in proliferating conditions in the presence of suppressive CD8 T cells according to any embodiment herein; determining sgRNAs that are enriched in proliferating T cells.


In another aspect, the present invention provides for a method of treating cancer or chronic infection in a subject in need thereof comprising administering to the subject CD8+ T cells modified to be resistant to suppressive CD8+ T cells, wherein the modified CD8+ T cells may be specific for the cancer or chronic infection. In certain embodiments, the CD8+ T cells modified to be resistant to suppressive CD8+ T cells comprise an inducible suicide gene. Not being bound by a theory, the cells may be killed to prevent a pathogenic autoimmune response.


In another aspect, the present invention provides for a method of treating cancer or chronic infection in a subject in need thereof comprising administering to the subject a therapeutically effective amount of an agent capable of blocking glucocorticoid signaling. The agent may be an antagonist of NR3C1. The antagonist may be a blocking antibody.


In another aspect, the present invention provides for a kit comprising reagents to detect at least one gene or polypeptide as defined in any embodiment herein.


In another aspect, the present invention provides for a method of treating cancer or chronic infection comprising reducing or eliminating the presence of an immune cell or changing a phenotype of the immune cell, at least at a disease or infection loci, wherein the immune cell is characterized by expression of CD8, TIM3, PD1, MT1, and IKZF2, and comprises expression of one or more genes selected from the group consisting of: TNFRSF9, PRF1, BHLHE40 (DEC1), IRF8, GLDC, STAT3, CST7, IL1R2, EEF2, SLC2A3, SQSTM1, RBPJ, NABP1, ACTN1, TNFRSF4, SERPINB9, FOSL2, CAPG, KLRC1, IL18R1, JUNB, EEF1A1, TNFRSF18, RGS2, NFKB2, RPL5, PEX16, LAT2, KDM5B, HILPDA, GEM, DENND4A, BCL2L11, ADAM8, PGLYRP1, KIT, SERPINE2, CCRL2, CSF1, EPAS1, RUNX2, SPRY2 and XCR1; or genes in Table 6. In certain embodiments, the immune cell does not comprise expression of HMMR. In certain embodiments, the presence of the immune cell is reduced or eliminated, or wherein a phenotype of the immune cell is changed by modulating expression of MT1 and/or MT2. The method of claim 58, wherein the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed by modulating expression of HELIOS (IKZF2). In certain embodiments, the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed by modulating expression or function of KIT. In certain embodiments, the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed by modulating expression or function of SERPINE2. In certain embodiments, the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed by modulating expression or function of TNFRSF4. In certain embodiments, the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed by modulating expression or function of ILR2. In certain embodiments, the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed by modulating expression or function of CSF1. In certain embodiments, the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed by modulating expression or function of CCRL2. In certain embodiments, the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed by modulating expression or function of IRF8. In certain embodiments, the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed, by modulating expression or function of RBPJ. In certain embodiments, the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed, by modulating expression or function of EPAS1. In certain embodiments, the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed, by modulating expression or function of RUNX2. In certain embodiments, the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed, by modulating expression or function of SPRY2. In certain embodiments, the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed, by modulating expression or function of STAT3. In certain embodiments, the presence of the immune cell is reduced or eliminated, or wherein a phenotype of the immune cell is changed by modulating expression of XCR1. In certain embodiments, the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed, by reducing a sensitivity of the immune cell to glucocorticoid signaling. In certain embodiments, modulating expression or function comprises inhibiting expression or function.


In another aspect, the present invention provides for a kit comprising reagents to detect at least one gene or polypeptide as defined in any of embodiment herein.


In another aspect, the present invention provides for an isolated T cell or population of T cells according to any of the preceding claims for use in the manufacture of a medicament for treating cancer, an autoimmune disease or chronic infection.


In another aspect, the present invention provides for a use of a T cell or population of T cells according to any of the preceding claims for treating cancer, an autoimmune disease or chronic infection.


In another aspect, the present invention provides for a method of treating cancer or chronic infection in a subject in need thereof comprising administering to the subject a therapeutically effective amount of an agent capable of inhibiting the interaction between XCL1 and XCR1.


An aspect of the invention provides the immune cell or immune cell population as taught herein for use in immunotherapy, such as adoptive immunotherapy, such as adoptive cell transfer. Also provided is a method of treating a subject in need thereof, particularly in need of immunotherapy, such as adoptive immunotherapy, such as adoptive cell transfer, comprising administering to said subject the immune cell or immune cell population as taught herein. Further provided is use of the immune cell or immune cell population as taught herein for the manufacture of a medicament for immunotherapy, such as adoptive immunotherapy, such as adoptive cell transfer. In certain embodiments, the immune cell is a T-cell, such as a CD8+ T-cell. In certain embodiments, the immunotherapy, adoptive immunotherapy or adoptive cell transfer may be for treating a proliferative disease, such as tumor or cancer, or a chronic infection, such as chronic viral infection.


In certain embodiments, an immune cell suitable for immunotherapy, such as a CD8+ T-cell, displays tumor specificity, more particularly displays specificity to a tumor antigen. In certain embodiments, an immune cell suitable for immunotherapy, such as a CD8+ T-cell, displays specificity to an antigen of an infectious agent, for example displays viral antigen specificity. In certain embodiments, an immune cell suitable for immunotherapy, such as a CD8+ T-cell, has been isolated from a tumor of a subject, preferably the cell is a tumor infiltrating lymphocyte (TIL). In certain embodiments, an immune cell suitable for immunotherapy, such as a CD8+ T-cell, comprises a chimeric antigen receptor (CAR). Such cell can also be suitably denoted as having been engineered to comprise or to express the CAR. In certain embodiments, the CAR comprises an extracellular antigen-binding element (or portion or domain) configured to specifically bind to a target antigen, a transmembrane domain, and an intracellular signaling domain. In certain embodiments, the intracellular signaling domain comprises a primary signaling domain and/or a costimulatory signaling domain. In certain embodiments, the CAR comprises the antigen-binding element, costimulatory signaling domain and primary signaling domain (such as CD3 zeta portion) in that order. In certain embodiments, the antigen-binding element comprises, consists of or is derived from an antibody, for example, the antigen-binding element is an antibody fragment. In certain embodiments, the antigen-binding element is derived from, for example is a fragment of, a monoclonal antibody, such as a human monoclonal antibody or a humanized monoclonal antibody. In certain embodiments, the antigen-binding element is a single-chain variable fragment (scFv). In certain preferred embodiments, the target antigen is selected from a group consisting of: CD19, BCMA, CLL-1, MAGE A3, MAGE A6, HPV E6, HPV E7, WT1, CD22, CD171, ROR1, MUC16, and SSX2. In certain preferred embodiments, the target antigen is CD19. In certain embodiments, the transmembrane domain is derived from the most membrane proximal component of the endodomain. In certain embodiments, the transmembrane domain is not CD3 zeta transmembrane domain. In certain embodiments, the transmembrane domain is a CD8α transmembrane domain or a CD28 transmembrane domain, preferably CD28 transmembrane domain. In certain embodiments, the primary signaling domain comprises a functional signaling domain of a protein selected from the group consisting of CD3 zeta, CD3 gamma, CD3 delta, CD3 epsilon, common FcR gamma (FCERIG), FcR beta (Fc Epsilon Rib), CD79a, CD79b, Fc gamma RIM, DAP10, and DAP12. In certain preferred embodiments, the primary signaling domain comprises a functional signaling domain of CD3ζ or FcRγ. In certain preferred embodiments, the primary signaling domain comprises a functional signaling domain of CD3ζ. In certain embodiments, the one or more costimulatory signaling domains comprise a functional signaling domain of a protein selected, each independently, from the group consisting of: CD27, CD28, 4-1BB (CD137), OX40, CD30, CD40, ICOS, lymphocyte function-associated antigen-1 (LFA-1), CD2, CD7, LIGHT, NKG2C, B7-H3, a ligand that specifically binds with CD83, CDS, ICAM-1, GITR, BAFFR, HVEM (LIGHTR), SLAMF7, NKp80 (KLRF1), CD160, CD19, CD4, CD8 alpha, CD8 beta, IL2R beta, IL2R gamma, IL7R alpha, ITGA4, VLA1, CD49a, ITGA4, IA4, CD49D, ITGA6, VLA-6, CD49f, ITGAD, CD11d, ITGAE, CD103, ITGAL, CD11a, LFA-1, ITGAM, CD11b, ITGAX, CD11c, ITGB1, CD29, ITGB2, CD18, ITGB7, TNFR2, TRANCE/RANKL, DNAM1 (CD226), SLAMF4 (CD244, 2B4), CD84, CD96 (Tactile), CEACAM1, CRTAM, Ly9 (CD229), CD160 (BY55), PSGL1, CD100 (SEMA4D), CD69, SLAMF6 (NTB-A, Ly108), SLAM (SLAMF1, CD150, IPO-3), BLAME (SLAMF8), SELPLG (CD162), LTBR, LAT, GADS, SLP-76, PAG/Cbp, NKp44, NKp30, NKp46, and NKG2D. In certain preferred embodiments, the one or more costimulatory signaling domains comprise a functional signaling domain of a protein selected, each independently, from the group consisting of: 4-1BB, CD27, and CD28. In certain preferred embodiments, the costimulatory signaling domain comprises a functional signaling domain of CD28. In certain embodiments, the CAR comprises an anti-CD19 scFv, an intracellular domain of a CD3ζ chain, and a signaling domain of CD28. In certain preferred embodiments, the CD28 sequence is as set forth in Genbank identifier NM 006139 (sequence version 1, 2 or 3) starting with the amino acid sequence IEVMYPPPY and continuing all the way to the carboxy-terminus of the protein. In certain preferred embodiments, the CAR is as included in KTE-C19 (axicabtagene ciloleucel) anti-CD19 CAR-T therapy product in development by Kite Pharma, Inc. In certain embodiments, an immune cell suitable for immunotherapy, such as a CD8+ T-cell, comprises an exogenous T-cell receptor (TCR). Such cell can also be suitably denoted as having been engineered to comprise or to express the TCR.


In certain embodiments, an immune cell suitable for immunotherapy, such as a CD8+ T-cell, may be further genetically modified, such as gene edited, i.e., a target locus of interest in the cell may be modified by a suitable gene editing tool or technique, such as without limitation CRISPR, TALEN or ZFN. An aspect relates to an immune cell obtainable by or obtained by said gene editing method, or progeny thereof, wherein the cell comprises a modification of the target locus not present in a cell not subjected to the method. Another aspect relates to a cell product from said cell or progeny thereof, wherein the product is modified in nature or quantity with respect to a cell product from a cell not subjected to the gene editing method. A further aspect provides an immune cell comprising a gene editing system, such as a CRISPR-Cas system, configured to carry out the modification of the target locus.


In certain preferred embodiments, the cell may be edited using any CRISPR system and method of use thereof as described herein. In certain preferred embodiments, cells are edited ex vivo and transferred to a subject in need thereof.


Further genetically modifying, such as gene editing, of the cell may be performed for example (1) to insert or knock-in an exogenous gene, such as an exogenous gene encoding a CAR or a TCR, at a preselected locus in the cell; (2) to knock-out or knock-down expression of an endogenous TCR in the cell; (3) to disrupt the target of a chemotherapeutic agent in the cell; (4) to knock-out or knock-down expression of an immune checkpoint protein or receptor in the cell; (5) to knock-out or knock-down expression of other gene or genes in the cell, the reduced expression or lack of expression of which can enhance the efficacy of adoptive therapies using the cell; (6) to knock-out or knock-down expression of an endogenous gene in a cell, said endogenous gene encoding an antigen targeted by an exogenous CAR or TCR; (7) to knock-out or knock-down expression of one or more MHC constituent proteins in the cell; (8) to activate a T cell, and/or increase the differentiation and/or proliferation of functionally exhausted or dysfunctional CD8+ T cells; and/or (9) to modulate CD8+ T cells, such that CD8+ T cells have increased resistance to exhaustion or dysfunction. In certain preferred embodiments, the cell may be edited to produce any one of the following combinations of the modifications set forth above: (1) and (2); (1) and (4); (2) and (4); (1), (2) and (4); (1) and (7); (2) and (7); (4) and (7); (1), (2) and (7); (1), (4) and (7); (1), (2), (4) and (7); optionally adding modification (8) or (9) to any one of the preceding combinations. In certain preferred embodiments, the targeted immune checkpoint protein or receptor is PD-1, PD-L1 and/or CTLA-4. In certain preferred embodiments, the targeted endogenous TCR gene or sequence may be TRBC1, TRBC2 and/or TRAC. In certain preferred embodiments, the targeted MEW constituent protein may be HLA-A, B and/or C, and/or B2M. In certain embodiments, the cell may thus be multiply edited (multiplex genome editing) to (1) knock-out or knock-down expression of an endogenous TCR (for example, TRBC1, TRBC2 and/or TRAC), (2) knock-out or knock-down expression of an immune checkpoint protein or receptor (for example PD1, PD-L1 and/or CTLA4); and (3) knock-out or knock-down expression of one or more MEW constituent proteins (for example, HLA-A, B and/or C, and/or B2M, preferably B2M).


These and other aspects, objects, features, and advantages of the example embodiments will become apparent to those having ordinary skill in the art upon consideration of the following detailed description of illustrated example embodiments.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1—illustrates the study design. Cells were sampled at the indicated time points from a mouse B16 melanoma model. Cells were sorted based on the following markers: 1) CD8+CD45+; 2) CD4+CD45+(Effector and Regulatory); 3) CD4-CD8-CD45+(NK cells, dendritic, macrophages); and 4) CD45− (fibroblasts, tumor cells). The cells were sequenced using plate based single cell sequencing.



FIG. 2—illustrates the number of TILs isolated from the B16 melanoma mouse model, the time points and the number of mice at each time point.



FIG. 3—illustrates clustering of CD8 T cells (left) and CD4 T cells (right).



FIG. 4—illustrates tSNE of CD8+ cells. 2592 cells were sequenced and 2017 passed extensive quality control. tSNE and clustering was performed on principal components (PC) 4-9 (PC1—transcription, PC2, PC3 strongly associated with sequencing batches).



FIG. 5—illustrates tSNE plots for each of the fifteen CD8+ clusters.



FIG. 6—illustrates further characterization of clusters 7, 8, 9 and 10 for expression of TIM-3 and PD-1.



FIG. 7—illustrates a plot showing decoupled dysfunction and activation signatures based on signatures disclosed in Singer et al. 2016.



FIG. 8—illustrates that clusters 7 and 9 are distinguished by the decoupling of dysfunction and activation signatures.



FIG. 9—illustrates that cluster 7 is Tim3+PD1+ and high for a CD8 Treg signature and cluster 9 does not express a CD8 Treg signature.



FIG. 10—illustrates that clusters 7 and 8 express a CD8 Treg signature.



FIG. 11—illustrates that cluster 7 expresses MT1 and Helios (IKZF2).



FIG. 12—illustrates that MT+PD-1+TIM3+ double positive (DP) cells are more suppressive than MT−/−DP.



FIG. 13—illustrates that cluster 9 and 10 express different signatures from cluster 7. Cluster 7 is low for a cell cycle and CD8 activation signature.



FIG. 14—illustrates transmembrane receptors that can be used to sort cluster 7 cells.



FIG. 15—illustrates cytokines/chemokines expressed by cluster 7.



FIG. 16—illustrates transcription factors significantly upregulated in cluster 7 as compared to clusters 10 and 9.



FIG. 17—illustrates FACS sorting of CD8 T cells for the markers PD1, TIM3, Helios and Ki-67.



FIG. 18—illustrates FACS sorting of CD8 T cells for the markers PD1, TIM3, cKIT and Helios and Ki-67.



FIG. 19—illustrates tSNE of CD4+ cells. 2496 cells were sequenced (26 plates) and 1478 passed extensive quality control. Shown is Foxp3 expression (marker for CD4+ Tregs).



FIG. 20—illustrates tSNE plots for each of the fourteen CD4+ clusters.



FIG. 21—illustrates major CD4 Treg populations.



FIG. 22—illustrates Tim+ expressing Treg populations.



FIG. 23—illustrates that clusters 4 and 7 express a Th1 signature and cytokine secretion signature.



FIG. 24—illustrates that there are positive and negative correlations across the CD8 and CD4 clusters.



FIG. 25—illustrates significant correlations between the CD8 and CD4 clusters. Red indicates a negative correlation and blue indicates a positive correlation.



FIG. 26—illustrates a heatmap indicating significant correlations between the CD8 and CD4 clusters.



FIG. 27—illustrates cell-cell interactions based on expression of receptors and ligands on the CD4 and CD8 clusters



FIG. 28—illustrates analysis of single cell TILs.



FIG. 29—illustrates the study design. Cells were sampled at 5 time points from 12 B16 melanoma mice. Cells were sorted based on the following markers: CD8+, CD4+ and CD45+. The cells were sequenced using plate based single cell sequencing.



FIG. 30—illustrates tSNE clustering of 2,017 CD8 T cells (left) and 1,478 CD4 T cells (right).



FIG. 31—illustrates that single-cell RNA-seq identifies activation-like and dysfunction-like populations by clustering CD8 T cells.



FIG. 32—illustrates that clusters high for a dysfunction signature are high for a CD8+ T regulatory signature.



FIG. 33—illustrates that a suppressive CD8+ population exists in tumors and is weakened by MT KO.



FIG. 34—illustrates the identification of CD8 cluster 7 markers by FACS.



FIG. 35 illustrates the expression in tSNE plots of CD8 cluster 7 markers.



FIG. 36—illustrates that the relative frequency of dysfunctional CD8+ T cells in a tumor is correlated with CD4+ Treg frequency.



FIG. 37—illustrates CD4/CD8 cell connections.



FIG. 38—illustrates expression in tSNE plots of XCL1 in cluster 8 and XCR1 in cluster 7.



FIG. 39—illustrates expression in tSNE plots of CCL1 in cluster 8 and CCR8 in clusters 7 and 8 and in Treg+Tim3+CD4 cells.



FIG. 40—illustrates analysis of single cells from the mouse model time points using the 10× genomics platform. Cell counts taken for cells sorted by day (left) and sorted by size (right) are shown.



FIG. 41—illustrates the first step in the 10× analysis. CD3+ cells are selected. The count of all cells and CD3 cells were taken, as well as the percentage of CD3. Shown is time point 11.



FIG. 42—illustrates the general statistics for all time points taken.



FIG. 43—illustrates CD8/CD4 partitioning of the clusters. Shown is time point 9.



FIG. 44—illustrates the fourth step in the 10× analysis. CD8/CD4 cells are selected and batch corrected across time points.



FIG. 45—illustrates strict selection of CD8 cells and plots based on mouse, time point/batch, and by clustering.



FIG. 46—illustrates tSNE plots of CD8 cell cluster specific reference genes.



FIG. 47—illustrates tSNE plots of CD8 cell cluster specific reference genes.



FIG. 48—illustrates that the same populations of cells are observed in the plate based and 10× single cell sequencing.





DETAILED DESCRIPTION

Unless defined otherwise, technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure pertains. Definitions of common terms and techniques in molecular biology may be found in Molecular Cloning: A Laboratory Manual, 2nd edition (1989) (Sambrook, Fritsch, and Maniatis); Molecular Cloning: A Laboratory Manual, 4th edition (2012) (Green and Sambrook); Current Protocols in Molecular Biology (1987) (F. M. Ausubel et al. eds.); the series Methods in Enzymology (Academic Press, Inc.): PCR 2: A Practical Approach (1995) (M. J. MacPherson, B. D. Hames, and G. R. Taylor eds.): Antibodies, A Laboraotry Manual (1988) (Harlow and Lane, eds.): Antibodies A Laboraotry Manual, 2nd edition 2013 (E. A. Greenfield ed.); Animal Cell Culture (1987) (R. I. Freshney, ed.); Benjamin Lewin, Genes IX, published by Jones and Bartlet, 2008 (ISBN 0763752223); Kendrew et al. (eds.), The Encyclopedia of Molecular Biology, published by Blackwell Science Ltd., 1994 (ISBN 0632021829); Robert A. Meyers (ed.), Molecular Biology and Biotechnology: a Comprehensive Desk Reference, published by VCH Publishers, Inc., 1995 (ISBN 9780471185710); Singleton et al., Dictionary of Microbiology and Molecular Biology 2nd ed., J. Wiley & Sons (New York, N.Y. 1994), March, Advanced Organic Chemistry Reactions, Mechanisms and Structure 4th ed., John Wiley & Sons (New York, N.Y. 1992); and Marten H. Hofker and Jan van Deursen, Transgenic Mouse Methods and Protocols, 2nd edition (2011).


As used herein, the singular forms “a”, “an”, and “the” include both singular and plural referents unless the context clearly dictates otherwise.


The term “optional” or “optionally” means that the subsequent described event, circumstance or substituent may or may not occur, and that the description includes instances where the event or circumstance occurs and instances where it does not.


The recitation of numerical ranges by endpoints includes all numbers and fractions subsumed within the respective ranges, as well as the recited endpoints.


The terms “about” or “approximately” as used herein when referring to a measurable value such as a parameter, an amount, a temporal duration, and the like, are meant to encompass variations of and from the specified value, such as variations of +/−10% or less, preferably +/−5% or less, more preferably +/−1% or less, and still more preferably +/−0.1% or less of and from the specified value, insofar such variations are appropriate to perform in the disclosed invention. It is to be understood that the value to which the modifier “about” or “approximately” refers is itself also specifically, and preferably, disclosed.


Reference throughout this specification to “one embodiment”, “an embodiment” means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, appearances of the phrases “in one embodiment” or “in an embodiment” in various places throughout this specification are not necessarily all referring to the same embodiment, but may. Furthermore, the particular features, structures or characteristics may be combined in any suitable manner, as would be apparent to a person skilled in the art from this disclosure, in one or more embodiments. Furthermore, while some embodiments described herein include some but not other features included in other embodiments, combinations of features of different embodiments are meant to be within the scope of the invention, and form different embodiments, as would be understood by those in the art. For example, in the appended claims, any of the claimed embodiments can be used in any combination.


The terms “subject”, “individual” or “patient” are used interchangeably throughout this specification, and typically and preferably denote humans, but may also encompass reference to non-human animals, preferably warm-blooded animals, even more preferably mammals, such as, e.g., non-human primates, rodents, canines, felines, equines, ovines, porcines, and the like. The term “non-human animals” includes all vertebrates, e.g., mammals, such as non-human primates, (particularly higher primates), sheep, dog, rodent (e.g. mouse or rat), guinea pig, goat, pig, cat, rabbits, cows, and non-mammals such as chickens, amphibians, reptiles etc. In one embodiment, the subject is a non-human mammal. In another embodiment, the subject is human. In another embodiment, the subject is an experimental animal or animal substitute as a disease model. The term does not denote a particular age or sex. Thus, adult and newborn subjects, as well as fetuses, whether male or female, are intended to be covered. Examples of subjects include humans, dogs, cats, cows, goats, and mice. The term subject is further intended to include transgenic species.


The terms “subtype”, “subset” or “subpopulation” are used interchangeably throughout this specification.


All publications, published patent documents, and patent applications cited herein are hereby incorporated by reference to the same extent as though each individual publication, published patent document, or patent application was specifically and individually indicated as being incorporated by reference.


Overview

Embodiments disclosed herein relate to cell products, substances, compositions, markers, marker signatures, molecular targets, kits of parts and methods useful in characterizing, evaluating and modulating the immune system and immune responses. The CD8+ T cells of the present invention were discovered by analysis of single immune cells obtained at several time points from a mouse tumor model (B16). The transcriptomes of the CD8+ T cells were analyzed. The T cells of the present invention were characterized as a suppressive CD8+ T cell population required to dampen excessive immune responses and prevent autoimmunity (e.g., a subtype of CD8 Tregs). Applicants identified markers expressed by the CD8+ T cells that can be used to detect and/or quantitate the T cells or specifically target the T cells therapeutically. Furthermore, the surface cell markers can be used to detect, quantitate and isolate the T cells. The identified markers can also be used to distinguish between PD1+Tim3+CD8+ T cell subtypes. Moreover, Applicants can confirm the presence of the CD8+ T cells in human samples. In certain embodiments, the T cell is characterized by expression of CD8, TIM3, PD1, MT1, and IKZF2, and low or no expression of HMMR. The T cell is further characterized by expression of one or more of TNFRSF9, PRF1, BHLHE40 (DEC1), IRF8, GLDC, STAT3, CST7, IL1R2, EEF2, SLC2A3, SQSTM1, RBPJ, NABP1, ACTN1, TNFRSF4, SERPINB9, FOSL2, CAPG, KLRC1, IL18R1, JUNB, EEF1A1, TNFRSF18, RGS2, NFKB2, RPL5, PEX16, LAT2, KDM5B, HILPDA, GEM, DENND4A, BCL2L11, ADAM8, PGLYRP1, KIT, SERPINE2, CCRL2, CSF1, EPAS1, RUNX2, SPRY2 and XCR1, or one or more genes in Table 6, preferably upregulated as compared to all CD8+TIM3+PD1+ T cells in a population of cells. The T cell may be further characterized by expression of a gene signature comprising any gene or combination of genes selected from Tables 1 to 6.


In certain embodiments, depletion of the suppressive T cells may be used in adoptive cell transfer (e.g., TIL therapy, CAR T therapy). In certain embodiments, T cells (e.g., tumor infiltrating lymphocytes or TILs) may obtained from a subject and depleted for the T cells described herein ex vivo. The term “ex vivo” is encompassed by the term “in vitro.” The term “in vitro” generally denotes outside, or external to, a body, e.g., an animal or human body. Not being bound by a theory removing CD8+ Tregs from a population of T cells used in adoptive cell transfer would allow an enhanced immune response. Not being bound by a theory, CD8 Tregs normally prevent the immune system from targeting self-antigens, but in the case of cancer Tregs may prevent immune cells from targeting cancer cells through suppression of effector cells. In certain embodiments, a population of T cells enriched for suppressive T cells may be used in treating an autoimmune disease. In certain embodiments, suppressive T cells may be isolated from a subject suffering from an autoimmunity disease, expanded and transferred back to the subject.


Particular advantageous uses include methods for identifying agents capable of modulating the T cells based on their gene signatures, protein signature, and/or other genetic or epigenetic signature as defined herein. In certain example embodiments, detection or quantifying the T cells may be used to determine responsiveness to various therapeutics (e.g., a decrease in the T cells may indicate an immunotherapy is effective). Not being bound by a theory, checkpoint blockade therapy may specifically target the T cells of the present invention. In certain embodiments, cytokines or differentiating agents may be used to shift the balance of T cells to be less or more suppressive.


In one aspect, the invention relates to a signature or set of biomarkers that distinguish between CD8+ T cells. The signature may be a gene signature, protein signature, and/or other genetic or epigenetic signature of particular tumor cell subpopulations, as defined herein. In certain embodiments, CD8+ T cell subtypes may be detected and isolated by subtype specific signature biomarkers or combinations thereof.


In certain embodiments, pharmaceutical compositions comprising populations of T cells wherein the T cells of the present invention are depleted may be used in treating cancer (e.g., adoptive cell transfer). In certain embodiments, populations of cells depleted for the T cells of the present invention are used in combination with other therapies (e.g., checkpoint blockade therapy, CAR T cell therapy). In certain embodiments, pharmaceutical compositions comprising populations of T cells wherein the T cells of the present invention are enriched may be used in treating an autoimmune disease.


The invention further relates to agents capable of inducing or suppressing particular immune cell populations based on the gene signatures, protein signature, and/or other genetic or epigenetic signature as defined herein, as well as their use for modulating, such as inducing or repressing, a particular gene signature, protein signature, and/or other genetic or epigenetic signature. In one embodiment, genes in one population of cells may be activated or suppressed in order to affect the cells of another population (e.g., suppressive T cells may be activated or inactivated to enhance or repress activity of effector T cells). Not being bound by a theory, the CD8+ T cells described herein are affected by other immune cells in the tumor microenvironment (e.g., antigen presenting cells). In related aspects, modulating, such as inducing or repressing, a particular gene signature, protein signature, and/or other genetic or epigenetic signature may modify overall immune cell composition, such as immune cell composition, such as immune cell subpopulation composition or distribution, or functionality.


In further aspects, the invention relates to a signature or set of biomarkers that may be detected in combination. The signature detected in combination may be a gene signature, protein signature, and/or other genetic or epigenetic signature of a particular tumor cell (sub)population (e.g., tumor cells capable of immune evasion, tumor cells having specific mutations). The invention hereto also further relates to particular tumor cell subpopulations, which may be identified based on the methods according to the invention as discussed herein; as well as methods to target such cell subpopulations, such as in therapeutics (e.g., adoptive cell therapy, CART cells, agents capable of modulating T cells as defined herein); and screening methods to identify agents capable of inducing or suppressing particular tumor cell (sub)populations.


The term “immune cell” as used throughout this specification generally encompasses any cell derived from a hematopoietic stem cell that plays a role in the immune response. The term is intended to encompass immune cells both of the innate or adaptive immune system. The immune cell as referred to herein may be a leukocyte, at any stage of differentiation (e.g., a stem cell, a progenitor cell, a mature cell) or any activation stage. Immune cells include lymphocytes (such as natural killer cells, T-cells (including, e.g., thymocytes, Th or Tc; Th1, Th2, Th17, Thαβ, CD4+, CD8+, effector Th, memory Th, regulatory Th, CD4+/CD8+ thymocytes, CD4−/CD8−thymocytes, γδ T cells, etc.) or B-cells (including, e.g., pro-B cells, early pro-B cells, late pro-B cells, pre-B cells, large pre-B cells, small pre-B cells, immature or mature B-cells, producing antibodies of any isotype, T1 B-cells, T2, B-cells, naïve B-cells, GC B-cells, plasmablasts, memory B-cells, plasma cells, follicular B-cells, marginal zone B-cells, B-1 cells, B-2 cells, regulatory B cells, etc.), such as for instance, monocytes (including, e.g., classical, non-classical, or intermediate monocytes), (segmented or banded) neutrophils, eosinophils, basophils, mast cells, histiocytes, microglia, including various subtypes, maturation, differentiation, or activation stages, such as for instance hematopoietic stem cells, myeloid progenitors, lymphoid progenitors, myeloblasts, promyelocytes, myelocytes, metamyelocytes, monoblasts, promonocytes, lymphoblasts, prolymphocytes, small lymphocytes, macrophages (including, e.g., Kupffer cells, stellate macrophages, M1 or M2 macrophages), (myeloid or lymphoid) dendritic cells (including, e.g., Langerhans cells, conventional or myeloid dendritic cells, plasmacytoid dendritic cells, mDC-1, mDC-2, Mo-DC, HP-DC, veiled cells), granulocytes, polymorphonuclear cells, antigen-presenting cells (APC), etc.


As used throughout this specification, “immune response” refers to a response by a cell of the immune system, such as a B cell, T cell (CD4+ or CD8+), regulatory T cell, antigen-presenting cell, dendritic cell, monocyte, macrophage, NKT cell, NK cell, basophil, eosinophil, or neutrophil, to a stimulus. In some embodiments, the response is specific for a particular antigen (an “antigen-specific response”), and refers to a response by a CD4 T cell, CD8 T cell, or B cell via their antigen-specific receptor. In some embodiments, an immune response is a T cell response, such as a CD4+ response or a CD8+ response. Such responses by these cells can include, for example, cytotoxicity, proliferation, cytokine or chemokine production, trafficking, or phagocytosis, and can be dependent on the nature of the immune cell undergoing the response.


T cell response refers more specifically to an immune response in which T cells directly or indirectly mediate or otherwise contribute to an immune response in a subject. T cell-mediated response may be associated with cell mediated effects, cytokine mediated effects, and even effects associated with B cells if the B cells are stimulated, for example, by cytokines secreted by T cells. By means of an example but without limitation, effector functions of MEW class I restricted Cytotoxic T lymphocytes (CTLs), may include cytokine and/or cytolytic capabilities, such as lysis of target cells presenting an antigen peptide recognized by the T cell receptor (naturally-occurring TCR or genetically engineered TCR, e.g., chimeric antigen receptor, CAR), secretion of cytokines, preferably IFN gamma, TNF alpha and/or or more immunostimulatory cytokines, such as IL-2, and/or antigen peptide-induced secretion of cytotoxic effector molecules, such as granzymes, perforins or granulysin. By means of example but without limitation, for MEW class II restricted T helper (Th) cells, effector functions may be antigen peptide-induced secretion of cytokines, preferably, IFN gamma, TNF alpha, IL-4, IL5, IL-10, and/or IL-2. By means of example but without limitation, for T regulatory (Treg) cells, effector functions may be antigen peptide-induced secretion of cytokines, preferably, IL-10, IL-35, and/or TGF-beta. B cell response refers more specifically to an immune response in which B cells directly or indirectly mediate or otherwise contribute to an immune response in a subject. Effector functions of B cells may include in particular production and secretion of antigen-specific antibodies by B cells (e.g., polyclonal B cell response to a plurality of the epitopes of an antigen (antigen-specific antibody response)), antigen presentation, and/or cytokine secretion.


The term “antigen” as used throughout this specification refers to a molecule or a portion of a molecule capable of being bound by an antibody, or by a T cell receptor (TCR) when presented by MHC molecules. At the molecular level, an antigen is characterized by its ability to be bound at the antigen-binding site of an antibody. The specific binding denotes that the antigen will be bound in a highly selective manner by its cognate antibody and not by the multitude of other antibodies which may be evoked by other antigens. An antigen is additionally capable of being recognized by the immune system. In some instances, an antigen is capable of eliciting a humoral immune response in a subject. In some instances, an antigen is capable of eliciting a cellular immune response in a subject, leading to the activation of B- and/or T-lymphocytes. In some instances, an antigen is capable of eliciting a humoral and cellular immune response in a subject. Hence, an antigen may be preferably antigenic and immunogenic. Alternatively, an antigen may be antigenic and not immunogenic. Typically, an antigen may be a peptide, polypeptide, protein, nucleic acid, an oligo- or polysaccharide, or a lipid, or any combination thereof, a glycoprotein, proteoglycan, glycolipid, etc. In certain embodiments, an antigen may be a peptide, polypeptide, or protein. An antigen may have one or more than one epitope. The terms “antigenic determinant” or “epitope” generally refer to the region or part of an antigen that specifically reacts with or is recognized by the immune system, specifically by antibodies, B cells, or T cells.


An antigen as contemplated throughout this specification may be obtained by any means available to a skilled person, e.g., may be isolated from a naturally-occurring material comprising the antigen, or may be produced recombinantly by a suitable host or host cell expression system and optionally isolated therefrom (e.g., a suitable bacterial, yeast, fungal, plant or animal host or host cell expression system), or may be produced recombinantly by cell-free transcription or translation, or non-biological nucleic acid or peptide synthesis.


The term “tumor antigen” as used throughout this specification refers to an antigen that is uniquely or differentially expressed by a tumor cell, whether intracellular or on the tumor cell surface (preferably on the tumor cell surface), compared to a normal or non-neoplastic cell. By means of example, a tumor antigen may be present in or on a tumor cell and not typically in or on normal cells or non-neoplastic cells (e.g., only expressed by a restricted number of normal tissues, such as testis and/or placenta), or a tumor antigen may be present in or on a tumor cell in greater amounts than in or on normal or non-neoplastic cells, or a tumor antigen may be present in or on tumor cells in a different form than that found in or on normal or non-neoplastic cells. The term thus includes tumor-specific antigens (TSA), including tumor-specific membrane antigens, tumor-associated antigens (TAA), including tumor-associated membrane antigens, embryonic antigens on tumors, growth factor receptors, growth factor ligands, etc. The term further includes cancer/testis (CT) antigens. Examples of tumor antigens include, without limitation, β-human chorionic gonadotropin (βHCG), glycoprotein 100 (gp100/Pme117), carcinoembryonic antigen (CEA), tyrosinase, tyrosinase-related protein 1 (gp75/TRP1), tyrosinase-related protein 2 (TRP-2), NY-BR-1, NY-CO-58, NY-ESO-1, MN/gp250, idiotypes, telomerase, synovial sarcoma X breakpoint 2 (SSX2), mucin 1 (MUC-1), antigens of the melanoma-associated antigen (MAGE) family, high molecular weight-melanoma associated antigen (HMW-MAA), melanoma antigen recognized by T cells 1 (MART1), Wilms' tumor gene 1 (WT1), HER2/neu, mesothelin (MSLN), alphafetoprotein (AFP), cancer antigen 125 (CA-125), and abnormal forms of ras or p53 (see also, WO2016187508A2). Tumor antigens may also be subject specific (e.g., subject specific neoantigens; see, e.g., U.S. Pat. No. 9,115,402; and international patent application publication numbers WO2016100977A1, WO2014168874A2, WO2015085233A1, and WO2015095811A2).


Biomarkers and Signatures

The invention further relates to various biomarkers for detecting CD8+ T cell populations. As used herein “marker” and “biomarker” are used interchangeably. In certain example embodiments, suppressive CD8+ T cell populations are present in a population of tumor infiltrating lymphocytes (TIL). The suppressive T cell populations may be detected by detecting one or more biomarkers in a sample. The set of markers may comprise one or more genes or polypeptides, e.g., TIM3, SERPINE2, HMMR, KIT, TNFRSF4, CD8, CD45, PD1, TNFRSF9, PRF1, BHLHE40, IRF8, GLDC, STAT3, CST7, IL1R2, EEF2, SLC2A3, SQSTM1, RBPJ, NABP1, ACTN1, TNFRSF4, SERPINB9, FOSL2, CAPG, KLRC1, IL18R1, JUNB, EEF1A1, TNFRSF18, RGS2, NFKB2, RPL5, PEX16, LAT2, KDM5B, HILPDA, GEM, DENND4A, BCL2L11, ADAM8, PGLYRP1, IKZF2, KIT, SERPINE2, CCRL2, CSF1, EPAS1, RUNX2 and SPRY2. In certain embodiments, the markers include the following combinations: a) TIM3, SERPINE2 and HMMR; or b) SERPINE2 and HMMR; or c) TIM3, KIT and HMMR; or d) TIM3, TNFRSF4 and HMMR; or e) any of (a), (b), (c) or (d) and one or more of CD8, CD45 and PD1; or any of (a), (b), (c), (d) or (e) and one or more of TNFRSF9, PRF1, BHLHE40, IRF8, GLDC, STAT3, CST7, IL1R2, EEF2, SLC2A3, SQSTM1, RBPJ, NABP1, ACTN1, TNFRSF4, SERPINB9, FOSL2, CAPG, KLRC1, IL18R1, JUNB, EEF1A1, TNFRSF18, RGS2, NFKB2, RPL5, PEX16, LAT2, KDM5B, HILPDA, GEM, DENND4A, BCL2L11, ADAM8, PGLYRP1, IKZF2, KIT, SERPINE2, CCRL2, CSF1, EPAS1, RUNX2 and SPRY2.


The term “biomarker” is widespread in the art and commonly broadly denotes a biological molecule, more particularly an endogenous biological molecule, and/or a detectable portion thereof, whose qualitative and/or quantitative evaluation in a tested object (e.g., in or on a cell, cell population, tissue, organ, or organism, e.g., in a biological sample of a subject) is predictive or informative with respect to one or more aspects of the tested object's phenotype and/or genotype. The terms “marker” and “biomarker” may be used interchangeably throughout this specification. Biomarkers as intended herein may be nucleic acid-based or peptide-, polypeptide- and/or protein-based. For example, a marker may be comprised of peptide(s), polypeptide(s) and/or protein(s) encoded by a given gene, or of detectable portions thereof. Further, whereas the term “nucleic acid” generally encompasses DNA, RNA and DNA/RNA hybrid molecules, in the context of markers the term may typically refer to heterogeneous nuclear RNA (hnRNA), pre-mRNA, messenger RNA (mRNA), or complementary DNA (cDNA), or detectable portions thereof. Such nucleic acid species are particularly useful as markers, since they contain qualitative and/or quantitative information about the expression of the gene. Particularly preferably, a nucleic acid-based marker may encompass mRNA of a given gene, or cDNA made of the mRNA, or detectable portions thereof. Any such nucleic acid(s), peptide(s), polypeptide(s) and/or protein(s) encoded by or produced from a given gene are encompassed by the term “gene product(s)”.


Preferably, markers as intended herein may be extracellular or cell surface markers, as methods to measure extracellular or cell surface marker(s) need not disturb the integrity of the cell membrane and may not require fixation/permeabilization of the cells.


Unless otherwise apparent from the context, reference herein to any marker, such as a peptide, polypeptide, protein, or nucleic acid, may generally also encompass modified forms of said marker, such as bearing post-expression modifications including, for example, phosphorylation, glycosylation, lipidation, methylation, cysteinylation, sulphonation, glutathionylation, acetylation, oxidation of methionine to methionine sulphoxide or methionine sulphone, and the like.


The term “peptide” as used throughout this specification preferably refers to a polypeptide as used herein consisting essentially of 50 amino acids or less, e.g., 45 amino acids or less, preferably 40 amino acids or less, e.g., 35 amino acids or less, more preferably 30 amino acids or less, e.g., 25 or less, 20 or less, 15 or less, 10 or less or 5 or less amino acids.


The term “polypeptide” as used throughout this specification generally encompasses polymeric chains of amino acid residues linked by peptide bonds. Hence, insofar a protein is only composed of a single polypeptide chain, the terms “protein” and “polypeptide” may be used interchangeably herein to denote such a protein. The term is not limited to any minimum length of the polypeptide chain. The term may encompass naturally, recombinantly, semi-synthetically or synthetically produced polypeptides. The term also encompasses polypeptides that carry one or more co- or post-expression-type modifications of the polypeptide chain, such as, without limitation, glycosylation, acetylation, phosphorylation, sulfonation, methylation, ubiquitination, signal peptide removal, N-terminal Met removal, conversion of pro-enzymes or pre-hormones into active forms, etc. The term further also includes polypeptide variants or mutants which carry amino acid sequence variations vis-à-vis a corresponding native polypeptide, such as, e.g., amino acid deletions, additions and/or substitutions. The term contemplates both full-length polypeptides and polypeptide parts or fragments, e.g., naturally-occurring polypeptide parts that ensue from processing of such full-length polypeptides.


The term “protein” as used throughout this specification generally encompasses macromolecules comprising one or more polypeptide chains, i.e., polymeric chains of amino acid residues linked by peptide bonds. The term may encompass naturally, recombinantly, semi-synthetically or synthetically produced proteins. The term also encompasses proteins that carry one or more co- or post-expression-type modifications of the polypeptide chain(s), such as, without limitation, glycosylation, acetylation, phosphorylation, sulfonation, methylation, ubiquitination, signal peptide removal, N-terminal Met removal, conversion of pro-enzymes or pre-hormones into active forms, etc. The term further also includes protein variants or mutants which carry amino acid sequence variations vis-à-vis a corresponding native protein, such as, e.g., amino acid deletions, additions and/or substitutions. The term contemplates both full-length proteins and protein parts or fragments, e.g., naturally-occurring protein parts that ensue from processing of such full-length proteins.


The reference to any marker, including any peptide, polypeptide, protein, or nucleic acid, corresponds to the marker commonly known under the respective designations in the art. The terms encompass such markers of any organism where found, and particularly of animals, preferably warm-blooded animals, more preferably vertebrates, yet more preferably mammals, including humans and non-human mammals, still more preferably of humans.


The terms particularly encompass such markers, including any peptides, polypeptides, proteins, or nucleic acids, with a native sequence, i.e., ones of which the primary sequence is the same as that of the markers found in or derived from nature. A skilled person understands that native sequences may differ between different species due to genetic divergence between such species. Moreover, native sequences may differ between or within different individuals of the same species due to normal genetic diversity (variation) within a given species. Also, native sequences may differ between or even within different individuals of the same species due to somatic mutations, or post-transcriptional or post-translational modifications. Any such variants or isoforms of markers are intended herein. Accordingly, all sequences of markers found in or derived from nature are considered “native”. The terms encompass the markers when forming a part of a living organism, organ, tissue or cell, when forming a part of a biological sample, as well as when at least partly isolated from such sources. The terms also encompass markers when produced by recombinant or synthetic means.


In certain embodiments, markers, including any peptides, polypeptides, proteins, or nucleic acids, may be human, i.e., their primary sequence may be the same as a corresponding primary sequence of or present in a naturally occurring human markers. Hence, the qualifier “human” in this connection relates to the primary sequence of the respective markers, rather than to their origin or source. For example, such markers may be present in or isolated from samples of human subjects or may be obtained by other means (e.g., by recombinant expression, cell-free transcription or translation, or non-biological nucleic acid or peptide synthesis).


The reference herein to any marker, including any peptide, polypeptide, protein, or nucleic acid, also encompasses fragments thereof. Hence, the reference herein to measuring (or measuring the quantity of) any one marker may encompass measuring the marker and/or measuring one or more fragments thereof.


For example, any marker and/or one or more fragments thereof may be measured collectively, such that the measured quantity corresponds to the sum amounts of the collectively measured species. In another example, any marker and/or one or more fragments thereof may be measured each individually. The terms encompass fragments arising by any mechanism, in vivo and/or in vitro, such as, without limitation, by alternative transcription or translation, exo- and/or endo-proteolysis, exo- and/or endo-nucleolysis, or degradation of the peptide, polypeptide, protein, or nucleic acid, such as, for example, by physical, chemical and/or enzymatic proteolysis or nucleolysis.


The term “fragment” as used throughout this specification with reference to a peptide, polypeptide, or protein generally denotes a portion of the peptide, polypeptide, or protein, such as typically an N- and/or C-terminally truncated form of the peptide, polypeptide, or protein. Preferably, a fragment may comprise at least about 30%, e.g., at least about 50% or at least about 70%, preferably at least about 80%, e.g., at least about 85%, more preferably at least about 90%, and yet more preferably at least about 95% or even about 99% of the amino acid sequence length of said peptide, polypeptide, or protein. For example, insofar not exceeding the length of the full-length peptide, polypeptide, or protein, a fragment may include a sequence of ≥5 consecutive amino acids, or ≥10 consecutive amino acids, or ≥20 consecutive amino acids, or ≥30 consecutive amino acids, e.g., ≥40 consecutive amino acids, such as for example ≥50 consecutive amino acids, e.g., ≥60, ≥70, ≥80, ≥90, ≥100, ≥200, ≥300, ≥400, ≥500 or ≥600 consecutive amino acids of the corresponding full-length peptide, polypeptide, or protein.


The term “fragment” as used throughout this specification with reference to a nucleic acid (polynucleotide) generally denotes a 5′- and/or 3′-truncated form of a nucleic acid. Preferably, a fragment may comprise at least about 30%, e.g., at least about 50% or at least about 70%, preferably at least about 80%, e.g., at least about 85%, more preferably at least about 90%, and yet more preferably at least about 95% or even about 99% of the nucleic acid sequence length of said nucleic acid. For example, insofar not exceeding the length of the full-length nucleic acid, a fragment may include a sequence of ≥5 consecutive nucleotides, or ≥10 consecutive nucleotides, or ≥20 consecutive nucleotides, or ≥30 consecutive nucleotides, e.g., ≥40 consecutive nucleotides, such as for example ≥50 consecutive nucleotides, e.g., ≥60, ≥70, ≥80, ≥90, ≥100, ≥200, ≥300, ≥400, ≥500 or ≥600 consecutive nucleotides of the corresponding full-length nucleic acid.


Cells such as immune cells as disclosed herein may in the context of the present specification be said to “comprise the expression” or conversely to “not express” one or more markers, such as one or more genes or gene products; or be described as “positive” or conversely as “negative” for one or more markers, such as one or more genes or gene products; or be said to “comprise” a defined “gene or gene product signature”.


Such terms are commonplace and well-understood by the skilled person when characterizing cell phenotypes. By means of additional guidance, when a cell is said to be positive for or to express or comprise expression of a given marker, such as a given gene or gene product, a skilled person would conclude the presence or evidence of a distinct signal for the marker when carrying out a measurement capable of detecting or quantifying the marker in or on the cell. Suitably, the presence or evidence of the distinct signal for the marker would be concluded based on a comparison of the measurement result obtained for the cell to a result of the same measurement carried out for a negative control (for example, a cell known to not express the marker) and/or a positive control (for example, a cell known to express the marker). Where the measurement method allows for a quantitative assessment of the marker, a positive cell may generate a signal for the marker that is at least 1.5-fold higher than a signal generated for the marker by a negative control cell or than an average signal generated for the marker by a population of negative control cells, e.g., at least 2-fold, at least 4-fold, at least 10-fold, at least 20-fold, at least 30-fold, at least 40-fold, at least 50-fold higher or even higher. Further, a positive cell may generate a signal for the marker that is 3.0 or more standard deviations, e.g., 3.5 or more, 4.0 or more, 4.5 or more, or 5.0 or more standard deviations, higher than an average signal generated for the marker by a population of negative control cells.


The present invention is also directed to signatures and uses thereof. As used herein a “signature” may encompass any gene or genes, protein or proteins, or epigenetic element(s) whose expression profile or whose occurrence is associated with a specific cell type, subtype, or cell state of a specific cell type or subtype within a population of cells (e.g., tumor infiltrating lymphocytes). In certain embodiments, the expression of the signatures (e.g., T cell signature) are dependent on epigenetic modification of the genes or regulatory elements associated with the genes. Thus, in certain embodiments, use of signature genes includes epigenetic modifications that may be detected or modulated. For ease of discussion, when discussing gene expression, any gene or genes, protein or proteins, or epigenetic element(s) may be substituted. Reference to a gene name throughout the specification encompasses the human gene, mouse gene and all other orthologues as known in the art in other organisms. As used herein, the terms “signature”, “expression profile”, or “expression program” may be used interchangeably. It is to be understood that also when referring to proteins (e.g. differentially expressed proteins), such may fall within the definition of “gene” signature. Levels of expression or activity or prevalence may be compared between different cells in order to characterize or identify for instance signatures specific for cell (sub)populations. Increased or decreased expression or activity of signature genes may be compared between different cells in order to characterize or identify for instance specific cell (sub)populations. The detection of a signature in single cells may be used to identify and quantitate for instance specific cell (sub)populations. A signature may include a gene or genes, protein or proteins, or epigenetic element(s) whose expression or occurrence is specific to a cell (sub)population, such that expression or occurrence is exclusive to the cell (sub)population. A gene signature as used herein, may thus refer to any set of up- and down-regulated genes that are representative of a cell type or subtype. A gene signature as used herein, may also refer to any set of up- and down-regulated genes between different cells or cell (sub)populations derived from a gene-expression profile. For example, a gene signature may comprise a list of genes differentially expressed in a distinction of interest (e.g., a pattern of gene expression).


The signature as defined herein (being it a gene signature, protein signature or other genetic or epigenetic signature) can be used to indicate the presence of a cell type, a subtype of the cell type, the state of the microenvironment of a population of cells, a particular cell type population or subpopulation, and/or the overall status of the entire cell (sub)population. Furthermore, the signature may be indicative of cells within a population of cells in vivo. The signature may also be used to suggest for instance particular therapies, or to follow up treatment, or to suggest ways to modulate immune systems. The signatures of the present invention may be discovered by analysis of expression profiles of single-cells within a population of cells from isolated samples (e.g. tumor samples), thus allowing the discovery of novel cell subtypes or cell states that were previously invisible or unrecognized. The presence of subtypes or cell states may be determined by subtype specific or cell state specific signatures. The presence of these specific cell (sub)types or cell states may be determined by applying the signature genes to bulk sequencing data in a sample. Not being bound by a theory the signatures of the present invention may be microenvironment specific, such as their expression in a particular spatio-temporal context. Not being bound by a theory, signatures as discussed herein are specific to a particular pathological context. Not being bound by a theory, a combination of cell subtypes having a particular signature may indicate an outcome. Not being bound by a theory, the signatures can be used to deconvolute the network of cells present in a particular pathological condition. Not being bound by a theory the presence of specific cells and cell subtypes are indicative of a particular response to treatment, such as including increased or decreased susceptibility to treatment. The signature may indicate the presence of one particular cell type.


The signature according to certain embodiments of the present invention may comprise or consist of one or more genes, proteins and/or epigenetic elements, such as for instance 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more. In certain embodiments, the signature may comprise or consist of two or more genes, proteins and/or epigenetic elements, such as for instance 2, 3, 4, 5, 6, 7, 8, 9, 10 or more. In certain embodiments, the signature may comprise or consist of three or more genes, proteins and/or epigenetic elements, such as for instance 3, 4, 5, 6, 7, 8, 9, 10 or more. In certain embodiments, the signature may comprise or consist of four or more genes, proteins and/or epigenetic elements, such as for instance 4, 5, 6, 7, 8, 9, 10 or more. In certain embodiments, the signature may comprise or consist of five or more genes, proteins and/or epigenetic elements, such as for instance 5, 6, 7, 8, 9, 10 or more. In certain embodiments, the signature may comprise or consist of six or more genes, proteins and/or epigenetic elements, such as for instance 6, 7, 8, 9, 10 or more. In certain embodiments, the signature may comprise or consist of seven or more genes, proteins and/or epigenetic elements, such as for instance 7, 8, 9, 10 or more. In certain embodiments, the signature may comprise or consist of eight or more genes, proteins and/or epigenetic elements, such as for instance 8, 9, 10 or more. In certain embodiments, the signature may comprise or consist of nine or more genes, proteins and/or epigenetic elements, such as for instance 9, 10 or more. In certain embodiments, the signature may comprise or consist of ten or more genes, proteins and/or epigenetic elements, such as for instance 10, 11, 12, 13, 14, 15, or more. It is to be understood that a signature according to the invention may for instance also include genes or proteins as well as epigenetic elements combined.


In certain embodiments, a signature is characterized as being specific for a particular immune cell or immune cell (sub)population if it is upregulated or only present, detected or detectable in that particular immune cell or immune cell (sub)population, or alternatively is downregulated or only absent, or undetectable in that particular immune cell or immune cell (sub)population. In this context, a signature consists of one or more differentially expressed genes/proteins or differential epigenetic elements when comparing different cells or cell (sub)populations, including comparing different immune cell or immune cell (sub)populations, as well as comparing immune cell or immune cell (sub)populations with non-immune cell or non-immune cell (sub)populations. It is to be understood that “differentially expressed” genes/proteins include genes/proteins which are up- or down-regulated as well as genes/proteins which are turned on or off. When referring to up- or down-regulation, in certain embodiments, such up- or down-regulation is preferably at least two-fold, such as two-fold, three-fold, four-fold, five-fold, or more, such as for instance at least ten-fold, at least 20-fold, at least 30-fold, at least 40-fold, at least 50-fold, or more. Alternatively, or in addition, differential expression may be determined based on common statistical tests, as is known in the art.


As discussed herein, differentially expressed genes/proteins, or differential epigenetic elements may be differentially expressed on a single cell level, or may be differentially expressed on a cell population level. Preferably, the differentially expressed genes/proteins or epigenetic elements as discussed herein, such as constituting the gene signatures as discussed herein, when as to the cell population or subpopulation level, refer to genes that are differentially expressed in all or substantially all cells of the population or subpopulation (such as at least 80%, preferably at least 90%, such as at least 95% of the individual cells). This allows one to define a particular subpopulation of immune cells. As referred to herein, a “subpopulation” of cells preferably refers to a particular subset of cells of a particular cell type which can be distinguished or are uniquely identifiable and set apart from other cells of this cell type. The cell subpopulation may be phenotypically characterized, and is preferably characterized by the signature as discussed herein. A cell (sub)population as referred to herein may constitute of a (sub)population of cells of a particular cell type characterized by a specific cell state.


When referring to induction, or alternatively suppression of a particular signature, preferable is meant induction or alternatively suppression (or upregulation or downregulation) of at least one gene/protein and/or epigenetic element of the signature, such as for instance at least two, at least three, at least four, at least five, at least six, or all genes/proteins and/or epigenetic elements of the signature.


Various aspects and embodiments of the invention may involve analyzing gene signatures, protein signature, and/or other genetic or epigenetic signature based on single cell analyses (e.g. single cell RNA sequencing) or alternatively based on cell population analyses, as is defined herein elsewhere.


In certain example embodiments, the signature genes may be used to deconvolute the network of cells present in a tumor based on comparing them to data from bulk analysis of a tumor sample. In certain example embodiments, the presence of specific immune cells and immune cell subtypes may be indicative of tumor growth, invasiveness and/or resistance to treatment. In one example embodiment, detection of one or more signature genes may indicate the presence of a particular cell type or cell types. In certain example embodiments, the presence of immune cell types within a tumor may indicate that the tumor will be sensitive to a treatment (e.g., checkpoint blockade therapy). In one embodiment, the signature genes of the present invention are applied to bulk sequencing data from a tumor sample obtained from a subject, such that information relating to disease outcome and personalized treatments is determined. In certain embodiments, the presence of suppressive T cells in a tumor may be determined by deconvolution of bulk tumor sequencing data and the ratio of suppressive T cells compared to clinical outcomes. Not being bound by a theory, a prognosis may be determined based on the immune cell status of a tumor.


Detection, Quantification and Isolation of Suppressive CD8+ T Cells

In one embodiment, the present invention provides for a method comprising detecting or quantifying CD8+ T cells in a biological sample. In preferred embodiments, one or more PD1+CD8+ T cells are detected or quantified in the biological sample. The CD8+ T cells may be detected or quantified using a set of markers comprising: one or more genes or polypeptides selected from the group consisting of TIM3, SERPINE2, HMMR, KIT, TNFRSF4, CD8, CD45, PD1, TNFRSF9, PRF1, BHLHE40 (DEC1), IRF8, GLDC, STAT3, CST7, IL1R2, EEF2, SLC2A3, SQSTM1, RBPJ, NABP1, ACTN1, TNFRSF4, SERPINB9, FOSL2, CAPG, KLRC1, IL18R1, JUNB, EEF1A1, TNFRSF18, RGS2, NFKB2, RPL5, PEX16, LAT2, KDM5B, HILPDA, GEM, DENND4A, BCL2L11, ADAM8, PGLYRP1, IKZF2 (HELIOS), KIT, SERPINE2, CCRL2, CSF1, EPAS1, RUNX2 and SPRY2. In certain embodiments, the markers include the following combinations: a) TIM3, SERPINE2 and HMMR; or b) SERPINE2 and HMMR; or c) TIM3, KIT and HMMR; or d) TIM3, TNFRSF4 and HMMR; or e) any of (a), (b), (c) or (d) and one or more of CD8, CD45 and PD1; or any of (a), (b), (c), (d) or (e) and one or more of TNFRSF9, PRF1, BHLHE40, IRF8, GLDC, STAT3, CST7, IL1R2, EEF2, SLC2A3, SQSTM1, RBPJ, NABP1, ACTN1, TNFRSF4, SERPINB9, FOSL2, CAPG, KLRC1, IL18R1, JUNB, EEF1A1, TNFRSF18, RGS2, NFKB2, RPL5, PEX16, LAT2, KDM5B, HILPDA, GEM, DENND4A, BCL2L11, ADAM8, PGLYRP1, IKZF2, KIT, SERPINE2, CCRL2, CSF1, EPAS1, RUNX2 and SPRY2. The T cells may be detected in intact cells by detecting surface markers (e.g., TNFRSF9, IL1R2, SLC2A3, TNFRSF4, KLRC1, IL18R1, TNFRSF18, LAT2, ADAM8, KIT and SERPINE2). The presence of the cells in a sample may also be detected after cells are broken (e.g., lysed, destroyed) or fixed and permeabilized. In an exemplary embodiment, cells are analyzed by single cell RNA sequencing (e.g., scRNA-seq) and the cells are sorted in silico based on gene expression attributed to each single cell. In another exemplary embodiment, fixed and permeabilized cells are analyzed by microscopy (e.g., fluorescent microscopy). In other embodiments, fixed and permeabilized cells may be detected and quantified using FACS. Thus, the specific cells may be detected in a biological sample and cell types quantitated even though the cells have been destroyed. In other embodiments, cells are detected or quantified from a sample without killing the cells, such as by using cell sorting with an affinity reagent specific to a cell surface marker.


In one embodiment, the method comprises isolating CD8+ T cells from a biological sample. In preferred embodiments, one or more PD1+CD8+ T cells are isolated from the biological sample. In certain embodiments, isolating CD8+ T cells from a biological sample results in depletion of the T cells from the biological sample. The CD8+ T cells may be isolated using a set of markers comprising: one or more surface genes or polypeptides selected from the group consisting of TNFRSF9, IL1R2, SLC2A3, TNFRSF4, KLRC1, IL18R1, TNFRSF18, LAT2, ADAM8, KIT and SERPINE2. In certain embodiments, the markers include the following combinations: a) TIM3, SERPINE2 and HMI or b) SERPINE2 and HMI or c) TIM3, KIT and HMMR; or d) TIM3, TNFRSF4 and HMMR; or e) any of (a), (b), (c) or (d) and one or more of CD8, CD45 and PD1; or any of (a), (b), (c), (d) or (e) and one or more of TNFRSF9, IL1R2, SLC2A3, TNFRSF4, KLRC1, IL18R1, TNFRSF18, LAT2, ADAM8, KIT and SERPINE2. In certain embodiments, cells are isolated or depleted from a sample by using an affinity reagent specific to a cell surface marker.


The genes or polypeptides in the group consisting of TNFRSF9, PRF1, BHLHE40, IRF8, GLDC, STAT3, CST7, IL1R2, EEF2, SLC2A3, SQSTM1, RBPJ, NABP1, ACTN1, TNFRSF4, SERPINB9, FOSL2, CAPG, KLRC1, IL18R1, JUNB, EEF1A1, TNFRSF18, RGS2, NFKB2, RPL5, PEX16, LAT2, KDM5B, HILPDA, GEM, DENND4A, BCL2L11, ADAM8, and PGLYRP1 were upregulated in cluster 7 relative to clusters 9 and 10. Thus, the genes may be used to further distinguish between each subtype. Moreover, the overall signatures or subset of the signature genes characteristic of each identified cluster (i.e., CD8+ T cell subtype) may be used to identify each subtype. In further embodiments, surface markers selected from the group of genes may be used to isolate each subtype.


A marker, for example a gene or gene product, for example a peptide, polypeptide, protein, or nucleic acid, or a group of two or more markers, is “detected” or “measured” in a tested object (e.g., in or on a cell, cell population, tissue, organ, or organism, e.g., in a biological sample of a subject) when the presence or absence and/or quantity of said marker or said group of markers is detected or determined in the tested object, preferably substantially to the exclusion of other molecules and analytes, e.g., other genes or gene products.


The terms “increased” or “increase” or “upregulated” or “upregulate” as used herein generally mean an increase by a statically significant amount. For avoidance of doubt, “increased” means a statistically significant increase of at least 10% as compared to a reference level, including an increase of at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 100% or more, including, for example at least 2-fold, at least 3-fold, at least 4-fold, at least 5-fold, at least 10-fold increase or greater as compared to a reference level, as that term is defined herein.


The term “reduced” or “reduce” or “decrease” or “decreased” or “downregulate” or “downregulated” as used herein generally means a decrease by a statistically significant amount relative to a reference. For avoidance of doubt, “reduced” means statistically significant decrease of at least 10% as compared to a reference level, for example a decrease by at least 20%, at least 30%, at least 40%, at least 50%, or at least 60%, or at least 70%, or at least 80%, at least 90% or more, up to and including a 100% decrease (i.e., absent level as compared to a reference sample), or any decrease between 10-100% as compared to a reference level, as that.


In certain embodiments, the biological sample may be a tumor sample obtained from a subject in need thereof and the CD8+ T cells may be CD8+ tumor infiltrating lymphocytes (TIL). Not being bound by a theory, the TILs may comprise suppressive T cells. In certain embodiments, the biological sample may be a sample obtained from a subject suffering from an autoimmune disease. In certain embodiments, T cells may be isolated from the biological sample.


In certain embodiments, the biological sample may comprise ex vivo or in vitro CD8+ T cells. The ex vivo or in vitro biological sample may be treated with an antigen. The ex vivo or in vitro biological sample may be treated with a differentiation agent. The differentiating agent may be a cytokine. The ex vivo or in vitro biological sample may be treated with a test agent. Not being bound by a theory, the ex vivo or in vitro biological sample may be differentiated to comprise certain types of T cells (e.g., suppressive or effector T cells). The test agent may be any agent predicted to affect the function or gene expression of any of the cells described herein. The agent may affect the ratio of cells in a population of cells (i.e., in the ex vivo or in vitro biological sample). For example, T cells may be differentiated to the T cells of the present invention. Not being bound by a theory, suppressive T cells differentiated ex vivo or in vitro may be used to treat a subject suffering from an autoimmune disease. Not being bound by a theory, suppressive T cells differentiated into effector T cells ex vivo or in vitro may be used to treat a subject suffering from cancer. The test agent may be a drug candidate. The drug candidate may be used to differentiate or modulate T cell balance in vivo. In certain embodiments, the biological sample is assayed to determine the quantity or changes in composition of T cells in the sample after treatment.


The terms “sample” or “biological sample” as used throughout this specification include any biological specimen obtained from a subject. Particularly useful samples are those known to comprise, or expected or predicted to comprise immune cells as taught herein. Preferably, a sample may be readily obtainable by minimally invasive methods, such as blood collection or tissue biopsy, allowing the removal/isolation/provision of the sample from the subject. Examples of particularly useful samples include without limitation whole blood or a cell-containing fraction of whole blood, such as serum, white blood cells, or peripheral blood mononuclear cells (PBMC), lymph, lymphatic tissue, inflammation fluid, tissue specimens, or tissue biopsies. The term “tissue” as used throughout this specification refers to any animal tissue types including, but not limited to, bone, bone marrow, neural tissue, fibrous connective tissue, cartilage, muscle, vasculature, skin, adipose tissue, blood and glandular tissue or other non-bone tissue. The tissue may be healthy or affected by pathological alterations, e.g., tumor tissue or tissue affected by a disease comprising an immune component. The tissue may be from a living subject or may be cadaveric tissue. The tissue may be autologous tissue or syngeneic tissue or may be allograft or xenograft tissue. A biological sample may also include cells grown in tissue culture, such as cells used for screening drugs or primary cells grown in culture for expansion.


The terms “quantity”, “amount” and “level” are synonymous and generally well-understood in the art. The terms as used throughout this specification may particularly refer to an absolute quantification of a marker in a tested object (e.g., in or on a cell, cell population, tissue, organ, or organism, e.g., in a biological sample of a subject), or to a relative quantification of a marker in a tested object, i.e., relative to another value such as relative to a reference value, or to a range of values indicating a base-line of the marker. Such values or ranges may be obtained as conventionally known.


An absolute quantity of a marker may be advantageously expressed as weight or as molar amount, or more commonly as a concentration, e.g., weight per volume or mol per volume. A relative quantity of a marker may be advantageously expressed as an increase or decrease or as a fold-increase or fold-decrease relative to said another value, such as relative to a reference value. Performing a relative comparison between first and second variables (e.g., first and second quantities) may but need not require determining first the absolute values of said first and second variables. For example, a measurement method may produce quantifiable readouts (such as, e.g., signal intensities) for said first and second variables, wherein said readouts are a function of the value of said variables, and wherein said readouts may be directly compared to produce a relative value for the first variable vs. the second variable, without the actual need to first convert the readouts to absolute values of the respective variables.


Reference values may be established according to known procedures previously employed for other cell populations, biomarkers and gene or gene product signatures. For example, a reference value may be established in an individual or a population of individuals characterized by a particular diagnosis, prediction and/or prognosis of said disease or condition (i.e., for whom said diagnosis, prediction and/or prognosis of the disease or condition holds true). Such population may comprise without limitation 2 or more, 10 or more, 100 or more, or even several hundred or more individuals.


A “deviation” of a first value from a second value may generally encompass any direction (e.g., increase: first value >second value; or decrease: first value <second value) and any extent of alteration.


For example, a deviation may encompass a decrease in a first value by, without limitation, at least about 10% (about 0.9-fold or less), or by at least about 20% (about 0.8-fold or less), or by at least about 30% (about 0.7-fold or less), or by at least about 40% (about 0.6-fold or less), or by at least about 50% (about 0.5-fold or less), or by at least about 60% (about 0.4-fold or less), or by at least about 70% (about 0.3-fold or less), or by at least about 80% (about 0.2-fold or less), or by at least about 90% (about 0.1-fold or less), relative to a second value with which a comparison is being made.


For example, a deviation may encompass an increase of a first value by, without limitation, at least about 10% (about 1.1-fold or more), or by at least about 20% (about 1.2-fold or more), or by at least about 30% (about 1.3-fold or more), or by at least about 40% (about 1.4-fold or more), or by at least about 50% (about 1.5-fold or more), or by at least about 60% (about 1.6-fold or more), or by at least about 70% (about 1.7-fold or more), or by at least about 80% (about 1.8-fold or more), or by at least about 90% (about 1.9-fold or more), or by at least about 100% (about 2-fold or more), or by at least about 150% (about 2.5-fold or more), or by at least about 200% (about 3-fold or more), or by at least about 500% (about 6-fold or more), or by at least about 700% (about 8-fold or more), or like, relative to a second value with which a comparison is being made.


Preferably, a deviation may refer to a statistically significant observed alteration. For example, a deviation may refer to an observed alteration which falls outside of error margins of reference values in a given population (as expressed, for example, by standard deviation or standard error, or by a predetermined multiple thereof, e.g., ±1×SD or ±2×SD or ±3×SD, or ±1×SE or ±2×SE or ±3×SE). Deviation may also refer to a value falling outside of a reference range defined by values in a given population (for example, outside of a range which comprises ≥40%, ≥50%, ≥60%, ≥70%, ≥75% or ≥80% or ≥85% or ≥90% or ≥95% or even ≥100% of values in said population).


In a further embodiment, a deviation may be concluded if an observed alteration is beyond a given threshold or cut-off. Such threshold or cut-off may be selected as generally known in the art to provide for a chosen sensitivity and/or specificity of the prediction methods, e.g., sensitivity and/or specificity of at least 50%, or at least 60%, or at least 70%, or at least 80%, or at least 85%, or at least 90%, or at least 95%.


For example, receiver-operating characteristic (ROC) curve analysis can be used to select an optimal cut-off value of the quantity of a given immune cell population, biomarker or gene or gene product signatures, for clinical use of the present diagnostic tests, based on acceptable sensitivity and specificity, or related performance measures which are well-known per se, such as positive predictive value (PPV), negative predictive value (NPV), positive likelihood ratio (LR+), negative likelihood ratio (LR−), Youden index, or similar.


The terms “isolating” or “purifying” as used throughout this specification with reference to a particular component of a composition or mixture (e.g., the tested object such as the biological sample) encompass processes or techniques whereby such component is separated from one or more or (substantially) all other components of the composition or mixture (e.g., the tested object such as the biological sample). The terms do not require absolute purity. Instead, isolating or purifying the component will produce a discrete environment in which the abundance of the component relative to one or more or all other components is greater than in the starting composition or mixture (e.g., the tested object such as the biological sample). A discrete environment may denote a single medium, such as for example a single solution, dispersion, gel, precipitate, etc. Isolating or purifying the specified immune cells from the tested object such as the biological sample may increase the abundance of the specified immune cells relative to all other cells comprised in the tested object such as the biological sample, or relative to other cells of a select subset of the cells comprised in the tested object such as the biological sample, e.g., relative to other white blood cells, peripheral blood mononuclear cells, immune cells, antigen presenting cells, or dendritic cells comprised in the tested object such as the biological sample. By means of example, isolating or purifying the specified immune cells from the tested object such as the biological sample may yield a cell population, in which the specified immune cells constitute at least 40% (by number) of all cells of said cell population, for example, at least 45%, preferably at least 50%, at least 55%, more preferably at least 60%, at least 65%, still more preferably at least 70%, at least 75%, even more preferably at least 80%, at least 85%, and yet more preferably at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or even 100% of all cells of said cell population.


Any existing, available or conventional separation, detection and/or quantification methods may be used to measure the presence or absence (e.g., readout being present vs. absent; or detectable amount vs. undetectable amount) and/or quantity (e.g., readout being an absolute or relative quantity) of the specified immune cells in, or to isolate the specified immune cells from, a tested object (e.g., a cell population, tissue, organ, organism, or a biological sample of a subject). Such methods allow to detect, quantify or isolate the specified immune cells in or from the tested object (e.g., a cell population, tissue, organ, organism, or a biological sample of a subject) substantially to the exclusion of other cells comprised in the tested object. Such methods may allow to detect, quantify or isolate the specified immune cells with sensitivity of at least 50%, at least 55%, at least 60%, at least 65%, preferably at least 70%, at least 75%, more preferably at least 80%, at least 85%, even more preferably at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or even 100%, and/or with specificity of at least 50%, at least 55%, at least 60%, at least 65%, preferably at least 70%, at least 75%, more preferably at least 80%, at least 85%, even more preferably at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or even 100%. By means of example, at least 40% (by number), for example at least 45%, preferably at least 50%, at least 55%, more preferably at least 60%, at least 65%, still more preferably at least 70%, at least 75%, even more preferably at least 80%, at least 85%, and yet more preferably at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or even 100% of all cells detected, quantified or isolated by such methods may correspond to the specified immune cells.


Isolated Cells

In another aspect, the present invention provides for isolated CD8+ T cells as described herein. The isolated CD8+ T cell subtypes may be isolated using any of the markers described herein. The isolated CD8+ T cell subtypes may be isolated from a human subject. The isolated CD8+ T cell may be isolated from an ex vivo sample (e.g., CAR T cell, autologous T cells or allogenic T cells grown in culture). In preferred embodiments, the isolated CD8+ T cell may be obtained from a subject suffering from a disease (e.g., cancer, an autoimmune disease, or chronic infection).


In one aspect, the invention is directed to isolated cell populations (e.g., T cells) comprising the T cells described herein and/or as identified by the signatures defined herein. Accordingly, methods for detecting, quantifying or isolating the specified immune cells may be marker-based or gene or gene product signature-based, i.e., may involve isolation of cells expressing or not expressing marker(s) or combination(s) of markers the expression or lack of expression of which is taught herein as typifying or characterizing the specified immune cells, or may involve detection, quantification or isolation of cells comprising gene or gene product signature(s) taught herein as typifying or characterizing the specified immune cells.


In another aspect, the present invention provides for a population of CD8+ T cells comprising CD8+ T cells as defined in any embodiment herein or isolated according to a method of any embodiment herein. The isolated population may comprise greater than 30%, 40%, 50%, 60%, 70%, 80%, 90% or 95% of a CD8+ T cell as defined in any embodiment herein. In certain embodiments, the population of cells is less than 30% of any one cell type, such as when cells are directly isolated from a patient. Not being bound by a theory, a population of cells isolated from a patient will include a heterogeneous population of cells, such that specific cell subtypes make up less than a majority of the total cells (e.g., less than 30%, 20%, 10%, 5%, or 1%). In certain embodiments, a subtype of cells is expanded or enriched ex vivo to obtain a non-naturally occurring cell population enriched for certain cell types. In certain embodiments, T cells according to the present invention are depleted from a population of cells. The isolated population may comprise less than 5%, 1%, 0.1%, 0.01%, or 0.001%, or comprise 0% of a suppressive CD8+ T cell as defined in any embodiment herein. The population of cells depleted for the T cells may be further expanded. Not being bound by a theory suppressive T cells may be depleted from a population of T cells and upon expanding a population enriched for effector T cells may be obtained. Not being bound by a theory an expanded population of T cells may be obtained that does not include suppressive T cells. In certain embodiments, the population of T cells may express a chimeric antigen receptor targeting tumor cell antigens. Not being bound by a theory suppressive T cells may be depleted from a population of CAR T cells.


The isolated immune cells or immune cell populations as disclosed throughout this specification may be suitably cultured or cultivated in vitro. The terms “culturing” or “cell culture” are common in the art and broadly refer to maintenance of cells and potentially expansion (proliferation, propagation) of cells in vitro. Typically, animal cells, such as mammalian cells, such as human cells, are cultured by exposing them to (i.e., contacting them with) a suitable cell culture medium in a vessel or container adequate for the purpose (e.g., a 96-, 24-, or 6-well plate, a T-25, T-75, T-150 or T-225 flask, or a cell factory), at art-known conditions conducive to in vitro cell culture, such as temperature of 37° C., 5% v/v CO2 and >95% humidity.


The term “medium” as used herein broadly encompasses any cell culture medium conducive to maintenance of cells, preferably conducive to proliferation of cells. Typically, the medium will be a liquid culture medium, which facilitates easy manipulation (e.g., decantation, pipetting, centrifugation, filtration, and such) thereof.


Typically, the medium will comprise a basal medium formulation as known in the art. Many basal media formulations (available, e.g., from the American Type Culture Collection, ATCC; or from Invitrogen, Carlsbad, Calif.) can be used, including but not limited to Eagle's Minimum Essential Medium (MEM), Dulbecco's Modified Eagle's Medium (DMEM), alpha modified Minimum Essential Medium (alpha-MEM), Basal Medium Essential (BME), Iscove's Modified Dulbecco's Medium (IMDM), BGJb medium, F-12 Nutrient Mixture (Ham), Liebovitz L-15, DMEM/F-12, Essential Modified Eagle's Medium (EMEM), RPMI-1640, Medium 199, Waymouth's MB 752/1 or Williams Medium E, and modifications and/or combinations thereof. Compositions of basal media are generally known in the art and it is within the skill of one in the art to modify or modulate concentrations of media and/or media supplements as necessary for the cells cultured.


Such basal media formulations contain ingredients necessary for mammalian cell development, which are known per se. By means of illustration and not limitation, these ingredients may include inorganic salts (in particular salts containing Na, K, Mg, Ca, Cl, P and possibly Cu, Fe, Se and Zn), physiological buffers (e.g., HEPES, bicarbonate), nucleotides, nucleosides and/or nucleic acid bases, ribose, deoxyribose, amino acids, vitamins, antioxidants (e.g., glutathione) and sources of carbon (e.g., glucose, sodium pyruvate, sodium acetate), etc.


For use in culture, basal media can be supplied with one or more further components. For example, additional supplements can be used to supply the cells with the necessary trace elements and substances for optimal growth and expansion. Furthermore, antioxidant supplements may be added, e.g., β-mercaptoethanol. While many basal media already contain amino acids, some amino acids may be supplemented later, e.g., L-glutamine, which is known to be less stable when in solution. A medium may be further supplied with antibiotic and/or antimycotic compounds, such as, typically, mixtures of penicillin and streptomycin, and/or other compounds, exemplified but not limited to, amphotericin, ampicillin, gentamicin, bleomycin, hygromycin, kanamycin, mitomycin, mycophenolic acid, nalidixic acid, neomycin, nystatin, paromomycin, polymyxin, puromycin, rifampicin, spectinomycin, tetracycline, tylosin, and zeocin.


Lipids and lipid carriers can also be used to supplement cell culture media. Such lipids and carriers can include, but are not limited to cyclodextrin, cholesterol, linoleic acid conjugated to albumin, linoleic acid and oleic acid conjugated to albumin, unconjugated linoleic acid, linoleic-oleic-arachidonic acid conjugated to albumin, oleic acid unconjugated and conjugated to albumin, among others. Albumin can similarly be used in fatty-acid free formulations.


Also contemplated is supplementation of cell culture media with mammalian plasma or sera. Plasma or sera often contain cellular factors and components that facilitate cell viability and expansion. Optionally, plasma or serum may be heat inactivated. Heat inactivation is used in the art mainly to remove the complement. Heat inactivation typically involves incubating the plasma or serum at 56° C. for 30 to 60 min, e.g., 30 min, with steady mixing, after which the plasma or serum is allowed to gradually cool to ambient temperature. A skilled person will be aware of any common modifications and requirements of the above procedure. Optionally, plasma or serum may be sterilized prior to storage or use. Usual means of sterilization may involve, e.g., filtration through one or more filters with pore size smaller than 1 μm, preferably smaller than 0.5 μm, e.g., smaller than 0.45 μm, 0.40 μm, 0.35 μm, 0.30 μm or 0.25 μm, more preferably 0.2 μm or smaller, e.g., 0.15 μm or smaller, 0.10 μm or smaller. Suitable sera or plasmas for use in media as taught herein may include human serum or plasma, or serum or plasma from non-human animals, preferably non-human mammals, such as, e.g., non-human primates (e.g., lemurs, monkeys, apes), fetal or adult bovine, horse, porcine, lamb, goat, dog, rabbit, mouse or rat serum or plasma, etc., or any combination of such. In certain preferred embodiments, a medium as taught herein may comprise bovine serum or plasma, preferably fetal bovine (calf) serum or plasma, more preferably fetal bovine (calf) serum (FCS or FBS). When culturing human cells, media may preferably comprise human serum or plasma, such as autologous or allogeneic human serum or plasma, preferably human serum, such as autologous or allogeneic human serum, more preferably autologous human serum or plasma, even more preferably autologous human serum.


In certain preferred embodiments, serum or plasma can be substituted in media by serum replacements, such as to provide for serum-free media (i.e., chemically defined media). The provision of serum-free media may be advantageous particularly with view to administration of the media or fraction(s) thereof to subjects, especially to human subjects (e.g., improved bio-safety). By the term “serum replacement” it is broadly meant any a composition that may be used to replace the functions (e.g., cell maintenance and growth supportive function) of animal serum in a cell culture medium. A conventional serum replacement may typically comprise vitamins, albumin, lipids, amino acids, transferrin, antioxidants, insulin and trace elements. Many commercialized serum replacement additives, such as KnockOut Serum Replacement (KOSR), N2, B27, Insulin-Transferrin-Selenium Supplement (ITS), and G5 are well known and are readily available to those skilled in the art.


Plasma or serum or serum replacement may be comprised in media as taught herein at a proportion (volume of plasma or serum or serum replacement/volume of medium) between about 0.5% v/v and about 40.0% v/v, preferably between about 5.0% v/v and about 20.0% v/v, e.g., between about 5.0% v/v and about 15.0% v/v, more preferably between about 8.0% v/v and about 12.0% v/v, e.g., about 10.0% v/v.


Methods of Detection and Isolation of Suppressive CD8+ T Cells Using Biomarkers

In certain embodiments, the CD8+ T cell subtypes may be detected, quantified or isolated using a technique selected from the group consisting of flow cytometry, mass cytometry, fluorescence activated cell sorting (FACS), fluorescence microscopy, affinity separation, magnetic cell separation, microfluidic separation, RNA-seq (e.g., bulk or single cell), quantitative PCR, MERFISH (multiplex (in situ) RNA FISH) and combinations thereof. The technique may employ one or more agents capable of specifically binding to one or more gene products expressed or not expressed by the CD8+ T cells, preferably on the cell surface of the CD8+ T cells. The one or more agents may be one or more antibodies. Other methods including absorbance assays and colorimetric assays are known in the art and may be used herein.


Depending on factors that can be evaluated and decided on by a skilled person, such as, inter alia, the type of a marker (e.g., peptide, polypeptide, protein, or nucleic acid), the type of the tested object (e.g., a cell, cell population, tissue, organ, or organism, e.g., the type of biological sample of a subject, e.g., whole blood, plasma, serum, tissue biopsy), the expected abundance of the marker in the tested object, the type, robustness, sensitivity and/or specificity of the detection method used to detect the marker, etc., the marker may be measured directly in the tested object, or the tested object may be subjected to one or more processing steps aimed at achieving an adequate measurement of the marker.


In other example embodiments, detection of a marker may include immunological assay methods, wherein the ability of an assay to separate, detect and/or quantify a marker (such as, preferably, peptide, polypeptide, or protein) is conferred by specific binding between a separable, detectable and/or quantifiable immunological binding agent (antibody) and the marker. Immunological assay methods include without limitation immunohistochemistry, immunocytochemistry, flow cytometry, mass cytometry, fluorescence activated cell sorting (FACS), fluorescence microscopy, fluorescence based cell sorting using microfluidic systems, immunoaffinity adsorption based techniques such as affinity chromatography, magnetic particle separation, magnetic activated cell sorting or bead based cell sorting using microfluidic systems, enzyme-linked immunosorbent assay (ELISA) and ELISPOT based techniques, radioimmunoassay (MA), Western blot, etc.


In certain example embodiments, detection of a marker or signature may include biochemical assay methods, including inter alia assays of enzymatic activity, membrane channel activity, substance-binding activity, gene regulatory activity, or cell signaling activity of a marker, e.g., peptide, polypeptide, protein, or nucleic acid.


In other example embodiments, detection of a marker may include mass spectrometry analysis methods. Generally, any mass spectrometric (MS) techniques that are capable of obtaining precise information on the mass of peptides, and preferably also on fragmentation and/or (partial) amino acid sequence of selected peptides (e.g., in tandem mass spectrometry, MS/MS; or in post source decay, TOF MS), may be useful herein for separation, detection and/or quantification of markers (such as, preferably, peptides, polypeptides, or proteins). Suitable peptide MS and MS/MS techniques and systems are well-known per se (see, e.g., Methods in Molecular Biology, vol. 146: “Mass Spectrometry of Proteins and Peptides”, by Chapman, ed., Humana Press 2000, ISBN 089603609x; Biemann 1990. Methods Enzymol 193: 455-79; or Methods in Enzymology, vol. 402: “Biological Mass Spectrometry”, by Burlingame, ed., Academic Press 2005, ISBN 9780121828073) and may be used herein. MS arrangements, instruments and systems suitable for biomarker peptide analysis may include, without limitation, matrix-assisted laser desorption/ionisation time-of-flight (MALDI-TOF) MS; MALDI-TOF post-source-decay (PSD); MALDI-TOF/TOF; surface-enhanced laser desorption/ionization time-of-flight mass spectrometry (SELDI-TOF) MS; electrospray ionization mass spectrometry (ESI-MS); ESI-MS/MS; ESI-MS/(MS)n (n is an integer greater than zero); ESI 3D or linear (2D) ion trap MS; ESI triple quadrupole MS; ESI quadrupole orthogonal TOF (Q-TOF); ESI Fourier transform MS systems; desorption/ionization on silicon (DIOS); secondary ion mass spectrometry (SIMS); atmospheric pressure chemical ionization mass spectrometry (APCI-MS); APCI-MS/MS; APCI-(MS)n; atmospheric pressure photoionization mass spectrometry (APPI-MS); APPI-MS/MS; and APPI-(MS)n. Peptide ion fragmentation in tandem MS (MS/MS) arrangements may be achieved using manners established in the art, such as, e.g., collision induced dissociation (CID). Detection and quantification of markers by mass spectrometry may involve multiple reaction monitoring (MRM), such as described among others by Kuhn et al. 2004 (Proteomics 4: 1175-86). MS peptide analysis methods may be advantageously combined with upstream peptide or protein separation or fractionation methods, such as for example with the chromatographic and other methods.


In other example embodiments, detection of a marker may include chromatography methods. In a one example embodiment, chromatography refers to a process in which a mixture of substances (analytes) carried by a moving stream of liquid or gas (“mobile phase”) is separated into components as a result of differential distribution of the analytes, as they flow around or over a stationary liquid or solid phase (“stationary phase”), between said mobile phase and said stationary phase. The stationary phase may be usually a finely divided solid, a sheet of filter material, or a thin film of a liquid on the surface of a solid, or the like. Chromatography may be columnar. While particulars of chromatography are well known in the art, for further guidance see, e.g., Meyer M., 1998, ISBN: 047198373X, and “Practical HPLC Methodology and Applications”, Bidlingmeyer, B. A., John Wiley & Sons Inc., 1993. Exemplary types of chromatography include, without limitation, high-performance liquid chromatography (HPLC), normal phase HPLC (NP-HPLC), reversed phase HPLC (RP-HPLC), ion exchange chromatography (IEC), such as cation or anion exchange chromatography, hydrophilic interaction chromatography (HILIC), hydrophobic interaction chromatography (HIC), size exclusion chromatography (SEC) including gel filtration chromatography or gel permeation chromatography, chromatofocusing, affinity chromatography such as immunoaffinity, immobilized metal affinity chromatography, and the like.


In certain embodiments, further techniques for separating, detecting and/or quantifying markers may be used in conjunction with any of the above described detection methods. Such methods include, without limitation, chemical extraction partitioning, isoelectric focusing (IEF) including capillary isoelectric focusing (CLEF), capillary isotachophoresis (CITP), capillary electrochromatography (CEC), and the like, one-dimensional polyacrylamide gel electrophoresis (PAGE), two-dimensional polyacrylamide gel electrophoresis (2D-PAGE), capillary gel electrophoresis (CGE), capillary zone electrophoresis (CZE), micellar electrokinetic chromatography (MEKC), free flow electrophoresis (FFE), etc.


In certain examples, such methods may include separating, detecting and/or quantifying markers at the nucleic acid level, more particularly RNA level, e.g., at the level of hnRNA, pre-mRNA, mRNA, or cDNA. Standard quantitative RNA or cDNA measurement tools known in the art may be used. Non-limiting examples include hybridization-based analysis, microarray expression analysis, digital gene expression profiling (DGE), RNA-in-situ hybridization (RISH), Northern-blot analysis and the like; PCR, RT-PCR, RT-qPCR, end-point PCR, digital PCR or the like; supported oligonucleotide detection, pyrosequencing, polony cyclic sequencing by synthesis, simultaneous bi-directional sequencing, single-molecule sequencing, single molecule real time sequencing, true single molecule sequencing, hybridization-assisted nanopore sequencing, sequencing by synthesis, single-cell RNA sequencing (sc-RNA seq), or the like.


In certain embodiments, the invention involves single cell RNA sequencing (see, e.g., Kalisky, T., Blainey, P. & Quake, S. R. Genomic Analysis at the Single-Cell Level. Annual review of genetics 45, 431-445, (2011); Kalisky, T. & Quake, S. R. Single-cell genomics. Nature Methods 8, 311-314 (2011); Islam, S. et al. Characterization of the single-cell transcriptional landscape by highly multiplex RNA-seq. Genome Research, (2011); Tang, F. et al. RNA-Seq analysis to capture the transcriptome landscape of a single cell. Nature Protocols 5, 516-535, (2010); Tang, F. et al. mRNA-Seq whole-transcriptome analysis of a single cell. Nature Methods 6, 377-382, (2009); Ramskold, D. et al. Full-length mRNA-Seq from single-cell levels of RNA and individual circulating tumor cells. Nature Biotechnology 30, 777-782, (2012); and Hashimshony, T., Wagner, F., Sher, N. & Yanai, I. CEL-Seq: Single-Cell RNA-Seq by Multiplexed Linear Amplification. Cell Reports, Cell Reports, Volume 2, Issue 3, p 666-673, 2012).


In certain embodiments, the invention involves plate based single cell RNA sequencing (see, e.g., Picelli, S. et al., 2014, “Full-length RNA-seq from single cells using Smart-seq2” Nature protocols 9, 171-181, doi:10.1038/nprot.2014.006).


In certain embodiments, the invention involves high-throughput single-cell RNA-seq. In this regard reference is made to Macosko et al., 2015, “Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets” Cell 161, 1202-1214; International patent application number PCT/US2015/049178, published as WO2016/040476 on Mar. 17, 2016; Klein et al., 2015, “Droplet Barcoding for Single-Cell Transcriptomics Applied to Embryonic Stem Cells” Cell 161, 1187-1201; International patent application number PCT/US2016/027734, published as WO2016168584A1 on Oct. 20, 2016; Zheng, et al., 2016, “Haplotyping germline and cancer genomes with high-throughput linked-read sequencing” Nature Biotechnology 34, 303-311; Zheng, et al., 2017, “Massively parallel digital transcriptional profiling of single cells” Nat. Commun. 8, 14049 doi: 10.1038/ncomms14049; International patent publication number WO2014210353A2; Zilionis, et al., 2017, “Single-cell barcoding and sequencing using droplet microfluidics” Nat Protoc. January; 12(1):44-73; Cao et al., 2017, “Comprehensive single cell transcriptional profiling of a multicellular organism by combinatorial indexing” bioRxiv preprint first posted online Feb. 2, 2017, doi: dx.doi.org/10.1101/104844; Rosenberg et al., 2017, “Scaling single cell transcriptomics through split pool barcoding” bioRxiv preprint first posted online Feb. 2, 2017, doi: dx.doi.org/10.1101/105163; Vitak, et al., “Sequencing thousands of single-cell genomes with combinatorial indexing” Nature Methods, 14(3):302-308, 2017; Cao, et al., Comprehensive single-cell transcriptional profiling of a multicellular organism. Science, 357(6352):661-667, 2017; and Gierahn et al., “Seq-Well: portable, low-cost RNA sequencing of single cells at high throughput” Nature Methods 14, 395-398 (2017), all the contents and disclosure of each of which are herein incorporated by reference in their entirety.


In certain embodiments, the invention involves single nucleus RNA sequencing. In this regard reference is made to Swiech et al., 2014, “In vivo interrogation of gene function in the mammalian brain using CRISPR-Cas9” Nature Biotechnology Vol. 33, pp. 102-106; Habib et al., 2016, “Div-Seq: Single-nucleus RNA-Seq reveals dynamics of rare adult newborn neurons” Science, Vol. 353, Issue 6302, pp. 925-928; Habib et al., 2017, “Massively parallel single-nucleus RNA-seq with DroNc-seq” Nat Methods. 2017 October; 14(10):955-958; and International patent application number PCT/US2016/059239, published as WO2017164936 on Sep. 28, 2017, which are herein incorporated by reference in their entirety.


In one embodiment, immune cells are stained for immune cell subtype specific signature genes. In one embodiment, the cells are fixed. In another embodiment, the cells are formalin fixed and paraffin embedded. In another example embodiment, the immune cell subtypes may be quantitated in a section of a tumor.


The method may allow to detect or conclude the presence or absence of the specified immune cells in a tested object (e.g., in a cell population, tissue, organ, organism, or in a biological sample of a subject). The method may also allow to quantify the specified immune cells in a tested object (e.g., in a cell population, tissue, organ, organism, or in a biological sample of a subject). The quantity of the specified immune cells in the tested object such as the biological sample may be suitably expressed for example as the number (count) of the specified immune cells per standard unit of volume (e.g., ml, μl or nl) or weight (e.g., g or mg or ng) of the tested object such as the biological sample. The quantity of the specified immune cells in the tested object such as the biological sample may also be suitably expressed as a percentage or fraction (by number) of all cells comprised in the tested object such as the biological sample, or as a percentage or fraction (by number) of a select subset of the cells comprised in the tested object such as the biological sample, e.g., as a percentage or fraction (by number) of white blood cells, peripheral blood mononuclear cells, immune cells, antigen presenting cells, or dendritic cells comprised in the tested object such as the biological sample. The quantity of the specified immune cells in the tested object such as the biological sample may also be suitably represented by an absolute or relative quantity of a suitable surrogate analyte, such as a peptide, polypeptide, protein, or nucleic acid expressed or comprised by the specified immune cells.


Where a marker is detected in or on a cell, the cell may be conventionally denoted as positive (+) or negative (−) for the marker. Semi-quantitative denotations of marker expression in cells are also commonplace in the art, such as particularly in flow cytometry quantifications, for example, “dim” vs. “bright”, or “low” vs. “medium”/“intermediate” vs. “high”, or “−” vs. “+” vs. “++”, commonly controlled in flow cytometry quantifications by setting of the gates. Where a marker is quantified in or on a cell, absolute quantity of the marker may also be expressed for example as the number of molecules of the marker comprised by the cell.


Where a marker is detected and/or quantified on a single cell level in a cell population, the quantity of the marker may also be expressed as a percentage or fraction (by number) of cells comprised in said population that are positive for said marker, or as percentages or fractions (by number) of cells comprised in said population that are “dim” or “bright”, or that are “low” or “medium”/“intermediate” or “high”, or that are “−” or “+” or “++”. By means of an example, a sizeable proportion of the tested cells of the cell population may be positive for the marker, e.g., at least about 20%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 95%, or up to 100%.


In certain embodiments, methods for detecting, quantifying or isolating the specified immune cells may be single-cell-based, i.e., may allow to discretely detect, quantify or isolate the specified immune cells as individual cells. In other embodiments, methods for detecting, quantifying or isolating the specified immune cells may be cell population-based, i.e., may only allow to detect, quantify or isolate the specified immune cells as a group or collection of cells, without providing information on or allowing to isolate individual cells.


Methods for detecting, quantifying or isolating the specified immune cells may employ any of the above-described techniques for measuring markers, insofar the separation or the qualitative and/or quantitative measurement of the marker(s) can be correlated with or translated into detection, quantification or isolation of the specified immune cells. For example, any of the above-described biochemical assay methods, immunological assay methods, mass spectrometry analysis methods, chromatography methods, or nucleic acid analysis method, or combinations thereof for measuring markers, may be employed for detecting, quantifying or isolating the specified immune cells.


In certain embodiments, the cells are detected, quantified or isolated using a technique selected from the group consisting of flow cytometry, fluorescence activated cell sorting, mass cytometry, fluorescence microscopy, affinity separation, magnetic cell separation, microfluidic separation, and combinations thereof.


Flow cytometry encompasses methods by which individual cells of a cell population are analyzed by their optical properties (e.g., light absorbance, light scattering and fluorescence properties, etc.) as they pass in a narrow stream in single file through a laser beam. Flow cytometry methods include fluorescence activated cell sorting (FACS) methods by which a population of cells having particular optical properties are separated from other cells.


Elemental mass spectrometry-based flow cytometry, or mass cytometry, offers an approach to analyze cells by replacing fluorochrome-labelled binding reagents with mass tagged binding reagents, i.e., tagged with an element or isotope having a defined mass. In these methods, labeled particles are introduced into a mass cytometer, where they are individually atomized and ionized. The individual particles are then subjected to elemental analysis, which identifies and measures the abundance of the mass tags used. The identities and the amounts of the isotopic elements associated with each particle are then stored and analyzed. Due to the resolution of elemental analysis and the number of elemental isotopes that can be used, it is possible to simultaneously measure up to 100 or more parameters on a single particle.


Fluorescence microscopy broadly encompasses methods by which individual cells of a cell population are microscopically analyzed by their fluorescence properties. Fluorescence microscopy approaches may be manual or preferably automated.


Affinity separation also referred to as affinity chromatography broadly encompasses techniques involving specific interactions of cells present in a mobile phase, such as a suitable liquid phase (e.g., cell population in an aqueous suspension) with, and thereby adsorption of the cells to, a stationary phase, such as a suitable solid phase; followed by separation of the stationary phase from the remainder of the mobile phase; and recovery (e.g., elution) of the adsorbed cells from the stationary phase. Affinity separation may be columnar, or alternatively, may entail batch treatment, wherein the stationary phase is collected/separated from the liquid phases by suitable techniques, such as centrifugation or application of magnetic field (e.g., where the stationary phase comprises magnetic substrate, such as magnetic particles or beads). Accordingly, magnetic cell separation is also envisaged herein.


Microfluidic systems allow for accurate and high throughput cell detection, quantification and/or sorting, exploiting a variety of physical principles. Cell sorting on microchips provides numerous advantages by reducing the size of necessary equipment, eliminating potentially biohazardous aerosols, and simplifying the complex protocols commonly associated with cell sorting. The term “microfluidic system” as used throughout this specification broadly refers to systems having one or more fluid microchannels. Microchannels denote fluid channels having cross-sectional dimensions the largest of which are typically less than 1 mm, preferably less than 500 μm, more preferably less than 400 μm, more preferably less than 300 μm, more preferably less than 200 μm, e.g., 100 μm or smaller. Such microfluidic systems can be used for manipulating fluid and/or objects such as droplets, bubbles, capsules, particles, cells and the like. Microfluidic systems may allow for example for fluorescent label-based (e.g., employing fluorophore-conjugated binding agent(s), such as fluorophore-conjugated antibody(ies)), bead-based (e.g., bead-conjugated binding agent(s), such as bead-conjugated antibody(ies)), or label-free cell sorting (reviewed in Shields et al., Lab Chip. 2015, vol. 15: 1230-1249).


Use of Specific Binding Agents

In certain embodiments, the aforementioned methods and techniques may employ agent(s) capable of specifically binding to one or more gene products, e.g., peptides, polypeptides, proteins, or nucleic acids, expressed or not expressed by the immune cells as taught herein. In certain preferred embodiments, such one or more gene products, e.g., peptides, polypeptides, or proteins, may be expressed on the cell surface of the immune cells (i.e., cell surface markers, e.g., transmembrane peptides, polypeptides or proteins, or secreted peptides, polypeptides or proteins which remain associated with the cell surface). Hence, further disclosed are binding agents capable of specifically binding to markers, such as genes or gene products, e.g., peptides, polypeptides, proteins, or nucleic acids as taught herein. Binding agents as intended throughout this specification may include inter alia antibodies, aptamers, spiegelmers (L-aptamers), photoaptamers, protein, peptides, peptidomimetics, nucleic acids such as oligonucleotides (e.g., hybridization probes or amplification or sequencing primers and primer pairs), small molecules, or combinations thereof.


The term “aptamer” refers to single-stranded or double-stranded oligo-DNA, oligo-RNA or oligo-DNA/RNA or any analogue thereof that specifically binds to a target molecule such as a peptide. Advantageously, aptamers display fairly high specificity and affinity (e.g., KA in the order 1×109 M−1) for their targets. Aptamer production is described inter alia in U.S. Pat. No. 5,270,163; Ellington & Szostak 1990 (Nature 346: 818-822); Tuerk & Gold 1990 (Science 249: 505-510); or “The Aptamer Handbook: Functional Oligonucleotides and Their Applications”, by Klussmann, ed., Wiley-VCH 2006, ISBN 3527310592, incorporated by reference herein. The term “photoaptamer” refers to an aptamer that contains one or more photoreactive functional groups that can covalently bind to or crosslink with a target molecule. The term “spiegelmer” refers to an aptamer which includes L-DNA, L-RNA, or other left-handed nucleotide derivatives or nucleotide-like molecules. Aptamers containing left-handed nucleotides are resistant to degradation by naturally occurring enzymes, which normally act on substrates containing right-handed nucleotides. The term “peptidomimetic” refers to a non-peptide agent that is a topological analogue of a corresponding peptide. Methods of rationally designing peptidomimetics of peptides are known in the art. For example, the rational design of three peptidomimetics based on the sulphated 8-mer peptide CCK26-33, and of two peptidomimetics based on the 11-mer peptide Substance P, and related peptidomimetic design principles, are described in Horwell 1995 (Trends Biotechnol 13: 132-134).


Binding agents may be in various forms, e.g., lyophilised, free in solution, or immobilised on a solid phase. They may be, e.g., provided in a multi-well plate or as an array or microarray, or they may be packaged separately, individually, or in combination.


The term “specifically bind” as used throughout this specification means that an agent (denoted herein also as “specific-binding agent”) binds to one or more desired molecules or analytes (e.g., peptides, polypeptides, proteins, or nucleic acids) substantially to the exclusion of other molecules which are random or unrelated, and optionally substantially to the exclusion of other molecules that are structurally related. The term “specifically bind” does not necessarily require that an agent binds exclusively to its intended target(s). For example, an agent may be said to specifically bind to target(s) of interest if its affinity for such intended target(s) under the conditions of binding is at least about 2-fold greater, preferably at least about 5-fold greater, more preferably at least about 10-fold greater, yet more preferably at least about 25-fold greater, still more preferably at least about 50-fold greater, and even more preferably at least about 100-fold, or at least about 1000-fold, or at least about 104-fold, or at least about 105-fold, or at least about 106-fold or more greater, than its affinity for a non-target molecule, such as for a suitable control molecule (e.g., bovine serum albumin, casein).


Preferably, the specific binding agent may bind to its intended target(s) with affinity constant (KA) of such binding KA≥1×106 M−1, more preferably KA≥1×107 M−1, yet more preferably KA≥1×108 M−1, even more preferably KA≥1×109 M−1, and still more preferably KA≥1×1010 M−1 or KA≥1×1011 M−1 or KA≥1×1012 M−1, wherein KA=[SBA_T]/[SBA][T], SBA denotes the specific-binding agent, T denotes the intended target. Determination of KA can be carried out by methods known in the art, such as for example, using equilibrium dialysis and Scatchard plot analysis.


In certain embodiments, the one or more binding agents may be one or more antibodies. As used herein, the term “antibody” is used in its broadest sense and generally refers to any immunologic binding agent. The term specifically encompasses intact monoclonal antibodies, polyclonal antibodies, multivalent (e.g., 2-, 3- or more-valent) and/or multi-specific antibodies (e.g., bi- or more-specific antibodies) formed from at least two intact antibodies, and antibody fragments insofar they exhibit the desired biological activity (particularly, ability to specifically bind an antigen of interest, i.e., antigen-binding fragments), as well as multivalent and/or multi-specific composites of such fragments. The term “antibody” is not only inclusive of antibodies generated by methods comprising immunization, but also includes any polypeptide, e.g., a recombinantly expressed polypeptide, which is made to encompass at least one complementarity-determining region (CDR) capable of specifically binding to an epitope on an antigen of interest. Hence, the term applies to such molecules regardless whether they are produced in vitro or in vivo. Antibodies also encompasses chimeric, humanized and fully humanized antibodies.


An antibody may be any of IgA, IgD, IgE, IgG and IgM classes, and preferably IgG class antibody. An antibody may be a polyclonal antibody, e.g., an antiserum or immunoglobulins purified there from (e.g., affinity-purified). An antibody may be a monoclonal antibody or a mixture of monoclonal antibodies. Monoclonal antibodies can target a particular antigen or a particular epitope within an antigen with greater selectivity and reproducibility. By means of example and not limitation, monoclonal antibodies may be made by the hybridoma method first described by Kohler et al. 1975 (Nature 256: 495), or may be made by recombinant DNA methods (e.g., as in U.S. Pat. No. 4,816,567). Monoclonal antibodies may also be isolated from phage antibody libraries using techniques as described by Clackson et al. 1991 (Nature 352: 624-628) and Marks et al. 1991 (J Mol Biol 222: 581-597), for example.


Antibody binding agents may be antibody fragments. “Antibody fragments” comprise a portion of an intact antibody, comprising the antigen-binding or variable region thereof. Examples of antibody fragments include Fab, Fab′, F(ab′)2, Fv and scFv fragments, single domain (sd) Fv, such as VH domains, VL domains and VHH domains; diabodies; linear antibodies; single-chain antibody molecules, in particular heavy-chain antibodies; and multivalent and/or multispecific antibodies formed from antibody fragment(s), e.g., dibodies, tribodies, and multibodies. The above designations Fab, Fab′, F(ab′)2, Fv, scFv etc. are intended to have their art-established meaning.


The term antibody includes antibodies originating from or comprising one or more portions derived from any animal species, preferably vertebrate species, including, e.g., birds and mammals. Without limitation, the antibodies may be chicken, turkey, goose, duck, guinea fowl, quail or pheasant. Also without limitation, the antibodies may be human, murine (e.g., mouse, rat, etc.), donkey, rabbit, goat, sheep, guinea pig, camel (e.g., Camelus bactrianus and Camelus dromaderius), llama (e.g., Lama paccos, Lama glama or Lama vicugna) or horse.


A skilled person will understand that an antibody can include one or more amino acid deletions, additions and/or substitutions (e.g., conservative substitutions), insofar such alterations preserve its binding of the respective antigen. An antibody may also include one or more native or artificial modifications of its constituent amino acid residues (e.g., glycosylation, etc.).


Methods of producing polyclonal and monoclonal antibodies as well as fragments thereof are well known in the art, as are methods to produce recombinant antibodies or fragments thereof (see for example, Harlow and Lane, “Antibodies: A Laboratory Manual”, Cold Spring Harbour Laboratory, New York, 1988; Harlow and Lane, “Using Antibodies: A Laboratory Manual”, Cold Spring Harbour Laboratory, New York, 1999, ISBN 0879695447; “Monoclonal Antibodies: A Manual of Techniques”, by Zola, ed., CRC Press 1987, ISBN 0849364760; “Monoclonal Antibodies: A Practical Approach”, by Dean & Shepherd, eds., Oxford University Press 2000, ISBN 0199637229; Methods in Molecular Biology, vol. 248: “Antibody Engineering: Methods and Protocols”, Lo, ed., Humana Press 2004, ISBN 1588290921).


As used herein, a “blocking” antibody or an antibody “antagonist” is one which inhibits or reduces biological activity of the antigen(s) it binds. In certain embodiments, the blocking antibodies or antagonist antibodies or portions thereof described herein completely inhibit the biological activity of the antigen(s).


Antibodies may act as agonists or antagonists of the recognized polypeptides. For example, the present invention includes antibodies which disrupt receptor/ligand interactions either partially or fully. The invention features both receptor-specific antibodies and ligand-specific antibodies. The invention also features receptor-specific antibodies which do not prevent ligand binding but prevent receptor activation. Receptor activation (i.e., signaling) may be determined by techniques described herein or otherwise known in the art. For example, receptor activation can be determined by detecting the phosphorylation (e.g., tyrosine or serine/threonine) of the receptor or of one of its down-stream substrates by immunoprecipitation followed by western blot analysis. In specific embodiments, antibodies are provided that inhibit ligand activity or receptor activity by at least 95%, at least 90%, at least 85%, at least 80%, at least 75%, at least 70%, at least 60%, or at least 50% of the activity in absence of the antibody.


The invention also features receptor-specific antibodies which both prevent ligand binding and receptor activation as well as antibodies that recognize the receptor-ligand complex. Likewise, encompassed by the invention are neutralizing antibodies which bind the ligand and prevent binding of the ligand to the receptor, as well as antibodies which bind the ligand, thereby preventing receptor activation, but do not prevent the ligand from binding the receptor. Further included in the invention are antibodies which activate the receptor. These antibodies may act as receptor agonists, i.e., potentiate or activate either all or a subset of the biological activities of the ligand-mediated receptor activation, for example, by inducing dimerization of the receptor. The antibodies may be specified as agonists, antagonists or inverse agonists for biological activities comprising the specific biological activities of the peptides disclosed herein. The antibody agonists and antagonists can be made using methods known in the art. See, e.g., PCT publication WO 96/40281; U.S. Pat. No. 5,811,097; Deng et al., Blood 92(6):1981-1988 (1998); Chen et al., Cancer Res. 58(16):3668-3678 (1998); Harrop et al., J. Immunol. 161(4):1786-1794 (1998); Zhu et al., Cancer Res. 58(15):3209-3214 (1998); Yoon et al., J. Immunol. 160(7):3170-3179 (1998); Prat et al., J. Cell. Sci. III (Pt2):237-247 (1998); Pitard et al., J. Immunol. Methods 205(2):177-190 (1997); Liautard et al., Cytokine 9(4):233-241 (1997); Carlson et al., J. Biol. Chem. 272(17):11295-11301 (1997); Taryman et al., Neuron 14(4):755-762 (1995); Muller et al., Structure 6(9):1153-1167 (1998); Bartunek et al., Cytokine 8(1):14-20 (1996).


The antibodies as defined for the present invention include derivatives that are modified, i.e., by the covalent attachment of any type of molecule to the antibody such that covalent attachment does not prevent the antibody from generating an anti-idiotypic response. For example, but not by way of limitation, the antibody derivatives include antibodies that have been modified, e.g., by glycosylation, acetylation, pegylation, phosphylation, amidation, derivatization by known protecting/blocking groups, proteolytic cleavage, linkage to a cellular ligand or other protein, etc. Any of numerous chemical modifications may be carried out by known techniques, including, but not limited to specific chemical cleavage, acetylation, formylation, metabolic synthesis of tunicamycin, etc. Additionally, the derivative may contain one or more non-classical amino acids.


Simple binding assays can be used to screen for or detect agents that bind to a target protein, or disrupt the interaction between proteins (e.g., a receptor and a ligand). Because certain targets of the present invention are transmembrane proteins, assays that use the soluble forms of these proteins rather than full-length protein can be used, in some embodiments. Soluble forms include, for example, those lacking the transmembrane domain and/or those comprising the IgV domain or fragments thereof which retain their ability to bind their cognate binding partners. Further, agents that inhibit or enhance protein interactions for use in the compositions and methods described herein, can include recombinant peptido-mimetics.


Detection methods useful in screening assays include antibody-based methods, detection of a reporter moiety, detection of cytokines as described herein, and detection of a gene signature as described herein.


Another variation of assays to determine binding of a receptor protein to a ligand protein is through the use of affinity biosensor methods. Such methods may be based on the piezoelectric effect, electrochemistry, or optical methods, such as ellipsometry, optical wave guidance, and surface plasmon resonance (SPR).


The term “antibody-like protein scaffolds” or “engineered protein scaffolds” broadly encompasses proteinaceous non-immunoglobulin specific-binding agents, typically obtained by combinatorial engineering (such as site-directed random mutagenesis in combination with phage display or other molecular selection techniques). Usually, such scaffolds are derived from robust and small soluble monomeric proteins (such as Kunitz inhibitors or lipocalins) or from a stably folded extra-membrane domain of a cell surface receptor (such as protein A, fibronectin or the ankyrin repeat).


Such scaffolds have been extensively reviewed in Binz et al. (Engineering novel binding proteins from nonimmunoglobulin domains. Nat Biotechnol 2005, 23:1257-1268), Gebauer and Skerra (Engineered protein scaffolds as next-generation antibody therapeutics. Curr Opin Chem Biol. 2009, 13:245-55), Gill and Damle (Biopharmaceutical drug discovery using novel protein scaffolds. Curr Opin Biotechnol 2006, 17:653-658), Skerra (Engineered protein scaffolds for molecular recognition. J Mol Recognit 2000, 13:167-187), and Skerra (Alternative non-antibody scaffolds for molecular recognition. Curr Opin Biotechnol 2007, 18:295-304), and include without limitation affibodies, based on the Z-domain of staphylococcal protein A, a three-helix bundle of 58 residues providing an interface on two of its alpha-helices (Nygren, Alternative binding proteins: Affibody binding proteins developed from a small three-helix bundle scaffold. FEBS J 2008, 275:2668-2676); engineered Kunitz domains based on a small (ca. 58 residues) and robust, disulphide-crosslinked serine protease inhibitor, typically of human origin (e.g. LACI-D1), which can be engineered for different protease specificities (Nixon and Wood, Engineered protein inhibitors of proteases. Curr Opin Drug Discov Dev 2006, 9:261-268); monobodies or adnectins based on the 10th extracellular domain of human fibronectin III (10Fn3), which adopts an Ig-like beta-sandwich fold (94 residues) with 2-3 exposed loops, but lacks the central disulphide bridge (Koide and Koide, Monobodies: antibody mimics based on the scaffold of the fibronectin type III domain. Methods Mol Biol 2007, 352:95-109); anticalins derived from the lipocalins, a diverse family of eight-stranded beta-barrel proteins (ca. 180 residues) that naturally form binding sites for small ligands by means of four structurally variable loops at the open end, which are abundant in humans, insects, and many other organisms (Skerra, Alternative binding proteins: Anticalins-harnessing the structural plasticity of the lipocalin ligand pocket to engineer novel binding activities. FEBS J 2008, 275:2677-2683); DARPins, designed ankyrin repeat domains (166 residues), which provide a rigid interface arising from typically three repeated beta-turns (Stumpp et al., DARPins: a new generation of protein therapeutics. Drug Discov Today 2008, 13:695-701); avimers (multimerized LDLR-A module) (Silverman et al., Multivalent avimer proteins evolved by exon shuffling of a family of human receptor domains. Nat Biotechnol 2005, 23:1556-1561); and cysteine-rich knottin peptides (Kolmar, Alternative binding proteins: biological activity and therapeutic potential of cystine-knot miniproteins. FEBS J 2008, 275:2684-2690).


Nucleic acid binding agents, such as oligonucleotide binding agents, are typically at least partly antisense to a target nucleic acid of interest. The term “antisense” generally refers to an agent (e.g., an oligonucleotide) configured to specifically anneal with (hybridize to) a given sequence in a target nucleic acid, such as for example in a target DNA, hnRNA, pre-mRNA or mRNA, and typically comprises, consist essentially of or consist of a nucleic acid sequence that is complementary or substantially complementary to said target nucleic acid sequence. Antisense agents suitable for use herein, such as hybridisation probes or amplification or sequencing primers and primer pairs) may typically be capable of annealing with (hybridizing to) the respective target nucleic acid sequences at high stringency conditions, and capable of hybridizing specifically to the target under physiological conditions. The terms “complementary” or “complementarity” as used throughout this specification with reference to nucleic acids, refer to the normal binding of single-stranded nucleic acids under permissive salt (ionic strength) and temperature conditions by base pairing, preferably Watson-Crick base pairing. By means of example, complementary Watson-Crick base pairing occurs between the bases A and T, A and U or G and C. For example, the sequence 5′-A-G-U-3′ is complementary to sequence 5′-A-C-U-3′.


The reference to oligonucleotides may in particular but without limitation include hybridization probes and/or amplification primers and/or sequencing primers, etc., as commonly used in nucleic acid detection technologies.


Binding agents as discussed herein may suitably comprise a detectable label. The term “label” refers to any atom, molecule, moiety or biomolecule that may be used to provide a detectable and preferably quantifiable read-out or property, and that may be attached to or made part of an entity of interest, such as a binding agent. Labels may be suitably detectable by for example mass spectrometric, spectroscopic, optical, colourimetric, magnetic, photochemical, biochemical, immunochemical or chemical means. Labels include without limitation dyes; radiolabels such as 32P, 33P, 35S, 125I, 131I; electron-dense reagents; enzymes (e.g., horse-radish peroxidase or alkaline phosphatase as commonly used in immunoassays); binding moieties such as biotin-streptavidin; haptens such as digoxigenin; luminogenic, phosphorescent or fluorogenic moieties; mass tags; and fluorescent dyes alone or in combination with moieties that may suppress or shift emission spectra by fluorescence resonance energy transfer (FRET).


In some embodiments, binding agents may be provided with a tag that permits detection with another agent (e.g., with a probe binding partner). Such tags may be, for example, biotin, streptavidin, his-tag, myc tag, maltose, maltose binding protein or any other kind of tag known in the art that has a binding partner. Example of associations which may be utilised in the probe:binding partner arrangement may be any, and includes, for example biotin: streptavidin, his-tag:metal ion (e.g., Ni2+), maltose:maltose binding protein, etc.


The marker-binding agent conjugate may be associated with or attached to a detection agent to facilitate detection. Examples of detection agents include, but are not limited to, luminescent labels; colourimetric labels, such as dyes; fluorescent labels; or chemical labels, such as electroactive agents (e.g., ferrocyanide); enzymes; radioactive labels; or radiofrequency labels. The detection agent may be a particle. Examples of such particles include, but are not limited to, colloidal gold particles; colloidal sulphur particles; colloidal selenium particles; colloidal barium sulfate particles; colloidal iron sulfate particles; metal iodate particles; silver halide particles; silica particles; colloidal metal (hydrous) oxide particles; colloidal metal sulfide particles; colloidal lead selenide particles; colloidal cadmium selenide particles; colloidal metal phosphate particles; colloidal metal ferrite particles; any of the above-mentioned colloidal particles coated with organic or inorganic layers; protein or peptide molecules; liposomes; or organic polymer latex particles, such as polystyrene latex beads. Preferable particles may be colloidal gold particles.


In certain embodiments, the one or more binding agents are configured for use in a technique selected from the group consisting of flow cytometry, fluorescence activated cell sorting, mass cytometry, fluorescence microscopy, affinity separation, magnetic cell separation, microfluidic separation, and combinations thereof.


Pharmaceutical Compositions Using Isolated Cells

In another aspect, the present invention provides for a pharmaceutical composition comprising the CD8+ T cell or the CD8+ T cell population as defined in any embodiment herein. In certain embodiments, the CD8+ T cell or the CD8+ T cell population may be formulated into a pharmaceutical composition.


In certain embodiments, the immune cell or immune cell population is autologous to said subject, i.e., the immune cell or immune cell population is isolated from the same subject as the subject to which/whom the immune cell or immune cell population is to be administered. In certain further embodiments, the immune cell or immune cell population is syngeneic to said subject, i.e., the immune cell or immune cell population is isolated from an identical twin of the subject to which/whom the immune cell or immune cell population is to be administered. In certain further embodiments, the immune cell or immune cell population is allogeneic to said subject, i.e., the immune cell or immune cell population is isolated from a different subject of the same species as the subject to which/whom the immune cell or immune cell population is to be administered. In certain embodiments, the immune cell or immune cell population may even be xenogeneic to said subject, i.e., the immune cell or immune cell population may be isolated from a subject of a different species than the subject to which/whom the immune cell or immune cell population is to be administered.


Preferably, non-autologous, such as allogeneic cells may be selected such as to maximize the tissue compatibility between the subject and the administered cells, thereby reducing the chance of rejection of the administered cells by patient's immune system or graft-vs.-host reaction. For example, advantageously the cells may be typically selected which have either identical HLA haplotypes (including one or preferably more HLA-A, HLA-B, HLA-C, HLA-D, HLA-DR, HLA-DP and HLA-DQ) to the subject, or which have the most HLA antigen alleles common to the subject and none or the least of HLA antigens to which the subject contains pre existing anti-HLA antibodies. In certain embodiments, allogenic T cells may be modified to prevent rejection from an allogenic healthy donor (described further herein).


A “pharmaceutical composition” refers to a composition that usually contains an excipient, such as a pharmaceutically acceptable carrier that is conventional in the art and that is suitable for administration to cells or to a subject.


The term “pharmaceutically acceptable” as used throughout this specification is consistent with the art and means compatible with the other ingredients of a pharmaceutical composition and not deleterious to the recipient thereof.


As used herein, “carrier” or “excipient” includes any and all solvents, diluents, buffers (such as, e.g., neutral buffered saline or phosphate buffered saline), solubilizers, colloids, dispersion media, vehicles, fillers, chelating agents (such as, e.g., EDTA or glutathione), amino acids (such as, e.g., glycine), proteins, disintegrants, binders, lubricants, wetting agents, emulsifiers, sweeteners, colorants, flavorings, aromatizers, thickeners, agents for achieving a depot effect, coatings, antifungal agents, preservatives, stabilizers, antioxidants, tonicity controlling agents, absorption delaying agents, and the like. The use of such media and agents for pharmaceutical active components is well known in the art. Such materials should be non-toxic and should not interfere with the activity of the cells or active components.


The precise nature of the carrier or excipient or other material will depend on the route of administration. For example, the composition may be in the form of a parenterally acceptable aqueous solution, which is pyrogen-free and has suitable pH, isotonicity and stability. For general principles in medicinal formulation, the reader is referred to Cell Therapy: Stem Cell Transplantation, Gene Therapy, and Cellular Immunotherapy, by G. Morstyn & W. Sheridan eds., Cambridge University Press, 1996; and Hematopoietic Stem Cell Therapy, E. D. Ball, J. Lister & P. Law, Churchill Livingstone, 2000.


The pharmaceutical composition can be applied parenterally, rectally, orally or topically. Preferably, the pharmaceutical composition may be used for intravenous, intramuscular, subcutaneous, peritoneal, peridural, rectal, nasal, pulmonary, mucosal, or oral application. In a preferred embodiment, the pharmaceutical composition according to the invention is intended to be used as an infusion. The skilled person will understand that compositions which are to be administered orally or topically will usually not comprise cells, although it may be envisioned for oral compositions to also comprise cells, for example when gastro-intestinal tract indications are treated. Each of the cells or active components (e.g., immunomodulants) as discussed herein may be administered by the same route or may be administered by a different route. By means of example, and without limitation, cells may be administered parenterally and other active components may be administered orally.


Liquid pharmaceutical compositions may generally include a liquid carrier such as water or a pharmaceutically acceptable aqueous solution. For example, physiological saline solution, tissue or cell culture media, dextrose or other saccharide solution or glycols such as ethylene glycol, propylene glycol or polyethylene glycol may be included.


The composition may include one or more cell protective molecules, cell regenerative molecules, growth factors, anti-apoptotic factors or factors that regulate gene expression in the cells. Such substances may render the cells independent of their environment.


Such pharmaceutical compositions may contain further components ensuring the viability of the cells therein. For example, the compositions may comprise a suitable buffer system (e.g., phosphate or carbonate buffer system) to achieve desirable pH, more usually near neutral pH, and may comprise sufficient salt to ensure isoosmotic conditions for the cells to prevent osmotic stress. For example, suitable solution for these purposes may be phosphate-buffered saline (PBS), sodium chloride solution, Ringer's Injection or Lactated Ringer's Injection, as known in the art. Further, the composition may comprise a carrier protein, e.g., albumin (e.g., bovine or human albumin), which may increase the viability of the cells.


Further suitably pharmaceutically acceptable carriers or additives are well known to those skilled in the art and for instance may be selected from proteins such as collagen or gelatine, carbohydrates such as starch, polysaccharides, sugars (dextrose, glucose and sucrose), cellulose derivatives like sodium or calcium carboxymethylcellulose, hydroxypropyl cellulose or hydroxypropylmethyl cellulose, pregeletanized starches, pectin agar, carrageenan, clays, hydrophilic gums (acacia gum, guar gum, arabic gum and xanthan gum), alginic acid, alginates, hyaluronic acid, polyglycolic and polylactic acid, dextran, pectins, synthetic polymers such as water-soluble acrylic polymer or polyvinylpyrrolidone, proteoglycans, calcium phosphate and the like.


In certain embodiments, a pharmaceutical cell preparation as taught herein may be administered in a form of liquid composition. In embodiments, the cells or pharmaceutical composition comprising such can be administered systemically, topically, within an organ or at a site of organ dysfunction or lesion.


Preferably, the pharmaceutical compositions may comprise a therapeutically effective amount of the specified immune cells and/or other active components (e.g., immunomodulants). The term “therapeutically effective amount” refers to an amount which can elicit a biological or medicinal response in a tissue, system, animal or human that is being sought by a researcher, veterinarian, medical doctor or other clinician, and in particular can prevent or alleviate one or more of the local or systemic symptoms or features of a disease or condition being treated.


Activated T Cell Compositions

A further aspect of the invention relates to a method for preparing a composition comprising activated T cells, the method comprising depleting suppressive T cells from a biological sample of a subject and contacting the remaining T cells in vitro with an immune cell or immune cell population, wherein the immune cell or immune cell population has been loaded with an antigen.


“Activation” generally refers to the state of a cell, such as preferably T cell, following sufficient cell surface moiety ligation (e.g., interaction between the T cell receptor on the surface of a T cell (such as naturally-occurring TCR or genetically engineered TCR, e.g., chimeric antigen receptor, CAR) and MHC-bound antigen peptide presented on the surface of an antigen presenting cell (e.g., dendritic cell) to induce a noticeable biochemical or morphological change of the cell, such as preferably T cell. In particular, “activation” may refer to the state of a T cell that has been sufficiently stimulated to induce detectable cellular proliferation of the T cell. Activation can also encompass induced cytokine production, and detectable T cell effector functions, e.g., regulatory or cytolytic effector functions. The T cells and antigen presenting cells may be suitably contacted by admixing the T cells and antigen presenting cells in an aqueous composition, e.g., in a culture medium, in sufficient numbers and for a sufficient duration of time to produce the desired T cell activation.


A further aspect of the invention relates to a method for adoptive immunotherapy in a subject in need thereof comprising administering to said subject a composition comprising activated T cells prepared with the method as taught above.


In certain embodiments, said T cells are CD8+ T cells, i.e., T cells expressing the CD8+ cell surface marker. More preferably, said T cells may be CD8+ T cells and said subject is suffering from proliferative disease.


In certain embodiments, the T cell, preferably a CD8+ T cell, may display specificity to a desired antigen, such as specificity to a tumor antigen (tumor antigen specificity). By means of an example, the T cell, preferably a CD8+ T cell, may have been isolated from a tumor of a subject. More preferably, the immune cell may be a tumor infiltrating lymphocyte (TIL). Generally, “tumor infiltrating lymphocytes” or “TILs” refer to white blood cells that have left the bloodstream and migrated into a tumor. Such T cells typically endogenously express a T cell receptor having specificity to an antigen expressed by the tumor cells (tumor antigen specificity).


In alternative embodiments, a T cell, preferably a CD8+ T cell, may be engineered to express a T cell receptor having specificity to a desired antigen, such as specificity to a tumor antigen (tumor antigen specificity). For example, the T cell, preferably a CD8+ T cell, may comprise a chimeric antigen receptor (CAR) having specificity to a desired antigen, such as a tumor-specific chimeric antigen receptor (CAR).


Adoptive Cell Therapy or Transfer

In certain embodiments, cells as described herein and below may be used for adoptive cell transfer (ACT). As used herein, “ACT”, “adoptive cell therapy” and “adoptive cell transfer” may be used interchangeably. In certain embodiments, the interaction of immune cells is advantageously used, such as modulating and/or transferring one immune cell subtype to cause an effect in another immune cell subtype. The transferred cells may include and be modulated by immune cells or immune cell populations as taught herein. In certain embodiments, the suppressive T cells of the present invention are depleted from cells used in ACT and the depleted cells may be transferred to a subject suffering from a disease (e.g., cancer). In certain embodiments, the cells of the present invention may be transferred to a subject suffering from a disease characteristic of an over reactive immune response (e.g., autoimmune disease). In certain embodiments, adoptive cell transfer may comprise: isolating from a biological sample of the subject a CD4+ and/or CD8+ T cell or CD4+ and/or CD8+ T cell population as described herein; in vitro expanding the T cell or T cell population; and administering the in vitro expanded T cell or T cell population to the subject. The method may further comprise enriching the expanded T cells for one subtype. In certain embodiments, the method may further comprise formulating the in vitro expanded immune cell or immune cell population into a pharmaceutical composition.


In certain embodiments, the present invention comprises adoptive cell therapy. In certain embodiments, Adoptive cell therapy (ACT) can refer to the transfer of cells to a patient with the goal of transferring the functionality and characteristics into the new host by engraftment of the cells (see, e.g., Mettananda et al., Editing an α-globin enhancer in primary human hematopoietic stem cells as a treatment for β-thalassemia, Nat Commun. 2017 Sep. 4; 8(1):424). As used herein, the term “engraft” or “engraftment” refers to the process of cell incorporation into a tissue of interest in vivo through contact with existing cells of the tissue. Adoptive cell therapy (ACT) can refer to the transfer of cells, most commonly immune-derived cells, back into the same patient or into a new recipient host with the goal of transferring the immunologic functionality and characteristics into the new host. If possible, use of autologous cells helps the recipient by minimizing GVHD issues. The adoptive transfer of autologous tumor infiltrating lymphocytes (TIL) (Besser et al., (2010) Clin. Cancer Res 16 (9) 2646-55; Dudley et al., (2002) Science 298 (5594): 850-4; and Dudley et al., (2005) Journal of Clinical Oncology 23 (10): 2346-57.) or genetically re-directed peripheral blood mononuclear cells (Johnson et al., (2009) Blood 114 (3): 535-46; and Morgan et al., (2006) Science 314(5796) 126-9) has been used to successfully treat patients with advanced solid tumors, including melanoma and colorectal carcinoma, as well as patients with CD19-expressing hematologic malignancies (Kalos et al., (2011) Science Translational Medicine 3 (95): 95ra73). In certain embodiments, allogenic cells immune cells are transferred (see, e.g., Ren et al., (2017) Clin Cancer Res 23 (9) 2255-2266). As described further herein, allogenic cells can be edited to reduce alloreactivity and prevent graft-versus-host disease. Thus, use of allogenic cells allows for cells to be obtained from healthy donors and prepared for use in patients as opposed to preparing autologous cells from a patient after diagnosis.


Aspects of the invention involve the adoptive transfer of immune system cells, such as T cells, specific for selected antigens, such as tumor associated antigens or tumor specific neoantigens (see, e.g., Maus et al., 2014, Adoptive Immunotherapy for Cancer or Viruses, Annual Review of Immunology, Vol. 32: 189-225; Rosenberg and Restifo, 2015, Adoptive cell transfer as personalized immunotherapy for human cancer, Science Vol. 348 no. 6230 pp. 62-68; Restifo et al., 2015, Adoptive immunotherapy for cancer: harnessing the T cell response. Nat. Rev. Immunol. 12(4): 269-281; and Jenson and Riddell, 2014, Design and implementation of adoptive therapy with chimeric antigen receptor-modified T cells. Immunol Rev. 257(1): 127-144; and Rajasagi et al., 2014, Systematic identification of personal tumor-specific neoantigens in chronic lymphocytic leukemia. Blood. 2014 Jul. 17; 124(3):453-62).


In certain embodiments, an antigen (such as a tumor antigen) to be targeted in adoptive cell therapy (such as particularly CAR or TCR T-cell therapy) of a disease (such as particularly of tumor or cancer) may be selected from a group consisting of: B cell maturation antigen (BCMA) (see, e.g., Friedman et al., Effective Targeting of Multiple BCMA-Expressing Hematological Malignancies by Anti-BCMA CAR T Cells, Hum Gene Ther. 2018 Mar. 8; Berdeja J G, et al. Durable clinical responses in heavily pretreated patients with relapsed/refractory multiple myeloma: updated results from a multicenter study of bb2121 anti-Bcma CAR T cell therapy. Blood. 2017; 130:740; and Mouhieddine and Ghobrial, Immunotherapy in Multiple Myeloma: The Era of CAR T Cell Therapy, Hematologist, May-June 2018, Volume 15, issue 3); PSA (prostate-specific antigen); prostate-specific membrane antigen (PSMA); PSCA (Prostate stem cell antigen); Tyrosine-protein kinase transmembrane receptor ROR1; fibroblast activation protein (FAP); Tumor-associated glycoprotein 72 (TAG72); Carcinoembryonic antigen (CEA); Epithelial cell adhesion molecule (EPCAM); Mesothelin; Human Epidermal growth factor Receptor 2 (ERBB2 (Her2/neu)); Prostase; Prostatic acid phosphatase (PAP); elongation factor 2 mutant (ELF2M); Insulin-like growth factor 1 receptor (IGF-1R); gplOO; BCR-ABL (breakpoint cluster region-Abelson); tyrosinase; New York esophageal squamous cell carcinoma 1 (NY-ESO-1); K-light chain, LAGE (L antigen); MAGE (melanoma antigen); Melanoma-associated antigen 1 (MAGE-A1); MAGE A3; MAGE A6; legumain; Human papillomavirus (HPV) E6; HPV E7; prostein; survivin; PCTA1 (Galectin 8); Melan-A/MART-1; Ras mutant; TRP-1 (tyrosinase related protein 1, or gp75); Tyrosinase-related Protein 2 (TRP2); TRP-2/INT2 (TRP-2/intron 2); RAGE (renal antigen); receptor for advanced glycation end products 1 (RAGE1); Renal ubiquitous 1, 2 (RU1, RU2); intestinal carboxyl esterase (iCE); Heat shock protein 70-2 (HSP70-2) mutant; thyroid stimulating hormone receptor (TSHR); CD123; CD171; CD19; CD20; CD22; CD26; CD30; CD33; CD44v7/8 (cluster of differentiation 44, exons 7/8); CD53; CD92; CD100; CD148; CD150; CD200; CD261; CD262; CD362; CS-1 (CD2 subset 1, CRACC, SLAMF7, CD319, and 19A24); C-type lectin-like molecule-1 (CLL-1); ganglioside GD3 (aNeu5Ac(2-8)aNeu5Ac(2-3)bDGalp(1-4)bDGlcp(1-1)Cer); Tn antigen (Tn Ag); Fms-Like Tyrosine Kinase 3 (FLT3); CD38; CD138; CD44v6; B7H3 (CD276); KIT (CD117); Interleukin-13 receptor subunit alpha-2 (IL-13Ra2); Interleukin 11 receptor alpha (IL-11Ra); prostate stem cell antigen (PSCA); Protease Serine 21 (PRSS21); vascular endothelial growth factor receptor 2 (VEGFR2); Lewis(Y) antigen; CD24; Platelet-derived growth factor receptor beta (PDGFR-beta); stage-specific embryonic antigen-4 (SSEA-4); Mucin 1, cell surface associated (MUC1); mucin 16 (MUC16); epidermal growth factor receptor (EGFR); epidermal growth factor receptor variant III (EGFRvIII); neural cell adhesion molecule (NCAM); carbonic anhydrase IX (CAIX); Proteasome (Prosome, Macropain) Subunit, Beta Type, 9 (LMP2); ephrin type-A receptor 2 (EphA2); Ephrin B2; Fucosyl GM1; sialyl Lewis adhesion molecule (sLe); ganglioside GM3 (aNeu5Ac(2-3)bDGalp(1-4)bDGlcp(1-1)Cer); TGS5; high molecular weight-melanoma-associated antigen (HMWMAA); o-acetyl-GD2 ganglioside (OAcGD2); Folate receptor alpha; Folate receptor beta; tumor endothelial marker 1 (TEM1/CD248); tumor endothelial marker 7-related (TEM7R); claudin 6 (CLDN6); G protein-coupled receptor class C group 5, member D (GPRC5D); chromosome X open reading frame 61 (CXORF61); CD97; CD179a; anaplastic lymphoma kinase (ALK); Polysialic acid; placenta-specific 1 (PLAC1); hexasaccharide portion of globoH glycoceramide (GloboH); mammary gland differentiation antigen (NY-BR-1); uroplakin 2 (UPK2); Hepatitis A virus cellular receptor 1 (HAVCR1); adrenoceptor beta 3 (ADRB3); pannexin 3 (PANX3); G protein-coupled receptor 20 (GPR20); lymphocyte antigen 6 complex, locus K 9 (LY6K); Olfactory receptor 51E2 (OR51E2); TCR Gamma Alternate Reading Frame Protein (TARP); Wilms tumor protein (WT1); ETS translocation-variant gene 6, located on chromosome 12p (ETV6-AML); sperm protein 17 (SPA17); X Antigen Family, Member 1A (XAGE1); angiopoietin-binding cell surface receptor 2 (Tie 2); CT (cancer/testis (antigen)); melanoma cancer testis antigen-1 (MAD-CT-1); melanoma cancer testis antigen-2 (MAD-CT-2); Fos-related antigen 1; p53; p53 mutant; human Telomerase reverse transcriptase (hTERT); sarcoma translocation breakpoints; melanoma inhibitor of apoptosis (ML-IAP); ERG (transmembrane protease, serine 2 (TMPRSS2) ETS fusion gene); N-Acetyl glucosaminyl-transferase V (NA17); paired box protein Pax-3 (PAX3); Androgen receptor; Cyclin B1; Cyclin D1; v-myc avian myelocytomatosis viral oncogene neuroblastoma derived homolog (MYCN); Ras Homolog Family Member C (RhoC); Cytochrome P450 1B1 (CYP1B1); CCCTC-Binding Factor (Zinc Finger Protein)-Like (BORIS); Squamous Cell Carcinoma Antigen Recognized By T Cells-1 or 3 (SART1, SART3); Paired box protein Pax-5 (PAX5); proacrosin binding protein sp32 (OY-TES 1); lymphocyte-specific protein tyrosine kinase (LCK); A kinase anchor protein 4 (AKAP-4); synovial sarcoma, X breakpoint-1, -2, -3 or -4 (SSX1, SSX2, SSX3, SSX4); CD79a; CD79b; CD72; Leukocyte-associated immunoglobulin-like receptor 1 (LAIR1); Fc fragment of IgA receptor (FCAR); Leukocyte immunoglobulin-like receptor subfamily A member 2 (LILRA2); CD300 molecule-like family member f (CD300LF); C-type lectin domain family 12 member A (CLEC12A); bone marrow stromal cell antigen 2 (BST2); EGF-like module-containing mucin-like hormone receptor-like 2 (EMR2); lymphocyte antigen 75 (LY75); Glypican-3 (GPC3); Fc receptor-like 5 (FCRL5); mouse double minute 2 homolog (MDM2); livin; alphafetoprotein (AFP); transmembrane activator and CAML Interactor (TACI); B-cell activating factor receptor (BAFF-R); V-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog (KRAS); immunoglobulin lambda-like polypeptide 1 (IGLL1); 707-AP (707 alanine proline); ART-4 (adenocarcinoma antigen recognized by T4 cells); BAGE (B antigen; b-catenin/m, b-catenin/mutated); CAMEL (CTL-recognized antigen on melanoma); CAP1 (carcinoembryonic antigen peptide 1); CASP-8 (caspase-8); CDCl27m (cell-division cycle 27 mutated); CDK4/m (cycline-dependent kinase 4 mutated); Cyp-B (cyclophilin B); DAM (differentiation antigen melanoma); EGP-2 (epithelial glycoprotein 2); EGP-40 (epithelial glycoprotein 40); Erbb2, 3, 4 (erythroblastic leukemia viral oncogene homolog-2, -3, 4); FBP (folate binding protein); fAchR (Fetal acetylcholine receptor); G250 (glycoprotein 250); GAGE (G antigen); GnT-V (N-acetylglucosaminyltransferase V); HAGE (helicose antigen); ULA-A (human leukocyte antigen-A); HST2 (human signet ring tumor 2); KIAA0205; KDR (kinase insert domain receptor); LDLR/FUT (low density lipid receptor/GDP L-fucose: b-D-galactosidase 2-a-L fucosyltransferase); L1CAM (L1 cell adhesion molecule); MC1R (melanocortin 1 receptor); Myosin/m (myosin mutated); MUM-1, -2, -3 (melanoma ubiquitous mutated 1, 2, 3); NA88-A (NA cDNA clone of patient M88); KG2D (Natural killer group 2, member D) ligands; oncofetal antigen (h5T4); p190 minor bcr-abl (protein of 190 KD bcr-abl); Pml/RARa (promyelocytic leukaemia/retinoic acid receptor a); PRAME (preferentially expressed antigen of melanoma); SAGE (sarcoma antigen); TEL/AME1 (translocation Ets-family leukemia/acute myeloid leukemia 1); TPI/m (triosephosphate isomerase mutated); CD70; and any combination thereof.


In certain embodiments, an antigen to be targeted in adoptive cell therapy (such as particularly CAR or TCR T-cell therapy) of a disease (such as particularly of tumor or cancer) is a tumor-specific antigen (TSA).


In certain embodiments, an antigen to be targeted in adoptive cell therapy (such as particularly CAR or TCR T-cell therapy) of a disease (such as particularly of tumor or cancer) is a neoantigen.


In certain embodiments, an antigen to be targeted in adoptive cell therapy (such as particularly CAR or TCR T-cell therapy) of a disease (such as particularly of tumor or cancer) is a tumor-associated antigen (TAA).


In certain embodiments, an antigen to be targeted in adoptive cell therapy (such as particularly CAR or TCR T-cell therapy) of a disease (such as particularly of tumor or cancer) is a universal tumor antigen. In certain preferred embodiments, the universal tumor antigen is selected from the group consisting of: a human telomerase reverse transcriptase (hTERT), survivin, mouse double minute 2 homolog (MDM2), cytochrome P450 1B 1 (CYP1B), HER2/neu, Wilms' tumor gene 1 (WT1), livin, alphafetoprotein (AFP), carcinoembryonic antigen (CEA), mucin 16 (MUC16), MUC1, prostate-specific membrane antigen (PSMA), p53, cyclin (Dl), and any combinations thereof.


In certain embodiments, an antigen (such as a tumor antigen) to be targeted in adoptive cell therapy (such as particularly CAR or TCR T-cell therapy) of a disease (such as particularly of tumor or cancer) may be selected from a group consisting of: CD19, BCMA, CD70, CLL-1, MAGE A3, MAGE A6, HPV E6, HPV E7, WT1, CD22, CD171, ROR1, MUC16, and SSX2. In certain preferred embodiments, the antigen may be CD19. For example, CD19 may be targeted in hematologic malignancies, such as in lymphomas, more particularly in B-cell lymphomas, such as without limitation in diffuse large B-cell lymphoma, primary mediastinal b-cell lymphoma, transformed follicular lymphoma, marginal zone lymphoma, mantle cell lymphoma, acute lymphoblastic leukemia including adult and pediatric ALL, non-Hodgkin lymphoma, indolent non-Hodgkin lymphoma, or chronic lymphocytic leukemia. For example, BCMA may be targeted in multiple myeloma or plasma cell leukemia (see, e.g., 2018 American Association for Cancer Research (AACR) Annual meeting Poster: Allogeneic Chimeric Antigen Receptor T Cells Targeting B Cell Maturation Antigen). For example, CLL1 may be targeted in acute myeloid leukemia. For example, MAGE A3, MAGE A6, SSX2, and/or KRAS may be targeted in solid tumors. For example, HPV E6 and/or HPV E7 may be targeted in cervical cancer or head and neck cancer. For example, WT1 may be targeted in acute myeloid leukemia (AML), myelodysplastic syndromes (MDS), chronic myeloid leukemia (CML), non-small cell lung cancer, breast, pancreatic, ovarian or colorectal cancers, or mesothelioma. For example, CD22 may be targeted in B cell malignancies, including non-Hodgkin lymphoma, diffuse large B-cell lymphoma, or acute lymphoblastic leukemia. For example, CD171 may be targeted in neuroblastoma, glioblastoma, or lung, pancreatic, or ovarian cancers. For example, ROR1 may be targeted in ROR1+ malignancies, including non-small cell lung cancer, triple negative breast cancer, pancreatic cancer, prostate cancer, ALL, chronic lymphocytic leukemia, or mantle cell lymphoma. For example, MUC16 may be targeted in MUC16ecto+ epithelial ovarian, fallopian tube or primary peritoneal cancer. For example, CD70 may be targeted in both hematologic malignancies as well as in solid cancers such as renal cell carcinoma (RCC), gliomas (e.g., GBM), and head and neck cancers (HNSCC). CD70 is expressed in both hematologic malignancies as well as in solid cancers, while its expression in normal tissues is restricted to a subset of lymphoid cell types (see, e.g., 2018 American Association for Cancer Research (AACR) Annual meeting Poster: Allogeneic CRISPR Engineered Anti-CD70 CAR-T Cells Demonstrate Potent Preclinical Activity Against Both Solid and Hematological Cancer Cells).


Various strategies may for example be employed to genetically modify T cells by altering the specificity of the T cell receptor (TCR) for example by introducing new TCR α and β chains with selected peptide specificity (see U.S. Pat. No. 8,697,854; PCT Patent Publications: WO2003020763, WO2004033685, WO2004044004, WO2005114215, WO2006000830, WO2008038002, WO2008039818, WO2004074322, WO2005113595, WO2006125962, WO2013166321, WO2013039889, WO2014018863, WO2014083173; U.S. Pat. No. 8,088,379).


As an alternative to, or addition to, TCR modifications, chimeric antigen receptors (CARs) may be used in order to generate immunoresponsive cells, such as T cells, specific for selected targets, such as malignant cells, with a wide variety of receptor chimera constructs having been described (see U.S. Pat. Nos. 5,843,728; 5,851,828; 5,912,170; 6,004,811; 6,284,240; 6,392,013; 6,410,014; 6,753,162; 8,211,422; and, PCT Publication WO9215322).


In general, CARs are comprised of an extracellular domain, a transmembrane domain, and an intracellular domain, wherein the extracellular domain comprises an antigen-binding domain that is specific for a predetermined target. While the antigen-binding domain of a CAR is often an antibody or antibody fragment (e.g., a single chain variable fragment, scFv), the binding domain is not particularly limited so long as it results in specific recognition of a target. For example, in some embodiments, the antigen-binding domain may comprise a receptor, such that the CAR is capable of binding to the ligand of the receptor. Alternatively, the antigen-binding domain may comprise a ligand, such that the CAR is capable of binding the endogenous receptor of that ligand.


The antigen-binding domain of a CAR is generally separated from the transmembrane domain by a hinge or spacer. The spacer is also not particularly limited, and it is designed to provide the CAR with flexibility. For example, a spacer domain may comprise a portion of a human Fc domain, including a portion of the CH3 domain, or the hinge region of any immunoglobulin, such as IgA, IgD, IgE, IgG, or IgM, or variants thereof. Furthermore, the hinge region may be modified so as to prevent off-target binding by FcRs or other potential interfering objects. For example, the hinge may comprise an IgG4 Fc domain with or without a S228P, L235E, and/or N297Q mutation (according to Kabat numbering) in order to decrease binding to FcRs. Additional spacers/hinges include, but are not limited to, CD4, CD8, and CD28 hinge regions.


The transmembrane domain of a CAR may be derived either from a natural or from a synthetic source. Where the source is natural, the domain may be derived from any membrane bound or transmembrane protein. Transmembrane regions of particular use in this disclosure may be derived from CD8, CD28, CD3, CD45, CD4, CD5, CDS, CD9, CD 16, CD22, CD33, CD37, CD64, CD80, CD86, CD 134, CD137, CD 154, TCR. Alternatively, the transmembrane domain may be synthetic, in which case it will comprise predominantly hydrophobic residues such as leucine and valine. Preferably a triplet of phenylalanine, tryptophan and valine will be found at each end of a synthetic transmembrane domain. Optionally, a short oligo- or polypeptide linker, preferably between 2 and 10 amino acids in length may form the linkage between the transmembrane domain and the cytoplasmic signaling domain of the CAR. A glycine-serine doublet provides a particularly suitable linker.


Alternative CAR constructs may be characterized as belonging to successive generations. First-generation CARs typically consist of a single-chain variable fragment of an antibody specific for an antigen, for example comprising a VL linked to a VH of a specific antibody, linked by a flexible linker, for example by a CD8α hinge domain and a CD8α transmembrane domain, to the transmembrane and intracellular signaling domains of either CD3ζ or FcRγ (scFv-CD3ζ or scFv-FcRγ; see U.S. Pat. Nos. 7,741,465; 5,912,172; 5,906,936). Second-generation CARs incorporate the intracellular domains of one or more costimulatory molecules, such as CD28, OX40 (CD134), or 4-1BB (CD137) within the endodomain (for example scFv-CD28/OX40/4-1BB-CD3; see U.S. Pat. Nos. 8,911,993; 8,916,381; 8,975,071; 9,101,584; 9,102,760; 9,102,761). Third-generation CARs include a combination of costimulatory endodomains, such a CD3-chain, CD97, GDI la-CD18, CD2, ICOS, CD27, CD154, CDS, OX40, 4-1BB, CD2, CD7, LIGHT, LFA-1, NKG2C, B7-H3, CD30, CD40, PD-1, or CD28 signaling domains (for example scFv-CD28-4-1BB-CD3ζ or scFv-CD28-OX40-CD3ζ; see U.S. Pat. Nos. 8,906,682; 8,399,645; 5,686,281; PCT Publication No. WO2014134165; PCT Publication No. WO2012079000). In certain embodiments, the primary signaling domain comprises a functional signaling domain of a protein selected from the group consisting of CD3 zeta, CD3 gamma, CD3 delta, CD3 epsilon, common FcR gamma (FCERIG), FcR beta (Fc Epsilon Rib), CD79a, CD79b, Fc gamma RIIa, DAP10, and DAP12. In certain preferred embodiments, the primary signaling domain comprises a functional signaling domain of CD3ζ or FcRγ. In certain embodiments, the one or more costimulatory signaling domains comprise a functional signaling domain of a protein selected, each independently, from the group consisting of: CD27, CD28, 4-1BB (CD137), OX40, CD30, CD40, PD-1, ICOS, lymphocyte function-associated antigen-1 (LFA-1), CD2, CD7, LIGHT, NKG2C, B7-H3, a ligand that specifically binds with CD83, CDS, ICAM-1, GITR, BAFFR, HVEM (LIGHTR), SLAMF7, NKp80 (KLRF1), CD160, CD19, CD4, CD8 alpha, CD8 beta, IL2R beta, IL2R gamma, IL7R alpha, ITGA4, VLA1, CD49a, ITGA4, IA4, CD49D, ITGA6, VLA-6, CD49f, ITGAD, CD11d, ITGAE, CD103, ITGAL, CD11a, LFA-1, ITGAM, CD11b, ITGAX, CD11c, ITGB1, CD29, ITGB2, CD18, ITGB7, TNFR2, TRANCE/RANKL, DNAM1 (CD226), SLAMF4 (CD244, 2B4), CD84, CD96 (Tactile), CEACAM1, CRTAM, Ly9 (CD229), CD160 (BY55), PSGL1, CD100 (SEMA4D), CD69, SLAMF6 (NTB-A, Ly108), SLAM (SLAMF1, CD150, IPO-3), BLAME (SLAMF8), SELPLG (CD162), LTBR, LAT, GADS, SLP-76, PAG/Cbp, NKp44, NKp30, NKp46, and NKG2D. In certain embodiments, the one or more costimulatory signaling domains comprise a functional signaling domain of a protein selected, each independently, from the group consisting of: 4-1BB, CD27, and CD28. In certain embodiments, a chimeric antigen receptor may have the design as described in U.S. Pat. No. 7,446,190, comprising an intracellular domain of CD3ζ chain (such as amino acid residues 52-163 of the human CD3 zeta chain, as shown in SEQ ID NO: 14 of U.S. Pat. No. 7,446,190), a signaling region from CD28 and an antigen-binding element (or portion or domain; such as scFv). The CD28 portion, when between the zeta chain portion and the antigen-binding element, may suitably include the transmembrane and signaling domains of CD28 (such as amino acid residues 114-220 of SEQ ID NO: 10, full sequence shown in SEQ ID NO: 6 of U.S. Pat. No. 7,446,190; these can include the following portion of CD28 as set forth in Genbank identifier NM 006139 (sequence version 1, 2 or 3):









(SEQ ID NO: 1)


IEVMYPPPYLDNEKSNGTIIHVKGKHLCPSPLFPGPSKPFWVLVVVGGVL





ACYSLLVTVAFIIFWVRSKRSRLLHSDYMNMTPRRPGPTRKHYQPYAPPR





DFAAYRS)).







Alternatively, when the zeta sequence lies between the CD28 sequence and the antigen-binding element, intracellular domain of CD28 can be used alone (such as amino sequence set forth in SEQ ID NO: 9 of U.S. Pat. No. 7,446,190). Hence, certain embodiments employ a CAR comprising (a) a zeta chain portion comprising the intracellular domain of human CD3t chain, (b) a costimulatory signaling region, and (c) an antigen-binding element (or portion or domain), wherein the costimulatory signaling region comprises the amino acid sequence encoded by SEQ ID NO: 6 of U.S. Pat. No. 7,446,190.


Alternatively, costimulation may be orchestrated by expressing CARs in antigen-specific T cells, chosen so as to be activated and expanded following engagement of their native αβTCR, for example by antigen on professional antigen-presenting cells, with attendant costimulation. In addition, additional engineered receptors may be provided on the immunoresponsive cells, for example to improve targeting of a T-cell attack and/or minimize side effects


By means of an example and without limitation, Kochenderfer et al., (2009) J Immunother. 32 (7): 689-702 described anti-CD19 chimeric antigen receptors (CAR). FMC63-28Z CAR contained a single chain variable region moiety (scFv) recognizing CD19 derived from the FMC63 mouse hybridoma (described in Nicholson et al., (1997) Molecular Immunology 34: 1157-1165), a portion of the human CD28 molecule, and the intracellular component of the human TCR-ζ molecule. FMC63-CD828BBZ CAR contained the FMC63 scFv, the hinge and transmembrane regions of the CD8 molecule, the cytoplasmic portions of CD28 and 4-1BB, and the cytoplasmic component of the TCR-ζ molecule. The exact sequence of the CD28 molecule included in the FMC63-28Z CAR corresponded to Genbank identifier NM_006139; the sequence included all amino acids starting with the amino acid sequence IEVMYPPPY and continuing all the way to the carboxy-terminus of the protein. To encode the anti-CD19 scFv component of the vector, the authors designed a DNA sequence which was based on a portion of a previously published CAR (Cooper et al., (2003) Blood 101: 1637-1644). This sequence encoded the following components in frame from the 5′ end to the 3′ end: an XhoI site, the human granulocyte-macrophage colony-stimulating factor (GM-CSF) receptor α-chain signal sequence, the FMC63 light chain variable region (as in Nicholson et al., supra), a linker peptide (as in Cooper et al., supra), the FMC63 heavy chain variable region (as in Nicholson et al., supra), and a NotI site. A plasmid encoding this sequence was digested with XhoI and NotI. To form the MSGV-FMC63-28Z retroviral vector, the XhoI and NotI-digested fragment encoding the FMC63 scFv was ligated into a second XhoI and NotI-digested fragment that encoded the MSGV retroviral backbone (as in Hughes et al., (2005) Human Gene Therapy 16: 457-472) as well as part of the extracellular portion of human CD28, the entire transmembrane and cytoplasmic portion of human CD28, and the cytoplasmic portion of the human TCR-ζ molecule (as in Maher et al., 2002) Nature Biotechnology 20: 70-75). The FMC63-28Z CAR is included in the KTE-C19 (axicabtagene ciloleucel) anti-CD19 CAR-T therapy product in development by Kite Pharma, Inc. for the treatment of inter alia patients with relapsed/refractory aggressive B-cell non-Hodgkin lymphoma (NHL). Accordingly, in certain embodiments, cells intended for adoptive cell therapies, more particularly immunoresponsive cells such as T cells, may express the FMC63-28Z CAR as described by Kochenderfer et al. (supra). Hence, in certain embodiments, cells intended for adoptive cell therapies, more particularly immunoresponsive cells such as T cells, may comprise a CAR comprising an extracellular antigen-binding element (or portion or domain; such as scFv) that specifically binds to an antigen, an intracellular signaling domain comprising an intracellular domain of a CD3t chain, and a costimulatory signaling region comprising a signaling domain of CD28. Preferably, the CD28 amino acid sequence is as set forth in Genbank identifier NM_006139 (sequence version 1, 2 or 3) starting with the amino acid sequence IEVMYPPPY (SEQ ID NO. 2) and continuing all the way to the carboxy-terminus of the protein. The sequence is reproduced herein: IEVMYPPPYLDNEKSNGTIIHVKGKHLCPSPLFPGPSKPFWVLVVVGGVLACYSLLVTVA FIIFWVRSKRSRLLHSDYMNMTPRRPGPTRKHYQPYAPPRDFAAYRS. Preferably, the antigen is CD19, more preferably the antigen-binding element is an anti-CD19 scFv, even more preferably the anti-CD19 scFv as described by Kochenderfer et al. (supra).


Additional anti-CD19 CARs are further described in WO2015187528. More particularly Example 1 and Table 1 of WO2015187528, incorporated by reference herein, demonstrate the generation of anti-CD19 CARs based on a fully human anti-CD19 monoclonal antibody (47G4, as described in US20100104509) and murine anti-CD19 monoclonal antibody (as described in Nicholson et al. and explained above). Various combinations of a signal sequence (human CD8-alpha or GM-CSF receptor), extracellular and transmembrane regions (human CD8-alpha) and intracellular T-cell signalling domains (CD28-CD3ζ; 4-1BB-CD3ζ; CD27-CD3ζ; CD28-CD27-CD3ζ, 4-1BB-CD27-CD3ζ; CD27-4-1BB-CD3ζ; CD28-CD27-FcεRI gamma chain; or CD28-FcεRI gamma chain) were disclosed. Hence, in certain embodiments, cells intended for adoptive cell therapies, more particularly immunoresponsive cells such as T cells, may comprise a CAR comprising an extracellular antigen-binding element that specifically binds to an antigen, an extracellular and transmembrane region as set forth in Table 1 of WO2015187528 and an intracellular T-cell signalling domain as set forth in Table 1 of WO2015187528. Preferably, the antigen is CD19, more preferably the antigen-binding element is an anti-CD19 scFv, even more preferably the mouse or human anti-CD19 scFv as described in Example 1 of WO2015187528. In certain embodiments, the CAR comprises, consists essentially of or consists of an amino acid sequence of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, or SEQ ID NO: 13 as set forth in Table 1 of WO2015187528.


By means of an example and without limitation, chimeric antigen receptor that recognizes the CD70 antigen is described in WO2012058460A2 (see also, Park et al., CD70 as a target for chimeric antigen receptor T cells in head and neck squamous cell carcinoma, Oral Oncol. 2018 March; 78:145-150; and Jin et al., CD70, a novel target of CAR T-cell therapy for gliomas, Neuro Oncol. 2018 Jan. 10; 20(1):55-65). CD70 is expressed by diffuse large B-cell and follicular lymphoma and also by the malignant cells of Hodgkins lymphoma, Waldenstrom's macroglobulinemia and multiple myeloma, and by HTLV-1- and EBV-associated malignancies. (Agathanggelou et al. Am. J. Pathol. 1995; 147: 1152-1160; Hunter et al., Blood 2004; 104:4881. 26; Lens et al., J Immunol. 2005; 174:6212-6219; Baba et al., J Virol. 2008; 82:3843-3852.) In addition, CD70 is expressed by non-hematological malignancies such as renal cell carcinoma and glioblastoma. (Junker et al., J Urol. 2005; 173:2150-2153; Chahlavi et al., Cancer Res 2005; 65:5428-5438) Physiologically, CD70 expression is transient and restricted to a subset of highly activated T, B, and dendritic cells.


By means of an example and without limitation, chimeric antigen receptor that recognizes BCMA has been described (see, e.g., US20160046724A1; WO2016014789A2; WO2017211900A1; WO2015158671A1; US20180085444A1; WO2018028647A1; US20170283504A1; and WO2013154760A1).


In certain embodiments, the immune cell may, in addition to a CAR or exogenous TCR as described herein, further comprise a chimeric inhibitory receptor (inhibitory CAR) that specifically binds to a second target antigen and is capable of inducing an inhibitory or immunosuppressive or repressive signal to the cell upon recognition of the second target antigen. In certain embodiments, the chimeric inhibitory receptor comprises an extracellular antigen-binding element (or portion or domain) configured to specifically bind to a target antigen, a transmembrane domain, and an intracellular immunosuppressive or repressive signaling domain. In certain embodiments, the second target antigen is an antigen that is not expressed on the surface of a cancer cell or infected cell or the expression of which is downregulated on a cancer cell or an infected cell. In certain embodiments, the second target antigen is an MHC-class I molecule. In certain embodiments, the intracellular signaling domain comprises a functional signaling portion of an immune checkpoint molecule, such as for example PD-1 or CTLA4. Advantageously, the inclusion of such inhibitory CAR reduces the chance of the engineered immune cells attacking non-target (e.g., non-cancer) tissues.


Alternatively, T-cells expressing CARs may be further modified to reduce or eliminate expression of endogenous TCRs in order to reduce off-target effects. Reduction or elimination of endogenous TCRs can reduce off-target effects and increase the effectiveness of the T cells (U.S. Pat. No. 9,181,527). T cells stably lacking expression of a functional TCR may be produced using a variety of approaches. T cells internalize, sort, and degrade the entire T cell receptor as a complex, with a half-life of about 10 hours in resting T cells and 3 hours in stimulated T cells (von Essen, M. et al. 2004. J. Immunol. 173:384-393). Proper functioning of the TCR complex requires the proper stoichiometric ratio of the proteins that compose the TCR complex. TCR function also requires two functioning TCR zeta proteins with ITAM motifs. The activation of the TCR upon engagement of its WIC-peptide ligand requires the engagement of several TCRs on the same T cell, which all must signal properly. Thus, if a TCR complex is destabilized with proteins that do not associate properly or cannot signal optimally, the T cell will not become activated sufficiently to begin a cellular response.


Accordingly, in some embodiments, TCR expression may eliminated using RNA interference (e.g., shRNA, siRNA, miRNA, etc.), CRISPR, or other methods that target the nucleic acids encoding specific TCRs (e.g., TCR-α and TCR-β) and/or CD3 chains in primary T cells. By blocking expression of one or more of these proteins, the T cell will no longer produce one or more of the key components of the TCR complex, thereby destabilizing the TCR complex and preventing cell surface expression of a functional TCR.


In some instances, CAR may also comprise a switch mechanism for controlling expression and/or activation of the CAR. For example, a CAR may comprise an extracellular, transmembrane, and intracellular domain, in which the extracellular domain comprises a target-specific binding element that comprises a label, binding domain, or tag that is specific for a molecule other than the target antigen that is expressed on or by a target cell. In such embodiments, the specificity of the CAR is provided by a second construct that comprises a target antigen binding domain (e.g., an scFv or a bispecific antibody that is specific for both the target antigen and the label or tag on the CAR) and a domain that is recognized by or binds to the label, binding domain, or tag on the CAR. See, e.g., WO 2013/044225, WO 2016/000304, WO 2015/057834, WO 2015/057852, WO 2016/070061, U.S. Pat. No. 9,233,125, US 2016/0129109. In this way, a T-cell that expresses the CAR can be administered to a subject, but the CAR cannot bind its target antigen until the second composition comprising an antigen-specific binding domain is administered.


Alternative switch mechanisms include CARs that require multimerization in order to activate their signaling function (see, e.g., US 2015/0368342, US 2016/0175359, US 2015/0368360) and/or an exogenous signal, such as a small molecule drug (US 2016/0166613, Yung et al., Science, 2015), in order to elicit a T-cell response. Some CARs may also comprise a “suicide switch” to induce cell death of the CAR T-cells following treatment (Buddee et al., PLoS One, 2013) or to downregulate expression of the CAR following binding to the target antigen (WO 2016/011210).


Alternative techniques may be used to transform target immunoresponsive cells, such as protoplast fusion, lipofection, transfection or electroporation. A wide variety of vectors may be used, such as retroviral vectors, lentiviral vectors, adenoviral vectors, adeno-associated viral vectors, plasmids or transposons, such as a Sleeping Beauty transposon (see U.S. Pat. Nos. 6,489,458; 7,148,203; 7,160,682; 7,985,739; 8,227,432), may be used to introduce CARs, for example using 2nd generation antigen-specific CARs signaling through CD3t and either CD28 or CD137. Viral vectors may for example include vectors based on HIV, SV40, EBV, HSV or BPV.


Cells that are targeted for transformation may for example include T cells, Natural Killer (NK) cells, cytotoxic T lymphocytes (CTL), regulatory T cells, human embryonic stem cells, tumor-infiltrating lymphocytes (TIL) or a pluripotent stem cell from which lymphoid cells may be differentiated. T cells expressing a desired CAR may for example be selected through co-culture with γ-irradiated activating and propagating cells (AaPC), which co-express the cancer antigen and co-stimulatory molecules. The engineered CAR T-cells may be expanded, for example by co-culture on AaPC in presence of soluble factors, such as IL-2 and IL-21. This expansion may for example be carried out so as to provide memory CAR+ T cells (which may for example be assayed by non-enzymatic digital array and/or multi-panel flow cytometry). In this way, CAR T cells may be provided that have specific cytotoxic activity against antigen-bearing tumors (optionally in conjunction with production of desired chemokines such as interferon-γ). CART cells of this kind may for example be used in animal models, for example to treat tumor xenografts.


In certain embodiments, ACT includes co-transferring CD4+Th1 cells and CD8+ CTLs to induce a synergistic antitumour response (see, e.g., Li et al., Adoptive cell therapy with CD4+T helper 1 cells and CD8+ cytotoxic T cells enhances complete rejection of an established tumour, leading to generation of endogenous memory responses to non-targeted tumour epitopes. Clin Transl Immunology. 2017 October; 6(10): e160).


In certain embodiments, Th17 cells are transferred to a subject in need thereof. Th17 cells have been reported to directly eradicate melanoma tumors in mice to a greater extent than Th1 cells (Muranski P, et al., Tumor-specific Th17-polarized cells eradicate large established melanoma. Blood. 2008 Jul. 15; 112(2):362-73; and Martin-Orozco N, et al., T helper 17 cells promote cytotoxic T cell activation in tumor immunity. Immunity. 2009 Nov. 20; 31(5):787-98). Those studies involved an adoptive T cell transfer (ACT) therapy approach, which takes advantage of CD4+ T cells that express a TCR recognizing tyrosinase tumor antigen. Exploitation of the TCR leads to rapid expansion of Th17 populations to large numbers ex vivo for reinfusion into the autologous tumor-bearing hosts.


In certain embodiments, ACT may include autologous iPSC-based vaccines, such as irradiated iPSCs in autologous anti-tumor vaccines (see e.g., Kooreman, Nigel G. et al., Autologous iPSC-Based Vaccines Elicit Anti-tumor Responses In Vivo, Cell Stem Cell 22, 1-13, 2018, doi.org/10.1016/j.stem.2018.01.016).


Unlike T-cell receptors (TCRs) that are MHC restricted, CARs can potentially bind any cell surface-expressed antigen and can thus be more universally used to treat patients (see Irving et al., Engineering Chimeric Antigen Receptor T-Cells for Racing in Solid Tumors: Don't Forget the Fuel, Front. Immunol., 3 Apr. 2017, doi.org/10.3389/fimmu.2017.00267). In certain embodiments, in the absence of endogenous T-cell infiltrate (e.g., due to aberrant antigen processing and presentation), which precludes the use of TIL therapy and immune checkpoint blockade, the transfer of CAR T-cells may be used to treat patients (see, e.g., Hinrichs C S, Rosenberg S A. Exploiting the curative potential of adoptive T-cell therapy for cancer. Immunol Rev (2014) 257(1):56-71. doi:10.1111/imr.12132).


Approaches such as the foregoing may be adapted to provide methods of treating and/or increasing survival of a subject having a disease, such as a neoplasia, for example by administering an effective amount of an immunoresponsive cell comprising an antigen recognizing receptor that binds a selected antigen, wherein the binding activates the immunoresponsive cell, thereby treating or preventing the disease (such as a neoplasia, a pathogen infection, an autoimmune disorder, or an allogeneic transplant reaction).


In certain embodiments, the treatment can be administered after lymphodepleting pretreatment in the form of chemotherapy (typically a combination of cyclophosphamide and fludarabine) or radiation therapy. Initial studies in ACT had short lived responses and the transferred cells did not persist in vivo for very long (Houot et al., T-cell-based immunotherapy: adoptive cell transfer and checkpoint inhibition. Cancer Immunol Res (2015) 3(10):1115-22; and Kamta et al., Advancing Cancer Therapy with Present and Emerging Immuno-Oncology Approaches. Front. Oncol. (2017) 7:64). Immune suppressor cells like Tregs and MDSCs may attenuate the activity of transferred cells by outcompeting them for the necessary cytokines. Not being bound by a theory lymphodepleting pretreatment may eliminate the suppressor cells allowing the TILs to persist.


In one embodiment, the treatment can be administrated into patients undergoing an immunosuppressive treatment (e.g., glucocorticoid treatment). The cells or population of cells, may be made resistant to at least one immunosuppressive agent due to the inactivation of a gene encoding a receptor for such immunosuppressive agent. In certain embodiments, the immunosuppressive treatment provides for the selection and expansion of the immunoresponsive T cells within the patient.


In certain embodiments, the treatment can be administered before primary treatment (e.g., surgery or radiation therapy) to shrink a tumor before the primary treatment. In another embodiment, the treatment can be administered after primary treatment to remove any remaining cancer cells.


In certain embodiments, immunometabolic barriers can be targeted therapeutically prior to and/or during ACT to enhance responses to ACT or CAR T-cell therapy and to support endogenous immunity (see, e.g., Irving et al., Engineering Chimeric Antigen Receptor T-Cells for Racing in Solid Tumors: Don't Forget the Fuel, Front. Immunol., 3 Apr. 2017, doi.org/10.3389/fimmu.2017.00267).


The administration of cells or population of cells, such as immune system cells or cell populations, such as more particularly immunoresponsive cells or cell populations, as disclosed herein may be carried out in any convenient manner, including by aerosol inhalation, injection, ingestion, transfusion, implantation or transplantation. The cells or population of cells may be administered to a patient subcutaneously, intradermally, intratumorally, intranodally, intramedullary, intramuscularly, intrathecally, by intravenous or intralymphatic injection, or intraperitoneally. In some embodiments, the disclosed CARs may be delivered or administered into a cavity formed by the resection of tumor tissue (i.e. intracavity delivery) or directly into a tumor prior to resection (i.e. intratumoral delivery). In one embodiment, the cell compositions of the present invention are preferably administered by intravenous injection.


The administration of the cells or population of cells can consist of the administration of 104-109 cells per kg body weight, preferably 105 to 106 cells/kg body weight including all integer values of cell numbers within those ranges. Dosing in CAR T cell therapies may for example involve administration of from 106 to 109 cells/kg, with or without a course of lymphodepletion, for example with cyclophosphamide. The cells or population of cells can be administrated in one or more doses. In another embodiment, the effective amount of cells are administrated as a single dose. In another embodiment, the effective amount of cells are administrated as more than one dose over a period time. Timing of administration is within the judgment of managing physician and depends on the clinical condition of the patient. The cells or population of cells may be obtained from any source, such as a blood bank or a donor. While individual needs vary, determination of optimal ranges of effective amounts of a given cell type for a particular disease or conditions are within the skill of one in the art. An effective amount means an amount which provides a therapeutic or prophylactic benefit. The dosage administrated will be dependent upon the age, health and weight of the recipient, kind of concurrent treatment, if any, frequency of treatment and the nature of the effect desired.


In another embodiment, the effective amount of cells or composition comprising those cells are administrated parenterally. The administration can be an intravenous administration. The administration can be directly done by injection within a tumor.


To guard against possible adverse reactions, engineered immunoresponsive cells may be equipped with a transgenic safety switch, in the form of a transgene that renders the cells vulnerable to exposure to a specific signal. For example, the herpes simplex viral thymidine kinase (TK) gene may be used in this way, for example by introduction into allogeneic T lymphocytes used as donor lymphocyte infusions following stem cell transplantation (Greco, et al., Improving the safety of cell therapy with the TK-suicide gene. Front. Pharmacol. 2015; 6: 95). In such cells, administration of a nucleoside prodrug such as ganciclovir or acyclovir causes cell death. Alternative safety switch constructs include inducible caspase 9, for example triggered by administration of a small-molecule dimerizer that brings together two nonfunctional icasp9 molecules to form the active enzyme. A wide variety of alternative approaches to implementing cellular proliferation controls have been described (see U.S. Patent Publication No. 20130071414; PCT Patent Publication WO2011146862; PCT Patent Publication WO2014011987; PCT Patent Publication WO2013040371; Zhou et al. BLOOD, 2014, 123/25:3895-3905; Di Stasi et al., The New England Journal of Medicine 2011; 365:1673-1683; Sadelain M, The New England Journal of Medicine 2011; 365:1735-173; Ramos et al., Stem Cells 28(6):1107-15 (2010)).


In a further refinement of adoptive therapies, genome editing may be used to tailor immunoresponsive cells to alternative implementations, for example providing edited CAR T cells (see Poirot et al., 2015, Multiplex genome edited T-cell manufacturing platform for “off-the-shelf” adoptive T-cell immunotherapies, Cancer Res 75 (18): 3853; Ren et al., 2017, Multiplex genome editing to generate universal CAR T cells resistant to PD1 inhibition, Clin Cancer Res. 2017 May 1; 23(9):2255-2266. doi: 10.1158/1078-0432.CCR-16-1300. Epub 2016 Nov. 4; Qasim et al., 2017, Molecular remission of infant B-ALL after infusion of universal TALEN gene-edited CART cells, Sci Transl Med. 2017 Jan. 25; 9(374); Legut, et al., 2018, CRISPR-mediated TCR replacement generates superior anticancer transgenic T cells. Blood, 131(3), 311-322; and Georgiadis et al., Long Terminal Repeat CRISPR-CAR-Coupled “Universal” T Cells Mediate Potent Anti-leukemic Effects, Molecular Therapy, In Press, Corrected Proof, Available online 6 Mar. 2018). Cells may be edited using any CRISPR system and method of use thereof as described herein. CRISPR systems may be delivered to an immune cell by any method described herein. In preferred embodiments, cells are edited ex vivo and transferred to a subject in need thereof. Immunoresponsive cells, CAR T cells or any cells used for adoptive cell transfer may be edited. Editing may be performed for example to insert or knock-in an exogenous gene, such as an exogenous gene encoding a CAR or a TCR, at a preselected locus in a cell (e.g. TRAC locus); to eliminate potential alloreactive T-cell receptors (TCR) or to prevent inappropriate pairing between endogenous and exogenous TCR chains, such as to knock-out or knock-down expression of an endogenous TCR in a cell; to disrupt the target of a chemotherapeutic agent in a cell; to block an immune checkpoint, such as to knock-out or knock-down expression of an immune checkpoint protein or receptor in a cell; to knock-out or knock-down expression of other gene or genes in a cell, the reduced expression or lack of expression of which can enhance the efficacy of adoptive therapies using the cell; to knock-out or knock-down expression of an endogenous gene in a cell, said endogenous gene encoding an antigen targeted by an exogenous CAR or TCR; to knock-out or knock-down expression of one or more MHC constituent proteins in a cell; to activate a T cell; to modulate cells such that the cells are resistant to exhaustion or dysfunction; and/or increase the differentiation and/or proliferation of functionally exhausted or dysfunctional CD8+ T-cells (see PCT Patent Publications: WO2013176915, WO2014059173, WO2014172606, WO2014184744, and WO2014191128).


In certain embodiments, editing may result in inactivation of a gene. By inactivating a gene, it is intended that the gene of interest is not expressed in a functional protein form. In a particular embodiment, the CRISPR system specifically catalyzes cleavage in one targeted gene thereby inactivating said targeted gene. The nucleic acid strand breaks caused are commonly repaired through the distinct mechanisms of homologous recombination or non-homologous end joining (NHEJ). However, NHEJ is an imperfect repair process that often results in changes to the DNA sequence at the site of the cleavage. Repair via non-homologous end joining (NHEJ) often results in small insertions or deletions (Indel) and can be used for the creation of specific gene knockouts. Cells in which a cleavage induced mutagenesis event has occurred can be identified and/or selected by well-known methods in the art. In certain embodiments, homology directed repair (HDR) is used to concurrently inactivate a gene (e.g., TRAC) and insert an endogenous TCR or CAR into the inactivated locus.


Hence, in certain embodiments, editing of cells (such as by CRISPR/Cas), particularly cells intended for adoptive cell therapies, more particularly immunoresponsive cells such as T cells, may be performed to insert or knock-in an exogenous gene, such as an exogenous gene encoding a CAR or a TCR, at a preselected locus in a cell. Conventionally, nucleic acid molecules encoding CARs or TCRs are transfected or transduced to cells using randomly integrating vectors, which, depending on the site of integration, may lead to clonal expansion, oncogenic transformation, variegated transgene expression and/or transcriptional silencing of the transgene. Directing of transgene(s) to a specific locus in a cell can minimize or avoid such risks and advantageously provide for uniform expression of the transgene(s) by the cells. Without limitation, suitable ‘safe harbor’ loci for directed transgene integration include CCR5 or AAVS1. Homology-directed repair (HDR) strategies are known and described elsewhere in this specification allowing to insert transgenes into desired loci (e.g., TRAC locus).


Further suitable loci for insertion of transgenes, in particular CAR or exogenous TCR transgenes, include without limitation loci comprising genes coding for constituents of endogenous T-cell receptor, such as T-cell receptor alpha locus (TRA) or T-cell receptor beta locus (TRB), for example T-cell receptor alpha constant (TRAC) locus, T-cell receptor beta constant 1 (TRBC1) locus or T-cell receptor beta constant 2 (TRBC1) locus. Advantageously, insertion of a transgene into such locus can simultaneously achieve expression of the transgene, potentially controlled by the endogenous promoter, and knock-out expression of the endogenous TCR. This approach has been exemplified in Eyquem et al., (2017) Nature 543: 113-117, wherein the authors used CRISPR/Cas9 gene editing to knock-in a DNA molecule encoding a CD19-specific CAR into the TRAC locus downstream of the endogenous promoter; the CAR-T cells obtained by CRISPR were significantly superior in terms of reduced tonic CAR signaling and exhaustion.


T cell receptors (TCR) are cell surface receptors that participate in the activation of T cells in response to the presentation of antigen. The TCR is generally made from two chains, α and β, which assemble to form a heterodimer and associates with the CD3-transducing subunits to form the T cell receptor complex present on the cell surface. Each α and β chain of the TCR consists of an immunoglobulin-like N-terminal variable (V) and constant (C) region, a hydrophobic transmembrane domain, and a short cytoplasmic region. As for immunoglobulin molecules, the variable region of the α and β chains are generated by V(D)J recombination, creating a large diversity of antigen specificities within the population of T cells. However, in contrast to immunoglobulins that recognize intact antigen, T cells are activated by processed peptide fragments in association with an MHC molecule, introducing an extra dimension to antigen recognition by T cells, known as MHC restriction. Recognition of MHC disparities between the donor and recipient through the T cell receptor leads to T cell proliferation and the potential development of graft versus host disease (GVHD). The inactivation of TCRα or TCRβ can result in the elimination of the TCR from the surface of T cells preventing recognition of alloantigen and thus GVHD. However, TCR disruption generally results in the elimination of the CD3 signaling component and alters the means of further T cell expansion.


Hence, in certain embodiments, editing of cells (such as by CRISPR/Cas), particularly cells intended for adoptive cell therapies, more particularly immunoresponsive cells such as T cells, may be performed to knock-out or knock-down expression of an endogenous TCR in a cell. For example, NHEJ-based or HDR-based gene editing approaches can be employed to disrupt the endogenous TCR alpha and/or beta chain genes. For example, gene editing system or systems, such as CRISPR/Cas system or systems, can be designed to target a sequence found within the TCR beta chain conserved between the beta 1 and beta 2 constant region genes (TRBC1 and TRBC2) and/or to target the constant region of the TCR alpha chain (TRAC) gene.


Allogeneic cells are rapidly rejected by the host immune system. It has been demonstrated that, allogeneic leukocytes present in non-irradiated blood products will persist for no more than 5 to 6 days (Boni, Muranski et al. 2008 Blood 1; 112(12):4746-54). Thus, to prevent rejection of allogeneic cells, the host's immune system usually has to be suppressed to some extent. However, in the case of adoptive cell transfer the use of immunosuppressive drugs also have a detrimental effect on the introduced therapeutic T cells. Therefore, to effectively use an adoptive immunotherapy approach in these conditions, the introduced cells would need to be resistant to the immunosuppressive treatment. Thus, in a particular embodiment, the present invention further comprises a step of modifying T cells to make them resistant to an immunosuppressive agent, preferably by inactivating at least one gene encoding a target for an immunosuppressive agent. An immunosuppressive agent is an agent that suppresses immune function by one of several mechanisms of action. An immunosuppressive agent can be, but is not limited to a calcineurin inhibitor, a target of rapamycin, an interleukin-2 receptor α-chain blocker, an inhibitor of inosine monophosphate dehydrogenase, an inhibitor of dihydrofolic acid reductase, a corticosteroid or an immunosuppressive antimetabolite. The present invention allows conferring immunosuppressive resistance to T cells for immunotherapy by inactivating the target of the immunosuppressive agent in T cells. As non-limiting examples, targets for an immunosuppressive agent can be a receptor for an immunosuppressive agent such as: CD52, glucocorticoid receptor (GR), a FKBP family gene member and a cyclophilin family gene member.


In certain embodiments, editing of cells (such as by CRISPR/Cas), particularly cells intended for adoptive cell therapies, more particularly immunoresponsive cells such as T cells, may be performed to block an immune checkpoint, such as to knock-out or knock-down expression of an immune checkpoint protein or receptor in a cell. Immune checkpoints are inhibitory pathways that slow down or stop immune reactions and prevent excessive tissue damage from uncontrolled activity of immune cells. In certain embodiments, the immune checkpoint targeted is the programmed death-1 (PD-1 or CD279) gene (PDCD1). In other embodiments, the immune checkpoint targeted is cytotoxic T-lymphocyte-associated antigen (CTLA-4). In additional embodiments, the immune checkpoint targeted is another member of the CD28 and CTLA4 Ig superfamily such as BTLA, LAG3, ICOS, PDL1 or KIR. In further additional embodiments, the immune checkpoint targeted is a member of the TNFR superfamily such as CD40, OX40, CD137, GITR, CD27 or TIM-3.


Additional immune checkpoints include Src homology 2 domain-containing protein tyrosine phosphatase 1 (SHP-1) (Watson H A, et al., SHP-1: the next checkpoint target for cancer immunotherapy! Biochem Soc Trans. 2016 Apr. 15; 44(2):356-62). SHP-1 is a widely expressed inhibitory protein tyrosine phosphatase (PTP). In T-cells, it is a negative regulator of antigen-dependent activation and proliferation. It is a cytosolic protein, and therefore not amenable to antibody-mediated therapies, but its role in activation and proliferation makes it an attractive target for genetic manipulation in adoptive transfer strategies, such as chimeric antigen receptor (CAR) T cells. Immune checkpoints may also include T cell immunoreceptor with Ig and ITIM domains (TIGIT/Vstm3/WUCAM/VSIG9) and VISTA (Le Mercier I, et al., (2015) Beyond CTLA-4 and PD-1, the generation Z of negative checkpoint regulators. Front. Immunol. 6:418).


WO2014172606 relates to the use of MT1 and/or MT2 inhibitors to increase proliferation and/or activity of exhausted CD8+ T-cells and to decrease CD8+ T-cell exhaustion (e.g., decrease functionally exhausted or unresponsive CD8+ immune cells). In certain embodiments, metallothioneins are targeted by gene editing in adoptively transferred T cells.


In certain embodiments, targets of gene editing may be at least one targeted locus involved in the expression of an immune checkpoint protein. Such targets may include, but are not limited to CTLA4, PPP2CA, PPP2CB, PTPN6, PTPN22, PDCD1, ICOS (CD278), PDL1, KIR, LAG3, HAVCR2, BTLA, CD160, TIGIT, CD96, CRTAM, LAIR1, SIGLEC7, SIGLEC9, CD244 (2B4), TNFRSF10B, TNFRSF10A, CASP8, CASP10, CASP3, CASP6, CASP7, FADD, FAS, TGFBRII, TGFRBRI, SMAD2, SMAD3, SMAD4, SMAD10, SKI, SKIL, TGIF1, IL10RA, IL10RB, HMOX2, IL6R, IL6ST, EIF2AK4, CSK, PAG1, SIT1, FOXP3, PRDM1, BATF, VISTA, GUCY1A2, GUCY1A3, GUCY1B2, GUCY1B3, MT1, MT2, CD40, OX40, CD137, GITR, CD27, SHP-1, TIM-3, CEACAM-1, CEACAM-3, or CEACAM-5. In preferred embodiments, the gene locus involved in the expression of PD-1 or CTLA-4 genes is targeted. In other preferred embodiments, combinations of genes are targeted, such as but not limited to PD-1 and TIGIT.


By means of an example and without limitation, WO2016196388 concerns an engineered T cell comprising (a) a genetically engineered antigen receptor that specifically binds to an antigen, which receptor may be a CAR; and (b) a disrupted gene encoding a PD-L1, an agent for disruption of a gene encoding a PD-L1, and/or disruption of a gene encoding PD-L1, wherein the disruption of the gene may be mediated by a gene editing nuclease, a zinc finger nuclease (ZFN), CRISPR/Cas9 and/or TALEN. WO2015142675 relates to immune effector cells comprising a CAR in combination with an agent (such as CRISPR, TALEN or ZFN) that increases the efficacy of the immune effector cells in the treatment of cancer, wherein the agent may inhibit an immune inhibitory molecule, such as PD1, PD-L1, CTLA-4, TIM-3, LAG-3, VISTA, BTLA, TIGIT, LAIR1, CD160, 2B4, TGFR beta, CEACAM-1, CEACAM-3, or CEACAM-5. Ren et al., (2017) Clin Cancer Res 23 (9) 2255-2266 performed lentiviral delivery of CAR and electro-transfer of Cas9 mRNA and gRNAs targeting endogenous TCR, β-2 microglobulin (B2M) and PD1 simultaneously, to generate gene-disrupted allogeneic CAR T cells deficient of TCR, HLA class I molecule and PD1.


In certain embodiments, cells may be engineered to express a CAR, wherein expression and/or function of methylcytosine dioxygenase genes (TET1, TET2 and/or TET3) in the cells has been reduced or eliminated, such as by CRISPR, ZNF or TALEN (for example, as described in WO201704916).


In certain embodiments, editing of cells (such as by CRISPR/Cas), particularly cells intended for adoptive cell therapies, more particularly immunoresponsive cells such as T cells, may be performed to knock-out or knock-down expression of an endogenous gene in a cell, said endogenous gene encoding an antigen targeted by an exogenous CAR or TCR, thereby reducing the likelihood of targeting of the engineered cells. In certain embodiments, the targeted antigen may be one or more antigen selected from the group consisting of CD38, CD138, CS-1, CD33, CD26, CD30, CD53, CD92, CD100, CD148, CD150, CD200, CD261, CD262, CD362, human telomerase reverse transcriptase (hTERT), survivin, mouse double minute 2 homolog (MDM2), cytochrome P450 1B1 (CYP1B), HER2/neu, Wilms' tumor gene 1 (WT1), livin, alphafetoprotein (AFP), carcinoembryonic antigen (CEA), mucin 16 (MUC16), MUC1, prostate-specific membrane antigen (PSMA), p53, cyclin (D1), B cell maturation antigen (BCMA), transmembrane activator and CAML Interactor (TACI), and B-cell activating factor receptor (BAFF-R) (for example, as described in WO2016011210 and WO2017011804).


In certain embodiments, editing of cells (such as by CRISPR/Cas), particularly cells intended for adoptive cell therapies, more particularly immunoresponsive cells such as T cells, may be performed to knock-out or knock-down expression of one or more MHC constituent proteins, such as one or more HLA proteins and/or beta-2 microglobulin (B2M), in a cell, whereby rejection of non-autologous (e.g., allogeneic) cells by the recipient's immune system can be reduced or avoided. In preferred embodiments, one or more HLA class I proteins, such as HLA-A, B and/or C, and/or B2M may be knocked-out or knocked-down. Preferably, B2M may be knocked-out or knocked-down. By means of an example, Ren et al., (2017) Clin Cancer Res 23 (9) 2255-2266 performed lentiviral delivery of CAR and electro-transfer of Cas9 mRNA and gRNAs targeting endogenous TCR, β-2 microglobulin (B2M) and PD1 simultaneously, to generate gene-disrupted allogeneic CAR T cells deficient of TCR, HLA class I molecule and PD1.


In other embodiments, at least two genes are edited. Pairs of genes may include, but are not limited to PD1 and TCRα, PD1 and TCRβ, CTLA-4 and TCRα, CTLA-4 and TCRβ, LAG3 and TCRα, LAG3 and TCRβ, Tim3 and TCRα, Tim3 and TCRβ, BTLA and TCRα, BTLA and TCRβ, BY55 and TCRα, BY55 and TCRβ, TIGIT and TCRα, TIGIT and TCRβ, B7H5 and TCRα, B7H5 and TCRβ, LAIR1 and TCRα, LAIR1 and TCRβ, SIGLEC10 and TCRα, SIGLEC10 and TCRβ, 2B4 and TCRα, 2B4 and TCRβ, B2M and TCRα, B2M and TCRβ.


In certain embodiments, a cell may be multiply edited (multiplex genome editing) as taught herein to (1) knock-out or knock-down expression of an endogenous TCR (for example, TRBC1, TRBC2 and/or TRAC), (2) knock-out or knock-down expression of an immune checkpoint protein or receptor (for example PD1, PD-L1 and/or CTLA4); and (3) knock-out or knock-down expression of one or more MHC constituent proteins (for example, HLA-A, B and/or C, and/or B2M, preferably B2M).


Whether prior to or after genetic modification of the T cells, the T cells can be activated and expanded generally using methods as described, for example, in U.S. Pat. Nos. 6,352,694; 6,534,055; 6,905,680; 5,858,358; 6,887,466; 6,905,681; 7,144,575; 7,232,566; 7,175,843; 5,883,223; 6,905,874; 6,797,514; 6,867,041; and 7,572,631. T cells can be expanded in vitro or in vivo.


Immune cells may be obtained using any method known in the art. In one embodiment, allogenic T cells may be obtained from healthy subjects. In one embodiment T cells that have infiltrated a tumor are isolated. T cells may be removed during surgery. T cells may be isolated after removal of tumor tissue by biopsy. T cells may be isolated by any means known in the art. In one embodiment, T cells are obtained by apheresis. In one embodiment, the method may comprise obtaining a bulk population of T cells from a tumor sample by any suitable method known in the art. For example, a bulk population of T cells can be obtained from a tumor sample by dissociating the tumor sample into a cell suspension from which specific cell populations can be selected. Suitable methods of obtaining a bulk population of T cells may include, but are not limited to, any one or more of mechanically dissociating (e.g., mincing) the tumor, enzymatically dissociating (e.g., digesting) the tumor, and aspiration (e.g., as with a needle).


The bulk population of T cells obtained from a tumor sample may comprise any suitable type of T cell. Preferably, the bulk population of T cells obtained from a tumor sample comprises tumor infiltrating lymphocytes (TILs).


The tumor sample may be obtained from any mammal. Unless stated otherwise, as used herein, the term “mammal” refers to any mammal including, but not limited to, mammals of the order Logomorpha, such as rabbits; the order Carnivora, including Felines (cats) and Canines (dogs); the order Artiodactyla, including Bovines (cows) and Swines (pigs); or of the order Perssodactyla, including Equines (horses). The mammals may be non-human primates, e.g., of the order Primates, Ceboids, or Simoids (monkeys) or of the order Anthropoids (humans and apes). In some embodiments, the mammal may be a mammal of the order Rodentia, such as mice and hamsters. Preferably, the mammal is a non-human primate or a human. An especially preferred mammal is the human.


T cells can be obtained from a number of sources, including peripheral blood mononuclear cells, bone marrow, lymph node tissue, spleen tissue, and tumors. In certain embodiments of the present invention, T cells can be obtained from a unit of blood collected from a subject using any number of techniques known to the skilled artisan, such as Ficoll separation. In one preferred embodiment, cells from the circulating blood of an individual are obtained by apheresis or leukapheresis. The apheresis product typically contains lymphocytes, including T cells, monocytes, granulocytes, B cells, other nucleated white blood cells, red blood cells, and platelets. In one embodiment, the cells collected by apheresis may be washed to remove the plasma fraction and to place the cells in an appropriate buffer or media for subsequent processing steps. In one embodiment of the invention, the cells are washed with phosphate buffered saline (PBS). In an alternative embodiment, the wash solution lacks calcium and may lack magnesium or may lack many if not all divalent cations. Initial activation steps in the absence of calcium lead to magnified activation. As those of ordinary skill in the art would readily appreciate a washing step may be accomplished by methods known to those in the art, such as by using a semi-automated “flow-through” centrifuge (for example, the Cobe 2991 cell processor) according to the manufacturer's instructions. After washing, the cells may be resuspended in a variety of biocompatible buffers, such as, for example, Ca-free, Mg-free PBS. Alternatively, the undesirable components of the apheresis sample may be removed and the cells directly resuspended in culture media.


In another embodiment, T cells are isolated from peripheral blood lymphocytes by lysing the red blood cells and depleting the monocytes, for example, by centrifugation through a PERCOLL™ gradient. A specific subpopulation of T cells, such as CD28+, CD4+, CDC, CD45RA+, and CD45RO+ T cells, can be further isolated by positive or negative selection techniques. For example, in one preferred embodiment, T cells are isolated by incubation with anti-CD3/anti-CD28 (i.e., 3×28)-conjugated beads, such as DYNABEADS® M-450 CD3/CD28 T, or XCYTE DYNABEADS™ for a time period sufficient for positive selection of the desired T cells. In one embodiment, the time period is about 30 minutes. In a further embodiment, the time period ranges from 30 minutes to 36 hours or longer and all integer values there between. In a further embodiment, the time period is at least 1, 2, 3, 4, 5, or 6 hours. In yet another preferred embodiment, the time period is 10 to 24 hours. In one preferred embodiment, the incubation time period is 24 hours. For isolation of T cells from patients with leukemia, use of longer incubation times, such as 24 hours, can increase cell yield. Longer incubation times may be used to isolate T cells in any situation where there are few T cells as compared to other cell types, such in isolating tumor infiltrating lymphocytes (TIL) from tumor tissue or from immunocompromised individuals. Further, use of longer incubation times can increase the efficiency of capture of CD8+ T cells.


Enrichment of a T cell population by negative selection can be accomplished with a combination of antibodies directed to surface markers unique to the negatively selected cells. A preferred method is cell sorting and/or selection via negative magnetic immunoadherence or flow cytometry that uses a cocktail of monoclonal antibodies directed to cell surface markers present on the cells negatively selected. For example, to enrich for CD4+ cells by negative selection, a monoclonal antibody cocktail typically includes antibodies to CD14, CD20, CD11b, CD16, HLA-DR, and CD8.


Further, monocyte populations (i.e., CD14+ cells) may be depleted from blood preparations by a variety of methodologies, including anti-CD14 coated beads or columns, or utilization of the phagocytotic activity of these cells to facilitate removal. Accordingly, in one embodiment, the invention uses paramagnetic particles of a size sufficient to be engulfed by phagocytotic monocytes. In certain embodiments, the paramagnetic particles are commercially available beads, for example, those produced by Life Technologies under the trade name Dynabeads™. In one embodiment, other non-specific cells are removed by coating the paramagnetic particles with “irrelevant” proteins (e.g., serum proteins or antibodies). Irrelevant proteins and antibodies include those proteins and antibodies or fragments thereof that do not specifically target the T cells to be isolated. In certain embodiments, the irrelevant beads include beads coated with sheep anti-mouse antibodies, goat anti-mouse antibodies, and human serum albumin.


In brief, such depletion of monocytes is performed by preincubating T cells isolated from whole blood, apheresed peripheral blood, or tumors with one or more varieties of irrelevant or non-antibody coupled paramagnetic particles at any amount that allows for removal of monocytes (approximately a 20:1 bead:cell ratio) for about 30 minutes to 2 hours at 22 to 37 degrees C., followed by magnetic removal of cells which have attached to or engulfed the paramagnetic particles. Such separation can be performed using standard methods available in the art. For example, any magnetic separation methodology may be used including a variety of which are commercially available, (e.g., DYNAL® Magnetic Particle Concentrator (DYNAL MPC®)). Assurance of requisite depletion can be monitored by a variety of methodologies known to those of ordinary skill in the art, including flow cytometric analysis of CD14 positive cells, before and after depletion.


For isolation of a desired population of cells by positive or negative selection, the concentration of cells and surface (e.g., particles such as beads) can be varied. In certain embodiments, it may be desirable to significantly decrease the volume in which beads and cells are mixed together (i.e., increase the concentration of cells), to ensure maximum contact of cells and beads. For example, in one embodiment, a concentration of 2 billion cells/ml is used. In one embodiment, a concentration of 1 billion cells/ml is used. In a further embodiment, greater than 100 million cells/ml is used. In a further embodiment, a concentration of cells of 10, 15, 20, 25, 30, 35, 40, 45, or 50 million cells/ml is used. In yet another embodiment, a concentration of cells from 75, 80, 85, 90, 95, or 100 million cells/ml is used. In further embodiments, concentrations of 125 or 150 million cells/ml can be used. Using high concentrations can result in increased cell yield, cell activation, and cell expansion. Further, use of high cell concentrations allows more efficient capture of cells that may weakly express target antigens of interest, such as CD28-negative T cells, or from samples where there are many tumor cells present (i.e., leukemic blood, tumor tissue, etc). Such populations of cells may have therapeutic value and would be desirable to obtain. For example, using high concentration of cells allows more efficient selection of CD8+ T cells that normally have weaker CD28 expression.


In a related embodiment, it may be desirable to use lower concentrations of cells. By significantly diluting the mixture of T cells and surface (e.g., particles such as beads), interactions between the particles and cells is minimized. This selects for cells that express high amounts of desired antigens to be bound to the particles. For example, CD4+ T cells express higher levels of CD28 and are more efficiently captured than CD8+ T cells in dilute concentrations. In one embodiment, the concentration of cells used is 5×106/ml. In other embodiments, the concentration used can be from about 1×105/ml to 1×106/ml, and any integer value in between.


T cells can also be frozen. Wishing not to be bound by theory, the freeze and subsequent thaw step provides a more uniform product by removing granulocytes and to some extent monocytes in the cell population. After a washing step to remove plasma and platelets, the cells may be suspended in a freezing solution. While many freezing solutions and parameters are known in the art and will be useful in this context, one method involves using PBS containing 20% DMSO and 8% human serum albumin, or other suitable cell freezing media, the cells then are frozen to −80° C. at a rate of 1° per minute and stored in the vapor phase of a liquid nitrogen storage tank. Other methods of controlled freezing may be used as well as uncontrolled freezing immediately at −20° C. or in liquid nitrogen.


T cells for use in the present invention may also be antigen-specific T cells. For example, tumor-specific T cells can be used. In certain embodiments, antigen-specific T cells can be isolated from a patient of interest, such as a patient afflicted with a cancer or an infectious disease. In one embodiment, neoepitopes are determined for a subject and T cells specific to these antigens are isolated. Antigen-specific cells for use in expansion may also be generated in vitro using any number of methods known in the art, for example, as described in U.S. Patent Publication No. US 20040224402 entitled, Generation and Isolation of Antigen-Specific T Cells, or in U.S. Pat. No. 6,040,177. Antigen-specific cells for use in the present invention may also be generated using any number of methods known in the art, for example, as described in Current Protocols in Immunology, or Current Protocols in Cell Biology, both published by John Wiley & Sons, Inc., Boston, Mass.


In a related embodiment, it may be desirable to sort or otherwise positively select (e.g. via magnetic selection) the antigen specific cells prior to or following one or two rounds of expansion. Sorting or positively selecting antigen-specific cells can be carried out using peptide-MHC tetramers (Altman, et al., Science. 1996 Oct. 4; 274(5284):94-6). In another embodiment, the adaptable tetramer technology approach is used (Andersen et al., 2012 Nat Protoc. 7:891-902). Tetramers are limited by the need to utilize predicted binding peptides based on prior hypotheses, and the restriction to specific HLAs. Peptide-WIC tetramers can be generated using techniques known in the art and can be made with any MHC molecule of interest and any antigen of interest as described herein. Specific epitopes to be used in this context can be identified using numerous assays known in the art. For example, the ability of a polypeptide to bind to MHC class I may be evaluated indirectly by monitoring the ability to promote incorporation of 125I labeled β2-microglobulin (β2m) into MHC class I/β2m/peptide heterotrimeric complexes (see Parker et al., J. Immunol. 152:163, 1994).


In one embodiment cells are directly labeled with an epitope-specific reagent for isolation by flow cytometry followed by characterization of phenotype and TCRs. In one embodiment, T cells are isolated by contacting with T cell specific antibodies. Sorting of antigen-specific T cells, or generally any cells of the present invention, can be carried out using any of a variety of commercially available cell sorters, including, but not limited to, MoFlo sorter (DakoCytomation, Fort Collins, Colo.), FACSAria™, FACSArray™, FACSVantage™ BD™ LSR II, and FACSCalibur™ (BD Biosciences, San Jose, Calif.).


In a preferred embodiment, the method comprises selecting cells that also express CD3. The method may comprise specifically selecting the cells in any suitable manner. Preferably, the selecting is carried out using flow cytometry. The flow cytometry may be carried out using any suitable method known in the art. The flow cytometry may employ any suitable antibodies and stains. Preferably, the antibody is chosen such that it specifically recognizes and binds to the particular biomarker being selected. For example, the specific selection of CD3, CD8, TIM-3, LAG-3, 4-1BB, or PD-1 may be carried out using anti-CD3, anti-CD8, anti-TIM-3, anti-LAG-3, anti-4-1BB, or anti-PD-1 antibodies, respectively. The antibody or antibodies may be conjugated to a bead (e.g., a magnetic bead) or to a fluorochrome. Preferably, the flow cytometry is fluorescence-activated cell sorting (FACS). TCRs expressed on T cells can be selected based on reactivity to autologous tumors. Additionally, T cells that are reactive to tumors can be selected for based on markers using the methods described in patent publication Nos. WO2014133567 and WO2014133568, herein incorporated by reference in their entirety. Additionally, activated T cells can be selected for based on surface expression of CD107a.


In one embodiment of the invention, the method further comprises expanding the numbers of T cells in the enriched cell population. Such methods are described in U.S. Pat. No. 8,637,307 and is herein incorporated by reference in its entirety. The numbers of T cells may be increased at least about 3-fold (or 4-, 5-, 6-, 7-, 8-, or 9-fold), more preferably at least about 10-fold (or 20-, 30-, 40-, 50-, 60-, 70-, 80-, or 90-fold), more preferably at least about 100-fold, more preferably at least about 1,000 fold, or most preferably at least about 100,000-fold. The numbers of T cells may be expanded using any suitable method known in the art. Exemplary methods of expanding the numbers of cells are described in patent publication No. WO 2003057171, U.S. Pat. No. 8,034,334, and U.S. Patent Application Publication No. 2012/0244133, each of which is incorporated herein by reference.


In one embodiment, ex vivo T cell expansion can be performed by isolation of T cells and subsequent stimulation or activation followed by further expansion. In one embodiment of the invention, the T cells may be stimulated or activated by a single agent. In another embodiment, T cells are stimulated or activated with two agents, one that induces a primary signal and a second that is a co-stimulatory signal. Ligands useful for stimulating a single signal or stimulating a primary signal and an accessory molecule that stimulates a second signal may be used in soluble form. Ligands may be attached to the surface of a cell, to an Engineered Multivalent Signaling Platform (EMSP), or immobilized on a surface. In a preferred embodiment both primary and secondary agents are co-immobilized on a surface, for example a bead or a cell. In one embodiment, the molecule providing the primary activation signal may be a CD3 ligand, and the co-stimulatory molecule may be a CD28 ligand or 4-1BB ligand.


In certain embodiments, T cells comprising a CAR or an exogenous TCR, may be manufactured as described in WO2015120096, by a method comprising: enriching a population of lymphocytes obtained from a donor subject; stimulating the population of lymphocytes with one or more T-cell stimulating agents to produce a population of activated T cells, wherein the stimulation is performed in a closed system using serum-free culture medium; transducing the population of activated T cells with a viral vector comprising a nucleic acid molecule which encodes the CAR or TCR, using a single cycle transduction to produce a population of transduced T cells, wherein the transduction is performed in a closed system using serum-free culture medium; and expanding the population of transduced T cells for a predetermined time to produce a population of engineered T cells, wherein the expansion is performed in a closed system using serum-free culture medium. In certain embodiments, T cells comprising a CAR or an exogenous TCR, may be manufactured as described in WO2015120096, by a method comprising: obtaining a population of lymphocytes; stimulating the population of lymphocytes with one or more stimulating agents to produce a population of activated T cells, wherein the stimulation is performed in a closed system using serum-free culture medium; transducing the population of activated T cells with a viral vector comprising a nucleic acid molecule which encodes the CAR or TCR, using at least one cycle transduction to produce a population of transduced T cells, wherein the transduction is performed in a closed system using serum-free culture medium; and expanding the population of transduced T cells to produce a population of engineered T cells, wherein the expansion is performed in a closed system using serum-free culture medium. The predetermined time for expanding the population of transduced T cells may be 3 days. The time from enriching the population of lymphocytes to producing the engineered T cells may be 6 days. The closed system may be a closed bag system. Further provided is population of T cells comprising a CAR or an exogenous TCR obtainable or obtained by said method, and a pharmaceutical composition comprising such cells.


In certain embodiments, T cell maturation or differentiation in vitro may be delayed or inhibited by the method as described in WO2017070395, comprising contacting one or more T cells from a subject in need of a T cell therapy with an AKT inhibitor (such as, e.g., one or a combination of two or more AKT inhibitors disclosed in claim 8 of WO2017070395) and at least one of exogenous Interleukin-7 (IL-7) and exogenous Interleukin-15 (IL-15), wherein the resulting T cells exhibit delayed maturation or differentiation, and/or wherein the resulting T cells exhibit improved T cell function (such as, e.g., increased T cell proliferation; increased cytokine production; and/or increased cytolytic activity) relative to a T cell function of a T cell cultured in the absence of an AKT inhibitor.


In certain embodiments, a patient in need of a T cell therapy may be conditioned by a method as described in WO2016191756 comprising administering to the patient a dose of cyclophosphamide between 200 mg/m2/day and 2000 mg/m2/day and a dose of fludarabine between 20 mg/m2/day and 900 mg/m2/day.


In one embodiment, adoptive cell transfer may comprise: depleting T cells as defined herein from a population of T cells obtained from the subject; in vitro expanding the T cell population; and administering the in vitro expanded T cell population to the subject. In certain embodiments, the method may further comprise formulating the in vitro expanded immune cell or immune cell population into a pharmaceutical composition.


In certain embodiments, suppressive CD8+ T cells are administered in combination with an autoimmune drug. Non-limiting examples of such drugs include methotrexate, cyclophosphamide, Imuran (azathioprine), cyclosporin, and steroid compounds such as prednisone and methylprednisolone.


Cancer

In certain example embodiments, the pharmaceutical compositions and adoptive cell transfer strategies may be used to treat various forms of cancer. Examples of cancer include but are not limited to carcinoma, lymphoma, blastoma, sarcoma, and leukemia or lymphoid malignancies. More particular examples of such cancers include without limitation: squamous cell cancer (e.g., epithelial squamous cell cancer), lung cancer including small-cell lung cancer, non-small cell lung cancer, adenocarcinoma of the lung, squamous carcinoma of the lung and large cell carcinoma of the lung, cancer of the peritoneum, hepatocellular cancer, gastric or stomach cancer including gastrointestinal cancer, pancreatic cancer, glioma, glioblastoma, cervical cancer, ovarian cancer, liver cancer, bladder cancer, hepatoma, breast cancer, colon cancer, rectal cancer, colorectal cancer, endometrial cancer or uterine carcinoma, salivary gland carcinoma, kidney or renal cancer, prostate cancer, vulval cancer, thyroid cancer, hepatic carcinoma, anal carcinoma, penile carcinoma, as well as CNS cancer, melanoma, head and neck cancer, bone cancer, bone marrow cancer, duodenum cancer, oesophageal cancer, thyroid cancer, or hematological cancer.


Other non-limiting examples of cancers or malignancies include, but are not limited to: Acute Childhood Lymphoblastic Leukemia, Acute Lymphoblastic Leukemia, Acute Lymphocytic Leukemia, Acute Myeloid Leukemia, Adrenocortical Carcinoma, Adult (Primary) Hepatocellular Cancer, Adult (Primary) Liver Cancer, Adult Acute Lymphocytic Leukemia, Adult Acute Myeloid Leukemia, Adult Hodgkin's Disease, Adult Hodgkin's Lymphoma, Adult Lymphocytic Leukemia, Adult Non-Hodgkin's Lymphoma, Adult Primary Liver Cancer, Adult Soft Tissue Sarcoma, AIDS-Related Lymphoma, AIDS-Related Malignancies, Anal Cancer, Astrocytoma, Bile Duct Cancer, Bladder Cancer, Bone Cancer, Brain Stem Glioma, Brain Tumours, Breast Cancer, Cancer of the Renal Pelvis and Urethra, Central Nervous System (Primary) Lymphoma, Central Nervous System Lymphoma, Cerebellar Astrocytoma, Cerebral Astrocytoma, Cervical Cancer, Childhood (Primary) Hepatocellular Cancer, Childhood (Primary) Liver Cancer, Childhood Acute Lymphoblastic Leukemia, Childhood Acute Myeloid Leukemia, Childhood Brain Stem Glioma, Glioblastoma, Childhood Cerebellar Astrocytoma, Childhood Cerebral Astrocytoma, Childhood Extracranial Germ Cell Tumours, Childhood Hodgkin's Disease, Childhood Hodgkin's Lymphoma, Childhood Hypothalamic and Visual Pathway Glioma, Childhood Lymphoblastic Leukemia, Childhood Medulloblastoma, Childhood Non-Hodgkin's Lymphoma, Childhood Pineal and Supratentorial Primitive Neuroectodermal Tumours, Childhood Primary Liver Cancer, Childhood Rhabdomyosarcoma, Childhood Soft Tissue Sarcoma, Childhood Visual Pathway and Hypothalamic Glioma, Chronic Lymphocytic Leukemia, Chronic Myelogenous Leukemia, Colon Cancer, Cutaneous T-Cell Lymphoma, Endocrine Pancreas Islet Cell Carcinoma, Endometrial Cancer, Ependymoma, Epithelial Cancer, Esophageal Cancer, Ewing's Sarcoma and Related Tumours, Exocrine Pancreatic Cancer, Extracranial Germ Cell Tumour, Extragonadal Germ Cell Tumour, Extrahepatic Bile Duct Cancer, Eye Cancer, Female Breast Cancer, Gaucher's Disease, Gallbladder Cancer, Gastric Cancer, Gastrointestinal Carcinoid Tumour, Gastrointestinal Tumours, Germ Cell Tumours, Gestational Trophoblastic Tumour, Hairy Cell Leukemia, Head and Neck Cancer, Hepatocellular Cancer, Hodgkin's Disease, Hodgkin's Lymphoma, Hypergammaglobulinemia, Hypopharyngeal Cancer, Intestinal Cancers, Intraocular Melanoma, Islet Cell Carcinoma, Islet Cell Pancreatic Cancer, Kaposi's Sarcoma, Kidney Cancer, Laryngeal Cancer, Lip and Oral Cavity Cancer, Liver Cancer, Lung Cancer, Lymphoproliferative Disorders, Macroglobulinemia, Male Breast Cancer, Malignant Mesothelioma, Malignant Thymoma, Medulloblastoma, Melanoma, Mesothelioma, Metastatic Occult Primary Squamous Neck Cancer, Metastatic Primary Squamous Neck Cancer, Metastatic Squamous Neck Cancer, Multiple Myeloma, Multiple Myeloma/Plasma Cell Neoplasm, Myelodysplastic Syndrome, Myelogenous Leukemia, Myeloid Leukemia, Myeloproliferative Disorders, Nasal Cavity and Paranasal Sinus Cancer, Nasopharyngeal Cancer, Neuroblastoma, Non-Hodgkin's Lymphoma During Pregnancy, Nonmelanoma Skin Cancer, Non-Small Cell Lung Cancer, Occult Primary Metastatic Squamous Neck Cancer, Oropharyngeal Cancer, Osteo-/Malignant Fibrous Sarcoma, Osteosarcoma/Malignant Fibrous Histiocytoma, Osteosarcoma/Malignant Fibrous Histiocytoma of Bone, Ovarian Epithelial Cancer, Ovarian Germ Cell Tumour, Ovarian Low Malignant Potential Tumour, Pancreatic Cancer, Paraproteinemias, Purpura, Parathyroid Cancer, Penile Cancer, Pheochromocytoma, Pituitary Tumour, Plasma Cell Neoplasm/Multiple Myeloma, Primary Central Nervous System Lymphoma, Primary Liver Cancer, Prostate Cancer, Rectal Cancer, Renal Cell Cancer, Renal Pelvis and Urethra Cancer, Retinoblastoma, Rhabdomyosarcoma, Salivary Gland Cancer, Sarcoidosis Sarcomas, Sezary Syndrome, Skin Cancer, Small Cell Lung Cancer, Small Intestine Cancer, Soft Tissue Sarcoma, Squamous Neck Cancer, Stomach Cancer, Supratentorial Primitive Neuroectodermal and Pineal Tumours, T-Cell Lymphoma, Testicular Cancer, Thymoma, Thyroid Cancer, Transitional Cell Cancer of the Renal Pelvis and Urethra, Transitional Renal Pelvis and Urethra Cancer, Trophoblastic Tumours, Urethra and Renal Pelvis Cell Cancer, Urethral Cancer, Uterine Cancer, Uterine Sarcoma, Vaginal Cancer, Visual Pathway and Hypothalamic Glioma, Vulvar Cancer, Waldenstrom's Macroglobulinemia, or Wilms' Tumour.


In further examples, any combinations of methods such as discussed herein may be employed.


Autoimmune Diseases

In certain example embodiments, the pharmaceutical compositions and adoptive cell transfer strategies may be used to treat various autoimmune diseases. As used throughout the present specification, the terms “autoimmune disease” or “autoimmune disorder” used interchangeably refer to a diseases or disorders caused by an immune response against a self-tissue or tissue component (self-antigen) and include a self-antibody response and/or cell-mediated response. The terms encompass organ-specific autoimmune diseases, in which an autoimmune response is directed against a single tissue, as well as non-organ specific autoimmune diseases, in which an autoimmune response is directed against a component present in two or more, several or many organs throughout the body.


Non-limiting examples of autoimmune diseases include but are not limited to acute disseminated encephalomyelitis (ADEM); Addison's disease; ankylosing spondylitis; antiphospholipid antibody syndrome (APS); aplastic anemia; autoimmune gastritis; autoimmune hepatitis; autoimmune thrombocytopenia; Behçet's disease; coeliac disease; dermatomyositis; diabetes mellitus type I; Goodpasture's syndrome; Graves' disease; Guillain-Barre syndrome (GBS); Hashimoto's disease; idiopathic thrombocytopenic purpura; inflammatory bowel disease (IBD) including Crohn's disease and ulcerative colitis; mixed connective tissue disease; multiple sclerosis (MS); myasthenia gravis; opsoclonus myoclonus syndrome (OMS); optic neuritis; Ord's thyroiditis; pemphigus; pernicious anaemia; polyarteritis nodosa; polymyositis; primary biliary cirrhosis; primary myoxedema; psoriasis; rheumatic fever; rheumatoid arthritis; Reiter's syndrome; scleroderma; Sjögren's syndrome; systemic lupus erythematosus; Takayasu's arteritis; temporal arteritis; vitiligo; warm autoimmune hemolytic anemia; or Wegener's granulomatosis.


Identifying Immunomodulators

A further aspect of the invention relates to a method for identifying an immunomodulant capable of modulating one or more phenotypic aspects of an immune cell or immune cell population as disclosed herein, comprising: a) applying a candidate immunomodulant to the immune cell or immune cell population; b) detecting modulation of one or more phenotypic aspects of the immune cell or immune cell population by the candidate immunomodulant, thereby identifying the immunomodulant.


The term “modulate” broadly denotes a qualitative and/or quantitative alteration, change or variation in that which is being modulated. Where modulation can be assessed quantitatively—for example, where modulation comprises or consists of a change in a quantifiable variable such as a quantifiable property of a cell or where a quantifiable variable provides a suitable surrogate for the modulation—modulation specifically encompasses both increase (e.g., activation) or decrease (e.g., inhibition) in the measured variable. The term encompasses any extent of such modulation, e.g., any extent of such increase or decrease, and may more particularly refer to statistically significant increase or decrease in the measured variable. By means of example, modulation may encompass an increase in the value of the measured variable by at least about 10%, e.g., by at least about 20%, preferably by at least about 30%, e.g., by at least about 40%, more preferably by at least about 50%, e.g., by at least about 75%, even more preferably by at least about 100%, e.g., by at least about 150%, 200%, 250%, 300%, 400% or by at least about 500%, compared to a reference situation without said modulation; or modulation may encompass a decrease or reduction in the value of the measured variable by at least about 10%, e.g., by at least about 20%, by at least about 30%, e.g., by at least about 40%, by at least about 50%, e.g., by at least about 60%, by at least about 70%, e.g., by at least about 80%, by at least about 90%, e.g., by at least about 95%, such as by at least about 96%, 97%, 98%, 99% or even by 100%, compared to a reference situation without said modulation. Preferably, modulation may be specific or selective, hence, one or more desired phenotypic aspects of an immune cell or immune cell population may be modulated without substantially altering other (unintended, undesired) phenotypic aspect(s).


The term “immunomodulant” broadly encompasses any condition, substance or agent capable of modulating one or more phenotypic aspects of an immune cell or immune cell population as disclosed herein. Such conditions, substances or agents may be of physical, chemical, biochemical and/or biological nature. The term “candidate immunomodulant” refers to any condition, substance or agent that is being examined for the ability to modulate one or more phenotypic aspects of an immune cell or immune cell population as disclosed herein in a method comprising applying the candidate immunomodulant to the immune cell or immune cell population (e.g., exposing the immune cell or immune cell population to the candidate immunomodulant or contacting the immune cell or immune cell population with the candidate immunomodulant) and observing whether the desired modulation takes place.


Immunomodulants may include any potential class of biologically active conditions, substances or agents, such as for instance antibodies, proteins, peptides, nucleic acids, oligonucleotides, small molecules, or combinations thereof.


By means of example but without limitation, immunomodulants can include low molecular weight compounds, but may also be larger compounds, or any organic or inorganic molecule effective in the given situation, including modified and unmodified nucleic acids such as antisense nucleic acids, RNAi, such as siRNA or shRNA, CRISPR/Cas systems, peptides, peptidomimetics, receptors, ligands, and antibodies, aptamers, polypeptides, nucleic acid analogues or variants thereof. Examples include an oligomer of nucleic acids, amino acids, or carbohydrates including without limitation proteins, oligonucleotides, ribozymes, DNAzymes, glycoproteins, siRNAs, lipoproteins, aptamers, and modifications and combinations thereof. Agents can be selected from a group comprising: chemicals; small molecules; nucleic acid sequences; nucleic acid analogues; proteins; peptides; aptamers; antibodies; or fragments thereof. A nucleic acid sequence can be RNA or DNA, and can be single or double stranded, and can be selected from a group comprising; nucleic acid encoding a protein of interest, oligonucleotides, nucleic acid analogues, for example peptide-nucleic acid (PNA), pseudo-complementary PNA (pc-PNA), locked nucleic acid (LNA), modified RNA (mod-RNA), single guide RNA etc. Such nucleic acid sequences include, for example, but are not limited to, nucleic acid sequence encoding proteins, for example that act as transcriptional repressors, antisense molecules, ribozymes, small inhibitory nucleic acid sequences, for example but are not limited to RNAi, shRNAi, siRNA, micro RNAi (mRNAi), antisense oligonucleotides, CRISPR guide RNA, for example that target a CRISPR enzyme to a specific DNA target sequence etc. A protein and/or peptide or fragment thereof can be any protein of interest, for example, but are not limited to: mutated proteins; therapeutic proteins and truncated proteins, wherein the protein is normally absent or expressed at lower levels in the cell. Proteins can also be selected from a group comprising; mutated proteins, genetically engineered proteins, peptides, synthetic peptides, recombinant proteins, chimeric proteins, antibodies, midibodies, minibodies, triabodies, humanized proteins, humanized antibodies, chimeric antibodies, modified proteins and fragments thereof. Alternatively, the agent can be intracellular within the cell as a result of introduction of a nucleic acid sequence into the cell and its transcription resulting in the production of the nucleic acid and/or protein modulator of a gene within the cell. In some embodiments, the agent is any chemical, entity or moiety, including without limitation synthetic and naturally-occurring non-proteinaceous entities. In certain embodiments, the agent is a small molecule having a chemical moiety. Agents can be known to have a desired activity and/or property, or can be selected from a library of diverse compounds.


In certain embodiments, an immunomodulant may be a hormone, a cytokine, a lymphokine, a growth factor, a chemokine, a cell surface receptor ligand such as a cell surface receptor agonist or antagonist, or a mitogen.


Non-limiting examples of hormones include growth hormone (GH), adrenocorticotropic hormone (ACTH), dehydroepiandrosterone (DHEA), cortisol, epinephrine, thyroid hormone, estrogen, progesterone, testosterone, or combinations thereof.


Non-limiting examples of cytokines include lymphokines (e.g., interferon-γ, IL-2, IL-3, IL-4, IL-6, granulocyte-macrophage colony-stimulating factor (GM-CSF), interferon-γ, leukocyte migration inhibitory factors (T-LIF, B-LIF), lymphotoxin-alpha, macrophage-activating factor (MAF), macrophage migration-inhibitory factor (MIF), neuroleukin, immunologic suppressor factors, transfer factors, or combinations thereof), monokines (e.g., IL-1, TNF-alpha, interferon-α, interferon-β, colony stimulating factors, e.g., CSF2, CSF3, macrophage CSF or GM-CSF, or combinations thereof), chemokines (e.g., beta-thromboglobulin, C chemokines, CC chemokines, CXC chemokines, CX3C chemokines, macrophage inflammatory protein (MIP), or combinations thereof), interleukins (e.g., IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, IL-13, IL-14, IL-15, IL-17, IL-18, IL-19, IL-20, IL-21, IL-22, IL-23, IL-24, IL-25, IL-26, IL-27, IL-28, IL-29, IL-30, IL-31, IL-32, IL-33, IL-34, IL-35, IL-36, or combinations thereof), and several related signalling molecules, such as tumour necrosis factor (TNF) and interferons (e.g., interferon-α, interferon-β, interferon-γ, interferon-k, or combinations thereof).


Non-limiting examples of growth factors include those of fibroblast growth factor (FGF) family, bone morphogenic protein (BMP) family, platelet derived growth factor (PDGF) family, transforming growth factor beta (TGFbeta) family, nerve growth factor (NGF) family, epidermal growth factor (EGF) family, insulin related growth factor (IGF) family, hepatocyte growth factor (HGF) family, hematopoietic growth factors (HeGFs), platelet-derived endothelial cell growth factor (PD-ECGF), angiopoietin, vascular endothelial growth factor (VEGF) family, glucocorticoids, or combinations thereof.


Non-limiting examples of mitogens include phytohaemagglutinin (PHA), concanavalin A (conA), lipopolysaccharide (LPS), pokeweed mitogen (PWM), phorbol ester such as phorbol myristate acetate (PMA) with or without ionomycin, or combinations thereof.


Non-limiting examples of cell surface receptors the ligands of which may act as immunomodulants include Toll-like receptors (TLRs) (e.g., TLR1, TLR2, TLR3, TLR4, TLR5, TLR6, TLR7, TLR8, TLR9, TLR10, TLR11, TLR12 or TLR13), CD80, CD86, CD40, CCR7, or C-type lectin receptors.


Altering Expression Using Immunomodulants

In certain embodiments, an immunomodulant may alter expression and/or activity of one or more endogenous genes of the suppressive CD8+ T cells. The term “altered expression” denotes that the modification of the immune cell alters, i.e., changes or modulates, the expression of the recited gene(s) or polypeptides(s). The term “altered expression” encompasses any direction and any extent of said alteration. Hence, “altered expression” may reflect qualitative and/or quantitative change(s) of expression, and specifically encompasses both increase (e.g., activation or stimulation) or decrease (e.g., inhibition) of expression.


In certain embodiments, the present invention provides for gene signature screening. The concept of signature screening was introduced by Stegmaier et al. (Gene expression-based high-throughput screening (GE-HTS) and application to leukemia differentiation. Nature Genet. 36, 257-263 (2004)), who realized that if a gene-expression signature was the proxy for a phenotype of interest, it could be used to find small molecules that effect that phenotype without knowledge of a validated drug target. The signatures of the present may be used to screen for drugs that induce or reduce the signature in immune cells as described herein. The signature may be used for GE-HTS. In certain embodiments, pharmacological screens may be used to identify drugs that selectively reduce or increase activity of suppressive immune cells. In certain embodiments, drugs that selectively activate or repress suppressive T cells are used for treatment of a cancer patient or a patient suffering from an autoimmune disease.


The Connectivity Map (cmap) is a collection of genome-wide transcriptional expression data from cultured human cells treated with bioactive small molecules and simple pattern-matching algorithms that together enable the discovery of functional connections between drugs, genes and diseases through the transitory feature of common gene-expression changes (see, Lamb et al., The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease. Science 29 Sep. 2006: Vol. 313, Issue 5795, pp. 1929-1935, DOI: 10.1126/science. 1132939; and Lamb, J., The Connectivity Map: a new tool for biomedical research. Nature Reviews Cancer January 2007: Vol. 7, pp. 54-60). In certain embodiments, Cmap can be used to screen for small molecules capable of modulating a signature of the present invention in silico.


Any one or more of the several successive molecular mechanisms involved in the expression of a given gene or polypeptide may be targeted by the immune cell modification as intended herein. Without limitation, these may include targeting the gene sequence (e.g., targeting the polypeptide-encoding, non-coding and/or regulatory portions of the gene sequence), the transcription of the gene into RNA, the polyadenylation and where applicable splicing and/or other post-transcriptional modifications of the RNA into mRNA, the localization of the mRNA into cell cytoplasm, where applicable other post-transcriptional modifications of the mRNA, the translation of the mRNA into a polypeptide chain, where applicable post-translational modifications of the polypeptide, and/or folding of the polypeptide chain into the mature conformation of the polypeptide. For compartmentalized polypeptides, such as secreted polypeptides and transmembrane polypeptides, this may further include targeting trafficking of the polypeptides, i.e., the cellular mechanism by which polypeptides are transported to the appropriate sub-cellular compartment or organelle, membrane, e.g. the plasma membrane, or outside the cell.


Hence, “altered expression” may particularly denote altered production of the recited gene products by the modified immune cell. As used herein, the term “gene product(s)” includes RNA transcribed from a gene (e.g., mRNA), or a polypeptide encoded by a gene or translated from RNA.


Also, “altered expression” as intended herein may encompass modulating the activity of one or more endogenous gene products. Accordingly, “altered expression”, “altering expression”, “modulating expression”, or “detecting expression” or similar may be used interchangeably with respectively “altered expression or activity”, “altering expression or activity”, “modulating expression or activity”, or “detecting expression or activity” or similar. As used herein, “modulating” or “to modulate” generally means either reducing or inhibiting the activity of a target or antigen, or alternatively increasing the activity of the target or antigen, as measured using a suitable in vitro, cellular or in vivo assay. In particular, “modulating” or “to modulate” can mean either reducing or inhibiting the (relevant or intended) activity of, or alternatively increasing the (relevant or intended) biological activity of the target or antigen, as measured using a suitable in vitro, cellular or in vivo assay (which will usually depend on the target or antigen involved), by at least 5%, at least 10%, at least 25%, at least 50%, at least 60%, at least 70%, at least 80%, or 90% or more, compared to activity of the target or antigen in the same assay under the same conditions but without the presence of the inhibitor/antagonist agents or activator/agonist agents described herein.


As will be clear to the skilled person, “modulating” can also involve effecting a change (which can either be an increase or a decrease) in affinity, avidity, specificity and/or selectivity of a target or antigen, for one or more of its targets compared to the same conditions but without the presence of a modulating agent. Again, this can be determined in any suitable manner and/or using any suitable assay known per se, depending on the target. In particular, an action as an inhibitor/antagonist or activator/agonist can be such that an intended biological or physiological activity is increased or decreased, respectively, by at least 5%, at least 10%, at least 25%, at least 50%, at least 60%, at least 70%, at least 80%, or 90% or more, compared to the biological or physiological activity in the same assay under the same conditions but without the presence of the inhibitor/antagonist agent or activator/agonist agent. Modulating can also involve activating the target or antigen or the mechanism or pathway in which it is involved.


In certain embodiments, an immunomodulant may be or may result in a genetic modification (e.g., mutation, editing, transgenesis, or combinations thereof) of an immune cell, for example, a genetic perturbation, such as a knock-out (i.e., resulting in a complete absence of expression and/or activity) of one or more endogenous genes/gene products, or a knock-down (i.e., resulting in a partial absence of expression and/or activity) of one or more endogenous genes/gene products, or another type of genetic modification modulating the expression and/or activity of one or more endogenous genes/gene products, or for example, introduction of one or more transgenes, such as one or more transgenes encoding one or more gene products. Such transgene may be suitably operably linked to suitable regulatory sequences, e.g., may be comprised in an expression cassette or an expression vector comprising suitable regulatory sequences, or may be configured to become operably linked to suitable regulatory sequences once inserted into the genetic material (e.g., genome) of the immune cell.


Any types of mutations achieving the intended effects are contemplated herein. For example, suitable mutations may include deletions, insertions, and/or substitutions. The term “deletion” refers to a mutation wherein one or more nucleotides, typically consecutive nucleotides, of a nucleic acid are removed, i.e., deleted, from the nucleic acid. The term “insertion” refers to a mutation wherein one or more nucleotides, typically consecutive nucleotides, are added, i.e., inserted, into a nucleic acid. The term “substitution” refers to a mutation wherein one or more nucleotides of a nucleic acid are each independently replaced, i.e., substituted, by another nucleotide.


In certain embodiments, a mutation may introduce a premature in-frame stop codon into the open reading frame (ORF) encoding a gene product. Such premature stop codon may lead to production of a C-terminally truncated form of said polypeptide (this may preferably affect, such as diminish or abolish, some or all biological function(s) of the polypeptide) or, especially when the stop codon is introduced close to (e.g., about 20 or less, or about 10 or less amino acids downstream of) the translation initiation codon of the ORF, the stop codon may effectively abolish the production of the polypeptide. Various ways of introducing a premature in-frame stop codon are apparent to a skilled person. For example but without limitation, a suitable insertion, deletion or substitution of one or more nucleotides in the ORF may introduce the premature in-frame stop codon.


In other embodiments, a mutation may introduce a frame shift (e.g., +1 or +2 frame shift) in the ORF encoding a gene product. Typically, such frame shift may lead to a previously out-of-frame stop codon downstream of the mutation becoming an in-frame stop codon. Hence, such frame shift may lead to production of a form of the polypeptide having an alternative C-terminal portion and/or a C-terminally truncated form of said polypeptide (this may preferably affect, such as diminish or abolish, some or all biological function(s) of the polypeptide) or, especially when the mutation is introduced close to (e.g., about 20 or less, or about 10 or less amino acids downstream of) the translation initiation codon of the ORF, the frame shift may effectively abolish the production of the polypeptide. Various ways of introducing a frame shift are apparent to a skilled person. For example but without limitation, a suitable insertion or deletion of one or more (not multiple of 3) nucleotides in the ORF may lead to a frame shift.


In further embodiments, a mutation may delete at least a portion of the ORF encoding a gene product. Such deletion may lead to production of an N-terminally truncated form, a C-terminally truncated form and/or an internally deleted form of said polypeptide (this may preferably affect, such as diminish or abolish, some or all biological function(s) of the polypeptide). Preferably, the deletion may remove about 20% or more, or about 50% or more of the ORF's nucleotides. Especially when the deletion removes a sizeable portion of the ORF (e.g., about 50% or more, preferably about 60% or more, more preferably about 70% or more, even more preferably about 80% or more, still more preferably about 90% or more of the ORF's nucleotides) or when the deletion removes the entire ORF, the deletion may effectively abolish the production of the polypeptide. The skilled person can readily introduce such deletions.


In further embodiments, a mutation may delete at least a portion of a gene promoter, leading to impaired transcription of the gene product.


In certain other embodiments, a mutation may be a substitution of one or more nucleotides in the ORF encoding a gene product resulting in substitution of one or more amino acids of the polypeptide. Such mutation may typically preserve the production of the polypeptide, and may preferably affect, such as diminish or abolish, some or all biological function(s) of the polypeptide. The skilled person can readily introduce such substitutions.


In certain preferred embodiments, a mutation may abolish native splicing of a pre-mRNA encoding a gene product. In the absence of native splicing, the pre-mRNA may be degraded, or the pre-mRNA may be alternatively spliced, or the pre-mRNA may be spliced improperly employing latent splice site(s) if available. Hence, such mutation may typically effectively abolish the production of the polypeptide's mRNA and thus the production of the polypeptide. Various ways of interfering with proper splicing are available to a skilled person, such as for example but without limitation, mutations which alter the sequence of one or more sequence elements required for splicing to render them inoperable, or mutations which comprise or consist of a deletion of one or more sequence elements required for splicing. The terms “splicing”, “splicing of a gene”, “splicing of a pre-mRNA” and similar as used herein are synonymous and have their art-established meaning. By means of additional explanation, splicing denotes the process and means of removing intervening sequences (introns) from pre-mRNA in the process of producing mature mRNA. The reference to splicing particularly aims at native splicing such as occurs under normal physiological conditions. The terms “pre-mRNA” and “transcript” are used herein to denote RNA species that precede mature mRNA, such as in particular a primary RNA transcript and any partially processed forms thereof. Sequence elements required for splicing refer particularly to cis elements in the sequence of pre-mRNA which direct the cellular splicing machinery (spliceosome) towards correct and precise removal of introns from the pre-mRNA. Sequence elements involved in splicing are generally known per se and can be further determined by known techniques including inter alia mutation or deletion analysis. By means of further explanation, “splice donor site” or “5′ splice site” generally refer to a conserved sequence immediately adjacent to an exon-intron boundary at the 5′ end of an intron. Commonly, a splice donor site may contain a dinucleotide GU, and may involve a consensus sequence of about 8 bases at about positions +2 to −6. “Splice acceptor site” or “3′ splice site” generally refers to a conserved sequence immediately adjacent to an intron-exon boundary at the 3′ end of an intron. Commonly, a splice acceptor site may contain a dinucleotide AG, and may involve a consensus sequence of about 16 bases at about positions −14 to +2.


In certain embodiments, the one or more modulating agents may be a small molecule. The term “small molecule” refers to compounds, preferably organic compounds, with a size comparable to those organic molecules generally used in pharmaceuticals. The term excludes biological macromolecules (e.g., proteins, peptides, nucleic acids, etc.). Preferred small organic molecules range in size up to about 5000 Da, e.g., up to about 4000, preferably up to 3000 Da, more preferably up to 2000 Da, even more preferably up to about 1000 Da, e.g., up to about 900, 800, 700, 600 or up to about 500 Da. In certain embodiments, the small molecule may act as an antagonist or agonist (e.g., blocking an enzyme active site or activating a receptor by binding to a ligand binding site).


One type of small molecule applicable to the present invention is a degrader molecule. Proteolysis Targeting Chimera (PROTAC) technology is a rapidly emerging alternative therapeutic strategy with the potential to address many of the challenges currently faced in modern drug development programs. PROTAC technology employs small molecules that recruit target proteins for ubiquitination and removal by the proteasome (see, e.g., Bondeson and Crews, Targeted Protein Degradation by Small Molecules, Annu Rev Pharmacol Toxicol. 2017 Jan. 6; 57: 107-123; and Lai et al., Modular PROTAC Design for the Degradation of Oncogenic BCR-ABL Angew Chem Int Ed Engl. 2016 Jan. 11; 55(2): 807-810).


Genetic Modifying Agents

In certain embodiments, the one or more modulating agents may be a genetic modifying agent. The genetic modifying agent may comprise a CRISPR system, a zinc finger nuclease system, a TALEN, a meganuclease or RNAi system.


In general, a CRISPR-Cas or CRISPR system as used in herein and in documents, such as WO 2014/093622 (PCT/US2013/074667), refers collectively to transcripts and other elements involved in the expression of or directing the activity of CRISPR-associated (“Cas”) genes, including sequences encoding a Cas gene, a tracr (trans-activating CRISPR) sequence (e.g. tracrRNA or an active partial tracrRNA), a tracr-mate sequence (encompassing a “direct repeat” and a tracrRNA-processed partial direct repeat in the context of an endogenous CRISPR system), a guide sequence (also referred to as a “spacer” in the context of an endogenous CRISPR system), or “RNA(s)” as that term is herein used (e.g., RNA(s) to guide Cas, such as Cas9, e.g. CRISPR RNA and transactivating (tracr) RNA or a single guide RNA (sgRNA) (chimeric RNA)) or other sequences and transcripts from a CRISPR locus. In general, a CRISPR system is characterized by elements that promote the formation of a CRISPR complex at the site of a target sequence (also referred to as a protospacer in the context of an endogenous CRISPR system). See, e.g, Shmakov et al. (2015) “Discovery and Functional Characterization of Diverse Class 2 CRISPR-Cas Systems”, Molecular Cell, DOI: dx.doi.org/10.1016/j.molcel.2015.10.008.


In certain embodiments, a protospacer adjacent motif (PAM) or PAM-like motif directs binding of the effector protein complex as disclosed herein to the target locus of interest. In some embodiments, the PAM may be a 5′ PAM (i.e., located upstream of the 5′ end of the protospacer). In other embodiments, the PAM may be a 3′ PAM (i.e., located downstream of the 5′ end of the protospacer). The term “PAM” may be used interchangeably with the term “PFS” or “protospacer flanking site” or “protospacer flanking sequence”.


In a preferred embodiment, the CRISPR effector protein may recognize a 3′ PAM. In certain embodiments, the CRISPR effector protein may recognize a 3′ PAM which is 5′H, wherein H is A, C or U.


In the context of formation of a CRISPR complex, “target sequence” refers to a sequence to which a guide sequence is designed to have complementarity, where hybridization between a target sequence and a guide sequence promotes the formation of a CRISPR complex. A target sequence may comprise RNA polynucleotides. The term “target RNA” refers to a RNA polynucleotide being or comprising the target sequence. In other words, the target RNA may be a RNA polynucleotide or a part of a RNA polynucleotide to which a part of the gRNA, i.e. the guide sequence, is designed to have complementarity and to which the effector function mediated by the complex comprising CRISPR effector protein and a gRNA is to be directed. In some embodiments, a target sequence is located in the nucleus or cytoplasm of a cell.


In certain example embodiments, the CRISPR effector protein may be delivered using a nucleic acid molecule encoding the CRISPR effector protein. The nucleic acid molecule encoding a CRISPR effector protein, may advantageously be a codon optimized CRISPR effector protein. An example of a codon optimized sequence, is in this instance a sequence optimized for expression in eukaryote, e.g., humans (i.e. being optimized for expression in humans), or for another eukaryote, animal or mammal as herein discussed; see, e.g., SaCas9 human codon optimized sequence in WO 2014/093622 (PCT/US2013/074667). Whilst this is preferred, it will be appreciated that other examples are possible and codon optimization for a host species other than human, or for codon optimization for specific organs is known. In some embodiments, an enzyme coding sequence encoding a CRISPR effector protein is a codon optimized for expression in particular cells, such as eukaryotic cells. The eukaryotic cells may be those of or derived from a particular organism, such as a plant or a mammal, including but not limited to human, or non human eukaryote or animal or mammal as herein discussed, e.g., mouse, rat, rabbit, dog, livestock, or non-human mammal or primate. In some embodiments, processes for modifying the germ line genetic identity of human beings and/or processes for modifying the genetic identity of animals which are likely to cause them suffering without any substantial medical benefit to man or animal, and also animals resulting from such processes, may be excluded. In general, codon optimization refers to a process of modifying a nucleic acid sequence for enhanced expression in the host cells of interest by replacing at least one codon (e.g. about or more than about 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more codons) of the native sequence with codons that are more frequently or most frequently used in the genes of that host cell while maintaining the native amino acid sequence. Various species exhibit particular bias for certain codons of a particular amino acid. Codon bias (differences in codon usage between organisms) often correlates with the efficiency of translation of messenger RNA (mRNA), which is in turn believed to be dependent on, among other things, the properties of the codons being translated and the availability of particular transfer RNA (tRNA) molecules. The predominance of selected tRNAs in a cell is generally a reflection of the codons used most frequently in peptide synthesis. Accordingly, genes can be tailored for optimal gene expression in a given organism based on codon optimization. Codon usage tables are readily available, for example, at the “Codon Usage Database” available at kazusa.orjp/codon/ and these tables can be adapted in a number of ways. See Nakamura, Y., et al. “Codon usage tabulated from the international DNA sequence databases: status for the year 2000” Nucl. Acids Res. 28:292 (2000). Computer algorithms for codon optimizing a particular sequence for expression in a particular host cell are also available, such as Gene Forge (Aptagen; Jacobus, P A), are also available. In some embodiments, one or more codons (e.g. 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more, or all codons) in a sequence encoding a Cas correspond to the most frequently used codon for a particular amino acid.


In certain embodiments, the methods as described herein may comprise providing a Cas transgenic cell in which one or more nucleic acids encoding one or more guide RNAs are provided or introduced operably connected in the cell with a regulatory element comprising a promoter of one or more gene of interest. As used herein, the term “Cas transgenic cell” refers to a cell, such as a eukaryotic cell, in which a Cas gene has been genomically integrated. The nature, type, or origin of the cell are not particularly limiting according to the present invention. Also the way the Cas transgene is introduced in the cell may vary and can be any method as is known in the art. In certain embodiments, the Cas transgenic cell is obtained by introducing the Cas transgene in an isolated cell. In certain other embodiments, the Cas transgenic cell is obtained by isolating cells from a Cas transgenic organism. By means of example, and without limitation, the Cas transgenic cell as referred to herein may be derived from a Cas transgenic eukaryote, such as a Cas knock-in eukaryote. Reference is made to WO 2014/093622 (PCT/US13/74667), incorporated herein by reference. Methods of US Patent Publication Nos. 20120017290 and 20110265198 assigned to Sangamo BioSciences, Inc. directed to targeting the Rosa locus may be modified to utilize the CRISPR Cas system of the present invention. Methods of US Patent Publication No. 20130236946 assigned to Cellectis directed to targeting the Rosa locus may also be modified to utilize the CRISPR Cas system of the present invention. By means of further example reference is made to Platt et. al. (Cell; 159(2):440-455 (2014)), describing a Cas9 knock-in mouse, which is incorporated herein by reference. The Cas transgene can further comprise a Lox-Stop-polyA-Lox(LSL) cassette thereby rendering Cas expression inducible by Cre recombinase. Alternatively, the Cas transgenic cell may be obtained by introducing the Cas transgene in an isolated cell. Delivery systems for transgenes are well known in the art. By means of example, the Cas transgene may be delivered in for instance eukaryotic cell by means of vector (e.g., AAV, adenovirus, lentivirus) and/or particle and/or nanoparticle delivery, as also described herein elsewhere.


It will be understood by the skilled person that the cell, such as the Cas transgenic cell, as referred to herein may comprise further genomic alterations besides having an integrated Cas gene or the mutations arising from the sequence specific action of Cas when complexed with RNA capable of guiding Cas to a target locus.


In certain aspects the invention involves vectors, e.g. for delivering or introducing in a cell Cas and/or RNA capable of guiding Cas to a target locus (i.e. guide RNA), but also for propagating these components (e.g. in prokaryotic cells). A used herein, a “vector” is a tool that allows or facilitates the transfer of an entity from one environment to another. It is a replicon, such as a plasmid, phage, or cosmid, into which another DNA segment may be inserted so as to bring about the replication of the inserted segment. Generally, a vector is capable of replication when associated with the proper control elements. In general, the term “vector” refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. Vectors include, but are not limited to, nucleic acid molecules that are single-stranded, double-stranded, or partially double-stranded; nucleic acid molecules that comprise one or more free ends, no free ends (e.g. circular); nucleic acid molecules that comprise DNA, RNA, or both; and other varieties of polynucleotides known in the art. One type of vector is a “plasmid,” which refers to a circular double stranded DNA loop into which additional DNA segments can be inserted, such as by standard molecular cloning techniques. Another type of vector is a viral vector, wherein virally-derived DNA or RNA sequences are present in the vector for packaging into a virus (e.g. retroviruses, replication defective retroviruses, adenoviruses, replication defective adenoviruses, and adeno-associated viruses (AAVs)). Viral vectors also include polynucleotides carried by a virus for transfection into a host cell. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g. bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively-linked. Such vectors are referred to herein as “expression vectors.” Common expression vectors of utility in recombinant DNA techniques are often in the form of plasmids.


Recombinant expression vectors can comprise a nucleic acid of the invention in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory elements, which may be selected on the basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid sequence to be expressed. Within a recombinant expression vector, “operably linked” is intended to mean that the nucleotide sequence of interest is linked to the regulatory element(s) in a manner that allows for expression of the nucleotide sequence (e.g. in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell). With regards to recombination and cloning methods, mention is made of U.S. patent application Ser. No. 10/815,730, published Sep. 2, 2004 as US 2004-0171156 A1, the contents of which are herein incorporated by reference in their entirety. Thus, the embodiments disclosed herein may also comprise transgenic cells comprising the CRISPR effector system. In certain example embodiments, the transgenic cell may function as an individual discrete volume. In other words samples comprising a masking construct may be delivered to a cell, for example in a suitable delivery vesicle and if the target is present in the delivery vesicle the CRISPR effector is activated and a detectable signal generated.


The vector(s) can include the regulatory element(s), e.g., promoter(s). The vector(s) can comprise Cas encoding sequences, and/or a single, but possibly also can comprise at least 3 or 8 or 16 or 32 or 48 or 50 guide RNA(s) (e.g., sgRNAs) encoding sequences, such as 1-2, 1-3, 1-4 1-5, 3-6, 3-7, 3-8, 3-9, 3-10, 3-8, 3-16, 3-30, 3-32, 3-48, 3-50 RNA(s) (e.g., sgRNAs). In a single vector there can be a promoter for each RNA (e.g., sgRNA), advantageously when there are up to about 16 RNA(s); and, when a single vector provides for more than 16 RNA(s), one or more promoter(s) can drive expression of more than one of the RNA(s), e.g., when there are 32 RNA(s), each promoter can drive expression of two RNA(s), and when there are 48 RNA(s), each promoter can drive expression of three RNA(s). By simple arithmetic and well established cloning protocols and the teachings in this disclosure one skilled in the art can readily practice the invention as to the RNA(s) for a suitable exemplary vector such as AAV, and a suitable promoter such as the U6 promoter. For example, the packaging limit of AAV is ˜4.7 kb. The length of a single U6-gRNA (plus restriction sites for cloning) is 361 bp. Therefore, the skilled person can readily fit about 12-16, e.g., 13 U6-gRNA cassettes in a single vector. This can be assembled by any suitable means, such as a golden gate strategy used for TALE assembly (genome-engineering.org/taleffectors/). The skilled person can also use a tandem guide strategy to increase the number of U6-gRNAs by approximately 1.5 times, e.g., to increase from 12-16, e.g., 13 to approximately 18-24, e.g., about 19 U6-gRNAs. Therefore, one skilled in the art can readily reach approximately 18-24, e.g., about 19 promoter-RNAs, e.g., U6-gRNAs in a single vector, e.g., an AAV vector. A further means for increasing the number of promoters and RNAs in a vector is to use a single promoter (e.g., U6) to express an array of RNAs separated by cleavable sequences. And an even further means for increasing the number of promoter-RNAs in a vector, is to express an array of promoter-RNAs separated by cleavable sequences in the intron of a coding sequence or gene; and, in this instance it is advantageous to use a polymerase II promoter, which can have increased expression and enable the transcription of long RNA in a tissue specific manner. (see, e.g., nar.oxfordjournals.org/content/34/7/e53.short and nature.com/mt/journal/v16/n9/abs/mt2008144a.html). In an advantageous embodiment, AAV may package U6 tandem gRNA targeting up to about 50 genes. Accordingly, from the knowledge in the art and the teachings in this disclosure the skilled person can readily make and use vector(s), e.g., a single vector, expressing multiple RNAs or guides under the control or operatively or functionally linked to one or more promoters-especially as to the numbers of RNAs or guides discussed herein, without any undue experimentation.


The guide RNA(s) encoding sequences and/or Cas encoding sequences, can be functionally or operatively linked to regulatory element(s) and hence the regulatory element(s) drive expression. The promoter(s) can be constitutive promoter(s) and/or conditional promoter(s) and/or inducible promoter(s) and/or tissue specific promoter(s). The promoter can be selected from the group consisting of RNA polymerases, pol I, pol II, pol III, T7, U6, H1, retroviral Rous sarcoma virus (RSV) LTR promoter, the cytomegalovirus (CMV) promoter, the SV40 promoter, the dihydrofolate reductase promoter, the β-actin promoter, the phosphoglycerol kinase (PGK) promoter, and the EF1α promoter. An advantageous promoter is the promoter is U6.


Additional effectors for use according to the invention can be identified by their proximity to cas1 genes, for example, though not limited to, within the region 20 kb from the start of the cas1 gene and 20 kb from the end of the cas1 gene. In certain embodiments, the effector protein comprises at least one HEPN domain and at least 500 amino acids, and wherein the C2c2 effector protein is naturally present in a prokaryotic genome within 20 kb upstream or downstream of a Cas gene or a CRISPR array. Non-limiting examples of Cas proteins include Cas1, Cas1B, Cas2, Cas3, Cas4, Cas5, Cash, Cas7, Cas8, Cas9 (also known as Csn1 and Csx12), Cas10, Csy1, Csy2, Csy3, Cse1, Cse2, Csc1, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Csx1, Csx15, Csf1, Csf2, Csf3, Csf4, homologues thereof, or modified versions thereof. In certain example embodiments, the C2c2 effector protein is naturally present in a prokaryotic genome within 20 kb upstream or downstream of a Cas 1 gene. The terms “orthologue” (also referred to as “ortholog” herein) and “homologue” (also referred to as “homolog” herein) are well known in the art. By means of further guidance, a “homologue” of a protein as used herein is a protein of the same species which performs the same or a similar function as the protein it is a homologue of. Homologous proteins may but need not be structurally related, or are only partially structurally related. An “orthologue” of a protein as used herein is a protein of a different species which performs the same or a similar function as the protein it is an orthologue of Orthologous proteins may but need not be structurally related, or are only partially structurally related.


Guide Molecules

The methods described herein may be used to screen inhibition of CRISPR systems employing different types of guide molecules. As used herein, the term “guide sequence” and “guide molecule” in the context of a CRISPR-Cas system, comprises any polynucleotide sequence having sufficient complementarity with a target nucleic acid sequence to hybridize with the target nucleic acid sequence and direct sequence-specific binding of a nucleic acid-targeting complex to the target nucleic acid sequence. The guide sequences made using the methods disclosed herein may be a full-length guide sequence, a truncated guide sequence, a full-length sgRNA sequence, a truncated sgRNA sequence, or an E+F sgRNA sequence. In some embodiments, the degree of complementarity of the guide sequence to a given target sequence, when optimally aligned using a suitable alignment algorithm, is about or more than about 50%, 60%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99%, or more. In certain example embodiments, the guide molecule comprises a guide sequence that may be designed to have at least one mismatch with the target sequence, such that a RNA duplex formed between the guide sequence and the target sequence. Accordingly, the degree of complementarity is preferably less than 99%. For instance, where the guide sequence consists of 24 nucleotides, the degree of complementarity is more particularly about 96% or less. In particular embodiments, the guide sequence is designed to have a stretch of two or more adjacent mismatching nucleotides, such that the degree of complementarity over the entire guide sequence is further reduced. For instance, where the guide sequence consists of 24 nucleotides, the degree of complementarity is more particularly about 96% or less, more particularly, about 92% or less, more particularly about 88% or less, more particularly about 84% or less, more particularly about 80% or less, more particularly about 76% or less, more particularly about 72% or less, depending on whether the stretch of two or more mismatching nucleotides encompasses 2, 3, 4, 5, 6 or 7 nucleotides, etc. In some embodiments, aside from the stretch of one or more mismatching nucleotides, the degree of complementarity, when optimally aligned using a suitable alignment algorithm, is about or more than about 50%, 60%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99%, or more. Optimal alignment may be determined with the use of any suitable algorithm for aligning sequences, non-limiting example of which include the Smith-Waterman algorithm, the Needleman-Wunsch algorithm, algorithms based on the Burrows-Wheeler Transform (e.g., the Burrows Wheeler Aligner), ClustalW, Clustal X, BLAT, Novoalign (Novocraft Technologies; available at www.novocraft.com), ELAND (Illumina, San Diego, Calif.), SOAP (available at soap.genomics.org.cn), and Maq (available at maq.sourceforge.net). The ability of a guide sequence (within a nucleic acid-targeting guide RNA) to direct sequence-specific binding of a nucleic acid-targeting complex to a target nucleic acid sequence may be assessed by any suitable assay. For example, the components of a nucleic acid-targeting CRISPR system sufficient to form a nucleic acid-targeting complex, including the guide sequence to be tested, may be provided to a host cell having the corresponding target nucleic acid sequence, such as by transfection with vectors encoding the components of the nucleic acid-targeting complex, followed by an assessment of preferential targeting (e.g., cleavage) within the target nucleic acid sequence, such as by Surveyor assay as described herein. Similarly, cleavage of a target nucleic acid sequence (or a sequence in the vicinity thereof) may be evaluated in a test tube by providing the target nucleic acid sequence, components of a nucleic acid-targeting complex, including the guide sequence to be tested and a control guide sequence different from the test guide sequence, and comparing binding or rate of cleavage at or in the vicinity of the target sequence between the test and control guide sequence reactions. Other assays are possible, and will occur to those skilled in the art. A guide sequence, and hence a nucleic acid-targeting guide RNA may be selected to target any target nucleic acid sequence.


In certain embodiments, the guide sequence or spacer length of the guide molecules is from 15 to 50 nt. In certain embodiments, the spacer length of the guide RNA is at least 15 nucleotides. In certain embodiments, the spacer length is from 15 to 17 nt, e.g., 15, 16, or 17 nt, from 17 to 20 nt, e.g., 17, 18, 19, or 20 nt, from 20 to 24 nt, e.g., 20, 21, 22, 23, or 24 nt, from 23 to 25 nt, e.g., 23, 24, or 25 nt, from 24 to 27 nt, e.g., 24, 25, 26, or 27 nt, from 27-30 nt, e.g., 27, 28, 29, or 30 nt, from 30-35 nt, e.g., 30, 31, 32, 33, 34, or 35 nt, or 35 nt or longer. In certain example embodiment, the guide sequence is 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 39 40, 41, 42, 43, 44, 45, 46, 47 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100 nt.


In some embodiments, the guide sequence is an RNA sequence of between 10 to 50 nt in length, but more particularly of about 20-30 nt advantageously about 20 nt, 23-25 nt or 24 nt. The guide sequence is selected so as to ensure that it hybridizes to the target sequence. This is described more in detail below. Selection can encompass further steps which increase efficacy and specificity.


In some embodiments, the guide sequence has a canonical length (e.g., about 15-30 nt) is used to hybridize with the target RNA or DNA. In some embodiments, a guide molecule is longer than the canonical length (e.g., >30 nt) is used to hybridize with the target RNA or DNA, such that a region of the guide sequence hybridizes with a region of the RNA or DNA strand outside of the Cas-guide target complex. This can be of interest where additional modifications, such deamination of nucleotides is of interest. In alternative embodiments, it is of interest to maintain the limitation of the canonical guide sequence length.


In some embodiments, the sequence of the guide molecule (direct repeat and/or spacer) is selected to reduce the degree secondary structure within the guide molecule. In some embodiments, about or less than about 75%, 50%, 40%, 30%, 25%, 20%, 15%, 10%, 5%, 1%, or fewer of the nucleotides of the nucleic acid-targeting guide RNA participate in self-complementary base pairing when optimally folded. Optimal folding may be determined by any suitable polynucleotide folding algorithm. Some programs are based on calculating the minimal Gibbs free energy. An example of one such algorithm is mFold, as described by Zuker and Stiegler (Nucleic Acids Res. 9 (1981), 133-148). Another example folding algorithm is the online webserver RNAfold, developed at Institute for Theoretical Chemistry at the University of Vienna, using the centroid structure prediction algorithm (see e.g., A. R. Gruber et al., 2008, Cell 106(1): 23-24; and P A Carr and G M Church, 2009, Nature Biotechnology 27(12): 1151-62).


In some embodiments, it is of interest to reduce the susceptibility of the guide molecule to RNA cleavage, such as to cleavage by Cas13. Accordingly, in particular embodiments, the guide molecule is adjusted to avoide cleavage by Cas13 or other RNA-cleaving enzymes.


In certain embodiments, the guide molecule comprises non-naturally occurring nucleic acids and/or non-naturally occurring nucleotides and/or nucleotide analogs, and/or chemically modifications. Preferably, these non-naturally occurring nucleic acids and non-naturally occurring nucleotides are located outside the guide sequence. Non-naturally occurring nucleic acids can include, for example, mixtures of naturally and non-naturally occurring nucleotides. Non-naturally occurring nucleotides and/or nucleotide analogs may be modified at the ribose, phosphate, and/or base moiety. In an embodiment of the invention, a guide nucleic acid comprises ribonucleotides and non-ribonucleotides. In one such embodiment, a guide comprises one or more ribonucleotides and one or more deoxyribonucleotides. In an embodiment of the invention, the guide comprises one or more non-naturally occurring nucleotide or nucleotide analog such as a nucleotide with phosphorothioate linkage, a locked nucleic acid (LNA) nucleotides comprising a methylene bridge between the 2′ and 4′ carbons of the ribose ring, or bridged nucleic acids (BNA). Other examples of modified nucleotides include 2′-O-methyl analogs, 2′-deoxy analogs, or 2′-fluoro analogs. Further examples of modified bases include, but are not limited to, 2-aminopurine, 5-bromo-uridine, pseudouridine, inosine, 7-methylguanosine. Examples of guide RNA chemical modifications include, without limitation, incorporation of 2′-O-methyl (M), 2′-O-methyl 3′ phosphorothioate (MS), S-constrained ethyl(cEt), or 2′-O-methyl 3′ thioPACE (MSP) at one or more terminal nucleotides. Such chemically modified guides can comprise increased stability and increased activity as compared to unmodified guides, though on-target vs. off-target specificity is not predictable. (See, Hendel, 2015, Nat Biotechnol. 33(9):985-9, doi: 10.1038/nbt.3290, published online 29 Jun. 2015 Ragdarm et al., 0215, PNAS, E7110-E7111; Allerson et al., J. Med. Chem. 2005, 48:901-904; Bramsen et al., Front. Genet., 2012, 3:154; Deng et al., PNAS, 2015, 112:11870-11875; Sharma et al., MedChemComm., 2014, 5:1454-1471; Hendel et al., Nat. Biotechnol. (2015) 33(9): 985-989; Li et al., Nature Biomedical Engineering, 2017, 1, 0066 DOI:10.1038/s41551-017-0066). In some embodiments, the 5′ and/or 3′ end of a guide RNA is modified by a variety of functional moieties including fluorescent dyes, polyethylene glycol, cholesterol, proteins, or detection tags. (See Kelly et al., 2016, J. Biotech. 233:74-83). In certain embodiments, a guide comprises ribonucleotides in a region that binds to a target RNA and one or more deoxyribonucletides and/or nucleotide analogs in a region that binds to Cas13. In an embodiment of the invention, deoxyribonucleotides and/or nucleotide analogs are incorporated in engineered guide structures, such as, without limitation, stem-loop regions, and the seed region. For Cas13 guide, in certain embodiments, the modification is not in the 5′-handle of the stem-loop regions. Chemical modification in the 5′-handle of the stem-loop region of a guide may abolish its function (see Li, et al., Nature Biomedical Engineering, 2017, 1:0066). In certain embodiments, at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, or 75 nucleotides of a guide is chemically modified. In some embodiments, 3-5 nucleotides at either the 3′ or the 5′ end of a guide is chemically modified. In some embodiments, only minor modifications are introduced in the seed region, such as 2′-F modifications. In some embodiments, 2′-F modification is introduced at the 3′ end of a guide. In certain embodiments, three to five nucleotides at the 5′ and/or the 3′ end of the guide are chemicially modified with 2′-O-methyl (M), 2′-O-methyl 3′ phosphorothioate (MS), S-constrained ethyl(cEt), or 2′-O-methyl 3′ thioPACE (MSP). Such modification can enhance genome editing efficiency (see Hendel et al., Nat. Biotechnol. (2015) 33(9): 985-989). In certain embodiments, all of the phosphodiester bonds of a guide are substituted with phosphorothioates (PS) for enhancing levels of gene disruption. In certain embodiments, more than five nucleotides at the 5′ and/or the 3′ end of the guide are chemicially modified with 2′-O-Me, 2′-F or S-constrained ethyl(cEt). Such chemically modified guide can mediate enhanced levels of gene disruption (see Ragdarm et al., 0215, PNAS, E7110-E7111). In an embodiment of the invention, a guide is modified to comprise a chemical moiety at its 3′ and/or 5′ end. Such moieties include, but are not limited to amine, azide, alkyne, thio, dibenzocyclooctyne (DBCO), or Rhodamine. In certain embodiment, the chemical moiety is conjugated to the guide by a linker, such as an alkyl chain. In certain embodiments, the chemical moiety of the modified guide can be used to attach the guide to another molecule, such as DNA, RNA, protein, or nanoparticles. Such chemically modified guide can be used to identify or enrich cells generically edited by a CRISPR system (see Lee et al., eLife, 2017, 6:e25312, DOI:10.7554).


In some embodiments, the modification to the guide is a chemical modification, an insertion, a deletion or a split. In some embodiments, the chemical modification includes, but is not limited to, incorporation of 2′-O-methyl (M) analogs, 2′-deoxy analogs, 2-thiouridine analogs, N6-methyladenosine analogs, 2′-fluoro analogs, 2-aminopurine, 5-bromo-uridine, pseudouridine (P), N1-methylpseudouridine (me1Ψ), 5-methoxyuridine(5moU), inosine, 7-methylguanosine, 2′ O-methyl 3′phosphorothioate (MS), S-constrained ethyl(cEt), phosphorothioate (PS), or 2′-O-methyl 3′thioPACE (MSP). In some embodiments, the guide comprises one or more of phosphorothioate modifications. In certain embodiments, at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, or 25 nucleotides of the guide are chemically modified. In certain embodiments, one or more nucleotides in the seed region are chemically modified. In certain embodiments, one or more nucleotides in the 3′-terminus are chemically modified. In certain embodiments, none of the nucleotides in the 5′-handle is chemically modified. In some embodiments, the chemical modification in the seed region is a minor modification, such as incorporation of a 2′-fluoro analog. In a specific embodiment, one nucleotide of the seed region is replaced with a 2′-fluoro analog. In some embodiments, 5 to 10 nucleotides in the 3′-terminus are chemically modified. Such chemical modifications at the 3′-terminus of the Cas13 CrRNA may improve Cas13 activity. In a specific embodiment, 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 nucleotides in the 3′-terminus are replaced with 2′-fluoro analogues. In a specific embodiment, 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 nucleotides in the 3′-terminus are replaced with 2′-O-methyl (M) analogs.


In some embodiments, the loop of the 5′-handle of the guide is modified. In some embodiments, the loop of the 5′-handle of the guide is modified to have a deletion, an insertion, a split, or chemical modifications. In certain embodiments, the modified loop comprises 3, 4, or 5 nucleotides. In certain embodiments, the loop comprises the sequence of UCUU, UUUU, UAUU, or UGUU.


In some embodiments, the guide molecule forms a stemloop with a separate non-covalently linked sequence, which can be DNA or RNA. In particular embodiments, the sequences forming the guide are first synthesized using the standard phosphoramidite synthetic protocol (Herdewijn, P., ed., Methods in Molecular Biology Col 288, Oligonucleotide Synthesis: Methods and Applications, Humana Press, New Jersey (2012)). In some embodiments, these sequences can be functionalized to contain an appropriate functional group for ligation using the standard protocol known in the art (Hermanson, G. T., Bioconjugate Techniques, Academic Press (2013)). Examples of functional groups include, but are not limited to, hydroxyl, amine, carboxylic acid, carboxylic acid halide, carboxylic acid active ester, aldehyde, carbonyl, chlorocarbonyl, imidazolylcarbonyl, hydrozide, semicarbazide, thio semicarbazide, thiol, maleimide, haloalkyl, sufonyl, ally, propargyl, diene, alkyne, and azide. Once this sequence is functionalized, a covalent chemical bond or linkage can be formed between this sequence and the direct repeat sequence. Examples of chemical bonds include, but are not limited to, those based on carbamates, ethers, esters, amides, imines, amidines, aminotrizines, hydrozone, disulfides, thioethers, thioesters, phosphorothioates, phosphorodithioates, sulfonamides, sulfonates, fulfones, sulfoxides, ureas, thioureas, hydrazide, oxime, triazole, photolabile linkages, C—C bond forming groups such as Diels-Alder cyclo-addition pairs or ring-closing metathesis pairs, and Michael reaction pairs.


In some embodiments, these stem-loop forming sequences can be chemically synthesized. In some embodiments, the chemical synthesis uses automated, solid-phase oligonucleotide synthesis machines with 2′-acetoxyethyl orthoester (2′-ACE) (Scaringe et al., J. Am. Chem. Soc. (1998) 120: 11820-11821; Scaringe, Methods Enzymol. (2000) 317: 3-18) or 2′-thionocarbamate (2′-TC) chemistry (Dellinger et al., J. Am. Chem. Soc. (2011) 133: 11540-11546; Hendel et al., Nat. Biotechnol. (2015) 33:985-989).


In certain embodiments, the guide molecule comprises (1) a guide sequence capable of hybridizing to a target locus and (2) a tracr mate or direct repeat sequence whereby the direct repeat sequence is located upstream (i.e., 5′) from the guide sequence. In a particular embodiment the seed sequence (i.e. the sequence essential critical for recognition and/or hybridization to the sequence at the target locus) of th guide sequence is approximately within the first 10 nucleotides of the guide sequence.


In a particular embodiment the guide molecule comprises a guide sequence linked to a direct repeat sequence, wherein the direct repeat sequence comprises one or more stem loops or optimized secondary structures. In particular embodiments, the direct repeat has a minimum length of 16 nts and a single stem loop. In further embodiments the direct repeat has a length longer than 16 nts, preferably more than 17 nts, and has more than one stem loops or optimized secondary structures. In particular embodiments the guide molecule comprises or consists of the guide sequence linked to all or part of the natural direct repeat sequence. A typical Type V or Type VI CRISPR-cas guide molecule comprises (in 3′ to 5′ direction or in 5′ to 3′ direction): a guide sequence a first complimentary stretch (the “repeat”), a loop (which is typically 4 or 5 nucleotides long), a second complimentary stretch (the “anti-repeat” being complimentary to the repeat), and a poly A (often poly U in RNA) tail (terminator). In certain embodiments, the direct repeat sequence retains its natural architecture and forms a single stem loop. In particular embodiments, certain aspects of the guide architecture can be modified, for example by addition, subtraction, or substitution of features, whereas certain other aspects of guide architecture are maintained. Preferred locations for engineered guide molecule modifications, including but not limited to insertions, deletions, and substitutions include guide termini and regions of the guide molecule that are exposed when complexed with the CRISPR-Cas protein and/or target, for example the stemloop of the direct repeat sequence.


In particular embodiments, the stem comprises at least about 4 bp comprising complementary X and Y sequences, although stems of more, e.g., 5, 6, 7, 8, 9, 10, 11 or 12 or fewer, e.g., 3, 2, base pairs are also contemplated. Thus, for example X2-10 and Y2-10 (wherein X and Y represent any complementary set of nucleotides) may be contemplated. In one aspect, the stem made of the X and Y nucleotides, together with the loop will form a complete hairpin in the overall secondary structure; and, this may be advantageous and the amount of base pairs can be any amount that forms a complete hairpin. In one aspect, any complementary X:Y basepairing sequence (e.g., as to length) is tolerated, so long as the secondary structure of the entire guide molecule is preserved. In one aspect, the loop that connects the stem made of X:Y basepairs can be any sequence of the same length (e.g., 4 or 5 nucleotides) or longer that does not interrupt the overall secondary structure of the guide molecule. In one aspect, the stemloop can further comprise, e.g. an MS2 aptamer. In one aspect, the stem comprises about 5-7 bp comprising complementary X and Y sequences, although stems of more or fewer basepairs are also contemplated. In one aspect, non-Watson Crick basepairing is contemplated, where such pairing otherwise generally preserves the architecture of the stemloop at that position.


In particular embodiments the natural hairpin or stemloop structure of the guide molecule is extended or replaced by an extended stemloop. It has been demonstrated that extension of the stem can enhance the assembly of the guide molecule with the CRISPR-Cas proten (Chen et al. Cell. (2013); 155(7): 1479-1491). In particular embodiments the stem of the stemloop is extended by at least 1, 2, 3, 4, 5 or more complementary basepairs (i.e. corresponding to the addition of 2, 4, 6, 8, 10 or more nucleotides in the guide molecule). In particular embodiments these are located at the end of the stem, adjacent to the loop of the stemloop.


In particular embodiments, the susceptibility of the guide molecule to RNAses or to decreased expression can be reduced by slight modifications of the sequence of the guide molecule which do not affect its function. For instance, in particular embodiments, premature termination of transcription, such as premature transcription of U6 Pol-III, can be removed by modifying a putative Pol-III terminator (4 consecutive U's) in the guide molecules sequence. Where such sequence modification is required in the stemloop of the guide molecule, it is preferably ensured by a basepair flip.


In a particular embodiment, the direct repeat may be modified to comprise one or more protein-binding RNA aptamers. In a particular embodiment, one or more aptamers may be included such as part of optimized secondary structure. Such aptamers may be capable of binding a bacteriophage coat protein as detailed further herein.


In some embodiments, the guide molecule forms a duplex with a target RNA comprising at least one target cytosine residue to be edited. Upon hybridization of the guide RNA molecule to the target RNA, the cytidine deaminase binds to the single strand RNA in the duplex made accessible by the mismatch in the guide sequence and catalyzes deamination of one or more target cytosine residues comprised within the stretch of mismatching nucleotides.


A guide sequence, and hence a nucleic acid-targeting guide RNA may be selected to target any target nucleic acid sequence. The target sequence may be mRNA.


In certain embodiments, the target sequence should be associated with a PAM (protospacer adjacent motif) or PFS (protospacer flanking sequence or site); that is, a short sequence recognized by the CRISPR complex. Depending on the nature of the CRISPR-Cas protein, the target sequence should be selected such that its complementary sequence in the DNA duplex (also referred to herein as the non-target sequence) is upstream or downstream of the PAM. In the embodiments of the present invention where the CRISPR-Cas protein is a Cas13 protein, the compelementary sequence of the target sequence is downstream or 3′ of the PAM or upstream or 5′ of the PAM. The precise sequence and length requirements for the PAM differ depending on the Cas13 protein used, but PAMs are typically 2-5 base pair sequences adjacent the protospacer (that is, the target sequence). Examples of the natural PAM sequences for different Cas13 orthologues are provided herein below and the skilled person will be able to identify further PAM sequences for use with a given Cas13 protein.


Further, engineering of the PAM Interacting (PI) domain may allow programing of PAM specificity, improve target site recognition fidelity, and increase the versatility of the CRISPR-Cas protein, for example as described for Cas9 in Kleinstiver B P et al. Engineered CRISPR-Cas9 nucleases with altered PAM specificities. Nature. 2015 Jul. 23; 523(7561):481-5. doi: 10.1038/nature14592. As further detailed herein, the skilled person will understand that Cas13 proteins may be modified analogously.


In particular embodiment, the guide is an escorted guide. By “escorted” is meant that the CRISPR-Cas system or complex or guide is delivered to a selected time or place within a cell, so that activity of the CRISPR-Cas system or complex or guide is spatially or temporally controlled. For example, the activity and destination of the 3 CRISPR-Cas system or complex or guide may be controlled by an escort RNA aptamer sequence that has binding affinity for an aptamer ligand, such as a cell surface protein or other localized cellular component. Alternatively, the escort aptamer may for example be responsive to an aptamer effector on or in the cell, such as a transient effector, such as an external energy source that is applied to the cell at a particular time.


The escorted CRISPR-Cas systems or complexes have a guide molecule with a functional structure designed to improve guide molecule structure, architecture, stability, genetic expression, or any combination thereof. Such a structure can include an aptamer.


Aptamers are biomolecules that can be designed or selected to bind tightly to other ligands, for example using a technique called systematic evolution of ligands by exponential enrichment (SELEX; Tuerk C, Gold L: “Systematic evolution of ligands by exponential enrichment: RNA ligands to bacteriophage T4 DNA polymerase.” Science 1990, 249:505-510). Nucleic acid aptamers can for example be selected from pools of random-sequence oligonucleotides, with high binding affinities and specificities for a wide range of biomedically relevant targets, suggesting a wide range of therapeutic utilities for aptamers (Keefe, Anthony D., Supriya Pai, and Andrew Ellington. “Aptamers as therapeutics.” Nature Reviews Drug Discovery 9.7 (2010): 537-550). These characteristics also suggest a wide range of uses for aptamers as drug delivery vehicles (Levy-Nissenbaum, Etgar, et al. “Nanotechnology and aptamers: applications in drug delivery.” Trends in biotechnology 26.8 (2008): 442-449; and, Hicke B J, Stephens A W. “Escort aptamers: a delivery service for diagnosis and therapy.” J Clin Invest 2000, 106:923-928.). Aptamers may also be constructed that function as molecular switches, responding to a que by changing properties, such as RNA aptamers that bind fluorophores to mimic the activity of green flourescent protein (Paige, Jeremy S., Karen Y. Wu, and Samie R. Jaffrey. “RNA mimics of green fluorescent protein.” Science 333.6042 (2011): 642-646). It has also been suggested that aptamers may be used as components of targeted siRNA therapeutic delivery systems, for example targeting cell surface proteins (Zhou, Jiehua, and John J. Rossi. “Aptamer-targeted cell-specific RNA interference.” Silence 1.1 (2010): 4).


Accordingly, in particular embodiments, the guide molecule is modified, e.g., by one or more aptamer(s) designed to improve guide molecule delivery, including delivery across the cellular membrane, to intracellular compartments, or into the nucleus. Such a structure can include, either in addition to the one or more aptamer(s) or without such one or more aptamer(s), moiety(ies) so as to render the guide molecule deliverable, inducible or responsive to a selected effector. The invention accordingly comprehends an guide molecule that responds to normal or pathological physiological conditions, including without limitation pH, hypoxia, O2 concentration, temperature, protein concentration, enzymatic concentration, lipid structure, light exposure, mechanical disruption (e.g. ultrasound waves), magnetic fields, electric fields, or electromagnetic radiation.


Light responsiveness of an inducible system may be achieved via the activation and binding of cryptochrome-2 and CIB 1. Blue light stimulation induces an activating conformational change in cryptochrome-2, resulting in recruitment of its binding partner CIB 1. This binding is fast and reversible, achieving saturation in <15 sec following pulsed stimulation and returning to baseline <15 min after the end of stimulation. These rapid binding kinetics result in a system temporally bound only by the speed of transcription/translation and transcript/protein degradation, rather than uptake and clearance of inducing agents. Crytochrome-2 activation is also highly sensitive, allowing for the use of low light intensity stimulation and mitigating the risks of phototoxicity. Further, in a context such as the intact mammalian brain, variable light intensity may be used to control the size of a stimulated region, allowing for greater precision than vector delivery alone may offer.


The invention contemplates energy sources such as electromagnetic radiation, sound energy or thermal energy to induce the guide. Advantageously, the electromagnetic radiation is a component of visible light. In a preferred embodiment, the light is a blue light with a wavelength of about 450 to about 495 nm. In an especially preferred embodiment, the wavelength is about 488 nm. In another preferred embodiment, the light stimulation is via pulses. The light power may range from about 0-9 mW/cm2. In a preferred embodiment, a stimulation paradigm of as low as 0.25 sec every 15 sec should result in maximal activation.


The chemical or energy sensitive guide may undergo a conformational change upon induction by the binding of a chemical source or by the energy allowing it act as a guide and have the Cas13 CRISPR-Cas system or complex function. The invention can involve applying the chemical source or energy so as to have the guide function and the Cas13 CRISPR-Cas system or complex function; and optionally further determining that the expression of the genomic locus is altered.


There are several different designs of this chemical inducible system: 1. ABI-PYL based system inducible by Abscisic Acid (ABA) (see, e.g., stke.sciencemag.org/cgi/content/abstract/sigtrans; 4/164/rs2), 2. FKBP-FRB based system inducible by rapamycin (or related chemicals based on rapamycin) (see, e.g., www.nature.com/nmeth/journal/v2/n6/full/nmeth763.html), 3. GID1-GAI based system inducible by Gibberellin (GA) (see, e.g., www.nature.com/nchembio/journal/v8/n5/full/nchembio.922.html).


A chemical inducible system can be an estrogen receptor (ER) based system inducible by 4-hydroxytamoxifen (4OHT) (see, e.g., www.pnas.org/content/104/3/1027.abstract). A mutated ligand-binding domain of the estrogen receptor called ERT2 translocates into the nucleus of cells upon binding of 4-hydroxytamoxifen. In further embodiments of the invention any naturally occurring or engineered derivative of any nuclear receptor, thyroid hormone receptor, retinoic acid receptor, estrogren receptor, estrogen-related receptor, glucocorticoid receptor, progesterone receptor, androgen receptor may be used in inducible systems analogous to the ER based inducible system.


Another inducible system is based on the design using Transient receptor potential (TRP) ion channel based system inducible by energy, heat or radio-wave (see, e.g., www.sciencemag.org/content/336/6081/604). These TRP family proteins respond to different stimuli, including light and heat. When this protein is activated by light or heat, the ion channel will open and allow the entering of ions such as calcium into the plasma membrane. This influx of ions will bind to intracellular ion interacting partners linked to a polypeptide including the guide and the other components of the Cas13 CRISPR-Cas complex or system, and the binding will induce the change of sub-cellular localization of the polypeptide, leading to the entire polypeptide entering the nucleus of cells. Once inside the nucleus, the guide protein and the other components of the Cas13 CRISPR-Cas complex will be active and modulating target gene expression in cells.


While light activation may be an advantageous embodiment, sometimes it may be disadvantageous especially for in vivo applications in which the light may not penetrate the skin or other organs. In this instance, other methods of energy activation are contemplated, in particular, electric field energy and/or ultrasound which have a similar effect.


Electric field energy is preferably administered substantially as described in the art, using one or more electric pulses of from about 1 Volt/cm to about 10 kVolts/cm under in vivo conditions. Instead of or in addition to the pulses, the electric field may be delivered in a continuous manner. The electric pulse may be applied for between 1 μs and 500 milliseconds, preferably between 1 μs and 100 milliseconds. The electric field may be applied continuously or in a pulsed manner for 5 about minutes.


As used herein, ‘electric field energy’ is the electrical energy to which a cell is exposed. Preferably the electric field has a strength of from about 1 Volt/cm to about 10 kVolts/cm or more under in vivo conditions (see WO97/49450).


As used herein, the term “electric field” includes one or more pulses at variable capacitance and voltage and including exponential and/or square wave and/or modulated wave and/or modulated square wave forms. References to electric fields and electricity should be taken to include reference the presence of an electric potential difference in the environment of a cell. Such an environment may be set up by way of static electricity, alternating current (AC), direct current (DC), etc, as known in the art. The electric field may be uniform, non-uniform or otherwise, and may vary in strength and/or direction in a time dependent manner.


Single or multiple applications of electric field, as well as single or multiple applications of ultrasound are also possible, in any order and in any combination. The ultrasound and/or the electric field may be delivered as single or multiple continuous applications, or as pulses (pulsatile delivery).


Electroporation has been used in both in vitro and in vivo procedures to introduce foreign material into living cells. With in vitro applications, a sample of live cells is first mixed with the agent of interest and placed between electrodes such as parallel plates. Then, the electrodes apply an electrical field to the cell/implant mixture. Examples of systems that perform in vitro electroporation include the Electro Cell Manipulator ECM600 product, and the Electro Square Porator T820, both made by the BTX Division of Genetronics, Inc (see U.S. Pat. No. 5,869,326).


The known electroporation techniques (both in vitro and in vivo) function by applying a brief high voltage pulse to electrodes positioned around the treatment region. The electric field generated between the electrodes causes the cell membranes to temporarily become porous, whereupon molecules of the agent of interest enter the cells. In known electroporation applications, this electric field comprises a single square wave pulse on the order of 1000 V/cm, of about 100 .mu.s duration. Such a pulse may be generated, for example, in known applications of the Electro Square Porator T820.


Preferably, the electric field has a strength of from about 1 V/cm to about 10 kV/cm under in vitro conditions. Thus, the electric field may have a strength of 1 V/cm, 2 V/cm, 3 V/cm, 4 V/cm, 5 V/cm, 6 V/cm, 7 V/cm, 8 V/cm, 9 V/cm, 10 V/cm, 20 V/cm, 50 V/cm, 100 V/cm, 200 V/cm, 300 V/cm, 400 V/cm, 500 V/cm, 600 V/cm, 700 V/cm, 800 V/cm, 900 V/cm, 1 kV/cm, 2 kV/cm, 5 kV/cm, 10 kV/cm, 20 kV/cm, 50 kV/cm or more. More preferably from about 0.5 kV/cm to about 4.0 kV/cm under in vitro conditions. Preferably the electric field has a strength of from about 1 V/cm to about 10 kV/cm under in vivo conditions. However, the electric field strengths may be lowered where the number of pulses delivered to the target site are increased. Thus, pulsatile delivery of electric fields at lower field strengths is envisaged.


Preferably the application of the electric field is in the form of multiple pulses such as double pulses of the same strength and capacitance or sequential pulses of varying strength and/or capacitance. As used herein, the term “pulse” includes one or more electric pulses at variable capacitance and voltage and including exponential and/or square wave and/or modulated wave/square wave forms.


Preferably the electric pulse is delivered as a waveform selected from an exponential wave form, a square wave form, a modulated wave form and a modulated square wave form.


A preferred embodiment employs direct current at low voltage. Thus, Applicants disclose the use of an electric field which is applied to the cell, tissue or tissue mass at a field strength of between 1V/cm and 20V/cm, for a period of 100 milliseconds or more, preferably 15 minutes or more.


Ultrasound is advantageously administered at a power level of from about 0.05 W/cm2 to about 100 W/cm2. Diagnostic or therapeutic ultrasound may be used, or combinations thereof.


As used herein, the term “ultrasound” refers to a form of energy which consists of mechanical vibrations the frequencies of which are so high they are above the range of human hearing. Lower frequency limit of the ultrasonic spectrum may generally be taken as about 20 kHz. Most diagnostic applications of ultrasound employ frequencies in the range 1 and 15 MHz′ (From Ultrasonics in Clinical Diagnosis, P. N. T. Wells, ed., 2nd. Edition, Publ. Churchill Livingstone [Edinburgh, London & NY, 1977]).


Ultrasound has been used in both diagnostic and therapeutic applications. When used as a diagnostic tool (“diagnostic ultrasound”), ultrasound is typically used in an energy density range of up to about 100 mW/cm2 (FDA recommendation), although energy densities of up to 750 mW/cm2 have been used. In physiotherapy, ultrasound is typically used as an energy source in a range up to about 3 to 4 W/cm2 (WHO recommendation). In other therapeutic applications, higher intensities of ultrasound may be employed, for example, HIFU at 100 W/cm up to 1 kW/cm2 (or even higher) for short periods of time. The term “ultrasound” as used in this specification is intended to encompass diagnostic, therapeutic and focused ultrasound.


Focused ultrasound (FUS) allows thermal energy to be delivered without an invasive probe (see Morocz et al 1998 Journal of Magnetic Resonance Imaging Vol. 8, No. 1, pp. 136-142. Another form of focused ultrasound is high intensity focused ultrasound (HIFU) which is reviewed by Moussatov et al in Ultrasonics (1998) Vol. 36, No. 8, pp. 893-900 and TranHuuHue et al in Acustica (1997) Vol. 83, No. 6, pp. 1103-1106.


Preferably, a combination of diagnostic ultrasound and a therapeutic ultrasound is employed. This combination is not intended to be limiting, however, and the skilled reader will appreciate that any variety of combinations of ultrasound may be used. Additionally, the energy density, frequency of ultrasound, and period of exposure may be varied.


Preferably the exposure to an ultrasound energy source is at a power density of from about 0.05 to about 100 Wcm-2. Even more preferably, the exposure to an ultrasound energy source is at a power density of from about 1 to about 15 Wcm-2.


Preferably the exposure to an ultrasound energy source is at a frequency of from about 0.015 to about 10.0 MHz. More preferably the exposure to an ultrasound energy source is at a frequency of from about 0.02 to about 5.0 MHz or about 6.0 MHz. Most preferably, the ultrasound is applied at a frequency of 3 MHz.


Preferably the exposure is for periods of from about 10 milliseconds to about 60 minutes. Preferably the exposure is for periods of from about 1 second to about 5 minutes. More preferably, the ultrasound is applied for about 2 minutes. Depending on the particular target cell to be disrupted, however, the exposure may be for a longer duration, for example, for 15 minutes.


Advantageously, the target tissue is exposed to an ultrasound energy source at an acoustic power density of from about 0.05 Wcm-2 to about 10 Wcm-2 with a frequency ranging from about 0.015 to about 10 MHz (see WO 98/52609). However, alternatives are also possible, for example, exposure to an ultrasound energy source at an acoustic power density of above 100 Wcm-2, but for reduced periods of time, for example, 1000 Wcm-2 for periods in the millisecond range or less.


Preferably the application of the ultrasound is in the form of multiple pulses; thus, both continuous wave and pulsed wave (pulsatile delivery of ultrasound) may be employed in any combination. For example, continuous wave ultrasound may be applied, followed by pulsed wave ultrasound, or vice versa. This may be repeated any number of times, in any order and combination. The pulsed wave ultrasound may be applied against a background of continuous wave ultrasound, and any number of pulses may be used in any number of groups.


Preferably, the ultrasound may comprise pulsed wave ultrasound. In a highly preferred embodiment, the ultrasound is applied at a power density of 0.7 Wcm-2 or 1.25 Wcm-2 as a continuous wave. Higher power densities may be employed if pulsed wave ultrasound is used.


Use of ultrasound is advantageous as, like light, it may be focused accurately on a target. Moreover, ultrasound is advantageous as it may be focused more deeply into tissues unlike light. It is therefore better suited to whole-tissue penetration (such as but not limited to a lobe of the liver) or whole organ (such as but not limited to the entire liver or an entire muscle, such as the heart) therapy. Another important advantage is that ultrasound is a non-invasive stimulus which is used in a wide variety of diagnostic and therapeutic applications. By way of example, ultrasound is well known in medical imaging techniques and, additionally, in orthopedic therapy. Furthermore, instruments suitable for the application of ultrasound to a subject vertebrate are widely available and their use is well known in the art.


In particular embodiments, the guide molecule is modified by a secondary structure to increase the specificity of the CRISPR-Cas system and the secondary structure can protect against exonuclease activity and allow for 5′ additions to the guide sequence also referred to herein as a protected guide molecule.


In one aspect, the invention provides for hybridizing a “protector RNA” to a sequence of the guide molecule, wherein the “protector RNA” is an RNA strand complementary to the 3′ end of the guide molecule to thereby generate a partially double-stranded guide RNA. In an embodiment of the invention, protecting mismatched bases (i.e. the bases of the guide molecule which do not form part of the guide sequence) with a perfectly complementary protector sequence decreases the likelihood of target RNA binding to the mismatched basepairs at the 3′ end. In particular embodiments of the invention, additional sequences comprising an extented length may also be present within the guide molecule such that the guide comprises a protector sequence within the guide molecule. This “protector sequence” ensures that the guide molecule comprises a “protected sequence” in addition to an “exposed sequence” (comprising the part of the guide sequence hybridizing to the target sequence). In particular embodiments, the guide molecule is modified by the presence of the protector guide to comprise a secondary structure such as a hairpin. Advantageously there are three or four to thirty or more, e.g., about 10 or more, contiguous base pairs having complementarity to the protected sequence, the guide sequence or both. It is advantageous that the protected portion does not impede thermodynamics of the CRISPR-Cas system interacting with its target. By providing such an extension including a partially double stranded guide molecule, the guide molecule is considered protected and results in improved specific binding of the CRISPR-Cas complex, while maintaining specific activity.


In particular embodiments, use is made of a truncated guide (tru-guide), i.e. a guide molecule which comprises a guide sequence which is truncated in length with respect to the canonical guide sequence length. As described by Nowak et al. (Nucleic Acids Res (2016) 44 (20): 9555-9564), such guides may allow catalytically active CRISPR-Cas enzyme to bind its target without cleaving the target RNA. In particular embodiments, a truncated guide is used which allows the binding of the target but retains only nickase activity of the CRISPR-Cas enzyme.


CRISPR RNA-Targeting Effector Proteins

In one example embodiment, the CRISPR system effector protein is an RNA-targeting effector protein. In certain embodiments, the CRISPR system effector protein is a Type VI CRISPR system targeting RNA (e.g., Cas13a, Cas13b, Cas13c or Cas13d). Example RNA-targeting effector proteins include Cas13b and C2c2 (now known as Cas13a). It will be understood that the term “C2c2” herein is used interchangeably with “Cas13a”. “C2c2” is now referred to as “Cas13a”, and the terms are used interchangeably herein unless indicated otherwise. As used herein, the term “Cas13” refers to any Type VI CRISPR system targeting RNA (e.g., Cas13a, Cas13b, Cas13c or Cas13d). When the CRISPR protein is a C2c2 protein, a tracrRNA is not required. C2c2 has been described in Abudayyeh et al. (2016) “C2c2 is a single-component programmable RNA-guided RNA-targeting CRISPR effector”; Science; DOI: 10.1126/science.aaf5573; and Shmakov et al. (2015) “Discovery and Functional Characterization of Diverse Class 2 CRISPR-Cas Systems”, Molecular Cell, DOI: dx.doi.org/10.1016/j.molcel.2015.10.008; which are incorporated herein in their entirety by reference. Cas13b has been described in Smargon et al. (2017) “Cas13b Is a Type VI-B CRISPR-Associated RNA-Guided RNases Differentially Regulated by Accessory Proteins Csx27 and Csx28,” Molecular Cell. 65, 1-13; dx.doi.org/10.1016/j.molcel.2016.12.023., which is incorporated herein in its entirety by reference.


In some embodiments, one or more elements of a nucleic acid-targeting system is derived from a particular organism comprising an endogenous CRISPR RNA-targeting system. In certain example embodiments, the effector protein CRISPR RNA-targeting system comprises at least one HEPN domain, including but not limited to the HEPN domains described herein, HEPN domains known in the art, and domains recognized to be HEPN domains by comparison to consensus sequence motifs. Several such domains are provided herein. In one non-limiting example, a consensus sequence can be derived from the sequences of C2c2 or Cas13b orthologs provided herein. In certain example embodiments, the effector protein comprises a single HEPN domain. In certain other example embodiments, the effector protein comprises two HEPN domains.


In one example embodiment, the effector protein comprise one or more HEPN domains comprising a RxxxxH motif sequence. The RxxxxH motif sequence can be, without limitation, from a HEPN domain described herein or a HEPN domain known in the art. RxxxxH motif sequences further include motif sequences created by combining portions of two or more HEPN domains. As noted, consensus sequences can be derived from the sequences of the orthologs disclosed in U.S. Provisional Patent Application 62/432,240 entitled “Novel CRISPR Enzymes and Systems,” U.S. Provisional Patent Application 62/471,710 entitled “Novel Type VI CRISPR Orthologs and Systems” filed on Mar. 15, 2017, and U.S. Provisional Patent Application entitled “Novel Type VI CRISPR Orthologs and Systems,” labeled as attorney docket number 47627-05-2133 and filed on Apr. 12, 2017.


In certain other example embodiments, the CRISPR system effector protein is a C2c2 nuclease. The activity of C2c2 may depend on the presence of two HEPN domains. These have been shown to be RNase domains, i.e. nuclease (in particular an endonuclease) cutting RNA. C2c2 HEPN may also target DNA, or potentially DNA and/or RNA. On the basis that the HEPN domains of C2c2 are at least capable of binding to and, in their wild-type form, cutting RNA, then it is preferred that the C2c2 effector protein has RNase function. Regarding C2c2 CRISPR systems, reference is made to U.S. Provisional 62/351,662 filed on Jun. 17, 2016 and U.S. Provisional 62/376,377 filed on Aug. 17, 2016. Reference is also made to U.S. Provisional 62/351,803 filed on Jun. 17, 2016. Reference is also made to U.S. Provisional entitled “Novel Crispr Enzymes and Systems” filed Dec. 8, 2016 bearing Broad Institute No. 10035.PA4 and Attorney Docket No. 47627.03.2133. Reference is further made to East-Seletsky et al. “Two distinct RNase activities of CRISPR-C2c2 enable guide-RNA processing and RNA detection” Nature doi:10/1038/nature19802 and Abudayyeh et al. “C2c2 is a single-component programmable RNA-guided RNA targeting CRISPR effector” bioRxiv doi:10.1101/054742.


In certain embodiments, the C2c2 effector protein is from an organism of a genus selected from the group consisting of: Leptotrichia, Listeria, Corynebacter, Sutterella, Legionella, Treponema, Filifactor, Eubacterium, Streptococcus, Lactobacillus, Mycoplasma, Bacteroides, Flaviivola, Flavobacterium, Sphaerochaeta, Azospirillum, Gluconacetobacter, Neisseria, Roseburia, Parvibaculum, Staphylococcus, Nitratifractor, Mycoplasma, Campylobacter, and Lachnospira, or the C2c2 effector protein is an organism selected from the group consisting of: Leptotrichia shahii, Leptotrichia. wadei, Listeria seeligeri, Clostridium aminophilum, Carnobacterium gallinarum, Paludibacter propionicigenes, Listeria weihenstephanensis, or the C2c2 effector protein is a L. wadei F0279 or L. wadei F0279 (Lw2) C2C2 effector protein. In another embodiment, the one or more guide RNAs are designed to detect a single nucleotide polymorphism, splice variant of a transcript, or a frameshift mutation in a target RNA or DNA.


In certain example embodiments, the RNA-targeting effector protein is a Type VI-B effector protein, such as Cas13b and Group 29 or Group 30 proteins. In certain example embodiments, the RNA-targeting effector protein comprises one or more HEPN domains. In certain example embodiments, the RNA-targeting effector protein comprises a C-terminal HEPN domain, a N-terminal HEPN domain, or both. Regarding example Type VI-B effector proteins that may be used in the context of this invention, reference is made to U.S. application Ser. No. 15/331,792 entitled “Novel CRISPR Enzymes and Systems” and filed Oct. 21, 2016, International Patent Application No. PCT/US2016/058302 entitled “Novel CRISPR Enzymes and Systems”, and filed Oct. 21, 2016, and Smargon et al. “Cas13b is a Type VI-B CRISPR-associated RNA-Guided RNase differentially regulated by accessory proteins Csx27 and Csx28” Molecular Cell, 65, 1-13 (2017); dx.doi.org/10.1016/j.molcel.2016.12.023, and U.S. Provisional application No. to be assigned, entitled “Novel Cas13b Orthologues CRISPR Enzymes and System” filed Mar. 15, 2017. In particular embodiments, the Cas13b enzyme is derived from Bergeyella zoohelcum.


In certain example embodiments, the RNA-targeting effector protein is a Cas13c effector protein as disclosed in U.S. Provisional Patent Application No. 62/525,165 filed Jun. 26, 2017, and PCT Application No. US 2017/047193 filed Aug. 16, 2017.


In some embodiments, one or more elements of a nucleic acid-targeting system is derived from a particular organism comprising an endogenous CRISPR RNA-targeting system. In certain embodiments, the CRISPR RNA-targeting system is found in Eubacterium and Ruminococcus. In certain embodiments, the effector protein comprises targeted and collateral ssRNA cleavage activity. In certain embodiments, the effector protein comprises dual HEPN domains. In certain embodiments, the effector protein lacks a counterpart to the Helical-1 domain of Cas13a. In certain embodiments, the effector protein is smaller than previously characterized class 2 CRISPR effectors, with a median size of 928 aa. This median size is 190 aa (17%) less than that of Cas13c, more than 200 aa (18%) less than that of Cas13b, and more than 300 aa (26%) less than that of Cas13a. In certain embodiments, the effector protein has no requirement for a flanking sequence (e.g., PFS, PAM).


In certain embodiments, the effector protein locus structures include a WYL domain containing accessory protein (so denoted after three amino acids that were conserved in the originally identified group of these domains; see, e.g., WYL domain IPR026881). In certain embodiments, the WYL domain accessory protein comprises at least one helix-turn-helix (HTH) or ribbon-helix-helix (RHH) DNA-binding domain. In certain embodiments, the WYL domain containing accessory protein increases both the targeted and the collateral ssRNA cleavage activity of the RNA-targeting effector protein. In certain embodiments, the WYL domain containing accessory protein comprises an N-terminal RHH domain, as well as a pattern of primarily hydrophobic conserved residues, including an invariant tyrosine-leucine doublet corresponding to the original WYL motif. In certain embodiments, the WYL domain containing accessory protein is WYL1. WYL1 is a single WYL-domain protein associated primarily with Ruminococcus.


In other example embodiments, the Type VI RNA-targeting Cas enzyme is Cas13d. In certain embodiments, Cas13d is Eubacterium siraeum DSM 15702 (EsCas13d) or Ruminococcus sp. N15.MGS-57 (RspCas13d) (see, e.g., Yan et al., Cas13d Is a Compact RNA-Targeting Type VI CRISPR Effector Positively Modulated by a WYL-Domain-Containing Accessory Protein, Molecular Cell (2018), doi.org/10.1016/j.molcel.2018.02.028). RspCas13d and EsCas13d have no flanking sequence requirements (e.g., PFS, PAM).


Cas13 RNA Editing

In one aspect, the invention provides a method of modifying or editing a target transcript in a eukaryotic cell. In some embodiments, the method comprises allowing a CRISPR-Cas effector module complex to bind to the target polynucleotide to effect RNA base editing, wherein the CRISPR-Cas effector module complex comprises a Cas effector module complexed with a guide sequence hybridized to a target sequence within said target polynucleotide, wherein said guide sequence is linked to a direct repeat sequence. In some embodiments, the Cas effector module comprises a catalytically inactive CRISPR-Cas protein. In some embodiments, the guide sequence is designed to introduce one or more mismatches to the RNA/RNA duplex formed between the target sequence and the guide sequence. In particular embodiments, the mismatch is an A-C mismatch. In some embodiments, the Cas effector may associate with one or more functional domains (e.g. via fusion protein or suitable linkers). In some embodiments, the effector domain comprises one or more cytindine or adenosine deaminases that mediate endogenous editing of via hydrolytic deamination. In particular embodiments, the effector domain comprises the adenosine deaminase acting on RNA (ADAR) family of enzymes. In particular embodiments, the adenosine deaminase protein or catalytic domain thereof capable of deaminating adenosine or cytidine in RNA or is an RNA specific adenosine deaminase and/or is a bacterial, human, cephalopod, or Drosophila adenosine deaminase protein or catalytic domain thereof, preferably TadA, more preferably ADAR, optionally huADAR, optionally (hu)ADAR1 or (hu)ADAR2, preferably huADAR2 or catalytic domain thereof.


The present application relates to modifying a target RNA sequence of interest (see, e.g, Cox et al., Science. 2017 Nov. 24; 358(6366):1019-1027). Using RNA-targeting rather than DNA targeting offers several advantages relevant for therapeutic development. First, there are substantial safety benefits to targeting RNA: there will be fewer off-target events because the available sequence space in the transcriptome is significantly smaller than the genome, and if an off-target event does occur, it will be transient and less likely to induce negative side effects. Second, RNA-targeting therapeutics will be more efficient because they are cell-type independent and not have to enter the nucleus, making them easier to deliver.


A further aspect of the invention relates to the method and composition as envisaged herein for use in prophylactic or therapeutic treatment, preferably wherein said target locus of interest is within a human or animal and to methods of modifying an Adenine or Cytidine in a target RNA sequence of interest, comprising delivering to said target RNA, the composition as described herein. In particular embodiments, the CRISPR system and the adenonsine deaminase, or catalytic domain thereof, are delivered as one or more polynucleotide molecules, as a ribonucleoprotein complex, optionally via particles, vesicles, or one or more viral vectors. In particular embodiments, the invention thus comprises compositions for use in therapy. This implies that the methods can be performed in vivo, ex vivo or in vitro. In particular embodiments, when the target is a human or animal target, the method is carried out ex vivo or in vitro.


A further aspect of the invention relates to the method as envisaged herein for use in prophylactic or therapeutic treatment, preferably wherein said target of interest is within a human or animal and to methods of modifying an Adenine or Cytidine in a target RNA sequence of interest, comprising delivering to said target RNA, the composition as described herein. In particular embodiments, the CRISPR system and the adenonsine deaminase, or catalytic domain thereof, are delivered as one or more polynucleotide molecules, as a ribonucleoprotein complex, optionally via particles, vesicles, or one or more viral vectors.


In one aspect, the invention provides a method of generating a eukaryotic cell comprising a modified or edited gene. In some embodiments, the method comprises (a) introducing one or more vectors into a eukaryotic cell, wherein the one or more vectors drive expression of one or more of: Cas effector module, and a guide sequence linked to a direct repeat sequence, wherein the Cas effector module associate one or more effector domains that mediate base editing, and (b) allowing a CRISPR-Cas effector module complex to bind to a target polynucleotide to effect base editing of the target polynucleotide within said disease gene, wherein the CRISPR-Cas effector module complex comprises a Cas effector module complexed with the guide sequence that is hybridized to the target sequence within the target polynucleotide, wherein the guide sequence may be designed to introduce one or more mismatches between the RNA/RNA duplex formed between the guide sequence and the target sequence. In particular embodiments, the mismatch is an A-C mismatch. In some embodiments, the Cas effector may associate with one or more functional domains (e.g. via fusion protein or suitable linkers). In some embodiments, the effector domain comprises one or more cytidine or adenosine deaminases that mediate endogenous editing of via hydrolytic deamination. In particular embodiments, the effector domain comprises the adenosine deaminase acting on RNA (ADAR) family of enzymes. In particular embodiments, the adenosine deaminase protein or catalytic domain thereof capable of deaminating adenosine or cytidine in RNA or is an RNA specific adenosine deaminase and/or is a bacterial, human, cephalopod, or Drosophila adenosine deaminase protein or catalytic domain thereof, preferably TadA, more preferably ADAR, optionally huADAR, optionally (hu)ADAR1 or (hu)ADAR2, preferably huADAR2 or catalytic domain thereof.


A further aspect relates to an isolated cell obtained or obtainable from the methods described herein comprising the composition described herein or progeny of said modified cell, preferably wherein said cell comprises a hypoxanthine or a guanine in replace of said Adenine in said target RNA of interest compared to a corresponding cell not subjected to the method. In particular embodiments, the cell is a eukaryotic cell, preferably a human or non-human animal cell, optionally a therapeutic T cell or an antibody-producing B-cell.


In some embodiments, the modified cell is a therapeutic T cell, such as a T cell suitable for adoptive cell transfer therapies (e.g., CAR-T therapies). The modification may result in one or more desirable traits in the therapeutic T cell, as described further herein.


The invention further relates to a method for cell therapy, comprising administering to a patient in need thereof the modified cell described herein, wherein the presence of the modified cell remedies a disease in the patient.


The present invention may be further illustrated and extended based on aspects of CRISPR-Cas development and use as set forth in the following articles and particularly as relates to delivery of a CRISPR protein complex and uses of an RNA guided endonuclease in cells and organisms:

  • Multiplex genome engineering using CRISPR-Cas systems. Cong, L., Ran, F. A., Cox, D., Lin, S., Barretto, R., Habib, N., Hsu, P. D., Wu, X., Jiang, W., Marraffini, L. A., & Zhang, F. Science February 15; 339(6121):819-23 (2013);
  • RNA-guided editing of bacterial genomes using CRISPR-Cas systems. Jiang W., Bikard D., Cox D., Zhang F, Marraffini L A. Nat Biotechnol March; 31(3):233-9 (2013);
  • One-Step Generation of Mice Carrying Mutations in Multiple Genes by CRISPR-Cas-Mediated Genome Engineering. Wang H., Yang H., Shivalila C S., Dawlaty M M., Cheng A W., Zhang F., Jaenisch R. Cell May 9; 153(4):910-8 (2013);
  • Optical control of mammalian endogenous transcription and epigenetic states. Konermann S, Brigham M D, Trevino A E, Hsu P D, Heidenreich M, Cong L, Platt R J, Scott D A, Church G M, Zhang F. Nature. August 22; 500(7463):472-6. doi: 10.1038/Nature12466. Epub 2013 Aug. 23 (2013);
  • Double Nicking by RNA-Guided CRISPR Cas9 for Enhanced Genome Editing Specificity. Ran, F A., Hsu, P D., Lin, C Y., Gootenberg, J S., Konermann, S., Trevino, A E., Scott, D A., Inoue, A., Matoba, S., Zhang, Y., & Zhang, F. Cell August 28. pii: 50092-8674(13)01015-5 (2013-A);
  • DNA targeting specificity of RNA-guided Cas9 nucleases. Hsu, P., Scott, D., Weinstein, J., Ran, F A., Konermann, S., Agarwala, V., Li, Y., Fine, E., Wu, X., Shalem, O., Cradick, T J., Marraffini, L A., Bao, G., & Zhang, F. Nat Biotechnol doi:10.1038/nbt.2647 (2013);
  • Genome engineering using the CRISPR-Cas9 system. Ran, F A., Hsu, P D., Wright, J., Agarwala, V., Scott, D A., Zhang, F. Nature Protocols November; 8(11):2281-308 (2013-B);
  • Genome-Scale CRISPR-Cas9 Knockout Screening in Human Cells. Shalem, O., Sanjana, N E., Hartenian, E., Shi, X., Scott, D A., Mikkelson, T., Heckl, D., Ebert, B L., Root, D E., Doench, J G., Zhang, F. Science December 12. (2013);
  • Crystal structure of cas9 in complex with guide RNA and target DNA. Nishimasu, H., Ran, F A., Hsu, P D., Konermann, S., Shehata, S I., Dohmae, N., Ishitani, R., Zhang, F., Nureki, O. Cell February 27, 156(5):935-49 (2014);
  • Genome-wide binding of the CRISPR endonuclease Cas9 in mammalian cells. Wu X., Scott D A., Kriz A J., Chiu A C., Hsu P D., Dadon D B., Cheng A W., Trevino A E., Konermann S., Chen S., Jaenisch R., Zhang F., Sharp P A. Nat Biotechnol. April 20. doi: 10.1038/nbt.2889 (2014);
  • CRISPR-Cas9 Knockin Mice for Genome Editing and Cancer Modeling. Platt R J, Chen S, Zhou Y, Yim M J, Swiech L, Kempton H R, Dahlman J E, Parnas O, Eisenhaure T M, Jovanovic M, Graham D B, Jhunjhunwala S, Heidenreich M, Xavier R J, Langer R, Anderson D G, Hacohen N, Regev A, Feng G, Sharp P A, Zhang F. Cell 159(2): 440-455 DOI: 10.1016/j.cell.2014.09.014(2014);
  • Development and Applications of CRISPR-Cas9 for Genome Engineering, Hsu P D, Lander E S, Zhang F., Cell. June 5; 157(6):1262-78 (2014).
  • Genetic screens in human cells using the CRISPR-Cas9 system, Wang T, Wei J J, Sabatini D M, Lander E S., Science. January 3; 343(6166): 80-84. doi:10.1126/science.1246981 (2014);
  • Rational design of highly active sgRNAs for CRISPR-Cas9-mediated gene inactivation, Doench J G, Hartenian E, Graham D B, Tothova Z, Hegde M, Smith I, Sullender M, Ebert B L, Xavier R J, Root D E., (published online 3 Sep. 2014) Nat Biotechnol. December; 32(12):1262-7 (2014);
  • In vivo interrogation of gene function in the mammalian brain using CRISPR-Cas9, Swiech L, Heidenreich M, Banerjee A, Habib N, Li Y, Trombetta J, Sur M, Zhang F., (published online 19 Oct. 2014) Nat Biotechnol. January; 33(1):102-6 (2015);
  • Genome-scale transcriptional activation by an engineered CRISPR-Cas9 complex, Konermann S, Brigham M D, Trevino A E, Joung J, Abudayyeh O O, Barcena C, Hsu P D, Habib N, Gootenberg J S, Nishimasu H, Nureki O, Zhang F., Nature. January 29; 517(7536):583-8 (2015).
  • A split-Cas9 architecture for inducible genome editing and transcription modulation, Zetsche B, Volz S E, Zhang F., (published online 2 Feb. 2015) Nat Biotechnol. February; 33(2):139-42 (2015);
  • Genome-wide CRISPR Screen in a Mouse Model of Tumor Growth and Metastasis, Chen S, Sanjana N E, Zheng K, Shalem O, Lee K, Shi X, Scott D A, Song J, Pan J Q, Weissleder R, Lee H, Zhang F, Sharp P A. Cell 160, 1246-1260, Mar. 12, 2015 (multiplex screen in mouse), and
  • In vivo genome editing using Staphylococcus aureus Cas9, Ran F A, Cong L, Yan W X, Scott D A, Gootenberg J S, Kriz A J, Zetsche B, Shalem O, Wu X, Makarova K S, Koonin E V, Sharp P A, Zhang F., (published online 1 Apr. 2015), Nature. April 9; 520(7546):186-91 (2015).
  • Shalem et al., “High-throughput functional genomics using CRISPR-Cas9,” Nature Reviews Genetics 16, 299-311 (May 2015).
  • Xu et al., “Sequence determinants of improved CRISPR sgRNA design,” Genome Research 25, 1147-1157 (August 2015).
  • Parnas et al., “A Genome-wide CRISPR Screen in Primary Immune Cells to Dissect Regulatory Networks,” Cell 162, 675-686 (Jul. 30, 2015).
  • Ramanan et al., CRISPR-Cas9 cleavage of viral DNA efficiently suppresses hepatitis B virus,” Scientific Reports 5:10833. doi: 10.1038/srep10833 (Jun. 2, 2015)
  • Nishimasu et al., Crystal Structure of Staphylococcus aureus Cas9,” Cell 162, 1113-1126 (Aug. 27, 2015)
  • BCL11A enhancer dissection by Cas9-mediated in situ saturating mutagenesis, Canver et al., Nature 527(7577):192-7 (Nov. 12, 2015) doi: 10.1038/nature15521. Epub 2015 Sep. 16.
  • Cpf1 Is a Single RNA-Guided Endonuclease of a Class 2 CRISPR-Cas System, Zetsche et al., Cell 163, 759-71 (Sep. 25, 2015).
  • Discovery and Functional Characterization of Diverse Class 2 CRISPR-Cas Systems, Shmakov et al., Molecular Cell, 60(3), 385-397 doi: 10.1016/j.molcel.2015.10.008 Epub Oct. 22, 2015.
  • Rationally engineered Cas9 nucleases with improved specificity, Slaymaker et al., Science 2016 Jan. 1 351(6268): 84-88 doi: 10.1126/science.aad5227. Epub 2015 Dec. 1.
  • Gao et al, “Engineered Cpf1 Enzymes with Altered PAM Specificities,” bioRxiv 091611; doi: http://dx.doi.org/10.1101/091611 (Dec. 4, 2016).
  • Cox et al., “RNA editing with CRISPR-Cas13,” Science. 2017 Nov. 24; 358(6366):1019 1027. doi: 10.1126/science.aag0180. Epub 2017 Oct. 25.


each of which is incorporated herein by reference, may be considered in the practice of the instant invention, and discussed briefly below:

    • Cong et al. engineered type II CRISPR-Cas systems for use in eukaryotic cells based on both Streptococcus thermophilus Cas9 and also Streptococcus pyogenes Cas9 and demonstrated that Cas9 nucleases can be directed by short RNAs to induce precise cleavage of DNA in human and mouse cells. Their study further showed that Cas9 as converted into a nicking enzyme can be used to facilitate homology-directed repair in eukaryotic cells with minimal mutagenic activity. Additionally, their study demonstrated that multiple guide sequences can be encoded into a single CRISPR array to enable simultaneous editing of several at endogenous genomic loci sites within the mammalian genome, demonstrating easy programmability and wide applicability of the RNA-guided nuclease technology. This ability to use RNA to program sequence specific DNA cleavage in cells defined a new class of genome engineering tools. These studies further showed that other CRISPR loci are likely to be transplantable into mammalian cells and can also mediate mammalian genome cleavage. Importantly, it can be envisaged that several aspects of the CRISPR-Cas system can be further improved to increase its efficiency and versatility.
    • Jiang et al. used the clustered, regularly interspaced, short palindromic repeats (CRISPR)-associated Cas9 endonuclease complexed with dual-RNAs to introduce precise mutations in the genomes of Streptococcus pneumoniae and Escherichia coli. The approach relied on dual-RNA:Cas9-directed cleavage at the targeted genomic site to kill unmutated cells and circumvents the need for selectable markers or counter-selection systems. The study reported reprogramming dual-RNA:Cas9 specificity by changing the sequence of short CRISPR RNA (crRNA) to make single- and multinucleotide changes carried on editing templates. The study showed that simultaneous use of two crRNAs enabled multiplex mutagenesis. Furthermore, when the approach was used in combination with recombineering, in S. pneumoniae, nearly 100% of cells that were recovered using the described approach contained the desired mutation, and in E. coli, 65% that were recovered contained the mutation.
    • Wang et al. (2013) used the CRISPR-Cas system for the one-step generation of mice carrying mutations in multiple genes which were traditionally generated in multiple steps by sequential recombination in embryonic stem cells and/or time-consuming intercrossing of mice with a single mutation. The CRISPR-Cas system will greatly accelerate the in vivo study of functionally redundant genes and of epistatic gene interactions.
    • Konermann et al. (2013) addressed the need in the art for versatile and robust technologies that enable optical and chemical modulation of DNA-binding domains based CRISPR Cas9 enzyme and also Transcriptional Activator Like Effectors
    • Ran et al. (2013-A) described an approach that combined a Cas9 nickase mutant with paired guide RNAs to introduce targeted double-strand breaks. This addresses the issue of the Cas9 nuclease from the microbial CRISPR-Cas system being targeted to specific genomic loci by a guide sequence, which can tolerate certain mismatches to the DNA target and thereby promote undesired off-target mutagenesis. Because individual nicks in the genome are repaired with high fidelity, simultaneous nicking via appropriately offset guide RNAs is required for double-stranded breaks and extends the number of specifically recognized bases for target cleavage. The authors demonstrated that using paired nicking can reduce off-target activity by 50- to 1,500-fold in cell lines and to facilitate gene knockout in mouse zygotes without sacrificing on-target cleavage efficiency. This versatile strategy enables a wide variety of genome editing applications that require high specificity.
    • Hsu et al. (2013) characterized SpCas9 targeting specificity in human cells to inform the selection of target sites and avoid off-target effects. The study evaluated >700 guide RNA variants and SpCas9-induced indel mutation levels at >100 predicted genomic off-target loci in 293T and 293FT cells. The authors that SpCas9 tolerates mismatches between guide RNA and target DNA at different positions in a sequence-dependent manner, sensitive to the number, position and distribution of mismatches. The authors further showed that SpCas9-mediated cleavage is unaffected by DNA methylation and that the dosage of SpCas9 and guide RNA can be titrated to minimize off-target modification. Additionally, to facilitate mammalian genome engineering applications, the authors reported providing a web-based software tool to guide the selection and validation of target sequences as well as off-target analyses.
    • Ran et al. (2013-B) described a set of tools for Cas9-mediated genome editing via non-homologous end joining (NHEJ) or homology-directed repair (HDR) in mammalian cells, as well as generation of modified cell lines for downstream functional studies. To minimize off-target cleavage, the authors further described a double-nicking strategy using the Cas9 nickase mutant with paired guide RNAs. The protocol provided by the authors experimentally derived guidelines for the selection of target sites, evaluation of cleavage efficiency and analysis of off-target activity. The studies showed that beginning with target design, gene modifications can be achieved within as little as 1-2 weeks, and modified clonal cell lines can be derived within 2-3 weeks.
    • Shalem et al. described a new way to interrogate gene function on a genome-wide scale. Their studies showed that delivery of a genome-scale CRISPR-Cas9 knockout (GeCKO) library targeted 18,080 genes with 64,751 unique guide sequences enabled both negative and positive selection screening in human cells. First, the authors showed use of the GeCKO library to identify genes essential for cell viability in cancer and pluripotent stem cells. Next, in a melanoma model, the authors screened for genes whose loss is involved in resistance to vemurafenib, a therapeutic that inhibits mutant protein kinase BRAF. Their studies showed that the highest-ranking candidates included previously validated genes NF 1 and MED 12 as well as novel hits NF2, CUL3, TADA2B, and TADA 1. The authors observed a high level of consistency between independent guide RNAs targeting the same gene and a high rate of hit confirmation, and thus demonstrated the promise of genome-scale screening with Cas9.
    • Nishimasu et al. reported the crystal structure of Streptococcus pyogenes Cas9 in complex with sgRNA and its target DNA at 2.5 A° resolution. The structure revealed a bibbed architecture composed of target recognition and nuclease lobes, accommodating the sgRNA:DNA heteroduplex in a positively charged groove at their interface. Whereas the recognition lobe is essential for binding sgRNA and DNA, the nuclease lobe contains the HNH and RuvC nuclease domains, which are properly positioned for cleavage of the complementary and non-complementary strands of the target DNA, respectively. The nuclease lobe also contains a carboxyl-terminal domain responsible for the interaction with the protospacer adjacent motif (PAM). This high-resolution structure and accompanying functional analyses have revealed the molecular mechanism of RNA-guided DNA targeting by Cas9, thus paving the way for the rational design of new, versatile genome-editing technologies.
    • Wu et al. mapped genome-wide binding sites of a catalytically inactive Cas9 (dCas9) from Streptococcus pyogenes loaded with single guide RNAs (sgRNAs) in mouse embryonic stem cells (mESCs). The authors showed that each of the four sgRNAs tested targets dCas9 to between tens and thousands of genomic sites, frequently characterized by a 5-nucleotide seed region in the sgRNA and an NGG protospacer adjacent motif (PAM). Chromatin inaccessibility decreases dCas9 binding to other sites with matching seed sequences; thus 70% of off-target sites are associated with genes. The authors showed that targeted sequencing of 295 dCas9 binding sites in mESCs transfected with catalytically active Cas9 identified only one site mutated above background levels. The authors proposed a two-state model for Cas9 binding and cleavage, in which a seed match triggers binding but extensive pairing with target DNA is required for cleavage.
    • Platt et al. established a Cre-dependent Cas9 knockin mouse. The authors demonstrated in vivo as well as ex vivo genome editing using adeno-associated virus (AAV)-, lentivirus-, or particle-mediated delivery of guide RNA in neurons, immune cells, and endothelial cells.
    • Hsu et al. (2014) is a review article that discusses generally CRISPR-Cas9 history from yogurt to genome editing, including genetic screening of cells.
    • Wang et al. (2014) relates to a pooled, loss-of-function genetic screening approach suitable for both positive and negative selection that uses a genome-scale lentiviral single guide RNA (sgRNA) library.
    • Doench et al. created a pool of sgRNAs, tiling across all possible target sites of a panel of six endogenous mouse and three endogenous human genes and quantitatively assessed their ability to produce null alleles of their target gene by antibody staining and flow cytometry. The authors showed that optimization of the PAM improved activity and also provided an on-line tool for designing sgRNAs.
    • Swiech et al. demonstrate that AAV-mediated SpCas9 genome editing can enable reverse genetic studies of gene function in the brain.
    • Konermann et al. (2015) discusses the ability to attach multiple effector domains, e.g., transcriptional activator, functional and epigenomic regulators at appropriate positions on the guide such as stem or tetraloop with and without linkers.
    • Zetsche et al. demonstrates that the Cas9 enzyme can be split into two and hence the assembly of Cas9 for activation can be controlled.
    • Chen et al. relates to multiplex screening by demonstrating that a genome-wide in vivo CRISPR-Cas9 screen in mice reveals genes regulating lung metastasis.
    • Ran et al. (2015) relates to SaCas9 and its ability to edit genomes and demonstrates that one cannot extrapolate from biochemical assays.
    • Shalem et al. (2015) described ways in which catalytically inactive Cas9 (dCas9) fusions are used to synthetically repress (CRISPRi) or activate (CRISPRa) expression, showing. advances using Cas9 for genome-scale screens, including arrayed and pooled screens, knockout approaches that inactivate genomic loci and strategies that modulate transcriptional activity.
    • Xu et al. (2015) assessed the DNA sequence features that contribute to single guide RNA (sgRNA) efficiency in CRISPR-based screens. The authors explored efficiency of CRISPR-Cas9 knockout and nucleotide preference at the cleavage site. The authors also found that the sequence preference for CRISPRi/a is substantially different from that for CRISPR-Cas9 knockout.
    • Parnas et al. (2015) introduced genome-wide pooled CRISPR-Cas9 libraries into dendritic cells (DCs) to identify genes that control the induction of tumor necrosis factor (Tnf) by bacterial lipopolysaccharide (LPS). Known regulators of Tlr4 signaling and previously unknown candidates were identified and classified into three functional modules with distinct effects on the canonical responses to LPS.
    • Ramanan et al (2015) demonstrated cleavage of viral episomal DNA (cccDNA) in infected cells. The HBV genome exists in the nuclei of infected hepatocytes as a 3.2 kb double-stranded episomal DNA species called covalently closed circular DNA (cccDNA), which is a key component in the HBV life cycle whose replication is not inhibited by current therapies. The authors showed that sgRNAs specifically targeting highly conserved regions of HBV robustly suppresses viral replication and depleted cccDNA.
    • Nishimasu et al. (2015) reported the crystal structures of SaCas9 in complex with a single guide RNA (sgRNA) and its double-stranded DNA targets, containing the 5′-TTGAAT-3′ PAM and the 5′-TTGGGT-3′ PAM. A structural comparison of SaCas9 with SpCas9 highlighted both structural conservation and divergence, explaining their distinct PAM specificities and orthologous sgRNA recognition.
    • Canver et al. (2015) demonstrated a CRISPR-Cas9-based functional investigation of non-coding genomic elements. The authors we developed pooled CRISPR-Cas9 guide RNA libraries to perform in situ saturating mutagenesis of the human and mouse BCL11A enhancers which revealed critical features of the enhancers.
    • Zetsche et al. (2015) reported characterization of Cpf1, a class 2 CRISPR nuclease from Francisella novicida U112 having features distinct from Cas9. Cpf1 is a single RNA-guided endonuclease lacking tracrRNA, utilizes a T-rich protospacer-adjacent motif, and cleaves DNA via a staggered DNA double-stranded break.
    • Shmakov et al. (2015) reported three distinct Class 2 CRISPR-Cas systems. Two system CRISPR enzymes (C2c1 and C2c3) contain RuvC-like endonuclease domains distantly related to Cpf1. Unlike Cpf1, C2c1 depends on both crRNA and tracrRNA for DNA cleavage. The third enzyme (C2c2) contains two predicted HEPN RNase domains and is tracrRNA independent.
    • Slaymaker et al (2016) reported the use of structure-guided protein engineering to improve the specificity of Streptococcus pyogenes Cas9 (SpCas9). The authors developed “enhanced specificity” SpCas9 (eSpCas9) variants which maintained robust on-target cleavage with reduced off-target effects.
    • Cox et al., (2017) reported the use of catalytically inactive Cas13 (dCas13) to direct adenosine-to-inosine deaminase activity by ADAR2 (adenosine deaminase acting on RNA type 2) to transcripts in mammalian cells. The system, referred to as RNA Editing for Programmable A to I Replacement (REPAIR), has no strict sequence constraints and can be used to edit full-length transcripts. The authors further engineered the system to create a high-specificity variant and minimized the system to facilitate viral delivery.


The methods and tools provided herein are may be designed for use with or Cas13, a type II nuclease that does not make use of tracrRNA. Orthologs of Cas13 have been identified in different bacterial species as described herein. Further type II nucleases with similar properties can be identified using methods described in the art (Shmakov et al. 2015, 60:385-397; Abudayeh et al. 2016, Science, 5; 353(6299)). In particular embodiments, such methods for identifying novel CRISPR effector proteins may comprise the steps of selecting sequences from the database encoding a seed which identifies the presence of a CRISPR Cas locus, identifying loci located within 10 kb of the seed comprising Open Reading Frames (ORFs) in the selected sequences, selecting therefrom loci comprising ORFs of which only a single ORF encodes a novel CRISPR effector having greater than 700 amino acids and no more than 90% homology to a known CRISPR effector. In particular embodiments, the seed is a protein that is common to the CRISPR-Cas system, such as Cas1. In further embodiments, the CRISPR array is used as a seed to identify new effector proteins.


Also, “Dimeric CRISPR RNA-guided FokI nucleases for highly specific genome editing”, Shengdar Q. Tsai, Nicolas Wyvekens, Cyd Khayter, Jennifer A. Foden, Vishal Thapar, Deepak Reyon, Mathew J. Goodwin, Martin J. Aryee, J. Keith Joung Nature Biotechnology 32(6): 569-77 (2014), relates to dimeric RNA-guided Fold Nucleases that recognize extended sequences and can edit endogenous genes with high efficiencies in human cells.


With respect to general information on CRISPR/Cas Systems, components thereof, and delivery of such components, including methods, materials, delivery vehicles, vectors, particles, and making and using thereof, including as to amounts and formulations, as well as CRISPR-Cas-expressing eukaryotic cells, CRISPR-Cas expressing eukaryotes, such as a mouse, reference is made to: U.S. Pat. Nos. 8,999,641, 8,993,233, 8,697,359, 8,771,945, 8,795,965, 8,865,406, 8,871,445, 8,889,356, 8,889,418, 8,895,308, 8,906,616, 8,932,814, and 8,945,839; US Patent Publications US 2014-0310830 (U.S. application Ser. No. 14/105,031), US 2014-0287938 A1 (U.S. application Ser. No. 14/213,991), US 2014-0273234 A1 (U.S. application Ser. No. 14/293,674), US2014-0273232 A1 (U.S. application Ser. No. 14/290,575), US 2014-0273231 (U.S. application Ser. No. 14/259,420), US 2014-0256046 A1 (U.S. application Ser. No. 14/226,274), US 2014-0248702 A1 (U.S. application Ser. No. 14/258,458), US 2014-0242700 A1 (U.S. application Ser. No. 14/222,930), US 2014-0242699 A1 (U.S. application Ser. No. 14/183,512), US 2014-0242664 A1 (U.S. application Ser. No. 14/104,990), US 2014-0234972 A1 (U.S. application Ser. No. 14/183,471), US 2014-0227787 A1 (U.S. application Ser. No. 14/256,912), US 2014-0189896 A1 (U.S. application Ser. No. 14/105,035), US 2014-0186958 (U.S. application Ser. No. 14/105,017), US 2014-0186919 A1 (U.S. application Ser. No. 14/104,977), US 2014-0186843 A1 (U.S. application Ser. No. 14/104,900), US 2014-0179770 A1 (U.S. application Ser. No. 14/104,837) and US 2014-0179006 A1 (U.S. application Ser. No. 14/183,486), US 2014-0170753 (U.S. application Ser. No. 14/183,429); US 2015-0184139 (U.S. application Ser. No. 14/324,960); Ser. No. 14/054,414 European Patent Applications EP 2 771 468 (EP13818570.7), EP 2 764 103 (EP13824232.6), and EP 2 784 162 (EP14170383.5); and PCT Patent Publications WO2014/093661 (PCT/US2013/074743), WO2014/093694 (PCT/US2013/074790), WO2014/093595 (PCT/US2013/074611), WO2014/093718 (PCT/US2013/074825), WO2014/093709 (PCT/US2013/074812), WO2014/093622 (PCT/US2013/074667), WO2014/093635 (PCT/US2013/074691), WO2014/093655 (PCT/US2013/074736), WO2014/093712 (PCT/US2013/074819), WO2014/093701 (PCT/US2013/074800), WO2014/018423 (PCT/US2013/051418), WO2014/204723 (PCT/US2014/041790), WO2014/204724 (PCT/US2014/041800), WO2014/204725 (PCT/US2014/041803), WO2014/204726 (PCT/US2014/041804), WO2014/204727 (PCT/US2014/041806), WO2014/204728 (PCT/US2014/041808), WO2014/204729 (PCT/US2014/041809), WO2015/089351 (PCT/US2014/069897), WO2015/089354 (PCT/US2014/069902), WO2015/089364 (PCT/US2014/069925), WO2015/089427 (PCT/US2014/070068), WO2015/089462 (PCT/US2014/070127), WO2015/089419 (PCT/US2014/070057), WO2015/089465 (PCT/US2014/070135), WO2015/089486 (PCT/US2014/070175), WO2015/058052 (PCT/US2014/061077), WO2015/070083 (PCT/US2014/064663), WO2015/089354 (PCT/US2014/069902), WO2015/089351 (PCT/US2014/069897), WO2015/089364 (PCT/US2014/069925), WO2015/089427 (PCT/US2014/070068), WO2015/089473 (PCT/US2014/070152), WO2015/089486 (PCT/US2014/070175), WO2016/049258 (PCT/US2015/051830), WO2016/094867 (PCT/US2015/065385), WO2016/094872 (PCT/US2015/065393), WO2016/094874 (PCT/US2015/065396), WO2016/106244 (PCT/US2015/067177).


Mention is also made of U.S. application 62/180,709, 17 Jun. 2015, PROTECTED GUIDE RNAS (PGRNAS); U.S. application 62/091,455, filed, 12 Dec. 2014, PROTECTED GUIDE RNAS (PGRNAS); U.S. application 62/096,708, 24 Dec. 2014, PROTECTED GUIDE RNAS (PGRNAS); U.S. applications 62/091,462, 12 Dec. 2014, 62/096,324, 23 Dec. 2014, 62/180,681, 17 Jun. 2015, and 62/237,496, 5 Oct. 2015, DEAD GUIDES FOR CRISPR TRANSCRIPTION FACTORS; U.S. application 62/091,456, 12 Dec. 2014 and 62/180,692, 17 Jun. 2015, ESCORTED AND FUNCTIONALIZED GUIDES FOR CRISPR-CAS SYSTEMS; U.S. application 62/091,461, 12 Dec. 2014, DELIVERY, USE AND THERAPEUTIC APPLICATIONS OF THE CRISPR-CAS SYSTEMS AND COMPOSITIONS FOR GENOME EDITING AS TO HEMATOPOETIC STEM CELLS (HSCs); U.S. application 62/094,903, 19 Dec. 2014, UNBIASED IDENTIFICATION OF DOUBLE-STRAND BREAKS AND GENOMIC REARRANGEMENT BY GENOME-WISE INSERT CAPTURE SEQUENCING; U.S. application 62/096,761, 24 Dec. 2014, ENGINEERING OF SYSTEMS, METHODS AND OPTIMIZED ENZYME AND GUIDE SCAFFOLDS FOR SEQUENCE MANIPULATION; U.S. application 62/098,059, 30 Dec. 2014, 62/181,641, 18 Jun. 2015, and 62/181,667, 18 Jun. 2015, RNA-TARGETING SYSTEM; U.S. application 62/096,656, 24 Dec. 2014 and 62/181,151, 17 Jun. 2015, CRISPR HAVING OR ASSOCIATED WITH DESTABILIZATION DOMAINS; U.S. application 62/096,697, 24 Dec. 2014, CRISPR HAVING OR ASSOCIATED WITH AAV; U.S. application 62/098,158, 30 Dec. 2014, ENGINEERED CRISPR COMPLEX INSERTIONAL TARGETING SYSTEMS; U.S. application 62/151,052, 22 Apr. 2015, CELLULAR TARGETING FOR EXTRACELLULAR EXOSOMAL REPORTING; U.S. application 62/054,490, 24 Sep. 2014, DELIVERY, USE AND THERAPEUTIC APPLICATIONS OF THE CRISPR-CAS SYSTEMS AND COMPOSITIONS FOR TARGETING DISORDERS AND DISEASES USING PARTICLE DELIVERY COMPONENTS; U.S. application 61/939,154, 12 Feb. 2014, SYSTEMS, METHODS AND COMPOSITIONS FOR SEQUENCE MANIPULATION WITH OPTIMIZED FUNCTIONAL CRISPR-CAS SYSTEMS; U.S. application 62/055,484, 25 Sep. 2014, SYSTEMS, METHODS AND COMPOSITIONS FOR SEQUENCE MANIPULATION WITH OPTIMIZED FUNCTIONAL CRISPR-CAS SYSTEMS; U.S. application 62/087,537, 4 Dec. 2014, SYSTEMS, METHODS AND COMPOSITIONS FOR SEQUENCE MANIPULATION WITH OPTIMIZED FUNCTIONAL CRISPR-CAS SYSTEMS; U.S. application 62/054,651, 24 Sep. 2014, DELIVERY, USE AND THERAPEUTIC APPLICATIONS OF THE CRISPR-CAS SYSTEMS AND COMPOSITIONS FOR MODELING COMPETITION OF MULTIPLE CANCER MUTATIONS IN VIVO; U.S. application 62/067,886, 23 Oct. 2014, DELIVERY, USE AND THERAPEUTIC APPLICATIONS OF THE CRISPR-CAS SYSTEMS AND COMPOSITIONS FOR MODELING COMPETITION OF MULTIPLE CANCER MUTATIONS IN VIVO; U.S. applications 62/054,675, 24 Sep. 2014 and 62/181,002, 17 Jun. 2015, DELIVERY, USE AND THERAPEUTIC APPLICATIONS OF THE CRISPR-CAS SYSTEMS AND COMPOSITIONS IN NEURONAL CELLS/TISSUES; U.S. application 62/054,528, 24 Sep. 2014, DELIVERY, USE AND THERAPEUTIC APPLICATIONS OF THE CRISPR-CAS SYSTEMS AND COMPOSITIONS IN IMMUNE DISEASES OR DISORDERS; U.S. application 62/055,454, 25 Sep. 2014, DELIVERY, USE AND THERAPEUTIC APPLICATIONS OF THE CRISPR-CAS SYSTEMS AND COMPOSITIONS FOR TARGETING DISORDERS AND DISEASES USING CELL PENETRATION PEPTIDES (CPP); U.S. application 62/055,460, 25 Sep. 2014, MULTIFUNCTIONAL-CRISPR COMPLEXES AND/OR OPTIMIZED ENZYME LINKED FUNCTIONAL-CRISPR COMPLEXES; U.S. application 62/087,475, 4 Dec. 2014 and 62/181,690, 18 Jun. 2015, FUNCTIONAL SCREENING WITH OPTIMIZED FUNCTIONAL CRISPR-CAS SYSTEMS; U.S. application 62/055,487, 25 Sep. 2014, FUNCTIONAL SCREENING WITH OPTIMIZED FUNCTIONAL CRISPR-CAS SYSTEMS; U.S. application 62/087,546, 4 Dec. 2014 and 62/181,687, 18 Jun. 2015, MULTIFUNCTIONAL CRISPR COMPLEXES AND/OR OPTIMIZED ENZYME LINKED FUNCTIONAL-CRISPR COMPLEXES; and U.S. application 62/098,285, 30 Dec. 2014, CRISPR MEDIATED IN VIVO MODELING AND GENETIC SCREENING OF TUMOR GROWTH AND METASTASIS.


Mention is made of U.S. applications 62/181,659, 18 Jun. 2015 and 62/207,318, 19 Aug. 2015, ENGINEERING AND OPTIMIZATION OF SYSTEMS, METHODS, ENZYME AND GUIDE SCAFFOLDS OF CAS9 ORTHOLOGS AND VARIANTS FOR SEQUENCE MANIPULATION. Mention is made of U.S. applications 62/181,663, 18 Jun. 2015 and 62/245,264, 22 Oct. 2015, NOVEL CRISPR ENZYMES AND SYSTEMS, U.S. applications 62/181,675, 18 Jun. 2015, 62/285,349, 22 Oct. 2015, 62/296,522, 17 Feb. 2016, and 62/320,231, 8 Apr. 2016, NOVEL CRISPR ENZYMES AND SYSTEMS, U.S. application 62/232,067, 24 Sep. 2015, U.S. application Ser. No. 14/975,085, 18 Dec. 2015, European application No. 16150428.7, U.S. application 62/205,733, 16 Aug. 2015, U.S. application 62/201,542, 5 Aug. 2015, U.S. application 62/193,507, 16 Jul. 2015, and U.S. application 62/181,739, 18 Jun. 2015, each entitled NOVEL CRISPR ENZYMES AND SYSTEMS and of U.S. application 62/245,270, 22 Oct. 2015, NOVEL CRISPR ENZYMES AND SYSTEMS. Mention is also made of U.S. application 61/939,256, 12 Feb. 2014, and WO 2015/089473 (PCT/US2014/070152), 12 Dec. 2014, each entitled ENGINEERING OF SYSTEMS, METHODS AND OPTIMIZED GUIDE COMPOSITIONS WITH NEW ARCHITECTURES FOR SEQUENCE MANIPULATION. Mention is also made of PCT/US2015/045504, 15 Aug. 2015, U.S. application 62/180,699, 17 Jun. 2015, and U.S. application 62/038,358, 17 Aug. 2014, each entitled GENOME EDITING USING CAS9 NICKASES.


Each of these patents, patent publications, and applications, and all documents cited therein or during their prosecution (“appln cited documents”) and all documents cited or referenced in the appln cited documents, together with any instructions, descriptions, product specifications, and product sheets for any products mentioned therein or in any document therein and incorporated by reference herein, are hereby incorporated herein by reference, and may be employed in the practice of the invention. All documents (e.g., these patents, patent publications and applications and the appln cited documents) are incorporated herein by reference to the same extent as if each individual document was specifically and individually indicated to be incorporated by reference.


In particular embodiments, pre-complexed guide RNA and CRISPR effector protein, (optionally, adenosine deaminase fused to a CRISPR protein or an adaptor) are delivered as a ribonucleoprotein (RNP). RNPs have the advantage that they lead to rapid editing effects even more so than the RNA method because this process avoids the need for transcription. An important advantage is that both RNP delivery is transient, reducing off-target effects and toxicity issues. Efficient genome editing in different cell types has been observed by Kim et al. (2014, Genome Res. 24(6):1012-9), Paix et al. (2015, Genetics 204(1):47-54), Chu et al. (2016, BMC Biotechnol. 16:4), and Wang et al. (2013, Cell. 9; 153(4):910-8).


In particular embodiments, the ribonucleoprotein is delivered by way of a polypeptide-based shuttle agent as described in WO2016161516. WO2016161516 describes efficient transduction of polypeptide cargos using synthetic peptides comprising an endosome leakage domain (ELD) operably linked to a cell penetrating domain (CPD), to a histidine-rich domain and a CPD. Similarly these polypeptides can be used for the delivery of CRISPR-effector based RNPs in eukaryotic cells.


Tale Systems

As disclosed herein editing can be made by way of the transcription activator-like effector nucleases (TALENs) system. Transcription activator-like effectors (TALEs) can be engineered to bind practically any desired DNA sequence. Exemplary methods of genome editing using the TALEN system can be found for example in Cermak T. Doyle E L. Christian M. Wang L. Zhang Y. Schmidt C, et al. Efficient design and assembly of custom TALEN and other TAL effector-based constructs for DNA targeting. Nucleic Acids Res. 2011; 39:e82; Zhang F. Cong L. Lodato S. Kosuri S. Church G M. Arlotta P Efficient construction of sequence-specific TAL effectors for modulating mammalian transcription. Nat Biotechnol. 2011; 29:149-153 and U.S. Pat. Nos. 8,450,471, 8,440,431 and 8,440,432, all of which are specifically incorporated by reference.


In advantageous embodiments of the invention, the methods provided herein use isolated, non-naturally occurring, recombinant or engineered DNA binding proteins that comprise TALE monomers as a part of their organizational structure that enable the targeting of nucleic acid sequences with improved efficiency and expanded specificity.


Naturally occurring TALEs or “wild type TALEs” are nucleic acid binding proteins secreted by numerous species of proteobacteria. TALE polypeptides contain a nucleic acid binding domain composed of tandem repeats of highly conserved monomer polypeptides that are predominantly 33, 34 or 35 amino acids in length and that differ from each other mainly in amino acid positions 12 and 13. In advantageous embodiments the nucleic acid is DNA. As used herein, the term “polypeptide monomers”, or “TALE monomers” will be used to refer to the highly conserved repetitive polypeptide sequences within the TALE nucleic acid binding domain and the term “repeat variable di-residues” or “RVD” will be used to refer to the highly variable amino acids at positions 12 and 13 of the polypeptide monomers. As provided throughout the disclosure, the amino acid residues of the RVD are depicted using the IUPAC single letter code for amino acids. A general representation of a TALE monomer which is comprised within the DNA binding domain is X1-11-(X12X13)-X14-33 or 34 or 35, where the subscript indicates the amino acid position and X represents any amino acid. X12X13 indicate the RVDs. In some polypeptide monomers, the variable amino acid at position 13 is missing or absent and in such polypeptide monomers, the RVD consists of a single amino acid. In such cases the RVD may be alternatively represented as X*, where X represents X12 and (*) indicates that X13 is absent. The DNA binding domain comprises several repeats of TALE monomers and this may be represented as (X1-11-(X12X13)-X14-33 or 34 or 35)z, where in an advantageous embodiment, z is at least 5 to 40. In a further advantageous embodiment, z is at least 10 to 26.


The TALE monomers have a nucleotide binding affinity that is determined by the identity of the amino acids in its RVD. For example, polypeptide monomers with an RVD of NI preferentially bind to adenine (A), polypeptide monomers with an RVD of NG preferentially bind to thymine (T), polypeptide monomers with an RVD of HD preferentially bind to cytosine (C) and polypeptide monomers with an RVD of NN preferentially bind to both adenine (A) and guanine (G). In yet another embodiment of the invention, polypeptide monomers with an RVD of IG preferentially bind to T. Thus, the number and order of the polypeptide monomer repeats in the nucleic acid binding domain of a TALE determines its nucleic acid target specificity. In still further embodiments of the invention, polypeptide monomers with an RVD of NS recognize all four base pairs and may bind to A, T, G or C. The structure and function of TALEs is further described in, for example, Moscou et al., Science 326:1501 (2009); Boch et al., Science 326:1509-1512 (2009); and Zhang et al., Nature Biotechnology 29:149-153 (2011), each of which is incorporated by reference in its entirety.


The TALE polypeptides used in methods of the invention are isolated, non-naturally occurring, recombinant or engineered nucleic acid-binding proteins that have nucleic acid or DNA binding regions containing polypeptide monomer repeats that are designed to target specific nucleic acid sequences.


As described herein, polypeptide monomers having an RVD of HN or NH preferentially bind to guanine and thereby allow the generation of TALE polypeptides with high binding specificity for guanine containing target nucleic acid sequences. In a preferred embodiment of the invention, polypeptide monomers having RVDs RN, NN, NK, SN, NH, KN, HN, NQ, HH, RG, KH, RH and SS preferentially bind to guanine. In a much more advantageous embodiment of the invention, polypeptide monomers having RVDs RN, NK, NQ, HH, KH, RH, SS and SN preferentially bind to guanine and thereby allow the generation of TALE polypeptides with high binding specificity for guanine containing target nucleic acid sequences. In an even more advantageous embodiment of the invention, polypeptide monomers having RVDs HH, KH, NH, NK, NQ, RH, RN and SS preferentially bind to guanine and thereby allow the generation of TALE polypeptides with high binding specificity for guanine containing target nucleic acid sequences. In a further advantageous embodiment, the RVDs that have high binding specificity for guanine are RN, NH RH and KH. Furthermore, polypeptide monomers having an RVD of NV preferentially bind to adenine and guanine. In more preferred embodiments of the invention, polypeptide monomers having RVDs of H*, HA, KA, N*, NA, NC, NS, RA, and S* bind to adenine, guanine, cytosine and thymine with comparable affinity.


The predetermined N-terminal to C-terminal order of the one or more polypeptide monomers of the nucleic acid or DNA binding domain determines the corresponding predetermined target nucleic acid sequence to which the TALE polypeptides will bind. As used herein the polypeptide monomers and at least one or more half polypeptide monomers are “specifically ordered to target” the genomic locus or gene of interest. In plant genomes, the natural TALE-binding sites always begin with a thymine (T), which may be specified by a cryptic signal within the non-repetitive N-terminus of the TALE polypeptide; in some cases this region may be referred to as repeat 0. In animal genomes, TALE binding sites do not necessarily have to begin with a thymine (T) and TALE polypeptides may target DNA sequences that begin with T, A, G or C. The tandem repeat of TALE monomers always ends with a half-length repeat or a stretch of sequence that may share identity with only the first 20 amino acids of a repetitive full length TALE monomer and this half repeat may be referred to as a half-monomer (FIG. 8), which is included in the term “TALE monomer”. Therefore, it follows that the length of the nucleic acid or DNA being targeted is equal to the number of full polypeptide monomers plus two.


As described in Zhang et al., Nature Biotechnology 29:149-153 (2011), TALE polypeptide binding efficiency may be increased by including amino acid sequences from the “capping regions” that are directly N-terminal or C-terminal of the DNA binding region of naturally occurring TALEs into the engineered TALEs at positions N-terminal or C-terminal of the engineered TALE DNA binding region. Thus, in certain embodiments, the TALE polypeptides described herein further comprise an N-terminal capping region and/or a C-terminal capping region.


An exemplary amino acid sequence of a N-terminal capping region is:









(SEQ ID No. 3)


M D P I R S R T P S P A R E L L S G P Q P D G V Q





P T A D R G V S P P A G G P L D G L P A R R T M S





R T R L P S P P A P S P A F S A D S F S D L L R Q





F D P S L F N T S L F D S L P P F G A H H T E A A





T G E W D E V Q S G L R A A D A P P P T M R V A V





T A A R P P R A K P A P R R R A A Q P S D A S P A





A Q V D L R T L G Y S Q Q Q Q E K I K P K V R S T





V A Q H H E A L V G H G F T H A H I V A L S Q H P





A A L G T V A V K Y Q D M I A A L P E A T H E A I





V G V G K Q W S G A R A L E A L L T V A G E L R G





P P L Q L D T G Q L L K I A K R G G V T A V E A V





H A W R N A L T G A P L N 







An exemplary amino acid sequence of a C-terminal capping region is:









(SEQ ID No. 4)


R P A L E S I V A Q L S R P D P A L A A L T N D H





L V A L A C L G G R P A L D A V K K G L P H A P A





L I K R T N R R I P E R T S H R V A D H A Q V V R





V L G F F Q C H S H P A Q A F D D A M T Q F G M S





R H G L L Q L F R R V G V T E L E A R S G T L P P





A S Q R W D R I L Q A S G M K R A K P S P T S T Q





T P D Q A S L H A F A D S L E R D L D A P S P M H





E G D Q T R A S 






As used herein the predetermined “N-terminus” to “C terminus” orientation of the N-terminal capping region, the DNA binding domain comprising the repeat TALE monomers and the C-terminal capping region provide structural basis for the organization of different domains in the d-TALEs or polypeptides of the invention.


The entire N-terminal and/or C-terminal capping regions are not necessary to enhance the binding activity of the DNA binding region. Therefore, in certain embodiments, fragments of the N-terminal and/or C-terminal capping regions are included in the TALE polypeptides described herein.


In certain embodiments, the TALE polypeptides described herein contain a N-terminal capping region fragment that included at least 10, 20, 30, 40, 50, 54, 60, 70, 80, 87, 90, 94, 100, 102, 110, 117, 120, 130, 140, 147, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260 or 270 amino acids of an N-terminal capping region. In certain embodiments, the N-terminal capping region fragment amino acids are of the C-terminus (the DNA-binding region proximal end) of an N-terminal capping region. As described in Zhang et al., Nature Biotechnology 29:149-153 (2011), N-terminal capping region fragments that include the C-terminal 240 amino acids enhance binding activity equal to the full length capping region, while fragments that include the C-terminal 147 amino acids retain greater than 80% of the efficacy of the full length capping region, and fragments that include the C-terminal 117 amino acids retain greater than 50% of the activity of the full-length capping region.


In some embodiments, the TALE polypeptides described herein contain a C-terminal capping region fragment that included at least 6, 10, 20, 30, 37, 40, 50, 60, 68, 70, 80, 90, 100, 110, 120, 127, 130, 140, 150, 155, 160, 170, 180 amino acids of a C-terminal capping region. In certain embodiments, the C-terminal capping region fragment amino acids are of the N-terminus (the DNA-binding region proximal end) of a C-terminal capping region. As described in Zhang et al., Nature Biotechnology 29:149-153 (2011), C-terminal capping region fragments that include the C-terminal 68 amino acids enhance binding activity equal to the full length capping region, while fragments that include the C-terminal 20 amino acids retain greater than 50% of the efficacy of the full length capping region.


In certain embodiments, the capping regions of the TALE polypeptides described herein do not need to have identical sequences to the capping region sequences provided herein. Thus, in some embodiments, the capping region of the TALE polypeptides described herein have sequences that are at least 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identical or share identity to the capping region amino acid sequences provided herein. Sequence identity is related to sequence homology. Homology comparisons may be conducted by eye, or more usually, with the aid of readily available sequence comparison programs. These commercially available computer programs may calculate percent (%) homology between two or more sequences and may also calculate the sequence identity shared by two or more amino acid or nucleic acid sequences. In some preferred embodiments, the capping region of the TALE polypeptides described herein have sequences that are at least 95% identical or share identity to the capping region amino acid sequences provided herein.


Sequence homologies may be generated by any of a number of computer programs known in the art, which include but are not limited to BLAST or FASTA. Suitable computer program for carrying out alignments like the GCG Wisconsin Bestfit package may also be used. Once the software has produced an optimal alignment, it is possible to calculate % homology, preferably % sequence identity. The software typically does this as part of the sequence comparison and generates a numerical result.


In advantageous embodiments described herein, the TALE polypeptides of the invention include a nucleic acid binding domain linked to the one or more effector domains. The terms “effector domain” or “regulatory and functional domain” refer to a polypeptide sequence that has an activity other than binding to the nucleic acid sequence recognized by the nucleic acid binding domain. By combining a nucleic acid binding domain with one or more effector domains, the polypeptides of the invention may be used to target the one or more functions or activities mediated by the effector domain to a particular target DNA sequence to which the nucleic acid binding domain specifically binds.


In some embodiments of the TALE polypeptides described herein, the activity mediated by the effector domain is a biological activity. For example, in some embodiments the effector domain is a transcriptional inhibitor (i.e., a repressor domain), such as an mSin interaction domain (SID). SID4X domain or a Kruppel-associated box (KRAB) or fragments of the KRAB domain. In some embodiments the effector domain is an enhancer of transcription (i.e. an activation domain), such as the VP16, VP64 or p65 activation domain. In some embodiments, the nucleic acid binding is linked, for example, with an effector domain that includes but is not limited to a transposase, integrase, recombinase, resolvase, invertase, protease, DNA methyltransferase, DNA demethylase, histone acetylase, histone deacetylase, nuclease, transcriptional repressor, transcriptional activator, transcription factor recruiting, protein nuclear-localization signal or cellular uptake signal.


In some embodiments, the effector domain is a protein domain which exhibits activities which include but are not limited to transposase activity, integrase activity, recombinase activity, resolvase activity, invertase activity, protease activity, DNA methyltransferase activity, DNA demethylase activity, histone acetylase activity, histone deacetylase activity, nuclease activity, nuclear-localization signaling activity, transcriptional repressor activity, transcriptional activator activity, transcription factor recruiting activity, or cellular uptake signaling activity. Other preferred embodiments of the invention may include any combination the activities described herein.


ZN-Finger Nucleases

Other preferred tools for genome editing for use in the context of this invention include zinc finger systems. One type of programmable DNA-binding domain is provided by artificial zinc-finger (ZF) technology, which involves arrays of ZF modules to target new DNA-binding sites in the genome. Each finger module in a ZF array targets three DNA bases. A customized array of individual zinc finger domains is assembled into a ZF protein (ZFP).


ZFPs can comprise a functional domain. The first synthetic zinc finger nucleases (ZFNs) were developed by fusing a ZF protein to the catalytic domain of the Type IIS restriction enzyme FokI. (Kim, Y. G. et al., 1994, Chimeric restriction endonuclease, Proc. Natl. Acad. Sci. U.S.A. 91, 883-887; Kim, Y. G. et al., 1996, Hybrid restriction enzymes: zinc finger fusions to Fok I cleavage domain. Proc. Natl. Acad. Sci. U.S.A. 93, 1156-1160). Increased cleavage specificity can be attained with decreased off target activity by use of paired ZFN heterodimers, each targeting different nucleotide sequences separated by a short spacer. (Doyon, Y. et al., 2011, Enhancing zinc-finger-nuclease activity with improved obligate heterodimeric architectures. Nat. Methods 8, 74-79). ZFPs can also be designed as transcription activators and repressors and have been used to target many genes in a wide variety of organisms. Exemplary methods of genome editing using ZFNs can be found for example in U.S. Pat. Nos. 6,534,261, 6,607,882, 6,746,838, 6,794,136, 6,824,978, 6,866,997, 6,933,113, 6,979,539, 7,013,219, 7,030,215, 7,220,719, 7,241,573, 7,241,574, 7,585,849, 7,595,376, 6,903,185, and 6,479,626, all of which are specifically incorporated by reference.


Meganucleases

As disclosed herein editing can be made by way of meganucleases, which are endodeoxyribonucleases characterized by a large recognition site (double-stranded DNA sequences of 12 to 40 base pairs). Exemplary method for using meganucleases can be found in U.S. Pat. Nos. 8,163,514; 8,133,697; 8,021,867; 8,119,361; 8,119,381; 8,124,369; and 8,129,134, which are specifically incorporated by reference.


RNAi

In certain embodiments, the genetic modifying agent is RNAi (e.g., shRNA). As used herein, “gene silencing” or “gene silenced” in reference to an activity of an RNAi molecule, for example a siRNA or miRNA refers to a decrease in the mRNA level in a cell for a target gene by at least about 5%, about 10%, about 20%, about 30%, about 40%, about 50%, about 60%, about 70%, about 80%, about 90%, about 95%, about 99%, about 100% of the mRNA level found in the cell without the presence of the miRNA or RNA interference molecule. In one preferred embodiment, the mRNA levels are decreased by at least about 70%, about 80%, about 90%, about 95%, about 99%, about 100%.


As used herein, the term “RNAi” refers to any type of interfering RNA, including but not limited to, siRNAi, shRNAi, endogenous microRNA and artificial microRNA. For instance, it includes sequences previously identified as siRNA, regardless of the mechanism of down-stream processing of the RNA (i.e. although siRNAs are believed to have a specific method of in vivo processing resulting in the cleavage of mRNA, such sequences can be incorporated into the vectors in the context of the flanking sequences described herein). The term “RNAi” can include both gene silencing RNAi molecules, and also RNAi effector molecules which activate the expression of a gene.


As used herein, a “siRNA” refers to a nucleic acid that forms a double stranded RNA, which double stranded RNA has the ability to reduce or inhibit expression of a gene or target gene when the siRNA is present or expressed in the same cell as the target gene. The double stranded RNA siRNA can be formed by the complementary strands. In one embodiment, a siRNA refers to a nucleic acid that can form a double stranded siRNA. The sequence of the siRNA can correspond to the full-length target gene, or a subsequence thereof. Typically, the siRNA is at least about 15-50 nucleotides in length (e.g., each complementary sequence of the double stranded siRNA is about 15-50 nucleotides in length, and the double stranded siRNA is about 15-50 base pairs in length, preferably about 19-30 base nucleotides, preferably about 20-25 nucleotides in length, e.g., 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides in length).


As used herein “shRNA” or “small hairpin RNA” (also called stem loop) is a type of siRNA. In one embodiment, these shRNAs are composed of a short, e.g. about 19 to about 25 nucleotide, antisense strand, followed by a nucleotide loop of about 5 to about 9 nucleotides, and the analogous sense strand. Alternatively, the sense strand can precede the nucleotide loop structure and the antisense strand can follow.


The terms “microRNA” or “miRNA” are used interchangeably herein are endogenous RNAs, some of which are known to regulate the expression of protein-coding genes at the posttranscriptional level. Endogenous microRNAs are small RNAs naturally present in the genome that are capable of modulating the productive utilization of mRNA. The term artificial microRNA includes any type of RNA sequence, other than endogenous microRNA, which is capable of modulating the productive utilization of mRNA. MicroRNA sequences have been described in publications such as Lim, et al., Genes & Development, 17, p. 991-1008 (2003), Lim et al Science 299, 1540 (2003), Lee and Ambros Science, 294, 862 (2001), Lau et al., Science 294, 858-861 (2001), Lagos-Quintana et al, Current Biology, 12, 735-739 (2002), Lagos Quintana et al, Science 294, 853-857 (2001), and Lagos-Quintana et al, RNA, 9, 175-179 (2003), which are incorporated by reference. Multiple microRNAs can also be incorporated into a precursor molecule. Furthermore, miRNA-like stem-loops can be expressed in cells as a vehicle to deliver artificial miRNAs and short interfering RNAs (siRNAs) for the purpose of modulating the expression of endogenous genes through the miRNA and or RNAi pathways.


As used herein, “double stranded RNA” or “dsRNA” refers to RNA molecules that are comprised of two strands. Double-stranded molecules include those comprised of a single RNA molecule that doubles back on itself to form a two-stranded structure. For example, the stem loop structure of the progenitor molecules from which the single-stranded miRNA is derived, called the pre-miRNA (Bartel et al. 2004. Cell 1 16:281-297), comprises a dsRNA molecule.


Transcriptional Activation/Repression

In certain embodiments, an immunomodulant may comprise (i) a DNA-binding portion configured to specifically bind to the endogenous gene and (ii) an effector domain mediating a biological activity.


In certain embodiments, the DNA-binding portion may comprise a zinc finger protein or DNA-binding domain thereof, a transcription activator-like effector (TALE) protein or DNA-binding domain thereof, or an RNA-guided protein or DNA-binding domain thereof.


In certain embodiments, the DNA-binding portion may comprise (i) Cas9 or Cpf1 or any Cas protein described herein modified to eliminate its nuclease activity, or (ii) DNA-binding domain of Cas9 or Cpf1 or any Cas protein described herein.


In some embodiments, the effector domain may be a transcriptional inhibitor (i.e., a repressor domain), such as an mSin interaction domain (SID). SID4X domain or a Kruppel-associated box (KRAB) or fragments of the KRAB domain. In some embodiments, the effector domain may be an enhancer of transcription (i.e. an activation domain), such as the VP16, VP64 or p65 activation domain. In some embodiments, the nucleic acid binding portion may be linked, for example, with an effector domain that includes but is not limited to a transposase, integrase, recombinase, resolvase, invertase, protease, DNA methyltransferase, DNA demethylase, histone acetylase, histone deacetylase, nuclease, transcriptional repressor, transcriptional activator, transcription factor recruiting, protein nuclear-localization signal or cellular uptake signal. In some embodiments, the effector domain may be a protein domain which exhibits activities which include but are not limited to transposase activity, integrase activity, recombinase activity, resolvase activity, invertase activity, protease activity, DNA methyltransferase activity, DNA demethylase activity, histone acetylase activity, histone deacetylase activity, nuclease activity, nuclear-localization signaling activity, transcriptional repressor activity, transcriptional activator activity, transcription factor recruiting activity, or cellular uptake signaling activity. Other preferred embodiments of the invention may include any combination the activities described herein.


Antibody Drug Conjugate

In certain embodiments, the agent capable of specifically binding to a gene product expressed on the cell surface of the immune cell is an antibody.


By means of an example, an agent, such as an antibody, capable of specifically binding to a gene product expressed on the cell surface of the immune cells may be conjugated with a therapeutic or effector agent for targeted delivery of the therapeutic or effector agent to the immune cells.


Examples of such therapeutic or effector agents include immunomodulatory classes as discussed herein, such as without limitation a toxin, drug, radionuclide, cytokine, lymphokine, chemokine, growth factor, tumor necrosis factor, hormone, hormone antagonist, enzyme, oligonucleotide, siRNA, RNAi, photoactive therapeutic agent, anti-angiogenic agent and pro-apoptotic agent.


Example toxins include ricin, abrin, alpha toxin, saporin, ribonuclease (RNase), DNase I, Staphylococcal enterotoxin-A, pokeweed antiviral protein, gelonin, diphtheria toxin, Pseudomonas exotoxin, or Pseudomonas endotoxin.


Example radionuclides include 103mRh, 103Ru, 105Rh, 105Ru, 107Hg, 109Pd, 109Pt, 111Ag, 111In, 113mIn 119Sb, 11C, 121mTe, 122mTe, 125I, 125mTe, 126I, 131I, 133I, 13N, 142Pr, 143Pr, 149Pm, 152Dy, 153Sm, 15O, 161Ho, 161Tb, 165Tm, 166Dy, 166Ho, 167Tm, 168Tm, 169Er, 169Yb, 177Lu, 186Re, 188Re, 189mOs, 189Re, 192Ir, 194Ir, 197Pt, 198Au, 199Au, 201Tl, 203Hg, 211At, 211Bi, 211Pb, 212Bi, 212Pb, 213Bi, 215Po, 217At, 219Rn, 221Fr, 223Ra, 224Ac, 225Ac, 225Fm, 32P, 33P, 47Sc, 51Cr, 57Co, 58Co, 59Fe, 62Cu, 67Cu, 67Ga, 75Br, 75Se, 76Br, 77As, 77Br, 80mBr, 89Sr, 90Y, 95Ru, 97Ru, 99Mo or 99mTc. Preferably, the radionuclide may be an alpha-particle-emitting radionuclide.


Example enzymes include malate dehydrogenase, staphylococcal nuclease, delta-V-steroid isomerase, yeast alcohol dehydrogenase, alpha-glycerophosphate dehydrogenase, triose phosphate isomerase, horseradish peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase or acetylcholinesterase. Such enzymes may be used, for example, in combination with prodrugs that are administered in relatively non-toxic form and converted at the target site by the enzyme into a cytotoxic agent. In other alternatives, a drug may be converted into less toxic form by endogenous enzymes in the subject but may be reconverted into a cytotoxic form by the therapeutic enzyme.


By means of an example, an agent, such as a bi-specific antibody, capable of specifically binding to a gene product expressed on the cell surface of suppressive immune cells and another cell may be used for targeting suppressive immune cells away from TILs and/or a tumor.


Targeting Suppressive T cells


In another aspect, detecting or quantifying CD8+ T cells may be used to select a treatment for a subject in need thereof. In certain embodiments, subjects comprising suppressive T cells as described herein are treated with an immunotherapy (e.g., checkpoint blockade therapy). Not being bound by a theory, the suppressive T cells express coinhibitory receptors (e.g., checkpoint proteins) that can be specifically targeted. The checkpoint blockade therapy may be an inhibitor of any check point protein described herein. The checkpoint blockade therapy may comprise anti-TIM3, anti-CTLA4, anti-PD-L1, anti-PD1, anti-TIGIT, anti-LAG3, or combinations thereof. Specific check point inhibitors include, but are not limited to anti-CTLA4 antibodies (e.g., Ipilimumab), anti-PD-1 antibodies (e.g., Nivolumab, Pembrolizumab), and anti-PD-L1 antibodies (e.g., Atezolizumab).


The treatment may involve transferring CAR T cells to a patient. The CAR T cells may be modified such that they are resistant to suppression by the CD8+ T cells of the present invention.


Bhlhe40 is also known as BHLHB2, Clast5, DEC1, HLHB2, SHARP-2, SHARP2, STRA13 and Stra14. As used herein Bhlhe40 refers to the human gene, mouse gene and all other orthologues. Bhlhe40 may refer to the gene identified by accession number NM_003670.2. DEC1 is a basic helix-loop-helix transcription factor that is known to be highly induced in a CD28-dependent manner upon T cell activation (Martinez-Llordella et al. “CD28-inducible transcription factor DEC1 is required for efficient autoreactive CD4+ T cell response.” J Exp Med. 2013 Jul. 29; 210(8):1603-19. doi: 10.1084/jem.20122387. Epub 2013 Jul. 22). DEC1 is required for the development of experimental autoimmune encephalomyelitis and plays a critical role in the production of the proinflammatory cytokines GM-CSF, IFNγ, and IL-2 (Martinez-Llordella, 2013). Applicants previously demonstrated that DEC1 has a role in promoting pathogenic Th17 differentiation (see, WO2015130968A2). Applicants have discovered that Bhlhe40 is upregulated in suppressive T cells and may therefore be targeted for downregulation in order to enhance an immune response.


IKZF2 is also known as ANF1A2, HELIOS, ZNF1A2, ZNFN1A2. As used herein Helios refers to the human gene, mouse gene and all other orthologues. Helios may refer to the gene identified by accession numbers NM_016260.2, NM_001079526.1 and NM_011770.4. Helios is a T cell-specific zinc finger transcription factor that is encoded by the Ikzf2 gene. It belongs to the Ikaros family of zinc finger proteins, which also includes Ikaros (Ikzf1), Aiolos (Ikzf3), loos (Ikzf4), and Pegasus (Ikzf5), Helios, along with other Ikaros proteins, regulate lymphocyte development and differentiation. Helios has been shown to have specific roles in Tregs (Nakagawa et al., Instability of Helios-deficient Tregs is associated with conversion to a T-effector phenotype and enhanced antitumor immunity, Proc Natl Acad Sci USA. 2016 May 31; 113(22).6248-53; and Kim et al., Stable inhibitory activity of regulatory T cells requires the transcription factor Helios, Science. 2015 Oct. 16; 350(6258):334-9). Applicants have shown a role for Helios in a specific suppressive T cell population (i.e., cluster 7). Not being bound by a theory, targeting Helios in specific T cells can enhance treatment and avoid unwanted side effects caused by targeting all Helios expressing T cells.


Diagnosis and Treatment Selection

In a further embodiment, the present invention provides for a method for determining the CD8+ T cell status of a subject, or for diagnosing, prognosing or monitoring a disease comprising an immune component in a subject by detecting or quantifying CD8+ T cells as defined in any embodiment herein in a biological sample of the subject. The CD8+ T cell status of the subject may be determined before and after therapy, whereby the efficacy of the therapy is determined or monitored. The therapy may be an immunotherapy (e.g., checkpoint blockade therapy). In certain embodiments, an immunotherapy is effective if after treatment the suppressive CD8+ T cells decrease. In certain embodiments, a subject suffering from cancer having less suppressive CD8+ T cells has a better prognosis than a subject having more suppressive CD8+ T cells.


The terms “diagnosis” and “monitoring” are commonplace and well-understood in medical practice. By means of further explanation and without limitation the term “diagnosis” generally refers to the process or act of recognizing, deciding on or concluding on a disease or condition in a subject on the basis of symptoms and signs and/or from results of various diagnostic procedures (such as, for example, from knowing the presence, absence and/or quantity of one or more biomarkers characteristic of the diagnosed disease or condition).


The term “monitoring” generally refers to the follow-up of a disease or a condition in a subject for any changes which may occur over time.


The terms “prognosing” or “prognosis” generally refer to an anticipation on the progression of a disease or condition and the prospect (e.g., the probability, duration, and/or extent) of recovery. A good prognosis of the diseases or conditions taught herein may generally encompass anticipation of a satisfactory partial or complete recovery from the diseases or conditions, preferably within an acceptable time period. A good prognosis of such may more commonly encompass anticipation of not further worsening or aggravating of such, preferably within a given time period. A poor prognosis of the diseases or conditions as taught herein may generally encompass anticipation of a substandard recovery and/or unsatisfactorily slow recovery, or to substantially no recovery or even further worsening of such.


The terms also encompass prediction of a disease. The terms “predicting” or “prediction” generally refer to an advance declaration, indication or foretelling of a disease or condition in a subject not (yet) having said disease or condition. For example, a prediction of a disease or condition in a subject may indicate a probability, chance or risk that the subject will develop said disease or condition, for example within a certain time period or by a certain age. Said probability, chance or risk may be indicated inter alia as an absolute value, range or statistics, or may be indicated relative to a suitable control subject or subject population (such as, e.g., relative to a general, normal or healthy subject or subject population). Hence, the probability, chance or risk that a subject will develop a disease or condition may be advantageously indicated as increased or decreased, or as fold-increased or fold-decreased relative to a suitable control subject or subject population. As used herein, the term “prediction” of the conditions or diseases as taught herein in a subject may also particularly mean that the subject has a ‘positive’ prediction of such, i.e., that the subject is at risk of having such (e.g., the risk is significantly increased vis-à-vis a control subject or subject population). The term “prediction of no” diseases or conditions as taught herein as described herein in a subject may particularly mean that the subject has a ‘negative’ prediction of such, i.e., that the subject's risk of having such is not significantly increased vis-à-vis a control subject or subject population.


Kits

In another aspect, the invention is directed to kit and kit of parts. The terms “kit of parts” and “kit” as used throughout this specification refer to a product containing components necessary for carrying out the specified methods (e.g., methods for detecting, quantifying or isolating immune cells as taught herein), packed so as to allow their transport and storage. Materials suitable for packing the components comprised in a kit include crystal, plastic (e.g., polyethylene, polypropylene, polycarbonate), bottles, flasks, vials, ampules, paper, envelopes, or other types of containers, carriers or supports. Where a kit comprises a plurality of components, at least a subset of the components (e.g., two or more of the plurality of components) or all of the components may be physically separated, e.g., comprised in or on separate containers, carriers or supports. The components comprised in a kit may be sufficient or may not be sufficient for carrying out the specified methods, such that external reagents or substances may not be necessary or may be necessary for performing the methods, respectively. Typically, kits are employed in conjunction with standard laboratory equipment, such as liquid handling equipment, environment (e.g., temperature) controlling equipment, analytical instruments, etc. In addition to the recited binding agents(s) as taught herein, such as for example, antibodies, hybridization probes, amplification and/or sequencing primers, optionally provided on arrays or microarrays, the present kits may also include some or all of solvents, buffers (such as for example but without limitation histidine-buffers, citrate-buffers, succinate-buffers, acetate-buffers, phosphate-buffers, formate buffers, benzoate buffers, TRIS (Tris(hydroxymethyl)-aminomethan) buffers or maleate buffers, or mixtures thereof), enzymes (such as for example but without limitation thermostable DNA polymerase), detectable labels, detection reagents, and control formulations (positive and/or negative), useful in the specified methods. Typically, the kits may also include instructions for use thereof, such as on a printed insert or on a computer readable medium. The terms may be used interchangeably with the term “article of manufacture”, which broadly encompasses any man made tangible structural product, when used in the present context.


The invention is further described in the following examples, which do not limit the scope of the invention described in the claims.


EXAMPLES
Example 1—Identification of Novel Tumor Infiltrating CD8+ T Cells Populations

Applicants identified novel CD8 and CD4 populations using the B16 melanoma mouse model. For single-cell RNA-Seq experiments, TILs from B16 melanomas were collected in 96-well plates. Applicants performed SMART-seq2 following the published protocol (Picelli et al., 2013 Nat Methods 10, 1096-1098) with minor modifications. Standard Illumina sequencing was performed. Cells in tumors tend to be high for inhibitory receptors (e.g., PD1, Tim3, TIGIT, LAG3). Therapies that block these receptors work in tumor therapy. Therefore, Applicants studied populations of TILs to elucidate the complexity of subpopulations that express these co-inhibitory receptors. Further, Applicants studied how these cell types interact with other cell types in the tumor.



FIGS. 1 and 2 illustrate the study design. Cells were sampled at each time point indicated after tumor cell implantation. Tumor size was measured in two dimensions by caliper and is expressed as the product of two perpendicular diameters. Cells were sorted based on cell markers. CD8 T cells were obtained by sorting for CD8+CD45+ cells. CD4 T cells (both Effector and Regulatory) were obtained by sorting for CD4+CD45+ cells. NK cells, dendritic cells, and macrophages were obtained by sorting for CD4CD8CD45+ cells. CD45 cells included fibroblasts and tumor cells. FIG. 3 illustrates clustering of the CD8 and CD4 T cells. DP refers to double positive for TIM3 and PD-1. DN refers to double negative for TIM3 and PD-1. SP refers to single positive for TIM3 and PD-1.



FIG. 4 illustrates dimension reduction analysis of the cells sequenced for CD8 T cells. Applicants sequenced 2592 cells (27 plates). 2313 cells passed the basic QC (89%) and 2017 cells passed the extensive QC (78%). Principal component (PC) analysis was performed using gene expression measured in the single cells. PC1 was associated with transcription and PC2 and PC3 were strongly associated with sequencing batches. tSNE and clustering was performed on PCs 4-9 (FIG. 4). All of the CD8 cells were pooled together on a normalized tSNE. The CD8 cells clustered into 15 clusters. FIG. 5 illustrates each cluster individually. FIG. 6 illustrates 4 populations that stand out based on expression of the co-inhibitory receptors PD1 and TIM3. Clusters 9, 10 and 7 are PD1+Tim3+(C9, C10, C7). Cluster 8 is PD1+Tim3− (C8). Applicants determined that the clusters are transcriptionally different. Not being bound by a theory the clusters are functionally different. Applicants provide data herein suggesting that the cells are functionally different.



FIG. 7 illustrates decoupled dysfunction and activation scores based on previous work by Applicants (Singer et al., 2016). FIG. 8 illustrates that Clusters 7 and 9 are distinguished by the decoupling of dysfunction and activation scores. FIGS. 9 and 10 illustrate that cluster 7 is high for a CD8 Treg signature (Kim et al., 2015 Science 350(6258):334-339) despite also expressing the co-inhibitory receptors PD-1 and TIM-3. The CD8 Treg signature of Kim et al. includes 343 genes that are upregulated and 153 genes that are downregulated. Cluster 7 signature genes that overlap (p<10−4) with the Treg upregulated gene signature are ADAM8, CCL3, HAVCR2, IRF8, LAT2, MUO10 and SLC37A2. None of the downregulated Treg genes are expressed in cluster 7. Thus, Cluster 7 is enriched for genes upregulated in CD8 Tregs. Not being bound by a theory the cluster 7 CD8 T cells represent a novel suppressive CD8 T cell with gene expression signatures similar to CD8 Tregs. Additionally, cluster 8 overlaps with (p<0.01) with the Treg upregulated gene signature. The overlapping cluster 8 genes are CD74, CD81, CD83, KLRK1, SDC4 and SPRY2. None of the cluster 9 or cluster 10 CD8 signatures overlap with the CD8 Treg signature. Cluster 8, 9 and 10 express genes downregulated in the Treg downregulated gene signature. Cluster 8 includes expression of LRIG1, NRN1, NRP1 and PTPRK (p=0.012). Thus, cluster 8 is enriched for genes either up or down in CD8 Tregs. Cluster 9 expresses ASPM, BUB1, CCNA2, CCNB2, CDCA8, CDKN3, CENPE, HMMR, KIF11, KIF4, MELK, NEK2, SPAG5 and TPX2 (p<10−13). Thus, cluster 9 may express a signature anti-correlated to the Treg signature. Cluster 10 expresses POLA1 and RRM2.



FIG. 11 illustrates that cluster 7 is high for MT1 (left). MT1 is significantly upregulated in cluster 7. FIG. 11 also illustrates that clusters 7 and 8 are marked by expression of the transcription factor Helios (IKZF2) (Kim et al., 2015) (right). Helios expression was found to be significantly upregulated in cluster 7 as compared to cluster 9 and cluster 10. Thus, Applicants have identified for the first time at least two Helios expressing subpopulations of CD8 T cells expressing PD1 and distinguished by at least expression of TIM3.



FIG. 12 illustrates that PD1+TIM3+(DP) MT (WT) expressing cells are the most suppressive in a CFSE assay for T cell proliferation. Greater suppression leads to a few cells with a higher concentration of CFSE (proliferating cells divide and CFSE is diluted among daughter cells). Upon knockout of MT the PD1+TIM3+(DP) MT−/− cells are less suppressive. Cluster 7 represents the population of CD8 cells that are both PD1+TIM3+ and have high MT1 expression. Thus, Applicants have shown for the first time that the cluster 7 population of cells may be suppressive to T cell proliferation. Based on cell type specific markers, cluster 7 T cells may be specifically targeted for therapeutic purposes (e.g., cancer, autoimmune diseases, chronic infection).



FIG. 13 illustrates further characterization of CD8 cell populations. Cluster 9 is high for cell cycle genes and a CD8 activation (effector) signature. Cluster 7 is low for both signatures. Cluster 9 is also high for an exhaustion signature (Wherry and Kurachi, 2015, Nature reviews Immunology 15, 486-499). Clusters 9, 10, 7 and 8 express a decoupled dysfunction signature determined by bulk expression data from populations of T cells (Singer et al., 2016).



FIG. 14 illustrates transmembrane receptors expressed or not expressed by the cluster 7 population. Sorting CD8 cell populations can use these markers. For example, cluster 7 can be sorted out using a combination of SERPINE2+HMMR−; KIT+Tim3+HMMR−; or TNFRSF4+Tim3+HMMR−. FIGS. 17 and 18 show sorting of CD8 T cells. PD1+TIM3+ and PD1+TIM3− are further sorted by HMMR, cKIT and Helios, as well as the proliferation marker Ki-67.



FIG. 15 illustrates cytokines/chemokines expressed by the cluster 7 cell population. IL1 is a proinflammatory cytokine and IL1R2 is a decoy receptor that dampens the proinflammatory response by removing IL1 from the system. Cluster 7 cells express IL1R2. Not being bound by a theory blocking IL1R2 or modulating its expression either through drug or genetic mechanisms (e.g., CRISPR) on cluster 7 cells can inhibit cluster 7 suppressive function. FIG. 16 illustrates transcription factors expressed by the cluster 7 cell population. All of these transcription factors are significantly upregulated in cluster 7 as compared to clusters 9 and 10. IKZF2 (Helios) may be involved in the regulation of Tregs and the STAT5 pathway. EPAS1 regulates VEGF. Further, EPAS1 is specific to cluster 7. RUNX2 is involved in CD8 memory differentiation. RBPJ is involved in Notch signaling. Thus, these transcription factors may be targeted to inhibit the suppressive function of cluster 7 cells.


Applicants hypothesized that cluster 7 is sensitive to steroid signaling. Specifically, cluster 7 may be sensitive to glucocorticoid signaling (see, e.g., Oakley and Cidlowsk J Allergy Clin Immunol. 2013 November; 132(5): 1033-1044). Glucocorticoid signaling turns on expression of MT's and correlates to TIM3/PD1 expression. NR3C1 is the glucocorticoid receptor and is differentially expressed on cluster 7 cells (see, e.g., Tables). Targeting glucocorticoid sensing may be a target for inhibiting the suppressive function of cluster 7 cells. Glucocorticoid inhibiting drugs have previously been described (see, e.g., Clark, Curr Top Med Chem. 2008; 8(9):813-38) and may be used in combination with checkpoint blockade therapy as described herein.


Applicants analyzed additional single cells to further validate the CD8 and CD4 T cell sub-populations (FIGS. 29 and 30). Applicants collected single cells at 5 time points from 12 mice injected with B16F10 melanoma. Cells were sorted based on CD8+, CD4+ and CD45+. Cells were analyzed using plate-based single-cell RNA-seq as described herein. Applicants analyzed 2,017 CD8+ T cells and 1,478 CD4+ T cells passing quality control. FIG. 30 are tSNE plots showing the general cell distribution after normalization. FIG. 31 shows that single cell RNA-seq identifies two dysfunction-like populations and two activation-like populations. The dysfunction and activation signatures were described previously (Singer et al., 2016). Clusters 9 and 10 correlated to the activation signature. Clusters 7 and 8 correlated to the dysfunction signature. Further analysis showed that the coinhibitory molecules PD1 and TIM3 are differentially expressed in the CD8 clusters. PD1 expression is seen in clusters 6-10. Tim3 is expressed highest in cluster 7 and is not expressed in cluster 8. Barplots of the p-values for each cluster are shown. PD1+TIM3+(DP) have previously been described as the most dysfunctional T cells (Singer et al., 2016). FIG. 32 shows that cluster 7 is a subtype of CD8 T cells that expresses a dysfunction signature and a CD8+ Treg signature. Cluster 7 also has high expression for the master transcription factor Helios (IKZF2) described herein. FIG. 33 illustrates that PD1+TIM3+(DP) MT (WT) expressing cells are the most suppressive in a CFSE assay for T cell proliferation. Upon knockout of MT the PD1+TIM3+(DP) MT−/− cells are less suppressive. The DP T cells include all of clusters 6, 7, 9 and 10. Cluster 7 represents the population of CD8 cells that are both PD1+TIM3+ and have high MT1 expression. Thus, knockout of MT in the DP population targets the cluster 7 subpopulation and cluster 7 may be responsible for the suppression of proliferation observed. Thus, the present invention advantageously provides for therapeutic targeting of a specific CD8 T cell subtype.


Cluster 7 (CD8) can be further characterized by expression of genes markers. FIGS. 34 and 35 show analysis by FACS and tSNE of cluster 7 specific markers. Lymphocytes were sorted. The lymphocytes were further sorted by CD8. The CD8 cells were further sorted into Helios positive and negative populations. The Helios+/− populations were then analyzed for the indicated markers. For example, Helios positive CD8 T cells are mostly PD1 positive and include both TIM3+(cluster 7) and TIM3− (cluster 8) subpopulations. FIG. 38 shows that Cluster 7 expresses XCR1, the receptor for XCL1. In certain embodiments, XRC1 is present in a cluster 7 gene signature. Tables 1 to 6 list in ranked order, genes differentially expressed in cluster 7. Table 1 lists the top 500 ranked genes. Tables 2 and 3 list transcription factors and cell surface/cytokines. Table 4 lists genes differentially expressed in cluster as compared to all 15 CD8 T cell clusters. Table 5 lists a cluster 7 signature. Table 6 lists a ranked cluster 7 signature determined using modified statistical analysis. In certain embodiments, Table 6 was determined using a more statistically accurate analysis of the CD8 clusters. In certain embodiments, Table 6 represents a gene signature based on analyzing more single cells.









TABLE 1







Ranked top 500 genes differentially expressed in cluster 7



















thresh_
hyper_
hyper_
gen_
rank_hy-
rank_gen_
mean_


Gene
TP
TN
mhg
pval
qval
qval
per_qval
qval
rank



















GLDC
0.719457014
0.847995546
0.705
1.75E−67
3.63E−64
−85.741
2
1
1.5


TNFRSF9
0.873303167
0.744988864
8.537
2.48E−73
1.03E−69
−74.603
1
2
1.5


PRF1
0.936651584
0.628619154
7.253
8.69E−64
1.20E−60
−70.08
3
3
3


IRF8
0.886877828
0.663697105
1.501
4.18E−58
3.46E−55
−60.58
5
4
4.5


CCRL2
0.7239819
0.791759465
1.05
1.60E−52
9.50E−50
−57.892
7
5
6


PCYT1A
0.656108597
0.83518931
0.575
4.90E−51
2.54E−48
−57.84
8
7
7.5


HAVCR2
0.873303167
0.683184855
2.154
5.96E−59
6.17E−56
−51.383
4
11
7.5


LAT2
0.647058824
0.83518931
0.903
2.19E−49
6.50E−47
−57.892
14
6
10


2900026A02RIK
0.687782805
0.807906459
0.926
9.38E−50
3.24E−47
−53.112
12
10
11


CSF1
0.556561086
0.885300668
0.911
1.02E−47
2.63E−45
−56.864
16
8
12


ADAM8
0.787330317
0.726057906
0.864
7.94E−50
2.99E−47
−50.88
10
14
12


ITGAV
0.85520362
0.657572383
0.084
7.25E−50
2.99E−47
−50.642
11
15
13


TMPRSS6
0.466063348
0.91481069
0.546
1.18E−41
1.87E−39
−53.736
26
9
17.5


ADAMTS14
0.619909502
0.834632517
0.506
1.83E−44
4.00E−42
−49.686
19
16
17.5


C1QTNF6
0.538461538
0.877505568
0.379
3.24E−42
5.59E−40
−51.184
24
12
18


RGS16
0.977375566
0.510022272
2.506
1.88E−54
1.30E−51
−39.659
6
31
18.5


SERPINE2
0.542986425
0.873051225
3.895
1.05E−41
1.73E−39
−51.019
25
13
19


LITAF
0.936651584
0.551224944
5.893
1.35E−49
4.31E−47
−41.548
13
25
19


RBPJ
0.873303167
0.632516704
2.763
4.56E−49
1.26E−46
−42.315
15
24
19.5


TNFRSF4
0.773755656
0.723830735
3.144
8.39E−47
2.05E−44
−42.493
17
23
20


GPR56
0.647058824
0.806792873
0.696
2.30E−42
4.15E−40
−44.618
23
18
20.5


PGLYRP1
0.923076923
0.53674833
4.954
5.99E−44
1.18E−41
−43.178
21
21
21


HILPDA
0.764705882
0.711024499
5.185
1.14E−42
2.16E−40
−41.529
22
26
24


ANXA2
0.923076923
0.520601336
10.29
1.88E−41
2.78E−39
−43.087
28
22
25


PLEK
0.914027149
0.566815145
1.395
1.02E−46
2.36E−44
−39.004
18
32
25


LAG3
0.963800905
0.513919822
4.793
7.81E−51
3.60E−48
−36.69
9
42
25.5


RGS8
0.538461538
0.865812918
0.275
5.40E−39
5.60E−37
−47.201
40
17
28.5


NABP1
0.828054299
0.63363029
2.705
2.53E−40
3.09E−38
−38.264
34
35
34.5


GPD2
0.714932127
0.732182628
1.007
3.13E−38
3.09E−36
−40.178
42
29
35.5


SLC37A2
0.502262443
0.878062361
0.546
1.80E−36
1.43E−34
−43.31
52
20
36


IKZF2
0.719457014
0.737193764
0.084
6.49E−40
7.47E−38
−37.446
36
37
36.5


AA467197
0.466063348
0.896993318
2.233
5.25E−36
3.96E−34
−43.648
55
19
37


UBASH3B
0.660633484
0.787861915
5.56
1.95E−40
2.58E−38
−35.97
32
44
38


EPAS1
0.529411765
0.863585746
0.986
5.63E−37
4.77E−35
−38.289
49
34
41.5


SERPINB9
0.850678733
0.593541203
3.333
4.94E−38
4.65E−36
−36.781
44
40
42


GAPDH
0.895927602
0.54844098
12.436
7.42E−40
8.31E−38
−35.534
37
47
42


CCNG1
0.873303167
0.587973274
2.284
1.79E−41
2.75E−39
−32.322
27
58
42.5


ACOT7
0.895927602
0.555122494
3.285
6.97E−41
9.63E−39
−32.602
30
56
43


BHLHE40
0.977375566
0.422048998
6.796
6.69E−41
9.57E−39
−32.471
29
57
43


TPI1
0.954751131
0.462138085
5.874
2.28E−40
2.86E−38
−32.629
33
55
44


RGS2
0.900452489
0.541202673
1.411
1.09E−39
1.19E−37
−34.658
38
51
44.5


CDK6
0.696832579
0.73830735
1.978
1.79E−36
1.43E−34
−36.931
51
39
45


CXCR6
0.986425339
0.423719376
5.506
4.32E−44
8.96E−42
−29.649
20
71
45.5


MNDA
0.470588235
0.889755011
4.02
1.12E−34
7.15E−33
−40.982
65
27
46


GEM
0.719457014
0.716592428
5.361
3.84E−36
2.95E−34
−36.705
54
41
47.5


GM5177
0.909502262
0.517817372
3.82
4.31E−38
4.16E−36
−33.888
43
53
48


CST7
0.963800905
0.445991091
7.082
1.99E−40
2.58E−38
−30.448
31
68
49.5


SLC2A3
0.656108597
0.768374165
5.663
8.80E−36
6.40E−34
−36.69
57
43
50


KIT
0.466063348
0.88752784
0.516
2.23E−33
1.27E−31
−40.679
73
28
50.5


GZMB
0.850678733
0.579064588
2.384
7.84E−36
5.80E−34
−35.205
56
50
53


S100A11
0.968325792
0.418708241
8.456
8.11E−38
7.31E−36
−32.158
46
60
53


IL1R2
0.407239819
0.915924276
3.396
2.28E−32
1.20E−30
−40.178
79
30
54.5


DSCAM
0.479638009
0.876948775
0.189
1.06E−32
5.69E−31
−38.384
77
33
55


CCL3
0.642533937
0.773942094
3.904
8.83E−35
5.72E−33
−35.708
64
46
55


FAM3C
0.832579186
0.605790646
0.287
1.12E−36
9.25E−35
−31.563
50
63
56.5


CASP3
0.895927602
0.549554566
3.644
5.01E−40
5.94E−38
−28.856
35
80
57.5


NR4A2
0.914027149
0.498886414
0.595
2.62E−36
2.05E−34
−31.351
53
64
58.5


CD244
0.466063348
0.883073497
3.545
3.23E−32
1.67E−30
−37.285
80
38
59


SLC16A11
0.429864253
0.901447661
1.029
1.21E−31
6.02E−30
−38.005
83
36
59.5


DUSP4
0.755656109
0.677616927
0.444
2.87E−35
2.02E−33
−31.906
59
61
60


CAPG
0.864253394
0.558463252
4.057
3.05E−35
2.11E−33
−31.567
60
62
61


SAMSN1
0.941176471
0.473830735
1.029
1.01E−38
1.02E−36
−27.927
41
87
64


FAM110A
0.683257919
0.7344098
0.731
1.08E−33
6.37E−32
−32.314
70
59
64.5


CIAPIN1
0.859728507
0.581291759
1.669
8.11E−38
7.31E−36
−27.949
45
86
65.5


NRGN
0.484162896
0.865812918
0.604
1.10E−30
5.06E−29
−35.762
90
45
67.5


PLAC8
0.43438914
0.896436526
10.661
5.77E−31
2.75E−29
−35.511
87
48
67.5


IMPA2
0.714932127
0.707683742
0.832
6.52E−34
3.92E−32
−30.742
69
66
67.5


SRGAP3
0.529411765
0.840757238
0.39
1.24E−31
6.14E−30
−34.118
84
52
68


FOXRED2
0.425339367
0.900890869
1.731
8.24E−31
3.88E−29
−35.48
88
49
68.5


NRP1
0.751131222
0.670935412
0.163
1.76E−33
1.03E−31
−30.537
71
67
69


ARL14EP
0.7239819
0.703786192
2.084
1.20E−34
7.55E−33
−29.195
66
75
70.5


EHD1
0.832579186
0.594654788
3.266
5.47E−35
3.60E−33
−28.975
63
78
70.5


LGALS1
0.923076923
0.508351893
10.112
1.29E−39
1.37E−37
−25.523
39
105
72


MT1
0.556561086
0.814587973
2.173
3.92E−30
1.78E−28
−33.823
91
54
72.5


ERGIC1
0.71040724
0.699888641
0.333
5.94E−32
3.00E−30
−29.048
82
76
79


OSBPL3
0.800904977
0.615256125
0.176
8.51E−33
4.64E−31
−28.093
76
85
80.5


SMIM3
0.497737557
0.857461024
6.263
9.72E−31
4.53E−29
−29.22
89
74
81.5


SERPINA3G
0.877828054
0.540089087
4.066
4.39E−35
2.94E−33
−25.966
62
101
81.5


TOX
0.904977376
0.520044543
3.455
1.76E−37
1.55E−35
−23.972
47
122
84.5


PKM
0.805429864
0.583518931
9.882
5.15E−29
2.09E−27
−29.958
102
69
85.5


CX3CR1
0.511312217
0.843541203
1.646
1.21E−29
5.27E−28
−28.883
95
79
87


ID2
0.972850679
0.341314031
4.705
6.02E−29
2.42E−27
−29
103
77
90


PEX16
0.624434389
0.759465479
2.725
1.51E−29
6.38E−28
−28.699
98
82
90


GPR65
0.760180995
0.652561247
2.585
5.08E−32
2.60E−30
−26.057
81
100
90.5


SEPT11
0.837104072
0.582405345
0.88
5.90E−34
3.60E−32
−24.768
68
117
92.5


NFKB2
0.846153846
0.561804009
2.359
1.52E−32
8.07E−31
−25.366
78
108
93


FDX1
0.574660633
0.79454343
1.77
7.17E−29
2.80E−27
−28.187
106
84
95


ENTPD1
0.701357466
0.692093541
0.202
2.00E−29
8.39E−28
−26.13
99
99
99


BCL2A1D
0.959276018
0.403674833
2.198
1.91E−33
1.10E−31
−23.195
72
127
99.5


DNMT3A
0.660633484
0.729398664
2.63
1.40E−29
6.04E−28
−25.499
96
106
101


ZMIZ1
0.751131222
0.655345212
0.214
4.47E−31
2.16E−29
−24.896
86
116
101


NRN1
0.538461538
0.815144766
3.643
9.38E−28
3.14E−26
−28.795
124
81
102.5


STAT3
0.909502262
0.43596882
6.143
3.70E−27
1.13E−25
−29.78
136
70
103


CLIC4
0.619909502
0.758351893
1.202
9.73E−29
3.74E−27
−26.3
108
98
103


GDPD5
0.438914027
0.878062361
2.606
4.27E−27
1.28E−25
−29.436
138
72
105


CCR8
0.443438914
0.874164811
5.401
7.64E−27
2.23E−25
−29.292
142
73
107.5


NEDD9
0.665158371
0.714922049
5.851
6.40E−28
2.17E−26
−26.928
122
93
107.5


GSTO1
0.624434389
0.751670379
5.507
3.04E−28
1.08E−26
−25.935
117
102
109.5


PGK1
0.936651584
0.402561247
7.541
2.92E−28
1.04E−26
−25.564
116
104
110


PDCD1
0.968325792
0.415367483
5.101
2.34E−37
2.02E−35
−19.612
48
172
110


UHRF2
0.542986425
0.809576837
0.971
2.48E−27
7.85E−26
−27.41
131
91
111


PLSCR1
0.696832579
0.688752784
2.785
2.73E−28
1.00E−26
−25.165
113
111
112


TIGIT
0.981900452
0.354120267
4.895
4.50E−33
2.49E−31
−21.13
75
151
113


ALDOA
0.954751131
0.360801782
9.477
5.16E−27
1.53E−25
−27.783
140
88
114


LILRB4
0.597285068
0.767817372
5.621
2.81E−27
8.68E−26
−26.812
134
95
114.5


KLRC1
0.841628959
0.546213808
4.198
1.16E−29
5.11E−28
−22.062
94
135
114.5


TFF1
0.384615385
0.904788419
5.97
6.36E−26
1.60E−24
−30.862
165
65
115


HNRNPA1
0.78280543
0.589643653
9.38
1.65E−26
4.52E−25
−28.44
151
83
117


PTPRS
0.606334842
0.763919822
1.989
7.79E−28
2.62E−26
−25.035
123
113
118


1700017B05RIK
0.71040724
0.675946548
1.618
2.83E−28
1.03E−26
−23.832
114
124
119


PTPLAD1
0.805429864
0.580734967
0.748
1.22E−28
4.65E−27
−22.786
109
129
119


VAMP8
0.923076923
0.462138085
2.807
4.06E−33
2.27E−31
−20.364
74
164
119


ESD
0.886877828
0.528396437
4.135
4.08E−35
2.77E−33
−19.013
61
183
122


GM14295
0.755656109
0.631959911
3.009
2.37E−28
8.78E−27
−22.272
112
134
123


NUCB1
0.895927602
0.478841871
0.444
4.44E−30
2.00E−28
−21.027
92
154
123


TUBB6
0.429864253
0.879175947
4.374
4.18E−26
1.06E−24
−27.679
163
89
126


SH2D2A
0.837104072
0.541759465
8.666
2.34E−28
8.76E−27
−21.518
111
142
126.5


RCN1
0.574660633
0.778953229
0.949
3.65E−26
9.34E−25
−27.183
162
92
127


TRPS1
0.511312217
0.829064588
0.986
9.50E−27
2.75E−25
−25.165
143
112
127.5


RPS27L
0.742081448
0.646993318
4.99
1.54E−28
5.82E−27
−21.356
110
147
128.5


SH3BGRL
0.828054299
0.559020045
0.546
3.27E−29
1.34E−27
−20.969
101
156
128.5


FKBP1A
0.972850679
0.393652561
4.911
1.17E−35
8.36E−34
−18.268
58
200
129


AFG3L2
0.674208145
0.697104677
0.411
1.82E−26
4.89E−25
−24.111
154
119
136.5


KDELR2
0.805429864
0.573496659
2.934
1.12E−27
3.68E−26
−21.276
126
148
137


IL2RB
0.561085973
0.782293987
9.964
5.64E−25
1.26E−23
−27.616
185
90
137.5


SLC25A4
0.950226244
0.405902004
7.096
1.34E−31
6.55E−30
−18.569
85
190
137.5


LYRM4
0.574660633
0.773942094
0.39
2.42E−25
5.71E−24
−25.643
175
103
139


BCL2L11
0.660633484
0.708240535
1.748
2.60E−26
6.81E−25
−24.04
158
120
139


DUT
0.814479638
0.609131403
2.151
4.25E−34
2.63E−32
−17.989
67
211
139


SERPINB6A
0.656108597
0.712694878
3.637
2.23E−26
5.93E−25
−23.705
156
125
140.5


RFK
0.714932127
0.660356347
0.595
1.16E−26
3.28E−25
−22.062
146
136
141


EEA1
0.547511312
0.795100223
0.189
2.24E−25
5.36E−24
−25.035
173
114
143.5


GALK1
0.56561086
0.787861915
3.819
1.74E−26
4.71E−25
−21.626
153
140
146.5


KLRC2
0.778280543
0.597995546
1.766
5.80E−27
1.71E−25
−20.64
141
161
151


TMBIM4
0.597285068
0.757238307
1.876
1.44E−25
3.55E−24
−21.991
168
137
152.5


PKP4
0.633484163
0.723273942
0.31
5.06E−25
1.15E−23
−22.686
183
130
156.5


RPS26
0.950226244
0.370824053
11.127
3.10E−27
9.52E−26
−19.14
135
181
158


LRRK1
0.50678733
0.819042316
0.322
2.49E−24
5.09E−23
−24.904
203
115
159


GLIPR1
0.755656109
0.613585746
5.081
7.31E−26
1.83E−24
−21.055
166
153
159.5


STK39
0.502262443
0.820155902
0.263
5.90E−24
1.15E−22
−25.214
211
109
160


SERPINA3H
0.714932127
0.646993318
0.251
7.63E−25
1.65E−23
−22.871
192
128
160


SLC52A3
0.325791855
0.927616927
3.605
1.15E−23
2.12E−22
−26.78
225
96
160.5


GM5069
0.728506787
0.623608018
2.694
1.45E−23
2.63E−22
−26.822
228
94
161


CCDC50
0.65158371
0.708240535
0.367
3.85E−25
8.92E−24
−21.472
179
144
161.5


ACTG1
0.945701357
0.349665924
11.98
7.96E−24
1.51E−22
−25.203
218
110
164


SLA2
0.823529412
0.550668151
0.39
2.03E−27
6.46E−26
−18.315
130
199
164.5


IL10RA
0.837104072
0.538975501
0.214
5.38E−28
1.86E−26
−18.023
120
209
164.5


CENPA
0.773755656
0.61247216
2.791
2.92E−28
1.04E−26
−17.723
115
217
166


RUNX2
0.78280543
0.582962138
1.546
1.20E−25
2.97E−24
−20.044
167
167
167


NEK6
0.339366516
0.918708241
1.144
3.51E−23
6.06E−22
−26.34
240
97
168.5


TXN1
0.959276018
0.367483296
3.623
7.60E−29
2.95E−27
−17.362
107
231
169


RPN1
0.891402715
0.457126949
0.356
1.45E−26
4.07E−25
−18.422
148
195
171.5


STARD3NL
0.647058824
0.720489978
2.227
2.34E−26
6.17E−25
−18.589
157
189
173


KDM2B
0.678733032
0.677616927
0.88
2.60E−24
5.29E−23
−21.433
204
145
174.5


MPHOSPH6
0.601809955
0.745545657
2.59
2.42E−24
4.96E−23
−21.241
202
150
176


IL18RAP
0.733031674
0.626948775
0.66
1.40E−24
2.93E−23
−21.027
199
155
177


CLTC
0.828054299
0.525612472
0.176
5.84E−25
1.30E−23
−19.944
187
169
178


DEGS1
0.769230769
0.605790646
5.83
1.03E−26
2.96E−25
−17.807
144
214
179


0610007P14RIK
0.63800905
0.718262806
3.406
7.29E−25
1.58E−23
−19.986
191
168
179.5


TNFRSF18
0.895927602
0.459910913
4.331
1.10E−27
3.65E−26
−16.98
125
240
182.5


TIPRL
0.773755656
0.600222717
3.18
1.29E−26
3.64E−25
−17.608
147
220
183.5


ATXN10
0.778280543
0.595768374
1.131
1.14E−26
3.26E−25
−17.466
145
227
186


SERPINB6B
0.787330317
0.561247216
3.908
1.41E−23
2.57E−22
−21.418
227
146
186.5


ISY1
0.886877828
0.482182628
1.975
6.73E−29
2.66E−27
−16.471
105
268
186.5


CMTM7
0.864253394
0.512249443
1.761
6.60E−29
2.63E−27
−16.368
104
273
188.5


SLC16A3
0.479638009
0.827394209
0.496
2.07E−22
3.16E−21
−25.499
271
107
189


ARSB
0.79638009
0.585746102
0.251
5.85E−28
2.00E−26
−16.597
121
258
189.5


DDIT4
0.601809955
0.735523385
0.356
7.08E−23
1.16E−21
−22.58
253
131
192


PRELID1
0.977375566
0.298997773
7.483
4.65E−25
1.07E−23
−18.071
181
206
193.5


RBL2
0.832579186
0.511135857
0.239
6.99E−24
1.35E−22
−19.557
215
173
194


HSP90B1
0.823529412
0.521714922
7.442
7.64E−24
1.46E−22
−19.492
217
174
195.5


HMGCR
0.65158371
0.694877506
2.791
2.84E−23
4.97E−22
−20.939
237
158
197.5


CETN2
0.705882353
0.658129176
0.669
3.71E−25
8.65E−24
−17.656
178
219
198.5


TWSG1
0.466063348
0.835746102
0.367
3.34E−22
4.90E−21
−23.987
282
121
201.5


COPS4
0.764705882
0.609131403
3.651
1.59E−26
4.40E−25
−16.618
150
256
203


TMEM123
0.891402715
0.464922049
4.878
1.61E−27
5.23E−26
−16.349
128
278
203


PREP
0.56561086
0.767817372
2.782
2.97E−23
5.18E−22
−19.715
238
170
204


VPS52
0.642533937
0.700445434
0.782
6.35E−23
1.05E−21
−20.519
251
163
207


NCOR2
0.656108597
0.688752784
0.239
5.32E−23
8.90E−22
−19.689
247
171
209


S100A4
0.769230769
0.596325167
6.264
1.76E−25
4.32E−24
−16.811
169
250
209.5


CALR
0.923076923
0.384187082
2.844
1.64E−23
2.98E−22
−18.422
229
196
212.5


RABGAP1L
0.787330317
0.551781737
2.223
1.87E−22
2.90E−21
−20.528
268
162
215


UAP1
0.461538462
0.83908686
2.275
3.08E−22
4.54E−21
−21.07
281
152
216.5


PGAM1
0.918552036
0.421492205
5.865
4.67E−27
1.39E−25
−15.989
139
295
217


SERPINA3I
0.687782805
0.659242762
0.918
5.24E−23
8.84E−22
−18.496
246
191
218.5


PTGER2
0.479638009
0.824053452
0.227
7.64E−22
1.03E−20
−22.388
308
132
220


COX17
0.868778281
0.493875278
5.638
2.54E−27
7.99E−26
−15.776
132
308
220


BCL2A1B
0.954751131
0.369153675
1.937
5.07E−28
1.77E−26
−15.472
119
322
220.5


NAP1L1
0.968325792
0.359131403
4.392
5.38E−30
2.40E−28
−14.871
93
348
220.5


PIGS
0.78280543
0.587416481
4.183
3.21E−26
8.25E−25
−16.223
161
284
222.5


SIK1
0.864253394
0.478285078
1.257
1.04E−24
2.21E−23
−16.799
195
252
223.5


FLNB
0.515837104
0.79844098
0.163
5.22E−22
7.35E−21
−20.969
294
157
225.5


SEMA6D
0.380090498
0.888084633
0.367
1.74E−21
2.16E−20
−24.639
334
118
226


MRPS21
0.701357466
0.658129176
3.124
1.43E−24
2.97E−23
−16.703
200
255
227.5


MAP2K3
0.742081448
0.623608018
4.48
2.42E−25
5.71E−24
−16.349
176
279
227.5


ENO3
0.470588235
0.829064588
3.527
1.34E−21
1.69E−20
−22.303
328
133
230.5


SMARCB1
0.65158371
0.698218263
4.752
9.93E−24
1.84E−22
−17.095
224
238
231


ATXN1
0.841628959
0.489420935
0.138
1.17E−22
1.85E−21
−18.207
263
202
232.5


CDV3
0.787330317
0.580178174
0.585
6.30E−26
1.59E−24
−15.902
164
301
232.5


SMPDL3B
0.470588235
0.828507795
0.832
1.66E−21
2.07E−20
−21.964
332
138
235


AI662270
0.809954751
0.550111359
1.195
2.36E−25
5.63E−24
−15.982
174
296
235


SERPINA3F
0.42081448
0.86247216
1.384
1.67E−21
2.08E−20
−21.93
333
139
236


PNKD
0.606334842
0.727728285
0.623
2.55E−22
3.83E−21
−18.402
276
197
236.5


CISD1
0.592760181
0.744432071
3.873
4.57E−23
7.80E−22
−17.332
243
232
237.5


NCF4
0.805429864
0.546213808
1.791
3.08E−24
6.21E−23
−16.455
206
270
238


PTPN7
0.764705882
0.599109131
3.651
3.21E−25
7.52E−24
−15.868
177
304
240.5


IL12RB2
0.447963801
0.841870824
0.66
4.40E−21
5.09E−20
−23.202
359
126
242.5


PADI2
0.647058824
0.702672606
2.709
8.75E−24
1.64E−22
−16.48
221
266
243.5


ETFB
0.828054299
0.542873051
3.503
4.17E−27
1.26E−25
−14.849
137
350
243.5


MED11
0.597285068
0.737750557
2.676
1.19E−22
1.87E−21
−17.51
264
224
244


RAB27A
0.769230769
0.602449889
2.16
2.83E−26
7.33E−25
−15.278
160
331
245.5


TYK2
0.683257919
0.66091314
0.604
1.15E−22
1.82E−21
−17.137
262
237
249.5


GABARAPL1
0.597285068
0.733853007
0.595
4.23E−22
6.05E−21
−17.973
290
212
251


CTSC
0.764705882
0.587416481
2.689
9.44E−24
1.75E−22
−16.298
223
281
252


AW112010
0.923076923
0.382516704
10.134
2.54E−23
4.48E−22
−16.372
234
272
253


ARL1
0.737556561
0.616926503
3.406
6.93E−24
1.34E−22
−16.038
214
293
253.5


PRDX2
0.864253394
0.480512249
4.836
5.65E−25
1.26E−23
−15.473
186
321
253.5


GNPNAT1
0.552036199
0.767260579
1.651
1.50E−21
1.89E−20
−18.958
330
184
257


SLC39A1
0.819004525
0.509465479
0.546
8.38E−22
1.12E−20
−18.183
311
204
257.5


GM14440
0.665158371
0.673719376
0.401
4.02E−22
5.79E−21
−17.437
288
228
258


CYB5B
0.674208145
0.669821826
2.29
1.03E−22
1.65E−21
−16.529
259
262
260.5


ERO1L
0.497737557
0.806792873
1.373
3.47E−21
4.11E−20
−19.368
350
175
262.5


NDFIP2
0.610859729
0.721046771
0.84
6.18E−22
8.42E−21
−17.516
302
223
262.5


PGLS
0.846153846
0.528396437
4.531
4.70E−28
1.65E−26
−13.831
118
407
262.5


ACSL4
0.755656109
0.577394209
1.541
2.15E−21
2.62E−20
−18.853
340
186
263


FUCA2
0.50678733
0.800111359
1.803
3.35E−21
3.98E−20
−18.882
349
185
267


CD200
0.34841629
0.901447661
1.614
2.61E−20
2.71E−19
−21.601
400
141
270.5


XPNPEP1
0.647058824
0.689309577
0.345
5.53E−22
7.72E−21
−16.861
297
245
271


PLP2
0.832579186
0.54064588
1.064
1.64E−27
5.28E−26
−13.715
129
418
273.5


MT2
0.325791855
0.913140312
1.646
5.06E−20
4.93E−19
−23.836
426
123
274.5


LPIN2
0.393665158
0.873051225
1.245
3.48E−20
3.53E−19
−21.488
408
143
275.5


3830406C13RIK
0.520361991
0.787861915
2.733
6.52E−21
7.28E−20
−19.078
371
182
276.5


SSR2
0.859728507
0.498886414
3.64
1.72E−26
4.70E−25
−13.864
152
403
277.5


NDUFS2
0.819004525
0.557349666
4.721
1.34E−27
4.37E−26
−13.502
127
431
279


2700060E02RIK
0.923076923
0.432628062
5.072
2.72E−29
1.13E−27
−13.17
100
459
279.5


MTHFD1L
0.597285068
0.729955457
0.918
1.47E−21
1.85E−20
−17.233
329
235
282


HIP1
0.647058824
0.682628062
0.163
4.02E−21
4.67E−20
−18.054
357
208
282.5


DYNLT3
0.429864253
0.849665924
0.774
2.78E−20
2.87E−19
−20.33
402
165
283.5


EFHD2
0.79638009
0.548997773
0.263
2.49E−23
4.43E−22
−15.078
233
339
286


TNFSF4
0.384615385
0.878619154
3.874
3.78E−20
3.79E−19
−20.871
414
159
286.5


FARS2
0.556561086
0.755011136
0.604
2.51E−20
2.61E−19
−19.261
398
177
287.5


CST3
0.791855204
0.551781737
1.7
4.71E−23
8.01E−22
−15.102
244
337
290.5


NOL7
0.809954751
0.532293987
0.832
3.34E−23
5.80E−22
−14.987
239
343
291


OXSR1
0.524886878
0.779510022
0.595
3.35E−20
3.41E−19
−19.212
407
179
293


DUSP6
0.57918552
0.740534521
1.475
6.50E−21
7.28E−20
−17.756
370
216
293


SEPT2
0.954751131
0.329064588
0.536
2.54E−23
4.48E−22
−14.651
235
357
296


UTF1
0.330316742
0.909242762
4.976
1.01E−19
9.36E−19
−21.255
447
149
298


ENO1
0.895927602
0.393652561
9.733
4.25E−20
4.22E−19
−19.242
418
178
298


MTMR1
0.56561086
0.756681514
0.202
1.32E−21
1.68E−20
−16.458
327
269
298


DCTN5
0.714932127
0.631959911
2.932
6.66E−23
1.10E−21
−14.969
252
344
298


PDCL3
0.647058824
0.687639198
3.007
9.13E−22
1.20E−20
−16.187
315
286
300.5


DDB1
0.764705882
0.58908686
4.051
5.87E−24
1.15E−22
−13.99
210
392
301


HDAC1
0.837104072
0.505011136
2.214
8.21E−24
1.55E−22
−14.043
220
386
303


SREBF2
0.733031674
0.597438753
0.138
5.92E−21
6.69E−20
−16.921
367
241
304


COMMD3
0.733031674
0.60467706
3.134
8.29E−22
1.11E−20
−15.961
310
298
304


GM9855
0.619909502
0.703786192
0.202
1.07E−20
1.15E−19
−17.276
385
233
309


CTSB
0.959276018
0.346325167
2.31
2.72E−26
7.09E−25
−13.17
159
460
309.5


SIVA1
0.57918552
0.739977728
2.032
7.74E−21
8.61E−20
−16.842
373
248
310.5


COX7B
0.877828054
0.466035635
3.203
2.15E−25
5.18E−24
−13.303
172
449
310.5


BEND4
0.705882353
0.630846325
0.356
1.18E−21
1.52E−20
−15.651
321
312
316.5


CBLB
0.981900452
0.267260579
1.609
1.33E−22
2.08E−21
−14.327
265
368
316.5


ANKRD39
0.466063348
0.821269488
2.245
8.50E−20
7.96E−19
−18.44
443
193
318


KARS
0.873303167
0.465478842
0.299
1.31E−24
2.74E−23
−13.425
198
438
318


LXN
0.50678733
0.791202673
1.373
7.43E−20
7.05E−19
−18.233
437
201
319


D16ERTD472E
0.841628959
0.496659243
0.926
1.73E−23
3.11E−22
−13.828
230
409
319.5


SPCS3
0.714932127
0.61636971
3.074
5.30E−21
6.05E−20
−16.349
363
280
321.5


TPM4
0.941176471
0.361358575
4.167
2.84E−24
5.75E−23
−13.413
205
440
322.5


CHST12
0.619909502
0.703229399
1.599
1.26E−20
1.35E−19
−16.596
387
259
323


ACOT9
0.692307692
0.64532294
1.305
8.52E−22
1.13E−20
−15.211
312
335
323.5


METAP2
0.832579186
0.508908686
2.057
1.28E−23
2.34E−22
−13.618
226
424
325


LAP3
0.429864253
0.845211581
0.88
1.60E−19
1.43E−18
−18.741
465
187
326


FUBP1
0.733031674
0.605233853
0.31
7.11E−22
9.61E−21
−14.855
307
349
328


TANK
0.647058824
0.674276169
1.35
4.42E−20
4.38E−19
−17.063
419
239
329


MNF1
0.610859729
0.707683742
2.362
3.61E−20
3.64E−19
−16.808
411
251
331


GM12669
0.769230769
0.569599109
2.278
3.39E−22
4.96E−21
−14.119
283
380
331.5


ST14
0.438914027
0.837416481
0.556
2.83E−19
2.40E−18
−19.188
489
180
334.5


IPO7
0.547511312
0.755567929
1.131
2.17E−19
1.88E−18
−18.429
478
194
336


TARS
0.63800905
0.691536748
0.465
3.30E−21
3.93E−20
−15.468
348
324
336


SLC25A17
0.597285068
0.722717149
3.434
1.38E−20
1.47E−19
−16.218
389
285
337


PFKL
0.615384615
0.71325167
2.284
2.05E−21
2.50E−20
−15.05
339
340
339.5


TMBIM1
0.330316742
0.90701559
2.462
3.10E−19
2.61E−18
−18.339
492
198
345


CCT3
0.891402715
0.46325167
0.848
2.59E−27
8.08E−26
−12.015
133
559
346


OS9
0.823529412
0.514476615
0.411
5.39E−23
8.97E−22
−13.34
249
444
346.5


CALM3
0.85520362
0.491648107
1.163
6.53E−25
1.43E−23
−12.592
189
504
346.5


DAPK2
0.43438914
0.841314031
2.31
2.15E−19
1.87E−18
−17.678
477
218
347.5


SIL1
0.407239819
0.856904232
0.807
6.91E−19
5.45E−18
−19.348
526
176
351


GTF2E2
0.533936652
0.768930958
1.546
1.02E−19
9.46E−19
−16.707
448
254
351


CANX
0.950226244
0.325167038
0.401
5.66E−22
7.87E−21
−13.839
298
406
352


NDUFA11
0.733031674
0.60857461
0.422
2.82E−22
4.21E−21
−13.536
277
427
352


UBE2N
0.850678733
0.472160356
0.31
5.39E−22
7.56E−21
−13.808
296
410
353


BAX
0.846153846
0.489420935
4.082
2.63E−23
4.63E−22
−13.004
236
473
354.5


IFRD1
0.764705882
0.565701559
0.669
3.58E−21
4.21E−20
−14.579
352
361
356.5


SDCBP2
0.352941176
0.890311804
0.971
1.38E−18
1.03E−17
−20.683
555
160
357.5


BIRC2
0.57918552
0.730512249
0.444
1.42E−19
1.29E−18
−16.549
456
260
358


MARC2
0.696832579
0.654231626
3.455
1.75E−23
3.15E−22
−12.853
231
485
358


RABGGTB
0.615384615
0.703786192
1.111
3.50E−20
3.54E−19
−15.695
410
311
360.5


QDPR
0.606334842
0.717706013
2.618
5.86E−21
6.64E−20
−14.657
366
356
361


LAMTOR4
0.656108597
0.675389755
2.776
2.91E−21
3.50E−20
−14.154
345
378
361.5


USMG5
0.868778281
0.466592428
2.884
4.90E−24
9.76E−23
−12.516
208
515
361.5


CUEDC2
0.57918552
0.732739421
4.581
7.26E−20
6.91E−19
−16.101
435
289
362


TSSC1
0.588235294
0.725501114
0.848
6.27E−20
6.03E−19
−16.02
431
294
362.5


GNB1
0.963800905
0.29844098
1.521
8.26E−22
1.11E−20
−13.733
309
416
362.5


TMEM254B
0.719457014
0.621380846
0.807
3.74E−22
5.45E−21
−13.326
284
445
364.5


CTLA4
0.936651584
0.413140312
2.685
1.42E−29
6.07E−28
−11.36
97
632
364.5


RILPL2
0.760180995
0.576837416
2.284
6.77E−22
9.20E−21
−13.577
305
426
365.5


WDR61
0.592760181
0.718819599
0.816
1.43E−19
1.30E−18
−16.362
458
274
366


SPRY2
0.533936652
0.762806236
0.872
7.08E−19
5.56E−18
−18.137
528
205
366.5


XPOT
0.466063348
0.815701559
0.411
6.17E−19
4.92E−18
−17.777
520
215
367.5


INF2
0.443438914
0.830734967
0.189
1.01E−18
7.72E−18
−18.451
544
192
368


GLUD1
0.814479638
0.511135857
0.379
2.19E−21
2.65E−20
−13.988
342
394
368


HCCS
0.461538462
0.819599109
1.753
5.04E−19
4.08E−18
−17.495
511
226
368.5


ACTR10
0.719457014
0.635300668
2.895
6.71E−24
1.31E−22
−12.403
213
524
368.5


ITGB1BP1
0.606334842
0.714922049
0.526
1.36E−20
1.46E−19
−14.84
388
351
369.5


BSG
0.932126697
0.357461024
0.575
3.86E−22
5.57E−21
−13.243
286
453
369.5


LIMS1
0.751131222
0.589643653
0.401
2.86E−22
4.25E−21
−13.109
278
465
371.5


BCAP29
0.615384615
0.69766147
3.131
2.06E−19
1.80E−18
−16.406
473
271
372


FARP1
0.488687783
0.798997773
0.299
5.70E−19
4.55E−18
−17.401
519
230
374.5


DGAT1
0.755656109
0.565144766
0.345
5.29E−20
5.13E−19
−15.472
427
323
375


MMD
0.687782805
0.635300668
0.485
4.70E−20
4.62E−19
−15.295
422
330
376


SSR3
0.764705882
0.556792873
4.001
3.62E−20
3.64E−19
−15.008
412
342
377


RHOF
0.705882353
0.630846325
2.272
1.18E−21
1.52E−20
−13.472
323
433
378


ZBTB32
0.43438914
0.83518931
1.084
2.09E−18
1.52E−17
−18.611
570
188
379


VDAC3
0.733031674
0.605790646
1.774
6.10E−22
8.42E−21
−13.233
304
454
379


SMS
0.733031674
0.595768374
1.333
9.26E−21
1.01E−19
−14.095
380
381
380.5


AKR1A1
0.918552036
0.384187082
7.662
1.02E−22
1.63E−21
−12.587
258
505
381.5


ACTN1
0.574660633
0.731069042
0.506
3.79E−19
3.16E−18
−16.472
498
267
382.5


ATP6V0B
0.628959276
0.693207127
3.939
2.22E−20
2.32E−19
−14.244
397
371
384


PTK2B
0.809954751
0.505567929
1.438
3.48E−20
3.53E−19
−14.567
409
362
385.5


REEP5
0.904977376
0.436525612
1.753
2.03E−26
5.42E−25
−11.423
155
618
386.5


CREM
0.678733032
0.658129176
2.531
9.14E−22
1.20E−20
−13.17
314
461
387.5


HK1
0.687782805
0.628062361
0.322
3.24E−19
2.72E−18
−16.255
495
283
389


EIF1AX
0.674208145
0.654788419
2.68
8.11E−21
8.92E−20
−13.883
377
401
389


RAP1A
0.769230769
0.56013363
1.864
4.22E−21
4.89E−20
−13.689
358
420
389


SEC61G
0.850678733
0.482182628
2.353
3.95E−23
6.77E−22
−12.302
242
536
389


SAR1B
0.674208145
0.64922049
1.705
3.81E−20
3.81E−19
−14.438
415
365
390


RNH1
0.751131222
0.58518931
1.077
9.65E−22
1.27E−20
−13.101
316
467
391.5


BMYC
0.43438914
0.834632517
1.098
2.55E−18
1.82E−17
−18.196
581
203
392


TMEM256
0.601809955
0.70935412
2.104
2.27E−19
1.95E−18
−15.827
481
306
393.5


NHP2
0.692307692
0.644766147
6.673
9.98E−22
1.30E−20
−13.062
318
471
394.5


TMEM135
0.57918552
0.723830735
0.251
1.02E−18
7.74E−18
−16.86
545
246
395.5


OTUB1
0.823529412
0.506124722
3.671
4.88E−22
6.90E−21
−12.63
293
499
396


MEA1
0.683257919
0.651447661
5.398
1.80E−21
2.22E−20
−13.176
336
458
397


SSBP1
0.755656109
0.570155902
1.281
1.45E−20
1.54E−19
−13.843
390
405
397.5


CYP51
0.538461538
0.758351893
0.687
9.01E−19
6.93E−18
−16.602
539
257
398


DCTN2
0.79638009
0.528953229
0.687
5.10E−21
5.84E−20
−13.445
362
436
399


TXNDC17
0.714932127
0.624164811
1.669
6.13E−22
8.42E−21
−12.67
303
496
399.5


PHB
0.787330317
0.542873051
2.43
1.99E−21
2.44E−20
−13.17
338
462
400


CISD3
0.538461538
0.760022272
2.973
5.38E−19
4.33E−18
−16.105
515
288
401.5


SDF4
0.923076923
0.371937639
4.852
3.85E−22
5.57E−21
−12.513
287
516
401.5


ETOHI1
0.78280543
0.520044543
0.202
2.19E−18
1.59E−17
−17.157
572
236
404


LDHA
0.63800905
0.662026726
11.313
1.19E−17
7.69E−17
−20.115
643
166
404.5


UQCR11
0.656108597
0.664253898
2.007
6.89E−20
6.60E−19
−14.167
433
376
404.5


MVP
0.452488688
0.824053452
1.993
1.07E−18
8.11E−18
−16.541
549
261
405


DENND4A
0.886877828
0.396436526
0.124
4.72E−19
3.85E−18
−15.873
509
303
406


DNAJC1
0.592760181
0.716592428
0.766
2.76E−19
2.35E−18
−15.39
487
325
406


RAB8B
0.819004525
0.501113586
1.05
7.14E−21
7.96E−20
−13.365
372
442
407


ABHD4
0.343891403
0.894766147
5.112
2.25E−18
1.62E−17
−16.901
575
242
408.5


PLK2
0.380090498
0.870267261
3.374
4.47E−18
3.09E−17
−17.608
600
221
410.5


MIF
0.941176471
0.315701559
6.412
2.87E−19
2.43E−18
−15.258
490
332
411


FBXW11
0.615384615
0.691536748
0.151
1.15E−18
8.63E−18
−16.351
552
277
414.5


SLC25A3
0.764705882
0.546213808
10.008
5.17E−19
4.17E−18
−15.605
514
315
414.5


XDH
0.533936652
0.759465479
0.411
1.98E−18
1.45E−17
−16.526
567
263
415


MLF2
0.719457014
0.610244989
3.862
8.04E−21
8.87E−20
−13.226
375
456
415.5


MRPL40
0.65158371
0.682628062
3.936
1.18E−21
1.52E−20
−12.54
322
514
418


PDIA4
0.63800905
0.678173719
2.128
1.54E−19
1.38E−18
−14.183
464
375
419.5


CD200R1
0.384615385
0.865812918
1.454
8.03E−18
5.31E−17
−17.97
627
213
420


GUK1
0.583710407
0.724387528
2.563
2.78E−19
2.36E−18
−14.688
488
354
421


OSTF1
0.755656109
0.547884187
9.286
3.86E−18
2.69E−17
−16.818
594
249
421.5


9530068E07RIK
0.737556561
0.594654788
0.496
3.54E−21
4.18E−20
−12.556
351
507
429


RAC1
0.918552036
0.378619154
0.872
4.21E−22
6.05E−21
−11.928
289
570
429.5


PLEKHB2
0.719457014
0.628062361
4.253
5.57E−23
9.24E−22
−11.492
250
609
429.5


DNAJB4
0.330316742
0.899777283
2.183
9.69E−18
6.30E−17
−17.561
638
222
430


IL21R
0.841628959
0.457126949
0.993
3.88E−19
3.22E−18
−14.583
500
360
430


SSR4
0.859728507
0.499443207
5.195
1.47E−26
4.09E−25
−10.771
149
712
430.5


SMYD5
0.352941176
0.884743875
1.157
1.54E−17
9.81E−17
−18.019
652
210
431


IQSEC1
0.764705882
0.542316258
0.422
1.34E−18
1.01E−17
−15.713
554
310
432


STX11
0.515837104
0.770044543
0.556
6.80E−18
4.55E−17
−16.858
620
247
433.5


GGH
0.443438914
0.827951002
2.856
2.74E−18
1.95E−17
−16.107
583
287
435


ATPIF1
0.678733032
0.652561247
4.168
4.49E−21
5.17E−20
−12.554
360
510
435


IDI1
0.475113122
0.79844098
0.367
2.03E−17
1.27E−16
−18.065
665
207
436


CNIH
0.660633484
0.653674833
3.426
3.86E−19
3.20E−18
−14.217
499
373
436


NAPSA
0.389140271
0.861358575
0.949
1.40E−17
8.99E−17
−17.416
646
229
437.5


BCL2A1C
0.642533937
0.669265033
1.036
5.67E−19
4.54E−18
−14.621
518
359
438.5


COMT
0.529411765
0.79064588
3.849
2.09E−22
3.18E−21
−11.528
272
605
438.5


APRT
0.787330317
0.547327394
1.189
6.16E−22
8.42E−21
−11.831
300
578
439


ATP10A
0.502262443
0.780066815
0.322
8.62E−18
5.68E−17
−16.799
629
253
441


ECH1
0.886877828
0.436525612
2.348
2.12E−23
3.79E−22
−11.212
232
650
441


SEPHS2
0.610859729
0.694877506
2.293
1.41E−18
1.05E−17
−15.315
556
328
442


GLRX3
0.755656109
0.571826281
4.684
9.36E−21
1.02E−19
−12.555
381
508
444.5


FDPS
0.570135747
0.731625835
3.992
1.00E−18
7.64E−18
−14.722
543
353
448


RNPEP
0.606334842
0.703786192
0.816
3.57E−19
2.98E−18
−13.896
496
400
448


TFG
0.601809955
0.706570156
0.74
5.04E−19
4.08E−18
−14.035
512
387
449.5


TNFRSF1B
0.877828054
0.453786192
4.715
5.90E−24
1.15E−22
−10.995
212
687
449.5


YWHAE
0.968325792
0.285634744
1.07
2.19E−21
2.65E−20
−12.026
343
558
450.5


GNG2
0.719457014
0.597438753
0.287
2.35E−19
2.02E−18
−13.659
482
422
452


TRPV2
0.701357466
0.615256125
4.401
2.71E−19
2.31E−18
−13.709
486
419
452.5


HSP90AB1
0.936651584
0.31013363
9.834
6.45E−18
4.33E−17
−16.101
618
290
454


XBP1
0.619909502
0.691536748
0.401
3.69E−19
3.08E−18
−13.808
497
411
454


MRPS36
0.619909502
0.690979955
1.454
4.31E−19
3.54E−18
−13.853
505
404
454.5


STRAP
0.79638009
0.526726058
1.674
9.02E−21
9.87E−20
−12.345
379
530
454.5


EIF4E
0.696832579
0.624721604
4.205
7.35E−20
6.99E−19
−12.983
436
474
455


MXI1
0.371040724
0.871937639
0.669
2.46E−17
1.50E−16
−17.254
679
234
456.5


ECE1
0.429864253
0.832962138
2.101
1.46E−17
9.30E−17
−16.498
648
265
456.5


GM17745
0.42081448
0.837416481
1.305
2.95E−17
1.77E−16
−17.498
689
225
457


CHCHD10
0.556561086
0.739977728
4.486
2.50E−18
1.79E−17
−15.187
580
336
458


BIN1
0.78280543
0.552895323
3.067
5.40E−22
7.56E−21
−11.382
295
623
459


TDG
0.411764706
0.845768374
3.288
1.45E−17
9.30E−17
−16.362
649
275
462


TMEM30A
0.701357466
0.60467706
0.356
4.04E−18
2.81E−17
−15.306
596
329
462.5


ACP1
0.633484163
0.680400891
4.013
2.64E−19
2.25E−18
−13.396
485
441
463


PEDN1
0.619909502
0.694877506
3.848
1.43E−19
1.30E−18
−13.078
457
469
463


SEPT9
0.85520362
0.458797327
4.236
3.69E−21
4.31E−20
−11.881
355
575
465


PSMB4
0.868778281
0.474944321
6.441
5.11E−25
1.15E−23
−10.579
184
746
465


PA2G4
0.787330317
0.526726058
2.995
1.22E−19
1.12E−18
−12.943
454
477
465.5


P4HB
0.936651584
0.365256125
0.526
7.55E−24
1.45E−22
−10.729
216
718
467


CCDC6
0.642533937
0.665367483
0.124
1.62E−18
1.20E−17
−14.167
561
377
469


BC004004
0.660633484
0.658685969
0.888
9.88E−20
9.19E−19
−12.722
446
493
469.5


COPS6
0.850678733
0.475501114
1.47
2.27E−22
3.44E−21
−11.122
274
665
469.5


SLC25A11
0.7239819
0.60857461
4.59
3.62E−21
4.26E−20
−11.666
353
588
470.5


IGBP1
0.742081448
0.594654788
4.527
9.84E−22
1.29E−20
−11.378
317
626
471.5


SSR1
0.895927602
0.404231626
1.007
3.13E−21
3.75E−20
−11.611
346
598
472


TRAF4
0.429864253
0.832962138
2.223
1.46E−17
9.30E−17
−15.968
650
297
473.5


HSD17B12
0.552036199
0.739420935
0.444
8.88E−18
5.84E−17
−15.594
631
317
474


COX6C
0.936651584
0.374164811
6.362
7.14E−25
1.56E−23
−10.522
190
758
474


RIOK1
0.619909502
0.685412027
1.17
2.01E−18
1.46E−17
−14.086
569
383
476


RNASEK
0.805429864
0.530066815
1.379
2.49E−22
3.75E−21
−11.025
275
680
477.5


SYTL2
0.502262443
0.776726058
0.345
2.41E−17
1.48E−16
−16.283
674
282
478


ORMDL1
0.606334842
0.69766147
0.444
2.00E−18
1.46E−17
−14.009
568
388
478


FXR1
0.56561086
0.729955457
2.401
4.96E−18
3.40E−17
−14.748
605
352
478.5


UGP2
0.497737557
0.781737194
1.642
1.55E−17
9.85E−17
−15.839
653
305
479


PTMS
0.524886878
0.763919822
1.208
4.77E−18
3.28E−17
−14.634
602
358
480


LAMP2
0.547511312
0.744432071
1.982
6.23E−18
4.19E−17
−14.908
617
347
482


HSPD1
0.846153846
0.458797327
1.585
6.50E−20
6.23E−19
−12.229
432
538
485


HSBP1
0.746606335
0.582962138
0.856
6.34E−21
7.12E−20
−11.562
368
602
485


PDIA3
0.945701357
0.331291759
6.479
9.01E−22
1.19E−20
−11.136
313
660
486.5


ITFG1
0.556561086
0.736636971
1.807
6.60E−18
4.42E−17
−14.66
619
355
487


PERP
0.371040724
0.869153675
1.195
7.32E−17
4.13E−16
−16.896
735
244
489.5


EXT2
0.542986425
0.751113586
1.844
2.62E−18
1.87E−17
−13.946
582
397
489.5


TRAF1
0.859728507
0.444877506
4.098
2.73E−20
2.82E−19
−11.745
401
581
491


NDUFB8
0.805429864
0.527839644
1.922
4.48E−22
6.36E−21
−10.949
292
691
491.5


TCP1
0.900452489
0.426503341
4.705
1.85E−24
3.82E−23
−10.352
201
782
491.5


NFKBIE
0.470588235
0.804008909
0.911
1.02E−17
6.64E−17
−14.917
639
346
492.5


HIF1A
0.800904977
0.521158129
4.03
9.86E−21
1.06E−19
−11.55
384
603
493.5


ATP5G1
0.837104072
0.488864143
4.903
5.86E−22
8.13E−21
−10.995
299
688
493.5


PLEKHO2
0.57918552
0.712694878
0.367
2.34E−17
1.45E−16
−15.604
672
316
494


SIN3B
0.814479638
0.518930958
1.422
2.87E−22
4.25E−21
−10.781
280
710
495


EDF1
0.936651584
0.36636971
6.946
5.63E−24
1.12E−22
−10.298
209
787
498


PRDX1
0.977375566
0.29064588
5.303
4.29E−24
8.60E−23
−10.269
207
790
498.5


VPS35
0.615384615
0.68986637
0.705
1.82E−18
1.34E−17
−13.455
565
434
499.5


RFC2
0.733031674
0.593541203
0.766
1.67E−20
1.75E−19
−11.507
395
606
500.5


EIF2S3X
0.819004525
0.494988864
0.367
3.32E−20
3.39E−19
−11.607
405
599
502


MRPS33
0.669683258
0.650334076
4.597
9.21E−20
8.60E−19
−11.993
444
562
503


CNOT3
0.583710407
0.708797327
0.39
2.30E−17
1.43E−16
−15.029
669
341
505


ELK3
0.592760181
0.706013363
0.263
5.55E−18
3.76E−17
−13.924
612
398
505


JAK3
0.914027149
0.376948775
0.651
3.66E−21
4.28E−20
−11.151
354
657
505.5


IRF5
0.34841629
0.883073497
3.694
1.01E−16
5.61E−16
−16.505
750
264
507


NFS1
0.470588235
0.800111359
0.864
3.60E−17
2.14E−16
−15.516
698
320
509


STK24
0.656108597
0.652004454
1.803
1.89E−18
1.38E−17
−13.25
566
452
509


AGPAT4
0.56561086
0.722717149
0.263
3.80E−17
2.25E−16
−15.531
701
319
510


FAM162A
0.63800905
0.671492205
2.198
9.69E−19
7.44E−18
−12.924
540
480
510


NDUFA9
0.71040724
0.621380846
5.242
4.67E−21
5.36E−20
−11.14
361
659
510


NDUFB7
0.868778281
0.478285078
2.186
2.04E−25
4.95E−24
−10.011
171
854
512.5


BRI3BP
0.520361991
0.777839644
2.924
1.87E−19
1.65E−18
−12.044
472
556
514


DYNLT1C
0.619909502
0.681514477
3.331
5.76E−18
3.90E−17
−13.725
613
417
515


TMEM173
0.692307692
0.625835189
6.206
1.80E−19
1.59E−18
−11.951
469
566
517.5


DPYSL2
0.923076923
0.35467706
0.526
2.84E−20
2.92E−19
−11.329
403
635
519


LCP1
0.932126697
0.328507795
9.251
4.84E−19
3.93E−18
−12.345
510
531
520.5


TIMM8B
0.466063348
0.804565702
3.967
2.60E−17
1.58E−16
−14.459
682
364
523


ARHGAP18
0.416289593
0.836859688
0.496
1.10E−16
6.03E−16
−16.043
756
292
524


KLRK1
0.760180995
0.561247216
1.064
4.09E−20
4.07E−19
−11.367
417
631
524


RAB14
0.728506787
0.58908686
1.824
1.82E−19
1.60E−18
−11.771
470
579
524.5


PIGP
0.479638009
0.793986637
1.257
2.77E−17
1.67E−16
−14.466
687
363
525


SERINC1
0.701357466
0.606904232
2.578
2.31E−18
1.66E−17
−13.057
578
472
525


UQCRC2
0.79638009
0.543429844
6.406
1.13E−22
1.79E−21
−10.287
261
789
525


CYFIP1
0.393665158
0.853006682
1.036
1.00E−16
5.55E−16
−15.898
749
302
525.5


MFSD1
0.597285068
0.70935412
1.029
7.10E−19
5.56E−18
−12.419
529
522
525.5


PPIP5K2
0.475113122
0.793429844
0.111
9.81E−17
5.45E−16
−15.815
747
307
527


GATAD1
0.547511312
0.739977728
0.485
2.25E−17
1.40E−16
−13.999
667
390
528.5


ILK
0.800904977
0.53674833
5.389
1.69E−22
2.63E−21
−10.267
266
791
528.5


SNRPA
0.714932127
0.59298441
1.506
2.36E−18
1.69E−17
−12.937
579
479
529


PHPT1
0.470588235
0.79844098
1.709
6.10E−17
3.47E−16
−15.224
728
334
531


GLRX
0.755656109
0.588530067
3.046
1.04E−22
1.65E−21
−10.201
260
803
531.5


CSTB
0.65158371
0.660356347
1.628
6.39E−19
5.08E−18
−12.185
522
542
532


MRPS2
0.307692308
0.905345212
4.43
3.63E−16
1.82E−15
−16.9
824
243
533.5


RSL24D1
0.71040724
0.596325167
0.986
3.27E−18
2.30E−17
−12.943
589
478
533.5


ADAM17
0.533936652
0.752783964
2.124
1.46E−17
9.30E−17
−13.689
647
421
534


ODC1
0.755656109
0.548997773
1.93
2.95E−18
2.09E−17
−12.707
584
494
539


DUSP14
0.330316742
0.892538976
5.323
2.30E−16
1.20E−15
−16.063
791
291
541


MTDH
0.65158371
0.659242762
3.239
8.62E−19
6.64E−18
−12.105
538
550
544


MRPL33
0.751131222
0.574610245
2.738
1.61E−20
1.70E−19
−10.843
392
701
546.5


PSMA4
0.828054299
0.523385301
3.329
1.09E−24
2.30E−23
−9.744
196
897
546.5


GPI1
0.954751131
0.295657016
7.555
1.15E−19
1.05E−18
−11.256
451
645
548


NSUN2
0.556561086
0.731625835
1.876
2.75E−17
1.66E−16
−13.79
686
412
549


SEC11C
0.900452489
0.408685969
5.759
1.95E−22
3.00E−21
−10.105
270
830
550


YWHAH
0.886877828
0.451002227
4.974
4.45E−25
1.02E−23
−9.634
180
921
550.5


REXO2
0.628959276
0.677616927
2.642
1.77E−18
1.30E−17
−12.204
562
541
551.5


LMAN1
0.411764706
0.839643653
1.22
1.28E−16
6.98E−16
−14.937
761
345
553


LAMTOR5
0.683257919
0.629732739
4.233
6.71E−19
5.30E−18
−11.701
525
584
554.5


ANXA4
0.393665158
0.850222717
0.546
2.72E−16
1.41E−15
−15.638
797
313
555


MRPL17
0.542986425
0.748329621
1.709
5.99E−18
4.04E−17
−12.621
615
500
557.5


CERS6
0.334841629
0.888641425
1.287
3.55E−16
1.79E−15
−15.916
822
300
561
















TABLE 2







Ranked top transcription factors differentially expressed in cluster 7
















Gene
TP
TN
thresh_mhg
hyper _pval
hyper _qval
gen_qval
rank_hyper_qval
rank_gen_qval
mean_rank



















IRF8
0.886877828
0.663697105
1.501
4.18E−58
1.63E−55
−60.58
1
1
1


RBPJ
0.873303167
0.632516704
2.763
4.56E−49
5.93E−47
−42.315
3
2
2.5


LITAF
0.936651584
0.551224944
5.893
1.35E−49
2.64E−47
−41.548
2
3
2.5


IKZF2
0.719457014
0.737193764
0.084
6.49E−40
5.06E−38
−37.446
5
6
5.5


BHLHE40
0.977375566
0.422048998
6.796
6.69E−41
6.53E−39
−32.471
4
7
5.5


EPAS1
0.529411765
0.863585746
0.986
5.63E−37
3.14E−35
−38.289
7
5
6


MNDA
0.470588235
0.889755011
4.02
1.12E−34
4.86E−33
−40.982
9
4
6.5


NR4A2
0.914027149
0.498886414
0.595
2.62E−36
1.28E−34
−31.351
8
8
8


TOX
0.904977376
0.520044543
3.455
1.76E−37
1.14E−35
−23.972
6
15
10.5


ID2
0.972850679
0.341314031
4.705
6.02E−29
1.96E−27
−29
12
10
11


NFKB2
0.846153846
0.561804009
2.359
1.52E−32
5.92E−31
−25.366
10
12
11


STAT3
0.909502262
0.43596882
6.143
3.70E−27
1.03E−25
−29.78
14
9
11.5


UHRF2
0.542986425
0.809576837
0.971
2.48E−27
7.44E−26
−27.41
13
11
12


ZMIZ1
0.751131222
0.655345212
0.214
4.47E−31
1.58E−29
−24.896
11
14
12.5


TRPS1
0.511312217
0.829064588
0.986
9.50E−27
2.47E−25
−25.165
15
13
14


KDM2B
0.678733032
0.677616927
0.88
2.60E−24
5.97E−23
−21.433
17
16
16.5


RUNX2
0.78280543
0.582962138
1.546
1.20E−25
2.92E−24
−20.044
16
18
17


RBL2
0.832579186
0.511135857
0.239
6.99E−24
1.43E−22
−19.557
19
20
19.5


NCOR2
0.656108597
0.688752784
0.239
5.32E−23
9.02E−22
−19.689
23
19
21


CALR
0.923076923
0.384187082
2.844
1.64E−23
2.91E−22
−18.422
22
22
22


SMARCB1
0.65158371
0.698218263
4.752
9.93E−24
1.84E−22
−17.095
21
26
23.5


UTF1
0.330316742
0.909242762
4.976
1.01E−19
1.17E−18
−21.255
34
17
25.5


SREBF2
0.733031674
0.597438753
0.138
5.92E−21
7.97E−20
−16.921
29
27
28


COMMD3
0.733031674
0.60467706
3.134
8.29E−22
1.24E−20
−15.961
26
30
28


HDAC1
0.837104072
0.505011136
2.214
8.21E−24
1.60E−22
−14.043
20
36
28


FUBP1
0.733031674
0.605233853
0.31
7.11E−22
1.11E−20
−14.855
25
35
30


GTF2E2
0.533936652
0.768930958
1.546
1.02E−19
1.17E−18
−16.707
33
28
30.5


SPRY2
0.533936652
0.762806236
0.872
7.08E−19
6.91E−18
−18.137
40
23
31.5


ZBTB32
0.43438914
0.83518931
1.084
2.09E−18
1.89E−17
−18.611
43
21
32


DENND4A
0.886877828
0.396436526
0.124
4.72E−19
4.72E−18
−15.873
39
31
35


CREM
0.678733032
0.658129176
2.531
9.14E−22
1.32E−20
−13.17
27
44
35.5


SMYD5
0.352941176
0.884743875
1.157
1.54E−17
1.25E−16
−18.019
48
24
36


PHB
0.787330317
0.542873051
2.43
1.99E−21
2.77E−20
−13.17
28
45
36.5


MXI1
0.371040724
0.871937639
0.669
2.46E−17
1.88E−16
−17.254
51
25
38


XBP1
0.619909502
0.691536748
0.401
3.69E−19
3.78E−18
−13.808
38
39
38.5


NFKBIE
0.470588235
0.804008909
0.911
1.02E−17
8.49E−17
−14.917
47
34
40.5


CNOT3
0.583710407
0.708797327
0.39
2.30E−17
1.79E−16
−15.029
50
33
41.5


PFDN1
0.619909502
0.694877506
3.848
1.43E−19
1.55E−18
−13.078
36
47
41.5


PA2G4
0.787330317
0.526726058
2.995
1.22E−19
1.36E−18
−12.943
35
48
41.5


ELK3
0.592760181
0.706013363
0.263
5.55E−18
4.71E−17
−13.924
46
38
42


IRF5
0.34841629
0.883073497
3.694
1.01E−16
7.06E−16
−16.505
56
29
42.5


GATAD1
0.547511312
0.739977728
0.485
2.25E−17
1.79E−16
−13.999
49
37
43


EDF1
0.936651584
0.36636971
6.946
5.63E−24
1.22E−22
−10.298
18
71
44.5


HSBP1
0.746606335
0.582962138
0.856
6.34E−21
8.24E−20
−11.562
30
60
45


HIF1A
0.800904977
0.521158129
4.03
9.86E−21
1.24E−19
−11.55
31
61
46


HIVEP1
0.561085973
0.723830735
0.151
8.15E−17
5.78E−16
−13.609
54
41
47.5


SMYD2
0.393665158
0.847995546
0.623
5.95E−16
3.57E−15
−15.744
65
32
48.5


HTATIP2
0.533936652
0.743875278
0.748
1.86E−16
1.25E−15
−13.747
58
40
49


MORF4L2
0.656108597
0.648106904
2.157
5.22E−18
4.53E−17
−12.013
45
57
51


TBX21
0.628959276
0.66481069
5.5
5.13E−17
3.85E−16
−12.289
52
52
52


MED7
0.457013575
0.802895323
1.536
3.88E−16
2.40E−15
−13.313
63
43
53


C1D
0.601809955
0.688195991
0.516
7.59E−17
5.58E−16
−12.182
53
54
53.5


TMF1
0.701357466
0.592427617
0.202
8.04E−17
5.78E−16
−12.152
55
55
55


CSDA
0.570135747
0.707126949
0.926
8.49E−16
4.94E−15
−12.863
67
49
58


MED14
0.50678733
0.753340757
0.151
6.71E−15
3.44E−14
−13.341
76
42
59


NFAT5
0.592760181
0.683184855
0.124
2.17E−15
1.17E−14
−12.415
72
51
61.5


IKZF3
0.846153846
0.448218263
0.287
8.29E−19
7.89E−18
−8.956
41
84
62.5


GNPTAB
0.479638009
0.772828508
0.275
1.39E−14
6.79E−14
−13.104
80
46
63


NFIL3
0.398190045
0.83908686
3.536
4.07E−15
2.17E−14
−12.098
73
56
64.5


MYEF2
0.547511312
0.718262806
0.454
7.72E−15
3.91E−14
−12.214
77
53
65


PHB2
0.819004525
0.486636971
2.284
2.58E−19
2.72E−18
−8.162
37
93
65


ZRANB2
0.497737557
0.761135857
1.227
6.12E−15
3.18E−14
−11.704
75
58
66.5


NT5C
0.819004525
0.494988864
4.353
3.32E−20
4.05E−19
−7.857
32
101
66.5


ZMAT2
0.647058824
0.640868597
3.444
2.83E−16
1.84E−15
−9.591
60
79
69.5


RNF14
0.502262443
0.754454343
0.465
1.35E−14
6.68E−14
−11.308
79
65
72


ZC3H15
0.755656109
0.528953229
1.208
3.20E−16
2.05E−15
−9.211
61
83
72


COPS2
0.50678733
0.748329621
0.918
2.52E−14
1.20E−13
−11.369
82
64
73


NFKBIB
0.7239819
0.559576837
1
7.08E−16
4.19E−15
−9.252
66
82
74


PPIE
0.552036199
0.722717149
4.715
8.90E−16
5.10E−15
−9.543
68
81
74.5


TFDP1
0.502262443
0.751113586
1.401
3.25E−14
1.51E−13
−11.08
84
67
75.5


NR4A3
0.583710407
0.683184855
0.678
1.59E−14
7.67E−14
−10.379
81
70
75.5


YAF2
0.334841629
0.871937639
4.538
1.85E−13
7.68E−13
−11.498
94
62
78


FUBP3
0.398190045
0.820712695
0.604
1.08E−12
3.94E−12
−12.6
107
50
78.5


AEBP2
0.56561086
0.695434298
0.111
4.19E−14
1.83E−13
−10.681
89
68
78.5


TSG101
0.642533937
0.625835189
0.766
2.63E−14
1.24E−13
−10.099
83
74
78.5


GTF2E1
0.334841629
0.868596882
2.88
5.70E−13
2.18E−12
−11.665
102
59
80.5


PHF15
0.475113122
0.765033408
0.444
2.90E−13
1.17E−12
−11.133
97
66
81.5


SND1
0.597285068
0.667037862
3.206
3.82E−14
1.73E−13
−9.617
86
78
82


RBX1
0.805429864
0.496659243
4.931
1.13E−18
1.05E−17
−6.842
42
122
82


TCERG1
0.583710407
0.668151448
0.287
5.07E−13
1.98E−12
−10.298
100
72
86


MYBBP1A
0.63800905
0.623051225
0.791
1.28E−13
5.44E−13
−9.591
92
80
86


FOSL2
0.647058824
0.620267261
3.109
3.42E−14
1.57E−13
−8.551
85
88
86.5


SKIL
0.864253394
0.391982183
0.202
1.37E−15
7.65E−15
−7.736
70
104
87


VAMP7
0.411764706
0.807349666
1.531
2.45E−12
8.37E−12
−11.4
114
63
88.5


CAND1
0.475113122
0.757238307
0.189
2.03E−12
7.00E−12
−10.627
113
69
91


NDUFA13
0.972850679
0.288975501
4.019
8.55E−23
1.39E−21
−5.73
24
160
92


SSRP1
0.737556561
0.526726058
1.632
3.85E−14
1.73E−13
−7.942
87
99
93


STAT4
0.823529412
0.45545657
4.008
1.04E−16
7.09E−16
−6.4
57
132
94.5


SARNP
0.773755656
0.506124722
3.952
5.80E−16
3.53E−15
−6.629
64
126
95


KEAP1
0.457013575
0.771158129
0.356
2.58E−12
8.68E−12
−9.681
116
77
96.5


FLI1
0.791855204
0.448775056
0.275
1.15E−12
4.15E−12
−8.951
108
85
96.5


BTF3
0.941176471
0.279510022
9.346
1.26E−15
7.10E−15
−6.717
69
124
96.5


GTF2H5
0.619909502
0.635300668
1.459
3.95E−13
1.56E−12
−8.07
99
96
97.5


RUVBL2
0.497737557
0.739420935
2.441
1.53E−12
5.42E−12
−8.425
110
91
100.5


PRDM1
0.34841629
0.849109131
2.993
1.22E−11
3.74E−11
−10.067
127
75
101


DDX54
0.610859729
0.63752784
0.379
1.55E−12
5.44E−12
−8.125
111
94
102.5


RUNX3
0.791855204
0.457126949
0.926
2.21E−13
8.99E−13
−7.477
96
109
102.5


CCNT2
0.470588235
0.756124722
1.903
6.54E−12
2.13E−11
−8.809
120
87
103.5


FLII
0.669683258
0.58518931
2.915
5.12E−13
1.98E−12
−7.567
101
107
104


SMARCC2
0.642533937
0.609131403
0.214
9.69E−13
3.56E−12
−7.738
106
103
104.5


HCLS1
0.904977376
0.36247216
4.471
3.12E−18
2.76E−17
−5.668
44
165
104.5


ARNT
0.447963801
0.766703786
0.31
4.45E−11
1.27E−10
−9.974
136
76
106


MLX
0.50678733
0.724387528
0.722
8.18E−12
2.57E−11
−8.452
124
89
106.5


MKI67IP
0.466063348
0.755011136
0.566
2.05E−11
6.20E−11
−8.911
129
86
107.5


ERH
0.873303167
0.388641425
5.282
2.03E−16
1.34E−15
−5.735
59
157
108


ZBTB1
0.402714932
0.799554566
0.367
1.09E−10
2.94E−10
−10.1
144
73
108.5


MED28
0.778280543
0.486636971
0.333
1.29E−14
6.45E−14
−6.088
78
139
108.5


HMGB3
0.457013575
0.760022272
0.444
3.68E−11
1.09E−10
−8.432
132
90
111


RARA
0.438914027
0.776169265
0.566
2.76E−11
8.27E−11
−8.247
130
92
111


RUVBL1
0.520361991
0.704899777
0.485
4.26E−11
1.23E−10
−8.11
135
95
115


KDM5C
0.574660633
0.65701559
0.444
3.14E−11
9.34E−11
−7.773
131
102
116.5


IRF2
0.678733032
0.573496659
0.356
8.58E−13
3.19E−12
−6.591
105
128
116.5


YBX1
0.760180995
0.463808463
0.029
6.17E−11
1.72E−10
−7.488
140
108
124


MAX
0.515837104
0.706570156
1.138
6.96E−11
1.93E−10
−7.405
141
110
125.5


POLR1E
0.2760181
0.889198218
5.21
3.11E−10
7.78E−10
−8.022
156
97
126.5


BOLA2
0.520361991
0.703786192
2.444
5.38E−11
1.52E−10
−7.054
138
116
127


AES
0.683257919
0.545100223
0.422
8.39E−11
2.29E−10
−7.207
143
114
128.5


GTF3A
0.592760181
0.646993318
0.848
7.65E−12
2.43E−11
−6.216
122
135
128.5


GTF2F1
0.642533937
0.599665924
3.216
6.63E−12
2.14E−11
−6.206
121
136
128.5


LRRF1P1
0.895927602
0.330734967
1.632
5.98E−14
2.59E−13
−5.435
90
170
130


RPL7L1
0.529411765
0.704899777
3.505
7.60E−12
2.43E−11
−5.987
123
140
131.5


RNF166
0.601809955
0.630289532
3.454
3.90E−11
1.14E−10
−6.417
133
131
132


CBX3
0.986425339
0.185412027
1.07
5.34E−15
2.81E−14
−4.911
74
191
132.5


GTF2B
0.588235294
0.642538976
1.86
4.44E−11
1.27E−10
−6.586
137
129
133


ZBTB17
0.343891403
0.841314031
4.305
2.64E−10
6.70E−10
−7.265
154
113
133.5


PLAGL2
0.380090498
0.80623608
0.506
1.54E−09
3.56E−09
−7.878
169
100
134.5


MED17
0.384615385
0.808463252
2.128
3.96E−10
9.78E−10
−7.307
158
111
134.5


TBPL1
0.511312217
0.706013363
0.401
1.79E−10
4.71E−10
−6.725
148
123
135.5


REL
0.574660633
0.645879733
0.506
2.77E−10
6.97E−10
−7.026
155
117
136


MORF4L1
0.850678733
0.393652561
3.668
4.01E−14
1.78E−13
−5.141
88
184
136


ARID5B
0.484162896
0.718819599
0.367
1.67E−09
3.83E−09
−7.671
170
106
138


DR1
0.461538462
0.742761693
0.864
6.95E−10
1.64E−09
−7.307
165
112
138.5


HCFC1
0.683257919
0.563474388
0.731
2.53E−12
8.59E−12
−5.699
115
163
139


RNPS1
0.778280543
0.471046771
0.748
3.16E−13
1.26E−12
−5.239
98
180
139


MED24
0.411764706
0.776726058
0.31
3.60E−09
7.83E−09
−7.686
179
105
142


TOX4
0.574660633
0.659799555
2.004
1.78E−11
5.44E−11
−5.76
128
156
142


FUS
0.737556561
0.503340757
1.556
4.21E−12
1.39E−11
−5.641
118
166
142


SMARCA5
0.42081448
0.772828508
0.444
1.68E−09
3.83E−09
−7.181
171
115
143


CNOT8
0.561085973
0.662583519
0.322
1.27E−10
3.38E−10
−5.879
147
145
146


DEK
0.823529412
0.420935412
0.422
1.58E−13
6.64E−13
−4.677
93
202
147.5


LZTR1
0.371040724
0.801224944
0.496
2.27E−08
4.39E−08
−7.993
202
98
150


BAZ1A
0.497737557
0.702672606
0.379
3.69E−09
7.97E−09
−6.864
180
120
150


VGLL4
0.633484163
0.589643653
1.454
2.47E−10
6.34E−10
−5.836
152
149
150.5


PURB
0.547511312
0.654231626
0.176
6.32E−09
1.33E−08
−6.933
186
119
152.5


SUB1
0.972850679
0.198775056
4.258
2.15E−13
8.83E−13
−4.325
95
210
152.5


SQSTM1
0.932126697
0.275055679
7.008
8.09E−14
3.47E−13
−4.191
91
218
154.5


FOXN3
0.484162896
0.714365256
0.367
3.95E−09
8.46E−09
−6.397
182
133
157.5


PTTG1
0.669683258
0.580734967
4.849
1.28E−12
4.56E−12
−4.408
109
209
159


DEDD
0.416289593
0.763919822
0.356
2.24E−08
4.35E−08
−7.023
201
118
159.5


TARDBP
0.63800905
0.594654788
0.556
4.20E−11
1.22E−10
−5.107
134
185
159.5


CDC5L
0.520361991
0.684855234
3.622
2.28E−09
5.09E−09
−5.844
175
147
161


SMARCE1
0.719457014
0.5077951
0.832
7.11E−11
1.95E−10
−5.182
142
182
162


NOTCH2
0.714932127
0.500556793
0.202
6.02E−10
1.44E−09
−5.706
163
162
162.5


UBTF
0.57918552
0.638084633
1.526
5.36E−10
1.30E−09
−5.679
161
164
162.5


CNOT7
0.479638009
0.719933185
1.891
2.91E−09
6.45E−09
−5.83
176
150
163


RELB
0.497737557
0.699888641
0.444
6.19E−09
1.31E−08
−5.947
185
142
163.5


RLIM
0.696832579
0.522828508
0.731
3.57E−10
8.86E−10
−5.428
157
171
164


GTF3C1
0.533936652
0.669821826
0.098
3.70E−09
7.97E−09
−5.839
181
148
164.5


DTX3
0.416289593
0.761135857
0.345
3.83E−08
7.29E−08
−6.681
205
125
165


PSMC3
0.841628959
0.427616927
1.655
3.41E−16
2.14E−15
−3.255
62
268
165


CCNH
0.50678733
0.69766147
1.151
2.06E−09
4.61E−09
−5.732
174
158
166


STAT5A
0.606334842
0.60467706
0.014
1.99E−09
4.49E−09
−5.721
173
161
167


GTF2F2
0.461538462
0.719376392
0.993
6.16E−08
1.14E−07
−6.629
210
127
168.5


HMG20B
0.479638009
0.703786192
0.687
5.63E−08
1.05E−07
−6.421
208
130
169


BCLAF1
0.678733032
0.535634744
0.379
1.03E−09
2.42E−09
−5.391
166
173
169.5


CNOT1
0.624434389
0.599665924
0.151
2.03E−10
5.30E−10
−4.943
149
190
169.5


NFATC1
0.85520362
0.373051225
0.475
7.80E−13
2.93E−12
−3.762
104
235
169.5


VPS72
0.511312217
0.687082405
1.269
6.87E−09
1.42E−08
−5.822
189
151
170


MTA2
0.841628959
0.386414254
0.651
1.69E−12
5.89E−12
−3.939
112
229
170.5


ECD
0.488687783
0.703786192
0.151
1.34E−08
2.63E−08
−5.915
198
144
171


MBD1
0.352941176
0.804565702
0.367
2.25E−07
3.96E−07
−6.864
222
121
171.5


NMI
0.65158371
0.575723831
0.864
1.15E−10
3.10E−10
−4.687
145
201
173


KAT5
0.375565611
0.790089087
0.379
1.02E−07
1.87E−07
−6.288
213
134
173.5


E2F4
0.429864253
0.747772829
0.465
5.35E−08
1.01E−07
−5.982
207
141
174


COPS5
0.502262443
0.702115813
3.004
1.90E−09
4.31E−09
−5.288
172
178
175


CBX4
0.407239819
0.764476615
0.138
8.67E−08
1.60E−07
−5.935
212
143
177.5


TCEA1
0.787330317
0.424832962
0.623
2.51E−10
6.41E−10
−4.522
153
206
179.5


TSC22D4
0.647058824
0.576280624
2.632
2.39E−10
6.17E−10
−4.423
151
208
179.5


TARBP2
0.357466063
0.803452116
0.941
1.36E−07
2.47E−07
−5.877
215
146
180.5


NFKBIA
0.923076923
0.30623608
5.781
2.11E−15
1.16E−14
−2.701
71
292
181.5


RNF4
0.705882353
0.516146993
3.663
2.14E−10
5.57E−10
−4.246
150
215
182.5


GTF2H2
0.343891403
0.81013363
0.848
3.19E−07
5.46E−07
−6.141
228
138
183


RBBP4
0.79638009
0.432071269
0.202
1.01E−11
3.14E−11
−3.694
125
241
183


UBXN4
0.56561086
0.649777283
3.402
6.58E−10
1.57E−09
−4.587
164
204
184


DPF2
0.583710407
0.619710468
0.782
6.55E−09
1.37E−08
−5.145
187
183
185


EGR1
0.597285068
0.609131403
1.485
4.23E−09
9.02E−09
−4.95
183
189
186


ZFPL1
0.321266968
0.830734967
1.043
1.92E−07
3.41E−07
−5.793
219
154
186.5


SERTAD2
0.470588235
0.704899777
0.202
1.85E−07
3.31E−07
−5.773
218
155
186.5


UBE2K
0.601809955
0.605790646
0.566
3.53E−09
7.73E−09
−4.757
178
198
188


HDAC3
0.524886878
0.662026726
0.475
6.10E−08
1.14E−07
−5.502
209
168
188.5


ATF6B
0.443438914
0.727728285
0.299
2.30E−07
4.03E−07
−5.731
223
159
191


ANAPC11
0.556561086
0.644766147
1.795
7.50E−09
1.54E−08
−4.786
190
195
192.5


EYA3
0.343891403
0.807906459
0.275
4.91E−07
8.21E−07
−5.802
233
153
193


UTP6
0.34841629
0.797884187
0.202
1.57E−06
2.44E−06
−6.145
251
137
194


ZHX1
0.343891403
0.80623608
0.322
6.73E−07
1.10E−06
−5.822
239
152
195.5


SCAP
0.357466063
0.799554566
0.151
2.91E−07
5.01E−07
−5.584
226
167
196.5


MED27
0.393665158
0.772271715
0.757
1.71E−07
3.08E−07
−5.315
217
177
197


TCF25
0.850678733
0.343541203
0.39
5.43E−10
1.31E−09
−3.774
162
233
197.5


CCNT1
0.457013575
0.712138085
0.556
4.17E−07
7.07E−07
−5.456
230
169
199.5


MED15
0.570135747
0.628619154
2.667
1.31E−08
2.60E−08
−4.651
197
203
200


NPM1
0.932126697
0.232182628
9.342
4.25E−10
1.04E−09
−3.692
160
242
201


NR1H2
0.642533937
0.560690423
1.475
7.76E−09
1.59E−08
−4.264
191
212
201.5


PHRF1
0.488687783
0.68596882
0.263
2.81E−07
4.87E−07
−5.248
225
179
202


EIF3H
0.986425339
0.152004454
7.886
1.07E−11
3.30E−11
−2.924
126
280
203


TFAM
0.384615385
0.774498886
0.227
4.58E−07
7.70E−07
−5.33
232
176
204


AIP
0.674208145
0.545657016
1.501
4.08E−10
1.00E−09
−3.528
159
250
204.5


ATF1
0.429864253
0.739420935
0.39
2.38E−07
4.15E−07
−5.067
224
187
205.5


GLRX2
0.402714932
0.778953229
3.191
1.07E−08
2.14E−08
−4.243
195
216
205.5


NR3C1
0.452488688
0.719933185
0.138
2.23E−07
3.93E−07
−4.898
221
192
206.5


CHD4
0.746606335
0.450445434
0.202
7.90E−09
1.61E−08
−4.171
192
222
207


KAT2A
0.321266968
0.824053452
0.506
7.40E−07
1.20E−06
−5.388
241
174
207.5


TBC1D2B
0.380090498
0.776726058
0.239
6.08E−07
1.00E−06
−5.218
237
181
209


TBL1XR1
0.461538462
0.720489978
1.373
5.06E−08
9.57E−08
−4.261
206
213
209.5


SMARCA4
0.65158371
0.55233853
0.202
6.78E−09
1.41E−08
−3.872
188
231
209.5


RELA
0.592760181
0.609131403
4.227
8.94E−09
1.81E−08
−3.943
193
228
210.5


TWISTNB
0.429864253
0.734966592
0.465
5.08E−07
8.47E−07
−5.026
234
188
211


KDM6A
0.407239819
0.747772829
0.227
1.60E−06
2.47E−06
−5.406
252
172
212


PHF5A
0.556561086
0.643095768
4.788
1.00E−08
2.01E−08
−3.898
194
230
212


YEATS4
0.466063348
0.706013363
0.895
3.01E−07
5.18E−07
−4.713
227
199
213


NONO
0.923076923
0.270044543
1.417
4.07E−12
1.36E−11
−2.279
117
310
213.5


GABPA
0.42081448
0.740534521
0.227
7.51E−07
1.21E−06
−5.096
242
186
214


SNW1
0.63800905
0.557906459
0.687
2.55E−08
4.89E−08
−4.059
203
225
214


PBX1P1
0.683257919
0.530066815
0.189
1.20E−09
2.79E−09
−3.361
167
261
214


CREB3
0.343891403
0.800111359
0.39
2.06E−06
3.12E−06
−5.382
257
175
216


RNF125
0.570135747
0.618596882
1.911
6.79E−08
1.26E−07
−4.176
211
221
216


ZBTB7A
0.50678733
0.661469933
0.163
9.46E−07
1.51E−06
−4.85
244
193
218.5


HES6
0.307692308
0.832405345
0.66
1.20E−06
1.91E−06
−4.822
245
194
219.5


SBDS
0.561085973
0.635300668
1.084
1.82E−08
3.55E−08
−3.682
200
244
222


HMGB1
0.977375566
0.175946548
3.997
4.24E−12
1.39E−11
−1.798
119
327
223


WHSC1
0.452488688
0.70935412
0.214
1.24E−06
1.96E−06
−4.706
247
200
223.5


BLOC1S1
0.452488688
0.714365256
0.941
5.60E−07
9.25E−07
−4.185
236
219
227.5


BAZ2A
0.416289593
0.737750557
0.151
2.24E−06
3.36E−06
−4.773
260
197
228.5


RNF19A
0.49321267
0.670935412
0.239
1.51E−06
2.36E−06
−4.511
250
207
228.5


PFDN5
0.932126697
0.2655902
4.42
5.83E−13
2.21E−12
−0.818
103
356
229.5


XAB2
0.475113122
0.694320713
0.696
5.32E−07
8.83E−07
−4.025
235
226
230.5


PQBP1
0.520361991
0.655345212
0.941
3.49E−07
5.95E−07
−3.726
229
237
233


LIMD1
0.488687783
0.673162584
0.176
2.02E−06
3.07E−06
−4.297
256
211
233.5


GTF2A2
0.466063348
0.703786192
0.669
4.33E−07
7.31E−07
−3.707
231
240
235.5


RBM38
0.7239819
0.485523385
3.483
1.47E−09
3.42E−09
−2.402
168
303
235.5


ILF3
0.714932127
0.481069042
0.411
1.50E−08
2.93E−08
−3.092
199
274
236.5


MAZ
0.447963801
0.70155902
0.401
7.33E−06
1.03E−05
−4.783
278
196
237


SMAD2
0.371040724
0.771158129
0.176
5.70E−06
8.14E−06
−4.524
273
205
239


CNBP
1
0.107461024
6.679
5.57E−11
1.56E−10
−1.214
139
341
240


PHF20L1
0.497737557
0.660356347
0.151
3.80E−06
5.55E−06
−4.183
267
220
243.5


GABPB1
0.407239819
0.742761693
0.848
3.55E−06
5.23E−06
−4.109
265
224
244.5


CDK7
0.65158371
0.505567929
0.014
6.19E−06
8.74E−06
−4.248
276
214
245


HNRNPD
0.547511312
0.625835189
0.138
6.25E−07
1.02E−06
−3.476
238
254
246


DNMT1
0.606334842
0.577951002
0.275
1.54E−07
2.79E−07
−2.951
216
277
246.5


BAZ1B
0.479638009
0.673719376
0.163
6.18E−06
8.74E−06
−4.166
275
223
249


DTX3L
0.438914027
0.721603563
3.113
1.22E−06
1.93E−06
−3.521
246
252
249


NACA
0.950226244
0.185412027
9.038
1.25E−08
2.49E−08
−2.421
196
302
249


KDM5A
0.642533937
0.525055679
0.239
1.62E−06
2.50E−06
−3.528
253
251
252


ATF4
0.864253394
0.334632517
1.239
1.19E−10
3.17E−10
−0.639
146
360
253


ATF2
0.42081448
0.71714922
0.263
2.57E−05
3.43E−05
−4.226
292
217
254.5


UHRF1
0.375565611
0.770044543
0.379
3.65E−06
5.35E−06
−3.683
266
243
254.5


CIZ1
0.43438914
0.718819599
0.239
3.50E−06
5.17E−06
−3.681
264
245
254.5


THRAP3
0.71040724
0.473273942
2.441
1.04E−07
1.89E−07
−2.599
214
295
254.5


UIMC1
0.447963801
0.704899777
0.516
4.52E−06
6.50E−06
−3.709
271
239
255


EED
0.375565611
0.763919822
0.585
9.63E−06
1.32E−05
−3.997
284
227
255.5


TRIM27
0.325791855
0.816815145
2.077
1.47E−06
2.30E−06
−3.29
249
267
258


CCNL1
0.719457014
0.470489978
2.428
3.62E−08
6.93E−08
−2.261
204
312
258


MED8
0.366515837
0.771158129
0.774
1.05E−05
1.43E−05
−3.77
286
234
260


RNF44
0.656108597
0.510579065
0.227
1.74E−06
2.67E−06
−3.238
254
270
262


RNF5
0.398190045
0.744988864
1.091
8.53E−06
1.19E−05
−3.661
280
246
263


CHURC1
0.43438914
0.717706013
6.17
4.14E−06
6.00E−06
−3.374
269
259
264


MED12
0.547511312
0.614142539
0.124
3.31E−06
4.91E−06
−3.326
263
265
264


PWP1
0.375565611
0.757238307
0.251
2.61E−05
3.47E−05
−3.737
293
236
264.5


MAF1
0.800904977
0.393652561
2.403
3.34E−09
7.36E−09
−0.873
177
352
264.5


EOMES
0.352941176
0.773942094
0.176
4.02E−05
5.24E−05
−3.778
299
232
265.5


PREB
0.520361991
0.638084633
0.214
4.36E−06
6.30E−06
−3.347
270
262
266


MED1
0.57918552
0.566258352
0.227
2.91E−05
3.86E−05
−3.711
295
238
266.5


MYSM1
0.457013575
0.69766147
2.856
3.92E−06
5.70E−06
−3.307
268
266
267


TBP
0.34841629
0.783407572
0.642
1.74E−05
2.35E−05
−3.606
289
247
268


TGIF1
0.633484163
0.533407572
0.585
1.83E−06
2.79E−06
−2.887
255
282
268.5


TRIP12
0.56561086
0.58908686
0.084
8.95E−06
1.24E−05
−3.337
282
264
273


MMS19
0.34841629
0.777839644
0.356
4.03E−05
5.24E−05
−3.557
300
248
274


BUD31
0.588235294
0.577394209
1.449
2.20E−06
3.32E−06
−2.816
259
290
274.5


MLLT6
0.511312217
0.64142539
0.098
8.87E−06
1.23E−05
−3.208
281
271
276


SMYD3
0.398190045
0.736080178
0.163
3.09E−05
4.07E−05
−3.385
296
258
277


SREBF1
0.447963801
0.687639198
0.227
4.82E−05
6.24E−05
−3.42
301
256
278.5


BATF
0.601809955
0.571269488
1.098
7.94E−07
1.27E−06
−2.155
243
315
279


IKZF1
0.828054299
0.334632517
0.39
2.13E−07
3.77E−07
−1.485
220
338
279


SCAND1
0.334841629
0.786191537
1.036
6.54E−05
8.31E−05
−3.512
307
253
280


CTNNB1
0.321266968
0.795100223
0.422
9.70E−05
0.000120418
−3.534
313
249
281


MKL1
0.561085973
0.59688196
0.176
5.76E−06
8.19E−06
−2.851
274
288
281


HMGB2
0.868778281
0.30623608
1.967
5.95E−09
1.26E−08
0
184
379
281.5


PER1
0.669683258
0.497772829
0.824
1.46E−06
2.29E−06
−2.116
248
316
282


AATF
0.343891403
0.777839644
0.687
7.07E−05
8.96E−05
−3.391
308
257
282.5


TCF20
0.674208145
0.482739421
0.084
5.65E−06
8.10E−06
−2.571
272
298
285


E4F1
0.330316742
0.793429844
0.496
3.98E−05
5.21E−05
−3.193
298
273
285.5


ING3
0.42081448
0.70935412
0.322
7.21E−05
9.10E−05
−3.347
309
263
286


CXXC1
0.479638009
0.669265033
0.506
1.14E−05
1.55E−05
−2.876
288
285
286.5


CNOT2
0.529411765
0.623051225
0.651
1.03E−05
1.41E−05
−2.822
285
289
287


PNN
0.574660633
0.56403118
0.287
6.43E−05
8.20E−05
−3.239
306
269
287.5


GTF2A1
0.407239819
0.71714922
0.176
0.000127921
0.000156392
−3.367
319
260
289.5


REXO4
0.371040724
0.756124722
0.614
5.36E−05
6.90E−05
−2.964
303
276
289.5


SF1
0.800904977
0.349665924
0.422
2.43E−06
3.62E−06
−2.069
262
317
289.5


MLX1P
0.529411765
0.600222717
0.202
0.00016273
0.000195276
−3.441
325
255
290


ATRX
0.547511312
0.59298441
0.07
4.94E−05
6.38E−05
−2.951
302
278
290


ABT1
0.791855204
0.340757238
0.111
2.92E−05
3.86E−05
−2.876
294
286
290


CDCA4
0.466063348
0.673162584
0.422
3.60E−05
4.73E−05
−2.718
297
291
294


SP3
0.411764706
0.71325167
0.263
0.000124537
0.000152734
−3.195
318
272
295


MTF2
0.34841629
0.770601336
0.322
0.000111835
0.000138024
−3.09
316
275
295.5


STAT6
0.633484163
0.519487751
0.163
1.11E−05
1.51E−05
−2.377
287
304
295.5


PNRC1
0.597285068
0.559576837
0.526
7.11E−06
1.00E−05
−2.217
277
314
295.5


ING4
0.452488688
0.682071269
0.731
5.74E−05
7.37E−05
−2.615
304
294
299


RORA
0.366515837
0.755011136
0.74
0.000107207
0.000132733
−2.881
315
284
299.5


TRIM28
0.375565611
0.747772829
0.31
9.67E−05
0.000120418
−2.853
314
287
300.5


SP110
0.814479638
0.342427617
0.214
7.28E−07
1.18E−06
−0.59
240
361
300.5


NFYC
0.461538462
0.673162584
0.433
6.12E−05
7.83E−05
−2.468
305
301
303


RNF114
0.701357466
0.461024499
0.299
2.27E−06
3.39E−06
−1.109
261
345
303


PNRC2
0.520361991
0.614142539
0.496
9.06E−05
0.000113305
−2.581
312
296
304


IFI35
0.529411765
0.624721604
0.766
8.28E−06
1.16E−05
−1.773
279
329
304


CIR1
0.380090498
0.736080178
0.322
0.000255489
0.000301029
−2.932
331
279
305


CAMTA2
0.36199095
0.75389755
0.239
0.000208539
0.000247959
−2.887
328
283
305.5


NCOA4
0.43438914
0.690423163
0.275
0.000160385
0.000193056
−2.656
324
293
308.5


JARID2
0.380090498
0.729955457
0.138
0.000527461
0.000606814
−2.894
339
281
310


MILL5
0.742081448
0.388084633
0.163
7.78E−05
9.75E−05
−2.262
311
311
311


HSF1
0.425339367
0.70155902
0.536
0.000114139
0.000140423
−2.376
317
306
311.5


DNM2
0.701357466
0.444877506
0.275
1.76E−05
2.36E−05
−1.605
290
335
312.5


RPL7
0.914027149
0.208797327
11.198
2.12E−06
3.20E−06
−0.328
258
369
313.5


PMF1
0.402714932
0.718262806
1.007
0.000185052
0.000220704
−2.377
327
305
316


PLRG1
0.321266968
0.784521158
0.848
0.000406362
0.00047167
−2.58
336
297
316.5


CEBPZ
0.389140271
0.723830735
0.401
0.00041407
0.000479191
−2.495
337
299
318


TLE3
0.511312217
0.615256125
0.176
0.000214079
0.000253771
−2.313
329
309
319


BRD8
0.610859729
0.511135857
0.07
0.000385821
0.000449165
−2.37
335
307
321


PTMA
0.995475113
0.066258352
7.631
9.55E−06
1.32E−05
−0.646
283
359
321


MED30
0.497737557
0.632516704
1.064
0.000133075
0.00016168
−1.936
321
323
322


PHF6
0.398190045
0.70545657
0.214
0.001251094
0.001402088
−2.481
348
300
324


ZNRD1
0.398190045
0.722717149
1.406
0.000177338
0.000212153
−1.93
326
324
325


TAF1B
0.36199095
0.741648107
0.632
0.000898856
0.001016098
−2.338
345
308
326.5


SMAD7
0.479638009
0.643095768
0.239
0.000281171
0.000330291
−1.953
332
322
327


ILF2
0.447963801
0.668708241
0.299
0.000450103
0.000519349
−2.044
338
319
328.5


MYC
0.429864253
0.683184855
0.333
0.000587319
0.000671713
−2.057
341
318
329.5


SPOP
0.552036199
0.578507795
0.379
0.000155797
0.000188114
−1.526
323
336
329.5


CREBBP
0.479638009
0.642538976
0.163
0.000298881
0.000348993
−1.873
334
326
330


STAT1
0.837104072
0.290089087
0.124
2.21E−05
2.96E−05
−0.239
291
371
331


NFX1
0.470588235
0.631959911
0.176
0.00208768
0.002299986
−2.258
354
313
333.5


NCOR1
0.746606335
0.378619154
0.832
0.000128543
0.000156662
−1.081
320
347
333.5


NFYB
0.425339367
0.683184855
0.485
0.000918983
0.001035848
−1.885
346
325
335.5


THOC2
0.43438914
0.665924276
0.138
0.002194353
0.002410698
−1.958
355
321
338


VAV1
0.805429864
0.29844098
0.07
0.000602489
0.000687049
−1.636
342
334
338


MBNL1
0.904977376
0.195991091
5.051
7.55E−05
9.50E−05
−0.362
310
366
338


GABPB2
0.57918552
0.517260579
0.029
0.004168801
0.004554152
−1.987
357
320
338.5


GTF3C2
0.470588235
0.634187082
0.239
0.00169389
0.0018821
−1.775
351
328
339.5


GON4L
0.42081448
0.684298441
0.057
0.001265881
0.001412898
−1.767
349
330
339.5


NCOA2
0.447963801
0.661469933
0.084
0.000963269
0.001082637
−1.68
347
333
340


HDAC7
0.628959276
0.484966592
0.263
0.000823589
0.00093372
−1.495
344
337
340.5


MIER1
0.615384615
0.5
0.287
0.000736355
0.000837254
−1.158
343
343
343


STAT5B
0.502262443
0.599665924
0.07
0.002431195
0.002663388
−1.733
356
331
343.5


RNF7
0.678733032
0.44376392
1.05
0.000292294
0.000342326
−0.696
333
357
345


PML
0.36199095
0.726057906
0.163
0.004461983
0.00484728
−1.725
359
332
345.5


HBP1
0.479638009
0.615256125
0.227
0.004253197
0.004633371
−1.436
358
339
348.5


RPL6
0.995475113
0.052895323
10.212
0.000146579
0.000177533
−0.022
322
377
349.5


NFKB1
0.411764706
0.674832962
0.151
0.006755416
0.007257885
−1.283
363
340
351.5


ELF1
0.787330317
0.319599109
0.872
0.000559335
0.00064159
−0.41
340
365
352.5


IRF3
0.43438914
0.650890869
0.696
0.008241059
0.008829706
−1.122
364
344
354


MXD1
0.479638009
0.610801782
0.189
0.006176842
0.006673043
−1.081
361
348
354.5


JUNB
0.963800905
0.106347439
8.485
0.000222969
0.000263509
0
330
380
355


SMARCA2
0.366515837
0.704899777
0.275
0.01882142
0.019785321
−1.209
371
342
356.5


GATA3
0.511312217
0.582962138
0.379
0.004795813
0.005195464
−0.857
360
354
357


NRF1
0.380090498
0.696547884
0.322
0.0132967
0.014053423
−1.107
369
346
357.5


DAXX
0.375565611
0.704342984
0.546
0.00996954
0.01062328
−1.033
366
349
357.5


CCNL2
0.687782805
0.415367483
0.496
0.001805368
0.001994599
−0.426
353
364
358.5


NCOA3
0.619909502
0.461581292
0.176
0.012714663
0.013474779
−1.007
368
350
359


NFATC3
0.511312217
0.574053452
0.111
0.009750684
0.010418539
−0.868
365
353
359


NSD1
0.597285068
0.493318486
0.084
0.006586043
0.007095461
−0.678
362
358
360


KLF6
0.800904977
0.296213808
0.333
0.001267985
0.001412898
−0.093
350
375
362.5


PIAS1
0.343891403
0.716035635
0.322
0.039478525
0.041057666
−0.945
375
351
363


ELF4
0.366515837
0.697104677
0.251
0.033149145
0.034659964
−0.857
373
355
364


KLF13
0.923076923
0.147550111
0.903
0.001765897
0.001956534
0
352
381
366.5


ARID1A
0.696832579
0.375835189
0.124
0.019757865
0.02071389
−0.589
372
362
367


NR4A1
0.665158371
0.413697105
0.275
0.014002034
0.014758901
−0.341
370
368
369


JUN
0.371040724
0.668708241
0.367
0.134780588
0.138327445
−0.494
380
363
371.5


BTG2
0.71040724
0.370824053
0.214
0.010060004
0.010690468
−0.02
367
378
372.5


ATF71P
0.552036199
0.508351893
0.275
0.052223876
0.054024699
−0.239
377
372
374.5


MAML2
0.416289593
0.617483296
0.098
0.183886522
0.187247372
−0.343
383
367
375


RNF138
0.610859729
0.454342984
0.401
0.038087039
0.039716431
−0.06
374
376
375


LDB1
0.398190045
0.634743875
0.516
0.18815501
0.191094932
−0.24
384
370
377


MTA3
0.429864253
0.60467706
0.444
0.179506517
0.183265816
−0.22
382
373
377.5


NOTCH1
0.524886878
0.512249443
0.07
0.165838562
0.169756009
−0.165
381
374
377.5


SP100
0.814479638
0.238864143
0.475
0.043616752
0.04524078
0
376
382
379


GTF2I
0.751131222
0.299554566
0.263
0.067640587
0.069787907
0
378
383
380.5


WHSC1L1
0.746606335
0.298997773
0.111
0.091267343
0.093916263
0
379
384
381.5


ARID5A
0.57918552
0.437082405
0.111
0.349663504
0.354204588
0
385
385
385


ZFP36L1
0.574660633
0.415367483
0.333
0.640096896
0.646730025
0
386
386
386


IRF1
0.56561086
0.415367483
0.642
0.730789354
0.736454387
0
387
387
387


PYHIN1
0.357466063
0.595211581
0.333
0.924007685
0.928770611
0
388
388
388


ZFP36L2
0.628959276
0.323496659
0.189
0.931648747
0.934043731
0
389
389
389


FOS
0.461538462
0.423162584
0.31
0.999549404
0.999549404
0
390
390
390
















TABLE 3





Ranked top surface cytokines differentially expressed in cluster 7




















Gene
TP
TN
thresh_mhg







TNFRSF9
0.873303167
0.744988864
8.537



CCRL2
0.7239819
0.791759465
1.05



HAVCR2
0.873303167
0.683184855
2.154



CSF1
0.556561086
0.885300668
0.911



ADAM8
0.787330317
0.726057906
0.864



ITGAV
0.85520362
0.657572383
0.084



SERPINE2
0.542986425
0.873051225
3.895



TNFRSF4
0.773755656
0.723830735
3.144



LAG3
0.963800905
0.513919822
4.793



GPR56
0.647058824
0.806792873
0.696



PGLYRP1
0.923076923
0.53674833
4.954



CXCR6
0.986425339
0.423719376
5.506



KIT
0.466063348
0.88752784
0.516



CCL3
0.642533937
0.773942094
3.904



NR4A2
0.914027149
0.498886414
0.595



IL1R2
0.407239819
0.915924276
3.396



CD244
0.466063348
0.883073497
3.545



NRP1
0.751131222
0.670935412
0.163



LGALS1
0.923076923
0.508351893
10.112



CX3CR1
0.511312217
0.843541203
1.646



GPR65
0.760180995
0.652561247
2.585



ENTPD1
0.701357466
0.692093541
0.202



TIGIT
0.981900452
0.354120267
4.895



PDCD1
0.968325792
0.415367483
5.101



CLIC4
0.619909502
0.758351893
1.202



TFF1
0.384615385
0.904788419
5.97



CCR8
0.443438914
0.874164811
5.401



KLRC1
0.841628959
0.546213808
4.198



LILRB4
0.597285068
0.767817372
5.621



IL2RB
0.561085973
0.782293987
9.964



KLRC2
0.778280543
0.597995546
1.766



IL10RA
0.837104072
0.538975501
0.214



IL18RAP
0.733031674
0.626948775
0.66



TNFRSF18
0.895927602
0.459910913
4.331



CMTM7
0.864253394
0.512249443
1.761



PTGER2
0.479638009
0.824053452
0.227



IL12RB2
0.447963801
0.841870824
0.66



NCOR2
0.656108597
0.688752784
0.239



CALR
0.923076923
0.384187082
2.844



TMEM123
0.891402715
0.464922049
4.878



CD200
0.34841629
0.901447661
1.614



GABARAPL1
0.597285068
0.733853007
0.595



TNFSF4
0.384615385
0.878619154
3.874



SEPT2
0.954751131
0.329064588
0.536



SIVA1
0.57918552
0.739977728
2.032



CTSB
0.959276018
0.346325167
2.31



LAP3
0.429864253
0.845211581
0.88



CTLA4
0.936651584
0.413140312
2.685



BSG
0.932126697
0.357461024
0.575



XPOT
0.466063348
0.815701559
0.411



CD200R1
0.384615385
0.865812918
1.454



MIF
0.941176471
0.315701559
6.412



RAC1
0.918552036
0.378619154
0.872



PDIA4
0.63800905
0.678173719
2.128



ATPIF1
0.678733032
0.652561247
4.168



IL21R
0.841628959
0.457126949
0.993



HSP90AB1
0.936651584
0.31013363
9.834



TRPV2
0.701357466
0.615256125
4.401



LAMP2
0.547511312
0.744432071
1.982



ECE1
0.429864253
0.832962138
2.101



P4HB
0.936651584
0.365256125
0.526



HSPD1
0.846153846
0.458797327
1.585



PDIA3
0.945701357
0.331291759
6.479



KLRK1
0.760180995
0.561247216
1.064



ADAM17
0.533936652
0.752783964
2.124



GPI1
0.954751131
0.295657016
7.555



CD82
0.941176471
0.334632517
6.884



CTSD
0.850678733
0.418151448
9.5



KLRE1
0.330316742
0.886414254
2.583



TFRC
0.49321267
0.771158129
2.356



CCL4
0.619909502
0.667594655
5.124



M6PR
0.891402715
0.40701559
4.087



IRAK2
0.542986425
0.737750557
1.536



KLRD1
0.78280543
0.493318486
5.966



IL2RA
0.303167421
0.902561247
2.452



AIMP1
0.683257919
0.618040089
0.986



CD44
0.805429864
0.478285078
0.367



HSPA9
0.701357466
0.594097996
3.417



CD8A
0.945701357
0.273942094
8.096



ERP44
0.787330317
0.513363029
1.899



ITGB3
0.389140271
0.83518931
0.124



TMX3
0.502262443
0.75
0.227



USP14
0.497737557
0.761135857
2.091



CD27
0.904977376
0.378062361
4.648



C1QBP
0.696832579
0.599665924
4.777



FERMT3
0.932126697
0.316258352
3.5



PEBP1
0.85520362
0.430957684
1.637



GPR160
0.357466063
0.85467706
0.766



IL18R1
0.597285068
0.668708241
3.714



ANXA5
0.696832579
0.58518931
3.898



IDE
0.737556561
0.537861915
0.651



LYST
0.624434389
0.645879733
0.669



CD2BP2
0.678733032
0.606347439
0.345



SCARB2
0.371040724
0.84298441
3.57



LY75
0.457013575
0.777282851
0.39



IFNG
0.488687783
0.750556793
4.6



SEMA4D
0.895927602
0.341314031
2.077



ITGB2
0.959276018
0.253340757
6.382



FLOT2
0.443438914
0.786191537
0.888



CD96
0.692307692
0.579621381
3.331



GRN
0.339366516
0.855790646
3.126



H13
0.909502262
0.353563474
4.489



ATP5B
0.990950226
0.146993318
6.427



PDLIM2
0.520361991
0.724944321
4.378



HNRNPU
0.787330317
0.485523385
0.516



NCKAP1L
0.764705882
0.492761693
0.856



PGRMC1
0.561085973
0.679844098
0.614



CD226
0.687782805
0.572383073
1.05



LY6A
0.683257919
0.596325167
5.895



GDI2
0.959276018
0.263919822
4.652



SMPD1
0.398190045
0.810690423
5.212



AAMP
0.760180995
0.513363029
3.018



CD9
0.43438914
0.782293987
5.837



TNIP1
0.619909502
0.620824053
1.111



ADAM10
0.737556561
0.492761693
0.214



CD38
0.429864253
0.777282851
2.091



CD74
0.312217195
0.86247216
1.683



FASL
0.656108597
0.60467706
3.863



PSTPIP1
0.778280543
0.479398664
3.501



CD3E
0.900452489
0.287861915
6.221



F2R
0.520361991
0.706013363
2.744



ATP6AP2
0.547511312
0.685412027
2.956



LSM1
0.497737557
0.723273942
0.546



TLN1
0.936651584
0.287861915
0.926



PTPRCAP
0.968325792
0.2422049
7.465



ERP29
0.592760181
0.630846325
0.956



CAP1
0.737556561
0.479398664
0.824



CCR5
0.511312217
0.703229399
3.733



CR1L
0.606334842
0.618596882
2.31



CCL5
0.977375566
0.233853007
3.234



H2-M3
0.529411765
0.688752784
2.766



IL27RA
0.665158371
0.56013363
1.227



SLC3A2
0.846153846
0.368596882
4.681



CD48
0.864253394
0.364699332
5.154



CAST
0.705882353
0.518930958
1.064



TNFSF10
0.343891403
0.814587973
2.926



EZR
0.895927602
0.319599109
0.632



NOTCH2
0.714932127
0.500556793
0.202



ITGAL
0.932126697
0.28285078
4.104



THY1
0.868778281
0.33518931
5.619



CLPTM1
0.402714932
0.770044543
0.299



IGF2R
0.552036199
0.629732739
0.111



CD160
0.488687783
0.714365256
0.895



CD47
0.959276018
0.216035635
6.062



LRPAP1
0.443438914
0.7344098
0.333



CD164
0.918552036
0.282293987
4.285



HMGB1
0.977375566
0.175946548
3.997



CD55
0.325791855
0.821269488
0.299



TRAF3
0.371040724
0.785634744
0.163



CMTM6
0.574660633
0.634187082
3.522



CD3G
0.981900452
0.140311804
8.624



CD6
0.823529412
0.380289532
2.757



ITGA4
0.886877828
0.299554566
0.556



NR3C1
0.452488688
0.719933185
0.138



SBDS
0.561085973
0.635300668
1.084



TGFBR2
0.769230769
0.438752784
1.646



RPS6KB1
0.502262443
0.670378619
0.251



IL12RB1
0.321266968
0.820155902
0.465



RALA
0.371040724
0.781737194
0.401



TSPAN32
0.321266968
0.816258352
0.824



SPN
0.737556561
0.444320713
0.251



HSPA5
0.923076923
0.222160356
6.319



HSP90AA1
0.823529412
0.378062361
2.521



CD52
0.941176471
0.224387528
9.42



CD5
0.57918552
0.59688196
5.913



ROCK1
0.56561086
0.601336303
0.111



PEAR1
0.371040724
0.7655902
0.287



CD37
0.882352941
0.304008909
3.294



IL2RG
1
0.08908686
5.342



LTB
0.891402715
0.283964365
6.104



BST2
0.502262443
0.663697105
1.345



ICAM1
0.515837104
0.631403118
0.251



STX4A
0.316742081
0.800111359
0.816



CD97
0.79638009
0.375278396
0.546



SLAMF1
0.325791855
0.79064588
0.444



IFNAR1
0.714932127
0.439309577
3.279



B4GALT1
0.882352941
0.253340757
2.046



CORO1A
0.914027149
0.214922049
10.504



GPR174
0.334841629
0.760022272
0.043



FLT3L
0.389140271
0.715478842
0.918



ICOS
0.678733032
0.449331849
0.39



SYNJ2BP
0.859728507
0.248886414
0.585



CCND2
0.737556561
0.405902004
0.111



B2M
0.995475113
0.061247216
11.727



PSEN1
0.452488688
0.647550111
0.401



CD53
0.950226244
0.162583519
5.988



NUP85
0.36199095
0.716035635
0.526



STK10
0.742081448
0.384187082
0.239



CD3D
0.986425339
0.087416481
6.219



HCST
0.819004525
0.301781737
2.546



MSN
0.950226244
0.140868597
5.287



PTPRC
0.986425339
0.081848552
3.674



ITGB1
0.656108597
0.44376392
0.239



HSPA8
0.986425339
0.063474388
10.474



CD8B1
0.968325792
0.106347439
8.815



MYO9B
0.389140271
0.652004454
0.163



CD28
0.687782805
0.39142539
0.251



LY6E
0.954751131
0.126391982
6.449



IL4RA
0.488687783
0.571269488
0.287



RPS19
0.963800905
0.110801782
9.967



NOTCH1
0.524886878
0.512249443
0.07



CNP
0.592760181
0.472717149
0.496



SELPLG
0.995475113
0.052895323
0.678



CD247
0.832579186
0.246659243
0.766



DPP4
0.325791855
0.691536748
0.239



PDE4B
0.475113122
0.546770601
0.31



CD84
0.520361991
0.513363029
0.275



CD2
0.868778281
0.188195991
0.888



IL16
0.457013575
0.549554566
0.287



IL17RA
0.411764706
0.572383073
0.163



CCR7
0.325791855
0.587416481
0.322



CD69
0.484162896
0.400334076
0.516
















Gene
hyper_pval
hyper_qval
gen_qval







TNFRSF9
2.48E−73
5.27E−71
−74.603



CCRL2
1.60E−52
1.13E−50
−57.892



HAVCR2
5.96E−59
6.31E−57
−51.383



CSF1
1.02E−47
3.08E−46
−56.864



ADAM8
7.94E−50
2.80E−48
−50.88



ITGAV
7.25E−50
2.80E−48
−50.642



SERPINE2
1.05E−41
1.85E−40
−51.019



TNFRSF4
8.39E−47
2.22E−45
−42.493



LAG3
7.81E−51
4.14E−49
−36.69



GPR56
2.30E−42
4.44E−41
−44.618



PGLYRP1
5.99E−44
1.27E−42
−43.178



CXCR6
4.32E−44
1.02E−42
−29.649



KIT
2.23E−33
2.63E−32
−40.679



CCL3
8.83E−35
1.17E−33
−35.708



NR4A2
2.62E−36
3.71E−35
−31.351



IL1R2
2.28E−32
2.42E−31
−40.178



CD244
3.23E−32
3.26E−31
−37.285



NRP1
1.76E−33
2.20E−32
−30.537



LGALS1
1.29E−39
2.10E−38
−25.523



CX3CR1
1.21E−29
1.07E−28
−28.883



GPR65
5.08E−32
4.89E−31
−26.057



ENTPD1
2.00E−29
1.63E−28
−26.13



TIGIT
4.50E−33
5.02E−32
−21.13



PDCD1
2.34E−37
3.54E−36
−19.612



CLIC4
9.73E−29
7.37E−28
−26.3



TFF1
6.36E−26
3.74E−25
−30.862



CCR8
7.64E−27
4.76E−26
−29.292



KLRC1
1.16E−29
1.07E−28
−22.062



LILRB4
2.81E−27
1.86E−26
−26.812



IL2RB
5.64E−25
3.23E−24
−27.616



KLRC2
5.80E−27
3.73E−26
−20.64



IL10RA
5.38E−28
3.93E−27
−18.023



IL18RAP
1.40E−24
7.83E−24
−21.027



TNFRSF18
1.10E−27
7.78E−27
−16.98



CMTM7
6.60E−29
5.18E−28
−16.368



PTGER2
7.64E−22
3.52E−21
−22.388



IL12RB2
4.40E−21
1.90E−20
−23.202



NCOR2
5.32E−23
2.68E−22
−19.689



CALR
1.64E−23
8.71E−23
−18.422



TMEM123
1.61E−27
1.10E−26
−16.349



CD200
2.61E−20
1.04E−19
−21.601



GABARAPL1
4.23E−22
1.99E−21
−17.973



TNFSF4
3.78E−20
1.49E−19
−20.871



SEPT2
2.54E−23
1.31E−22
−14.651



SIVA1
7.74E−21
3.17E−20
−16.842



CTSB
2.72E−26
1.65E−25
−13.17



LAP3
1.60E−19
5.67E−19
−18.741



CTLA4
1.42E−29
1.20E−28
−11.36



BSG
3.86E−22
1.90E−21
−13.243



XPOT
6.17E−19
2.04E−18
−17.777



CD200R1
8.03E−18
2.40E−17
−17.97



MIF
2.87E−19
9.80E−19
−15.258



RAC1
4.21E−22
1.99E−21
−11.928



PDIA4
1.54E−19
5.54E−19
−14.183



ATPIF1
4.49E−21
1.90E−20
−12.554



IL21R
3.88E−19
1.30E−18
−14.583



HSP90AB1
6.45E−18
1.98E−17
−16.101



TRPV2
2.71E−19
9.43E−19
−13.709



LAMP2
6.23E−18
1.94E−17
−14.908



ECE1
1.46E−17
4.12E−17
−16.498



P4HB
7.55E−24
4.10E−23
−10.729



HSPD1
6.50E−20
2.46E−19
−12.229



PDIA3
9.01E−22
4.07E−21
−11.136



KLRK1
4.09E−20
1.58E−19
−11.367



ADAM17
1.46E−17
4.12E−17
−13.689



GPI1
1.15E−19
4.21E−19
−11.256



CD82
2.75E−21
1.21E−20
−9.818



CTSD
2.16E−16
5.52E−16
−14.202



KLRE1
2.75E−15
6.28E−15
−16.362



TFRC
1.05E−15
2.49E−15
−13.761



CCL4
2.13E−16
5.52E−16
−12.771



M6PR
7.77E−21
3.17E−20
−8.074



IRAK2
1.23E−16
3.22E−16
−11.616



KLRD1
1.01E−15
2.46E−15
−11.474



IL2RA
3.98E−15
8.78E−15
−12.37



AIMP1
1.32E−17
3.84E−17
−9.402



CD44
8.14E−17
2.18E−16
−10.21



HSPA9
5.39E−17
1.47E−16
−10.168



CD8A
7.90E−16
1.95E−15
−10.988



ERP44
3.16E−18
1.01E−17
−7.981



ITGB3
1.12E−13
2.23E−13
−13.483



TMX3
4.34E−14
8.93E−14
−12.816



USP14
6.12E−15
1.34E−14
−11.619



CD27
7.70E−20
2.86E−19
−7.169



C1QBP
4.27E−17
1.17E−16
−9.227



FERMT3
8.68E−18
2.56E−17
−8.337



PEBP1
3.13E−18
1.01E−17
−7.934



GPR160
3.12E−13
6.06E−13
−12.901



IL18R1
2.59E−14
5.38E−14
−11.684



ANXA5
1.28E−15
3.01E−15
−10.769



IDE
3.54E−15
7.90E−15
−9.306



LYST
1.43E−14
3.03E−14
−10.055



CD2BP2
6.58E−16
1.64E−15
−7.848



SCARB2
5.76E−13
1.07E−12
−11.036



LY75
5.47E−13
1.04E−12
−10.847



IFNG
6.54E−13
1.21E−12
−10.788



SEMA4D
6.49E−15
1.40E−14
−8.169



ITGB2
3.58E−16
9.05E−16
−7.109



FLOT2
9.14E−13
1.67E−12
−10.567



CD96
1.25E−14
2.68E−14
−7.454



GRN
1.17E−11
2.01E−11
−9.864



H13
5.21E−18
1.65E−17
−4.232



ATP5B
2.81E−12
5.01E−12
−9.158



PDLIM2
4.78E−13
9.13E−13
−7.959



HNRNPU
1.77E−15
4.13E−15
−6.105



NCKAP1L
8.83E−14
1.78E−13
−7.248



PGRMC1
3.65E−12
6.44E−12
−8.598



CD226
1.60E−13
3.14E−13
−7.374



LY6A
2.33E−15
5.37E−15
−5.969



GDI2
3.10E−17
8.64E−17
−4.262



SMPD1
1.62E−11
2.73E−11
−8.841



AAMP
3.33E−15
7.52E−15
−5.882



CD9
1.53E−11
2.61E−11
−7.854



TNIP1
8.05E−12
1.40E−11
−7.367



ADAM10
3.07E−11
5.08E−11
−7.667



CD38
1.20E−10
1.91E−10
−7.94



CD74
4.65E−10
6.81E−10
−8.642



FASL
1.47E−13
2.92E−13
−5.108



PSTPIP1
5.82E−14
1.19E−13
−4.954



CD3E
7.79E−11
1.25E−10
−6.868



F2R
3.37E−11
5.53E−11
−6.63



ATP6AP2
1.54E−11
2.61E−11
−6.121



LSM1
5.85E−11
9.47E−11
−6.477



TLN1
1.03E−15
2.49E−15
−2.194



PTPRCAP
8.60E−17
2.28E−16
−1.214



ERP29
1.85E−10
2.84E−10
−6.503



CAP1
3.34E−10
4.95E−10
−6.815



CCR5
3.14E−10
4.73E−10
−6.744



CR1L
1.59E−10
2.48E−10
−6.103



CCL5
7.29E−18
2.21E−17
−0.345



H2-M3
2.22E−10
3.38E−10
−5.642



IL27RA
1.61E−10
2.49E−10
−5.354



SLC3A2
1.68E−11
2.80E−11
−4.424



CD48
3.94E−13
7.59E−13
−2.79



CAST
1.30E−10
2.05E−10
−4.945



TNFSF10
1.31E−07
1.75E−07
−6.763



EZR
5.78E−13
1.07E−12
−2.638



NOTCH2
6.02E−10
8.69E−10
−5.706



ITGAL
1.54E−14
3.22E−14
−1.055



THY1
3.62E−11
5.90E−11
−3.591



CLPTM1
6.24E−08
8.47E−08
−6.014



IGF2R
1.80E−07
2.37E−07
−6.348



CD160
1.83E−09
2.57E−09
−5.379



CD47
1.34E−12
2.42E−12
−2.122



LRPAP1
7.17E−08
9.68E−08
−5.951



CD164
1.38E−12
2.48E−12
−1.546



HMGB1
4.24E−12
7.43E−12
−1.798



CD55
6.28E−07
8.02E−07
−5.938



TRAF3
4.80E−07
6.21E−07
−5.108



CMTM6
2.37E−09
3.31E−09
−3.97



CD3G
1.13E−09
1.60E−09
−3.555



CD6
3.24E−10
4.84E−10
−3.009



ITGA4
2.88E−10
4.37E−10
−2.818



NR3C1
2.23E−07
2.92E−07
−4.898



SBDS
1.82E−08
2.50E−08
−3.682



TGFBR2
8.22E−10
1.18E−09
−3.089



RPS6KB1
4.61E−07
5.99E−07
−4.584



IL12RB1
1.56E−06
1.92E−06
−5.039



RALA
9.65E−07
1.21E−06
−4.625



TSPAN32
3.20E−06
3.89E−06
−4.996



SPN
9.53E−08
1.28E−07
−3.122



HSPA5
2.99E−08
4.08E−08
−2.998



HSP90AA1
4.77E−10
6.93E−10
−2.037



CD52
1.24E−10
1.97E−10
−0.714



CD5
4.91E−07
6.31E−07
−3.346



ROCK1
1.72E−06
2.11E−06
−4.092



PEAR1
1.36E−05
1.63E−05
−4.013



CD37
3.90E−10
5.74E−10
−0.561



IL2RG
3.79E−09
5.25E−09
−1.145



LTB
1.54E−09
2.17E−09
−0.806



BST2
1.27E−06
1.59E−06
−2.813



ICAM1
1.88E−05
2.22E−05
−3.054



STX4A
8.20E−05
9.40E−05
−3.347



CD97
1.36E−07
1.80E−07
−0.861



SLAMF1
0.00010523 
0.000119298
−3.3



IFNAR1
5.74E−06
6.95E−06
−1.854



B4GALT1
1.52E−06
1.88E−06
−0.618



CORO1A
8.20E−07
1.04E−06
−0.466



GPR174
0.001761276
0.001914823
−2.295



FLT3L
0.001048641
0.001145938
−2.047



ICOS
0.000162874
0.00017984 
−1.965



SYNJ2BP
0.000123973
0.000138328
−1.673



CCND2
1.72E−05
2.05E−05
−0.541



B2M
2.69E−05
3.13E−05
−0.551



PSEN1
0.00245109 
0.002651179
−1.501



CD53
7.25E−07
9.20E−07
0



NUP85
0.01085838 
0.011567721
−1.259



STK10
0.000120222
0.000134852
−0.54



CD3D
7.12E−06
8.57E−06
−0.029



HCST
7.28E−05
8.39E−05
−0.043



MSN
2.53E−05
2.97E−05
−0.028



PTPRC
2.06E−05
2.42E−05
0



ITGB1
0.002730536
0.002938445
−0.385



HSPA8
0.000590012
0.000648096
−0.119



CD8B1
7.23E−05
8.38E−05
0



MYO9B
0.128803746
0.133854873
−0.495



CD28
0.012756924
0.013522339
−0.298



LY6E
8.36E−05
9.53E−05
0



IL4RA
0.052465723
0.054791789
−0.253



RPS19
0.000111218
0.000125416
0



NOTCH1
0.165838562
0.171501343
−0.165



CNP
0.038042717
0.03992602 
−0.048



SELPLG
0.000146579
0.000162694
0



CD247
0.004665044
0.004994896
−0.019



DPP4
0.324787096
0.331033002
−0.171



PDE4B
0.292572864
0.299639841
−0.081



CD84
0.190762572
0.196318763
−0.038



CD2
0.021107237
0.022262359
0



IL16
0.454103629
0.460621863
0



IL17RA
0.698347268
0.704998194
0



CCR7
0.995063869
0.999581424
0



CD69
0.999581424
0.999581424
0














Gene
rank_hyper_qval
rank_gen_qval
mean_rank





TNFRSF9
1
1
1


CCRL2
3
2
2.5


HAVCR2
2
4
3


CSF1
7
3
5


ADAM8
5
6
5.5


ITGAV
6
7
6.5


SERPINE2
12
5
8.5


TNFRSF4
8
10
9


LAG3
4
14
9


GPR56
11
8
9.5


PGLYRP1
10
9
9.5


CXCR6
9
19
14


KIT
18
11
14.5


CCL3
16
15
15.5


NR4A2
15
16
15.5


IL1R2
20
12
16


CD244
21
13
17


NRP1
17
18
17.5


LGALS1
13
27
20


CX3CR1
23
21
22


GPR65
22
26
24


ENTPD1
26
25
25.5


TIGIT
19
32
25.5


PDCD1
14
37
25.5


CLIC4
28
24
26


TFF1
36
17
26.5


CCR8
34
20
27


KLRC1
24
30
27


LILRB4
32
23
27.5


IL2RB
37
22
29.5


KLRC2
33
35
34


IL10RA
29
40
34.5


IL18RAP
38
33
35.5


TNFRSF18
30
44
37


CMTM7
27
47
37


PTGER2
46
29
37.5


IL12RB2
50
28
39


NCOR2
42
36
39


CALR
40
39
39.5


TMEM123
31
49
40


CD200
53
31
42


GABARAPL1
44
41
42.5


TNFSF4
54
34
44


SEPT2
41
53
47


SIVA1
52
45
48.5


CTSB
35
62
48.5


LAP3
60
38
49


CTLA4
25
75
50


BSG
43
61
52


XPOT
64
43
53.5


CD200R1
71
42
56.5


MIF
62
51
56.5


RAC1
45
69
57


PDIA4
59
56
57.5


ATPIF1
49
66
57.5


IL21R
63
54
58.5


HSP90AB1
69
50
59.5


TRPV2
61
58
59.5


LAMP2
68
52
60


ECE1
75
46
60.5


P4HB
39
83
61


HSPD1
56
68
62


PDIA3
47
77
62


KLRK1
55
74
64.5


ADAM17
74
59
66.5


GPI1
58
76
67


CD82
48
89
68.5


CTSD
83
55
69


KLRE1
93
48
70.5


TFRC
88
57
72.5


CCL4
82
65
73.5


M6PR
51
99
75


IRAK2
81
72
76.5


KLRD1
87
73
80


IL2RA
96
67
81.5


AIMP1
73
90
81.5


CD44
79
85
82


HSPA9
78
86
82


CD8A
86
79
82.5


ERP44
65
100
82.5


ITGB3
106
60
83


TMX3
103
64
83.5


USP14
97
71
84


CD27
57
111
84


C1QBP
77
92
84.5


FERMT3
72
97
84.5


PEBP1
66
103
84.5


GPR160
109
63
86


IL18R1
102
70
86


ANXA5
90
82
86


IDE
95
91
93


LYST
100
87
93.5


CD2BP2
85
105
95


SCARB2
114
78
96


LY75
112
80
96


IFNG
115
81
98


SEMA4D
98
98
98


ITGB2
84
112
98


FLOT2
116
84
100


CD96
99
107
103


GRN
123
88
105.5


H13
67
144
105.5


ATP5B
119
93
106


PDLIM2
111
101
106


HNRNPU
91
122
106.5


NCKAP1L
105
110
107.5


PGRMC1
120
96
108


CD226
108
108
108


LY6A
92
125
108.5


GDI2
76
143
109.5


SMPD1
126
94
110


AAMP
94
128
111


CD9
124
104
114


TNIP1
122
109
115.5


ADAM10
128
106
117


CD38
133
102
117.5


CD74
145
95
120


FASL
107
133
120


PSTPIP1
104
137
120.5


CD3E
132
113
122.5


F2R
129
117
123


ATP6AP2
125
121
123


LSM1
131
119
125


TLN1
89
164
126.5


PTPRCAP
80
175
127.5


ERP29
138
118
128


CAP1
143
114
128.5


CCR5
141
116
128.5


CR1L
136
123
129.5


CCL5
70
189
129.5


H2-M3
139
130
134.5


IL27RA
137
132
134.5


SLC3A2
127
142
134.5


CD48
110
161
135.5


CAST
135
138
136.5


TNFSF10
159
115
137


EZR
113
162
137.5


NOTCH2
147
129
138


ITGAL
101
177
139


THY1
130
149
139.5


CLPTM1
156
124
140


IGF2R
161
120
140.5


CD160
151
131
141


CD47
117
165
141


LRPAP1
157
126
141.5


CD164
118
172
145


HMGB1
121
170
145.5


CD55
166
127
146.5


TRAF3
164
134
149


CMTM6
152
147
149.5


CD3G
149
150
149.5


CD6
142
157
149.5


ITGA4
140
159
149.5


NR3C1
162
139
150.5


SBDS
154
148
151


TGFBR2
148
155
151.5


RPS6KB1
163
141
152


IL12RB1
172
135
153.5


RALA
169
140
154.5


TSPAN32
174
136
155


SPN
158
154
156


HSPA5
155
158
156.5


HSP90AA1
146
167
156.5


CD52
134
180
157


CD5
165
152
158.5


ROCK1
173
145
159


PEAR1
177
146
161.5


CD37
144
182
163


IL2RG
153
176
164.5


LTB
150
179
164.5


BST2
170
160
165


ICAM1
179
156
167.5


STX4A
185
151
168


CD97
160
178
169


SLAMF1
187
153
170


IFNAR1
175
169
172


B4GALT1
171
181
176


CORO1A
168
187
177.5


GPR174
195
163
179


FLT3L
194
166
180


ICOS
192
168
180


SYNJ2BP
190
171
180.5


CCND2
178
184
181


B2M
182
183
182.5


PSEN1
196
173
184.5


CD53
167
202
184.5


NUP85
199
174
186.5


STK10
189
185
187


CD3D
176
199
187.5


HCST
184
197
190.5


MSN
181
200
190.5


PTPRC
180
203
191.5


ITGB1
197
188
192.5


HSPA8
193
194
193.5


CD8B1
183
204
193.5


MYO9B
204
186
195


CD28
200
190
195


LY6E
186
205
195.5


IL4RA
203
191
197


RPS19
188
206
197


NOTCH1
205
193
199


CNP
202
196
199


SELPLG
191
207
199


CD247
198
201
199.5


DPP4
208
192
200


PDE4B
207
195
201


CD84
206
198
202


CD2
201
208
204.5


IL16
209
209
209


IL17RA
210
210
210


CCR7
211
211
211


CD69
212
212
212
















TABLE 4





Ranked top 100 differentially expressed genes in cluster 7 as compared to all 15 CD8 T cell clusters





















adj.pval.cluster 1
adj.pval.cluster 2
adj.pval.cluster 3
adj.pval.cluster 4
adj.pval.cluster 5





GLDC
0
0
0
0
0


TNFRSF9
0
0
0
0
0


PRF1
0
0
0
0
0


IRF8
0
0
0
0
0


CCRL2
0
0
0
0
0


LAT2
0
0
0
0
0


PCYT1A
0
0
0
0
0


CSF1
0
0
0
0
0


MYO10
0
0
0
0
0


TMPRSS6
0
0
0
0
0


2900026A02RIK
0
0
0
0
0


HAVCR2
0
0
0
0
0


C1QTNF6
0
0
0
0
0


SERPINE2
0
0
0
0
0


ADAM8
0
0
0
0
0


ITGAV
0
0
0
0
0


ADAMTS14
0
0
0
0
0


RGS8
0
0
0
0
0


GPR56
0
0
0
0
0


AA467197
0
0
0
0
0


SLC37A2
0
0
0
0
0


PGLYRP1
0
0
0
0
0


ANXA2
0
0
0
0
0


TNFRSF4
0
0
0
0
0


RBPJ
0
0
0
0
0


LITAF
0
0
0
0
0


HILPDA
0
0
0
−2.032
0


MNDA
0
0
0
0
0


KIT
0
0
0
0
0


GPD2
0
0
0
0
0


IL1R2
0
0
0
0
0


RGS16
0
0
0
0
0


PLEK
0
0
0
0
0


DSCAM
0
0
0
0
0


EPAS1
0
0
0
0
0


NABP1
0
0
0
0
0


SLC16A11
0
0
0
0
0


GZMF
0
0
0
0
0


IKZF2
0
0
0
0
0


CD244
0
0
0
0
0


GZMC
0
0
0
0
0


CDK6
0
0
0
0
0


SERPINB9
0
0
0
0
0


GEM
0
0
0
0
0


LAG3
0
0
0
0
0


SLC2A3
0
0
0
0
0


UBASH3B
0
0
0
0
0


NRGN
0
0
0
0
0


CCL3
0
0
0
0
0


GAPDH
0
0
0
0
0


PLAC8
0
0
0
0
0


FOXRED2
0
0
0
0
0


GZMB
0
0
0
0
0


FILIP1
0
0
0
0
0


RGS2
0
0
0
−0.462
0


EXPH5
0
0
0
0
0


SRGAP3
0
0
0
0
0


GM5177
0
0
0
0
0


MT1
0
0
0
0
0


TPI1
0
0
0
0
0


ACOT7
0
0
0
0
0


BHLHE40
0
0
0
0
0


CCNG1
0
0
0
0
0


FAM110A
0
0
0
0
0


S100A11
0
0
0
0
0


DUSP4
0
0
0
0
0


CAPG
0
0
0
0
0


FAM3C
0
0
0
0
0


NR4A2
0
0
0
0
0


TFF1
0
0
0
0
0


IMPA2
0
0
0
0
0


NRP1
0
0
0
0
0


CST7
0
0
0
0
0


PLXND1
0
0
0
0
0


PKM
0
0
0
0
0


STAT3
0
0
0
0
0


CXCR6
0
0
0
−0.53
0


GDPD5
0
0
0
0
0


CCR8
0
0
0
0
0


SMIM3
0
0
0
0
0


ARL14EP
0
0
0
0
0


ERGIC1
0
0
0
0
0


ID2
0
0
0
0
0


EHD1
0
0
0
0
0


CX3CR1
0
0
0
0
0


CASP3
0
0
0
0
0


NRN1
0
0
0
0
0


PEX16
0
0
0
0
0


HNRNPA1
0
0
0
0
0


FDX1
0
0
0
0
0


OSBPL3
0
0
0
0
0


GZME
0
0
0
0
0


CIAPIN1
0
0
0
0
0


SAMSN1
0
0
0
0
0


ALDOA
0
0
0
0
0


TUBB6
0
0
0
0
0


IL2RB
0
0
0
0
0


GZMD
0
0
0
0
0


UHRF2
0
0
0
0
0






adj.pval.cluster 6
adj.pval.cluster 7
adj.pval.cluster 8
adj.pval.cluster 9
adj.pval.cluster 10





GLDC
0
−85.741
−0.349
−8.74
−5.177


TNFRSF9
0
−74.603
−5.868
−9.018
−6.921


PRF1
−1.31
−70.08
0
−8.936
−6.49


IRF8
0
−60.58
−8.638
−11.165
−5.51


CCRL2
0
−57.892
−0.224
−14.77
−4.152


LAT2
0
−57.892
−0.572
−10.136
−2.469


PCYT1A
0
−57.84
−0.662
−11.248
−2.084


CSF1
0
−56.864
−5.975
−6.433
−2.708


MYO10
0
−54.348
−1.064
−1.523
−1.588


TMPRSS6
0
−53.736
0
−3.033
−4.619


2900026A02RIK
−0.094
−53.112
−0.364
−7.417
−8.528


HAVCR2
−5.269
−51.383
0
−16.568
−11.845


C1QTNF6
0
−51.184
−0.046
−5.849
−8.317


SERPINE2
0
−51.019
−1.886
−10.282
−1.85


ADAM8
−0.568
−50.88
0
−13.585
−3.536


ITGAV
0
−50.642
−5.945
−8.788
−6.619


ADAMTS14
0
−49.686
−0.611
−5.926
−9.936


RGS8
0
−47.201
−0.241
−5.434
−8.407


GPR56
−11.808
−44.618
0
−7.91
−8.024


AA467197
0
−43.648
−0.129
−3.303
−2.284


SLC37A2
0
−43.31
−0.479
−1.185
−5.219


PGLYRP1
0
−43.178
−5.804
−13.116
−7.444


ANXA2
−4.007
−43.087
−7.039
−17.945
−7.336


TNFRSF4
0
−42.493
−37.64
−6.475
−1.862


RBPJ
0
−42.315
−9.234
−5.337
−3.259


LITAF
−3.899
−41.548
−3.66
−13.046
−7.349


HILPDA
0
−41.529
−0.028
−4.275
−6.589


MNDA
0
−40.982
−10.506
−8.14
−0.138


KIT
0
−40.679
−22.015
−0.725
−3.094


GPD2
0
−40.178
−0.519
−13.917
−7.064


IL1R2
0
−40.178
−9.473
−0.35
0


RGS16
−9.843
−39.659
−3.736
−22.439
−12.941


PLEK
−1.487
−39.004
−2.892
−10.591
−8.925


DSCAM
0
−38.384
−2.139
−8.24
−5.086


EPAS1
0
−38.289
−0.12
−4.994
−2.887


NABP1
0
−38.264
−0.256
−4.082
−3.117


SLC16A11
0
−38.005
−14.163
−4.89
−1.423


GZMF
0
−37.709
0
−1.555
−0.551


IKZF2
0
−37.446
−10.474
−4.472
−3.435


CD244
0
−37.285
0
−10.792
−12.777


GZMC
0
−37.022
−0.047
−3.473
−0.97


CDK6
−0.679
−36.931
0
−3.291
−11.204


SERPINB9
0
−36.781
−0.198
−6.539
−2.276


GEM
0
−36.705
−2.772
−7.375
−1.815


LAG3
−10.838
−36.69
−24.857
−4.285
−9.648


SLC2A3
0
−36.69
−0.88
−5.126
−0.392


UBASH3B
−1.011
−35.97
−1.123
−9.111
−2.621


NRGN
0
−35.762
−16.934
−6.835
−1.884


CCL3
−1.341
−35.708
0
−7.774
−2.31


GAPDH
−3.878
−35.534
−1.651
−28.733
−14.377


PLAC8
−0.15
−35.511
0
−12.622
−3.435


FOXRED2
0
−35.48
−0.706
−4.364
−10.171


GZMB
−17.027
−35.205
0
−17.517
−9.369


FILIP1
0
−34.687
0
−5.945
−2.659


RGS2
0
−34.658
−4.62
−6.204
−2.36


EXPH5
0
−34.452
−3.942
−1.893
−0.23


SRGAP3
0
−34.118
−0.864
−6.231
−5.259


GM5177
−1.85
−33.888
−2.021
−26.362
−13.092


MT1
0
−33.823
0
−12.657
−10.364


TPI1
0
−32.629
−2.816
−21.536
−12.198


ACOT7
0
−32.602
−9.357
−6.331
−9.956


BHLHE40
−0.301
−32.471
−34.275
−4.262
−0.569


CCNG1
0
−32.322
−0.599
−10.943
−5.354


FAM110A
0
−32.314
−3.177
−12.491
−6.369


S100A11
−2.849
−32.158
−2.028
−10.953
−5.371


DUSP4
0
−31.906
−22.596
−2.854
−4.287


CAPG
0
−31.567
−16.812
−3.977
−1.984


FAM3C
0
−31.563
−2.316
−9.625
−8.321


NR4A2
−1.781
−31.351
−10.637
−8.09
−7.402


TFF1
0
−30.862
−4.387
−2.848
−4.229


IMPA2
0
−30.742
−2.661
−20.435
−11.402


NRP1
0
−30.537
−30.902
−2.005
−4.848


CST7
0
−30.448
−8.982
−1.397
−2.186


PLXND1
0
−30.229
−0.273
−5.416
−2.422


PKM
0
−29.958
−0.613
−12.041
−9.718


STAT3
0
−29.78
−4.825
−1.639
−1.529


CXCR6
−17.027
−29.649
0
−2.242
−10.395


GDPD5
0
−29.436
−4.136
−4.718
−1.394


CCR8
0
−29.292
−36.056
−1.86
−0.433


SMIM3
0
−29.22
−6.398
−14.507
−0.831


ARL14EP
0
−29.195
−9.693
−12.507
−5.873


ERGIC1
0
−29.048
−4.429
−5.476
−8.829


ID2
−3.382
−29
0
−4.316
−3.54


EHD1
−0.727
−28.975
−1.406
−6.07
−2.538


CX3CR1
0
−28.883
−4.262
−11.074
−1.089


CASP3
−8.408
−28.856
−0.755
−13.285
−15.872


NRN1
0
−28.795
−39.128
−4.33
−1.816


PEX16
0
−28.699
−2.777
−2.117
−1.075


HNRNPA1
0
−28.44
−1.621
−11.938
−8.143


FDX1
0
−28.187
−1.805
−14.381
−4.935


OSBPL3
−5.169
−28.093
−0.769
−16.399
−10.473


GZME
0
−28.084
0
−1.257
−0.075


CIAPIN1
−0.55
−27.949
−3.79
−6.224
−8.771


SAMSN1
−0.812
−27.927
−5.485
−9.544
−4.631


ALDOA
−5.221
−27.783
−0.04
−3.877
−2.09


TUBB6
0
−27.679
0
−27.279
−5.756


IL2RB
−5.313
−27.616
−0.082
−2.376
−5.386


GZMD
0
−27.52
0
−1.044
−0.241


UHRF2
0
−27.41
−1.698
−7.039
−3.537






adj.pval.cluster 11
adj.pval.cluster 12
adj.pval.cluster 13
adj.pval.cluster 14
adj.pval.cluster 15





GLDC
−0.001
−0.109
0
0
−1.259


TNFRSF9
−0.27
−15.96
0
0
−0.619


PRF1
−0.001
−1.811
0
0
−0.032


IRF8
−0.001
−13.999
0
0
−0.83


CCRL2
−0.001
0
0
0
−0.091


LAT2
−0.001
0
0
0
−0.049


PCYT1A
−0.001
−0.683
0
0
−0.336


CSF1
−0.001
0
0
0
−0.198


MYO10
−0.001
0
0
0
−0.063


TMPRSS6
−0.001
0
0
0
−0.008


2900026A02RIK
−0.001
0
0
0
−0.051


HAVCR2
−0.001
−3.738
0
0
0


C1QTNF6
−0.001
0
0
0
−0.048


SERPINE2
−0.001
0
0
0
−1.4


ADAM8
−0.001
−0.079
0
−1.071
−0.132


ITGAV
−0.001
−5.295
0
0
−0.93


ADAMTS14
−0.001
−0.273
−0.141
0
−0.294


RGS8
−0.001
0
−0.141
0
−0.032


GPR56
−0.001
−0.171
0
0
−0.011


AA467197
−0.001
0
0
−0.152
−0.006


SLC37A2
−0.001
0
0
0
−0.225


PGLYRP1
−0.001
0
0
0
−0.597


ANXA2
−0.252
−0.485
0
−0.065
−1.032


TNFRSF4
−0.093
−5.661
0
0
−4.988


RBPJ
−0.001
−5.328
0
0
−1.932


LITAF
−0.001
−0.137
0
0
−1.067


HILPDA
−0.001
−0.247
0
0
−0.493


MNDA
−0.001
−0.022
0
0
−1.517


KIT
−0.001
−0.022
0
0
−0.121


GPD2
−0.001
−1.482
0
0
−0.483


IL1R2
−0.001
−0.517
0
−0.697
−5.067


RGS16
−0.095
−2.418
−0.595
0
−0.986


PLEK
−0.001
−7.355
0
0
−0.548


DSCAM
−0.001
0
−0.438
0
−0.461


EPAS1
−0.001
−0.023
−0.184
0
−0.233


NABP1
−0.001
−0.877
0
0
−0.04


SLC16A11
−0.001
0
0
0
−1.047


GZMF
−0.001
−0.002
0
0
−0.052


IKZF2
−0.001
−0.15
0
0
−1.383


CD244
−0.001
0
0
0
−0.012


GZMC
−0.001
0
0
−0.326
−0.088


CDK6
−0.001
−3.568
0
0
−0.434


SERPINB9
−0.001
−1.839
0
0
−0.411


GEM
−0.001
0
0
0
−0.647


LAG3
−0.108
−4.982
0
0
−1.917


SLC2A3
−0.001
−0.46
0
0
−0.668


UBASH3B
−0.001
−3.389
0
0
−0.022


NRGN
−0.001
0
−0.069
0
−3.422


CCL3
−0.001
−10.724
0
0
−0.093


GAPDH
−0.417
−4.935
0
0
−1.463


PLAC8
−0.001
0
0
0
−0


FOXRED2
−0.001
−0.4
0
0
−0.252


GZMB
−0.001
−0.426
0
0
0


FILIP1
−0.001
−0.039
0
0
−0.068


RGS2
−0.001
0
0
0
−0.672


EXPH5
−0.001
−0.184
−0.485
0
−1.176


SRGAP3
−0.001
−0.701
0
0
−0.018


GM5177
−0.432
−4.98
0
0
−1.502


MT1
−0.001
−0.183
0
0
−0.232


TPI1
−0.09
−3.506
0
0
−2.568


ACOT7
−0.001
0.436
0
−2.29
−2.183


BHLHE40
−0.125
−4.679
0
0
−1.518


CCNG1
−0.001
0
0
0
−0.919


FAM110A
−0.001
−2.034
0
0
−1.327


S100A11
−0.001
0
0
0
−0.729


DUSP4
−0.001
−9.348
0
0
−2.344


CAPG
−0.001
0
0
−0.408
−2.146


FAM3C
−0.001
−0.472
0
0
−0.726


NR4A2
−0.001
−1.067
−0.279
0
−0.99


TFF1
−0.001
0
0
−0.185
−0.85


IMPA2
−0.001
0
0
−0.066
−1.337


NRP1
0
−2.099
0
−3.231
−3.686


CST7
−0.001
−1.396
0
0
−0.093


PLXND1
−0.001
0
0
0
−0.068


PKM
−0.001
−6.583
0
0
−1.77


STAT3
−0.496
−10.55
−1.172
0
−0.553


CXCR6
−0.001
−0.018
0
0
0


GDPD5
−0.001
−0.882
0
−2.974
−2.711


CCR8
−0.001
−1.64
0
0
−2.288


SMIM3
−0.001
−0.51
0
−1.908
−1.071


ARL14EP
−0.001
−0.244
0
0
−3.948


ERGIC1
−0.022
−7.328
0
0
−1.126


ID2
−0.001
−0.416
0
0
−0.061


EHD1
−0.001
−4.721
0
0
−0.262


CX3CR1
−0.001
−0.17
0
0
−0.165


CASP3
−0.001
−1.659
0
0
−2.174


NRN1
−0.001
−0.014
0
0
−5.958


PEX16
−0.001
0
0
−1.498
−0.78


HNRNPA1
−0.055
−7.899
0
0
−2.267


FDX1
−0.001
0
0
0
−1.121


OSBPL3
−0.001
−0.386
0
0
−0.401


GZME
−0.001
0
0
−0.472
−0.068


CIAPIN1
−0.001
−1.959
0
0
−0.719


SAMSN1
−0.001
−2.181
0
−0.061
−0.308


ALDOA
−0.001
−2.078
0
0
−0.055


TUBB6
−0.001
0
0
−9.47
−4.64


IL2RB
−0.001
0
0
0
−0.001


GZMD
−0.001
0
0
0
−0.072


UHRF2
−0.001
−0.063
0
0
−1.752
















TABLE 5







Cluster 7 Specific Gene Signature

















0
8
9
10
rank_0
rank_8
rank_9
rank_10
mean_rank




















TNFRSF9
−91.791
−14.331
−14.793
−13.779
2
6
1
1
2.5


PRF1
−79.24
−29.216
−13.275
−11.21
6
1
2
2
2.75


GLDC
−113.856
−14.208
−7.744
−8.202
1
7
5
5
4.5


IRF8
−83.708
−4.672
−7.942
−9.831
4
44
4
3
13.75


ADAM8
−63.45
−16.254
−3.094
−6.172
18
4
36
9
16.75


SERPINB9
−43.259
−11.006
−4.253
−6.666
36
15
18
6
18.75


LAT2
−78.786
−8.281
−3.519
−5.725
8
24
30
13
18.75


CCRL2
−79.074
−12.487
−2.528
−6.172
7
12
48
11
19.5


NABP1
−45.027
−9.903
−6.648
−5.353
34
19
7
19
19.75


HILPDA
−50.028
−12.93
−6.973
−3.258
27
10
6
38
20.25


SLC2A3
−42.969
−7.103
−4.913
−8.887
38
28
13
4
20.75


PCYT1A
−79.651
−8.38
−2.581
−5.38
5
23
45
18
22.75


2900026A02RIK
−71.856
−10.685
−3.96
−2.875
11
16
23
41
22.75


TMPRSS6
−67.607
−10.618
−4.739
−2.384
15
17
15
55
25.5


MYO10
−70.953
−5.129
−4.707
−3.772
14
40
16
33
25.75


ID2
−32.463
−12.505
−5.358
−3.609
47
11
12
36
26.5


RBPJ
−57.177
−1.82
−5.411
−6.536
21
67
11
8
26.75


ITGAV
−71.527
−5.709
−3.643
−3.728
13
35
26
34
27


STAT3
−34.747
−2.777
−8.191
−6.615
45
55
3
7
27.5


LITAF
−55.012
−6.153
−2.705
−6.172
26
33
44
10
28.25


SERPINE2
−73.535
−5.105
−2.535
−5.436
9
42
47
16
28.5


PGLYRP1
−59.204
−4.039
−3.643
−5.014
19
47
27
21
28.5


ALDOA
−31.109
−12.007
−3.074
−5.422
48
13
37
17
28.75


CSF1
−87.97
−2.774
−3.364
−4.438
3
56
32
25
29


GEM
−47.97
−4.672
−3.388
−5.711
28
45
31
14
29.5


HAVCR2
−64.475
−28.612
−2.425
−2.598
17
2
50
49
29.5


IL2RB
−30.885
−11.694
−5.935
−2.605
49
14
9
47
29.75


EPAS1
−47.895
−9.44
−3.336
−3.315
29
21
33
37
30


AA467197
−55.604
−7.01
−3.64
−3
24
29
28
40
30.25


RGS2
−45.144
−3.008
−4.105
−5.039
33
53
19
20
31.25


LILRB4
−30.192
−8.596
−5.464
−2.766
50
22
10
43
31.25


SLC37A2
−55.738
−6.098
−5.942
−2.076
23
34
8
61
31.5


UBASH3B
−46.981
−6.695
−2.92
−4.125
30
31
41
29
32.75


SH2D2A
−22.905
−13.497
−3.954
−3.078
62
9
24
39
33.5


CCL3
−42.757
−13.952
−2.096
−3.705
40
8
54
35
34.25


PLAC8
−42.778
−15.592
−1.435
−4.693
39
5
72
22
34.5


ANXA2
−57.193
−5.113
−1.633
−6.172
20
41
66
12
34.75


GPR56
−55.013
−20.844
−2.995
−1.571
25
3
40
73
35.25


ADAMTS14
−71.564
−7.445
−3.954
−1.436
12
27
25
78
35.5


BCL2L11
−27.381
−4.72
−4.051
−4.63
56
43
21
24
36


PEX16
−36.099
−2.394
−4.791
−4.42
44
60
14
26
36


C1QTNF6
−71.908
−10.129
−2.852
−1.567
10
18
42
74
36


EHD1
−36.304
−5.416
−3.159
−3.847
43
36
35
31
36.25


RGS8
−67.32
−7.49
−3.585
−1.429
16
25
29
79
37.25


S100A11
−40.349
−6.891
−2.113
−3.933
42
30
53
30
38.75


GPD2
−56.588
−9.512
−1.734
−1.998
22
20
62
62
41.5


GZMC
−46.719
−5.383
−1.956
−2.723
32
37
56
44
42.25


DENND4A
−17.686
−2.592
−4.01
−4.635
70
57
22
23
43


EXPH5
−46.88
−1.602
−2.757
−4.261
31
71
43
28
43.25


CBLB
−15.239
−6.307
−4.266
−2.107
72
32
17
59
45


GZMF
−42.15
−5.285
−2.132
−2.474
41
38
52
51
45.5


CCNG1
−43.191
−7.49
−1.353
−2.419
37
26
79
53
48.75


SERPINB6B
−27.486
−1.621
−1.918
−5.66
55
70
57
15
49.25


IL12RB2
−29.107
−2.489
−1.779
−4.264
52
58
61
27
49.5


GDPD5
−43.403
−1.575
−1.85
−2.683
35
72
59
45
52.75


RPS2
−11.518
−2.415
−2.998
−2.774
75
59
39
42
53.75


SLC25A4
−22.912
−2.934
−4.07
−1.377
61
54
20
80
53.75


GIPC2
−34.711
−1.384
−2.169
−2.605
46
75
51
48
55


LPIN2
−26.773
−3.391
−1.581
−2.621
57
50
69
46
55.5


GZME
−28.851
−3.604
−1.388
−2.41
54
48
76
54
58


MAP3K1
−11.979
−2.207
−3.043
−2.217
74
63
38
58
58.25


DGAT1
−18.408
−4.261
−1.786
−1.994
69
46
60
63
59.5


RARA
−7.991
−2.277
−3.295
−1.733
78
61
34
68
60.25


RIOK1
−15.921
−5.188
−1.96
−1.523
71
39
55
77
60.5


AI662270
−19.804
−3.455
−1.376
−2.526
67
49
77
50
60.75


NPNT
−28.907
−1.748
−1.677
−2.32
53
68
65
57
60.75


GZMD
−29.386
−3.315
−1.399
−1.823
51
52
73
67
60.75


ADCK3
−21.125
−1.311
−2.528
−2.436
64
80
49
52
61.25


SDCBP2
−26.342
−3.357
−1.395
−1.836
58
51
74
66
62.25


MVP
−20.821
−1.34
−1.393
−3.779
65
78
75
32
62.5


TRAF4
−19.908
−1.884
−1.454
−2.337
66
65
70
56
64.25


CALR
−24.033
−2.21
−1.376
−2.091
60
62
78
60
65


SKIL
−8.274
−2.069
−2.539
−1.532
77
64
46
76
65.75


FUZ
−21.768
−1.404
−1.86
−1.588
63
74
58
70
66.25


OSR2
−18.773
−1.73
−1.697
−1.588
68
69
64
71
68


ZC3H12C
−26.294
−1.322
−1.436
−1.855
59
79
71
65
68.5


FNDC3A
−9.517
−1.378
−1.701
−1.957
76
77
63
64
70


ZFP296
−12.662
−1.822
−1.62
−1.545
73
66
67
75
70.25


FXYD5
−3.349
−1.494
−1.614
−1.58
80
73
68
72
73.25


CD3E
−7.741
−1.383
−1.336
−1.615
79
76
80
69
76
















TABLE 6







Cluster 7 Specific Gene Signature


CD8_cluster7











Genes 1-25
Genes 26-50
Genes 51-75
Genes 76-100
Genes 101-124





PRF1
ADAMTS14
GZMD
SLC16A3
STK24


GLDC
RGS8
SERPINE2
GPR65
FKBP1A


LAT2
CCNG1
SLC25A4
FCRL6
DSCAM


ADAM8
CDK6
PADI2
GM14295
STK39


TNFRSF9
GPR56
PPP1R3B
ITGB1BP1
ISY1


HILPDA
GPD2
MYO10
SRGAP3
MRC2


TMPRSS6
PLAC8
SLC52A3
FOXRED2
NUDT18


CCRL2
HAVCR2
ASB2
NAGPA
SIL1


ID2
GZMF
LRRK1
RCN1
ENO1


NABP1
CBLB
AFG3L2
GBP10
LPIN2


LILRB4
EHD1
PTK2B
SLA2
GP49A


2900026A02RIK
FILIP1
ACOXL
RHOC
ACOT7


AA467197
SLC2A3
NEK6
STAT3
RGS1


SERPINB9
GZME
NEDD9
SYNGR3
SLC27A1


UBASH3B
BCL2L11
ANXA2
PLXND1
CST7


CXCR6
INSRR
DGAT1
SLC24A1
TIPRL


PCYT1A
PLEK
IRAK2
ERO1L
CTSD


IL2RB
RGS2
MT2
RLN3
CIAPIN1


CCL3
GZMC
THEMIS2
EPDR1
PTPN5


LITAF
GEM
RGS16
IRF8
IL10RA


EPAS1
GZMB
PGLYRP1
MT1
GSTO1


C1QTNF6
SDCBP2
FXYD5
TMEM135
GBP6


ALDOA
ITGAV
SLC35D3
SLCO2A1
SERINC1


S100A11
PKM
CCL4
TOMM40L
RAB19


SLC37A2
SH2D2A
DGKH
D16ERTD472E









Example 2—Identification of Novel Tumor Infiltrating CD4+ T Cells Populations

CD4 cells were analyzed from the mouse tumor model at time points as discussed in Example 1 herein. CD4 T cells (both Effector and Regulatory) were obtained by sorting for CD4+CD45+ cells. NK cells, dendritic cells, and macrophages were obtained by sorting for CD4CD8CD45+ cells. CD45 cells included fibroblasts and tumor cells.



FIG. 19 illustrates dimension reduction analysis of the cells sequenced for CD4 T cells. Applicants sequenced 2496 cells (26 plates). 2114 cells passed the basic QC (85%) and 1478 cells passed the extensive QC (59%). Principal component (PC) analysis was performed using gene expression measured in the single cells. PC1 was associated with transcription and PC2 and PC3 were strongly associated with sequencing batches. tSNE and clustering was performed on PCs 4-6. All of the CD4 cells were pooled together on a normalized tSNE. The CD4 cells clustered into 14 clusters. FIG. 20 illustrates each cluster individually. FIG. 21 illustrates 4 populations that stand out based on expression of the CD4 Treg marker Foxp3 and a Treg signature. FIG. 22 illustrates 5 populations that stand out based on expression of the coinhibitory receptor Tim3. Tim3+CD4 Tregs are the most repressive in the tumor environment (Sakuishi et al., TIM3+FOXP3+ regulatory T cells are tissue-specific promoters of T-cell dysfunction in cancer. Oncoimmunology. 2013 Apr. 1; 2(4):e23849). The clusters also express Tbet which has been described in the context of Tregs that suppress Th1 responses (Levine et al., Stability and function of regulatory T cells expressing the transcription factor T-bet. Nature. 2017 Jun. 15; 546(7658):421-425). FIG. 23 illustrates that CD4 clusters 4 and 7 have high of a Th1 and cytokine secretion signature. The Th1 signature includes Tbet, IFNy, II2, TNFa, STAT4, CXCR6, CCR5 AND CXCR3. The cytokine signature includes GZMB, GZMK, PRF1, GZMA, GZMF, GZMC, GZMM, IFNG, TNF, GZMD, GZME and IL2.


Example 3—Identification of Cell-Cell Interactions in CD8+ and CD4+ T Cell Populations


FIGS. 24-26 and 36 illustrate that the different CD4 and CD8 T cell subtypes identified positively and negatively correlate with each other. Positive correlation relates to the situation wherein high expression of one cell subtype correlates to high expression of another cell type. Negative correlation relates to the situation wherein high expression of one cell subtype correlates to low expression of another cell type. For example, CD4 clusters 8 and 10 expression correlates to expression of CD8 cluster 7 (FIG. 26). Thus, DP suppressive and dysfunctional CD8 T cells (cluster 7) correlate with CD4 Tim3+ Tregs and CD4 HeliosloiTregs. FIG. 36 shows that the relative frequency of dysfunctional CD8+ T cells in a tumor is correlated with CD4+ Treg frequency.



FIG. 27 shows a heatmap plotting a signature for ligands and a signature for receptors. Thus, clusters of T cells can be analyzed for expression of receptor/ligand pairs. The clusters expressing the receptor/ligand pairs may functionally interact. For example, CD8 cluster 7 expresses a receptor for the ligand expressed by CD4 cluster 1.



FIG. 28 shows that cells analyzed by single cell RNA-seq provide results consistent with bulk sequencing.



FIG. 37 shows interactions between CD8 PD1+ populations and CD4 populations (Th1-like and Treg). Connections were specifically made based on chemokine/chemokine receptor pairs (with CD45+). Applicants analyzed the interactions between Cluster 8 and cluster 7 (i.e., dysfunctional clusters).



FIG. 38 shows that XCL1 is expressed strongly in cluster 8 and XCR1 is expressed in cluster 7. Previous studies described the role of XCL1, a chemokine associated with immune suppression and allergy, on CD4(+)CD25(high)CD127(low/−) regulatory T cell (Treg) function in allergic asthma (Nguyen et al., J Immunol. 2008 Oct. 15; 181(8):5386-95). Several studies suggest that during early tumor response NK cells secrete XCL-1 thereby recruiting XCR1+cDC1, which have been shown to be crucial to cross-present antigens to CD8 T cells (e.g PMID: 22566900 and 29429633). This cross-presentation is crucial to differentiate cytotoxic CD8 T cells. Thus, a widespread blockade of this molecule during early tumor responses could potentially hinder rather than enhance immune response. Nevertheless, some reports show that XCR-1 is expressed in Tregs and that its ligand XCL-1 increases suppressive activity in models of allergic asthma (PMID: 18832695). Moreover, it is now better recognized that chemokine-chemoreceptors axis can be exploited by Tregs to directly suppress T conventional cells cytotoxic activity (PMID: 26854929). Applicants hypothesize that C7 CD8 cells are recruited to the tumor via this axis and/or Tregs are recruited through secretion of C8 CD8-derived XCL-1. Applicants hypothesize that the XCL1+CD8+: XCR1+CD8+ axis may enhance regulatory activity of CD8+ TILs. Thus, targeting XCL1 and/or XCR1 in CD8 clusters 8 and 7 may be used to enhance or inhibit T cell suppression.



FIG. 39 shows that CCL1 is expressed in cluster 8 and CCR8 is expressed in cluster 8 and cluster 7. CCR8 is also expressed in Treg+Tim3+CD4 cells. Applicants hypothesize that CCL1+CD8+ cells (cluster 8) have a regulatory interaction with dysfunctional CD8 (cluster 7) and CD4+ Tregs. Previous studies data confirm the importance of this axis in recruiting Tregs to lymphoid tissues and inflammatory sites and sustaining their inflammatory phenotype (PMID: 11560999, 23798714). Additionally, blockade of CCL1 has been suggested to enhance tumor immunity (Hoelzinger et al., Blockade of CCL1 Inhibits T Regulatory Cell Suppressive Function Enhancing Tumor Immunity without Affecting T Effector Responses, J Immunol. 2010 Jun. 15; 184(12):6833-42).


Example 4—Identification of CD8+ T Cells Populations Using 10× Genomics Platform

Applicants performed single cell RNA sequencing on the B16 mouse model as described in Example 1 using the 10× genomics platform (10× Genomics, Inc., Pleasanton, Calif., www.10xgenomics.com/solutions/single-cell/). Applicants validated the previous results in that the 10× time course data revealed the same populations as in the CD8 plates, including cluster 7. FIG. 40 shows cell counts taken for cells sorted by day (left) and sorted by size (right). The first step performed was to select for CD3+ cells (FIG. 41). FIG. 42 shows the general statistics for all time points taken. FIG. 43 shows CD8/CD4 partitioning of the clusters. The portioned cells could be classified as strict CD4, Strict CD8, weak CD8. Cluster 3 and cluster 0 are CD8 clusters. Clusters 2 and 5 are CD4 clusters. FIG. 44 shows the number of cells after selecting for CD8/CD4 cells and batch correction across the time points. FIG. 45 shows plots of strict CD8 cells based on mouse, time point/batch, and by clustering. The single cells did not cluster by mouse or by time point, but clustered according to cell type (e.g., gene expression, tSNE). Applicants measured cluster specific gene expression in the strict CD8 cells and the results were consistent with signatures for CD8 clusters (FIGS. 46-47). FIG. 48 shows a comparison of the plate based clusters and the 10× clusters. For example, a cluster corresponding to clusters 7, 8, 9 and 10 in the plate based analysis were also present in single cells analyzed by the 10× platform.


Various modifications and variations of the described methods, pharmaceutical compositions, and kits of the invention will be apparent to those skilled in the art without departing from the scope and spirit of the invention. Although the invention has been described in connection with specific embodiments, it will be understood that it is capable of further modifications and that the invention as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications of the described modes for carrying out the invention that are obvious to those skilled in the art are intended to be within the scope of the invention. This application is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departures from the present disclosure come within known customary practice within the art to which the invention pertains and may be applied to the essential features herein before set forth.

Claims
  • 1. An isolated T cell characterized in that the T cell comprises expression of CD8, TIM3, PD1, MT1, and IKZF2, and comprises expression of one or more genes selected from the group consisting of: a. TNFRSF9, PRF1, BHLHE40 (DEC1), IRF8, GLDC, STAT3, CST7, IL1R2, EEF2, SLC2A3, SQSTM1, RBPJ, NABP1, ACTN1, TNFRSF4, SERPINB9, FOSL2, CAPG, KLRC1, IL18R1, JUNB, EEF1A1, TNFRSF18, RGS2, NFKB2, RPL5, PEX16, LAT2, KDM5B, HILPDA, GEM, DENND4A, BCL2L11, ADAM8, PGLYRP1, KIT, SERPINE2, CCRL2, CSF1, EPAS1, RUNX2, SPRY2 and XCR1; orb. Table 6, preferably,wherein the T cell does not express HMMR; and/orwherein the T cell comprises upregulation of one or more genes selected from the group consisting of TNFRSF9, PRF1, BHLHE40, IRF8, GLDC, STAT3, CST7, IL1R2, EEF2, SLC2A3, SQSTM1, RBPJ, NABP1, ACTN1, TNFRSF4, SERPINB9, FOSL2, CAPG, KLRC1, IL18R1, JUNB, EEF1A1, TNFRSF18, RGS2, NFKB2, RPL5, PEX16, LAT2, KDM5B, HILPDA, GEM, DENND4A, BCL2L11, ADAM8, PGLYRP1, IKZF2, KIT, SERPINE2, CCRL2, CSF1, EPAS1, RUNX2, SPRY2 and XCR1 as compared to all CD8+TIM3+PD1+ T cells; and/orwherein the T cell comprises downregulation of a cell cycle signature as compared to all CD8+TIM3+PD1+ T cells.
  • 2-4. (canceled)
  • 5. The isolated T cell according to claim 1, wherein the T cell suppresses T cell proliferation; and/or wherein the T cell is further characterized by a gene signature comprising one or more genes or polypeptides selected from Tables 1 to 6; and/orwherein the T cell is a human cell; and/orwherein the T cell is autologous for a subject suffering from cancer.
  • 6-8. (canceled)
  • 9. A method for detecting or quantifying T cells in a biological sample of a subject, the method comprising detecting or quantifying in a biological sample of the subject, T cells as defined in claim 1, preferably, wherein T cells are detected or quantified using a set of markers comprising:a) TIM3, SERPINE2 and HMMR; orb) SERPINE2 and HMMR; orc) TIM3, KIT and HMMR; ord) TIM3, TNFRSF4 and HMMR; ore) (a), (b), (c) or (d) and one or more genes or polypeptides selected from the group consisting of CD8, CD45 and PD1; orf) (a), (b), (c), (d) or (e) and one or more genes or polypeptides selected from the group consisting of TNFRSF9, PRF1, BHLHE40, IRF8, GLDC, STAT3, CST7, IL1R2, EEF2, SLC2A3, SQSTM1, RBPJ, NABP1, ACTN1, TNFRSF4, SERPINB9, FOSL2, CAPG, KLRC1, IL18R1, JUNB, EEF1A1, TNFRSF18, RGS2, NFKB2, RPL5, PEX16, LAT2, KDM5B, HILPDA, GEM, DENND4A, BCL2L11, ADAM8, PGLYRP1, IKZF2, KIT, SERPINE2, CCRL2, CSF1, EPAS1, RUNX2, SPRY2 and XCR1, more preferably,wherein intact T cells are detected or quantified using a set of surface markers comprising:a) TIM3, SERPINE2 and HMMR; orb) SERPINE2 and HMMR; orc) TIM3, KIT and HMMR; ord) TIM3, TNFRSF4 and HMMR; ore) (a), (b), (c) or (d) and one or more genes or polypeptides selected from the group consisting of CD8, CD45 and PD1; orf) (a), (b), (c), (d) or (e) and one or more genes or polypeptides selected from the group consisting of TNFRSF9, IL1R2, SLC2A3, TNFRSF4, KLRC1, IL18R1, TNFRSF18, LAT2, ADAM8, KIT, SERPINE2 and XCR1.
  • 10. (canceled)
  • 11. The method according to claim 9, wherein the T cells are detected or quantified using a technique selected from the group consisting of RT-PCR, RNA-seq, single cell RNA-seq, flow cytometry, mass cytometry, fluorescence activated cell sorting, fluorescence microscopy, affinity separation, magnetic cell separation, microfluidic separation, and combinations thereof; or wherein the intact T cells are detected or quantified using a technique selected from the group consisting of flow cytometry, fluorescence activated cell sorting, affinity separation, magnetic cell separation, microfluidic separation, and combinations thereof.
  • 12-13. (canceled)
  • 14. A method for isolating T cells from a biological sample of a subject, the method comprising isolating from the biological sample, T cells as defined in claim 1, preferably, wherein T cells are isolated using a set of surface markers comprising:a) TIM3, SERPINE2 and HMMR; orb) SERPINE2 and HMMR; orc) TIM3, KIT and HMMR; ord) TIM3, TNFRSF4 and HMMR; ore) (a), (b), (c) or (d) and one or more genes or polypeptides selected from the group consisting of CD8, CD45 and PD1; orf) (a), (b), (c), (d) or (e) and one or more genes or polypeptides selected from the group consisting of TNFRSF9, IL1R2, SLC2A3, TNFRSF4, KLRC1, IL18R1, TNFRSF18, LAT2, ADAM8, KIT, SERPINE2 and XCR1, more preferably,wherein the T cells are isolated, using a technique selected from the group consisting of flow cytometry, fluorescence activated cell sorting, affinity separation, magnetic cell separation, microfluidic separation, and combinations thereof.
  • 15-16. (canceled)
  • 17. The method according to claim 9, wherein the technique employs one or more agents capable of specifically binding to one or more gene products expressed or not expressed by the T cells, preferably on the cell surface of the T cells, more preferably, wherein the one or more agents are one or more antibodies.
  • 18. (canceled)
  • 19. The method according to claim 9, wherein the biological sample is a tumor sample obtained from a subject in need thereof; or wherein the biological sample is a sample obtained from a subject suffering from an autoimmune disease; orwherein the biological sample is a sample obtained from a subject suffering from a chronic infection; orwherein the biological sample comprises ex vivo or in vitro T cells.
  • 20-22. (canceled)
  • 23. A population of T cells comprising T cells as defined in claim 1; or depleted for T cells as defined in claim 1.
  • 24. (canceled)
  • 25. The population of T cells according to claim 23, wherein the population of T cells comprise chimeric antigen receptor (CAR) T cells or T cells expressing an exogenous T-cell receptor (TCR); and/or wherein the population of T cells comprise T cells autologous for a subject suffering from cancer; and/orwherein the population of T cells comprise T cells displaying tumor specificity; and/orwherein the population of T cells are expanded; and/orwherein the population of T cells comprise activated T cells, preferably, wherein the population of T cells comprises T cells activated with tumor specific antigens, more preferably, wherein the tumor specific antigens are subject specific antigens.
  • 26-31. (canceled)
  • 32. A pharmaceutical composition comprising the depleted T cell population as defined in claim 23.
  • 33. A method of treating cancer comprising administering to a subject in need thereof the pharmaceutical composition according to claim 32.
  • 34. A method of treating cancer in a subject in need thereof comprising: a) depleting T cells as defined in claim 1 from a population of T cells obtained from the subject;b) in vitro expanding the population of T cells; andc) administering the in vitro expanded population of T cells to the subject.
  • 35. The method according to claim 33, wherein the T cell population is administered after ablation therapy or lymphodepletion therapy.
  • 36. A method of treating cancer or chronic infection in a subject in need thereof comprising administering to the subject a therapeutically effective amount of an agent: a) capable of reducing the activity of a T cell as defined in claim 1; orb) capable of reducing the activity or expression of one or more genes or polypeptides selected from the group consisting of TNFRSF9, PRF1, BHLHE40, IRF8, GLDC, STAT3, CST7, IL1R2, EEF2, SLC2A3, SQSTM1, RBPJ, NABP1, ACTN1, TNFRSF4, SERPINB9, FOSL2, CAPG, KLRC1, IL18R1, JUNB, EEF1A1, TNFRSF18, RGS2, NFKB2, RPL5, PEX16, LAT2, KDM5B, HILPDA, GEM, DENND4A, BCL2L11, ADAM8, PGLYRP1, IKZF2, KIT, SERPINE2, CCRL2, CSF1, EPAS1, RUNX2, SPRY2 and XCR1; orc) capable of targeting or binding to one or more cell surface exposed genes or polypeptides on a T cell as defined in claim 1; ord) capable of targeting or binding to one or more receptors or ligands specific for a cell surface exposed gene or polypeptide on a T cell as defined in claim 1; ore) capable of targeting or binding to one or more genes or polypeptides secreted from a T cell as defined in claim 1; orf) capable of targeting or binding to one or more receptors specific for a gene or polypeptide secreted from a T cell as defined in claim 1, preferably,wherein said agent comprises a therapeutic antibody, antibody fragment, antibody-like protein scaffold, aptamer, protein, genetic modifying agent or small molecule, more preferably, wherein the therapeutic antibody is an antibody drug conjugate; orwherein said agent capable of targeting or binding to a cell surface exposed gene or polypeptide comprises a CAR T cell capable of targeting or binding to the cell surface exposed gene or polypeptide.
  • 37-39. (canceled)
  • 40. A method of treating an autoimmune disease in a subject in need thereof comprising administering to the subject a therapeutically effective amount of an agent capable of inducing the activity of a T cell as defined in claim 1; or administering to the subject T cells as defined in claim 1.
  • 41. (canceled)
  • 42. A method for identifying an immunomodulant capable of modulating one or more phenotypic aspects of the T cell as defined in claim 1, comprising: a) applying a candidate immunomodulant to the T cell or T cell population; andb) detecting modulation of one or more phenotypic aspects of the T cell or T cell population by the candidate immunomodulant, thereby identifying the immunomodulant, preferably,wherein the immunomodulant is capable of modulating suppression of T cell proliferation by the T cell; and/orwherein the immunomodulant comprises a therapeutic antibody, antibody fragment, antibody-like protein scaffold, aptamer, protein or small molecule.
  • 43-44. (canceled)
  • 45. A pharmaceutical composition comprising the immunomodulant as defined in 42.
  • 46. A method for determining the T cell status of a subject, or for diagnosing, prognosing or monitoring a disease comprising an immune component in a subject, the method comprising detecting or quantifying in a biological sample of the subject T cells as defined in claim 1, wherein an increase as compared to a reference level indicates a suppressed immune response, preferably, wherein the disease is cancer, an autoimmune disease, or chronic infection.
  • 47. (canceled)
  • 48. A method of preparing cells for use in adoptive cell transfer comprising: a) obtaining a population of T cells; andb) depleting T cells as defined in claim 1 from the population of T cells, preferably,further comprising expanding the depleted cells; and/orfurther comprising activating the depleted cells; and/orwherein the population of T cells comprise CAR T cells; and/orwherein the population of T cells comprise autologous TILs.
  • 49-52. (canceled)
  • 53. A method of screening for genes required for suppression of effector T cells by suppressive CD8+ T cells comprising: a) introducing a library of sgRNAs specific to a set of target genes to a population of T cells expressing a CRISPR system;b) culturing the cells in proliferating conditions in the presence of CD8 T cells according to claim 1;c) determining sgRNAs that are enriched in proliferating T cells.
  • 54. A method of treating cancer or chronic infection in a subject in need thereof comprising: administering to the subject CD8+ T cells modified to be resistant to suppressive CD8+ T cells, wherein the modified CD8+ T cells are specific for the cancer or chronic infection, and/oradministering to the subject a therapeutically effective amount of an agent capable of blocking glucocorticoid signaling, preferably, wherein the agent is an antagonist of NR3C1, more preferably, wherein the antagonist is a blocking antibody; and/orreducing or eliminating the presence of an immune cell or changing a phenotype of the immune cell, at least at a disease or infection loci, wherein the immune cell is the immune cell of claim 1.
  • 55-59. (canceled)
  • 60. The method of claim 54, wherein the presence of the immune cell is reduced or eliminated, or wherein a phenotype of the immune cell is changed by modulating expression of MT1 and/or MT2; or wherein the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed by modulating expression of HELIOS (IKZF2); orwherein the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed by modulating expression or function of KIT; orwherein the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed by modulating expression or function of SERPINE2; orwherein the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed by modulating expression or function of TNFRSF4; orwherein the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed by modulating expression or function of ILR2; orwherein the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed by modulating expression or function of CSF1; orwherein the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed by modulating expression or function of CCRL2; orwherein the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed by modulating expression or function of IRF8; orwherein the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed, by modulating expression or function of RBPJ; orwherein the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed, by modulating expression or function of EPAS1; orwherein the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed, by modulating expression or function of RUNX2; orwherein the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed, by modulating expression or function of SPRY2; orwherein the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed, by modulating expression or function of STAT3; orwherein the presence of the immune cell is reduced or eliminated, or where a phenotype of the immune cell is changed, by reducing a sensitivity of the immune cell to glucocoritcoid signaling; orwherein the presence of the immune cell is reduced or eliminated, or wherein a phenotype of the immune cell is changed by modulating expression of XCR1, preferably,wherein modulating expression or function comprises inhibiting expression or function.
  • 61-76. (canceled)
  • 77. A kit comprising reagents to detect at least one gene or polypeptide as defined in claim 1.
  • 78. (canceled)
  • 79. (canceled)
  • 80. A method of treating cancer or chronic infection in a subject in need thereof comprising administering to the subject a therapeutically effective amount of an agent capable of inhibiting the interaction between XCL1 and XCR1.
CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of U.S. Provisional Application Nos. 62/565,862, filed Sep. 29, 2017 and 62/588,101, filed Nov. 17, 2017. The entire contents of the above-identified applications are hereby fully incorporated herein by reference.

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH

This invention was made with government support under Grant Nos. MH105960, CA187975, AI073748 and NS045937 awarded by the National Institutes of Health. The government has certain rights in the invention.

PCT Information
Filing Document Filing Date Country Kind
PCT/US18/53791 10/1/2018 WO 00
Provisional Applications (2)
Number Date Country
62565862 Sep 2017 US
62588101 Nov 2017 US