Photoreactive ligands and uses thereof

Information

  • Patent Grant
  • 11535597
  • Patent Number
    11,535,597
  • Date Filed
    Wednesday, January 17, 2018
    6 years ago
  • Date Issued
    Tuesday, December 27, 2022
    a year ago
Abstract
Disclosed herein are methods for identifying proteins as targets for interaction with a small molecule ligand. Also disclosed herein are small molecule ligands and compositions for use in profiling druggable proteins.
Description
SEQUENCE LISTING

The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Mar. 14, 2018, is named 48054-706_831_SL.txt and is 199,103 bytes in size.


BACKGROUND OF THE DISCLOSURE

Protein function assignment has been benefited from genetic methods, such as target gene disruption, RNA interference, and genome editing technologies, which selectively disrupt the expression of proteins in native biological systems. Chemical probes offer a complementary way to perturb proteins that have the advantages of producing graded (dose-dependent) gain- (agonism) or loss- (antagonism) of-function effects that are introduced acutely and reversibly in cells and organisms. Small molecules present an alternative method to selectively modulate proteins and to serve as leads for the development of novel therapeutics.


SUMMARY OF THE DISCLOSURE

Disclosed herein, in certain embodiments, is a method of identifying a protein capable of interacting with a small molecule ligand, comprising: (a) providing a cell sample; (b) exposing the cell sample to at least one potential small molecule ligand having a structure comprising at least a photoreactive diazirine group and a terminal alkyne group; (c) irradiating the cell sample with UV light; (d) performing lysis on the cell sample; (e) subjecting proteins in the post lysis material to fluorophore tagging; and (f) isolating at least one fluorophore-tagged protein.


Disclosed herein, in certain embodiments, is a method of identifying a protein capable of interacting with a small molecule ligand, comprising: (a) providing a cell sample; (b) exposing the cell sample to the small molecule ligand having a structure comprising at least a photoreactive diazirine group, and a terminal alkyne group; (c) irradiating the cell sample with UV light; (d) performing lysis on the cell sample; (e) subjecting the proteins in the post lysis material to tagging; and (f) isolating the tagged proteins for analysis to identify a protein capable of interacting with the small molecule ligand.


Disclosed herein, in certain embodiments, is a method of identifying a small molecule ligand binding site on an isolated protein, comprising: (a) providing an isolated protein; (b) exposing the protein to at least one of potential small molecule ligands having a structure comprising at least a photoreactive diazirine group and a terminal alkyne group; (c) irradiating the protein with UV light; (d) tagging the protein with biotin; (e) binding the biotin-tagged protein to solid phase beads; (f) digesting the protein to provide protein fragments; and (g) analyzing the protein fragments to determine the small molecule ligand binding site.


Disclosed herein, in certain embodiments, is a method of identifying a small molecule ligand capable of interacting with a cellular protein, comprising: (a) providing a cell which expresses the cellular protein; (b) exposing the cell to a first-small molecule ligand of predetermined affinity for the cellular protein and a second small molecule ligand, wherein the small molecule ligand of predetermined affinity has a structure comprising at least a photoreactive diazirine group and a terminal alkyne group; (c) irradiating the cell with UV light; (d) performing lysis on the cell; (e) subjecting proteins in the post lysis material to tagging of the first small molecule ligand; and (f) determining the level of tagging in the presence of the second small molecule ligand compared to the level of tagging in the absence of the second small molecule ligand.


Disclosed herein, in certain embodiments, is a small molecule ligand which is capable of binding to a binding site on a protein, in which the protein is selected from Tables 1-4. In some cases, the binding site is disclosed in Tables 1-3.





BRIEF DESCRIPTION OF THE DRAWINGS

Various aspects of the disclosure are set forth with particularity in the appended claims. A better understanding of the features and advantages of the present disclosure will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the disclosure are utilized, and the accompanying drawings of which:



FIG. 1A-FIG. 1H exemplify a chemical proteomic strategy for mapping of fragment-protein interactions in cells. FIG. 1A represents schematic depiction of fully functionalized fragment (FFF) probes and experimental workflow to identify FFF-protein interactions in cells by quantitative MS-based proteomics. Isotopically heavy and light amino acid-labeled cells are treated with distinct FFF probes for 30 min, followed by UV light exposure, lysis, conjugation to biotin azide by CuAAC, streptavidin enrichment of labeled proteins, tryptic digestion and subsequent analysis of tryptic peptides. FIG. 1B exemplifies structures of FFF probes. Shown are the ‘constant’ (containing the diazirine photoreactive group and clickable alkyne handle) and ‘variable’ (consisting of small-molecule fragments; enclosed in box) regions of probes. FIG. 1C exemplifies FFF probe-protein interactions in cells. HEK293T cells were treated with probes (20 μM) for 30 min, followed by photocrosslinking and analysis as described in FIG. 1D. Asterisk mark representative distinct probe-protein interactions. FIG. 1E exemplifies additional profiles of FFF probe-protein interactions. FIG. 1D exemplifies experimental workflow to visualize FFF probe-protein interactions in cells by SDS-PAGE coupled with in-gel fluorescence scanning. Cells are treated with indicated FFF probe for 30 min, followed by photocrosslinking, lysis, CuAAC conjugation to a rhodamine (TAMRA)-azide tag, separation by SDS-PAGE, and visualization by in-gel fluorescence scanning. FIG. 1E exemplifies FFF probe-protein interactions in cells. HEK293T cells were treated with FFF probes (20 μM) for 30 min in situ, followed by photocrosslinking, separation of soluble and membrane fractions and analysis. (FIG. 1F, FIG. 1G) Fragment probes show concentration-dependent labeling of proteins in HEK293T cells (FIG. 1F), with little to no further change in protein labeling when incubated in cells for 5 to 30 min prior to photocrosslinking (FIG. 1G). FIG. 1H exemplifies HEK293T cells were treated with FFF probes (20 μM) for 30 min, and the cells were then washed 1-2× with DPBS prior to photocrosslinking. Asterisks mark proteins that show similar extents of probe labeling before and after cell washing.



FIG. 2A-FIG. 2T exemplify quantitative MS-based proteomic analysis of fragment-protein interactions in cells. FIG. 2A exemplifies heatmap showing relative protein enrichment values of FFF probes (200 μM) versus control 1 in HEK293T cells. FIG. 2B is a representative SILAC ratio plot of proteins differentially enriched in probe-vs-probe (13 versus 3) experiments in HEK293T cells. Proteins preferentially enriched (>3-fold by either probe, depicted with dashed lines) in 13-vs-3 experiments that were also preferentially enriched (>2-fold) by 13 or 3 in probe-vs-control 1 experiments are depicted. FIG. 2B also exemplifies proteins that were strongly enriched by both probes in probe-vs-control 1 experiments and proteins not enriched by either probe. FIG. 2C exemplifies that most proteins demonstrating preferential enrichment (>3-fold) in probe-vs-probe experiments show corresponding preferential enrichment by the same probe in probe-vs-1 experiments. Light gray portions of bars mark fractions of proteins that were strongly enriched by both probes in probe-vs-control 1 experiments. (FIG. 2D-FIG. 2F) Heatmaps (FIG. 2D, FIG. 2E) and extracted MS1 chromatograms of representative tryptic peptides (FIG. 2F) for four example proteins showing strong preferential enrichment by one FFF probe over control 1 (FIG. 2D) and the corresponding results for these proteins in probe-vs-probe experiments (FIG. 2E). FIG. 2G exemplifies that the majority of proteins that are strongly enriched (SILAC ratio>10) by most FFF probes (≥8 of 11) in probe-vs-control 1 experiments show preferential enrichment by one FFF probe in probe-vs-probe experiments. (FIG. 2H-FIG. 2J) Heatmaps (FIG. 2H, FIG. 2I) and extracted MS1 chromatograms of representative tryptic peptides (FIG. 2J) for three example proteins showing enrichment by the majority of FFF probes over control 1 (FIG. 2H) and preferential enrichment by FFF probe 3 in probe-vs-probe experiments (FIG. 2I). FIG. 2K exemplifies that FFF probes show minimal toxicity in HEK293T cells when tested under conditions that mirror those used for mapping probe-protein interactions in cells (200 μM FFF probe, 45 min incubation). Viability was assessed by CellTiter-Glo luminescent assay. Data represent average values±SD. n=3 per group. FIG. 2L exemplifies SILAC ratio plots for representative FFF probes in which isotopically heavy and light amino acid-labeled HEK293T cells were treated with control 1 or the indicated FFF probe (200 μM each). Dashed line indicates required threshold enrichment ratio (>5-fold) for designation of FFF targets. FIG. 2M exemplifies representative SILAC ratio plots for control experiments in which isotopically heavy and light amino acid-labeled HEK293T cells were treated with the same FFF probe (200 μM). FIG. 2N exemplifies that fraction of targets for representative FFF probes that exhibit UV-dependent enrichment. Briefly, ‘light’ cells were treated with 200 μM of the corresponding probe and UV-irradiated while ‘heavy’ cells were treated with the same probe and not exposed to UV light. Proteins were considered to be labeled in a UV-dependent fashion if >3-fold enrichment in light cells was observed. For each probe, >97% of protein targets exhibited UV-dependent enrichment. FIG. 2O exemplifies the number of protein targets enriched by corresponding FFF probes tested at 20 and 200 μM. (FIG. 2P) Heatmap of enriched proteins in FFF probe-versus-control 1 experiments using 20 μM FFF in HEK293T cells. FIG. 2Q exemplifies histogram of HEK293T cell-derived iBAQ values as estimates of the abundance distribution for protein targets of FFF probes. FIG. 2R exemplifies box-and-whisker plot of iBAQ values for FFF protein targets plotted versus the number of FFF probes that enriched each protein (□=Spearman's correlation coefficient). FIG. 2S exemplifies histogram showing the number of FFF probe hits per protein target; a median value of three probes were found per protein. FIG. 2T exemplifies confirmation of FFF probe interaction profiles for representative protein targets. Proteins were recombinantly expressed as FLAG-tagged forms in HEK293T cells, followed by treatment with the indicated FFF probes (20 μM), photocrosslinking and lysis, SDS-PAGE, and in-gel fluorescence scanning. FIG. 2U exemplifies that for proteins shown in FIG. 2T, extracted MS1 chromatograms and corresponding SILAC ratios of representative tryptic peptides quantified in the indicated probe-versus-probe experiments.



FIG. 3A-FIG. 3P exemplify types of proteins and sites on these proteins targeted by FFF probes. (FIG. 3A, FIG. 3B) Categorization of FFF probe targets based on presence or absence in DrugBank (FIG. 3A) and protein class distribution (FIG. 3B). FIG. 3C exemplifies the number of FFF probe-modified peptides per protein target. FIG. 3D represents the distribution of probe-modified peptides that overlap (or do not overlap) with residues in predicted binding pockets of proteins with structures available in the PDB (as determined by fpocket analysis). (FIG. 3E-FIG. 3G) Examples of probe labeling sites mapped onto protein structures. Tryptic peptides containing probe-labeled sites are shown in green, and residues that overlap with predicted binding pockets are shown in beige. FIG. 3E exemplifies that FFF 13-modified peptide (aa 197-215) in human YWHAE (gray, PDB 3UBW) overlaps with the binding cleft that interacts with myeloid leukemia factor 1 (MLF1-derived peptide shown in yellow). This pocket is also the target of fragment (3S)-pyrrolindin-3-ol shown in purple. FIG. 3E discloses SEQ ID NO: 918. FIG. 3F exemplifies that FFF 13-modified peptide (aa 66-79) in human BAX (gray, PDB 4ZIE) complexed with BH3 peptide of BIM (cyan). FIG. 3F discloses SEQ ID NO: 919. FIG. 3G exemplifies the ribbon structure of human CTSB (gray, PDB 1GMY) highlighting FFF 9-modified peptide (aa 315-332) that is competed when HEK293T cells are co-treated with 9 (200 μM) and CTSB inhibitor Z-FA-FMK. Represented in yellow is the catalytic cysteine C108 (red) bound to Z-FA-FMK. FIG. 3G discloses SEQ ID NO: 920. FIG. 3H exemplies that fraction of FFF probe targets with (membrane) or without (soluble) known/predicted transmembrane domains. FIG. 3I exemplifies the breakdown of soluble and membrane proteins, and corresponding probe-modified peptides from these proteins, with available crystal structures. FIG. 3J exemplifies the distribution of peptides labeled by one or more FFF probes. FIG. 3K exemplifies the distribution of probe-modified peptides based on overlap of their amino acid sequence with predicted binding pocket residues determined by fpocket analysis. FIG. 3L exemplifies the fraction of proteins with multiple probe-modified peptides that correspond to shared or distinct binding pockets. FIG. 3M exemplifies for proteins with annotated functional sites, distances of functional sites from probe-modified peptides. Functional sites include annotated enzyme catalytic residues (active sites), substrate binding sites, and metal-binding sites. FIG. 3N exemplifies the functional class distribution for proteins with FFF-modified peptides and subdivided based on availability of crystal structures for these proteins. FIG. 3O exemplifies FFF 9-modified peptides (green/tan, where tan further designates residues that overlap with those predicted to be part of binding pockets as determined by fpocket) in the structure of human GLA (gray, PDB 3S5Z). Peptides aa 50-68 and aa 241-253 are found near the active site (purple, with substrate alpha D-galactose depicted in yellow) and a secondary ligand binding site (with the beta D-galactose ligand depicted in yellow), respectively. FIG. 3O discloses SEQ ID NOS 921-922, respectively, in order of appearance. FIG. 3P exemplifies overlap of protein targets of FFF probes with protein targets of cysteine-reactive fragments.



FIG. 4A-FIG. 4M exemplify ligand discovery by competitive profiling of elaborated fragment-based compounds. FIG. 4A exemplifies a schematic for competitive profiling experiments. Isotopically heavy and light amino acid-labeled cells are treated with DMSO or elaborated fragment competitor, respectively, and the corresponding FFF probe for 30 min, followed by UV light exposure, cell lysis, CuAAC conjugation to biotin azide, streptavidin enrichment of probe-labeled proteins, tryptic digestion, and quantitative MS analysis of tryptic peptides. Competed targets are defined as those showing >3-fold reductions in FFF probe labeling in the presence of competitor compound. FIG. 4B exemplifies structure of fragment cores (upper) with representative elaborated competitors (lower, where core fragments are depicted). (FIG. 4C, FIG. 4D) Heatmap of (FIG. 4C) and number of competitor compounds per (FIG. 4D) competed protein targets in experiments using 20 μM FFF and 160 μM competitor. FIG. 4E exemplifies categorization of competed targets based on presence or absence in DrugBank for experiments using either 20 μM FFF probes (+160 μM competitors) or 200 μM FFF probes (+200 μM competitors). Targets competed in both 20 and 200 μM data sets were excluded from the 200 μM groups for the pie chart analysis. FIG. 4F exemplifies the protein functional class distribution for competed targets compared to all FFF probe targets. (FIG. 4G, FIG. 4H) Representative SILAC ratio plots for competitive profiling experiments with FFF probes 8 (FIG. 4G) and 3 (FIG. 4H) (20 μM) and 8× competitors 20 and 21, respectively. PTGR2 (FIG. 4G) and SLC25A20 (FIG. 4H) were identified as the top competed targets for 20 and 21, respectively. Dotted lines indicate a three-fold ratio change threshold for designating competed targets. (FIG. 4I-FIG. 4K) Structures of elaborated fragment competitors with corresponding FFF probe used in competitive profiling experiments. Core fragment structure within each competitor compound is highlighted. FIG. 4L exemplifies the number of competed protein targets per competitor tested in HEK293T cells at 160 μM with 20 μM FFF probe. FIG. 4M exemplifies the total number of competed protein targets for five representative competitors (160-200 μM) evaluated in experiments with high (200 μM) or low (20 μM) concentrations of FFF probes.



FIG. 5A-FIG. 5S exemplify fragment-derived ligands disrupt function of PTGR2 and SLC25A20 in human cells. FIG. 5A exemplifies structure of hPTGR2 (PDB 2ZB4, gray) highlighting FFF 8-modified tryptic peptides (aa 55-66, green; and aa 261-278, pink) near the active site (15-keto-PGE2 in yellow, NADP+ in blue) of PTGR2. Probe labeling (200 μM) of both tryptic peptides was blocked by 20 (200 μM), as shown with representative MS1 plots for each peptide. FIG. 5A discloses SEQ ID NOS 923-924, respectively, in order of appearance. FIG. 5B exemplifies PTGR2 ligands 22 and 20 but not inactive control 23, inhibited 15-keto prostaglandin E2 (15-keto-PGE2) reductase activity of recombinant PTGR2. Data represent average values±SD; n=3 per group. FIG. 5C exemplifies structures (top) and activities (bottom gels) of initial PTGR2 ligand 20, optimized ligand 22, and inactive analog 23. Gels show concentration-dependent competitor blockade of FFF 8 labeling of recombinantly expressed FLAG-tagged PTGR2 in HEK29T cells. FIG. 5D exemplifies compound 22, but not inactive control 23, increased 15-keto-PGE2-dependent PPARγ transcriptional activity in PTGR2-transfected HEK293T cells. Data represent average values±SD; ####p<0.0001 for 15k-PGE2-treated PTGR2-transfected cells versus empty vector group, ****p<0.0001 for compound- versus DMSO-treated groups; n=3 per group. FIG. 5E exemplifies structures (top) and activities (bottom gels) of SLC25A20 ligand 21 and inactive analog 24. Gel shows concentration-dependent competitor blockade of FFF 3 labeling (20 μM) of recombinantly expressed FLAG-tagged SLC25A20 in HEK29T cells. (FIG. 5F, FIG. 5G) Compound 21, but not 24, increases long-chain (>C14) acylcarnitine content (FIG. 5F) and reduces maximal exogenous fatty acid oxidation (FIG. 5G) of HSC-5 cells. Data represent average values±SD; **p<0.01 and ****p<0.0001 for compound- versus DMSO-treated groups; n=3-5 per group. FIG. 5H exemplifies expanded screen of competitor compounds by monitoring reductions in FFF probe labeling of recombinantly expressed, FLAG-tagged human PTGR2 and SLC25A20 in HEK293T cells. FIG. 5I exemplifies competition gel profiles for competitor compounds corresponding to fragment elements from FFF probes 8 (competitor 49 for PTGR2) and 3 (competitor 50 for SLC25A20). FIG. 5J exemplifies optimization of PTGR2 inhibitors. Upper images show structures of analogs of lead inhibitor 20 that were synthesized and tested. Lower image shows competition gel profiles for these analogs with human PTGR2 expressed in HEK293T cells. FIG. 5K exemplifies extracted MS1 chromatograms and corresponding SILAC ratios for representative tryptic peptides of PTGR2 from competition experiments with the indicated compounds, in which isotopically light and heavy amino acid-labeled HEK293T cells were treated with FFF probe 8 (20 μM) and, respectively, DMSO (red) or competitor compound (blue) at the indicated concentrations. (FIG. 5L, FIG. 5M) Competition SILAC plots for optimized PTGR2 inhibitor 22 (60 μM, FIG. 5L) and inactive control 23 (160 μM, FIG. 5M) tested with FFF probe 8 (20 μM). FIG. 5N exemplifies PTGR2 ligands 20 and 22 do not directly induce PPARγ transcriptional activity in HEK293T cells co-transfected with a GAL4-PPARγ luciferase reporter and an empty control vector. FIG. 5O exemplifies fitted full dose-response of data exemplified in FIG. 5D. FIG. 5P exemplify fitted IC50 curve for the concentration-dependent blockade of 3 (20 μM) labeling of SLC25A20 expressed in HEK293T cells by 21 with representative competition gel shown below. Data represent average values±SD; n=3 per group. FIG. 5Q exemplify extracted MS1 chromatograms and corresponding SILAC ratios for representative tryptic peptides of SLC25A20 from competition experiments with the indicated compounds at the indicated concentrations. FIG. 5R exemplify competition SILAC plots for inactive control 24 (160 μM) tested with FFF probe 3 (20 μM). FIG. 5S exemplify oxygen consumption rate (OCR) of HSC5 cells pre-treated for 40 min with 21 or 24 and then provided with exogenous palmitate. A concentration-dependent inhibition of basal and maximal respiration was observed for 21, but not 24. Data represent average values±SD; n=5 per group. Oligomycin is an inhibitor of ATP synthase; FCCP=carbonyl cyanide-4-(trifluoromethoxy)phenylhydrazone is an ionophore uncoupling reagent that collapses mitochondrial membrane potential, allowing maximal respiration; RAA=rotenone and antimycin A are complex I and complex III inhibitors that block mitochondrial respiration, enabling the calculation of non-mitochondrial respiration.



FIG. 6A-J illustrates additional small molecule ligands substituents disclosed herein.





DETAILED DESCRIPTION OF THE DISCLOSURE

Chemical probes can be discovered through multiple routes that can involve, for example, high-throughput screening (HTS) of individual proteins (target-based) or more complex cell and organismal systems (e.g., phenotype-based systems). In some instances, high-throughput screening, whether it is target- or phenotype-based, uses large chemical libraries (˜106) composed of relatively high MW (300-500 Da) and structurally diverse compounds. In some cases, hit compounds from these libraries prove difficult to optimize due to their size, structural complexity, and suboptimal ligand efficiency. Target-based screens are furthermore generally performed with purified proteins and therefore do not provide direct information about the activity of ligands in more complex biological systems (e.g., cells), where factors that regulate protein structure and function, such as subcellular localization, post-translational modification, and protein-protein interactions can affect ligand-protein interactions. Alternatively, phenotype-based screening, for example, faces the challenge of identifying the molecular target(s) of active compounds, in particular, in cases where the screening hits display moderate-low potency.


Fragment-based ligand and drug discovery (FBLD) is an approach that utilizes smaller numbers (˜103) of low molecular weight compounds (<300 Da), and typically screened at high concentrations (>100 M). In some instances, FBLD emphasizes the identification of structurally simple hit compounds that are then optimized into more potent ligands. In some cases, a tenet of FBLD is that, by limiting molecular size, a relatively small number of fragments can represent a large fraction of accessible chemical space.


In some embodiments, described herein is another method of identifying small molecule ligands for interaction with target proteins of interest. In some instances, this method allows for mapping of small molecule ligands for interaction with a target protein under native conditions, thereby allowing for accurate mapping of interaction with potential small molecule ligands. In some instances, the method allows for identification of novel proteins as druggable targets as the method eliminates the need of recombinant expression and purification.


In additional embodiments, described herein include small molecule ligands, compositions, cells and assays related to the method of identifying small molecule ligands for interaction with target proteins of interest.


Small Molecule Ligands


In some embodiments, disclosed herein are small molecule ligands in which each of the small molecule ligand comprises a photoreactive diazirine group and an alkyne group. In some instances, the alkyne group is a terminal alkyne group. In some instances, the small molecule ligand further comprises a small molecule fragment. In some embodiments, the small molecule fragments described herein comprise non-naturally occurring molecules. In some instances, the non-naturally occurring molecules do not include natural and/or non-natural peptide fragments, or small molecules that are produced naturally within the body of a mammal.


In some embodiments, a small molecule fragment described herein comprises a molecule weight of about 100 Dalton or higher. In some embodiments, the small molecule fragment comprises a molecule weight of about 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000 Dalton, or higher. In some instances, the molecule weight of the small molecule fragment is between about 150 and about 500, about 150 and about 450, about 150 and about 440, about 150 and about 430, about 150 and about 400, about 150 and about 350, about 150 and about 300, about 150 and about 250, about 170 and about 500, about 180 and about 450, about 190 and about 400, about 200 and about 350, about 130 and about 300, or about 120 and about 250 Dalton.


In some embodiments, the molecule weight of a small molecule fragment described herein is calculated based on the molecule weight of carbon and hydrogen atoms and optionally further based on nitrogen, oxygen and/or sulfur atoms of the small molecule fragment. In some cases, the molecule weight of the small molecule fragment is calculated without the molecular weight of one or more elements selected from a halogen, a nonmetal, a transition metal, or a combination thereof.


In some embodiments, a small molecule fragment described herein comprises micromolar or millimolar binding affinity. In some instances, the small molecule fragment comprises a binding affinity of about 100 nM, 200 nM, 300 nM, 400 nM, 500 nM, 1 μM, 10 μM, 100 μM, 500 μM, 1 mM, 10 mM, or higher.


In some embodiments, a small molecule fragment described herein has a high ligand efficiency (LE). Ligand efficiency is the measurement of the binding energy per atom of a ligand to its binding partner. In some instances, the ligand efficiency is defined as the ratio of the Gibbs free energy (ΔG) to the number of non-hydrogen atoms of the compound (N):

LE=(ΔG)/N.


In some cases, LE is also arranged as:

LE=1.4(−log IC50)/N.


In some instances, the LE score is about 0.3 kcal mol−1HA−1, about 0.35 kcal mol−1HA−1, about 0.4 kcal mol−1HA−1, or higher.


In some embodiments, a small molecule fragment described herein is designed based on the Rule of 3. In some embodiments, the Rule of 3 comprises a non-polar solvent-polar solvent (e.g. octanol-water) partition coefficient log P of about 3 or less, a molecular mass of about 300 Daltons or less, about 3 hydrogen bond donors or less, about 3 hydrogen bond acceptors or less, and about 3 rotatable bonds or less.


In some embodiments, a small molecule fragment described herein comprises three cyclic rings or less.


In some embodiments, a small molecule fragment described herein binds to a binding site of a protein in which the protein is about 20 amino acid residues in length or more. In some instances, the small molecule fragment described herein binds to a binding site of a protein in which the protein is about 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 150, 200, 250, 300, 350, 400, 450, 500, 600, 700, 800, 900, 1000 amino acid residues in length or more.


In some embodiments, a small molecule fragment described herein is obtained from a compound library. In some cases, the compound library comprises ChemBridge fragment library, Pyramid Platform Fragment-Based Drug Discovery, Maybridge fragment library, FRGx from AnalytiCon, TCI-Frag from AnCoreX, Bio Building Blocks from ASINEX, BioFocus 3D from Charles River, Fragments of Life (FOL) from Emerald Bio, Enamine Fragment Library, IOTA Diverse 1500, BIONET fragments library, Life Chemicals Fragments Collection, OTAVA fragment library, Prestwick fragment library, Selcia fragment library, TimTec fragment-based library, Allium from Vitas-M Laboratory, or Zenobia fragment library.


In some embodiments, a small molecule fragment comprises a structure illustrated in FIG. 1B, in which each fragment nomenclature (or probe nomenclature) is illustrated by a numerical number. For example, the small molecule fragment




embedded image



is assigned as probe 1.


In some embodiments, a small molecule ligand described herein has a structure represented by Formula (I):




embedded image



wherein R is selected from the groups provided below:




embedded image


embedded image



Protein Targets


In some embodiments, a protein target described herein is a soluble protein or a membrane protein. In some cases, a protein target described herein is involved in one or more of a biological process such as protein transport, lipid metabolism, apoptosis, transcription, electron transport, mRNA processing, or host-virus interaction. In some instances, the protein target is associated with one or more of diseases such as cancer or one or more disorders or conditions such as immune, metabolic, developmental, reproductive, neurological, psychiatric, renal, cardiovascular, or hematological disorders or conditions.


In some embodiments, the protein target comprises one or more functions of an enzyme, a transporter, a receptor, a channel protein, an adaptor protein, a chaperone, a signaling protein, a plasma protein, transcription related protein, translation related protein, mitochondrial protein, or cytoskeleton related protein. In some embodiments, the protein target is an enzyme, a transporter, a receptor, a channel protein, an adaptor protein, a chaperone, a signaling protein, a plasma protein, transcription related protein, translation related protein, mitochondrial protein, or cytoskeleton related protein. In some instances, the protein target has an uncategorized function.


In some embodiments, the protein target is an enzyme. An enzyme is a protein molecule that accelerates or catalyzes chemical reaction. In some embodiments, non-limiting examples of enzymes include kinases, proteases, or deubiquitinating enzymes.


In some instances, exemplary kinases include tyrosine kinases such as the TEC family of kinases such as Tec, Bruton's tyrosine kinase (Btk), interleukin-2-indicible T-cell kinase (Itk) (or Emt/Tsk), Bmx, and Txk/Rlk; spleen tyrosine kinase (Syk) family such as SYK and Zeta-chain-associated protein kinase 70 (ZAP-70); Src kinases such as Src, Yes, Fyn, Fgr, Lck, Hck, Blk, Lyn, and Frk; JAK kinases such as Janus kinase 1 (JAK1), Janus kinase 2 (JAK2), Janus kinase 3 (JAK3), and Tyrosine kinase 2 (TYK2); or ErbB family of kinases such as Her1 (EGFR, ErbB1), Her2 (Neu, ErbB2), Her3 (ErbB3), and Her4 (ErbB4).


In some embodiments, the protein target is a protease. In some embodiments, the protease is a caspase. In some instances, the caspase is an initiator (apical) caspase. In some instances, the caspase is an effector (executioner) caspase. Exemplary caspase includes CASP2, CASP8, CASP9, CASP10, CASP3, CASP6, CASP7, CASP4, and CASP5. In some instances, the cysteine protease is a cathepsin. Exemplary cathepsin includes Cathepsin B, Cathepsin C, Cathepsin F, Cathepsin H, Cathepsin K, Cathepsin L1, Cathepsin L2, Cathepsin O, Cathepsin S, Cathepsin W, or Cathepsin Z.


In some embodiments, the protein target is a deubiquitinating enzyme (DUB). In some embodiments, exemplary deubiquitinating enzymes include cysteine proteases DUBs or metalloproteases. Exemplary cysteine protease DUBs include ubiquitin-specific protease (USP/UBP) such as USP1, USP2, USP3, USP4, USP5, USP6, USP7, USP8, USP9X, USP9Y, USP10, USP11, USP12, USP13, USP14, USP15, USP16, USP17, USP17L2, USP17L3, USP17L4, USP17L5, USP17L7, USP17L8, USP18, USP19, USP20, USP21, USP22, USP23, USP24, USP25, USP26, USP27X, USP28, USP29, USP30, USP31, USP32, USP33, USP34, USP35, USP36, USP37, USP38, USP39, USP40, USP41, USP42, USP43, USP44, USP45, or USP46; ovarian tumor (OTU) proteases such as OTUB1 and OTUB2; Machado-Josephin domain (MJD) proteases such as ATXN3 and ATXN3L; and ubiquitin C-terminal hydrolase (UCH) proteases such as BAP1, UCHL1, UCHL3, and UCHL5. Exemplary metalloproteases include the Jab1/Mov34/Mpr1 Pad1 N-terminal+ (MPN+) (JAMM) domain proteases.


In some embodiments, exemplary proteins as enzymes include, but are not limited to, abhydrolase domain-containing protein 10, mitochondrial (ABHD10); aconitate hydratase, mitochondrial (ACO2); low molecular weight phosphotyrosine protein phosphatase (ACP1); chaperone activity of beI complex-like, mitochondrial (ADCK3); adenosine kinase (ADK); adenylosuccinate synthetase isozyme 2 (ADSS); acylglycerol kinase, mitochondrial (AGK); alkyldihydroxyacetonephosphate synthase, peroxisomal (AGPS); apoptosis-inducing factor 1, mitochondrial (AIFM1); Delta-1-pyrroline-5-carboxylate synthase (ALDH18A1); mitochondrial 10-formyltetrahydrofolate dehydrogen (ALDHIL2); alpha-aminoadipic semialdehyde dehydrogenase (ALDH7A1); ATPase ASNAI (ASNAI); ATPase family AAA domain-containing protein 3A (ATAD3A); bifunctional purine biosynthesis protein PURH (ATIC); bleomycin hydrolase (BLMH); calpain-1 catalytic subunit (CAPN1); creatine kinase B-type (CKB); caseinolytic peptidase B protein homolog (CLPB); putative ATP-dependent Clp protease proteolytic subunit (CLPP); carnitine O-palmitoyltransferase 2, mitochondrial (CPT2); probable serine carboxypeptidase CPVL (CPVL); cathepsin B (CTSB); cathepsin D (CTSD); NADH-cytochrome b5 reductase 3 (CYB5R3); cytochrome P450 20A1 (CYP20A1); 2,4-dienoyl-CoA reductase, mitochondrial (DECR1); delta(24)-sterol reductase (DHCR24); dihydrolipoyl dehydrogenase, mitochondrial (DLD); deoxyribonuclease-2-alpha (DNASE2); endothelin-converting enzyme 1 (ECEI); Delta(3,5)-Delta(2,4)-dienoyl-CoA isomerase, mitochondrial (ECH1); eukaryotic translation initiation factor 3 subunit (EIF3F); elongation of very long chain fatty acids protein (ELOVL2); exonuclease 1 (EXO1); phenylalanine—tRNA ligase beta subunit (FARSB); fatty acid synthase (FASN); squalene synthase (FDFT1); ferrochelatase, mitochondrial (FECH); alpha-galactosidase A (GLA); beta-galactosidase (GLB1); lactoylglutathione lyase (GLO1); glutamate dehydrogenase 1, mitochondrial (GLUD1); hydroxyacyl-coenzyme A dehydrogenase, mitochondrial (HADH); trifunctional enzyme subunit alpha, mitochondrial (HADHA); histidine—tRNA ligase, cytoplasmic (HARS); minor histocompatibility antigen H13 (HM13); heme oxygenase 2 (HMOX2); estradiol 17-beta-dehydrogenase 12 (HSD17B12); peroxisomal multifunctional enzyme type 2 (HSD17B4); insulin-degrading enzyme (IDE); isocitrate dehydrogenase (IDH2); gamma-interferon-inducible lysosomal thiol reductase (IFI30); inosine-5-monophosphate dehydrogenase 2 (IMPDH2); leucine—tRNA ligase, cytoplasmic (LARS); L-lactate dehydrogenase A chain (LDHA); L-lactate dehydrogenase B chain (LDHB); legumain (LGMN); lysosomal acid lipase/cholesteryl ester hydrolase (LIPA); methyltransferase-like protein 7A (METTL7A); NADH-ubiquinone oxidoreductase chain 2 (MT-ND2); monofunctional C1-tetrahydrofolate synthase, mitochondrial (MTHFDIL); alpha-N-acetylglucosaminidase (NAGLU); peroxisomal NADH pyrophosphatase NUDT12 (NUDT12); nucleoside diphosphate-linked moiety X motif 19, mitochondrial (NUDT19); ornithine aminotransferase, mitochondrial (OAT); phosphoenolpyruvate carboxykinase (PCK2); protein-L-isoaspartate(D-aspartate)O-methyltransferase (PCMT1); prenylcysteine oxidase 1 (PCYOX1); presequence protease, mitochondrial (PITRM1); pyruvate kinase isozymes M1/M2 (PKM); peroxiredoxin-2 (PRDX2); DNA-dependent protein kinase catalytic subunit (PRKDC); proteasome subunit alpha type-2 (PSMA2); dolichyl-diphosphooligosaccharide—protein glycosyltransferase subnit 1 (RPN1); RuvB-like 1 (RUVBL1); thimet oligopeptidase (THOP1); or tripeptidyl-peptidase 1 (TPP1).


In some embodiments, the protein target is a transcription factor or regulator. Exemplary protein targets as transcription factors and regulators include, but are not limited to, actin-like protein 6A (ACTL6A); putative adenosylhomocysteinase 2 (AHCYL1); acidic leucine-rich nuclear phosphoprotein 32 family member A (ANP32A); complement component 1 Q subcomponent-binding protein (C1QBP); probable ATP-dependent RNA helicase DDX17 (DDX17); probable ATP-dependent RNA helicase DHX36 (DHX36); elongation factor 1-alpha 1 (EEF1A1); eukaryotic initiation factor 4A-I (EIF4A1); electron transfer flavoprotein subunit beta (ETFB); far upstream element-binding protein 1 (FUBP1); histone H1.2 (HIST1H1C); heterogeneous nuclear ribonucleoprotein K (HNRNPK); interleukin enhancer-binding factor 2 (ILF2); DNA replication licensing factor MCM2 (MCM2); DNA replication licensing factor MCM4 (MCM4); N-alpha-acetyltransferase 15, NatA auxiliary subunit (NAA15); non-POU domain-containing octamer-binding protein (NONO); nucleobindin-1 (UCB1); polyadenylate-binding protein 1 (PABPC1); paraspeckle component 1 (PSPC1); RNA-binding protein 14 (RBM14); putative RNA-binding protein 3 (RBM3); RNA-binding motif protein, X chromosome (RBMX); 40S ribosomal protein S3 (RPS3); X-ray repair cross-complementing protein 6 (XRCC6); nuclease-sensitive element-binding protein 1 (YBX1); prostaglandin reductase 2 (PTGR2); zinc binding alcohol dehydrogenase domain containing 2 (ZADH2); or lysophosphatidylcholine acetyltransferase 3 (LPCAT3).


In some embodiments, the protein target is a channel, transporter or receptor. Exemplary protein targets as channels, transporters, or receptors include, but are not limited to, alpha-actinin-4 (ACTN4); AP-1 complex subunit beta-1 (AP1B1); ADP-ribosylation factor 1 (ARF1); ADP-ribosylation factor 3 (ARF3); ADP-ribosylation factor 4 (ARF4); ADP-ribosylation factor 5 (ARF5); sodium/potassium-transporting ATPase subunit alpha (ATP1A1); sarcoplasmic/endoplasmic reticulum calcium ATPase (ATP2A2); plasma membrane calcium-transporting ATPase 1 (ATP2B1); plasma membrane calcium-transporting ATPase 4 (ATP2B4); ATP synthase subunit alpha, mitochondrial (ATP5A1); coatomer subunit beta (COPB1); exportin-2 (CSE1L); Electron transfer flavoprotein subunit beta (ETFB); heterogeneous nuclear ribonucleoprotein A1 (HNRNPA1); heterogeneous nuclear ribonucleoprotein A1-like 2 (HNRNPA1L2); importin-4 (IPO4); cytochrome c oxidase subunit 2 (MT-C02); nuclear autoantigenic sperm protein (NASP); nucleoporin Nup37 (NUP37); nuclear pore complex protein Nup93 (NUP93); nuclear transport factor 2 (NUTF2); membrane-associated progesterone receptor component (PGRMC2); prohibitin-2 (PHB2); protein quaking (QKI); sideroflexin-1 (SFXN1); ADP/ATP translocase 3 (SLC25A6); mitochondrial carnitine/acylcarnitine carrier protein (SLC25A20) or voltage-dependent anion-selective channel protein (VDAC3).


In some embodiments, the protein target is a chaperone. Exemplary protein targets as chaperones include, but are not limited to, acidic leucine-rich nuclear phosphoprotein 32 family member B (ANP32B); large proline-rich protein BAG6 (BAG6); T-complex protein 1 subunit beta (CCT2); peptidyl-prolyl cis-trans isomerase FKBP4 (FKBP4); heat shock protein HSP 90-beta (HSP90AB1); endoplasmin (HSP90B1); LDLR chaperone MESD (MESDC2); nucleophosmin (NPM1); or protein SET (SET).


In some embodiments, the protein target is an adapter, scaffolding or modulator protein. Exemplary protein targets as adapter, scaffolding, or modulator proteins include, but are not limited to, actin, alpha skeletal muscle (ACTA1); actin, cytoplasmic 1 (ACTB); cytoskeleton-associated protein 4 (CKAP4); cytochrome c oxidase subunit 5A, mitochondrial (COX5A); catenin beta-1 (CTNNB1); FGFR1 oncogene partner (FGFR1OP); HAUS augmin-like complex subunit 2 (HAUS2); hemoglobin subunit alpha (HBA2); kinesin-like protein KIF11 (KIF11); myosin-10 (MYH10); myosin-9 (MYH9); phosphatidylinositol transfer protein beta isoform (PITPNB); proactivator polypeptide (PSAP); endophilin-B1 (SH3GLB1); stomatin-like protein 2 (STOML2); tubulin beta-4B chain (TUBB4B); or tubulin beta-6 chain (TUBB6).


In some embodiments, a protein target comprises a protein illustrated in Tables 1-4. In some instances, a protein target comprises a protein illustrated in Table 1. In some embodiments, the protein target comprises a binding site denoted in Table 1. In some instances, a protein target comprises a protein illustrated in Table 2. In some embodiments, the protein target comprises a binding site denoted in Table 2. In some instances, a protein target comprises a protein illustrated in Table 3. In some embodiments, the protein target comprises a binding site denoted in Table 3. In some instances, a protein target comprises a protein illustrated in Table 4.


Methods of Use


In some embodiments, disclosed herein include a method of identifying a protein that is capable of interacting with a small molecule ligand. In some instances, the method comprises (a) providing a cell sample; (b) exposing the cell sample to a plurality of potential small molecule ligands having a structure comprising at least a photoreactive diazirine group and a terminal alkyne group; (c) irradiating the cell sample with UV light; (d) performing lysis on the cell sample; (e) subjecting proteins in the post lysis material to fluorophore tagging (e.g., rhodamine, fluorescein, and the like); and (f) isolating at least one fluorophore-tagged protein. In other instances, the method comprises (a) providing a cell sample; (b) exposing the cell sample to the small molecule ligand having a structure comprising at least a photoreactive diazirine group, and a terminal alkyne group; (c) irradiating the cell sample with UV light; (d) performing lysis on the cell sample; (e) subjecting the proteins in the post lysis material to tagging; and (f) isolating the tagged proteins for analysis to identify a protein capable of interacting with the small molecule ligand.


In some cases, the small molecule ligand has a structure represented by Formula (I):




embedded image



wherein R is selected from the groups provided below:




embedded image


embedded image


In some cases, the small molecule ligand has a structure represented by Formula (Ib):




embedded image



wherein R is an amide substituent bonded to the NH group of the amines provided in FIGS. 6A-J.


In some cases, the small molecule ligand has a structure represented by Formula (II):




embedded image



wherein R1 is hydrogen and R2 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl; or R1 and R2 together with the nitrogen to which they are attached form an optionally substituted heterocyclyl ring.


In some cases, the small molecule ligand has a structure represented by Formula (III):




embedded image



wherein R1 is hydrogen and R2 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl; or R1 and R2 together with the nitrogen to which they are attached form an optionally substituted heterocyclyl ring.


In some cases, the small molecule ligand has a structure represented by Formula (III):




embedded image



wherein R1 is hydrogen and R2 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl; or R1 and R2 together with the nitrogen to which they are attached form an optionally substituted heterocyclyl ring.


In some cases, the small molecule ligand has a structure represented by Formula (IV):




embedded image



wherein R1 is hydrogen and R2 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl; or R1 and R2 together with the nitrogen to which they are attached form an optionally substituted heterocyclyl ring.


In some cases, the small molecule ligand has a structure represented by Formula (V):




embedded image



wherein R1 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl.


In some cases, the small molecule ligand has a structure represented by Formula (VI):




embedded image



wherein R1 is hydrogen and R2 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl; or R1 and R2 together with the nitrogen to which they are attached form an optionally substituted heterocyclyl ring.


In some cases, the tagging further comprises i) attaching the small molecule ligand-protein complex to a biotin moiety and ii) interacting the biotin moiety with a streptavidin-coupled bead.


In some instances, the analysis comprises a proteomic analysis.


In some instances, a cell from the cell sample is a mammalian cell. In some cases, a cell from the cell sample is obtained from HEK293T, K562, or HSC-5 cell lines. In some cases, a cell from the cell sample is a tumor cell.


In some cases, the method is an in situ method. In other cases, the method is an in vitro method.


In some embodiments, also disclosed herein include a method of identifying a small molecule ligand binding site on an isolated protein. In some cases, the method comprises (a) providing an isolated protein; (b) exposing the protein to a plurality of potential small molecule ligands having a structure comprising at least a photoreactive diazirine group and a terminal alkyne group; (c) irradiating the protein with UV light; (d) tagging the protein with biotin; (e) binding the biotin-tagged protein to solid phase beads; (f) digesting the protein to provide protein fragments; and (g) analyzing the protein fragments to determine the small molecule ligand binding site.


In some instances, the isolated protein is selected from Tables 1-3. In some cases, the isolated protein is selected from Table 1. In some cases, the isolated protein is selected from Table 2. In some cases, the isolated protein is selected from Table 3. In some cases, the isolated protein is a recombinant protein.


In some cases, the small molecule ligand has a structure represented by Formula (I):




embedded image



wherein R is selected from the groups provided below:




embedded image


embedded image


In some cases, the small molecule ligand has a structure represented by Formula (Ib):




embedded image



wherein R is an amide substituent bonded to the NH group of the amines provided in FIGS. 6A-J.


In some cases, the small molecule ligand has a structure represented by Formula (II):




embedded image



wherein R1 is hydrogen and R2 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl; or R1 and R2 together with the nitrogen to which they are attached form an optionally substituted heterocyclyl ring.


In some cases, the small molecule ligand has a structure represented by Formula (III):




embedded image



wherein R1 is hydrogen and R2 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl; or R1 and R2 together with the nitrogen to which they are attached form an optionally substituted heterocyclyl ring.


In some cases, the small molecule ligand has a structure represented by Formula (III):




embedded image



wherein R1 is hydrogen and R2 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl; or R1 and R2 together with the nitrogen to which they are attached form an optionally substituted heterocyclyl ring.


In some cases, the small molecule ligand has a structure represented by Formula (IV):




embedded image



wherein R1 is hydrogen and R2 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl; or R1 and R2 together with the nitrogen to which they are attached form an optionally substituted heterocyclyl ring.


In some cases, the small molecule ligand has a structure represented by Formula (V):




embedded image



wherein R1 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl.


In some cases, the small molecule ligand has a structure represented by Formula (VI):




embedded image



wherein R1 is hydrogen and R2 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl; or R1 and R2 together with the nitrogen to which they are attached form an optionally substituted heterocyclyl ring.


In some instances, the analyzing comprises a proteomic analysis.


In some embodiments, tagging comprises labeling the protein with a labeling group for use in further analysis of the protein. In some instances, the labeling group comprises a fluorophore. In some instances, a fluorophore comprises rhodamine, rhodol, fluorescein, thiofluorescein, aminofluorescein, carboxyfluorescein, chlorofluorescein, methylfluorescein, sulfofluorescein, aminorhodol, carboxyrhodol, chlororhodol, methylrhodol, sulforhodol, aminorhodamine, carboxyrhodamine, chlororhodamine, methylrhodamine, sulforhodamine, thiorhodamine, cyanine, indocarbocyanine, oxacarbocyanine, thiacarbocyanine, merocyanine, cyanine 2, cyanine 3, cyanine 3.5, cyanine 5, cyanine 5.5, cyanine 7, oxadiazole derivatives, pyridyloxazole, nitrobenzoxadiazole, benzoxadiazole, pyren derivatives, cascade blue, oxazine derivatives, Nile red, Nile blue, cresyl violet, oxazine 170, acridine derivatives, proflavin, acridine orange, acridine yellow, arylmethine derivatives, auramine, crystal violet, malachite green, tetrapyrrole derivatives, porphin, phtalocyanine, bilirubin 1-dimethylaminonaphthyl-5-sulfonate, 1-anilino-8-naphthalene sulfonate, 2-p-touidinyl-6-naphthalene sulfonate, 3-phenyl-7-isocyanatocoumarin, N-(p-(2-benzoxazolyl)phenyl)maleimide, stilbenes, pyrenes, 6-FAM (Fluorescein), 6-FAM (NHS Ester), 5(6)-FAM, 5-FAM, Fluorescein dT, 5-TAMRA-cadavarine, 2-aminoacridone, HEX, JOE (NHS Ester), MAX, TET, ROX, TAMRA, TARMA™ (NHS Ester), TEX 615, ATTO™ 488, ATTO™ 532, ATTO™ 550, ATTO™ 565, ATTO™ Rho101, ATTO™ 590, ATTO™ 633, ATTO™ 647N, TYE™ 563, TYE™ 665, or TYE™ 705.


In some embodiments, the labeling group comprises a biotin, a streptavidin, bead, resin, a solid support, or a combination thereof. As used herein, a biotin described herein comprises biotin and biotin derivatives. Exemplary biotin derivatives include, but are not limited by, desthiobiotin, biotin alkyne or biotin azide. In some instances, a biotin described herein is desthiobiotin. In some cases, a biotin described herein is d-Desthiobiotin.


In some instances, the labeling group comprising biotin further comprises a linker. In some cases, the linker is about 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 or more residues in length. In some instances, the linker further comprises a cleavage site, such as a protease cleavage site (e.g., TEV cleavage site). In some cases, the biotin-linker moiety is further isotopically-labeled, for example, isotopically labeled with 13C and 15N atoms at one or more amino acid residue positions. In some cases, the biotin-linker moiety is a isotopically-labeled TEV-tag as described in Weerapana, et al., “Quantitative reactivity profiling predicts functional cysteines in proteomes,” Nature 468(7325): 790-795.


In some cases, the labeling group comprising biotin further interacts with a streptavidin moiety. In some instances, the labeling group comprising biotin is further attached to a bead, such as a streptavidin-coupled bead. In some instances, the labeling group comprising biotin is further attached to a resin or a solid support, such as a streptavidin-coupled resin or a streptavidin-coupled solid support. In some instances, the solid support is a plate, a platform, a cover slide, a microfluidic channel, and the like.


In some cases, the method is a high-throughput method.


In some embodiments, disclosed herein also include proteins and their respective binding sites identified for interaction with one or more small molecule ligands. In some instances, the binding sites are disclosed in Tables 1-3. In some cases, the binding sites are disclosed in Table 3.


In some embodiments, disclosed herein is a small molecule ligand which binds to the ACP1 protein, wherein the small molecule ligand binds to one or more of the following residues: VDSAATSGYEIGNPPDYR (SEQ ID NO: 1) of the ACP1 protein having the UniProtKB accession number P24666. In some instances, also disclosed herein is a small molecule ligand which binds to the ACP1 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: VDSAATSGYEIGNPPDYR (SEQ ID NO: 1) of the ACP1 protein having the UniProtKB accession number P24666. In some instances, the small molecule ligand is probe 13.


In some embodiments, disclosed herein is a small molecule ligand which binds to the ADCK3 protein, wherein the small molecule ligand binds to one or more of the following residues: LGQMLSIQDDAFINPHLAK (SEQ ID NO: 2) of the ADCK3 protein having the UniProtKB accession number Q8NI60. In some embodiments, also disclosed herein is a small molecule ligand which binds to the ADCK3 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: LGQMLSIQDDAFINPHLAK (SEQ ID NO: 2) of the ADCK3 protein having the UniProtKB accession number Q8NI60. In some instances, the small molecule ligand is probe 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the ADK protein, wherein the small molecule ligand binds to one or more of the following residues: IFTLNLSAPFISQFYK (SEQ ID NO: 3) of the ADK protein having the UniProtKB accession number P55263. In some embodiments, also disclosed herein is a small molecule ligand which binds to the ADK protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: IFTLNLSAPFISQFYK (SEQ ID NO: 3) of the ADK protein having the UniProtKB accession number P55263. In some instances, the small molecule ligand is probe 2.


In some embodiments, disclosed herein is a small molecule ligand which binds to the ADSS protein, wherein the small molecule ligand binds to one or more of the following residues: FIEDELQIPVK (SEQ ID NO: 4) of the ADSS protein having the UniProtKB accession number P30520. In some embodiments, also disclosed herein is a small molecule ligand which binds to the ADSS protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: FIEDELQIPVK (SEQ ID NO: 4) of the ADSS protein having the UniProtKB accession number P30520. In some instances, the small molecule ligand is probe 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the AIFM1 protein, wherein the small molecule ligand binds to one or more of the following residues: PYWHQSMFWSDLGPDVGYEAIGLVDSSLPTVGVFAK (SEQ ID NO: 5) of the AIFM1 protein having the UniProtKB accession number 095831. In some embodiments, also disclosed herein is a small molecule ligand which binds to the AIFM1 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: PYWHQSMFWSDLGPDVGYEAIGLVDSSLPTVGVFAK (SEQ ID NO: 5) of the AIFM1 protein having the UniProtKB accession number 095831. In some instances, the small molecule ligand is probe 2, 3, 4 or 6.


In some embodiments, disclosed herein is a small molecule ligand which binds to the ALDH7A1 protein, wherein the small molecule ligand binds to one or more of the following residues: ILVEGVGEVQEYVDICDYAVGLSR (SEQ ID NO: 6) of the ALDH7A1 protein having the UniProtKB accession number P49419. In some embodiments, also disclosed herein is a small molecule ligand which binds to the ALDH7A1 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: ILVEGVGEVQEYVDICDYAVGLSR (SEQ ID NO: 6) of the ALDH7A1 protein having the UniProtKB accession number P49419. In some instances, the small molecule ligand is probe 8 or 13.


In some embodiments, disclosed herein is a small molecule ligand which binds to a protein selected from ARF4 or ARF5, wherein the small molecule ligand binds to one or more of the following residues: LGEIVTTIPTIGFNVETVEYK (SEQ ID NO: 7), corresponding to LGEIVTTIPTIGFNVETVEYK (SEQ ID NO: 7) of the ARF4 protein having the UniProtKB accession number P18085. In some embodiments, also disclosed herein is a small molecule ligand which binds to a protein selected from ARF4 or ARF5, wherein the small molecule ligand binds a ligand binding site defined by the following residues: LGEIVTTIPTIGFNVETVEYK (SEQ ID NO: 7), corresponding to LGEIVTTIPTIGFNVETVEYK (SEQ ID NO: 7) of the ARF4 protein having the UniProtKB accession number P18085. In some instances, the small molecule ligand is probe 2, 3, 4, 8 or 13.


In some embodiments, disclosed herein is a small molecule ligand which binds to the ARL1 protein, wherein the small molecule ligand binds to one or more residues of a ligand binding site selected from: GTGLDEAMEWLVETLK (SEQ ID NO: 9) and LQVGEVVTTIPTIGFNVETVTYK (SEQ ID NO: 10) of the ARL1 protein having the UniProtKB accession number P40616. In some embodiments, also disclosed herein is a small molecule ligand which binds to the ARL1 protein, wherein the small molecule ligand binds a ligand binding site defined by: GTGLDEAMEWLVETLK (SEQ ID NO: 9) or LQVGEVVTTIPTIGFNVETVTYK (SEQ ID NO: 10) of the ARL1 protein having the UniProtKB accession number P40616. In some instances, the small molecule ligand is probe 13 or 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the ATIC protein, wherein the small molecule ligand binds to one or more of the following residues: AFTHTAQYDEAISDYFR (SEQ ID NO: 11) of the ATIC protein having the UniProtKB accession number P31939. In some embodiments, also disclosed herein is a small molecule ligand which binds to the ATIC protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: AFTHTAQYDEAISDYFR (SEQ ID NO: 11) of the ATIC protein having the UniProtKB accession number P31939. In some instances, the small molecule ligand is probe 13.


In some embodiments, disclosed herein is a small molecule ligand which binds to the BLMH protein, wherein the small molecule ligand binds to one or more residues of a ligand binding site selected from: CYFFLSAFVDTAQR (SEQ ID NO: 12) and GEISATQDVMMEEIFR (SEQ ID NO: 13) of the BLMH protein having the UniProtKB accession number Q13867. In some embodiments, also disclosed herein is a small molecule ligand which binds to the BLMH protein, wherein the small molecule ligand binds a ligand binding site defined by: CYFFLSAFVDTAQR (SEQ ID NO: 12) or GEISATQDVMMEEIFR (SEQ ID NO: 13) of the BLMH protein having the UniProtKB accession number Q13867. In some instances, the small molecule ligand is probe 13 or 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the CALR protein, wherein the small molecule ligand binds to one or more residues of a ligand binding site selected from: SGTIFDNFLITNDEAYAEEFGNETWGVTK (SEQ ID NO: 14) and HEQNIDCGGGYVK (SEQ ID NO: 15) of the CALR protein having the UniProtKB accession number P27797. In some embodiments, also disclosed herein is a small molecule ligand which binds to the CALR protein, wherein the small molecule ligand binds a ligand binding site defined by: SGTIFDNFLITNDEAYAEEFGNETWGVTK (SEQ ID NO: 14) or HEQNIDCGGGYVK (SEQ ID NO: 15) of the CALR protein having the UniProtKB accession number P27797. In some instances, the small molecule ligand is probe 6, 9, or 13.


In some embodiments, disclosed herein is a small molecule ligand which binds to the CAPN1 protein, wherein the small molecule ligand binds to one or more of the following residues: LVFVHSAEGNEFWSALLEK (SEQ ID NO: 16) of the CAPN1 protein having the UniProtKB accession number P07384. In some embodiments, also disclosed herein is a small molecule ligand which binds to the CAPN1 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: LVFVHSAEGNEFWSALLEK (SEQ ID NO: 16) of the CAPN1 protein having the UniProtKB accession number P07384. In some instances, the small molecule ligand is probe 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the CKB protein, wherein the small molecule ligand binds to one or more residues of a ligand binding site selected from: FPAEDEFPDLSAHNNHMAK (SEQ ID NO: 17), LAVEALSSLDGDLAGR (SEQ ID NO: 18), TFLVWVNEEDHLR (SEQ ID NO: 19), FCTGLTQIETLFK (SEQ ID NO: 20), LGFSEVELVQMVVDGVK (SEQ ID NO: 21) and LEQGQAIDDLMPAQK (SEQ ID NO: 22) of the CKB protein having the UniProtKB accession number P12277. In some embodiments, also disclosed herein is a small molecule ligand which binds to the CKB protein, wherein the small molecule ligand binds a ligand binding site defined by: FPAEDEFPDLSAHNNHMAK (SEQ ID NO: 17), LAVEALSSLDGDLAGR (SEQ ID NO: 18), TFLVWVNEEDHLR (SEQ ID NO: 19), FCTGLTQIETLFK (SEQ ID NO: 20), LGFSEVELVQMVVDGVK (SEQ ID NO: 21) or LEQGQAIDDLMPAQK (SEQ ID NO: 22) of the CKB protein having the UniProtKB accession number P12277. In some instances, the small molecule ligand is probe 3 or 13.


In some embodiments, disclosed herein is a small molecule ligand which binds to the CKMT1B protein, wherein the small molecule ligand binds to one or more of the following residues: SFLIWVNEEDHTR (SEQ ID NO: 23) of the CKMT1B protein having the UniProtKB accession number P12532. In some embodiments, disclosed herein is a small molecule ligand which binds to the CKMT1B protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: SFLIWVNEEDHTR (SEQ ID NO: 23) of the CKMT1B protein having the UniProtKB accession number P12532. In some instances, the small molecule ligand is probe 3.


In some embodiments, disclosed herein is a small molecule ligand which binds to the CLPP protein, wherein the small molecule ligand binds to one or more of the following residues: QSLQVIESAMER (SEQ ID NO: 24) of the CLPP protein having the UniProtKB accession number Q16740. In some embodiments, also disclosed herein is a small molecule ligand which binds to the CLPP protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: QSLQVIESAMER (SEQ ID NO: 24) of the CLPP protein having the UniProtKB accession number Q16740. In some instances, the small molecule ligand is probe 6.


In some embodiments, disclosed herein is a small molecule ligand which binds to the CSNK1A1 protein, wherein the small molecule ligand binds to one or more of the following residues: DYNVLVMDLLGPSLEDLFNFCSR (SEQ ID NO: 25) of the CSNK1A1 protein having the UniProtKB accession number P48729. In some embodiments, also disclosed herein is a small molecule ligand which binds to the CSNK1A1 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: DYNVLVMDLLGPSLEDLFNFCSR (SEQ ID NO: 25) of the CSNK1A1 protein having the UniProtKB accession number P48729. In some instances, the small molecule ligand is probe 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the CSNK2B protein, wherein the small molecule ligand binds to one or more of the following residues: VYCENQPMLPIGLSDIPGEAMVK (SEQ ID NO: 26) of the CSNK2B protein having the UniProtKB accession number P67870. In some embodiments, also disclosed herein is a small molecule ligand which binds to the CSNK2B protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: VYCENQPMLPIGLSDIPGEAMVK (SEQ ID NO: 26) of the CSNK2B protein having the UniProtKB accession number P67870. In some instances, the small molecule ligand is probe 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the CTSB protein, wherein the small molecule ligand binds to one or more of the following residues: GQDHCGIESEVVAGIPR (SEQ ID NO: 27) of the CTSB protein having the UniProtKB accession number P07858. In some embodiments, also disclosed herein is a small molecule ligand which binds to the CTSB protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: GQDHCGIESEVVAGIPR (SEQ ID NO: 27) of the CTSB protein having the UniProtKB accession number P07858. In some cases, the small molecule ligand is probe 2, 4, 9 or 13.


In some embodiments, disclosed herein is a small molecule ligand which binds to the CTSD protein, wherein the small molecule ligand binds to one or more residues of a ligand binding site selected from: DPDAQPGGELMLGGTDSK (SEQ ID NO: 28), EGCEAIVDTGTSLMVGPVDEVR (SEQ ID NO: 29) and AIGAVPLIQGEYMIPCEK (SEQ ID NO: 30) of the CTSD protein having the UniProtKB accession number P07339. In some embodiments, also disclosed herein is a small molecule ligand which binds to the CTSD protein, wherein the small molecule ligand binds a ligand binding site defined by: DPDAQPGGELMLGGTDSK (SEQ ID NO: 28), EGCEAIVDTGTSLMVGPVDEVR (SEQ ID NO: 29) or AIGAVPLIQGEYMIPCEK (SEQ ID NO: 30) of the CTSD protein having the UniProtKB accession number P07339. In some cases, the small molecule ligand is probe 2, 3, 4, 6, 8, 9, 13, 14 or 15.


In some embodiments, disclosed herein is a small molecule ligand which binds to the CYB5R3 protein, wherein the small molecule ligand binds to one or more of the following residues: LWYTLDR (SEQ ID NO: 31) of the CYB5R3 protein having the UniProtKB accession number P00387. In some embodiments, also disclosed herein is a small molecule ligand which binds to the CYB5R3 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: LWYTLDR (SEQ ID NO: 31) of the CYB5R3 protein having the UniProtKB accession number P00387. In some cases, the small molecule ligand is probe 3.


In some embodiments, disclosed herein is a small molecule ligand which binds to the DECR1 protein, wherein the small molecule ligand binds to one or more of the following residues: FDGGEEVLISGEFNDLR (SEQ ID NO: 32) of the DECR1 protein having the UniProtKB accession number Q16698. In some embodiments, also disclosed herein is a small molecule ligand which binds to the DECR1 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: FDGGEEVLISGEFNDLR (SEQ ID NO: 32) of the DECR1 protein having the UniProtKB accession number Q16698. In some cases, the small molecule ligand is probe 6.


In some embodiments, disclosed herein is a small molecule ligand which binds to the DHX9 protein, wherein the small molecule ligand binds to one or more of the following residues: ISAVSVAER (SEQ ID NO: 33) of the DHX9 protein having the UniProtKB accession number Q08211. In some embodiments, also disclosed herein is a small molecule ligand which binds to the DHX9 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: ISAVSVAER (SEQ ID NO: 33) of the DHX9 protein having the UniProtKB accession number Q08211. In some cases, the small molecule ligand is probe 3.


In some embodiments, disclosed herein is a small molecule ligand which binds to the DLD protein, wherein the small molecule ligand binds to one or more of the following residues: VLGAHILGPGAGEMVNEAALALEYGASCEDIAR (SEQ ID NO: 34) of the DLD protein having the UniProtKB accession number P09622. In some embodiments, also disclosed herein is a small molecule ligand which binds to the DLD protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: VLGAHILGPGAGEMVNEAALALEYGASCEDIAR (SEQ ID NO: 34) of the DLD protein having the UniProtKB accession number P09622. In some cases, the small molecule ligand is probe 4, 13 or 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the ECH1 protein, wherein the small molecule ligand binds to one or more residues of a ligand binding site selected from: MFTAGIDLMDMASDILQPK (SEQ ID NO: 35), YQETFNVIER (SEQ ID NO: 36) and EVDVGLAADVGTLQR (SEQ ID NO: 37) of the ECH1 protein having the UniProtKB accession number Q13011. In some embodiments, also disclosed herein is a small molecule ligand which binds to the ECH1 protein, wherein the small molecule ligand binds a ligand binding site defined by: MFTAGIDLMDMASDILQPK (SEQ ID NO: 35), YQETFNVIER (SEQ ID NO: 36) or EVDVGLAADVGTLQR (SEQ ID NO: 37) of the ECH1 protein having the UniProtKB accession number Q13011. In some cases, the small molecule ligand is probe 3, 4, 6, 8, 13, 14 or 15.


In some embodiments, disclosed herein is a small molecule ligand which binds to the EIF4A1 protein, wherein the small molecule ligand binds to one or more residues of a ligand binding site selected from: MFVLDEADEMLSR (SEQ ID NO: 38) and GYDVIAQAQSGTGK (SEQ ID NO: 39) of the EIF4A1 protein having the UniProtKB accession number P60842. In some embodiments, also disclosed herein is a small molecule ligand which binds to the EIF4A1 protein, wherein the small molecule ligand binds a ligand binding site defined by: MFVLDEADEMLSR (SEQ ID NO: 38) or GYDVIAQAQSGTGK (SEQ ID NO: 39) of the EIF4A1 protein having the UniProtKB accession number P60842. In some cases, the small molecule ligand is probe 9, 13 or 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the EIF4A2 protein, wherein the small molecule ligand binds to one or more of the following residues: GYDVIAQAQSGTGK (SEQ ID NO: 40) of the EIF4A2 protein having the UniProtKB accession number Q14240. In some embodiments, also disclosed herein is a small molecule ligand which binds to the EIF4A2 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: GYDVIAQAQSGTGK (SEQ ID NO: 40) of the EIF4A2 protein having the UniProtKB accession number Q14240. In some instances, the small molecule ligand is probe 13.


In some embodiments, disclosed herein is a small molecule ligand which binds to the ETFB protein, wherein the small molecule ligand binds to one or more of the following residues: HSMNPFCEIAVEEAVR (SEQ ID NO: 41) of the ETFB protein having the UniProtKB accession number P38117. In some embodiments, also disclosed herein is a small molecule ligand which binds to the ETFB protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: HSMNPFCEIAVEEAVR (SEQ ID NO: 41) of the ETFB protein having the UniProtKB accession number P38117. In some cases, the small molecule ligand is probe 3.


In some embodiments, disclosed herein is a small molecule ligand which binds to the FECH protein, wherein the small molecule ligand binds to one or more of the following residues: SEVVILFSAHSLPMSVVNR (SEQ ID NO: 42) of the FECH protein having the UniProtKB accession number P22830. In some embodiments, also disclosed herein is a small molecule ligand which binds to the FECH protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: SEVVILFSAHSLPMSVVNR (SEQ ID NO: 42) of the FECH protein having the UniProtKB accession number P22830. In some cases, the small molecule ligand is probe 4.


In some embodiments, disclosed herein is a small molecule ligand which binds to the GLA protein, wherein the small molecule ligand binds to one or more residues of a ligand binding site selected from: SILDWTSFNQER (SEQ ID NO: 43), FMCNLDCQEEPDSCISEK (SEQ ID NO: 44) and LFMEMAELMVSEGWK (SEQ ID NO: 45) of the GLA protein having the UniProtKB accession number P06280. In some embodiments, also disclosed herein is a small molecule ligand which binds to the GLA protein, wherein the small molecule ligand binds a ligand binding site defined by: SILDWTSFNQER (SEQ ID NO: 43), FMCNLDCQEEPDSCISEK (SEQ ID NO: 44) or LFMEMAELMVSEGWK (SEQ ID NO: 45) of the GLA protein having the UniProtKB accession number P06280. In some cases, the small molecule ligand is probe 4 or 9.


In some embodiments, disclosed herein is a small molecule ligand which binds to the GLB1 protein, wherein the small molecule ligand binds to one or more of the following residues: TEAVASSLYDILAR (SEQ ID NO: 46) of the GLB1 protein having the UniProtKB accession number P16278. In some embodiments, also disclosed herein is a small molecule ligand which binds to the GLB1 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: TEAVASSLYDILAR (SEQ ID NO: 46) of the GLB1 protein having the UniProtKB accession number P16278. In some instances, the small molecule ligand is probe 9.


In some embodiments, disclosed herein is a small molecule ligand which binds to the GLO1 protein, wherein the small molecule ligand binds to one or more of the following residues: GLAFIQDPDGYWIEILNPNK (SEQ ID NO: 47) of the GLO1 protein having the UniProtKB accession number Q04760. In some embodiments, also disclosed herein is a small molecule ligand which binds to the GLO1 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: GLAFIQDPDGYWIEILNPNK (SEQ ID NO: 47) of the GLO1 protein having the UniProtKB accession number Q04760. In some instances, the small molecule ligand is probe 3 or 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the GLUD1 protein, wherein the small molecule ligand binds to one or more residues of a ligand binding site selected from: YSTDVSVDEVK (SEQ ID NO: 48) and HGGTIPIVPTAEFQDR (SEQ ID NO: 49) of the GLUD1 protein having the UniProtKB accession number P00367. In some embodiments, also disclosed herein is a small molecule ligand which binds to the GLUD1 protein, wherein the small molecule ligand binds a ligand binding site defined by: YSTDVSVDEVK (SEQ ID NO: 48) or HGGTIPIVPTAEFQDR (SEQ ID NO: 49) of the GLUD1 protein having the UniProtKB accession number P00367. In some instances, the small molecule ligand is probe 6.


In some embodiments, disclosed herein is a small molecule ligand which binds to the GOLPH3 protein, wherein the small molecule ligand binds to one or more of the following residues: EGYTSFWNDCISSGLR (SEQ ID NO: 50) of the GOLPH3 protein having the UniProtKB accession number Q9H4A6. In some embodiments, also disclosed herein is a small molecule ligand which binds to the GOLPH3 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: EGYTSFWNDCISSGLR (SEQ ID NO: 50) of the GOLPH3 protein having the UniProtKB accession number Q9H4A6. In some instances, the small molecule ligand is probe 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the GSTP1 protein, wherein the small molecule ligand binds to one or more of the following residues: FQDGDLTLYQSNTILR (SEQ ID NO: 51) of the GSTP1 protein having the UniProtKB accession number P09211. In some embodiments, also disclosed herein is a small molecule ligand which binds to the GSTP1 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: FQDGDLTLYQSNTILR (SEQ ID NO: 51) of the GSTP1 protein having the UniProtKB accession number P09211. In some instances, the small molecule ligand is probe 2.


In some embodiments, disclosed herein is a small molecule ligand which binds to the HBA2 protein, wherein the small molecule ligand binds to one or more residues of a ligand binding site selected from: VGAHAGEYGAEALER (SEQ ID NO: 52) and VDPVNFK (SEQ ID NO: 53) of the HBA2 protein having the UniProtKB accession number P69905. In some embodiments, also disclosed herein is a small molecule ligand which binds to the HBA2 protein, wherein the small molecule ligand binds a ligand binding site defined by: VGAHAGEYGAEALER (SEQ ID NO: 52) or VDPVNFK (SEQ ID NO: 53) of the HBA2 protein having the UniProtKB accession number P69905. In some instances, the small molecule ligand is probe 4.


In some embodiments, disclosed herein is a small molecule ligand which binds to the HEXA protein, wherein the small molecule ligand binds to one or more of the following residues: LTSDLTFAYER (SEQ ID NO: 54) of the HEXA protein having the UniProtKB accession number P06865. In some embodiments, also disclosed herein is a small molecule ligand which binds to the HEXA protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: LTSDLTFAYER (SEQ ID NO: 54) of the HEXA protein having the UniProtKB accession number P06865. In some instances, the small molecule ligand is probe 9.


In some embodiments, disclosed herein is a small molecule ligand which binds to the HMOX2 protein, wherein the small molecule ligand binds to one or more of the following residues: AENTQFVK (SEQ ID NO: 55) and LATTALYFTYSALEEEMER (SEQ ID NO: 56) of the HMOX2 protein having the UniProtKB accession number P30519. In some embodiments, also disclosed herein is a small molecule ligand which binds to the HMOX2 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: AENTQFVK (SEQ ID NO: 55) or LATTALYFTYSALEEEMER (SEQ ID NO: 56) of the HMOX2 protein having the UniProtKB accession number P30519. In some instances, the small molecule ligand is probe 2, 3, 4, 6, 8, 14 or 15.


In some embodiments, disclosed herein is a small molecule ligand which binds to the HSD17B4 protein, wherein the small molecule ligand binds to one or more of the following residues: LGLLGLANSLAIEGR (SEQ ID NO: 57) of the HSD17B4 protein having the UniProtKB accession number P51659. In some embodiments, also disclosed herein is a small molecule ligand which binds to the HSD17B4 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: LGLLGLANSLAIEGR (SEQ ID NO: 57) of the HSD17B4 protein having the UniProtKB accession number P51659. In some instances, the small molecule ligand is probe 3.


In some embodiments, disclosed herein is a small molecule ligand which binds to the HSP90AB1 protein, wherein the small molecule ligand binds to one or more residues of a ligand binding site selected from: VFIMDSCDELIPEYLNFIR (SEQ ID NO: 58) and GFEVVYMTEPIDEYCVQQLK (SEQ ID NO: 59) of the HSP90AB1 protein having the UniProtKB accession number P08238. In some embodiments, also disclosed herein is a small molecule ligand which binds to the HSP90AB1 protein, wherein the small molecule ligand binds a ligand binding site defined by: VFIMDSCDELIPEYLNFIR (SEQ ID NO: 58) or GFEVVYMTEPIDEYCVQQLK (SEQ ID NO: 59) of the HSP90AB1 protein having the UniProtKB accession number P08238. In some instances, the small molecule ligand is probe 13 or 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the HSP90B1 protein, wherein the small molecule ligand binds to one or more residues of a ligand binding site selected from: LISLTDENALSGNEELTVK (SEQ ID NO: 60) and YSQFINFPIYVWSSK (SEQ ID NO: 61) of the HSP90B1 protein having the UniProtKB accession number P14625. In some embodiments, also disclosed herein is a small molecule ligand which binds to the HSP90B1 protein, wherein the small molecule ligand binds a ligand binding site defined by: LISLTDENALSGNEELTVK (SEQ ID NO: 60) or YSQFINFPIYVWSSK (SEQ ID NO: 61) of the HSP90B1 protein having the UniProtKB accession number P14625. In some instances, the small molecule ligand is probe 6 or 9.


In some embodiments, disclosed herein is a small molecule ligand which binds to the HSPA8 protein, wherein the small molecule ligand binds to one or more of the following residues: SFYPEEVSSMVLTK (SEQ ID NO: 62) of the HSPA8 protein having the UniProtKB accession number P11142. In some embodiments, also disclosed herein is a small molecule ligand which binds to the HSPA8 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: SFYPEEVSSMVLTK (SEQ ID NO: 62) of the HSPA8 protein having the UniProtKB accession number P11142. In some instances, the small molecule ligand is probe 13 or 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the IMPDH2 protein, wherein the small molecule ligand binds to one or more of the following residues: YEQGFITDPVVLSPK (SEQ ID NO: 63) of the IMPDH2 protein having the UniProtKB accession number P12268. In some embodiments, also disclosed herein is a small molecule ligand which binds to the IMPDH2 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: YEQGFITDPVVLSPK (SEQ ID NO: 63) of the IMPDH2 protein having the UniProtKB accession number P12268. In some instances, the small molecule ligand is probe 13.


In some embodiments, disclosed herein is a small molecule ligand which binds to the LDHA protein, wherein the small molecule ligand binds to one or more of the following residues: DLADELALVDVIEDK (SEQ ID NO: 64) of the LDHA protein having the UniProtKB accession number P00338. In some embodiments, also disclosed herein is a small molecule ligand which binds to the LDHA protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: DLADELALVDVIEDK (SEQ ID NO: 64) of the LDHA protein having the UniProtKB accession number P00338. In some instances, the small molecule ligand is probe 9.


In some embodiments, disclosed herein is a small molecule ligand which binds to the LDHB protein, wherein the small molecule ligand binds to one or more of the following residues: MVVESAYEVIK (SEQ ID NO: 65) of the LDHB protein having the UniProtKB accession number P07195. In some embodiments, also disclosed herein is a small molecule ligand which binds to the LDHB protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: MVVESAYEVIK (SEQ ID NO: 65) of the LDHB protein having the UniProtKB accession number P07195. In some instances, the small molecule ligand is probe 4.


In some embodiments, disclosed herein is a small molecule ligand which binds to the LGMN protein, wherein the small molecule ligand binds to one or more of the following residues: DYTGEDVTPQNFLAVLR (SEQ ID NO: 66) of the LGMN protein having the UniProtKB accession number Q99538. In some embodiments, also disclosed herein is a small molecule ligand which binds to the LGMN protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: DYTGEDVTPQNFLAVLR (SEQ ID NO: 66) of the LGMN protein having the UniProtKB accession number Q99538. In some instances, the small molecule ligand is probe 9.


In some embodiments, disclosed herein is a small molecule ligand which binds to the LTA4H protein, wherein the small molecule ligand binds to one or more of the following residues: LVVDLTDIDPDVAYSSVPYEK (SEQ ID NO: 67) of the LTA4H protein having the UniProtKB accession number P09960. In some embodiments, also disclosed herein is a small molecule ligand which binds to the LTA4H protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: LVVDLTDIDPDVAYSSVPYEK (SEQ ID NO: 67) of the LTA4H protein having the UniProtKB accession number P09960. In some cases, the small molecule ligand is probe 4, 8 or 13.


In some embodiments, disclosed herein is a small molecule ligand which binds to the NAMPT protein, wherein the small molecule ligand binds to one or more of the following residues: YLLETSGNLDGLEYK (SEQ ID NO: 68) of the NAMPT protein having the UniProtKB accession number P43490. In some embodiments, also disclosed herein is a small molecule ligand which binds to the NAMPT protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: YLLETSGNLDGLEYK (SEQ ID NO: 68) of the NAMPT protein having the UniProtKB accession number P43490. In some cases, the small molecule ligand is probe 3, 6, 8, 13, 14 or 15.


In some embodiments, disclosed herein is a small molecule ligand which binds to the NPM1 protein, wherein the small molecule ligand binds to one or more residues of a ligand binding site selected from: DELHIVEAEAMNYEGSPIK (SEQ ID NO: 69) and MSVQPTVSLGGFEITPPVVLR (SEQ ID NO: 70) of the NPM1 protein having the UniProtKB accession number P06748. In some embodiments, also disclosed herein is a small molecule ligand which binds to the NPM1 protein, wherein the small molecule ligand binds a ligand binding site defined by: DELHIVEAEAMNYEGSPIK (SEQ ID NO: 69) or MSVQPTVSLGGFEITPPVVLR (SEQ ID NO: 70) of the NPM1 protein having the UniProtKB accession number P06748. In some cases, the small molecule ligand is probe 13.


In some embodiments, disclosed herein is a small molecule ligand which binds to the PCMT1 protein, wherein the small molecule ligand binds to one or more of the following residues: LILPVGPAGGNQMLEQYDK (SEQ ID NO: 71) of the PCMT1 protein having the UniProtKB accession number P22061. In some embodiments, also disclosed herein is a small molecule ligand which binds to the PCMT1 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: LILPVGPAGGNQMLEQYDK (SEQ ID NO: 71) of the PCMT1 protein having the UniProtKB accession number P22061. In some instances, the small molecule ligand is probe 2, 3 or 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the PDHB protein, wherein the small molecule ligand binds to one or more of the following residues: VFLLGEEVAQYDGAYK (SEQ ID NO: 72) of the PDHB protein having the UniProtKB accession number P11177. In some embodiments, also disclosed herein is a small molecule ligand which binds to the PDHB protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: VFLLGEEVAQYDGAYK (SEQ ID NO: 72) of the PDHB protein having the UniProtKB accession number P11177. In some instances, the small molecule ligand is probe 2, 3, 13 or 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the PGK1 protein, wherein the small molecule ligand binds to one or more of the following residues: QIVWNGPVGVFEWEAFAR (SEQ ID NO: 73) of the PGK1 protein having the UniProtKB accession number P00558. In some embodiments, also disclosed herein is a small molecule ligand which binds to the PGK1 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: QIVWNGPVGVFEWEAFAR (SEQ ID NO: 73) of the PGK1 protein having the UniProtKB accession number P00558. In some instances, the small molecule ligand is probe 3.


In some embodiments, disclosed herein is a small molecule ligand which binds to the PKM protein, wherein the small molecule ligand binds to one or more of the following residues: IYVDDGLISLQVK (SEQ ID NO: 74) and LAPITSDPTEATAVGAVEASFK (SEQ ID NO: 75) of the PKM protein having the UniProtKB accession number P14618. In some embodiments, also disclosed herein is a small molecule ligand which binds to the PKM protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: IYVDDGLISLQVK (SEQ ID NO: 74) or LAPITSDPTEATAVGAVEASFK (SEQ ID NO: 75) of the PKM protein having the UniProtKB accession number P14618. In some instances, the small molecule ligand is probe 2 or 9.


In some embodiments, disclosed herein is a small molecule ligand which binds to the POR protein, wherein the small molecule ligand binds to one or more of the following residues: TALTYYLDITNPPR (SEQ ID NO: 76) of the POR protein having the UniProtKB accession number P16435. In some embodiments, also disclosed herein is a small molecule ligand which binds to the POR protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: TALTYYLDITNPPR (SEQ ID NO: 76) of the POR protein having the UniProtKB accession number P16435. In some instances, the small molecule ligand is probe 13 or 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to a protein selected from PPP1CA and PPP1CC, wherein the small molecule ligand binds to one or more of the following residues: IYGFYDECK (SEQ ID NO: 77), which corresponds to IYGFYDECK (SEQ ID NO: 78) of the PPP1CC protein having the UniProtKB accession number P36873. In some embodiments, also disclosed herein is a small molecule ligand which binds to a protein selected from PPP1CA and PPP1CC, wherein the small molecule ligand binds a ligand binding site defined by the following residues: IYGFYDECK (SEQ ID NO: 77), which corresponds to IYGFYDECK (SEQ ID NO: 78) of the PPP1CC protein having the UniProtKB accession number P36873. In some instances, the small molecule ligand is probe 2.


In some embodiments, disclosed herein is a small molecule ligand which binds to the PPP1CC protein, wherein the small molecule ligand binds to one or more of the following residues: EIFLSQPILLELEAPLK (SEQ ID NO: 79) of the PPP1CC protein having the UniProtKB accession number P36873. In some embodiments, also disclosed herein is a small molecule ligand which binds to the PPP1CC protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: EIFLSQPILLELEAPLK (SEQ ID NO: 79) of the PPP1CC protein having the UniProtKB accession number P36873. In some instances, the small molecule ligand is probe 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the PPT1 protein, wherein the small molecule ligand binds to one or more of the following residues: TLMEDVENSFFLNVNSQVTTVCQALAK (SEQ ID NO: 80) of the PPT1 protein having the UniProtKB accession number P50897. In some embodiments, also disclosed herein is a small molecule ligand which binds to the PPT1 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: TLMEDVENSFFLNVNSQVTTVCQALAK (SEQ ID NO: 80) of the PPT1 protein having the UniProtKB accession number P50897. In some cases, the small molecule ligand is probe 2, 4, 8, 9, 13, 14 or 15.


In some embodiments, disclosed herein is a small molecule ligand which binds to the PRDX2 protein, wherein the small molecule ligand binds to one or more of the following residues: TDEGIAYR (SEQ ID NO: 81) of the PRDX2 protein having the UniProtKB accession number P32119. In some embodiments, also disclosed herein is a small molecule ligand which binds to the PRDX2 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: TDEGIAYR (SEQ ID NO: 81) of the PRDX2 protein having the UniProtKB accession number P32119. In some cases, the small molecule ligand is probe 13.


In some embodiments, disclosed herein is a small molecule ligand which binds to the PSMB4 protein, wherein the small molecule ligand binds to one or more of the following residues: FEGGVVIAADMLGSYGSLAR (SEQ ID NO: 82) of the PSMB4 protein having the UniProtKB accession number P28070. In some embodiments, also disclosed herein is a small molecule ligand which binds to the PSMB4 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: FEGGVVIAADMLGSYGSLAR (SEQ ID NO: 82) of the PSMB4 protein having the UniProtKB accession number P28070. In some cases, the small molecule ligand is probe 6.


In some embodiments, disclosed herein is a small molecule ligand which binds to the PSMB5 protein, wherein the small molecule ligand binds to one or more residues of a ligand binding site selected from: LLANMVYQYK (SEQ ID NO: 83) and DAYSGGAVNLYHVR (SEQ ID NO: 84) of the PSMB5 protein having the UniProtKB accession number P28074. In some embodiments, also disclosed herein is a small molecule ligand which binds to the PSMB5 protein, wherein the small molecule ligand binds a ligand binding site defined by: LLANMVYQYK (SEQ ID NO: 83) or DAYSGGAVNLYHVR (SEQ ID NO: 84) of the PSMB5 protein having the UniProtKB accession number P28074. In some instances, the small molecule ligand is probe 3, 4 or 6.


In some embodiments, disclosed herein is a small molecule ligand which binds to the PSMB6 protein, wherein the small molecule ligand binds to one or more of the following residues: SGSAADTQAVADAVTYQLGFHSIELNEPPLVHTAASLFK (SEQ ID NO: 85) of the PSMB6 protein having the UniProtKB accession number P28072. In some embodiments, also disclosed herein is a small molecule ligand which binds to the PSMB6 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: SGSAADTQAVADAVTYQLGFHSIELNEPPLVHTAASLFK (SEQ ID NO: 85) of the PSMB6 protein having the UniProtKB accession number P28072. In some instances, the small molecule ligand is probe 3, 6 or 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the RAB7A protein, wherein the small molecule ligand binds to one or more of the following residues: DEFLIQASPR (SEQ ID NO: 86) of the RAB7A protein having the UniProtKB accession number P51149. In some embodiments, also disclosed herein is a small molecule ligand which binds to the RAB7A protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: DEFLIQASPR (SEQ ID NO: 86) of the RAB7A protein having the UniProtKB accession number P51149. In some instances, the small molecule ligand is probe 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the RUVBL2 protein, wherein the small molecule ligand binds to one or more of the following residues: ALESDMAPVLIMATNR (SEQ ID NO: 87) of the RUVBL2 protein having the UniProtKB accession number Q9Y230. In some embodiments, also disclosed herein is a small molecule ligand which binds to the RUVBL2 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: ALESDMAPVLIMATNR (SEQ ID NO: 87) of the RUVBL2 protein having the UniProtKB accession number Q9Y230. In some instances, the small molecule ligand is probe 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the SMYD3 protein, wherein the small molecule ligand binds to one or more of the following residues: DQYCFECDCFR (SEQ ID NO: 88) of the SMYD3 protein having the UniProtKB accession number Q9H7B4. In some embodiments, also disclosed herein is a small molecule ligand which binds to the SMYD3 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: DQYCFECDCFR (SEQ ID NO: 88) of the SMYD3 protein having the UniProtKB accession number Q9H7B4. In some cases, the small molecule ligand is probe 9.


In some embodiments, disclosed herein is a small molecule ligand which binds to the TPP1 protein, wherein the small molecule ligand binds to one or more of the following residues: GCHESCLDEEVEGQGFCSGPGWDPVTGWGTPNFPALLK (SEQ ID NO: 89) of the TPP1 protein having the UniProtKB accession number 014773. In some embodiments, also disclosed herein is a small molecule ligand which binds to the TPP1 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: GCHESCLDEEVEGQGFCSGPGWDPVTGWGTPNFPALLK (SEQ ID NO: 89) of the TPP1 protein having the UniProtKB accession number 014773. In some instances, the small molecule ligand is probe 4, 9, 13, 14 or 15.


In some embodiments, disclosed herein is a small molecule ligand which binds to the TXNDC17 protein, wherein the small molecule ligand binds to one or more of the following residues: YEEVSVSGFEEFHR (SEQ ID NO: 90) of the TXNDC17 protein having the UniProtKB accession number Q9BRA2. In some embodiments, also disclosed herein is a small molecule ligand which binds to the TXNDC17 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: YEEVSVSGFEEFHR (SEQ ID NO: 90) of the TXNDC17 protein having the UniProtKB accession number Q9BRA2. In some instances, the small molecule ligand is probe 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the YWHAE protein, wherein the small molecule ligand binds to one or more residues of a ligand binding site selected from: EAAENSLVAYK (SEQ ID NO: 91) and AAFDDAIAELDTLSEESYK (SEQ ID NO: 92) of the YWHAE protein having the UniProtKB accession number P62258. In some embodiments, also disclosed herein is a small molecule ligand which binds to the YWHAE protein, wherein the small molecule ligand binds a ligand binding site defined by: EAAENSLVAYK (SEQ ID NO: 91) or AAFDDAIAELDTLSEESYK (SEQ ID NO: 92) of the YWHAE protein having the UniProtKB accession number P62258. In some cases, the small molecule ligand is probe 13.


In some embodiments, disclosed herein is a small molecule ligand which binds to the YWHAQ protein, wherein the small molecule ligand binds to one or more of the following residues: TAFDEAIAELDTLNEDSYK (SEQ ID NO: 93) of the YWHAQ protein having the UniProtKB accession number P27348. In some embodiments, also disclosed herein is a small molecule ligand which binds to the YWHAQ protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: TAFDEAIAELDTLNEDSYK (SEQ ID NO: 93) of the YWHAQ protein having the UniProtKB accession number P27348. In some cases, the small molecule ligand is probe 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the YWHAZ protein, wherein the small molecule ligand binds to one or more of the following residues: TAFDEAIAELDTLSEESYK (SEQ ID NO: 94) of the YWHAZ protein having the UniProtKB accession number P63104. In some embodiments, also disclosed herein is a small molecule ligand which binds to the YWHAZ protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: TAFDEAIAELDTLSEESYK (SEQ ID NO: 94) of the YWHAZ protein having the UniProtKB accession number P63104. In some instances, the small molecule ligand is probe 13 or 14.


In some embodiments, disclosed herein is a small molecule ligand which binds to the EXO1 protein, wherein the small molecule ligand binds to one or more of the following residues: SQGVDCLVAPYEADAQLAYLNK (SEQ ID NO: 95) of the EXO1 protein having the UniProtKB accession number Q9UQ84. In some embodiments, also disclosed herein is a small molecule ligand which binds to the EXO1 protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: SQGVDCLVAPYEADAQLAYLNK (SEQ ID NO: 95) of the EXO1 protein having the UniProtKB accession number Q9UQ84. In some instances, the small molecule ligand is probe 2, 6, 8, 9 or 13.


In some embodiments, disclosed herein is a small molecule ligand which binds to the LMNA protein, wherein the small molecule ligand binds to one or more of the following residues: MQQQLDEYQELLDIK (SEQ ID NO: 96) of the LMNA protein having the UniProtKB accession number P02545. In some embodiments, also disclosed herein is a small molecule ligand which binds to the LMNA protein, wherein the small molecule ligand binds a ligand binding site defined by the following residues: MQQQLDEYQELLDIK (SEQ ID NO: 96) of the LMNA protein having the UniProtKB accession number P02545. In some instances, the small molecule ligand is probe 6 or 13.


In some cases, the small molecule ligand which binds to a protein has a structure represented by Formula (Ia):




embedded image



wherein R1 is hydrogen and R2 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl; or R1 and R2 together with the nitrogen to which they are attached form an optionally substituted heterocyclyl ring wherein R1 is hydrogen and R2 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl; or R1 and R2 together with the nitrogen to which they are attached form an optionally substituted heterocyclyl ring; and R3 is an optionally substituted C2-C6 alkyl.


In some cases, the small molecule ligand which binds to a protein has a structure represented by Formula (IIa):




embedded image



wherein R1 is hydrogen and R2 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl; or R1 and R2 together with the nitrogen to which they are attached form an optionally substituted heterocyclyl ring; and R3 is an optionally substituted C2-C6 alkyl.


In some cases, the small molecule ligand which binds to a protein has a structure represented by Formula (IIIa):




embedded image



wherein R1 is hydrogen and R2 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl; or R1 and R2 together with the nitrogen to which they are attached form an optionally substituted heterocyclyl ring; and R3 is an optionally substituted C2-C6 alkyl.


In some cases, the small molecule ligand which binds to a protein has a structure represented by Formula (IVa):




embedded image



wherein R1 is hydrogen and R2 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl; or R1 and R2 together with the nitrogen to which they are attached form an optionally substituted heterocyclyl ring; and R3 is an optionally substituted C2-C6 alkyl.


In some cases, the small molecule ligand which binds to a protein has a structure represented by Formula (Va):




embedded image



wherein R1 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl; and R2 is an optionally substituted C2-C6 alkyl.


In some cases, the small molecule ligand which binds to a protein has a structure represented by Formula (VIa):




embedded image



wherein R1 is hydrogen and R2 is selected from substituted alkyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted cycloalkyl, optionally substituted heterocyclyl, optionally substituted heteroaryl, optionally substituted aralkyl, optionally substituted heteroarylalkyl, or optionally substituted heterocyclylalkyl; or R1 and R2 together with the nitrogen to which they are attached form an optionally substituted heterocyclyl ring; and R3 is an optionally substituted C2-C6 alkyl.


Cells, Analytical Techniques, and Instrumentation


In certain embodiments, one or more of the methods disclosed herein comprise a cell sample. In some embodiments, the cell sample for use with the methods described herein is obtained from cells of an animal. In some instances, the animal cell includes a cell from a marine invertebrate, fish, insects, amphibian, reptile, or mammal. In some instances, the mammalian cell is a primate, ape, equine, bovine, porcine, canine, feline, or rodent. In some instances, the mammal is a primate, ape, dog, cat, rabbit, ferret, or the like. In some cases, the rodent is a mouse, rat, hamster, gerbil, hamster, chinchilla, or guinea pig. In some embodiments, the bird cell is from a canary, parakeet or parrots. In some embodiments, the reptile cell is from a turtles, lizard or snake. In some cases, the fish cell is from a tropical fish. In some cases, the fish cell is from a zebrafish (e.g. Danino rerio). In some cases, the worm cell is from a nematode (e.g. C. elegans). In some cases, the amphibian cell is from a frog. In some embodiments, the arthropod cell is from a tarantula or hermit crab.


In some embodiments, the cell sample for use with the methods described herein is obtained from a mammalian cell. In some instances, the mammalian cell is an epithelial cell, connective tissue cell, hormone secreting cell, a nerve cell, a skeletal muscle cell, a blood cell, or an immune system cell.


Exemplary mammalian cells include, but are not limited to, 293A cell line, 293FT cell line, 293F cells, 293 H cells, HEK 293 cells, CHO DG44 cells, CHO-S cells, CHO-KI cells, Expi293F™ cells, Flp-In™ T-REx™ 293 cell line, Flp-In™-293 cell line, Flp-In™-3T3 cell line, Flp-In™-BHK cell line, Flp-In™-CHO cell line, Flp-In™-CV-1 cell line, Flp-In™-Jurkat cell line, FreeStyle™ 293-F cells, FreeStyle™ CHO-S cells, GripTite™ 293 MSR cell line, GS-CHO cell line, HepaRG™ cells, T-REx™ Jurkat cell line, Per.C6 cells, T-REx™-293 cell line, T-REx™-CHO cell line, T-REx™-HeLa cell line, NC-HIMT cell line, and PC12 cell line.


In some instances, the cell sample for use with the methods described herein is obtained from cells of a tumor cell line. In some instances, the sample is obtained from cells of a solid tumor cell line. In some instances, the solid tumor cell line is a sarcoma cell line. In some instances, the solid tumor cell line is a carcinoma cell line. In some embodiments, the sarcoma cell line is obtained from a cell line of alveolar rhabdomyosarcoma, alveolar soft part sarcoma, ameloblastoma, angiosarcoma, chondrosarcoma, chordoma, clear cell sarcoma of soft tissue, dedifferentiated liposarcoma, desmoid, desmoplastic small round cell tumor, embryonal rhabdomyosarcoma, epithelioid fibrosarcoma, epithelioid hemangioendothelioma, epithelioid sarcoma, esthesioneuroblastoma, Ewing sarcoma, extrarenal rhabdoid tumor, extraskeletal myxoid chondrosarcoma, extraskeletal osteosarcoma, fibrosarcoma, giant cell tumor, hemangiopericytoma, infantile fibrosarcoma, inflammatory myofibroblastic tumor, Kaposi sarcoma, leiomyosarcoma of bone, liposarcoma, liposarcoma of bone, malignant fibrous histiocytoma (MFH), malignant fibrous histiocytoma (MFH) of bone, malignant mesenchymoma, malignant peripheral nerve sheath tumor, mesenchymal chondrosarcoma, myxofibrosarcoma, myxoid liposarcoma, myxoinflammatory fibroblastic sarcoma, neoplasms with perivascular epitheioid cell differentiation, osteosarcoma, parosteal osteosarcoma, neoplasm with perivascular epitheioid cell differentiation, periosteal osteosarcoma, pleomorphic liposarcoma, pleomorphic rhabdomyosarcoma, PNET/extraskeletal Ewing tumor, rhabdomyosarcoma, round cell liposarcoma, small cell osteosarcoma, solitary fibrous tumor, synovial sarcoma, telangiectatic osteosarcoma.


In some embodiments, the carcinoma cell line is obtained from a cell line of adenocarcinoma, squamous cell carcinoma, adenosquamous carcinoma, anaplastic carcinoma, large cell carcinoma, small cell carcinoma, anal cancer, appendix cancer, bile duct cancer (i.e., cholangiocarcinoma), bladder cancer, brain tumor, breast cancer, cervical cancer, colon cancer, cancer of Unknown Primary (CUP), esophageal cancer, eye cancer, fallopian tube cancer, gastroenterological cancer, kidney cancer, liver cancer, lung cancer, medulloblastoma, melanoma, oral cancer, ovarian cancer, pancreatic cancer, parathyroid disease, penile cancer, pituitary tumor, prostate cancer, rectal cancer, skin cancer, stomach cancer, testicular cancer, throat cancer, thyroid cancer, uterine cancer, vaginal cancer, or vulvar cancer.


In some instances, the cell sample is obtained from cells of a hematologic malignant cell line. In some instances, the hematologic malignant cell line is a T-cell cell line. In some instances, B-cell cell line. In some instances, the hematologic malignant cell line is obtained from a T-cell cell line of: peripheral T-cell lymphoma not otherwise specified (PTCL-NOS), anaplastic large cell lymphoma, angioimmunoblastic lymphoma, cutaneous T-cell lymphoma, adult T-cell leukemia/lymphoma (ATLL), blastic NK-cell lymphoma, enteropathy-type T-cell lymphoma, hematosplenic gamma-delta T-cell lymphoma, lymphoblastic lymphoma, nasal NK/T-cell lymphomas, or treatment-related T-cell lymphomas.


In some instances, the hematologic malignant cell line is obtained from a B-cell cell line of: acute lymphoblastic leukemia (ALL), acute myelogenous leukemia (AML), chronic myelogenous leukemia (CML), acute monocytic leukemia (AMoL), chronic lymphocytic leukemia (CLL), high-risk chronic lymphocytic leukemia (CLL), small lymphocytic lymphoma (SLL), high-risk small lymphocytic lymphoma (SLL), follicular lymphoma (FL), mantle cell lymphoma (MCL), Waldenstrom's macroglobulinemia, multiple myeloma, extranodal marginal zone B cell lymphoma, nodal marginal zone B cell lymphoma, Burkitt's lymphoma, non-Burkitt high grade B cell lymphoma, primary mediastinal B-cell lymphoma (PMBL), immunoblastic large cell lymphoma, precursor B-lymphoblastic lymphoma, B cell prolymphocytic leukemia, lymphoplasmacytic lymphoma, splenic marginal zone lymphoma, plasma cell myeloma, plasmacytoma, mediastinal (thymic) large B cell lymphoma, intravascular large B cell lymphoma, primary effusion lymphoma, or lymphomatoid granulomatosis.


In some embodiments, the cell sample for use with the methods described herein is obtained from a tumor cell line. Exemplary tumor cell line includes, but is not limited to, 600MPE, AU565, BT-20, BT-474, BT-483, BT-549, Evsa-T, Hs578T, MCF-7, MDA-MB-231, SkBr3, T-47D, HeLa, DU145, PC3, LNCaP, A549, H1299, NCI-H460, A2780, SKOV-3/Luc, Neuro2a, RKO, RKO-AS45-1, HT-29, SW1417, SW948, DLD-1, SW480, Capan-1, MC/9, B72.3, B25.2, B6.2, B38.1, DMS 153, SU.86.86, SNU-182, SNU-423, SNU-449, SNU-475, SNU-387, Hs817.T, LMH, LMH/2A, SNU-398, PLHC-1, HepG2/SF, OCI-Ly1, OCI-Ly2, OCI-Ly3, OCI-Ly4, OCI-Ly6, OCI-Ly7, OCI-Ly10, OCI-Ly18, OCI-Ly19, U2932, DB, HBL-1, RIVA, SUDHL2, TMD8, MEC1, MEC2, 8E5, CCRF-CEM, MOLT-3, TALL-104, AML-193, THP-1, BDCM, HL-60, Jurkat, RPMI 8226, MOLT-4, RS4, K-562, KASUMI-1, Daudi, GA-10, Raji, JeKo-1, NK-92, and Mino.


In some embodiments, the cell sample for use in the methods is from any tissue or fluid from an individual. Samples include, but are not limited to, tissue (e.g. connective tissue, muscle tissue, nervous tissue, or epithelial tissue), whole blood, dissociated bone marrow, bone marrow aspirate, pleural fluid, peritoneal fluid, central spinal fluid, abdominal fluid, pancreatic fluid, cerebrospinal fluid, brain fluid, ascites, pericardial fluid, urine, saliva, bronchial lavage, sweat, tears, ear flow, sputum, hydrocele fluid, semen, vaginal flow, milk, amniotic fluid, and secretions of respiratory, intestinal or genitourinary tract. In some embodiments, the sample is a tissue sample, such as a sample obtained from a biopsy or a tumor tissue sample. In some embodiments, the sample is a blood serum sample. In some embodiments, the sample is a blood cell sample containing one or more peripheral blood mononuclear cells (PBMCs). In some embodiments, the sample contains one or more circulating tumor cells (CTCs). In some embodiments, the sample contains one or more disseminated tumor cells (DTC, e.g., in a bone marrow aspirate sample).


In some embodiments, the cell samples are obtained from the individual by any suitable means of obtaining the sample using well-known and routine clinical methods. Procedures for obtaining tissue samples from an individual are well known. For example, procedures for drawing and processing tissue sample such as from a needle aspiration biopsy is well-known and is employed to obtain a sample for use in the methods provided. Typically, for collection of such a tissue sample, a thin hollow needle is inserted into a mass such as a tumor mass for sampling of cells that, after being stained, will be examined under a microscope.


Sample Preparation and Analysis


In some embodiments, the sample is a sample solution. In some instances, the sample solution comprises a solution such as a buffer (e.g. phosphate buffered saline) or a media. In some embodiments, the media is an isotopically labeled media. In some instances, the sample solution is a cell solution.


In some embodiments, the sample (e.g., cells or a cell solution) is incubated with one or more probes for analysis of protein-probe interactions. In some instances, the sample (e.g., cells or a cell solution) is further incubated in the presence of an additional probe prior to addition of the one or more probes. In other instances, the sample (e.g., cells or a cell solution) is further incubated with a non-probe small molecule ligand, in which the non-probe small molecule ligand does not contain a photoreactive moiety and/or an alkyne group. In such instances, the sample is incubated with a probe and non-probe small molecule ligand for competitive protein profiling analysis.


In some cases, the sample is compared with a control. In some cases, a difference is observed between a set of probe protein interactions between the sample and the control. In some instances, the difference correlates to the interaction between the small molecule fragment and the proteins.


In some embodiments, one or more methods are utilized for labeling a sample (e.g. cells or a cell solution) for analysis of probe protein interactions. In some instances, a method comprises labeling the sample (e.g. cells or a cell solution) with an enriched media. In some cases, the sample (e.g. cells or a cell solution) is labeled with isotope-labeled amino acids, such as 13C or 15N-labeled amino acids. In some cases, the labeled sample is further compared with a non-labeled sample to detect differences in probe protein interactions between the two samples. In some instances, this difference is a difference of a target protein and its interaction with a small molecule ligand in the labeled sample versus the non-labeled sample. In some instances, the difference is an increase, decrease or a lack of protein-probe interaction in the two samples. In some instances, the isotope-labeled method is termed SILAC, stable isotope labeling using amino acids in cell culture.


In some instances, the sample is divided into a first cell solution and a second cell solution. In some cases, the first cell solution is incubated with a first probe for an extended period of time to generate a first group of probe-protein complexes. In some instances, the extended period of time is about 5, 10, 15, 20, 30, 60, 90, 120 minutes or longer. In some instances, the second cell solution comprises a second probe to generate a second group of probe-protein complexes. In some instances, the first probe and the second probe are different. In some embodiments, cells from the second cell solution are treated with a buffer, such as a control buffer, in which the buffer does not contain a small molecule fragment probe. In some embodiments, the control buffer comprises dimethyl sulfoxide (DMSO).


In some embodiments, a method comprises incubating a sample (e.g. cells or a cell solution) or a processed sample (e.g., a cell lysate) with a labeling group (e.g., an isotopically labeled labeling group) to tag one or more proteins of interest for further analysis. In such cases, the labeling group comprises a biotin, a streptavidin, bead, resin, a solid support, or a combination thereof, and further comprises a linker that is optionally isotopically labeled. As described above, the linker can be about 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 or more residues in length and can further comprise a cleavage site, such as a protease cleavage site (e.g., TEV cleavage site). In some cases, the labeling group is a biotin-linker moiety, which is optionally isotopically labeled with 13C and 15N atoms at one or more amino acid residue positions within the linker. In some cases, the biotin-linker moiety is a isotopically-labeled TEV-tag as described in Weerapana, et al., “Quantitative reactivity profiling predicts functional cysteines in proteomes,” Nature 468(7325): 790-795.


In some embodiments, an isotopic reductive dimethylation (ReDi) method is utilized for processing a sample. In some cases, the ReDi labeling method involves reacting peptides with formaldehyde to form a Schiff base, which is then reduced by cyanoborohydride. This reaction dimethylates free amino groups on N-termini and lysine side chains and monomethylates N-terminal prolines. In some cases, the ReDi labeling method comprises methylating peptides from a first processed sample with a “light” label using reagents with hydrogen atoms in their natural isotopic distribution and peptides from a second processed sample with a “heavy” label using deuterated formaldehyde and cyanoborohydride. Subsequent proteomic analysis (e.g., mass spectrometry analysis) based on a relative peptide abundance between the heavy and light peptide version can be used for analysis of probe-protein interactions.


In some embodiments, isobaric tags for relative and absolute quantitation (iTRAQ) method is utilized for processing a sample. In some cases, the iTRAQ method is based on the covalent labeling of the N-terminus and side chain amines of peptides from a processed sample. In some cases, reagent such as 4-plex or 8-plex is used for labeling the peptides.


In some embodiments, the probe-protein complex is further conjugated to a chromophore, such as a fluorophore. In some instances, the probe-protein complex is separated and visualized utilizing an electrophoresis system, such as through a gel electrophoresis, or a capillary electrophoresis. Exemplary gel electrophoresis includes agarose based gels, polyacrylamide based gels, or starch based gels. In some instances, the probe-protein is subjected to a native electrophoresis condition. In some instances, the probe-protein is subjected to a denaturing electrophoresis condition.


In some instances, the probe-protein after harvesting is further fragmentized to generate protein fragments. In some instances, fragmentation is generated through mechanical stress, pressure, or chemical means. In some instances, the protein from the probe-protein complexes is fragmented by a chemical means. In some embodiments, the chemical means is a protease. Exemplary proteases include, but are not limited to, serine proteases such as chymotrypsin A, penicillin G acylase precursor, dipeptidase E, DmpA aminopeptidase, subtilisin, prolyl oligopeptidase, D-Ala-D-Ala peptidase C, signal peptidase I, cytomegalovirus assemblin, Lon-A peptidase, peptidase Clp, Escherichia coli phage KlF endosialidase CIMCD self-cleaving protein, nucleoporin 145, lactoferrin, murein tetrapeptidase LD-carboxypeptidase, or rhomboid-1; threonine proteases such as ornithine acetyltransferase; cysteine proteases such as TEV protease, amidophosphoribosyltransferase precursor, gamma-glutamyl hydrolase (Rattus norvegicus), hedgehog protein, DmpA aminopeptidase, papain, bromelain, cathepsin K, calpain, caspase-1, separase, adenain, pyroglutamyl-peptidase I, sortase A, hepatitis C virus peptidase 2, sindbis virus-type nsP2 peptidase, dipeptidyl-peptidase VI, or DeSI-1 peptidase; aspartate proteases such as beta-secretase 1 (BACE1), beta-secretase 2 (BACE2), cathepsin D, cathepsin E, chymosin, napsin-A, nepenthesin, pepsin, plasmepsin, presenilin, or renin; glutamic acid proteases such as AfuGprA; and metalloproteases such as peptidase_M48.


In some instances, the fragmentation is a random fragmentation. In some instances, the fragmentation generates specific lengths of protein fragments, or the shearing occurs at particular sequence of amino acid regions.


In some instances, the protein fragments are further analyzed by a proteomic method such as by liquid chromatography (LC) (e.g. high performance liquid chromatography), liquid chromatography-mass spectrometry (LC-MS), matrix-assisted laser desorption/ionization (MALDI-TOF), gas chromatography-mass spectrometry (GC-MS), capillary electrophoresis-mass spectrometry (CE-MS), or nuclear magnetic resonance imaging (NMR).


In some embodiments, the LC method is any suitable LC methods well known in the art, for separation of a sample into its individual parts. This separation occurs based on the interaction of the sample with the mobile and stationary phases. Since there are many stationary/mobile phase combinations that are employed when separating a mixture, there are several different types of chromatography that are classified based on the physical states of those phases. In some embodiments, the LC is further classified as normal-phase chromatography, reverse-phase chromatography, size-exclusion chromatography, ion-exchange chromatography, affinity chromatography, displacement chromatography, partition chromatography, flash chromatography, chiral chromatography, and aqueous normal-phase chromatography.


In some embodiments, the LC method is a high performance liquid chromatography (HPLC) method. In some embodiments, the HPLC method is further categorized as normal-phase chromatography, reverse-phase chromatography, size-exclusion chromatography, ion-exchange chromatography, affinity chromatography, displacement chromatography, partition chromatography, chiral chromatography, and aqueous normal-phase chromatography.


In some embodiments, the HPLC method of the present disclosure is performed by any standard techniques well known in the art. Exemplary HPLC methods include hydrophilic interaction liquid chromatography (HILIC), electrostatic repulsion-hydrophilic interaction liquid chromatography (ERLIC) and reverse phase liquid chromatography (RPLC).


In some embodiments, the LC is coupled to a mass spectroscopy as a LC-MS method. In some embodiments, the LC-MS method includes ultra-performance liquid chromatography-electrospray ionization quadrupole time-of-flight mass spectrometry (UPLC-ESI-QTOF-MS), ultra-performance liquid chromatography-electrospray ionization tandem mass spectrometry (UPLC-ESI-MS/MS), reverse phase liquid chromatography-mass spectrometry (RPLC-MS), hydrophilic interaction liquid chromatography-mass spectrometry (HILIC-MS), hydrophilic interaction liquid chromatography-triple quadrupole tandem mass spectrometry (HILIC-QQQ), electrostatic repulsion-hydrophilic interaction liquid chromatography-mass spectrometry (ERLIC-MS), liquid chromatography time-of-flight mass spectrometry (LC-QTOF-MS), liquid chromatography-tandem mass spectrometry (LC-MS/MS), multidimensional liquid chromatography coupled with tandem mass spectrometry (LC/LC-MS/MS). In some instances, the LC-MS method is LC/LC-MS/MS. In some embodiments, the LC-MS methods of the present disclosure are performed by standard techniques well known in the art.


In some embodiments, the GC is coupled to a mass spectroscopy as a GC-MS method. In some embodiments, the GC-MS method includes two-dimensional gas chromatography time-of-flight mass spectrometry (GC*GC-TOFMS), gas chromatography time-of-flight mass spectrometry (GC-QTOF-MS) and gas chromatography-tandem mass spectrometry (GC-MS/MS).


In some embodiments, CE is coupled to a mass spectroscopy as a CE-MS method. In some embodiments, the CE-MS method includes capillary electrophoresis-negative electrospray ionization-mass spectrometry (CE-ESI-MS), capillary electrophoresis-negative electrospray ionization-quadrupole time of flight-mass spectrometry (CE-ESI-QTOF-MS) and capillary electrophoresis-quadrupole time of flight-mass spectrometry (CE-QTOF-MS).


In some embodiments, the nuclear magnetic resonance (NMR) method is any suitable method well known in the art for the detection of one or more cysteine binding proteins or protein fragments disclosed herein. In some embodiments, the NMR method includes one dimensional (1D) NMR methods, two dimensional (2D) NMR methods, solid state NMR methods and NMR chromatography. Exemplary 1D NMR methods include 1Hydrogen, 13Carbon, 15Nitrogen, 17Oxygen, 19Fluorine, 31Phosphorus, 39Potassium, 2Sodium, 33Sulfur, 87Strontium, 27Aluminium, 43Calcium, 35Chlorine, 37Chlorine, 63Copper, 65Copper, 57Iron, 25Magnesium, 199Mercury or 67Zinc NMR method, distortionless enhancement by polarization transfer (DEPT) method, attached proton test (APT) method and 1D-incredible natural abundance double quantum transition experiment (INADEQUATE) method. Exemplary 2D NMR methods include correlation spectroscopy (COSY), total correlation spectroscopy (TOCSY), 2D-INADEQUATE, 2D-adequate double quantum transfer experiment (ADEQUATE), nuclear overhauser effect spectroscopy (NOSEY), rotating-frame NOE spectroscopy (ROESY), heteronuclear multiple-quantum correlation spectroscopy (HMQC), heteronuclear single quantum coherence spectroscopy (HSQC), short range coupling and long range coupling methods. Exemplary solid state NMR method include solid state 13Carbon NMR, high resolution magic angle spinning (HR-MAS) and cross polarization magic angle spinning (CP-MAS) NMR methods. Exemplary NMR techniques include diffusion ordered spectroscopy (DOSY), DOSY-TOCSY and DOSY-HSQC.


In some embodiments, the protein fragments are analyzed by method as described in Weerapana et al., “Quantitative reactivity profiling predicts functional cysteines in proteomes,” Nature, 468:790-795 (2010).


In some embodiments, the results from the mass spectroscopy method are analyzed by an algorithm for protein identification. In some embodiments, the algorithm combines the results from the mass spectroscopy method with a protein sequence database for protein identification. In some embodiments, the algorithm comprises ProLuCID algorithm, Probity, Scaffold, SEQUEST, or Mascot.


In some embodiments, a value is assigned to each of the protein from the probe-protein complex. In some embodiments, the value assigned to each of the protein from the probe-protein complex is obtained from the mass spectroscopy analysis. In some instances, the value is the area-under-the curve from a plot of signal intensity as a function of mass-to-charge ratio. In some embodiments, a first value is assigned to the protein obtained from the first cell solution and a second value is assigned to the same protein obtained from the second cell solution. In some instances, a ratio is calculated between the two values. In some instances, a ratio of greater than 2 indicates that the protein is a candidate for interacting with a drug. In some instances, the ratio is greater than 2.5, 3, 3.5, 4, 4.5, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20. In some cases, the ratio is at most 20.


In some instances, the ratio is calculated based on averaged values. In some instances, the averaged value is an average of at least two, three, or four values of the protein from each cell solution, or that the protein is observed at least two, three, or four times in each cell solution and a value is assigned to each observed time. In some instances, the ratio further has a standard deviation of less than 12, 10, or 8.


In some instances, a value is not an averaged value. In some instances, the ratio is calculated based on value of a protein observed only once in a cell population. In some instances, the ratio is assigned with a value of 20.


Kits/Article of Manufacture


Disclosed herein, in certain embodiments, are kits and articles of manufacture for use with one or more methods described herein. In some embodiments, described herein is a kit for generating a protein comprising a photoreactive ligand. In some embodiments, such kit includes photoreactive small molecule ligands described herein, small molecule fragments or libraries and/or controls, and reagents suitable for carrying out one or more of the methods described herein. In some instances, the kit further comprises samples, such as a cell sample, and suitable solutions such as buffers or media. In some embodiments, the kit further comprises recombinant proteins for use in one or more of the methods described herein. In some embodiments, additional components of the kit comprises a carrier, package, or container that is compartmentalized to receive one or more containers such as vials, tubes, and the like, each of the container(s) comprising one of the separate elements to be used in a method described herein. Suitable containers include, for example, bottles, vials, plates, syringes, and test tubes. In one embodiment, the containers are formed from a variety of materials such as glass or plastic.


The articles of manufacture provided herein contain packaging materials. Examples of pharmaceutical packaging materials include, but are not limited to, bottles, tubes, bags, containers, and any packaging material suitable for a selected formulation and intended mode of use.


For example, the container(s) include probes, test compounds, and one or more reagents for use in a method disclosed herein. Such kits optionally include an identifying description or label or instructions relating to its use in the methods described herein.


A kit typically includes labels listing contents and/or instructions for use, and package inserts with instructions for use. A set of instructions will also typically be included.


In one embodiment, a label is on or associated with the container. In one embodiment, a label is on a container when letters, numbers or other characters forming the label are attached, molded or etched into the container itself; a label is associated with a container when it is present within a receptacle or carrier that also holds the container, e.g., as a package insert. In one embodiment, a label is used to indicate that the contents are to be used for a specific therapeutic application. The label also indicates directions for use of the contents, such as in the methods described herein.


Certain Terminology


Unless defined otherwise, all technical and scientific terms used herein have the same meaning as is commonly understood by one of skill in the art to which the claimed subject matter belongs. It is to be understood that the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of any subject matter claimed. In this application, the use of the singular includes the plural unless specifically stated otherwise. It must be noted that, as used in the specification and the appended claims, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise. In this application, the use of “or” means “and/or” unless stated otherwise. Furthermore, use of the term “including” as well as other forms, such as “include”, “includes,” and “included,” is not limiting.


As used herein, ranges and amounts can be expressed as “about” a particular value or range. About also includes the exact amount. Hence “about 5 μL” means “about 5 μL” and also “5 μL.” Generally, the term “about” includes an amount that would be expected to be within experimental error.


The section headings used herein are for organizational purposes only and are not to be construed as limiting the subject matter described.


The term “protein”, as used herein, refers to any polymeric chain of amino acids. The term “protein” encompasses native or modified protein, protein fragments, or polypeptide analogs comprising non-native amino acid residues. In some instances, a protein is monomeric. In other instances, a protein is polymeric. In some instances, a protein described herein is also referred to as an “isolated polypeptide”, or a polypeptide that by virtue of its origin or source of derivation is not associated with naturally associated components that accompany it in its native state; is substantially free of other proteins from the same species; is expressed by a cell from a different species; or does not occur in nature.


In some embodiments, the term “bind(s)” or “binding” encompass a covalent interaction between a small molecule ligand and a protein binding site described herein. In other embodiments, the term “bind(s)” or “binding” encompass a non-covalent interaction between a small molecule ligand and a protein binding site described herein. In additional embodiments, the term “bind(s)” or “binding” encompass an interaction between a small molecule ligand and a region of a protein of interest in which the region on the protein is about 1 Å, 2 Å, 3 Å, 4 Å, 5 Å, 6 Å, 7 Å, 8 Å, 9 Å or 10 Å away from a binding site on the protein of interest. In some cases, the binding site is a functional or active site on the protein. In some cases, the binding site on the protein is not a functional or active site. In additional cases, the binding site on the protein is distal from a functional or active site. In the context of a competition interaction with two or more different small molecule ligands, the term “bind(s)” or “binding” can encompass blocking or displacement of small molecule ligands from interacting with a region or binding site on a protein of interest.


As used herein, the term “functional site” or “active site” are used interchangeably and refer to a region of a protein that has a specific biological activity. For example, the functional site can be a site that binds a substrate or other binding partner and optionally contributes the amino acid residues that directly participate in the making and breaking of chemical bonds. In some instances, a functional site or active site encompass, e.g., catalytic sites of enzymes, ligand binding domains of receptors, binding domains of regulators, or receptor binding domains of secreted proteins. In some cases, the functional or active site also encompass transactivation, protein-protein interaction, or DNA binding domains of transcription factors and regulators.


EXAMPLES

These examples are provided for illustrative purposes only and not to limit the scope of the claims provided herein.


Example 1—Cell Lines

HEK293T cells were maintained in high-glucose DMEM (Gibco) supplemented with 10% (v/v) fetal bovine serum (FBS), penicillin (100 U/mL), streptomycin (100 μg/mL) and L-glutamine (2 mM). K562 and HSC-5 cells were maintained in high-glucose IMDM (Gibco) supplemented with 10% (v/v) fetal bovine serum (FBS), penicillin (100 U/mL) and streptomycin (100 μg/mL). All cell lines were grown at 37° C. in a humidified 5% CO2 atmosphere. For SILAC experiments, each cell line was passaged at least six times in either SILAC DMEM or SILAC IMDM, (Thermo), which lack L-lysine and L-arginine, and supplemented with 10% (v/v) dialyzed FBS (Gemini), PSQ (as above), and either [13C6, 15N2]-L-lysine and [13C6, 15N4]-L-arginine (100 μg/mL each) or L-lysine.HCl and L-arginine.HCl (100 μg/mL each). Heavy and light cells were maintained in parallel and cell aliquots were frozen after six passages in SILAC media and stored in liquid N2 until needed. Whenever thawed, cells were passaged at least three times before being used in experiments.


3T3-L1 preadipocytes were maintained in DMEM supplemented with 10% bovine calf serum. 10T1/2 cells were maintained in DMEM with 10% fetal bovine serum (FBS). To induce differentiation, confluent cells were cultured in DMEM with 10% FBS and exposed to dexamethasone (1 μM), 3-isobutyl-1-methylxanthine (IBMX; 0.5 mM), and insulin (1 μg/ml) for 2 days, followed by culture with insulin alone (1 μg/ml).


Example 2—In Situ Labeling of Live Cells with “Fully Functionalized” Fragment (FFF) Probes

For gel-based experiments, cells were grown in 6-well plates to ˜90% confluence at the time of treatment. Cells were carefully washed with Dulbecco's phosphate buffered saline (DPBS) and replenished with fresh serum-free media containing indicated FFF probe, and, if applicable, competitors or DMSO vehicle (1 mL). Following incubation at 37° C. for 30 min, cells were directly exposed to 365 nm light for 10 min. For no UV experiments, cells were incubated at 4° C. for 10 min under ambient light. For MS-based experiments, cell labeling was performed in a similar manner as described above. Modifications to this protocol included using isotopically ‘light’ and ‘heavy’ SILAC cells that were grown to near complete confluence prior to treatment in 10 cm plates. In probe-versus-control probe and probe-versus-probe experiments, isotopically light cells were treated with indicated fragment probe, while the heavy cells were treated with control probe (1), or additional FFF probe to be compared, at indicated concentrations. In competition type experiments, heavy and light cells were co-treated with the indicated FFF probe and competitor or DMSO, respectively. Following treatments and photocrosslinking, cells were harvested in cold DPBS by scraping, centrifuged (1,400 g, 3 min, 4° C.), and pellets washed with cold DPBS (2×) and then aspirated. Pellets were either directly processed or kept frozen at −80° C. until use.


Example 3—Preparation of Probe-Labeled Proteome for Gel- and MS-Based Protein Analyses

Cells pellets were lysed in cold DPBS (100-500 □L) using a Branson Sonifier probe sonicator (10 pulses, 30% duty cycle, output setting=4). For experiments requiring cell fractionation into membrane and soluble proteomes, cell lysates were then centrifuged (100,000×g, 45 min) to provide soluble (supernatant) and membrane (pellet) fractions. Membrane pellets were resuspended in cold DPBS after separation by sonication. Protein concentration was determined using the DC Protein Assay (Bio-Rad) and absorbance read using a Tecan, Infinite F500 plate reader following manufacturer's instructions. For SILAC experiments, isotopically heavy and light whole cell lysates were adjusted to 1.5 mg/mL, and were then mixed in equal proportions (500 □L each) in cold DPBS.


Example 4—Gel-Based Analysis of Crosslinked Proteins in Cells

Proteomes from treated cells were diluted to 1 mg/mL. To each sample (50 □L), 6 □L of a freshly prepared “click” reagent mixture containing 0.1 mM tris(benzyltriazolylmethyl)amine (TBTA) (3 □L/sample, 1.7 mM in 1:4 DMSO:t-ButOH), 1 mM CuSO4 (1 □L/sample, 50 mM in H2O), 25 □M tetramethylrhodamine (TAMRA) azide (1 □L/sample, 1.25 mM in DMSO), and freshly prepared 1 mM tris(2-carboxyethyl)phosphine HCl (TCEP) (1 □L/sample, 50 mM in PBS or H2O) was added to conjugate the fluorophore to probe-labeled proteins. Upon addition of the click mixture, each reaction was immediately mixed by vortexing and then allowed to react at ambient temperature for 1 hr before quenching the reactions with SDS loading buffer (4× stock, 17 □L). Proteins (25 □g total protein loaded per gel lane) were resolved using SDS-PAGE (10% acrylamide) and visualized by in-gel fluorescence on a Hitachi FMBIO-II or a Bio-Rad ChemiDoc™ MP flatbed fluorescence scanner.


Example 5—Preparation of Labeled Proteome for MS-Based Analysis

Profiling experiments were adapted methods previously reported. To the combined mixture of heavy and light soluble proteomes (1.5 mg) in 1 mL DPBS, a mixture of TBTA (60 μL/sample, 1.7 mM in 1:4 DMSO:t-BuOH), CuSO4 (20 μL/sample, 50 mM in H2O), TCEP (20 μL/sample, 50 mM in DPBS) and Biotin-N3 (10 μL/sample, 10 mM in DMSO) was added and each sample was rotated at room temperature. After 1 hr, the mixture was transferred to a 15 mL falcon tube and a cold 4:1 mixture (2.5 mL) of methanol (MeOH)/chloroform (CHCl3) was added followed by cold PBS (1 mL) on ice. The resulting cloudy mixture was centrifuged (5,000×g, 10 min, 4° C.) to fractionate the protein interphase from the organic and aqueous solvent layers. After washing the protein disc carefully with cold 1:1 MeOH:CHCl3 (3×1 mL) followed by sonication in cold 4:1 MeOH:CHCl3 (3 mL) to ensure click reagents were efficiently removed, the remaining precipitate was pelleted by centrifugation (5,000×g, 10 min, 4° C.). The pellet was aspirated and resuspended in a freshly-prepared solution of proteomics-grade urea (500 μL, 6 M in DPBS) containing 10 μL of 10% SDS and then dissolved by sonication. Disulfides were reduced by adding 50 μL of a 1:1 mixture containing TCEP (200 mM in DPBS) pre-neutralized with potassium carbonate (600 mM DPBS) for 30 min at 37° C. Reduced thiols were then alkylated by addition of iodoacetamide (70 μL of 400 mM in DPBS) for 30 min at ambient temperature protected from light. To each solution, 130 μL of 10% SDS (in DPBS) was added and then diluted to ˜0.2% SDS with DPBS (5.5 mL) and incubated with pre-equilibrated streptavidin agarose resin (100 μL 1:1 slurry, Pierce) for 1.5 hr at ambient temperature on a rotator. The streptavidin beads were collected by centrifugation (1,400 g, 1-2 min) and sequentially washed with 0.2% SDS in DPBS (1×5 mL), detergent-free DPBS (2×5 mL), and H2O (2×5 mL) to remove unbound protein, excess detergent, and small molecules. The resin was transferred to a Protein LoBind tube (Eppendorf) and bound proteins were digested on-bead overnight at 37° C. in ˜200 μL total volume containing sequencing grade porcine trypsin (2 μg, Promega) in the presence of urea (2 M in DPBS) and CaCl2 (1 mM). The proteolyzed supernatant was transferred to a fresh Protein LoBind tube, acidified with formic acid (5% final) and stored at −20° C. until analyzed.


Example 6—Multidimensional Liquid Chromatography-Tandem Mass Spectrometry (LC/LC-MS/MS) Analysis of Tryptic Digests

Peptides from tryptic digests were pressure loaded onto a 250 μm (inner diameter) fused silica capillary column packed with C18 resin (4 cm, Aqua 5 μm, Phenomenex). Samples were analyzed using an LTQ-Orbitrap Velos mass spectrometer (Thermo Scientific) coupled to an Agilent 1200 series quaternary pump. Peptides were eluted by two-dimensional separation on a column with a 5 μm tip [100 μm fused silica, packed with C18 (10 cm) and strong cation exchange (SCX) resin (4 cm, Phenomenex)] using a five-step ‘MudPIT’ protocol that involves 0%, 25%, 50%, 80% and 100% salt bumps of ammonium acetate (NH4OAc; 500 mM) to elute peptides stepwise from the SCX to the C18 resin followed by an increasing gradient of acetonitrile in each step (5%-100% buffer B in buffer A; buffer A: 95% H2O, 5% acetonitrile, 0.1% formic acid; buffer B: 5% H2O, 95% acetonitrile, 0.1% formic acid). The flow rate through the column was 0.25 μl/min and the voltage applied to the nano-LC electrospray ionization source was 2.5 kV. Spectra were collected in a data-dependent acquisition mode such that each scan cycle involved a single high-resolution full MS spectrum of parent ions (MS1 scan from 400-1800 m/z) collected in the orbitrap coupled to 30 CID-induced fragmentation (MS2) scans in the ion trap of the 30 most abundant parent ions from the MS1 scan. Dynamic exclusion (repeat count of 1, exclusion duration of 20 s). Parent ions with unassigned or +1 charge states by the instrument were excluded for fragmentation. All other parameters were left at default values.


Example 7—Peptide and Protein Identification and Quantification

From each of the five .raw files (one for each salt ‘bump’) generated by the instrument (Xcalibur software), the MS2 spectra for all fragmented parent ions (.ms2 file) were extracted using RAW Xtract (version 1.9.9.2; 2004 release). Each .ms2 file was searched using the ProLuCID algorithm against a reverse-concatenated, nonredundant (gene-centric) database of the human proteome (Uniprot release—Nov. 5, 2012) or mouse proteome (Nov. 5, 2012) and filtered using DTASelect 2.0 within the Integrated Proteomics Pipeline (IP2) software. All cysteine residues were specified with a static modification for carbamidomethylation (+57.0215 Da) and one oxidized methionine residue per peptide (if found) was allowed as a variable oxidation (+15.9949 Da). In addition, peptides were required to have at least one tryptic terminus. Each dataset was simultaneously searched for both light and heavy isotopologues of the same peptide by specifying the mass shift of heavy residues as static modifications on lysine (+8.0142 Da) and arginine (+10.0082 Da) in a coupled ‘heavy’ search. The precursor ion mass tolerance for a minimum envelope of three isotopic peaks was set to 50 ppm, the minimum peptide length was six residues, the false-positive rate was set at 1% or lower and at least 2 peptides of a protein must be detected in order to be advanced to the next step of analysis.


Heavy and light parent ion chromatograms associated with successfully identified peptides were extracted and compared using in-house software (CIMAGE). Briefly, extracted MS1 ion chromatograms (+10 ppm error tolerance of predicted m/z) from both ‘light’ and ‘heavy’ target peptide masses (m/z) were generated using a retention time window (+10 min) centered on the time when the peptide ion was selected for MS/MS fragmentation (minimum 3 MS1's per peak), and subsequently identified. Next, the ratio of the peak areas under the light and heavy signals (signal-to-noise ratio>2.5) was calculated. Computational filters used to ensure that the correct peak-pair was used for quantification include a co-elution correlation score filter (R2≥0.8), removing target peptides with bad co-elution profile, and an ‘envelope correlation score’ filter (R2>0.8) that eliminates target peptides whose predicted pattern of the isotopic envelope distribution does not match the experimentally observed high-resolution MS1 spectrum. In addition, peptides detected as ‘singletons,’ where only the heavy ion of a peptide pair was identified, but that cleared all other filtering parameters, are given a default assigned ratio of ‘20,’ which is defined as any measured ratio that is ≥20 and is the maximum ratio reported here.


Example 8—Proteomic Analysis of Probe-Labeled Proteins by Mass Spectrometry

Median SILAC ratios were filtered to ensure that each protein ratio was resultant from three or more unique and quantified peptides and that the combined peptide ratios possessed a standard deviation of less than 60% of the median; if greater, the combined ratio was assigned the lowest quantified peptide value. SILAC ratios meeting these criteria were then combined with replicate data sets from the same probe, cell line and experimental conditions. Identification of probe targets enriched in fragment probe versus control probe experiments in HEK293T cells represent averaged data from at least two biological replicate experiments and K562 data in single replicate experiments. Identification of probe targets from comparison of probe versus probe experiments and from fragment probe competition experiments represent averaged values of at least two biological replicate experiments.


In order to be classified as a probe target, proteins must (1) comply with the above criteria and (2) be enriched greater than 5-fold over control probe 1 (SILAC>5) in at least two different probe data sets (200 μM). If protein is enriched 5-fold or more by only one probe, then it had to be quantified in three or more independent experiments. In order to be included in probe-versus-probe comparisons, protein must abide by the above criteria and also be a target for at least one of the two probes, as designated above. For competition experiments, proteins (1) must be designated probe targets for the probe being used, as described above, (2) competed greater than 3-fold (competition SILAC ratio>3) unless otherwise noted, and (3) must have SILAC ratios derived from three or more quantified peptides.


Example 9—Fragment Probe Target Meta-Analysis

Custom python scripts were used to compile functional annotations of final probe targets available in the UniProtKB/Swiss-Prot Protein Knowledge database. Probe targets were queried against the DrugBank database (Version 4.2) and fractionated into DrugBank and non-DrugBank proteins. Functional keywords assigned at the protein level were collected from the Uniprot database and the two DrugBank and non-DrugBank categories were further classified into protein functional classes. Membrane proteins were defined as proteins possessing known or predicted transmembrane domains (UniProt analysis), and the remaining targets were considered soluble. Heatmaps were generated using RStudio software.


Example 10—Cell Treatments and Preparation for MS-Based Analyses of Probe-Modified Peptides

Preparation and analysis was adapted from methods previously reported. In brief, for global mapping of fragment probe-modified peptides, separate 10 cm dishes of cells were treated with probes (200-250 μM) in 3.0 mL of DMEM (serum-free) and (if applicable) competitor ligands, proteomes harvested and subjected to click chemistry conditions with either light or heavy isotopically labeled biotin-TEV-azide (10 μL of 5 mM stocks in DMSO, final concentration=100 μM), TCEP, ligand and CuSO4 as detailed above. The samples were allowed to react for 1 h at which point the samples were centrifuged (16,000 g, 5 min, 4° C.). The resulting pellets were sonicated in ice-cold methanol (500 μL) and the resuspended light- and heavy-labeled samples were then combined and centrifuged (16,000 g, 5 min, 4° C.). The pellets were then solubilized in PBS containing 1.2% SDS (1 mL) with sonication and heating (5 min, 95° C.). Samples were transferred to falcon tubes containing DPBS (5 mL), to which a 100 μL of streptavidin-agarose beads slurry was added. After incubation, the beads (3 hr) were pelleted by centrifugation (1,400 g, 3 min) and were washed (2×10 mL PBS and 2×10 mL water). The beads were transferred to eppendorf tubes with 1 mL DPBS, centrifuged (1,400 g, 3 min), and resuspended in PBS containing 6 M urea. To this was added 10 mM DTT (25 μL of a 200 mM stock in water) and the beads were incubated at 65° C. for 15 mins. 20 mM iodoacetamide (25 μL of a 400 mM stock in water) was then added and allowed to react at 37° C. for 30 mins with shaking. The bead mixture was diluted with 900 μL PBS, pelleted by centrifugation (1,400 g, 3 min), and resuspended in 200 μL 2M urea (DPBS) containing trypsin and CaCl2 as described above. The beads were separated from the digest by centrifugation (1,000 g, 1 min), washed (2×1 mL PBS and 2×1 mL water) and then transferred to fresh eppendorfs with 1 mL water. The washed beads were washed once further in 150 μL TEV buffer (50 mM Tris, pH 8, 0.5 mM EDTA, 1 mM DTT) by centrifugation (1,400 g, 3 min) and the resuspended in 150 μL TEV buffer. 5 μL TEV protease (80 μM) was added and the reactions were rotated overnight at 29° C. The TEV digest was separated from the beads by centrifugation (1,400 g, 3 min) and the beads were washed once with water (100 μL). The samples were then acidified to a final concentration of 5% (v/v) formic acid and stored at −80° C. prior to analysis.


The resulting probe-modified peptides were collected for MS analysis, which was performed as described above with differences in the salt bumps applied in the chromatographic gradients which in this case were 0%, 30%, 60%, 90% and 100% NH4OAc (500 μM). The protein identification searches of the MS data were performed with the following changes applied to identify the peptides modified with the corresponding fragment probe and the cleaved TEV tag. All amino acids were considered as possible residues for modification. To facilitate the computational searches, sets of up to 3 amino acids were searched using ProLuCID and filtered with DTASelect as described above. The mass of the modification used to search for probe-modified peptides was +665.4013 m/z for 8, +667.3264 m/z for 4, +665.3285 m/z for 3, +678.3602 m/z for 6, +680.4122 m/z for 9, +679.4179 m/z for 13, +755.3867 m/z for 2, +655.4170 m/z for 14, and +669.3598 m/z for 15, which are the masses for the corresponding probe plus the light TEV-tag and an additional +6.0138 m/z for the heavy counterpart. The isoTOP ratios for probe labeled peptides were quantified using the in-house software CIMAGE.


Example 11—Analysis of Probe Labeled Peptides

For protein mapping experiments, fragment probe-modified peptides were expected to show a ratio of heavy and light signals of ˜1.0 (0.5<ratio<2.0) and were required to have been designated an enriched target by the corresponding probe in whole-protein capture experiments. For each protein in the site-of-labeling dataset, the UniProtKB accession number was used to map and collect relevant structures from the RCSB Protein Data Bank (PDB) fulfilling the following criteria: structures determined by X-ray crystallography, wild-type protein, Homo sapiens as the sole source organism. For proteins with multiple available structures, custom R scripts were used to further filter the PDB files, privileging higher sequence coverage for isoTOP peptides (see Tables 1-3 for selected PDB accessions). Fpocket 2.0 was used to detect potential binding pockets for the resultant structures with all parameters set at recommended default. Pockets with volume less than 500 Å3 were removed from output prior to further analysis. Residues surrounding fpocket predicted binding pockets for each protein were collected to determine the number of residues overlapping with isoTOP peptides. For structures with multiple chains, the average number of overlapping residues for all chains possessing isoTOP peptide was used. Custom Python scripts were used to compile functional site annotations using the UniProtKB/Swiss-Prot Protein Knowledge database (release-2016_06). Relevant UniProt entries were searched for available functional residues, specifically for annotations regarding enzyme catalytic residues (active sites), substrate binding sites, and metal-binding sites. At the isoTOP peptide level, the distances between all possible atom pairs, consisting of one atom from isoTOP peptide and the other atom from a functional site, were calculated and the minimum distance was designated as the spatial distance between isoTOP peptide and functional sites. Annotated FFF-labeled peptides and corresponding analyses shown in Table 1-3.


Example 12—PPARγ Luciferase Reporter Assay

HEK293T cells were transiently co-transfected using Polyethylenimine (Sigma) with a UAS-Luciferase reporter and a vector expressing the heterologous GAL4 DNA binding domain (DBD) or a GAL4 DNA binding domain::PPARγ ligand binding domain (LBD) chimeric protein, and full-length PTGR2. 24 hr after transfection, cells were treated either with vehicle (DMSO), 15k-PGE2 (20 μM), or fragment compounds. Rosiglitazone (2 μM), a synthetic PPARγ ligand, was used as control. 16 hr after incubation, cells were lysed in Cell Culture Lysis Reagent (Promega) and luciferase activity measured using the Luciferase Assay System (Promega).


Example 13—Oxygen Consumption Rate Measurements

Palmitate-BSA oxidation measurements were performed using the Seahorse XFe96 Extracellular Flux Analyzer. Briefly, HSC5 cells were plated at 4.0×104 cells/well and incubated for 24 hr in a 37° C., 5% CO2 incubator. One hour prior to the XF assay, media was changed to 1× Krebs-Henseleit buffer (111 mM NaCl, 4.7 mM KCl, 2 mM MgSO4, 1.2 mM Na2HPO4, pH 7.4) with 2.5 mM glucose, 0.5 mM carnitine, and 5 mM HEPES. 20 min after media exchange, cells were treated with either vehicle (DMSO), 24 (100 μM) or 21 (100, 50, 20 and 5 μM respectively). After 40 min, cells were given palmitate:BSA (667 μM and 167 μM respectively) or BSA alone and the XF assay was started. Perturbation compounds (oligomycin 4 μM, FCCP 4 μM, RAA 2 μM) were prepared in 1× KH buffer and injected from the reagent ports automatically onto wells.


Example 14—Adipocyte Phenotypic Screen

3T3-L1 preadipocytes were induced to differentiate in the presence of 50 μM of each fragment probe. Rosiglitazone (2 μM) was used as a positive control. Media was replaced every two days and compounds refreshed. On day 8 of differentiation, cells were fixed with 4% PFA and stained with the fluorescent lipid stain Nile red (AdipoRed) and Hoechst for nuclei counterstain. Cells were imaged using a Celigo S Cell Imaging Cytometer (Nexcelom Bioscience) and compounds promoting increased lipid accumulation (i.e. fluorescence) identified. Hits were validated at two concentrations (10 μM and 50 μM) in 12-well plate format. To prepare primary brown preadipocytes, interscapular fat depots of neonatal mice were digested for 40 min at 37° C. with 1.5 mg/mL collagenase type I in 61.5 mM NaCl, 2.5 mM KCl, 0.65 mM CaCl2, 2.5 mM glucose, 50 mM Hepes, 50 μg/mL penicillin-streptomycin and 2% (wt/vol) BSA. Cells were next filtered through a 100 μm cell strainer, plated in DMEM supplemented with 20 mM Hepes, 20% FBS, and penicillin/streptomycin, and grown to confluency. Cells were induced to differentiate in DMEM with 10% FBS, dexamethasone (1 μM), IBMX (0.5 mM), insulin (1 μg/ml), triiodothyronine (1 nM), and either DMSO (0.1%), 25 (10 μM), or rosiglitazone (2 μM). Two days later, media was switched and differentiating cells were maintained in DMEM, 10% FBS, insulin, triiodothyronine, and experimental compounds. Media was refreshed every 2 days. Human mesenchymal stem cells were maintained in DMEM supplemented with 10% FBS and grown to confluence. Two days after confluence, cells were induced to differentiate in media containing DMEM supplemented with 10% FBS, dexamethasone (1 μM), IBMX (0.5 mM), insulin (1 μg/ml), indomethacin (125 μM), and either DMSO (0.1%), 25 (10 μM), or rosiglitazone (2 μM) for 2 days. Media and compounds were refreshed every 2 days, alternating complete differentiation media with maintenance media (DMEM 10% FBS supplemented only with insulin) for 18 days.


Example 15—RNAseq Analysis

For RNA-seq, 0.6-1×106 cells were collected in Trizol (Invitrogen) and total RNA was extracted using Direct-Zol RNA extraction kit (Zymo Research). PolyA+ RNA was fragmented and prepared into strand-specific libraries using the Illumina True-seq stranded RNA kit (Illumina) and analyzed on an Illumina HiSeq 2500 sequencer. Libraries were sequenced using single-end 50 bp reads at a depth of 10-15 million reads per library. Single-end sequencing reads were mapped to the mouse reference genome (mm9, NCBI37) using STAR (version 2.3.0.c, default parameters). Only reads that aligned uniquely to a single genomic location were used for downstream analysis (MAPQ>10). Gene expression values were calculated for read counts on exons of annotated RefSeq genes using HOMER. Differentially expressed genes between GFP- and PGRMC2-overexpressing cells were calculated from three replicates per condition using EdgeR and a threshold of adjusted p-value<0.05 was used to call differentially expressed genes. Gene expression values are shown as read counts normalized to 107 mapped reads. Differentially expressed genes were used for pathway analysis. Gene ontology functional enrichment analysis was performed using Ingenuity Pathway Analysis (Qiagen). Heatmaps were generated using RStudio software (package ‘gplots’). RNA-seq data have been deposited in the GEO repository under accession number GSE90731.


Example 16—Cell Viability Assay

Cells were seeded in white-opaque 96-well plates in full growth media at a density of 6,000 cells/well (100 μL) and were allowed to grow for 14 hrs at 37° C. in a humidified 5% CO2 atmosphere. The cells were then treated with compounds or DMSO (1% DMSO final for all wells) in triplicate and incubated at 37° C. in a humidified 5% CO2 atmosphere for 45 min. Note, all photoaffinity probe incubations for MS- and gel-based experiments were performed for 30 min. Cell viability was determined using the luciferase-based CellTiter-Glo Luminescent Cell Viability Assay (Promega).


Example 17—Cloning and Transient Overexpression of Proteins in HEK293T Cells

Full-length genes encoding proteins of interest were PCR amplified from a cDNA library derived from low-passage HEK293T cells. Gene products were cloned into the pRK5 vector with a C-terminal FLAG tag using SalI (N-terminal) and NotI (C-terminal) restriction sites. All clone sequences were verified. To recombinantly overexpress proteins used in in situ treatments, HEK293T cells were grown to 40-60% confluency under standard growth conditions in 6-well (for gel-based experiments) or 10 cm tissue culture plates (for MS-based experiments) and transiently transfected with 1-3 μg of desired construct (6-well plates) or 5 μg (10 cm plates) using polyethyleneimine ‘MAX’ (MW 40,000, PEI; Polysciences, Inc.). ‘Mock’ transfected cells were transfected with a vector containing METAP2 for 48 hr. Human SLC25A20 in a pCMV6-Entry vector with a C-terminal DDK tag was purchased from Origene. Empty pCMV-Entry vector was used as ‘mock’ control for experiments with SLC25A20. The pRK5 vector was a gift from David Sabatini (MIT).


Example 18—Lentiviral Infection

3T3-L1 preadipocytes were infected overnight at 70% confluency in 10 cm Petri dishes with lentiviruses expressing a non-targeting scramble shRNA or two different shRNAs against mouse PGRMC2. Two days after infection, cells were re-plated into 12-well plates and grown to confluence. Two days after confluence, cells were induced to differentiate in presence of dexamethasone (1 μM), IBMX (0.5 mM), insulin (1 μg/ml) and either DMSO (0.1%), test compound (10 μM), or Rosiglitazone (2 μM). Cells were stained at day 7 of differentiation with Nile Red and Hoechst, imaged and harvested for RNA and protein extraction. For rescue experiments, scramble and PGRMC2 knockdown cells were co-infected with lentiviruses over-expressing human V5-tagged PGRMC2. 3T3-L1 preadipocytes stably overexpressing GFP or hPGRMC2 were selected with blasticidin (20 μg/ml) for 10 days and maintained in culture in 10% BCS.


Example 19—Confocal Imaging of PGRMC2

For immunostaining, cells were grown on gelatin-coated cover glasses, fixed in 4% PFA, permeabilized in 0.5% Triton-PBS and blocked with 5% FBS-PBS solution. Rabbit anti-PGRMC2 (Bethyl Labs) and mouse KDEL monoclonal antibody (SEQ ID NO: 925) (clone 10C3, Enzo Life Sciences) were diluted at 0.4 μg/ml and 1 μg/ml using blocking buffer and samples were incubated overnight at 4° C. in a humidified chamber. Alexafluor-488 anti-rabbit and alexafluor-568 anti-mouse secondary antibodies were diluted to 1:500 dilution in blocking buffer and samples incubated for 1 hour at RT. Nuclei and actin filaments were stained by Hoechst and Acti-stain 670 phalloidin dyes, respectively. Cells were washed 3 times with PBS for 10 minutes after each incubation. Images were acquired with a Zeiss LSM 710 laser scanning confocal microscope and analyzed with IMARIS (Bitplane Inc.) and Adobe Photoshop CS3 (Adobe Systems Incorporated) software.


Example 20—Western Blot Analysis

After scanning for fluorescence, proteins were transferred to a nitrocellulose membrane in Towbin buffer, the membrane was blocked for ˜1 hr at ambient temperature with 5% nonfat dry milk (w/v) or 5% BSA in Tris-buffered saline with Tween 20 (TBST) and incubated with primary antibodies in the same solution overnight at 4° C. The blots were washed (3×5 min, TBST), incubated with secondary antibodies (IRDye 800CW or HRP-conjugated anti-mouse and anti-rabbit) in milk or BSA for 1 hr at ambient temperature, washed (3×5 min, TBST), rinsed in water and visualized on a LICOR Odyssey Scanner or resolved by film exposure.


Example 21—Gene Expression Analysis

Total RNA was isolated from cells using Direct-zol™ RNA MiniPrep Plus (Zymo Research). Taqman-based quantitative real-time PCR was performed using the SuperScript III Platinum One-Step qRT-PCR reagent (Thermo Fisher Scientific). Samples were run in triplicate as multiplexed reactions and normalized to an internal control (36B4; acidic ribosomal phosphoprotein P0 mRNA).


Example 22—In Vitro LCMS-Based Activity Assay for PTGR2

Aliquots (1 μL) of test compounds dissolved in DMSO were transferred to 1.5 mL eppendorf tubes followed by addition of recombinant human PTGR2 (44 μL, 200 nM final concentration) in freshly prepared reaction buffer (Tris Buffer, 1 mM EDTA, 50 μM TCEP, 300 μM NADPH). The resulting mixture was vortexed and then incubated at 37° C. for 20 min. Next, a 5 μL solution of 15-keto-PGE2 substrate (20 μM final concentration) in reaction buffer was added and the reaction was allowed to proceed for 30 min at 37° C. Reactions were quenched by the addition of 0.5% AcOH in ethyl acetate (800 μL), water (300 μL) and 100 μL of internal standard PGE2-d4 (30 pmol/sample) dissolved in 1:1 methanol/water. Phases were separated by centrifugation and the organic layer was collected and dried under a stream of N2, then stored at −80° C. until analysis. Directly prior to analysis, samples were reconstituted in 100 μL of MeCN:H2O (1:1, v/v) and analyzed by LC/MS/MS. All conditions were performed in triplicate and repeated at least three independent times.












LCMS Conditions for prostaglandin measurements
















Instrument
Agilent 6460 Triple Quadrupole LC/MS system


Column
Kinetex 5 μm C18 100 A, 50 × 4.6 mm column


Injection
15 μL


Gas temperature
350° C.


Gas flow
9 L/min


nebulizer
35 psi


capillary
4000 V positive/4000 V negative


MRM scan type
300 delta EMV (+)


Mobile Phase A
70:30:0.1 H2O/Acetonitrile/Formic acid


Mobile Phase B
50:50:0.1 Isopropyl Alcohol/Acetonitrile/Formic Acid









The following MS parameters were used to measure the indicated metabolites by MRM (precursor ion, product ion, collision energy, polarity): PGE2-d4 (355, 275, 18), 13,14-dihydro-15-keto-PGE2 (351, 333, 18) and 15-keto-PGE2 (349, 161, 20). 15-keto-PGE2 and 13,14-dihydro-15-keto-PGE2 levels were quantified by determining peak areas in relation to internal standard PGE2-d4. Non-deuterated 15-keto-PGE2 and 13,14-dihydro-15-keto-PGE2 standards were used to confirm retention time and fragmentation.


Chromatography Method














Time (min)
B (%)
Flow rate (mL/min)

















0.0
0
0.6


1.0
0
0.6


2.0
20
0.6


4.0
20
0.6


7.0
75
0.6


7.2
100
0.6


11.0
100
0.6


11.1
0
0.6


13.0
0
0.6











    • To minimize carryover, LC solvents were cycled between 100% Mobile Phase A and 100% Mobile Phase B over 5 min after each run.





Example 23—LCMS Analysis of Acylcarnitines in HSC-5 Cells

HSC-5 cells were seeded in 10 cm plates and grown to ˜90% confluency. Media was aspirated, cells were washed carefully with DPBS (3 mL) and resuspended in freshly-prepared serum-free IMDM media containing test compound(s) or vehicle. After incubation at 37° C. for 3 hr, the media was removed and cells were washed with cold DPBS (2×3 mL). Cells were scraped in 4 mL cold DPBS, transferred to a falcon tube and centrifuged at 2000 rpm for 8 min, and resuspended in 1 mL cold DPBS. Cells were lysed using a probe sonicator, and 1 mL of lysates normalized to 1.5 mg/mL were transferred to 2-dram glass vials. MeCN (3 mL) containing acyl carnitine internal standard mix (Cambridge Isotope Laboratories) was added to lysates and vigorously vortexed. Internal standards include 2H9-carnitine (2.28 nmol); 2H3-acetyl carnitine (C2, 570 pmol); 2H3 propionyl carnitine (C3, 120 pmol); 2H3 butryl carnitine (C4, 120 pmol); 2H9 isovaleryl carnitine (C5, 120 pmol); 2H3 octanoyl carnitine (C8, 120 pmol); 2H9 myristoyl carnitine (C14, 120 pmol); 2H3 palmitoyl carnitine (C16, 240 pmol). Samples were centrifuged at 1000 rpm for 5 min to pellet insoluble precipitate, and the remaining eluent carefully transferred to fresh 2-dram vials to avoid disturbing the precipitate. The eluent was concentrated under a stream of N2, and samples were stored at −80° C. until analysis. Directly prior to analysis, samples were reconstituted in 500 uL of MeCN:H2O (1:1, v/v) and analyzed by LC/MS/MS. The indicated acyl carnitines were quantified by measuring the area under the peak relative to an internal standard (2H3 palmitoyl carnitine for C16, C18 and C18:1; 2H9 myristoyl carnitine for C12 and C14; 2H3 octanoyl carnitine for C5DC/C10-OH and C4DC; 2H9 isovaleryl carnitine for C5 and C7).












LCMS Conditions for acyl carnitine measurements
















Instrument
Agilent 6460 Triple Quadrupole LC/MS system


Column
Kinetex 5 μm C18 100 A, 50 × 4.6 mm column


Injection
15 μL


Gas temperature
350° C.


Gas flow
9 L/min


nebulizer
35 psi


capillary
4000 V positive/4000 V negative


MRM scan type
300 delta EMV (+)


Mobile Phase A
95:5:0.1 H2O/Methanol/Formic Acid


Mobile Phase B
60:35:5:0.1 Isopropyl Alcohol/Methanol/H2O/Formic



Acid










Chromatography Method:














Time (min)
% B
Flow (mL/min)

















0
0
0.1


5
0
0.1


5.01
0
0.4


7
0
0.4


30
100
0.4


30.01
100
0.5


38
100
0.5


38.01
0
0.5


42
0
0.5


46
100
0.5


50
100
0.5


54
0
0.5


57
0
0.5


57.01
0
0.4


59
0
0.1











    • To minimize carryover, LC solvents were cycled between 100% Mobile Phase A and 100% Mobile Phase B over 5 min after each run.


      Transition Table:















Transition Table:










Acyl Carnitine
Precursor → product ion







C12
344.2 → 85.1



C14
372.3 → 85.1



C16
400.3 → 85.1



C18:1
426.3 → 85.1



C18
428.3 → 85.1



C4DC
318.2 → 85.1



C5
246.1 → 85.1



C10-OH
332.2 → 85.1



C7
274.1 → 85.1



D3 acetyl
207.1 → 85.1



D3 butyryl
235.1 → 85.1



D3 octanoyl
291.2 → 85.1



D3 palmitoyl
403.3 → 85.1



D3 propionyl
221.1 → 85.1



D9 isovaleryl
255.1 → 85.1



D9 myristoyl
381.3 → 85.1










Example 24—Quantification and Statistical Analysis

All data fitting and statistical analysis performed using GraphPad Prism version 6.00 for Windows, GraphPad Software, La Jolla Calif. USA, www.graphpad.com. Statistical values including the exact n and statistical significance are also reported. Probe binding blockade and PTGR2 inhibition curves are plotted as mean±SD (n=3 or 4 per group) for a representative biological replicate using a variable slope (four parameter) non-linear fit. Gene expression data are presented as mean±SD (n=3 per group). HSC5 metabolite data are shown as mean±SD (n=3 per group). Statistical significance was defined as P<0.05 and determined by 2-tailed Student t tests, or two-way ANOVA with Bonferroni's post-tests.


Example 25—Data and Software Availability

Data Resources: The RNA-seq data reported has been deposited in the NCBI under the ID code GEO: GSE90731.


Software: All custom scripts used have been deposited to GitHub (https://github.com/Chymichead/FBDDinCell).


Example 26—Profiling Small-Molecule Fragment-Protein Interactions in Human Cells

A small library of 14 “fully functionalized” fragment (FFF) probes were synthesized as described in Example 30 with each member possessing a variable small-molecule fragment conjugated to a constant tag bearing an alkyne and photoactivatable diazirine group (FIG. 1A). The variable fragment groups had an average molecular weight of 176 Da and were selected because they represent structural motifs found in many biologically active natural products and clinically approved drugs (FIG. 1B). The FFF probes were initially assessed using gel-based profiling (FIG. 1D) by treating HEK293T cells with each fragment probe (20 μM, 30 min), followed by exposure to UV light (10 min, 4° C.), cell lysis, coupling to a rhodamine (Rh)-azide tag using copper-catalyzed azide alkyne cycloaddition (CuAAC) chemistry, and separation and visualization of fragment-modified proteins by SDS-PAGE coupled with in-gel fluorescence scanning. Despite the structural simplicity and small size of the variable fragment groups, each probe produced marked and differential concentration-dependent protein labeling in HEK293T cells (FIG. 1C, FIG. 1E, and FIG. 1F). Negligible protein labeling was observed in the absence of UV light (FIG. 1C and FIG. 1E), exemplifying that the fragment-protein interactions correspond to reversible binding events that were converted to covalent adducts by photoreactivity. Exposure of cells to UV light from 5-60 min produced equivalent protein labeling (FIG. 1G), while washing cells prior to UV exposure substantially decrease FFF probe labeling for most, but not all proteins (FIG. 1H). Finally, a “fragment-less” probe bearing a methyl group (1) produced much less protein labeling, exemplifying that the variable group of FFF probes is critical for protein binding and further that 1 serves as a useful control probe for the chemical proteomic mapping of fragment-protein interactions in cells.


Example 27—a Global Analysis of Fragment-Protein Interactions in Cells

Fragment-binding proteins in human cells were globally mapped by quantitative chemical proteomics following the general protocol shown in FIG. 1A. Each FFF probe was initially compared to control probe 1 in pairwise experiments using isotopically light and heavy amino acid-labeled HEK293T cells, where proteins strongly enriched by the test FFF probe over 1 (light:heavy ratios>5) were designated as test probe targets. Adhering to the general principles of FBLD, where a relatively small number of fragments are screened at high concentrations against proteins, 11 FFF probes (2-4, 6, 8-9, 11-15) were analyzed at 200 μM each (30 min incubation; n=2-3 per probe) in HEK293T cells, with a subset of probes also being evaluated in K562 cells. Under these conditions, FFF probes displayed little to no cytotoxicity (FIG. 2K) and interacted with an extensive array of proteins. To minimize false-positives, proteins were only designated as fragment targets if they were detected with at least three unique, quantifiable peptides and enriched (>five-fold over 1, FIG. 2L) by more than one FFF probe, or, if enriched by only one probe, then required to be quantified in at least three independent experiments. Control experiments were also conducted with representative probes to confirm that targets were enriched in a UV-dependent manner and showed SILAC ratios of ˜1.0 in experiments where heavy and light cells were treated with equal concentrations of the same FFF probe (FIG. 2M, FIG. 2N).


In aggregate, more than 2000 protein targets were identified for the FFF probes, which individually displayed a broad range of protein enrichments (FIG. 2A, FIG. 2O). When tested at lower concentrations (20 μM), FFF probes enriched fewer protein targets (FIG. 2O, FIG. 2P), confirming that the extent of proteome engagement depends on probe concentration. A review of expression-based proteomics data generated in HEK293T cells revealed that the protein targets of FFF probes spanned more than five orders of magnitude in abundance and this range bracketed the median protein abundance value in HEK293T cells (FIG. 2Q), exemplifying, along with other analyses (FIG. 2R, FIG. 2S), that FFF probes enriched proteins across a broad range of expression.


To more quantitatively assess the structure-activity relationships (SARs) emerging from the initial FFF probe experiments, additional studies were performed comparing the relative protein interaction profiles of FFF probes, wherein isotopically light and heavy cells were treated with two different probes (probe-vs-probe comparisons) and processed as shown in FIG. 1A. These experiments exemplified that proteins preferentially enriched by one FFF probe relative to another in probe-vs-probe comparisons were also often preferentially enriched by the same probe in original comparisons to control 1 (FIG. 2B-FIG. 2F). The probe-vs-probe comparisons also revealed that most of the proteins showing broad interaction potential across the fragment library in probe-vs-control 1 experiments (light gray sub-bars, FIG. 2C) still exhibited preferential interactions with one or a subset of FFF probes (FIG. 2G-FIG. 2J).


The fragment interactions profiles were verified for representative proteins by recombinant expression in HEK293T cells. It was found that the fragment interaction profile for each recombinant protein, as measured by gel-based profiling (FIG. 1D), matched that of its endogenous form as determined by quantitative MS-based proteomics, with each target showing a strong preference for a distinct fragment probe (FIG. 2T, FIG. 2U).


Example 28—Types of Proteins and Protein Sites Targeted by Fragments

The fragment probes targeted both membrane and soluble proteins (FIG. 3H), and only a small fraction (17%) of these proteins had known ligands as estimated by their presence in the DrugBank database (FIG. 3A). This subset of previously liganded proteins was mainly enzymes (FIG. 3B). In contrast, the much larger subset of fragment probe targets (83%) not represented in DrugBank showed a broader functional distribution, with a reduced fractional representation of enzymes counterbalanced by expanded coverage of channels/transporters/receptors, transcription factors/regulators, and uncategorized proteins (FIG. 3B). A greater percentage of targets enriched by low (20 μM, 24%) versus high (200 μM, 12%) concentrations of fragments were found in DrugBank (FIG. 3A), exemplifying that the capacity to screen higher concentrations of fragment probes expanded the scope of newly discovered ligandable proteins in human cells.


Considering that the chemical proteomic results provided the first evidence of ligandability for many protein targets, the fragment binding sites on these proteins were aimed to be identified next. Determining the sites of photoreactive probe binding to proteins is technically challenging, but the simple structures of FFF probes, along with the implementation of advanced chemical proteomic methods for isotopically labeling small-molecule probe-modified peptides is advantageous. Using these methods, over 800 unique peptides modified by one or more FFF probes were identified that collectively derived from 443 proteins (FIG. 3I and Tables 1-3) in HEK293T cells. Fragment-modified peptides were found in both membrane and soluble proteins (FIG. 3I), and, while many proteins were targeted by multiple FFF probes at the same site (FIG. 3J), in the substantial majority of cases, only a single fragment-modified peptide was identified per protein (FIG. 3C).


Using the pocket-detection algorithm fpocket, for the 186 proteins harboring fragment-modified peptides for which crystal structures were also available (FIG. 3I), it was found that the vast majority of fragment-modified peptides (˜80%) overlapped directly and substantially with predicted ligand-binding pocket residues (FIG. 3D and FIG. 3K and Tables 1-3). For proteins possessing multiple distinct fragment-modified peptides, it was found that these peptides often mapped to a shared predicted pocket (FIG. 3L). For proteins with annotated functional residues (e.g., active site residues; 77 total proteins), approximately 60% of the probe-modified peptides were within 6 angstroms of a functional residue (FIG. 3M).


Many of the proteins with mapped fragment-binding sites and crystal structures corresponded to enzymes (FIG. 3N), but non-enzymes of note included: i) the 14-3-3 adapter protein YWHAE, which was modified by probe 13 on a peptide (aa 197-215) that lines the primary interaction cleft for binding the oncoprotein myeloid leukaemia factor 1 (MLF1) (FIG. 3E); and ii) the proapoptotic effector protein BAX, which was also modified by probe 13 on a peptide (aa 66-79) within a groove that binds the BH3-domain containing activators Bim and Bid (FIG. 3F). Among the enzymes with mapped fragment-binding sites, the cysteine protease cathepsin B (CTSB) was targeted by probe 9 at an active-site proximal peptide (aa 315-332), and this interaction was blocked by the CTSB inhibitor Z-FA-FMK (FIG. 3G). Fragment-modified peptides at allosteric or secondary ligand-binding sites were also identified, including, for instance, a pocket on α-galactosidase (GLA) proposed to constitute a site for pharmacological chaperoning (FIG. 3O). Lastly, little overlap (<15%) was found between FFF targets and proteins liganded by cysteine-reactive electrophilic fragments (FIG. 3P). Even if this analysis was restricted to proteins that contained IA-reactive, the overlap between FFF targets and electrophilic fragments targets remained modest (˜28%) (FIG. 3P). These results exemplify that reversible and irreversible fragments interact with largely distinct subsets of the human proteome.


Example 29—Functional Characterization of Fragment-Protein Interactions

FBLD typically identifies low-affinity (high μM to mM) hit compounds that often require substantial, structure-guided medicinal chemistry optimization to improve potency and selectivity. As an alternative and complementary approach to structure-based ligand development, the proteome-wide, cell-based fragment screens are adapted to identify higher potency ligand-protein interactions. This goal is accomplished by screening focused libraries of small molecules containing representative fragment cores elaborated with additional “binding” substituents for competitive blockade of FFF probe-protein interactions in cells (FIG. 4A). Elaborated competitor molecules were purchased or synthesized for three FFF probes—3, 6, and 8 (FIG. 4B and FIG. 4I-FIG. 4K)—and treated cells with these competitors (17 total, each screened versus DMSO as a control) in eight-fold excess over the corresponding FFF probe (160 μM competitor, 20 μM FFF probe), after which FFF-modified proteins enriched and identified as shown in FIG. 4A. A total of 100 competed targets—defined as proteins that displayed substantial reductions (>3-fold) in signal in small-molecule competitor (heavy) versus DMSO (light) treated cells—were identified (FIG. 4C-FIG. 4F, FIG. 4L). Competed proteins showed widely varied SARs that ranged from broad interactions with several (>5) competitors to preferential binding to a single competitor (FIG. 4D).


Another 215 competed targets were mapped in experiments where a subset of the competitors (five total) was tested against higher concentrations of the corresponding FFF probes (200 μM) (FIG. 4M). A greater representation of DrugBank proteins was noted for competed targets identified with low (20 μM) versus high (200 μM) concentrations of FFF probes (43% and 20%, respectively) (FIG. 4E). These results exemplify that performing small-molecule competition studies with higher concentrations of FFF probes, where a much greater proportion of probe targets are enriched and quantified (FIG. 2O), increases not only the total number of identified competed protein targets, but also the fraction of these targets that represent heretofore unliganded proteins. Finally, the competed protein targets exemplified a broad functional class distribution generally matching that found for the greater collection of FFF targets (FIG. 4F), exemplifying that high-occupancy small-molecule interactions were not biased toward a specific category of protein in human cells.


For determining if the discovered small-molecule ligands affected protein functions, one enzyme (PTGR2) and one transporter (SLC25A20) were selected for which distinct high-occupancy ligands were identified in competitor profiling experiments (FIG. 4G, FIG. 4H). These proteins also have important roles in human metabolism, but lack selective, cell-active inhibitors. Gel-based competitor profiling of recombinant PTGR2 and SLC25A20 (FIG. 5H) exemplified the preferential binding of ligands determined by MS-based proteomics (20 for PTGR2 and 21 for SLC25A20; FIG. 4G, FIG. 4H). Competitor molecules containing only the fragment head groups of FFF probes did not appreciably block probe labeling of PTGR2 and SLC25A20 (FIG. 5I). These results exemplify that chemical proteomics discover weak fragment-protein interactions in cells and, through competitive profiling of structurally elaborated fragment analogues, efficiently identify compounds that display superior protein binding.


PTGR2, or prostaglandin reductase 2, catalyzes the NADPH-dependent reduction of 15-keto-PGE2 to 13,14-dihydro-15-keto-PGE2 and regulates adipogenesis through restricting 15-keto-PGE2 activity as a natural ligand for the nuclear receptor PPARγ. The only reported inhibitor of PTGR2 is the NSAID drug indomethacin, which exhibits a very weak in vitro IC50 value of ˜200 μM. Probe 8 modified two active site-proximal peptides in PTGR2, and these reactions were sensitive to competition by 20 (FIG. 5A), which also inhibited PTGR2-mediated reduction of 15-keto-PGE2 with an IC50 value of 79 μM (FIG. 5B). A screen of structural analogues of 20 exemplified that substitution of the lactam ring with a phenyl group and conversion of the piperidine core to a piperazine furnished 22 (FIG. 5C and FIG. 5J), which showed substantially increased potency (>20-fold) in assays measuring either competition of 8-labeling (FIG. 5C) or 15-keto-PGE2 reductase activity (IC50=0.6 μM; FIG. 5B) of recombinant PTGR2. An inactive analogue 23 was also identified, which did not affect labeling of PTGR2 by 8 (FIG. 5C and FIG. 5J) or PTGR2 catalytic activity (FIG. 5B).


Compound 22, but not 23, blocked FFF 8 labeling of endogenous PTGR2 in HEK293T cells with good potency (complete inhibition at 5 μM and ˜80% inhibition at 500 nM) and excellent selectivity (FIG. 5K-FIG. 5M). 22 did not cross-react with ZADH2 (FIG. 5L), a sequence-related homologue of PTGR2 that was a principal off-target of 20 (FIG. 4G). Addition of 22 also produced a concentration-dependent rescue of 15-keto-PGE2-dependent PPARγ transcriptional activity in cells recombinantly expressing PTGR2 (FIG. 5D); in contrast, the inactive control compound 23 showed no effect (FIG. 5D). Neither 22 nor 23 directly modulated PPARγ (FIG. 5N). The IC50 value displayed by 22 for inhibition of PTGR2 in cells was ˜0.7 μM (FIG. 5O), which meets the criterion for in situ activity of chemical probes put forth by the Structural Genomics Consortium.


SLC25A20 is a multi-pass transmembrane protein that transports long-chain acylcarnitines into the mitochondrial matrix, where these lipids provide fatty acid substrates for β-oxidation. There are no selective small-molecule probes to study SLC25A20 function in human cells. The quantitative MS experiments exemplified SLC25A20 as a primary target of the elaborated coumarin-based competitor 21 (FIG. 4H), and this interaction was confirmed for recombinant SLC25A20 in HEK293T cells, where 21 blocked FFF probe 3 labeling of SLC25A20 with an IC50 of ˜10 μM (FIG. 5E). The coumarin-based compound 24 was identified as an inactive control (FIG. 5E, FIG. 5P, and FIG. 5Q).


Compound 21 (0.2-100 μM, 3 h), but not the inactive control 24 (100 μM), produced a strong, concentration-dependent increase in long-chain (C16, C18, C18:1) acylcarnitines in human squamous cell carcinoma (HSC5) cells, with significant effects being observed for 21 at concentrations (20-50 μM; FIG. 5F), where 21 also substantially blocked probe 3 labeling of SLC25A20 in cells as measured by quantitative MS-based proteomics (FIG. 5Q, FIG. 5R). No changes were found in short- or medium-chain acylcarnitines (<C16), which are thought to cross the mitochondrial membranes without conversion to acylcarnitine esters. HSC5 cells treated with 21, but not 24 showed impaired capacity to oxidize palmitate (FIG. 5G and FIG. 5S). These results exemplify that 21 acts as a selective, cell-active inhibitor of SLC25A20, leading to disruption of mitochondrial long-chain acylcarnitine transport and FAO.


Example 30—Chemical Synthesis

Materials


Purchased starting materials were used as received unless otherwise noted. All moisture sensitive reactions were performed in an inert, dry atmosphere of nitrogen in flame dried glassware. Reagent grade solvents were used for extractions and flash chromatography. All amines used in probe library synthesis are available from commercial vendors. All fragment-based competitors were synthesized or purchased through Sigma Aldrich Market Select vendors. Reaction progress was checked by analytical thin-layer chromatography (TLC, Merck silica gel 60 F-254 plates). The plates were monitored either with UV illumination, or by charring with anisaldehyde (2.5% p-anisaldehyde, 1% AcOH, 3.5% H2SO4 (conc.) in 95% EtOH) or ninhydrin (0.3% ninhydrin (w/v), 97:3 EtOH-AcOH) stains. Flash column chromatography was performed using silica gel (F60, 40-63 um, 60 A). Preparative thin layer chromotography (PTLC) was carried out using glass backed PTLC plates 1000-2000 μm thickness (Analtech). The solvent compositions reported for all chromatographic separations are on a volume/volume (v/v) basis. 1H-NMR spectra were recorded at either 400, 500 or 600 MHz and are reported in parts per million (ppm) on the 6 scale relative to CDCl3 (δ 7.26) as an internal standard. Data are reported as follows: chemical shift, multiplicity (s=singlet, d=doublet, t=triplet, q=quartet, br=broad, m=multiplet), coupling constants (Hz), and integration. 13C-NMR spectra were recorded at either 100 or 125 MHz and are reported in parts per million (ppm) on the δ scale relative to CDCl3 (δ 77.00). Mass spectrometry data were collected on a HP1100 single-quadrupole instrument (ESI; low resolution) or an Agilent ESI-TOF instrument (HRMS).


Synthesis of 3-(3-(but-3-yn-1-yl)-3H-diazirin-3-yl)propanoic acid (30-3)



embedded image




embedded image


Ethyl 4-oxooct-7-ynoate (30-1) was synthesized following similar procedures previously reported. A solution of crude pent-4-ynal (17.2 g, 210 mmol) and ethyl acrylate (45.5 mL, 420 mmol, 2 equiv) in dioxane (250 mL) was added dropwise over a period of 4 h to a suspension of thiazolium salt catalyst (7.88 g, 29.2 mmol, 0.14 equiv), triethylamine (20.4 mL, 147 mmol, 0.7 equiv) and ethyl acrylate (45.5 mL) in dioxane (300 mL) at 80° C. under an atmosphere of nitrogen. The mixture was stirred and heated at 80° C. for 54 h and then volatiles removed by rotary evaporation. The residue was resuspended in methylene chloride (600 mL) and washed with aqueous 10% H2SO4 (150 mL), saturated aqueous NaHCO3 (250 mL) and brine (250 mL), then dried over anhydrous Na2SO4 and volatiles removed by rotary evaporation. Crude 30-1 was purified by flash column chromatography (100% hexanes→5%→10%→15%→20% ethyl acetate in hexanes), resulting in 30-1 as a light brown oil (10.7 g, 28%). 1H NMR (400 MHz, CDCl3) δ 4.20 (q, J=7.1, 2H), 2.86-2.76 (m, 4H), 2.68 (t, J=6.5, 2H), 2.54 (td, J=2.6, 7.3, 2H), 2.04 (t, J=2.7, 1H), 1.33 (td, J=2.2, 7.2, 4H). MS (ESI) calc'd for [M+H]+ C10H15O3+ 183.1, found 183.1.




embedded image


4-Oxooct-7-ynoic acid (30-2). To a solution of 30-1 (9.46 g, 52 mmol) in methanol (400 mL), added LiOH (6.2 g, 260 mmol, 5 equiv) and water (4.8 mL, 267 mmol, 5.1 equiv) and let resulting solution stir at room temperature for 15 h when TLC (3:1 hexanes/ethyl acetate) indicated the complete consumption of starting material. The solution was carefully acidified with aqueous HCl (6 M) until a pH of ˜3 was achieved. The resulting solution was then extracted with methylene chloride and the combined organic layers were dried over anhydrous Na2SO4 and volatiles were removed by rotary evaporation, resulting in 30-2 as a brown solid (7.6 g, 95%), which was used without further purification. 1H NMR (400 MHz, CDCl3) δ 2.90-2.57 (m, 6H), 2.48 (td, J=2.5, 7.3, 2H), 1.98 (t, J=2.5, 1H). MS (ESI) calc'd for [M−H]− C3H9O3 153.0, found 153.0.




embedded image


3-(3-(But-3-yn-1-yl)-3H-diazirin-3-yl)propanoic acid (30-3). A dried round bottom flask containing 30-2 (3.1 g, 20 mmol) cooled to 0° C. was charged with 7N NH3 in methanol (195 mL) and resulting solution was stirred at 0° C. under an atmosphere of nitrogen for 3 h. At this time, a solution of hydroxylamine-O-sulfonic acid (3.2 g, 28.2 mmol, 1.4 equiv) in anhydrous methanol (25 mL) was added dropwise via addition funnel at 0° C. The resulting solution was stirred at 0° C. for an additional 1 h and then allowed to warm to room temperature over 14 h. Resulting suspension was evaporated to dryness and resuspended in methanol (30 mL) and solid was filtered and washed several times with methanol. The combined filtrate was evaporated and resuspended in anhydrous methanol (180 mL), then cooled to 0° C. (protected from light). Diisopropylethylamine (7.8 mL) was added, followed by iodine (portion-wise), until a dark brown color persisted for more than 30 min, indicating total oxidation of diaziridine. The solution was then diluted with ethyl acetate (200 mL) and washed with aq. 1N HCl (200 mL), saturated aqueous Na2S2O3 (3×200 mL or until organic phase clarified) and brine. Combined aqueous phases were washed once with ethyl acetate and all organic layers were combined, then dried over anhydrous Na2SO4 and volatiles removed by rotary evaporation. Crude 30-3 was purified by flash column chromatography (100% hexanes→2%→5%→10%→20% ethyl acetate in hexanes), resulting in 30-3 as a colorless oil (889 mg, 27%). 1H NMR (400 MHz, CDCl3) δ 2.18 (t, J=7.7, 2H), 2.06-1.98 (m, 3H), 1.81 (t, J=7.7, 2H), 1.66 (t, J=7.4, 2H). 13C NMR (101 MHz, CDCl3) δ 178.63, 82.56, 69.37, 32.16, 28.21, 27.72, 27.46, 13.21. MS (ESI) calc'd for [M−H]− C8H9N2O2 165.1, found 165.1. Characterization matches that previously reported by Li et al Angew Chem Int Ed. (2013) 52, 8551-6.




embedded image


General Procedure 1: Coupling Procedure for the Synthesis of Simple Fragment-Based Probes


To a 4 mL vial containing 3-(3-(but-3-yn-1-yl)-3H-diazirin-3-yl)propanoic acid (30-3, 1 eq.) in DCM, commercially available amine (1.1 eq.), DIPEA (3.0 eq.) EDC-HCl (1.5 eq.), and HOBt (1.5 eq.) were added. Reaction mixtures were stirred at room temperature for 4 h to overnight when TLC indicated reaction completed. The crude samples were diluted with DCM and washed first with saturated aqueous NH4Cl (10 mL) and saturated aqueous NaHCO3 (10 mL), then dried over anhydrous Na2SO4 and volatiles removed by rotary evaporation. Crude products were purified by PTLC or flash column chromatography.


General Procedure 2: Coupling Procedure for the Synthesis of Photoaffinity Probe Library Used in Phenotypic Screening


A 4 mL vial was charged with 3-(3-(but-3-yn-1-yl)-3H-diazirin-3-yl)propanoic acid (10 mg, 0.060 mmol) or propionic acid (0.060 mmol), commercially available amine (0.060 mmol, 1 eq.), DIPEA (0.032 mL, 0.181 mmol, 3.0 eq.), HATU (34.3 mg, 0.090 mmol, 1.5 eq.) and DMF (1 mL). Reaction mixtures were stirred at room temperature for 4 h. The crude samples were diluted with methanol to a total volume of 1.6 mL then purified by reverse phase HPLC using following conditions:












LC/MS conditions for Library Characterization
















Column
Xbridge Prep C18 19 × 150 mm, 10 μm


Flow Rate
15 ml/min


Mobile Phase A
10 mM ammonium acetate in water


Mobile Phase B
Acetonitrile


Gradient
10% B to 100% B over 20 min followed by a 3 min



wash at 100% B and 2 min equilibration at 10% B.











embedded image


3-(3-(But-3-yn-1-yl)-3H-diazirin-3-yl)-N-methylpropanamide (1) General Procedure 1. Purified by SiO2 flash chromatography (Hexane/EtOAc, 7:3→1:1) to afford 1 as a colorless sticky solid (6 mg, 93%). 1H NMR (400 MHz, CDCl3) δ 5.56 (brs, 1H), 2.82 (d, J=2.2 Hz, 2H), 2.08-1.98 (m, 3H), 1.94 (m, 2H), 1.90-1.83 (m, 2H), 1.66 (t, J=7.4 Hz, 2H). 13C NMR (126 MHz, CDCl3) δ 172.12, 83.09, 69.57, 32.79, 30.58, 28.83, 28.25, 26.80, 13.68. HRMS (ESI-TOF) calcd for C9H14N3O 180.1131 (M+H+), found 180.1131.




embedded image


3-(3-(But-3-yn-1-yl)-3H-diazirin-3-yl)-N-(2-oxo-5-phenyl-2,3-dihydro-1H-benzo[e][1,4]diazepin-3-yl)propanamide (2) General Procedure 1. Purified by SiO2 flash chromatography (Hexane/EtOAc, 3:1) to afford 2 as a white sticky solid (22 mg, 76%). 1H NMR (400 MHz, CDCl3) δ 9.18 (s, 1H), 7.56-7.30 (m, 8H), 7.22-7.10 (m, 2H), 5.53 (d, J=7.9 Hz, 1H), 2.29-2.13 (m, 2H), 2.07-1.97 (m, 3H), 1.87 (t, J=7.4 Hz, 2H), 1.68 (t, J=7.4 Hz, 2H). 13C NMR (101 MHz, CDCl3) δ 171.34, 168.74, 138.54, 137.36, 132.21, 131.45, 130.69, 129.87, 128.25, 127.61, 124.18, 121.46, 82.76, 69.26, 67.13, 32.30, 30.37, 28.30, 27.87, 13.33. HRMS (ESI-TOF) calcd for C23H22N5O2 400.1768 (M+H+), found 400.1768.




embedded image


3-(3-(But-3-yn-1-yl)-3H-diazirin-3-yl)-N-(2-oxo-2H-chromen-6-yl)propanamide (3) General Procedure 1. Purified by SiO2 flash chromatography (Hexane/EtOAc, 3:2) to afford 3 as a yellow sticky solid (12.8 mg, 57%). 1H NMR (400 MHz, CDCl3) δ 8.01 (d, J=2.2 Hz, 1H), 7.69 (d, J=9.6 Hz, 1H), 7.62 (br s, 1H), 7.42 (dd, J=8.9, 2.5 Hz, 1H), 7.29 (d, 7.7 Hz, 1H), 6.44 (d, J=9.6 Hz, 1H), 2.16 (t, J=7.5 Hz, 2H), 2.04 (td, J=7.4, 2.6 Hz, 2H), 2.01-1.92 (m, 3H), 1.75 δ 1.62 (m, 2H). 13C NMR (101 MHz, CDCl3) δ 169.69, 160.82, 150.48, 143.49, 134.28, 123.57, 119.04, 118.58, 117.20, 82.67, 69.33, 32.44, 31.16, 28.09, 27.80, 13.29. HRMS (ESI-TOF) calcd for C17H16N3O3 310.1186 (M+H+), found 310.1186.




embedded image


N-(Benzo[b]thiophen-5-ylmethyl)-3-(3-(but-3-yn-1-yl)-3H-diazirin-3-yl)propanamide (4) General Procedure 1. Purified by SiO2 flash chromatography (Hexane/EtOAc, 3:1) to afford 4 as a off-white sticky solid (12.3 mg, 44%). 1H NMR (500 MHz, CDCl3) δ 7.84 (d, J=8.3 Hz, 1H), 7.73 (s, 1H), 7.46 (d, J=5.4 Hz, 1H), 7.30 (d, J=5.4 Hz, 1H), 7.26 (d, J=8.0 Hz, 1H), 5.80 (br s, 1H), 4.54 (d, J=5.7 Hz, 2H), 2.03-1.95 (m, 5H), 1.91 δ 1.86 (m, 2H), 1.64 (t, J=7.5 Hz, 2H). 13C NMR (126 MHz, CDCl3) δ 171.27, 140.32, 139.41, 134.65, 127.61, 124.71, 124.06, 123.22, 83.10, 69.62, 44.23, 32.82, 30.73, 28.75, 13.70. HRMS (ESI-TOF) calcd for C17H18N3OS 312.1165 (M+H+), found 312.1167




embedded image


N-(Benzofuran-5-ylmethyl)-3-(3-(but-3-yn-1-yl)-3H-diazirin-3-yl)propanamide (5) General Procedure 1. Purified by PTLC (Hexane/EtOAc, 3:1) to afford 5 as a off-white sticky solid (10.8 mg, 76%). 1H NMR (400 MHz, CDCl3) δ 7.63 (d, J=2.2 Hz, 1H), 7.54-7.49 (m, 1H), 7.46 (d, J=8.5 Hz, 1H), 7.21 (dd, J=8.5, 1.8 Hz, 1H), 6.74 (dd, J=2.2, 1.0 Hz, 1H), 5.75 (brs, 1H), 4.51 (d, J=5.7 Hz, 2H), 2.06-1.83 (m, 7H), 1.65 (t, J=7.4 Hz, 2H). HRMS (ESI-TOF) calcd for C17H18N3O2 296.1393 (M+H+), found 296.1392




embedded image


3-(3-(But-3-yn-1-yl)-3H-diazirin-3-yl)-N-(1-methyl-2-oxo-1,2,3,4-tetrahydroquinolin-7-yl)propanamide (6) General Procedure 1. Purified by SiO2 flash chromatography (Hexane/EtOAc, 3:1) to afford 6 as a light brown sticky solid (33 mg, 56%). 1H NMR (500 MHz, CDCl3) δ 7.43 (d, 2.4 Hz, 1H), 7.35 (brs, 1H), 7.29 (dd, J=8.7, 2.5 Hz, 1H), 6.91 (d, J=8.7 Hz, 1H), 3.33 (s, 3H), 2.99-2.89 (m, 2H), 2.76-2.65 (m, 2H), 2.19 (t, J=7.5, 6.7 Hz, 2H), 2.12 (td, J=7.4, 2.6 Hz, 2H), 2.07 (t, J=2.6 Hz, 1H), 2.02 (t, J=7.5 Hz, 2H), 1.76 (t, J=7.5 Hz, 2H). 13C NMR (126 MHz, CDCl3) δ 170.59, 169.79, 137.62, 133.17, 127.38, 120.28, 119.32, 115.38, 83.09, 69.69, 32.87, 31.99, 31.58, 29.98, 28.61, 28.23, 25.88, 13.71. HRMS (ESI-TOF) calcd for C18H21N4O2 325.1659 (M+H+), found 325.1658




embedded image


N-((1H-Indol-5-yl)methyl)-3-(3-(but-3-yn-1-yl)-3H-diazirin-3-yl)propanamide (7) General Procedure 1. Purified by PTLC (Hexane/EtOAc, 3:1) to afford 7 as an off-white sticky solid (12.2 mg, 57%). 1H NMR (500 MHz, CDCl3) δ 8.31 (brs, 1H), 7.57-7.50 (m, 1H), 7.36 (d, J=8.3 Hz, 1H), 7.22 (dd, J=3.2, 2.4 Hz, 1H), 7.11 (dd, J=8.3, 1.7 Hz, 1H), 6.53-6.51 (m, 1H), 5.71 (brs, 1H), 4.50 (d, J=5.4 Hz, 2H), 2.00 (td, J=7.4, 2.6 Hz, 2H), 1.98-1.92 (m, 3H), 1.89-1.84 (m, 2H), 1.64 (t, J=7.4 Hz, 2H). 13C NMR (126 MHz, CDCl3) δ 171.11, 135.68, 129.70, 128.47, 125.34, 122.74, 120.65, 111.79, 102.96, 83.14, 69.61, 44.83, 32.78, 30.79, 28.86, 13.70. HRMS (ESI-TOF) calcd for C17H19N4O 295.1553 (M+H+), found 295.1555.




embedded image


3-(3-(But-3-yn-1-yl)-3H-diazirin-3-yl)-1-(4-phenylpiperidin-1-yl)propan-1-one (8) General Procedure 1. Purified by SiO2 flash chromatography (Hexane/EtOAc, 3:1) to afford 8 as an off-white sticky solid (19.7 mg, 88%). 1H NMR (400 MHz, CDCl3) δ 7.31 (t, J=7.5 Hz, 2H), 7.25-7.16 (m, 3H), 4.85-4.69 (m, 1H), 3.92-3.83 (m, 1H), 3.10 (apparent td, J=13.3, 2.7 Hz, 1H), 2.73 (apparent tt, J=12.2, 3.7 Hz, 1H), 3.62 (apparent td, J=13.3, 2.8 Hz, 1H), 2.13-2.08 (m, 2H), 2.05 (td, J=7.5, 2.7 Hz, 2H), 1.98 (t, J=2.6 Hz, 1H), 1.92-1.84 (m, 2H), 1.69 (t, J=7.5 Hz, 2H) (rotomeric isomers present). 13C NMR (101 MHz, CDCl3) δ 169.33, 145.08, 128.59, 126.70, 126.54, 82.80, 69.12, 46.09, 42.75, 42.55, 33.81, 32.80, 32.57, 28.08, 26.99, 13.34. HRMS (ESI-TOF) calcd for C19H23N3O 310.1914 (M+H+), found 310.1916.




embedded image


3-(3-(But-3-yn-1-yl)-3H-diazirin-3-yl)-N-(4-(piperidin-4-yl)phenyl)propanamide (9) Followed General Procedure 1 for amide bond coupling. Crude 9 was then re-dissolved in DCM (1 mL) and TFA (0.3 mL) was carefully added. The resulting mixture was evaporated and crude 9 was purified by PTLC (DCM/MeOH, 6:1) yielding 9 as a white solid (22 mg, 67%, 2 steps). 1H NMR (500 MHz, CDCl3) δ 7.44 (d, J=8.1 Hz, 2H), 7.18 (d, J=8.2 Hz, 2H), 7.13 (s, 1H), 3.45 (d, J=12.7 Hz, 2H), 3.00-2.89 (m, 2H), 2.76-2.65 (m, 3H), 2.12 (t, J=7.5 Hz, 2H), 2.04 (td, J=7.5, 2.6 Hz, 2H), 2.02-1.91 (m, 3H), 1.68 (t, J=7.4 Hz, 2H). HRMS (ESI-TOF) calcd for C19H25N4O 325.2023 (M+H+), found 325.2023.




embedded image


N-([1,1′-Biphenyl]-4-ylmethyl)-3-(3-(but-3-yn-1-yl)-3H-diazirin-3-yl)propanamide (10) General Procedure 1. Purified by PTLC (Hexane/EtOAc, 4:1) to afford 10 as a white sticky solid (18.5 mg, 78%). 1H NMR (400 MHz, CDCl3) δ 7.61-7.52 (m, 4H), 7.44 (t, J=7.5 Hz, 2H), 7.38-7.33 (m, 4H), 5.77 (br s, 1H), 4.47 (d, J=5.7 Hz, 2H), 2.09-1.94 (m, 5H), 1.94-1.85 (m, 2H), 1.66 (t, J=7.4 Hz, 2H). 13C NMR (101 MHz, CDCl3) δ 170.92, 140.63, 137.05, 128.80, 128.32, 127.48, 127.39, 127.06, 82.70, 69.22, 43.47, 32.42, 30.32, 28.34, 27.86, 13.31. HRMS (ESI-TOF) calcd for C21H22N3O 332.1757 (M+H+), found 332.1755.




embedded image


1-(4-Benzhydrylpiperazin-1-yl)-3-(3-(but-3-yn-1-yl)-3H-diazirin-3-yl)propan-1-one (11) General Procedure 1. Purified by PTLC (DCM/MeOH, 20:1) to afford 11 as an off-white sticky residue (12 mg, 75%). 1H NMR (500 MHz, CDCl3) δ 7.43-7.38 (m, 4H), 7.31-7.24 (m, 4H), 7.22-7.16 (m, 2H), 4.23 (s, 1H), 3.66-3.54 (m, 2H), 3.48-3.34 (m, 2H), 2.36 (apparent t, J=5.0 Hz, 4H), 2.06-1.98 (m, 4H), 1.96 (t, J=2.7 Hz, 1H), 1.85-1.80 (m, 2H), 1.65 (t, J=7.4 Hz, 2H). 13C NMR (126 MHz, CDCl3) δ 169.84, 142.47, 129.01, 128.25, 127.58, 69.52, 52.34, 51.93, 45.96, 42.33, 32.93, 28.41, 27.22, 13.71. HRMS (ESI-TOF) calcd for C25H29N4O 401.2336 (M+H+), found 401.2335.




embedded image


3-(3-(But-3-yn-1-yl)-3H-diazirin-3-yl)-N-(4-((4-methylpiperazin-1-yl)methyl)phenyl)propanamide (12) General Procedure 1. Purified by PTLC (DCM/MeOH, 9:1) to afford 12 as an off-white sticky solid (16 mg, 76%). 1H NMR (500 MHz, CDCl3) δ 7.51 (s, 1H), 7.45 (d, J=8.1 Hz, 2H), 7.25 (d, J=8.3 Hz, 2H), 3.47 (s, 2H), 2.36 (s, 3H), 2.12 (t, J=7.5 Hz, 2H), 2.02 (td, J=7.4, 2.7 Hz, 2H), 1.98 (t, J=2.6 Hz, 1H), 1.92 (t, J=7.5 Hz, 2H), 1.67 (t, J=7.4 Hz, 2H). 13C NMR (126 MHz, CDCl3) δ 169.83, 137.24, 130.20, 120.29, 83.11, 62.59, 55.21, 52.68, 45.93, 32.84, 31.64, 28.63, 28.26, 13.71. HRMS (ESI-TOF) calcd for C20H28N5O 354.2288 (M+H+), found 354.2289.




embedded image


1-(2-Benzylpiperidin-1-yl)-3-(3-(but-3-yn-1-yl)-3H-diazirin-3-yl)propan-1-one (13) 1H NMR (500 MHz, CDCl3) General Procedure 1. Purified by PTLC (Hexane/EtOAc, 1:1) to afford 13 as an off-white sticky solid (9 mg, 77%). δ 7.35-7.15 (m, 3H), 7.11 (apparent d, J=7.4 Hz, 2H), 5.14-4.95 (m, 0.5H), 4.68-4.57 (m, 0.5H), 4.13-3.97 (m, 0.5H), 3.63-3.50 (m, 0.5H), 3.21-3.02 (m, 1H), 2.89-2.69 (m, 2H), 2.09-1.87 (m, 4H), 1.83-1.24 (m, 11H). 13C NMR (126 MHz, CDCl3) δ 169.99, 139.08, 139.01, 129.61, 129.46, 129.19, 128.73, 127.17, 126.63, 83.19, 69.49, 69.42, 55.55, 50.01, 41.70, 37.16, 37.04, 36.10, 32.88, 32.70, 29.92, 28.49, 28.46, 28.18, 27.78, 26.86, 26.47, 26.45, 25.89, 19.67, 19.27, 13.72, 13.70. Note: rotomeric isomers observed. HRMS (ESI-TOF) calcd for C20H26N3O 324.2070 (M+H+), found 324.2068.




embedded image


N-((3s,5s,7s)-Adamantan-1-yl)-3-(3-(but-3-yn-1-yl)-3H-diazirin-3-yl)propanamide (14) General Procedure 1. Purified by SiO2 flash chromatography (Hexane/EtOAc, 10:1→6:1→3:1) to afford 14 as a colorless sticky solid (14.7 mg, 68%). 1H NMR (500 MHz, CDCl3) δ 5.08 (brs, 1H), 2.15 (m, 3H), 2.04-1.95 (m, 9H), 1.88-1.75 (m, 4H), 1.72-1.59 (m, 8H). 13C NMR (126 MHz, CDCl3) δ 170.46, 83.17, 69.52, 52.41, 42.02, 36.74, 32.89, 31.69, 29.86, 29.84, 28.73, 13.71. HRMS (ESI-TOF) calcd for C28H26N3O 300.2070 (M+H+), found 300.2067.




embedded image


N-(2-(Benzo[d][1,3]dioxol-5-yl)ethyl)-3-(3-(but-3-yn-1-yl)-3H-diazirin-3-yl)propanamide (15) General Procedure 1. Purified by SiO2 flash chromatography (Hexane/EtOAc, 3:1) to afford 15 as a white solid (20.2 mg, 71%). 1H NMR (500 MHz, CDCl3) δ 6.74 (d, J=7.9 Hz, 1H), 6.67 (d, J=1.7 Hz, 1H), 6.62 (dd, J=7.9, 1.7 Hz, 1H), 5.93 (s, 2H), 5.43 (d, J=7.4 Hz, 1H), 3.45 (td, J=6.9, 5.8 Hz, 2H), 2.72 (t, J=6.9 Hz, 2H), 2.01 (td, J=7.4, 2.7 Hz, 2H), 1.96 (t, J=2.6 Hz, 1H), 1.90 δ 1.78 (m, 4H), 1.62 (t, J=7.4 Hz, 2H). 13C NMR (126 MHz, CDCl3) δ 171.37, 148.27, 146.65, 132.85, 122.01, 109.43, 108.79, 101.34, 83.10, 69.59, 41.21, 35.71, 32.81, 30.74, 28.72, 13.69. HRMS (ESI-TOF) calcd for C17H20N3O3 314.1499 (M+H+), found 314.1500.




embedded image


(S)-2-(3-(3-(but-3-yn-1-yl)-3H-diazirin-3-yl)propanamido)-4-methyl-N-(naphthalen-2-yl)pentanamide (25) General Procedure 1. Purified by SiO2 flash chromatography (Hexane/EtOAc, 3:1) to afford 25 as a white solid (27 mg, 53%). 1H NMR (500 MHz, CDCl3) δ 9.39 (s, 1H), 8.20 (d, J=2.2 Hz, 1H), 7.70-7.63 (m, 1H), 7.63-7.54 (m, 2H), 7.41 (dd, J=8.8, 2.1 Hz, 1H), 7.37-7.30 (m, 2H), 6.94 (d, J=7.9 Hz, 1H), 4.80 (td, J=8.3, 5.6 Hz, 1H), 2.09-1.94 (m, 2H), 1.93 (t, J=2.6 Hz, 1H), 1.91-1.70 (m, 7H), 1.51 (t, J=7.4 Hz, 2H), 1.00 (dd, J=12.9, 6.1 Hz, 6H). 13C NMR (125 MHz, CDCl3) δ 172.64, 171.84, 135.71, 134.08, 131.04, 129.03, 128.02, 126.74, 125.37, 120.43, 117.39, 83.02, 69.65, 53.48, 41.31, 32.56, 30.42, 28.65, 28.13, 25.35, 23.40, 22.59, 13.59. HRMS (ESI-TOF) calcd for C24H29N4O2 405.2285 (M+H+), found 405.2285




embedded image


(S)-3-(3-(but-3-yn-1-yl)-3H-diazirin-3-yl)-N-(1-((4-methoxynaphthalen-2-yl)amino)-1-oxopropan-2-yl)propanamide (26) General Procedure 1. Purified by SiO2 flash chromatography (Hexane/EtOAc, 9:1→4:1→2:1) to afford 26 as a white solid (147 mg, 73%). 1H NMR (500 MHz, CDCl3) δ 8.86 (s, 1H), 8.29-8.14 (m, 1H), 7.79-7.63 (m, 2H), 7.50 (ddd, J=8.2, 6.7, 1.4 Hz, 1H), 7.44 (ddd, J=8.2, 6.8, 1.3 Hz, 1H), 7.17 (d, J=1.8 Hz, 1H), 6.39 (d, J=7.5 Hz, 1H), 4.83 (p, J=7.1 Hz, 1H), 4.02 (s, 3H), 2.21-2.03 (m, 5H), 2.02-1.93 (m, 2H), 1.71 (t, J=7.2 Hz, 2H), 1.61 (d, J=7.0 Hz, 3H). 13C NMR (125 MHz, CDCl3) δ 172.31, 171.22, 156.39, 136.00, 134.77, 127.54, 124.74, 123.51, 122.22, 109.59, 99.18, 82.97, 69.77, 55.92, 50.34, 32.66, 30.70, 28.76, 28.18, 18.39, 13.62. HRMS (ESI-TOF) calcd for C22H25N4O3 393.1921 (M+H+), found 393.1923


General Procedure 3:




embedded image


To a solution of N-butanoyl-L-leucine (Effenberger et al., 2015) (1 equiv) in DCM (0.06M relative to acid), added commercially available amine (1.1 equiv), DIPEA (2.2 equiv) EDC-HCl (1.2 equiv) and HOBt (1.2 equiv) were added. Reaction mixtures were stirred at room temperature for 4 h to overnight when TLC indicated reaction completed. The crude samples were diluted with DCM and washed first with saturated aqueous NH4Cl and saturated aqueous NaHCO3, then dried over anhydrous Na2SO4 and volatiles removed by rotary evaporation. Crude products were purified by PTLC or flash column chromatography.




embedded image


(S)-2-butyramido-4-methyl-N—((S)-1,2,3,4-tetrahydronaphthalen-1-yl)pentanamide (29) General Procedure 3. Purified by PTLC (Hexane/EtOAc, 1:1) to afford 29 as an off-white solid (24 mg, 73%). 1H NMR (400 MHz, CDCl3) δ 7.23-7.04 (m, 4H), 6.39 (d, J=8.8 Hz, 1H), 5.99 (d, J=8.3 Hz, 1H), 5.16-5.08 (m, 1H), 4.44 (td, J=8.4, 5.4 Hz, 1H), 2.77 (qd, J=16.9, 8.7 Hz, 2H), 2.16 (td, J=7.3, 1.4 Hz, 2H), 2.08-1.93 (m, 1H), 1.91-1.39 (m, 8H), 1.03-0.81 (m, 9H). 13C NMR (125 MHz, CDCl3) δ 173.37, 171.79, 137.83, 136.59, 129.52, 128.83, 127.71, 126.68, 52.03, 48.02, 41.91, 38.87, 30.49, 29.59, 25.28, 23.27, 22.76, 20.46, 19.48, 14.09. HRMS (ESI-TOF) calcd for C20H31N2O2 331.2380 (M+H+), found 331.2383




embedded image


(S)—N-(2-(benzo[d][1,3]dioxol-5-yl)ethyl)-2-butyramido-4-methylpentanamide (30) General Procedure 3. Purified by SiO2 flash chromatography (Hexane/EtOAc, 3:2) to afford 30 as a white solid (181 mg, 75%). 1H NMR (500 MHz, CDCl3) δ 6.73 (d, J=7.9 Hz, 1H), 6.69-6.64 (m, 1H), 6.62 (dd, J=7.9, 1.7 Hz, 1H), 6.45-6.34 (m, 1H), 6.06 (t, J=7.9 Hz, 1H), 5.92 (s, 2H), 4.39 (td, J=8.3, 6.1 Hz, 1H), 3.49 (dq, J=13.5, 6.9 Hz, 1H), 3.38 (dq, J=13.3, 6.8 Hz, 1H), 2.71 (t, J=7.1 Hz, 2H), 2.15 (t, J=7.5 Hz, 2H), 1.70-1.41 (m, 5H), 0.97-0.85 (m, 9H). 13C NMR (125 MHz, CDCl3) δ 173.43, 172.46, 148.18, 146.60, 132.80, 122.02, 109.46, 108.72, 101.29, 51.89, 41.55, 41.20, 38.82, 35.70, 25.18, 23.17, 22.69, 19.44, 14.08. HRMS (ESI-TOF) calcd for C19H29N2O4 349.2122 (M+H+), found 349.2124


General Procedure 4:




embedded image


To commercially available amine (1.0 equiv) in DCM (0.1 M), added DIPEA (1.1 equiv) followed by the slow addition of butanoyl chloride (1.0 equiv). Resulting mixture was allowed to stir at room temperature until amine was fully consumed, as indicated by TLC. The crude mixture was diluted with DCM, washed first with saturated aqueous NH4Cl and saturated aqueous NaHCO3, then dried over anhydrous Na2SO4 and volatiles removed by rotary evaporation. Crude products were purified by PTLC.




embedded image


(S)-2-butyramido-4-methyl-N-(naphthalen-2-yl)pentanamide (27) General Procedure 4. Purified by PTLC (DCM/MeOH, 20:1) to afford 27 as a white solid (15 mg, 58%). 1H NMR (400 MHz, CDCl3) δ 9.41 (s, 1H), 8.26-8.09 (m, 1H), 7.69-7.54 (m, 3H), 7.42 (dd, J=8.8, 2.1 Hz, 1H), 7.38-7.29 (m, J=7.1, 3.5 Hz, 2H), 6.62 (d, J=8.0 Hz, 1H), 4.83 (td, J=8.3, 5.9 Hz, 1H), 2.22 (apparent td, J=7.3, 2.9 Hz, 2H), 1.92-1.57 (m, 5H), 0.99 (dd, J=12.4, 6.1 Hz, 6H), 0.90 (t, J=7.4 Hz, 3H). 13C NMR (125 MHz, CDCl3) δ 174.40, 171.36, 135.78, 134.13, 131.00, 128.96, 128.00, 127.85, 126.69, 125.26, 120.40, 117.15, 53.08, 40.96, 38.78, 25.33, 23.34, 22.67, 19.53, 14.04. HRMS (ESI-TOF) calcd for C20H26N2O2 327.2067 (M+H+), found 327.2069




embedded image


(S)—N-(1-((4-methoxynaphthalen-2-yl)amino)-1-oxopropan-2-yl)butyramide (28) General Procedure 4. Purified by PTLC (DCM/MeOH, 9:1) to afford 28 as a colorless solid (22.7 mg, 68%). 1H NMR (500 MHz, CDCl3) δ 9.36 (s, 1H), 8.12 (dd, J=8.2, 1.4 Hz, 1H), 7.69-7.64 (m, 1H), 7.62 (d, J=8.1 Hz, 1H), 7.40 (ddd, J=8.2, 6.7, 1.4 Hz, 1H), 7.34 (ddd, J=8.2, 6.8, 1.3 Hz, 1H), 7.10 (d, J=1.8 Hz, 1H), 6.56 (d, J=7.5 Hz, 1H), 4.91 (p, J=7.1 Hz, 1H), 3.91 (s, 3H), 2.27 (apparent td, J=7.4, 3.1 Hz, 2H), 1.78-1.68 (m, 2H), 1.55 (d, J=6.9 Hz, 3H), 0.96 (t, J=7.4 Hz, 3H). 13C NMR (125 MHz, CDCl3) δ 173.23, 170.49, 155.47, 135.40, 133.97, 126.67, 123.75, 122.56, 121.34, 108.54, 98.25, 55.04, 49.29, 38.06, 18.74, 17.78, 13.23. HRMS (ESI-TOF) calcd for C18H23N2O3 315.1703 (M+H+), found 315.1703




embedded image


1-(4-phenylpiperidin-1-yl)butan-1-one (49) General Procedure 4. Purified by SiO2 flash chromatography (Hexanes/EtOAc, 10:1→3:1) to afford 49 as a white solid (110 mg, 77%). 1H NMR (500 MHz, CDCl3) δ 7.31 (t, J=7.6 Hz, 2H), 7.24-7.16 (m, 3H), 4.81 (ddd, J=13.5, 4.2, 2.2 Hz, 1H), 3.99 (ddt, J=13.8, 4.2, 2.2 Hz, 1H), 3.12 (td, J=13.1, 2.6 Hz, 1H), 2.73 (tt, J=12.2, 3.7 Hz, 1H), 2.68-2.56 (m, 1H), 2.44-2.25 (m, 2H), 2.00-1.83 (m, 2H), 1.75-1.52 (m, 4H), 0.99 (t, J=7.4 Hz, 3H). HRMS (ESI-TOF) calcd for C13H14NO3 232.0968 [M+H+], found 232.0967




embedded image


N-(2-oxo-2H-chromen-6-yl)butyramide (50) General Procedure 4. Purified by SiO2 flash chromatography (Hexanes/EtOAc, 10:1→3:1) to afford 50 as a light yellow solid (116 mg, 81%). 1H NMR (400 MHz, CDCl3) δ 8.07 (d, J=2.5 Hz, 1H), 7.69 (d, J=9.5 Hz, 1H), 7.52 (brs, 1H), 7.42 (dd, J=8.9, 2.6 Hz, 1H), 7.28 (d, J=2.4 Hz, 1H), 6.44 (d, J=9.6 Hz, 1H), 2.39 (t, J=7.4 Hz, 2H), 1.79 (h, J=7.4 Hz, 2H), 1.03 (t, J=7.4 Hz, 3H). HRMS (ESI-TOF) calcd for C15H22NO 232.1696 [M+H+], found 232.1696




embedded image


1-(4-(2-Methoxyphenyl)piperazin-1-yl)-2-phenylethan-1-one (22). To a mixture of 1-(2-methoxyphenyl)piperazine (30 mg, 0.156 mmol) in anhydrous CH2Cl2 (1.5 mL) and pyridine (0.5 mL) was added phenylacetylchloride (23 mg, 0.172 mmol, 1.1 equiv). The reaction mixture was stirred at room temperature for 12 h before removing the solvent under reduced pressure. The remaining residue was purified by PTLC (Hexanes/EtOAc, 2/1) providing the title compound 22 as a colorless oil (46 mg, 96%). 1H NMR (600 MHz, CDCl3) δ 7.33 (t, J=7.5 Hz, 2H), 7.29-7.22 (m, 3H), 7.02 (td, J=7.7, 1.5 Hz, 1H), 6.93-6.81 (m, 3H), 3.85-3.83 (m, 5H), 3.79 (s, 2H), 3.64-3.59 (m, 2H), 3.00 (t, J=5.1 Hz, 2H), 2.85 (t, J=5.0 Hz, 2H). 13C NMR (151 MHz, CDCl3) δ 40.66, 41.58, 46.00, 50.02, 50.37, 54.99, 110.86, 117.95, 120.58, 123.08, 126.39, 128.16, 128.33, 134.67, 140.20, 151.78, 169.08. HRMS (ESI-TOF) calcd for C19H23N2O2 311.1754 [M+H+], found 311.1753




embedded image


1-(Benzylsulfonyl)-4-(2-methoxyphenyl)piperidine (23). To a mixture of 4-(2-methoxyphenyl)piperidine (50 mg, 0.26 mmol) and N,N-diisopropylethylamine (DIPEA, 0.100 mL, 0.58 mmol) in anhydrous THF (3.0 mL) was added benzylsulfonyl chloride (55 mg, 0.28 mmol, 1.1 equiv.) under N2. The reaction mixture was stirred at 50° C. for 12 h. The reaction mixture was poured into a separatory funnel with brine (10 mL) and extracted with EtOAc (2×10 mL). The combined organic layers were then dried over anhydrous Na2SO4 and concentrated under reduced pressure. The remaining residue was purified by SiO2 flash chromatography (Hexanes/EtOAc, 5/1) providing the title compound 23 as a slightly beige powder (50 mg, 56%). 1H NMR (600 MHz, CDCl3) δ 7.46-7.35 (m, 5H), 7.19 (ddd, J=8.3, 7.4, 1.7 Hz, 1H), 7.10 (dd, J=7.6, 1.7 Hz, 1H), 6.93 (td, J=7.5, 1.1 Hz, 1H), 6.85 (dd, J=8.2, 1.1 Hz, 1H), 4.24 (s, 2H), 3.83-3.75 (m, 5H), 2.96 (tt, J=12.1, 3.5 Hz, 1H), 2.72 (td, J=12.4, 2.5 Hz, 2H), 1.80-1.73 (m, 2H), 1.64 (qd, J=12.6, 4.2 Hz, 2H). 13C NMR (151 MHz, CDCl3) δ 169.48, 152.18, 140.60, 135.07, 128.73, 128.56, 126.79, 123.48, 120.98, 118.35, 111.26, 55.39, 50.77, 50.42, 46.40, 41.98, 41.06. HRMS (ESI-TOF) calcd for C19H24NO3S 346.1471 (M+H+), found 346.1472.




embedded image


N-(2-(4-(2-methoxyphenyl)piperidin-1-yl)-2-oxoethyl)acetamide (51). 4-(2-methoxyphenyl)piperidine (50 mg, 0.26 mmol), acetylglycine (46 mg, 0.39 mmol, 1.5 equiv.) and N,N-diisopropylethylamine (DIPEA, 0.137 mL, 0.58 mmol, 3.0 equiv.) in anhydrous DMF (1.0 mL) were added EDC (75 mg, 0.39 mmol, 1.5 equiv.) and HOAt (53 mg, 0.39 mmol, 1.5 equiv.). The reaction mixture was stirred at room temperature for ˜12 h before removing the solvent under reduced pressure. The remaining residue was purified by PTLC (CH2Cl2/MeOH, 9/1) providing the title compound 51 as a colorless oil (40 mg, 53%). 1H NMR (600 MHz, CDCl3) δ 7.21 (ddd, J=8.2, 7.4, 1.7 Hz, 1H), 7.10 (dd, J=7.6, 1.8 Hz, 1H), 6.93 (td, J=7.5, 1.1 Hz, 1H), 6.87 (dd, J=8.2, 1.1 Hz, 1H), 6.67 (brs, 1H), 4.77-4.71 (m, 1H), 4.16-4.09 (m, 1H), 4.05 (dd, J=17.3, 3.8 Hz, 1H), 3.83-3.81 (m, 4H), 3.24-3.12 (m, 2H), 2.75 (td, J=12.9, 2.8 Hz, 1H), 2.05 (s, 3H), 1.94-1.85 (m, 2H), 1.68-1.52 (m, 2H). 13C NMR (151 MHz, CDCl3) δ 23.07, 31.26, 32.12, 35.43, 41.40, 43.13, 45.30, 55.28, 110.42, 120.70, 126.38, 127.40, 132.74, 156.66, 166.03, 170.09. HRMS (ESI-TOF) calcd for C16H23N2O3 291.1703 (M+H+), found 291.1704.




embedded image


1-(4-(2-Methoxyphenyl)piperidin-1-yl)-2-(piperidin-1-yl)ethan-1-one (52). To a mixture of 4-(2-methoxyphenyl)piperidine (350 mg, 1.83 mmol) and triethylamine (0.643 mL, 4.57 mmol, 2.5 equiv.) in anhydrous CH2Cl2 (3.5 mL) was slowly added chloroacetyl chloride (0.175 mL, 2.20 mmol, 1.2 equiv.) under N2 at 0° C. The reaction mixture was stirred at room temperature for 1 h and diluted with EtOAc (10 mL). The mixture was washed with 1N aqueous HCl (1×10 mL) and brine. The organic layer was then dried over anhydrous Na2SO4 and concentrated under reduced pressure to afford a crude compound as a dark brown oil which was used to next reaction without further purification.


To a mixture of the oil (100 mg, 0.37 mmol) and triethylamine (0.156 mL, 1.12 mmol, 3.0 equiv.) in CH3CN (1 mL) was added piperidine (0.110 mL, 1.12 mmol, 3.0 equiv.) under N2. The reaction mixture was stirred at room temperature for 1 h and then quenched with H2O (1 mL). The product was extracted with EtOAc (2×10 mL). The combined organic layers were then dried over anhydrous Na2SO4 and concentrated under reduced pressure. The remaining residue was purified by SiO2 flash chromatography (Hexanes/EtOAc, 3/1, 3% Et3N) providing the title compound 52 as a pale yellow oil (84 mg, 71% in 2 steps). 1H NMR (600 MHz, CDCl3) δ 7.20 (ddd, J=8.2, 7.4, 1.7 Hz, 1H), 7.12 (dd, J=7.6, 1.7 Hz, 1H), 6.93 (td, J=7.5, 1.2 Hz, 1H), 6.87 (dd, J=8.2, 1.1 Hz, 1H), 4.77-4.70 (m, 1H), 4.32-4.25 (m, 1H), 3.83 (s, 3H), 3.25 (d, J=13.3 Hz, 1H), 3.22-3.14 (m, 1H), 3.12-3.04 (m, 3H), 2.65 (td, J=12.9, 2.7 Hz, 1H), 2.47-2.41 (m, 4H), 1.87-1.83 (m, 1H), 1.66 (qd, J=12.6, 4.1 Hz, 1H), 1.61-1.53 (m, 5H), 1.45-1.41 (m, 2H). 13C NMR (151 MHz, CDCl3) δ 24.01, 24.04, 26.03, 31.72, 32.59, 35.61, 42.84, 46.71, 54.32, 54.42, 55.26, 62.61, 109.95, 110.38, 120.66, 126.47, 126.49, 127.15, 133.53, 156.74, 168.41. HRMS (ESI-TOF) calcd for C19H29N2O2 317.2223 (M+H+), found 317.2226.




embedded image


1-(4-(2-Methoxyphenyl)piperidin-1-yl)-2-morpholinoethan-1-one (53). 4-(2-methoxyphenyl)piperidine (30 mg, 0.16 mmol), morpholin-4-ylacetic acid (27 mg, 0.19 mmol, 1.2 equiv.) and DIPEA (0.084 mL, 0.48 mmol, 3.0 equiv.) in anhydrous DMF (1.0 mL) were added EDC (45 mg, 0.23 mmol, 1.5 equiv.) and HOAt (32 mg, 0.23 mmol, 1.5 equiv.). The reaction mixture was stirred at room temperature for 2 days. H2O (1 mL) was added to the reaction mixture and product was extracted with EtOAc (2×1 mL). The combined organic layers were concentrated under reduced pressure. The remaining residue was purified by PTLC (EtOAc/MeOH, 5/1) providing the title compound 53 as a colorless oil (35 mg, 70%). 1H NMR (400 MHz, CDCl3) δ 7.21 (td, J=7.8, 1.7 Hz, 1H), 7.11 (dd, J=7.6, 1.7 Hz, 1H), 6.98-6.84 (m, 2H), 4.74 (d, J=12.9 Hz, 1H), 4.18 (d, J=13.4 Hz, 1H), 3.83 (s, 3H), 3.74 (t, J=4.7 Hz, 4H), 3.28 (d, J=13.5 Hz, 1H), 3.24-3.07 (m, 3H), 2.72-2.61 (m, 1H), 2.60-2.47 (m, 4H), 1.88 (t, J=14.4 Hz, 2H), 1.69-1.59 (m, 2H). HRMS (ESI-TOF) calcd for C18H27N2O3 319.2016 (M+H+), found 319.2017.




embedded image


1-(2-(4-(2-Methoxyphenyl)piperidin-1-yl)-2-oxoethyl)pyridin-2(1H)-one (54). 4-(2-methoxyphenyl)piperidine (50 mg, 0.26 mmol), (2-oxo-2H-pyridin-1-yl)-acetic acid (48 mg, 0.31 mmol, 1.2 equiv.) and triethylamine (0.054 mL, 0.39 mmol, 1.5 equiv.) in anhydrous DMF (1.0 mL) were added EDC (76 mg, 0.39 mmol, 1.5 equiv.) and HOAt (53 mg, 0.39 mmol, 1.5 equiv.). The reaction mixture was stirred at room temperature for ˜12 h before removing the solvent under reduced pressure. The remaining residue was purified by PTLC (EtOAc/MeOH, 6/1) providing the title compound 54 as a colorless oil (39 mg, 46%). 1H NMR (600 MHz, CDCl3) δ 7.39-7.30 (m, 2H), 7.20 (ddd, J=8.2, 7.4, 1.7 Hz, 1H), 7.12 (dd, J=7.5, 1.7 Hz, 1H), 6.93 (td, J=7.5, 1.1 Hz, 1H), 6.87 (dd, J=8.2, 1.1 Hz, 1H), 6.58 (ddd, J=9.2, 1.4, 0.7 Hz, 1H), 6.21 (td, J=6.7, 1.4 Hz, 1H), 4.86 (d, J=15.2 Hz, 1H), 4.80-4.69 (m, 2H), 4.15-4.04 (m, 1H), 3.83 (s, 3H), 3.31-3.16 (m, 2H), 2.75 (td, J=13.0, 2.9 Hz, 1H), 1.97-1.90 (m, 1H), 1.90-1.83 (m, 1H), 1.72-1.58 (m, 2H). 13C NMR (151 MHz, CDCl3) δ 30.90, 31.84, 34.98, 42.98, 45.82, 48.40, 54.87, 105.52, 109.56, 109.96, 120.22, 120.29, 126.06, 126.91, 132.51, 138.06, 139.59, 156.27, 161.96, 164.46. HRMS (ESI-TOF) calcd for C19H23N2O3 327.1703 (M+H+), found 327.1705.




embedded image


1-(4-(2-Methoxyphenyl)piperidin-1-yl)-2-phenylethan-1-on (55). To a mixture of 4-(2-methoxyphenyl)piperidine (30 mg, 0.16 mmol) and triethylamine (0.073 mL, 0.24 mmol, 1.5 equiv.) in anhydrous CH2Cl2 (1.0 mL) was added phenylacetyl chloride (26 mg, 0.17 mmol, 1.1 equiv.) under N2 at 0° C. The reaction mixture was stirred at room temperature for 1 h before removing the solvent under reduced pressure. The remaining residue was purified by PTLC (Hexanes/EtOAc, 2/1) providing the title compound 55 as a white solid (15 mg, 31%). 1H NMR (500 MHz, CDCl3) δ 7.36-7.27 (m, 3H), 7.27-7.14 (m, 3H), 7.03 (dd, J=7.5, 1.7 Hz, 1H), 6.94-6.82 (m, 2H), 4.81 (d, J=13.1 Hz, 1H), 3.97 (d, J=13.4 Hz, 1H), 3.80 (s, 3H), 3.78 (s, 2H), 3.17-3.04 (m, 2H), 2.67 (td, J=12.9, 2.8 Hz, 1H), 1.83 (d, J=13.5 Hz, 1H), 1.73 (d, J=13.3 Hz, 1H), 1.59 (td, J=12.7, 4.3 Hz, 1H), 1.31 (qd, J=12.6, 4.1 Hz, 1H). HRMS (ESI-TOF) calcd for C20H24NO2 310.1801 (M+H+), found 310.1801.




embedded image


4-(2-Methoxyphenyl)-N-phenylpiperidine-1-carboxamide (56). To a solution of 4-(2-methoxyphenyl)piperidine (50 mg, 0.26 mmol) in anhydrous DMF (1.0 mL) was added sodium hydride (in 60% oil, 12.5 mg, 0.31 mmol, 1.2 equiv.) under N2 at 0° C. The mixture was stirred at 0° C. for 15 min. Phenylisocyanate (37 mg, 0.31 mmol, 1.2 equiv.) in anhydrous DMF (0.5 mL) was added to the mixture. The reaction was then allowed to warm to room temperature. After stirring at room temperature for 1 h, the reaction was quenched with saturated aqueous NH4Cl and the product was extracted with EtOAc (2×10 mL). The combined organic layers were then dried over anhydrous Na2SO4 and concentrated under reduced pressure. The remaining residue was purified by PTLC (Hexanes/EtOAc, 1/1) providing the title compound 56 as an off-white powder (71 mg, 89%). 1H NMR (600 MHz, CDCl3) δ 7.41-7.36 (m, 2H), 7.36-7.25 (m, 2H), 7.24-7.13 (m, 2H), 7.03 (tt, J=7.4, 1.2 Hz, 1H), 6.94 (td, J=7.5, 1.1 Hz, 1H), 6.88 (dd, J=8.1, 1.1 Hz, 1H), 6.39 (brs, 1H), 4.24-4.18 (m, 2H), 3.84 (s, 3H), 3.17 (tt, J=12.1, 3.5 Hz, 1H), 3.03 (td, J=13.0, 2.6 Hz, 2H), 1.92-1.86 (m, 2H), 1.76-1.66 (m, 2H). 13C NMR (151 MHz, CDCl3) δ 31.26, 34.92, 44.81, 54.85, 76.31, 76.81, 76.91, 76.99, 109.94, 119.33, 119.36, 120.24, 120.25, 122.45, 122.49, 126.06, 126.79, 128.40, 128.43, 154.45, 156.27. HRMS (ESI-TOF) calcd for C19H23N2O2 311.1754 (M+H+), found 311.1753.




embedded image


2-Phenyl-1-(4-(2-(trifluoromethyl)phenyl)piperidin-1-yl)ethan-1-one (57). To a mixture of 4-(2-(trifluoromethyl)phenyl)piperidine hydrochloride (40 mg, 0.15 mmol) in anhydrous CH2Cl2 (1.5 mL) and pyridine (0.5 mL) was added phenylacetylchloride (26 mg, 0.17 mmol, 1.1 equiv.) under N2 at 0° C. The reaction mixture was stirred at room temperature for 12 h before removing the solvent under reduced pressure. The remaining residue was purified by PTLC (Hexanes/EtOAc, 2/1) providing the title compound 57 as a colorless oil (40 mg, 77%). 1H NMR (600 MHz, CDCl3) δ 7.61 (dd, J=7.9, 1.2 Hz, 1H), 7.51-7.45 (m, 1H), 7.38-7.22 (m, 7H), 4.88-4.81 (m, 1H), 4.02-3.96 (m, 1H), 3.84-3.75 (m, 2H), 3.15-3.04 (m, 2H), 2.65 (td, J=13.0, 2.8 Hz, 1H), 1.82 (d, J=13.3 Hz, 1H), 1.69 (d, J=13.2 Hz, 1H), 1.63 (qd, J=12.6, 4.2 Hz, 1H), 1.31 (qd, J=12.6, 4.1 Hz, 1H). 13C NMR (151 MHz, CDCl3) δ 32.46, 33.25, 37.85, 40.91, 42.24, 46.50, 125.05, 125.42, 125.46, 125.91, 126.40, 127.37, 127.55, 128.21, 128.34, 131.65, 134.85, 143.64, 168.99. HRMS (ESI-TOF) calcd for C20H21F3NO 348.1570 (M+H+), found 348.1572.




embedded image


1-(4-(3-Methoxyphenyl)piperidin-1-yl)-2-phenylethan-1-one (58). To a mixture of 4-(2-(trifluoromethyl)phenyl)piperidine hydrochloride (40 mg, 0.15 mmol) in anhydrous CH2Cl2 (1.5 mL) and pyridine (0.5 mL) was added phenylacetylchloride (26 mg, 0.17 mmol, 1.1 equiv.) under N2 at 0° C. The reaction mixture was stirred at room temperature for 12 h before removing the solvent under reduced pressure. The remaining residue was purified by PTLC (Hexanes/EtOAc, 2/1) providing the title compound 58 as a colorless oil (40 mg, 44%). 1H NMR (500 MHz, CDCl3) δ 7.37-7.28 (m, 3H), 7.28-7.17 (m, 3H), 6.78-6.69 (m, 2H), 6.67-6.65 (m, 1H), 4.81 (d, J=13.3 Hz, 1H), 3.98 (d, J=13.7 Hz, 1H), 3.83-3.73 (m, 4H), 3.10-3.01 (m, 1H), 2.70-2.59 (m, 2H), 1.87 (d, J=13.5 Hz, 1H), 1.74 (d, J=14.7 Hz, 1H), 1.65-1.56 (m, 1H), 1.38-1.23 (m, 2H). HRMS (ESI-TOF) calcd for C20H24NO2 310.1801 (M+H+), found 310.1801.




embedded image



General Procedure 5:


To a mixture of 1-phenylacetyl-piperazin hydrochloride (30 mg, 0.13 mmol), phenylboronic acid (2.0 equiv.) and triethylamine (0.092 mL, 0.66 mmol, 5.0 equiv.) in C1CH2CH2C1 (1.0 mL) was added Cu(OAc)2 (48 mg, 0.17 mmol, 2.0 equiv.). The reaction mixture was stirred at 50° C. for 12 h before removing the solvent under reduced pressure. The remaining residue was purified by PTLC (Hexanes/EtOAc, 1/1) providing the title compound.




embedded image


2-phenyl-1-(4-phenylpiperazin-1-yl)ethan-1-one (59). (10 mg, colorless oil, 27%): 1H NMR (500 MHz, CDCl3) δ 7.36-7.30 (m, 3H), 7.30-7.21 (m, 4H), 6.92-6.85 (m, 3H), 3.84-3.77 (m, 4H), 3.63-3.57 (m, 2H), 3.17-3.11 (m, 2H), 2.99-2.95 (m, 2H). HRMS (ESI-TOF) calcd for C18H21N2O 281.1648 (M+H+), found 281.1649.




embedded image


1-(4-(4-methoxyphenyl)piperazin-1-yl)-2-phenylethan-1-one (60). (7.2 mg, colorless oil, 18%): 1H NMR (500 MHz, CDCl3) δ 7.36-7.29 (m, 3H), 7.29-7.25 (m, 4H), 6.88-6.79 (m, 2H), 3.83-3.74 (m, 7H), 3.62-3.56 (m, 2H), 3.01 (t, J=5.2 Hz, 2H), 2.87-2.83 (m, 2H). HRMS (ESI-TOF) calcd for C19H23N2O2 311.1754 (M+H+), found 311.1755.




embedded image


1-(4-(4-methoxyphenyl)piperazin-1-yl)-2-phenylethan-1-one (61). (1.6 mg, white solid, 3.0%): 1H NMR (500 MHz, CDCl3) δ 7.38-7.24 (m, 6H), 7.21-7.17 (m, 2H), 3.86-3.78 (m, 4H), 3.63 (t, J=5.2 Hz, 2H), 3.26 (t, J=5.3 Hz, 2H), 3.08 (t, J=5.1 Hz, 2H). HRMS (ESI-TOF) calcd for C20H19F6N2O 417.1396 (M+H+), found 417.1397




embedded image


1-(4-(2-phenoxyphenyl)piperazin-1-yl)-2-phenylethan-1-one (62). (3.3 mg, colorless oil, 6.8%): 1H NMR (500 MHz, CDCl3) δ 7.34-7.20 (m, 6H), 7.13-6.85 (m, 8H), 3.72 (s, 2H), 3.60 (t, J=5.1 Hz, 2H), 3.40-3.34 (m, 2H) 3.02 (t, J=5.1 Hz, 2H), 2.87 (t, J=5.0 Hz, 2H). HRMS (ESI) calcd for C24H25N2O2 373.191 (M+H+), found 373.1909.


Tables 1-3 illustrate proteins and binding sites described herein.














TABLE 1





Accession

Labeled





#
Protein Name
Peptide
Peptide Sequence
Probes
Family







Q9NUJ1
ABHD10
285-300
ADIQLLVYTIDDLIDK (SEQ ID
3
Enzymes



Abhydrolase

NO: 97)





domain-







containing







protein 10,







mitochondrial






Q9NUJ1
ABHD10
209-223
YSEEGVYNVQYSFIK (SEQ ID NO:
13 14
Enzymes



Abhydrolase

98)
15 3 4
















domain-


8






containing









protein 10,









mitochondrial








Q99798
ACO2 Aconitate
32-50
VAMSHFETNEYIHYDLLEK (SEQ
6
Enzymes





hydratase,

ID NO: 99)







mitochondrial








P24666
ACP1 Low
42-59
VDSAATSGYEIGNPPDYR
13
Enzymes





molecular

(SEQ ID NO: 1)







weight









phosphotyrosine









protein









phosphatase








P68133
ACTA1 Actin,
241-256
SYELPDGQVITIGNER (SEQ ID
13 3 9
Adapter,





alpha skeletal

NO: 100)

Scaffolding,





muscle



Modulator









Proteins




P68133
ACTA1 Actin,
71-86
YPIEHGIITNWDDMEK (SEQ ID
13
Adapter,





alpha skeletal

NO: 101)

Scaffolding,





muscle



Modulator









Proteins




P62736
ACTA2 Actin,
241-256
SYELPDGQVITIGNER (SEQ ID
13 9
Adapter,





aortic smooth

NO: 102)

Scaffolding,





muscle



Modulator









Proteins




P62736
ACTA2 Actin,
71-86
YPIEHGIITNWDDMEK
13
Adapter,





aortic smooth

NO: 103)

Scaffolding,





muscle



Modulator









Proteins

















P60709
ACTB Actin,
148-177
TTGIVMDSGDGVTHTVPIYEGYA
14 13
Adapter,






cytoplasmic 1

LPHAILR (SEQ ID NO: 104)

Scaffolding,










Modulator










Proteins





P60709
ACTB Actin,
197-206
GYSFTTTAER (SEQ ID NO: 105)
3
Adapter,






cytoplasmic 1



Scaffolding,










Modulator










Proteins





P60709
ACTB Actin,

LCYVALDFEQEMATAASSSSLEK
13 14
Adapter,






cytoplasmic 1
216-238
(SEQ ID NO: 106)
3 9 8
Scaffolding,










Modulator










Proteins



















P60709
ACTB Actin,
239-254
SYELPDGQVITIGNER (SEQ ID
13 14
Adapter,







cytoplasmic 1

NO: 107)
3 9 8
Scaffolding,











Modulator











Proteins






P60709
ACTB Actin,
96-113
VAPEEHPVLLTEAPLNPK(SEQ ID
14 3
Adapter,







cytoplasmic 1

NO: 108)
13
Scaffolding,











Modulator











Proteins






Q562R1
ACTBL2 Beta-
240-255
SYELPDGQVITIGNER (SEQ ID
13
Adapter,







actin-like

NO: 109)

Scaffolding,







protein 2



Modulator











Proteins






Q562R1
ACTBL2 Beta-
97-114
VAPDEHPILLTEAPLNPK (SEQ ID
13
Adapter,







actin-like

NO: 110)

Scaffolding,







protein 2



Modulator











Proteins






O96019
ACTL6A Actin-
25-34
AGYAGEDCPK (SEQ ID NO: 111)
3
Transcription







like protein 6A



factors,











Regulators






P12814
ACTN1 Alpha-
237-254
AIMTYVSSFYHAFSGAQK (SEQ ID
13
Adapter,







actinin-1

NO: 112)

Scaffolding,











Modulator











Proteins






P12814
ACTN1 Alpha-
377-387
GYEEWLLNEIR (SEQ ID NO: 113)
13
Adapter,







actinin-1



Scaffolding,











Modulator











Proteins






O43707
ACTN4 Alpha-
256-273
AIMTYVSSFYHAFSGAQK (SEQ ID
13
Channels,







actinin-4

NO: 114)

Transporters,











Receptors






O43707
ACTN4 Alpha-
396-406
GYEEWLLNEIR (SEQ ID NO: 115)
13
Channels,







actinin-4



Transporters,











Receptors






O43707
ACTN4 Alpha-
470-494
VEQIAAIAQELNELDYYDSHNVN
14
Channels,







actinin-4

TR (SEQ ID NO: 116)

Transporters,











Receptors






O43707
ACTN4 Alpha-
792-805
ACLISLGYDVENDR (SEQ ID:
14
Channels,







actinin-4

117)

Transporters,











Receptors






Q8NI60
ADCK3
277-295
LGQMLSIQDDAFINPHLAK
14
Enzymes







Chaperone

(SEQ ID NO: 2)









activity of bc1











complex-like,


14








mitochondrial










P55263
ADK Adenosine
209-224
IFTLNLSAPFISQFYK
2
Enzymes







kinase

(SEQ ID NO: 3)








P30520
ADSS
431-441
FIEDELQIPVK
14
Enzymes







Adenylosuccinate

(SEQ ID NO: 4)









synthetase











isozyme 2










Q53H12
AGK
283-304
LASYWAQPQDALSQEVSPEVWK
14
Enzymes







Acylglycerol

(SEQ ID NO: 118)









kinase,











mitochondrial











AGPS










O00116
Alkyldihydroxya
587-603
GISDPLTVFEQTEAAAR (SEQ ID
13 14
Enzymes







cetonephosphate

NO: 119)









synthase,











peroxisomal










O43865
AHCYL1
250-261
GIVEESVTGVHR 
6
Transcription







Putative

(SEQ ID NO: 120)

factors,







adenosylhomocy



Regulators







steinase 2










Q96HN2
AHCYL2











Putative
331-342
GIVEESVTGVHR 
6
Enzymes







adenosylhomocy

(SEQ ID NO: 121)









steinase 3










O95433
AHSA1
225-246
VFTTQELVQAFTHAPATLEADR
4
Chaperones







Activator of 90

(SEQ ID NO: 122)









kDa heat shock











protein ATPase











homolog 1










O95433
AHSA1
322-328
YYFEGIK (SEQ ID NO: 123)
4
Chaperones







Activator of 90











kDa heat shock











protein ATPase











homolog 1










O95831
AIFM1

PYWHQSMIEWSDLGPDVGYEAIG
3 2 4
Enzymes







Apoptosis-
475-510
LVDSSLPTVGVFAK
6








inducing factor

(SEQ ID NO: 5)









1, mitochondrial










P54886
ALDH18A1
650-662
FASYLTFSPSEVK (SEQ ID NO:
14
Enzymes







Delta-1-

(12)









pyrroline-5-











carboxylate











synthase










Q3SY69
ALDHIL2
152-172
AGFSVFWADDGLDTGPILLQR
6
Enzymes







Mitochondrial

(SEQ ID NO: 125)









10-











formyltetraydro











folate











dehydrogen










P49419
ALDH7A1
139-162
ILVEGVGEVQEYVDICDYAVGLSR
13 8
Enzymes







Alpha-

(SEQ ID NO: 6)









aminoadipic











semialdehyde











dehydrogenase










Q9UJX3
ANAPC7
407-424
LDCYEGLIECYLASNSIR (SEQ ID
3
Uncategorized







Anaphase-

NO: 126)









promoting











complex subunit











7











ANP32A Acidic
117-132
SLDLFNCEVTNLNDYR (SEQ ID
13
Transcription






Q10567
leucine-rich

NO: 127)

factors,







nuclear



Regulators







phosphoprotein











32 family











member A










Q92688
ANP32B Acidic
117-132
SLDLFNCEVTNLNDYR (SEQ ID
13
Chaperones







leucine-rich

NO: 128)









nuclear











phosphoprotein











32 family











member B










Q10567
AP1B1 AP-1
902-913
LTNGIWVLAELR (SEQ ID NO:
13
Channels,







complex subunit

129)

Transporters,







beta-1



Receptors






Q9BZZ5
API5 Apoptosis
182-196
VLEDVTFEEFVLFMK (SEQ ID
4
Uncategorized







inhibitor 5

NO: 130)








Q9BZZ5
API5 Apoptosis
131-148
GTLGGLFSQILQGEDIVR (SEQ ID
4
Uncategorized







inhibitor 5

NO: 131)








Q9BZZ5
API5 Apoptosis
211-237
QQLVELVAEQADLEQTFNPSDPD
4
Uncategorized







inhibitor 5

CVDR (SEQ ID NO: 132)








Q9BUR5
APOO
173-182
GYIVIEDLWK (SEQ ID NO: 133)
14 4 2
Channels,







Apolipoprotein



Transporters,







O



Receptors






P84077
ARF1 ADP-
39-59
LGEIVTTIPTIGFNVETVEYK
13 3 2
Channels,







ribosylation

(SEQ ID NO: 7)
8
Transporters,







factor 1



Receptors






P61204
ARF3 ADP-
39-59
LGEIVTTIPTIGFNVETVEYK
13 3 2
Channels,







ribosylation

(SEQ ID NO: 7)
8
Transporters,







factor 3



Receptors






P18085
ARF4 ADP-
39-59
LGEIVTTIPTIGFNVETVEYK
13 3 2
Channels,







ribosylation

(SEQ ID NO: 7)
8
Transporters,







factor 4



Receptors






P84085
ARF5 ADP-
39-59
LGEIVTTIPTIGFNVETVEYK
13 3 2
Channels,







ribosylation

(SEQ ID NO: 7)
8
Transporters,







factor 5



Receptors






P40616
ARL1 ADP-
163-178
GTGLDEAMEQLVETLK
 14 13
Transcription







ribosylation

(SEQ ID NO: 9)

factors







factor-like



Regulators







protein 1










P40616
ARL1 ADP-
37-59
LQVGEVVTTIPTIGFNVETVTY
13
Transcription







ribosylation

(SEQ ID NO: 10)

factors







factor-like



Regulators







protein 1










O43681
ASNA1 ATPase
131-153
MMQEAMSAFPGIDEAMSYAEVM
14
Enzymes







ASNA1

R (SEQ ID NO: 134)








Q9NV17
ATAD3A
287-294
AFVTDWDK (SEQ ID NO: 135)
4 6
Enzymes







ATPase family











AAA domain-











containing











protein 3A










P31939
ATIC
178-194
AFTHTAQYDEAISDYFR
13 
Enzymes







Bifunctional

(SEQ ID NO: 11)









purine











biosynthesis











protein PURH










P05023
ATP1A1
360-377
NLEAVETLGSTSTICSDK (SEQ ID
13 14
Channels,







Sodium/potassium-

NO: 136)

Transporters,







transporting



Receptors







ATPase subunit











apha










P05023
ATP1A1
894-911
WINDVEDSYGQQWTYEQR (SEQ
9
Channels,







Sodium/potassium-

ID NO: 137)

Transporters,







transporting



Receptors







ATPase subunit











apha










P16615
ATP2A2
335-352
SLPSVETLGCTSVICSD (SEQ ID
14
Channels,







Sarcoplasic/

NO: 138)

Transporters,







endoplasmic



Receptors







reticulum











calcium ATPase










P20020
ATP2B1 Plasma
824-840
EASDIILTDDNFTSIVK (SEQ ID
14
Channels,







membrane

NO: 139)

Transporters,







calcium-



Receptors







transporting











ATPase 1










P23634
ATP2B4 Plasma
812-828
EASDILTDDFTSIV (SEQ ID
14
Channels,







membrane

NO: 140)

Transporters,







calcium-



Receptors







transporting











ATPase 4










P25705
ATP5A1 ATP
104-123
GMSLNLEPDNVGVVVFGNDK
14 3 
Channels,







synthase subunit

(SEQ ID NO: 141)
13
Transporters,







alpha,



Receptors







mitochondrial










P25705
ATP5A1 ATP
442-463
EVAAFAQFGSDLDAATQQLLSR
13 14
Channels,







synthase subunit

(SEQ ID NO: 142)
3 2 9
Transporters,







alpha,


8
Receptors







mitochondrial










P06576
ATP5B ATP
144-155
IMNVGEPIDER (SEQ ID NO: 143)
2 6
Channels,







synthase subunit



Transporters,







beta,



Receptors







mitochondrial










P06576
ATP5B ATP
226-239
AHGGYSVFAGCGER (SEQ ID NO:
6
Channels,







synthase subunit

144)

Transporters,







beta,



Receptors







mitochondrial










P06576
ATP5B ATP
242-259
EGNDLYHEMIESGVINLK (SEQ 
9 6
Channels,







synthase subunit

NO: 145)

Transporters,







beta,



Receptors







mitochondrial










P06576
ATP5B ATP
295-310
DQEGQDVLLFIDNIFR (SEQ ID
6
Channels,







synthase subunit

NO: 146)

Transporters,







beta,



Receptors







mitochondrial










P06576
ATP5B ATP
352-387
GSITSVQAIYVPADDLTDPAPATT
14 9 6
Channels,







synthase subunit

FAHLDATTVLSR (SEQ ID NO:

Transporters,







beta,

147)

Receptors







mitochondrial










P06576
ATP5B ATP
388-406
AIAELGIYPAVDPLDSTSR (SEQ ID
13 14
Channels,







synthase subunit

NO: 148)
3 2 6
Transporters,







beta,


8
Receptors







mitochondrial










P06576
ATP5B ATP
407-422
IMDPNIVGSEHYDVAR (SEQ ID
14  
Channels,







synthase subunit

NO: 149)

Transporters,







beta,



Receptors







mitochondrial










P06576
ATP5B ATP
433-451
SLQDIIAILGMDELSEEDK (SEQ ID
14 6
Channels,







synthase subunit

NO: 150)

Transporters,







beta,



Receptors







mitochondrial










P06576
ATP5B ATP
463-480
FLSQPFQVAEVFTGHMGK (SEQ
6
Channels,







synthase subunit

ID NO: 151)

Transporters,







beta,



Receptors







mitochondrial










P06576
ATP5B ATP
95-109
LVLEVAQHLGESTVR (SEQ ID
13 6
Channels,







synthase subunit

NO: 152

Transporters,







beta,



Receptors







mitochondrial










P36542
ATP5C1 ATP
116-126
SEVATLTAAGK (SEQ ID NO: 153)
6
Channels,







synthase subunit



Transporters,







gamma,



Receptors







mitochondrial










P24539
ATP5F1 ATP
116-126
YGPFVADFADK (SEQ ID NO: 154)
14
Channels,







synthase subunit



Transporters,







b, mitochondrial



Receptors






P24539
ATP5F1 ATP
56-70
YGLIPEEFFQFLYPK (SEQ ID NO:
14 4 2
Channels,







synthase subunit

155)
13
Transporters,







b, mitochondrial



Receptors






P24539
ATP5F1 ATP
71-90
TGVTGPYVLGTGLILYALSK (SEQ
13
Channels,







synthase subunit

ID NO: 156)

Transporters,







b, mitochondrial



Receptors







ATP5F1 ATP










P21281
ATP6V1B2 V-
437-457
AVVGEEALTSDDLLYLEFLQK
14
Channels,







type proton

(SEQ ID NO: 157)

Transporters,







ATPase subunit



Receptors







B, brain isoform










P21281
ATP6V1B2 V-
83-93
SGQVLEVSGSK (SEQ ID NO: 158)
13
Channels,







type proton



Transporters,







ATPase subunit



Receptors







B, brain isoform










P36543
ATP6V1BE1 V-
200-212
LDLIAQQMMPEVR (SEQ ID NO:
13
Channels,







type proton

159)

Transporters,







ATPase subunit



Receptors







E 1










P46379
BAG6 Large
332-344
LLGNTFVALSDLR (SEQ ID NO:
8
Chaperones







proline-rich

160)









protein BAG6










Q07812
BAX Apoptosis
66-78
IGDELDSNMELQR (SEQ ID NO:
13
Uncategorized







regulator BAX

161)








O75934
BCAS2 Pre-
137-151
VYNENLVHMIEHAQK (SEQ ID
4
Uncategorized







mRNA-splicing

NO: 162)









factor SPF27










Q13867
BLMH 
203-218
GEISATQDVMMEEIFR
13
Enzymes







Bleomycin

(SEQ ID NO: 13)









hydrolase










Q13867
BLMH 
111-124
CYFFLSAFVDTAQR
14
Enzymes







Bleomycin

(SEQ ID NO: 12)









hydrolase










P35613
BSG Basigin
283-300
SELHIENLNMEADPGQYR (SEQ ID
13 14
Uncategorized









NO: 163)
4







P35613
BSG Basigin
228-243
SSEHINEGETAMLVCK (SEQ ID
2
Uncategorized









NO: 164)








Q4ZIN3
C19orf6
254-271
LLLDEFLGYDDILMSSVK (SEQ ID 
9
Uncategorized







Membralin

NO: 165)








Q07021
C1QBP
247-276
GVDNTFADELVELSTALEHQEYIT
13 14
Transcription







Complement

FLEDLK (SEQ ID N: 166)
3 9
factors,







component 1 Q



Regulators







subcomponent-











binding protein










Q07021
C1QBP
155-174
VEEQEPELTSTPNFVVEVIK(SEQ
13 14
Transcription







Complement

ID NO: 167)
3 9
factors,







component 1 Q



Regulators







subcomponent-











binding protein










Q07021
C1QBP
105-119
MSGGWELELNGTEAK (SEQ ID
9
Transcription







Complement

NO: 168)

factors,







component 1 Q



Regulators







subcomponent-











binding protein










Q07021
C1QBP
181-207
ALVLDCHYPEDEVGQEDEAESDIF
13 9
Transcription







Complement

SIR (SEQ ID NO: 169)

factors,







component 1 Q



Regulators







subcomponent-











binding protein










Q07021
C1QBP
81-91
AFVDFLSDEIK (SEQ ID NO: 170)
9
Transcription







Complement



factors,







component 1 Q



Regulators







subcomponent-











binding protein










Q07021
C1QBP
129-154
ITVTFNINNSIPPTFDGEEEPSQGQ
9
Transcription







Complement

K (SEQ ID NO: 171)

factors,







component 1 Q



Regulators







subcomponent-











binding protein










Q07021
C1QBP
208-220
EVSFQSTGESEWK (SEQ ID NO:
3 9
Transcription







Complement

172)

factors,







component 1 Q



Regulators







subcomponent-











binding protein










P62158
CALM3
128-149
EADIDGDGQVNYEEFVQMMTAK
13
Adapter,







Calmodulin

(SEQ ID NO: 173)

Scaffolding,











Modulator











Proteins






P62158
CALM3
39-75
SLGQNPTEAELQDMINEVDADGN
14
Adapter,







Calmodulin

GTIDFPEFLTMMAR (SEQ ID NO:

Scaffolding,









174)

Modulator











Proteins






P27797
CALR
323-351
SGTIFDNFLITNDEAYAEEFGNET
13 9
Chaperones







Calreticulin

WGVTK
6










(SEQ ID NO: 14)








P27797
CALR
99-111
HEQNIDCGGGYVK
6
Chaperones







Calreticulin

(SEQ ID NO: 15)








P27824
CANX Calnexin
235-274
THLYTLILNPDNSFEILVDQSVVN
6
Chaperones









SGNLLNDMTPPVNPSR (SEQ ID











NO: 175)








P07384
CAPN1
175-193
LVFVHSAEGNEFWSALLEK
14
Enzymes







Calpain-1

(SEQ ID NO: 16)









catalytic subunit










Q96A33
CCDCC47
197-212
LNQENEHIYNLWCSGR (SEQ ID
4 2
Uncategorized







Coiled-coil

NO: 177)









domain-











containing











protein 47










Q96A33
CCDCC47
375-392
DMEALLPLMNMVIYSIDK (SEQ
6
Uncategorized







Coiled-coil

ID NO: 178)









domain-











containing











protein 47










Q96ER9
CCDCC51
86-96
YEEFVGLNEVR (SEQ ID NO: 179)
14
Uncategorized







Coiled-coil











domain-











containing











protein 51










P78371
CCT2 T-
294-322
QLIYNYPEQLFGAAGVMAIEHAD
14
Chaperones







complex protein

FAGVER (SEQ ID NO: 180)









1 subunit beta










P78371
CCT2 T-
502-516
QVLLSAAEAAEVILR (SEQ ID NO: 
14 3
Chaperones







complex protein

181)









1 subunit beta










P78371
CCT2 T-
90-111
VQDDEVGDGTTSVTVLAAELLR
14
Chaperones







complex protein

(SEQ ID NO: 182)









1 subunit beta










P49368
CCT3 T-
439-449
AVAQALEVIPR (SEQ ID NO: 183)
14
Chaperones







complex protein











1 subunit











gamma










P49368
CCT3 T-
86-127
TQDEEVGDGTTSVIILAGEMLSVA
14
Chaperones







complex protein

EHFLEQQMHPTVVISAYR (SEQ ID









1 subunit

NO: 184)









gamma










P50991
CCT4 T-
175-193
VVSQYSSLLSPMSVNAVMK (SEQ
2
Chaperones







complex protein

ID NO: 185)









1 subunit delta










P50991
CCT4 T-
453-481
AFADAMEVIPSTLAENAGLNPIST
14 4 2
Chaperones







complex protein

VTELR (SEQ ID NO: 186)









1 subunit delta










P48643
CCT5 T-
294-323
ETGANLAICQWGFDDEANHLLLQ
6
Chaperones







complex protein

NLPAVR (SEQ ID NO: 187)









1 subunit epsilon










P48643
CCT5 T-
324-340
WVGGPEIELIAIATGGR (SEQ ID
14 3
Chaperones







complex protein

NO: 188)
13 6








1 subunit epsilon










P48643
CCT5 T-
450-478
AFADALEVIPMALSENSGMNPIQT
14 6
Chaperones







complex protein

MTEVR (SEQ ID NO: 189)









1 subunit epsilon










P48643
CCT5 T-
97-126
SQDDEIGDGTTGVVVLAGALLE
13 14
Chaperones







complex protein

AEQLLDR (SEQID NO: 190)
6 9








1 subunit epsilon










P40227
CCT6A T-
400-424
NAIDDGCVVPGAGAVEVAMAEA
9
Chaperones







complex protein

LIK (SEQ ID NO: 191)









1 subunit zeta










Q99832
CCT7 T-
85-106
SQDAEVGDGTTSVTLLAAEFLK
13
Chaperones







complex protein

(SEQ ID NO: 192)









1 subunit eta










P50990
CCT8 T-
441-450
FAEAFEAIPR (SEQ ID NO: 193)
8
Chaperones







complex protein











1 subunit theta










Q16543
CDC37 Hsp90
287-307
LGPGGLDPVEVYESLPEELQK
8
Chaperones







co-chaperone

(SEQ ID NO: 194)









Cdc37










Q96JB5
CDK5RAP3
351-367
NQFLDELMELEIFLAQR (SEQ ID
3
Adapter,







CDK5

NO: 195)

Scaffolding,







regulatory



Modulator







subuni-



Proteins







associated











protein 3










Q07065
CKAP4
312-326
STLQTMESDIYTEVR (SEQ ID NO:
13 14
Adapter,







Cytoskeleton-

196)
9 8
Scaffolding,







associated 



Modulator







protein 4



Proteins






P12277
CKB Creatine
224-236
TFLVWVNEEDHLR
3
Enzymes







kinase B-type

(SEQ ID NO: 19)








P12277
CKB Creatine
342-358
LGFSEVELVQMVVDGVK
3 13
Enzymes







kinase B-type

(SEQ ID NO: 21)








P12277
CKB Creatine
367-381
LEQGQAIDDLMPAQK
13
Enzymes







kinase B-type

(SEQ ID NO: 22)








P12277
CKB Creatine
14-32
FPAEDEFPDLSAHNNHMAK
3
Enzymes







kinase B-type

(SEQ ID NO: 17)








P12277
CKB Creatine
157-172
LAVEALSSLDGDLAGR
13
Enzymes







kinase B-type

(SEQ ID NO: 18)








P12277
CKB Creatine
253-265
FCTGLTQIETLFK
13
Enzymes







kinase B-type

(SEQ ID NO: 20)








P12532
CKMT1B
257-269
SFLIWVNEEDHTR
3
Enzymes







Creatine kinase

(SEQ ID NO: 23)









U-type,











motochondrial










O75503
CLN5 Ceroid-
74-96
YTFCPTGSPIPVMEGDDDIEVFR
9
Uncategorized







lipofuscinosis

(SEQ ID NO: 197)









neuronal protein











5










Q9H078
CLPB
630-650
VVNQLAAAYEQDLLPGGCTLR
14
Enzymes







Caseinolytic

(SEQ ID NO: 198)









peptidase B











protein homolog










Q16740
CLPP Putative
215-226
QSLQVIESAMER
6
Enzymes







ATP-dependent

(SEQ ID NO: 24)









Clp protease











proteolytic











subunit










O96005
CLPTM1 Cleft
325-346
SPWNFLGDELYEQSDEEQDSVK
13 14
Uncategorized







lip and palate

(SEQ ID NO: 199)
2 6








transmembrane











protein 1










O96005
CLPTM1 Cleft
548-562
ALNTFIDDLFAFVIK (SEQ ID NO:
2
Uncategorized







lip and palate

200)









transmembrane











protein 1










P53618
COPB1
262-279
YEAAGTLVTLSSAPTAIK (SEQ ID
13
Channels,







Coatomer

NO: 201)

Transporters,







subunit beta



Receptors






Q9BT78
COPS4 COP9
154-170
LYLEDDDPVQAEAYINR (SEQ ID
13 15
Uncategorized







signalosome

NO: 202)









complex subunit











4










Q5HYK3
COQ5 2-
258-279
LYDLYSFQVIPVLGEVIAGDWK
14 2
Enzymes







methoxy-6-

(SEQ ID NO: 203)









polyprenyl-1,4-











benzoquinol











methylase,










Q7KZN9
COX15
296-313
MGESWIPEDLFTFSPILR (SEQ ID
14
Uncategorized







Cytochrome

NO: 204)









coxidase











assembly protein











COX15 homolo










P20674
COX5A
73-87
GINTLVTYDMVPEPK (SEQ ID NO:
13 14
Adapter,







Cytochrome

205)
3 2 9
Scaffolding,







coxidase subunit



Modulator







5A,



Proteins







mitochondrial










P23786
CPT2 Carnitine
363-382
DGSTAVHFEHSWGDGVAVLR
15 13
Enzymes







O-

(SEQ ID NO: 206)









palmitoyltransfe











rase 2,











mitochondrial










P23786
CPT2 Carnitine
478-495
QYGQTVATYESCSTAAFK (SEQ
4
Enzymes







O-

ID NO: 207)









palmitoyltransfe











rase 2,











mitochondrial










Q9H3G5
CPVL Probable
281-292
QNWFEAFEILDK (SEQ ID NO: 
4 9
Enzymes







serine

208)









carboxypeptidase











CPVL










Q9H3G5
CPVL Probable
320-331
CTEPEDQLYYVK (SEQ ID NO:
13 9
Enzymes







serine

209)









carboxypeptidase











CPVL










Q9H3G5
CPVL Probable
195-208
NNDFYVTGESYAGK (SEQ ID NO:
9
Enzymes







serine

210)









carboxypeptidase











CPVL










P55060
CSEIL
32-52
FLESVEGNQNYPLLLLTLLEK
14 3
Channels,







Exportin-2

(SEQ ID NO: 211)

Transporters,











Receptors






P55060
CSEIL
396-418
FFEGPVTGIFSGYVNSMLQEYAK
14
Channels,







Exportin-2

(SEQ ID NO: 212)

Transporters,











Receptors






P48729
CSNK1A1
84-106
DYNVLVMDLLGPSLEDLFNFCSR
14
Enzymes







Casein kinase I

(SEQ ID NO: 25)









isoform alpha


14
Uncategorized






P67870
CSNK2B Casein
112-134
VYCENQPMLPIGLSDIPGEAMVK









kinase II subunit

(SEQ ID NO: 26)









beta










Q12996
CSTF3
440-464
YGDIPEYVLAYIDYLSHLNEDNNT
13
Uncategorized







Cleavage

R (SEQ ID NO: 213)









stimulation











factor subunit 3










Q12996
CSTF3
319-330
LFSDEAANIYER 
13 14
Uncategorized







Cleavage

(SEQ ID NO: 214)









stimulation











factor subunit 3










P35222
CTNNB1
648-661
NEGVATYAAAVLFR (SEQ ID NO: 
14 13
Adapter







Catenin beta-1

215)

Scaffolding,











Modulator











Proteins






P07858
CTSB
315-331
GQDHCGIESEVVAGIPR
13 4 2
Enzymes







Cathepsin B

(SEQ ID NO: 27)
9







P07339
CTSD Cathespin
236-253
DPDAQPGGELMLGGTDSK
9
Enzymes







D

(SEQ ID NO: 28)








P07339
CTSD Cathespin
288-309
EGCEAIVDTGTSLMVGPVDEVR
13 14
Enzymes







D

(SEQ ID NO: 29)
15 4 6











9 8







P07339
CTSD Cathespin
314-331
AIGAVPLIQGEYMIPCEK
14 15
Enzymes







D

(SEQ ID NO 30)
3 2 4











13 6 9











8







O43169
CYB5B
138-144
YYTSESK (SEQ ID NO: 216)
4 2
Adapter







Cytochrome b5



Scaffolding,







type B



Modulator











Proteins






P00387
CYB5R3
235-241
LWYTLDR
3
Enzymes







NADH-

(SEQ ID NO: 31)









cytochrome b5











reductase 3










Q6UW02
CYP20A1
397-413
TFSSLGFSGTQCPELR (SEQ ID
14 4 3
Enzymes







Cytochrome

NO: 217)









P450 20A1










P61962
DCAF7 DDB1-
82-96
GVYPDLLATSGDYLR (SEQ ID
14
Uncategorized







and CUL4-

NO: 218)









associated factor











7










Q13561
DCTN2
380-395
ENLATVEGNFASIDER (SEQ ID 
13 6
Adapter







Dynactin

NO: 219)

Scaffolding,







subunit 2



Modulator











Proteins






Q9H773
DCTPP1 dCTP
90-109
AALQEELSDVLIYLVALAAR (SEQ
14 4
Enzymes







pyrophosphatase

ID NO: 220)









1










Q92841
DDX17
406-417
LIQLMEEIMAEK 
13 14
Transcription







Propable ATP-

(SEQ ID NO: 221)
2 9
factors,







dependent RNA



Regulators







helicase DDX17










Q92841
DDX17
536-547
VLEEANQAINPK 
3
Transcription







Propable ATP-

(SEQ ID NO: 222)

factors,







dependent RNA



Regulators







helicase DDX17










Q16698
DECR1 2, 4-
299-315
FDGGEEVLISGEFNDLR
6
Enzymes







dienoyl-CoA

(SEQ ID NO: 32)









reductase,











mitochondrial










Q15392
DHCR24
334-352
SIFWELQDIIPFGNNPIFR (SEQ ID
3 15 2
Enzymes







Delta(24)-sterol

NO: 223)









reductase










Q15392
DHCR24
428-444
GNEAELYIDIGAYGEPR (SEQ ID
13 14
Enzymes







Delta(24)-sterol

NO: 224)
8








reductase










Q9H2U1
DHX36
754-770
SDHLTVVNAFEGWEEAR (SEQ ID
6
Transcription







Probable ATP-

NO: 225)

factors,







dependent RNA



Regulators







helicase A










Q08211
DHX9 ATP-
448-456
ISAVSVAER
3
Transcription







dependent RNA

(SEQ ID NO: 33)

factors,







helicase A



Regulators






Q08211
DHX9 ATP-
121-141
AENNSEVGASGYGVPGPTWDR
8
Transcription







dependent RNA

(SEQ ID NO: 226)

factors,







helicase A



Regulators






Q9NR28
DIABLO Diablo 
124-140
MNSEEEDEVWQVIIGAR (SEQ ID
13
Uncategorized







homolog,

NO: 227)









mitochondrial










P09622
DLD
450-482
VLGAHILGPGAGEMVEAALALE
14 4
Enzymes







Dihydrolipoyl

YGASCEDIAR
13








dehydrogenase,

(SEQ ID NO: 34)









mitochondrial










Q9NVH1
DNAJC11 DnaJ
207-226
GWGELEFGAGDLQGPLFGLK
14 6
Chaperones







homolog

(SEQ ID NO: 228)









subfamily C











member 11










O00115
DNASE2
173-202
QLTYTYPWVYNYQLEGIFAQEFP
9
Enzymes







Deoxyribonucle

DLENVVK (SEQ ID NO: 229)









ase-2-alpha










P42892
ECE1
434-453
FCVSDTENNLGFALGPMFVK
14 13
Enzymes







Endothelin-

(SEQ ID NO: 230)









converting











enzyme 1










Q13011
ECH1
197-211
EVDVGLAADVGTLQR
13 14
Enzymes







Delta(3,5)-

(SEQ ID NO: 37)
15 3 4








Delta(2,4)-


6 8








dienoyl-CoA











isomerase,











mitochondrial










Q13011
ECH1
149-158
YQETFNVIER
6
Enzymes







Delta(3,5)-

(SEQ ID NO: 36)









Delta(2,4)-











dienoyl-CoA











isomerase,











mitochondrial










Q13011
ECH1
113-131
MFTAGIDLMDMASDILQPK
6
Enzymes







Delta(3,5)-

(SEQ ID NO: 35)









Delta(2,4)-











dienoyl-CoA











isomerase,











mitochondrial










Q9NTX5
ECHDC1
272-283
ELYLEEALQNER (SEQ ID NO:
9
Enzymes







Ethylmalonyl-

231)









CoA











decarboxylase










P68104
EEF1A1
135-146
EHALLAYTLGVK (SEQ ID NO:
13
Transcription







Elongation

232)

factors,







factor 1-alpha 1



Regulators






P26641
EEF1G
379-400
GQELAFPLSPDWQVDYESYTWR
13
Uncategorized







Elongation

(SEQ ID NO: 233)









factor 1-gamma










P26641
EEF1G
58-85
VPAFEGDDGFCVFESNAIAYYVS
3
Uncategorized







Elongation

NEELR (SEQ ID NO: 234)









factor 1-gamma










P13639
EEF2
457-481
YVEPIEDVPCGNIVGLVGVDQFLV
3
Transcription







Elongation

K (SEQ ID NO: 235)

factors,







factor 2



Regulators






P13639
EEF2
740-765
LMEPIYLVEIQCPEQVVGGIYGVL
3
Transcription







Elongation

NR (SEQ ID NO: 236)

factors,







factor 2



Regulators






P13639
EEF2
768-785
HVFEESQVAGTPMFVVK (SEQ
3
Transcription







Elongation

ID NO: 237)

factors,







factor 2



Regulators






P60228
EIF3E
173-191
LASEILMQNWDAAMEDLTR (SEQ
2
Uncategorized







Eukaryotic

ID NO: 238)









translation











initiation factor











3 submit










O00303
EIF3F
193-210
EAPNPIHLTVDTSLQNGR (SEQ ID
3 6
Enzymes







Eukaryotic

NO: 239)









translation











initiation factor











3 submit










O00303
EIF3F
279-297
IQDALSTVLQYAEDVLSGK (SEQ
3 9
Enzymes







Eukaryotic

ID NO: 240)









translation











initiation factor











3 submit










O15372
EIF3H
207-220
NSHLINVLMWELEK (SEQ ID NO:
2
Uncategorized







Eukaryotic

241)









translation











initiation factor











3 submit










Q9Y262
EIF3L
404-419
GDPQVYEELFSYSCPK (SEQ ID
13
Uncategorized







Eukaryotic

NO: 242)









translation











initiation factor











3 submit










Q9Y262
EIF3L
243-262
QLEVYTSGGDPESVAGEYGR
13 14
Uncategorized







Eukaryotic

(SEQ ID NO: 243)









translation











initiation factor











3 submit










P60842
EIF4A1
69-82
GYDVIAQAQSGTGK
14 13
Transcription







Eukaryotic

(SEQ ID NO: 39)
9
factors,







initiation factor



Regulators







4A-I



Transcription






P60842
EIF4A1
178-190
MFVLDEADEMLSR
13
factors,







Eukaryotic

(SEQ ID NO: 38)

Regulators







initiation factor



Transcription







4A-I



factors,






Q14240
EIF4A2
70-83
GYDVIAQAQSGTGK
13
Regulators







Eukaryotic

(SEQ ID NO: 40)

Transcription







initiation factor



factors,







4A-II



Regulators






Q15056
EIF4H
97-109
EALTYDGALLGDR (SEQ ID NO:
9
Transcription







Eukaryotic

244)

factors,







translation



Regulators







initiation factor



Transcription







4H










P55010
EIF5 Eukaryotic
274-288
AMGPLVLTEVLFNEK (SEQ ID
14 3 2
Transcription







translation

NO: 245)
4 13 6
factors,







initiation factor



Regulators







5










Q15717
ELAVL1
20-37
TNLIVNYLPQNMTQDELR (SEQ
13 4 2
Transcription







ELAV-like

ID NO: 246)

factors,







protein 1



Regulators






Q9NXB9
ELOVL2
42543
AFDDEINAFLDNMFGPR (SEQ ID
14 9
Enzymes







Elongation of 

NO: 247)









very long chain 











fatty acids











protein










P50402
EMD Emerin
212-221
APGAGLGQDR (SEQ ID NO: 248)
4
Adapter,











Scaffolding,











Modulator











Proteins






P50402
EMD Emerin
89-103
GYNDDYYEESYFTTR (SEQ ID
6 8
Adapter,









NO: 249)

Scaffolding,











Modulator











Proteins






P07099
EPHX1 Epoxide
329-338
FSTWTNTEFR (SEQ ID NO: 250)
3 6
Enzymes







hydolase 1










P84090
ERH Enhancer
18-34
TYADYESVNECMEGVCK(SEQ ID
13
Uncategorized







of rudimentary

NO: 251)









homolog










P38117
ETFB Electron
36-51
HSMNPFCEIAVEEAVR
3
Channels,







transfer

(SEQ ID NO: 41)

Transporters,







flavoprotein



Receptors







subunit beta










Q01844
EQSR1 RNA-
269-292
QDHPSSMGVYGQESGGFSGPGEN
2
Transcription







binding protein

R (SEQ ID NO: 252)

factors,







EWS



Regulators






Q9UQ84
EXO1
139-160
SQGVDCLVAPYEADAQLAYLNK
13 2
Enzymes







Exonuclease 1

(SEQ ID NO: 95)
6 9 8







Q96CS3
FAF2 FAS-
249-277
LEGLIQPDDLINQLTFIMDANQTY
6
Uncategorized







associated factor

LVSER (SEQ ID NO: 253)









2

WEYVPLGPFLGK (SEQ ID NO: 
14
Enzymes






P16930
FAH
242-253
254)









Fumarylacetoace











tase










Q9NRY5
FAM114A2
184-196
TMDVIAEGDPGFK (SEQ ID NO:
14
Uncategorized







Protein

255)









FAM114A2










Q9NSD9
FARSB
72-82
YDLLCLEGLVR (SEQ ID NO: 256)
9
Enzymes







Phenylalanini--











tRNA ligase











beta subunit










Q9NSD9
FARSB
518-530
IMQLLDVPPGEDK (SEQ ID NO: 
2
Enzymes







Phenylalanini--

257)









tRNA ligase











beta subunit










P49327
FASN Fatty
1350-1383
GHPLGDIVAFLTSTEPQYGQGILS
14 13
Enzymes







acid synthase

QDAWESLFSR (SEQ ID NO: 258)








P37268
FDFT1
78-92
ALDTLEDDMTISVEK (SEQ ID NO:
15
Enzymes







Squalene

259)









synthase










P22830
FECH
254-272
SEVVILFSAHSLPMSVVNR
4
Enzymes







Ferrochelatase,

(SEQ ID NOW: 42)









mitochondrial










O95684
FGFR1OP
39-50
AAVFLALEEQEK (SEQ ID NO:
14 13
Adapter,







FGFR1

260)
8
Scaffolding,







oncogene



Modulator







partner



Proteins






Q96AY3
FKBP10
198-212
GGTYDTYVGSGWLIK (SEQ ID
13
Enzymes







Peptidyl-prolyl

NO: 261)









cis-trans











isomerase











FKBP10










Q02790
FKBP4
190-206
FEIGEGENLDLPYGLER (SEQ ID
13
Chaperones







Peptidyl-prolyl

NO: 262)









cis-trans











isomerase











FKBP4










Q96AE4
FUBP1 Far
593-620
MGQAVPAPTGAPPGGQPDYSAA
14
Transcription







upstream

WAEYYR (SEQ ID NO: 263)

factors,







element-binding



Regulators







protein 1










Q96AE4
FUBP1 Far
272-284
IGGNEGIDVPIPR (SEQ ID NO: 
6
Transcription







upstream

264)

factors,







element-binding



Regulators







protein 1










P35637
FUS RNA-
335-348
GEATVSFDDPPSAK (SEQ ID NO: 
2
Transcription







binding protein 

265)

factors,







FUS



Regulators






P10253
GAA Lysosomal
855-870
GELFWDDGESLEVLER (SEQ ID
9
Enzymes







alpha-

NO: 266)









glucosidase










P50395
GDI2 Rab GDP
119-137
VPSTEAEALASSLMGLFEK (SEQ
13 14
Uncategorized







dissociation

ID NO: 267)









inhibitor beta










P50395
GDI2 Rab GDP
222-240
SPYLYPLYGLGELPQGFAR (SEQ
3 13
Uncategorized







dissociation

IN NO: 268)









inhibitor beta










Q9H3K2
GHITM Growth
218-240
AAWYTAGIVGGLSTVAMCAPSEK
14
Uncategorized







hormone-

(SEQ ID NO: 269)









inducible











transmembrane











protein










P06280
GLA Alpha-
241-252
SILDWTSFNQER
9
Enzymes







galactosidase A

(SEQ ID NO: 43)








P06280
GLA Alpha-
68-82
LFMEMAELMVSEGWK
4
Enzymes







galactosidase A

(SEQ ID NO: 45)








P06280
GLA Alpha-
50-67
FMCNLDCQEEPDSCISEK
9
Enzymes







galactosidase A

(SEQ ID NO: 44)








P16278
GLB1 Beta-
286-299
TEAVASSLYDILAR
9
Enzymes







galactosidase

(SEQ ID NO: 46)








Q04760
GLO1
160-179
GLAFIQDPDGYWIEILNPNK
14 3
Enzymes







Lactoylglutathio

(SEQ ID NO: 47)









ne lyase










Q9HC38
GLOD4
71-96
TMVGFGPEDDHFVAELTYNYGV
4 13
Uncategorized







Glyoxalase

GDYK (SEQ ID NO: 270)









domain-











containing











protein 4










P00367
GLUD1
481-496
HGGTIPIVPTAEFQDR
6
Enzymes







Glutamate

(SEQ ID NO: 49)









dehydrogenase











1, mitochondrial










P00367
GLUD1
152-162
YSTDVSVDEVK
6
Enzymes







Glutamate

(SEQ ID NO: 48)









dehydrogenase











1, mitochondrial










P49448
GLUD2
152-162
YSTDVSVDEVK
6
Enzymes







Glutamate

(SEQ ID NO: 48)









dehydrogenase











2, mitochondrial










Q9H4A6
GOLPH3 Golgi
75-90
EGYTSFWNDCISSGLR
14
Adapter,







phophoprotein

(SEQ ID NO: 50)

Scaffolding,







3



Modulator











Proteins






Q9BQ67
GRWD1
183-198
LLQVVEEPQALAAFLR (SEQ ID
3
Uncategorized







Glutamate-rich

NO: 271)









WD repeat-











containing











protein 1










Q9BQ67
GRWD1
263-287
SVEDLQWSPTENTVFASCSADASI
13
Uncategorized







Glutamate-rich

R (SEQ ID NO: 272)









WD repeat-











containing











protein 1










P09211
GSTP1
56-71
FQDGDLTLYQSNTILR
2
Enzymes







Glutathione S-

(SEQ ID NO: 51) 









transferase P










P0C0S5
H2AFZ Histone
47-75
VGATAAVYSAAILEYLTAEVLEL
3
Transcription







H2A.Z

AGNASK (SEQ ID NO: 273)

factors,











Regulators






Q16836
HADH
250-271
LGAGYPMGPFELLDYVGLDTTK
13 2
Enzymes







Hydroxyacyl-

(SEQ ID NO: 274)









coenzyme A











dehydrogenase,











mitochondrial










P40939
HADHA
112-125
TLQEVTQLSQEAQR (SEQ ID NO:
4 8
Enzymes







Trifunctional

275)









enzyme subunit











alpha,











mitochondrial










P12081
HARS
170-193
EFYQCDFDIAGNFDPMIPDAECLK
15 14
Enzymes







Histidine--tRNA

(SEQ ID NO: 276)
4








ligase,











cytoplasmic










Q96CS2
HAUS1 HAUS
94-108
YLNALVDSAVALETK (SEQ ID
14
Adapter,







augmin-like

NO: 277)

Scaffolding,







complex subunit



Modulator







1



Proteins






Q9NVX0
HAUS2 HAUS
173-189
MDILVTETEELAENILK (SEQ ID
14
Adapter,







augmin-like

NO: 278)

Scaffolding,







complex subunit



Modulator







2



Proteins






P69905
HBA2
18-32
VGAHAGEYGAEALER
4
Adapter,







Hemoglobin

(SEQ ID NO: 52)

Scaffolding,







subunit alpha



Modulator











Proteins






P69905
HBA2
94-100
VDPVNFK
4
Adapter,







Hemoglobin

(SEQ ID NO: 53)

Scaffolding,







subunit alpha



Modulator











Proteins






P53701
HCCS
200-210
SWMGYELPFDE (SEQ ID NO: 279)
4
Enzymes







Cytochrome c-











type heme lyase










Q7Z4Q2
HEATR3 HEAT
224-250
SFSATALNMLESALLSPVSSMESL
4 2
Uncategorized







repeat-

LLK (SEQ ID NO: 280)









containing











protein 3










P06865
HEXA Beta-
489-499
LTSDLTFAYER 
9
Enzymes







hexosaminidase

(SEQ ID NO: 54)









subuit alpha










Q6NVY1
HIBACH 3-
238-252
ENIASVLENYHTESK (SEQ ID NO:
6
Enzymes







hydroxyisobutyr

281)









yl-CoA











hydrolase,











mitochondrial










P16403
HIST1H1C
65-75
ALAAAGYDVEK (SEQ ID NO: 282)
8
Transcription







Histone H1.2



factors,











Regulators






P01892
HLA-A HLAC
46-59
FIAVGYVDDTQFVR (SEQ ID NO:
14
Uncategorized







class I

283)









histocompatibility











antigen, A-2











alpha










Q8TCT9
HM13 Minor
62-73
NASDMPETITSR (SEQID NO: 284)
13 14
Enzymes







histocompatibility


2 4 8








antigen H13










P30519
HMOX2 Heme
48-55
AENTQFVK
15 14
Enzymes







oxygenase 2

(SEQ ID NO: 55)
3 4 2











6 8







P03519
HMOX2 Heme
69-87
LATTALYFTYSALEEEMER
14
Enzymes







oxygenase 2

(SEQ ID NO: 56)








P09651
HNRNPA1
353-370
NQGGYGGSSSSSSYGSGR (SEQ ID
13 14
Channels,







Heterogeneous

NO: 285)
3 2 9
Transporters,







nuclear



Receptors







ribonucleoprotein











A1










P09651
HNRNPA1
337-350
SSGPYGGGGQYGAK (SEQ ID NO:
3 2
Channels,







Heterogeneous

286)

Transporters,







nuclear



Receptors







ribonucleoprotein











A1










P09651
HNRNPA1
56-75
GFGFVTYATVEEVDAAMNAR
3
Channels,







Heterogeneous

(SEQ ID NO: 287)

Transporters,







nuclear



Receptors







ribonucleoprotein











A1










P09651
HNRNPA1
233-265
GGGGYGGSGDGYNGFGNDGGYG
8
Channels,







Heterogeneous

GGGPGYSGGSR (SEQ ID NO: 288)

Transporters,







nuclear



Receptors







ribonucleoprotein











A1










P09651
HNRNPA1
16-31
LFIGGLSFETTDESLR (SEQ ID NO:
14 3 2
Channels,







Heterogeneous

289)

Transporters,







nuclear



Receptors







ribonucleoprotein











A1










P09651
HNRNPA1
131-140
IEVIEIMTDR (SEQ ID NO: 290)
3 9 8
Channels,







Heterogeneous



Transporters,







nuclear



Receptors







ribonucleoprotein











A1










Q32P51
HNRNPA1L2
285-298
SSGPYGGGGQYFAK (SEQ ID NO:
3 2 4
Channels,







Heterogeneous

291)

Transporters,







nuclear



Receptors







ribonucleoprotein











A1-like 2




















Q32P51
HNRNPA1L2
131-140
IEVIEIMTDR (SEQ ID NO: 292)
3 4 9
Channels,

















Heterogeneous



Transporters,







nuclear



Receptors







ribonucleoprotein











A1-like 2










Q32P51
HNRNPA1L2
16-31
LFIGGLSFETTDESLR (SEQ ID NO:
14 3 4
Channels,







Heterogeneous

293)
2
Transporters,







nuclear



Receptors







ribonucleoprotein











A1-like 2










P22626
HNRNPA2L1
130-137
DYFEEYGK (SEQ ID NO: 294)
6
Channels,







Heterogeneous



Transporters,







nuclear



Receptors







ribonucleoproteins











A2/B1










P22626
HNRNPA2L1
138-147
IDTIEIITDR (SEQ ID NO: 295)
13
Channels,







Heterogeneous



Transporters,







nuclear



Receptors







ribonucleoproteins











A2/B1










P22626
HNRNPA2L1
191-200
QEMQEVQSSR (SEQ ID NO: 296)
6
Channels,







Heterogeneous



Transporters,







nuclear



Receptors







ribonucleoproteins











A2/B1










P22626
HNRNPA2L1
229-238
GGSDGYGSGR (SEQ ID NO: 297)
3 6
Channels,







Heterogeneous



Transporters,







nuclear



Receptors







ribonucleoproteins











A2/B1










P22626
HNRNPA2L1
239-266
GFGDGYNGYGGGPGGGNFGGSP
13 14
Channels,







Heterogeneous

GYGGGR (SEQ ID NO: 298)
3 2 6
Transporters,







nuclear


8
Receptors







ribonucleoproteins











A2/B1










P22626
HNRNPA2L1
23-38
LFIGGLSFETTEESLR (SEQ ID NO:
13 3 2
Channels,







Heterogeneous

299)
6 9
Transporters,







nuclear



Receptors







ribonucleoproteins











A2/B1










P22626
HNRNPA2L1
267-317
GGYGGGGPGYGNQGGGYGGGY
13 2
Channels,







Heterogeneous

DNYGGGNYGSGNYNDFGNYNQQ
9 6
Transporters,







nuclear

PSNYPGMK (SEQ ID NO: 300)

Receptors







ribonucleoproteins











A2/B1










P22626
HNRNPA2L1
326-350
NMGGPYGGGNYGPGGSGGSGGY
14 3 2
Channels,







Heterogeneous

GGR (SEQ ID NO: 301)
13 8 6
Transporters,







nuclear



Receptors







ribonucleoproteins











A2/B1










P22626
HNRNPA2L1
42472
TLETVPLER (SEQ ID NO: 302)
6
Channels,







Heterogeneous



Transporters,







nuclear



Receptors







ribonucleoproteins











A2/B1










P51991
HNRNPA3
152-161
IETIEVMEDR (SEQ ID NO: 303)
9 6
Transcription







Heterogeneous



factors,







nuclear



Regulators







ribonucleoprotein











A3

SSGSPYGGGYGSGGGSGGYGSR
13 14
Transcription






P51991
HNRNPA3
355-376
(SEQ ID NO: 304)
3 2 4
factors,







Heterogeneous


6
Regulators







nuclear











ribonucleoprotein











A3










P51991
HNRNPA3
37-52
LFIGGLSFETTDDSLR (SEQ ID NO:
4
Transcription







Heterogeneous

305)

factors,







nuclear



Regulators







ribonucleoprotein











A3










P07910
HNRNP C
100-130
SAAEMYGSVTEHPSPSPLLSSSFD
13 4
Transcription







Heterogeneous

LDYDFQR (SEQ ID NO: 306)

factors,







nuclear



Regulators







ribonucleoproteins











C1/C2










P07910
HNRNPC
136-142
MYSYPAR (SEQ ID NO: 307)
4 3
Transcription







Heterogeneous



factors,







nuclear



Regulators







ribonucleoproteins











C1/C2










P07910
HNRNP C
51-61
GFAFVQVYVNER 
2 13
Transcription







Heterogeneous

(SEQ ID NO: 308)

factors,







nuclear



Regulators







ribonucleoproteins











C1/C2










P07910
HNRNPC
65-73
AAVAGEDGR (SEQ ID NO: 309)
4
Transcription







Heterogeneous



factors,







nuclear



Regulators







ribonucleoproteins











C1/C2










P07910
HNRNP C
74-89
MIAGQVLDINLAAEPK (SEQ ID
4 3 2
Transcription







Heterogeneous

NO: 310)
13
factors,







nuclear



Regulators







ribonucleoproteins











C1/C2










Q14103
HNRNP D
184-197
IFVGGLSPDTPEEK (SEQ ID NO:
13 6
Transcription







Heterogeneous

311)

factors,







nuclear



Regulators







ribonucleoprotein











D0










P52597
HNRNPF
151-167
ITGEAFVQFASQELAEK (SEQ ID
4 2 13
Transcription







Heterogeneous

NO: 312)
9
factors,







nuclear



Regulators







ribonucleoprotein











F










P52597
HNRNPF
53-68
QSGEAFVELGSEDDVK (SEQ ID
6
Transcription







Heterogeneous

NO: 313)

factors,







nuclear



Regulators







ribonucleoprotein











F










P52597
HNRNP F
99-114
HSGPNSADSANDGFVR (SEQ ID
6
Transcription







Heterogeneous

NO: 314)

factors,







nuclear



Regulators







ribonucleoprotein











F










P52597
HNRNPF
125-150
EEIVQFFSGLEIVPNGITLPVDPEG
3 6
Transcription







Heterogeneous

K (SEQ ID NO: 315)

factors,







nuclear



Regulators







ribonucleoprotein











F










P52597
HNRNP F
300-316
ATENDIYNFFSPLNPVR (SEQ ID
13 3 2
Transcription







Heterogeneous

NO: 316)
4 6
factors,







nuclear



Regulators







ribonucleoprotein











F










P52597
HNRNPF
317-326
VHIEIGPDGR (SEQ ID NO: 317)
6
Transcription







Heterogeneous



factors,







nuclear



Regulators







ribonucleoprotein











F










P31943
HNRNPH1
125-150
EEIVQFFSGLEIVPNGITLPVDFQG
2 6
Transcription







Heterogeneous

R (SEQ ID NO: 318)

factors,







nuclear



Regulators







ribonucleoprotein











H










P31943
HNRNPH1
151-167
STGEAFVQFASQEIAEK (SEQ ID
13 14
Transcription







Heterogeneous

NO: 319)
3 6 8
factors,







nuclear



Regulators







ribonucleoprotein











H










P31943
HNRNPH1
234-259
GAYGGGYGGYDDYNGYNDGYG
3 2 13
Transcription







Heterogeneous

FGSDR (SEQ ID NO: 320)
6 8
factors,







nuclear



Regulators







ribonucleoprotein











H










P31943
HNRNPH1
263-275
DLNYCFSGMSDHR (SEQ ID NO:
6
Transcription







Heterogeneous

321)

factors,







nuclear



Regulators







ribonucleoprotein











H










P31943
HNRNPH1
276-294
YGDGGSTFQSTTGHCVHMR (SEQ
6
Transcription







Heterogeneous

ID NO: 322)

factors,







nuclear



Regulators







ribonucleoprotein











H










P31943
HNRNPH1
300-316
ATENDIYNFFSPLNPVR (SEQ ID
13 14
Transcription







Heterogeneous

NO: 323)
3 2 6
factors,







nuclear



Regulators







ribonucleoprotein











H










P31943
HNRNPH1
317-326
VHIEIGPDGR (SEQ ID NO: 324)
3 6
Transcription







Heterogeneous



factors,







nuclear



Regulators







ribonucleoprotein











H










P31943
HNRNPH1
356-375
YVELFLNSTAGASGGAYEHR
3 6
Transcription







Heterogeneous

(SEQ ID NO: 325)

factors,







nuclear



Regulators







ribonucleoprotein











H










P31943
HNRNPH1
99-114
HTGPNSPDTANDGFVR (SEQ ID
6
Transcription







Heterogeneous

NO: 326)

factors,







nuclear



Regulators







ribonucleoprotein











H










P55795
HNRNPH2
151-167
STGEAFVQFASQEIAEK (SEQ ID
13 14
Transcription







Heterogeneous

NO: 327)
3 8 6
factors,







nuclear



Regulators







ribonucleoprotein











H2










P55795
HNRNPH2
234-259
GAYGGGYGGYDDYGGYNDGYG
8 6
Transcription







Heterogeneous

FGSDR (SEQ ID NO: 328)

factors,







nuclear



Regulators







ribonucleoprotein











H2










P55795
HNRNPH2
263-275
DLNYCFSGMSDHR (SEQ ID NO:
6
Transcription







Heterogeneous

329)

factors,







nuclear



Regulators







ribonucleoprotein











H2










P55795
HNRNPH2
300-316
ATENDIYNFFSPLNPMR (SEQ ID
6
Transcription







Heterogeneous

NO: 330)

factors,







nuclear



Regulators







ribonucleoprotein











H2










P55795
HNRNPH2
317-326
VHIEIGPDGR (SEQ ID NO: 331)
6
Transcription







Heterogeneous



factors,







nuclear



Regulators







ribonucleoprotein











H2










P55795
HNRNPH2
99-114
HTGPNSPDTANDGFVR (SEQ ID 
6
Transcription







Heterogeneous

NO: 332)

factors,







nuclear



Regulators







ribonucleoprotein











H2










P31942
HNRNPH3
139-169
GGDGYDGGYGGFDDYGGYNNY
6
Transcription







Heterogeneous

GYGNDGFDDR (SEQ ID NO: 333)

factors,







nuclear



Regulators







ribonucleoprotein











H3










P31942
HNRNPH3
206-222
ATENDIANFFSPLNPIR (SEQ ID
2 4 6
Transcription







Heterogeneous

NO: 334)

factors,







nuclear



Regulators







ribonucleoprotein











H3










P31942
HNRNPH3
262-287
YIELFLNSTPGGGSGMGGSGMGG
14 4 2
Transcription







Heterogeneous

YGR (SEQ ID NO: 335)
6
factors,







nuclear



Regulators







ribonucleoprotein











H3










P31942
HNRNPH3
288-301
DGMDNQGGYGSVGR (SEQ ID
8 6
Transcription







Heterogeneous

NO: 336)

factors,







nuclear



Regulators







ribonucleoprotein











H3










P31942
HNRNPH3
324-343
GGGGSGGYYGQGGMSGGGWR
2
Transcription







Heterogeneous

(SEQ ID NO: 337)

factors,







nuclear



Regulators







ribonucleoprotein











H3










P31942
HNRNPH3
56-67
STGEAFVQFASK 
6
Transcription







Heterogeneous

(SEQ ID NO: 338)

factors,







nuclear



Regulators







ribonucleoprotein











H3










P61978
HNRNP K
423-433
IDEPLEGSEDR (SEQ ID NO: 339)
4
Transcription







Heterogeneous



factors,







nuclear



Regulators







ribonucleoprotein











K










P61978
HNRNPK
397-405
DLAGSIIGK (SEQ ID NO: 340)
4 3
Transcription







Heterogeneous



factors,







nuclear



Regulators







ribonucleoprotein











K










P61978
HNRNPK
415-422
HESGASIK (SEQ ID NO: 341)
4 3 13
Transcription







Heterogeneous



factors,







nuclear



Regulators







ribonucleoprotein











K










P61978
HNRNP K
434-456
IITITGTQDQIQNAQYLLQNSVK
13 14
Transcription







Heterogeneous

(SEQ ID NO: 342)
3 2 4
factors,







nuclear


9 8
Regulators







ribonucleoprotein











K










P61978
HNRNPK
70-86
TDYNASVSVPDSSGPER (SEQ ID
8 4
Transcription







Heterogeneous

NO: 343)

factors,







nuclear



Regulators







ribonucleoprotein











K










P61978
HNRNP K
87-102
ILSISADIETIGEILK (SEQ ID NO:
4
Transcription







Heterogeneous

344)

factors,







nuclear



Regulators







ribonucleoprotein











K










P61978
HNRNPK
104-139
IIPTLEEGLQLPSPTATSQLPLESDA
13 14
Transcription







Heterogeneous

VECLNYQHYK (SEQ ID NO: 345)
3 4 2
factors,







nuclear



Regulators







ribonucleoprotein











K










P61978
HNRNP K
180-191
LFQECCPHSTDR 
13
Transcription







Heterogeneous

(SEQ ID NO: 346)

factors,







nuclear



Regulators







ribonucleoprotein











K










P61978
HNRNPK
208-219
IILDLISESPIK 
14 4 2
Transcription







Heterogeneous

(SEQ ID NO: 347)
13 9
factors,







nuclear



Regulators







ribonucleoprotein











K










P61978
HNRNP K
222-246
AQPYDPNFYDETYDYGGFTMMF
4
Transcription







Heterogeneous

DDR (SEQ ID NO: 348)

factors,







nuclear



Regulators







ribonucleoprotein











K










P61978
HNRNPK
279-286
DYDDMSPR (SEQ ID NO: 349)
4
Transcription







Heterogeneous



factors,







nuclear



Regulators







ribonucleoprotein











K










P61978
HNRNP K
317-325
GGDLMAYDR (SEQ ID NO: 350)
2
Transcription







Heterogeneous



factors,







nuclear



Regulators







ribonucleoprotein











K










P61978
HNRNPK
378-396
GSYGDLGGPIITTQVTIPK (SEQ ID
14 3 2
Transcription







Heterogeneous

NO: 351)
4 13 9
factors,







nuclear


8
Regulators







ribonucleoprotein











K










P14866
HNRNP L
108-136
GLIDGVVEADLVEALQEFGPISYV
14 3 9
Transcription







Heterogeneous

VVMPK (SEQ ID NO: 352)

factors,







nuclear



Regulators







ribonucleoprotein











L










P14866
HNRNPL
399-411
VFNVFCLYGNVEK (SEQ ID NO:
2
Transcription







Heterogeneous

353)

factors,







nuclear



Regulators







ribonucleoprotein











L










P14866
HNRNPL
47-56
YYGGGSEGGR (SEQ ID NO: 354)
3
Transcription







Heterogeneous



factors,







nuclear



Regulators







ribonucleoprotein











L










P52272
HNRNPM
346-362
MGGMEGPFGGGMENMGR (SEQ
14 2 6
Transcription







Heterogeneous

ID NO: 355)

factors,







nuclear



Regulators







ribonucleoprotein











M










P52272
HNRNPM
532-543
MVPAGMGAGLER (SEQ ID NO: 
6
Transcription







Heterogeneous

356)

factors,







nuclear



Regulators







ribonucleoprotein











M










P52272
HNRNPM
202-214
LGSTVFVANLDYK (SEQ ID NO:
6
Transcription







Heterogeneous

357)

factors,







nuclear



Regulators







ribonucleoprotein











M










P52272
HNRNPM
323-345
GIGMGNIGPAGMGMEGIGFGINK
3 2 6
Transcription







Heterogeneous

(SEQ ID NO: 358)

factors,







nuclear



Regulators







ribonucleoprotein











M










P52272
HNRNPM
437-443
MGLVMDR(SEQ ID NO: 359)
6
Transcription







Heterogeneous



factors,







nuclear



Regulators







ribonucleoprotein











M










P52272
HNRNPM
457-471
MGPLGLDHMASSIER (SEQ ID 
3 6
Transcription







Heterogeneous

NO: 360)

factors,







nuclear



Regulators







ribonucleoprotein











M










P52272
HNRNPM
544-550
MGPVMDR (SEQ ID NO: 361)
6
Transcription







Heterogeneous



factors,







nuclear



Regulators







ribonucleoprotein











M










P52272
HNRNPM
551-557
MATGLER (SEQ ID NO: 362)
6
Transcription







Heterogeneous



factors,







nuclear



Regulators







ribonucleoprotein











M










P52272
HNRNPM
571-578
MGANSLER (SEQ ID NO: 363)
6
Transcription







Heterogeneous



factors,







nuclear



Regulators







ribonucleoprotein











M










P52272
HNRNPM
592-606
MGPAMGPALGAGIER (SEQ ID
2 6
Transcription







Heterogeneous

NO: 364)

factors,







nuclear



Regulators







ribonucleoprotein











M










P52272
HNRNPM
699-707
FESPEVAER (SEQ ID NO: 365)
6
Transcription







Heterogeneous



factors,







nuclear



Regulators







ribonucleoprotein











M










P52272
HNRNPM
95-110
VGEVTYVELLMDAEGK (SEQ ID
13 14
Transcription







Heterogeneous

NO: 366)
3 2 6
factors,







nuclear


9
Regulators







ribonucleoprotein











M










P52272
HNRNPM
113-120
GCAVVEFK (SEQ ID NO: 367)
6
Transcription







Heterogeneous



factors,







nuclear



Regulators







ribonucleoprotein











M










P52272
HNRNPM
486-496
MGAGMGFGLER (SEQ ID NO: 
6
Transcription







Heterogeneous

368)

factors,







nuclear



Regulators






O43390
HNRNPR
147-171
YGGPPPDSVYSGVQPGIGTEVFVG
6
Transcription







Heterogeneous

K (SEQ ID NO: 369)

factors,







nuclear



Regulators







ribonucleoprotein











R










O43390
HNRNPR
316-339
VWGNVVTVEWADPVEEPDPEVM
6
Transcription







Heterogeneous

AK (SEQ ID NO: 370)

factors,







nuclear



Regulators







ribonucleoprotein











R










O43390
HNRNPR
347-359
NLATTVTEEILKE (SEQ ID NO: 
6
Transcription







Heterogeneous

371)

factors,







nuclear



Regulators







ribonucleoprotein











R










O43390
HNRNPR
428-441
STAYEDYYYHPPPR (SEQ ID NO:
2 6
Transcription







Heterogeneous

372)

factors,







nuclear



Regulators







ribonucleoprotein











R










Q99714
HSD17B10 3-
42672
GLVAVITGGASGLGLATAER
3 2
Enzymes







hydroxyacyl-

(SEQ ID NO: 373)









CoA











dehydrogenase











type-2










Q53GQ0
HSD17B12
182-206
GAILNISSGSGMLPVPLLTIYSATK
14 2
Enzymes







Estradio 17-

(SEQ ID NO: 374)









beta-











dehydrogenase











12










Q53GQ0
HSD17B12
36-64
VWGVGNEAGVGPGLGEQAVVT
14 4 2
Enzymes







Estradio 17-

GSTDGIGK (SEQ ID NO: 375)









beta-











dehydrogenase











12










P51659
HSD17B4
169-183
LGLLGLANSLAIEGR
3
Enzymes







Peroxisomal

(SEQ ID NO: 57)









multifunctional











enzyme type 2










P51659
HSD17B4
385-403
SMMGGGLAEIPGLSINFAK (SEQ
14
Enzymes







Peroxisomal

ID NO: 376)









multifunctional











enzyme type 2










P51659
HSD17B4
622-633
LQSTFVFEEIGR 
14
Enzymes







Peroxisomal

(SEQ ID NO: 377)









multifunctional











enzyme type 2










P07900
HSP90AA1
368-386
VFIMDNCEELIPEYLNFIR (SEQ ID
13
Chaperones







Heat shock

NO: 378)









protein HSP 90-











alpha










P07900
HSP90AA1
300-314
NPDDITNEEYGEFYK (SEQ ID NO: 
13
Chaperones







Heat shock

379)









protein HSP 90-











alpha










P07900
HSP90AA1
514-534
HGLEVIYMIERPIDEYCVQQLK
13
Chaperones







Heat shock

(SEQ ID NO: 380)









protein HSP 90-











alpha










P08238
HSP90AB1
360-378
VFIMDSCDELIPEYLNFIR
14 13
Chaperones







Heat shock

(SEQ ID NO: 58)









protein HSP 90-











beta










P08238
HSP90AB1
507-526
GFEVVYMTEPIDEYCVQQLK
13 14
Chaperones







Heat shock

(SEQ ID NO: 59)









protein HSP 90-











beta










P08238
HSP90AB1
686-719
LGLGIDEDEVAAEEPNAAVPDEIP
13 14
Chaperones







Heat shock

PLEGDEDASR (SEQ ID NO: 381)









protein HSP 90-











beta










P14625
HSP90B1
664-671
AQAYQTGK (SEQ ID NO: 382)
13 8
Chaperones







Endoplasmin










P14625
HSP90B1
117-135
LISLTDENALSGNEELTVK
9
Chaperones







Endoplasmin

(SEQ ID NO: 60)








P14625
HSP90B1
271-285
YSQFINFPIYVWSSK
6
Chaperones







Endoplasmin

(SEQ ID NO: 61)








P14625
HSP90B1
495-503
LGVIEDHSNR (SEQ ID NO: 383)
6
Chaperones







Endoplasmin










P14625
HSP90B1
52-67
EEEAIQLDGLNASQIR (SEQ ID
6
Chaperones







Endoplasmin

NO: 384)








P08107
HSPA1B Heat
113-126
AFYPEEISSMVLTK (SEQ ID NO:
13
Chaperones







shock 70 kDa

385)









protein 1A/1B










P08107
HSPA1B Heat
176-187
IINEPTAAAIAYGLDR (SEQ ID NO:
13
Chaperones







shock 70 kDa

386)









protein 1A/1B










P08107
HSPA1B Heat
362-384
SINPDEAVAYGAAVQAAILMGDK
13
Chaperones







shock 70 kDa

(SEQ ID NO: 387)









protein 1A/1B










P08107
HSPA1B Heat
424-447
QTQIFTTYSDNQPGVLIQVYEGER
13 3
Chaperones







shock 70 kDa

(SEQ ID NO: 388)









protein 1A/1B










P08107
HSPA1B Heat
598-628
ELEQVCNPIISGLYQGAGGPGPGG
13
Chaperones







shock 70 kDa

FGAQGPK (SEQ ID NO: 389)









protein 1A/1B










P11021
HSPA5 78 kDa
602-617
IEWLESHQDADIEDFK (SEQ ID
6
Chaperones







glucose-

NO: 390)









regulated protein










P11021
HSPA5 78 kDa
82-96
NQLTSNPENTVFDAK (SEQ ID
9 6
Chaperones







glucose-

NO: 391)









regulated protein










P11021
HSPA5 78 kDa
475-492
DNHLLGTFDLTGIPPAPR (SEQ ID
6
Chaperones







glucose-

NO: 392)









regulated protein










P11021
HSPA5 78 kDa
61-74
ITPSYVAFTPEGER (SEQ ID NO:
6
Chaperones







glucose-

393)









regulated protein










P11142
HSPA8 Heat
424-447
QTQTFTTYSDNQPGVLIQVYEGER
13 14
Chaperones







shock cognate

(SEQ ID NO: 394)









71 kDa protein










P11142
HSPA8 Heat
113-126
SFYPEEVSSMVLTK
13 14
Chaperones







shock cognate

(SEQ ID NO: 62)









71 kDa protein










P38646
HSPA9 Stress-
266-284
STNGDTFLGGEDFDQALLR (SEQ
13 8
Chaperones







70 protein,

ID NO: 395)









mitochondrial










P10809
HSPD1 60 kDA
345-352
VGEVIVTK (SEQ ID NO: 396)
3
Chaperones







heat shock











protein, 











mitochondrial










P10809
HSPD1 60 kDA
206-218
TLNDELEIIEGMK (SEQ ID NO:
13 3
Chaperones







heat shock

397)









protein, 











mitochondrial










P10809
HSPD1 60 kDA
222-233
GYISPYGINTSK 
13
Chaperones







heat shock

(SEQ ID NO: 398)









protein, 











mitochondrial










P10809
HSPD1 60 kDA
251-268
ISSIQSIVPALEIANAHR (SEQ ID
3 13
Chaperones







heat shock

NO: 399)









protein, 











mitochondrial










P10809
HSPD1 60 kDA
371-387
IQEIIEQLDVTTSEYEK (SEQ ID
13
Chaperones







heat shock

NO: 400)









protein, 











mitochondrial










P10809
HSPD1 60 kDA
494-516
IMQSSSEGYDAMAGDFVNMVE
13 8
Chaperones







heat shock

K (SEZ ID NO: 401)









protein, 











mitochondrial










P10809
HSPD1 60 kDA
97-121
LVQDVANNTNEEAGDGTTTATVL
13 8
Chaperones







heat shock

AR (SEQ ID NO: 402)









protein, 











mitochondrial











HSPD1 60 kDA










Q9NSE4
IARS2
818-832
SCQTALVEILDVIVR (SEQ ID NO:
14 3 2
Enzymes







Isoleucine--

403)
13 6








tRNA ligase,











mitochondrial










Q9NSE4
IARS2
793-803
ELSNFYFSIIK (SEQ ID NO: 404)
2 6
Enzymes







Isoleucine--











tRNA ligase,











mitochondrial










P14735
IDE Insulin-
312-324
NLYVTFPIPDLQK (SEQ ID NO: 
4
Enzymes







degrading

405)









enzyme










P48735
IDH2 Isocitrate
244-251
WPLYMSTK (SEQ ID NO: 406)
3 6
Enzymes







dehydrogenase










P13284
IFI30 Gamma-
129-157
VEACVLDELDMELAFLTIVCMEE
9
Enzymes







interferon-

FEDMER (SEQ ID NO: 407)









inducible











lysosomal thiol











reductase










Q9NZI8
IGF2BP1
509-525
TVNELQNLTAAEVVVPR (SEQ ID
3 13
Channels,







Insulin-like

NO: 408)

Transporters,







growth factor 2



Receptors







mRNA-binding











protein










Q12905
ILF2 Interleukin
329-356
ILGQEGDASYLASEISTWDGVIVT
4
Transcription







enhancer-

PSE (SEQ ID NO: 409)

factors,







binding factor 2



Regulators






Q12905
ILF2 Interleukin
81-103
INNVIDNLIVAPGTFEVQIEEVR
13 4
Transcription







enhancer-

(SEQ ID NO: 410)

factors,







binding factor 2



Regulators






AIL0T0
ILVBL
557-577
EQVPSLGSNVACGLAYTDYHK
13
Enzymes







Acetolactate

(SEQ ID NO: 411)









synthase-like











protein










Q16891
IMMT
345-353
VQAAQSEAK (SEQ ID NO: 412)
4 3 6
Uncategorized







Mitochondrial











inner membrane











protein










Q16891
IMMT
527-545
LSQEQVDNFTLDINTAYAR (SEQ
13 14
Uncategorized







Mitochondrial

ID NO: 413)
6 8








inner membrane











protein










Q16891
IMMT
548-564
GIEQAVAQSHAVAEEEAR (SEQ ID
13 4 6
Uncategorized







Mitochondrial

NO: 414)









inner membrane











protein










P12268
IMPDH2
110-124
YEQGFITDPVVLSPK
13
Enzymes







Inosine-5-

(SEQ ID NO: 63)









monophosphate











dehydrogenase 2










Q8TEX9
IPO4 Importin-4
163-182
LLNETLGEVGSPGLLFYSLR (SEQ
4
Channels,









ID NO: 415)

Transporters,











Receptors






O00410
IPO5 Importin-5
721-735
VAAAESMPLLLECAR (SEQ ID
14 2
Channels,









NO: 416)
13 8
Transporters,











Receptors






O95373
IPO7 Importin-7
441-427
TMGFCYQILTEPNADRP (SEQ ID
13
Channels,









NO: 417)

Transporters,











Receptors






Q96P70
IPO9 Importin-9
49-74
VLEVTEEFGVHLAELTVDPQGAL
14
Channels,









AIR (SEQ ID NO: 418)

Transporters,











Receptors






O14654
IRS4 Insulin
256-267
LCLTDEEVVFVR (SEQ ID NO: 
14
Uncategorized







receptor

419)









substrate 4










Q92945
KHSRP Far
629-646
IGQQPQQPGAPPQQDYTK (SEQ ID
2 6
Transcription







upstream

NO: 420)

factors,







element-binding



Regulators







dehydrogenase 2










P52732
KIF11 Kinesin-
158-181
VSLLEIYNEELFDLLNPSSDVSER
6
Adapter,







like protein

(SEQ ID NO: 421)

Scaffolding,







KIF11



Modulator











Proteins






P52292
KPNA2
203-227
YGAVDPLLALLAVPDMSSLACGY
14 13
Channels,







Importin subunit

LR (SEQ ID NO: 422)

Transporters,







alpha-2



Receptors






P52292
KPNA2
301-315
LLGASELPIVTPALR (SEQ ID NO: 
13
Channels,







Importin subunit

423)

Transporters,







alpha-2



Receptors






Q14974
KPNB1
317-332
GALQYLVPILTQTLTK (SEQ ID
14 13
Channels,







Importin subunit

NO: 424)

Transporters,







beta-1



Receptors






Q14974
KPNB1
28-42
AAVENLPTFLVELSR (SEQ ID NO:
14 13
Channels,







Importin subunit

425)

Transporters,







beta-1



Receptors






Q14974
KPNB1
526-537
SSAYESLMEIVK 
13 14
Channels,







Importin subunit

(SEQ ID NO: 426)

Transporters,







beta-1



Receptors






P13473
LAMP2
133-144
GILTVDELLAIR 
14 9
Uncategorized







Lysosome-

(SEQ ID NO: 427)









associated











membrane











glycoprotein 2










Q9P2J5
LARS Leucine--
1007-1017
ILDLQLEFDEK (SEQ ID NO: 428)
13
Enzymes







tRNA ligase,











cytoplasmic










P00338
LDHA L-lactate
43-57
DLADELALVDVIEDK
9
Enzymes







dehydrogenase

(SEQ ID NO: 64)









A chain










P07195
LDHB L-lactate
234-244
MVVESAYEVIK
4
Enzymes







dehydrogenase

(SEQ ID NO: 65)









B chain










O95202
LETM1 LETM1
452-463
VAEVEGEQVDNK (SEQ ID NO:
13 14
Uncategorized







and EF-hand

429)
4 3 8








domain-











containing











protein 1, mit










Q08380
LGALS3BP
522-541
ALMLCEGLFVADVTDFEGWK
9
Uncategorized







Galectin-3-

(SEQ ID NO: 430)









binding protein










Q99538
LGMN
102-118
DYTGEDVTPQNFLAVLR
9
Enzymes







Legumain

(SEQ ID NO: 66)








P38571
LIPA
255-270
ELCGNLCFLLCGFNER (SEQ ID
14
Enzymes







Lysosomal acid

NO: 431)









lipase/cholestery











1 ester hydrolase










P02545
LMNA
63-72
ITESEEVVSR (SEQ ID NO: 432)
6
Uncategorized







Prelamin-A/C










P02545
LMNA
172-180
LEAALGEAK (SEQ ID NO: 433)
3
Uncategorized







Prelamin-A/C










P02545
LMNA
209-216
NIYSEELR (SEQ ID NO: 434)
6
Uncategorized







Prelamin-A/C










P02545
LMNA
281-296
NSNLVGAAHEELQQSR (SEQ ID
6
Uncategorized







Prelamin-A/C

NO: 435)








P02545
LMNA
352-366
MQQQLDEYQELLDIK
13 6
Uncategorized







Prelamin-A/C

(SEQ ID NO: 96)








P20700
LMNB1 Lamin-
321-330
IQELEDLLAK (SEQ ID NO: 436)
6
Uncategorized







B1










P20700
LMNB1 Lamin-
80-90
ALYETELADAR (SEQ ID NO: 437)
13
Uncategorized







B1










P20700
LMNB1 Lamin-
351-367
DQMQQQLNDYEQLLDVK (SEQ
14 8
Uncategorized







B1

ID NO: 438)








P20700
LMNB1 Lamin-
210-220
SMYEEEINETR (SEQ ID NO: 439)
13
Uncategorized







B1










P20700
LMNB1 Lamin-
52-67
SLETENSALQLQVTER (SEQ ID
13 14
Uncategorized







B1

NO: 440)
6 8







Q03252
LMNB2 Lamin-
106-113
AELDEVNK (SEQ ID NO: 441)
6
Uncategorized







B2










Q03252
LMNB2 Lamin-
74-84
ALYESELADAR (SEQ ID NO: 442)
13
Uncategorized







B2










Q03252
LMNB2 Lamin-
139-150
SEVELAAALSDK (SEQ ID NO:
13
Uncategorized







B2

443)








P36776
LONP1 Lon
598-632
GYQGDPSSALLELLDPEQNANFL
13 6
Channels,







protease

DHYLDVPVDLSK (SEQ ID NO:

Transporters,







homolog,

444)

Receptors







mitochondrial










Q96AG4
LRRC59
268-292
VTELQQQPLCTSVNTIYDNAVQG
13 14
Uncategorized







Leucine-rich

LR (SEQ ID NO: 445)
2 8








repeat-











containing











protein 59










P09960
LTA4H
366-386
LVVDLTDIDPDVAYSSVPYEK
13 4 8
Enzymes







Leukotriene A-4

(SEQ ID NO: 67)









hydrolase










O00754
MAN2B1
291-305
ELVDYFLNVATAQGR (SEQ ID 
14
Enzymes







Lysosomal

NO: 446)









alpha-











mannosidase










O00754
MAN2B1
614-638
ATFDPDTGLLMEIMNMNQQLLLP
9
Enzymes







Lysosomal

VR (SEQ ID NO: 447)









alpha-











mannosidase










Q9Y2E5
MAN2B2
642-664
AAVPAWEAVEMEIVAGQLVTEIR
9
Enzymes







Epididymis

(SEQ ID NO: 448)









specific alpha-











mannosidase










Q15691
MAPRE1
223-241
NIELICQENEGENDPVLQR (SEQ
13
Adapter,







Microtubule-

ID NO: 449)

Scaffolding,







associated



Modulator







protein RP/EB



Proteins







family member










Q8NI22
MCFD2
103-126
EEGSEQAPLMSEDELINIIDGVLR
14
Channels,







Multiple

(SEQ ID NO: 450)

Transporters,







coagulation



Receptors







factor deficiency











protein 2










P49736
MCM2 DNA
797-807
VMLESFIDTQK (SEQ ID NO: 451)
13
Transcription







replication



factors,







licensing factor



Regulators







MCM2










P33991
MCM4 DNA
502-516
AEINILLCGDPGTSK (SEQ ID NO:
15
Transcription







replication

452)

factors,







licensing factor



Regulators







MCM4










P33991
MCM4 DNA
517-529
SQLLQYVYNLVPR (SEQ ID NO:
6
Transcription







replication

453)

factors,







licensing factor



Regulators







MCM4










Q14566
MCM6 DNA
59-85
NTLVVSFVDLEQFNQQLSTTIQEE
14 15
Transcription







replication

FYR (SEQ ID NO: 454)
3 6
factors,







licensing factor



Regulators







MCM6










Q14696
MESDC2 LDLR
113-127
TLMMFVTVSGSPTEK (SEQ ID
2
Chaperones







chaperone

NO: 455)









MESD










Q9H8H3
METTL7A
94-105
VTCIDPNPNFEK 
13
Enzymes







Methyltransferase-

(SEQ ID NO: 456)









like protein 7A










P46013
MKI67 Anitgen
648-659
SGASEANLIVAK 
8
Transcription







KI-67

(SEQ ID NO: 457)

factors,











Regulators






Q7Z7F7
MRPL55 39S
59-67
QDGSTIHIR (SEQ ID NO: 458)
6
Uncategorized







ribosomal











protein L55,











mitochondrial










P43246
MSH2 DNA
848-871
ALELEEFQYIGESQGYDIMEPAAK
14
Transcription







mismathc repair

(SEQ ID NO: 459)

factors,







protein Msh2



Regulators






P00403
MT-CO2
142-151
VVLPIEAPIR (SEQ ID NO: 460)
6
Channels,







Cytochrome c



Transporters,







oxidase subunit



Receptors







2










P03891
MT-ND2
264-272
WAIIEEFTK (SEQ ID NO: 461)
14
Enzymes







NADH-











ubiquinone











oxidoreductase











chain 2










Q9NZJ7
MTCH1
65-103
MDGGSGGLGSGDNAPTTEALFVA
14 2
Channels,







Mitochondrial

LGAGVTALSHPLLYVK (SEQ ID

Transporters,







carrier homolog

NO: 462)

Receptors







1










Q86UE4
MTDH Protein
42510
SWQDELAQQAEEGSAR (SEQ ID
14 4 2
Uncategorized







LYRIC

NO: 463)
13 8







Q86UE4
MTDH Protein
34-45
TELGLDGLEPK (SEQ ID NO: 464)
13 14
Uncategorized







LYRIC


4 2 9











8







Q9UDX5
MTFP1
21-33
YLGYANEVGEAFR (SEQ ID NO: 
13 14
Uncategorized







Mitochondrial

465)
2 9








fission process











protein 1










Q9UDX5
MTFP1
103-116
VCAASLYVLGTATR (SEQ ID NO:
14
Uncategorized







Mitochondrial

466)









fission process











protein 1










Q6UB35
MTHFD1L
307-326
IHFGGLIEEDDVILLAAALR (SEQ
6
Enzymes







Monofunctional

ID NO: 467)









C1-











tetrahydrofolate











synthase,











mitochondrial










Q13505
MTX1 Metaxin-
238-252
QGADTLAFMSLLEEK (SEQ ID
14 3 4
Channels,







1

NO: 468)
2 6 8
Transporters,











Receptors






P35580
MYH10
1546-1562
TQLEELEDELQATEDAK (SEQ ID
13 6
Adapter,







Myosin-10

NO: 469)
9
Scaffolding,











Modulator











Proteins






P35580
MYH10
1684-1701
SLEAEILQLQEELASSER (SEQ ID
14 13
Adapter,







Myosin-10

NO: 470)
6
Scaffolding,











Modulator











Proteins






P35580
MYH10
1738-1758
IAQLEEELEEEQSNMELLNDR
6
Adapter,







Myosin-10

(SEQ ID NO: 471)

Scaffolding,











Modulator











Proteins






P35580
MYH10
1814-1822
ATISALEAK (SEQ ID NO: 472)
6
Adapter,







Myosin-10



Scaffolding,











Modulator











Proteins






P35580
MYH10
248-268
INFDVTGYIVGANIETYLLEK
6 9
Adapter,







Myosin-10

(SEQ ID NO: 473)

Scaffolding,











Modulator











Proteins






P35580
MYH10
890-910
NILAEQLQAETELFAEAEEMR
13 14
Adapter,







Myosin-10

(SEQ ID NO: 474)
6 9
Scaffolding,











Modulator











Proteins






P35579
MYH9 Myosin-
1539-1555
TQLEELEDELQATEDAK (SEQ ID
13
Adapter,







9

NO: 475)

Scaffolding,











Modulator











Proteins






P35579
MYH9 Myosin-
1677-1694
SMEAEMIQLQEELAAAER (SEQ
13
Adapter,







9

ID NO: 476)

Scaffolding,











Modulator











Proteins






Q9BXJ9
NAA15 N-
798-818
NLQTCMEVLEALYDGSLGDCK
14 4 2
Transcription







alpha-

(SEQ ID NO: 477)

factors,







acetyltransferase



Regulators







15, NatA











auxiliary subunit
566-580
QAVQELVSLYYEEAR (SEQ ID
9
Enzymes






P54802
NAGLU Alpha-

NO: 478)









N-











acetylglucosami











nidase










P54802
NAGLU Alpha-
594-615
AGGVLAYELLPALDEVLASDSR
13 15
Enzymes







N-

(SEQ ID NO: 479)









acetylglucosami











nidase










P43490
NAMPT
175-189
YLLETSGNLDGLEYK
13 14
Enzymes







Nicotinamide

(SEQ ID NO: 68)
15 3 6








phosphoribosyltr


8








ansferase










P55209
NAP1L1
95-104
FYEEVHDLER (SEQ ID NO: 480)
3 4 13
Uncategorized







Nucleosome


6 9 8








assembly protein











1-like 1










P55209
NAP1L1
177-194
NVDLLSDMVQEHDEPILK (SEQ
6
Uncategorized







Nucleosome

ID NO: 481)









assembly protein











1-like 1










P55209
NAP1L1
56-72
LDGLVETPTGYIESLPR (SEQ ID
14 4 2
Uncategorized







Nucleosome

NO: 482)
9








assembly protein











1-like 1










Q99733
NAP1L14
84-93
FYEEVHDLER (SEQ ID NO: 483)
3 4 13
Uncategorized







Nucleosome


6 9 8








assembly protein











1-like 4










P49321
NASP Nuclear
503-526
SLQENEEEEIGNLELAWDMLDLA
13 14
Channels,







autoantigenic

K (SEQ ID NO: 484)
8
Transporters,







sperm protein



Receptors






P49321
NASP Nuclear
77-93
YGETANECGEAFFFYGK (SEQ ID 
13
Channels,







autoantigenic

MO: 485)

Transporters,







sperm protein



Receptors






Q9H0A0
NAT10 N-
600-625
ASGDLIPWTVSEQFQDPDFGGLSG
13 3
Enzymes







acetyltransferase

GR (SEQ ID NO: 486)









10










Q08161
NCBP1 Nuclear
42-65
SACSLESNLEGLAGVLEADLPNY
13 14
Channels,







cap-binding

K (SEQ ID NO: 487)
3 2
Transporters,







protein subunit 1



Receptors






P28331
NDUFS1
312-325
GLLTYTSWEDALSR (SEQ ID NO: 
14
Enzymes







NADH-

488)









ubiquinone











oxidoreductase











75 kDa subunit,











mit










Q9UMX5
NENF Neudesin
85-94
GAPYNALTGK (SEQ ID NO: 489)
6
Adapter,











Scaffolding,











Modulator











Proteins






P55769
NHP2L1 NHP2-
114-125
QQIQSIQQSIER 
3 2 6
Transcription







like protein 1

(SEQ ID NO: 490)

factors,











Regulators






Q9BPW8
NIPSNAP1
255-268
GWDENVYYTVPLVR (SEQ ID NO:
4 6
Uncategorized







Protein NipSnap

491)









homolog 1










Q9Y3T9
NOC2L
591-606
VSFGVSEQQAVEAWEK (SEQ ID
2
Transcription







Nucleolar

NO: 492)

factors,







complex protein



Regulators







2 homolog










Q15233
NONO Non-
127-135
VELDNMPLR (SEQ ID NO: 493)
3
Transcription







POU domain-



factors,







containing



Regulators







octamer-binding











protein










Q15233
NONO Non-
257-270
FAQPGSFEYEYAMR (SEQ ID NO:
6
Transcription







POU domain-

494)

factors,







containing



Regulators







octamer-binding











protein










Q15233
NONO Non-
296-304
LEMEMEAAR (SEQ ID NO: 495)
6
Transcription







POU domain-



factors,







containing



Regulators







octamer-binding











protein










Q15233
NONO Non-
154-176
NLPQYVSNELLEEAFSVFGQVER
13 14
Transcription







POU domain-

(SEQ ID NO: 496)
3 2 6
factors,







containing


9
Regulators







octamer-binding











protein










Q15233
NONO Non-
177-184
AVVIVDDR (SEQ ID NO: 497)
6
Transcription







POU domain-



factors,







containing



Regulators







octamer-binding











protein










Q15233
NONO Non-
326-336
MEELHNQEVQK (SEQ ID NO: 498)
13
Transcription







POU domain-



factors,







containing



Regulators







octamer-binding











protein










Q15233
NONO Non-
435-456
FGQAATMEGIGAIGGTPPAFNR
6
Transcription







POU domain-

(SEQ ID NO: 499)

factors,







containing



Regulators







octamer-binding











protein










P06748
NPM1 
278-291
MTDQEAIQDLWQWR (SEQ ID
13
Chaperones







Nucleophosmin

NO: 500)








P06748
NPM1 
33-45
VDNDENEHQLSLR (SEQ ID
13
Chaperones







Nucleophosmin

NO: 501)








P06748
NPM1 
55-73
DELHIVEAEAMNYEGSPIK
13
Chaperones







Nucleophosmin

(SEQ ID NO: 69)








P06748
NPM1 
81-101
MSVQPTVSLGGFEITPPVVLR
13
Chaperones







Nucleophosmin

(SEQ ID NO: 70)








Q08J23
NSUN2 tRNA
603-618
LAQEGIYTLYPFINSR
3 6
Transcription







(cytosine(34)-

(SEQ ID NO: 502)

factors,







C(5))-



Regulators







methyltransferase










Q9BV86
NTMT1 N-
167-185
DNMAQEGVILDDVDSSVCR 
13
Enzymes







terminal Xaa-

(SEQ ID NO: 503)









Pro-Lys N-











methyltransferase











1










Q02818
NUCB1 
54-69
YLQEVIDVLETDGHFR 
13 14
Transcription







Nucleobindin-1

(SEQ ID NO: 504)
2 4 3
factors,










6
Regulators






P80303
NUCB2
60-69
QVIDVLETDK
4 13 6
Transcription







Nucleobindin-1

(SEQ ID NO: 505)

factors,











Regulators






Q9BQG2
NUDT12











Peroxisomal
143-166
ESHPATVFILFSDLNPLVTLGGNK
15
Enzymes







NADH

(SEQ ID NO: 506)









pyrophosphatase











NUDT12










A8MXV4
NUDT19
223-252
EPPPVYPDLAEVVGYQWSSPSEA
6
Enzymes







Nucleoside

TESFLSK (SEQ ID NO: 507)









diphosphate-











linked moiety X











motif 19,











mitochodrial










O75694
NUP155
952-968
HGEPEEDIVGLQAFQER 
13
Channels,







Nuclear pore

(SEQ ID NO: 508)

Transporters,







complex protein



Receptors







Nup155










Q12769
NUP160
638-661
AAEQILEDMITIDVENVMEDICSK
14
Channels,







Nuclear pore

(SEQ ID NO: 509)

Transporters,







complex protein



Receptors







Nup160










Q92621
NUP205
1235-1252
VLVAEVNALQGMAAIGQR 
14
Channels,







Nuclear pore

(SEQ ID NO: 510

Transporters,







complex protein



Receptors







Nup205










P35658
NUP214
770-783
TTLLEGFAGVEEAR 
14
Channels,







Nuclear pore

(SEQ ID NO: 511)

Transporters,







complex protein



Receptors







Nup214










Q8NFH4
NUP37
136-150
EGQEIASVSDDHTCR 
13 9
Channels,







Nucleoporin

(SEQ ID NO: 512)

Transporters,







Nup37



Receptors






Q8N1F7
NUP93 Nuclear
539-545
FESTDPR 
4
Channels,







pore complex

(SEQ ID NO: 513)

Transporters,







protein Nup93



Receptors






P61970
NUTF2 Nuclear
91-106
ADEDPIMGFHQMFLLK 
14
Channels,







transport factor 

(SEQ ID NO: 514)

Transporters,







2



Receptors






Q6DKJ4
NXN
384-403
DYTNLPEAAPLLTILDMSAR 
14
Enzymes







Nucleoredoxin

(SEQ ID NO: 515)








P04181
OAT Ornithine
33-46
TVQGPPTSDDIFER 
14 13
Enzymes







aminotransferase,

(SEQ ID NO: 516)









mitochondrial










P04181
OAT Ornithine
332-351
VAIAALEVLEEENLAENADK
14 13
Enzymes







aminotransferase,

(SEQ ID NO: 517)









mitochondrial










Q9NX40
OCIAD1 OCIA
34-46
VFAECNDESFWFR 
13 2
Uncategorized







domain-

(SEQ ID NO: 518)









containing











protein 1










O60313
OPA1 Dynamin-
801-818
CNEEHPAYLASDEITTVR 
13
Enzymes







like 120 kDa

(SEQ ID NO: 519









protein,











mitochondrial










P07237
P4HB Protein
133-162
TGPAATTLPDGAAAESLVESSEVA
14 3 9
Chaperones







disulfide-

VIGFFK (SEQ ID NO: 520)









isomerase










P07237
P4HB Protein
171-195
QFLQAAEAIDDIPFGITSNSDVFSK
9
Chaperones







disulfide-

(SEQ ID NO: 521)









isomerase










P07237
P4HB Protein
231-247
HNQLPLVIEFTEQTAPK 
14 2
Chaperones







disulfide-

(SEQ ID NO: 522)
13








isomerase










P11940
PABPC1
114-129
ALYDTFSAFGNILSCK
14
Transcription







Polyadenylate-

(SEQ ID NO: 523)

factors,







binding protein



Regulators







1










P11940
PABPC1
51-67
SLGYAYVNFQQPADAER 
14 3
Transcription







Polyadenylate-

(SEQ ID NO: 524)

factors,







binding protein



Regulators







1










P11940
PABPC1
581-604
ITGMLLEIDNSELLHMLESPESLR
14
Transcription







Polyadenylate-

(SEQ ID NO: 525)

factors,







binding protein



Regulators







1










Q11310
PABPC4
51-67
SLGYAYVNFQQPADAER
14 3
Transcription







Polyadenylate-

(SEQ ID NO: 526)

factors,







binding protein



Regulators







4










Q13310
PABPC4
590-613
ITGMLLEIDNSELLHMLESPESLR
14
Transcription







Polyadenylate-

(SEQ ID NO: 527)

factors,







binding protein



Regulators







4










Q13310
PABPC4
114-129
ALYDTFSAFGNILSCK
14
Transcription







Polyadenylate-

(SEQ ID NO: 528)

factors,







binding protein



Regulators







4

VEMLDNLLDIEVAYSLLR
3 6
Transcription






P09874
PARP1 Poly
762-779
(SEQ ID NO: 529)

factors,











Regulators






P09874
PARP1 Poly
954-1000
TTPDPSANISLDGVDVPLGTGISSG
3
Transcription









VNDTSLLYNEYIVYDIAQVNLK

factors,









(SEQ ID NO: 530)

Regulators






Q16822
PCK2
245-261
EIISFGSGYGGNSLLGK
14 15
Enzymes







Phosphoenolypr

(SEQ ID NO: 531)
13








uvate











carboxykinase










P22061
PCMT1 Protein-
179-197
LILPVGPAGGNQMLEQYDK
14 3 2
Enzymes







L-

(SEQ ID NO: 71)









isoaspartate(D-











aspartate) O-











methyltransferase










P12004
PCNA
118-138
LMDLDVEQLGIPEQEYSCVVK
14
Transcription







Proliferating cell

(SEQ ID NO: 532)

factors,







nuclear antigen



Regulators






Q9UHG3
PCYOX1
267-280
SNLISGSVMYIEEK
14 9
Enzymes







Prenylcysteine

(SEQ ID NO: 533)









oxidase 1










Q9UHG3
PCYOX1
292-304
MYEVVYQIGTETR
9
Enzymes







Prenylcysteine

(SEQ ID NO: 534)









oxidase 1










Q9UHG3
PCYOX1
152-162
MHMWVEDVLDK 
4 13
Enzymes







Prenylcysteine

(SEQ ID NO: 535)









oxidase 1










Q9UHG3
PCYOX1
37-54
IAIIGAGIGGTSAAYYLR
14
Enzymes







Prenylcysteine

(SEQ ID NO: 536)









oxidase 1










Q53EL6
PDCD4
246-256
DLPELALDTPR (SEQ ID NO: 537)
13
Transcription







Programmed 



factors,







cell death



Regulators







protein 4










P11177
PDHB Pyruvate
53-68
VFLLGEEVAQYDGAYK
13 14
Enzymes







dehydrogenase

(SEQ ID NO: 72)
3 2








E1 component











subunit beta,










P13667
PDIA4 Protein
486-499
FAMEPEEFDSDTLR
9
Enzymes







disulfide-

(SEQ ID NO: 538)









isomerase A4










Q29RF7
PDS5A Sister
638-657
SIEGTADDEEEGVSPDTAIR
13
Uncategorized







chromatid

(SEQ ID NO: 539)









cohesion protein











PDS5 homolog











A










Q99471
PFDN5
20-37
NQLDQEVEFLSTSIAQLK
2
Chaperones







Prefoldin

(SEQ ID NO: 540)









subunit 5










P07737
PFN1 Profilin-1
39-54
TFVNITPAEVGVLVGK
13
Adapter,









(SEQ ID NO: 541)

Scaffolding,











Modulator











Proteins






P07737
PFN1 Profilin-1
76-89
DSLLQDGEFSMDLR 
13 8
Adapter,









(SEQ ID NO: 542)

Scaffolding,











Modulator











Proteins






P00558
PGK1
333-350
QIVWNGPVGVFEWEAFAR
3
Enzymes







Phosphoglycerate

(SEQ ID NO: 73)









kinase 1










O00264
PGRMC1
106-119
FYGPEGPYGVFAGR
14 2 4
Channels,







Membrane-

(SEQ ID NO: 543)
3 13
Transporters,







associated



Receptors







progesterone











receptor











component










O00264
PGRMC1
48-67
GDQPAASGDSDDDEPPPLPR
13 14
Channels,







Membrane-

(SEQ ID NO: 544)
2 4 8
Transporters,







associated



Receptors







progesterone











receptor











component










O15173
PGRMC2
136-149
FYGPAGPYGIFAGR
4
Channels,







Membrane-

(SEQ ID NO: 545)

Transporters,







associated



Receptors







progesterone











receptor











component










P35232
PHB Prohibitin
220-239
AAELIANSLATAGDGLIELR
9
Uncategorized









(SEQ ID NO: 546)








P35232
PHB Prohibitin
241-253
LEAAEDIAYQLSR 
14
Uncategorized









(SEQ ID NO: 547)








P35232
PHB Prohibitin
42501
VFESIGK (SEQ ID NO: 548)
13
Uncategorized






Q99623
PHB2
38-48
ESVFTVEGGHR (SEQ ID NO: 549)
2 6
Channels,







Prohibitin-2



Transporters,











Receptors






Q99623
PHB2
55-71
IGGVQQDTILAEGLHFR
3 4 2
Channels,







Prohibitin-2

(SEQ ID NO: 550)
6
Transporters,











Receptors






Q99623
PHB2
225-236
IVQAEGEAEAAK
6
Channels,







Prohibitin-2

(SEQ ID NO: 551)

Transporters,











Receptors






O43175
PHGDH D-3-
295-308
CGEEIAVQFVDMVK
13
Enzymes







phosphoglycerate

(SEQ ID NO: 552)









dehydrogenase










P48739
PITPNB
32-44
NETGGGEGIEVLK
14 3
Adapter,







Phosphatidylino

(SEQ ID NO: 553)

Scaffolding,







sitol transfer



Modulator







protein beta



Proteins







isoform










Q5JRX3
PITRM1
364-385
ALIESGLGTDFSPDVGYNGYTR
14 2
Enzymes







Presequence

TQNGR (SEQ ID NO: 554)
13 8 6








protease,











mitochondrial










P14618
PKM Pyruvate
174-186
IYVDDGLISLQVK
2 9
Enzymes







kinase isozymes

(SEQ ID NO: 74)









M1/M2










P14618
PKM Pyruvate
401-422
LAPITSDPTEATAVGAVEASFK
2 9
Enzymes







kinase isozymes

(SEQ ID NO: 75)









M1/M2










Q8IV08
PLD3
425-453
ATYIGTSNWSGNYFTETAGTSLLV
3 6 9
Enzymes







Phospholipase

TQNGR (SEQ ID NO: 555)









D3










P13797
PLS3 Plastin-3
72-85
ISFDEFVYIFQEVK
14
Uncategorized









(SEQ ID NO: 556)








Q10713
PMPCA
443-451
PVIFEDVGR (SEQ ID NO: 557)
14 8 6
Enzymes







Mitochondrial-











processing 











peptidase











subunit alpha










O75439
PMPCB
406-424
TNMLLQLDGSTPICEDIGR
13
Enzymes







Mitochondrial-

(SEQ ID NO: 558)









processing 











peptidase











subunit beta










Q9Y2S7
POLDIP2
166-199
ALYAIPGLDYVSHEDILPYTSTDQ
6
Uncategorized







Polymerase

VPIQHELFER (SEQ ID NO: 559)









delta-interacting











protein 2










O00411
POLRMT DNA-
482-502
MLLQVLQALPAQGESFTTLAR
14 3 2
Enzymes







directed RNA

(SEQ ID NO: 560)
6








polymerase,











mitochondrial










P16435
POR NADPH--
369-382
TALTYYLDITNPPR
13 14
Enzymes







cytochrome

(SEQ ID NO: 76)









P450 reductase










P62136
PPP1CA
133-141
IYGFYDECK
2
Enzymes







Serine/threonine-

(SEQ ID NO: 77)









protein











phosphate











PP1-alpha cat










P62140
PPP1CB
132-140
IYGFYDECK
2
Enzymes







Serine/threonine-

(SEQ ID NO: 78)









protein











phosphate











PP1-alpha cat










P62140
PPP1CB
43-59
IEFLSQPILLELEAPLK
14
Enzymes







Serine/threonine-

(SEQ ID NO: 79)









protein











phosphate











PP1-alpha cat










P36873
PPP1CC
44-60
EIFLSQPILLELEAPLK
14
Enzymes







Serine/threonine-

(SEQ ID NO: 79)









protein











phosphate











PP1-gamma cat










P36873
PPP1CC
133-141
IYGFYDECK
2
Enzymes







Serine/threonine-

(SEQ ID NO: 78)









protein











phosphate











PP1-gamma cat










P50897
PPT1 Palmitoyl-
75-101
TLMEDVENSFFLNVNSQVTTVCQ
13 14
Enzymes







protein

ALKA
15 4 2








thioesterase 1

(SEQ ID NO: 80)
9 8







P32119
PRDX2
120-127
TDEGIAYR
13
Enzymes







Peroxiredoxin-2

(SEQ ID NO: 81)








P78527
PRKDC DNA-
3030-3046
IWSEPFYQETLPYMIR
14
Enzymes







dependent

(SEQ ID NO: 561)









protein kinase











catalytic subunit










P78527
PRKDC DNA-
758-782
LGLSYTPLAEVGLNALEEWSIYID
14
Enzymes







dependent

R (SEQ ID NO: 562)









protein kinase











catalytic subunit










Q99873
PRMT1 Protein
380-391
DVDFMYVELIQR
13
Enzymes







argine N-

(SEQ ID NO: 563)









methyltransferase











1










Q9UMS4
PRPF19 Pre-
186-196
ATLYVTAIEDR (SEQ ID NO: 564)
14
Enzymes







mRNA-











processing 











factor 19










P07602
PSAP 
77-93
ALQDEWDAVMLHSFTLR
4
Adapter,







Proactivator

(SEQ ID NO: 565)

Scaffolding,







polypeptide



Modulator






P07602
PSAP 



Proteins







Proactivator
108-122
EIVDSYLPVILDIIK
13 14
Adapter,







polypeptide

(SEQ ID NO: 566)
3 2 4
Scaffolding,










15 6 9
Modulator










8
Proteins






P07602
PSAP 
263-275
EICALVGFCDEVK
14
Adapter,







Proactivator

(SEQ ID NO: 567)

Scaffolding,







polypeptide



Modulator











Proteins






P07602
PSAP 
311-323
SDVYCEVCEFLVK
13 4 9
Adapter,







Proactivator

(SEQ ID NO: 568)
8
Scaffolding,







polypeptide



Modulator











Proteins






P07602
PSAP 
430-438
QEILAALEK (SEQ ID NO: 569)
2 6
Adapter,







Proactivator



Scaffolding,







polypeptide



Modulator











Proteins






P07602
PSAP 
439-449
GCSFLPDPYQK (SEQ ID NO: 570)
14 9
Adapter,







Proactivator



Scaffolding,







polypeptide



Modulator











Proteins






P07602
PSAP 
450-478
QCDQFVAEYEPVLIEILVEVMDPS
14 4 9
Adapter,







Proactivator

FVCLK (SEQ ID NO: 571)

Scaffolding,







polypeptide



Modulator











Proteins






P07602
PSAP 
68-78
DVVTAAGDMLK
14 4 9
Adapter,







Proactivator

(SEQ ID NO: 572)

Scaffolding,







polypeptide



Modulator











Proteins






P25787
PSMA2
144-159
PYLFQSDPSGAYFAWK
2
Enzymes







Proteasome

(SEQ ID NO: 573)









subunit alpha











type-2










P25787
PSMA2
19-39
LVQIEYALAAVAGGAPSVGIK
3
Enzymes







Proteasome

(SEQ ID NO: 574)









subunit alpha











type-2










P25789
PSMA4
68-91
LNEDMACSVAGITSDANVLTNEL
13 14
Enzymes







Proteasome

R (SEQ ID NO: 575)
3 6 8








subunit alpha











type-4










P20618
PSMB1
129-146
FFPYYVYNIIGGLDEEGK
13 14
Enzymes







Proteasome

(SEQ ID NO: 576)
2 15








subunit beta











type-1










P49721
PSMB2
96-126
TPYHVNLLLAGYDEHEGPALYY
2 6
Enzymes







Proteasome

MDYLAALAK (SEQ ID NO: 577)









subunit beta











type-2










P49721
PSMB2
42-62
ILLLCVGEAGDTVQFAEYIQ
6
Enzymes







Proteasome

(SEQ ID NO: 578)









subunit beta











type-2










P49720
PSMB3
100-115
FGPYYTEPVIAGLDPK
13 14
Enzymes







Proteasome

(SEQ ID NO: 579)
15 3 6








subunit beta











type-3










P28070
PSMB4
61-80
FEGGVVIAADMLGSYGSLAR
6
Enzymes







Proteasome

(SEQ ID NO: 82)









subunit beta











type-4










P28074
PSMB5
141-150
LLANMVYQYK
4 3 6
Enzymes







Proteasome

(SEQ ID NO: 83)









subunit beta











type-5










P28074
PSMA5
226-239
DAYSGGAVNLYHVR
6
Enzymes







Proteasome

(SEQ ID NO: 84)









subunit beta











type-5










P28072
PSMB6
80-118
SGSAADTQAVADAVTYQLGFHSI
14 3 6
Enzymes







Proteasome

ELNEPPLVHTAASLFK









subunit beta

(SEQ ID NO: 85)









type-6










O00231
PSMD11 26S
164-175
ALLVEVQLLESK
2
Uncategorized







proteasome non-

(SEQ ID NO: 580)









ATPase











regulatory











subunit 11










O00231
PSMD11 26S
227-246
TAYSYFYEAFEGYDSIDSPK
2 4
Uncategorized







proteasome non-

(SEQ ID NO: 581)









ATPase











regulatory











subunit 11










O00231
PSMD11 26S
298-304
SLADFEX (SEQ ID NO: 582)
4
Uncategorized







proteasome non-











ATPase











regulatory











subunit 11
242-256
HDADGQATLLNLLLR
14 4







O43242
PSMD3 26S

(SEQ ID NO: 583)

Uncategorized







proteasome non-











ATPase











regulatory











subunit 3










O43242
PSMD3 26S
426-440
LQLDSPEDAEFIVAK
14
Uncategorized







proteasome non-

(SEQ ID NO: 584)









ATPase











regulatory











subunit 3










Q9UL46
PSME2
132-145
IEDGNDFGVAIQEK
6
Uncategorized







Proteasome

(SEQ ID NO: 585)









activator











complex subunit











2










P61289
PSME3
147-166
IEDGNNFGVSIQEETVAELR
14 4
Uncategorized







Proteasome

(SEQ ID NO: 586)
13 8








activator











complex subunit











3










P61289
PSME3
167-181
TVESEAASYLDQISR
13 4 8
Uncategorized







Proteasome

(SEQ ID NO: 587)









activator











complex subunit











3










P61289
PSME3
22-36
ITSEAEDLVANFFPK
4
Uncategorized







Proteasome

(SEQ ID NO: 588)









activator











complex subunit











3










Q8WXF1
PSPC1
229-247
PVIVEPMEQFDDEDGLPEK
14 6
Transcription







Paraspeckle

(SEQ ID NO: 589)

factors,







component 1



Regulators






P26599
PTBP1
219-238
NNQFQALLQYADPVSAQHA
14
Transcription







Polypyrimidine

(SEQ ID NO: 590)

factors,







tract-binding 



Regulators







protein 1










Q96EY7
PTCD3
119-126
FIINSYPK (SEQ ID NO: 591)
2
Transcription







Pentatricopeptide



factors,







repeat-



Regulators







containing











protein 3, mit










Q8N8N7
PTGR2
93-106
GDFVTSFYWPWQTK
14
Enzymes







Prostaglandin

(SEQ ID NO: 592)









reductase 2










Q8N8N7
PTGR2
262-278
DVPYPPPLSPAIEAIQK
14 3 2
Enzymes







Prostaglandin

(SEQ ID NO: 593)









reductase 2










Q9P035
PTPLAD1 3-
133-146
LESEGSPETLTNLR
13
Enzymes







hydroxyacyl-

(SEQ ID NO: 594)









CoA











dehydratase 3










Q9UHX1
PUF60 Poly(U)-
474-489
DIDDDLEGEVTEECGK
13 15
Transcription







binding-splicing

(SEQ ID NO: 595)
14 4 8
factors,







factor PUF60



Regulators






Q5XLP0
QIL1 Protein
15-36
GSVAGGAVYLVYDQELLGPSDK
14
Uncategorized







QIL1

(SEQ ID NO: 596)








Q96PU8
QKI Protein 
192-205
MQLMELAILNGTYR
2
Channels,







quaking

(SEQ ID NO: 597)

Transporters,











Receptors






P51149
RAB7A Ras-
104-113
DEFLIQASPR
14
Adapter,







related protein

(SEQ ID NO: 86)

Scaffolding,







Rab-7a



Modulator











Proteins






Q7Z6M1
RABEPK Rab9
87-100
YEHASFIPSCTPDR
14
Uncategorized







effector protein

(SEQ ID NO: 598)









with kelch











motifs










P11233
RALA Ras-
28-47
SALTLQFMYDEFVEDYEPTK
9
Transcription







related protein

(SEQ ID NO: 599)

factors,







Ral-A



Regulators






P54136
RARS Arginine--
528-540
GNTAAYLLYAFTR
14
Enzymes







tRNA ligase,

(SEQ ID NO: 600)









cytoplasmic










Q96PK6
RBM14 RNA-
224-238
ASYVAPTAQPATYR
6
Transcription







binding protein

(SEQ ID NO: 601)

factors,







14



Regulators






Q96PK6
RBM14 RNA-
65-72
ALVVEMSR (SEQ ID NO: 602)
6
Transcription







binding protein



factors,







14



Regulators






P98179
RMB3 Putative
8-39
LFVGGLNFNTDEQALEDHFSSFGP
13 3 2
Transcription







RNA-binding

ISEVVVVK (SEQ ID NO: 603)
9
factors,







protein 3



Regulators






P38159
RMBX RNA-
126-144
GGHMDDGGYSMNFNMSSSR
6
Transcription







binding motif

(SEQ ID NO: 604)

factors,







protein, X



Regulators







chromosome










P38159
RMBX RNA-
23-30
ALEAVFGK (SEQ ID NO: 605)
3 13 6
Transcription







binding motif



factors,







protein, X



Regulators







chromosome










P38159
RMBX RNA-
245-252
DYGHSSSR (SEQ ID NO: 606)
3
Transcription







binding motif



factors,







protein, X



Regulators







chromosome










P38159
RMBX RNA-
283-292
DSYESYGNSR (SEQ ID NO: 607)
6
Transcription







binding motif



factors,







protein, X



Regulators







chromosome










P38159
RMBX RNA-
299-309
GPPPSYGGSSR (SEQ ID NO: 608
6
Transcription







binding motif



factors,







protein, X



Regulators







chromosome










P38159
RMBX RNA-
332-339
SDLYSSGR (SEQ ID NO: 609)
6
Transcription







binding motif



factors,







protein, X



Regulators







chromosome










P38159
RMBX RNA-
50-63
GFAFVTFESPADAK
6
Transcription







binding motif

(SEQ ID NO: 610)

factors,







protein, X



Regulators







chromosome










Q96E39
RBMXL1 RNA
299-309
GPPPSYGGSSR (SEQ ID NO: 611)
6
Transcription







binding motif



factors,







protein, X-



Regulators







linked-like-1










Q96E39
RBMXL1 RNA
50-63
GFAFVTFESPADAK
6
Transcription







binding motif

(SEQ ID NO: 612)

factors,







protein, X-



Regulators







linked-like-1










Q96E39
RBMXL1 RNA
245-252
DYGHSSSR (SEQ ID NO: 613)
3
Transcription







binding motif



factors,







protein, X-



Regulators







linked-like-1










Q96E39
RBMXL1 RNA
283-292
DSYESYGNSR (SEQ ID NO: 614)
6
Transcription







binding motif



factors,







protein, X-



Regulators







linked-like-1










Q96E39
RBMXL1 RNA
126-144
GGHMDDGGYSMNFNMSSR
6
Transcription







binding motif

(SEQ ID NO: 615)

factors,







protein, X-



Regulators







linked-like-1










Q15293
RCN1
91-105
IDNDGDGFVTTEELK
13
Uncategorized







Reticulocalbin-1

(SEQ ID NO: 616)








Q14257
RCN2
283-305
LSEEEILENPDLFLTSEATDYGR
14 3
Uncategorized







Reticulocalbin-2

(SEQ ID NO: 617)
15 6 9











8







Q14257
RCN2
130-148
VIDFDENTALDDAEEESFR
13 9
Uncategorized







Reticulocalbin-2

(SEQ ID NO: 618)
6







Q14257
RCN2
217-232
WDPTANEDPEWILVE
14 4 6
Uncategorized







Reticulocalbin-2

(SEQ ID NO: 619)








Q14257
RCN2
96-103
HYAMQEAK (SEQ ID NO: 620)
6
Uncategorized







Reticulocalbin-2










Q14257
RCN2
161-200
ANQDSGPGLSLEEFIAFEHPEEVD
9
Uncategorized







Reticulocalbin-2

YMTEFVIQEALEEHDK











(SEQ ID NO: 621)








P35250
RBC2
211-230
VPYTDDGLEAIIFTAQGDMR
13
Transcription







Replication

(SEQ ID NO: 622)

factors,







factor C subunit 



Regulators







2










P62888
RPL30 60S
58-68
SEIEYYAMLAK (SEQ ID NO: 623)
13
Uncategorized







ribosomal











protein L30










P62917
RPL8 60S
129-144
ASGNYATVISHNPETK
2
Transcription







ribosomal

(SEQ ID NO: 624)

factors,







protein L8



Regulators






P05387
RPLP2 60S
50-61
NIEDVIAQGIGK 
14
Uncategorized







acidic ribosomal

(SEQ ID NO: 625)









protein P2










P04843
RPN1 Dolichyl-
152-169
QFVVFEGNHYFYSPYPTK
6
Enzymes







diphosphooligos

(SEQ ID NO: 626)









accharide--











protein











glycosyltransfer











ase subunit 1










P04843
RPN1 Dolichyl-
328-352
THYIVGYNLPSYEYLYNLGDQYA
6
Enzymes







diphosphooligos

LK (SEQ ID NO: 627)









accharide--











protein











glycosyltransfer











ase subunit 1










P04843
RPN1 Dolichyl-
525-536
ALTSEIALLQSR 
13 6
Enzymes







diphosphooligos

(SEQ ID NO: 628)









accharide--











protein











glycosyltransfer











ase subunit 1










P04844
RPN2 Dolichyl-
155-178
EETVLATVQALQTASHLSQQADL
6
Enzymes







diphosphooligos

(SEQ ID NO: 629)









accharide--











protein











glycosyltransfer











ase subunit 2










P04844
RPN2 Dolichyl-
179-190
SIVEEIEDLVAR 
14 3
Enzymes







diphosphooligos

(SEQ ID NO: 630)
13 6 9








accharide--











protein











glycosyltransfer











ase subunit 2










P04844
RPN2 Dolichyl-
443-456
TGQEVVFVAEPDNK
9
Enzymes







diphosphooligos

(SEQ ID NO: 631)









accharide--











protein











glycosyltransfer











ase subunit 2










P46783
RPS10 40S
81-95
DYLHLPPEIVPATR
3 13
Uncategorized







ribosomal

(SEQ ID NO: 632)









protein S10










P23396
RPS3 40S
152-173
FVDGLMIHSGDPVNYYVDTAVR
6
Transcription







ribosomal

(SEQ ID NO: 633)

factors,







protein S3



Regulators






P23396
RPS3 40S
28-40
ELAEDGYSGVEVR
13 6
Transcription







ribosomal

(SEQ ID NO: 634)

factors,







protein S3



Regulators






P23396
RPS3 40S
46-54
TEIIILATR (SEQ ID NO: 635)
6
Transcription







ribosomal



factors,







protein S3



Regulators






P23396
RPS3 40S
77-90
FGFPEGSVELYAEK
2 6
Transcription







ribosomal

(SEQ ID NO: 636)

factors,







protein S3



Regulators






P62241
RPS8 40S
158-170
ISSLLEEQFQGK
13
Uncategorized







ribosomal

(SEQ ID NO: 637)









protein S8










Q9NQC3
RTN4
1075-1090
AYLESEVAISEELVQK
13 14
Uncategorized







Reticulon-4

(SEQ ID NO: 638)








Q9Y265
RUVBL1 RuvB-
318-333
ALESSIAPIVIFASNR
2
Enzymes







like 1

(SEQ ID NO: 639)








Q9Y265
RUVBL1 RuvB-
91-107
VPFCPMVGSEVYSTEIK
2
Enzymes







like 1

(SEQ ID NO: 640)








Q9Y230
RUVBL2 RuvB-
315-330
ALESDMAPVLIMATNR
14
Transcription







like 2

(SEQ ID NO: 87)

factors,











Regulators






Q9Y512
SAMM50
128-148
LTGSYNTMVGNNEGSMVLGLK
14 4
Uncategorized







Sorting and

(SEQ ID NO: 641)









assembly











machinery











component 50











homolo










Q8NBX0
SCCPDH
145-167
GVYIIGSSGFDSIPADLGVIYTR
14
Enzymes







Saccharopine

(SEQ ID NO: 642)









dehydrogenase-











like











oxidoreductase










Q9HB40
SCPEP1
256-275
AEMIIEQNTDGVNFYNILTK
13 14
Enzymes







Retinoid-

(SEQ ID NO: 643)
15 3 2








inducible serine


4 6 9








carboxypeptidase


8







Q01105
SET Protein
91-122
IPNFWVTTFVNHPQVSALLGEEDE
3
Chaperones







SET

EALHYLTR (SEQ ID NO: 644)








P23246
SFPQ Splicing
377-399
NLSPYVSNELLEEAFSQFGPIER
13 14
Transcription







factor, proline-

(SEQ ID NO: 645)
3 2 4
factors,







and glutamine-


9
Regulators







rich










P23246
SFPQ Splicing
444-462
PVIVEPLEQLDDEDGLPEK
14 4 2
Transcription







factor, proline-

(SEQ ID NO: 646)

factors,







and glutamine-



Regulators







rich










Q9H9B4
SFXN1
36-48
NILLTNEQLESAR
14
Channels







Sideroflexin-1

(SEQ ID NO: 647)

Transporters,











Receptors






Q9H9B4
SFXN1
137-170
SGDAPLTVNELGTAYVSATTGAV
14
Channels







Sideroflexin-1

ATALGLNALTK (SEQ ID NO: 648)

Transporters,











Receptors






Q9H9B4
SFXN1
56-70
QGIVPPGLTENELWR
14
Channels







Sideroflexin-1

(SEQ ID NO: 649)

Transporters,











Receptors






Q9H9B4
SFXN1
93-112
MSAQVPMNMTITGCMMTFYR
6
Channels







Sideroflexin-1

(SEQ ID NO: 650)

Transporters,











Receptors






Q9H9B4
SFXN1
234-253
ILMAAPGMAIPPFIMNTLEK
6
Channels







Sideroflexin-1

(SEQ ID NO: 651)

Transporters,











Receptors






Q6P4A7
SFXN4
43-66
FLQWTELLDPTNVFISVESIENSR
14 2
Channels







Sideroflexin-4

(SEQ ID NO: 652)

Transporters,











Receptors






O95470
SGPL1
42699
AFEPYLEILEVYSTK
14
Enzymes







Sphingosine-1-

(SEQ ID NO: 653)









phophate lyase











1










Q9Y371
SH3GLB1
22-29
AVQFTEEK (SEQ ID NO: 654)
4 8
Adapter,







Endophilin-B1



Scaffolding,











Modulator











Proteins






P34897
SHMT2 Serine
105-121
YYGGAEVVDEIELLCQR
13 14
Enzymes







hydroxymethyltr

(SEQ ID NO: 655)
15 3 2








ansferase,


8








mitochondrial










Q9UBX3
SLC25A10
171-186
GALVTVGQLSCYDQAK
14
Channels







Mitochondrial

(SEQ ID NO: 656)

Transporters,







dicarboxylate



Receptors







carrier










O75746
SCLC25A12
260-283
YGQVTPLEIDILYQLADLYNASGR
14 4
Channels







Calcium-binding

(SEQ ID NO: 657)

Transporters,







mitochondrial



Receptors







carrier protein











Aral










O75746
SCLC25A12
641-652
LATATFAGIENK 
14 4
Channels







Calcium-binding

(SEQ ID NO: 658)

Transporters,







mitochondrial



Receptors







carrier protein











Aral










Q9UJS0
SCLC25A13
293-310
IAPLEEGTLPFNLAEAQR
4 6
Channels







Calcium-binding

(SEQ ID NO: 659)

Transporters,







mitochondrial



Receptors







carrier protein











Aral










Q9UJS0
SCLC25A13
261-282
FGQVTPMEVDILFQLADLYEPR
14 15
Channels







Calcium-binding

(SEQ ID NO: 660)
3 4 2
Transporters,







mitochondrial


6
Receptors







carrier protein











Aral










Q9UJS0
SCLC25A13
642-653
LAVATFAGIENK 
14 4 3
Channels







Calcium-binding

(SEQ ID NO: 661)
15 6 8
Transporters,







mitochondrial



Receptors







carrier protein











Aral










Q6NUK1
SCLC25A24
454-469
VLPAVGISYVVYENMK
2
Channels







Calcium-binding

(SEQ ID NO: 662)

Transporters,







mitochondrial



Receptors







carrier protein











SCaM










Q00325
SLC25A3
146-161
VLYSNMLGEENTYLWR
4
Channels







Phophate

(SEQ ID NO: 663)

Transporters,







carrier protein,



Receptors







mitochondrial










Q00325
SLC25A3
162-187
TSLYLAASASAEFFADIALAPMEA
4
Channels







Phophate

AK (SEQ ID NO: 664)

Transporters,







carrier protein,



Receptors







mitochondrial










Q9H2D1
SLC25A32
118-145
LEATEYLVSAAEAGAMTLCITNPL
14
Channels







Mitochondrial

WVTK (SEQ ID NO: 665)

Transporters,







folate



Receptors







transporter/carrier










P12235
SLC25A4
189-199
AAYFGVYDTAK (SEQ ID NO: 666)
14 2 8
Channels







ADP/ATP


4
Transporters,







translocase 1



Receptors






Q8TBP6
SLC25A40
136-152
LGENETCIPIVAGIVAR
14
Channels







Solute carrier

(SEQ ID NO: 667)

Transporters,







family 25



Receptors







member 40










P05141
SLC25A5
42697
DFLAGGVAAAISK
14 2 4
Channels







ADP/ATP

(SEQ ID NO: 668)

Transporters,







translocase 2



Receptors






P05141
SLC25A5
189-199
AAYFGIYDTAK (SEQ ID NO: 669)
14 2 4
Channels







ADP/ATP


3 9 8
Transporters,







translocase 2



Receptors






P12236
SLC25A6
42697
DFLAGGIAAAISK
14 4 2
Channels







ADP/ATP

(SEQ ID NO: 670)
13
Transporters,







translocase 3



Receptors






P12236
SLC25A6
189-199
AAYFGVYDTAK (SEQ ID NO: 671)
14 2 8
Channels







ADP/ATP


4
Transporters,







translocase 3



Receptors






Q8IXU6
SLC35F2 Solute
188-221
EDNSGSDVLIGDILVLLGASLYAIS
9
Channels







carrier family 35

NVCEEYIVK (SEQ ID NO: 672)

Transporters,







member F2



Receptors






Q9H2G2
SLK STE20-like
27-47
DLNPEDFWEIIGELGDGAFGK
6
Enzymes







serine/threonine-

(SEQ ID NO: 673)









protein kinase










Q92922
SMARCC1
894-905
SLVALLVETQMK
13
Transcription







SWI/SNF

(SEQ ID NO: 674)

factors,







complex subunit



Regulators







SMARCC1










Q14683
SMC1A
1070-1086
FNACFESVATNIDEIYK
8
Adapter,







Structureal

(SEQ ID NO: 675)

Scaffolding,







maintenance of



Modulator







chromosomes



Proteins







protein 1A










Q9H7B4
SMYD3 SET
255-265
DQYCFECDCFR
9
Enzymes







and MYND

(SEQ ID NO: 88)









domain-











containing











protein 3










Q96DI7
SNRNP40 U5
233-260
GHADSVTGLSLSSEGSYLLSNAM
13
Uncategorized







small nuclear

DNTVR (SEQ ID NO: 676)









ribonucleoprotein











40 kDa protein










P62314
SNRNP40 U5
67-86
YFILPDSLPLDTLLVDVEPK
13
Uncategorized







small nuclear

(SEQ ID NO: 677)









ribonucleoprotein











Sm D1










Q13813
SPTAN1
2354-2382
SLGYDLPMVEEGEPDPEFEAILDT
13 14
Adapter,







Spectrin alpha

VDPNR (SEQ ID NO: 678)

Scaffolding,







chain, non-



Modulator







erythrocytic 1



Proteins






Q01082
SPTAN1
1706-1717
EVDDLEQWIAER
13
Adapter,







Spectrin beta

(SEQ ID NO: 679)

Scaffolding,







chain, non-



Modulator







erythrocytic 1



Proteins






Q9UHB9
SRP68 Signal
312-333
IFLLGLADNEAAIVQAESEETK
14 13
Transcription







recognition

(SEQ ID NO: 680)

factors,







particle 68 kDa



Regulators







protein










Q04837
SSBP1 Single-
67-81
SGDSEVYQLGDVSQK
13 8
Transcription







stranded DNA-

(SEQ ID NO: 681)

factors,







binding protein,



Regulators







mitochondrial










Q8N3U4
STAG2 Cohesin
273-290
ELQENQDEIENMMNAIFK
13
Uncategorized







subunit SA-2

(SEQ ID NO: 682)








P31948
STIP1 Stress-
416-429
DCEECIQLEPTFIK
14
Uncategorized







induced-

(SEQ ID NO: 683)









phosphoprotein











1










Q9UJZ1
STOML2
58-72
ILEPGLNILIPVLDR
6
Adapter,







Stomatin-like

(SEQ ID NO: 684)

Scaffolding,







protein 2



Modulator











Proteins






Q9UJZ1
STOML2
35-51
NTVVLFVPQQEAWVVER
6
Adapter,







Stomatin-like

(SEQ ID NO: 685)

Scaffolding,







protein 2



Modulator











Proteins






Q9UJZ1
STOML2
115-135
ASYGVEDPEYAVTQLAQTTMR
13 8
Adapter,







Stomatin-like

(SEQ ID NO: 686)

Scaffolding,







protein 2



Modulator











Proteins






P46977
STT3A
330-340
FYSLLDPSYAK
14
Enzymes







Dolichyl-

(SEQ ID NO: 687)









diphosphooligos











accharide--











protein glycosy










P46977
STT3A
59-67
FLAEEGFYK (SEQ ID NO: 688)
6
Enzymes







Dolichyl-











diphosphooligos











accharide--











protein glycosy










P46977
STT3A
672-690
DFELDVLEEAYTTEHWLVR
6
Enzymes







Dolichyl-

(SEQ ID NO: 689)









diphosphooligos











accharide--











protein glycosy










P46977
STT3A
572-595
ELDVSYVLVIFGGLTGYSSDDINK
9
Enzymes







Dolichyl-

(SEQ ID NO: 690)









diphosphooligos











accharide--











protein glycosy










Q8TCJ2
STT3B
692-703
ESDYFTPQGEFR 
14
Enzymes







Dolichyl-

(SEQ ID NO: 691)









diphosphooligos











accharide--











protein glycosy










Q8TCJ2
STT3B
651-674
TLDVDYVLVIFGGVIGYSGDDINK
9
Enzymes







Dolichyl-

(SEQ ID NO: 692)









diphosphooligos











accharide--











protein glycosy










Q96I99
SUCLG2
151-160
ETYLAILMDR (SEQ ID NO: 693)
3
Enzymes







Succinyl-CoA











ligase










O15260
SURF4 Surfeit
31-43
LCLISTFLEDGIR 
13 14
Uncategorized







locus protein 4

(SEQ ID NO: 694)








O60506
SYNCRIP
334-356
NLANTVTEEILEK
9 6
Transcription







Heterogeneous

(SEQ ID NO: 695)

factors,







nuclear



Regulators







ribonucleoprotein











Q










Q92804
TAF15 TATA-
284-297
GEATVSFDDPPSAK
2
Transcription







binding protein-

(SEQ ID NO: 696)

factors,







associated factor



Regulators







2N










Q92804
TAF15 TATA-
423-431
SGGGYGGDR (SEQ ID NO: 697)
6
Transcription







binding protein-



factors,







associated factor



Regulators







2N










Q12788
TBL3
755-766
AALEALLPYTER 
13 8
Uncategorized







Transducin beta-

(SEQ ID NO: 698)









like protein 3










P52888
THOP1 Thimet
67-79
ALADVEVTYTVQR
14 8
Enzymes







oligopeptidase

(SEQ ID NO: 699)








P52888
THOP1 Thimet
105-115
LSEFDVEMSMR (SEQ ID NO: 700)
14
Enzymes







oligopeptidase










P52888
THOP1 Thimet
499-520
DFVEAPSQMLENWVWEQEPLLR
14
Enzymes







oligopeptidase

(SEQ ID NO: 701)








P62072
TIMM10
42545
AQQLAAELEVEMMADMYNR
13 14
Chaperones







Mitochondrial

(SEQ ID NO: 702)
9 8








import inner











membrane











translocase











su










Q99595
TIMM17A
13-35
IVDDCGGAFTMGTIGGGIFQAIK
14 15
Channels







Mitochondrial

(SEQ ID NO: 703)
2 4 8
Transporters,







import inner



Receptors







membrane











translocase su










O60830
TIMM17B
13-35
IVDDCGGAFTMGVIGGGVFQAIK
14 15
Channels







Mitochondrial

(SEQ ID NO: 704)
2 4 3
Transporters,







import inner


13 6 9
Receptors







membrane











translocase su










O43615
TIMM44
428-439
DQDELNPYAAWR
13 
Channels







Mitochondrial

(SEQ ID NO: 705)

Transporters,







import inner



Receptors







membrane











translocase su










P49755
TMED10
154-169
LEDLSESIVNDFAYMK
14 3 9
Channels







Transmembrane

(SEQ ID NO: 706)

Transporters,







emp24 domain-



Receptors







containg











protein 10










Q9BVK6
TMED9
49-65
CFIEEIPDETMVIGNYR
9
Channels







Transmembrane

(SEQ ID NO: 707)

Transporters,







emp24 domain-



Receptors







containg











protein 9










Q9H061
TMEM126A
85-105
CFVSFPLNTGDLDCETCRITR
14
Uncategorized







Transmembrane

(SEQ ID NO: 708)









protein 126A










P42166
TMPO Lamina-
621-637
TYDAASYICEAAFDEVK
4
Transcription







associated

(SEQ ID NO: 709)

factors,







polypeptide 2, 



Regulators







isoform alpha










Q92973
TNPO1
273-298
TQDQDENVALEACEFWLTLAEQP
9
Channels







Transportin-1

ICK (SEQ ID NO: 710)

Transporters,











Receptors






Q92973
TNPO1
45-64
LEQLNQYPDFNNYLIFVLTK
13 14
Channels







Transportin-1

(SEQ ID NO: 711)
2
Transporters,











Receptors






Q9NS69
TOMM22
106-117
LQMEQQQQLQQR
14
Channels







Mitochondrial

(SEQ ID NO: 712)

Transporters,







import receptor



Receptors







subunit TOM22











homolog










Q9NS69
TOMM22
61-76
SAAGATFDLSLFVAQK
14 4 2
Channels







Mitochondrial

(SEQ ID NO: 713)
13
Transporters,







import receptor



Receptors







subunit TOM22











homolog










O96008
TOMM40
278-293
ASDQLQVGVEFEASTR
14
Channels







Mitochondrial

(SEQ ID NO: 714)

Transporters,







import receptor



Receptors







subunit TOM40











homolog










O94826
TOMM70A
475-494
CAEGYALYAQALTDQQQFGK
14
Uncategorized







Mitochondrial

(SEQ ID NO: 715)









import receptor











subunit TOM70










P67936
TPM4
170-177
SLEAASEK (SEQ ID NO: 716)
13 3
Adapter,







Tropomyosin



Scaffolding,







alpha-4-chain



Modulator











Proteins






O14773
TPP1
521-558
GCHESCLDEEVEGQGFCSGPGWD
13 14
Enzymes







Tripeptidyl-

PVTGWGTPNFPALLK
15 4 9








peptidase 1

(SEQ ID NO: 89)








Q9H4I3
TRABD TraB
235-253
DLLEQMMAEMIGEFPDLHR
14
Uncategorized







domain-

(SEQ ID NO: 717)









containing











protein










Q12931
TRAP1 Heat
603-619
LDTHPAMVTVLEMGAAR
13
Chaperones







shock protein 75

(SEQ ID NO: 718)









kDa,











mitochondrial










Q15631
TSN Translin
205-215
VEEVVYDLSIR (SEQ ID NO: 719)
2
Transcription











factors,











Regulators






Q6DKK2
TTC19
134-149
AITYTYDLMANLAFIR
6
Adapter,







Tetratricopeptide

(SEQ ID NO: 720)

Scaffolding,







repeat protein



Modulator







19,



Proteins







mitochondrial










Q14166
TTLL12
254-287
CMLLPWAPTDMLDLSSCTPEPPA
4
Enzymes







Tubulin--

EHYQAILEENK (SEQ ID NO: 721)









tyrosine ligase-











like protein 12










Q71U36
TUBA1A
353-370
VGINYQPPTVVPGGDLAK
4
Adapter,







Tubulin alpha-

(SEQ ID NO: 722)

Scaffolding,







1A chain



Modulator











Proteins






Q71U36
TUBA1A
244-264
FDGALNVDLTEFQTNLVPYPR
13 3 2
Adapter,







Tubulin alpha-

(SEQ ID NO: 723)
4
Scaffolding,







1A chain



Modulator











Proteins






Q71U36
TUBA1A
281-304
AYHEQLSVAEITNACFEPANQMV
4 3 13
Adapter,







Tubulin alpha-

K (SEQ ID NO: 724)

Scaffolding,







1A chain



Modulator











Proteins






Q71U36
TUBA1A
374-390
AVCMLSNTTAIAEAWAR
4
Adapter,







Tubulin alpha-

(SEQ ID NO: 725)

Scaffolding,







1A chain



Modulator











Proteins






Q71U36
TUBA1A
65-79
AVFVDLEPTVIDEVR
13 4 3
Adapter,







Tubulin alpha-

(SEQ ID NO: 726)
2
Scaffolding,







1A chain



Modulator











Proteins






Q71U36
TUBA1A
403-422
AFVHWYVGEGMEEGEFSEAR
4
Adapter,







Tubulin alpha-

(SEQ ID NO: 727)

Scaffolding,







1A chain



Modulator











Proteins






Q71U36
TUBA1A
41-60
TIGGGDDSFNTFFSETGAGK
13
Adapter,







Tubulin alpha-

(SEQ ID NO: 728)

Scaffolding,







1A chain



Modulator











Proteins






Q13748
TUBA3D
244-264
FDGALNVDLTEFQTNLVPYPR
14 13
Adapter,







Tubulin alpha-

(SEQ ID NO: 729)
9 4
Scaffolding,







3C/D chain



Modulator











Proteins






Q13748
TUBA3D
281-304
AYHEQLSVAEITNACFEPANQMV
14 13
Adapter,







Tubulin alpha-

K (SEQ ID NO: 730)
4
Scaffolding,







3C/D chain



Modulator











Proteins






Q13748
TUBA3D
41-60
TIGGGDDSFNTFFSETGAGK
13 9
Adapter,







Tubulin alpha-

(SEQ ID NO: 731)

Scaffolding,







3C/D chain



Modulator











Proteins






P68366
TUBA4A
244-264
FDGALNVDLTEFQTNLVPYPR
3 2 13
Adapter,







Tubulin alpha-

(SEQ ID NO: 732)
9 4
Scaffolding,







4A chain



Modulator











Proteins






P68366
TUBA4A
281-304
AYHEQLSVAEITNACFEPANQMV
3 13 4
Adapter,







Tubulin alpha-

K (SEQ ID NO: 733)

Scaffolding,







4A chain



Modulator











Proteins






P68366
TUBA4A
340-352
SIQFVDWCPTGFK
13
Adapter,







Tubulin alpha-

(SEQ ID NO: 734)

Scaffolding,







4A chain



Modulator











Proteins






Q9NY65
TUBA8 Tubulin
244-264
FDGALNVDLTEFQTNLVPYPR
3 2 13
Adapter,







alpha-8 chain

(SEQ ID NO: 735)
9 4
Scaffolding,











Modulator











Proteins






P07437
TUBB Tubulin
104-121
GHYTEGAELVDSVLDVVR
13 9
Adapter,







beta chain

(SEQ ID NO: 736)
6
Scaffolding,











Modulator











Proteins






P07437
TUBB Tubulin
175-213
VSDTVVEPYNATLSVHQLVENTD
6
Adapter,







beta chain

ETYCIDNEALYDICFR

Scaffolding,









(SEQ ID NO: 737)

Modulator











Proteins






P07437
TUBB Tubulin
20-46
FWEVISDEHGIDPTGTYHGDSDLQ
6
Adapter,







beta chain

LDR (SEQ ID NO: 738)

Scaffolding,











Modulator











Proteins






P07437
TUBB Tubulin
217-241
LTTPTYGDLNHLVSATMSGVTTC
3 13 6
Adapter,







beta chain

LR (SEQ ID NO: 739)

Scaffolding,











Modulator











Proteins






P07437
TUBB Tubulin
283-297
ALTVPELTQQVFDAK
6
Adapter,







beta chain

(SEQ ID NO: 740)

Scaffolding,











Modulator











Proteins






P07437
TUBB Tubulin
310-318
YLTVAAVFR
6
Adapter,







beta chain

(SEQ ID NO: 741)

Scaffolding,











Modulator











Proteins






P07437
TUBB Tubulin
337-350
NSSYFVEWIPNNVK
13 3 9
Adapter,







beta chain

(SEQ ID NO: 742)
8 6
Scaffolding,











Modulator











Proteins






P07437
TUBB Tubulin
381-390
ISEQFTAMFR (SEQ ID NO: 743)
6
Adapter,







beta chain



Scaffolding,











Modulator











Proteins






P07437
TUBB Tubulin
47-58
ISVYYNEATGGK
13 6
Adapter,







beta chain

(SEQ ID NO: 744)

Scaffolding,











Modulator











Proteins






P07437
TUBB Tubulin
63-77
AILVDLEPGTMDSVR
6
Adapter,







beta chain

(SEQ ID NO: 745)

Scaffolding,











Modulator











Proteins






Q9BVA1
TUBB2B
381-390
ISEQFTAMFR (SEQ ID NO: 746)
6
Adapter,







Tubulin beta-2B



Scaffolding,







chain



Modulator











Proteins






Q9BVA1
TUBB2B
63-77
AILVDLEPGTMDSVR
6
Adapter,







Tubulin beta-2B

(SEQ ID NO: 747)

Scaffolding,







chain



Modulator











Proteins






Q9BVA1
TUBB2B
175-213
VSDTVVEPYNATLSVHQLVENTD
6
Adapter,







Tubulin beta-2B

ETYCIDNEALYDICFR

Scaffolding,







chain

(SEQ ID NO: 748)

Modulator











Proteins






Q9BVA1
TUBB2B
337-350
NSSYFVEWIPNNVK
3 13 9
Adapter,







Tubulin beta-2B

(SEQ ID NO: 749)
8 6
Scaffolding,







chain



Modulator











Proteins






Q9BVA1
TUBB2B
104-121
GHYTEGAELVDSVLDVVR
9 6
Adapter,







Tubulin beta-2B

(SEQ ID NO: 750)

Scaffolding,







chain



Modulator











Proteins






Q9BVA1
TUBB2B
217-241
LTTPTYGDLNHLVSATMSGCTTC
3 13
Adapter,







Tubulin beta-2B

LR (SEQ ID NO: 751)

Scaffolding,







chain



Modulator











Proteins






Q13509
TUBB3 Tubulin
104-121
GHYTEGAELVDSVLDVVR
9 4 6
Adapter,







beta-3 chain

(SEQ ID NO: 752)

Scaffolding,











Modulator











Proteins






Q13509
TUBB3 Tubulin
337-350
NSSYFVEWIPNNVK
4 6
Adapter,







beta-3 chain

(SEQ ID NO: 753)

Scaffolding,











Modulator











Proteins






Q13509
TUBB3 Tubulin
63-77
AILVDLEPGTMDSVR
4
Adapter,







beta-3 chain

(SEQ ID NO: 754)

Scaffolding,











Modulator











Proteins






P68371
TUBB4B 
104-121
GHYTEGAELVDSVLDVVR
9 6
Adapter,







Tubulin beta-4B

(SEQ ID NO: 755)

Scaffolding,







chain



Modulator











Proteins






P68371
TUBB4B 
175-213
VSDTVVEPYNATLSVHQLVENTD
6
Adapter,







Tubulin beta-4B

ETYCIDNEALYDICFR

Scaffolding,







chain

(SEQ ID NO: 756)

Modulator











Proteins






P68371
TUBB4B 
217-241
LTTPTYGDLNHLVSATMSGVTTC
3 13 6
Adapter,







Tubulin beta-4B

LR (SEQ ID NO: 757)

Scaffolding,







chain



Modulator











Proteins






P68371
TUBB4B 
310-318
YLTVAAVFR (SEQ ID NO: 758)
6
Adapter,







Tubulin beta-4B



Scaffolding,







chain



Modulator











Proteins






P68371
TUBB4B 
337-350
NSSYFVEWIPNNVK
3 13 9
Adapter,







Tubulin beta-4B

(SEQ ID NO: 759)
8 6
Scaffolding,







chain



Modulator











Proteins






P68371
TUBB4B 
381-390
ISEQFTAMFR (SEQ ID NO: 760)
6
Adapter,







Tubulin beta-4B



Scaffolding,







chain



Modulator











Proteins






Q9BUF5
TUBB6 Tubulin
217-241
LTTPTYGDLNHLVSATMSGVTTS
4
Adapter,







beta-6 chain

LR (SEQ ID NO: 761)

Scaffolding,











Modulator











Proteins






Q9BUF5
TUBB6 Tubulin
175-213
VSDTVVEPYNATLSVHQLVENTD
6
Adapter,







beta-6 chain

ETYCIDNEALYDICFR

Scaffolding,









(SEQ ID NO: 762)

Modulator











Proteins






Q9BUF5
TUBB6 Tubulin
337-350
NSSYFVEWIPNNVK
4 2 13
Adapter,







beta-6 chain

(SEQ ID NO: 763)
6
Scaffolding,











Modulator











Proteins






P49411
TUFM
183-200
ADAVQDSEMVELVELEIR
13 4 3
Transciption







Elongation

(SEQ ID NO: 764)
2 8 6
factors,







factor Tu, 



Regulators







mitochondrial










P49411
TUFM
239-252
LLDAVDTYIPVPAR
6
Transciption







Elongation

(SEQ ID NO: 765)

factors,







factor Tu, 



Regulators







mitochondrial










P49411
TUFM
272-281
GTVVTGTLER (SEQ ID NO: 766)
3 4 15
Transciption







Elongation


6 8
factors,







factor Tu, 



Regulators







mitochondrial










Q9BRA2
TXNDC17
42477
YEEVSVSGFEEFHR
14
Uncategorized







Thioredoxin

(SEQ ID NO: 90)









domain-











containing











protein 17










Q14157
UBAP2L
239-257
TATEEWGTEDWNEDLSETK
8
Uncategorized







Ubiquitin-

(SEQ ID NO: 767)









associated











protein 2-like










P31930
UQCRC1
397-415
NALVSHLDGTTPVCEDIGR
13 4 3
Channels,







Cytochrome b-

(SEQ ID NO: 768)
2 8
Transporters,







c1 complex



Receptors







subunit 1,











mitochondrial










P21796
VDAC1
140-161
GALVLGYEGWLAGYQMNFETAK
14 2
Channels,







Voltage-

(SEQ ID NO: 769)
13 4 6
Transporters,







dependent



Receptors







anion-slective











channel protein










P21796
VDAC1
121-139
EHINLGCDMDFDIAGPSIR
13 14
Channels,







Voltage-

(SEQ ID NO: 770)
2 4 8
Transporters,







dependent



Receptors







anion-slective











channel protein










P21796
VDAC1
75-93
WNTDNTLGTEITVEDQLAR
13 14
Channels,







Voltage-

(SEQ ID NO: 771)
15 3 2
Transporters,







dependent


4 6 9
Receptors







anion-slective


8








channel protein










P21796
VDAC1
164-174
VTQSNFAVGYK (SEQ ID NO: 772)
14 4 8
Channels,







Voltage-


6
Transporters,







dependent



Receptors







anion-slective











channel protein










P21796
VDAC1
64-74
WTEYGLTFTEK (SEQ ID NO: 773)
13 14
Channels,







Voltage-


15 3 2
Transporters,







dependent


4 6 9
Receptors







anion-slective


8








channel protein










P21796
VDAC1
35-53
SENGLEFTSSGSANTETTK
4 8 9
Channels,







Voltage-

(SEQ ID NO: 774)

Transporters,







dependent



Receptors







anion-slective











channel protein










P21796
VDAC1
175-197
TDEFQLHTNVNDGTEFGGSIYQK
14 4 8
Channels,







Voltage-

(SEQ ID NO: 775)

Transporters,







dependent



Receptors







anion-slective











channel protein










P21796
VDAC1
225-236
YQIDPDACFSAK 
4 8
Channels,







Voltage-

(SEQ ID NO: 776)

Transporters,







dependent



Receptors







anion-slective











channel protein










P45880
VDAC2
86-107
WNTDNTLGTEIAIEDQICQGLK
13 14
Channels,







Voltage-

(SEQ ID NO: 777)
15 3 2
Transporters,







dependent


4 6 9
Receptors







anion-slective


8








channel protein










P45880
VDAC2
178-185
NNFAVGYR (SEQ ID NO: 778)
14 2
Channels,







Voltage-


13 8 4
Transporters,







dependent


6
Receptors







anion-slective











channel protein










P45880
VDAC2
186-208
TGDFQLHTNVNDGTEFGGSIYQK
14 4 2
Channels,







Voltage-

(SEQ ID NO: 779)

Transporters,







dependent



Receptors







anion-slective











channel protein










P45880
VDAC2
209-229
VCEDLDTSVNLAWTSGTNCTR
13 14
Channels,







Voltage-

(SEQ ID NO: 780)
15 2 9
Transporters,







dependent


8 4
Receptors







anion-slective











channel protein










P45880
VDAC2
236-247
TQLDPTASISAK 
13 14
Channels,







Voltage-

(SEQ ID NO: 781)
4
Transporters,







dependent



Receptors







anion-slective











channel protein










P45880
VDAC2
75-85
WCEYGLTFTEK (SEQ ID NO: 782)
13 14
Channels,







Voltage-


15 3 2
Transporters,







dependent


4 6 9
Receptors







anion-slective











channel protein










Q9Y277
VDAC3
164-174
LSQNNFALGYK (SEQ ID NO: 783)
14
Channels,







Voltage-



Transporters,







dependent



Receptors







anion-slective











channel protein










P08670
VIM Vimentin
283-292
NLQEAEEWK (SEQ ID NO: 784)
13 14
Uncategorized










3 2 4











15 6 9











8







P08670
VIM Vimentin
322-334
QVQSLTCEVDALK
4 9 6
Uncategorized









(SEQ ID NO: 785)








P08670
VIM Vimentin
176-184
DNAEDIMR (SEQ ID NO: 786)
6
Uncategorized






P08670
VIM Vimentin
197-207
EEAENTLQSFR (SEQ ID NO: 787)
13 14
Uncategorized










3 2 15











9 6







P08670
VIM Vimentin
130-139
ILLAELEQLK (SEQ ID NO: 788)
14 3 2
Uncategorized










4 69







P08670
VIM Vimentin
29-36
SYVTTSTR (SEQ ID NO: 789)
14 6
Uncategorized






P08670
VIM Vimentin
146-155
LGDLYEEEMR (SEQ ID NO: 790)
6
Uncategorized






P08670
VIM Vimentin
42502
SVSSSSYR (SEQ ID NO: 791)
6
Uncategorized






P08670
VIM Vimentin
189-196
LQEEMLQR (SEQ ID NO: 792)
3 6
Uncategorized






P08670
VIM Vimentin
105-113
VELQELNDR (SEQ ID NO: 793)
4 6
Uncategorized






P08670
VIM Vimentin
79-97
LLQDSVDFSLADAINTEFK
13 14
Uncategorized









(SEQ ID NO: 794)
15 4 3











2 6 9











8







P08670
VIM Vimentin
295-304
FADLSEAANR (SEQ ID NO: 795)
6
Uncategorized






P08670
VIM Vimentin
346-364
EMEENFAVEAANYQDTIGR
13 14
Uncategorized









(SEQ ID NO: 796)
15 3 2











4 6 9











8







P08670
VIM Vimentin
335-342
GTNESLER (SEQ ID NO: 797)
6
Uncategorized






P08670
VIM Vimentin
114-120
GANYIDK (SEQ ID NO: 798)
4 3 8
Uncategorized










6







P08670
VIM Vimentin
365-373
LQDEIQNMK (SEQ ID NO: 799)
4 3 2
Uncategorized










6







P08670
VIM Vimentin
382-390
EYQDLLNVK(SEQ ID NO: 800)
3
Uncategorized






P08670
VIM Vimentin
51-64
SLYASSPGGVYATR

Uncategorized









(SEQ ID NO: 801)
14 2 4











3 13 8











6







P08670
VIM Vimentin
224-235
VESLQEEIAFLK 
14 4 6
Uncategorized









(SEQ ID NO: 802)








Q96GC9
VMP1 Vacuole
214-243
LSGAEPDDEEYQEFEEMLEHAES
14
Uncategorized







membrane

AQDFASR (SEQ ID NO: 803)









protein 1










Q96AX1
VPS33A
233-262
NVDLLTPLATQLTYEGLIDEIYGIQ
14
Channels,







Vacuolar protein

NSYVK (SEQ ID NO: 804)

Transporters,







sorting-



Receptors







associated











protein 33A










Q9UID3
VPS51 Vacuolar
742-763
FVADEELVHLLLDEVVASAALR
14
Channels,







protein sorting-

(SEQ ID NO: 805)

Transporters,







associated



Receptors







protein 51 hom










O43592
XPOT Exportin-
825-843
VLVTVIQGAVEYPDPIAQK
132
Channels,







T

(SEQ ID NO: 806)

Transporters,











Receptors






P12956
XRCC6 X-ray
475-488
SDSFENPVLQQHFR
3 4 2
Transcription







repair cross-

(SEQ ID NO: 807)
13 8
factors,







complementing



Regulators







protein 6










P12956
XRCC6 X-ray
489-510
NLEALALDLMEPQAVDLTLPK
13 3 2
Transcription







repair cross-

(SEQ ID NO: 808)
4 8
factors,







complementing



Regulators







protein 6










P67809
YBX1
102-118
SVGDGETVEFDVVEGEK
6
Transcription







Nuclease-

(SEQ ID NO: 809)

factors,







sensitive



Regulators







element-binding











protein 1










P62258
YWHAE 14-3-3
197-215
AAFDDAIAELDTLSEESYK
13
Uncategorized







protein epsilon

(SEQ ID NO: 92)








P62258
YWHAE 14-3-3
143-153
EAAENSLVAYK
13
Uncategorized







protein epsilon

(SEQ ID NO: 91)








P27348
YWHAQ 14-3-3
194-212
TAFDEAIAELDTLNEDSYK
14
Uncategorized







protein theta

(SEQ ID NO: 93)








P63104
YWHAZ 14-3-3
194-212
TAFDEAIAELDTLSEESY
14 13
Uncategorized







protein

(SEQ ID NO: 94)









zelta/delta






















TABLE 2










Predicted
Overlapping







pocket
pockets



Labeled



residue
(fpocket


Protein Name
Peptide
Peptide Sequence
Probes
PDB
overlap
designation)







ACP1 Low
42-59
VDSAATSGYEIGNPPD
13
3N8I
47.A,50.A
1


molecular weight

YR






phosphotyrosine

(SEQ ID NO: 1)






protein phosp








ADCK3 Chaperone
277-295
LGQMLSIQDDAFINPH
14
4PED
278.A,283.A
1


activity of bc1

LAK






complex-like,

(SEQ ID NO: 2)






mitochondr








ADK Adenosine
209-224
IFTLNLSAPFISQFYK
2
4O1L
200.A,205.A,
1, 5, 10, 12,


kinase

(SEQ ID NO: 3)


207.A,200.B,
20, 21







201.B,206.B,








207.B



ADSS
431-441
FIEDELQIPVK
14
2V40
435.A
8


Adenylosuccinate

(SEQ ID NO: 4)






synthetase isozyme








2








AHCYL2 Putative
331-342
GIVEESVTGVHR (SEQ
6
3GVP
335.A,336.A,
5, 6, 16, 17,


adenosylhomocysteinase

ID NO: 810)


337.A,338.A,
18, 22, 27,


3




341.A,342.A,
38, 51







335.B,








336.B,337.B,








338.B,332.C.








334.C,








335.C,337.C,








339.C,342.C,








332.D,334.D,








335.D,336.D,








337.D,








339.D,342.D



AIFM1 Apoptosis-
475-510
PYWHQSMFWSDLGP
2 3 4 6
4LII
480.A,482.A,
1


inducing factor 1,

DVGYEAIGLVDSSLPT


492.A



mitochondrial

VGVFAK








(SEQ ID NO: 5)






ALDH7A1 Alpha-
139-162
ILVEGVGEVQEYVDIC
8 13
4ZUL
117.A,118.A,
3, 6, 26, 37


aminoadipic

DYAVGLSR


1220.A,123.A,
47, 48, 52,


semialdehyde

(SEQ ID NO: 6)


127.A,l28.A,
69, 71, 84,


dehydrogenase




130.A,
86, 93, 95,







131.A,132.A,
102, 115







133.A,134.A,








111.B,112.B,








113.B,








114.B,117.B,








120.B,123.B,








127.B,128.B,








130.B,








132.B,133.B,








134.B,120.C,








123.C,127.C,








128.C,








120.D,123.D,








127.D,128.D,








130.D,132.D,








133.D,








134.D,116.E,








120.E,124.E,








128.E,120.F,








127.F,128.F,








130.F,








131.F,132.F,








133.F,134.F,








128.G,130.G,








132.G,133.G,








134.G,120.H,








124.H,








128.H,130.H,








133.H,134.H



ANP32A Acidic
117-132
SLDLFNCEVTNLNDY
13
4XOS
No Overlap



leucine-rich nuclear

R (SEQ ID NO: 811)






phosphoprotein 32








fami








API5 Apoptosis
182-196
VLEDVTGEEFVLFMK
4
3U0R
187.A,193.A
3


inhibitor 5

(SEQ ID NO: 812)






API5 Apoptosis
131-148
GTLGGLFSQILQGEDI
4
3U0R
145.A
3


inhibitors

VR (SEQ ID NO: 813)






API5 Apoptosis
211-237
QQLVELVAEQADLEQ






inhibitor 5

TFNPSDPDCVDR (SEQ
4
3U0R
No Overlap





ID NO: 814)






ARF1 ADP-
39-59
LGEIVTTIPTIGFNVET
2 3 8 13
3O47
175.A,176.A,
1, 2, 9, 11, 15


ribosylation factor 1

VEYK


177.A,178.A,





(SEQ ID NO: 7)


179.A,181.A,








183,A,








185.A,172.B,








173.B,174.B,








185.B,187.B,








189.B



ARF4 ADP-
39-59
LGEIVTTIPTIGFNVET
2 3 8 13
1Z6X
48.A,49.A,50.A,
1, 5, 6


ribosylation factor 4

VEYK


51.A,52.A,





(SEQ ID NO: 7)


54.A,49.B,








52.B,54.B



ARF5 ADP-
39-59
LGEIVTTIPTIGFNVET
2 3 4 8
2B6H
44.A,52.A,53.A,



ribosylation factor 5

VEYK
13

54.A,57.A,
1, 2




(SEQ ID NO: 7)


59.A



ARL1 ADP-
163-178
GTGLDEAMEWLVET
13 14
4DCN
No Overlap



ribosylation factor-

LK






like protein 1

(SEQ ID NO: 9)






ARL1 ADP-
37-59
LQVGEVVTTIPTIGFN
13
4DCN
38.A,44.A,4.6A,
1, 2, 3, 4


ribosylation factor-

VETVTYK


47.A,51.A,



like protein 1

(SEQ ID NO: 10)


52.A,53.A,








54.A,38.B,








43.B,44.B,








46.B,47.B,








48.B,52.B,








54.B



ATIC Bifunctional
178-194
AFTHTAQYDEAISDY
13
1PKX
183.A,184.A,
5, 12, 17, 18,


purine biosynthesis

FR


187.A,191.A,
49, 51, 54


protein PURH

(SEQ ID NO: 11)


194.A,183.B,








187.B,








188.B,190.B,








191.B,194.B,








180.C,181.C,








183.C,








184.C,185.C,








187.C,188.C,








191.C,








194.C,181.D,








183.D,184.D,








185.D,187.D,








188.D,190.D,








194.D



BAX Apoptosis
66-78
IGDELDSNMELQR
13
4ZIG
No Overlap



regulator BAX

(SEQ ID NO: 815)






BLMH Bleomycin
203-218
GEISATQDVMMEEIFR
13
1CB5
210.A,213.A,
29, 30, 31, 78


hydrolase

(SEQ ID NO: 13)


217.A,218.A,








210.B,







'
213.B,217.B,








210.C,213.C,








217.C



BLMH Bleomycin
111-124
CYFFLSAFVDTAQR
14
1CB5
112.A,122.A,
4, 29, 30, 31


hydrolase

(SEQ ID NO: 12)


123.A,112.B,
67, 76, 77







113.B,








122.B,123.B,








112.C,122.C,








123.C



C1QBP
247-276
GVDNTFADELVELST
3 9 13
3RPX
264.A,265.A,
1, 2, 6, 8


Complement

ALEHQEYITFLEDLK
14

268.A,274.A,



component 1 Q

(SEQ ID NO: 816)


260.C,261.C,



subcomponent-




264.C,



binding prot




265.C,268.C



C1QBP
105-119
MSGGWELELNGTEA
9
3RPX
108.A,110.
7


Complement

K (SEQ ID NO: 817)


A,111.A



component 1 Q








subcomponent-








binding prot








CALM3
39-75
SLGQNPTEAELQDMI
14
4UPU
No Overlap



Calmodulin

NEVDADGNGTIDFPE








FLTMMAR 








NO: 818)






CALR Calreticulin
323-351
SGTIFDNFLITNDEAY
6 9 13
3POW
329.A,345.A,





AEEFGNETWGVTK


346.A,349.A
4




(SEQ ID NO: 14)






CALR Calreticulin
99-111
HEQNIDCGGGYVK
6
3POW
No Overlap





(SEQ ID NO: 15)






CAPN1 Calpain-1
175-193
LVFVHSAEGNEFWSA
14
2ARY
175.A,179.A,
1, 14


catalytic subunit

LLEK


180.A,181.A,





(SEQ ID NO: 16)


182.A,183.A,








186.A,








175.B,176.B,








179.B,180.B,








181.B,186.B



CKB Creatine
224-236
TFLVWVNEEDHLR
3
3B6R
228.A,232.A,
1, 2, 4


kinase B-type

(SEQ ID NO: 19)


233.A,228.B,








232.B



CKB Creatine
342-358
LGFSEVELVQMVVDG
3 13
3B6R
342.A
21


kinase B-type

VK








(SEQ ID NO: 21)






CKB Creatine
367-381
LEQGQAIDDLMPAQK
13
3B6R
No Overlap



kinase B-type

(SEQ ID NO: 22)






CKB Creatine
14-32
FPAEDEFPDLSAHNN
3
3B6R
29.B
5


kinase B-type

HMAK








(SEQ ID NO: 17)






CKB Creatine
157-172
LAVEALSSLDGDLAG
13
3B6R
159.B,160.B,
3, 10, 15


kinase B-type

R


163.B,164.B,





(SEQ ID NO: 18)


168.B,169.B,








170.B,








171.B,172.B



CKB Creatine
253-265
FCTGLTQIETLFK
13
3B6R
261.A,265.A,
7, 17


kinase B-type

(SEQ ID NO: 20)


261.B,265.B



CKMT1B Creatine
257-269
SFLIWVNEEDHTR
3
1QK1
223.B,227.B,
4, 6, 38, 78,


kinase U-type,

(SEQ ID NO: 23)


223.C,221.D,
80, 82


mitochondria'




223.D,226.D,








221.H,








223.H,226.H



CLPP Putative
215-226
QSLQVIESAMER
6
1TG6
166.A,167.A,
20, 41, 49,


ATP-dependent Clp

(SEQ ID NO: 24)


168.A,169.A,
53, 56, 58,


protease proteolyticsu




170.A,169.B,
60, 62, 65







170.B,








159.C,167.C,








168.C,169.C,








170,C,159.E,








159.G,








161.G,163.G,








165.G,167.G,








168.G,169.G,








170.G



COPS4 COP9
154-170
LYLEDDDPVQAEAYI
13 15
4D18
157.D,158.D
178


signalosome

NR (SEQ ID NO: 819)






complex subunit 4








CSNK1A1 Casein
84-106
DYNVLVMDLLGPSLE
14
5FQD
95.C,100.C,
14, 15, 69, 88,


kinase I isoform

DLFNFCSR


88.F,90.F,



alpha

(SEQ ID NO: 25)


91.F,93.F,94.F,








95.F,99.F



CSNK2B Casein
112-134
VYCENQPMLPIGLSDI
14
4NH1
126.C,126.D
1, 4


kinase II subunit 

PGEAMVK






beta

(SEQ ID NO: 26)






CTNNB1 Catenin
648-661
NEGVATYAAAVLFR
13 14
3TX7
660.A,661.A
28


beta-1

(SEQ ID NO: 820)






CTSB Cathepsin B
315-331
GQDHCGIESEVVAGIP
2 49 13
3K9M
237.A,238.A,
4, 10, 13, 19




R


240.A,241.A,





(SEQ ID NO: 27)


251.A,252.A,








251.B,252.B



CTSD Cathepsin D
236-253
DPDAQPGGELMLGGT
9
4OD9
173.B
10




DSK








(SEQ ID NO: 28)






CTSD Cathepsin D
288-309
EGCEAIVDTGTSLMV
4 6 8 9
4OD9
231.B,233.B,
1, 2, 7




GPVDEVR
13 14

234.B,238.B,





(SEQ ID NO: 29)
15

241.B,242.B,








245.B,








231.D,233.D,








234.D,235.D,








236.D,238.D



CTSD Cathepsin D
314-331
AIGAVPLIQGEYMIPC
2 3 4 6
4OD9
258.B,260.B,
1, 2




EK
8 9 13

258.D,260.D





(SEQ ID NO: 30)
14 15





CYB5R3 NADH-cytochrome b5
235-241
LWYTLDR
3
lUMK
237.A,238.A,
1


reductase 3

(SEQ ID NO: 31)


239.A



DECR1 2,4-
299-315
FDGGEEVLISGEFNDL
6
1W6U
306.A,307.A,
1, 2, 9, 10,


dienoyl-CoA

R


308.A,309.A,
14, 23, 25,


reductase,

(SEQ ID NO: 32)


311.A,312.A,
27, 35, 42, 47


mitochondrial




313.A,








314.A,315.A,








304.B,305.B,








308.B,310.B,








311.B,








313.B,314.B,








315.B,303.C,








304.C,305.C,








306 C,








308.C,310.C,








311.C,312.C,








315.C,305.D,








306.D,








307.D,308.D,








309.D,310.D,








311.D,312.D,








313.D,








314.D,315.D



DHX9 ATP-
448-456
ISAVSVAER
3
3LLM
449.B,453.B,
6


dependent RNA

(SEQ ID NO: 33)


456.B



helicase A








DIABLO Diablo
124-140
MNSEEEDEVWQVIIG
13
4TX5
78.A,82.A,85.A,
5, 11, 12


homolog,

AR (SEQ ID NO: 821)


71.B,74.B,



mitochondrial




75.B,78.B,








84.B



DLD Dihydrolipoyl
450-482
VLGAHILGPGAGEMV
4 13 14
3RNM
416.A,417.A,
2, 3, 8, 9, 10,


dehydrogenase,

NEAALALEYGASCED


418.A,423.A,
11, 12, 17,


mitochondrial

IAR


424.A,443.A,
37, 44, 50,




(SEQ ID NO: 34)


444.A,
54, 67







445.A,446.A,








447.A,415.B,








416.B,423.B,








424 B,








428.B,433.B,








436.B,437.B,








446.B,421.C,








423.C,








424.C,427.C,








436.C,437.C,








443.C,447.C,








421.D,








423.D,424.D,








446.D



ECH1 Delta(3,5)-
197-211
EVDVGLAADVGTLQ
3 4 6 8
2VRE
171.A,174.A,
1, 3, 4, 24


Delta(2,4)-dienoyl-

R
13 14

171.B,176.B,



CoA isomerase,

(SEQ ID NO: 37)
15

179.B,180.B,



mitoc




171.C,174.C



ECH1 Delta(3,5)-
149-158
YQETFNVIER
6
2VRE
123.A,124.A,
1, 2, 3, 4, 5,


Delta(2,4)-dienoyl-

(SEQ ID NO: 36)


128.A,131.A,
8, 9, 12


CoA isomerase,




123.B,125.B,



mitoc




126.B,








128.B,131.B,








123.C,124.C,








125.C,128.C,








131.C



ECH1 Delta(3,5)-
113-131
MFTAG1DLMDMASDI
6
2VRE
98.A,100.A,
1, 3, 4, 12,


Delta(2,4)-dienoyl-

LQPK


101.A,102.A,
23, 24


CoA isomerase,

(SEQ ID NO: 35)


92.B,97.B,



mitoc




98.B,100.B,








90.C,92.0,








93.C,94.C,








95.C,97.C,








98.C,100.C,








101.C,104.C



EIF4A1 Eukaryotic
69-82
GYDVIAQAQSGTGK
9 13 14
2ZU6
75.A,76.A,
1, 5, 10, 53,


initiation factor 4A-

(SEQ ID NO: 39)


78.A,82.A,
84


I




75.C,78.C,








79.C,80.C,82.C



EIF4A1 Eukaryotic
178-190
MFVLDEADEML SR
13
2ZU6
178.C,190.C,1
2, 4, 5


initiation factor 4A-

(SEQ ID NO: 38)


85.D,186.D,



I




188.D,189.D,








190.D



EIF4A2
Eukaryotic
GYDVIAQAQSGTGK
13
3BOR
76.A,82.A,
1


initiation factor 4A-
70-83
(SEQ ID NO: 40)


83.A



II








ELAVL1 ELAV-
20-37
TNLIVNYLPQNMTQD
2 4 13
4FXV
33.A,26.B,
1, 2, 4, 5, 6


like protein 1

ELR (SEQ ID NO: 822)


28.B,30.B,








32.B,34.B,








35.B,37.B,20.C,








21.C,32.D,








34.D,35.D,37.D



ERH Enhancer of
18-34
TYADYESVNECMEG
13
2NML
18.A
2


rudimentary

VCK (SEQ ID NO: 823)






homolog








ETFB Electron
36-51
HSMNPFCEIAVEEAV
3
2A1T
36.S,37.S,
1, 4


transfer

R


39.S,40.S,



flavoprotein subunit

(SEQ ID NO: 41)


41.S,43.S,44.S



beta








EXO1 Exonuclease
139-160
SQGVDCLVAPYEADA
2 6 8 9
3QEB
143.Z,144.Z,
1, 9, 11


1

QLAYLNK
13

145.Z,149.Z,





(SEQ ID NO: 95)


150.Z



FARSB
72-82
YDLLCLEGLVR (SEQ
9
3L4G
72.B,72.D,
1, 5, 8, 9, 43,


Phenylalanine--

ID NO: 824)


76.D,72.F,



tRNA ligase beta




76.F,72.H,72.J,



subunit




74.J,76.J,75.L, 
44, 45, 53,







76.N,78.N,
57, 99, 113,







72.P,75.P,
124, 273, 279







76.P



FARSB
518-530
IMQLLDVPPGEDK
2
3L4G
519.B,520.B,
2, 7, 35, 54,


Phenylalanine--

(SEQ ID NO: 825)


524.B,526.B,
97, 106, 107,


tRNA ligase beta




528.B,530.B,
134, 136,


subunit




520.D,
181, 215,







521.D,523.D,
218, 224,







524.D,525.D,
267, 288,







530.D,519.F,
295, 308







520.F,








523.F,524.F,








525.F,520.H,








521.H,523.H,








524.H,526.H,








530.H,








519.J,520.J,








523.J,524.J,








525.J,526.J,








529.J,530.J,








523.N,520.P,








523.P



FDFT1 Squalene
78-92
ALDTLEDDMTISVEK
15
3VJ9
80.A,83.A
1


synthase

(SEQ ID NO: 826)






FECH
254-272
SEVVILFSAHSLPMSV
4
3HCN
255.A,263.A,
1, 2, 3, 8, 12,


Ferrochelatase,

VNR


270.A,271.A,
17, 21, 27


mitochondrial

(SEQ ID NO: 42)


754.B,755.B,








763.B,








764.B,766.B,








768.B,770.B,








771.B



FKBP4 Peptidyl-

FEIGEGENLDLPYGLE
13
4LAY
No Overlap



prolyl cis-trans
190-206
R (SEQ ID NO: 827)






isomerase FKBP4








GLA Alpha-
241-252
SILDWTSFNQER
9
3S5Z
244.A,247.A,
11, 20


galactosidase A

(SEQ ID NO: 43)


250.B,251.B,








252.B



GLA Alpha-
68-82
LFMEMAELMVSEGW


70.A,68.B,
13, 16


galactosidase A

K
4
3S5Z
71.B





(SEQ ID NO: 45)






GLA Alpha-
50-67
FMCNLDCQEEPDSCIS
9
3S5Z
50.A,51.A,
1, 3, 13, 16


galactosidase A

EK


52.A,53.A,





(SEQ ID NO: 44)


59.A,60.A,








61.A,62.A,








66.A,50.B,








51.B,52.B,53.B,








55.B,59.B,








60.B,61.B,








62.B,63.B,








65.B,67.B



GLB1 Beta-
286-299
TEAVASSLYDILAR
9
3THC
No Overlap



galactosidase

(SEQ ID NO: 46)






GLO1
160-179
GLAFIQDPDGYWIEIL
3 14
3W0T
159.A,164.A,
1, 2, 4, 8, 12,


Lactoylglutathione

NPNK


165.A,166.A,
19


lyase

(SEQ ID NO: 47)


175.A,178.A,








160.B,








162.B,170.B,








172.B,160.C,








162.C,170.C,








172.C,








162.D,164.D,








166.D,167.D,








168.D,170.D,








172.D



GLUD1 Glutamate
481-496
HGGTIPIVPTAEFQDR
6
1L1F
443.A,440.B,
35, 39, 66


dehydrogenase 1,

(SEQ ID NO: 49)


443.B,439.F,



mitochondria




443.F



GLUD1 Glutamate
152-162
YSTDVSVDEVK
6
1L1F
99.A,100.A,
4, 17, 44, 55,


dehydrogenase 1,

(SEQ ID NO: 48)


101.A,102.A,
57, 60, 61, 65


mitochondrial




99.B,100.B,








101.B,107.B,








99.C,101.C,








102.C,99.D,








100.D,101.D,








102.D,








109.D,99.E,








100.E,101.E,








102.E,99.F,








100.F,101.F,








102.F,109.F



GOLPH3 Golgi
75-90
EGYTSFWNDCISSGLR
14
3KN1
76.A,79.A,
1, 5, 9


phosphoprotein 3

(SEQ ID NO: 50)


80.A,81.A,83.A,








84.A,85.A,








87.A,90.A



GSTP1 Glutathione
56-71
FQDGDLTLYQSNTILR
2
2A2R
61.B,63.B,
1, 14


S-transferase P

(SEQ ID NO: 51)


64.B



H2AFZ Histone
47-75
VGATAAVYSAAILEY
3
5FUG
48.A,49.A,
1, 2, 3, 4, 5,


H2A.Z

LTAEVLELAGNASK


52.A,53.A,
7, 8, 10, 11,




(SEQ ID NO: 828)


71.A,72.A,73.A,
18, 27







74.A,49.D,








67.D,70.D,








71.D,46.G,








54.G,57.G,








59.G,60.G,








63.G,71.G,








72.G,73.G,








74.G,48.J,








49.J,52.J



HADH
250-271
LGAGYPMGPFELLDY
2 13
3HAD
238.A,239.A,
1, 2, 3


Hydroxyacyl-

VGLDTTK (SEQ ID


240.A,242.A,



coenzyme A

NO: 829)


243.A,252.A,



dehydrogenase,




256.A,



mitochondria




257.A,239.B,








240.B,242.B,








243.B,245.B,








252.B,








253.B,256.B,








257.B



HARS Histidine--
170-193
EFYQCDFDIAGNFDP
4 14 15
4PHC
171.A,173.A,
1, 2, 4, 5, 32,


tRNA ligase,

MIPDAECLK(SEQ ID


171.B,172.B,
43


cytoplasmic

NO: 830)


173.B,171.C,








172.C,








173.C,177.C,








180.C,181.C,








182.C,184.C,








185.C,








188.C,170.D,








171.D,172.D



HBA2 Hemoglobin
18-32
VGAHAGEYGAEALE
4
4X0L
27.A,31.A
6


subunit alpha

R








(SEQ ID NO: 52)






HBA2 Hemoglobin
94-100
VDPVNFK
4
4X0L
96.A
2


subunit alpha

(SEQ ID NO: 53)






HEXA Beta-
489-499
LTSDLTFAYER
9
2GJX
497.E,497.H,
47, 70


hexosaminidase

(SEQ ID NO: 54)


498.H



subunit alpha








HLA-A HLA class
46-59
FIAVGYVDDTQFVR
14
5EU3
23.A,30.A,
1, 5


I histocompatibility 

(SEQ ID NO: 831)


31.A,32.A



antigen, A-2 alpha








HMOX2 Heme
48-55
AENTQFVK
2 3 4 6
4WMH
52.A,54.A
1


oxygenase 2

(SEQ ID NO: 55)
8 14 15





HMOX2 Heme
69-87
LATTALYFTYSALEEE
14
4WMH
73.A,74.A,
1, 5


oxygenase 2

MER


76.A,77.A,





(SEQ ID NO: 56)


79.A,80.A



HNRNPA1
56-75
GFGFVTYATVEEVDA
3
2UP1
No Overlap
-


Heterogeneous

AMNAR (SEQ ID NO:






nuclear

832)






ribonucleoprotein








Al








HNRNPA1
16-31
LFIGGLSFETTDESLR
23 14
2UP1
27.A
2


Heterogeneous

(SEQ ID NO: 833)






nuclear








ribonucleoprotein








Al








HNRNPA1
131-140
IEVIEIMTDR (SEQ ID
3 8 9
2UP1
No Overlap
-


Heterogeneous

NO: 834)






nuclear








ribonucleoprotein








Al








HNRNPK
423-433
IDEPLEGSEDR (SEQ
4
1ZZK
No Overlap
-


Heterogeneous

ID NO: 835)






nuclear








ribonucleoprotein K








HNRNPK
397-405
DLAGSIIGK (SEQ ID
3 4
1ZZK
No Overlap
-


Heterogeneous

NO: 836)






nuclear








ribonucleoprotein K








HNRNPK
415-422
HESGASIK (SEQ ID
3 4 13
1ZZK
42.A
2


Heterogeneous

NO: 837)






nuclear








ribonucleoprotein K








HNRNPK
434-456
IITITGTQDQIQNAQYL
2 3 4 8
1ZZK
75.A,76.A,
1, 2


Heterogeneous

LQNSVK (SEQ ID NO:
9 13 14

78.A,79.A,



nuclear

838)


80.A



ribonucleoprotein K








HNRNPL
399-411
VFNVFCLYGNVEK
2
3TO8
405.A,406.A
2


Heterogeneous

(SEQ ID NO: 839)






nuclear








ribonucleoprotein L








HSD17B10 3-

GLVAVITGGASGLGL






hydroxyacyl-CoA
10-29
ATAER (SEQ ID NO:
2 3
2O23
20.A,29.A,
1, 2, 16


dehydrogenase

840)


20.B,29.B



type-2








HSD17B4
169-183
LGLLGLANSLAIEGR
3
1ZBQ
175.A,176.A,
10, 12, 15,


Peroxisomal

(SEQ ID NO: 57)


179.A,180.A,
22, 44


multifunctional




183.A,169.B,



enzyme type 2




175.B,








176.B,169.C,








172.C,176.C,








179.C,180.C,








169.D,








172.D,179.D,








180.D,183.D,








179.F,180.F



HSP90AB1 Heat
360-378
VFIMDSCDELIPEYLN
13 14
3PRY
361.A,362.A,
1,2, 3, 4, 5,


shock protein HSP

FIR


363.A,364.A,
7, 12, 30


90-beta

(SEQ ID NO: 58)


365.A,366.A,








367.A,








368.A,370.A,








371.A,375.A,








365.B,366.B,








367.B,








362.C,365.C,








366.C,367.C,








370.C,371.C,








372.C,








373.C,375.C



HSP90AB1 Heat
507-526
GFEVVYMTEPIDEYC
13 14
3PRY
508.A,512.A,
1, 3, 4, 5, 6,


shock protein HSP

VQQLK


513.A,514.A,
7, 11, 13, 18,


90-beta

(SEQ ID NO: 59)


515.A,516.A,
20, 21, 25,







517.A,
28, 33







518.A,519.A,








520.A,523.A,








525.A,








514.B,516.B,








518.B,525.B,








508.C,512.C,








513.C,514.C,








515.C,








516.C,517.C,








518.C,519.C,








520.C,524.C



HSP90B1
117-135
LISLTDENALSGNEEL
9
4NH9
No Overlap
-


Endoplasmin

TVK








(SEQ ID NO: 60)






HSP90B1
271-285
YSQFINFPIYVWSSK
6
4NH9
No Overlap
-


Endoplasmin

(SEQ ID NO: 61)






HSPA1B Heat
424-447
QTQIFTTYSDNQPGVL
3 13
4WV5
429.A,431.A,
1, 3, 7, 10, 11


shock 70 kDa

IQVYEGER (SEQ ID


432.A,433.A,



protein 1A/1B

NO: 841)


434.A,435.A,








436.A,








439.A,444.A,








436.B,439.B,








445.B,446.B,








447.B



HSPA5 78 kDa
602-617
IEWLESHQDADIEDFK
6
5E85
602.A,605.A,
6


glucose-regulated

(SEQ ID NO: 842)


606.A,609.A



protein








HSPA5 78 kDa
475-492
DNHLLGTFDLTGIPPA
6
5E85
490.A,491.A,
6, 7


glucose-regulated

PR (SEQ ID NO: 843)


492.A



protein








HSPA8 Heat shock
113-126
SFYPEEVSSMVLIK
13 14
3LDQ
115.A,116.A,
15


cognate 71 kDa

(SEQ ID NO: 62)


117.A



protein








HSPA9 Stress-70
266-284
STNGDTFLGGEDFDQ
8 13
4KBO
268.A,269.A,
1, 3


protein,

ALLR (SEQ ID NO:


270.A,271.A,



mitochondrial

844)


279.A,283.A



HSPD1 60 Da heat
206-218
TLNDELEIIEGMK
3 13
4PJ1
183.A,184.A,
1, 2, 3, 4, 7,


shock protein,

(SEQ ID NO: 845)


188.A,190.A,
14, 17, 21,


mitochondrial




194.A,183.B,
27, 29, 34,







193.B,
36, 37, 44,







183.C,186.C,
46, 47, 48,







188.C,190.C,
49, 53, 54,







194.C,183.D,
55, 57, 58,







184.D,
60, 63, 64,







188.D,190.D,
67, 72, 73,







191.D,193.D,
82, 84, 95,







183.E,184.E,
100, 103,







190.E,
117, 118,







192.E,193.E,
119, 129,







183.F,184.F,
131, 135,







188.F,190.F,
154, 160,







191.F,192.F,
204, 244,







193.F,183.G,
269, 277,







184.G,
281, 310,







189.G,191.G,
369, 371,







193.G,183.H,
381, 382,







184.H,190.H,
385, 472







194.H,








183.I,191.I,








193.I,183.J,








192.J,193.J,








183.K,184.K,








192.K,193.K,








182.L,183.L,








184.L,








185.L,186.L,








187.L,188.L,








190.L,193.L,








194.L,183.M,








184.M,








190.M,193.M,








183.N,184.N,








188.N,189.N,








190.N,








191.N,192.N,








193.N,194.N



HSPD1 60 kDa heat
222-233
GYISPYFINTSK (SEQ
13
4PJ1
199.A,200.



shock protein,

ID NO: 846)


A,201.A,202.A,



mitochondrial




203.A,205.A,








208.A,
18, 19, 22,







200.B,201.B,
25, 28, 30,







202.B,203.B,
38, 39, 43,







204.B,205.B,
61, 98, 102,







206.B,
123, 124,







207.B,208.B,
152, 179,







209.B,200.D,
184, 188,







201.D,202.D,
201, 202,







203.D,
209, 222,







208.D,199.E,
229, 282,







202.E,203.E,
283, 314,







205.E,206.E,
332, 340,







208.E,209.E,
388, 409,







199.F,
429, 460,







200.F,201.F,
468, 471, 482







202.F,203.F,








206.F,198.G,








199.G,200.G,








201.G,202.G,








205.G,








206.G,200.H,








201.H,202.H,








203.H,206.H,








207.H,








199.I,201.I,








202.I,203.I,








205.I,199.J,








200.I,201.J,








202.J,203.J,








200.K,201.K,








202.K,203.K,








206.K,207K,








198.L,








199.L,200.L,








202.L,203.L,








205.L,199.M,








200.M,201.M,








202.M,








203.M,205.M,








208.M



HSPD1 60 kDa heat
251-268
ISSIQSIVPALEIANAH
3 13
4PJ1
230.B,231.B,



shock protein,

R (SEQ ID NO: 847)


234.B,235.B,



mitochondrial




238.B,243.B,








229.C,








230.C,231.C,








234.C,235.C,








238.C,243.C,








244.C,








237.E,240.E,
18, 66, 83,







241.E,244.E,
96, 147, 176,







231.F,234.F,
179, 183,







235.F,238.F,
216, 217,







244.F,228.G,
305, 362,







238.G,
374, 377,







240.G,241.G,
380, 391,







242.G,244.G,
439, 473, 482







237.H,240.H,








241.H,








244.H,237.K,








238.K,241.K,








231.M,








240.M,241.M,








244.M,238.N,








239.N,242.N,








244.N



HSPD1 60 kDa heat
371-387
IQEIIEQLDVTTSEYEK
13
4PJ1
349.A,350.A,
1, 2, 3, 4, 7,


shock protein,

(SEQ ID NO: 848)


353.A,361.A,
14, 17, 24,


mitochondrial




363.A,352.B,
25, 30, 37,







353.B,
43, 46, 49,







355.B,356.B,
52, 53, 54,







357.B,358.B, 
55, 57, 63,







359.B,361.B,
73, 85, 95,







362.B,
98, 100, 102,







363.B,348.C,
111, 118,







350.C,351.C,
128, 134,







352.C,353.C,
150, 154,







354.C,
173, 181,







355.C,361.C,
184, 197,







363.C,348.D,
198, 201,







349.D,350.D,
202, 206,







352.D,
222, 229,







353.D,358.D,
237, 240,







361.D,362.D,
242, 278,







363.D,350.E,
280, 282,







352.E,
283, 297,







353.E,354.E,
330, 332,







355.E,356.E,
353, 371,







357.E,358.E,
389, 449







359.E,361.E,








362.E,363.E,








348.F,








351.F,354.F,








355.F,358.F,








361.F,363.F,








348.G,361.G,








362.G,348.H,








349.H,








350.H,353.H,








355.H,361.H,








363.H,348.I,








349.I,352.I,








353.I,361.I,








349.J,350.J,








351.J,353.J,








354.J,355.J,








361.J,362.J,








363.J,348.K,








349.K,








353.K,354.K,








355.K,361.K,








348.L,349.L,








351.L,








355.L,356.L,








358.L,361.L,








362.L,348.M,








349.M,350.M,








353.M,








354.M,355.M,








361.M,362.M,








363.M



HSPD1 60 kDa heat
494-516
IMQSSSEVGYDAMAG
8 13
4PJ1
479.A,480.A,
5, 6, 8, 9, 10,


shock protein,

DFVNMVEK (SEQ ID


482.A,483.A,
11, 12, 15,


mitochondrial

NO: 849)


479.B,481.B,
16, 33, 106,







482.B,
133, 213,







484.B,479.C,
252, 279,







484.C,486.C,
334, 390,







479.E,480.E,
469, 477







481.E,








482.E,491.E,








492.E,479.F,








479.G,483.G,








479.H,482.H,








483.H,484.H,








491.H,








492.I,484.J,








471.K,472.K,








475.K,484.K,








479.L,481.L,








482.L,








483.L,489.L,








471.M,479.M,








479.N,481.N



HSPD1 60 kDa heat
97-121
LVQDVANNTNEEAG
8 13
4PJ1
80.A,84.A,94.A,
6, 8, 9, 10,


shock protein,

DGTTTATVLAR (SEQ


79.B,84.B,
11, 12, 14,


mitochondrial

ID NO: 850)


91.B,97.B,
15, 16 ,17







80.C,84.C,
20' 21' 31'







87.C,90.C,
37' 41' 46'







94.C,97.C,80.D,
48, 49, 53,







82.D,84.D,
55, 56, 57,







88.D,91.D,
60, 63, 67,







75.E,76.E,
71, 73, 84,







80.E,83.E,94.E,
121, 142,







80.F,84.F,
145, 178,







87.F,88.F,
181, 214







90.F,80.G,82.G,
237, 240,




'


84.G,87.G,
250, 253,







90.G,94.G,
255, 275,







97.G,80.H,
280, 319, 371







82.H,83.H,








85.H,86.H,








89.H,80.I,








83.I,91.I,80.J,








83.J,84.J,86.J,








94.J,79.K,








83.K,84.K,








85.K,86.K,








94.K,97.K,








80.L,84.L,








85.L,86.L,87.L,








94.L,97.L,








80.M,82.M,








83.M,88.M,








90.M,94.M,








97.M,80.N,








82.N,84.N,








87.N,90.N



IDE Insulin-
312-324
NLYVTFPIPDLQK
4
4RAL
316.A,319.A,
3, 4, 7, 39,


degrading enzyme

(SEQ ID NO: 851)


320.A,321.A,
41, 67, 93







322.A,323.A,








324.A,








316.B,319.B,








320.B



IGF2BP1 Insulin-
509-525
TVNELQNLTAAEVVV
3 13
3KRM
525.A,515.B,
1, 2, 12, 13,


like growth factor 2 

PR (SEQ ID NO: 852)


518.B,524.B,
17


mRNA-binding




525.B,524.C,



protein




525.C



IMPDH2 Inosine-5-
110-124
YEQGFITDPVVLSPK
13
1NF7
110.A,111.A,
3, 9, 16, 21,


monophosphate

(SEQ ID NO: 63)


112.A,113.A,
30, 33


dehydrogenase 2




114.A,116.A,








120.A,








122.A,110.B,








111.B,112.B,








119.B,121.B



KPNA2 Importin
203-227
YGAVDPLLALLAVPD
13 14
4WV6
No Overlap



subunit alpha-2

MSSLACGYLR (SEQ








ID NO: 853)






KPNA2 Importin
301-315
LLGASELPIVTPALR
13
4WV6
No Overlap



subunit alpha-2

(SEQ ID NO: 854)






KPNB1 Importin
317-332
GALQYLVPILTQTLTK
13 14
3W5K
318.A,330.A
1,23


subunit beta-1

(SEQ ID NO: 855)






KPNB1 Importin
28-42
AAVENLPTFLVELSR
13 14
3W5K
29.A,30.A,34.A,
28, 33, 39


subunit beta-1

(SEQ ID NO: 856)


3 5.A,36.A,








38.A,39.A,








40.A,42.A



KPNB1 Importin
526-537
SSAYESLMEIVK (SEQ
13 14
3W5K
537.A
3


subunit beta-1

ID NO: 857)






LDHA L-lactate
43-57
DLADELALVDVIEDK
9
4JNK
42.A,43.A,44.A,
1, 2, 3, 4, 6,


dehydrogenase A

(SEQ ID NO: 64)


45.A,46.A,
7, 10, 12, 14,


chain




5 1.A,42.B,
16, 27, 29







45.B,46.B,








51.B,53.B,








56.B,42.C,








45.C,50.C,51.C,








42.D,43.D,








44.D,45.D,








46.D,51.D,








52.D



LDHB L-lactate
234-244
MVVESAYEVIK
4
1I0Z
233.A,238.A,



dehydrogenase B

(SEQ ID NO: 65)


242.A,238.B,
1, 2, 4, 5


chain




239.B



LGMN Legumain
102-118
DYTGEDVTPQNFLAV
9
4N6O
No Overlap





LR








(SEQ ID NO: 66)






LMNA Prelamin-
352-366
MQQQLDEYQELLDIK
6 13
3V5B
No Overlap



A/C

(SEQ ID NO: 96)






LTA4H
366-386
LVVDLTDIDPDVAYS
4 8 13
3U9W
1367.A,1369.A,
1


Leukotriene A-4

SVPYEK


1377.A,



hydrolase

(SEQ ID NO: 67)


1380.A,1383.A



NAGLU Alpha-N-
566-580
QAVQELVSLYYEEAR
9
4XWH
No Overlap



acetylglucosaminidase

(SEQ ID NO: 858)






NAGLU Alpha-N-
594-615
AGGVLAYELLPALDE
13 15
4XWH
No Overlap



acetylglucosaminidase

VLASDSR (SEQ ID








NO: 859)






NAMPT
175-189
YLLETSGNLDGLEYK
3 6 8 13
4LVF
185.A,187.A,
2, 5, 6


Nicotinamide

(SEQ ID NO: 68)
14 15

188.A,189.A,



phosphoribosyltrans 




184.B,185.B,



ferase




189.B



NCBP1 Nuclear
42-65
SACSLESNLEGLAGV
2 3 13
3FEY
No Overlap



cap-binding protein

LEADLPNYK (SEQ ID
14





subunit 1

NO: 860)






NHP2L1 NHP2-
114-125
QQIQSIQQSIER (SEQ
236
3S1V
118.A,119.A,
4, 36, 61


like protein 1

ID NO: 861)


118.J,119J,








120.J,121.J, 








125.J



NONO Non-POU
127-135
VELDNMPLR (SEQ ID
3
3SDE
127.B,131.B
1


domain-containing

NO: 862)






octamer-binding








protein








NONO Non-POU
257-270
FAQPGSFEYEYAMR
6
3SDE
257.B,258.B,
6, 13


domain-containing

(SEQ ID NO: 863)


259.B,260.B,



octamer-binding




265.B,267.B



protein








NONO  Non-POU
296-304
LEMEMEAAR (SEQ ID
6
3SDE
No Overlap



domain-containing

NO: 864)






octamer-binding








protein








NONO Non-POU
154-176
NLPQYVSNELLEEAFS
2 3 6 9
3SDE
154.B,173.B,
2, 5


domain-containing

VFGQVER (SEQ ID
13 14

174.B,175.B



octamer-binding

NO: 865)






protein








NONO Non-POU
177-184
AVVIVDDR (SEQ ID 
6
3SDE
181.B,183.B,
3


domain-containing

NO: 866)


184.B



octamer-binding








protein








NPM1
55-73
DELHIVEAEAMNYEG
13
2P1B
55.A,56.A,57.A,
1, 4, 5, 7, 12,


Nucleophosmin

SPIK


55.B,56.B,
13, 19, 23,




(SEQ ID NO: 69)


66.B,67.B,
25, 27, 32, 38







68.B,72.B,








55.C,56.C,








57.C,64.C,65.C,








67.C,68.C,








55.D,56.D,








57.D,61.D,








73.D,55.E,








56.E,57.E,








61.E,72.E,73.E,








55.F,56.F,








57.F,64.F,








65.F,67.F,68.F,








55.G,56.G,








57.G,61.G,








63 .G,64.G,








65.G,73.G,








55.H,56.H,








57.H,63.H,








64.H,65.H,








55.I,56.I,








57.I,55.J,56.J,








57.J



NPM1
81-101
MSVQPTVSLGGFEITP
13
2P1B
81.A,82.A,83.A,
1, 9, 12, 13,


Nucleophosmin

PVVLR


84.A,86.A,
14, 15, 16




(SEQ ID NO: 70)


87.A,88.A,
17, 25, 29,







89.A,90.A,
30, 31, 32,







95.A,81.B,
33, 36, 37, 38







82.B,83.B,








84.B,86.B,








87.B,88.B,89.B,








90.B,81.C,








82.C,83.C,








84.C,86.0,








87.C,88.C,








89.C,90.0,








92 C 93.C,94.C,








95.C,96.C,








81.D,82.D,








83.D,84.D,








88.D,89.D,








90.D,81.E,








82.E,83.E,








84.E,88.E,89.E,








90.E,93.E,








81.F,82.F,








83.F,84.F,86.F,








87.F,88.F,








89.F,90.F,








94.F,95.F,97.F,








99.F,81.G,








82.G,83.G,








84.G,85.G,








86.G,88.G,








89.G,90.G,








94.G,95.G,








96.G,97.G,








99.G,101.G,








81.H,82.H,








83.H,84.H,








86.H,87.H,








88.H,89.H,








90.H,94.H,








95.H,96.H,








98.H,81.I,








82.I,83.I,84.I,








86.I,87.I,88.I,








89.I,90.I,








95.I,96.I,81.J,








82.J,83.J,84.J,








86.J,87.J,








88.J,89.J,90.J,








95.J,96.J



NTMT1 N-terminal
167-185
DNMAQEGVILDDVDS
13
5E2B
168.A,180.A,
1, 4, 7, 18


Xaa-Pro-Lys N-

SVCR (SEQ ID NO: 


182.A,183.A,



methyltransferase 1

867)


184.A,185.A,








167.B,








168.B,169.B,








170.B,171.B,








178.B,179.B,








182.B,








183.B,184.B,








185.B



OAT Ornithine
332-351
VAIAALEVLEEENLAE
13 14
2OAT
340.A,341.A,
6, 21, 24, 43


aminotransferase,

NADK (SEQ ID NO:


344.A,346.A,



mitochondrial

868)


334.B,338.B,








344.B,








347.B,350.B,








340.C,341.C,








344.C,350.C



P4HB Protein
171-195
QFLQAAEAIDDIPFGIT
9
4JU5
178.A,179.A,
4, 13


disulfide-isomerase

SNSDVFSK (SEQ ID


181.A,181.B





NO: 869)






P4HB Protein
231-247
HNQLPLVIEFTEQTAP
2 13 14
4JU5
231.A,232.A,
2, 3, 4, 5, 7,


disulfide-isomerase

K (SEQ ID NO: 870)


233.A,234.A,
11, 13, 15







235.A,236.A,








242.A,








244.A,245.A,








246.A,231.B,








233.B,234.B,








235.B,








238.B,239.B,








240.B,241.B,








244.B,245.B,








247.B



PABPC1
114-129
ALYDTFSAFGNILSCK
14
1CVJ
116.A,127.A,
1, 2, 9, 12,


Polyadenylate-

(SEQ ID NO: 871)


128.A,116.B,
13, 16, 27


binding protein 1




126.B,127.B,








128.B,








129.B,116.C,








125.C,126.C,








127.C,128.C,








116.D,








116.E,117.E,








125.E,127.E,








128.E,116.G,








126.G,128.G



PABPC1
51-67
SLGYAYVNFQQPADA
314
1CVJ
51.A,52.A,58.A,
2, 3, 4, 5, 7,


Polyadenylate-

ER (SEQ ID NO: 872)


60.A,58.B,
11, 14, 22,


binding protein 1




61.C,64.C,
30, 32, 33,







67.C,51.E,
43, 51, 52, 53







52.E,56.E,64.E,








66.E,67.E,








52.F,60.F,








60.G,67.G,51.H,








56.H,60.H



PARP1 Poly
762-779
VEMLDNLLDIEVAYS
3 6
4ZZZ
762.A,763.A,
1 2 29




LLR (SEQ ID NO: 873)


766.A,767.A,








769.A,773.A,








763.B,








766.B,769.B



PARP1 Poly
954-
TTPDPSANISLDGVDV
3
4ZZZ
962.A,964.A,
3, 4, 6, 8, 9,



1000
PLGTGISSGVNDTSLL


965.A,967.A,
13, 16, 22,




YNEYIVYDIAQVNLK


980.A,981.A,
24, 25, 26




(SEQ ID NO: 874)


983.A,








985.A,988.A,








992.A,993.A,








996.A,1000.A,








955B,








961.B,968.B,








970.B,981.B,








982.B,








983.B,985.B,








986.B,988.B,








989.B,992.B,








993.B,








996.B



PCMT1 Protein-L-
179-197
LILPVGPAGGNQMLE
2 3 14
1I1N
183.A,185.A
5


isoaspartate(D-

QYDK






aspartate) O-

(SEQ ID NO: 71)






methyltransf








PCNA Proliferating
118-138
LMDLDVEQLGIPEQE
14
5E0V
121.A,122.A,
1, 2, 11


cell nuclear antigen

YSCVVK (SEQ ID NO:


123.A,122.B,





875)


123.B,124.B,








125.B,








126.B



PDCD4
246-256
DLPELALDTPR (SEQ
13
3EIJ
256.A
1


Programmed cell

ID NO: 876)






death protein 4








PDHB Pyruvate
53-68
VFLLGEEVAQYDGAY
2 3 13
3EXE
31.B,32.B,37.B,
1,2, 3, 12,


dehydrogenase El

K
14

23.D,28.D,
13, 14, 18,


component subunit

(SEQ ID NO: 72)


33.D,36.D,
19, 21, 29


beta,




37.D,28.F,








31.F,32.F,33.F,








36.F,37.F,








24.H,28.H,








31.H,32.H,








36.H,37.H



PGK1
333-350
QIVWNGPVGVFEWE
3
2WZB
No Overlap 



Phosphoglycerate

AFAR






kinase 1

(SEQ ID NO: 73)






PGRMC1
106-119
FYGPEGPYGVFAGR
2 34 13
4X8Y
108.A,109.A,
12


Membrane-

(SEQ ID NO: 877)
14

110.A



associated








progesterone








receptor componen








PKM Pyruvate
174-186
IYVDDGLISLQVK
2 9
4FXF
177.D,180.D,
1, 2 4, 16,


kinase isozymes

(SEQ ID NO: 74)


175.C,177.C,
47, 62


M1/M2




178.C,179.C,








180.C,








182.C,175.B,








177.B,180.B,








182.B



PKM Pyruvate
401-422
LAPITSDPTEATAVGA
2 9
4FXF
401.A,403.A,
3, 9, 37, 39,


kinase isozymes

VEASFK


418.A,420.A,
41, 53, 68


M1/M2

(SEQ ID NO: 75)


421.A,422.A,
75, 78







401.D,








408.D,409.D,








420.D,421.D,








404.C,407.C,








408.C,








409.C,410.C,








402.B,403.B,








404.B,414.B



POR NADPH--
369-382
TALTYYLDITNPPR
13 14
3QFS
375.A,376.A
2, 5


cytochrome P450

(SEQ ID NO: 76)






reductase








PPP1CA
133-141
IYGFYDECK
2
4XPN
134.C,139.C
3,4


Serine/threonine-

(SEQ ID NO: 77)






protein phosphatase








PP1-alpha cat








PPP1CC
44-60
EIFLSQPILLELEAPLK
14
4UT2
55.A,56.A,47.B,
13, 14


Serine/threonine-

(SEQ ID NO: 79)


48.B,49.B,



protein phosphatase




50.B,51.B



PP1-gamma cat








PPP1CC
133-141
IYGFYDECK
2
4UT2
No
Overlap


Serine/threonine-

(SEQ ID NO: 78)






protein phosphatase





-


PP1-gamma cat








PPT1 Palmitoyl-
75-101
TLMEDVENSFFLNVN
2 4 8 9
3GRO
75.A,76.A,78.A,
1,2, 5, 10


protein thioesterase

SQVTTVCQALAK
13 14

75.B,76.B,



1

(SEQ ID NO: 80)
15

80.B,81.B,








85.B,86.B,








87.B,90.B



PRDX2
120-127
TDEGIAYR
13
1QMV
121.A,122.
3, 5, 6, 7, 8,


Peroxiredoxin-2

(SEQ ID NO: 81)


A,121.B,122.B,
14, 15, 18,







124.B,121.C,
20, 26, 27,







122.C,
29, 30, 32,







124.C,127.C,
36, 42, 43,







120.D,121.D,
54,72, 111







122.D,124.D,








126.D,








127.D,121.E,








122.E,124.E,








127.E,121.F,








122.F,124.F,








127.F,








121.G,122.G,








124.G,120.H,








121.H,122.H,








124.H,








126.H,127.H,








120.I,121.I,








122.I,124.I,








126.I,127.I,








120.J,121.J,








122.J,124.J,








126.J,127.J



PSMA2 Proteasome
144-159
PYLFQSDPSGAYFAW
2
4R3O
144.B,152.B,
1 28, 63 69,


subunit alpha type-2

K (SEQ ID NO: 878)


154.B,155.B,
93







156.B,147.P,








149.P,








154.P,155.P,








157.P



PSMA2 Proteasome
19-39
LVQIEYALAAVAGGA
3
4R3O
25.P,28.P
28


subunit alpha type-2

PSVGIK (SEQ ID NO:








879)






PSMA4 Proteasome
68-91
LNEDMACSVAGITSD
3 6 8 13
4R3O
71.C,80.C,81.C,
1, 69, 93,


subunit alpha type-4

ANVLTNELR (SEQID
14

84.C,85.C,
145, 147




NO: 880)


88.C,70.Q,








81.Q,85.Q,








87.Q,88.Q,








89.Q,91.Q



PSMB1 Proteasome
129-146
FFPYYVYNIIGGLDEE
2 13 14
4R3O
107.M,108.M,
4, 11, 57


subunit beta type-1

GK (SEQID NO: 881)
15

109.M,118.M,








107.1,109.1



PSMB2 Proteasome
96-126
TPYHVNLLLAGYDEH
2 6
4R3O
96.K,97.K,98.K,
97, 145, 180,


subunit beta type-2

EGPALYYMDYLAAL


99.K,101.K,
181, 191




AK (SEQ ID NO: 882)


108.K,110K








111K,








119.K,101.Y,








116.Y,119.Y,








124.Y,125.Y,








126.Y



PSMB2 Proteasome
42-62
ILLLCVGEAGDTVQF
6
4R3O
49.K,52.K,48.Y,
97, 149, 163


subunit beta type-2

AEYIQK (SEQ ID NO:


54.Y,58.Y,





883)


61.Y



PSMB3 Proteasome
100-115
FGPYYTEPVIAGLDPK
3 6 13
4R3O
100.J,106.X,
1, 3, 198


subunit beta type-3

(SEQ ID NO: 884)
14 15

113.X,114.X



PSMB4 Proteasome
61-80
FEGGVVIAADMLGSY
6
4R3O
30.2,35.2
67


subunit beta type-4

GSLAR








(SEQ ID NO: 82)






PSMB5 Proteasome

LLANMVYQYK
3 4 6
4R3O
88.L,91.L
97


subunit beta type-5
141-150
(SEQ ID NO: 83)






PSMB5 Proteasome
226-239
DAYSGGAVNLYHVR
6
4R3O
No Overlap



subunit beta type-5

(SEQ ID NO: 84)






PSMB6 Proteasome
80-118
SGSAADTQAVADAVT
3614
4R3O
48.H,50.H,51.H,
10, 23, 50,


subunit beta type-6

YQLGFHSIELNEPPLV


60.H,61.H,6
67, 152, 155,




HTAASLFK


4.H,65.H,
160, 169,




(SEQ ID NO: 85)


67.H,68.H,
187, 200,







69.H,70.H,
204, 206, 212







71.H,75.H,








77.H,78.H,








84.H,46.V,








48.V,50.V,








51.V,53.V,








62.V,65.V,








70.V,72.V,








75.V,77.V,








78.V,82.V,








84.V



PSPC1 Paraspeckle
229-247
PVIVEPMEQFDDEDG
6 14
3SDE
229.A,231.A,
1, 2, 3, 4, 6


component 1

LPEK (SEQ ID NO:


232.A,233.A,





885)


235.A,237.A,








239.A,








240.A,241.A



PTGR2








Prostaglandin
93-106
GDFVTSFYWPWQTK
14
2ZB4
97.A
2


reductase 2

(SEQ ID NO: 886)






PTGR2
262-278
DVPYPPPLSPAIEAIQK
2 3 14
2ZB4
263.A,265.A,



Prostaglandin

(SEQ ID NO: 887)


267.A,277.A,
1, 4


reductase 2




278.A



RAB7A Ras-related
104-113
DEFLIQASPR
14
1YHN
NoOverlap



protein Rab-7a

(SEQ ID NO: 86)






RARS Arginine--
528-540
GNTAAYLLYAFTR
14
4ZAJ
464.A,467.A,
1


tRNA ligase,

(SEQ ID NO: 888)


468.A



cytoplasmic








RPL30 60S
58-68
SEIEYYAMLAK (SEQ
13
3VI6
No verlap



ribosomal protein

ID NO: 889)






L30








RUVBL1 RuvB-
318-333
ALESSIAPIVIFASNR
2
2XSZ
229.A,231.A,
2, 3, 4, 5, 12,


like 1

(SEQ ID NO: 890)


232.A,233.A,
20, 37, 39,







234.A,235.A,
45, 47, 48,







236.A,
52, 53, 55,







237.A,243.A,
58, 69







229.B,231.B,








232.B,233.B,








236.B,








229.C,230.C,








231.C,232.C,








233.C,234.C,








235.C,








236.C,237.C,








238.C,243.C,








244.0



RUVBL1 RuvB-
91-107
VPFCPMVGSEVYSTEI
2
2XSZ
105.A,106.A,
1, 2, 3, 7, 8,


like 1

K (SEQ ID NO: 891)


107.A,108.A,
13, 33







115.A,116.A,








117.A,








118.A,119.A,








120.A,121.A,








105.B,106.B,








107.B,








108.B,120.B,








105.C,106.C,








107.C,113.C,








120.0



RUVBL2 RuvB-like 2
315-330
ALESDMAPVLIMATN
14
3UK6
316.A,317.A,
1, 3, 4, 5, 6,




R


318.A,319.A,
7, 8, 9, 10,




(SEQ ID NO: 87)


320.A,322.A,
11, 12, 13,







323.A,
15, 18, 19,







329.A,315.B,
20, 22, 23,







317.B,318.B,
24, 25, 30,







319.B,320.B,
31, 33, 37,







321.B,
48, 54, 64,







322.B,323.B,
65, 72, 75,







329.B,315.C,
80, 83, 98,







316.C,318.C,
100, 117,







322.C,
121, 147,







315.D,316.D,
154, 155







317.D,318.D,








319.D,321.D,








322.D,








323.D,329.D,








318.E,319.E,








322.E,323.E,








329.E,








315.F,316.F,








317.F,318.F,








319.F,320.F,








322.F,323.F,








329.F,315.G,








320.G,








329.G,330.G,








315.H,318.H,








320.H,322.H,








323.H,








324.H,327.H,








329.H,315.I,








317.I,318.I,








319.I,320.I,








322.I,329.I,








315.J,318.J,








320.J,322.J,








323.J,327.J,








329.J,318.K,








319.K,322.K,








323.K,








325.K,328.K,








318.L,319.L,








320.L,322.L,








323.L,








325.L,329.L



SFPQ
Splicing
NL SPYVSNELLEEAFS
2 3 4 9
4WIK
377.A,380.A,
1, 3, 6, 12


factor, proline- and
377-399
QFGPIER (SEQ ID NO:
13 14

381.A,399.A,



glutamine-rich

892)


377.B



SFPQ Splicing
444-462
PVIVEPLEQLDDEDGL
2 4 14
4WIK
450.A,452.A,
5, 8, 12


factor, proline- and

PEK (SEQ ID NO: 893)


455.A,446.B,



glutamine-rich




447.B,448.B,








449.B,








455.B



SLC25Al2
260-283
YGQVTPLEIDILYQLA
4 14
4P5X
No Overlap



Calcium-binding

DLYNASGR (SEQ ID






mitochondrial

NO: 894)






carrier protein Aral








SLC25A13
261-282
FGQVTPMEVDILFQL
2 3 4 6
4P5W
262.A,263.A,
1,4, 10, 11


Calcium-binding

ADLYEPR (SEQ ID
14 15

264.A,265.A,



mitochondrial

NO: 895)


267.A,273.A,



carrier protein Aral




274.A,








276.A,277.A,








280.A,282.A,








261.B,262.B,








263.B,








264.B,265.B,








267.B,268.B,








270.B,273.B,








274.B,








276.B,280.B



SLC25A13
642-653
LAVATFAGIENK (SEQ
3 4 6 8
4P5W
647.A,649.A,
1, 2, 3, 4, 8


Calcium-binding

ID NO: 896)
14 15

650.A,651.A,



mitochondrial




653.A,647.B,



carrier protein Aral




648.B,








649.B,650.B,








651.B



SMYD3 SET and
255-265
DQYCFECDCFR
9
5HQ8
255.A,256.A,
1, 2, 5, 6, 8,


MYND domain-

(SEQ ID NO: 88)


258.A,259.A,
23


containing protein 3




260.A,264.A,








255.B,








256.B,258.B,








259.B



SPTBN1 Spectrin
1706-
EVDDLEQWIAER
13
3EDV
No Overlap



beta chain, non-
1717
(SEQ ID NO: 897)






erythrocytic 1








STAG2 Cohesin
273-290
ELQENQDEIENMMNA
13
4PK7
No Overlap



subunit SA-2

JFK (SEQ ID NO: 898)






TIMM10
6-24
AQQLAAELEVEMMA
8 9 13
2BSK
15.D,19.D,20.D,
3, 11


Mitochondrial

DMYNR (SEQ ID NO:
14

22.D,23.D,



import inner

899)


24.D



membrane








translocase su








TIMM44
428-439
DQDELNPYAAWR
13
2CW9
434.A,435.A,
1


Mitochondrial

(SEQ ID NO: 900)


438.A



import inner








membrane








translocase su








TNPO1
273-298
TQDQDENVALEACEF
9
4OO6
265.A,266.A,
3


Transportin-1

WLTLAEQPICK (SEQ


267.A





ID NO: 901)






TNPO1
45-64
LEQLNQYPDFNNYLIF
2 13 14
4OO6
37.A,38.A,40.A,
11, 16, 18


Transportin-1

VLTK (SEQ ID NO:


43.A,45.A,





902)


46.A



TPP1 Tripeptidyl-
521-558
GCHESCLDEEVEGQG
4 9 13
3EDY
522.A,524.
4, 8, 13


peptidase 1

FCSGPGWDPVTGWG
14 15

A,531.A,53





TPNFPALLK


2.A,534.A,5





(SEQ ID NO: 89)


35.A,537.A,








540.A,541.A,








543.A,548.A



TSN Translin
205-215
VEEVVYDLSIR (SEQ
2
3PJA
206.A,207.A,
1, 2, 3, 4, 6,




ID NO: 903)


209.A,210.A,
11, 15,49,







211.A,215.A,
54, 65, 68,







207.B,
94, 101, 116







211.B,213.B,








215.B,207.C,








209.C,213.C,








214.C,








215.C,206.D,








207.D,209.D,








210.D,211.D,








213.D,








214.D,215.D,








207.E,211.E,








214.E,215.E,








207.F,








208.F,211.F,








215.F,207.G,








210.G,211.G,








214.G,215.G,








207.H,








211.H,215.H,








207.I,209.I,








211.I,213.I,








214.I,215.I



TXNDC17
4-17
YEEVSVSGFEEFHR
14
1WOU
No Overlap



Thioredoxin

(SEQ ID NO: 90)






domain-containing








protein 17








VDAC1 Voltage-
140-161
GALVLGYEGWLAGY
2 4 6 13
2JK4
144.A,146.A,
1 2 6


dependent anion-

QMNEETAK (SEQ ID
14

149.A,152.A,



selective channel

NO: 904)


153.A,155.A,



protein




157.A



VDAC1 Voltage-
121-139
EHINLGCDMDFDIAG
2 4 8 13
2JK4
126.A,127.A,
1, 6


dependent anion-

PSIR (SEQ ID NO: 905)
14

131.A,142.A



selective channel








protein








VDAC1 Voltage-
75-93
WNTDNTLGTEITVED
2 3 4 6
2JK4
84.A,85.A,86.A,
5


dependent anion-

QLAR (SEQ ID
8 9 13

87.A



selective channel

NO: 906)
14 15





protein








VDAC1 Voltage-
164-174
VTQSNFAVGYK (SEQ
4 6 8 14
2JK4
173.A,174.A,
1, 2


dependent anion-

ID NO: 907)


175.A,176.A,



selective channel




177.A



protein








VDAC1 Voltage-
64-74
WTEYGLTFTEK (SEQ
2 3 4 6
2JK4
No Overlap



dependent anion-

ID NO: 908)
8 9 13





selective channel


14 15





protein








VDAC1 Voltage-
35-53
SENGLEFTSSGSANTE
4 8 9
2JK4
45.A,47.A
7


dependent anion-

TTK (SEQ ID NO: 909)






selective channel








protein








VDAC1 Voltage- 
175-197
TDEFQLHTNVNDGTE
4 8 14
2JK4
180.A,181.A,
1, 2, 4


dependent anion-

FGGSIYQK(SEQ ID


184.A,185.A,



selective channel

NO: 910)


186.A,196.A,



protein




198.A



VDAC1 Voltage-
225-236
YQIDPDACFSAK (SEQ
4 8
2JK4
229.A
4


dependent anion-

ID NO: 911)






selective channel








protein








VIM Vimentin
176-184
DNLAEDIMR (SEQ ID
6
4YPC
No Overlap





NO: 912)






VIM Vimentin
197-207
EEAENTLQSFR (SEQ 
2 3 6 9
4YPC
No Overlap





ID NO: 913)
13 14








15





VIM Vimentin
189-196
LQEEMLQR (SEQ ID
3 6
4YPC
No Overlap





NO: 914)






VIM Vimentin
224-235
VESLQEEIAFLK (SEQ
4 6 14
4YPC
No Overlap





ID NO: 915)






VPS33A Vacuolar
233-262
NVDLLTPLATQLTYE
14
4BX9
No Overlap



protein sorting-

GLIDEIYGIQNSYVK






associated protein

(SEQ ID NO: 916)






33A








XRCC6 X-ray
475-488
SDSFENPVLQQHFR
2 3 4 8
1JEY
476.A,486.A,
1, 25


repair cross-

(SEQ ID NO: 917)
13

488.A



complementing








protein 6








XRCC6 X-ray
489-510
NLEALALDLMEPEQA
2 3 4 8
1JEY
491.A,497.
14, 25


repair cross-

VDLTLPK (SEQ ID
13

A,508.A,50



complementing

NO: 176)


9.A



protein 6








YWHAE 14-3-3
197-215
AAFDDAIAELDTLSEE
13
3UBW
212.A
1


protein epsilon

SYK








(SEQ ID NO: 92)






YWHAE 14-3-3
143-153
EAAENSLVAYK
13
3UBW
No Overlap



protein epsilon

(SEQ ID NO: 91)






YWHAQ 14-3-3
194-212
TAFDEAIAELDTLNED
14
5IQP
196.A,197.
2, 6, 10


protein theta

SYK


A,196.B,197.B,





(SEQ ID NO: 93)


209.B,210.B



YWHAZ 14-3-3
194-212
TAFDEAIAELDTLSEE
13 14
5D2D
196.A,197.A,



protein zeta/delta

SYK


200.A,203.A,





(SEQ ID NO: 94)


211.A,194.B,
1,6, 13, 19







198.B,








211.B,212.B
























TABLE 3











Annotated
Estimated
SEQ


Accension

Labeled



Functional
Distance
ID


#
Protein Name
Peptide
Peptide Sequence
Probes
PDB
Site
from Site
NO:























P24666
ACP1 Low
42-59
VDSAATSGYEIG
13
3N81
ACT_SITE
2.995
1



molecular

NPPDYR


13 13, 





weight




ACT_SITE





phosphotyrosine 




19 19, 





protein phosp




ACT_SITE










130 130







Q8NI60
ADCK3
277-295
LGQMLSIQDDAFI
14
4PED
NP_BIND
2.639
2



Chaperone

NPHLAK


336 344, 





activity of 




ACT_SITE





bc1 




488 488, 





complex-like, 




BINDING





mitochondr




358 358







P55263
ADK
209-224
IFTLNLSAPFISQF
2
4O1L
ACT_SITE
5.239
3



Adenosine

YK


317 317, 





kinase




METAL 49










49, METAL










147 147, 










METAL 148










148







P30520
ADSS
431-441
FIEDELQIPVK
14
2V40
NP_BIND 39
6.392
4



Adenylosuccin




45, 





ate synthetase




NP_BIND 67





isozyme 2




69, 










NP_BIND










362 364, 










NP_BIND










444 447, 










ACT_SITE










40 40, 










ACT_SITE










68 68, 










METAL 40










40, METAL










67 67, 










BINDING 40










40, 










BINDING










162 162, 










BINDING










176 176, 










BINDING










255 255, 










BINDING










270 270, 










BINDING










334 334, 










BINDING










336 336







O95831
AIFM1
475-510
PYWHQSMFWSD
3 2
4LII
NP_BIND
0
5



Apoptosis-

LGPDVGYEAIGL
4 6

138 142, 





inducing 

VDSSLPTVGVFA


NP_BIND





factor 1, 

K


164 165, 





mitochondrial




NP_BIND










454 455, 










BINDING










172 172, 










BINDING










177 177, 










BINDING










233 233, 










BINDING










285 285, 










BINDING










438 438, 










BINDING










483 483







P49419
ALDH7A1
139-162
ILVEGVGEVQEY
13 8
4ZUL
NP_BIND
4.14
6



Alpha-

VDICDYAVGLSR


274 279, 





aminoadipic




ACT_SITE





semialdehyde




296 296, 





dehydrogenase




ACT_SITE










330 330, 










SITE 195










195







P18085
ARF4 ADP-
39-59
LGEIVTTIPTIGFN
13 3
1Z6X
NP_BIND 24
2.742
7



ribosylation




31, 





factor 4




NP_BIND 67







VETVEYK
2 8

71, 










NP_BIND










126 129







P84085
ARF5 ADP-
39-59
LGEIVTTIPTIGFN
13 3
2B6H
NP_BIND 24
2.639
8



ribosylation

VETVEYK
4 2 8

31, 





factor 5




NP_BIND 67










71, 










NP_BIND










126 129







P40616
ARL1 ADP-
163-178
GTGLDEAMEWL
14 13
4DCN
NP_BIND 24
3.491
9



ribosylation

VETLK


31, 





factor-like




NP_BIND 45





protein 1




48, 










NP_BIND 67










71, 










NP_BIND










126 129, 










NP_BIND










160 161, 










METAL 31










31, METAL










48 48, 










BINDING 70










70







P40616
ARL1 ADP-
37-59
LQVGEVVTTIPTI
13
4DCN
NP_BIND 24
0
10



ribosylation

GFNVETVTYK


31, 





factor-like




NP_BIND 45










48, 










NP_BIND 67










71, 










NP_BIND










126 129, 










NP_BIND










protein 1










160 161, 










METAL 31










31, METAL










48 48, 










BINDING 70










70







P31939
ATIC
178-194
AFTHTAQYDEAI
13
1PKX
NP_BIND 12
2.81
11



Bifunctional

SDYFR


14, 





purine




NP_BIND 34





biosynthesis




NP_BIND 64





protein PURH




67, 










NP_BIND










101 104, 










NP_BIND










125 127, 










ACT_SITE










137 137, 










ACT_SITE










267 267, 










BINDING










316 316, 










BINDING










339 339, 










BINDING










431 431, 










BINDING










451 451, 










BINDING










541 541, 










BINDING










588 588, 










SITE 266 266





















Q13867
BLMH
111-124
CYFFLSAFVDTA
14
1CB5
ACT_SITE
15.919
12

















Bleomycin

QR


73 73, 





hydrolase




ACT_SITE










372 372, 










ACT_SITE










396 396







Q13867
BLMH
203-218
GEISATQDVMME
13
1CB5
ACT_SITE
19.295
13



Bleomycin

EIFR


73 73, 





hydrolase




ACT_SITE










372 372, 










ACT_SITE










396 396







P27797
CALR
323-351
SGTIFDNFLITND
13 9
3POW
METAL 26
0
14



Calreticulin

EAYAEEFGNETW
6

26, METAL







GVTK


62 62, 










METAL 64










64, METAL










328 328, 










BINDING










109 109, 










BINDING










111 111, 










BINDING










128 128, 










BINDING










135 135, 










BINDING










317 317







P27797
CALR
 99-111
HEQNEDCGGGYV
6
3POW
METAL 26
0
15



Calreticulin

K


26, METAL










62 62, 










METAL 64










64, METAL










328 328, 










BINDING










109 109, 










BINDING










111 111, 










BINDING










128 128, 










BINDING










135 135, 










BINDING










317 317







P07384
CAPN1
175-193
LVFVHSAEGNEF
14
2ARY
ACT_SITE
7.409
16



Calpain-1

WSALLEK


115 115, 





catalytic




ACT_SITE





subunit




272 272, 










ACT_SITE










296 296, 










SITE 15 16, 










SITE 27 28







P12277
CKB Creatine
14-32
FPAEDEFPDLSAH
3
3B6R
NP_BIND
2.797
17



kinase B-type

NNHMAK


128 132, 










NP_BIND










320 325, 










BINDING 72










72, 










BINDING










130 130, 










BINDING










132 132, 










BINDING










191 191, 










BINDING










232 232, 










BINDING










236 236, 










BINDING










285 285, 










BINDING










292 292, 










BINDING










320 320, 










BINDING










335 335







P12277
CKB Creatine
157-172
LAVEALSSLDGD
13
3B6R
NP_BIND
7.719
18



kmase B-type

LAGR


128 132, 










NP_BIND










320 325, 










BINDING 72










72, 










BINDING










130 130, 










BINDING










132 132, 










BINDING










191 191, 










BINDING










232 232, 










BINDING










236 236, 










BINDING










285 285, 










BINDING










292 292, 










BINDING










320 320, 










BINDING










335 335







P12277
CKB Creatine
224-236
TFLVWVNEEDHL
3
3B6R
NP_BIND
0
19



kinase B-type

R


128 132, 










NP_BIND










320 325, 










BINDING 72










72, 










BINDING










130 130, 










BINDING










132 132, 










BINDING










191 191, 










BINDING










232 232, 










BINDING










236 236, 










BINDING










285 285, 










BINDING










292 292, 










BINDING










320 320, 










BINDING










335 335







P12277
CKB Creatine
253-265
FCTGLTQIETLFK
13
3B6R
NP_BIND
3.569
20



kmase B-type




128 132, 










NP_BIND










320 325, 










BINDING 72










72, 










BINDING










130 130, 










BINDING










132 132, 










BINDING










191 191, 










BINDING










232 232, 










BINDING










236 236, 










BINDING










285 285, 










BINDING










292 292, 










BINDING










320 320, 










BINDING










335 335







P12277
CKB Creatine
342-358
LGFSEVELVQMV
3 13
3B6R
NP_BIND
4.632
21



kinase B-type

VDGVK


128 132, 










NP_BIND










320 325, 










BINDING 72










72, 










BINDING










130 130, 










BINDING










132 132, 










BINDING










191 191, 










BINDING










232 232, 










BINDING










236 236, 










BINDING










285 285, 










BINDING










292 292, 










BINDING










320 320, 










BINDING










335 335







P12277
CKB Creatine
367-381
LEQGQAIDDLMP
13
3B6R
NP_BIND
15.156
22



kinase B-type

AQK


128 132, 










NP_BIND










320 325, 










BINDING 72










72, 










BINDING










130 130, 










BINDING










132 132, 










BINDING










191 191, 










BINDING










232 232, 










BINDING










236 236, 










BINDING










285 285, 










BINDING










292 292, 










BINDING










320 320, 










BINDING










335 335







P12532
CKMT1B
257-269
SFLIWVNEEDHT
3
1QK1
NP_BIND
0
2



Creatine kinase

R


161 165, 





U-type, 




NP_BIND





mitochondrial




353 358, 










BINDING










224 224, 










BINDING










269 269, 










BINDING










325 325, 










BINDING










368 368







Q16740
CLPP Putative
215-226
QSLQVIESAMER
6
1TG6
ACT_SITE
3.045
24



ATP-




153 153, 





dependent Clp




ACT_SITE





protease




178 178





proteolytic su












P48729
CSNK1A1
84-106
DYNVLVMDLLG
14
5FQD
NP_BIND 23
2.833
25



Casein kinase I

PSLEDLFNFCSR


31, 





isoform alpha




ACT_SITE










136 136, 










BINDING 46










46







P67870
CSNK2B
112-134
VYCENQPMLPIG
14
4NH1
METAL 109
0
26



Casein kinase

LSDIPGEAMVK


109, METAL





II subunit beta




114 114, 










METAL 137










137, METAL










140 140







P07858
CTSB
315-331
GQDHCGIESEVV
13 4
3K9M
ACT_SITE
6.662
27



Cathepsin B

AGIPR
2 9

108 108, 










ACT_SITE










278 278, 










ACT_SITE










298 298







P07339
CTSD
236-253
DPDAQPGGELML
9
4OD9
ACT_SITE
11.321
28



Cathepsin D

GGTDSK


97 97, 










ACT_SITE










295 295







P07339
CTSD
288-309
EGCEAIVDTGTSL
13 14
4OD9
ACT_SITE
0
29



Cathepsin D

MVGPVDEVR
15 4 

97 97, 








6 9 8

ACT SITE










295 295







P07339
CTSD
314-331
AIGAVPLIQGEY
14 15
4OD9
ACT_SITE
13.281
30



Cathepsin D

MIPCEK
3 2 4

97 97, 








13 6

ACT_SITE








9 8

295 295







P00387
CYB5R3
235-241
LWYTLDR
3
1UMK
NP_BIND
2.96
31



NADH-




132 147, 





cytochrome b5




NP_BIND





reductase 3




171 206







Q16698
DECR1 2,4-
299-315
FDGGEEVLISGEF
6
1W6U
NP_BIND 66
2.779
32



dienoyl-CoA

NDLR


71, 





reductase, 




NP_BIND





mitochondrial




240 243, 










ACT_SITE










199 199, 










BINDING 91










91, 










BINDING 91










91, 










BINDING










117 117, 










BINDING










119 119, 










BINDING










149 149, 










BINDING










157 157, 










BINDING










214 214, 










BINDING










251 251







Q08211
DHX9 ATP-
448-456
ISAVSVAER
3
3LLM
NP_BIND
3.525
33



dependent




411 419





RNA helicase










A












P09622
DLD
450-482
VLGAHILGPGAG
14 4
3RNM
NP_BIND 71
6.842
34



Dihydrolipoyl

EMVNEAALALEY
13

80, 





dehydrogenase, 

GASCEDIAR


NP_BIND





mitochondrial




183 185, 










NP_BIND










220 227, 










NP_BIND










361 364, 










ACT_SITE










487 487, 










BINDING 89










89, 










BINDING










154 154, 










BINDING










243 243, 










BINDING










278 278, 










BINDING










314 314, 










BINDING










355 355







Q13011
ECH1
113-131
MFTAGIDLMDM
6
2VRE
BINDING
3.9
35



Delta(3,5)-

ASDILQPK


174 174, 





Delta(2,4)-




SITE 197





dienoyl-CoA




197, SITE





isomerase, 




205 205





mitoc












Q13011
ECH1
149-158
YQETFNVIER
6
2VRE
BINDING
2.823
36



Delta(3,5)-




174 174, 





Delta(2,4)-




SITE 197





dienoyl-CoA




197, SITE





isomerase, 




205 205





mitoc












Q13011
ECH1
197-211
EVDVGLAADVG
13 14
2VRE
BINDING
0
37



Delta(3,5)-

TLQR
15 3

174 174, 





Delta(2,4)-


4 6 8

SITE 197





dienoyl-CoA




197, SITE





isomerase, 




205 205





mitoc












P60842
EIF4A1
178-190
MFVLDEADEMLS
13
2ZU6
NP_BIND 76
2.797
38



Eukaryotic

R


83





initiation factor










4A-I












P60842
EIF4A1
69-82
GYDVIAQAQSGT
14 13
2ZU
NP_BIND 76
0
39



Eukaryotic

GK
9 6

83





initiation factor










4A-I












Q14240
EIF4A2










Eukaryotic
70-83
GYDVIAQAQSGT
13
3B0R
NP_BIND 77
0
40



initiation factor

GK


84





4A-II












P38117
ETFB Electron
36-51
HSMNPFCEIAVEE
3
2A1T
BINDING 16
5.189
41



transfer

AVR


16





flavoprotein










subunit beta












P22830
FECH
254-272
SEVVILFSAHSLP
4
3HCN
ACT_SITE
3.373
42



Fen-ochelatase, 

MSVVNR


230 230, 





mitochondrial




ACT_SITE










383 383, 










METAL 196










196, METAL










403 403, 










METAL 406










406, METAL










411 411







P06280
GLA Alpha-
241-252
SILDWTSFNQER
9
355Z
ACT_SITE
5.4
43



galactosidase




170 170, 





A




AC_SITE










231 231







P06280
GLA Alpha-
50-67
FMCNLDCQEEPD
9
3S5Z
ACT_SITE
8.622
44



galactosidase

SCISEK


170 170, 





A




ACT_SITE










231 231







P06280
GLA Alpha-
68-82
LFMEMAELMVSE
4
355Z
ACT_SITE
14.579
45



galactosidase

GWK


170 170, 





A




ACT_SITE










231 231







P16278
GLB1 Beta-
286-299
TEAVASSLYDILA
9
3THC
ACT_SITE
7.48
46



galactosidase

R


188 188, 










ACT_SITE










268 268







Q04760
GLO1
160-179
GLAFIQDPDGYW
14 3
3W0T
ACT_SITE
0
47



Lactoyl-

IEILNPNK


173 173, 





glutathione-




METAL 34





lyase




34, METAL










100 100, 










METAL 127










127, METAL










173 173, 










BINDING 34










34, 










BINDING 38










38, 










BINDING










104 104, 










BINDING










123 123, 










BINDING










127 127







P00367
GLUD1
152-162
YSTDVSVDEVK
6
1L1F
NP_BIND
3.908
48



Glutamate




141 143, 





dehydrogenase




ACT_SITE





1, 




183 183, 





mitochondrial




BINDING










147 147, 










BINDING










171 171, 










BINDING










176 176, 










BINDING










252 252, 










BINDING










266 266, 










BINDING










270 270, 










BINDING










319 319, 










BINDING










322 322, 










BINDING










438 438, 










BINDING










444 444, 










BINDING










450 450, 










BINDING










516 516







P00367
GLUD1
481-496
HGGTIPIVPTAEF
6
1L1F
NP_BIND
10.438
49



Glutamate

QDR


141 143, 





dehydrogenase




ACT_SITE





1, 




183 183, 





mitochondrial




BINDING










147 147, 










BINDING










171 171, 










BINDING










176 176, 










BINDING










252 252, 










BINDING










266 266, 










BINDING










270 270, 










BINDING










319 319, 










BINDING










322 322, 










BINDING










438 438, 










BINDING










444 444, 










BINDING










450 450, 










BINDING










516 516







Q9H4A6
GOLPH3
75-90
EGYTSFWNDCISS
14
3KN1
BINDING 81
0
50



Golgi

GLR


81, 





phosphoprotein




BINDING 90





3




90, 










BINDING










171 171, 










BINDING










174 174







P09211
GSTP1
56-71
FQDGDLTLYQSN
2
2A2R
BINDING 8
3.198
51



Glutathione S-

TILR


8, BINDING





transferase P




14 14, 










BINDING 39










39, 










BINDING 45










45







P69905
HBA2
18-32
VGAHAGEYGAE
4
4X0L
METAL 59
3.717
52



Hemoglobin

ALER


59, METAL





subunit alpha




88 88, SITE










12 12, SITE










57 57, SITE










61 61, SITE










91 91, SITE










100 100







P69905
HBA2
 94-100
VDPVNFK
4
4X0L
METAL 59
0
53



Hemoglobin




59, METAL





subunit alpha




88 88, SITE










12 12, SITE










57 57, SITE










61 61, SITE










91 91, SITE










100 100







P06865
HEXA Beta-
489-499
LTSDLTFAYER
9
2GJX
ACT_SITE
28.463
54



hexosaminidase




323 323





subunit alpha












P30519
HMOX2 Heme
48-55
AENTQFVK
15 14
4WMB
METAL 45
3.21
55



oxygenase 2


3 4 2

45








6 8









P30519
HMOX2 Heme
69-87
LATTALYFTYSA
14
4WMH
METAL 45
11.935
56



oxygenase 2

LEEEMER


45







P51659
HSD17B4
169-183
LGLLGLANSLAIE
3
1ZBQ
NP_BIND 13
1.327
57



multifunctional

GR


37, 





enzyme type 2




NP_BIND 75










76, 










NP_BIND










164 168, 










NP_BIND










196 199, 










ACT_SITE










164 164, 










BINDING 21










21, 










BINDING 40










40, 










BINDING 99










99, 










BINDING










151 151, 










BINDING










435 435, 










BINDING










533 533, 










BINDING










563 563, 










BINDING










706 706, 










BINDING










724 724







P08238
HSP90AB1  
360-378
VFIMDSCDELIPE
14 13
3PRY
BINDING 46
12.676
58



Heat shock

YLNFIR


46, 





protein HSP




BINDING 88





90-beta




88, 










BINDING










107 107, 










BINDING










133 133, 










BINDING










392 392







P08238
HSP90AB1 
507-526
GFEVVYMTEPID
13 14
3PRY
BINDING 46
35.151
59



Heat shock

EYCVQQLK


46, 





protein HSP




BINDING 88





90-beta




88, 










BINDING










107 107, 










BINDING










133 133, 










BINDING










392 392







P14625
HSP90B1
117-135
LISLTDENALSGN
9
4NH9
BINDING
3.486
60



Endoplasmin

EELTVK


107 107, 










BINDING










149 149, 










BINDING










162 162, 










BINDING










168 168, 










BINDING










199 199, 










BINDING










448 448







P14625
HSP90B1
271-285
YSQFINFPIYVWS
6
4NH9
BINDING
7.026
61



Endoplasmin

SK


107 107, 










BINDING










149 149, 










BINDING










162 162, 










BINDING










168 168, 










BINDING










199 199, 










BINDING










448 448







P11142
HSPA8 Heat
113-126
SFYPEEVSSMVLT
13 14
3LDQ
NP_BIND 12
4.637
62



shock cognate

K


15, 





71 kDa protein




NP_BIND










202 204, 










NP_BIND










268 275, 










NP_BIND










339 342, 










BINDING 71










71







P12268
IMPDH2
110-124
YEQGFITDPVVLS
13
1NF7
NP_BIND
21.6
63



Inosine-5-

PK


274 276, 





monophosphate




NP_BIND





dehydrogenase




324 326, 





2




ACT_SITE










331 331, 










ACT_SITE










429 429, 










METAL 326










326, METAL










328 328, 










METAL 331










331, METAL










500 500, 










METAL 501










501, METAL










502 502, 










BINDING










329 329, 










BINDING










441 441







P00338
LDHA L- 
43-57
DLADELALVDVI
9
4JNK
NP_BIND 29
0
64



lactate

EDK


57, 





dehydrogenase




ACT_SITE





A chain




193 193, 










BINDING 99










99,










BINDING










106 106, 










BINDING










138 138, 










BINDING










169 169, 










BINDING










248 248







P07195
LDHB L-
234-244
MVVESAYEVEK
4
1I0Z
NP_BIND 31
3.118
65



lactate




53, 





dehydrogenase




ACT_SITE





B chain




194 194, 










BINDING










100 100, 










BINDING










107 107, 










BINDING










139 139, 










BINDING










170 170, 










BINDING










249 249







Q99538
LGMN
102-118
DYTGEDVTPQNF
9
4N6O
ACT_SITE
10.316
66



Legumain

LAVLR


148 148, 










ACT_SITE










189 189, 










SITE 323 324







P09960
LTA4H
366-386
LVVDLTDIDPDV
13 4
3U9W
ACT_SITE
0
67



Leukotriene A-

AYSSVPYEK
8

297 297, 





4 hydrolase




ACT_SITE










384 384, 










METAL 296










296, METAL










300 300, 










METAL 319










319, SITE










376 376, 










SITE 379 379







P43490
NAMPT
175-189
YLLETSGNLDGL
13 14
4LVF
BINDING
9.786
68



Nicotinamide

EYK
15 3

196 196, 





phosphoribosyl


6 8

BINDING





transferase




219 219, 










BINDING










247 247, 










BINDING










311 311, 










BINDING










384 384, 










BINDING










392 392







P06748
NPM1
55-73
DELHIVEAEAMN
13
2P1B
SITE 55 55, 
0
69



Nucleophosmin

YEGSPIK


SITE 80 80, 










SITE 175 176







P06748
NPM1
 81-101
MSVQPTVSLGGF
13
2P1B
SITE 55 55, 
1.327
70



Nucleophosmin

EITPPVVLR


SITE 80 80, 










SITE 175 176







P22061
PCMT1
179-197
LILPVGPAGGNQ
14 3
1I1N
ACT_SITE
8.729
71



Protein-L-

MLEQYDK
2

60 60





isoaspartate(D-










aspartate) O-










methyhransf












P11177
PDHB
53-68
VFLLGEEVAQYD
13 14
3EXE
BINDING 89
2.492
72



Pyruvate

GAYK
3 2

89





dehydrogenase










E1 component










subunit beta, 












P00558
PGK1
333-350
QIVWNGPVGVFE
3
2WZB
NP_BIND
0
73



Phosphoglycerate

WEAFAR


373 376, 





kinase 1




BINDING 39










39, 










BINDING










123 123, 










BINDING










171 171, 










BINDING










220 220, 










BINDING










313 313, 










BINDING










344 344







P14618
PKM Pyruyate
174-186
IYVDDGLISLQVK
2 9
 4FXF
NP_BIND 75
3.318
74



kinase




78, METAL





isozymes




75 75, 





M1/M2




METAL 77










77, METAL










113 113, 










METAL 114










114, METAL










272 272, 










METAL 296










296, 










BINDING 70










70, 










BINDING 73










73, 










BINDING










106 106, 










BINDING










120 120, 










BINDING










207 207, 










BINDING










270 270, 










BINDING










295 295, 










BINDING










296 296, 










BINDING










328 328, 










BINDING










464 464, 










BINDING










482 482, 










BINDING










489 489, 










SITE 270










270, SITE










433 433







P14618
PKM Pyruvate
401-422
LAPITSDPTEATA
2 9
4FXF
NP_BIND 75
9.657
75



kinase

VGAVEASFK


78, METAL





isozymes




75 75, 





M1/M2




METAL 77










77, METAL










113 113, 










METAL 114










114, METAL










272 272, 










METAL 296










296, 










BINDING 70










70, 










BINDING 73










73, 










BINDING










106 106, 










BINDING










120 120, 










BINDING










207 207, 










BINDING










270 270, 










BINDING










295 295, 










BINDING










296 296, 










BINDING










328 328, 










BINDING










464 464, 










BINDING










482 482, 










BINDING










489 489, 










SITE 270










270, SITE










433 433







P16435
POR NADPH--
369-382
TALTYYLDITNPP
13 14
3QFS
NP_BIND 86
3.068
76



cytochrome

R


91, 





P450 reductase




NP_BIND










138 141, 










NP_BIND










173 182, 










NP_BIND










454 457, 










NP_BIND










472 474, 










NP_BIND










488 491, 










NP_BIND










596 597, 










NP_BIND










602 606, 










BINDING










208 208, 










BINDING










298 298, 










BINDING










424 424, 










BINDING










478 478, 










BINDING










535 535, 










BINDING










638 638, 










BINDING










676 676







P62136
PPP1CA
133-141
IYGFYDECK
2
4XPN
ACT_SITE
4.098
77



Serine/




125 125, 





threonine-




METAL 64





protein




64, METAL





phosphatase




66 66, 





PP1-alpha cat




METAL 92










92, METAL










92 92, 










METAL 92










92, METAL










124 124, 










METAL 124










124, METAL










173 173, 










METAL 173










173, METAL










248 248, 










METAL 248










248







P36873
PPP1CC
133-141
IYGFYDECK
2 
4UT2
ACT_SITE
4.153
78



Serine/




125 125, 





threonine-




METAL 64





protein




64, METAL





phosphatase




66 66, 





PP1-gamma




METAL 92





cat




92, METAL










92 92, 










METAL 124










124, METAL










173 173, 










METAL 248










248, SITE










273 273







P36873
PPP1CC
44-60
EIFLSQPILLELEA
14
4UT2
ACT_SITE
10.048
79



Serine/

PLK


125 125, 





threonine-




METAL 64





protein




64, METAL





phosphatase




66 66, 





PP1-gamma




METAL 92





cat




92, METAL










92 92, 










METAL 124










124, METAL










173 173, 










METAL 248










248, SITE










273 273







P50897
PPT1
 75-101
TLMEDVENSFFL
13 14
3GRO
ACT_SITE
9.259
80



Palmitoyl-

NVNSQVTTVCQA
15 4

115 115, 





protein

LAK
2 9 8

ACT_SITE





thioesterase 1




233 233, 










ACT_SITE










289 289







P32119
PRDX2
120-127
TDEGIAYR
13
1QMV
ACT_SITE
2.624
81



Peroxiredoxin-




51 51





2












P28070
PSMB4
61-80
FEGGVVIAADML
6
4R3O
ACT_SITE
11.396
82



Proteasome

GSYGSLAR


46 46





subunit beta










type-4












P28074
PSMB5
141-150
LLANMVYQYK
4 3
4R3O
ACT_SITE
10.794
83



Proteasome


6

60 60, 





subunit beta




BINDING





type-5




108 108







P28074
PSMB5
226-239
DAYSGGAVNLY
6
4R3O
ACT_SITE
2.795
84



Proteasome

HVR


60 60, 





subunit beta




BINDING





type-5




108 108







P28072
PSMB6
 80-118
SGSAADTQAVAD
14 3
4R3O
ACT_SITE
3.784
85



Proteasome

AVTYQLGFHSIEL
6

35 35





subunit beta

NEPPLVHTAASLF








type-6

K










P51149
RAB7A Ras-
104-113
DEFLIQASPR
14
1YHN
NP_BIND 15
8.675
86



related protein




22, 





Rab-7a




NP_BIND 34










40, 










NP_BIND 63










67, 










NP_BIND










125 128, 










NP_BIND










156 157







Q9Y230
RUVBL2
315-330
ALESDMAPVLIM
14
3UK6
NP_BIND 77
3.038
87



RuyB-like 2

ATNR


84







Q9H7B4
SMYD3 SET
255-265
DQYCLECDCFR
9
5HQ8
BINDING
0
88



and MYND




124 124, 





domain-




BINDING





containing




132 132, 





protein 3




BINDING










181 181, 










BINDING










239 239, 










BINDING










259 259







O14773
TPP1
521-558
GCHESCLDEEVE
13 14
3EDY
ACT_SITE
0
89



Tripeptidyl-

GQGFCSGPGWDP
15 4

272 272, 





peptidase 1

VTGWGTPNFPAL
9

ACT_SITE







LK


276 276, 










ACT_SITE










475 475, 










METAL 517










517, METAL










518 518, 










METAL 539










539, METAL










541 541, 










METAL 543










543







Q9BRA2
TXNDC17
42477
YEEVSVSGFEEFH
14
IWOU
ACT_SITE
12.278
90



Thioredoxin

R


43 43, 





domain-




ACT_SITE





containing




46 46, SITE





protein 17




44 44, SITE










45 45







P62258
YWHAE 14-3-
143-153
EAAENSLVAYK
13
3UBW
SITE 57 57, 
2.851
91



3 protein




SITE 130 130





epsilon












P62258
YWHAE 14-3-
197-215
AAFDDAIAELDT
13
3UBW
SITE 57 57, 
14.177
92



3 protein

LSEESYK


SITE 130 130





epsilon












P27348
YWHAQ 14-3-
194-212
TAFDEAIAELDTL
14
5IQP
SITE 56 56, 
14.319
93



3 protein theta

NEDSYK


SITE 127 127







P63104
YWHAZ 14-3-
194-212
TAFDEAIAELDTL
14 13
5D2D
SITE 56 56, 
14.87
94



3 protein

SEESYK


SITE 127 127





zeta/delta












Q9UQ84
EX01
139-160
SQGVDCLVAPYE
13 2
3QEB
METAL 30
0
95



Exonuclease 1

ADAQLAYLNK
6 9 8

30, METAL










78 78, 










METAL 150










150, METAL










152 152, 










METAL 171










171, METAL










173 173, 










METAL 225










225







P02545
LMNA
352-366
MQQQLDEYQELL
13 6
3V5B
SITE 266
28.999
96



Prelamin-A/C

DIK


SITE 330










266, SITE










325 325, 










330, SITE










646 647











Table 4 illustrates exemplary list of proteins identified by a method described herein.
















Accession #
Protein Name









P01023
A2M Alpha-2-macroglobulin



Q9NRG9
AAAS Aladin



Q13685
AAMP Angio-associated migratory cell




protein



P49588
AARS Alanine--tRNA ligase, cytoplasmic



Q5JTZ9
AARS2 Alanine--tRNA ligase,




mitochondrial



Q9NRN7
AASDHPPT L-aminoadipate-




semialdehyde dehydrogenase-phosphop



P08183
ABCB1 Multidrug resistance protein 1



Q9NRK6
ABCB10 ATP-binding cassette sub-family




B member 10, mitoc



O75027
ABCB7 ATP-binding cassette sub-family




B member 7, mitoch



Q9NUT2
ABCB8 ATP-binding cassette sub-family




B member 8, mitoch



P28288
ABCD3 ATP-binding cassette sub-family




D member 3



P61221
ABCE1 ATP-binding cassette sub-family




E member 1



Q8NE71
ABCF1 ATP-binding cassette sub-family F




member 1



Q9UG63
ABCF2 ATP-binding cassette sub-family F




member 2



Q9NUJ1
ABHD10 Abhydrolase domain-containing




protein 10, mitochon



Q8N2K0
ABHD12 Monoacylglycerol lipase




ABHD12



O95870
ABHD16A Abhydrolase domain-




containing protein 16A



P09110
ACAA1 3-ketoacyl-CoA thiolase,




peroxisomal



Q9H845
ACAD9 Acyl-CoA dehydrogenase family




member 9, mitochondr



P11310
ACADM Medium-chain specific acyl-CoA




dehydrogenase, mito



P45954
ACADSB Short/branched chain specific




acyl-CoA dehydrogena



P49748
ACADVL Very long-chain specific acyl-




CoA dehydrogenase, m



P24752
ACAT1 Acetyl-CoA acetyltransferase,




mitochondrial



Q9BWD1
ACAT2 Acetyl-CoA acetyltransferase,




cytosolic



Q9H3P7
ACBD3 Golgi resident protein GCP60



Q9UKV3
ACIN1 Apoptotic chromatin condensation




inducer in the nu



P53396
ACLY ATP-citrate synthase



Q99798
ACO2 Aconitate hydratase, mitochondrial



P49753
ACOT2 Acyl-coenzyme A thioesterase 2,




mitochondrial



O00154
ACOT7 Cytosolic acyl coenzyme A




thioester hydrolase



Q9Y305
ACOT9 Acyl-coenzyme A thioesterase 9,




mitochondrial



Q15067
ACOX1 Peroxisomal acyl-coenzyme A




oxidase 1



P24666
ACP1 Low molecular weight




phosphotyrosine protein phosp



P11117
ACP2 Lysosomal acid phosphatase



Q9NPH0
ACP6 Lysophosphatidic acid phosphatase




type 6



P33121
ACSL1 Long-chain-fatty-acid--CoA ligase 1



O95573
ACSL3 Long-chain-fatty-acid--CoA ligase 3



O60488
ACSL4 Long-chain-fatty-acid--CoA ligase 4



Q53FZ2
ACSM3 Acyl-coenzyme A synthetase




ACSM3, mitochondrial



P68133
ACTA1 Actin, alpha skeletal muscle



P62736
ACTA2 Actin, aortic smooth muscle



P60709
ACTB Actin, cytoplasmic 1



Q562R1
ACTBL2 Beta-actin-like protein 2



P68032
ACTC1 Actin, alpha cardiac muscle 1



P63261
ACTG1 Actin, cytoplasmic 2



O96019
ACTL6A Actin-like protein 6A



P12814
ACTN1 Alpha-actinin-1



Q08043
ACTN3 Alpha-actinin-3



O43707
ACTN4 Alpha-actinin-4



P61163
ACTR1A Alpha-centractin



P61160
ACTR2 Actin-related protein 2



P61158
ACTR3 Actin-related protein 3



P55265
ADAR Double-stranded RNA-specific




adenosine deaminase



Q8NI60
ADCK3 Chaperone activity of bc1




complex-like, mitochondr



Q96D53
ADCK4 Uncharacterized aarF domain-




containing protein kin



P35611
ADD1 Alpha-adducin



P55263
ADK Adenosine kinase



Q9BRR6
ADPGK ADP-dependent glucokinase



P30520
ADSS Adenylosuccinate synthetase




isozyme 2



Q9Y4W6
AFG3L2 AFG3-like protein 2



Q53H12
AGK Acylglycerol kinase, mitochondrial



P35573
AGL Glycogen debranching enzyme



Q99943
AGPAT1 1-acyl-sn-glycerol-3-phosphate




acyltransferase alp



Q9NUQ2
AGPAT5 1-acyl-sn-glycerol-3-phosphate




acyltransferase eps



Q86UL3
AGPAT6 Glycerol-3-phosphate




acyltransferase 4



O00116
AGPS Alkyldihydroxyacetonephosphate




synthase, peroxisom



P23526
AHCY Adenosylhomocysteinase



O43865
AHCYL1 Putative




adenosylhomocysteinase 2



Q96HN2
AHCYL2 Putative




adenosylhomocysteinase 3



O95433
AHSA1 Activator of 90 kDa heat shock




protein ATPase homo



O95831
AIFM1 Apoptosis-inducing factor 1,




mitochondrial



Q12904
AIMP1 Aminoacyl tRNA synthase




complex-interacting multif



Q13155
AIMP2 Aminoacyl tRNA synthase




complex-interacting multif



O00170
AIP AH receptor-interacting protein



P54819
AK2 Adenylate kinase 2, mitochondrial



Q92667
AKAP1 A-kinase anchor protein 1,




mitochondrial



Q02952
AKAP12 A-kinase anchor protein 12



O43823
AKAP8 A-kinase anchor protein 8



Q9ULX6
AKAP8L A-kinase anchor protein 8-like



Q04828
AKR1C1 Aldo-keto reductase family 1




member C1



P31751
AKT2 RAC-beta serine/threonine-protein




kinase



P54886
ALDH18A1 Delta-1-pyrroline-5-




carboxylate synthase



P00352
ALDH1A1 Retinal dehydrogenase 1



P30837
ALDH1B1 Aldehyde dehydrogenase X,




mitochondrial



Q3SY69
ALDH1L2 Mitochondrial 10-




formyltetrahydrofolate dehydrogen



P05091
ALDH2 Aldehyde dehydrogenase,




mitochondrial



P51648
ALDH3A2 Fatty aldehyde dehydrogenase



Q02252
ALDH6A1 Methylmalonate-semialdehyde




dehydrogenase [acylati



P49419
ALDH7A1 Alpha-aminoadipic




semialdehyde dehydrogenase



P49189
ALDH9A1 4-trimethylaminobutyraldehyde




dehydrogenase



Q9BT22
ALG1 Chitobiosyldiphosphodolichol beta-




mannosyltransfer



Q9BV10
ALG12 Dol-P-Man:Man(7)GlcNAc(2)-PP-




Dol alpha-1,6-mannosy



Q9Y673
ALG5 Dolichyl-phosphate beta-




glucosyltransferase



Q9Y672
ALG6 Dolichyl pyrophosphate




Man9GlcNAc2 alpha-1,3-gluco



Q86V81
ALYREF THO complex subunit 4



Q9UJX4
ANAPC5 Anaphase-promoting complex




subunit 5



Q9UJX3
ANAPC7 Anaphase-promoting complex




subunit 7



Q86XL3
ANKLE2 Ankyrin repeat and LEM




domain-containing protein 2



Q8IZ07
ANKRD13A Ankyin repeat domain-




containing protein 13A



Q9NW15
ANO10 Anoctamin-10



P39687
ANP32A Acidic leucine-rich nuclear




phosphoprotein 32 fami



Q92688
ANP32B Acidic leucine-rich nuclear




phosphoprotein 32 fami



Q9BTT0
ANP32E Acidic leucine-rich nuclear




phosphoprotein 32 fami



P04083
ANXA1 Annexin A1



P50995
ANXA11 Annexin A11



P07355
ANXA2 Annexin A2



P08758
ANXA5 Annexin A5



P08133
ANXA6 Annexin A6



P20073
ANXA7 Annexin A7



Q10567
AP1B1 AP-1 complex subunit beta-1



P63010
AP2B1 AP-2 complex subunit beta



Q96CW1
AP2M1 AP-2 complex subunit mu



O00203
AP3B1 AP-3 complex subunit beta-1



O14617
AP3D1 AP-3 complex subunit delta-1



Q9Y2T2
AP3M1 AP-3 complex subunit mu-1



P13798
APEH Acylamino-acid-releasing enzyme



Q9BZZ5
API5 Apoptosis inhibitor 5



Q06481
APLP2 Amyloid-like protein 2



Q9HDC9
APMAP Adipocyte plasma membrane-




associated protein



Q8NCW5
APOA1BP NAD(P)H-hydrate epimerase



Q9UH17
APOBEC3B Probable DNA dC-dU-




editing enzyme APOBEC-3B



P02649
APOE Apolipoprotein E



Q9BQE5
APOL2 Apolipoprotein L2



Q9BUR5
APOO Apolipoprotein O



Q6UXV4
APOOL Apolipoprotein O-like



P05067
APP Amyloid beta A4 protein



P07741
APRT Adenine phosphoribosyltransferase



P10398
ARAF Serine/threonine-protein kinase A-




Raf



P48444
ARCN1 Coatomer subunit delta



P84077
ARF1 ADP-ribosylation factor 1



P61204
ARF3 ADP-ribosylation factor 3



P18085
ARF4 ADP-ribosylation factor 4



P84085
ARF5 ADP-ribosylation factor 5



E7EV62
ARFGAP1 ADP-ribosylation factor




GTPase-activating protein



Q8N6H7
ARFGAP2 ADP-ribosylation factor




GTPase-activating protein



P53367
ARFIP1 Arfaptin-1



Q92888
ARHGEF1 Rho guanine nucleotide




exchange factor 1



Q92974
ARHGEF2 Rho guanine nucleotide




exchange factor 2



O14497
ARID1A AT-rich interactive domain-




containing protein 1A



P40616
ARL1 ADP-ribosylation factor-like protein 1



Q8N6S5
ARL6IP6 ADP-ribosylation factor-like




protein 6-interacting



Q9NVJ2
ARL8B ADP-ribosylation factor-like




protein 8B



Q9NVT9
ARMC1 Armadillo repeat-containing




protein 1



Q8N2F6
ARMC10 Armadillo repeat-containing




protein 10



Q9UH62
ARMCX3 Armadillo repeat-containing X-




linked protein 3



Q13510
ASAH1 Acid ceramidase



Q9UBL3
ASH2L Set1/Ash2 histone




methyltransferase complex subuni



O43681
ASNA1 ATPase ASNA1



P08243
ASNS Asparagine synthetase [glutamine-




hydrolyzing]



Q12797
ASPH Aspartyl/asparaginyl beta-




hydroxylase



Q8NBU5
ATAD1 ATPase family AAA domain-




containing protein 1



Q9NVI7
ATAD3A ATPase family AAA domain-




containing protein 3A



Q5T9A4
ATAD3B ATPase family AAA domain-




containing protein 3B



Q5T2N8
ATAD3C ATPase family AAA domain-




containing protein 3C



Q7Z3C6
ATG9A Autophagy-related protein 9A



P31939
ATIC Bifunctional purine biosynthesis




protein PURH



Q8NHH9
ATL2 Atlastin-2



Q6DD88
ATL3 Atlastin-3



Q9HD20
ATP13A1 Probable cation-transporting




ATPase 13A1



P05023
ATP1A1 Sodium/potassium-transporting




ATPase subunit alpha



P13637
ATP1A3 Sodium/potassium-transporting




ATPase subunit alpha



P54709
ATP1B3 Sodium/potassium-transporting




ATPase subunit beta-



P16615
ATP2A2 Sarcoplasmic/endoplasmic




reticulum calcium ATPase



Q93084
ATP2A3 Sarcoplasmic/endoplasmic




reticulum calcium ATPase



P20020
ATP2B1 Plasma membrane calcium-




transporting ATPase 1



P23634
ATP2B4 Plasma membrane calcium-




transporting ATPase 4



P25705
ATP5A1 ATP synthase subunit alpha,




mitochondrial



P06576
ATP5B ATP synthase subunit beta,




mitochondrial



P36542
ATP5C1 ATP synthase subunit gamma,




mitochondrial



P24539
ATP5F1 ATP synthase subunit b,




mitochondrial



O75947
ATP5H ATP synthase subunit d,




mitochondrial



O75964
ATP5L ATP synthase subunit g,




mitochondrial



P48047
ATP5O ATP synthase subunit O,




mitochondrial



Q93050
ATP6V0A1 V-type proton ATPase 116 kDa




subunit a isoform 1



Q9Y487
ATP6V0A2 V-type proton ATPase 116 kDa




subunit a isoform 2



P61421
ATP6V0D1 V-type proton ATPase subunit




d 1



P38606
ATP6V1A V-type proton ATPase catalytic




subunit A



P21281
ATP6V1B2 V-type proton ATPase subunit




B, brain isoform



P36543
ATP6V1E1 V-type proton ATPase subunit




E 1



Q9UBB4
ATXN10 Ataxin-10



Q99700
ATXN2 Ataxin-2



Q8WWM7
ATXN2L Ataxin-2-like protein



Q9Y679
AUP1 Ancient ubiquitous protein 1



O14965
AURKA Aurora kinase A



O43505
B3GNT1 N-acetyllactosaminide beta-1,3-




N-acetylglucosaminy



O95817
BAG3 BAG family molecular chaperone




regulator 3



O95429
BAG4 BAG family molecular chaperone




regulator 4



Q9UL15
BAG5 BAG family molecular chaperone




regulator 5



P46379
BAG6 Large proline-rich protein BAG6



Q9UQB8
BAIAP2 Brain-specific angiogenesis




inhibitor 1-associated



B0UX83
BAT3 HLA-B associated transcript 3



B0UXB6
BAT5 Abhydrolase domain-containing




protein 16A



Q07812
BAX Apoptosis regulator BAX



Q9NRL2
BAZ1A Bromodomain adjacent to zinc




finger domain protein



Q9UIG0
BAZ1B Tyrosine-protein kinase BAZ1B



P51572
BCAP31 B-cell receptor-associated protein




31



O75934
BCAS2 Pre-mRNA-splicing factor SPF27



Q9P287
BCCIP BRCA2 and CDKN1A-interacting




protein



P12694
BCKDHA 2-oxoisovalerate dehydrogenase




subunit alpha, mito



Q9BXK5
BCL2L13 Bcl-2-like protein 13



Q9NYF8
BCLAF1 Bcl-2-associated transcription




factor 1



Q9Y276
BCS1L Mitochondrial chaperone BCS1



P55957
BID BH3-interacting domain death agonist



Q13867
BLMH Bleomycin hydrolase



P53004
BLVRA Biliverdin reductase A



P30043
BLVRB Flavin reductase (NADPH)



Q9NSY1
BMP2K BMP-2-inducible protein kinase



Q14137
BOP1 Ribosome biogenesis protein BOP1



Q6PJG6
BRAT1 BRCA1-associated ATM activator 1



P25440
BRD2 Bromodomain-containing protein 2



Q8WY22
BRI3BP BRI3-binding protein



Q8TDN6
BRIX1 Ribosome biogenesis protein




BRX1 homolog



Q5VW32
BROX BRO1 domain-containing protein




BROX



Q9NW68
BSDC1 BSD domain-containing protein 1



P35613
BSG Basigin



Q06187
BTK Tyrosine-protein kinase BTK



O60566
BUB1B Mitotic checkpoint




serine/threonine-protein kinase



O43684
BUB3 Mitotic checkpoint protein BUB3



Q13895
BYSL Bystin



Q7L1Q6
BZW1 Basic leucine zipper and W2




domain-containing prot



Q69YU5
C12orf73 Uncharacterized protein




C12orf73



Q9Y224
C14orf166 UPF0568 protein C14orf166



Q96GQ5
C16orf58 UPF0420 protein C16orf58



Q9BSF4
C19orf52 Uncharacterized protein




C19orf52



Q4ZIN3
C19orf6 Membralin



E9PFR7
C1orf27 Protein C1orf27



Q07021
C1QBP Complement component 1 Q




subcomponent-binding prot



Q9BQP7
C20orf72 Uncharacterized protein




C20orf72



P30042
C21orf33 ES1 protein homolog,




mitochondrial



Q9H6V9
C2orf43 UPF0554 protein C2orf43



Q8WWC4
C2orf47 Uncharacterized protein C2orf47,




mitochondrial



Q96FZ2
C3orf37 UPF0361 protein C3orf37



Q9H993
C6orf211 UPF0364 protein C6orf211



Q9H7E9
C8orf33 UPF0488 protein C8orf33



Q5T6V5
C9orf64 UPF0553 protein C9orf64



Q9Y376
CAB39 Calcium-binding protein 39



Q9HB71
CACYBP Calcyclin-binding protein



P27708
CAD CAD protein



Q9BY67
CADM1 Cell adhesion molecule 1



P05937
CALB1 Calbindin



P62158
CALM1 Calmodulin



P27797
CALR Calreticulin



O43852
CALU Calumenin



Q14012
CAMK1 Calcium/calmodulin-dependent




protein kinase type 1



Q13557
CAMK2D Calcium/calmodulin-dependent




protein kinase type I



Q13555
CAMK2G Calcium/calmodulin-dependent




protein kinase type I



P27824
CANX Calnexin



P07384
CAPN1 Calpain-1 catalytic subunit



P17655
CAPN2 Calpain-2 catalytic subunit



P04632
CAPNS1 Calpain small subunit 1



Q14444
CAPRIN1 Caprin-1



P47756
CAPZB F-actin-capping protein subunit




beta



Q86X55
CARM1 Histone-arginine




methyltransferase CARM1



P49589
CARS Cysteine--tRNA ligase, cytoplasmic



P20810
CAST Calpastatin



P04040
CAT Catalase



P35520
CBS Cystathionine beta-synthase



Q13185
CBX3 Chromobox protein homolog 3



P45973
CBX5 Chromobox protein homolog 5



Q96G28
CCDC104 Coiled-coil domain-containing




protein 104



O60826
CCDC22 Coiled-coil domain-containing




protein 22



Q96A33
CCDC47 Coiled-coil domain-containing




protein 47



Q96ER9
CCDC51 Coiled-coil domain-containing




protein 51



Q16204
CCDC6 Coiled-coil domain-containing




protein 6



P78371
CCT2 T-complex protein 1 subunit beta



P49368
CCT3 T-complex protein 1 subunit gamma



P50991
CCT4 T-complex protein 1 subunit delta



P48643
CCT5 T-complex protein 1 subunit epsilon



P40227
CCT6A T-complex protein 1 subunit zeta



Q99832
CCT7 T-complex protein 1 subunit eta



P50990
CCT8 T-complex protein 1 subunit theta



O95400
CD2BP2 CD2 antigen cytoplasmic tail-




binding protein 2



P60033
CD81 CD81 antigen



Q9UJX2
CDC23 Cell division cycle protein 23




homolog



P30260
CDC27 Cell division cycle protein 27




homolog



Q16543
CDC37 Hsp90 co-chaperone Cdc37



P60953
CDC42 Cell division control protein 42




homolog



Q99459
CDC5L Cell division cycle 5-like protein



P19022
CDH2 Cadherin-2



O14735
CDIPT CDP-diacylglycerol--inositol 3-




phosphatidyltransfe



P06493
CDK1 Cyclin-dependent kinase 1



P24941
CDK2 Cyclin-dependent kinase 2



P11802
CDK4 Cyclin-dependent kinase 4



Q96JB5
CDK5RAP3 CDK5 regulatory subunit-




associated protein 3



Q00534
CDK6 Cyclin-dependent kinase 6



P50750
CDK9 Cyclin-dependent kinase 9



Q5VV42
CDKAL1 Threonylcarbamoyladenosine




tRNA methylthiotransfer



O95674
CDS2 Phosphatidate cytidylyltransferase 2



Q03701
CEBPZ CCAAT/enhancer-binding protein




zeta



Q9BXW7
CECR5 Cat eye syndrome critical region




protein 5



Q92879
CELF1 CUGBP Elav-like family member 1



Q5SW79
CEP170 Centrosomal protein of 170 kDa



Q9C0F1
CEP44 Centrosomal protein of 44 kDa



Q9Y6K0
CEPT1




Choline/ethanolaminephosphotransferase 1



P27544
CERS1 Ceramide synthase 1



Q96G23
CERS2 Ceramide synthase 2



Q6ZMG9
CERS6 Ceramide synthase 6



Q9NX63
CHCHD3 Coiled-coil-helix-coiled-coil-




helix domain-contain



O14646
CHD1 Chromodomain-helicase-DNA-




binding protein 1



Q14839
CHD4 Chromodomain-helicase-DNA-




binding protein 4



O14757
CHEK1 Serine/threonine-protein kinase




Chk1



Q8IWX8
CHERP Calcium homeostasis endoplasmic




reticulum protein



Q9NZZ3
CHMP5 Charged multiyesicular body




protein 5



Q14011
CIRBP Cold-inducible RNA-binding




protein



Q9NZ45
CISD1 CDGSH iron-sulfur domain-




containing protein 1



Q8N5K1
CISD2 CDGSH iron-sulfur domain-




containing protein 2



Q8WWK9
CKAP2 Cytoskeleton-associated protein 2



Q07065
CKAP4 Cytoskeleton-associated protein 4



P12277
CKB Creatine kinase B-type



P12532
CKMT1B Creatine kinase U-type,




mitochondrial



F5H604
CLASP2 CLIP-associating protein 2



Q96S66
CLCC1 Chloride channel CLIC-like




protein 1



O14967
CLGN Calmegin



O15247
CLIC2 Chloride intracellular channel




protein 2



Q9Y696
CLIC4 Chloride intracellular channel




protein 4



O75503
CLN5 Ceroid-lipofuscinosis neuronal




protein 5



P54105
CLNS1A Methylosome subunit pICIn



Q9H078
CLPB Caseinolytic peptidase B protein




homolog



Q16740
CLPP Putative ATP-dependent Clp




protease proteolytic su



O96005
CLPTM1 Cleft lip and palate




transmembrane protein 1



Q96KA5
CLPTM1L Cleft lip and palate




transmembrane protein 1-like



P30085
CMPK1 UMP-CMP kinase



Q99439
CNN2 Calponin-2



Q15417
CNN3 Calponin-3



A5YKK6
CNOT1 CCR4-NOT transcription complex




subunit 1



Q9NZN8
CNOT2 CCR4-NOT transcription complex




subunit 2



P09543
CNP 2,3-cyclic-nucleotide 3-




phosphodiesterase



Q9BT09
CNPY3 Protein canopy homolog 3



Q9Y2R0
COA3 Cytochrome C oxidase assembly




factor 3 homolog, mi



Q13057
COASY Bifunctional coenzyme A




synthase



P21964
COMT Catechol O-methyltransferase



P53618
COPB1 Coatomer subunit beta



P35606
COPB2 Coatomer subunit beta



014579
COPE Coatomer subunit epsilon



Q9Y678
COPG1 Coatomer subunit gamma-1



Q9UBF2
COPG2 Coatomer subunit gamma-2



P61201
COPS2 COP9 signalosome complex




subunit 2



Q9UNS2
COPS3 COP9 signalosome complex




subunit 3



Q9BT78
COPS4 COP9 signalosome complex




subunit 4



Q92905
COPS5 COP9 signalosome complex




subunit 5



Q7L5N1
COPS6 COP9 signalosome complex




subunit 6



Q5HYK3
COQ5 2-methoxy-6-polyprenyl-1,4-




benzoquinol methylase,



Q9ULV4
CORO1C Coronin-1C



I3L416
CORO7 Coronin



Q9Y6N1
COX11 Cytochrome c oxidase assembly




protein COX11, mitoc



Q7KZN9
COX15 Cytochrome c oxidase assembly




protein COX15 homolo



P13073
COX4I1 Cytochrome c oxidase subunit 4




isoform 1, mitochon



P20674
COX5A Cytochrome c oxidase subunit 5A,




mitochondrial



O75976
CPD Carboxypeptidase D



Q99829
CPNE1 Copine-1



O75131
CPNE3 Copine-3



P36551
CPOX Coproporphyrinogen-III oxidase,




mitochondrial



Q9BRF8
CPPED1 Calcineurin-like phosphoesterase




domain-containing



Q9UKF6
CPSF3 Cleavage and polyadenylation




specificity factor su



Q16630
CPSF6 Cleavage and polyadenylation




specificity factor su



Q8N684
CPSF7 Cleavage and polyadenylation




specificity factor su



P50416
CPT1A Carnitine O-palmitoyltransferase




1, liver isoform



P23786
CPT2 Carnitine O-palmitoyltransferase 2,




mitochondrial



Q9H3G5
CPVL Probable serine carboxypeptidase




CPVL



P46108
CRK Adapter molecule crk



P46109
CRKL Crk-like protein



O75390
CS Citrate synthase, mitochondrial



P16989
CSDA DNA-binding protein A



O75534
CSDE1 Cold shock domain-containing




protein E1



P55060
CSE1L Exportin-2



P41240
CSK Tyrosine-protein kinase CSK



P48729
CSNK1A1 Casein kinase I isoform alpha



P49674
CSNK1E Casein kinase I isoform epsilon



P68400
CSNK2A1 Casein kinase II subunit alpha



P19784
CSNK2A2 Casein kinase II subunit alpha



P67870
CSNK2B Casein kinase II subunit beta



P04080
CSTB Cystatin-B



Q05048
CSTF1 Cleavage stimulation factor subunit 1



P33240
CSTF2 Cleavage stimulation factor subunit 2



Q12996
CSTF3 Cleavage stimulation factor subunit 3



O15320
CTAGE5 Cutaneous T-cell lymphoma-




associated antigen 5



Q13363
CTBP1 C-terminal-binding protein 1



P32929
CTH Cystathionine gamma-lyase



P35221
CTNNA1 Catenin alpha-1



P35222
CTNNB1 Catenin beta-1



O60716
CTNND1 Catenin delta-1



P17812
CTPS1 CTP synthase 1



P10619
CTSA Lysosomal protective protein



P07858
CTSB Cathepsin B



P53634
CTSC Dipeptidyl peptidase 1



P07339
CTSD Cathepsin D



Q14247
CTTN Src substrate cortactin



Q13620
CUL4B Cullin-4B



O60888
CUTA Protein CutA



Q69YN2
CWF19L1 CWF19-like protein 1



Q9BVG4
CXorf26 UPF0368 protein Cxorf26



P00167
CYB5A Cytochrome b5



O43169
CYB5B Cytochrome b5 type B



Q8WUJ1
CYB5D2 Neuferricin



Q9UHQ9
CYB5R1 NADH-cytochrome b5 reductase 1



P00387
CYB5R3 NADH-cytochrome b5 reductase 3



P08574
CYC1 Cytochrome c1, heme protein,




mitochondrial



Q7L576
CYFIP1 Cytoplasmic FMR1-interacting




protein 1



Q6UW02
CYP20A1 Cytochrome P450 20A1



Q16850
CYP51A1 Lanosterol 14-alpha




demethylase



P51398
DAP3 28S ribosomal protein S29,




mitochondrial



P14868
DARS Aspartate--tRNA ligase,




cytoplasmic



Q6PI48
DARS2 Aspartate--tRNA ligase,




mitochondrial



Q96EP5
DAZAP1 DAZ-associated protein 1



Q16643
DBN1 Drebrin



Q9UJU6
DBNL Drebrin-like protein



P61962
DCAF7 DDB1- and CUL4-associated




factor 7



Q8WVC6
DCAKD Dephospho-CoA kinase domain-




containing protein



P81605
DCD Dermcidin



Q14203
DCTN1 Dynactin subunit 1



Q13561
DCTN2 Dynactin subunit 2



Q9UJW0
DCTN4 Dynactin subunit 4



Q9H773
DCTPP1 dCTP pyrophosphatase 1



Q92564
DCUN1D4 DCN1-like protein 4



Q7Z4W1
DCXR L-xylulose reductase



Q16531
DDB1 DNA damage-binding protein 1



P39656
DDOST Dolichyl-




diphosphooligosaccharide--protein glycosy



Q96HY6
DDRGK1 DDRGK domain-containing




protein 1



Q13206
DDX10 Probable ATP-dependent RNA




helicase DDX10



Q92841
DDX17 Probable ATP-dependent RNA




helicase DDX17



Q9NVP1
DDX18 ATP-dependent RNA helicase




DDX18



Q9UHI6
DDX20 Probable ATP-dependent RNA




helicase DDX20



Q9NR30
DDX21 Nucleolar RNA helicase 2



Q9BUQ8
DDX23 Probable ATP-dependent RNA




helicase DDX23



Q9GZR7
DDX24 ATP-dependent RNA helicase




DDX24



O00148
DDX39A ATP-dependent RNA helicase




DDX39A



Q13838
DDX39B Spliceosome RNA helicase




DDX39B



O00571
DDX3X ATP-dependent RNA helicase




DDX3X



Q86XP3
DDX42 ATP-dependent RNA helicase




DDX42



Q7L014
DDX46 Probable ATP-dependent RNA




helicase DDX46



P17844
DDX5 Probable ATP-dependent RNA




helicase DDX5



Q9BQ39
DDX50 ATP-dependent RNA helicase




DDX50



Q8TDD1
DDX54 ATP-dependent RNA helicase




DDX54



P26196
DDX6 Probable ATP-dependent RNA




helicase DDX6



Q16698
DECR1 2,4-dienoyl-CoA reductase,




mitochondrial



O15121
DEGS1 Sphingolipid delta(4)-desaturase




DES1



Q9BUN8
DERL1 Derlin-1



Q9BSY9
DESI2 Desumoylating isopeptidase 2



O00273
DFFA DNA fragmentation factor subunit




alpha



Q96DF8
DGCR14 Protein DGCR14



Q15392
DHCR24 Delta(24)-sterol reductase



P00374
DHFR Dihydrofolate reductase



P49366
DHPS Deoxyhypusine synthase



Q9Y394
DHRS7 Dehydrogenase/reductase SDR




family member 7



Q6IAN0
DHRS7B Dehydrogenase/reductase SDR




family member 7B



O43143
DHX15 Putative pre-mRNA-splicing




factor ATP-dependent RN



Q7Z478
DHX29 ATP-dependent RNA helicase




DHX29



Q7L2E3
DHX30 Putative ATP-dependent RNA




helicase DHX30



Q9H2U1
DHX36 Probable ATP-dependent RNA




helicase DHX36



Q14562
DHX8 ATP-dependent RNA helicase




DHX8



Q08211
DHX9 ATP-dependent RNA helicase A



Q9NR28
DIABLO Diablo homolog, mitochondrial



O60610
DIAPH1 Protein diaphanous homolog 1



Q9Y2L1
DIS3 Exosome complex exonuclease




RRP44



P10515
DLAT Dihydrolipoyllysine-residue




acetyltransferase comp



P09622
DLD Dihydrolipoyl dehydrogenase,




mitochondrial



Q15398
DLGAP5 Disks large-associated protein 5



P31689
DNAJA1 DnaJ homolog subfamily A




member 1



O60884
DNAJA2 DnaJ homolog subfamily A




member 2



Q96EY1
DNAJA3 DnaJ homolog subfamily A




member 3, mitochondrial



P25685
DNAJB1 DnaJ homolog subfamily B




member 1



Q9NXW2
DNAJB12 DnaJ homolog subfamily B




member 12



Q96KC8
DNAJC1 DnaJ homolog subfamily C




member 1



Q8IXB1
DNAJC10 DnaJ homolog subfamily C




member 10



Q9NVH1
DNAJC11 DnaJ homolog subfamily C




member 11



Q99543
DNAJC2 DnaJ homolog subfamily C




member 2



Q9H3Z4
DNAJC5 DnaJ homolog subfamily C




member 5



Q99615
DNAJC7 DnaJ homolog subfamily C




member 7



O75937
DNAJC8 DnaJ homolog subfamily C




member 8



Q8WXX5
DNAJC9 DnaJ homolog subfamily C




member 9



O00115
DNASE2 Deoxyribonuclease-2-alpha



Q05193
DNM1 Dynamin-1



O00429
DNM1L Dynamin-1-like protein



P50570
DNM2 Dynamin-2



Q9UQ16
DNM3 Dynamin-3



Q9BU89
DOHH Deoxyhypusine hydroxylase



Q9UPQ8
DOLK Dolichol kinase



Q86YN1
DOLPP1 Dolichyldiphosphatase 1



O60762
DPM1 Dolichol-phosphate




mannosyltransferase



Q9NY33
DPP3 Dipeptidyl peptidase 3



Q9UHL4
DPP7 Dipeptidyl peptidase 2



Q9Y295
DRG1 Developmentally-regulated GTP-




binding protein 1



Q08554
DSC1 Desmocollin-1



Q02413
DSG1 Desmoglein-1



P15924
DSP Desmoplakin



P60981
DSTN Destrin



Q14204
DYNC1H1 Cytoplasmic dynein 1 heavy




chain 1



Q13409
DYNC1I2 Cytoplasmic dynein 1




intermediate chain 2



Q9Y6G9
DYNC1LI1 Cytoplasmic dynein 1 light




intermediate chain 1



P63167
DYNLL1 Dynein light chain 1,




cytoplasmic



Q96FJ2
DYNLL2 Dynein light chain 2,




cytoplasmic



Q99848
EBNA1BP2 Probable rRNA-processing




protein EBP2



O95905
ECD Protein SGT1



P42892
ECE1 Endothelin-converting enzyme 1



Q13011
ECH1 Delta(3,5)-Delta(2,4)-dienoyl-CoA




isomerase, mitoc



Q9NTX5
ECHDC1 Ethylmalonyl-CoA




decarboxylase



P30084
ECHS1 Enoyl-CoA hydratase,




mitochondrial



P42126
ECI1 Enoyl-CoA delta isomerase 1,




mitochondrial



O75521
ECI2 Enoyl-CoA delta isomerase 2,




mitochondrial



Q5VYK3
ECM29 Proteasome-associated protein




ECM29 homolog



Q6P2E9
EDC4 Enhancer of mRNA-decapping




protein 4



P68104
EEF1A1 Elongation factor 1-alpha 1



Q5VTE0
EEF1A1P5 Putative elongation factor 1-




alpha-like 3



P24534
EEF1B2 Elongation factor 1-beta



E9PRY8
EEF1D Elongation factor 1-delta



P26641
EEF1G Elongation factor 1-gamma



P13639
EEF2 Elongation factor 2



Q8IYU8
EFHA1 EF-hand domain-containing family




member A1



Q15029
EFTUD2 116 kDa U5 small nuclear




ribonucleoprotein compone



Q9H4M9
EHD1 EH domain-containing protein 1



Q9H223
EHD4 EH domain-containing protein 4



O14681
EI24 Etoposide-induced protein 2.4




homolog



Q9BY44
EIF2A Eukaryotic translation initiation




factor 2A



P19525
EIF2AK2 Interferon-induced, double-




stranded RNA-activated



Q9NR50
EIF2B3 Translation initiation factor eIF-




2B subunit gamma



P05198
EIF2S1 Eukaryotic translation initiation




factor 2 subunit



P20042
EIF2S2 Eukaryotic translation initiation




factor 2 subunit



P41091
EIF2S3 Eukaryotic translation initiation




factor 2 subunit



Q14152
EIF3A Eukaryotic translation initiation




factor 3 subunit



P55884
EIF3B Eukaryotic translation initiation




factor 3 subunit



B5ME19
EIF3CL Eukaryotic translation initiation




factor 3 subunit



O15371
EIF3D Eukaryotic translation initiation




factor 3 subunit



P60228
EIF3E Eukaryotic translation initiation




factor 3 subunit



B0QY89
EIF3EIP Eukaryotic translation initiation




factor 3 subunit



O00303
EIF3F Eukaryotic translation initiation




factor 3 subunit



O75821
EIF3G Eukaryotic translation initiation




factor 3 subunit



O15372
EIF3H Eukaryotic translation initiation




factor 3 subunit



Q13347
EIF3I Eukaryotic translation initiation




factor 3 subunit



O75822
EIF3J Eukaryotic translation initiation




factor 3 subunit



Q9Y262
EIF3L Eukaryotic translation initiation




factor 3 subunit



Q7L2H7
EIF3M Eukaryotic translation initiation




factor 3 subunit



P60842
EIF4A1 Eukaryotic initiation factor 4A-I



Q14240
EIF4A2 Eukaryotic initiation factor 4A-II



P38919
EIF4A3 Eukaryotic initiation factor 4A-III



P23588
EIF4B Eukaryotic translation initiation




factor 4B



P06730
EIF4E Eukaryotic translation initiation




factor 4E



Q04637
EIF4G1 Eukaryotic translation initiation




factor 4 gamma 1



P78344
EIF4G2 Eukaryotic translation initiation




factor 4 gamma 2



Q15056
EIF4H Eukaryotic translation initiation




factor 4H



P55010
EIF5 Eukaryotic translation initiation




factor 5



P63241
EIF5A Eukaryotic translation initiation




factor 5A-1



Q9GZV4
EIF5A2 Eukaryotic translation initiation




factor 5A-2



O60841
EIF5B Eukaryotic translation initiation




factor 5B



P56537
EIF6 Eukaryotic translation initiation




factor 6



Q9BQ52
ELAC2 Zinc phosphodiesterase ELAC




protein 2



Q15717
ELAVL1 ELAV-like protein 1



Q8IZ81
ELMOD2 ELMO domain-containing




protein 2



Q9NXB9
ELOVL2 Elongation of very long chain




fatty acids protein



Q8N766
EMC1 ER membrane protein complex




subunit 1



Q9NPA0
EMC7 ER membrane protein complex




subunit 7



P50402
EMD Emerin



O94919
ENDOD1 Endonuclease domain-




containing 1 protein



Q9UHY7
ENOPH1 Enolase-phosphatase E1



P11171
EPB41 Protein 4.1



O43491
EPB41L2 Band 4.1-like protein 2



Q9UM22
EPDR1 Mammalian ependymin-related




protein 1



P07099
EPHX1 Epoxide hydrolase 1



P34913
EPHX2 Bifunctional epoxide hydrolase 2



P07814
EPRS Bifunctional glutamate/proline--




tRNA ligase



P42566
EPS15 Epidermal growth factor receptor




substrate 15



Q9UBC2
EPS15L1 Epidermal growth factor receptor




substrate 15-like



Q9NZ08
ERAP1 Endoplasmic reticulum




aminopeptidase 1



Q9Y282
ERGIC3 Endoplasmic reticulum-Golgi




intermediate compartme



P84090
ERH Enhancer of rudimentary homolog



O75477
ERLIN1 Erlin-1



O94905
ERLIN2 Erlin-2



Q96HE7
ERO1L ERO1-like protein alpha



P30040
ERP29 Endoplasmic reticulum resident




protein 29



Q9B526
ERP44 Endoplasmic reticulum resident




protein 44



Q9BSJ8
ESYT1 Extended synaptotagmin-1



A0FGR8
ESYT2 Extended synaptotagmin-2



P62495
ETF1 Eukaryotic peptide chain release




factor subunit 1



P13804
ETFA Electron transfer flavoprotein




subunit alpha, mito



P38117
ETFB Electron transfer flavoprotein




subunit beta



Q16134
ETFDH Electron transfer flavoprotein-




ubiquinone oxidored



Q01844
EWSR1 RNA-binding protein EWS



Q9UQ84
EXO1 Exonuclease 1



Q96KP1
EXOC2 Exocyst complex component 2



Q96A65
EXOC4 Exocyst complex component 4



O00471
EXOC5 Exocyst complex component 5



Q01780
EXOSC10 Exosome component 10



Q9NQT5
EXOSC3 Exosome complex component




RRP40



P15311
EZR Ezrin



Q9Y624
F11R Junctional adhesion molecule A



O60427
FADS1 Fatty acid desaturase 1



O95864
FADS2 Fatty acid desaturase 2



Q9UNN5
FAF1 FAS-associated factor 1



Q96CS3
FAF2 FAS-associated factor 2



P16930
FAH Fumarylacetoacetase



Q9NRY5
FAM114A2 Protein FAM114A2



Q96TA1
FAM129B Niban-like protein 1



Q96A26
FAM162A Protein FAM162A



Q9BTY7
FAM203A Protein FAM203A



P0CB43
FAM203B Protein FAM203B



Q9UK61
FAM208A Protein FAM208A



Q9BRX8
FAM213A Redox-regulatory protein




FAM213A



Q92520
FAM3C Protein FAM3C



Q9NUQ9
FAM49B Protein FAM49B



Q9H019
FAM54B Protein FAM54B



Q96TC7
FAM82A2 Regulator of microtubule




dynamics protein 3



Q96DB5
FAM82B Regulator of microtubule




dynamics protein 1



Q9UBU6
FAM8A1 Protein FAM8A1



Q8NCA5
FAM98A Protein FAM98A



Q52LJ0
FAM98B Protein FAM98B



Q9NVI1
FANCI Fanconi anemia group I protein



Q8WVX9
FAR1 Fatty acyl-CoA reductase 1



Q9Y285
FARSA Phenylalanine--tRNA ligase alpha




subunit



Q9NSD9
FARSB Phenylalanine--tRNA ligase beta




subunit



P49327
FASN Fatty acid synthase



P22087
FBL rRNA 2-O-methyltransferase




fibrillarin



P37268
FDFT1 Squalene synthase



P22830
FECH Ferrochelatase, mitochondrial



P39748
FEN1 Flap endonuclease 1



Q86UX7
FERMT3 Fermitin family homolog 3



O95684
FGFR1OP FGFR1 oncogene partner



P07954
FH Fumarate hydratase, mitochondrial



Q9Y613
FHOD1 FH1/FH2 domain-containing




protein 1



Q6UN15
FIP1L1 Pre-mRNA 3-end-processing




factor FIP1



Q96AY3
FKBP10 Peptidyl-prolyl cis-trans




isomerase FKBP10



Q9NWM8
FKBP14 Peptidyl-prolyl cis-trans




isomerase FKBP14



P62942
FKBP1A Peptidyl-prolyl cis-trans




isomerase FKBP1A



Q00688
FKBP3 Peptidyl-prolyl cis-trans isomerase




FKBP3



Q02790
FKBP4 Peptidyl-prolyl cis-trans isomerase




FKBP4



Q13451
FKBP5 Peptidyl-prolyl cis-trans isomerase




FKBP5



Q9Y680
FKBP7 Peptidyl-prolyl cis-trans isomerase




FKBP7



Q14318
FKBP8 Peptidyl-prolyl cis-trans isomerase




FKBP8



Q8NFF5
FLAD1 FAD synthase



Q13045
FLII Protein flightless-1 homolog



Q14315
FLNC Filamin-C



O75955
FLOT1 Flotillin-1



Q14254
FLOT2 Flotillin-2



Q06787
FMR1 Fragile X mental retardation protein 1



Q9H479
FN3K Fructosamine-3-kinase



P49354
FNTA Protein




farnesyltransferase/geranylgeranyltransfer



Q96CU9
FOXRED1 FAD-dependent




oxidoreductase domain-containing pro



Q16658
FSCN1 Fascin



Q8IY81
FTSJ3 pre-rRNA processing protein FTSJ3



Q96AE4
FUBP1 Far upstream element-binding




protein 1



Q96I24
FUBP3 Far upstream element-binding




protein 3



P04066
FUCA1 Tissue alpha-L-fucosidase



Q9BTY2
FUCA2 Plasma alpha-L-fucosidase



P35637
FUS RNA-binding protein FUS



P51114
FXR1 Fragile X mental retardation




syndrome-related prot



P51116
FXR2 Fragile X mental retardation




syndrome-related prot



Q13283
G3BP1 Ras GTPase-activating protein-




binding protein 1



Q9UN86
G3BP2 Ras GTPase-activating protein-




binding protein 2



P11413
G6PD Glucose-6-phosphate 1-




dehydrogenase



P10253
GAA Lysosomal alpha-glucosidase



O14976
GAK Cyclin-G-associated kinase



Q10472
GALNT1 Polypeptide N-




acetylgalactosaminyltransferase 1



Q10471
GALNT2 Polypeptide N-




acetylgalactosaminyltransferase 2



Q8N4A0
GALNT4 Polypeptide N-




acetylgalactosaminyltransferase 4



Q14697
GANAB Neutral alpha-glucosidase AB



Q14C86
GAPVD1 GTPase-activating protein and




VPS9 domain-containi



P41250
GARS Glycine--tRNA ligase



P22102
GART Trifunctional purine biosynthetic




protein adenosin



P04062
GBA Glucosylceramidase



O75323
GBAS Protein NipSnap homolog 2



Q92538
GBF1 Golgi-specific brefeldin A-




resistance guanine nucl



O75600
GCAT 2-amino-3-ketobutyrate coenzyme




A ligase, mitochon



Q92616
GCN1L1 Translational activator GCN1



P31150
GDI1 Rab GDP dissociation inhibitor




alpha



P50395
GDI2 Rab GDP dissociation inhibitor beta



Q8N9F7
GDPD1 Glycerophosphodiester




phosphodiesterase domain-con



Q7L5D6
GET4 Golgi to ER traffic protein 4




homolog



Q96RP9
GFM1 Elongation factor G, mitochondrial



Q06210
GFPT1 Glucosamine--fructose-6-




phosphate aminotransferase



P38435
GGCX Vitamin K-dependent gamma-




carboxylase



Q92820
GGH Gamma-glutamyl hydrolase



Q9UJ14
GGT7 Gamma-glutamyltransferase 7



Q9H3K2
GHITM Growth hormone-inducible




transmembrane protein



Q6Y7W6
GIGYF2 PERQ amino acid-rich with GYF




domain-containing pr



P32189
GK Glycerol kinase



P06280
GLA Alpha-galactosidase A



P16278
GLB1 Beta-galactosidase



Q92896
GLG1 Golgi apparatus protein 1



Q04760
GLO1 Lactoylglutathione lyase



Q9HC38
GLOD4 Glyoxalase domain-containing




protein 4



O76003
GLRX3 Glutaredoxin-3



O94925
GLS Glutaminase kidney isoform,




mitochondrial



Q68CQ7
GLT8D1 Glycosyltransferase 8 domain-




containing protein 1



P00367
GLUD1 Glutamate dehydrogenase 1,




mitochondrial



P49448
GLUD2 Glutamate dehydrogenase 2,




mitochondrial



P17900
GM2A Ganglioside GM2 activator



P49915
GMPS GMP synthase [glutamine-




hydrolyzing]



P04899
GNAI2 Guanine nucleotide-binding




protein G(i) subunit al



P08754
GNAI3 Guanine nucleotide-binding




protein G(k) subunit al



P62873
GNB1 Guanine nucleotide-binding protein




G(I)/G(S)/G(T)



P62879
GNB2 Guanine nucleotide-binding protein




G(I)/G(S)/G(T)



P63244
GNB2L1 Guanine nucleotide-binding




protein subunit beta-2-



Q13823
GNL2 Nucleolar GTP-binding protein 2



Q9BVP2
GNL3 Guanine nucleotide-binding protein-




like 3



O15228
GNPAT Dihydroxyacetone phosphate




acyltransferase



P15586
GNS N-acetylglucosamine-6-sulfatase



Q08378
GOLGA3 Golgin subfamily A member 3



Q8TBA6
GOLGA5 Golgin subfamily A member 5



O00461
GOLIM4 Golgi integral membrane protein 4



Q8NBJ4
GOLM1 Golgi membrane protein 1



Q9H4A6
GOLPH3 Golgi phosphoprotein 3



Q9H4A5
GOLPH3L Golgi phosphoprotein 3-like



Q9HD26
GOPC Golgi-associated PDZ and coiled-




coil motif-contain



O95249
GOSR1 Golgi SNAP receptor complex




member 1



P00505
GOT2 Aspartate aminotransferase,




mitochondrial



O43292
GPAA1 Glycosylphosphatidylinositol




anchor attachment 1 p



Q9HCL2
GPAM Glycerol-3-phosphate




acyltransferase 1, mitochondr



P43304
GPD2 Glycerol-3-phosphate




dehydrogenase, mitochondrial



Q5VW38
GPR107 Protein GPR107



P0CG08
GPR89B Golgi pH regulator B



P36969
GPX4 Phospholipid hydroperoxide




glutathione peroxidase,



Q8TED1
GPX8 Probable glutathione peroxidase 8



P62993
GRB2 Growth factor receptor-bound




protein 2



Q9UBQ7
GRHPR Glyoxylate




reductase/hydroxypyruvate reductase



Q9HAV7
GRPEL1 GrpE protein homolog 1,




mitochondrial



Q12849
GRSF1 G-rich sequence factor 1



Q9BQ67
GRWD1 Glutamate-rich WD repeat-




containing protein 1



P15170
GSPT1 Eukaryotic peptide chain release




factor GTP-bindin



Q8IYD1
GSPT2 Eukaryotic peptide chain release




factor GTP-bindin



P00390
GSR Glutathione reductase, mitochondrial



P48637
GSS Glutathione synthetase



Q9Y2Q3
GSTK1 Glutathione S-transferase kappa 1



P21266
GSTM3 Glutathione S-transferase Mu 3



P78417
GSTO1 Glutathione S-transferase omega-1



P09211
GSTP1 Glutathione S-transferase P



P78347
GTF21 General transcription factor II-I



Q9Y5Q9
GTF3C3 General transcription factor 3C




polypeptide 3



O00178
GTPBP1 GTP-binding protein 1



Q9BZE4
GTPBP4 Nucleolar GTP-binding protein 1



P08236
GUSB Beta-glucuronidase



P13807
GYS1 Glycogen



P16104
H2AFX Histone H2A.x



O75367
H2AFY Core histone macro-H2A.1



P0C0S5
H2AFZ Histone H2A.Z



Q16836
HADH Hydroxyacyl-coenzyme A




dehydrogenase, mitochondria



P40939
HADHA Trifunctional enzyme subunit




alpha, mitochondrial



P55084
HADHB Trifunctional enzyme subunit




beta, mitochondrial



P12081
HARS Histidine--tRNA ligase,




cytoplasmic



O14929
HAT1 Histone acetyltransferase type B




catalytic subunit



Q96CS2
HAUS1 HAUS augmin-like complex




subunit 1



Q9NVX0
HAUS2 HAUS augmin-like complex




subunit 2



Q68CZ6
HAUS3 HAUS augmin-like complex




subunit 3



Q9H6D7
HAUS4 HAUS augmin-like complex




subunit 4



O94927
HAUS5 HAUS augmin-like complex




subunit 5



O00165
HAX1 HCLS1-associated protein X-1



P69905
HBA2 Hemoglobin subunit alpha



P68871
HBB Hemoglobin subunit beta



P02100
HBE1 Hemoglobin subunit epsilon



P69891
HBG1 Hemoglobin subunit gamma-1



P69892
HBG2 Hemoglobin subunit gamma-2



Q9Y450
HBS1L HBS1-like protein



P02008
HBZ Hemoglobin subunit zeta



P53701
HCCS Cytochrome c-type heme lyase



Q13547
HDAC1 Histone deacetylase 1



Q92769
HDAC2 Histone deacetylase 2



P51858
HDGF Hepatoma-derived growth factor



Q9BSH5
HDHD3 Haloacid dehalogenase-like




hydrolase domain-contai



Q00341
HDLBP Vigilin



Q9H583
HEATR1 HEAT repeat-containing protein 1



Q86Y56
HEATR2 HEAT repeat-containing protein 2



Q7Z4Q2
HEATR3 HEAT repeat-containing protein 3



Q9NRZ9
HELLS Lymphoid-specific helicase



Q9BXL5
HEMGN Hemogen



P06865
HEXA Beta-hexosaminidase subunit alpha



P07686
HEXB Beta-hexosaminidase subunit beta



P31937
HIBADH 3-hydroxyisobutyrate




dehydrogenase, mitochondrial



Q6NVY1
HIBCH 3-hydroxyisobutyryl-CoA




hydrolase, mitochondrial



Q9Y241
HIGD1A HIG1 domain family member 1A



P49773
HINT1 Histidine triad nucleotide-binding




protein 1



Q9NQE9
HINT3 Histidine triad nucleotide-binding




protein 3



P16403
HIST1H1C Histone H1.2



P16402
HIST1H1D Histone H1.3



Q16777
HIST2H2AC Histone H2A type 2-C



P19367
HK1 Hexokinase-1



P52789
HK2 Hexokinase-2



P30443
HLA-A HLA class I histocompatibility




antigen, A-1 alpha



P01892
HLA-A HLA class I histocompatibility




antigen, A-2 alpha



P04439
HLA-A HLA class I histocompatibility




antigen, A-3 alpha



P01891
HLA-A HLA class I histocompatibility




antigen, A-68 alpha



P30462
HLA-B HLA class I histocompatibility




antigen, B-14 alpha



P18463
HLA-B HLA class I histocompatibility




antigen, B-37 alpha



Q29940
HLA-B HLA class I histocompatibility




antigen, B-59 alpha



Q31612
HLA-B HLA class I histocompatibility




antigen, B-73 alpha



P30460
HLA-B HLA class I histocompatibility




antigen, B-8 alpha



P30499
HLA-C HLA class I histocompatibility




antigen, Cw-1 alpha



F8VZB9
HLA-C HLA class I histocompatibility




antigen, Cw-14 alph



Q07000
HLA-C HLA class I histocompatibility




antigen, Cw-15 alph



Q29963
HLA-C HLA class I histocompatibility




antigen, Cw-6 alpha



P10321
HLA-C HLA class I histocompatibility




antigen, Cw-7 alpha



Q8TCT9
HM13 Minor histocompatibility antigen




H13



P09429
HMGB1 High mobility group protein B1



P26583
HMGB2 High mobility group protein B2



O15347
HMGB3 High mobility group protein B3



Q01581
HMGCS1 Hydroxymethylglutaryl-CoA




synthase, cytoplasmic



P09601
HMOX1 Heme oxygenase 1



P30519
HMOX2 Heme oxygenase 2



Q13151
HNRNPA0 Heterogeneous nuclear




ribonucleoprotein A0



P09651
HNRNPA1 Heterogeneous nuclear




ribonucleoprotein A1



Q32P51
HNRNPA1L2 Heterogeneous nuclear




ribonucleoprotein A1-like 2



P22626
HNRNPA2B1 Heterogeneous nuclear




ribonucleoproteins A2/B1



P51991
HNRNPA3 Heterogeneous nuclear




ribonucleoprotein A3



Q99729
HNRNPAB Heterogeneous nuclear




ribonucleoprotein A/B



P07910
HNRNPC Heterogeneous nuclear




ribonucleoproteins C1/C2



O60812
HNRNPCL1 Heterogeneous nuclear




ribonucleoprotein C-like 1



Q14103
HNRNPD Heterogeneous nuclear




ribonucleoprotein D0



P52597
HNRNPF Heterogeneous nuclear




ribonucleoprotein F



P31943
HNRNPH1 Heterogeneous nuclear




ribonucleoprotein H



P55795
HNRNPH2 Heterogeneous nuclear




ribonucleoprotein H2



P31942
HNRNPH3 Heterogeneous nuclear




ribonucleoprotein H3



P61978
HNRNPK Heterogeneous nuclear




ribonucleoprotein K



P14866
HNRNPL Heterogeneous nuclear




ribonucleoprotein L



P52272
HNRNPM Heterogeneous nuclear




ribonucleoprotein M



O43390
HNRNPR Heterogeneous nuclear




ribonucleoprotein R



Q00839
HNRNPU Heterogeneous nuclear




ribonucleoprotein U



Q9BUJ2
HNRNPUL1 Heterogeneous nuclear




ribonucleoprotein U-like pro



Q1KMD3
HNRNPUL2 Heterogeneous nuclear




ribonucleoprotein U-like pro



O14979
HNRPDL Heterogeneous nuclear




ribonucleoprotein D-like



Q8WVV9
HNRPLL Heterogeneous nuclear




ribonucleoprotein L-like



Q5SSJ5
HP1BP3 Heterochromatin protein 1-




binding protein 3



P37235
HPCAL1 Hippocalcin-like protein 1



P00492
HPRT1 Hypoxanthine-guanine




phosphoribosyltransferase



Q86YZ3
HRNR Hornerin



Q7LGA3
HS2ST1 Heparan sulfate 2-O-




sulfotransferase 1



Q99714
HSD17B10 3-hydroxyacyl-CoA




dehydrogenase type-2



Q8NBQ5
HSD17B11 Estradiol 17-beta-




dehydrogenase 11



Q53GQ0
HSD17B12 Estradiol 17-beta-




dehydrogenase 12



P51659
HSD17B4 Peroxisomal multifunctional




enzyme type 2



Q3SXM5
HSDL1 Inactive hydroxysteroid




dehydrogenase-like protein



Q6YN16
HSDL2 Hydroxysteroid dehydrogenase-




like protein 2



P07900
HSP90AA1 Heat shock protein HSP 90-




alpha



P08238
HSP90AB1 Heat shock protein HSP 90-




beta



P14625
HSP90B1 Endoplasmin



Q0VDF9
HSPA14 Heat shock 70 kDa protein 14



P08107
HSPA1A Heat shock 70 kDa protein




1A/1B



P34931
HSPA1L Heat shock 70 kDa protein 1-like



P11021
HSPA5 78 kDa glucose-regulated protein



P17066
HSPA6 Heat shock 70 kDa protein 6



P11142
HSPA8 Heat shock cognate 71 kDa protein



P38646
HSPA9 Stress-70 protein, mitochondrial



P04792
HSPB1 Heat shock protein beta-1



Q9NZL4
HSPBP1 Hsp70-binding protein 1



P10809
HSPD1 60 kDa heat shock protein,




mitochondrial



P61604
HSPE1 10 kDa heat shock protein,




mitochondrial



Q92598
HSPH1 Heat shock protein 105 kDa



O43719
HTATSF1 HIV Tat-specific factor 1



Q7Z6Z7
HUWE1 E3 ubiquitin-protein ligase




HUWE1



Q9Y4L1
HYOU1 Hypoxia up-regulated protein 1



P41252
IARS Isoleucine--tRNA ligase,




cytoplasmic



Q9NSE4
IARS2 Isoleucine--tRNA ligase,




mitochondrial



O60725
ICMT Protein-S-isoprenylcysteine O-




methyltransferase



P14735
IDE Insulin-degrading enzyme



O75874
IDH1 Isocitrate dehydrogenase [NADP]




cytoplasmic



P48735
IDH2 Isocitrate dehydrogenase



P50213
IDH3A Isocitrate dehydrogenase



O43837
IDH3B Isocitrate dehydrogenase [NAD]




subunit beta, mitoc



P13284
IF130 Gamma-interferon-inducible




lysosomal thiol reducta



Q9NZI8
IGF2BP1 Insulin-like growth factor 2




mRNA-binding protein



Q9Y6M1
IGF2BP2 Insulin-like growth factor 2




mRNA-binding protein



O00425
IGF2BP3 Insulin-like growth factor 2




mRNA-binding protein



Q13123
IK Protein Red



Q12905
ILF2 Interleukin enhancer-binding factor 2



Q12906
ILF3 Interleukin enhancer-binding factor 3



A1L0T0
ILVBL Acetolactate synthase-like protein



Q16891
IMMT Mitochondrial inner membrane




protein



Q9NX62
IMPAD1 Inositol monophosphatase 3



P12268
IMPDH2 Inosine-5-monophosphate




dehydrogenase 2



Q16352
INA Alpha-internexin



Q9UI26
IPO11 Importin-11



Q8IEX9
IPO4 Importin-4



O00410
IPO5 Importin-5



O95373
IPO7 Importin-7



O15397
IPO8 Importin-8



Q96P70
IPO9 Importin-9



P46940
IQGAP1 Ras GTPase-activating-like




protein IQGAP1



O14654
IRS4 Insulin receptor substrate 4



Q96CN7
ISOC1 Isochorismatase domain-containing




protein 1



Q96J02
ITCH E3 ubiquitin-protein ligase Itchy




homolog



Q9Y287
ITM2B Integral membrane protein 2B



Q8N5M9
JAGN1 Protein jagunal homolog 1



P14923
JUP Junction plakoglobin



Q15046
KARS Lysine-tRNA ligase



Q96CX2
KCTD12 BTB/POZ domain-containing




protein KCTD12



P24390
KDELR1 ER lumen protein retaining




receptor 1



P33947
KDELR2 ER lumen protein retaining




receptor 2



O43731
KDELR3 ER lumen protein retaining




receptor 3



Q8NB78
KDM1B Lysine-specific histone




demethylase 1B



Q06136
KDSR 3-ketodihydrosphingosine reductase



Q07666
KHDRBS1 KH domain-containing, RNA-




binding, signal transduc



Q92945
KHSRP Far upstream element-binding




protein 2



Q15397
KIAA0020 Pumilio domain-containing




protein KIAA0020



O75153
KIAA0664 Clustered mitochondria protein




homolog



Q2M389
KIAA1033 WASH complex subunit 7



Q96EK5
KIAA1279 KIF1-binding protein



Q8N163
KIAA1967 DBIRD complex subunit




KIAA1967



Q8IYS2
KIAA2013 Uncharacterized protein




KIAA2013



P52732
KIF11 Kinesin-like protein KIF11



Q14807
KIF22 Kinesin-like protein KIF22



Q99661
KIF2C Kinesin-like protein KIF2C



P33176
KIF5B Kinesin-1 heavy chain



Q07866
KLC1 Kinesin light chain 1



Q9H0B6
KLC2 Kinesin light chain 2



P50748
KNTC1 Kinetochore-associated protein 1



P52294
KPNA1 Importin subunit alpha-1



P52292
KPNA2 Importin subunit alpha-2



O00505
KPNA3 Importin subunit alpha-3



O00629
KPNA4 Importin subunit alpha-4



O60684
KPNA6 Importin subunit alpha-7



Q14974
KPNB1 Importin subunit beta-1



Q5T749
KPRP Keratinocyte proline-rich protein



Q86UP2
KTN1 Kinectin



Q9H9P8
L2HGDH L-2-hydroxyglutarate




dehydrogenase, mitochondrial



P11279
LAMP1 Lysosome-associated membrane




glycoprotein 1



P13473
LAMP2 Lysosome-associated membrane




glycoprotein 2



Q6IAA8
LAMTOR1 Ragulator complex protein




LAMTOR1



P28838
LAP3 Cytosol aminopeptidase



Q6PKG0
LARP1 La-related protein 1



Q71RC2
LARP4 La-related protein 4



Q92615
LARP4B La-related protein 4B



Q9P2J5
LARS Leucine--tRNA ligase, cytoplasmic



Q15031
LARS2 Probable leucine--tRNA ligase,




mitochondrial



Q9Y4W2
LAS1L Ribosomal biogenesis protein




LAS1L



Q14739
LBR Lamin-B receptor



P00338
LDHA L-lactate dehydrogenase A chain



P07195
LDHB L-lactate dehydrogenase B chain



Q9Y2U8
LEMD3 Inner nuclear membrane protein




Man1



Q32P28
LEPRE1 Prolyl 3-hydroxylase 1



O95202
LETM1 LETM1 and EF-hand domain-




containing protein 1, mit



Q08380
LGALS3BP Galectin-3-binding protein



Q99538
LGMN Legumain



P18858
LIG1 DNA ligase 1



P38571
LIPA Lysosomal acid lipase/cholesteryl




ester hydrolase



P49257
LMAN1 Protein ERGIC-53



Q12907
LMAN2 Vesicular integral-membrane




protein VIP36



Q8WVP7
LMBR1 Limb region 1 protein homolog



Q68DH5
LMBRD2 LMBR1 domain-containing




protein 2



Q9BU23
LMF2 Lipase maturation factor 2



P02545
LMNA Prelamin-A/C



P20700
LMNB1 Lamin-B1



Q03252
LMNB2 Lamin-B2



Q9UIQ6
LNPEP Leucyl-cystinyl aminopeptidase



P36776
LONP1 Lon protease homolog,




mitochondrial



Q8NF37
LPCAT1 Lysophosphatidylcholine




acyltransferase 1



Q6P1A2
LPCAT3 Lysophospholipid acyltransferase 5



Q92604
LPGAT1 Acyl-




CoA:lysophosphatidylglycerol




acyltransferase



P42704
LRPPRC Leucine-rich PPR motif-




containing protein, mitocho



Q8N1G4
LRRC47 Leucine-rich repeat-containing




protein 47



Q96AG4
LRRC59 Leucine-rich repeat-containing




protein 59



Q9UFC0
LRWD1 Leucine-rich repeat and WD




repeat-containing prote



Q8ND56
LSM14A Protein LSM14 homolog A



Q9BX40
LSM14B Protein LSM14 homolog B



P48449
LSS Lanosterol synthase



P09960
LTA4H Leukotriene A-4 hydrolase



Q96GA3
LTV1 Protein LTV1 homolog



O95232
LUC7L3 Luc7-like protein 3



P07948
LYN Tyrosine-protein kinase Lyn



Q9UPN3
MACF1 Microtubule-actin cross-linking




factor 1, isoforms



P43366
MAGEB1 Melanoma-associated antigen




B1



O15479
MAGEB2 Melanoma-associated antigen




B2



O60732
MAGEC1 Melanoma-associated antigen




C1



Q9UBF1
MAGEC2 Melanoma-associated antigen




C2



Q9Y5V3
MAGED1 Melanoma-associated antigen




D1



Q9UNF1
MAGED2 Melanoma-associated antigen




D2



Q96A72
MAGOHB Protein mago nashi homolog 2



Q9H0U3
MAGT1 Magnesium transporter protein 1



P33908
MAN1A1 Mannosyl-oligosaccharide 1,2-




alpha-mannosidase IA



O00754
MAN2B1 Lysosomal alpha-mannosidase



Q9Y2E5
MAN2B2 Epididymis-specific alpha-




mannosidase



P46821
MAP1B Microtubule-associated protein 1B



Q02750
MAP2K1 Dual specificity mitogen-




activated protein kinase



P36507
MAP2K2 Dual specificity mitogen-




activated protein kinase



P27816
MAP4 Microtubule-associated protein 4



P28482
MAPK1 Mitogen-activated protein kinase 1



P27361
MAPK3 Mitogen-activated protein kinase 3



Q15691
MAPRE1 Microtubule-associated protein




RP/EB family member



Q15555
MAPRE2 Microtubule-associated protein




RP/EB family member



Q9NX47
MARCH5 E3 ubiquitin-protein ligase




MARCH5



P56192
MARS Methionine--tRNA ligase,




cytoplasmic



Q96GX5
MASTL Serine/threonine-protein kinase




greatwall



P43243
MATR3 Matrin-3



Q7Z434
MAVS Mitochondrial antiviral-signaling




protein



Q96N66
MBOAT7 Lysophospholipid




acyltransferase 7



Q8IVS2
MCAT Malonyl-CoA-acyl carrier protein




transacylase, mit



Q9HCC0
MCCC2 Methylcrotonoyl-CoA




carboxylase beta chain, mitoch



Q8NI22
MCFD2 Multiple coagulation factor




deficiency protein 2



P49736
MCM2 DNA replication licensing factor




MCM2



P25205
MCM3 DNA replication licensing factor




MCM3



P33991
MCM4 DNA replication licensing factor




MCM4



P33992
MCM5 DNA replication licensing factor




MCM5



Q14566
MCM6 DNA replication licensing factor




MCM6



P33993
MCM7 DNA replication licensing factor




MCM7



Q9BTE3
MCMBP Mini-chromosome maintenance




complex-binding protei



Q9ULC4
MCTS1 Malignant T-cell-amplified




sequence 1



Q14676
MDC1 Mediator of DNA damage




checkpoint protein 1



P40926
MDH2 Malate dehydrogenase,




mitochondrial



P23368
ME2 NAD-dependent malic enzyme,




mitochondrial



O00470
MEIS1 Homeobox protein Meisl



O14770
MEIS2 Homeobox protein Meis2



Q7L2J0
MEPCE 7SK snRNA methylphosphate




capping enzyme



Q14696
MESDC2 LDLR chaperone MESD



Q8N6R0
METTL13 Methyltransferase-like protein




13



Q9H8H3
METTL7A Methyltransferase-like protein




7A



Q9GZY8
MFF Mitochondrial fission factor



O95140
MFN2 Mitofusin-2



Q6N075
MFSD5 Major facilitator superfamily




domain-containing pr



Q8NHS3
MFSD8 Major facilitator superfamily




domain-containing pr



O60502
MGEA5 Bifunctional protein NCOAT



O14880
MGST3 Microsomal glutathione S-




transferase 3



Q5JRA6
MIA3 Melanoma inhibitory activity




protein 3



Q9BPX6
MICU1 Calcium uptake protein 1,




mitochondrial



Q99797
MIPEP Mitochondrial intermediate




peptidase



P46013
MKI67 Antigen KI-67



Q9BYG3
MKI67IP MKI67 FHA domain-interacting




nucleolar phosphoprot



P55196
MLLT4 Afadin



Q96EY8
MMAB Cob(I)yrinic acid a,c-diamide




adenosyltransferase,



Q8N4V1
MMGT1 Membrane magnesium




transporter 1



Q96T76
MMS19 MMS19 nucleotide excision




repair protein homolog



Q13724
MOGS Mannosyl-oligosaccharide




glucosidase



Q9UBU8
MORF4L1 Mortality factor 4-like protein 1



Q15014
MORF4L2 Mortality factor 4-like protein 2



Q9HCE1
MOV10 Putative helicase MOV-10



O00566
MPHOSPH10 U3 small nucleolar




ribonucleoprotein protein MPP10



Q00013
MPP1 55 kDa erythrocyte membrane




protein



Q14168
MPP2 MAGUK p55 subfamily member 2



Q9NZW5
MPP6 MAGUK p55 subfamily member 6



P25325
MPST 3-mercaptopyruvate




sulfurtransferase



P39210
MPV17 Protein Mpv17



Q567V2
MPV17L2 Mpv17-like protein 2



Q7Z7H8
MRPL10 39S ribosomal protein L10,




mitochondrial



Q13084
MRPL28 39S ribosomal protein L28,




mitochondrial



Q9BZE1
MRPL37 39S ribosomal protein L37,




mitochondrial



Q9NYK5
MRPL39 39S ribosomal protein L39,




mitochondrial



Q9NQ50
MRPL40 39S ribosomal protein L40,




mitochondrial



Q9H9J2
MRPL44 39S ribosomal protein L44,




mitochondrial



Q9BRJ2
MRPL45 39S ribosomal protein L45,




mitochondrial



Q9H2W6
MRPL46 39S ribosomal protein L46,




mitochondrial



Q7Z7F7
MRPL55 39S ribosomal protein L55,




mitochondrial



Q9Y676
MRPS18B 28S ribosomal protein S18b,




mitochondrial



P82650
MRPS22 28S ribosomal protein S22,




mitochondrial



Q92552
MRPS27 28S ribosomal protein S27,




mitochondrial



Q92665
MRPS31 28S ribosomal protein S31,




mitochondrial



P82673
MRPS35 28S ribosomal protein S35,




mitochondrial



P82933
MRPS9 28S ribosomal protein S9,




mitochondrial



P43246
MSH2 DNA mismatch repair protein




Msh2



P52701
MSH6 DNA mismatch repair protein




Msh6



O43347
MSI1 RNA-binding protein Musashi




homolog 1



Q96DH6
MSI2 RNA-binding protein Musashi




homolog 2



P26038
MSN Moesin



Q9P289
MST4 Serine/threonine-protein kinase




MST4



Q9BUK6
MSTO1 Protein misato homolog 1



P00395
MT-CO1 Cytochrome c oxidase subunit 1



P00403
MT-CO2 Cytochrome c oxidase subunit 2



P03886
MT-ND1 NADH-ubiquinone




oxidoreductase chain 1



P03891
MT-ND2 NADH-ubiquinone




oxidoreductase chain 2



P03905
MT-ND4 NADH-ubiquinone




oxidoreductase chain 4



P03915
MT-ND5 NADH-ubiquinone




oxidoreductase chain 5



O94776
MTA2 Metastasis-associated protein




MTA2



Q13126
MTAP S-methyl-5-thioadenosine




phosphorylase



Q9NZJ7
MTCH1 Mitochondrial carrier homolog 1



Q9Y6C9
MTCH2 Mitochondrial carrier homolog 2



Q86UE4
MTDH Protein LYRIC



Q9UDX5
MTFP1 Mitochondrial fission process




protein 1



P11586
MTHFD1 C-1-tetrahydrofolate synthase,




cytoplasmic



Q6UB35
MTHFD1L Monofunctional C1-




tetrahydrofolate synthase, mitoc



P13995
MTHFD2 Bifunctional




methylenetetrahydrofolate dehydrogena



P42898
MTHFR Methylenetetrahydrofolate




reductase



Q13505
MTX1 Metaxin-1



O75431
MTX2 Metaxin-2



Q969V5
MUL1 Mitochondrial ubiquitin ligase




activator of NFKB 1



Q9BQG0
MYBBP1A Myb-binding protein 1A



P35580
MYH10 Myosin-10



P35749
MYH11 Myosin-11



P35579
MYH9 Myosin-9



O14950
MYL12B Myosin regulatory light chain




12B



P60660
MYL6 Myosin light polypeptide 6



Q96H55
MYO19 Unconventional myosin-XIX



P41227
NAA10 N-alpha-acetyltransferase 10



Q9BXJ9
NAA15 N-alpha-acetyltransferase 15,




NatA auxiliary subun



Q6N069
NAA16 N-alpha-acetyltransferase 16,




NatA auxiliary subun



Q14CX7
NAA25 N-alpha-acetyltransferase 25,




NatB auxiliary subun



Q86UY6
NAA40 N-alpha-acetyltransferase 40



Q13765
NACA Nascent polypeptide-associated




complex subunit alp



Q4G0N4
NADKD1 NAD kinase domain-




containing protein 1



Q13564
NAE1 NEDD8-activating enzyme E1




regulatory subunit



P54802
NAGLU Alpha-N-acetylglucosaminidase



P43490
NAMPT Nicotinamide




phosphoribosyltransferase



P55209
NAP1L1 Nucleosome assembly protein




1-like 1



Q99733
NAP1L4 Nucleosome assembly protein




1-like 4



F5HFY4
NAP1L4b Nucleosome assembly protein




1-like 4



P54920
NAPA Alpha-soluble NSF attachment




protein



P49321
NASP Nuclear autoantigenic sperm




protein



Q9H0A0
NAT10 N-acetyltransferase 10



Q15021
NCAPD2 Condensin complex subunit 1



Q9BPX3
NCAPG Condensin complex subunit 3



Q15003
NCAPH Condensin complex subunit 2



Q09161
NCBP1 Nuclear cap-binding protein




subunit 1



Q9UBB6
NCDN Neurochondrin



Q6PIU2
NCEH1 Neutral cholesterol ester




hydrolase 1



Q969V3
NCLN Nicalin



Q9HCD5
NCOA5 Nuclear receptor coactivator 5



Q92542
NCSTN Nicastrin



O95299
NDUFA10 NADH dehydrogenase




[ubiquinone] 1 alpha subcomplex



Q86Y39
NDUFA11 NADH dehydrogenase




[ubiquinone] 1 alpha subcomplex



Q9P0J0
NDUFA13 NADH dehydrogenase




[ubiquinone] 1 alpha subcomplex



O95167
NDUFA3 NADH dehydrogenase



P51970
NDUFA8 NADH dehydrogenase




[ubiquinone] 1 alpha subcomplex



Q16795
NDUFA9 NADH dehydrogenase




[ubiquinone] 1 alpha subcomplex



O14561
NDUFAB1 Acyl carrier protein,




mitochondrial



O96000
NDUFB10 NADH dehydrogenase




[ubiquinone] 1 beta subcomplex



O43676
NDUFB3 NADH dehydrogenase



O95168
NDUFB4 NADH dehydrogenase




[ubiquinone] 1 beta subcomplex



O95169
NDUFB8 NADH dehydrogenase




[ubiquinone] 1 beta subcomplex



Q9Y6M9
NDUFB9 NADH dehydrogenase




[ubiquinone] 1 beta subcomplex



P28331
NDUFS1 NADH-ubiquinone




oxidoreductase 75 kDa subunit, mit



O75306
NDUFS2 NADH dehydrogenase




[ubiquinone] iron-sulfur protei



O75489
NDUFS3 NADH dehydrogenase




[ubiquinone] iron-sulfur protei



O75251
NDUFS7 NADH dehydrogenase



O00217
NDUFS8 NADH dehydrogenase




[ubiquinone] iron-sulfur protei



P49821
NDUFV1 NADH dehydrogenase



P19404
NDUFV2 NADH dehydrogenase




[ubiquinone] flavoprotein 2, mi



P07197
NEFM Neurofilament medium




polypeptide



Q9UMX5
NENF Neudesin



Q8NBF2
NHLRC2 NHL repeat-containing protein 2



P55769
NHP2L1 NHP2-like protein 1



Q9Y221
NIP7 60S ribosome subunit biogenesis




protein NIP7 homol



Q9BPW8
NIPSNAP1 Protein NipSnap homolog 1



O15226
NKRF NF-kappa-B-repressing factor



Q9BYT8
NLN Neurolysin, mitochondrial



P30419
NMT1 Glycylpeptide N-




tetradecanoyltransferase 1



P40261
NNMT Nicotinamide N-




methyltransferase



Q13423
NNT NAD(P) transhydrogenase,




mitochondrial



Q9Y3T9
NOC2L Nucleolar complex protein 2




homolog



Q8WTT2
NOC3L Nucleolar complex protein 3




homolog



Q9BVI4
NOC4L Nucleolar complex protein 4




homolog



Q5SY16
NOL9 Polynucleotide 5-hydroxyl-kinase




NOL9



Q15155
NOMO1 Nodal modulator 1



Q5JPE7
NOMO2 Nodal modulator 2



P69849
NOMO3 Nodal modulator 3



Q15233
NONO Non-POU domain-containing




octamer-binding protein



O00567
NOP56 Nucleolar protein 56



Q9Y2X3
NOP58 Nucleolar protein 58



Q8IVI9
NOSTRIN Nostrin



O15118
NPC1 Niemann-Pick C1 protein



P61916
NPC2 Epididymal secretory protein E1



P55786
NPEPPS Puromycin-sensitive




aminopeptidase



P06748
NPM1 Nucleophosmin



O75607
NPM3 Nucleoplasmin-3



P15559
NQO1 NAD(P)H dehydrogenase




[quinone] 1



P04150
NR3C1 Glucocorticoid receptor



P01111
NRAS GTPase NRas



O43847
NRD1 Nardilysin



Q8IXM6
NRM Nurim



Q15738
NSDHL Sterol-4-alpha-carboxylate 3-




dehydrogenase, decath



P46459
NSF Vesicle-fusing ATPase



Q08J23
NSUN2 tRNA (cytosine(34)-C(5))-




methyltransferase



P49902
NT5C2 Cytosolic purine 5-nucleotidase



Q9HOP0
NT5C3 Cytosolic 5-nucleotidase 3



Q969T7
NT5C3L Cytosolic 5-nucleotidase III-like




protein



Q5TFE4
NT5DC1 5-nucleotidase domain-




containing protein 1



Q9H857
NT5DC2 5-nucleotidase domain-




containing protein 2



Q86UY8
NT5DC3 5-nucleotidase domain-




containing protein 3



Q9BV86
NTMT1 N-terminal Xaa-Pro-Lys N-




methyltransferase 1



Q9BSD7
NTPCR Cancer-related nucleoside-




triphosphatase



Q02818
NUCB1 Nucleobindin-1



P80303
NUCB2 Nucleobindin-2



Q9Y266
NUDC Nuclear migration protein nudC



Q96RS6
NUDCD1 NudC domain-containing




protein 1



Q9BQG2
NUDT12 Peroxisomal NADH




pyrophosphatase NUDT12



Q9NV35
NUDT15 Probable 8-oxo-dGTP




diphosphatase NUDT15



A8MXV4
NUDT19 Nucleoside diphosphate-linked




moiety X motif 19, m



O43809
NUDT21 Cleavage and polyadenylation




specificity factor su



Q9BW91
NUDT9 ADP-ribose pyrophosphatase,




mitochondrial



Q14980
NUMA1 Nuclear mitotic apparatus




protein 1



P57740
NUP107 Nuclear pore complex protein




Nup107



Q8WUM0
NUP133 Nuclear pore complex protein




Nup133



P49790
NUP153 Nuclear pore complex protein




Nup153



O75694
NUP155 Nuclear pore complex protein




Nup155



Q12769
NUP160 Nuclear pore complex protein




Nup160



Q92621
NUP205 Nuclear pore complex protein




Nup205



Q8TEM1
NUP210 Nuclear pore membrane




glycoprotein 210



P35658
NUP214 Nuclear pore complex protein




Nup214



Q8NFH5
NUP35 Nucleoporin NUP53



Q8NFH4
NUP37 Nucleoporin Nup37



Q8NFH3
NUP43 Nucleoporin Nup43



Q9UKX7
NUP50 Nuclear pore complex protein




Nup50



Q7Z3B4
NUP54 Nucleoporin p54



P37198
NUP62 Nuclear pore glycoprotein p62



Q9BW27
NUP85 Nuclear pore complex protein




Nup85



Q99567
NUP88 Nuclear pore complex protein




Nup88



Q8N1F7
NUP93 Nuclear pore complex protein




Nup93



P52948
NUP98 Nuclear pore complex protein




Nup98-Nup96



P61970
NUTF2 Nuclear transport factor 2



Q9UBU9
NXF1 Nuclear RNA export factor 1



Q6DKJ4
NXN Nucleoredoxin



P04181
OAT Ornithine aminotransfemse,




mitochondrial



Q9NX40
OCIAD1 OCIA domain-containing




protein 1



Q5SWX8
ODR4 Protein odr-4 homolog



Q02218
OGDH 2-oxoglutarate dehydrogenase,




mitochondrial



O15294
OGT UDP-N-acetylglucosamine--peptide




N-acetylglucosami



Q9NTK5
OLA1 Obg-like ATPase 1



Q96E52
OMA1 Metalloendopeptidase OMA1,




mitochondrial



O60313
OPA1 Dynamin-like 120 kDa protein,




mitochondrial



Q9H6K4
OPA3 Optic atrophy 3 protein



Q9UBD5
ORC3 Origin recognition complex




subunit 3



P22059
OSBP Oxysterol-binding protein 1



Q9BZF1
OSBPL8 Oxysterol-binding protein-




related protein 8



Q96SU4
OSBPL9 Oxysterol-binding protein-




related protein 9



Q96FW1
OTUB1 Ubiquitin thioestemse OTUB1



Q15070
OXA1L Mitochondrial inner membrane




protein OXA1L



P13674
P4HA1 Prolyl 4-hydroxylase subunit




alpha-1



P07237
P4HB Protein disulfide-isomerase



Q9UQ80
PA2G4 Proliferation-associated protein




2G4



P11940
PABPC1 Polyadenylate-binding protein 1



Q9H361
PABPC3 Polyadenylate-binding protein 3



Q13310
PABPC4 Polyadenylate-binding protein 4



Q86U42
PABPN1 Polyadenylate-binding protein 2



P68402
PAFAH1B2 Platelet-activating factor




acetylhydrolase IB subu



P22234
PAICS Multifunctional protein ADE2



Q9H074
PAIP1 Polyadenylate-binding protein-




interacting protein



Q13177
PAK2 Serine/threonine-protein kinase




PAK 2



Q9NVE7
PANK4 Pantothenate kinase 4



P51003
PAPOLA Poly(A) polymerase alpha



O43252
PAPSS1 Bifunctional 3-




phosphoadenosine 5-phosphosulfate



P09874
PARP1 Poly [ADP-ribose] polymerase 1



Q96KB5
PBK Lymphokine-activated killer T-cell-




originated prot



Q86U86
PBRM1 Protein polybromo-1



Q15365
PCBP1 Poly(rC)-binding protein 1



Q15366
PCBP2 Poly(rC)-binding protein 2



P57721
PCBP3 Poly(rC)-binding protein 3



Q16822
PCK2 Phosphoenolpyruvate




carboxykinase [GTP], mitochond



Q15154
PCM1 Pericentriolar material 1 protein



P22061
PCMT1 Protein-L-isoaspartate(D-




aspartate) O-methyltransf



P12004
PCNA Proliferating cell nuclear antigen



Q9UHG3
PCYOX1 Prenylcysteine oxidase 1



Q8NBM8
PCYOX1L Prenylcysteine oxidase-like



P49585
PCYT1A Choline-phosphate




cytidylyltransferase A



Q14690
PDCD11 Protein RRP5 homolog



Q53EL6
PDCD4 Programmed cell death protein 4



O14737
PDCD5 Programmed cell death protein 5



O75340
PDCD6 Programmed cell death protein 6



Q8WUM4
PDCD6IP Programmed cell death 6-




interacting protein



Q9H2J4
PDCL3 Phosducin-like protein 3



Q6L8Q7
PDE12 2,5-phosphodiesterase 12



P08559
PDHA1 Pyruvate dehydrogenase E1




component subunit alpha,



P11177
PDHB Pyruvate dehydrogenase E1




component subunit beta,



P30101
PDIA3 Protein disulfide-isomerase A3



P13667
PDIA4 Protein disulfide-isomerase A4



Q15084
PDIA6 Protein disulfide-isomerase A6



O00151
PDLIM1 PDZ and LIM domain protein 1



Q9P0J1
PDP1



Q9NUG6
PDRG1 p53 and DNA damage-regulated




protein 1



Q29RF7
PDS5A Sister chromatid cohesion protein




PDS5 homolog A



O00764
PDXK Pyridoxal kinase



P30086
PEBP1 Phosphatidylethanolamine-




binding protein 1



Q9BY49
PECR Peroxisomal trans-2-enoyl-CoA




reductase



Q9UBV8
PEF1 Peflin



Q9BRX2
PELO Protein pelota homolog



Q8IZL8
PELP1 Proline-, glutamic acid- and




leucine-rich protein



O00541
PES1 Pescadillo homolog



O96011
PEX11B Peroxisomal membrane protein




11B



Q9Y5Y5
PEX16 Peroxisomal membrane protein




PEX16



P40855
PEX19 Peroxisomal biogenesis factor 19



PFAS



O15067
Phosphoribosylformylglycinamidine




synthase



Q9UHV9
PFDN2 Prefoldin subunit 2



Q99471
PFDN5 Prefoldin subunit 5



P17858
PFKL 6-phosphofructokinase, liver type



P08237
PFKM 6-phosphofructokinase, muscle




type



Q01813
PFKP 6-phosphofructokinase type C



P07737
PFN1 Profilin-1



Q96HS1
PGAM5 Serine/threonine-protein




phosphatase PGAM5, mitoch



P00558
PGK1 Phosphoglycerate kinase 1



P07205
PGK2 Phosphoglycerate kinase 2



P36871
PGM1 Phosphoglucomutase-1



O95394
PGM3 Phosphoacetylglucosamine mutase



O00264
PGRMC1 Membrane-associated




progesterone receptor componen



O15173
PGRMC2 Membrane-associated




progesterone receptor componen



P35232
PHB Prohibitin



Q99623
PHB2 Prohibitin-2



O43175
PHGDH D-3-phosphoglycerate




dehydrogenase



Q9BTU6
PI4K2A Phosphatidylinositol 4-kinase




type 2-alpha



Q9UBF8
PI4KB Phosphatidylinositol 4-kinase beta



Q13492
PICALM Phosphatidylinositol-binding




clathrin assembly pro



Q92643
PIGK GPI-anchor transamidase



Q969N2
PIGT GPI transamidase component PIG-T



Q9H490
PIGU Phosphatidylinositol glycan anchor




biosynthesis cl



Q13526
PIN1 Peptidyl-prolyl cis-trans isomerase




NIMA-interacti



Q9UG56
PISD Phosphatidylserine decarboxylase




proenzyme



Q00169
PITPNA Phosphatidylinositol transfer




protein alpha isofor



P48739
PITPNB Phosphatidylinositol transfer




protein beta isoform



Q5JRX3
PITRM1 Presequence protease,




mitochondrial



P30613
PKLR Pyruvate kinase isozymes R/L



P14618
PKM Pyruvate kinase isozymes M1/M2



Q99640
PKMYT1 Membrane-associated tyrosine-




and threonine-specif



Q16512
PKN1 Serine/threonine-protein kinase N1



Q16513
PKN2 Serine/threonine-protein kinase N2



Q9Y446
PKP3 Plakophilin-3



Q8NCC3
PLA2G15 Group XV phospholipase A2



Q8NHP8
PLBD2 Putative phospholipase B-like 2



P19174
PLCG1 1-phosphatidylinositol 4,5-




bisphosphate phosphodie



Q8IV08
PLD3 Phospholipase D3



Q15149
PLEC Plectin



Q99541
PLIN2 Perilipin-2



O60664
PLIN3 Perilipin-3



P53350
PLK1 Serine/threonine-protein kinase




PLK1



Q02809
PLOD1 Procollagen-lysine, 2-




oxoglutarate 5-dioxygenase 1



P13797
PLS3 Plastin-3



Q10713
PMPCA Mitochondrial-processing




peptidase subunit alpha



O75439
PMPCB Mitochondrial-processing




peptidase subunit beta



Q9H307
PNN Pinin



Q96AD5
PNPLA2 Patatin-like phospholipase




domain-containing prote



Q8TCS8
PNPT1 Polyribonucleotide




nucleotidyltransferase 1, mitoc



F8VUJ3
POC1B-GALNT4 Protein POC1B-




GALNT4



Q14181
POLA2 DNA polymerase alpha subunit B



P28340
POLD1 DNA polymerase delta catalytic




subunit



Q9Y257
POLDIP2 Polymerase delta-interacting




protein 2



P24928
POLR2A DNA-directed RNA




polymerase II subunit RPB1



P30876
POLR2B DNA-directed RNA polymerase




II subunit RPB2



O00411
POLRMT DNA-directed RNA




polymerase, mitochondrial



Q15165
PON2 Serum paraoxonase/arylesterase 2



Q99575
POP1 Ribonucleases P/MRP protein




subunit POP1



P16435
POR NADPH--cytochrome P450




reductase



Q9H2U2
PPA2 Inorganic pyrophosphatase 2,




mitochondrial



Q9NQ55
PPAN Suppressor of SWI4 1 homolog



C9J3F9
PPAN-P2RY11 Protein PPAN-P2RY11



Q06203
PPAT Amidophosphoribosyltransferase



Q13356
PPIL2 Peptidyl-prolyl cis-trans




isomerase-like 2



P49593
PPM1F Protein phosphatase 1F



O15355
PPM1G Protein phosphatase 1G



Q9Y570
PPME1 Protein phosphatase




methylesterase 1



P62136
PPP1CA Serine/threonine-protein




phosphatase PP1-alpha cat



P62140
PPP1CB Serine/threonine-protein




phosphatase PP1-beta cata



P36873
PPP1CC Serine/threonine-protein




phosphatase PP1-gamma cat



P67775
PPP2CA Serine/threonine-protein




phosphatase 2A catalytic



P62714
PPP2CB Serine/threonine-protein




phosphatase 2A catalytic



P30153
PPP2R1A Serine/threonine-protein




phosphatase 2A 65 kDa reg



P30154
PPP2R1B Serine/threonine-protein




phosphatase 2A 65 kDa reg



P63151
PPP2R2A Serine/threonine-protein




phosphatase 2A 55 kDa reg



Q15172
PPP2R5A Serine/threonine-protein




phosphatase 2A 56 kDa reg



Q13362
PPP2R5C Serine/threonine-protein




phosphatase 2A 56 kDa reg



Q14738
PPP2R5D Serine/threonine-protein




phosphatase 2A 56 kDa reg



P60510
PPP4C Serine/threonine-protein




phosphatase 4 catalytic s



O00743
PPP6C Serine/threonine-protein




phosphatase 6 catalytic s



Q9UPN7
PPP6R1 Serine/threonine-protein




phosphatase 6 regulatory



Q5H9R7
PPP6R3 Serine/threonine-protein




phosphatase 6 regulatory



P50897
PPT1 Palmitoyl-protein thioesterase 1



Q9UMR5
PPT2 Lysosomal thioesterase PPT2



O43663
PRC1 Protein regulator of cytokinesis 1



P42785
PRCP Lysosomal Pro-X




carboxypeptidase



Q06830
PRDX1 Peroxiredoxin-1



P32119
PRDX2 Peroxiredoxin-2



P30048
PRDX3 Thioredoxin-dependent peroxide




reductase, mitochon



Q13162
PRDX4 Peroxiredoxin-4



P30044
PRDX5 Peroxiredoxin-5, mitochondrial



P30041
PRDX6 Peroxiredoxin-6



Q9HCU5
PREB Prolactin regulatory element-




binding protein



P48147
PREP Prolyl endopeptidase



Q4J6C6
PREPL Prolyl endopeptidase-like



P49643
PRIM2 DNA primase large subunit



P17612
PRKACA cAMP-dependent protein




kinase catalytic subunit a1



P54619
PRKAG1 5-AMP-activated protein




kinase subunit gamma-1



P10644
PRKAR1A cAMP-dependent protein




kinase type I-alpha regulat



P13861
PRKAR2A cAMP-dependent protein




kinase type II-alpha regula



P31323
PRKAR2B cAMP-dependent protein




kinase type II-beta regulat



P05771
PRKCB Protein kinase C beta type



P14314
PRKCSH Glucosidase 2 subunit beta



P78527
PRKDC DNA-dependent protein kinase




catalytic subunit



O75569
PRKRA Interferon-inducible double




stranded RNA-dependent



Q99873
PRMT1 Protein arginine N-




methyltransferase 1



O60678
PRMT3 Protein arginine N-




methyltransferase 3



O14744
PRMT5 Protein arginine N-




methyltransferase 5



Q9UMS4
PRPF19 Pre-mRNA-processing factor 19



Q5VTL8
PRPF38B Pre-mRNA-splicing factor 38B



O75400
PRPF40A Pre-mRNA-processing factor




40 homolog A



O94906
PRPF6 Pre-mRNA-processing factor 6



Q6P2Q9
PRPF8 Pre-mRNA-processing-splicing




factor 8



P48634
PRRC2A Protein PRRC2A



Q9Y520
PRRC2C Protein PRRC2C



P07602
PSAP Proactivator polypeptide



P49768
PSEN1 Presenilin-1



P49810
PSEN2 Presenilin-2



O75475
PSIP1 PC4 and SFRS1-interacting




protein



P25786
PSMA1 Proteasome subunit alpha type-1



P25787
PSMA2 Proteasome subunit alpha type-2



P25788
PSMA3 Proteasome subunit alpha type-3



P25789
PSMA4 Proteasome subunit alpha type-4



P28066
PSMA5 Proteasome subunit alpha type-5



P60900
PSMA6 Proteasome subunit alpha type-6



O14818
PSMA7 Proteasome subunit alpha type-7



P20618
PSMB1 Proteasome subunit beta type-1



P49721
PSMB2 Proteasome subunit beta type-2



P49720
PSMB3 Proteasome subunit beta type-3



P28070
PSMB4 Proteasome subunit beta type-4



P28074
PSMB5 Proteasome subunit beta type-5



P28072
PSMB6 Proteasome subunit beta type-6



Q99436
PSMB7 Proteasome subunit beta type-7



P62191
PSMC1 26S protease regulatory subunit 4



P35998
PSMC2 26S protease regulatory subunit 7



P17980
PSMC3 26S protease regulatory subunit




6A



P43686
PSMC4 26S protease regulatory subunit




6B



P62195
PSMC5 26S protease regulatory subunit 8



P62333
PSMC6 26S protease regulatory subunit




10B



Q99460
PSMD1 26S proteasome non-ATPase




regulatory subunit 1



O75832
PSMD10 26S proteasome non-ATPase




regulatory subunit 10



O00231
PSMD11 26S proteasome non-ATPase




regulatory subunit 11



Q9UNM6
PSMD13 26S proteasome non-ATPase




regulatory subunit 13



O00487
PSMD14 26S proteasome non-ATPase




regulatory subunit 14



Q13200
PSMD2 26S proteasome non-ATPase




regulatory subunit 2



O43242
PSMD3 26S proteasome non-ATPase




regulatory subunit 3



Q16401
PSMD5 26S proteasome non-ATPase




regulatory subunit 5



Q15008
PSMD6 26S proteasome non-ATPase




regulatory subunit 6



P51665
PSMD7 26S proteasome non-ATPase




regulatory subunit 7



P48556
PSMD8 26S proteasome non-ATPase




regulatory subunit 8



Q06323
PSME1 Proteasome activator complex




subunit 1



Q9UL46
PSME2 Proteasome activator complex




subunit 2



P61289
PSME3 Proteasome activator complex




subunit 3



Q92530
PSMF1 Proteasome inhibitor PI31




subunit



O95456
PSMG1 Proteasome assembly chaperone 1



Q8WXF1
PSPC1 Paraspeckle component 1



P26599
PTBP1 Polypyrimidine tract-binding




protein 1



O95758
PTBP3 Polypyrimidine tract-binding




protein 3



Q96EY7
PTCD3 Pentatricopeptide repeat-




containing protein 3, mit



P48651
PTDSS1 Phosphatidylserine synthase 1



Q9BVG9
PTDSS2 Phosphatidylserine synthase 2



Q9H7Z7
PTGES2 Prostaglandin E synthase 2



Q15185
PTGES3 Prostaglandin E synthase 3



Q8N8N7
PTGR2 Prostaglandin reductase 2



Q9P035
PTPLAD1 3-hydroxyacyl-CoA




dehydratase 3



P18031
PTPN1 Tyrosine-protein phosphatase




non-receptor type 1



Q06124
PTPN11 Tyrosine-protein phosphatase




non-receptor type 11



Q9H3S7
PTPN23 Tyrosine-protein phosphatase




non-receptor type 23



Q6NZI2
PTRF Polymerase I and transcript release




factor



Q9Y3E5
PTRH2 Peptidyl-tRNA hydrolase 2,




mitochondrial



Q9UHX1
PUF60 Poly(U)-binding-splicing factor




PUF60



Q14671
PUM1 Pumilio homolog 1



Q96PZ0
PUS7 Pseudouridylate synthase 7




homolog



Q15269
PWP2 Periodic tryptophan protein 2




homolog



Q9NR77
PXMP2 Peroxisomal membrane protein 2



P32322
PYCR1 Pyrroline-5-carboxylate




reductase 1, mitochondrial



Q96C36
PYCR2 Pyrroline-5-carboxylate




reductase 2



P11216
PYGB Glycogen phosphorylase, brain




form



P06737
PYGL Glycogen phosphorylase, liver




form



P20742
PZP Pregnancy zone protein



Q5XKP0
QIL1 Protein QIL1



Q96PU8
QKI Protein quaking



P61026
RAB10 Ras-related protein Rab-10



P62491
RAB11A Ras-related protein Rab-11A



Q15907
RAB11B Ras-related protein Rab-11B



P61106
RAB14 Ras-related protein Rab-14



Q9NP72
RAB18 Ras-related protein Rab-18



P62820
RAB1A Ras-related protein Rab-1A



Q9H0U4
RAB1B Ras-related protein Rab-1B



Q9UL25
RAB21 Ras-related protein Rab-21



Q969Q5
RAB24 Ras-related protein Rab-24



P61019
RAB2A Ras-related protein Rab-2A



Q8WUD1
RAB2B Ras-related protein Rab-2B



Q15042
RAB3GAP1 Rab3 GTPase-activating




protein catalytic subunit



Q9H2M9
RAB3GAP2 Rab3 GTPase-activating




protein non-catalytic subun



Q8TBN0
RAB3IL1 Guanine nucleotide exchange




factor for Rab-3A



P20339
RAB5A Ras-related protein Rab-5A



P61020
RAB5B Ras-related protein Rab-5B



P51148
RAB5C Ras-related protein Rab-5C



P51149
RAB7A Ras-related protein Rab-7a



P51151
RAB9A Ras-related protein Rab-9A



Q7Z6M1
RABEPK Rab9 effector protein with




kelch motifs



P54727
RAD23B UV excision repair protein




RAD23 homolog B



Q92878
RAD50 DNA repair protein RAD50



P78406
RAE1 mRNA export factor



P11233
RALA Ras-related protein Ral-A



Q9UKM9
RALY RNA-binding protein Raly



P62826
RAN GTP-binding nuclear protein Ran



P43487
RANBP1 Ran-specific GTPase-activating




protein



P49792
RANBP2 E3 SUMO-protein ligase




RanBP2



P62834
RAP1A Ras-related protein Rap-1A



P61224
RAP1B Ras-related protein Rap-1b



P61225
RAP2B Ras-related protein Rap-2b



Q9Y3L5
RAP2C Ras-related protein Rap-2c



P54136
RARS Arginine--tRNA ligase,




cytoplasmic



Q8IY67
RAVER1 Ribonucleoprotein PTB-




binding 1



Q09028
RBBP4 Histone-binding protein RBBP4



Q16576
RBBP7 Histone-binding protein RBBP7



Q9NWB1
RBFOX1 RNA binding protein fox-1




homolog 1



O43251
RBFOX2 RNA binding protein fox-1




homolog 2



P98175
RBM10 RNA-binding protein 10



Q8IXT5
RBM12B RNA-binding protein 12B



Q96PK6
RBM14 RNA-binding protein 14



B0LM41
RBM14/RBM4 Protein RBM14-RBM4



Q96T37
RBM15 Putative RNA-binding protein 15



P49756
RBM25 RNA-binding protein 25



Q9NW13
RBM28 RNA-binding protein 28



P98179
RBM3 Putative RNA-binding protein 3



Q14498
RBM39 RNA-binding protein 39



Q9BWF3
RBM4 RNA-binding protein 4



Q9BQ04
RBM4B RNA-binding protein 4B



P29558
RBMS1 RNA-binding motif, single-




stranded-interacting pro



P38159
RBMX RNA-binding motif protein, X




chromosome



Q96E39
RBMXL1 RNA binding motif protein, X-




linked-like-1



Q15293
RCN1 Reticulocalbin-1



Q14257
RCN2 Reticulocalbin-2



Q8TC12
RDH11 Retinol dehydrogenase 11



Q8NBN7
RDH13 Retinol dehydrogenase 13



Q9HBH5
RDH14 Retinol dehydrogenase 14



P35241
RDX Radixin



P46063
RECQL ATP-dependent DNA helicase




Q1



Q00765
REEP5 Receptor expression-enhancing




protein 5



O15258
RER1 Protein RER1



Q6NUM9
RETSAT All-trans-retinol 13,14-




reductase



P35250
RFC2 Replication factor C subunit 2



P40938
RFC3 Replication factor C subunit 3



P35249
RFC4 Replication factor C subunit 4



P40937
RFC5 Replication factor C subunit 5



Q96AA3
RFT1 Protein RFT1 homolog



Q15382
RHEB GTP-binding protein Rheb



P61586
RHOA Transforming protein RhoA



P08134
RHOC Rho-related GTP-binding protein




RhoC



Q8IXI1
RHOT2 Mitochondrial Rho GTPase 2



Q5UIP0
RIF1 Telomere-associated protein RIF1



Q6NUQ1
RINT1 RAD50-interacting protein 1



Q9BVS4
RIOK2 Serine/threonine-protein kinase




RIO2



O43353
RIPK2 Receptor-interacting




serine/threonine-protein kina



Q9NWS8
RMND1 Required for meiotic nuclear




division protein 1 ho



O00584
RNASET2 Ribonuclease T2



Q9H920
RNF121 RING finger protein 121



Q9UBS8
RNF14 E3 ubiquitin-protein ligase




RNF14



Q5VTR2
RNF20 E3 ubiquitin-protein ligase




BRE1A



Q9H4A4
RNPEP Aminopeptidase B



P27694
RPA1 Replication protein A 70 kDa




DNA-binding subunit



P15927
RPA2 Replication protein A 32 kDa




subunit



P62906
RPL10A 60S ribosomal protein L10a



Q02543
RPL18A 60S ribosomal protein L18a



P62750
RPL23 A 60S ribosomal protein L23a



P61254
RPL26 60S ribosomal protein L26



P62888
RPL30 60S ribosomal protein L30



P36578
RPL4 60S ribosomal protein L4



P18124
RPL7 60S ribosomal protein L7



P62424
RPL7A 60S ribosomal protein L7a



Q6DKI1
RPL7L1 60S ribosomal protein L7-like 1



P62917
RPL8 60S ribosomal protein L8



P05387
RPLP2 60S acidic ribosomal protein P2



P04843
RPN1 Dolichyl-




diphosphooligosaccharide-protein




glycosy



P04844
RPN2 Dolichyl-




diphosphooligosaccharide-protein




glycosy



Q9NQG5
RPRD1B Regulation of nuclear pre-




mRNA domain-containing p



P46783
RPS10 40S ribosomal protein S10



P62277
RPS13 40S ribosomal protein S13



P62244
RPS15A 40S ribosomal protein S15a



P62249
RPS16 40S ribosomal protein S16



P62269
RPS18 40S ribosomal protein S18



P15880
RPS2 40S ribosomal protein S2



P62266
RPS23 40S ribosomal protein S23



P62847
RPS24 40S ribosomal protein S24



P62979
RPS27A Ubiquitin-40S ribosomal protein




S27a



P23396
RPS3 40S ribosomal protein S3



P61247
RPS3A 40S ribosomal protein S3a



Q15418
RPS6KA1 Ribosomal protein S6 kinase




alpha-1



Q15349
RPS6KA2 Ribosomal protein S6 kinase




alpha-2



P51812
RPS6KA3 Ribosomal protein S6 kinase




alpha-3



P62241
RPS8 40S ribosomal protein S8



A6NE09
RPSAP58 Protein RPSAP58



Q8IZ73
RPUSD2 RNA pseudouridylate synthase




domain-containing pro



Q9HB90
RRAGC Ras-related GTP-binding protein C



Q9P2E9
RRBP1 Ribosome-binding protein 1



P23921
RRM1 Ribonucleoside-diphosphate




reductase large subunit



P31350
RRM2 Ribonucleoside-diphosphate




reductase subunit M2



P56182
RRP1 Ribosomal RNA processing




protein 1 homolog A



Q5JTH9
RRP12 RRP12-like protein



Q14684
RRP1B Ribosomal RNA processing




protein 1 homolog B



O76021
RSL1D1 Ribosomal L1 domain-




containing protein 1



Q92541
RTF1 RNA polymerase-associated




protein RTF1 homolog



O95197
RTN3 Reticulon-3



Q9NQC3
RTN4 Reticulon-4



Q8WWV3
RTN4IP1 Reticulon-4-interacting protein




1, mitochondrial



Q9Y265
RUVBL1 RuvB-like 1



Q9Y230
RUVBL2 RuvB-like 2



Q9NTJ5
SACM1L Phosphatidylinositide




phosphatase SAC1



Q15424
SAFB Scaffold attachment factor B1



Q14151
SAFB2 Scaffold attachment factor B2



Q9Y512
SAMM50 Sorting and assembly




machinery component 50 homolo



Q9NSI8
SAMSN1 SAM domain-containing




protein SAMSN-1



Q9NR31
SAR1A GTP-binding protein SAR1a



Q9Y6B6
SAR1B GTP-binding protein SAR1b



P49591
SARS Serine--tRNA ligase, cytoplasmic



Q9NP81
SARS2 Serine--tRNA ligase,




mitochondrial



O43290
SART1 U4/U6.U5 tri-snRNP-associated




protein 1



Q15020
SART3 Squamous cell carcinoma antigen




recognized by T-ce



O14828
SCAMP3 Secretory carrier-associated




membrane protein 3



Q8WTV0
SCARB1 Scavenger receptor class B




member 1



Q14108
SCARB2 Lysosome membrane protein 2



Q8NBX0
SCCPDH Saccharopine dehydrogenase-




like oxidoreductase



O00767
SCD Acyl-CoA desaturase



Q8WVM8
SCFD1 Sec1 family domain-containing




protein 1



O75880
SCO1 Protein SCO1 homolog,




mitochondrial



O43819
SCO2 Protein SCO2 homolog,




mitochondrial



P22307
SCP2 Non-specific lipid-transfer protein



Q9HB40
SCPEP1 Retinoid-inducible serine




carboxypeptidase



O00560
SDCBP Syntenin-1



Q9BRK5
SDF4 45 kDa calcium-binding protein



P31040
SDHA Succinate dehydrogenase




[ubiquinone] flavoprotein



P21912
SDHB Succinate dehydrogenase




[ubiquinone] iron-sulfur s



P67812
SEC11A Signal peptidase complex




catalytic subunit SEC11A



P55735
SEC13 Protein SEC13 homolog



O15027
SEC16A Protein transport protein




Sec16A



O75396
SEC22B Vesicle-trafficking protein




SEC22b



Q15436
SEC23A Protein transport protein




Sec23A



Q15437
SEC23B Protein transport protein Sec23B



Q9Y6Y8
SEC23IP 5EC23-interacting protein



P53992
SEC24C Protein transport protein Sec24C



O94979
SEC31A Protein transport protein




Sec31A



P61619
SEC61A1 Protein transport protein Sec61




subunit alpha isof



Q99442
SEC62 Translocation protein SEC62



Q9UGP8
SEC63 Translocation protein SEC63




homolog



Q9UBV2
SEL1L Protein sel-1 homolog 1



Q15019
SEPT2 Septin-2



Q16181
SEPT7 Septin-7



Q8NC51
SERBP1 Plasminogen activator inhibitor




1 RNA-binding prot



P30740
SERPINB1 Leukocyte elastase inhibitor



P29508
SERPINB3 Serpin B3



P35237
SERPINB6 Serpin B6



P50454
SERPINH1 Serpin H1



P58004
SESN2 Sestrin-2



Q01105
SET Protein SET



Q15637
SF1 Splicing factor 1



Q15459
SF3A1 Splicing factor 3A subunit 1



Q12874
SF3A3 Splicing factor 3A subunit 3



O75533
SF3B1 Splicing factor 3B subunit 1



Q13435
SF3B2 Splicing factor 3B subunit 2



Q9BWJ5
SF3B5 Splicing factor 3B subunit 5



P23246
SFPQ Splicing factor, proline- and




glutamine-rich



Q9H9B4
SFXN1 Sideroflexin-1



Q96NB2
SFXN2 Sideroflexin-2



Q6P4A7
SFXN4 Sideroflexin-4



O95470
SGPL1 Sphingosine-1-phosphate lyase 1



O43765
SGTA Small glutamine-rich




tetratricopeptide repeat-cont



Q99961
SH3GL1 Endophilin-A2



Q9Y371
SH3GLB1 Endophilin-B1



P34896
SHMT1 Serine




hydroxymethyltransferase, cytosolic



P34897
SHMT2 Serine




hydroxymethyltransferase, mitochondrial



Q9HAT2
SIAE Sialate O-acetylesterase



Q99720
SIGMAR1 Sigma non-opioid intracellular




receptor 1



Q96ST3
SIN3A Paired amphipathic helix protein




Sin3a



P42285
SKIV2L2 Superkiller viralicidic activity




2-like 2



P63208
SKP1 S-phase kinase-associated protein 1



P41440
SLC19A1 Folate transporter 1



P43007
SLC1A4 Neutral amino acid transporter A



Q15758
SLC1A5 Neutral amino acid transporter




B(0)



P53007
SLC25A1 Tricarboxylate transport




protein, mitochondrial



Q9UBX3
SLC25A10 Mitochondrial dicarboxylate




carrier



Q02978
SLC25A11 Mitochondrial 2-




oxoglutarate/malate carrier protei



O75746
SLC25A12 Calcium-binding




mitochondrial carrier protein Aral



Q9UJS0
SLC25A13 Calcium-binding




mitochondrial carrier protein Aral



Q9Y619
SLC25A15 Mitochondrial ornithine




transporter 1



P16260
SLC25A16 Graves disease carrier protein



Q9HC21
SLC25A19 Mitochondrial thiamine




pyrophosphate carrier



O43772
SLC25A20 Mitochondrial




carnitine/acylcarnitine carrier prot



Q9H936
SLC25A22 Mitochondrial glutamate




carrier 1



Q6NUK1
SLC25A24 Calcium-binding




mitochondrial carrier protein SCaM



Q70HW3
SLC25A26 S-adenosylmethionine




mitochondrial carrier protein



Q00325
SLC25A3 Phosphate carrier protein,




mitochondrial



Q5SVS4
SLC25A30 Kidney mitochondrial carrier




protein 1



Q9H2D1
SLC25A32 Mitochondrial folate




transporter/carrier



Q9BSK2
SLC25A33 Solute carrier family 25




member 33



P12235
SLC25A4 ADP/ATP translocase 1



Q8TBP6
SLC25A40 Solute carrier family 25




member 40



P05141
SLC25A5 ADP/ATP translocase 2



P12236
SLC25A6 ADP/ATP translocase 3



O14975
SLC27A2 Very long-chain acyl-CoA




synthetase



P11166
SLC2A1 Solute carrier family 2,




facilitated glucose trans



Q8TAD4
SLC30A5 Zinc transporter 5



Q6NXT4
SLC30A6 Zinc transporter 6



Q8NEW0
SLC30A7 Zinc transporter 7



Q6PML9
SLC30A9 Zinc transporter 9



O00400
SLC33A1 Acetyl-coenzyme A transporter 1



Q8TB61
SLC35B2 Adenosine 3-phospho 5-




phosphosulfate transporter



Q8IXU6
SLC35F2 Solute carrier family 35




member F2



Q96QD8
SLC38A2 Sodium-coupled neutral amino




acid transporter 2



P08195
SLC3A2 4F2 cell-surface antigen heavy




chain



P30825
SLC7A1 High affinity cationic amino




acid transporter 1



Q9H2G2
SLK STE20-like serine/threonine-protein




kinase



Q8WU79
SMAP2 Stromal membrane-associated




protein 2



P28370
SMARCA1 Probable global transcription




activator SNF2L1



P51532
SMARCA4 Transcription activator BRG1



O60264
SMARCA5 SWI/SNF-related matrix-




associated actin-dependent



Q12824
SMARCB1 SWI/SNF-related matrix-




associated actin-dependent



Q92922
SMARCC1 SWI/SNF complex subunit




SMARCC1



Q14683
SMC1A Structural maintenance of




chromosomes protein 1A



O95347
SMC2 Structural maintenance of




chromosomes protein 2



Q9UQE7
SMC3 Structural maintenance of




chromosomes protein 3



Q9NTJ3
SMC4 Structural maintenance of




chromosomes protein 4



A6NHR9
SMCHD1 Structural maintenance of




chromosomes flexible hin



Q16637
SMN1 Survival motor neuron protein



P17405
SMPD1 Sphingomyelin




phosphodiesterase



Q9NXE4
SMPD4 Sphingomyelin




phosphodiesterase 4



Q2TAY7
SMU1 WD40 repeat-containing protein




SMU1



Q9H7B4
SMYD3 SET and MYND domain-




containing protein 3



O00161
SNAP23 Synaptosomal-associated




protein 23



O95721
SNAP29 Synaptosomal-associated




protein 29



Q7KZF4
SND1 Staphylococcal nuclease domain-




containing protein



O75643
SNRNP200 U5 small nuclear




ribonucleoprotein 200 kDa helicas



Q96DI7
SNRNP40 U5 small nuclear




ribonucleoprotein 40 kDa protein



P08621
SNRNP70 U1 small nuclear




ribonucleoprotein 70 kDa



P09012
SNRPA U1 small nuclear




ribonucleoprotein A



P62314
SNRPD1 Small nuclear ribonucleoprotein




Sm D1



Q13573
SNW1 SNW domain-containing protein 1



Q13596
SNX1 Sorting nexin-1



O60749
SNX2 Sorting nexin-2



Q96L92
SNX27 Sorting nexin-27



Q9Y5X3
SNX5 Sorting nexin-5



Q9UNH7
SNX6 Sorting nexin-6



Q9Y5X1
SNX9 Sorting nexin-9



P35610
SOAT1 Sterol O-acyltransferase 1



P04179
SOD2 Superoxide dismutase



P18583
SON Protein SON



Q99523
SORT1 Sortilin



O60271
SPAG9 C-Jun-amino-terminal kinase-




interacting protein 4



Q8NB90
SPATA5 Spermatogenesis-associated




protein 5



Q8NBT2
SPC24 Kinetochore protein Spc24



Q9HBM1
SPC25 Kinetochore protein Spc25



Q15005
SPCS2 Signal peptidase complex subunit 2



Q8N0X7
SPG20 Spartin



Q9H2V7
SPNS1 Protein spinster homolog 1



P35270
SPR Sepiapterin reductase



P02549
SPTA1 Spectrin alpha chain, erythrocytic 1



Q13813
SPTAN1 Spectrin alpha chain, non-




erythrocytic 1



Q01082
SPTBN1 Spectrin beta chain, non-




erythrocytic 1



O15269
SPTLC1 Serine palmitoyltransferase 1



O15270
SPTLC2 Serine palmitoyltransferase 2



Q14534
SQLE Squalene monooxygenase



P30626
SRI Sorcin



P19623
SRM Spermidine synthase



P61011
SRP54 Signal recognition particle 54 kDa




protein



Q9UHB9
SRP68 Signal recognition particle 68 kDa




protein



O76094
SRP72 Signal recognition particle 72 kDa




protein



Q965B4
SRPK1 SRSF protein kinase 1



P08240
SRPR Signal recognition particle receptor




subunit alpha



Q9Y5M8
SRPRB Signal recognition particle




receptor subunit beta



Q9UQ35
SRRM2 Serine/arginine repetitive matrix




protein 2



Q9BXP5
SRRT Serrate RNA effector molecule




homolog



O75494
SRSF10 Serine/arginine-rich splicing




factor 10



P84103
SRSF3 Serine/arginine-rich splicing




factor 3



Q16629
SRSF7 Serine/arginine-rich splicing




factor 7



Q13242
SRSF9 Serine/arginine-rich splicing




factor 9



Q04837
SSBP1 Single-stranded DNA-binding




protein, mitochondrial



P43307
SSR1 Translocon-associated protein




subunit alpha



P51571
SSR4 Translocon-associated protein




subunit delta



Q08945
SSRP1 FACT complex subunit SSRP1



P50502
ST13 Hsc70-interacting protein



Q8N3U4
STAG2 Cohesin subunit SA-2



Q92783
STAM Signal transducing adapter




molecule 1



O95772
STARD3NL MLN64 N-terminal domain




homolog



Q9NQZ5
STARD7 StAR-related lipid transfer




protein 7, mitochondri



P42224
STAT1 Signal transducer and activator of




transcription 1



P52630
STAT2 Signal transducer and activator of




transcription 2



P40763
STAT3 Signal transducer and activator of




transcription 3



P42229
STAT5A Signal transducer and activator




of transcription 5



P51692
STAT5B Signal transducer and activator




of transcription 5



O95793
STAU1 Double-stranded RNA-binding




protein Staufen homolo



Q13586
STIM1 Stromal interaction molecule 1



P31948
STIP1 Stress-induced-phosphoprotein 1



Q9Y6E0
STK24 Serine/threonine-protein kinase




24



Q13188
STK3 Serine/threonine-protein kinase 3



Q13043
STK4 Serine/threonine-protein kinase 4



P16949
STMN1 Stathmin



Q9UJZ1
STOML2 Stomatin-like protein 2



Q9Y3F4
STRAP Serine-threonine kinase receptor-




associated protei



Q96519
STRBP Spermatid perinuclear RNA-




binding protein



P46977
STT3A Dolichyl-




diphosphooligosaccharide--protein




glycosy



Q8TCJ2
STT3B Dolichyl-




diphosphooligosaccharide--protein




glycosy



Q9UNE7
STUB1 E3 ubiquitin-protein ligase CHIP



O60499
STX10 Syntaxin-10



Q86Y82
STX12 Syntaxin-12



Q9P2W9
STX18 Syntaxin-18



Q13190
STX5 Syntaxin-5



O43752
STX6 Syntaxin-6



Q15833
STXBP2 Syntaxin-binding protein 2



O00186
STXBP3 Syntaxin-binding protein 3



Q96I99
SUCLG2 Succinyl-CoA ligase [GDP-




forming] subunit beta, mi



Q8IWZ8
SUGP1 SURP and G-patch domain-




containing protein 1



O94901
SUN1 SUN domain-containing protein 1



Q9UH99
SUN2 SUN domain-containing protein 2



Q9Y5B9
SUPT16H FACT complex subunit SPT16



O00267
SUPT5H Transcription elongation factor




SPT5



Q7KZ85
SUPT6H Transcription elongation factor




SPT6



O15260
SURF4 Surfeit locus protein 4



Q15022
SUZ12 Polycomb protein SUZ12



Q96A49
SYAP1 Synapse-associated protein 1



Q92797
SYMPK Symplekin



O60506
SYNCRIP Heterogeneous nuclear




ribonucleoprotein Q



Q9Y6A5
TACC3 Transforming acidic coiled-coil-




containing protein



Q9BSH4
TACO1 Translational activator of




cytochrome c oxidase 1



Q92804
TAF15 TATA-binding protein-associated




factor 2N



P37802
TAGLN2 Transgelin-2



Q13148
TARDBP TAR DNA-binding protein 43



P26639
TARS Threonine--tRNA ligase,




cytoplasmic



Q9BW92
TARS2 Threonine--tRNA ligase,




mitochondrial



Q8TC07
TBC1D15 TBC1 domain family member




15



Q99426
TBCB Tubulin-folding cofactor B



Q9BTW9
TBCD Tubulin-specific chaperone D



Q15813
TBCE Tubulin-specific chaperone E



Q9Y4P3
TBL2 Transducin beta-like protein 2



Q12788
TBL3 Transducin beta-like protein 3



Q969Z0
TBRG4 Protein TBRG4



P23193
TCEA1 Transcription elongation factor A




protein 1



Q13428
TCOF1 Treacle protein



P17987
TCP1 T-complex protein 1 subunit alpha



Q9Y2W6
TDRKH Tudor and KH domain-




containing protein



Q9NZ01
TECR Trans-2,3-enoyl-CoA reductase



Q9Y4R8
TELO2 Telomere length regulation




protein TEL2 homolog



Q9NXF1
TEX10 Testis-expressed sequence 10




protein



Q00059
TFAM Transcription factor A,




mitochondrial



Q92734
TFG Protein TFG



P02786
TFRC Transferrin receptor protein 1



P21980
TGM2 Protein-glutamine gamma-




glutamyltransferase 2



Q08188
TGM3 Protein-glutamine gamma-




glutamyltransferase E



Q96RS0
TGS1 Trimethylguanosine synthase



Q8IXH7
TH1L Negative elongation factor C/D



Q96FV9
THOC1 THO complex subunit 1



Q96J01
THOC3 THO complex subunit 3



P52888
THOP1 Thimet oligopeptidase



Q9Y2W1
THRAP3 Thyroid hormone receptor-




associated protein 3



Q9BV44
THUMPD3 THUMP domain-containing




protein 3



P31483
TIA1 Nucleolysin TIA-1 isoform p40



Q01085
TIAL1 Nucleolysin TIAR



P62072
TIMM10 Mitochondrial import inner




membrane translocase su



Q9Y5L4
TIMM13 Mitochondrial import inner




membrane translocase su



Q99595
TIMM17A Mitochondrial import inner




membrane translocase su



O60830
TIMM17B Mitochondrial import inner




membrane translocase su



O14925
TIMM23 Mitochondrial import inner




membrane translocase su



Q5SRD1
TIMM23B Putative mitochondrial import




inner membrane trans



O43615
TIMM44 Mitochondrial import inner




membrane translocase su



Q3ZCQ8
TIMM50 Mitochondrial import inner




membrane translocase su



Q9NPL8
TIMMDC1 Translocase of inner




mitochondrial membmne domain



O75663
TIPRL TIP41-like protein



Q6JUT2
TIRAP3 TIR domain-containing adapter




molecule 2



Q86UE8
TLK2 Serine/threonine-protein kinase




tousled-like 2



E9PSI1
TM9SF1 Transmembrane 9 superfamily




member 1



Q99805
TM9SF2 Transmembrane 9 superfamily




member 2



Q9HD45
TM9SF3 Transmembrane 9 superfamily




member 3



Q92544
TM9SF4 Transmembrane 9 superfamily




member 4



P55061
TMBIM6 Bax inhibitor 1



Q9UM00
TMCO1 Transmembrane and coiled-coil




domain-containing pr



Q13445
TMED1 Transmembrane emp24 domain-




containing protein 1



P49755
TMED10 Transmembrane emp24




domain-containing protein 10



Q15363
TMED2 Transmembrane emp24 domain-




containing protein 2



Q9Y3A6
TMED5 Transmembrane emp24 domain-




containing protein 5



Q9Y3B3
TMED7 Transmembrane emp24 domain-




containing protein 7



Q9BVK6
TMED9 Transmembrane emp24 domain-




containing protein 9



Q9H061
TMEM126A Transmembrane protein




126A



Q8IUX1
TMEM126B Transmembrane protein




126B



Q9P0S9
TMEM14C Transmembrane protein 14C



Q9NX00
TMEM160 Transmembrane protein 160



Q9NX61
TMEM161A Transmembrane protein




161A



Q9HC07
TMEM165 Transmembrane protein 165



Q86WV6
TMEM173 Transmembrane protein 173



O14524
TMEM194A Transmembrane protein




194A



Q8N511
TMEM199 Transmembrane protein 199



Q6UW68
TMEM205 Transmembrane protein 205



Q9H813
TMEM206 Transmembrane protein 206



Q96SK2
TMEM209 Transmembrane protein 209



Q6NUQ4
TMEM214 Transmembrane protein 214



P57088
TMEM33 Transmembrane protein 33



Q9NVV0
TMEM38B Trimeric intracellular cation




channel type B



Q9BTV4
TMEM43 Transmembrane protein 43



Q9BTX1
TMEM48 Nucleoporin NDC1



Q9BXS4
TMEM59 Transmembrane protein 59



Q6PI78
TMEM65 Transmembrane protein 65



Q96MH6
TMEM68 Transmembrane protein 68



Q9BUB7
TMEM70 Transmembrane protein 70,




mitochondrial



Q8NBN3
TMEM87A Transmembrane protein 87A



Q5BJF2
TMEM97 Transmembrane protein 97



P28289
TMOD1 Tropomodulin-1



Q9NYL9
TMOD3 Tropomodulin-3



P42166
TMPO Lamina-associated polypeptide 2,




isoform alpha



P42167
TMPO Lamina-associated polypeptide 2,




isoforms beta/gam



Q6ZXV5
TMTC3 Transmembrane and TPR repeat-




containing protein 3



Q9H3N1
TMX1 Thioredoxin-related




transmembrane protein 1



Q96JJ7
TMX3 Protein disulfide-isomerase TMX3



Q9H1E5
TMX4 Thioredoxin-related




transmembrane protein 4



Q9C0C2
TNKS1BP1 182 kDa tankyrase-1-binding




protein



Q92973
TNPO1 Transportin-1



O14787
TNPO2 Transportin-2



Q9Y5L0
TNPO3 Transportin-3



O60784
TOM1 Target of Myb protein 1



Q9NS69
TOMM22 Mitochondrial import receptor




subunit TOM22 homolo



O96008
TOMM40 Mitochondrial import receptor




subunit TOM40 homolo



O94826
TOMM70A Mitochondrial import




receptor subunit TOM70



P11388
TOP2A DNA topoisomerase 2-alpha



Q02880
TOP2B DNA topoisomerase 2-beta



O14656
TOR1A Torsin-1A



Q5JTV8
TOR1AIP1 Torsin-1A-interacting protein 1



P04637
TP53 Cellular tumor antigen p53



O43399
TPD52L2 Tumor protein D54



P06753
TPM3 Tropomyosin alpha-3 chain



P67936
TPM4 Tropomyosin alpha-4 chain



O14773
TPP1 Tripeptidyl-peptidase 1



P12270
TPR Nucleoprotein TPR



P13693
TPT1 Tmnslationally-controlled tumor




protein



Q9ULW0
TPX2 Targeting protein for Xklp2



Q13595
TRA2A Transformer-2 protein homolog




alpha



P62995
TRA2B Transformer-2 protein homolog




beta



Q9H4I3
TRABD TraB domain-containing protein



Q15629
TRAM1 Translocating chain-associated




membrane protein 1



Q12931
TRAP1 Heat shock protein 75 kDa,




mitochondrial



Q13263
TRIM28 Transcription intermediary




factor 1-beta



Q9UPN9
TRIM33 E3 ubiquitin-protein ligase




TRIM33



Q14669
TRIP12 E3 ubiquitin-protein ligase




TRIP12



Q15645
TRIP13 Pachytene checkpoint protein 2




homolog



Q7L0Y3
TRMT10C Mitochondrial ribonuclease P




protein 1



Q7Z4G4
TRMT11 tRNA (guanine(10)-N2)-




methyltransferase homolog



Q7Z2T5
TRMT1L TRMT1-like protein



Q8IZ69
TRMT2A tRNA (uracil-5-)-




methyltransferase homolog A



P10155
TROVE2 60 kDa SS-A/Ro




ribonucleoprotein



P43897
TSFM Elongation factor Ts,




mitochondrial



Q99816
TSG101 Tumor susceptibility gene 101




protein



Q15631
TSN Translin



Q99598
TSNAX Translin-associated protein X



O43657
TSPAN6 Tetraspanin-6



Q2NL82
TSR1 Pre-rRNA-processing protein




TSR1 homolog



Q99614
TTC1 Tetratricopeptide repeat protein 1



Q6DKK2
TTC19 Tetratricopeptide repeat protein




19, mitochondrial



Q6P3X3
TTC27 Tetratricopeptide repeat protein




27



Q6PGP7
TTC37 Tetratricopeptide repeat protein




37



Q5R3I4
TTC38 Tetratricopeptide repeat protein




38



O95801
TTC4 Tetratricopeptide repeat protein 4



Q14166
TTLL12 Tubulin-tyrosine ligase-like




protein 12



Q9C0H2
TTYH3 Protein tweety homolog 3



Q71U36
TUBA1A Tubulin alpha-1A chain



P68363
TUBA1B Tubulin alpha-1B chain



Q9BQE3
TUBA1C Tubulin alpha-1C chain



Q13748
TUBA3C Tubulin alpha-3C/D chain



P68366
TUBA4A Tubulin alpha-4A chain



Q9NY65
TUBA8 Tubulin alpha-8 chain



P07437
TUBB Tubulin beta chain



Q9H4B7
TUBB1 Tubulin beta-1 chain



Q13885
TUBB2A Tubulin beta-2A chain



Q9BVA1
TUBB2B Tubulin beta-2B chain



Q13509
TUBB3 Tubulin beta-3 chain



P04350
TUBB4A Tubulin beta-4A chain



P68371
TUBB4B Tubulin beta-4B chain



Q9BUF5
TUBB6 Tubulin beta-6 chain



Q3ZCM7
TUBB8 Tubulin beta-8 chain



P23258
TUBG1 Tubulin gamma-1 chain



Q9BSJ2
TUBGCP2 Gamma-tubulin complex




component 2



P49411
TUFM Elongation factor Tu,




mitochondrial



Q6IBS0
TWF2 Twinfilin-2



P40222
TXLNA Alpha-taxilin



P10599
TXN Thioredoxin



Q99757
TXN2 Thioredoxin, mitochondrial



O95881
TXNDC12 Thioredoxin domain-




containing protein 12



Q9BRA2
TXNDC17 Thioredoxin domain-




containing protein 17



Q8NBS9
TXNDC5 Thioredoxin domain-




containing protein 5



O43396
TXNL1 Thioredoxin-like protein 1



Q16881
TXNRD1 Thioredoxin reductase 1,




cytoplasmic



P04818
TYMS Thymidylate synthase



Q2T9J0
TYSND1 Peroxisomal leader peptide-




processing protease



Q01081
U2AF1 Splicing factor U2AF 35 kDa




subunit



P26368
U2AF2 Splicing factor U2AF 65 kDa




subunit



O15042
U2SURP U2 snRNP-associated SURP




motif-containing protein



P22314
UBA1 Ubiquitin-like modifier-activating




enzyme 1



Q9UBT2
UBA2 SUMO-activating enzyme subunit 2



P62987
UBA52 Ubiquitin-60S ribosomal protein




L40



A0AVT1
UBA6 Ubiquitin-like modifier-activating




enzyme 6



Q9BSL1
UBAC1 Ubiquitin-associated domain-




containing protein 1



Q5T6F2
UBAP2 Ubiquitin-associated protein 2



Q14157
UBAP2L Ubiquitin-associated protein 2-




like



J3QRK5
UBBP4 Protein UBBP4



P63279
UBE2I SUMO-conjugating enzyme




UBC9



P68036
UBE2L3 Ubiquitin-conjugating enzyme




E2 L3



P61081
UBE2M NEDD8-conjugating enzyme




Ubc12



P61088
UBE2N Ubiquitin-conjugating enzyme




E2 N



Q9C0C9
UBE2O Ubiquitin-conjugating enzyme




E2 O



Q7Z7E8
UBE2Q1 Ubiquitin-conjugating enzyme




E2 Q1



Q15386
UBE3C Ubiquitin-protein ligase E3C



Q9UMX0
UBQLN1 Ubiquilin-1



Q9UHD9
UBQLN2 Ubiquilin-2



Q9NRR5
UBQLN4 Ubiquilin-4



P17480
UBTF Nucleolar transcription factor 1



Q04323
UBXN1 UBX domain-containing protein 1



P09936
UCHL1 Ubiquitin carboxyl-terminal




hydrolase isozyme L1



P15374
UCHL3 Ubiquitin carboxyl-terminal




hydrolase isozyme L3



Q9Y5K5
UCHL5 Ubiquitin carboxyl-terminal




hydrolase isozyme L5



O94874
UFL1 E3 UFM1-protein ligase 1



Q16739
UGCG Ceramide glucosyltransferase



Q9NYU2
UGGT1 UDP-glucose:glycoprotein




glucosyltransferase 1



Q6BDS2
UHRF1BP1 UHRF1-binding protein 1



Q13432
UNC119 Protein unc-119 homolog A



A6NIH7
UNC119B Protein unc-119 homolog B



Q70J99
UNC13D Protein unc-13 homolog D



Q9H3U1
UNC45A Protein unc-45 homolog A



A4D2Q0
UNC84A SUN domain-containing




protein 1



E9PBQ3
Uncharacterized protein



H3BQZ7
Uncharacterized protein



H7C417
Uncharacterized protein



H7C455
Uncharacterized protein



H7C469
Uncharacterized protein



I3L2F9
Uncharacterized protein



Q92900
UPF1 Regulator of nonsense transcripts 1



Q9BZI7
UPF3B Regulator of nonsense transcripts




3B



Q9NVA1
UQCC Ubiquinol-cytochrome c reductase




complex chaperone



P31930
UQCRC1 Cytochrome b-c1 complex




subunit 1, mitochondrial



P22695
UQCRC2 Cytochrome b-c1 complex




subunit 2, mitochondrial



P47985
UQCRFS1 Cytochrome b-c1 complex




subunit Rieske, mitochondr



O14949
UQCRQ Cytochrome b-cl complex




subunit 8



Q14694
USP10 Ubiquitin carboxyl-terminal




hydrolase 10



P51784
USP11 Ubiquitin carboxyl-terminal




hydrolase 11



P54578
USP14 Ubiquitin carboxyl-terminal




hydrolase 14



Q9Y4E8
USP15 Ubiquitin carboxyl-terminal




hydrolase 15



Q53GS9
USP39 U4/U6.U5 tri-snRNP-associated




protein 2



Q96K76
USP47 Ubiquitin carboxyl-terminal




hydrolase 47



Q86UV5
USP48 Ubiquitin carboxyl-terminal




hydrolase 48



P45974
USP5 Ubiquitin carboxyl-terminal




hydrolase 5



Q93009
USP7 Ubiquitin carboxyl-terminal




hydrolase 7



Q9NQZ2
UTP3 Something about silencing protein




10



Q9NYH9
UTP6 U3 small nucleolar RNA-




associated protein 6 homolo



P51809
VAMP7 Vesicle-associated membrane




protein 7



Q9P0L0
VAPA Vesicle-associated membrane




protein-associated pro



O95292
VAPB Vesicle-associated membrane




protein-associated pro



P26640
VARS Valine-tRNA ligase



Q99536
VAT1 Synaptic vesicle membrane protein




VAT-1 homolog



P61758
VBP1 Prefoldin subunit 3



P18206
VCL Vinculin



P55072
VCP Transitional endoplasmic reticulum




ATPase



Q96JH7
VCPIP1 Deubiquitinating protein




VCIP135



P21796
VDAC1 Voltage-dependent anion-




selective channel protein



P45880
VDAC2 Voltage-dependent anion-




selective channel protein



Q9Y277
VDAC3 Voltage-dependent anion-




selective channel protein



P08670
VIM Vimentin



Q96GC9
VMP1 Vacuole membrane protein 1



Q96RL7
VPS13A Vacuolar protein sorting-




associated protein 13A



Q9P253
VPS18 Vacuolar protein sorting-




associated protein 18 hom



Q96AX1
VPS33A Vacuolar protein sorting-




associated protein 33A



Q96QK1
VPS35 Vacuolar protein sorting-




associated protein 35



Q9UN37
VPS4A Vacuolar protein sorting-




associated protein 4A



O75351
VPS4B Vacuolar protein sorting-




associated protein 4B



Q9UID3
VPS51 Vacuolar protein sorting-




associated protein 51 hom



Q99986
VRK1 Serine/threonine-protein kinase




VRK1



Q7Z5K2
WAPAL Wings apart-like protein




homolog



P23381
WARS Tryptophan--tRNA ligase,




cytoplasmic



Q969T9
WBP2 WW domain-binding protein 2



O75083
WDR1 WD repeat-containing protein 1



Q9UNX4
WDR3 WD repeat-containing protein 3



Q8NI36
WDR36 WD repeat-containing protein 36



Q15061
WDR43 WD repeat-containing protein 43



Q9NNW5
WDR6 WD repeat-containing protein 6



Q9GZS3
WDR61 WD repeat-containing protein 61



Q9BQA1
WDR77 Methylosome protein 50



Q6UXN9
WDR82 WD repeat-containing protein 82



O96028
WHSC1 Probable histone-lysine N-




methyltransferase NSD2



Q5T9L3
WLS Protein wntless homolog



Q9NQW7
XPNPEP1 Xaa-Pro aminopeptidase 1



Q9NQH7
XPNPEP3 Probable Xaa-Pro




aminopeptidase 3



O14980
XPO1 Exportin-1



Q9HAV4
XPO5 Exportin-5



Q96QU8
XPO6 Exportin-6



O43592
XPOT Exportin-T



P13010
XRCC5 X-ray repair cross-




complementing protein 5



P12956
XRCC6 X-ray repair cross-




complementing protein 6



Q9H0D6
XRN2 5-3 exoribonuclease 2



P54577
YARS Tyrosine--tRNA ligase,




cytoplasmic



P67809
YBX1 Nuclease-sensitive element-




binding protein 1



P07947
YES1 Tyrosine-protein kinase Yes



O95070
YIF1A Protein YIF1A



Q5BJH7
YIF1B Protein YIF1B



P49750
YLPM1 YLP motif-containing protein 1



Q96TA2
YME1L1 ATP-dependent zinc




metalloprotease YME1L1



Q96MU7
YTHDC1 YTH domain-containing




protein 1



Q9Y5A9
YTHDF2 YTH domain family protein 2



P31946
YWHAB 14-3-3 protein beta/alpha



P62258
YWHAE 14-3-3 protein epsilon



P61981
YWHAG 14-3-3 protein gamma



Q04917
YWHAH 14-3-3 protein eta



P27348
YWHAQ 14-3-3 protein theta



P63104
YWHAZ 14-3-3 protein zeta/delta



Q8N4Q0
ZADH2 Zinc-binding alcohol




dehydrogenase domain-containi



Q8WU90
ZC3H15 Zinc finger CCCH domain-




containing protein 15



Q7Z2W4
ZC3HAV1 Zinc finger CCCH-type




antiviral protein 1



Q9NUD5
ZCCHC3 Zinc finger CCHC domain-




containing protein 3



Q6NZY4
ZCCHC8 Zinc finger CCHC domain-




containing protein 8



Q96KR1
ZFR Zinc finger RNA-binding protein



O75844
ZMPSTE24 CAAX prenyl protease 1




homolog



P17028
ZNF24 Zinc finger protein 24



O75312
ZNF259 Zinc finger protein ZPR1



Q5BKZ1
ZNF326 DBIRD complex subunit




ZNF326



Q96F45
ZNF503 Zinc finger protein 503



Q86UK7
ZNF598 Zinc finger protein 598



Q15942
ZYX Zyxin










While preferred embodiments of the present invention have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention. It should be understood that various alternatives to the embodiments of the invention described herein may be employed in practicing the invention. It is intended that the following claims define the scope of the invention and that methods and structures within the scope of these claims and their equivalents be covered thereby.

Claims
  • 1. A small molecule ligand having a structure represented by Formula (I):
  • 2. A small molecule ligand which binds to a ligand binding site of Cathepsin B (CTSB), the small molecule ligand is, selected from the group consisting of
  • 3. The small molecule ligand of claim 2, wherein the ligand binding site is defined by the following residues: GQDHCGIESEVVAGIPR of the CTSB protein having the UniProtKB accession number P07858.
CROSS-REFERENCE

This application is a U.S. National Phase of International Application No. PCT/US2018/014104, filed on Jan. 17, 2018, which claims the benefit of U.S. Provisional Application No. 62/447,882, filed Jan. 18, 2017, both of which are incorporated herein by reference in their entireties.

STATEMENT AS TO FEDERALLY SPONSORED RESEARCH

This invention was made with government support under grant numbers DK099810, CA132630 and 1S10OD16357 awarded by the National Institutes of Health. The government has certain rights in the invention.

PCT Information
Filing Document Filing Date Country Kind
PCT/US2018/014104 1/17/2018 WO
Publishing Document Publishing Date Country Kind
WO2018/136555 7/26/2018 WO A
US Referenced Citations (28)
Number Name Date Kind
4469863 Ts'o et al. Sep 1984 A
5034506 Summerton et al. Jul 1991 A
5216141 Benner Jun 1993 A
5235033 Summerton et al. Aug 1993 A
5386023 Sanghvi et al. Jan 1995 A
5602240 Mesmaeker et al. Feb 1997 A
5637684 Cook et al. Jun 1997 A
5644048 Yau et al. Jul 1997 A
6344330 Ellman et al. Feb 2002 B1
7348437 Cravatt et al. Mar 2008 B2
8669065 Hansen et al. Mar 2014 B1
8778302 Tai et al. Jul 2014 B2
10168342 Cravatt et al. Jan 2019 B2
20090068107 Cravatt et al. Mar 2009 A1
20100021950 Lammert et al. Jan 2010 A1
20100179118 Ozawa et al. Jul 2010 A1
20100184661 Luo et al. Jul 2010 A1
20100203647 Hang et al. Aug 2010 A1
20110020837 Haberkant et al. Jan 2011 A1
20110195527 O'Neill et al. Aug 2011 A1
20120225434 Ciufolini et al. Sep 2012 A1
20130165337 Robinson et al. Jun 2013 A1
20140243430 Geho et al. Aug 2014 A1
20140357512 Yang et al. Dec 2014 A1
20150157686 Janssen-Heininger et al. Jun 2015 A1
20160252509 Cravatt et al. Sep 2016 A1
20170115303 Cravatt et al. Apr 2017 A1
20200292555 Cravatt et al. Sep 2020 A1
Foreign Referenced Citations (10)
Number Date Country
2011517315 Jun 2011 JP
WO-0077184 Dec 2000 WO
WO-0242773 May 2002 WO
WO-2005118833 Dec 2005 WO
WO-2006112841 Oct 2006 WO
WO-2009142678 Nov 2009 WO
WO-2015023724 Feb 2015 WO
WO-2016029037 Feb 2016 WO
WO-2017070611 Apr 2017 WO
WO-2018136555 Jul 2018 WO
Non-Patent Literature Citations (138)
Entry
Abegg et al. Proteome-Wide Profiling of Targets of Cysteine reactive Small molecules by Using Ethynyl Benziodoxolone Reagents. Angewandte Chemie International Edition 54:10852-10857 (2015).
Ahmad et al. Structure Based Molecular Inhibition of Caspase-8 For Treatment of Multi-Neurodegenerative Disease Using Known Natural Compounds. Bioinformatics 10(4):191-195 (2014).
Aldini et al. Identification of actin as a 15-deoxy-Delta12,14-prostaglandin J2 target in neuroblastoma cells: mass spectrometric, computational, and functional approaches to investigate the effect on cytoskeletal derangement. Biochemistry 46:2707-2718 (2007).
Bachovchin et al. Academic cross-fertilization by public screening yields a remarkable class of protein phosphatase methylesterase-1 inhibitors. PNAS USA 108:6811-6816 (2011).
Bachovchin et al. The Pharmacological Landscape and Therapeutic Potential of Serine Hydrolases. Nature Reviews 11:52-68 (2012).
Backus et al. Proteome-wide covalent ligand discovery in native biological systems. Nature 534(7608):570-574 (2016).
Ban et al. Tyrosine bioconjugation through aqueous ene-type reactions: a click-like reaction for tyrosine. J Am Chem Soc 132:1523-1525 (2010).
Barelier et al. Discovery of Fragment Molecules That Bind The Human Peroxiredoxin 5 Active Site. PLoS One 5(3):e9744 (2010).
Beaucage, et al. The functionalization of oligonucleotides via phosphoramidite derivative. Tetrahedron. 1993;49(10):1925-63.
Bennaars-Eiden et al. Covalent modification of epithelial fatty acid-binding protein by 4-hydroxynonenal in vitro and in vivo. Evidence for a role in antioxidant biology. J Biol Chem 277:50693-50702 (2002).
Bischoff et al. Amino Acids: Chemistry, Functionality and Selected Non-Enzymatic Post-Translational Modifications. J Proteomics 75:2275-2296 (2012).
Bloem et al. Tissue distribution and functional expression of a cDNA encoding a novel mixed lineage kinase. J Mol Cell Cardiol 33:1739-1750 (2001).
Brill et al. Synthesis of oligodeoxynucleoside phosphorodithioates via thioamidites. J. Am. Chem. Soc. 111:2321-2322 (1989).
Carbone et al. Inhibition of Hsp72-mediated protein refolding by 4-hydroxy-2-nonenal. Chem Res Toxicol 17:1459-1467 (2004).
Carbone et al. Modification of heat shock protein 90 by 4-hydroxynonenal in a rat model of chronic alcoholic liver disease. J Pharmacol Exp Ther 315:8-15 (2005).
Carlsson et al. Screening for genetic mutations. Nature 380(6571):207 (1996).
Chalker et al. Chemical modification of proteins at cysteine: opportunities in chemistry and biology. Chem Asian J 4(5):630-640 (2009).
Chaudhary et al. Probing the phosphoinositide 4,5-bisphosphate binding site of human profilin I. Chemistry & Biology 5(5):273-281 (1998).
Chipuk et al. Sphingolipid metabolism cooperates with BAK and BAX to promote the mitochondrial pathway of apoptosis. Cell 148:988-1000 (2012).
Codreanu et al. Global analysis of protein damage by the lipid electrophile 4-hydroxy-2-nonenal. Mol Cell Proteomics 8:670-680 (2009).
Cohen et al. Structural bioinformatics-based design of selective, irreversible kinase inhibitors. Science 308:1318-1321 (2005).
De Mesmaeker et al. Comparison of Rigid and Flexible Backbones in Antisense Oligonucleotides Bioorg Med Chem Lett 4(3):395-398 (1994).
Dempcy et al. Synthesis of a thymidyl pentamer of deoxyribonucleic guanidine and binding studies with DNA homopolynucleotides PNAS USa 92:6097-6101 (1995).
Deng et al. Proteome-wide Quantification and Characterization of Oxidation-Sensitive Cysteines in Pathogenic Bacteria. Cell Host Microbe 13:358-370 (2013).
Derakhshan et al. Unbiased Identification of Cyteine S-Nitrosylation Sites on Proteins. Nat Protocol 2(7):1685-1691 (2007).
Doorn et al. Covalent modification of amino acid nucleophiles by the lipid peroxidation products 4-hydroxy-2-nonenal and 4-oxo-2-nonenal. Chem Res Toxicol 15:1445-1450 (2002).
Dubinina et al. Role of 4-hydroxy-trans-2-nonenal in cell functions. Biochemistry (Most) 75:1069-1087 (2010).
Egholm et al. Peptide nucleic acids (PNA) oligonucleotide analogues with an achiral peptide backbone. J Am Chem Soc 114:1895-1897 (1992).
Erlanson et al. Tethering: Fragment-Based Drug Discovery. Annu Rev Biophys Biomol Structure 33:199-223 (2004).
Forman. Reactive oxygen species and alpha,beta-unsaturated aldehydes as second messengers in signal transduction. Ann N Y Acad Sci 1203:35-44 (2010).
Frei et al. Fast and Highly Chemoselective Alkynylation of Thiols with Hypervalent Iodine Reagents Enabled through a Low Energy Barrier Concerted Mechanism. J Am Chem Soc 136:16563-16573 (2014).
Fritz et al. An overview of the chemistry and biology of reactive aldehydes. Free Radic Biol Med 59:85-91 (2012).
Fritz et al. Exploring the biology of lipid peroxidation-derived protein carbonylation. Chem Res Toxicol 24:1411-1419 (2011).
Fujishima et al. Ligand-directed acyl imidazole chemistry for labeling of membrane-bound proteins on live cells. J Am Chem Soc 134:3961-3964 (2012).
Gao et al. Unusual conformation of a 3′-thioformacetal linkage in a DNA duplex. J. Biomolecular NMR.34:17-34 (1994).
Giron et al. Cysteine Tagging for MS-based Proteomics. Mass spectrometry Reviews 30:366-395 (2011).
Gotoh et al. Identification and characterization of a novel MAP kinase kinase kinase, MLTK. J Biol Chem 276:4276-4286 (2001).
Gubbens et al. Photocrosslinking and click chemistry enable the specific detection of proteins interacting with phospholipids at the membrane interface. Chem Biol. 16(1):3-14 (2009).
Gubbens et al. Proteome-wide detection of phospholipid-protein interactions in mitochondria by photocrosslinking and click chemistry. Mol Biosyst 6(10):1751-1759 (2010).
Gueraud et al. Chemistry and biochemistry of lipid peroxidation products. Free Radic Res 44:1098-1124 (2010).
Gushwa et al. Selective targeting of distinct active site nucleophiles by irreversible SRC-family kinase inhibitors. J Am Chem Soc 134:20214-20217 (2012).
Haberkant et al. Protein-lipid interactions: paparazzi hunting for snap-shots. Biol Chem 390:795-803 (2009).
Han et al. A comparative ‘bottom up’ proteomics strategy for the site-specific identification and quantification of protein modifications by electrophilic lipids. J Proteomics 75:5724-5733 (2012).
Hang et al. Exploring protein lipidation with chemical biology. Chem Rev 111:6341-6358 (2011).
Higdon et al. Methods for imaging and detecting modification of proteins by reactive lipid species. Free Radic Biol Med 47:201-212 (2009.
Horn et al. Oligonucleotides with alternating anionic and cationic phosphoramidate linkages: Synthesis and hybridization of stereo-uniform isomers. Tetrahedron Lett 37:743-746 (1996).
Huang et al. Crystal structure of an inactive Akt2 kinase domain. Structure 11:21-30 (2003).
Hulce et al. Proteome-wide mapping of cholesterol-interacting proteins in mammalian cells. Nat Methods 10(3):259-264 (2013).
Jacob et al. Control of Oxidative Posttranslational Cysteine Modifications: From Intricate Chemistry to Widespread Biological and Medical Applications. Chem Res Toxicol 25:588-604 (2012).
Jacobs et al. Heat shock factor 1 attenuates 4-Hydroxynonenal mediated apoptosis: critical role for heat shock protein 70 induction and stabilization of Bcl-XL. J Biol Chem 282:33412-33420 (2007).
Jacobs et al. Systems analysis of protein modification and cellular responses induced by electrophile stress. Acc Chem Res. 43(5):673-683 (2010).
Jenkins et al. The Biosynthesis of Carbocyclic Nucleosides Chem Soc Re 24:169-176 (1995).
Jung et al. Hybridization of Alternating Cationic/Anionic Oligonucleotides to RNA Segments. Nucleosides & Nucleotides 13(6&7):1597-1605 (1994).
Kambe et al. Supporting Information—Mapping the Protein Interaction Landscape for Fully Functionalized Small-Molecule Probes in Human Cells. J Am Chem Soc 136(30):10777-10782 (2014).
Keshet et al. Chapter 1: The MAP kinase signaling cascades: a system of hundreds of components regulates a diverse array of physiological functions. Methods Mol Biol 661:3-38 (2010).
Kiedrowshi et al. Parabolic Growth of a Self-Replicating Hexadeoxynucleotide Bearing a 3′-5′-Phosphoamidate Linkage Angew. Chem. Intl. Ed. English 30(4):423-426 (1991).
Kim et al. An azido-biotin reagent for use in the isolation of protein adducts of lipid-derived electrophiles by streptavidin catch and photorelease. Mol Cell Proteomics 8:2080-2089 (2009).
Knight et al. Features of selective kinase inhibitors. Chem Biol 12:621-637 (2005).
Koshkin et al. LNA (locked nucleic acids): An RNA mimic forming exceedingly stable LNA: LNA duplexes. J Am Chem Soc 120:13252-13253 (1998).
Kumagai et al. CERT Mediates Intermembrane Transfer of Various Molecular Species of Ceramides. JBC 280:6488-6495 (2005).
Kutuk et al. Apoptosis signalling by 4-hydroxynonenal: a role for JNK-c-Jun/AP-1 pathway. Redox Rep 12:30-34 (2007).
Leitner et al. Chemistry meets proteomics: the use of chemical tagging reactions for MS-based proteomics. Proteomics 6:5418-5434 (2006).
Leonard et al. Chemical ‘omics’ approaches for understanding protein cysteine oxidation in biology. Curr Opin Chem Biol 15:88-102 (2011).
Leonarduzzi et al. Signaling kinases modulated by 4-hydroxynonenal. Free Radic Biol Med 37:1694-1702 (2004).
Letsinger et al. Cationic Oligonucleotides J Am Chem Soc 110:4470-4471 (1988).
Letsinger et al. Effects of pendant groups at phosphorus on binding properties of d-ApA analogues. Nucl Acids Res14(8):3487-3499 (1986).
Letsinger et al. Phosphoramidate Analogs of Oligonucleotides J Org Chem 35(11):3800-3803 (1970).
Liu et al. Developing irreversible inhibitors of the protein kinase cysteinome. Chem Biol 20:146-159 (2013).
Long et al. The Metabolic Serine Hydrolases and Their Functions in Mammalian Physiology and Disease. Chemical Reviews 111:6022-6063 (2011).
Lopachin et al. Molecular mechanisms of 4-hydroxy-2-nonenal and acrolein toxicity: nucleophilic targets and adduct formation. Chem Res Toxicol 22:1499-1508 (2009).
Mag et al. Synthesis and selective cleavage of an oligodeoxynucleotide containing a bridged internucleotide 5′-phosphorothioate linkage. Nucleic Acids Res 19(7):1437-1441 (1991).
Marino et al. Proteomics: mapping reactive cysteines. Nat Chem Biol. 7(2):72-73 (2011).
Meier et al. Peptide Nucleic Acids (PNAs)—Unusual Properties of Nonionic Oligonucleotide Analogues. Angew. Chem. Int. Ed. Engl.31(8):1008-1010 (1992).
Ngo et al. Mutant methionyl-tRNA synthetase from bacteria enables site-selective N-terminal labeling of proteins expressed in mammalian cells. PNAS USA 110:4992-4997 (2013).
Niphakis et al. A Global Map of Lipid-Binding Proteins and Their Ligandability in Cells. Cell 161(7):1668-1680 (2015).
Pace et al. Diverse functional roles of reactive cysteines. ACS Chem Biol 8(2):283-296 (2013).
Parker et al. Ligand and Target Discovery by Fragment-Based Screening in Human Cells. Cell 168(3):527-541 (2017).
Parola et al. HNE interacts directly with JNK isoforms in human hepatic stellate cells. J Clin Invest 102:1942-1950 (1998).
Patricelli et al. Functional interrogation of the kinome using nucleotide acyl phosphates. Biochemistry 46:350-358 (2007).
Pauwels et al. Biological activity of new 2-5A analogues. Chemica Scripta 26:141-149 (1986).
PCT/US2014/050828 International Preliminary Report on Patentability dated Feb. 25, 2016.
PCT/US2014/050828 International Search Report and Written Opinion dated Dec. 12, 2014.
PCT/US2016/024148 International Preliminary Report on Patentability dated Oct. 12, 2017.
PCT/US2016/024148 International Search Report and Written Opinion dated Jul. 25, 2016.
PCT/US2016/058308 International Preliminary Report on Patentability dated May 3, 2018.
PCT/US2016/058308 International Search Report and Written Opinion dated Jan. 17, 2017.
PCT/US2018/014104 International Preliminary Report on Patentability dated Aug. 1, 2019.
PCT/US2018/14104 International Search Report and Written Opinion dated Jul. 26, 2018.
PCT/US2018/14104 Invitation to Pay Additional Fees dated May 31, 2018.
Peng et al. Turning the spotlight on protein-lipid interactions in cells. Curr Opin Chem Biol 21:144-153 (2014).
Perluigi et al. 4-Hydroxy-2-nonenal, a reactive product of lipid peroxidation, and neurodegenerative diseases: a toxic combination illuminated by redox proteomics studies. Antioxid Redox Signal 17:1590-1609 (2012).
Prescher et al. Chemistry in living systems. Nat Chem Biol. 1(1):13-21 (2005).
Rawls, Rebecca L. Optimistic about antisense. Promising clinical results and chemical strategies for further improvements delight antisense drug researchers. C & E News. 35-59 (Jun. 2, 1997).
Roe et al. Proteomic mapping of 4-hydroxynonenal protein modification sites by solid-phase hydrazide chemistry and mass spectrometry. Anal Chem 79:3747-3756 (2007).
Rostovtsev et al. A stepwise huisgen cycloaddition process: copper (i)-catalyzed regioselective “ligation” of azides and terminal alkynes. Angew. Chem. Int. Ed. Engl. 41(14):2596-2599 (2002).
Rudolph et al. Transduction of redox signaling by electrophile-protein reactions. Sci Signal 2:re7 (2009).
Sadaghiani et al. Tagging and detection strategies for activity-based proteomics. Curr Opin Chem Biol. 11(1):20-28 (2007).
Saghatelian et al. Assignment of endogenous substrates to enzymes by global metabolite profiling. Biochemistry 43:14332-14339 (2004).
Sawai et al. Synthesis and Properties of Oligoadenylic Acids Containing 2′-5′ Phosphoramide Linkage. Chem. Lett. 13(5):805-808 (1984).
Scotcher et al. Identification of Two Reactive Cysteine Residues in the Tumor Suppressor Protein p53 Using Top-Down FTICR Mass Spectrometry. 22:888-897 (2011).
Shearn et al. Modification of Akt2 by 4-hydroxynonenal inhibits insulin-dependent Akt signaling in HepG2 cells. Biochemistry 50:3984-3996 (2011).
Shen et al. JNK signaling pathway is a key modulator in cell death mediated by reactive oxygen and nitrogen species. Free Radic Biol Med 40:928-939 (2006).
Simon et al. Determining target engagement in living systems. Nat Chem Biol 9(4):200-205 (2013).
Singh et al. The resurgence of covalent drugs. Nat Rev Drug Discov 10(4):307-317 (2011).
Speers et al. Activity-based protein profiling in vivo using a copper(i)-catalyzed azide-alkyne [3+2] cycloaddition. J Am Chem Soc. 125(16):4686-4687 (2003).
Sprinzel et al. Enzymatic incorporation of ATP and CTP analogues into the 3′ end of tRNA. Eur J Biochem 81(3):579-589 (1977).
Surh et al. 15-Deoxy-Δ(12,14)-prostaglandin J(2), an electrophilic lipid mediator of anti-inflammatory and pro-resolving signaling. Biochem Pharmacol 82:1335-1351 (2011).
Tate. Recent advances in chemical proteomics: exploring the posttranslational proteome. J Chem Biol 1:17-26 (2008).
U.S. Appl. No. 14/911,316 Office Action dated Jan. 12, 2018.
U.S. Appl. No. 14/911,316 Office Action dated Mar. 27, 2019.
U.S. Appl. No. 15/080,767 Office Action dated Dec. 19, 2017.
U.S. Appl. No. 15/080,767 Office Action dated May 17, 2017.
U.S. Appl. No. 15/080,767 Office Action dated Oct. 28, 2016.
U.S. Appl. No. 15/331,745 Office Action dated Apr. 22, 2019.
U.S. Appl. No. 15/331,745 Office Action dated Dec. 29, 2017.
U.S. Appl. No. 15/331,745 Office Action dated Jul. 17, 2017.
U.S. Appl. No. 15/331,745 Office Action dated Jun. 12, 2018.
U.S. Appl. No. 15/331,745 Office Action dated Oct. 9, 2018.
Uchida. 4-Hydroxy-2-nonenal: a product and mediator of oxidative stress. Prog Lipid Res 42:318-343 (2003).
Vila et al. Identification of protein targets of 4-hydroxynonenal using click chemistry for ex vivo biotinylation of azido and alkynyl derivatives. Chem Res Toxicol. 21(2):432-444 (2008).
Wang et al. A chemoproteomic platform to quantitatively map targets of lipid-derived electrophiles. Nat Methods. 11(1):79-85 (2014).
Wang et al. Complete inhibition of anisomycin and UV radiation but not cytokine induced JNK and p38 activation by an aryl-substituted dihydropyrrolopyrazole quinoline and mixed lineage kinase 7 small interfering RNA. J Biol Chem 280:19298-19305 (2005).
Wang et al. Exploring post-translational arginine modification using chemically synthesized methylglyoxal hydroimidazolones (MG-Hs). J Am Chem Soc 134:8958-8967 (2012).
Washburn et al. Large-scale analysis of the yeast proteome by multidimensional protein identification technology. Nat Biotechnol 19(3):242-247 (2001).
Weerapana et al. Disparate proteome reactivity profiles of carbon electrophiles. Nat Chem Biol 4:405-407 (2008).
Weerapana et al. Quantitative reactivity profiling predicts functional cysteines in proteomes. Nature 468:790-795 (2010).
Wong et al. Small molecule kinase inhibitors block the ZAK-dependent inflammatory effects of doxorubicin. Cancer Biol Ther. 14(1):56-63 (2013).
Xia et al. Photoactivatable lipid probes for studying biomembranes by photoaffinity labeling. Chem Rev. 113(10):7880-7929 (2013).
Yang et al. ZAK inhibits human lung cancer cell growth via ERK and JNK activation in an AP-1-dependent manner. Cancer Sci 101:1374-1381 (2010).
Yu et al. Effect of C-terminal truncations on MLK7 catalytic activity and JNK activation. Biochem Biophys Res Commun 310:452-457 (2003).
Zhou et al. A structure-guided approach to creating covalent FGFR inhibitors. Chem Biol 17:285-295 (2010).
Fu et al. Accelerated cellular on- and off-target screening of bioactive compounds using microarrays. Org Biomol Chem 14(1):59-64 (2015).
Li et al. Design and Synthesis of Minimalist Terminal Alkyne-Containing Diazirine Photo-Crosslinkers and Their Incorporation into Kinase Inhibitors for Cell- and Tissue-Based Proteome Profiling. Angewandte Chemie 125(33):8713-8718 (2103).
Li et al. Minimalist Cyclopropene-Containing Photo-Cross-Linkers Suitable for Live-Cell Imaging and Affinity-Based Protein Labeling. J Am Chem Soc 136(28):9990-9998 (2014).
U.S. Appl. No. 14/911,316 Office Action dated Oct. 4, 2019.
Wang et al. Fluorescein Derivatives as Bifunctional Molecules for the Simultaneous Inhibiting and Labeling of FTO Protein. J Am Chem Soc 137(43):13736-13739 (2015).
Weerapana et al. Supplementary Information, Quantitative reactivity profiling predicts functional cysteines in proteomes. Nature 468:1-263 (2010).
Xie et al. Fluorescent Probes for Single-Step Detection and Proteomic Profiling of Histone Deacetylases. J Am Chem Soc 138(48):15596-15604 (2016).
Related Publications (1)
Number Date Country
20200071277 A1 Mar 2020 US
Provisional Applications (1)
Number Date Country
62447882 Jan 2017 US