A computer readable form of the Sequence Listing is filed with this application by electronic submission and is incorporated into this application by reference in its entirety. The Sequence Listing is contained in the file created on Nov. 28, 2023, having the file name “11348-054US1_ST26” and is 68,924 bytes in size.
This disclosure relates to reagents and methods for assessing receptor binding as well as determining functional consequences of receptor-ligand binding, particularly by using yeast cells genetically engineered to express peptide-, protein-, and nanobody-based ligands that are trapped between the yeast cell wall and cell membrane, i.e., periplasmic space.
The yeast Saccharomyces cerevisiae is an established model for the heterologous study of human transmembrane receptors such as G protein-coupled receptors (GPCRs) and receptor tyrosine kinases (RTKs). Through genetic engineering, the yeast model has become a robust and highly scalable drug-discovery platform for identifying small molecule pharmacological ligands. By contrast, efforts to study and discover genetically encoded peptide ligands, protein ligands, and pharmacological tools selective for protein conformation and function remains a major challenge for both human- and yeast-based discovery platforms. The new technologies described in this application address this important unmet need.
Existing technologies display proteins on the outside of the yeast cell wall to discover interactions between a protein bait (e.g., nanobody) and target (e.g., receptor or viral component). In these approaches, selection is based solely on bait and target binding using a pull-down assay with the purified protein target immobilized on a bead. Although these methods have identified many new protein-protein interactions, they suffer from several critical limitations that have hindered their further development. These include 1) the need to over-express and purify sufficient amounts of a protein target, which may not be possible, 2) non-specific binding of the purified target to the assay beads, which creates the need for several rounds of enrichment, and most critically 3) little to no information regarding the functional selectivity of the bait-target interaction.
Thus, there remains a need in the art to develop methods for assessing receptor binding as well as determining functional consequences of receptor-ligand binding for human proteins in yeast cells.
Provided herein are systems and methods for assessing receptor binding as well as determining functional consequences of receptor-ligand binding in yeast cells that both utilize and overcome the limitations of the cell wall as a situs for displaying receptors and any other proteins of interest with sufficient and necessary periplasmic accessibility and spatial exclusivity. The methods disclosed herein describe a cell wall anchoring protein for displaying peptide-, protein-, and nanobody-based ligands in the periplasmic space (i.e., the space between cell wall and cell membrane) rather than on the yeast surface. In this way, the periplasmic-displayed ligand is presented to the ligand-binding interface of the receptor, where it is capable of modulating receptor function for the purposes of studying ligand-receptor biology or identifying new ligand-receptor or protein-protein interactions via high-throughput screening experiments.
In some aspects, disclosed herein is a yeast periplasmic ligand-trapping yeast display system comprising: a plurality of yeast cells, wherein each cell comprises:
In some embodiments, the cell-wall anchoring protein comprises an N terminus projected in the periplasmic space, and wherein the ligand is fused to the N terminus of the cell-wall anchoring protein.
In some embodiments, the yeast cells are Saccharomyces cerevisiae. In some embodiments, the ligand is a genetically encoded peptide, protein, or nanobody.
In some embodiments, the signal sequence comprises a sequence at least 80% identical to SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, or SEQ ID NO:26. In some embodiments, the signal sequence comprises SEQ ID NO:21.
In some embodiments, the cell-wall anchoring protein comprises a sequence at least 80% identical to SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, or SEQ ID NO:19. In some embodiments, the cell-wall anchoring protein comprises SEQ ID NO:13.
In some embodiments, the inducible reporter produces a fluorescent protein. In some embodiments, the fluorescent protein is mTurqoise2.
In some embodiments, the inducible reporter is dependent on activation of a receptor.
In some aspects, disclosed herein is a method of screening for a ligand that modulates a receptor function, comprising:
In some embodiments, the yeast cells are Saccharomyces cerevisiae. In some embodiments, the ligand is a genetically encodable peptide, protein, or nanobody.
In some embodiments, the signal sequence comprises a sequence at least 80% identical to SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, or SEQ ID NO:26. In some embodiments, the signal sequence comprises SEQ ID NO:21.
In some embodiments, the cell-wall anchoring protein comprises a sequence at least 80% identical to SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, or SEQ ID NO:19. In some embodiments, the cell-wall anchoring protein comprises SEQ ID NO:13.
In some embodiments, the inducible reporter produces a fluorescent protein. In some embodiments, the fluorescent protein is mTurqoise2.
In some embodiments, the inducible reporter is dependent on activation of a receptor.
The disclosure will be better understood and features, aspects, and advantages other than those set forth above will become apparent when consideration is given to the following detailed description thereof. Such detailed description refers to the following drawings.
For the purposes of promoting an understanding of the disclosure, reference will now be made to embodiments and specific language will be used to describe the same. It will nevertheless be understood that no limitation of the scope of the disclosure is thereby intended, such alteration and further modifications of the disclosure as illustrated herein, being contemplated as would normally occur to one skilled in the art to which the disclosure relates.
As used in the specification, articles “a” and “an” are used herein to refer to one or to more than one (i.e., at least one) of the grammatical object of the article. By way of example, “an element” means at least one element and can include more than one element.
“About” is used to provide flexibility to a numerical range endpoint by providing that a given value can be “slightly above” or “slightly below” the endpoint without affecting the desired result. The term “about” in association with a numerical value means that the numerical value can vary by plus or minus 5% or less of the numerical value.
Throughout this specification, unless the context requires otherwise, the word “comprise” and “include” and variations (e.g., “comprises,” “comprising,” “includes,” “including”) will be understood to imply the inclusion of a stated component, feature, element, or step or group of components, features, elements, or steps but not the exclusion of any other integer or step or group of integers or steps.
As used herein, “and/or” refers to and encompasses any and all possible combinations of one or more of the associated listed items, as well as the lack of combinations where interpreted in the alternative (“or”).
Recitation of ranges of values herein are merely intended to serve as a succinct method of referring individually to each separate value falling within the range, unless otherwise indicated herein. Furthermore, each separate value is incorporated into the specification as if it were individually recited herein. For example, if a range is stated as 1 to 50, it is intended that values such as 2 to 4, 10 to 30, or 1 to 3, etc., are expressly enumerated in this disclosure. These are only examples of what is specifically intended, and all possible combinations of numerical values between and including the lowest value and the highest value enumerated are to be considered to be expressly stated in this disclosure.
Unless otherwise defined, all technical terms used herein have the same meaning as commonly understood by a person of ordinary skill in the art to which this disclosure belongs.
In some aspects, disclosed herein is a yeast periplasmic ligand-trapping yeast display system comprising: a plurality of yeast cells, wherein each cell comprises:
In some embodiments, the cell-wall anchoring protein is fused to the N terminus of the cell-wall anchoring protein. In some embodiments, the cell-wall anchoring protein is fused to the C terminus of the cell-wall anchoring protein.
In some embodiments, the cell-wall anchoring protein comprises an N terminus projected in the periplasmic space, and wherein the ligand is fused to the N terminus of the cell-wall anchoring protein.
In some aspects, disclosed herein is a yeast periplasmic ligand-trapping yeast display system comprising: a plurality of yeast cells, wherein each cell comprises:
In some embodiments, the yeast cells are Saccharomyces cerevisiae. In some embodiments, the ligand is a genetically encodable peptide, protein, or nanobody.
In some embodiments, the signal sequence comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, or SEQ ID NO:26. In some embodiments, the signal sequence comprises a sequence at least 80% identical to SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, or SEQ ID NO:26. In some embodiments, the signal sequence comprises SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, or SEQ ID NO:26.
In some embodiments, the signal sequence comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:21. In some embodiments, the signal sequence comprises SEQ ID NO:21.
In some embodiments, the signal sequence comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:24. In some embodiments, the signal sequence comprises SEQ ID NO:24.
In some embodiments, the signal sequence comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:26. In some embodiments, the signal sequence comprises SEQ ID NO:26.
In some embodiments, the cell-wall anchoring protein comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, or SEQ ID NO:19. In some embodiments, the cell-wall anchoring protein comprises a sequence at least 80% identical to SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, or SEQ ID NO:19. In some embodiments, the cell-wall anchoring protein comprises SEQ ID NO: 13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO: 18, or SEQ ID NO:19.
In some embodiments, the cell-wall anchoring protein comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:13. In some embodiments, the cell-wall anchoring protein comprises SEQ ID NO:13.
In some embodiments, the cell-wall anchoring protein comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:14. In some embodiments, the cell-wall anchoring protein comprises SEQ ID NO:14.
In some embodiments, the cell-wall anchoring protein comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:15. In some embodiments, the cell-wall anchoring protein comprises SEQ ID NO:15.
In some embodiments, the cell-wall anchoring protein comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:17. In some embodiments, the cell-wall anchoring protein comprises SEQ ID NO:17.
In some embodiments, the inducible reporter produces a fluorescent protein. In some embodiments, the fluorescent protein is mTurqoise2.
In some embodiments, the inducible reporter is dependent on activation of a receptor.
In some aspects, disclosed herein is a method of screening for a ligand that modulates a receptor function, comprising:
In some aspects, disclosed herein is a method of screening for a ligand that modulates a receptor function, comprising:
In some embodiments, the yeast cells are Saccharomyces cerevisiae. In some embodiments, the ligand is a genetically encodable peptide, protein, or nanobody.
In some embodiments, the signal sequence comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, or SEQ ID NO:26. In some embodiments, the signal sequence comprises a sequence at least 80% identical to SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, or SEQ ID NO:26. In some embodiments, the signal sequence comprises SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, or SEQ ID NO:26.
In some embodiments, the signal sequence comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:21. In some embodiments, the signal sequence comprises SEQ ID NO:21.
In some embodiments, the signal sequence comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:24. In some embodiments, the signal sequence comprises SEQ ID NO:24.
In some embodiments, the signal sequence comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:26. In some embodiments, the signal sequence comprises SEQ ID NO:26.
In some embodiments, the cell-wall anchoring protein comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, or SEQ ID NO:19. In some embodiments, the cell-wall anchoring protein comprises a sequence at least 80% identical to SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, or SEQ ID NO:19. In some embodiments, the cell-wall anchoring protein comprises SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, or SEQ ID NO:19.
In some embodiments, the cell-wall anchoring protein comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:13. In some embodiments, the cell-wall anchoring protein comprises SEQ ID NO:13.
In some embodiments, the cell-wall anchoring protein comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:14. In some embodiments, the cell-wall anchoring protein comprises SEQ ID NO: 14.
In some embodiments, the cell-wall anchoring protein comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:15. In some embodiments, the cell-wall anchoring protein comprises SEQ ID NO:15.
In some embodiments, the cell-wall anchoring protein comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:17. In some embodiments, the cell-wall anchoring protein comprises SEQ ID NO:17.
In some embodiments, the inducible reporter produces a fluorescent protein. In some embodiments, the fluorescent protein is mTurqoise2.
In some embodiments, the inducible reporter is dependent on activation of a receptor.
In some embodiments, the method allows screening of a nanobody library. In some embodiments, the method allows screening of a peptide library. In some embodiments, the method allows screening of a small molecule library.
In some embodiments, the systems and methods use yeast engineered for the functional expression of human G-protein coupled receptors (GPCRs), the utility of the anchored ligand design using known peptide, protein, and nanobody ligands is demonstrated, whose effects on receptor functionality are directly assessed inter alia via fluorescent reporting of receptor activation using microplate readers, fluorescence activated cell sorting, high-content microscopy, and any other methods that provide quantification of the fluorescent reporter. The platform can also be used to detect intracellular ligands and nanobodies selective for specific receptor conformations and multimeric states.
In some embodiments, the systems and methods use yeast engineered for the functional expression of a Chemokine Receptor (for example, the C—X—C Motif Chemokine Receptor 4 (CXCR4)). In some embodiments, the systems and methods use yeast engineered for the functional expression of Angiotensin II Receptor Type I (AGTR1).
In some embodiments, herein is described a novel platform, ENTRAP, that overcomes these barriers to identify and select conformation-specific ligands and nanobodies for GPCRs based not only on binding, but also on functional and pharmacological outcomes, including receptor agonism, inverse agonism, antagonism, allosteric modulation and conformational blocking.
In some embodiments, the systems and methods use yeast engineered for the functional expression of human receptor tyrosine kinases (RTKs).
Also provided herein is a method of identifying receptor function, the method comprising: providing a plurality of yeast cells, wherein each cell comprises, one or more ligands displayed in the periplasm of the yeast cells, wherein the one or more ligands are anchored to the cell wall by a cell-wall anchoring protein; and wherein the ligand or ligands are capable of ligand-receptor binding to a transmembrane receptor expressed at the yeast cell wall surface; wherein the transmembrane receptor is bound to an intracellular protein, wherein the intracellular protein is encoded with an inducible reporter; wherein the inducible reporter is fluorescent protein, and wherein expression of the inducible reporter is dependent on activation of the transmembrane receptor by the one or more ligands.
The ligand trapping technology disclosed herein was designed with compatibility for yeast-based platforms used to study human transmembrane receptors and other cell surface proteins. Displaying peptides, proteins, or nanobodies in the periplasm near the receptor's ligand-binding site (i.e., orthosteric site) permits ligand-receptor interactions to occur. As shown in
Combining these technologies created a pharmacological screening platform with broad applications. First, the ligand trapping technology allows engineering of autocrine signaling systems, where endogenous peptide/protein ligands can be displayed in yeast strains harboring their cognate receptors. These strains can then be used for screening any chemical library to identify potential antagonists, providing a unique advantage over most other screening platforms commonly biased towards agonist discovery. These strains can also be used for screening libraries of purified peptides and proteins to identify potential antagonists.
This combinatorial platform also allows screening of gene-based libraries encoding a variety of peptide, protein, or nanobody ligands and libraries. The libraries can comprise DNA or RNA, and can comprise a genetically encodeable ligand for GPCRs or other receptors that can be expressed to the yeast membrane. This approach resulted in a major time savings and financial advantage over conventional screening approaches for these ligand types, which required initial ligand purification with limited yields. Additionally, many of these ligands cannot be overexpressed and purified from conventional sources or traverse the yeast cell wall when added exogenously, limiting the use of yeast screening platforms prior to the periplasmic ligand trapping technology disclosed herein.
Lastly, in some embodiments, each individual cell contains a single genetically encoded receptor and ligand. These genetic components serve as identity barcodes for deconvolution of functional ligand-receptor interactions in pooled experiments. As a consequence, pooling of both ligand and receptor libraries can be performed simultaneously, with functional interactions being identified merely through fluorescence- and sequence-based methods.
As used herein the term “recombinant expression construct” will be understood by a person having ordinary skill in the art to mean a product of genetic engineering techniques for introducing into a cell a nucleic acid encoding a protein of interest, particularly a protein heterologous to the cell into which the recombinant expression construct is introduced.
The term “G Protein-Coupled Receptor” or “GPCR” refers to any member of the large family of transmembrane receptors that typically function to bind molecules outside the cell and activate inside signal transduction pathways, ultimately inducing one or more cellular responses. G protein-coupled receptors are found only in eukaryotes, including yeast and animals.
Binding and activation of a GPCR typically involves signal transduction pathways including the cAMP signal pathway and the phosphatidylinositol signal pathway. When a ligand binds to the GPCR it causes a conformational change in the GPCR, which allows it to act as a guanine nucleotide exchange factor (GEF). The GPCR can then activate an associated G-protein by exchanging its bound GDP for a GTP. The G-protein's a subunit, together with the bound GTP, can then dissociate from the β and γ subunits to further affect intracellular signaling proteins or target functional proteins directly depending on the a subunit type (Gαs, Gαi/o, Gαq/11, Gα12/13).
In some embodiments, the intracellular protein comprises a sequence at least 60% identical (for example, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%) to SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 11, or SEQ ID NO: 12. In some embodiments, the cell-wall anchoring protein comprises a sequence at least 80% identical to SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, or SEQ ID NO: 12. In some embodiments, the cell-wall anchoring protein comprises SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:11, or SEQ ID NO: 12.
All GPCRs share a common structure and mechanism of signal transduction. Generally, GPCRs can be grouped into 6 classes based on sequence homology and functional similarity: Class A (or 1) (Rhodopsin-like), Class B (or 2) (Secretin receptor family), Class C (or 3) (Metabotropic glutamate/pheromone), Class D (or 4) (Fungal mating pheromone receptors), Class E (or 5) (Cyclic AMP receptors), Class F (or 6) (Frizzled/Smoothened). Many G protein-coupled receptors are involved in detection of endogenous ligands (e.g., hormones, growth factors, etc.).
In some embodiments, the GPCR is localized to the cell membrane. In some embodiments, the GPCR gene is integrated into the cell's genome. In some embodiments, the inducible reporter is integrated into the cell's genome.
In some embodiments, the target domain gene comprises a “barcode” or a unique sequence. The barcode is used to uniquely identify or distinguish the target domain. The barcode may be of any suitable length for unambiguously identifying the target domain gene. The length of the barcode sequence is not critical and may be of any length sufficient to distinguish the barcode sequence from other barcode sequences. In some embodiments, the target domain gene is heterologous to the yeast system and represents a unique DNA sequence that can be identified by quantitative polymerase chain reaction, NanoString, sequencing, and similar methods.
In some embodiments, the reporter is induced by signal transduction upon activation of the GPCR (shown, e.g. in
In some embodiments, the cells are yeast cells selected from Saccharomyces cerevisiae, Schizosaccharomyces pombe, Yarrowia lipolytica, Candida glabrata, Ashbya gossypii, Cyberlindnera jadinii, Pichia pastoris, Kluyveromyces lactis, Hansenula polymorpha, Candida boidinii, Arxula adeninivorans, Xanthophyllomyces dendrorhous, or Candida albicans.
In some embodiments, the cells can comprise any other yeast species that has a cell wall and pheromone-sensing pathway. In some embodiments, the cells can comprise yeast from the evolutionary kingdom Fungi and division Ascomycota that have a cell wall and conserved pheromone-sensing pathway.
In some embodiments, the cell-wall anchoring protein is any of Ccw1423-238, Sed119-338, Flo11496-1537, Flo525-1075, Suc220-532, Ecm3320-429, and Yps122-569. In some embodiments, the cell-wall anchoring protein is Ccw1423-238. Ccw 1423-238 is localized to the inner cell wall in yeast.
Various exemplary embodiments of compositions and methods according to this invention are now described in the following non-limiting Examples. The Examples are offered for illustrative purposes only and are not intended to limit the scope of the present invention in any way. Indeed, various modifications of the invention in addition to those shown and described herein will become apparent to those skilled in the art from the foregoing description and the following examples and fall within the scope of the appended claims.
The Examples set forth herein incorporate and rely on certain experimental and preparatory methods and techniques preformed as exemplified herein.
NEB® 5-alpha Competent Escherichia coli (E. coli), a derivative of the DH5α strain, were used for all cloning experiments and general plasmid propagation. Initial selection of E. coli was performed at 37° C. on Lysogeny Broth (LB) agar medium supplemented with 100 mg/L carbenicillin: 10.0 g/L tryptone, 5.0 g/L yeast extract, 5.0 g/L sodium chloride (Sigma), 0.1 g/L sodium hydroxide, 18.0 g/L Bacto agar. Select transformants were then grown at 37° C., shaking at 200 rpm, in LB medium supplemented with 50 mg/L carbenicillin.
All yeast strains used are derivatives of BY4741 (MATa his3Δ1 leu2Δ0 met15Δ0 ura3Δ0), are listed in Table 1 and have been described previously (Kapolka et al., 2020, Proc. Natl. Acad. Sci. USA 117: 13117-13126). Specifically, these strains are derivatives of the DI3Δfig1Δ::mTq2 P1 model strain (BY4741 far1Δ0 sst2Δ0 ste2Δ0 fig1Δ0::mTq2 X-2:PTEF1a-UnTS-TCYC1b) containing humanized Ga proteins.
Prior to transformation or other experimentation, yeast cells were grown at 30° C. on Yeast extract Peptone Dextrose (YPD) agar medium: 10.0 g/L yeast extract (RPI), 20.0 g/L peptone (RPI), 20.0 g/L dextrose (RPI), 15.0 g/L Bacto agar (RPI). Select colonies were propagated at 30° C., static or shaking at 200 rpm, in liquid cultures of YPD or pH-adjusted Synthetic Complete Dextrose low fluorescence medium (SCD Screening Medium) as specified comprising 5.0 g/L ammonium sulfate, 0.79 g/L CSM supplement mixture, 20.0 g/L dextrose, 9.76 g/L MES monohydrate, 8.72 g/L potassium phosphate dibasic, 1.70 g/L low fluorescence yeast nitrogen base without amino acids, folic acid, and riboflavin, pH adjusted with potassium hydroxide or hydrochloric acid.
For yeast transformations, transformants were initially selected at 30° C. for three days on SCD dropout agar medium comprising 5.0 g/L ammonium sulfate, 20.0 g/L dextrose, 1.70 g/L yeast nitrogen base without amino acids and ammonium sulfate, 0.1 g/L sodium hydroxide, 15.0 g/L Bacto agar, and 0.69 g/L of Complete Supplement Mixture (CSM)-Leu or 0.67 g/L of CSM-Leu-Ura. Select transformants were propagated at 30° C., static or shaking at 200 rpm, in liquid cultures of SCD dropout medium.
A CLARIOstar multimode microplate reader was used for collecting all microplate-based absorbance (A600) and fluorescence data with the following instrument settings: absorbance (excitation 600 nm; 22 flashes/well), mTq2-based fluorescence (excitation 430/10 nm, dichroic LP 458 nm, emission 482/16 nm; 10 flashes/well; bottom read).
All plasmids used and constructed throughout this study were purified using EZ Plasmid Miniprep Kit and sequences validated via Sanger sequencing.
Plasmids used for testing yeast anchoring proteins were constructed by amplifying the respective genes via PCR from the BY4741 genome, starting immediately downstream of the signal peptide cleavage site, except for Flo11496-1537, which contains only the 42 C-terminal amino acids of Flo1, synthesized as a gBlock (IDT) as described previously (Rowe et al., 2020, J. Biol. Chem. 295: 8262-8271). Amplicons were cloned into pYEplac181 αPrenat using NEBuilder® HiFi DNA Assembly Master Mix (NEB; catalog no. E2621L), where αPrenat is the pre α-factor signal sequence. The gene for the fluorescent protein mTurquoise2, mTq2, was then inserted between αPrenat and the anchor protein-encoding gene in similar fashion, with yeast codon-optimized mTq2 being sourced from Addgene (#86424). Sequences for all anchor proteins tested are provided in Table 2.
Six unique sequences were selected for comparison with αPrenat used initially for secretion, which included: the native pre-pro α-factor signal sequence (αPrePronat), three optimized pre-pro α-factor variants (αPreProv2 (Aza et al., 2021, Cell Mol Life Sci 78: 3691-3707)), αPreProv3 (Id.), and αPreProv4 (Rakestraw et al., 2009, Biotechnol Bioeng 103: 1192-1201)), the signal sequence native to the S. cerevisiae oligosaccharyltransferase complex alpha subunit (Ost1; YJL002C), and a chimeric pre-pro α-factor sequence where the pre α-factor sequence has been replaced with that from Ost1 (OstlaPro) (Bcan et al., 2022, Nat Commun. 13: 2882; Fitzgerald et al., 2014, Microbial Cell Factories 13: 125). Complete signal sequences were first generated via Assembly PCR using 2-4 overlapping oligonucleotides 40-100 bp long. Once assembled, amplicons were directly cloned into pYEplac181 as described above, followed by cloning of mTq2 downstream of the specified signal sequence. Sequences for all signal sequences tested are provided in Table 3.
GPCRs via periplasmic ligand display (
A novel vector pYEplac181M2 was designed, which provided both constitutive (TEF1A promoter) and inducible (GAL1 promoter) expression similarly to that described previously (Vickers et al., 2013, Microbial Cell Factories 12: 96). This was done by PCR-amplifying the GAL1 promoter and ADH1 terminator sequences from pYDS649HM (McMahon et al., 2018, Nat Struct Mol Biol 25: 289-296) and simultaneously cloning into pYEplac181. Then, pYEplac181M2_αPrePronat-CXCL12a was created by simultaneously cloning αPrePronat and CXCL12a genes downstream of the GAL1 promoter using HiFi DNA Assembly. A similar approach was used for creating pYEplac181M2_αPrePronat-AGTII, but αPrePronat-AGTII was first assembled via Assembly PCR using a yeast codon-optimized (IDT) AGT-II sequence and subsequently cloned into pYEplac181M2 under GAL1 regulation. Lastly, periplasmic-displayed nanobody CA4139 (αPrePronat-CA4139-Ccw1423-238) or intracellular nanobodies AT110, AT110i103, AT110i1, and BV025 (McMahon et al., 2018, Nat Struct Mol Biol 25: 289-296) were cloned into pYEplac181M2 αPrePronat-CXCL12a or pYEplac181M2_αPrePronat-AGTII, respectively, under TEF1A regulation using HiFi DNA Assembly.
A Mix & Go E. coli Transformation Kit (Zymo Research, Irvine, CA) and associated protocol was used to make competent bacterial cells. Competent cells were stored as 210-uL aliquots in sterile 1.5-mL Eppendorf tubes at −80° ° C. for long-term storage. For transformation, cells were thawed on ice, gently mixed, and 50 uL aliquoted to sterile 1.5-mL round-bottom Eppendorf tubes on ice. Desired DNA (1-5 uL) was added to cells, gently flicked 4-5 times, and placed on ice for 15 min. Cell mixtures were then heat shocked at 42° C. for 30 sec, recovered on ice for 5 minutes, and 900 uL room-temperature Super Optimal broth with Catabolite repression (SOC) was added prior to rotating at 37° ° C. for 45 min. Once transformation was complete, 100 uL transformant cells were plated onto LB agar medium supplemented with appropriate antibiotics.
Prior to transformation, select yeast colonies were picked from YPD agar medium into 5 mL YPD liquid medium in sterile 50-mL conical tubes and grown to saturation overnight. Cultures were then diluted 1:50 in 5-15 mL YPD in sterile 50-mL conical tubes and grown for 2.5-3 h to A600 0.4-1.0. Cells were made chemically competent following a standard lithium acetate protocol as described previously (Kapolka et al., 2020, Proc. Natl. Acad. Sci. USA 117: 13117-13126).
50 uL competent yeast cells were added to DNA transformation mixtures containing 175 uL PEG mix (400 g/L PEG-3350 dissolved in LiOAc mix), 5 uL salmon sperm DNA (stored on ice after initial boiling at 100° ° C. for 10 minutes) (Thermo Fisher) and 150-200 ng plasmid DNA (150 ng for single plasmid transformations; 100 ng of each plasmid for co-transforming two plasmids) in sterile 1.5-mL Eppendorf tubes. Mixtures were vortexed, incubated at room temperature for 30 minutes, spiked with 12 uL DMSO (Sigma), and heat shocked at 42° C. for 15 minutes. Transformant yeast cells were then centrifuged (5000 g for 1 minute), harvested, and resuspended in 200-400 uL YPD liquid medium. 35-50 uL or 100 uL resuspended cells were plated onto small (22-mm, 12-well) or large (100-mm) petri dishes, respectively, containing appropriate SCD dropout agar medium and placed at 30° ° C. for three days.
For autocrine activation of receptors SSTR5 and CXCR4 with periplasmic-displayed ligands SRIF-14 and CXCL12a, respectively, the 10 yeast strains harboring humanized Ga proteins (DI DCyFIR P1 I-DI DCyFIR P1 S (2); see Table 1) were co-transformed with pYEplac195 receptor and pYEplac181 αPrePronat-ligand-Ccw 14.
Four transformant yeast colonies per strain were picked from SCD-Ura-Leu agar medium into 1 mL SCD-Ura-Leu medium in sterile 96-well deep-well blocks, covered with porous film, shaken on a MixMate microplate shaker (1200 rpm for 30 seconds), and grown at 30° C. static for 18-19 h. Cells were then resuspended via shaking (1500 rpm for 1 minute), A600 measured, and used to prepare 80 uL cultures normalized to A600 0.1 in SCD Screening Medium pH 7.0 in a sterile black 384-well clear-bottom plate. 384-well plate was then centrifuged (3000 g for 1 minute), shaken (2000 rpm for 30 seconds), and placed in a microplate reader pre-warmed to 30° C. and programmed to collect A600 and fluorescence (gain 1200) measurements every 1 h for 15 h.
The yeast strain harboring the humanized Gαi protein (DI DCyFIR P1 I) was co-transformed with pYEplac195 CXCR4 and one of the following pYEplac181M2 constructs: pYEplac181M2_αPrePronat-CXCL12a (CXCL12a (Gal induced)), pYEplac181M2 αPrePronat-CA4139-Ccw14_αPrePronat-CXCL12a (CXCL12a+Nb CA4139), or empty pYEplac181M2 (neg ctrl) for testing nanobody-based modulation of CXCR4 using periplasm-displayed nanobody CA4139. For each pYEplac181M2 construct listed, the components under regulation of the constitutive TEF1A promoter are listed first, where applicable, followed by components under the inducible GAL1 promoter (i.e., pYEplac181M2 componentconstitutive_componentinducible).
Four transformant yeast colonies per transformation were picked from SCD-Ura-Leu agar medium into 1 mL SCD-Ura-Leu medium in a sterile 96-well deep-well block, covered with porous film, shaken on a MixMate microplate shaker (1200 rpm for 30 seconds), and grown at 30° C. static for 21 h. Cells were then resuspended via shaking (1500 rpm for 1 minute), A600 measured, and used to prepare 1.5 mL cultures normalized to A600 0.2 in low-glucose SCD-Ura-Leu medium (1.9% galactose, 0.1% glucose) in a sterile 96-well deep-well block. Block was covered with porous film and grown at 30° C. static for 21 h. Cells were then resuspended again and A600 measured. Using a Biomek NXP liquid-handling robot, the volume of cells needed for preparing 100-uL cultures normalized to A600 0.2 were transferred to seven individual sterile 96-well plates. Cells were centrifuged (3000 g for 5 min), harvested, washed with 100 uL sterile H2O, and resuspended in 100 uL SCD Screening Medium pH 7.0 with varying ratios of galactose/glucose. 80 uL of resuspended cells were then transferred to a sterile black 384-well clear-bottom plate, centrifuged (3000 g for 1 minute), shaken (2000 rpm for 30 seconds), and placed in a microplate reader pre-warmed to 30° C. and programmed to collect A600 and fluorescence (gain 1200) measurements every 1 h for 15 h.
Testing modulation of AGTR1 using intracellular nanobodies was performed, using a yeast strain harboring the humanized Gαi protein (DI DCyFIR P1 I) that was co-transformed with pYEplac195 AGTR1 and one of the following pYEplac181M2 constructs: pYEplac181M2_αPrePronat-AGT-II (AGT-II (Gal induced)), pYEplac181M2 AT110_αPrePronat-AGT-II (AGT-II+Nb AT110), pYEplac181M2 AT110i103_αPrePronat-AGT-II (AGT-II+Nb AT1101103), pYEplac181M2 AT110i1_αPrePronat-AGT-II (AGT-II+Nb AT110i1), or empty pYEplac181M2 (negative control). Four transformant yeast colonies per transformation were prepared, experiments performed, and data collected as described above in CXCR4 modulation via Periplasmic Display of Nanobody CA4139.
The sequences of all protein and nucleic acid chains deposited in the PDB (Berman et al., 2003, Nat Struct Biol 10: 980; Berman et al., 2000, Nucl Acids Res 28: 235-242) were downloaded from the PDB on 10/6/2022. Using a custom Python script, protein-only sequence entries were parsed into a new FASTA file that served as input for the program BLAST+ (Camacho et al., 2009, BMC Bioinformatics 10: 421) to create a local BLAST library of all protein chains in the PDB (738,016 total chains).
Collecting and Processing 1,336 Nanobody Structural Chains from the PDB
PDB entries containing nanobodies were identified using a 2-step procedure. In step one, an initial set of nanobody structures in the PDB was identified. In step two, the sequences of these structures to exhaustively collect all nanobody chains in the PDB using sequence similarity.
A web-based PDB query using the search term “nanobody” and source organism names Lama glama, Camelus dromedarius, Camelidae mixed library, Camelidae, Camelus bactrianus, and synthetic constructs returned 550 results that were downloaded in a custom table containing the PDB code, sequence, and sequence length of each PDB entry.
Using BLAST+, the seed nanobody library file created in step one, and custom Python scripts, all sequences in the PDB that matched at least one member of the seed nanobody library using cutoffs of >50% sequence similarity, >100 aligned residues, and E-value <le-30 were identified. Using the SIFTS taxonomy resource, the matching sequences were filtered for nanobody chains using the taxonomic ids: 9844, Lama glama; 9838, Camelus dromedarius; 1579311, Camelidae mixed library; 9835, Camelidae; 9837, Camelus bactrianus; and 32630 synthetic constructs. Using a pipeline of custom Python scripts, the individual nanobody chain(s) in each PDB file were parsed, retained chains with <150 residues, and aligned the resultant set of 1,336 nanobody chains to a reference nanobody structure (PDB code 3P0G chain B) using the TMalign algorithm (Zhang et al., 2005, Nucl Acids Res 33: 2302-2309).
The three major subclasses of nanobody structures can be classified by the size and relative position of their CDR3 loops. Nanobody subclasses were numbered in order of their prevalence in the PDB. Nanobodies in subclass 1 typically have a longer CDR3 loop (>8 residues) that folds against or adjacent to β-strands 3, 4, 8, and 9, often adopting some degree α-helical substructure. In contrast, nanobodies in subclasses 2 and 3 form upright CDR3 loops that can be shorter (subclass 2) or longer (subclass 3). As such, nanobody subclasses 2 and 3 can be differentiated from subclass 1 by the lack residues adjacent to β-strands 3, 4, 8, and 9. The 1,336 nanobody chains in a 3-step process as described below, were classified.
Using structures of each nanobody subclass that was visually confirmed in PyMOL (Schrodinger, 2002, The PyMOL Molecular Graphics System, Version 2.4.2.), Cα atom waypoints for each of the three CDR3 subtypes were selected. Waypoints used for subclass 1 were: 1F2X, chain L, Cα-1105; 6WAQ, chain C, Cα-419; 1IEH, chain A, Cα-103; 1JTP, chain B, Cα-107; 2WZP, chain E, Cα-107. Waypoints used for subclass 2 were: 3POG, chain B, Cα-101; 3POG, chain B, Cα-102; 3POG, chain B, Cα-103; 5HM1, chain C, Cα-372; 3EZJ, chain H, Cα-103. Waypoints used for subclass 3 were: 3KIK, chain D, Cα-658; 1G9E, chain A, Cα-100.
Using a custom Python script and set of waypoint atoms, each terminal side chain (TSC) atom that was within 5 Å of any waypoint was collected. Using consensus network analysis (CNA) (see, Isom et al., 2015, Proc Natl Acad Sci USA 12: 5702-5707; Isom et al., 2016, Biochemistry 55: 534-542; Rowe et al., 2021, J Biol Chem 296: 100167), the set of 4,068 TSCs was triangulated and reduced to three clusters of 2,125, 1,737, and 31 TSCs. The CNA parameters used were minimum cluster size of 5, minimum split cluster size of 50, vacuum network size limit of 500, vacuum network distance limit of 4 Å, and split clusters selected. Using pHinder (Id.), cluster surfaces was calculated, establishing CDR3 classification voxels for the three nanobody subclasses.
The TSCs of the 1,336 were evaluated using a custom python script nanobody chains against each CDR3 voxel. A subset of TSCs residing within CDR3 voxel were identified for each nanobody. These TSC location hits were used to discern the subclass of each nanobody chain. TSC location hits for subclass 3 were conclusive and distinct from subclasses 1 and 2. While most TSC location hits for subclasses 1 and 2 were definitive, two scenarios caused the heuristic to be further defined. In scenario one, a nanobody had <3 TSC locations hits that were typically located at the interface of the subclass 1 and 2 voxels. Using two addition Cα atom waypoints, nanobodies were assigned as belonging to subclass 1 or 2 (using waypoint 5JMO, chain D, Cα-100), or subclass 2 or 3 (using waypoint 7ME7, chain A, Cα-102). In scenario two, a nanobody had several TSC location hits in both subclass 1 and 2 voxels. By default, these nanobodies were assigned to subclass 1 unless their CDR3s had <9 TSC hits and the number of TSC hits in the subclass 2 voxel was more than double the number of TSC hits in the subclass 1 voxel. All 1,336 nanobody chains were definitively classified. Amino acid sequences for the 988 subclass 1, 301 subclass 2, and 47 subclass 3 nanobodies were saved in three FASTA-formatted libraries and animated GIF files depicting the TSC hits and classification voxels for each nanobody subclass were generated for visual confirmation.
Synthetic nanobody libraries were built by selecting particularly advantageous consensus scaffolds from nanobody subclasses 1 and 2. The protein sequences of subclasses 1 and 2 using SnapGene (San Diego, CA) were aligned and the results in FASTA format. The FASTA files and custom Python scripts were used to calculate the amino acid consensus at each residue position to build Python regular expressions of consensus amino acid sequences flanking CDRs 1, 2, and 3: subclass 1 (consensus before CDR1 “L.LSC..S” (SEQ ID NO:37) and after CDR1 “ . . . W.RQA” (SEQ ID NO:38)), (consensus before CDR2 “RFT.S.” (SEQ ID NO:39) and after CDR2 “ . . . . L.M..L”), (consensus before CDR3 “DT..Y.C..” (SEQ ID NO:40) and after CDR3 “..WG.G”); subclass 2 (consensus before CDR1 “LSC..S” (SEQ ID NO:41) and after CDR1 “ . . . W.R..P”), (consensus before CDR2 “FTI . . . ” and after CDR2 “N..YL”), (consensus before CDR3 “DT..Y.C..” (SEQ ID NO:40) and after CDR3 “ . . . WG.G”). CDR 1, 2, and 3 sequences of each nanobody in subclasses 1 and 2 were parsed to aggregate sets of nanobody chains with identical CDR 1, 2, and 3 loops and consensus CDR3 flanking regions. One subclass 1 (7KLW chain C: 7KLW.C) and two subclass 2 (3P0G chain B and 6DO1 chain C: 3P0G.B and 6DO1.C) nanobody chains were selected as particularly advantageous consensus scaffolds for building three distinct synthetic nanobody libraries. Each consensus scaffold was selected to have CDR3 loops of differing length (9, 3P0G.B; 14, 7KLW.C; and 15, 6DO1.C) that equally subdivided and sampled the nanobody binding interface defined by subclass 1 and 2 CDR3 classification voxels.
Three synthetic nanobody libraries were prepared using a 3-step process. In step one, the CDR1 residue sequences matched across three nanobody scaffolds by changing the CDR1 of 7KLW.C (SISSI) (SEQ ID NO:42) and 6DO1.C (NIFDV) (SEQ ID NO:43) to match the CDR1 of 3P0G.B (SIFSI) (SEQ ID NO:44). Additionally, F41 of 7KLW.C was changed to Y41 to match the upstream CDR1 flanks across all three scaffolds. By matching CDR1 regions, the same primer pools could be used to introduce CDR1 diversity. In step two, each scaffold gene was codon optimized for yeast expression using the web-based optimization tool by IDT-DNA (Coralville, IA), ordered as a gBlock from IDT-DNA, and subcloned into a 2μ pYEplac181 yeast expression plasmid using a NEBuilder® HiFi DNA assembly kit (E2621, New England Biolabs, Ipswich, MA). In step 3, each synthetic library was manufactured and delivered as 10 μg of linear dsDNA by Twist Bioscience (South San Francisco, CA) using the nanobody scaffold plasmids described above, and the residue positions and amino acid frequencies listed in Tables 4-6. The resultant libraries had variable CDR1 and CDR3 regions and theoretical diversities of 6.9×108 (3P0G.B), 1.35×1011 (6DO1.C), and 2.04×1012 (7KLW.C).
In order to design the periplasmic ligand trapping technology, identification of a suitable cell wall protein, as shown in
Fluorescent protein mTurqoise2 was used as a surrogate ligand to quantify total protein output, secretion efficiency, and trapping efficiency of each candidate signal sequence and anchoring protein based on fluorescence-based measurements. Results identified αPrePro (SEQ ID NO:1 and SEQ ID NO:21) as a superior signal sequence for secretion and Ccw1423-238 (SEQ ID NO:2 and SEQ ID NO:13) as an optimal cell wall anchoring protein motif, whose amino acid sequences are shown in
To confirm the ability of the yeast periplasmic trapping design for displaying ligands that activate human membrane receptors, the invention was combined with previously engineered yeast strains designed for human GPCR studies (Kapolka et al., 2020, Proc. Natl. Acad. Sci. USA 117: 13117-13126) (see Table 1). These strains contain 1) a human GPCR, 2) one of ten unique intracellular Ga proteins (humanized), and 3) a fluorescent reporter gene induced upon GPCR/Ga activation. The ability to activate Somatostatin Receptor 5 (SSTR5) through secretion and trapping of its cognate peptide ligand somatostatin (SRIF-14) was tested. As shown in
To confirm that larger protein ligands are also compatible with periplasmic display and functional modulation of receptors, the same design was used to display a chemokine protein ligand (CXCL12a) in strains harboring the C—X—C Motif Chemokine Receptor 4 (CXCR4). This resulted in activation of CXCR4 and two Gα proteins (Gαi, Gαt) through activated GPCR-Ga coupling, as shown in
The periplasmic ligand trapping applications from
Using engineered yeast strains harboring CXCR4 as described in
Incorporating the ligand trapping technology into the yeast platform for human membrane receptor studies allows assessment of receptor function from peptide-, protein-, and nanobody-receptor interactions occurring at/near the orthosteric binding site. However, identifying similar interactions that occur at the receptor's intracellular face can also be advantageous and used for modulating receptor function. This is especially true for nanobodies, which can recognize specific receptor conformations and influence interactions between receptors and intracellular signaling proteins that control downstream signaling pathways.
Using engineered yeast strains harboring Angiotensin II Receptor Type I (AGTR1), autocrine activation of this receptor was tested to see co-expression/secretion of its cognate peptide ligand angiotensin II (AGT-II). As shown in
Initial studies identifying AT110 indicated this nanobody stabilizes the active state of AGTR1 (Wingler et al., 2019, Cell 176: 479-490), correlating with the increased signaling observed above. Next, two additional active-state stabilizing intracellular nanobodies for AGTR1, i.e., AT1101103 (Wellner et al., 2021, Nat Chem Biol 17: 1057-1064) and AT110i1 (Wingler et al., 2019, Cell 176: 479-490) were tested, to confirm the platform was capable of reporting similar nanobody-receptor interactions. As shown in
A major application of the nanobody trapping platform disclosed herein is the discovery of extracellular nanobodies that regulate GPCR function by acting as full and partial agonists, inverse agonists, positive and negative allosteric modulators, and antagonists. Additionally, this platform is particularly advantageous for discovering intracellular nanobodies that function as GPCR chaperones, are selective for ligand-dependent active states, are selective for inactive states, and can identify specific GPCR isoforms. In both cases, such nanobodies could serve as novel and essential tools for studying GPCR pharmacology, developing GPCR-targeted nanobody therapies, and providing new biotechnologies based on specific GPCR-nanobody interactions.
Three synthetic nanobody libraries that are compatible with the nanobody discovery platform were designed and manufactured. As detailed in the materials and methods, particularly advantageous consensus nanobody scaffolds from the successful examples available in the Protein Data Bank (PDB) (Berman et al., 2003, Nat Struct Biol 10: 980; Berman et al., 2000, Nucl Acids Res 28: 235-242) were identified. Using custom Python code 1,336 nanobody chains from the PDB into subclasses 1 [988 (74%) chains], 2, [301 (23%) chains], and 3 [47 (3%) chains] based on the position and conformation of their third complementarity determining region (CDR3) loops (
Each library can be screened for new nanobody-GPCR interactions by PCR amplifying and transforming a pool of nanobody amplicons into DCyFIR yeast strains (Kapolka et al., 2020, Proc. Natl. Acad. Sci. USA 117: 13117-13126; Rowe et al., 2020, J. Biol. Chem. 295: 8262-8271). Using this approach, nanobody genes can either be integrated into expression vectors or integrated directly into the genome via CRISPR (
All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, and patent application was specifically and individually indicated to be incorporated by reference.
While some embodiments have been illustrated and described in detail in the appended drawings and the foregoing description, such illustration and description are to be considered illustrative and not restrictive. Other variations to the disclosed embodiments can be understood and effected in practicing the claims, from a study of the drawings the disclosure, and the appended claims. The mere fact that certain measures or features are recited in mutually different dependent claims does not indicate that the combination of these measures or features cannot be used. Any reference signs in the claims should not be construed as limiting the scope.
This application claims the benefit of U.S. Provisional Patent Application Ser. No. 63/385,395 filed Nov. 29, 2022, which is expressly incorporated herein by reference.
This invention was made with government support under R35GM119518 awarded by the National Institutes of Health. The government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
63385395 | Nov 2022 | US |