A computer readable form of the Sequence Listing is filed with this application by electronic submission and is incorporated into this application by reference in its entirety. The Sequence Listing is contained in the file created on Apr. 8, 2021, having the file name “20-596-WO_SeqList_ST25.txt” and is 8 kb in size.
Protein pores play key roles in fundamental biological processes and biotechnological applications such as DNA nanopore sequencing, and hence the design of pore-containing proteins is of considerable scientific and biotechnological interest.
In one aspect, the disclosure provides polypeptides comprising an amino acid sequence at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of any one of SEQ ID NOS: 1-4. In one embodiment, the polypeptide comprises an amino acid sequence at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 1, wherein the polypeptide is identical to the polypeptide of SEQ ID NO:1 at least at 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, or more of residues 3, 13-14, 16-18, 20, 21, 23-25, 27-28, 30-32, 40, 44, 47, 49, 56, 65, and 68 in SEQ ID NO:1. In another embodiment, the polypeptide comprises an amino acid sequence at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 1, wherein changes in the polypeptide amino acid sequence relative to SEQ ID NO:1 comprise:
(a) changes in 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, or more of residues 1-4, 6-7, 9-10, 32-40, 42, and 70-72 to other polar amino acids;
(b) changes in 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, or more of residues 14, 16-18, 20-21, 23-25, 27-28, 30, 49, 56, and 63 to other hydrophobic amino acids; and/or
(c) a change at residue 44.
In a further embodiment, the polypeptide is identical to SEQ ID NO:1 at least at 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or all of residues 5, 8, 11-12, 15, 19, 22, 26, 29, 41, 43, 45-48, 50, 51-55, 57-62, 64, and 67-69 in SEQ ID NO:1. In one embodiment, the polypeptide is identical to SEQ ID NO:1 at 1, 2, or all 3 of residues 13, 31, and 65 in SEQ ID NO:1.
In another embodiment, the polypeptide comprises an amino acid sequence at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 2, wherein changes in the polypeptide amino acid sequence relative to SEQ ID NO:2 comprise changes in 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or more of residues 1-4, 7-8, 10-12, 14-15, 17-19, 21-26, 28-29, 32-33, 36-37, 39-40, 43-44, 46-55, 58, 60, 67, 71, 78, 82, 96, and 98-100 to other polar amino acids. In one embodiment, the polypeptide is identical to SEQ ID NO:2 at least at 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30,31, 32, 33, 34, 35, 36, 37, 38, 39, or all of residues 5-6, 9, 13, 16, 20, 27, 30-31, 34-35, 38, 41-42, 45, 56-57, 59, 61, 63-65, 68, 70, 72, 74-75, 77, 79, 81, 84-86, 88-90, 92-93, 95, and 97 in SEQ ID NO:2. In another embodiment, the polypeptide is identical to SEQ ID NO:2 at least at 1, 2, 3, 4, 5, 6, 7, 8, 9, or more of residues 62, 66, 69, 73, 76, 80, 83, 87, 91, and 94 in SEQ ID NO:2.
In one embodiment, the polypeptide comprises an amino acid sequence at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 3, wherein changes in the polypeptide amino acid sequence relative to SEQ ID NO:3 comprise
(a) changes in 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or more of residues 1-4, 7-8, 10-12, 14, 37, 39-40, 43-44, 46-55, 58, 60, 96, and 98-100 to other polar amino acids; and/or
(b) changes in 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, or more of residues 17-19, 21-26, 28-29, 32-33, 71, 78, and 82 to other hydrophobic amino acids.
In another embodiment, the polypeptide is identical to SEQ ID NO:3 at least at 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30,31, 32, 33, 34, 35, 36, 37, 38, 39, or all of residues 5-6, 9, 13, 16, 20, 27, 30-31, 34-35, 38, 41-42, 45, 56-57, 59, 61, 63-65, 68, 70, 72, 74-75, 77, 79, 81, 84-86, 88-90, 92-93, 95, and 97 in SEQ ID NO:3. In a further embodiment, the polypeptide is identical to SEQ ID NO:3 at 1, 2, or all 3 of residues 15, 36, and 66 in SEQ ID NO:3. In one embodiment, the polypeptide is identical to SEQ ID NO:3 at least at 1, 2, 3, 4, 5, 6, 7, 8, 9, or more of residues 62, 66, 69, 73, 76, 80, 83, 87, 91, and 94 in SEQ ID NO:3. In another embodiment, all optional amino acid residue positions in the polypeptide relative to SEQ ID NO:3 are present.
In one embodiment, the polypeptide is of the formula X1-Z—X2, wherein X1 and X2 independently comprise the polypeptide of any embodiment or combination of embodiments of the polypeptides disclosed herein, and Z comprises an amino acid linker. In another embodiment, the polypeptide is of the formula X1-Z—X2, wherein X1 and X2 independently comprise any embodiment or combination of embodiments of the polypeptides disclosed herein, and Z comprises an amino acid linker. In a further embodiment, the polypeptide comprises an amino acid sequence at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO:4.
In another embodiment, the disclosure provides polypeptide oligomers comprising 2, 3, 4, 5, 6, or more copies of the polypeptide of any embodiment or combination of embodiments of the polypeptides disclosed herein. In one embodiment, the oligomer comprises a homo-hexamer of the polypeptide of any embodiment or combination of embodiments of the polypeptides disclosed herein.
In other aspects, the disclosure provides nucleic acids encoding the polypeptide of any embodiment or combination of embodiments of the polypeptides disclosed herein, expression vector comprising a nucleic acid of the disclosure operatively linked to a suitable control sequence, and host cells comprising the nucleic acid, expression vector, polypeptide, or polypeptide oligomer of any embodiment or combination of embodiments disclosed herein. In one embodiment of host cells of the disclosure, the polypeptide oligomer forms a transmembrane pore in the host cell. In another aspect, the disclosure provides liposomes comprising the polypeptide oligomer of any embodiment herein, wherein the polypeptide oligomer forms a transmembrane channel in the liposome. In a further aspect, the disclosure provides uses or methods of use of the polypeptides, polypeptide oligomers, nucleic acids, vectors, host cells, and liposomes described herein for any suitable purpose, including but not limited to selective transmembrane ion conductance, DNA nanopore sequencing, generating membranes with selective permeabilities, sensing molecules in the environment, and controlling cellular behavior.
Representative gel filtration chromatography and SDS-PAGE of TMH4C4. TMH4C4 protein was purified in the buffer containing 0.2% (weight/volume) n-decyl-β-D-maltopyranoside (DM, Anatrace).
As used herein, the singular forms “a”, “an” and “the” include plural referents unless the context clearly dictates otherwise. “And” as used herein is interchangeably used with “or” unless expressly stated otherwise.
All embodiments of any aspect of the disclosure and appendices can be used in combination, unless the context clearly dictates otherwise.
Unless the context clearly requires otherwise, throughout the description, appendix, and the claims, the words ‘comprise’, ‘comprising’, and the like are to be construed in an inclusive sense as opposed to an exclusive or exhaustive sense; that is to say, in the sense of “including, but not limited to”. Words using the singular or plural number also include the plural and singular number, respectively. Additionally, the words “herein,” “above,” and “below” and words of similar import, when used in this application, shall refer to this application as a whole and not to any particular portions of the application.
As used herein, the amino acid residues are abbreviated as follows: alanine (Ala; A), asparagine (Asn; N), aspartic acid (Asp; D), arginine (Arg; R), cysteine (Cys; C), glutamic acid (Glu; E), glutamine (Gln; Q), glycine (Gly; G), histidine (His; H), isoleucine (Ile; I), leucine (Leu; L), lysine (Lys; K), methionine (Met; M), phenylalanine (Phe; F), proline (Pro; P), serine (Ser; S), threonine (Thr; T), tryptophan (Trp; W), tyrosine (Tyr; Y), and valine (Val; V).
As referred to herein, polar amino acid residues are Gly (G), Ser (S), Thr (T), Cys (C), Tyr (Y), Asn (N), Gln (Q); Asp (D), Glu (E); Lys (K), Arg (R), and His (H). As referred to herein, hydrophobic residues are Ala (A), Val (V), Leu (L), Ile (I), Pro (P), Phe (F), Trp (W), and Met (M).
In a first aspect, the disclosure provides polypeptides comprising an amino acid sequence at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of any one of SEQ ID NOS: 1-4. As described in detail herein, the inventors describe the computational design of protein channels or pores formed by two concentric rings of α-helices that are stable and mono-disperse in both water-soluble and membrane-embedded forms. The protein channels or pores are comprised of polypeptide subunits as claimed herein.
WTSKTDDDPSAQSEQLVAMSLMLIAASLLI
As described in the examples, the polypeptides can tolerate significant substitutions, as described in detail below. For example with respect to SEQ ID NO:1 (TMHC6):
Residues 1-4, 6-7, 9-10, 32-40, 42, and 70-72: Exposed surface residues that can be mutated to other polar residues
Residues 14, 16-18, 20-21, 23-25, 27-28, 30, 49, 56, 63: Exposed lipid interacting residues in the TM domains that can be mutated to other hydrophobic residues.
Residues 5, 8, 11-12, 15, 19, 22, 26, 29, 41, 43, 45-48, 50, 51-55, 57-62, 64, 67-69: Core residues buried inside the protein; useful to hold the hexameric topology / pore formation.
Residues 13-31; 44-65: Transmembrane (TM) domains
Residues 13, 31, and 65: Useful residues to set the orientation of the transmembrane protein in the lipid bilayer.
Residue 44: Ion selectivity filter. This residue provides K+ion selectivity; substitution of this residue can be used to modify ion selectivity or eliminate ion selectivity altogether.
In one embodiment, the polypeptide comprises an amino acid sequence at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 1, wherein the polypeptide is identical to the polypeptide of SEQ ID NO:1 at least at 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, or more of residues 3, 13-14, 16-18, 20, 21, 23-25, 27-28, 30-32, 40, 44, 47, 49, 56, 65, and 68 in SEQ ID NO:1. These residues are highlighted in bold in SEQ ID NO:1.
In another embodiment, the polypeptide comprises an amino acid sequence at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 1, wherein changes in the polypeptide amino acid sequence relative to SEQ ID NO:1 comprise:
(a) changes in 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, or more of residues 1-4, 6-7, 9-10, 32-40, 42, and 70-72 to other polar amino acids;
(b) changes in 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, or more of residues 14, 16-18, 20-21, 23-25, 27-28, 30, 49, 56, and 63 to other hydrophobic amino acids; and/or
(c) a change at residue 44.
In a further embodiment, the polypeptide is identical to SEQ ID NO:1 at least at 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or all of residues 5, 8, 11-12, 15, 19, 22, 26, 29, 41, 43, 45-48, 50, 51-55, 57-62, 64, and 67-69 in SEQ ID NO:1. These are the core residues of TMHC6.
In a further embodiment, the polypeptide is identical to SEQ ID NO:1 at 1, 2, or all 3 of residues 13, 31, and 65 in SEQ ID NO:1.
In one embodiment, the polypeptide comprises an amino acid sequence at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of any one of SEQ ID NOS: 2-4. In one embodiment, the polypeptide comprises an amino acid sequence at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 2, wherein changes in the polypeptide amino acid sequence relative to SEQ ID NO:2 comprise changes in 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or more of residues 1-4, 7-8, 10-12, 14-15, 17-19, 21-26, 28-29, 32-33, 36-37, 39-40, 43-44, 46-55, 58, 60, 67, 71, 78, 82, 96, and 98-100 to other polar amino acids.
As described herein with respect to SEQ ID NO:2 (WSHC8):
Residues 1-4, 7-8, 10-12, 14-15, 17-19, 21-26, 28-29, 32-33, 36-37, 39-40, 43-44, 46-55, 58, 60, 67, 71, 78, 82, 96, 98-100: exposed surface residues; can be mutated to other polar residues.
Residues 5-6, 9, 13, 16, 20, 27, 30-31, 34-35, 38, 41-42, 45, 56-57, 59, 61, 63-65, 68, 70, 72, 74-75, 77, 79, 81, 84-86, 88-90, 92-93, 95, 97: core residues buried inside the protein; useful residues to hold the hexameric topology/pore formation.
Residues 62, 66, 69, 73, 76, 80, 83, 87, 91, 94: pore-facing residues; too many hydrophobic residues may collapse the pore.
In one embodiment, the polypeptide is identical to SEQ ID NO:2 at least at 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30,31, 32, 33, 34, 35, 36, 37, 38, 39, or all of residues 5-6, 9, 13, 16, 20, 27, 30-31, 34-35, 38, 41-42, 45, 56-57, 59, 61, 63-65, 68, 70, 72, 74-75, 77, 79, 81, 84-86, 88-90, 92-93, 95, and 97 in SEQ ID NO:2. These are the core residues.
In another embodiment, the polypeptide is identical to SEQ ID NO:2 at least at 1, 2, 3, 4, 5, 6, 7, 8, 9, or more of residues 62, 66, 69, 73, 76, 80, 83, 87, 91, and 94 in SEQ ID NO:2. These are the pore-facing residues.
In another embodiment, the polypeptide comprises an amino acid sequence at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO: 3, wherein changes in the polypeptide amino acid sequence relative to SEQ ID NO:3 comprise
(a) changes in 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or more of residues 1-4, 7-8, 10-12, 14, 37, 39-40, 43-44, 46-55, 58, 60, 96, and 98-100 to other polar amino acids; and/or
(b) changes in 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, or more of residues 17-19, 21-26, 28-29, 32-33, 71, 78, and 82 to other hydrophobic amino acids; and
wherein residues in parentheses are optional.
As described herein with respect to SEQ ID NO:3 (TMHC8):
Residues 1-4, 7-8, 10-12, 14, 37, 39-40, 43-44, 46-55, 58, 60, 96, 98-100: exposed surface residues; can be mutated to other polar residues.
Residues 17-19, 21-26, 28-29, 32-33, 71, 78, 82: exposed lipid interacting residues in the TM domains; can be mutated to other hydrophobic residues.
Residues 5-6, 9, 13, 16, 20, 27, 30-31, 34-35, 38, 41-42, 45, 56-57, 59, 61, 63-65, 68, 70, 72, 74-75, 77, 79, 81, 84-86, 88-90, 92-93, 95, 97: core residues buried inside the protein; useful residues to hold the hexameric topology / pore formation.
Residues 62, 66, 69, 73, 76, 80, 83, 87, 91, 94: pore-facing residues; too many hydrophobic residues may collapse the pore.
Residues 16-35 and residues 68-87: TM domains.
Residues 15, 36, 66: useful residues to set the orientation of the transmembrane protein in the lipid bilayer.
In one embodiment, the polypeptide is identical to SEQ ID NO:3 at least at 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30,31, 32, 33, 34, 35, 36, 37, 38, 39, or all of residues 5-6, 9, 13, 16, 20, 27, 30-31, 34-35, 38, 41-42, 45, 56-57, 59, 61, 63-65, 68, 70, 72, 74-75, 77, 79, 81, 84-86, 88-90, 92-93, 95, and 97 in SEQ ID NO:3. These are the core residues.
In another embodiment, the polypeptide is identical to SEQ ID NO:3 at 1, 2, or all 3 of residues 15, 36, and 66 in SEQ ID NO:3.
In a further embodiment, the polypeptide is identical to SEQ ID NO:3 at least at 1, 2, 3, 4, 5, 6, 7, 8, 9, or more of residues 62, 66, 69, 73, 76, 80, 83, 87, 91, and 94 in SEQ ID NO:3. These are the pore-facing residues.
In one embodiment, all optional amino acid residue positions in the polypeptide relative to SEQ ID NO:3 are present. In other embodiments, one or more of the optional amino acid residues are absent. In one embodiment, residue 1 can be S or V. In another embodiment, the C-terminal optional residues (98-100) may RHI, RH, R, or absent, or may be absent or may be substituted with other amino acids.
In another embodiment, the polypeptide has the formula X1-Z—X2, wherein X1 and X2 independently comprise the polypeptide of any embodiment or embodiments described above related to SEQ ID NOS:1-3, and Z comprises an amino acid linker. Thus, in various embodiments X1 may comprise any embodiment disclosed herein where SEQ ID NO:1 is the reference sequence, where SEQ ID NO:2 is the reference sequence, or where SEQ ID NO:3 is the reference sequence, and X3 may independently comprise any embodiment disclosed herein where SEQ ID NO:1 is the reference sequence, where SEQ ID NO:2 is the reference sequence, or where SEQ ID NO:3 is the reference sequence. Any suitable linker may be used, including but not limited to those described herein. In certain embodiments, the linker comprises a flexible linker of at least 4 amino acids in length.
In one embodiment, X1 and X3 independently comprise any embodiment disclosed herein where SEQ ID NO:3 is the reference sequence. In these embodiments, X1 and X3 may comprise the same amino acid sequence or may differ. In another embodiment, the polypeptide comprises an amino acid sequence at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the amino acid sequence of SEQ ID NO:4.
ILLSWRSEELIRELEEKGAASEAELARMKQQ
LVIAFVFLILLILLSWRSEELIRELEEKGAA
In another embodiment, the disclosure provides polypeptide oligomers comprising 2, 3, 4, 5, 6, or more copies of the polypeptide of any embodiment or combination of embodiments disclosed herein. The polypeptide oligomers can function, for example, as protein channels or pores in water-soluble or membrane-embedded forms, as discussed in detail in the examples that follow.
In one embodiment, the polypeptide copies in the oligomer are identical, and the oligomer may comprise a homo-oligomer, including but not limited to a homo-dimer, homo-trimer, homo-tetramer, homo-pentamer, or homo-hexamer. In one specific embodiment, the oligomer comprises a homo-hexamer.
As used throughout the present application, the term “polypeptide” is used in its broadest sense to refer to a sequence of subunit D- or L-amino acids, including canonical and non-canonical amino acids. The polypeptides described herein may be chemically synthesized or recombinantly expressed. The polypeptides may be linked to other compounds to promote an increased half-life in vivo, such as by PEGylation, HESylation, PASylation, glycosylation, or may be produced as an Fc-fusion or in deimmunized variants. Such linkage can be covalent or non-covalent as is understood by those of skill in the art.
As will be understood by those of skill in the art, the polypeptides of the invention may include additional residues at the N-terminus, C-terminus, or both that are not present in the polypeptides disclosed herein; these additional residues are not included in determining the percent identity of the polypeptides of the invention relative to the reference polypeptide.
In another aspect the disclosure provides nucleic acids encoding the polypeptides of any embodiment or combination of embodiments of the disclosure. The nucleic acid sequence may comprise single stranded or double stranded RNA or DNA in genomic or cDNA form, or DNA-RNA hybrids, each of which may include chemically or biochemically modified, non-natural, or derivatized nucleotide bases. Such nucleic acid sequences may comprise additional sequences useful for promoting expression and/or purification of the encoded polypeptide, including but not limited to polyA sequences, modified Kozak sequences, and sequences encoding epitope tags, export signals, and secretory signals, nuclear localization signals, and plasma membrane localization signals. It will be apparent to those of skill in the art, based on the teachings herein, what nucleic acid sequences will encode the polypeptides of the disclosure.
In a further aspect, the disclosure provides expression vectors comprising the nucleic acid of any aspect of the disclosure operatively linked to a suitable control sequence. “Expression vector” includes vectors that operatively link a nucleic acid coding region or gene to any control sequences capable of effecting expression of the gene product. “Control sequences” operably linked to the nucleic acid sequences of the disclosure are nucleic acid sequences capable of effecting the expression of the nucleic acid molecules. The control sequences need not be contiguous with the nucleic acid sequences, so long as they function to direct the expression thereof. Thus, for example, intervening untranslated yet transcribed sequences can be present between a promoter sequence and the nucleic acid sequences and the promoter sequence can still be considered “operably linked” to the coding sequence. Other such control sequences include, but are not limited to, polyadenylation signals, termination signals, and ribosome binding sites. Such expression vectors can be of any type, including but not limited plasmid and viral-based expression vectors. The control sequence used to drive expression of the disclosed nucleic acid sequences in a mammalian system may be constitutive (driven by any of a variety of promoters, including but not limited to, CMV, SV40, RSV, actin, EF) or inducible (driven by any of a number of inducible promoters including, but not limited to, tetracycline, ecdysone, steroid-responsive). The expression vector must be replicable in the host organisms either as an episome or by integration into host chromosomal DNA. In various embodiments, the expression vector may comprise a plasmid, viral-based vector, or any other suitable expression vector.
In another aspect, the disclosure provides host cells that comprise the nucleic acids, expression vectors (i.e.: episomal or chromosomally integrated), polypeptide, or polypeptide oligomer of any embodiment or combination of embodiments disclosed herein, wherein the host cells can be either prokaryotic or eukaryotic. For example, the cells can be transiently or stably engineered to incorporate the expression vector of the disclosure, using techniques including but not limited to bacterial transformations, calcium phosphate co-precipitation, electroporation, or liposome mediated-, DEAE dextran mediated-, polycationic mediated-, or viral mediated transfection.
In one embodiment, the polypeptide oligomer forms a transmembrane pore in the host cell as described above and as detailed in the examples. In one such embodiment, the host cell is a eukaryotic cell.
In another aspect, the disclosure provides liposomes comprising the polypeptide oligomer of any embodiment herein, wherein the polypeptide oligomer forms a transmembrane channel in the liposome, as described in detail in the examples that follow. As used herein, a liposome is any spherical vesicle having at least one lipid bilayer.
In another aspect, the disclosure provides methods for use of the polypeptides, polypeptide oligomers, nucleic acids, vectors, host cells, and liposomes described herein for any suitable purpose, including but not limited to selective transmembrane ion conductance,
DNA nanopore sequencing, generating membranes with selective permeabilities, sensing molecules in the environment, and controlling cellular behavior.
Protein pores play key roles in fundamental biological processes and biotechnological applications such as DNA nanopore sequencing, and hence the design of pore-containing proteins is of considerable scientific and biotechnological interest. The de novo design of stable, well-defined transmembrane protein pores capable of conducting ions selectively or large enough to allow passage of small-molecule fluorophores remains an outstanding challenge. Here, we report the computational design of protein pores formed by two concentric rings of α-helices that are stable and mono-disperse in both water-soluble and membrane-embedded forms. Crystal structures of the water-soluble forms of a 12 helical pore with an inner diameter at the narrowest constriction of 4 Å, and of a 16 helical pore with a constriction diameter of 10 Å, are close to the computational design models (0.89 and 2.51 Å C-α RMSD respectively). Patch clamp electrophysiology experiments show that the transmembrane form of the 12-helix pore expressed in insect cells allows passage of ions across the membrane with high selectivity for potassium over sodium, which is blocked by specific chemical modification at the pore entrance. The transmembrane form of the 16-helix pore, but not the 12-helix pore, allows passage of biotinylated Alexa™ Fluor 488 when incorporated into liposomes using in vitro protein synthesis. A cryo-EM structure of the 16-helix transmembrane pore closely matches the design model. The ability to produce structurally and functionally well-defined transmembrane pores opens the door to the creation of designer pores for a wide variety of applications.
Introduction
De novo design of transmembrane proteins with large pores presents a stringent thermodynamic challenge, as there is a larger surface area to volume ratio and hence a lower density of stabilizing interactions. A homo-oligomeric architecture in which a cyclic arrangement of multiple identical copies of a single subunit surrounds the pore is attractive for design as the subunit need not be a large protein. The transformation of soluble oligomeric protein pores into their membrane counterparts is challenging because of the altered thermodynamics of folding in the membrane and possible undesired interactions between non-polar residues introduced to interact with lipids and those at intersubunit interfaces.
We set out to design transmembrane protein pores using a two-step approach. We reasoned that protein pores formed from two concentric rings of α-helices containing buried hydrogen bond networks for structural specificity could be stable in both soluble and, after resurfacing the lipid exposed residues with membrane-compatible hydrophobic residues, transmembrane forms. The increased interhelical interaction surfaces in such two-ring bundles could provide greater stability than single-ring structures, particularly for larger pore sizes, and could act together with buried hydrogen bond networks to yield greater structural robustness to conversion between soluble and transmembrane forms. We then sought to design cylindrical two-ring bundles consisting of a minimum of 12 closely packed helices that are assembled from 2-4 helix monomers, to avoid having to build such large structures from a single polypeptide.
We explored the design of water-soluble pores containing helical bundles with a two-ring 6-fold symmetric topology formed from monomeric subunits composed of an inner and outer helix bridged with a short loop. Backbones were generated by sampling α-helical and superhelical parameters of the assemblies using a generalization of the Crick coiled-coil parameterization'. For each topology, the α-helical phase, superhelical radius, superhelical twist, and axial offset between helices were allowed to vary, while the superhelical phase was predetermined by the symmetry (360/6 =60° for the hexamer). After backbone generation and loop closure, we searched for hydrogen-bond networks across intermolecular interfaces using Rosetta™ HBNet and carried out combinatorial sequence optimization for the remaining residue positions keeping the polar networks found by HBNet fixed. The core residues and backbone parameters of the inner helices were set to those of a previously characterized “single-ring” helical bundle to reduce the design space. Rosetta™ “fold-and-dock” structure prediction calculations were used to investigate the extent to which the designed sequences encode the design target topologies. Designs (
We obtained synthetic genes encoding the selected designs and expressed them in Escherichia coli (E. coli). All 3 hexameric designs selected for experimental validation were well expressed and soluble in E. coli and purified efficiently by nickel-affinity chromatography and size-exclusion chromatography (SEC,
To investigate ion conductance by TMHC6, we performed whole-cell patch clamp experiments on Trichoplusia ni insect cells (Hi5) expressing the designed transmembrane proteins. Using extracellular (bath) and intracellular (electrode) solutions containing 100 mM NaCl and 100 mM NaF, respectively, the TMHC6 construct exhibited a symmetric current/voltage relationship for inward and outward Na current as a function of membrane potential (
In the design model, the extracellular ring of six Glu44 residues is the site of cation entry; we tested this hypothesis by site-directed mutagenesis and chemical modification. Mutation of Glu44 to Phe (E44F) removed the negative charge at the extracellular entry to the pore and reduced the conductance to 308±81 pA/pF, 51.3% of the control (
Taken together, the high ion selectivity, specific inhibition by multivalent cations, and complete block by MTS reagents acting at the extracellular entry to the pore strongly suggest that ion passage is through the designed central transmembrane pore. Leak conductances would not be expected to have these properties. Single channel recordings on TMHC6 did not yield clear signals. The single channel conductance may be too low, perhaps because of the central non-polar lined portion of the pore.
Why is TMHC6 selective for K+? The TMHC6 pore diameter of 3.3 Å is well tuned for the conductance of dehydrated K+. In contrast, voltage-gated sodium and calcium channels, which conduct hydrated Na and Ca' have pores that are 4.6 Å in diameter, well designed for hydrated sodium and calcium ions but not for K+. The ability to design transmembrane pores de novo lays the foundation for broad exploration of the pore diameters and chemical interactions that are required for selective conductance of a wide range of ions. This understanding enables the design of channels which directly modulate cell function. Towards that end, we expressed TMHC6 in a yeast strain that requires potassium for growth, and we found that TMHC6 considerably boosted the growth rate in a K+ dependent fashion (data not shown).
To explore the generation of larger transmembrane pores capable of transporting organic molecules larger than single ions, we designed water-soluble helical bundles with 8-fold symmetry using an approach similar to that described above for WSHC6 except that a starting “single-ring” template was not employed; instead, the structure and sequence of the inner ring of helices was sampled de novo in parallel with those of the outer ring. We obtained synthetic genes encoding selected octameric pore designs and expressed them in E. coli. Four of five of the octameric designs were well expressed and soluble in E. coli, and they could be purified by nickel-affinity chromatography and SEC. However, none of the designs populated only the target octameric state. There are only small differences in interface geometry between C7, C8 and C9 assemblies (the angles between subunits are 51°, 45° and 40°, respectively), and mixtures were observed for most of the designs. We reasoned that achieving a well-defined C8 state would require more precise interface definition; therefore, we carried out a second round of C8 designs with larger inter-subunit interaction surface areas and more hydrogen bond networks across the interface (
We determined the crystal structure for WSHC8 (
Expression of the designed larger transmembrane nanopore in insect cells resulted in cell death, probably because of induced cell permeability, and hence we were unable to assess the channel activity in these cells. Instead, we used a liposome-based assay coupled to in vitro protein synthesis28-30. TMHC6 and TMH4C4 were produced inside liposomes containing streptavidin, and biotinylated Alexa™ Fluor 488 (MW ˜1 kDa) was added outside (data not shown). In vitro synthesized TMHC6 and TMH4C4 had similar elution volumes as the bacterially expressed and purified proteins when analyzed by SEC (data not shown). Consistent with the much larger pore radius of TMH4C4 compared to TMHC6, we observed considerably more accumulation of dye within proteoliposomes containing TMH4C4 than in proteoliposomes containing TMHC6, or empty liposomes (we controlled for differences in protein expression and membrane incorporation of the two proteins by adjusting the template DNA concentration such that the amount of protein present in the membrane was the same). The narrowest dimension of the head group of the fluorophore is approximately 12 Å (
Seeking to obtain an overall structure of the 16-helix transmembrane channel, we used cryogenic electron microscopy (cryo-EM) to determine the three-dimensional (3D) structure of TMH4C4. The protein was concentrated to ˜6 mg/ml, applied to EM grids, and cryo-EM images collected and processed following standard protocols (data not shown). To avoid potential bias, Cl symmetry was applied for automatic image processing and classification, which yielded a dominant 16-helix form containing ˜40% of all the 3D classified particles, with the most continuous and intact map among all classes. Further classification and refinement focused on this set of particles resulted in a 5.9 Å resolution map from 64,739 (out of a total of 1,559,110 for 3D classification) selected particles (data not shown). The cryo-EM structure clearly shows the formation of a 16-helix assembly with a large central pore, consistent with the design model built from the crystal structure of the soluble form. Density encircling the membrane-spanning region likely originates from surrounding detergent molecules. A structure model of TMH4C4 built based on the EM map had some deviation among the four protomers so the tetramer is not perfectly symmetric, but the central pore-containing 16-helix structure corresponding to the original TMH4C4 design is clearly resolved, and aligns well to the design model.
Our advances in designing structurally well-defined transmembrane pores, both inform our understanding of general principles of protein biophysics, and open the door for a wide range of applications. From the membrane protein folding perspective, our success in designing transmembrane homo-oligomeric structures with hydrogen bond network containing subunit interfaces suggests that the combination of the substantial buried surface area and polarity of the interaction surfaces makes them robust to changes in the surrounding environment and reduces confounding by interactions with the purely hydrophobic lipid facing residues during folding. This approach allows the construction of quite substantial pores with environments very different from that of the surrounding lipids: the TMHC6 pore clearly shows selective transmembrane ion conductance, and the 10 Å diameter TMH4C4 pore clearly evident in the cryo-EM structure is lined with polar residues and provides passage to solutes as large as biotinylated Alexa™ Fluor 488.
Custom design now provides a route to understanding the underlying chemistry and physics of solute permeation and selectivity by modulating pore structures and selectivity filters in ways that are not possible with native channels. Our strategy of first designing soluble pore-containing structures, and then converting the stable designs into transmembrane proteins following crystal structure determination, leverages the considerably more straightforward structural characterization of soluble proteins as a key step toward building complex transmembrane proteins with a high success rate (see the Supplementary Information). Additional selectivity filters and gating mechanisms can be incorporated both to test basic concepts and to enable a wide range of applications. For example, in nanotechnology, custom designed pores could provide new routes to generating membranes with selective permeabilities, sensing molecules in the environment, and controlling cellular behavior.
Synthetic peptides as models for ion channel proteins. Acc. Chem. Res. 26, 191-197 (1993).
Helical backbones used for design calculations were generated by sampling the generalized Crick coiled-coil parameters 15,16 as described previously17,18. Only supercoil and, alpha-helical parameters of the inner and outer helices of the monomers were sampled. These parameters include supercoil twist (ω0), supercoil radius (R0), alpha-helical phase (φ1), and the offset along the z-axis (Zoffset). The supercoil phases (φ0) were fixed at 0° for the inner helices and −30° and −22.5° for the outer helices of hexamers and octamers, respectively. The supercoil pitch of the outer helices was constrained to match that of the inner helices to maintain contacts to the inner helices throughout the length of the helical bundles18. The Alpha-helical twists (ω1) were fixed at ideal values of 102.85°/aa for left-handed supercoils (e.g. WSHC6) and 100°/aa for straight helical bundles (e.g. WSHC8). Cyclic symmetries were applied during design calculations. The parameters for the inner helices of WSHC6 were preset according to the fitting result of a previously reported six-helix bundle13 using the Coiled-coil Crick Parameterization web server. Supercoil and alpha-helical parameters used to generate the backbones of WSHC6 and WSHC8 (2nd & 3rd columns) and the fitted parameters for the final design model and the crystal structure of WSHC8 (4th & 5th columns) are shown in Table 1:
We learned from the failed first round of design of helical bundles with large pores that enforcing supercoil twist and knobs-into-holes packing of conventional left-handed coiled coils will cause a big curvature of the generated helices, as shown in
The parametrically generated inner and outer helical backbones were connected before design calculations with short 2-5-residue loops using methods described previously18,35.
Cyclic symmetries were applied at the beginning of the calculations. The backbone geometry was then regularized by minimization in Cartesian space. HBNet18 was used to search for hydrogen-bond networks that span the helical interfaces. With atom pair constraints applied to the newly identified hydrogen-bond networks, Monte Carlo rotamer packing and minimization were performed to design the remaining positions aiming to lower the total energy and optimize the hydrophobic packing around the hydrogen-bond networks. A final minimization and side-chain repacking step was carried out without atom pair restraints on hydrogen-bonding residues to evaluate how well the networks remained intact in the absence of the constraints. Low energy designs with intact hydrogen-bond networks were tested using Rosetta™ “fold-and-dock” structure prediction calculations19. Sequences for which the predicted lowest-energy structures converge on the design models without additional energy minima were selected for experimental characterizations (Table 2).
We designed transmembrane versions of WSHC6 and WSHC8 pores by resurfacing the outside of the crystal structures with patterned hydrophobic residues, and RK- and YW-rings at the intracellular and extracellular boundary region, respectively8. Briefly, the N- and C-termini of the transmembrane designs are designed to localize in the cytoplasmic side, by adding an RK ring with Arg and Lys residues close to the termini. YW ring, designed at the lipid-water boundary on the periplasmic side, set up the register in the membrane. We designed the hydrophobic transmembrane span in between the YW and RK rings with a length of 26 and 31 Å for TMHC6 and TMHC8, respectively. Hydrophobic residues are designed based on amino acid propensity in the membrane, replacing all polar residues exposed to the membrane. The crystal structure of WSHC8 was used as the starting model.
The residues surrounding the central pores were also redesigned to modulate compatibility within the membrane and channel conductance: if these residues are too hydrophobic, some lipids may block the pores; if too polar, the designed TM segment may not be able to insert into the membrane. Polar residues were designed in the TM pore region, while membrane compatibility was predicted by TMHMM36 software. For TMHC6, we designed a ring of glutamate residues (E-ring) and two rings of lysine residues (K-rings) at the opening of the central channel on the extracellular and intracellular side, respectively, to modulate channel conductance. For TMHC8, in each monomer, we redesigned 4 charged residues surrounding the pore to apolar residues for insertion into the membrane, leaving 4 polar residues in the TM region to make a relative hydrophilic central channel. We ordered two designs for TMHC8, both of which expressed well, enriched in the membrane fraction. TMHMM36 software predicts that both TMHC6 and TMHC8 have two transmembrane segments.
All the figures for structures and maps are prepared in ChimeraTM34 or Pymol™. The constriction sites in the channels and pores are calculated using HOLE20. For the WSHC6 channel structure, the narrowest constriction is at Leu51 with a diameter of approximately 4 Å as calculated. For the TMHC6 design model, the narrowest constriction is at Glu44 with a diameter of approximately 3.3 Å. For the WSHC8 channel structure, the narrowest constriction of this channel has an inner diameter of approximately 10 Å at Lys80. For the TMH4C4 pore, the narrowest constriction is at Lys80 and Lys184 with a diameter of approximately 10 Å, calculated from EM structure.
Chemicals used were of the highest grade commercially available and were purchased from Sigma-Aldrich (St. Louis, Mo., USA), Invitrogen (Carlsbad, Calif,, USA), or Qiagen (Hilden, Germany). Detergents were from Anatrace (Maumee, Ohio, USA).
Synthetic genes were obtained from Genscript Inc. (Piscataway, N.J., USA) or IDT (Coralville, Iowa, USA) and cloned into the pET28b expression vector via NdeI/XhoI restriction sites. The plasmids were transformed into chemically competent E. coli Lemo21(DE3) cells (NEB, Ipswich, Mass.). Gene expression was facilitated by growing pre-cultures in Terrific Broth (TB) medium with a final concentration of 50 μg/ml kanamycin overnight at 37° C. 10 ml pre-cultures were used to inoculate 1 L of autoinduction medium37, again containing 50 μg/ml kanamycin for plasmid selection. The cultures were grown at 37° C. for 3 hours and then 18° C. overnight for protein expression. Cells were harvested by centrifugation.
2.3 Cell lysis and Purification of Water-Soluble Proteins
Cells were resuspended and homogenized in lysis buffer containing 50 mM Tris-HCl pH 8.0 and 300 mM NaCl 20 mM imidazole, and lysed by sonication in the presence of DNAse and EDTA-free protease inhibitor tablet (ThermoFisher Scientific). Lysates were cleared by centrifugation at 4° C. 18,000 g for at least 30 minutes and applied to Ni-NTA (Qiagen) columns pre-equilibrated in the lysis buffer. The column was washed with 5-10 column volumes (CV) of wash buffer (50 mM Tris, 300 mM NaCl, 30 mM Imidazole, pH 8.0), followed by 5 CV of high-salt wash buffer (50 mM Tris, 1 M NaCl, 30 mM Imidazole, pH 8.0), and then 5 CV of high-imidazole wash buffer (50 mM Tris, 300 mM NaCl, 60 mM
Imidazole, pH 8.0). Protein was eluted with 50 mM Tris, 300 mM NaCl, 500 mM Imidazole, pH 8.0. After concentration, proteins were further purified by gel filtration (Superdex-200 increase 10/300 GL; GE Healthcare) using lx phosphate buffer saline (PBS), pH 7.4. The N-terminal hexa-histidine tag was removed with restriction grade thrombin (EMD Millipore) overnight at 30° C. After full cleavage, the reaction is stopped by addition of phenylmethanesulfonyl fluoride (PMSF), followed by another round of gel filtration purification.
Synthetic genes were obtained from Genscript Inc. (Piscataway, N.J., USA) or IDT (Coralville, Iowa, USA) and cloned into the pET29b expression vector via Ndel/Xhol restriction sites. The plasmids were transformed into chemically competent E. coli Lemo21(DE3) cells (NEB, Ipswich, Mass.). Gene expression was facilitated by growing pre-cultures in Luria-Bertani broth (LB) medium with a final concentration of 50 μg/ml kanamycin overnight at 37° C. 10 ml pre-cultures were used to inoculate 1 L of LB medium, again containing 50 μg/ml kanamycin for plasmid selection. The cultures were grown at 37° C. for 3-4 hours until an OD600 of 0.8-1.0 was reached and then 18° C. overnight for protein expression. Cells were harvested by centrifugation.
Cells were resuspended and homogenized in lysis buffer containing 25 mM Tris-HCl pH 8.0 and 150 mM NaCl, and lysed by sonication in the presence of serine protease inhibitor phenylmethanesulfonyl fluoride (PMSF). Cell debris was removed by low-speed centrifugation for 10 minutes. The supernatant was collected and ultracentrifuged for 1 h at 150,000 g. The membrane fraction was collected and homogenized in buffer containing 25 mM Tris-HCl pH 8.0 and 150 mM NaCl. n-Decyl-β-D-Maltopyranoside (DM; Anatrace) was added to the membrane suspension to a final concentration of 1.5% (w/v) and then incubated for 2 h at 4° C. After another ultracentrifugation step at 150,000g for 30 min, the supernatant was collected and loaded on Ni2+-nitrilotriacetate affinity resin (Ni-NTA; Qiagen), followed by a wash with 25 mM Tris-HCl pH 8.0, 150 mM NaCl, 30 mM imidazole and 0.2% DM. Proteins were eluted with buffer containing 25 mM Tris-HCl pH 8.0, 150 mM NaCl, 30 mM imidazole and 0.2% DM. After concentration to 2-3 mg ml−1, proteins were further purified by gel filtration (Superdex™-200 10/30; GE Healthcare). The buffer for gel filtration contained 25 mM Tris-HCl pH 8.0, 150 mM NaCl and 0.2% DM. To prepare samples for Cryo-EM, TMH4C4 protein was purified in the buffer containing 0.02% (weight/volume) n-dodecyl-β-D-maltopyranoside (DDM, Anatrace). The major peak fractions were taken for further Cryo-EM investigation. For AUC experiments, the proteins, originally in 0.2% DM buffer, were buffer exchanged into the buffer for AUC by gel filtration, which contained 20 mM sodium phosphate, pH 7.0 and 200 mM NaCl supplemented with 0.5% Pentaethylene Glycol Monooctyl Ether (C8E5).
CD wavelength-scan measurements were made on an AVIV™ CD spectrometer model 420. Protein concentrations ranged from 0.1-0.2 mg/ml in PBS (pH 7.4) for soluble proteins and PBS (pH 7.4) plus 0.2% DM for transmembrane proteins. Wavelength-scan spectra from 260 to 190 nm were recorded in triplets and averaged. The scanning increment for full wavelength scans was 1 nm. Temperature melts were conducted in 2° C. steps (heating rate of 2° C./min) and recorded by following the absorption signal at a wavelength of 220 nm. Three sets of wavelength scan spectra were recorded at 25° C., 95° C. and after cooling down to 25° C.
Analytical ultracentrifugation (sedimentation velocity and sedimentation equilibrium) experiments were carried out using a Beckman XL-I analytical ultracentrifuge (Beckman Coulter) equipped with an eight-cell An-50 Ti rotor. The protein samples were centrifuged at the speed of 40 k rpm for sedimentation velocity (SV) experiments. Sedimentation-equilibrium (SE) experiments were then performed on samples that showed monodisperse distribution (>90%) of sedimentation coefficients by SV to determine the MW of the discrete species. The water-soluble proteins were centrifuged in PBS (pH7.4). The transmembrane proteins were centrifuged in a buffer containing 20 mM sodium phosphate, pH 7.0, 200 mM NaCl and 0.5% C8E5. Density matching was unnecessary as the solvent density was equal to that of C8E5. For sedimentation equilibrium, data were collected by UV absorbance at 230 nm at 20° C. for three protein concentrations in the range of 0.2-0.8 OD23o. Samples were centrifuged at three different speeds corresponding to 6, the reduced molecular weight38, of 2, 3 and 4.5 cm−2, respectively. Data were globally fit to a single ideal species model using the program SEDPHAT. The partial specific volume, buffer density, and viscosity were calculated using SEDNTERP.
Samples were purified by gel filtration in PBS (pH7.4) with 1% glycerol; fractions preceding the void volume of the column were used as blanks for buffer subtraction. Scattering measurements were performed at the SIBYLS™ 12.3.1 beamline at the Advanced Light Source39. For each sample, data were collected for two different concentrations to test for concentration-dependent effects; “high” concentration samples ranged from 4-7 mg/ml and “low” concentration samples ranged from 1-2 mg/ml. Data were analyzed using ATSAS™ software40 and the experimental scattering profiles were fit to design models using CRYSOLTM41.
2.9 X-Ray Crystallography
WSHC8 samples for crystallization were purified with an additional gel filtration step using a Superdex™ 200 Increase 10/300 GL column (GE Healthcare) with 20 mM Tris-HCl buffer pH8.0, 100 mN NaCl. Pure fractions were pooled and concentrated to 34 mg/mL. Crystallization screens were prepared using a 5-position deck Mosquito™ Crystal (ttplabtech) with an active humidity chamber. Crystals were grown by vapor diffusion in hanging drops at 17° C. Single crystals for data collection were grown in the No. 85 crystallization solution of JCSG+ suite (Qiagen) containing 0.3 M magnesium formate and 0.1 M Bis-Tris pH5.5. WSHC8 crystals were frozen in liquid nitrogen in the crystal growing buffer with additional 20% glycerol as cryo-protectant. The X-ray datasets were collected at the Advanced Light Source at Lawrence Berkeley National Laboratory with beamlines 8.2.1. Crystals are grown in the space group of P21212. The structure of WSHC8 was determined using molecular replacement with the Rosetta™ design model. The model building was done with COOTTM45. Structure refinement performed with PHENIXTM42.
# I/σ(I) (along a*, b* and c* axes)
Recombinant baculovirus was generated by using the Bac-to-Bac™ system (Invitrogen). Trichoplusia ni insect cells (HiS, Thermo Fisher) were grown on 35-mm Petri dishes in Grace's™ insect medium (Gibco) supplemented with FBS (10%) and antibiotics (100 μg/ml streptomycin and 100 U/ml penicillin). Cells were infected by replacing the incubation medium with a medium containing the baculovirus encoding the designed channels protein constructs. After 1 h, 2 ml incubation medium was added to the virus-containing medium. Cells were maintained at 25-27° C. for at least 24 h before the study.
Whole-cell patch clamp currents were recorded using an amplifier (Axopatch™ 200; Molecular Devices) with glass micropipettes resistance (1.5-3 MΩ). Capacitance was subtracted and series resistance was compensated using internal amplifier circuitry. To test the ion selectivity of the designed channels, Hi5 cells expressing the design of interest were bathed in a solution containing 100 mM of the chloride salt of the monovalent cations K+, Na+, Cs+, and CH3NH3+. The patch pipettes contained the equivalent concentration of the fluoride salt of the same cation, except for CH3NH3+, for which the pipette contained 100 mM CsF. The standard voltage-clamp protocol for measuring ionic currents consisted of 20-ms test pulses from a holding potential of 0 mV to voltages ranging from −100 mV to +100 mV in 10-mV steps. No leak subtraction protocols were used during the measuring of currents. The data were filtered at 2 KHz. For analysis of measured ionic current, the value of the current at the end of each pulse was measured. All current measurements were normalized to the cell capacitances. Each measurement is carried out on a different cell and the reported values are for the number of cells studied. The error bars in figures indicate the standard error of measurement (s.e.m.). Voltage-clamp pulses were generated, and currents were recorded using Pulse™ software controlling an Instrutech™ ITC18 interface (HEKA). Data was analyzed using Igor Pro 6.37 software (WaveMetrics).
The methanethiosulfonate (MTS) reagents [2-(trimethy-lammonium)ethyl] methanethiosulfonate (MTSET), N,N-Dibutyl-N-[2-(2-methyl-2,2-dioxidodithio)ethyl]-1-butanaminium Bromide; (MTS-TBAE) and sodium(2-sulfonatoethyl) methanethiosulfonate (MTSES) were purchased from Toronto Research Chemicals (North York, Canada). A 10 mM stock solution of MTS reagents dissolved in water was prepared, and aliquots of the stock solution were kept on ice until used. Before perfusing MTS reagents, the previously mentioned pulse protocol was applied to measure the peak current at +100 mV. Then, cells were perfused with 2.5 mM MTS reagent for 5-10 min while they are being held at 0 mV. This time was needed to allow the reagent to react with the SH groups of cysteine residues.
This application claims priority to U.S. Provisional Patent Application Ser. No. 63/017810 filed Apr. 30, 2020, incorporated by reference herein in its entirety.
This invention was made with government support under Grant No. FA9550-18-1-0297, awarded by the Air Force Office of Scientific Research. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2021/029533 | 4/28/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63017810 | Apr 2020 | US |