 
                 Patent Application
 Patent Application
                     20250163400
 20250163400
                    The instant application contains a Sequence Listing which has been submitted electronically in XML format and is hereby incorporated by reference in its entirety. Said XML copy created on Nov. 14, 2024, is named 061291-522001WO-new.xml and is 160 KB in size.
The invention relates to protein nanostructures, computational methods used to design protein nanostructures, and uses thereof in, for example, vaccines.
Protein nanostructures may be used to display vaccine antigens, as gene therapy vectors, or for other purposes.
The nanostructure I53-50, which is described in US 2016/0122392 A1, is one example of a computationally designed nanostructure. To design I53-50, the structures of the trimeric 2-keto-3-deoxy-6-phosphogluconate (KDPG) aldolase from Thermotoga maritima, (Protein Data Bank entry 1WA3) and the pentameric Lumazine synthase RibH2 from Mesorhizobium loti (20BX) were computationally docked to one another, such that the symmetry axes of the trimeric components and pentameric components align to the shared symmetry elements of an icosahedron; then the protein-protein interfaces between the trimeric components and the pentameric components were modified in silico to drive self-assembly of the two components into a nanostructure with icosahedral symmetry composed of trimers and pentamers having 3-fold and 5-fold symmetry axes-termed an 153 architecture. The resulting polypeptide sequences were expressed and purified, and it was shown experimentally that the polypeptides, as predicted, would self-assemble into the intended two-component nanostructure having 153 architecture. This two-component nanostructure (I53-50) comprised 60 copies of each polypeptide component, that is 20 copies of the designed trimeric component based on 1WA3 (termed “I53-50A”) and 12 copies of the designed pentameric component (termed “I53-50B”).
Another example of designed nanostructure, based on the same trimeric component and termed I3-01, is described in US 2018/0030429 A1. To design I3-01, the structure of KDPG (PDB entry 1WA3) was docked against itself alone; then the interfaces between the trimeric components were modified in silico to drive self-assembly of the designed nanostructure. The interface residues selected were different from the interface residues in the two-component nanostructure. The resulting single polypeptide sequence was expressed and purified, and it was shown experimentally that this polypeptide, as predicted would spontaneously self-assemble into a one-component icosahedral nanostructure having subunits aligned to the icosahedral 3-fold and new protein-protein interfaces on the icosahedral 2-fold symmetry axes-termed an 13 architecture. This one-component nanostructure (I3-01) comprised 60 copies of the polypeptide, which is 20 copies of the designed trimeric component.
Both of these designed nanostructures, I53-50 and I3-01, and variants thereof, have been employed to make vaccine candidates. For example, US 2020/0392187 A1 describes a two-component nanostructure composed of I53-50A fused the fusion (F) protein of a pneumovirus, and self-assembled with I53-50B to form an icosahedral nanostructure having a F protein trimer display on each three-fold axis. As another example, WO 2019/241483 A1 describes a one-component nanostructure composed of I3-01 C-terminally fused engineered envelope (Env) proteins of HIV-1. Nanostructures of this type have been shown to be effective vaccines and are currently in human clinical trials. Specifically, I53-50 nanostructures displaying F proteins from Respiratory Syncytial Virus (RSV) or human Metapneumovirus (hMPV), or the Spike (S) protein of SARS-CoV2, are in clinical trials as vaccines.
Nonetheless, there remains a need in the art for novel protein nanostructures capable of displaying other antigens. The present disclosure addresses that need.
The present disclosure provides a polypeptide that is a circular permutation of I53-50A, comprising an assembly domain, comprising, in N- to C-terminal order, a N-terminal polypeptide segment, a linking polypeptide segment, and a C-terminal polypeptide segment, wherein the N-terminal polypeptide segment comprises residues 74-201 of SEQ ID NO: 1 or a variant thereof, and the C-terminal polypeptide segment comprises residues 1-73 of SEQ ID NO: 1 or a variant thereof.
The present disclosure provides a polypeptide that is a circular permutation of I53-50A, comprising an assembly domain, comprising, in N- to C-terminal order, a N-terminal polypeptide segment, a linking polypeptide segment, and a C-terminal polypeptide segment, wherein the N-terminal polypeptide segment comprises residues 107-201 of SEQ ID NO: 1 or a variant thereof, and the C-terminal polypeptide segment comprises residues 1-106 of SEQ ID NO: 1 or a variant thereof.
The present disclosure provides a polypeptide that is a circular permutation of I53-50A, comprising an assembly domain, comprising, in N- to C-terminal order, a N-terminal polypeptide segment, a linking polypeptide segment, and a C-terminal polypeptide segment, wherein the N-terminal polypeptide segment comprises residues 128-201 of SEQ ID NO: 1 or a variant thereof, and the C-terminal polypeptide segment comprises residues 1-127 of SEQ ID NO: 1 or a variant thereof.
In some embodiments, variants of SEQ ID NO: 1 are at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 1.
In some embodiments, the N-terminal polypeptide segment and the C-terminal polypeptide segment comprises polypeptide sequences each selected from pairs A, B, or C provided in the Sequence Table, or from variants thereof having at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical thereto.
In some embodiments, the linking polypeptide segment comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to any of one of SEQ ID NOs: 8-21.
In some embodiments, the assembly domain comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to any one of SEQ ID NOs: 22-24.
In some embodiments, the assembly domain comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to a first polypeptide sequence in Table 2 or Table 4.
The present disclosure provides a polypeptide that is a variant of I53-50A having a C-terminal extension, comprising an assembly domain, the assembly domain comprising, in N- to C-terminal order, a base polypeptide segment and an extending polypeptide segment, wherein the base polypeptide segment comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical to residues 1-201 of SEQ ID NO: 1.
In some embodiments, the extending polypeptide segment comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to a polypeptide sequence in Table 1.
In some embodiments, the assembly domain comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to any one of SEQ ID NOs: 22-25.
In some embodiments, the assembly domain comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to a first polypeptide sequence in Table 2 or Table 4.
In some embodiments, the polypeptide comprises one or more amino acid residues at interface positions such that the polypeptide is capable of self-assemble to form a trimeric component of one-component nanostructure.
In some embodiments, the polypeptide comprises one or more amino acid residues at interface positions such that the polypeptide is capable of self-assemble to form a trimeric component of two-component nanostructure.
In some embodiments, the polypeptide self-assembles to form a trimeric component of a nanostructure, wherein the C terminus of the assembly domain is accessible on the surface of the nanostructure.
In some embodiments, the polypeptide self-assembles to form a trimeric component, optionally wherein the distance from the C terminus of the assembly domain to the three-fold axis is less than 30 Å, less than 25 Å, or less than 20 Å, or between 10 Å and 30 Å, between 15 Å and 30 Å, between 15 Å and 25 Å, or between 20 Å and 25 Å.
In some embodiments, the polypeptide self-assembles to form a soluble trimer, wherein the C terminus of the assembly domain is accessible on the surface of the soluble trimer.
In some embodiments, the polypeptide self-assembles to form a soluble trimer, wherein the C terminus of the assembly domain is proximal to the three-fold axis of the soluble trimer, optionally wherein the distance from the C terminus to the three-fold axis is less than 30 Å, less than 25 Å, or less than 20 Å, or between 10 Å and 30 Å, between 15 Å and 30 Å, between 15 Å and 25 Å, or between 20 Å and 25 Å.
In some embodiments, the polypeptide is a fusion protein comprising, in N- to C-terminal order, the assembly domain, optionally a polypeptide linker, and a heterologous polypeptide.
In some embodiments, the heterologous polypeptide is an antigen.
In some embodiments, the antigen is an ectodomain of a surface protein of a pathogenic organism, optionally a virus, or an antigenic fragment thereof.
In some embodiments, the antigen is an OspA or antigenic fragment thereof, preferably an OspA of Borrelia burgdorferi sensu lato.
In some embodiments, the antigen is an ectodomain of viral glycoprotein, or an antigenic fragment thereof.
In some embodiments, the antigen is an ectodomain of bacterial protein, or an antigenic fragment thereof.
The present disclosure provides a protein nanostructure, comprising a first component comprising a first polypeptide, and optionally a second component comprising a second polypeptide, wherein the first polypeptide is a polypeptide according to the present disclosure.
In some embodiments, the first component is a trimeric component comprising three copies of the first polypeptide.
In some embodiments, the nanostructure comprises the second component, and the second component comprises a second assembly domain comprising a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to: SEQ ID NO: 26 or 27.
In some embodiments, the second component is a pentamer comprising five copies of the second polypeptide.
In some embodiments, the nanostructure comprises 20 copies of the first component.
In some embodiments, the nanostructure further comprises 12 copies of the second component.
In some embodiments, the C terminus of the first polypeptide is accessible on the surface of the nanostructure.
In some embodiments, the first polypeptide is a fusion protein comprising, in N- to C-terminal order, the first assembly domain, optionally a polypeptide linker, and a heterologous polypeptide.
In some embodiments, the heterologous polypeptide is an antigen.
In some embodiments, the antigen is an ectodomain of a surface protein of a pathogenic organism, optionally a virus, or an antigenic fragment thereof.
In some embodiments, the antigen is an OspA or antigenic fragment thereof, preferably an OspA of Borrelia burgdorferi sensu lato.
In some embodiments, the antigen is an ectodomain of viral glycoprotein, or an antigenic fragment thereof.
In some embodiments, the antigen is an ectodomain of bacterial protein, or an antigenic fragment thereof.
In some embodiments, the nanostructure has an 153 architecture and/or a quaternary structure substantially similar to I53-50.
The present disclosure provides a polynucleotide encoding a nanostructure disclosed herein or a polypeptide disclosed herein.
The present disclosure provides a delivery vehicle, comprising a polynucleotide disclosed herein, optionally a viral vector or a lipid nanoparticle.
The present disclosure provides a pharmaceutical composition, comprising a nanostructure disclosed herein, a polynucleotide disclosed herein, or a delivery vehicle disclosed herein, and a pharmaceutically acceptable carrier.
The present disclosure provides a vaccine, comprising a nanostructure disclosed herein, a polynucleotide disclosed herein, or a delivery vehicle disclosed herein, a pharmaceutically acceptable carrier, and optionally an adjuvant.
The present disclosure provides a host cell suitable for expression of a nanostructure disclosed herein or a polypeptide disclosed herein; and/or comprising a polynucleotide disclosed herein.
The present disclosure provides a method of making a polypeptide or nanostructure, comprising culturing a host cell disclosed herein under conditions suitable for expression of the polypeptide or nanostructure.
The present disclosure provides a method of generating an immune response to an antigen or to a pathogenic organism in a subject in need thereof, comprising administering to the subject a vaccine as disclosed herein, optionally via intramuscular injection or inhalation.
The present disclosure provides a method of immunizing a subject against infection by a pathogen, comprising administering to the subject a vaccine as disclosed herein, optionally via intramuscular injection or inhalation.
The present disclosure provides a composition or method as described herein.
Any aspect or embodiment described herein can be combined with any other aspect or embodiment as disclosed herein.
    
    
    
    
    
    
    
    
    
The present disclosure relates generally to polypeptides for forming nanostructures, nanostructures, and uses thereof. In some embodiments, the disclosure provides polypeptides having disclosed sequences. In some embodiments, the polypeptides form nanostructure components in which the C terminus of the polypeptide is accessible on the surface of the nanostructure. In some embodiments, the polypeptide is a fusion protein comprising, in N- to C-terminal order, an assembly domain, optionally a linker, and a heterologous polypeptide, such as an antigen or antigenic fragment thereof.
Computationally designed protein nanomaterials are useful platforms for delivery of macromolecules, and vaccine design. The characteristics that make a particular nanomaterial useful include, but are not limited to, modularity, spontaneous self-assembly across a useful range of concentrations, stability, accessible termini, and particle size. Termini availability is constrained by the components used for designing a particular nanomaterial, and the orientation of the component within the designed architecture. Without wishing to be bound by theory, to ensure that any genetically linked domain is properly oriented with respect to the surface of the nanomaterial, the local structure of the termini is a contributing element. The present disclosure demonstrates that circular permutation can be an effective method for changing the accessibility of termini. In some embodiments, de novo designed termini extensions that are well ordered can also change termini accessibility. In some embodiments, both techniques are used to change the termini availability of a nanostructure (e.g., the protein nanomaterial I53-50). In some embodiments, a nanomaterial designed using circular permutation and/or de novo designed termini extensions may display the Borrelia burgdorferi sensu lato antigen OspA. In some embodiments, the techniques to change the termini availability described herein may be applied to a I3-01 protein nanomaterial.
The nanostructures of the present disclosure provide an antigen fused to the C terminus of a first component such that the antigen is displayed on the surface of the nanostructure. Without wishing to be bound by theory, fusion to the C terminus may increase or alter the immune response to the antigen. In some cases, fusion to the C terminus may promote induction of a protective and/or functional immune response in the subject. In embodiments, the nanostructures comprise a fusion between the C terminus and the N terminus of the first component via a novel linking polypeptide sequences as shown in Table 3. In embodiments, the nanostructures comprise sequence breaks which generate novel N- and C-termini as compared to a reference sequence fused to antigens or antigenic fragments.
The present disclosure provides a polypeptide that is a circular permutation of I53-50A, comprising an assembly domain, comprising, in N- to C-terminal order, a N-terminal polypeptide segment, a linking polypeptide segment, and a C-terminal polypeptide segment, wherein the N-terminal polypeptide segment comprises residues 74-201 of SEQ ID NO: 1 or a variant thereof, and the C-terminal polypeptide segment comprises residues 1-73 of SEQ ID NO: 1 or a variant thereof.
  
    
      
        
        
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
        
      
    
  
The present disclosure provides a polypeptide that is a circular permutation of 153-50A, comprising an assembly domain, comprising, in N- to C-terminal order, a N-terminal polypeptide segment, a linking polypeptide segment, and a C-terminal polypeptide segment, wherein the N-terminal polypeptide segment comprises residues 107-201 of SEQ ID NO: 1 or a variant thereof, and the C-terminal polypeptide segment comprises residues 1-106 of SEQ ID NO: 1 or a variant thereof.
The present disclosure provides a polypeptide that is a circular permutation of I53-50A, comprising an assembly domain, comprising, in N- to C-terminal order, a N-terminal polypeptide segment, a linking polypeptide segment, and a C-terminal polypeptide segment, wherein the N-terminal polypeptide segment comprises residues 128-201 of SEQ ID NO: 1 or a variant thereof, and the C-terminal polypeptide segment comprises residues 1-127 of SEQ ID NO: 1 or a variant thereof.
In some embodiments, variants of SEQ ID NO: 1 are at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 1.
In some embodiments, the N-terminal polypeptide segment and the C-terminal polypeptide segment comprises polypeptide sequences each selected from pairs A, B, or C provided in the Sequence Table, or from variants thereof having at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical thereto.
In some embodiments, the N-terminal polypeptide segment and the C-terminal polypeptide segment comprises polypeptide sequences each selected from pairs A, B, or C, or from variants thereof at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical thereto:
  
    
      
        
        
        
        
          
            
          
          
            
            
            
          
          
            
          
        
        
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
          
          
            
          
        
      
    
  
In some embodiments, the linking polypeptide segment comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to any of one of SEQ ID NOs: 8-21.
In some embodiments, the assembly domain comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to any one of SEQ ID NOs: 22-24.
In some embodiments, the assembly domain comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to a first polypeptide sequence in Table 2 or Table 4.
The present disclosure provides a polypeptide that extends the C terminus of I53-50A, comprising an assembly domain, the assembly domain comprising, in N- to C-terminal order, a N-terminal polypeptide segment and an extending polypeptide segment, wherein the N-terminal polypeptide segment comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical to residues 1-201 of SEQ ID NO: 1.
In some embodiments, the extending polypeptide segment comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to a polypeptide sequence in Table 1.
As used herein, the term “linking polypeptide segment” refers to a polypeptide that connects the C terminus of a C-terminal polypeptide segment (e.g., a C-terminal polypeptide segment of I53-50A or a variant thereof) to an N-terminal polypeptide segment, to computationally generate a circular polypeptide chain in the process of circular permutation. After a circular polypeptide is computationally generated, breakpoints between the secondary structure elements are identified to create an N terminus for the designed polypeptide, which then may be expressed.
As used herein, the term “extending polypeptide segment” refers to a polypeptide that extends the C terminus of a base polypeptide segment (e.g., a polypeptide segment I53-50A or a variant thereof. In embodiments, the extending polypeptide segment may extend the C terminus to near to the N terminus of the base polypeptide segment without connecting the C terminus to the N terminus.
In some embodiments, the assembly domain comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to any one of SEQ ID NOs: 22-25.
In some embodiments, the assembly domain comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to a first polypeptide sequence in Table 2 or Table 4.
In some embodiments, the polypeptide comprises one or more amino acid residues at interface positions such that the polypeptide is capable of self-assemble to form a trimeric component of one-component nanostructure.
In some embodiments, the polypeptide comprises one or more amino acid residues at interface positions such that the polypeptide is capable of self-assemble to form a trimeric component of two-component nanostructure.
In some embodiments, the polypeptide self-assembles to form a trimeric component of a nanostructure, the C terminus of the assembly domain is accessible on the surface of the nanostructure.
In some embodiments, the polypeptide self-assembles to form a trimeric component, optionally wherein the distance from the C terminus of the assembly domain to the three-fold axis is less than 30 Å, less than 25 Å, or less than 20 Å, or between 10 Å and 30 Å, between 15 Å and 30 Å, between 15 Å and 25 Å, or between 20 Å and 25 Å.
In some embodiments, the polypeptide self-assembles to form a soluble trimer, the C terminus of the assembly domain is accessible on the surface of the soluble trimer.
In some embodiments, the polypeptide self-assembles to form a soluble trimer, the C terminus of the assembly domain is proximal to the three-fold axis of the soluble trimer, optionally wherein the distance from the C terminus to the three-fold axis is less than 30 Å, less than 25 Å, or less than 20 Å, or between 10 Å and 30 Å, between 15 Å and 30 Å, between 15 Å and 25 Å, or between 20 Å and 25 Å.
In some embodiments, the polypeptide is a fusion protein comprising, in N- to C-terminal order, the assembly domain, optionally a polypeptide linker, and a heterologous polypeptide.
In some embodiments, the heterologous polypeptide is an antigen.
In some embodiments, the antigen is an ectodomain of a surface protein of a pathogenic organism, optionally a virus, or an antigenic fragment thereof.
In some embodiments, the antigen is an OspA or antigenic fragment thereof, preferably an OspA of Borrelia burgdorferi sensu lato.
In some embodiments, the antigen is an ectodomain of viral glycoprotein, or an antigenic fragment thereof.
In some embodiments, the antigen is an ectodomain of bacterial protein, or an antigenic fragment thereof.
The present disclosure provides a protein nanostructure, comprising a first component comprising a first polypeptide, and optionally a second component comprising a second polypeptide, wherein the first polypeptide is a polypeptide according to the present disclosure.
In some embodiments, the first component is a trimeric component comprising three copies of the first polypeptide.
In some embodiments, the nanostructure comprises the second component, and the second component comprises a second assembly domain comprising a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to: SEQ ID NO: 26 or 27.
In some embodiments, the second component is a pentamer comprising five copies of the second polypeptide.
In some embodiments, the nanostructure comprises 20 copies of the first component.
In some embodiments, the nanostructure further comprises 12 copies of the second component.
In some embodiments, the C terminus of the first polypeptide is accessible on the surface of the nanostructure.
In some embodiments, the first polypeptide is a fusion protein comprising, in N- to C-terminal order, the first assembly domain, optionally a polypeptide linker, and a heterologous polypeptide.
In some embodiments, the heterologous polypeptide is an antigen.
In some embodiments, the antigen is an ectodomain of a surface protein of a pathogenic organism, optionally a virus, or an antigenic fragment thereof.
In some embodiments, the antigen is an OspA or antigenic fragment thereof, preferably an OspA of Borrelia burgdorferi sensu lato.
In some embodiments, the antigen is an ectodomain of viral glycoprotein, or an antigenic fragment thereof.
In some embodiments, the antigen is an ectodomain of bacterial protein, or an antigenic fragment thereof.
In some embodiments, the nanostructure has an 153 architecture and/or a quaternary structure substantially similar to I53-50.
The present disclosure provides a polynucleotide encoding a nanostructure disclosed herein or a polypeptide disclosed herein.
The present disclosure provides a delivery vehicle, comprising a polynucleotide disclosed herein, optionally a viral vector or a lipid nanoparticle.
The present disclosure provides a pharmaceutical composition, comprising a nanostructure disclosed herein, a polynucleotide disclosed herein, or a delivery vehicle disclosed herein, and a pharmaceutically acceptable carrier.
The present disclosure provides a vaccine, comprising a nanostructure disclosed herein, a polynucleotide disclosed herein, or a delivery vehicle disclosed herein, a pharmaceutically acceptable carrier, and optionally an adjuvant.
The present disclosure provides a host cell suitable for expression of a nanostructure disclosed herein or a polypeptide disclosed herein; and/or comprising a polynucleotide disclosed herein.
The present disclosure provides a method of making a polypeptide or nanostructure, comprising culturing a host cell disclosed herein under conditions suitable for expression of the polypeptide or nanostructure.
The present disclosure provides a method of generating an immune response to an antigen or to a pathogenic organism in a subject in need thereof, comprising administering to the subject a vaccine as disclosed herein, optionally via intramuscular injection or inhalation.
The present disclosure provides a method of immunizing a subject against infection by a pathogen, comprising administering to the subject a vaccine as disclosed herein, optionally via intramuscular injection or inhalation.
The present disclosure provides a composition or method as described herein.
Patent Pub No. US 2015/0356240 A1 describes various methods for designing protein assemblies. As described in US Patent Pub No. US 2016/0122392 A1 and in International Patent Pub. No. WO 2014/124301 A1, isolated polypeptides were designed for their ability to self-assemble in pairs to form protein nanostructures, such as icosahedral particles. The design involved design of suitable interface residues for each member of the polypeptide pair that can be assembled to form the protein nanostructures. The protein nanostructures so formed include symmetrically repeated, non-natural, non-covalent polypeptide-polypeptide interfaces that orient a first assembly domain and a second assembly domain into protein nanostructures, such as one with an icosahedral symmetry.
Thus, in one embodiment a first assembly domain and second assembly domain of the component are selected from the Sequence Table. In each case, an N-terminal methionine residue present in the full-length protein is included, but may be removed to make a fusion that is not included in the sequence. The identified residues in the Sequence Table are numbered beginning with an N-terminal methionine. In various embodiments, one or more additional residues are deleted from the N terminus and/or additional residues are added to the N terminus. In some embodiments, the interface residues of I53-50A (SEQ ID NO: 1) first assembly domain are 25, 29, 33, 54, and 57. In some embodiments, the interface residues of I53-50B (SEQ ID NO: 27) or I53-50B.4PosT1 (SEQ ID NO: 26) second assembly domain are 24, 28, 36, 124, 125, 127, 128, 129, 131, 132, 133, 135, and 139.
The pair of sequences together form an 153 multimer with icosahedral symmetry. The interface residues identified are residue numbers in each illustrative polypeptide that were identified as present at the interface of resulting assembled protein nanostructures (i.e., “identified interface residues”). As can be seen, the number of interface residues for the illustrative polypeptides of SEQ ID NOs: 1 and (26 or 27) range from 4-13. In various embodiments, a first assembly domain and second assembly domain comprise an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical over its length, and identical at least at 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, or 13 identified interface positions (depending on the number of interface residues for a given polypeptide), to the amino acid sequence of a polypeptide selected from the group consisting of SEQ ID NOs: 1 and (26 or 27).
In some embodiments, a polypeptide for forming a nanostructure comprises a first assembly domain. In some embodiments, a polypeptide for forming a nanostructure comprises a first assembly domain comprising a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to a first polypeptide sequence in Table 2 or Table 4.
In some embodiments, a protein nanostructure comprises a first component and optionally, a second component. In some embodiments, the first component comprises a first polypeptide comprising a first assembly domain. In some embodiments, a protein nanostructure, comprises a first component, and optionally a second component, wherein the first component comprises a first polypeptide comprising a first assembly domain comprising a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to a polypeptide sequence in Table 2 or Table 4.
In some embodiments, the first component is a trimeric component comprising three copies of the first polypeptide.
In some embodiments, the nanostructure comprises the second component. In some embodiments, the second component comprises a second polypeptide comprising a second assembly domain. In some embodiments, the nanostructure comprises the second component comprising a second polypeptide comprising a second assembly domain comprising a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90% at least 95% at least 96% at least 97% at least 98% at least 99% or 100% identical to:
  
    
      
        
        
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
        
      
    
  
In some embodiments, the second component is a pentamer comprising five copies of the second polypeptide.
In some embodiments, the first component comprises a first polypeptide comprising a first assembly domain comprising a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to a polypeptide sequence in Table 2 or Table 4 and the second component comprising a second polypeptide comprising a second assembly domain comprising a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to:
  
    
      
        
        
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
        
      
    
  
In some embodiments, the nanostructure comprises 20 copies of the first component.
In some embodiments, the nanostructure further comprises 12 copies of the second component.
In some embodiments, the C terminus of the first polypeptide is accessible on the surface of the nanostructure.
In some embodiments, the first polypeptide comprises one or more amino acid residues at interface positions such that the polypeptide is capable of self-assembling to form a trimeric component of a one-component nanostructure.
In some embodiments, the first polypeptide comprises one or more amino acid residues at interface positions such that the polypeptide is capable of self-assembling to form a trimeric component of a two-component nanostructure.
In some embodiments, when the first polypeptide self-assembles to form a trimeric component of a nanostructure, the C terminus of the assembly domain is accessible on the surface of the nanostructure.
In some embodiments, when the first polypeptide self-assembles to form a trimeric component, the C terminus of the assembly domain is proximal to the three-fold axis of the trimeric component. In some embodiments, the distance from the C terminus to the three-fold axis is less than 30 Å, less than 25 Å, or less than 20 Å, or between 10 Å and 30 Å, between 15 Å and 30 Å, between 15 Å and 25 Å, or between 20 Å and 25 Å.
In some embodiments, a protein nanostructure comprises a first component and optionally, a second component. In such embodiment, the first component comprises a first polypeptide comprising a first assembly domain and optionally, a second component comprising a second polypeptide comprising a second assembly domain.
In some embodiments, the first polypeptide is a fusion protein comprising, in N- to C-terminal order, the first assembly domain, optionally a linker, and a heterologous polypeptide sequence, preferably an antigen.
In some embodiments, the antigen is an ectodomain of a surface protein of a pathogenic organism, or an antigenic fragment thereof. In some embodiments, the antigen is an ectodomain of a surface protein of a virus, or an antigenic fragment thereof.
Without wishing to be bound by theory, antigens may be natively anchored at or near their N terminus.
Antigens that have N-terminal attachments in their native orientations may exhibit improved stability or antigenicity when fused to the C terminus of a first assembly domain as compared to an antigen fused to the N terminus of a first assembly domain.
In some embodiments, the antigen is fused to the C terminus of the first assembly domain.
Illustrative bacterial surface proteins that are natively anchored at their N-termini include, but are not limited to, OspA of Borrelia burgdorferi sensu lato, OspB of Borrelia burgdorferi sensu lato, fHbp of N. Meningitidis, bacterial type II membrane proteins that are asymmetric or comprise a C3-symmetric oligomer, and bacterial proteins with lipidation sites towards the N-terminal side of the protein that are asymmetric or comprise a C3-symmetric oligomer. Illustrative non-bacterial surface proteins that are natively anchored at their N-termini include, but are not limited to, paramyxovirus and/or pneumovirus G proteins.
In some embodiments, the antigen is an OspA or antigenic fragment thereof. In some embodiments, the antigen is an OspA of Borrelia burgdorferi sensu lato (SEQ ID NO: 28). In some embodiments, the antigen is an OspB or antigenic fragment thereof. In some embodiments, the antigen is an OspB of Borrelia burgdorferi sensu lato (SEQ ID NO: 29).
In some embodiments, the antigen is fHbp of N. Meningitidis (SEQ ID NO: 30) or an antigenic fragment thereof.
In some embodiments, the antigen is an antigen derived from a bacterial pathogen that exhibits asymmetric type II membrane geometry.
In some embodiments, the antigen is an antigen derived from a bacterial pathogen that exhibits C3-symmetric oligomer geometry.
In some embodiments, the antigen is an antigen derived from a bacterial pathogen that comprises lipidation sites on the N-terminal portion of the protein and exhibits asymmetric type II membrane geometry.
In some embodiments, the antigen is an antigen derived from a bacterial pathogen that comprises lipidation sites on the N-terminal portion of the protein and exhibits C3-symmetric oligomer geometry.
In some embodiments, the antigen is an RSV G protein (SEQ ID NO: 31) or an antigenic fragment thereof. In some embodiments, the antigen is an hMPV G protein (SEQ ID NO: 32) or an antigenic fragment thereof.
In some embodiments, the antigen is a paramyxovirus and/or pneumovirus G protein or an antigenic fragment thereof.
In some embodiments, the antigen is a S1 C-terminal domains of coronavirus, a RBD of paramyxovirus G, H or HN proteins (e.g., Nipah/Hendra G, PIV3 HN, Measles H, Mumps HN), an HA head domain of influenza, a fusion domain of a class III fusion protein (e.g., CMV, EBV, HSV, VZV, Rabies), gp120 of HIV Env, an engineered antigen (e.g., eOD), rotavirus VP8 domain, a segment/domain of P. Falciparum CSP, a segment/domain of B. Burgdorferi sensu lato OspA (e.g., the C-terminal domain).
In some embodiments, the antigen is an ectodomain of a viral glycoprotein, or an antigenic fragment thereof.
In some embodiments, the antigen is an ectodomain of a parasitic protein, or an antigenic fragment thereof. In some embodiments, the parasitic protein is from a plasmodium parasite.
In some embodiments, the antigen is an ectodomain of bacterial protein, or an antigenic fragment thereof.
The present disclosure provides a polynucleotide encoding a nanostructure of any of the embodiments herein or a polypeptide of any of the embodiments herein.
The present disclosure provides a vector comprising a polynucleotide encoding a nanostructure of any of the embodiments herein or a polypeptide of any of the embodiments herein.
The present disclosure provides a host cell suitable for expression of a nanostructure of any of the embodiments herein or a polypeptide of any of the embodiments herein; and/or comprising a polynucleotide of any of the embodiments herein.
The present disclosure provides a method of making a polypeptide or nanostructure, comprising culturing a host cell of any of the embodiments herein under conditions suitable for expression of a polypeptide or nanostructure of any of the embodiments herein.
In another aspect, the disclosure provides a polynucleotide encoding any of the foregoing polypeptides. The polynucleotide may be an mRNA, such as a modified mRNA. The disclosure further provides vectors that include any of these polynucleotides. The vector may be a viral vector, such as an adenovirus vector, or a non-viral vector, such as a lipid nanoparticle (LNP). The disclosure further provides host cells that are transfected or transformed with any of the foregoing polynucleotides.
In an aspect, the disclosure provides a method of making a protein nanostructure involving culturing a host cell under conditions suitable to cause the expression of one or more components of a nanostructure, alone or separately; purifying the components, alone or separately; contacting solutions of the purified components; and/or incubating the components under condition suitable for self-assembly of the components to form a nanostructure.
The disclosure also provides pharmaceutical compositions. Such pharmaceutical compositions can be used for generating an immune response against an infectious disease in a subject. The pharmaceutical compositions of the disclosure may include a pharmaceutically acceptable carrier. A thorough discussion of such carriers is available in Chapter 30 of Remington: The Science and Practice of Pharmacy (23rd ed., 2021).
In some embodiments, the pharmaceutical composition can also include excipients and/or additives. Examples of these are surfactants, stabilizers, complexing agents, antioxidants, or preservatives which prolong the duration of use of the finished pharmaceutical formulation, flavorings, vitamins, or other additives known in the art. Complexing agents include, but are not limited to, ethylenediaminetetraacetic acid (EDTA) or a salt thereof, such as the disodium salt, citric acid, nitrilotriacetic acid and the salts thereof. In some embodiments, preservatives include, but are not limited to, those that protect the solution from contamination with pathogenic particles, including benzalkonium chloride or benzoic acid, or benzoates such as sodium benzoate. Antioxidants include, but are not limited to, vitamins, provitamins, ascorbic acid, vitamin E, salts or esters thereof.
Non-limiting examples of pharmaceutically acceptable excipients include water, NaCl, normal saline solutions, lactated Ringer's, normal sucrose, normal glucose, binders, fillers, disintegrants, lubricants, coatings, sweeteners, flavors, salt solutions (such as Ringer's solution), alcohols, oils, gelatins, carbohydrates such as lactose, amylose or starch, fatty acid esters, hydroxymethyl cellulose, polyvinyl pyrrolidine, and colors, and the like. Such preparations can be sterilized and, if desired, mixed with auxiliary agents such as lubricants, preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic pressure, buffers, coloring, and/or aromatic substances and the like that do not deleteriously react with the compounds of the disclosure. One of skill in the art will recognize that other pharmaceutical excipients are useful in the present disclosure.
In some embodiments, one or more tonicity agents may be added to provide the desired ionic strength. Tonicity agents for use herein include those which display no or only negligible pharmacological activity after administration. Both inorganic and organic tonicity adjusting agents may be used.
In another aspect, the disclosure provides a pharmaceutical composition comprising a polypeptide, a protein complex, or a nanostructure of the disclosure.
The present disclosure provides a pharmaceutical composition comprising a nanostructure of any of the embodiments herein.
The present disclosure provides a vaccine comprising a nanostructure of any of the embodiments herein.
In another aspect, the disclosure provides pharmaceutical composition or vaccines. In embodiments, pharmaceutical composition includes a nanostructure as described herein in therapeutically effective amount. In embodiments, the vaccine includes a nanostructure in an amount effective to generate an immune response in a subject. Nanostructures used in vaccines may be complexed with, conjugated to, or fused to an antigen. The antigen may be a polypeptide derived from a pathogenic organism, or an antigenic fragment thereof.
In other embodiments, the pharmaceutical composition or vaccines includes a polynucleotide encoding a nanostructure as disclosed herein, such as an mRNA, or vector as disclosed herein, such an LNP—in each case in a therapeutically effective amount or in an amount effective to generate an immune response.
In another aspect, the disclosure provides methods of treating and/or preventing a disease or disorder in a subject in need thereof, as well as methods of generating an immune response to a pathogenic organism in a subject. Such methods may comprise administering a nanostructure, pharmaceutical composition, or vaccine according to the disclosure to the subject by intramuscular, intravenous, or intranasal.
Further provided are kits and pre-filled syringes that include any of the foregoing compositions.
In another aspect, the disclosure provides a vaccine comprising a polypeptide, a protein complex, or a nanostructure as disclosed herein.
In some embodiments, the vaccine comprises an adjuvant.
Adjuvants or immune potentiators may also be administered with or in combination with lipid nanoparticle composition. Advantages of adjuvants include, but are not limited to, the enhancement of the immunogenicity of antigens, modification of the nature of the immune response, the reduction of the antigen amount needed for a successful immunization, the reduction of the frequency of booster immunizations needed and an improved immune response in elderly and immunocompromised vaccinees. These may be co-administered by any route, e.g., intramuscular, subcutaneous, intravenous, or intradermal injections.
Adjuvants may include, but are not limited to, a natural or a synthetic adjuvant. Adjuvants may be organic or inorganic.
Adjuvants may be selected from any of the classes (1) mineral salts, e.g., aluminum hydroxide and aluminum or calcium phosphate gels; (2) emulsions including: oil emulsions and surfactant based formulations, e.g., microfluidised detergent stabilized oil-in-water emulsion, purified saponin, oil-in-water emulsion, stabilized water-in-oil emulsion; (3) particulate adjuvants, e.g., virosomes (unilamellar liposomal vehicles incorporating influenza hemagglutinin), structured complex of saponins and lipids, polylactide co-glycolide (PLG); (4) microbial derivatives; (5) endogenous human immunomodulators; (6) inert vehicles, such as gold particles; (7) microorganism derived adjuvants; (8) tensoactive compounds; (9) carbohydrates; or combinations thereof.
Adjuvants for nucleic acid vaccines (DNA) have been disclosed in, for example, Kobiyama, et al., Vaccines, 2013, 1(3), 278-292, the contents of which are incorporated herein by reference in their entirety. Any of the adjuvants disclosed by Kobiyama et al., may be used in the vaccines as described herein.
Other adjuvants which may be utilized include any of those listed on the web-based vaccine adjuvant database, on the World Wide Web at violinet.org/vaxjo/and described in for example Sayers, et al., J. Biomedicine and Biotechnology, volume 2012 (2012), Article ID 831486, 13 pages, the contents of which are incorporated herein by reference in their entirety.
Specific adjuvants may include cationic liposome-DNA complex JVRS-100, aluminum hydroxide vaccine adjuvant, aluminum phosphate vaccine adjuvant, aluminum potassium sulfate adjuvant, alhydrogel, ISCOM(s)™, Freund's Complete Adjuvant, Freund's Incomplete Adjuvant, CpG DNA Vaccine Adjuvant, Cholera toxin, Cholera toxin B subunit, Liposomes, Saponin Vaccine Adjuvant, DDA Adjuvant, Squalene-based Adjuvants, Etx B subunit Adjuvant, IL-12 Vaccine Adjuvant, LTK63 Vaccine Mutant Adjuvant, TiterMax Gold Adjuvant, Ribi Vaccine Adjuvant, Montanide ISA 720 Adjuvant, Corynebacterium-derived P40 Vaccine Adjuvant, MPL™ Adjuvant, AS04, AS02, AS01E, Lipopolysaccharide Vaccine Adjuvant, Muramyl Dipeptide Adjuvant, CRL1005, Killed Corynebacterium parvum Vaccine Adjuvant, Montanide ISA 51, Bordetella pertussis component Vaccine Adjuvant, Cationic Liposomal Vaccine Adjuvant, Adamantylamide Dipeptide Vaccine Adjuvant, Arlacel A, VSA-3 Adjuvant, Aluminum vaccine adjuvant, Polygen Vaccine Adjuvant, ADJUMER™, Algal Glucan, Bay R1005, Theramide®, Stearyl Tyrosine, Specol, Algammulin, AVRIDINE®, Calcium Phosphate Gel, CTA1-DD gene fusion protein, DOC/Alum Complex, Gamma Inulin, Gerbu Adjuvant, GM-CSF, GMDP, Recombinant hIFN-gamma/Interferon-g, Interleukin-10, Interleukin-2, Interleukin-7, Sclavo peptide, Rehydragel LV, Rehydragel HPA, Loxoribine, MF59, MTP-PE Liposomes, Murametide, Murapalmitine, D-Murapalmitine, NAGO, Non-Ionic Surfactant Vesicles, PMMA, Protein Cochleates, QS-21, SPT (Antigen Formulation), nanoemulsion vaccine adjuvant, AS03, Quil-A vaccine adjuvant, RC529 vaccine adjuvant, LTR192G Vaccine Adjuvant, E. coli heat-labile toxin, LT, amorphous aluminum hydroxyphosphate sulfate adjuvant, Calcium phosphate vaccine adjuvant, Montanide Incomplete Seppic Adjuvant, Imiquimod, Resiquimod, AF03, Flagellin, Poly(I:C), ISCOMATRIX®, Abisco-100 vaccine adjuvant, Albumin-heparin microparticles vaccine adjuvant, AS-2 vaccine adjuvant, B7-2 vaccine adjuvant, DHEA vaccine adjuvant, Immunoliposomes Containing Antibodies to Costimulatory Molecules, SAF-1, Sendai Proteoliposomes, Sendai-containing Lipid Matrices, Threonyl muramyl dipeptide (TMDP), Ty Particles vaccine adjuvant, Bupivacaine vaccine adjuvant, DL-PGL (Polyester poly (DL-lactide-co-glycolide)) vaccine adjuvant, IL-15 vaccine adjuvant, LTK72 vaccine adjuvant, MPL-SE vaccine adjuvant, non-toxic mutant E112K of Cholera Toxin mCT-E112K, and/or Matrix-S.
In some embodiments, the adjuvant comprises squalene. In some embodiments, the adjuvant comprises aluminum hydroxide. In some embodiments, the adjuvant comprises AS01E.
The present disclosure provides a method of making a polypeptide or nanostructure, comprising culturing a host cell disclosed herein under conditions suitable for expression of the polypeptide or nanostructure.
The present disclosure provides a method of generating an immune response to an antigen or to a pathogenic organism in a subject in need thereof, comprising administering to the subject a nanostructure of the present disclosure.
The present disclosure provides a method of generating an immune response to an antigen or to a pathogenic organism in a subject in need thereof, comprising administering to the subject a vaccine as disclosed herein, optionally via intramuscular injection or inhalation.
The present disclosure provides a method of generating high titers of functional antibodies against an antigen or to a pathogenic organism.
The present disclosure provides a method of immunizing a subject against infection by a pathogen, comprising administering to the subject a vaccine as disclosed herein, optionally via intramuscular injection or inhalation.
In another aspect, the disclosure provides methods of administration for the composition, the pharmaceutical composition, or the vaccine described herein.
In another aspect, the disclosure provides a method of vaccinating a subject, comprising administering to the subject a composition described herein. In another aspect, the disclosure provides a method of generating an immune response in a subject, comprising administering to the subject a composition described herein. In another aspect, the disclosure provides a method of treating or preventing disease in a subject, comprising administering to the subject a composition described herein. In another aspect, the disclosure provides a composition of the disclosure for use in vaccinating, generating an immune response, or treating or preventing disease. In another aspect, the disclosure provides a composition, method, or use as described herein. In another aspect, the disclosure provides a method of making a composition, comprising culturing host cells modified to express one or more polypeptides as described herein.
In some embodiments, the method comprising administering the vaccine described herein. In some embodiments, the vaccine is administered by subcutaneous injection. In some embodiments, the vaccine is administered by intramuscular injection. In some embodiments, the vaccine is administered by intradermal injection. In some embodiments, the vaccine is administered intranasally. In one aspect, the disclosure provides a pre-filled syringe comprising the vaccine described herein. In one aspect, the disclosure provides a kit comprising the vaccine described herein or the pre-filled syringe described herein.
In some embodiments, the unit dose of the pharmaceutical composition comprises about 0.5 μg to about 1 μg, about 20 μg to about 25 μg, about 70 μg to about 75 μg, about 100 μg to about 125 μg, about 100 μg to about 150 μg, about 125 μg to about 175 μg, about 200 μg to about 250 μg, about 225 μg to about 300 μg, or about 250 μg to about 350 μg of the protein nanostructures.
In some embodiments, the unit dose of the pharmaceutical composition comprises about 0.5 μg to about 1 μg, about 20 μg to about 25 μg, about 25 μg to about 50 μg, about 50 μg to about 70 μg, about 70 μg to about 75 μg, about 75 μg to about 100 μg, about 100 μg to about 125 μg, about 125 μg to about 150 μg, about 150 μg to about 175 μg, about 175 μg to about 200 μg, about 200 μg to about 250 μg, or about 250 μg to about 300 μg of the protein nanostructures.
The singular forms “a,” “an,” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise.
As used herein, the term “about” means a range of values including the specified value, which a person of ordinary skill in the art would consider reasonably similar to the specified value. For example, about means within a standard deviation using measurements generally acceptable in the art. For example, about means a range extending to +/−10%, +/−5%, +/−3%, or +/−1% of the specified value.
The term “at least” followed by a number is used herein to denote the start of a range beginning with that number (which may be a range having an upper limit or no upper limit, depending on the variable being defined). For example, “at least 1” means 1 or more than 1.
The term “at most” followed by a number is used herein to denote the end of a range ending with that number (which may be a range having 1 or 0 as its lower limit, or a range having no lower limit, depending upon the variable being defined). For example, “at most 4” means 4 or less than 4, and “at most 40%” means 40% or less than 40%. When, in this specification, a range is given as “(a first number) to (a second number)” or “(a first number)-(a second number)” this means a range whose lower limit is the first number and whose upper limit is the second number. For example, 25 to 100 mm means a range whose lower limit is 25 mm, and whose upper limit is 100 mm.
The term “identical” or percent “identity,” in the context of two or more nucleic acid or polypeptide sequences, refers to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues or nucleotides that are the same, when compared and aligned for maximum correspondence. Methods of alignment of sequences for comparison are well known in the art. Once aligned, the number of matches is determined by counting the number of positions where an identical nucleotide or amino acid residue is present in both sequences. The percent sequence identity is determined by dividing the number of matches in the alignment by the length of the reference sequence, followed by multiplying the resulting value by 100. For example, a peptide sequence that has 1166 matches when aligned with a reference sequence having 1554 amino acids is 75.0 percent identical to the test sequence (1166÷1554*100=75.0). As the terms are used herein, gaps in the alignment do not decrease the percent sequence identity. Unless otherwise specified, optimal alignment of sequences for comparison is conducted by the global alignment algorithm of Needleman and Wunsch, Mol. Biol. 48:443 (1970) as implemented by EMBOSS Needle (on the World Wide Web at ebi.ac.uk/Tools/psa/emboss_needle/) (Madeira et al. Nucleic Acids Res. 50(W1):W276-W279 (2022)). Other alignment methods may be used, including without limitation those described in Devereux, et al, Nucleic Acids Res. 12:387-95 (1984); Atschul et al. J. Mo. Biol. 215:403-10 (1990) (BLAST); Carrillo and Lipman Siam J. Appl. Math. 48(5) (1988); Computational Molecular Biology (Lesk, AM, ed., 1989); Biocomputing Informatics and Genome Projects, (Smith, DW, ed., 1993); Computer Analysis of Sequence Data, Part I, (Griffin and Griffin, eds., 1994); Sequence Analysis in Molecular Biology (von Heinje, 2012); Sequence Analysis Primer (Gribskov and Devereux, J., eds. 1993). Sequence identity is calculated using the implementation of the Needleman-Wunsch algorithm provided by the National Library of Medicine (on the World Wide Web at blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE_TYPE=BlastSearch&BLAST_SPEC=GlobalAln).
For example, sequence identity can be determined by standard methods that are commonly used to compare the similarity of two polypeptide or two polynucleotide sequences. Using a computer program such as EMBOSS Needle or BLAST, two polypeptide or two polynucleotide sequences are aligned for optimal matching of their respective residues (either along the full length of one or both sequences, or along a pre-determined portion of one or both sequences). The programs provide a default opening penalty and a default gap penalty, and a scoring matrix such as PAM 250 (a standard scoring matrix; see Dayhoff et al., in Atlas of Protein Sequence and Structure, vol. 5, supp. 3 (1978)) that can be used in conjunction with the computer program.
The term “substantially similar” refers to two polypeptides, proteins, assemblies, nanostructures, or other physical embodiments of the present that may differ in architecture, sequence, configuration, associations, and the like yet provide about the same or similar properties, structure, activity, and/or function. For example, a nanostructure having an 153 architecture and/or a quaternary structure provides properties, activity and/or function as nanostructures having about the same or similar to a nanostructure having the I53-50 architecture and/or a quaternary structure. In other words, embodiments of the present disclosure may be exchanged, yet achieve a desired outcome, e.g., in properties, structures, activities, and/or functions.
Throughout this specification and the claims which follow, unless the context requires otherwise, the word “comprise”, and variations such as “comprises” and “comprising,” as well as “has” or “having” and “includes” or “including,” will be understood to imply the inclusion of a stated element or step or group of elements or steps but not the exclusion of any other element or step or group of elements or steps. “Consisting essentially of” or “consists essentially” indicates exclusion of elements or steps that materially affect the basic and novel characteristics of the claimed invention.
Any aspect or embodiment described herein can be combined with any other aspect or embodiment as disclosed herein.
Characteristics that make a particular nanomaterial useful include modularity, spontaneous self-assembly across a useful range of concentrations, stability, accessible termini, and particle size. Circular permutation is one method for changing the accessibility of termini. Alternatively, de novo designed termini extensions that are well ordered can also change termini accessibility. Here we have used both techniques to change the termini availability of the protein nanomaterial I53-50 and demonstrated the utility of these new constructs by displaying the Borrelia burgdorferi sensu lato antigen OspA.
This Example demonstrates computational design of a I53-50A variant (termed “CompAext”) in which the C terminus of the protein is accessible on the surface of the trimeric component, which enables polypeptide fusion at the N terminus of the protein to be displayed, such as an antigen. After modelling this extension, contacts between the extension and adjacent segments were improved by remodeling the protein. This resulted in extensive remodeling of the N terminus residues of I53-50A, and modification of residues scattered throughout the primary structure (sequence) of I53-50A.
A de novo helical segment was designed off of the C terminus using RFDesign inpainting. Adjacent segment loop lengths were preserved, but still allowed to be designed. This enables optimization of the original loop with any contacting de novo segments. De novo segment lengths between 1 and 30 amino acids were sampled. To build the final helical segment Rfdiffusion was used. The de novo segment and adjacent regions were allowed to diffuse. Where de novo elements were introduced a range of lengths were sampled around the lengths identified by inpainting. The highest scoring designs from RFdiffusion were selected for sequence design with ProteinMPNN. Structures for top sequences were predicted with ColabFold and compared to the design model. The result of the design process was polypeptide sequences as disclosed in Table 1. A flexible linker sequence was included at the C terminus of each design to further facilitate fusion of other polypeptides (e.g., an antigen or purification tag) to the C terminus. This C-terminal linker and N-terminal leader sequences are underlined; the underline sequences are optional as they could be replaced with other linkers and leaders.
The resulting constructs are the sequences in Table 2, below, that have ‘None’ in the permutation column.
The extension for each design is listed in Table 1.
  
    
      
        
        
          
            
          
        
        
          
            
          
          
            
          
          
            
          
        
      
      
        
        
        
        
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
        
      
    
  
This Example describes circular permutation of I53-50. Various permutations were modelled computationally. The preferred results involved circular permutation of CompAext. Conceptually, first, the C terminus of I53-50Aext was connected to its N terminus, to generate a circular polypeptide chain using an extending polypeptide segment. Second, breakpoints between the secondary structure elements were identified to create a N terminus. Three preferred breakpoints were identified, at approximately residues 73, 106, and 127. These permutations were termed, respectively, Permutation 1, Permutation 2, and Permutation 3. Lastly, computational modelling was applied to design novel contacts between secondary structure elements within the tertiary structure of the Permutation 1, Permutation 2, and Permutation 3. Resulting sequences are provided in Table 2.
I53-50 CompA (or “I53-50A”) is a TIM barrel fold derived from the protein 1WA3. TIM barrels are an approximately circular repeat protein consisting of eight pairs of beta-strands and alpha-helices. The beta-strands are parallel and oriented around a solvated central lumen, with the helices are on the external surface. The lumen is often capped on one or both ends with an additional, terminal helical segment. This is true for CompA, which has an N-terminal capping helix (H1). Therefore, CompA secondary structure elements are, in order from N to C terminus, H1, E1, H2, E2, . . . , H8, E8, H9. Because of the structure of TIM barrels, the protein can be divided into pairs of secondary structure elements and recombined in any order simply by designing new connecting loops. However, some connectivity pairs are more likely to successfully fold into the desired structure than others. 1WA3 is also a C3-symmetric homotrimer and much of the interface is formed by loops between strands and helices which further limits the number of possible connections. Applying those limitations, the order of the helices and strands within the peptide sequence where permuted with the further constraint that helices and strands must alternate, resulting in 3474 possible permutations.
To evaluate these permutations, loops were closed using RFDesign inpainting. When permuted segments remain adjacent to its original neighboring segments, the connecting loop length is preserved, but still allowed to be designed. This enables optimization of the original loop with any contacting de novo loops. Where the order is not the same as the original sequence loop lengths between 1 and 30 amino acids were sampled.
Most permutations resulted in poor quality loop closures or no viable solution could be found. Of the closed permutations, the simplest permutation, (i.e., connecting the C terminus to the N terminus and then introducing a cut point elsewhere in the sequence) produced the highest scoring results. Permutations with a minimum 1ddt>0.75 were selected. Some of these permutations introduced irreconcilable clashes with either CompB or symmetric copies of CompA in the I53-50 assembly and were discarded.
To build the final loops Rfdiffusion was used. Loops and adjacent regions were allowed to diffuse. Where de novo elements were introduced a range of lengths were sampled around the lengths identified by inpainting. The highest scoring designs from RFdiffusion were selected for sequence design with ProteinMPNN. Structures for top sequences were predicted with ColabFold and compared to the design model. The result of the design process was polypeptide sequences as disclosed in Table 2. A linker was included at the C terminus of each design to further facilitate fusion of other polypeptides (e.g., an antigen) to the C terminus. This C-terminal linker and N-terminal leader sequences are underlined; the underline sequences are optional as they could be replaced with other linkers and leaders.
  
    
      
        
        
          
            
          
        
        
          
            
          
          
            
          
        
      
      
        
        
        
        
          
            
            
            
          
          
            
          
          
            
            
            
              MKMEELFKKHKIVAVLRANSVEEAIEKAVAVFAGGVHLIEITFTVP
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGSGSVEQCRKAVEAGAEYIVSPHLDEEISQFCKEKGIPYMPGVMT
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
              GT (SEQ ID NO: 63)
          
          
            
          
          
            
            
            
              MGSIPYMPGVMTPTELVKAMKLGHLLLKLFPGEVVGPQFVKAMKK
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGSGHLLLKLFPGEVVGPQFVKAMKKTFPKARFVPTGGVNLDNVC
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
              GT (SEQ ID NO: 65)
          
          
            
          
          
            
            
            
              MGAPADRELLRKLLENRIVAVLRANSVEEAIEKAVAVFAGGVTIIEI
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGAPEEKKMIALLAENPIVAVLRANSVEEAIEKAVAVFAGGVTIIEI
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
              SGT (SEQ ID NO: 68)
          
          
            
          
          
            
            
            
              MGSGVPYMPGVMTPTELVKAMKLGHLLLKLFPGEVVGPQFVKAM
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
              SGSGT (SEQ ID NO: 69)
          
          
            
          
          
            
            
            
              MGDPKELAMLKAFLEEKIVAVLRANSVEEAIEKAVAVFAGGVKIIEI
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MPEKEREIMIAFLKNRIVAVLRANSVEEAIEKAVAVFAGGVKIIEITF
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGSEADLKMLKKLYEEKIVAVLRANSVEEAIEKAVAVFAGGVKIIEI
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGLPEVELKMIEKIMEEGIVAVLRANSVEEAIEKAVAVFAGGVKIIEI
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGVDEKDLKLLEALAANRIVAVLRANSVEEAIEKAVAVFAGGVTII
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGVSEKEIEMLKKFNEARIVAVLRANSVEEAIEKAVAVFAGGVTIIE
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGLSPAEQAMLLAVVENRIVAVLRANSVEEAIEKAVAVFAGGVTII
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
              T (SEQ ID NO: 76)
          
          
            
          
          
            
            
            
              MGYPEAQIELLDKVIKEGIVAVLRANSVEEAIEKAVAVFAGGVTIIEI
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGLSEKEIAIIEAFLENPIVAVLRANSVEEAIEKAVAVFAGGVTIIEIT
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGVSEADLALLKALAENQIVAVLRANSVEEAIEKAVAVFAGGVTII
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGSGSVEQCRKAVEAGAEYIVSPHLDEEISQFCKEKGIPYMPGVMT
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
              SGT (SEQ ID NO: 80)
          
          
            
          
          
            
            
            
              MGSGIPYMPGVMTPTELVKAMKLGHRVLKLFPGEVVGPQFVKAM
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
              GSGT (SEQ ID NO: 81)
          
          
            
          
          
            
            
            
              MGSGGHRVLKLFPGEVVGPQFVKAMKKTFPDARFVPTGGVNLDNV
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
              GSGT (SEQ ID NO: 82)
          
          
            
          
          
            
            
            
              MGSGSVEQCRKAVEAGAEYIVSPHLDEEISQFCKEKGIPYMPGVMT
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
              GT (SEQ ID NO: 83)
          
          
            
          
          
            
            
            
              MGSGIPYMPGVMTPTELVKAMKLGHKLLKLFPGEVVGPQFVKAM
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
              SGT (SEQ ID NO: 84)
          
          
            
          
          
            
            
            
              MGSGGHKLLKLFPGEVVGPQFVKAMKKTFPDAAFVPTGGVNLDNV
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
              GT (SEQ ID NO: 85)
          
          
            
          
          
            
            
            
              MGDKAMASMAKQFCKNKIVAVLRANSVEEAIEKAVAVFAGGVAII
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
              GSGT (SEQ ID NO: 86)
          
          
            
          
          
            
            
            
              MGSGSVEQCRKAVEAGADYIVSPHLDEEISQFCKEKGVAYMPGVM
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGSGVAYMPGVMTPTELVKAMKLGHLVLKLFPGEVVGPQFVKAM
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGSSGHLVLKLFPGEVVGPQFVKAMKKTFPDVFFVPTGGVNLDNV
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MTEAQALMLKAFVEEKIVAVLRANSVEEAIEKAVAVFAGGVNIIEIT
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGTEANALMLKRFVEEKIVAVLRANSVEEAIEKAVAVFAGGVNIIEI
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MTPAEALMLKRFVKEKIVAVLRANSVEEAIEKAVAVFAGGVNIIEIT
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
              GT (SEQ ID NO: 92)
          
          
            
          
          
            
            
            
              MTPAEALMLKRFVKEKIVA VLRANSVEEAIEKAVAVFAGGVNIIEIT
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
              SGSGT (SEQ ID NO: 93)
          
          
            
          
          
            
            
            
              MTPAEALMLKRFVEEKIVAVLRANSVEEAIEKAVAVFAGGVNIIEIT
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
              SGSGT (SEQ ID NO: 94)
          
          
            
          
          
            
            
            
              MTPAEALMLKRFVEEKIVAVLRANSVEEAIEKAVAVFAGGVNIIEIT
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
              SGT (SEQ ID NO: 95)
          
          
            
          
          
            
            
            
              MGNKEEIEEKFAREKIVAVLRANSVEEAIEKAVAVFAGGVGIIEITFT
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGNEEIEEKFAKEKIVAVLRANSVEEAIEKAVAVFAGGVGIIEITFTV
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGNKEIIEKFAKEKIVAVLRANSVEEAIEKAVAVFAGGVGIIEITFTV
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGNVEIIEKFAKEKIVAVLRANSVEEAIEKAVAVFAGGVGIIEITFTV
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGNPKEIIEKFAKEKIVAVLRANSVEEAIEKAVAVFAGGVGIIEITFT
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGNKEIGEKFAREKIVAVLRANSVEEAIEKAVAVFAGGVGIIEITFT
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGDLKMAKAFAREKIVAVLRANSVEEAIEKAVAVFAGGVGIIEITFT
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGDKKMAKAFAKEKIVAVLRANSVEEAIEKAVAVFAGGVGIIEITF
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGTDEKMAKAFAREKIVAVLRANSVEEAIEKAVAVFAGGVGIIEITF
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
              MGDEKMAKAFAREKIVAVLRANSVEEAIEKAVAVFAGGVGIIEITFT
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
        
      
    
  
The linking polypeptide segment for each of the permuted designs is listed in Table 3.
  
    
      
        
        
          
            
          
          
            
          
          
            
          
          
            
          
        
        
          
            
          
        
      
      
        
        
        
          
            
            
          
          
            
          
          
            
            
          
          
            
          
          
            
            
          
          
            
          
          
            
            
          
          
            
          
          
            
            
          
          
            
          
        
      
      
        
        
          
            
          
        
      
      
        
        
        
          
            
            
          
          
            
          
          
            
            
          
          
            
          
          
            
            
          
          
            
          
          
            
            
          
          
            
          
          
            
            
          
          
            
          
        
      
      
        
        
          
            
          
        
      
      
        
        
        
          
            
            
          
          
            
          
          
            
            
          
          
            
          
          
            
            
          
          
            
          
          
            
            
          
          
            
          
        
      
    
  
Designs with pLDDT>0.90 were ordered as bicistronic plasmids for cytosolic expression in E. coli. In one set of constructs, one open reading frame encoded a CompB (153-50B); the other encoded each of the CompA (153-50A variants) in Table 2, above, with a 6×His tag on the C terminus of CompA. In another set of experiment, one open reading frame encoded a full-length wild-type OspA fused the C terminus of each the CompA (I53-50A variants) in Table 2 (in 5′ to 3′ order, an I53-50A variant, a polypeptide linker, and OspA); and the other open reading frame encoded CompB (I53-50B.4PosT1) having a 6×His tag on the C terminus. Successful designs are expected to assemble into 153-50-derived nanostructures when expressed in E. coli cytosols. Table 2 lists the selected designs. In each sequence in Table 2, the start codon at the beginning of each sequence and the polypeptide linker sequence at the C terminus, used to connect to OspA, is underlined.
Constructs were screened by expressing in E. coli at 2 ml scale and lysed using sonication. Clarified lysates were purified by Ni-NTA MagBeads. Purification was characterized by SDS-PAGE. Constructs where both components were in the eluate fraction were considered passing, failing where only CompB is in the eluate fraction, and ambiguous where only CompA or not components were observed in the eluate.
Constructs were further characterized by expression in E. coli at 500 mL scale and lysed using sonication or microfluidization, as bicistronic constructs with CompB, or as monocistronic constructs where the open reading frame only encoded CompA. Clarified lysates were purified over a Ni-NTA gravity column, and further purified by SEC. Purified VLPs were characterized by DLS and negative-stain EM (
To improve in vitro assembly a second set of designs obtained using multiple approaches was ordered (TABLE 4). These designs were expressed in E. coli as a monocistronic construct and purified by IMAC. CompB was mixed with IMAC eluate and VLP assembly measured by DLS (
  
    
      
        
        
          
            
          
        
        
          
            
          
          
            
          
        
      
      
        
        
        
        
        
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSEADLKMLKKLYEERIVAVLRANSVE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSEADLKMLKKLYQERIVAVLRANSVE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSEADLKMLKKLYTEKIVAVLRANSVE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSEADLKMLKKLYTEKIVAVLRANSVE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGGHKLLKLFPGEVVGPQFVKAMKKTFP
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
              T
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSTEAQALMLKAFVEEKIVAVLRANSVEE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
              T
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSGNEEIEEKFASEKIVAVLRANSVEEAI
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSGNEEIEEKFAKEKIVAVLRANSVEEAI
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGNVEIIEKFAKEKIVAVLRANSVEEAIEK
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGNVEIIEKFASEKIVAVLRANSVEEAIEK
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGNVEIIEKFASEKIVAVLRANSVEEAIEK
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGNPKEIIEKFASEKIVAVLRANSVEEAIE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGNPKEIIEKFASEKIVAVLRANSVEEAIE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSGNKEIGEKFAEEKIVAVLRANSVEEAI
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSGNKEIGEKFATEKIVAVLRANSVEEAI
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSGNKEIGEKFATEKIVAVLRANSVEEAI
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSGNKEIGEKFASEKIVAVLRANSVEEAI
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSGNEEMEELFASHKIVAVLRANSVEEA
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSGNEEMEELFAKHKIVAVLRANSVEE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSGNEEMEELFASHKIVAVLRANSVEEA
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSGNEEMEELFAKHKIVAVLRANSVEE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSEADLKMLKLFYQHRIVAVLRANSVE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSEADLKMLKLFYQHRIVAVLRANSVE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGGHKILKLFPGEVVGPQFVKAMKGPFP
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
              T
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSEADLKMLKKFYTHKIVAVLRANSVE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSEADLKMLKKFYTHKIVAVLRANSVE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSTEAQALMLKLFVEHKIVAVLRANSVEE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
              T
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGNPKEMIELFASHKIVAVLRANSVEEAIE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSGNKEMGELFATHKIVAVLRANSVEE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSGNKEMGELFASHKIVAVLRANSVEE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSGNKEMGELFATHKIVAVLRANSVEE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSGNKEMGELFASHKIVAVLRANSVEE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSEADLKMLKKLYQERIVAVLRANSVE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSEADLKMLKKLYTEKIVAVLRANSVE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSGNEEIEEKFASEKIVAVLRANSVEEAI
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGNVEIIEKFAKEKIVAVLRANSVEEAIEK
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGNVEIIEKFASEKIVAVLRANSVEEAIEK
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGNVEIIEKFASEKIVAVLRANSVEEAIEK
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGNPKEIIEKFASEKIVAVLRANSVEEAIE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGNPKEIIEKFASEKIVAVLRANSVEEAIE
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSGNKEIGEKFAKEKIVAVLRANSVEEA
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSGNKEIGEKFATEKIVAVLRANSVEEAI
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
          
          
            
            
            
              MGSSHHHHHHGSGSGNKEIGEKFASEKIVAVLRANSVEEAI
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
            
          
          
            
            
            
          
          
            
          
        
      
    
  
Clause 1. A polypeptide for forming a nanostructure, comprising an assembly domain comprising a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to a first polypeptide sequence in Table 2.
Clause 2. A protein nanostructure, comprising a first component, and optionally a second component, wherein the first component comprises a first polypeptide comprising a first assembly domain comprising a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to a polypeptide sequence in Table 2.
Clause 3. The nanostructure of clause 2, wherein the first component is a trimeric component comprising three copies of the first polypeptide.
Clause 4. The nanostructure of clause 1 or clause 2, wherein the nanostructure comprises the second component and wherein the second component comprises a second assembly domain comprising a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to:
  
    
      
        
        
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
          
            
          
        
      
    
  
Clause 5. The nanostructure of clause 4, wherein the second component is a pentamer comprising five copies of the second polypeptide.
Clause 6. The nanostructure of any one of clauses 2-5, wherein the nanostructure comprises 20 copies of the first component.
Clause 7. The nanostructure of any one of clauses 2-6, wherein the nanostructure further comprises 12 copies of the second component.
Clause 8. The nanostructure of any one of clause 2-7, wherein the C terminus of the first polypeptide is accessible on the surface of the nanostructure.
Clause 9. The nanostructure of any one of clauses 2-8, wherein the first polypeptide is a fusion protein comprising, in N- to C-terminal order, the first assembly domain, optionally a linker, and a heterologous polypeptide sequence, preferably an antigen.
Clause 10. The nanostructure of clause 9, wherein the antigen is an ectodomain of a surface protein of a pathogenic organism, optionally a virus, or an antigenic fragment thereof.
Clause 11. The nanostructure of clause 10, wherein the antigen is an OspA or antigenic fragment thereof, preferably an OspA of Borrelia burgdorferi sensu lato.
Clause 12. The nanostructure of clause 9, wherein the antigen is an ectodomain of viral glycoprotein, or an antigenic fragment thereof.
Clause 13. The nanostructure of clause 9, wherein the antigen is an ectodomain of bacterial protein, or an antigenic fragment thereof.
Clause 14. A method of generating an immune response to an antigen or to a pathogenic organism in a subject in need thereof, comprising administering to the subject the nanostructure of any of clauses 2-13.
Clause 15. A pharmaceutical composition comprising the nanostructure of any of clauses 2-13.
Clause 16. A vaccine comprising the nanostructure of any of clauses 2-13.
Clause 17. A polynucleotide encoding the nanostructure of any of clauses 2-13 or the polypeptide of clause 1.
Clause 18. A host cell suitable for expression of the nanostructure of any of clauses 2-13 or the polypeptide of clause 1; and/or comprising the polynucleotide of clause 17.
Clause 19. A method of making a polypeptide or nanostructure, comprising culturing the host cell of clause 18 under conditions suitable for expression of the polypeptide or nanostructure.
  
    
      
        
        
          
            
          
          
            
          
        
      
      
        
        
        
        
          
            
            
            
          
          
            
            
            
          
          
            
          
        
      
      
        
        
        
        
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
              burgdorferi sensu lato
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
              burgdorferi sensu lato
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSEADLKMLKKLYEERIVAVLRANSVEEAIEKAVA
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSEADLKMLKKLYQERIVAVLRANSVEEAIEKAVA
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSEADLKMLKKLYTEKIVAVLRANSVEEAIEKAVA
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSEADLKMLKKLYTEKIVAVLRANSVEEAIEKAVA
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGGHKLLKLFPGEVVGPQFVKAMKKTFPDAAFVPT
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSTEAQALMLKAFVEEKIVAVLRANSVEEAIEKAVAV
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSGNEEIEEKFASEKIVAVLRANSVEEAIEKAVAVFA
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSGNEEIEEKFAKEKIVAVLRANSVEEAIEKAVAVF
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGNVEIIEKFAKEKIVAVLRANSVEEAIEKAVAVFAG
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGNVEIIEKFASEKIVAVLRANSVEEAIEKAVAVFAGG
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGNVEIIEKFASEKIVAVLRANSVEEAIEKAVAVFAGG
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGNPKEIIEKFASEKIVAVLRANSVEEAIEKAVAVFAG
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGNPKEIIEKFASEKIVAVLRANSVEEAIEKAVAVFAG
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSGNKEIGEKFAEEKIVAVLRANSVEEAIEKAVAVF
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSGNKEIGEKFATEKIVAVLRANSVEEAIEKAVAVF
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSGNKEIGEKFATEKIVAVLRANSVEEAIEKAVAVF
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSGNKEIGEKFASEKIVAVLRANSVEEAIEKAVAVF
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSGNEEMEELFASHKIVAVLRANSVEEAIEKAVAVF
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSGNEEMEELFAKHKIVAVLRANSVEEAIEKAVAVF
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSEADLKMLKLFYQHRIVAVLRANSVEEAIEKAVA
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
              GT
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSEADLKMLKLFYQHRIVAVLRANSVEEAIEKAVA
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
              GT
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGGHKILKLFPGEVVGPQFVKAMKGPFPDVAFVPTG
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSEADLKMLKKFYTHKIVAVLRANSVEEAIEKAVA
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
              GT
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSEADLKMLKKFYTHKIVAVLRANSVEEAIEKAVA
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
              GT
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSTEAQALMLKLFVEHKIVAVLRANSVEEAIEKAVAV
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGNPKEMIELFASHKIVAVLRANSVEEAIEKAVAVFA
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSGNKEMGELFATHKIVAVLRANSVEEAIEKAVAVF
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSGNKEMGELFASHKIVAVLRANSVEEAIEKAVAV
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSGNKEMGELFATHKIVAVLRANSVEEAIEKAVAVF
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSGNKEMGELFASHKIVAVLRANSVEEAIEKAVAV
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSEADLKMLKKLYQERIVAVLRANSVEEAIEKAVA
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
              GT
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSEADLKMLKKLYTEKIVAVLRANSVEEAIEKAVA
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
              GT
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSGNEEIEEKFASEKIVAVLRANSVEEAIEKAVAVFA
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGNVEIIEKFAKEKIVAVLRANSVEEAIEKAVAVFAG
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGNVEIIEKFASEKIVAVLRANSVEEAIEKAVAVFAGG
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGNVEIIEKFASEKIVAVLRANSVEEAIEKAVAVFAGG
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGNPKEIIEKFASEKIVAVLRANSVEEAIEKAVAVFAG
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGNPKEIIEKFASEKIVAVLRANSVEEAIEKAVAVFAG
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSGNKEIGEKFAKEKIVAVLRANSVEEAIEKAVAVF
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSGNKEIGEKFATEKIVAVLRANSVEEAIEKAVAVF
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSGNKEIGEKFATEKIVAVLRANSVEEAIEKAVAVF
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
          
          
            
            
              MGSSHHHHHHGSGSGNKEIGEKFASEKIVAVLRANSVEEAIEKAVAVF
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
            
          
          
            
            
          
          
            
          
        
      
    
  
The entire disclosure of each of the patent and scientific documents referred to herein is incorporated by reference for all purposes.
The reference in this specification to any prior publication (or information derived from it), or to any matter which is known, is not, and should not be taken as an acknowledgment or admission or any form of suggestion that prior publication (or information derived from it) or known matter forms part of the common general knowledge in the field of endeavor to which this specification relates.
The invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The foregoing embodiments are therefore to be considered in all respects illustrative rather than limiting on the invention described herein. The scope of the invention is thus indicated by the appended claims rather than by the foregoing description, and all changes that come within the meaning and range of equivalency of the claims are intended to be embraced therein.
This application claims priority to U.S. provisional patent applications 63/601,517, filed on Nov. 21, 2023, and 63/552,288, filed on Feb. 12, 2024, the contents of each of which are incorporated herein by reference in their entireties.
| Number | Date | Country | |
|---|---|---|---|
| 63601517 | Nov 2023 | US | |
| 63552288 | Feb 2024 | US |