NANOSTRUCTURE-FORMING POLYPEPTIDES AND USES THEREOF

SEQUENCE LISTING

The instant application contains a Sequence Listing which has been submitted electronically in XML format and is hereby incorporated by reference in its entirety. Said XML copy created on Nov. 14, 2024, is named 061291-522001WO-new.xml and is 160 KB in size.

TECHNICAL FIELD

The invention relates to protein nanostructures, computational methods used to design protein nanostructures, and uses thereof in, for example, vaccines.

BACKGROUND

Protein nanostructures may be used to display vaccine antigens, as gene therapy vectors, or for other purposes.

The nanostructure I53-50, which is described in US 2016/0122392 A1, is one example of a computationally designed nanostructure. To design I53-50, the structures of the trimeric 2-keto-3-deoxy-6-phosphogluconate (KDPG) aldolase from Thermotoga maritima, (Protein Data Bank entry 1WA3) and the pentameric Lumazine synthase RibH2 from Mesorhizobium loti (20BX) were computationally docked to one another, such that the symmetry axes of the trimeric components and pentameric components align to the shared symmetry elements of an icosahedron; then the protein-protein interfaces between the trimeric components and the pentameric components were modified in silico to drive self-assembly of the two components into a nanostructure with icosahedral symmetry composed of trimers and pentamers having 3-fold and 5-fold symmetry axes-termed an 153 architecture. The resulting polypeptide sequences were expressed and purified, and it was shown experimentally that the polypeptides, as predicted, would self-assemble into the intended two-component nanostructure having 153 architecture. This two-component nanostructure (I53-50) comprised 60 copies of each polypeptide component, that is 20 copies of the designed trimeric component based on 1WA3 (termed “I53-50A”) and 12 copies of the designed pentameric component (termed “I53-50B”).

Another example of designed nanostructure, based on the same trimeric component and termed I3-01, is described in US 2018/0030429 A1. To design I3-01, the structure of KDPG (PDB entry 1WA3) was docked against itself alone; then the interfaces between the trimeric components were modified in silico to drive self-assembly of the designed nanostructure. The interface residues selected were different from the interface residues in the two-component nanostructure. The resulting single polypeptide sequence was expressed and purified, and it was shown experimentally that this polypeptide, as predicted would spontaneously self-assemble into a one-component icosahedral nanostructure having subunits aligned to the icosahedral 3-fold and new protein-protein interfaces on the icosahedral 2-fold symmetry axes-termed an 13 architecture. This one-component nanostructure (I3-01) comprised 60 copies of the polypeptide, which is 20 copies of the designed trimeric component.

Both of these designed nanostructures, I53-50 and I3-01, and variants thereof, have been employed to make vaccine candidates. For example, US 2020/0392187 A1 describes a two-component nanostructure composed of I53-50A fused the fusion (F) protein of a pneumovirus, and self-assembled with I53-50B to form an icosahedral nanostructure having a F protein trimer display on each three-fold axis. As another example, WO 2019/241483 A1 describes a one-component nanostructure composed of I3-01 C-terminally fused engineered envelope (Env) proteins of HIV-1. Nanostructures of this type have been shown to be effective vaccines and are currently in human clinical trials. Specifically, I53-50 nanostructures displaying F proteins from Respiratory Syncytial Virus (RSV) or human Metapneumovirus (hMPV), or the Spike (S) protein of SARS-CoV2, are in clinical trials as vaccines.

Nonetheless, there remains a need in the art for novel protein nanostructures capable of displaying other antigens. The present disclosure addresses that need.

SUMMARY

The present disclosure provides a polypeptide that is a circular permutation of I53-50A, comprising an assembly domain, comprising, in N- to C-terminal order, a N-terminal polypeptide segment, a linking polypeptide segment, and a C-terminal polypeptide segment, wherein the N-terminal polypeptide segment comprises residues 74-201 of SEQ ID NO: 1 or a variant thereof, and the C-terminal polypeptide segment comprises residues 1-73 of SEQ ID NO: 1 or a variant thereof.

The present disclosure provides a polypeptide that is a circular permutation of I53-50A, comprising an assembly domain, comprising, in N- to C-terminal order, a N-terminal polypeptide segment, a linking polypeptide segment, and a C-terminal polypeptide segment, wherein the N-terminal polypeptide segment comprises residues 107-201 of SEQ ID NO: 1 or a variant thereof, and the C-terminal polypeptide segment comprises residues 1-106 of SEQ ID NO: 1 or a variant thereof.

The present disclosure provides a polypeptide that is a circular permutation of I53-50A, comprising an assembly domain, comprising, in N- to C-terminal order, a N-terminal polypeptide segment, a linking polypeptide segment, and a C-terminal polypeptide segment, wherein the N-terminal polypeptide segment comprises residues 128-201 of SEQ ID NO: 1 or a variant thereof, and the C-terminal polypeptide segment comprises residues 1-127 of SEQ ID NO: 1 or a variant thereof.

In some embodiments, variants of SEQ ID NO: 1 are at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 1.

In some embodiments, the N-terminal polypeptide segment and the C-terminal polypeptide segment comprises polypeptide sequences each selected from pairs A, B, or C provided in the Sequence Table, or from variants thereof having at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical thereto.

In some embodiments, the linking polypeptide segment comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to any of one of SEQ ID NOs: 8-21.

In some embodiments, the assembly domain comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to any one of SEQ ID NOs: 22-24.

The present disclosure provides a polypeptide that is a variant of I53-50A having a C-terminal extension, comprising an assembly domain, the assembly domain comprising, in N- to C-terminal order, a base polypeptide segment and an extending polypeptide segment, wherein the base polypeptide segment comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical to residues 1-201 of SEQ ID NO: 1.

In some embodiments, the extending polypeptide segment comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to a polypeptide sequence in Table 1.

In some embodiments, the polypeptide comprises one or more amino acid residues at interface positions such that the polypeptide is capable of self-assemble to form a trimeric component of one-component nanostructure.

In some embodiments, the polypeptide self-assembles to form a trimeric component of a nanostructure, wherein the C terminus of the assembly domain is accessible on the surface of the nanostructure.

In some embodiments, the polypeptide self-assembles to form a trimeric component, optionally wherein the distance from the C terminus of the assembly domain to the three-fold axis is less than 30 Å, less than 25 Å, or less than 20 Å, or between 10 Å and 30 Å, between 15 Å and 30 Å, between 15 Å and 25 Å, or between 20 Å and 25 Å.

In some embodiments, the polypeptide self-assembles to form a soluble trimer, wherein the C terminus of the assembly domain is accessible on the surface of the soluble trimer.

In some embodiments, the polypeptide self-assembles to form a soluble trimer, wherein the C terminus of the assembly domain is proximal to the three-fold axis of the soluble trimer, optionally wherein the distance from the C terminus to the three-fold axis is less than 30 Å, less than 25 Å, or less than 20 Å, or between 10 Å and 30 Å, between 15 Å and 30 Å, between 15 Å and 25 Å, or between 20 Å and 25 Å.

In some embodiments, the polypeptide is a fusion protein comprising, in N- to C-terminal order, the assembly domain, optionally a polypeptide linker, and a heterologous polypeptide.

In some embodiments, the heterologous polypeptide is an antigen.

In some embodiments, the antigen is an ectodomain of a surface protein of a pathogenic organism, optionally a virus, or an antigenic fragment thereof.

In some embodiments, the antigen is an OspA or antigenic fragment thereof, preferably an OspA of Borrelia burgdorferi sensu lato.

In some embodiments, the antigen is an ectodomain of viral glycoprotein, or an antigenic fragment thereof.

In some embodiments, the antigen is an ectodomain of bacterial protein, or an antigenic fragment thereof.

The present disclosure provides a protein nanostructure, comprising a first component comprising a first polypeptide, and optionally a second component comprising a second polypeptide, wherein the first polypeptide is a polypeptide according to the present disclosure.

In some embodiments, the first component is a trimeric component comprising three copies of the first polypeptide.

In some embodiments, the nanostructure comprises the second component, and the second component comprises a second assembly domain comprising a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to: SEQ ID NO: 26 or 27.

In some embodiments, the second component is a pentamer comprising five copies of the second polypeptide.

In some embodiments, the nanostructure comprises 20 copies of the first component.

In some embodiments, the nanostructure further comprises 12 copies of the second component.

In some embodiments, the C terminus of the first polypeptide is accessible on the surface of the nanostructure.

In some embodiments, the first polypeptide is a fusion protein comprising, in N- to C-terminal order, the first assembly domain, optionally a polypeptide linker, and a heterologous polypeptide.

In some embodiments, the heterologous polypeptide is an antigen.

In some embodiments, the antigen is an ectodomain of a surface protein of a pathogenic organism, optionally a virus, or an antigenic fragment thereof.

In some embodiments, the antigen is an OspA or antigenic fragment thereof, preferably an OspA of Borrelia burgdorferi sensu lato.

In some embodiments, the antigen is an ectodomain of viral glycoprotein, or an antigenic fragment thereof.

In some embodiments, the antigen is an ectodomain of bacterial protein, or an antigenic fragment thereof.

In some embodiments, the nanostructure has an 153 architecture and/or a quaternary structure substantially similar to I53-50.

The present disclosure provides a polynucleotide encoding a nanostructure disclosed herein or a polypeptide disclosed herein.

The present disclosure provides a delivery vehicle, comprising a polynucleotide disclosed herein, optionally a viral vector or a lipid nanoparticle.

The present disclosure provides a pharmaceutical composition, comprising a nanostructure disclosed herein, a polynucleotide disclosed herein, or a delivery vehicle disclosed herein, and a pharmaceutically acceptable carrier.

The present disclosure provides a vaccine, comprising a nanostructure disclosed herein, a polynucleotide disclosed herein, or a delivery vehicle disclosed herein, a pharmaceutically acceptable carrier, and optionally an adjuvant.

The present disclosure provides a host cell suitable for expression of a nanostructure disclosed herein or a polypeptide disclosed herein; and/or comprising a polynucleotide disclosed herein.

The present disclosure provides a method of making a polypeptide or nanostructure, comprising culturing a host cell disclosed herein under conditions suitable for expression of the polypeptide or nanostructure.

The present disclosure provides a method of generating an immune response to an antigen or to a pathogenic organism in a subject in need thereof, comprising administering to the subject a vaccine as disclosed herein, optionally via intramuscular injection or inhalation.

The present disclosure provides a method of immunizing a subject against infection by a pathogen, comprising administering to the subject a vaccine as disclosed herein, optionally via intramuscular injection or inhalation.

The present disclosure provides a composition or method as described herein.

Any aspect or embodiment described herein can be combined with any other aspect or embodiment as disclosed herein.

BRIEF DESCRIPTION OF FIGURES

FIGS. 1A-1E show topology maps of CompA and circularly permuted versions. FIG. 1A: WT CompA topology. Alpha-helices are indicated by gray bars, labeled H1-H10. Beta-strands are indicated with white arrows, labeled E1-E8. FIG. 1B: Topology map of CompA with a de novo helical extension, H11, indicated with a dashed bar. FIG. 1C: Circular permutation of CompA with a cut-point between E3 and H4 (residues 73 and 74). FIG. 1D: Circular permutation of CompA with a cut-point between H5 and E5 (residues 106 and 107). FIG. 1E: Circular permutation of CompA with a cut-point between H6 and E6 (residues 127 and 128).

FIG. 2 shows small-scale immobilized metal affinity chromatography (IMAC) pull-down assay SDS-PAGE. IMAC pull-down flow through (FT), and elution (E) samples for select constructs, showing two bands of the expected size in the elution fraction.

FIG. 3 shows an example negative stain electron microscopy micrograph of a representative construct (CompA.024) with an OspA antigen genetically fused to the carboxy terminus demonstrating assembly into monodisperse VLPs of the expected size.

FIG. 4 shows an illustrative dynamic light scattering size distribution of a representative construct (CompA.024) with an OspA antigen genetically fused to the carboxy terminus demonstrating assembly into monodisperse VLPs of the expected size.

FIGS. 5A-5B show biolayer interferometry of antibody binding to an antigen fused to the carboxy-terminus of a representative construct (CompA.024), or to the amino-terminus of I53-50 CompA. Binding to LA-2 (FIG. 5A), which binds to an epitope on the carboxy-terminal end of the antigen, or binding to 221-7 (FIG. 5B), which binds to an epitope along the central domain of the antigen.

FIGS. 6A-C show structure models of representative designs. Extension of the carboxy terminus with parallel helical segments in grey (FIG. 6A). Extensions with an extended loop between the native carboxy terminus and the termini-extending helical segment (FIG. 6B). A circularly permuted design (FIG. 6C).

FIG. 7 shows representative particle size distributions for twenty-four constructs, designed to have improved assembly characteristics, assembled after IMAC purification. I53-50 is provided as a reference in each plot.

FIG. 8 shows the SEC chromatogram for four representative assemblies from purified components.

FIG. 9 shows particle size distributions of SEC representative purified VLPs compared to purified I53-50 VLP.

DETAILED DESCRIPTION

The present disclosure relates generally to polypeptides for forming nanostructures, nanostructures, and uses thereof. In some embodiments, the disclosure provides polypeptides having disclosed sequences. In some embodiments, the polypeptides form nanostructure components in which the C terminus of the polypeptide is accessible on the surface of the nanostructure. In some embodiments, the polypeptide is a fusion protein comprising, in N- to C-terminal order, an assembly domain, optionally a linker, and a heterologous polypeptide, such as an antigen or antigenic fragment thereof.

Computationally designed protein nanomaterials are useful platforms for delivery of macromolecules, and vaccine design. The characteristics that make a particular nanomaterial useful include, but are not limited to, modularity, spontaneous self-assembly across a useful range of concentrations, stability, accessible termini, and particle size. Termini availability is constrained by the components used for designing a particular nanomaterial, and the orientation of the component within the designed architecture. Without wishing to be bound by theory, to ensure that any genetically linked domain is properly oriented with respect to the surface of the nanomaterial, the local structure of the termini is a contributing element. The present disclosure demonstrates that circular permutation can be an effective method for changing the accessibility of termini. In some embodiments, de novo designed termini extensions that are well ordered can also change termini accessibility. In some embodiments, both techniques are used to change the termini availability of a nanostructure (e.g., the protein nanomaterial I53-50). In some embodiments, a nanomaterial designed using circular permutation and/or de novo designed termini extensions may display the Borrelia burgdorferi sensu lato antigen OspA. In some embodiments, the techniques to change the termini availability described herein may be applied to a I3-01 protein nanomaterial.

The nanostructures of the present disclosure provide an antigen fused to the C terminus of a first component such that the antigen is displayed on the surface of the nanostructure. Without wishing to be bound by theory, fusion to the C terminus may increase or alter the immune response to the antigen. In some cases, fusion to the C terminus may promote induction of a protective and/or functional immune response in the subject. In embodiments, the nanostructures comprise a fusion between the C terminus and the N terminus of the first component via a novel linking polypeptide sequences as shown in Table 3. In embodiments, the nanostructures comprise sequence breaks which generate novel N- and C-termini as compared to a reference sequence fused to antigens or antigenic fragments.

(I53-50A; SEQ ID NO: 1)

MEELFKKHKIVAVLRANSVEEAIEKAVAVFAGGVHLIEITFTVPDADTV

IKALSVLKEKGAIIGAGTVTSVEQCRKAVESGAEFIVSPHLDEEISQFC

KEKGVFYMPGVMTPTELVKAMKLGHTILKLFPGEVVGPQFVKAMKGPFP

NVKFVPTGGVNLDNVCEWFKAGVLAVGVGSALVKGTPDEVREKAKAFVE

KIRGCTE

The present disclosure provides a polypeptide that is a circular permutation of 153-50A, comprising an assembly domain, comprising, in N- to C-terminal order, a N-terminal polypeptide segment, a linking polypeptide segment, and a C-terminal polypeptide segment, wherein the N-terminal polypeptide segment comprises residues 107-201 of SEQ ID NO: 1 or a variant thereof, and the C-terminal polypeptide segment comprises residues 1-106 of SEQ ID NO: 1 or a variant thereof.

The present disclosure provides a polypeptide that is a circular permutation of I53-50A, comprising an assembly domain, comprising, in N- to C-terminal order, a N-terminal polypeptide segment, a linking polypeptide segment, and a C-terminal polypeptide segment, wherein the N-terminal polypeptide segment comprises residues 128-201 of SEQ ID NO: 1 or a variant thereof, and the C-terminal polypeptide segment comprises residues 1-127 of SEQ ID NO: 1 or a variant thereof.

In some embodiments, the N-terminal polypeptide segment and the C-terminal polypeptide segment comprises polypeptide sequences each selected from pairs A, B, or C, or from variants thereof at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical thereto:

N-terminal polypeptide segment
C-terminal polypeptide segment

A
MKMEELFKKHKIVAVLRANSVEE
EQCRKAVESGAEFIVSPHLDEEISQ

AIEKAVAVFAGGVHLIEITFTVPD
FCKEKGVFYMPGVMTPTELVKA

ADTVIKALSVLKEKGAIIGAGTVT
MKLGHDILKLFPGEVVGPQFVKA

SV (SEQ ID NO: 2)
MKGPFPNVKFVPTGGVNLDNVCK

WFKAGVLAVGVGKALVKGKPDE

VREKAKKFVKKIR (SEQ ID NO: 5)

B
MKMEELFKKHKIVAVLRANSVEE
YMPGVMTPTELVKAMKLGHDILK

AIEKAVAVFAGGVHLIEITFTVPD
LFPGEVVGPQFVKAMKGPFPNVK

ADTVIKALSVLKEKGAIIGAGTVT
FVPTGGVNLDNVCKWFKAGVLA

SVEQCRKAVESGAEFIVSPHLDEEI
VGVGKALVKGKPDEVREKAKKF

SQFCKEKGVF (SEQ ID NO: 3)
VKKIR (SEQ ID NO: 6)

C
MKMEELFKKHKIVAVLRANSVEE
LKLFPGEVVGPQFVKAMKGPFPN

AIEKAVAVFAGGVHLIEITFTVPD
VKFVPTGGVNLDNVCKWFKAGV

ADTVIKALSVLKEKGAIIGAGTVT
LAVGVGKALVKGKPDEVREKAK

SVEQCRKAVESGAEFIVSPHLDEEI
KFVKKIR (SEQ ID NO: 7)

SQFCKEKGVFYMPGVMTPTELVK

AMKLGHDI (SEQ ID NO: 4)

The present disclosure provides a polypeptide that extends the C terminus of I53-50A, comprising an assembly domain, the assembly domain comprising, in N- to C-terminal order, a N-terminal polypeptide segment and an extending polypeptide segment, wherein the N-terminal polypeptide segment comprises a polypeptide sequence at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identical to residues 1-201 of SEQ ID NO: 1.

As used herein, the term “linking polypeptide segment” refers to a polypeptide that connects the C terminus of a C-terminal polypeptide segment (e.g., a C-terminal polypeptide segment of I53-50A or a variant thereof) to an N-terminal polypeptide segment, to computationally generate a circular polypeptide chain in the process of circular permutation. After a circular polypeptide is computationally generated, breakpoints between the secondary structure elements are identified to create an N terminus for the designed polypeptide, which then may be expressed.

As used herein, the term “extending polypeptide segment” refers to a polypeptide that extends the C terminus of a base polypeptide segment (e.g., a polypeptide segment I53-50A or a variant thereof. In embodiments, the extending polypeptide segment may extend the C terminus to near to the N terminus of the base polypeptide segment without connecting the C terminus to the N terminus.

In some embodiments, the polypeptide self-assembles to form a trimeric component of a nanostructure, the C terminus of the assembly domain is accessible on the surface of the nanostructure.

In some embodiments, the polypeptide self-assembles to form a soluble trimer, the C terminus of the assembly domain is accessible on the surface of the soluble trimer.

In some embodiments, the polypeptide self-assembles to form a soluble trimer, the C terminus of the assembly domain is proximal to the three-fold axis of the soluble trimer, optionally wherein the distance from the C terminus to the three-fold axis is less than 30 Å, less than 25 Å, or less than 20 Å, or between 10 Å and 30 Å, between 15 Å and 30 Å, between 15 Å and 25 Å, or between 20 Å and 25 Å.