The present invention relates to the field of nanoparticles. Especially the invention relates to a lipoprotein cage comprising a protein cage and a surfactant composition for intracellular delivery of cargo.
Well-defined nanoparticles can be well suited for protective transport of molecules within biological systems.
Protein cages have recently emerged as an important platform for nanotechnology development. Of the naturally existing protein cages, viruses are among the most efficient nanomachines, achieving component replication and efficient self-assembly in complex biological milieu. An artificial system that can carry out the most basic steps of viral particle assembly in vivo has been designed based on patchwork cages formed from Aquifex aeolicus lumazine synthase and a circularly permuted variant with appended cationic peptides. These two-component protein containers self-assemble in vivo, capturing endogenous RNA molecules in a size-selective manner (Azuma et al., Modular Protein Cages for Size-Selective RNA Packaging in Vivo, J. Am. Chem. Soc. 2018, vol. 140, 566-569)
In a further approach, nanoparticulate phospholipid bilayer disks were assembled from phospholipid and a class of amphipathic helical proteins using a synthetic gene. The self-assembly process begins with a mixture of the phospholipid and protein in the presence of a detergent. Upon removal of detergent, particles form containing either saturated or unsaturated phospholipid (Bayburt et al., Self-Assembly of Discoidal Phospholipid Bilayer Nanoparticles with Membrane Scaffold Proteins Nano Letters 2002, vol. 2(8), pp. 853-85)
The advent of de novo-designed protein cages offers an alternative strategy for creating cargo transport vehicles. A designed nonfunctional protein cage was transformed into a nucleic acid delivery vehicle, which can encapsulate oligonucleotides in vitro with high binding affinity (Edwardson et al., Rational Engineering of a Designed Protein Cage for siRNA Delivery. J. Am. Chem. Soc. 2018, vol. 140, 10439-10442).
However, proteinaceous compartments are limited in the types of cargo that they carry. Incorporation of molecules other than proteins and nucleic acids requires developing an additional set of self-assembly rules. Development of protein cages by incorporation of other molecular species has the potential to expand applicability into new areas, one of which is the transport of small molecules as possible when using the present invention.
The present invention provides porous lipoprotein cages (also called herein lipoprotein scaffolds) capable of encapsulating poorly water-soluble molecule cargo. In the lipoprotein cages of the invention, hydrophobic compartmentalization is achieved by combining the engineered proteins that stabilize the cage and act as biorecognizable barcodes for the cargo to be delivered by the cage via the specific pores with amphiphiles that create a hydrophobic interior of the cage (
In a two-tier host-guest approach, a designed protein cage with a highly positively charged lumen is used to nucleate anionic surfactant molecules into micellar aggregates within its interior cavity at concentrations well below their critical aggregation concentration. Electrostatic attraction drives the encapsulation of anionic surfactants, which phase separate due to the high effective concentration to form micellar aggregates within the protein cage. The non-polar core of this stable protein-surfactant complex can then sequester small molecules through the hydrophobic effect.
The protein cage is highly stable and acts as a template to form the lipidic/micellar core within its inner cavity, meaning a previous formulation step is not required. Therefore, the amphiphiles do not need to form a stable particle on their own before addition of the protein.
Through their unique architecture, the resulting lipoprotein cages of the invention can recruit and sequester small, preferably small hydrophobic molecules through the protein-scaffolded hydrophobic core. The inventors showed that these lipoprotein cages are stable, monodisperse and protect their cargo from sequestration by serum proteins, thus enhancing the cellular uptake of poorly soluble fluorescent probes and cytotoxic drugs. These findings demonstrate the beneficial combination of electrostatically-driven and amphiphilic self-assembly within stable protein compartments using a protein and surfactant molecule.
Due to the generality of the hydrophobic effect, this system could be used to encapsulate all manner of small, preferably hydrophobic molecules, including hormones, hydrophobic peptides, luminescent metal complexes, therapeutic agents, and vitamins etc.
Thus, in a first aspect, the present invention provides for a lipoprotein cage for intracellular delivery of cargo, said lipoprotein cage comprises:
In a further aspect, the invention relates to a complex comprising the lipoprotein cage of the invention and one or more cargo molecules.
In a further aspect, the invention relates to a method for manufacturing the lipoprotein cage of the invention comprising the steps of self-assembling a protein cage from at least one polypeptide comprising the amino acid sequence I, preferably from 24 polypeptides each comprising the amino acid sequence I, and encapsulating the surfactant composition of the invention into the protein cage, without disassembly of the protein cage.
In a further aspect, the invention provides a method for manufacturing the complex of the invention comprising the step of mixing the lipoprotein cage of the invention with one or more cargo molecules, wherein said cargo is encapsulated into the lipoprotein cage of the invention without disassembly of the lipoprotein cage.
In a further aspect, the invention provides a method for treating cells with the complex of the invention comprising the step of contacting said cell with the complex of the invention.
Unless defined otherwise, all technical and scientific terms used herein have the same meanings as commonly understood by one of ordinary skill in the art to which this invention belongs.
Throughout this specification and the claims which follow, unless the context requires otherwise, the word “comprise” or the word “include”, and variations such as “comprises/includes” and “comprising/including”, are to be understood to imply the inclusion of an element, stated integer, step or a group thereof but not the exclusion of any other element, stated integer, step or a group thereof.
As used in this specification and the appended claims, the singular forms “a”, “an”, and “the” include plural referents, unless the content clearly dictates otherwise.
The term “about” or “approximately” when used in connection with a numerical value is meant to encompass numerical values within a range having a lower limit that is 0-10% smaller than the indicated numerical value and having an upper limit that is 0-10% larger than the indicated numerical value. The term “about” or “approximately” means preferably ±10%, more preferably ±5%, again more preferably ±3% or most preferably ±0% (referring to the given numeric value, respectively). In each of the invention embodiments, “about” can be deleted. All ranges of values disclosed herein should refer to and include any and all values falling within said range including the values defining the range.
In a first aspect, the invention relates to a lipoprotein cage for intracellular delivery of cargo, said lipoprotein cage comprises:
As used herein, the terms protein cage and lipoprotein cage relate to cage-like nanoparticles. The protein cage and lipoprotein cage of the invention are preferably in the nanometer size range (i.e. from about 1 nm to about 1000 nanometers). In a further preferred embodiment, said protein or lipoprotein cage has an external diameter up to about 50 nm. In a very preferred embodiment, said lipoprotein cage has an external diameter of about 13 nm.
Said polypeptide of the invention is selected such that it is capable of forming a protein cage by self-assembly of said at least one polypeptide of the invention. In a preferred embodiment, the protein cage of the invention comprises exactly 24 polypeptides of the invention (i.e. said at least one polypeptide is defined as exactly 24 polypeptides of the invention). In another preferred embodiment, the protein cage of the invention has an octahedral geometry (octahedral point group symmetry). In a preferred embodiment of the invention, the protein cage has a quaternary structure of multiple subunits, preferably of 8 subunits, each subunit comprises 3 polypeptides of the invention.
In a preferred embodiment said protein cage comprises (i) an exterior protein scaffold and (ii) a central cavity (also mentioned herein as cavity, internal cavity, interior or lumen). Preferably, said exterior scaffold surrounds, i.e. is assembled around said central cavity.
The term “negatively charged” or “positively charged” as used herein includes and preferably refers to a molecule that has a negatively or positively charged group. The term “anion” as used herein refers to negatively charged ions. More preferably, said anion or negatively charged molecule has a negatively charged group at neutral or physiological pH. The protein cage of the invention has a positively charged cavity. In a preferred embodiment, the protein cage is positively charged on the cavity surface. Said positive charges stem from multiple positively charged amino acids, preferably arginines or lysines. Thus, the protein cage of the invention has a very strong affinity to encapsulate negatively charged molecules. In a preferred embodiment, said cavity of the protein cage has a diameter from about 6.5 nm to about 8 nm, preferably of about 8 nm.
In a preferred embodiment, said protein cage has a porous structure, i.e. said protein cage includes pores. Preferably, the exterior scaffold of the protein cage includes pores that are connected to the cavity of said protein cage. A pore is defined herein as an opening or gap in the protein cage or in the exterior scaffold of the protein cage.
In a preferred embodiment, said protein or lipoprotein cage comprises six pores. Preferably, said exterior scaffold includes six pores, which are connected to the cavity of the protein or lipoprotein cage. In another preferred embodiment, said pores have a diameter from about 3 nm to about 4 nm. In another preferred embodiment, said protein or lipoprotein cage includes 6 pores having a diameter of 3-4 nm
In another further preferred embodiment, the protein cage of the invention includes 6 pores having a diameter of about 3-4 nm, the internal cavity of said protein cage has a diameter of about 8 nm, and the external diameter of said protein cage is about 13 nm.
Loading (or encapsulation) and unloading (or release) of cargo into the lipoprotein or of the surfactant composition into the protein cage works via the pores of the protein cage and the surfactant composition affects the loading and unloading of cargo. The terms loading or encapsulation relates to any uptake of cargo, composition or amphiphile into the lipoprotein or protein cage. The term unloading (or release) relates to liberation or displacement/replacement of the cargo, either in part or completely, preferably completely.
Said surfactant composition of the invention is encapsulated into the assembled protein cage, i.e. without disassembly of the protein cage. In a preferred embodiment, the assembled protein cage of the invention is loadable with the surfactant composition, without disassembly of the protein cage. In a preferred embodiment of the invention, electrostatic attraction drives the encapsulation of the surfactant composition. Preferably in the lipoprotein cage of the invention, said surfactant composition phase separates due to its high effective concentration within the lipoprotein cage. Preferably said surfactant composition is capable of forming a micellar aggregate within the protein cage.
More preferably said surfactant composition according to the invention creates a hydrophobic core within the protein cage. In a preferred embodiment, the lipoprotein cage of the invention is loadable and unloadable with cargo without disassembly of the lipoprotein cage. Further preferably, said lipoprotein cage of the invention, especially said non-polar core of said lipoprotein cage of the invention is capable of sequestering cargo molecules loaded into the lipoprotein cage of the invention. Preferably, said cargo is encapsulated within the lipoprotein cage by non-covalent interactions. Said non-covalent interactions are preferably hydrophobic interactions (hydrophobic effect). In a preferred embodiment, the lipoprotein cage of the invention is capable of delivering cargo intracellularly. Preferably, said cargo is delivered intracellularly, without disassembly of the lipoprotein cage. In a preferred embodiment, the lipoprotein cage of the invention is capable of encapsulating cargo, without disassembly of the lipoprotein cage, and to release said encapsulated cargo, preferably intracellularly, without disassembly of the lipoprotein cage. This is possible due to the porous structure of the lipoprotein cage. In a further preferred embodiment, the lipoprotein cage of the invention is loadable with cargo extracellularly and unloadable from said cargo intracellularly, without disassembly of the lipoprotein cage.
In a preferred embodiment, the lipoprotein cage of the invention is capable of encapsulating cargo extracellularly, to enter cells with said encapsulated cargo and to release said encapsulated cargo into the cells, without disassembly of the lipoprotein cage. In a preferred embodiment, the lipoprotein cage of the invention is capable of encapsulating cargo, to enter cells with said encapsulated cargo, and to release said encapsulated cargo into the cells, more preferably into the cytoplasm of a cell, each step without disassembly of the lipoprotein cage. In another preferred embodiment, the lipoprotein cage of the invention is capable of encapsulating cargo, to enter cells with said encapsulated cargo, and to release encapsulated hydrophobic cargo into the cells, wherein said released cargo escapes to the cytoplasm of the cell, each step without disassembly of the lipoprotein cage.
Preferably, said cargo is hydrophobic cargo, more preferably non-polar cargo.
Preferably, said cargo is small cargo. Small cargo has preferably a size of 1000 Da or below. In a preferred embodiment, a size of 1000 Da or below means that said cargo has a size of 1000 Da or lower, preferably 800 Da or lower, more preferably 600 Da or lower, again more preferably 500 Da or lower, again more preferably 400 Da or lower, again more preferably 300 Da or lower, again more preferably 200 Da or lower, again more preferably 100 Da or lower.
In another embodiment, said cargo is small hydrophobic cargo having a size of 1000 Da or below, again more preferably small non-polar cargo having a size of 1000 Da or below.
Preferably, said cargo has a low solubility in aqueous media. Preferably, said small cargo having a size of 1000 Da or below and has a low solubility in aqueous media. More preferably said cargo of low solubility is included in Class II or Class IV of the Biopharmaceutics Classification System (BC S). Again more preferably, said low solubility cargo has a lower solubility than a highly soluble cargo for which the highest strength dose is soluble in 250 mL or less of aqueous media over the pH range of 1.0-7.5, more preferably over the pH range of 1.0-6.8, at 37±1° C. Preferred methods for determining solubility are the USP Dissolution Apparatus, shake-flask method or acid or base titration methods.
Preferably said cargo is an active agent, preferably a therapeutic or diagnostic agent. More preferably said cargo is selected from the group consisting of a chemotherapeutic agent, such doxorubicin or paclitaxel, an antifungal agent, such as bifonazole or amphotericin B, an antiviral agent such as indinavir or ritonavir, and an antibiotic.
The surfactant composition of the lipoprotein comprises one or more amphiphiles selected such that the net charge of the composition is negative. The net charge of a composition is the overall charge contributed by all compounds included in the composition. Preferably, the net charge of the composition is negative at a physiological pH.
Based on the positive charge of the protein cage assembled according to the examples and the number of amphiphiles encapsulated therein, the inventors found that at least 20 mol % of the compounds of the surfactant composition having at least one negative charge is a value sufficient for cargo encapsulation, especially for small cargo molecules that have a size of 1000 Da or below, or hydrophobic cargo molecules, or poorly low water soluble cargo molecules included in Class II or Class IV of BCS.
Thus, in a further preferred embodiment, at least 20 mol % of the compounds included in the surfactant composition have at least one negative charge. In a further preferred embodiment, at least 20 mol % of the amphiphiles included in the surfactant composition have at least one negative charge. In case the compound has more than one negative charge, i.e. an amount of N negative charges, said value of 20 mol % can be divide by N.
In a further preferred embodiment, at least 20 mol %, preferably at least 30 mol %, more preferably at least 40 mol %, again more preferably at least 50 mol %, again more preferably at least 60 mol %, again more preferably at least 70 mol %, again more preferably at least 80 mol %, again more preferably at least 90 mol %, again more preferably at least 100 mol %, of the compounds or amphiphiles included in the surfactant composition have at least one negative charge.
Amphiphile or amphiphilic compounds (herein also called surfactants) are defined herein as organic compounds that comprise at least one hydrophobic group and at least one hydrophilic group. In a preferred embodiment, said amphiphile is a diblock compound comprising a hydrophilic “head” group, and a hydrophobic “tail” region. In a preferred embodiment, the amphiphile consists of at least one hydrophobic and at least one hydrophilic group. In a preferred embodiment, said amphiphile is a diblock compound comprising, preferably consisting of a hydrophilic “head” group, and a hydrophobic “tail” region.
The hydrophobic group comprises at least one hydrocarbon moiety. In a preferred embodiment, the hydrophobic group consists of at least one hydrocarbon moiety. The term “hydrocarbon moiety”, as used herein, encompasses compounds that consist only of hydrogen and carbon, joined by covalent bonds. The term encompasses open chain (aliphatic) hydrocarbons, including straight (unbranched) chain and branched hydrocarbons, and saturated as well as mono- and polyunsaturated hydrocarbons. The term also encompasses hydrocarbons containing one or more cyclic or aromatic ring.
The at least one hydrocarbon moiety is selected from the group consisting of linear or branched C4-C30 alkyl, C4-C30 alkenyl, C4-C30 alkynyl, C4-C30 alkoxy and C5-C30 cycloalkyl. In a preferred embodiment, said hydrocarbon moiety is selected from the group consisting of linear C4-C30 alkyl, C4-C30 alkenyl, C4-C30 alkynyl, C4-C30 alkoxy and C5-C30 cycloalkyl. In a preferred embodiment, said hydrocarbon moiety is selected from the group consisting of linear or branched alkyl, alkenyl, alkynyl, alkoxy, or cycloalkyl of C8 to C20, preferably C10 to 18, more preferably C12 to C18, most preferably C12, C14, C16, C17 or C18. In a preferred embodiment, said hydrocarbon moiety is selected from the group consisting of linear alkyl, linear alkenyl or cycloalkyl of C8 to C20, preferably C10 to 18, more preferably C12 to C18, most preferably C12, C14, C16, C17 or C18. In a preferred embodiment, said hydrocarbon moiety further comprises an aryl moiety in addition to the alkyl, alkenyl, alkynyl, alkoxy and cycloalkyl moiety.
In a preferred embodiment, said hydrocarbon moiety is selected from the group consisting of branched or linear alkyl, branched or linear alkenyl or cycloalkyl of C5-C30, preferably C8 to C20, more preferably C10 to 18, even more preferably C12 to C18, most preferably C12, C14, C16, C17 or C18. In a preferred embodiment, said anionic moiety is a sulfate moiety, and said hydrocarbon moiety is a linear alkyl, linear alkyl-ether or cycloalkyl residue, wherein the linear alkyl, linear alkyl-ether or cycloalkyl residue is C5-C30, preferably C8 to C20, more preferably C10 to 18, even more preferably C12 to C18, most preferably C12, C14, C16, C17 or C18.
The term “alkyl” or “alkyl residue”, as used herein, refers to a straight or branched hydrocarbon chain radical consisting solely of carbon and hydrogen atoms, containing no unsaturation, having from 4-30 carbon atoms (e.g., C4-C30 alkyl), and which may be or typically is attached to the rest of the molecule by a single bond. Whenever it appears herein, a numerical range such as “4-30” refers to each integer in the given range. For example, “C4-C30” means that the alkyl group may consist of 4 carbon atom, 5 carbon atoms, 6 carbon atoms, etc., up to and including 30 carbon atoms, although the definition is also intended to cover the occurrence of the term “alkyl” where no numerical range is specifically designated. Typical alkyl groups include, but are not limited to, alkylether, methyl, ethyl, n-propyl, 1-methylethyl (interchangeably used with iso-propyl; interchangeably abbreviated herein as iPr or Pri), n-butyl, isobutyl, sec-butyl, isobutyl, tertiary butyl (interchangeably used with 1,1-dimethylethyl or tert-butyl), n-pentyl, isopentyl, neopentyl, hexyl, septyl, octyl, nonyl and decyl. Unless stated otherwise specifically in the specification, an alkyl group is optionally substituted by one or more of substituents which are independently alkenyl, alkoxy, carboxylic group (—COOH), heteroalkyl, heteroalkenyl, hydroxyl, phosphate group (—OP(O)(OH)O—), phosphonate group (—OP(O)O—), phenyl group (—C6H4) optionally substituted with a halogen, preferably iodine, or a carboxylic group. Preferably, the term “alkyl”, as used herein, refers to an unsubstituted alkyl as defined herein.
The term “alkenyl” or “alkenyl residue”, as used herein, refers to a straight or branched hydrocarbon chain radical group consisting solely of carbon and hydrogen atoms, containing at least one double bond, and having from 4 to 30 carbon atoms (i.e., C4-C30 alkenyl), which may be or typically is attached to the rest of the molecule by a single bond. Whenever it appears herein, a numerical range such as “4-30” refers to each integer in the given range—e.g., “C4-C30” means that the alkenyl group may consist of 4 carbon atoms, 5 carbon atoms, etc., up to and including 30 carbon atoms. Typical alkenyl groups include, but are not limited to ethenyl (i.e., vinyl), prop-1-enyl (i.e., allyl), but-1-enyl, pent-1-enyl and penta-1,4-dienyl, alkenyl ether. Each double bond can be of either the (E)- or (Z)-configuration. Alkenyl, thus, may include, if applicable, either each of said double bond in its (E)-configuration, in its (Z)-configuration and mixtures thereof in any ratio. Unless stated otherwise specifically in the specification, an alkenyl group is optionally substituted by one or more of substituents which are independently alkenyl, alkoxy, carboxylic group (—COOH), heteroalkyl, heteroalkenyl, hydroxyl, phosphate group (—OP(O)(OH)O—), phosphonate group (—OP(O)O—), phenyl group (—C6H4) optionally substituted with a halogen, preferably iodine, or a carboxylic group. Preferably, the term “alkenyl”, as used herein, refers to an unsubstituted alkenyl as defined herein.
The term “alkynyl” or “alkynyl residue”, as used herein, refers to a straight or branched hydrocarbon chain radical group consisting solely of carbon and hydrogen atoms, containing at least one triple bond, having from two to ten carbon atoms (i.e., C4-C30 alkynyl). Whenever it appears herein, a numerical range such as “4-30” refers to each integer in the given range—e.g., “C4-C30” means that the alkynyl group may consist of 4 carbon atoms, 5 carbon atoms, etc., up to and including 30 carbon atoms. Typical alkynyl groups include, but are not limited to ethynyl, propynyl, butynyl, pentynyl and hexynyl. Unless stated otherwise specifically in the specification, an alkynyl group is optionally substituted by one or more of substituents which are independently alkenyl, carboxylic group (—COOH), heteroalkyl, heteroalkenyl, phosphate group (—OP(O)(OH)O—), phosphonate group (—OP(O)O—), phenyl group (—C6H4) optionally substituted with a halogen, preferably iodine, or a carboxylic group. Preferably, the term “alkynyl”, as used herein, refers to an unsubstituted alkynyl as defined herein.
The term “alkoxy” or “alkoxy residue”, as used herein, refers to the group —O-alkyl, including from 4 to 30 carbon atoms of a straight, branched configuration and combinations thereof attached to the parent structure through an oxygen. Examples include, but are not limited to, methoxy, ethoxy, propoxy, isopropoxy, cyclopropyloxy and cyclohexyloxy. The term “alkoxy” includes substituted alkoxy which refers to alkoxy wherein the alkyl constituent is substituted (i.e., —O-(substituted alkyl)). Unless stated otherwise specifically in the specification, the alkyl moiety of an alkoxy group is optionally substituted by one or more of substituents which are independently alkenyl, carboxylic group (—COOH), heteroalkyl, heteroalkenyl, phosphate group (—OP(O)(OH)O—), phosphonate group (—OP(O)O—), phenyl group (—C6H4) optionally substituted with a halogen, preferably iodine, or a carboxylic group.
The term “aryl” or “aryl residue”, as used herein, refers to an aromatic radical with six to ten ring atoms (e.g., C6-C10 aromatic or C6-C10 aryl) which has at least one ring having a conjugated pi electron system which is carbocyclic (e.g., phenyl, fluorenyl, and naphthyl). Bivalent radicals formed from substituted benzene derivatives and having the free valences at ring atoms are named as substituted phenylene radicals. Bivalent radicals derived from univalent polycyclic hydrocarbon radicals whose names end in “-yl” by removal of one hydrogen atom from the carbon atom with the free valence are named by adding “-idene” to the name of the corresponding univalent radical, e.g., a naphthyl group with two points of attachment is termed naphthylidene. The term includes monocyclic or fused-ring polycyclic (i.e., rings which share adjacent pairs of ring atoms) groups.
The term “cycloalkyl” or “cycloalkyl residue”, as used herein, refers to a monocyclic or polycyclic radical that contains only carbon and hydrogen, and may be saturated, or partially unsaturated. Cycloalkyl groups include groups having from 5 to 30 ring carbon atoms (i.e. C5-30 cycloalkyl). Whenever it appears herein, a numerical range such as “5 to 30” refers to each integer in the given range, e.g., “C5-30 cycloalkyl” means that the cycloalkyl group may consist of 5 carbon atoms, etc., up to and including 30 carbon atoms. Illustrative examples of cycloalkyl groups include, but are not limited to the following moieties: cyclopropyl, cyclobutyl, cyclopentyl, cyclopentenyl, cyclohexyl, cyclohexenyl, cycloheptyl, cyclooctyl, cyclononyl, cyclodecyl, norbornyl, and the like. The term “cycloalkyl” also relates to a monocyclic or polycyclic radical that contains further hydrocarbon moieties, such as linear or branched alkyl, alkenyl, alkynyl, alkoxy or aryl. The most preferred cycloalkyl is a saturated C17 polycyclic cycloalkyl.
The term “amphiphile” as used herein comprises in a preferred embodiment amphiphilic compounds selected from the group consisting of phospholipids, sphingolipids, glycerolipids, saccharolipids, fatty acids, fatty acid esters, steroids, sterols, steroid esters, polyketides, amphiphilic block copolymers, peptides, amphiphiles comprising peptides or oligonucleotides, peptide nucleic acids, carboxylates, sulfates, sulfonates, boronates, phosphonates, and phosphates.
In a preferred embodiment, the surfactant composition comprises a mixture of different amphiphiles.
In a preferred embodiment, the hydrophilic group of said amphiphile is an anionic hydrophilic group, i.e. comprising at least one anion. Preferably, said anionic hydrophilic group is such that said amphiphile has a negative molecular net charge (herein mentioned as anionic amphiphile).
In a preferred embodiment, the surfactant composition comprises at least one anionic amphiphile, i.e. an amphiphile having a negative molecular net charge. In another preferred embodiment, the surfactant composition comprises at least one anionic amphiphile having a negative molecular net charge and a cationic, uncharged and/or zwitterionic amphiphile. In a preferred embodiment, all amphiphiles included in the surfactant composition are anionic amphiphiles.
In another preferred embodiment, said surfactant composition comprises a steroid. Preferably, said steroid is an anionic steroid.
Preferably, said anionic amphiphile comprises an anionic moiety and a positively charged counterion, such as an ammonium, alkali or alkaline earth metal ion. More preferably, said positively charged counterion is a sodium, potassium, or ammonium ion, again more preferably a sodium ion.
In a preferred embodiment, said hydrophilic group of said amphiphile is selected from the group consisting of a carboxylate, sulfate, sulfonate, boronate, phosphonate, phosphate, peptide, nucleic acid, amino acid moiety or peptide nucleic acid. In a preferred embodiment, said hydrophilic group of said anionic amphiphile is selected from the group consisting of anions of carboxylate, sulfate, sulfonate, boronate, phosphonate, phosphate, peptide, nucleic acid, amino acid moiety or peptide nucleic acid.
In a preferred embodiment, said amino acid moiety comprises, preferably consists of one or two covalently coupled amino acids. In a preferred embodiment, said peptide moiety comprises, preferably consists of more than two amino acids. Said amino acids of the amino acid or peptide moiety of the hydrophilic group comprises hydrophilic amino acids, such as negatively charged Asp and Glu, or positively charged Lys, His and Arg, preferably negatively charged Asp and Glu.
In a preferred embodiment, said nucleic acid moiety comprises, preferably consists of DNA, RNA including analogs thereof, in the size of 5-500 base pairs, preferably 5-300 base pairs, more preferably from 5-200 base pairs, or again more preferably 5-100 base pairs, again more preferably 5-50 base pairs. Analogs of DNA or RNA are structurally similar to the native nucleic acid, but differ from the native nucleic acid (e.g., through chemical modification) at one or more of the nucleic acid backbone (e.g., phosphate in native nucleic acids), nucleic acid sugar (e.g., deoxyribose for native DNA and ribose in native RNA), and nucleic acid base (e.g., adenosine, cytosine, guanine, thymidine, or purine in native nucleic acids.) Nucleic acid analogs and mimics commonly result from modifications of native nucleic acids at the nucleobase (e.g., modified base), the sugar (e.g., fluorinated or deoxy sugars), and/or the phosphodiester backbone (e.g., peptide or thioester backbones). Nucleic acid analogs and mimics are known to those of skill in the art and include, for example, locked nucleic acids (LNAs), peptide nucleic acids (PNAs), and morpholinos.
In a more preferred embodiment, said hydrophilic group is selected from the group consisting of a carboxylate, sulfate, sulfonate, boronate, phosphonate and phosphate moiety. In an again more preferred embodiment, said hydrophilic group is selected from the group consisting of a carboxylate, sulfate, sulfonate and phosphate moiety. Again more preferably said hydrophilic group is a sulfate or phosphate moiety. Again more preferably, said hydrophilic group is a sulfate moiety.
In another preferred embodiment, said hydrophilic group comprises an anionic moiety. In another preferred embodiment, said hydrophilic group comprises an anionic moiety selected from the group consisting of a carboxylate, sulfate, sulfonate, phosphonate, boronate, phosphate and amino acid moiety. In another preferred embodiment, said hydrophilic group consists of an anionic moiety is selected from the group consisting of a carboxylate, sulfate, sulfonate, phosphonate, boronate, phosphate and amino acid moiety. Preferably, said anionic moiety is a sulfate.
In another preferred embodiment, at least one of said one or more amphiphiles included in the surfactant composition consists of (i) a hydrophilic group selected from the group consisting of a carboxylate, sulfate, sulfonate, boronate, phosphonate, phosphate moiety and amino acid, and (ii) a hydrophobic group comprising at least one hydrocarbon moiety selected from the group consisting of linear or branched C4-C30 alkyl, C4-C30 alkenyl, C4-C30 alkynyl, C4-C30 alkoxy or C5-C30 cycloalkyl.
The term “carboxy moiety” or “carboxylate moiety” as used herein refers preferably to groups of R—CO2−. The term “sulfate moiety” as used herein refers to groups of R—SO4−. The term “sulfonate moiety” as used herein refers preferably to groups of R—SO3−. The term “phosphonate moiety” as used herein refers preferably to groups of R—PO3−—R or R—PO32−. The term “phosphonate moiety” as used herein refers preferably to groups of R—PO4−—R, R—PO42−. The term “boronate moiety” as used herein refers preferably to groups of R—BO22−,
In another preferred embodiment, said amphiphile is defined as a salt of R—CO2−, R—PO42−, R—PO32−, R—BO22−, wherein R consists of a hydrocarbon moiety selected from the group consisting of linear or branched C4-C30 alkyl, C4-C30 alkenyl, C4-C30 alkynyl, C4-C30 alkoxy or C5-C30 cycloalkyl. In a more preferred embodiment, said amphiphile is defined as a salt of R—CO2−, R—SO4−, R—SO3−, R—PO4−—, R—PO42−, R—PO3−—R, R—PO32−, R—BO22−, wherein R consists of a hydrocarbon moiety selected from the group consisting of linear C4-C30 alkyl, C4-C30 alkenyl, C4-C30 alkynyl, C4-C30 alkoxy or C5-C30 cycloalkyl.
In a preferred embodiment, said hydrophilic group of at least one of said one or more amphiphiles included in the surfactant composition is a sulfate moiety. In a preferred embodiment, said at least one of said one or more amphiphiles included in the surfactant composition is a salt of a sulfate. In a preferred embodiment, said hydrophilic group of said amphiphile comprises a sulfate moiety, and the hydrocarbon moiety of the hydrophobic group is selected from the group consisting of linear or branched alkyl or alkyl-ether residue, wherein the alkyl or alkyl-ether is C6 to C30, preferably C8 to C20, more preferably C10 to 18, even more preferably C12 to C18, most preferably C12, C14, C16 or C18 or a mixture thereof. In a preferred embodiment, said hydrophilic group comprises a sulfate moiety, and said hydrocarbon moiety is selected from the group consisting of linear or branched alkyl, alkyl-ether or alkenyl residue, wherein the alkyl or alkyl-ether residue is C6 to C30, preferably C8 to C20, more preferably C10 to 18, even more preferably C12 to C18, most preferably C12, C14, C16 or C18 or a mixture thereof. In a preferred embodiment, said hydrophilic group consists of a sulfate moiety, and said hydrocarbon moiety is selected from the group consisting of linear alkyl, alkyl-ether or alkenyl residue, wherein the linear alkyl or alkyl-ether residue is C6 to C30, preferably C8 to C20, more preferably C10 to 18, even more preferably C12 to C18, most preferably C12, C14, C16 or C18 or a mixture thereof.
In another preferred embodiment, said amphiphile is a salt of a primary or secondary alkyl sulfate (fatty alcohol) or alkyl-ether sulfate (fatty alcohol ether).
In another preferred embodiment, said amphiphile is a salt of a primary or secondary alkyl sulfate (fatty alcohol) or alkyl-ether sulfate (fatty alcohol ether), said sulfate comprises a sulfate moiety (as a hydrophilic group) and a linear alkyl residue, wherein the linear alkyl residue is C6 to C30, preferably C8 to C20, more preferably C10 to 18, even more preferably C12 to C18, most preferably C12, C14, C16 or C18 or a mixture thereof. In another preferred embodiment, said amphiphile is a salt of a primary or secondary alkyl sulfate (fatty alcohol) or alkyl-ether sulfate (fatty alcohol ether), said hydrophilic group of said amphiphile consists of a sulfate moiety, and said hydrocarbon moiety is a linear alkyl residue, wherein the linear alkyl residue is C6 to C30, preferably C8 to C20, more preferably C10 to 18, even more preferably C12 to C18, most preferably C12, C14, C16 or C18 or a mixture thereof. The fatty alcohols and fatty alcohol ethers can be synthetic or they can be derived from natural fats. Primary alkyl sulfate amphiphiles are defined herein as those compounds, which have the sulfate moiety at the terminal of the carbon chain. Secondary alkyl sulfate amphiphiles are defined herein as those compounds, which have the sulfate moiety distributed randomly along the carbon chain.
In a preferred embodiment, at least one of said one or more amphiphiles included in the surfactant composition is a salt of a primary or secondary alkyl sulfate (fatty alcohol comprising a sulfate moiety (as hydrophilic moiety) and a linear alkyl residue, wherein the linear alkyl residue is C6 to C30, preferably C8 to C20, more preferably C10 to 18, even more preferably C12 to C18, again even more preferably C12, C14, C16 or C18 or a mixture thereof. In another preferred embodiment, said amphiphile is a salt of a primary alkyl sulfate (fatty alcohol) comprising a sulfate moiety and a linear alkyl residue, wherein the linear alkyl residue is C6 to C30, preferably C8 to C20, more preferably C10 to 18, even more preferably C12 to C18, again even more preferably C12, C14, C16 or C18 or a mixture thereof. In another preferred embodiment, said amphiphile is a salt of a primary alkyl sulfate (fatty alcohol), said sulfate comprises a sulfate moiety and a linear alkyl residue, wherein the linear alkyl residue is C6 to C20, preferably C8 to C16, more preferably C10 to 12, even more preferably C12.
In a preferred embodiment, at least one of said one or more amphiphiles included in the surfactant composition is a salt of a primary or secondary alkyl sulfate (fatty alcohol), said sulfate comprises a sulfate moiety (as hydrophilic moiety), and a linear alkyl residue, wherein the linear alkyl residue is C6 to C30, preferably C8 to C20, more preferably C10 to 18, even more preferably C12 to C18, again even more preferably C12, C14, C16 or C18; and a salt of a steroid comprising an anionic moiety selected from the group consisting of sulfonate, sulfate, carboxylate, phosphonate, boronate, phosphate ester or amino acids. More preferably said steroid comprises an anionic moiety selected from the group consisting of a sulfonate, sulfate, carboxylate, phosphonate, boronate or phosphate moiety, and a steroid moiety. Again more preferably said steroid comprises an anionic moiety selected from the group consisting of a sulfonate, sulfate, carboxylate, phosphonate, or phosphate moiety, and a sterol, preferably a cholesterol moiety.
Preferably, said amphiphile is a salt of a sulfate of formula H3C—(CH2)n—CH2—O—SO3− or H3C—(CH2)n—CH2—(O—CH2CH2)m—O—SO3−, wherein m is a value from 1 to 4 and n is a value from 4 to 20, preferably 6 to 20, more preferably 8 to 18, even more preferably from 10 to 16, again even more preferably from 10 to 14, again even more preferably 10-12, most preferably 10. In another preferred embodiment m is 2 or 3 and n is a value from 4 to 20, preferably 6 to 20, more preferably 8 to 18, even more preferably from 10 to 16, again even more preferably from 10 to 14, again even more preferably 10-12, most preferably 10. In another preferred embodiment m is 2 and n is a value from 4 to 20, preferably 6 to 20, more preferably 8 to 18, even more preferably from 10 to 16, again even more preferably from 10 to 14, again even more preferably 10-12, most preferably 10.
Preferably, said amphiphile is a salt of a sulfate of formula H3C—(CH2)n—CH2—O—SO3−, wherein n is a value from 4 to 20, preferably 6 to 20, more preferably 8 to 18, even more preferably from 10 to 16, again even more preferably from 10 to 14, again even more preferably 10-12, most preferably 10.
In another preferred embodiment, said amphiphile is a salt of a sulfate of formula H3C—(CH2)n—CH2—O—SO3−, wherein n is a value from 4 to 20, preferably 6 to 20, more preferably 8 to 18, even more preferably from 10 to 16, again even more preferably from 10 to 14, again even more preferably 10-12, most preferably 10; and a steroid sulfonate, sulfate or carboxylate, phosphonate, boronate, phosphate ester of a steroid. More preferably said anionic steroid for these amphiphiles is a sulfonate, sulfate, carboxylate, phosphonate, boronate or phosphate ester of cholesterol.
In another preferred embodiment, said amphiphile is selected from the group consisting of salt of lauryl sulfate, laureth sulfate, pareth sulfate, myreth sulfate, n-octyl sulfate, 8-hexadecylsulfate, and tetradecyl sulfate. In another preferred embodiment, said amphiphile is selected from the group consisting of sodium lauryl sulfate, ammonium lauryl sulfate, potassium lauryl sulfate, sodium laureth sulfate, ammonium laureth sulfate, sodium pareth sulfate, sodium myreth sulfate, sodium n-octyl sulfate, sodium 8-hexadecylsulfate, and sodium tetradecyl sulfate. In another preferred embodiment, said amphiphile is a salt of lauryl sulfate or laureth sulfate.
Preferably, said salt of a sulfate is selected from an ammonium, alkali or alkaline earth salts, more preferably from a sodium, potassium, or ammonium salt, again more preferably a sodium salt.
In a preferred embodiment, said amphiphile is selected from the group consisting of sodium lauryl sulfate (sodium dodecyl sulfate, SLS, or SDS, herein typically referred to as SDS), ammonium lauryl sulfate, potassium lauryl sulfate, sodium laureth sulfate (sodium lauryl ether sulfate or SLES), ammonium laureth sulfate, sodium pareth sulfate, sodium myreth sulfate, sodium n-octyl sulfate, sodium 8-hexadecylsulfate, sodium dodecylbenzenesulfonate (SDBS) and sodium tetradecyl sulfate.
In another preferred embodiment, said surfactant composition comprises an amphiphile selected from the group consisting of SDS, ammonium lauryl sulfate, potassium lauryl sulfate, SLES, ammonium laureth sulfate, sodium pareth sulfate, sodium myreth sulfate, sodium n-octyl sulfate, sodium 8-hexadecylsulfate, and sodium tetradecyl sulfate combined with a steroid comprising an anionic moiety selected from the group consisting of a sulfonate, sulfate, carboxylate, phosphonate, boronate, and phosphate ester. More preferably said steroid is a sulfonate, sulfate, carboxylate, phosphonate, boronate or phosphate ester of cholesterol.
In a preferred embodiment, said amphiphile is an ammonium, alkali or alkaline earth salt of lauryl sulfate or laureth sulfate. In a more preferred embodiment, said amphiphile is a sodium, potassium, or ammonium salt of lauryl sulfate or laureth sulfate. In another preferred embodiment, said amphiphile is a sodium salt of lauryl sulfate or laureth sulfate.
In a preferred embodiment, said amphiphile is an ammonium, alkali or alkaline earth salt of lauryl sulfate (sodium dodecyl sulfate, SLS, or SDS). In a more preferred embodiment, said amphiphile is a sodium, potassium, or ammonium salt of lauryl sulfate. Most preferably, said amphiphile is sodium lauryl sulfate.
In a preferred embodiment, said hydrophilic group of at least one of said one or more amphiphiles included in the surfactant composition is a sulfonate, boronate or phosphonate moiety. In a preferred embodiment, said at least one of said one or more amphiphiles included in the surfactant composition is a salt of a sulfonate, boronate or phosphonate.
In a preferred embodiment, said hydrophilic group of at least one of said one or more amphiphiles included in the surfactant composition is a sulfonate, boronate or phosphonate moiety; and said hydrocarbon moiety of said hydrophobic group is selected from the group consisting of alkyl, alkyl-benzene, benzene-alky, alky-ester, alkenyl, alkyl-succinate, alkyl-acetate and alkyl-tauride, wherein alkyl or alkenyl is linear or branched, preferably linear C6 to C30. Said hydrophilic group of at least one of said one or more amphiphiles included in the surfactant composition is a sulfonate, boronate or phosphonate moiety; and said hydrocarbon moiety is selected from the group consisting of alkyl, alkyl-benzene, benzene-alky, alky-ester, alkenyl, alkyl-succinate, alkyl-acetate and alkyl-tauride, wherein alkyl or alkenyl is linear or branched, preferably linear C8 to C20. said hydrophilic group of at least one of said one or more amphiphiles included in the surfactant composition is a sulfonate, boronate or phosphonate moiety and said hydrocarbon moiety is selected from the group consisting of alkyl, alkyl-benzene, benzene-alky, alky-ester, alkenyl, alkyl-succinate, alkyl-acetate and alkyl-tauride, wherein alkyl or alkenyl is linear or branched, preferably linear C10 to 18. said hydrophilic group of at least one of said one or more amphiphiles included in the surfactant composition is a sulfonate, boronate or phosphonate moiety and said hydrocarbon moiety is selected from the group consisting of alkyl, alkyl-benzene, benzene-alky, alky-ester, alkenyl, alkyl-succinate, alkyl-acetate and alkyl-tauride, wherein alkyl or alkenyl is linear or branched, preferably linear C12 to C18 or a mixture thereof said hydrophilic group of at least one of said one or more amphiphiles included in the surfactant composition is a sulfonate, boronate or phosphonate moiety and said hydrocarbon moiety is selected from the group consisting of alkyl, alkyl-benzene, benzene-alky, alky-ester, alkenyl, alkyl-succinate, alkyl-acetate and alkyl-tauride, wherein alkyl or alkenyl is linear or branched, preferably linear C12, C14, C16 or C18 or a mixture thereof said hydrophilic group of at least one of said one or more amphiphiles included in the surfactant composition is a sulfonate, boronate or phosphonate moiety and said hydrocarbon moiety is selected from the group consisting of alkyl, alkyl-benzene, benzene-alky, alky-ester, alkenyl, alkyl-succinate, alkyl-acetate and alkyl-tauride, wherein alkyl or alkenyl is linear or branched, preferably linear C12 or C14 or a mixture thereof.
In a preferred embodiment, at least one of said one or more amphiphiles included in the surfactant composition is a salt of a sulfonate, wherein said sulfonate is selected from the group consisting of primary alkylsulfonate, secondary alkylsulfonate, sulfonated alkylester, alkylestersuphonates, alpha olefin sulfonate, arylalkylsulfonates, alkylarylsulfonates, alkylbenzenesulfonates (ABS), benzenealkylsulfonates, alkylsulfoacetates, alkylsulfosuccinates, alkyltaurides, sulfolipids and sulfoglycolipids. In another preferred embodiment, said amphiphile is a salt of a sulfonate selected from the group consisting of primary alkylsulfonate, secondary alkylsulfonate, sulfonated alkyl ester, alkylestersuphonates, arylalkyl sulfonates, alkylarylsulfonates, alkylbenzenesulfonates, benzenealkylsulfonates, alkylsulfoacetates, alkylsulfosuccinates, and alkyltaurides. In a more preferred embodiment, said amphiphile is selected from the group consisting of primary alkylsulfonate, secondary alkylsulfonate, sulfonated alkylester, alkylestersuphonates, alpha olefin sulfonate, alkylbenzenesulfonates, benzenealkyl sulfonates, alkyl sulfoacetates, alkyl sulfosuccinates, and alkyltaurides. In an even more preferred embodiment, said amphiphile is a salt of a primary alkylsulfonate or secondary alkylsulfonate
In a preferred embodiment, said amphiphile is a salt of a sulfonate comprising a linear or branched, preferably linear alkyl or alkenyl residue, said salt of the sulfonate is selected from the group consisting of primary alkylsulfonate, secondary alkylsulfonate, sulfonated alkylester, alkylestersulfonates, alpha olefin (alkenyl) sulfonate, alkylbenzenesulfonates (ABS), benzenealkylsulfonates, alkylsulfoacetates, alkylsulfosuccinates, and alkyltaurides, wherein the alkyl residue is C6 to C30, preferably C8 to C20, more preferably C10 to 18, even more preferably C12 to C18, most preferably C12, C14, C16 or C18 or a mixture thereof. In another preferred embodiment, said amphiphile is a salt of a primary or secondary alkylsulfonate comprising a linear or branched alkyl residue of C6 to C30, preferably C8 to C20, more preferably C10 to 18, even more preferably C12 to C18, most preferably C12, C14, C16 or C18 or a mixture thereof. In another preferred embodiment, said amphiphile is a salt of a primary alkylsulfonate or secondary alkylsulfonate comprising a linear alkyl residue of C6 to C30, preferably C8 to C20, more preferably C10 to 18, even more preferably C12 to C18, most preferably C12, C14, C16 or C18 or a mixture thereof.
Preferably, said salt of a sulfate, sulfonate, boronate or phosphonate is selected from an ammonium, alkali or alkaline earth salts, more preferably from a sodium, potassium, or ammonium salt, again more preferably a sodium salt.
In a preferred embodiment, at least one of said one or more amphiphiles included in the surfactant composition is a salt of a primary or secondary alkylsulfate, or a salt of a primary or secondary alkylsulfonate, wherein the alkyl residue of said alkylsulfate or alkylsulfonate is linear C6 to C30, or salt of a sulfate of formula H3C—(CH2)n—CH2—(O—CH2CH2)m—O—SO3−, wherein m is a value from 1 to 4 and n is a value from 4 to 20. Preferably, said amphiphile is a salt of a primary or secondary alkylsulfate or a salt of a primary or secondary alkylsulfonate, wherein the alkyl residue of said alkylsulfate or alkylsulfonate is linear C8 to C20, or a salt of a sulfate of formula H3C—(CH2)n—CH2—(O—CH2CH2)m—O—SO3−, wherein m is a value from 1 to 4 and n is a value from 6 to 18. Preferably, said amphiphile is a salt of a primary or secondary alkylsulfate or a salt of a primary or secondary alkylsulfonate, wherein the alkyl residue of said alkylsulfate or alkylsulfonate is linear C10 to C18, or a salt of a sulfate of formula H3C—(CH2)n—CH2—(O—CH2CH2)m—O—SO3−, wherein m is a value from 2 or 3 and n is a value from 8 to 16. Preferably, said amphiphile is a salt of a primary or secondary alkylsulfate or a salt of a primary or secondary alkylsulfonate, wherein the alkylsulfate or alkyl residue of said alkylsulfonate is linear C10 to C16, or a salt of a sulfate of formula H3C—(CH2)n—CH2—(O—CH2CH2)m—O—SO3−, wherein m is a value from 2 or 3 and n is a value from 8 to 14. Preferably, said amphiphile is a salt of a primary or secondary alkylsulfate or a salt of a primary or secondary alkylsulfonate, wherein the alkyl residue of said alkylsulfate or alkylsulfonate is linear C10 to C14, or a salt of a sulfate of formula H3C—(CH2)n—CH2—(O—CH2CH2)m—O—SO3−, wherein m is a value from 2 or 3 and n is a value from 8 to 12. Preferably, said amphiphile is a salt of a primary or secondary alkylsulfate or a salt of a primary or secondary alkylsulfonate, wherein the alkyl residue of said alkylsulfate or alkylsulfonate is linear C12, or a salt of a sulfate of formula H3C—(CH2)n—CH2—(O—CH2CH2)m—O—SO3−, wherein m is a value from 2 or 3 and n is 10. Preferably, said salts of said sulfates or sulfonates are selected from an ammonium, alkali or alkaline earth salts, more preferably from a sodium, potassium, or ammonium salt, again more preferably a sodium salt.
In a preferred embodiment, said hydrophilic group of at least one of said one or more amphiphiles included in the surfactant composition is a carboxyl moiety. In a preferred embodiment, at least one of said one or more amphiphiles included in the surfactant composition is a salt of a carboxylate.
In a preferred embodiment, said hydrophilic group is a carboxy moiety, and said hydrocarbon moiety of said hydrophobic group is selected from the group consisting of alkyl, alkynyl, alkenyl or fatty acid residue, preferably a linear alkyl, alkenyl or fatty acid residue. In a preferred embodiment, said hydrophilic group is a carboxy moiety, and said hydrocarbon moiety of said hydrophobic group is selected from the group consisting of linear alkyl, alkynyl, alkenyl or fatty acid residue, wherein the alkyl or alkenyl residue is C6 to C30, preferably C8 to C20, more preferably C10 to 18, even more preferably C12 to C18, most preferably C12, C14, C16 or C18 or a mixture thereof.
In a preferred embodiment, said hydrophilic group is a carboxy moiety, and said hydrocarbon moiety of said hydrophobic group is a linear alkyl residue, wherein the linear alkyl residue is C6 to C30, preferably C8 to C20, more preferably C10 to 18, even more preferably C12 to C18, most preferably C12, C14, C16 or C18 or a mixture thereof.
In a preferred embodiment, said hydrophilic group is a carboxy moiety, and said hydrocarbon moiety of said hydrophobic group is a linear or branched moiety selected from alkyl, alkynyl, alkenyl or fatty acid, alkyl ester, alkyl ether, alkyl polyglycol or alkysarcosinate, wherein alkyl or alkenyl is C6 to C30, preferably C8 to C20, more preferably C10 to 18, even more preferably C12 to C18, most preferably C12, C14, C16 or C18 or a mixture thereof. In a preferred embodiment, said hydrophilic group is a carboxy moiety, and said hydrocarbon moiety of said hydrophobic group is a linear alkyl residue, wherein the linear alkyl residue is C6 to C30, preferably C8 to C20, more preferably C10 to 18, even more preferably C12 to C18, most preferably C12, C14, C16 or C18 or a mixture thereof.
In a preferred embodiment, at least one of said one or more amphiphiles included in the surfactant composition is a amphiphile selected from the group consisting of primary or secondary alkylsulfate, primary or secondary alkylsulfonate, alkylsarcosinate or alkylcarboxylate, wherein the alkyl residue of said primary or secondary alkylsulfate, alkylsulfonate, alkylsarcosinate or alkylcarboxylate is linear or branched, preferably linear C6 to C30; or a salt of a sulfate of formula H3C—(CH2)n—CH2—(O—CH2CH2)m—O—SO3−, wherein m is a value from 1 to 4 and n is a value from 4 to 20. More preferably, said amphiphile is a salt selected from the group consisting of primary or secondary alkylsulfate, primary or secondary alkylsulfonate, alkylsarcosinate or alkylcarboxylate, wherein the alkyl residue of said primary or secondary alkylsulfate, alkylsulfonate, alkylsarcosinate or alkylcarboxylate is linear or branched, preferably linear C8 to C20; or a salt of a sulfate of formula H3C—(CH2)n—CH2—(O—CH2CH2)m—O—SO3−, wherein m is 2 or 3 and n is a value from 6 to 18. More preferably, said amphiphile is a salt selected from the group consisting of primary or secondary alkylsulfate, primary or secondary alkylsulfonate, alkylsarcosinate or alkylcarboxylate, wherein the alkyl residue of said primary or secondary alkylsulfate, alkylsulfonate, alkylsarcosinate or alkylcarboxylate is linear or branched, preferably linear C10 to C18; or a salt of a sulfate of formula H3C—(CH2)n—CH2—(O—CH2CH2)m—O—SO3−, wherein m is 2 or 3 and n is a value from 8 to 16. More preferably, said amphiphile is a salt selected from the group consisting of primary or secondary alkylsulfate, primary or secondary alkylsulfonate, alkylsarcosinate or alkylcarboxylate, wherein the alkyl residue of said primary or secondary alkylsulfate, alkylsulfonate, alkylsarcosinate or alkylcarboxylate is linear or branched, preferably linear C12 to C18; or a salt of a sulfate of formula H3C—(CH2)n—CH2—(O—CH2CH2)m—O—SO3−, wherein m is 2 or 3 and n is a value from 10 to 16. More preferably, said amphiphile is a salt selected from the group consisting of primary or secondary alkylsulfate, primary or secondary alkylsulfonate, alkylsarcosinate or alkylcarboxylate, wherein the alkyl residue of said primary or secondary alkylsulfate, alkylsulfonate, alkylsarcosinate or alkylcarboxylate is linear or branched, preferably linear C12 to C16; or a salt of a sulfate of formula H3C—(CH2)n—CH2—(O—CH2CH2)m—O—SO3−, wherein m is 2 or 3 and n is a value from 10 to 14.
In a preferred embodiment, at least one of said one or more amphiphiles included in the surfactant composition is an amphiphilic lipid. The term “amphiphilic lipid” as used herein refers to amphiphilic compounds which contain both hydrophilic moieties and hydrophobic moieties comprising hydrocarbons selected from oils, fats (such as fatty acids, glycerides), sterols, steroids, and derivative forms of these compounds. Suitable amphiphilic lipids include moieties derived from fatty acids and their derivatives, hydrocarbons and their derivatives, and sterols, such as cholesterol. In a preferred embodiment, said amphiphilic lipid is selected from the group consisting of phospholipids, sphingolipids, glycerolipids, and saccharolipids.
In a preferred embodiment, said hydrophilic group of at least one of said one or more amphiphiles included in the surfactant composition is a phosphate moiety. In a preferred embodiment, at least one of said one or more amphiphiles included in the surfactant composition is a phosphate salt.
In a preferred embodiment, said hydrophilic group is a phosphate moiety, and said hydrocarbon moiety of said hydrophobic group is selected from the group consisting of fatty acid, fatty acid ester, alkyl, alkenyl, glyceride moiety. Preferably, said fatty acid, fatty acid ester, alkyl or alkenyl is linear. Preferably, said hydrophobic group is selected from the group consisting of saturated fatty acid, fatty acid ester, linear alkyl or linear alkenyl.
The term “fatty acid”, as used herein, refers to a hydrocarbon chain that terminates with a carboxylic acid group, wherein said hydrocarbon chain is typically and preferably either an alkyl or alkenyl of typically 3 to 32 carbons long, and that are, thus, saturated or unsaturated, and that are optionally substituted by one or more, preferably one, carboxylic group (-COOH), one or more, preferably one, C1-32 alkyl, one or more, preferably one, phosphate group (HOP(O)(OH)O—), one or more, preferably one, phosphonate group (HOP(O)O—), one or more, preferably one, thiophosphate group (HOP(O)(SH)O—), one or more, preferably one, dithiophosphate group (HOP(S)(SH)O—), one or more, preferably one, diphosphate group (HO—P(O)(OH)—O—P(O)(OH)—O—), one or more, preferably one, triphosphate group (HO—P(O)(OH)—O—P(O)(OH)—O—P(O)(OH)—O—), one or more phenyl group (—C6H5), one or more phenyl group substituted with a halogen, preferably iodine, or a carboxylic group. If a fatty acid contains one or more double bonds, and is thus unsaturated, there is the possibility of either a cis or trans geometric isomerism. The term “fatty acid moiety”, as used herein, refers to a moiety derived from a fatty acid, as defined herein, wherein one carboxylic group (—COOH) of said fatty acid becomes and is a —C(O)— group of said fatty acid moiety, which —C(O)— group is linked to said oligonucleotide either directly or via spacer in accordance with the present invention. The term “fatty acid” includes fatty diacids which refer to fatty acids as defined herein but with an additional carboxylic acid group in the omega position. In a preferred embodiment, said amphiphile is a phosphate salt comprising a phosphate moiety and at least one linear or branched, preferably linear C6 to C30 chain of fatty acid, fatty acid ester, alkyl or alkenyl. In a further preferred embodiment, said amphiphile is a phosphate salt comprising a phosphate moiety and at least one C8 to C20 chain of a fatty acid, alkyl or alkenyl. In a further preferred embodiment, said amphiphile is a phosphate salt comprising a phosphate moiety and at least one C10 to C18 chain of fatty acid, alkyl or alkenyl. In further preferred embodiment, said amphiphile is a phosphate salt comprising a phosphate moiety and at least one C12 to C18 chain of fatty acid, alkyl or alkenyl. In further preferred embodiment, said amphiphile is a phosphate salt comprising a phosphate moiety and at least one C12 to C18 chain of fatty acid, alkyl or alkenyl. In further preferred embodiment, said amphiphile is a phosphate salt comprising a phosphate moiety and at least one C12, C14, C16 or C18 chain or a mixture thereof of fatty acid, alkyl or alkenyl. Preferably, said alkyl or alkenyl is linear. More preferably said alkyl or alkenyl is linear and said fatty acid is unsaturated.
In a preferred embodiment, at least one of said one or more amphiphiles included in the surfactant composition is a phosphate salt selected from the group consisting of mono-alkyl phosphate ester salts, di-alkyl phosphate ester salts, mono-alkenyl phosphate ester salts, di-alkenyl phosphate ester salts and phospholipids.
In a preferred embodiment, at least one of said one or more amphiphiles included in the surfactant composition is a phospholipid. Preferably, said phospholipid consists of at least two hydrophobic fatty acid moieties, preferably exactly two fatty acid moieties, and a hydrophilic phosphate moiety. The moieties are preferably covalently linked by a glycerol moiety. The phosphate moiety is preferably modified with simple organic molecules such as choline, ethanolamine or serine.
In a preferred embodiment, at least one of said one or more amphiphiles included in the surfactant composition is a phosphate salt selected from the group consisting of mono-alkyl phosphate ester salts, di-alkyl phosphate ester salts, mono-alkenyl phosphate ester salts, di-alkenyl phosphate ester salts and phospholipids, wherein said alkyl or alkenyl of the phosphate ester salts or the fatty acids of the phospholipids is linear C6 to C30, preferably C8 to C20, more preferably C10 to 18, even more preferably C12 to C18, most preferably C12, C14, C16 or C18 or a mixture thereof.
In a preferred embodiment, at least one of said one or more amphiphiles included in the surfactant composition is a salt selected from the group consisting of primary or secondary alkylsulfate, primary or secondary alkylsulfonate, alkylphosphonate, alkylboronate, alkylsarcosinate, alkylcarboxylate, mono-alkyl phosphate ester, or di-alkyl phosphate ester, wherein the alkyl residue of said alkylsulfate, alkylsulfonate, alkylphosphonate, alkylboronate, alkylsarcosinate or alkylcarboxylate or phosphate ester is linear C6 to C30; or a salt of a sulfate of formula H3C—(CH2)n—CH2—(O—CH2CH2)m—O—SO3−, wherein m is a value from 1 to 4 and n is a value from 4 to 20. More preferably, said amphiphile is a salt selected from the group consisting of primary or secondary alkyl sulfate, primary or secondary alkylsulfonate, alkylphosphonate, alkylboronate, alkylsarcosinate or alkylcarboxylate, mono-alkyl phosphate ester, or di-alkyl phosphate ester, wherein the alkyl residue of said alkylsulfate, alkylsulfonate, alkylphosphonate, alkylboronate, alkylsarcosinate or alkylcarboxylate or phosphate ester is linear C8 to C30; or a salt of a sulfate of formula H3C—(CH2)n—CH2—(O—CH2CH2)m—O—SO3−, wherein m is 2 or 3 and n is a value from 6 to 18. More preferably, said amphiphile is a salt selected from the group consisting of primary or secondary alkylsulfate, primary or secondary alkylsulfonate, alkylphosphonate, alkylboronate, alkylsarcosinate or alkylcarboxylate, mono-alkyl phosphate ester, or di-alkyl phosphate ester, wherein the alkyl residue of said alkylsulfate, alkylsulfonate, alkylphosphonate, alkylboronate, alkylsarcosinate or alkylcarboxylate or phosphate ester is linear C10 to C18; or a salt of a sulfate of formula H3C—(CH2)n—CH2—(O—CH2CH2)m—O—SO3−, wherein m is 2 or 3 and n is a value from 8 to 16. More preferably, said amphiphile is a salt selected from the group consisting of primary or secondary alkylsulfate, primary or secondary alkylsulfonate, alkylphosphonate, alkylboronate, alkylsarcosinate or alkylcarboxylate, mono-alkyl phosphate ester, or di-alkyl phosphate ester, wherein the alkyl residue of said primary or secondary alkylsulfate, alkylsulfonate, alkylphosphonate, alkylboronate, alkylsarcosinate or alkylcarboxylate or phosphate ester is linear C12 to C18; or a salt of a sulfate of formula H3C—(CH2)n—CH2—(O—CH2CH2)m—O—SO3−, wherein m is 2 or 3 and n is a value from 10 to 16. More preferably, said amphiphile is a salt selected from the group consisting of primary or secondary alkylsulfate, primary or secondary alkylsulfonate, alkylphosphonate, alkylboronate, alkylsarcosinate or alkylcarboxylate, mono-alkyl phosphate ester, or di-alkyl phosphate ester, wherein the alkyl residue of said primary or secondary alkylsulfate, alkylsulfonate, alkylphosphonate, alkylboronate, alkylsarcosinate or alkylcarboxylate or phosphate ester is linear C12 to C16; or a salt of a sulfate of formula H3C—(CH2)n—CH2—(O—CH2CH2)m—O—SO3−, wherein m is 2 or 3 and n is a value from 10 to 14.
In a preferred embodiment the molar ratio of total surfactant molecules (i.e. anionic amphiphile and optional anionic steroid) to protein cages encapsulated into the assembled protein cage is up to about 1000:1. In a preferred embodiment, the molar ratio of total surfactant molecules to protein cages is about 800:1.
In another preferred embodiment said anionic amphiphile is a salt of dodecylsulfate, preferably sodium dodecyl sulfate, and the molar ratio of dodecylsulfate molecules encapsulated into the assembled protein cage to protein cages is up to about 1000:1. In a preferred embodiment said anionic amphiphile is a salt of dodecylsulfate, preferably sodium dodecyl sulfate (SDS), and the molar ratio of dodecylsulfate molecules encapsulated into the assembled protein cage to protein cages is up to about 800:1. This is based on the data that indicate that no change to the quaternary structure of the protein cage of the invention occurs at a concentration of about 800 equivalent of SDS, while more than 1000 equivalents gave rise to bands of increased mobility, potentially due to external association of the SDS molecules with the protein cage.
In a preferred embodiment, said hydrocarbon moiety is selected from the group consisting of a fatty acid, fatty acid ester, steroid, sterol, steroid ester.
In another preferred embodiment, said one or more amphiphile included in the surfactant composition comprises an amphiphilic steroid. In another preferred embodiment, said one or more amphiphile included in the surfactant composition comprises an anionic amphiphilic steroid. In another preferred embodiment, said one or more amphiphile included in the surfactant composition comprises an anionic amphiphile and an anionic steroid. In another preferred embodiment, said one or more amphiphile included in the surfactant composition comprises an anionic amphiphile and an anionic steroid, wherein the hydrophilic group of said anionic amphiphile and anionic steroid is selected from the group consisting of sulfonate, sulfate, carboxylate, phosphonate, boronate, and phosphate moiety. More preferably, in another preferred embodiment, said one or more amphiphile included in the surfactant composition comprises an anionic amphiphile and an anionic steroid, wherein the hydrophilic group of said anionic amphiphile and anionic steroid is a sulfate moiety.
The term “steroid” as used herein preferably comprises steroids, steroid esters and sterols, more preferably anionic comprises steroids, steroid esters and sterols.
Preferably, said anionic steroid is selected from the group consisting of a steroid comprising a sulfonate, sulfate, carboxylate, phosphonate, boronate, phosphate, phosphate ester or hydrophilic amino acids. More preferably, said anionic steroid is selected from the group consisting of steroid sulfonate, steroid sulfate and steroid phosphate ester. More preferably said anionic steroid comprises an anionic moiety selected from the group consisting of sulfonate, sulfate, carboxylate, phosphonate, boronate, phosphate; and a steroid moiety selected from the group consisting of estradiol, estriol, diethylstilbestrol, dehydroepiandrosterone, cholesterol, pregnenolone, DHEA, androstenediol, androsterone, estrone, testosterone. In another preferred embodiment, said anionic steroid is selected from the group consisting of estriol sulfate, estradiol sulfate, estradiol disulfate (EDS), diethylstilbestrol disulfate, dehydroepiandrosterone sulfate, cholesterol sulfate, pregnenolone sulfate, DHEA sulfate, androstenediol sulfate, androsterone sulfate, estrone sulfate, estradiol sulfate, testosterone sulfate. Most preferably, said anionic steroid is cholesterol sulfate (CS).
In a preferred embodiment, said anionic amphiphile of the surfactant composition of the invention is sodium dodecyl sulfate (SDS) and said anionic steroid is cholesterol sulfate (CS).
In a preferred embodiment, said a surfactant composition comprises an anionic amphiphile and an anionic steroid encapsulated into the assembled protein cage, wherein the molar ratio of the anionic amphiphile and anionic steroid within the surfactant composition is from 50% anionic amphiphile/50% anionic steroid to 100% anionic amphiphile/0% anionic steroid, more preferably from 75% anionic amphiphile/25% anionic steroid to 100% anionic amphiphile/0% anionic steroid. In a preferred embodiment, said anionic steroid is used encapsulation of large planar cargo molecules, wherein preferably the surfactant composition comprises 75% of anionic amphiphile and 25% anionic steroid, indicated as molar ratio.
In a preferred embodiment, said anionic amphiphile is SDS and said anionic steroid is CS, wherein SDS is included in the surfactant composition from 100% to 50%, preferably from 100% to 75% in relation to CS, indicated as molar ratio.
In a preferred embodiment, said surfactant composition comprises SDS and CS. In a preferred embodiment, said surfactant composition consists of SDS and CS. In a preferred embodiment, said surfactant composition comprises SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1. In a preferred embodiment, said surfactant composition consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1. In a preferred embodiment, said surfactant composition comprises SDS and CS, and wherein said molar ratio of said SDS to CS is 3 to 1. In a preferred embodiment, said surfactant composition consists of SDS and CS, and wherein said molar ratio of said SDS to CS is 3 to 1.
Said a protein cage comprising at least one polypeptide comprising an amino acid sequence I consisting of:
wherein any of X1 to X29 are independently of each other an amino acid, provided that at least 3 of X1 to X6 are independently of each other a positively charged amino acid, and wherein optionally up to 5 amino acids in positions other than denoted by X1 to X29 in SEQ ID NO: 1 are exchanged by any amino acid;
The term “polypeptide” as used herein refers to any peptide-bond-linked polymer of amino acids, regardless of size, length, secondary and tertiary structure, number of subunits or post-translational modification. Thus, the term “polypeptide” is to be understood as covering the terms “peptide”, “protein”, “amino acid chain”, “amino acid sequence”. Polypeptides in accordance with the invention can be an open linear peptide chain or cyclic peptides; alternatively or additionally, peptides of the invention may include at least one chemical modification, such as lipidation, glycosylation and phosphorylation. Peptides, as understood herein, especially peptides of the invention, are isolated or, preferably can be produced by chemical synthesis, RNA translation and/or recombinant processes.
The term “amino acid”, as used herein, refers to organic compounds containing the functional groups amine (—NH2) and carboxylic acid (—COOH) and its zwitterions, typically and preferably, along with a side chain specific to each amino acid. The term “amino acid” typically and preferably includes amino acids that occur naturally, such as proteinogenic amino acids (produced by RNA-translation), non-proteinogenic amino acids (produced by other metabolic mechanisms, e.g. posttranslational modification), standard or canonical amino acids (that are directly encoded by the codons of the genetic code) and non-standard or non-canonical amino acids (not directly encoded by the genetic code). Naturally occurring amino acids include proteinogenic and non-proteinogenic amino acids. The term “amino acid”, as used herein, also includes unnatural amino acids that are chemically synthesized. Moreover, the term covers alpha- (α-), beta- (β-), gamma- (γ-) and delta- (δ-) etc. amino acids as well as mixtures thereof in any ratio, and, if applicable, any isomeric form of an amino acid, i.e. its D- and L-stereoisomers (alternatively addressed by the (R) and (S) nomenclature) as well as mixtures thereof in any ratio, preferably in a racemic ratio of 1:1. Amino acids in this invention are typically and preferably in L-configuration. The term “D-stereoisomer”, “L-stereoisomer”, “D-amino acid” or “L-amino acid” refers to the chiral alpha carbon of the amino acids. Amino acid can include modifications and/or attached compounds and residues, for example residues used for peptide synthesis, such as Boc, Fmoc or both.
The terms “Xn to Xm” and “Xn-m” are used interchangeably herein for denoting certain amino acid positions in SEQ ID NO: 1.
Optionally up to five amino acids in positions other than denoted by X1-29 in SEQ ID NO: 1 can be exchanged by any amino acid. The term “amino acid exchange” or “exchanged by any amino acid” used interchangeably herein includes or preferably refers to an exchange by deletion of one amino acid or substitution of a single amino acid by one or more amino acids (addition), more preferably by one, two or three amino acids. Most preferably, the term “amino acid exchange” refers to deletion of a single amino acid or substitution of a single amino acid by one, two or three amino acids. In a preferred embodiment, said amino acid exchange is a substitution. In another preferred embodiment, said amino acid exchange is a conservative amino acid substitution.
The term “conservative substitution” is an amino acid substitution that changes a given amino acid to a different amino acid with similar biochemical properties. Conservative substitutions include and preferably refer to isosteric substitutions and substitutions where the charged, polar, aromatic, aliphatic or hydrophobic nature of the amino acid is maintained. Conservative substitutions refer to substitutions that maintain the capability of the polypeptide of the invention to self-assemble into the protein cage of the invention.
The term “positively charged” as used herein includes and preferably refers to a molecule that has a positively charged group. More preferably, said positively charged molecule has a positively charged group at neutral or physiological pH.
In preferred embodiments of the present invention, optionally up to 4 amino acids, more preferably up to 3 amino acids, again more preferably up to 2 amino acids, most preferably 1 amino acid in positions other than denoted by X1-29 in SEQ ID NO: 1 is/are exchanged by any amino acid.
In a preferred embodiment, said polypeptide has a length of about 500 amino acids or less, preferably about 400 amino acids or less, more preferably 300 amino acids or less, again more preferably 250 amino acids or less, again more preferably 200 amino acids or less, again more preferably from 188 to 230 amino acids, most preferably 192 amino acids.
In another preferred embodiment, said polypeptide is an isolated polypeptide.
In another preferred embodiment, said polypeptide consists of an amino acid sequence consisting of SEQ ID NO: 1. In another preferred embodiment, said polypeptide consists of an amino acid sequence consisting of SEQ ID NO: 1, wherein optionally up to 5 amino acids in positions other than denoted by X1-29 in SEQ ID NO: 1 are exchanged by any amino acid.
In another preferred embodiment, said positively charged amino acids of said at least 3 of X1 to X6 are independently of each other arginine or a conservative substitution of arginine. In another preferred embodiment, said positively charged amino acids of said at least 3 of said X1 to X6 are independently of each other selected from the group consisting of arginine, 5-methyl-arginine, gamma-hydroxy arginine, 2-amino-4-guanidinobutryric acid, 2-amino-3-guanidinopropionic acid, canavanine, homoarginine, lysine, diaminobutyric acid, 2,3-diaminopropanoic acid, (2S)-2,8-diaminooctanoic acid, ornithine, thialysine and histidine.
In another preferred embodiment, said positively charged amino acids of said at least 3 of X1 to X6 are independently of each other selected from the group consisting of arginine, 5-methyl-arginine, gamma-hydroxy arginine, 2-amino-4-guanidinobutryric acid, 2-amino-3-guanidinopropionic acid, canavanine, homoarginine, lysine, diaminobutyric acid, 2,3-diaminopropanoic acid, (2S)-2,8-diaminooctanoic acid, ornithine and thialysine.
In another more preferred embodiment, said positively charged amino acid of said at least 3 of X1 to X6 is independently of each other histidine, arginine or lysine. In another more preferred embodiment, said positively charged amino acid of said at least 3 of X1 to X6 is independently of each other arginine or lysine. In another more preferred embodiment, said at least 3 positively charged amino acid of said X1 to X6 are independently of each arginine. In one embodiment, each of said positively charged amino acids X1 to X6 is arginine. In one embodiment, said positively charged amino acids X1 to X6 is lysine.
X7 is cysteine, a conservative substitution thereof or a positively charged amino acid.
In a preferred embodiment, said conservative substitution of cysteine is selected from the group consisting of methionine, homocysteine, selenocysteine, serine, hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, homoserine, 3-hydroxy-L-valine, 4,5-dihydroxy-isoleucine, 6-hydroxy-L-norleucine, S-(2-hydroxyethyl)-L-cysteine, phosphoserine, 4-hydroxy-L-threonine, threonine, and phosphothreonine. In a preferred embodiment, said conservative substitution of cysteine is selected from the group consisting of homocysteine, selenocysteine, serine, hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, homoserine, 3-hydroxy-L-valine, 4,5-dihydroxy-isoleucine, 6-hydroxy-L-norleucine, S-(2-hydroxyethyl)-L-cysteine, phosphoserine, 4-hydroxy-L-threonine, threonine, and phosphothreonine. More preferably, said conservative substitution of cysteine is selected from the group consisting of homocysteine, selenocysteine, and serine. Again more preferably, said conservative substitution of cysteine is homocysteine or selenocysteine.
In another preferred embodiment, X7 is selected from the group consisting of homocysteine, selenocysteine, cysteine, arginine, 5-methyl-arginine, gamma-hydroxy arginine, 2-amino-4-guanidinobutryric acid, 2-amino-3-guanidinopropionic acid, canavanine, homoarginine, lysine, diaminobutyric acid, 2,3-diaminopropanoic acid, (2S)-2,8-diaminooctanoic acid, ornithine and thialysine.
In another preferred embodiment, X7 is selected from the group consisting of homocysteine, selenocysteine, serine, hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3 -dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, homoserine, 3-hydroxy-L-valine, 4,5-dihydroxy-isoleucine, 6-hydroxy-L-norleucine, S-(2-hydroxyethyl)-L-cysteine, phosphoserine, 4-hydroxy-L-threonine, threonine, and phosphothreonine, arginine, 5-methyl-arginine, gamma-hydroxy arginine, 2-amino-4-guanidinobutryric acid, 2-amino-3-guanidinopropionic acid, canavanine, homoarginine, lysine, diaminobutyric acid, 2,3-diaminopropanoic acid, (2S)-2,8-diaminooctanoic acid, ornithine and thialysine. In another preferred embodiment, said X7 is selected from arginine, lysine, serine, homocysteine, or cysteine. In another further preferred embodiment, said X7 is selected from arginine, lysine, homocysteine, or cysteine.
If X7 is cysteine or a conservative substitution of cysteine it could be used to append positively charged groups via disulfide formation (e.g. cysteamine, 1-(3-mercaptopropyl)guanidine etc.) to the polypeptide of the invention.
In one embodiment, said at least 3 of said X1 to X6 are independently of each other lysine or arginine, and said X7 is selected from arginine, lysine, serine, homocysteine or cysteine.
In amino acid sequence I consisting of SEQ ID NO: 1, at least 3 of said amino acids X1 to X6 are independently of each other positively charged amino acids. In a preferred embodiment, at least 4, more preferably at least 5, again more preferably 6, i.e. each of said amino acids X1 to X6 is/are independently of each other a positively charged amino acid.
In a preferred embodiment, at least 4, preferably at least 5, more preferably 6, i.e. each of said amino acids X1 to X6 is/are independently of each other a positively charged amino acid, wherein said positively charged amino acid is arginine or lysine. In a preferred embodiment, at least 4, preferably at least 5, more preferably each of said amino acids of X1 to X6 is/are independently of each other a positively charged amino acids wherein said positively charged amino acids are arginine. In a preferred embodiment, at least 4, preferably at least 5, more preferably 6 of said amino acids X1 to X6 are independently of each other positively charged amino acids, wherein said positively charged amino acids are lysine.
In a preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1 to X3. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1, X2, and X4. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1, X2, and X5. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1, X2, and X6.
In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X2, X3, and X4. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X2, X3, and X5. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X2, X3, and X6.
In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X3, X4, and X5. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X3, X4, and X6. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X3, X4, and X1.
In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X4, X5, and X6. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X4, X5, and X1. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X4, X5, and X2.
In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X5, X6, and X1. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X5, X6, and X2. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X5, X6, and X3.
In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1, X3, and X5. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1, X3, and X6. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X2, X4, and X6. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1, X4, and X6.
In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1, X2, X3, and X4. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1, X2, X3, and X5. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1, X2, X3, and X6. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X2, X3, X4, and X5. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X2, X3, X4, and X6. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X3 to X6. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X2, X4, X5, and X6. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1, X4, X5, and X6. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1, X3, X5, and X6. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1, X3, X4, and X6. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X2, X3, X4, and X6.
In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1-5. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1-4 and X6. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1-3, X5 and X6. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1, X2, and X4 to X6. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1, and X3-6. In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X2 to X6.
In another preferred embodiment, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1 to X6 are independently of each other positively charged amino acids.
More preferably, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1 to X6, or X4 to X6, or X1, X2 and X5, or X1, X2, X4, and X5, or X1, X3, X4, and X6, or X1 and X4 to X6, or X1 to X3 and X6, or X1 to X5. More preferably, said at least 3 of X1 to X6 being independently of each other a positively charged amino acid are X1 to X6, or X4 to X6, or X1, X2 and X5, or X1, X2, X4, and X5, or X1, X3, X4, and X6, or X1 and X4 to X6, or X1 to X3 and X6, or X1 to X5.
In a preferred embodiment, X4 to X6 are independently of each other lysine or arginine. In another preferred embodiment, X1, X2, and X4 are independently of each other lysine or arginine. In another preferred embodiment, X1, X2, X4, and X5 are independently of each other lysine or arginine. In another preferred embodiment, X1, X3, X4, and X6 are independently of each other lysine or arginine. In another preferred embodiment, X1, and X3 to X6 are independently of each other independently of each other lysine or arginine. In another preferred embodiment, X1-3, X5 and X6 are independently of each other lysine or arginine. In another preferred embodiment, X1-5 are independently of each other lysine or arginine. In another preferred embodiment, X1 to X6 are independently of each other lysine or arginine.
In a preferred embodiment, each of X1 to X6 is independently arginine or lysine. In a preferred embodiment, each of X1 to X6 is arginine. In a preferred embodiment, each of X1 to X6 is lysine.
In another preferred embodiment, X1 to X6 are independently of each other lysine or arginine and X7 is selected from arginine, lysine or cysteine.
The inventors found out that amino acids at positions X8 to X12 define pore size of the protein cage formed via self-assembly by the polypeptide of the invention. Larger amino acid at positions X8 to X12 reduced pore size, while smaller amino acid at these positions increased pore size.
Thus, in a preferred embodiment, said amino acids at positions X8 to X12 in the polypeptide of the invention are selected such that the surfactant composition can be loaded into and released from the protein cage and cargo into/from the lipoprotein cage, without disassembly. In another preferred embodiment, said amino acids at positions X8 to X12 in the polypeptide of the invention are selected such that cargo can be loaded extracellularly into the protein cage and released intracellularly, preferably, into the cytoplasm of a cell, without disassembly.
Preferably, said cargo is small cargo. Small cargo has preferably a size of 1000 Da or below. In a preferred embodiment, a size of 1000 Da or below means that said cargo has a size of 1000 Da or lower, preferably 800 Da or lower, more preferably 600 Da or lower, again more preferably 500 Da or lower, again more preferably 400 Da or lower, again more preferably 300 Da or lower, again more preferably 200 Da or lower, again more preferably 100 Da or lower. Preferably, said cargo is hydrophobic cargo, more preferably non-polar cargo.
Preferably, said cargo has a low solubility in aqueous media. Preferably, said small cargo having a size of 1000 Da or below has a low solubility in aqueous media. More preferably said cargo of low solubility is included in Class II or Class IV of the Biopharmaceutics Classification System (BCS). Again more preferably, said low soluble cargo has a lower solubility as a highly soluble cargo for which the highest strength dose is soluble in 250 mL or less of aqueous media over the pH range of 1.0-7.5, more preferably, 1.0-6.8, at 37±1° C.
In a preferred embodiment, said X8 is glycine or a conservative substitution thereof. Preferably, said conservative substitution of glycine is selected from the group consisting of, alanine, leucine, valine, tert-leucine, homoleucine, isoleucine, alloisoleucine 2-aminobutyric acid, diethylalanine, norleucine, and norvaline. More preferably, said conservative substitution of glycine is selected from the group consisting of, alanine, leucine, valine, tert-leucine, homoleucine, isoleucine, alloisoleucine 2-aminobutyric acid, diethylalanine, norleucine, and norvaline. In a further preferred embodiment, said X8 is selected from the group consisting of glycine, alanine, leucine, and valine. In a preferred embodiment, said X8 is glycine. In another preferred embodiment, said X9 and X12 are independently of each other glutamine or a conservative substitution thereof. Preferably said conservative substitution of glutamine is selected from the group consisting of asparagine, beta-hydroxyasparagine, 3-methyl-L-glutamine, (2S,4S)-2,5-diamino-4-hydroxy-5-oxopentanoic acid, and n-methyl-asparagine. In another preferred embodiment, said X9 and X12 are independently of each other glutamine or asparagine.
In another preferred embodiment, said X9 and X12 are both glutamine. In another preferred embodiment, said X9 and X12 are both asparagine. In another preferred embodiment, said X9 or X12 is glutamine. In another preferred embodiment, said X9 is glutamine. In another preferred embodiment, said X12 is glutamine.
In another preferred embodiment, said X10 is glutamate or a conservative substitution thereof. Preferably, said conservative substitution of glutamate is selected from the group consisting of glutamate, aspartate, (2S,4R)-4-methylglutamate, (3S)-3-methyl-L-glutamic acid, (3R)-3-methyl-L-glutamic acid, 5-O-methyl-glutamic acid, 4-hydroxy-glutamic acid, 6-carboxylysine, beta-hydroxyaspartic acid, 2-amino-propanedioic acid, 3,3-dimethyl aspartic acid, 2-aminoadipic acid and 3-methyl-aspartic acid. In a further preferred embodiment, said X10 is glutamate or aspartate. In another further preferred embodiment, said X10 is glutamate.
In another preferred embodiment, said X11 is selected from the group consisting of serine or a conservative substitution thereof. Preferably said conservative substitution of serine is selected from the group consisting of cysteine, methionine, homocysteine, selenocysteine, hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, homoserine, 3-hydroxy-L-valine, 4,5-dihydroxy-isoleucine, 6-hydroxy-L-norleucine, S-(2-hydroxyethyl)-L-cysteine, phosphoserine, 4-hydroxy-L-threonine, threonine, and phosphothreonine. More preferably said conservative substitution of serine is selected from the group consisting of cysteine, homocysteine, selenocysteine, hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, homoserine, 3-hydroxy-L-valine, 4,5-dihydroxy-isoleucine, 6-hydroxy-L-norleucine, S-(2-hydroxyethyl)-L-cysteine, phosphoserine, 4-hydroxy-L-threonine, threonine, and phosphothreonine. Again more preferably said conservative substitution of serine is selected from the group consisting of hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, homoserine, 3-hydroxy-L-valine, 4,5-dihydroxy-isoleucine, 6-hydroxy-L-norleucine, S-(2-hydroxyethyl)-L-cysteine, phosphoserine, 4-hydroxy-L-threonine, threonine, and phosphothreonine. Again more preferably said conservative substitution of serine is selected from the group consisting of homoserine, threonine, 4-hydroxy-L-threonine, 6-hydroxy-L-norleucine, 4,5-dihydroxy-isoleucine, 3-hydroxy-L-valine, hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, and allo-threonine.
In a further preferred embodiment, said X11 is selected from the group consisting of serine, homoserine, and threonine. In another further preferred embodiment, said X11 is serine.
In a preferred embodiment, said X8 is selected from the group consisting of glycine, alanine, leucine, valine, tert-leucine, homoleucine, isoleucine, alloisoleucine 2-aminobutyric acid, diethylalanine, norleucine, and norvaline; X9 and X12 are independently of each other selected from the group consisting of glutamine, asparagine, beta-hydroxyasparagine, 3-methyl-L-glutamine, (2S,4S)-2,5-diamino-4-hydroxy-5-oxopentanoic acid, and n-methyl-asparagine; X10 is selected from the group consisting of glutamate, aspartate (2S,4R) methylglutamate, (3S)-3-methyl-L-glutamic acid, (3R)-3-methyl-L-glutamic acid, 5-O-methyl-glutamic acid, 4-hydroxy-glutamic-acid, 6-carboxylysine, beta-hydroxyaspartic acid, 2-amino-propanedioic acid, 3,3-dimethyl aspartic acid, 2-aminoadipic acid and 3-methyl-aspartic acid; and X11 is selected from the group consisting of serine, homoserine, threonine, 4-hydroxy-L-threonine, 6-hydroxy-L-norleucine, 4,5-dihydroxy-i soleucine, 3 -hydroxy-L-valine, hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, and allo-threonine.
In a further preferred embodiment, said X8 is selected from the group consisting of glycine, alanine, leucine, and valine; X9 and X12 are independently of each other glutamine or asparagine; X10 is glutamate or aspartate; and X11 is selected from the group consisting of serine, threonine, and homoserine.
In a further preferred embodiment, said X8 is glycine, X9 and X12 are is glutamine, X10 is glutamate and X11 is serine.
In a preferred embodiment, each of X8, X9, X10, X11 and X12 is independently selected from the group consisting of glycine, alanine, leucine, valine, tert-leucine, homoleucine, isoleucine, alloisoleucine 2-aminobutyric acid, diethylalanine, norleucine, norvaline; glutamine, asparagine, beta-hydroxyasparagine, 3-methyl-L-glutamine, (2S,4S)-2,5-diamino-4-hydroxy-5-oxopentanoic acid, n-methyl-asparagine; glutamate, aspartate (2S,4R)-4-methylglutamate, (3 S)-3-methyl-L-glutamic acid, (3R)-3-methyl-L-glutamic acid, 5-O-methyl-glutamic acid, 4-hydroxy-glutamic-acid, 6-carboxylysine, beta-hydroxyaspartic acid, 2-amino-propanedioic acid, 3,3-dimethyl aspartic acid, 2-aminoadipic acid, 3-methyl-aspartic acid; serine, homoserine, threonine, 4-hydroxy-L-threonine, 6-hydroxy-L-norleucine, 4,5-dihydroxy-isoleucine, 3 -hydroxy-L-valine, hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, and allo-threonine.
In a further preferred embodiment, each of X8, X9, X10, X11 and X12 is independently selected from the group consisting of glycine, alanine, leucine, valine; glutamate, aspartate; glutamine, asparagine; serine, threonine, and homoserine.
In a further preferred embodiment, each of X8, X9, X10, X11 and X12 is independently selected from the group consisting of serine, glycine, glutamine, and glutamate.
In a preferred embodiment, in sequence I X1-6 are lysine or arginine, X7 is selected from arginine, lysine or cysteine and each of X8, X9, X10, X11 and X12 is independently selected from the group consisting of glycine, alanine, leucine, valine, tert-leucine, homoleucine, isoleucine, alloisoleucine 2-aminobutyric acid, diethylalanine, norleucine, norvaline; glutamine, asparagine, beta-hydroxyasparagine, 3 -methyl-L-glutamine, (2S,4S)-2,5-diamino-4-hydroxy-5-oxopentanoic acid, n-methyl-asparagine; glutamate, aspartate (2S,4R)-4-methylglutamate, (3S)-3-methyl-L-glutamic acid, (3R)-3-methyl-L-glutamic acid, 5-O-methyl-glutamic acid, 4-hydroxy-glutamic-acid, 6-carboxylysine, beta-hydroxyaspartic acid, 2-amino-propanedioic acid, 3,3-dimethyl aspartic acid, 2-aminoadipic acid, 3-methyl-aspartic acid; serine, homoserine, threonine, 4-hydroxy-L-threonine, 6-hydroxy-L-norleucine, 4,5-dihydroxy-isoleucine, 3-hydroxy-L-valine, hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2 S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, and allo-threonine. In a preferred embodiment, preferably the C-terminus of the amino acid sequence I, and wherein in sequence I X1-6 are lysine or arginine, X7 is selected from arginine, lysine or cysteine and each of X8, X9, X10, X11 and X12 is independently selected from the group consisting of glycine, alanine, leucine, valine, tert-leucine, homoleucine, isoleucine, alloisoleucine 2-aminobutyric acid, diethylalanine, norleucine, norvaline; glutamine, asparagine, beta-hydroxyasparagine, 3-methyl-L-glutamine, (2S,4S)-2,5-diamino-4-hydroxy-5-oxopentanoic acid, n-methyl-asparagine; glutamate, aspartate (2S,4R)-4-methyl glutamate, (3S)-3-methyl-L-glutamic acid, (3R)-3-methyl-L-glutamic acid, 5-O-methyl-glutamic acid, 4-hydroxy-glutamic-acid, 6-carboxylysine, beta-hydroxyaspartic acid, 2-amino-propanedioic acid, 3,3-dimethyl aspartic acid, 2-aminoadipic acid, 3-methyl-aspartic acid; serine, homoserine, threonine, 4-hydroxy-L-threonine, 6-hydroxy-L-norleucine, 4,5-dihydroxy-isoleucine, 3-hydroxy-L-valine, hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyl eucine, and allo-threonine.
In another preferred embodiment, at least three of X1-6 are independently of each other lysine or arginine, X7 is selected from arginine, lysine or cysteine, X8 is selected from the group consisting of glycine, alanine, leucine, and valine; X9 and X12 are independently of each other glutamine or asparagine; X10 is glutamate or aspartate; and X11 is selected from the group consisting of serine, threonine, and homoserine. In another preferred embodiment, at least three of X1-6 are independently of each other lysine or arginine, X8 is glycine, X9 and X12 are independently of each other glutamine, X10 is glutamate and X11 is serine. In another preferred embodiment, X1-6 are lysine or arginine, X7 is selected from arginine, lysine or cysteine and X8 is glycine, X9 and X12 are independently of each other glutamine, X10 is glutamate and X11 is serine. In another preferred embodiment, X1-6 are lysine or arginine, X7 is selected from arginine, lysine or cysteine and X8 is glycine, X9 and X12 are independently of each other is glutamine, X10 is glutamate and X11 is serine.
In one embodiment, at least 3 of said X1-6 are independently of each other lysine or arginine, said X7 is selected from arginine, lysine or cysteine, X8 is glycine, X9 and X12 are independently of each other glutamine, X10 is glutamate and X11 is serine.
In another preferred embodiment, X1 to X6 are independently of each other lysine or arginine, X7 is selected from arginine, lysine or cysteine, X8 is glycine, X9 and X12 are independently of each other glutamine, X10 is glutamate and X11 is serine and said polypeptide.
In a preferred embodiment, in sequence I X1-6 are lysine or arginine, X7 is selected from arginine, lysine or cysteine and X8 is glycine, X9 and X12 are independently of each other glutamine, X10 is glutamate and X11 is serine. In a preferred embodiment, in sequence I X1-6 are lysine or arginine, X7 is selected from arginine, lysine or cysteine and X8 is glycine, X9 and X12 are independently of each other glutamine, X10 is glutamate and X11 is serine; and said protein cage or said complex of the invention comprising said polypeptide of the invention is capable of endosomal escape.
When the polypeptide of the invention forms a protein cage by self-assembly, amino acids X13 to X29 are exposed to the external surface of the protein cage and are therefore most prone to tolerate mutations while still keeping ability to self-assemble into a cage-like protein cage.
Thus, in a preferred embodiment, amino acids X13 to X29 exposed to the external surface of the protein cage are any amino acid.
In a preferred embodiment, X13, X17 and X19 are independently of each other serine or a conservative substitution thereof. Preferably said conservative substitution of serine is selected from the group consisting of cysteine, methionine, homocysteine, selenocysteine, hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, homoserine, 3-hydroxy-L-valine, 4,5-dihydroxy-isoleucine, 6-hydroxy-L-norleucine, S-(2-hydroxyethyl)-L-cysteine, phosphoserine, 4-hydroxy-L-threonine, threonine, and phosphothreonine. More preferably said conservative substitution of serine is selected from the group consisting of cysteine, homocysteine, selenocysteine, hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, homoserine, 3-hydroxy-L-valine, 4,5-dihydroxy-isoleucine, 6-hydroxy-L-norleucine, S-(2-hydroxyethyl)-L-cysteine, phosphoserine, 4-hydroxy-L-threonine, threonine, and phosphothreonine. Again more preferably said conservative substitution of serine is selected from the group consisting of hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, homoserine, 3-hydroxy-L-valine, 4,5-dihydroxy-isoleucine, 6-hydroxy-L-norleucine, S-(2-hydroxyethyl)-L-cysteine, phosphoserine, 4-hydroxy-L-threonine, threonine, and phosphothreonine. Again more preferably said conservative substitution of serine is selected from the group consisting of homoserine, threonine, 4-hydroxy-L-threonine, 6-hydroxy-L-norleucine, 4,5-dihydroxy-isoleucine, 3-hydroxy-L-valine, hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, and allo-threonine.
In another preferred embodiment, X13, X17 and X19, are independently of each other selected from the group consisting of serine, homoserine, and threonine. In another preferred embodiment, X13, X17 and X19, are independently of each other serine.
In another preferred embodiment, X14, X16 and X24 are independently of each other asparagine or a conservative substitution thereof. Preferably, said conservative substitution of asparagine is selected from the group consisting of glutamine, beta-hydroxyasparagine, 3-methyl-L-glutamine, (2S,4S)-2,5-diamino-4-hydroxy-5-oxopentanoic acid, and n-methyl-asparagine
In another preferred embodiment, X14, X16 and X24 are independently of each other asparagine. In another preferred embodiment, X14, X16 and X24 are independently of each other asparagine or glutamine.
In another preferred embodiment, X15, X20 X26 and X28 are independently of each other aspartate, glutamate or a conservative substitution thereof. Preferably said conservative substitution of aspartate or glutamate is selected from the group consisting of (2S,4R)-4-methylglutamate, (3S)-3-methyl-L-glutamic acid, (3R)-3-methyl-L-glutamic acid, 5-O-methyl-glutamic acid, 4-hydroxy-glutamic-acid, 6-carboxylysine, beta-hydroxyaspartic acid, 2-amino-propanedioic acid, 3,3-dimethyl aspartic acid, 2-aminoadipic acid and 3-methyl-aspartic acid.
In another preferred embodiment, X15 and X20 are independently of each other aspartate. In another preferred embodiment, X26 and X28 are independently of each other glutamate.
In another preferred embodiment, X15, X20 X26 and X28 are independently of each other aspartate or glutamate. In another preferred embodiment, X15, X20 X26 and X28 are independently of each other selected from the group consisting of glutamate, aspartate, 2S,4R-4-methylglutamate, (3S)-3-methyl-L-glutamic acid, (3R)-3-methyl-L-glutamic acid, 5-O-methyl-glutamic acid, 4-hydroxy-glutamic-acid, 6-carboxylysine, beta-hydroxyaspartic acid, 2-amino-propanedioic acid, 3,3-dimethyl aspartic acid, 2-aminoadipic acid and 3-methyl-aspartic acid.
In another preferred embodiment, X18 and X29 are independently of each other leucine or a conservative substitution thereof. Preferably said conservative substitution of leucine is selected from the group consisting of glycine, alanine, valine, tert-leucine, homoleucine, isoleucine, alloisoleucine 2-aminobutyric acid, diethylalanine, norleucine, and norvaline.
In another preferred embodiment, X18 and X29 are independently of each other leucine. In a preferred embodiment, said X18 and X29 are independently of each other selected from the group consisting of glycine, alanine, leucine, and valine. In a preferred embodiment, said X18, and X29 are independently of each other selected from the group consisting of glycine, alanine, leucine, valine, tert-leucine, homoleucine, isoleucine, alloisoleucine 2-aminobutyric acid, diethylalanine, norleucine, and norvaline.
In another preferred embodiment, X21, X22, X23, X25 and X27 are independently of each other a positively charged amino acid.
In another preferred embodiment, X21, X22, X23, X25 and X27 are independently of each other arginine, lysine or a conservative substitution thereof. Preferably said conservative substitution of arginine or leucine is selected from the group consisting of histidine, 5-methyl-arginine, gamma-hydroxy arginine, 2-amino-4-guanidinobutryric acid, 2-amino-3-guanidinopropionic acid, canavanine, homoarginine, diaminobutyric acid, 2,3-diaminopropanoic acid, (2S)-2,8-diaminooctanoic acid, ornithine and thialysine. More preferably said conservative substitution of arginine or leucine is selected from the group consisting of 5-methyl-arginine, gamma-hydroxy arginine, 2-amino-4-guanidinobutryric acid, 2-amino-3-guanidinopropionic acid, canavanine, homoarginine, diaminobutyric acid, 2,3-diaminopropanoic acid, (2S)-2,8-diaminooctanoic acid, ornithine and thialysine.
In another preferred embodiment, X21, X22, X23, X25 and X27 are independently of each other histidine, arginine or lysine. In another preferred embodiment, X21, X22, X23, X25 and X27 are independently of each other arginine or lysine. In another preferred embodiment, X21, X22, X23, X25 and X27 are independently of each other is arginine. In another preferred embodiment, X21, X22, X23, X25 and X27 are independently of each other 6 is lysine.
In another preferred embodiment, X21, X22, X25 and X27 are independently of each other arginine. In another preferred embodiment, X23 is lysine. In another preferred embodiment, X21, X22, X23, X25 and X27 are independently of each other selected from the group consisting of arginine, 5-methyl-arginine, gamma-hydroxy arginine, 2-amino-4-guanidinobutryric acid, 2-amino-3 -guanidinopropionic acid, canavanine, homoarginine, lysine, diaminobutyric acid, 2,3-diaminopropanoic acid, (2S)-2,8-diaminooctanoic acid, ornithine and thialysine.
In another preferred embodiment, X13, X17 and X19, are independently of each other serine; X14, X16 and X24 are independently of each other asparagine; X15 and X20 are independently of each other aspartate; X18 and X29 are independently of each other leucine; X26 and X28 are independently of each other glutamate; X21, X22, X25 and X27 are independently of each other arginine; and X23 is lysine.
In another preferred embodiment, X13, X17 and X19, are independently of each other selected from the group consisting of serine, homoserine, and threonine; X14, X16 and X24 are independently of each other asparagine or glutamine; X15, X20 X26 and X28 are independently of each other aspartate or glutamate; said X18 and X29 are independently of each other selected from the group consisting of glycine, alanine, leucine, and valine; X21, X22, X23, X25 and X27 are independently of each other arginine or lysine.
In another preferred embodiment, X13, X17, and X19, are independently of each other selected from the group consisting of serine, homoserine, threonine, 4-hydroxy-L-threonine, 6-hydroxy-L-norleucine, 4,5-dihydroxy-isoleucine, 3 -hydroxy-L-valine, hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, and allo-threonine;
X14, X16, and X24 are independently of each other selected from the group consisting of glutamine, asparagine, beta-hydroxyasparagine, 3-methyl-L-glutamine, (2S,4S)-2,5-diamino-4-hydroxy-5-oxopentanoic acid, and n-methyl-asparagine;
X15, X20 X26 and X28 are independently of each other selected from the group consisting of glutamate, aspartate, (2S,4R)-4-methylglutamate, (3S)-3-methyl-L-glutamic acid, (3R)-3-methyl-L-glutamic acid, 5-O-methyl-glutamic acid, 4-hydroxy-glutamic-acid, 6-carboxylysine, beta-hydroxyaspartic acid, 2-amino-propanedioic acid, 3,3-dimethyl aspartic acid, 2-aminoadipic acid and 3-methyl-aspartic acid;
X18, and X29 are independently of each other selected from the group consisting of glycine, alanine, leucine, valine, tert-leucine, homoleucine, isoleucine, alloisoleucine 2-aminobutyric acid, diethylalanine, norleucine, and norvaline; and
X21, X22, X23, X25 and X27 are independently of each other selected from the group consisting of arginine, 5-methyl-arginine, gamma-hydroxy arginine, 2-amino guanidinobutryric acid, 2-amino-3-guanidinopropionic acid, canavanine, homoarginine, lysine, diaminobutyric acid, 2,3-diaminopropanoic acid, (2S)-2,8-diaminooctanoic acid, ornithine and thialysine.
In another preferred embodiment, X8 is glycine or a conservative substitution thereof, wherein preferably said conservative substitution of glycine is selected from the group consisting of alanine, leucine, valine, tert-leucine, homoleucine, isoleucine, alloisoleucine 2-aminobutyric acid, diethylalanine, norleucine, and norvaline;
X9 and X12 are independently of each other glutamine or a conservative substitution thereof, wherein preferably said conservative substitution of glutamine is selected from the group consisting of asparagine, beta-hydroxyasparagine, 3 -methyl-L-glutamine, (2 S,4 S)-2,5-diamino-4-hydroxy-5-oxopentanoic acid, and n-methyl-asparagine;
X10 is glutamate or a conservative substitution thereof, wherein preferably said conservative substitution of glutamate is selected from the group consisting of aspartate (2S,4R)-4-methylglutamate, (3S)-3-methyl-L-glutamic acid, (3R)-3-methyl-L-glutamic acid, 5-O-methyl-glutamic acid, 4-hydroxy-glutamic-acid, 6-carboxylysine, beta-hydroxyaspartic acid, 2-amino-propanedioic acid, 3,3-dimethyl aspartic acid, 2-aminoadipic acid, and 3-methyl-aspartic acid; and
X11 is serine or a conservative substitution thereof, wherein preferably said conservative substitution of serine is selected from the group consisting of homoserine, threonine, 4-hydroxy-L-threonine, 6-hydroxy-L-norleucine, 4,5-dihydroxy-isoleucine, 3 -hydroxy-L-valine, hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, and allo-threonine; more preferably X8 is glycine, X9 and X12 are independently of each other glutamine or asparagine, X10 is glutamate or aspartate, and X11 is serine.
In another preferred embodiment, X13, X17, and X19 are independently of each other serine or a conservative substitution thereof, wherein preferably said conservative substitution of serine is selected from the group consisting of homoserine, threonine, 4-hydroxy-L-threonine, 6-hydroxy-L-norleucine, 4,5-dihydroxy-i soleucine, 3-hydroxy-L-valine, hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, and allo-threonine;
X14, X16, and X24 are independently of each other asparagine, glutamine or a conservative substitution thereof, wherein preferably said conservative substitution of asparagine or glutamine is selected from the group consisting of beta-hydroxyasparagine, 3-methyl-L-glutamine, (2S,4S)-2,5-diamino-4-hydroxy-5-oxopentanoic acid, and n-methyl-asparagine;
X15, X20 X26 and X28 are independently of each other aspartate, glutamate or a conservative substitution thereof, wherein preferably said conservative substitution of aspartate or glutamate is selected from the group consisting of (2S,4R)-4-methylglutamate, (3S)-3-methyl-L-glutamic acid, (3R)-3-methyl-L-glutamic acid, 5-O-methyl-glutamic acid, 4-hydroxy-glutamic-acid, 6-carboxylysine, beta-hydroxyaspartic acid, 2-amino-propanedioic acid, 3,3-dimethyl aspartic acid, 2-aminoadipic acid and 3-methyl-aspartic acid;
X18 and X29 are independently of each other leucine or a conservative substitution thereof, wherein preferably said conservative substitution of leucine is selected from the group consisting of glycine, alanine, valine, tert-leucine, homoleucine, isoleucine, alloisoleucine 2-aminobutyric acid, diethylalanine, norleucine, and norvaline; and
X21, X22, X23, X25 and X27 are independently of each other arginine, lysine or a conservative substitution thereof, wherein preferably said conservative substitution of arginine or leucine is selected from the group consisting of histidine, 5-methyl-arginine, gamma-hydroxy arginine, 2-amino-4-guanidinobutryric acid, 2-amino-3-guanidinopropionic acid, canavanine, homoarginine, diaminobutyric acid, 2,3-diaminopropanoic acid, (2S)-2,8-diaminooctanoic acid, ornithine and thialysine.
In a preferred embodiment, X8 is glycine or a conservative substitution thereof; X9 and X12 are independently of each other glutamine, asparagine or a conservative substitution thereof; X10 is glutamate, aspartate or a conservative substitution thereof; X11 is serine or a conservative substitution thereof; X13, X17, and X19 are independently of each other serine or a conservative substitution thereof; X14, X16, and X24 are independently of each other asparagine, glutamine or a conservative substitution thereof; X15, X20 X26 and X28 are independently of each other aspartate, glutamate or a conservative substitution thereof; X18 and X29 are independently of each other leucine or a conservative substitution thereof; and X21, X22, X23, X25 and X27 are independently of each other arginine, lysine or a conservative substitution thereof.
In another preferred embodiment, said positively charged amino acid of said at least 3 of X1 to X6 is independently of each other selected from the group consisting of arginine, 5-methyl-arginine, gamma-hydroxy arginine, 2-amino-4-guanidinobutyric acid, 2-amino guanidinopropionic acid, canavanine, homoarginine, lysine, diaminobutyric acid, 2,3-diaminopropanoic acid, (2S)-2,8-diaminooctanoic acid, ornithine, and thialysine;
X7 is cysteine, a conservative substitution thereof or a positively charged amino acid; preferably X7 is selected from the group consisting of homocysteine, cysteine, selenocysteine, arginine, 5-methyl-arginine, gamma-hydroxy arginine, 2-amino-4-guanidinobutyric acid, 2-amino-3-guanidinopropionic acid, canavanine, homoarginine, lysine, diaminobutyric acid, 2,3-diaminopropanoic acid, (2S)-2,8-diaminooctanoic acid, ornithine, thialysine and serine;
X8 is glycine or a conservative substitution thereof selected from the group consisting of, alanine, leucine, valine, tert-leucine, homoleucine, isoleucine, alloisoleucine 2-aminobutyric acid, diethylalanine, norleucine, and norvaline;
X9 and X12 are independently of each other glutamine or a conservative substitution thereof selected from the group consisting of asparagine, beta-hydroxyasparagine, 3-methyl-L-glutamine, (2S,4S)-2,5-diamino-4-hydroxy-5-oxopentanoic acid, and n-methyl-asparagine;
X10 is glutamate or a conservative substitution thereof selected from the group consisting of aspartate (2S,4R)-4-methylglutamate, (3S)-3-methyl-L-glutamic acid, (3R)-3-methyl-L-glutamic acid, 5-O-methyl-glutamic acid, 4-hydroxy-glutamic-acid, 6-carboxylysine, beta-hydroxyaspartic acid, 2-amino-propanedioic acid, 3,3 -dimethyl aspartic acid, 2-aminoadipic acid, and 3-methyl-aspartic acid; and
X11 is serine or a conservative substitution thereof selected from the group consisting of homoserine, threonine, 4-hydroxy-L-threonine, 6-hydroxy-L-norleucine, 4,5-dihydroxy-isoleucine, 3 -hydroxy-L-valine, hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3 -hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, and allo-threonine;
X13, X17, and X19, are independently of each other selected from the group consisting of serine, homoserine, threonine, 4-hydroxy-L-threonine, 6-hydroxy-L-norleucine, 4,5-dihydroxy-isoleucine, 3-hydroxy-L-valine, hydroxynorvaline, 2-amino-5-hydroxypentanoic acid, allo-threonine, 3,3-dihydroxy-alanine, 4-hydroxy-L-isoleucine, (2S,3R)-2-amino-3-hydroxy-4-methylpentanoic acid, beta-hydroxyleucine, and allo-threonine;
X14, X16, and X24 are independently of each other selected from the group consisting of glutamine, asparagine, b eta-hydroxyasparagine, 3-methyl-L-glutamine, (2S,4S)-2,5-diamino-4-hydroxy-5-oxopentanoic acid, and n-methyl-asparagine;
X15, X20 X26 and X28 are independently of each other selected from the group consisting of glutamate, aspartate, (2 S,4R)-4-methylglutamate, (3S)-3-methyl-L-glutamic acid, (3R)-3-methyl-L-glutamic acid, 5-O-methyl-glutamic acid, 4-hydroxy-glutamic-acid, 6-carboxylysine, beta-hydroxyaspartic acid, 2-amino-propanedioic acid, 3,3 -dimethyl aspartic acid, 2-aminoadipic acid and 3-methyl-aspartic acid;
X18, and X29 are independently of each other selected from the group consisting of glycine, alanine, leucine, valine, tert-leucine, homoleucine, isoleucine, alloisoleucine 2-aminobutyric acid, diethylalanine, norleucine, and norvaline; and
X21, X22, X23, X25 and X27 are independently of each other selected from the group consisting of arginine, 5-methyl-arginine, gamma-hydroxy arginine, 2-amino guanidinobutryric acid, 2-amino-3-guanidinopropionic acid, canavanine, homoarginine, lysine, diaminobutyric acid, 2,3-diaminopropanoic acid, (2S)-2,8-diaminooctanoic acid, ornithine and thialysine.
In another preferred embodiment, said positively charged amino acid of said at least 3 of X1 to X6 is independently of each other arginine or lysine, X7 is selected from the group consisting of homocysteine, cysteine, arginine, and lysine, X8 is selected from the group consisting of glycine, alanine, leucine, and valine, X9 and X12 are independently of each other glutamine or asparagine, X10 is glutamate or aspartate, X11, X13, X17, and X19 are independently of each other selected from the group consisting of serine, threonine, and homoserine, X14, X16, and X24 are independently of each other asparagine or glutamine, X15, X20 X26 and X28 are independently of each other aspartate or glutamate, X18 and X29 are independently of each other selected from the group consisting of leucine, glycine, alanine, and valine, and X21, X22, X23, X25 and X27 are independently of each other arginine or lysine.
In another preferred embodiment, said positively charged amino acid of said at least 3 of X1 to X6 is independently of each other arginine or lysine, X7 is selected from the group consisting of homocysteine, cysteine, arginine, and lysine, X8 is glycine, X9 and X12 are independently of each other glutamine or asparagine, X10 is glutamate or aspartate, X11 is serine, X13, X17, and X19 are independently of each other serine, X14, X16, and X24 are independently of each other asparagine, X15, X20 X26 and X28 are independently of each other aspartate or glutamate, X18 and X29 are independently of each other leucine, and X21, X22, X23, X25 and X27 are independently of each other arginine or lysine.
In another preferred embodiment, at least three of X1-6 are independently of each other lysine or arginine, X7 is selected from arginine, lysine or cysteine, X8 is selected from the group consisting of glycine, alanine, leucine, and valine; X9 and X12 are independently of each other glutamine or asparagine; X10 is glutamate or aspartate; and X11 is selected from the group consisting of serine, threonine, and homoserine; X18 and X29 are independently of each other leucine; X15 and X20 are independently of each other aspartate; X26 and X28 are independently of each other glutamate; X14, X16 and X24 are independently of each other asparagine; X13, X17 and X19, are independently of each other serine; X21, X22, X25 and X27 are independently of each other arginine; and X23 is lysine.
In another preferred embodiment, at least three of X1-6 are independently of each other lysine or arginine, X7 is selected from arginine, lysine or cysteine, X8 is selected from the group consisting of glycine, alanine, leucine, and valine; X9 and X12 are independently of each other glutamine or asparagine; X10 is glutamate or aspartate; and X11 is selected from the group consisting of serine, threonine, and homoserine; X13, X17 and X19, are independently of each other selected from the group consisting of serine, homoserine, and threonine; X14, X16 and X24 are independently of each other asparagine or glutamine; X15, X20 X26 and X28 are independently of each other aspartate or glutamate; said X18 and X29 are independently of each other selected from the group consisting of glycine, alanine, leucine, and valine; X21, X22, X23, X25 and X27 are independently of each other arginine or lysine.
In another preferred embodiment, X1-6 are independently of each other lysine or arginine, X7 is selected from arginine, lysine or cysteine, X8 is selected from the group consisting of glycine, alanine, leucine, and valine; X9 and X12 are independently of each other glutamine or asparagine; X10 is glutamate or aspartate; and X11 is selected from the group consisting of serine, threonine, and homoserine; X13, X17 and X19, are independently of each other selected from the group consisting of serine, homoserine, and threonine; X14, X16 and X24 are independently of each other asparagine or glutamine; X15, X20 X26 and X28 are independently of each other aspartate or glutamate; said X18 and X29 are independently of each other selected from the group consisting of glycine, alanine, leucine, and valine; X21, X22, X23, X25 and X27 are independently of each other arginine or lysine.
In another preferred embodiment, X1-6 are independently of each other lysine or arginine, X7 is selected from arginine, lysine or cysteine, X8 is selected from the group consisting of glycine, alanine, leucine, and valine; X9 and X12 are independently of each other glutamine or asparagine; X10 is glutamate or aspartate; and X11 is selected from the group consisting of serine, threonine, and homoserine; X13, X17 and X19, are independently of each other serine; X14, X16 and X24 are independently of each other asparagine; X15 and X20 are independently of each other aspartate; X18 and X29 are independently of each other leucine; X26 and X28 are independently of each other glutamate; X21, X22, X25 and X27 are independently of each other arginine; and X23 is lysine.
In a preferred embodiment, said amino acid sequence I is a sequence selected from the group consisting of SEQ ID NO: 2 to 16 and 20 to 27. In a preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 2 to 16 and 20 to 27.
In a preferred embodiment, said amino acid sequence I is a sequence selected from the group consisting of SEQ ID NO: 2 to 5 and SEQ ID NO: 10 to 16. In a preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 2 to 5 and SEQ ID NO: 10 to 16.
In a preferred embodiment, said amino acid sequence I is a sequence selected from the group consisting of SEQ ID NO: 2 to 5 and SEQ ID NO: 10 to 16 and 22. In a preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 2 to 5, SEQ ID NO: 10 to 16, and SEQ ID NO: 20 to 22.
In a preferred embodiment, said amino acid sequence I is a sequence selected from the group consisting of SEQ ID NO: 2 to 5 and SEQ ID NO: 10 to 16, 20 and 21. In a preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 2 to 5 and SEQ ID NO: 10 to 16, 20 and 21. In a preferred embodiment, said amino acid sequence I is a sequence selected from the group consisting of SEQ ID NO: 2 to 5 and SEQ ID NO: 10 to 16 and 22. In a preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 2 to 5 and SEQ ID NO: 10 to 16, 20, 21 and 22.
In a preferred embodiment, said amino acid sequence I is SEQ ID NO: 2. In a preferred embodiment, said amino acid sequence I is SEQ ID NO: 3. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 4. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 5. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 10. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 11. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 12. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 13. In a preferred embodiment, said amino acid sequence I is SEQ ID NO: 14. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 15. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 16. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 20. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 21. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 22.
In a preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 2. In another preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 3. In another preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 4. In another preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 5. In another preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 10. In another preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 11. In another preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 12. In another preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 13. In another preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 14. In another preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 15. In another preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 16. In another preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 20. In another preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 21. In another preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 22. In a preferred embodiment, said polypeptide of the invention consists of a sequence selected from the group consisting of SEQ ID NO: 3, wherein said polypeptide of the invention comprises a histidine tag (His tag) consisting of 3 or more consecutive histidines, wherein said His tag is attached to the C- or N-terminus, preferably the C-terminus of the amino acid sequence I.
In a preferred embodiment, said amino acid sequence I is a sequence selected from the group consisting of SEQ ID NO: 2 to 16 and 20 to 27, and said surfactant composition comprises, preferably consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1.
In a preferred embodiment, said amino acid sequence I is a sequence selected from the group consisting of SEQ ID NO: 2 to 5 and SEQ ID NO: 10 to 16, and said surfactant composition comprises, preferably consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1.
In a preferred embodiment, said amino acid sequence I is a sequence selected from the group consisting of SEQ ID NO: 2 to 5 and SEQ ID NO: 10 to 16 and 22, and said surfactant composition comprises, preferably consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1.
In a preferred embodiment, said amino acid sequence I is a sequence selected from the group consisting of SEQ ID NO: 2 to 5 and SEQ ID NO: 10 to 16, 20 and 21, and said surfactant composition comprises, preferably consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1.
In a preferred embodiment, said amino acid sequence I is a sequence selected from the group consisting of SEQ ID NO: 2 to 5 and SEQ ID NO: 10 to 16 and 22, and said surfactant composition comprises, preferably consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1.
In a preferred embodiment, said amino acid sequence I is SEQ ID NO: 2, and said surfactant composition comprises, preferably consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1. In a preferred embodiment, said amino acid sequence I is SEQ ID NO: 3, and said surfactant composition comprises, preferably consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 4, and said surfactant composition comprises, preferably consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 5, and said surfactant composition comprises, preferably consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 10, and said surfactant composition comprises, preferably consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 11, and said surfactant composition comprises, preferably consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 12, and said surfactant composition comprises, preferably consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 13, and said surfactant composition comprises, preferably consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1. In a preferred embodiment, said amino acid sequence I is SEQ ID NO: 14, and said surfactant composition comprises, preferably consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 15, and said surfactant composition comprises, preferably consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 16, and said surfactant composition comprises, preferably consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 20, and said surfactant composition comprises, preferably consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 21, and said surfactant composition comprises, preferably consists of SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1. In another preferred embodiment, said amino acid sequence I is SEQ ID NO: 22, and said surfactant composition comprises, preferably consists of, SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1.
In a preferred embodiment, said polypeptide of the invention comprises a tag, i.e. a peptide or non-peptide tag, preferably a peptide tag (i.e. a functional amino acid sequence). In a preferred embodiment, said tag is located at the C- or N-terminal end of the polypeptide of the invention. In a preferred embodiment, said tag is a non-peptide tag, preferably polyethylene glycol (PEG). Preferably, said PEG is coupled via the amino acid serine or cysteine to said amino acid sequence I. PEG coupled to the polypeptide of the invention will increase stability and reduce immunogenicity.
In a preferred embodiment, said polypeptide of the invention comprises a tag, preferably a peptide tag, wherein said tag is located at the C- or N-terminal end, preferably at the C-terminal end, of said polypeptide, preferably of said amino acid sequence I, and wherein said tag is preferably fused to said C- or N-terminal end of said polypeptide, preferably of said amino acid sequence I.
In a preferred embodiment, said amino acid sequence I is a sequence selected from the group consisting of SEQ ID NO: 2 to 16 and SEQ ID NO: 20 to 27, and wherein said polypeptide of the invention comprises a tag, preferably a peptide tag, wherein said tag is located at the C- or N-terminal end, preferably at the C-terminal end, of said polypeptide, preferably of said amino acid sequence I, and wherein said tag is preferably fused to said C- or N-terminal end of said polypeptide, preferably of said amino acid sequence I.
In a preferred embodiment, said amino acid sequence I is a sequence selected from the group consisting of SEQ ID NO: 2 to 5 and SEQ ID NO: 10 to 16, and wherein said polypeptide of the invention comprises a tag, preferably a peptide tag, wherein said tag is located at the C- or N-terminal end, preferably at the C-terminal end, of said polypeptide, preferably of said amino acid sequence I, and wherein said tag is preferably fused to said C- or N-terminal end of said polypeptide, preferably of said amino acid sequence I.
In a preferred embodiment, said amino acid sequence I is a sequence selected from the group consisting of SEQ ID NO: 2 to 5 and SEQ ID NO: 10 to 16 and SEQ ID NO: 20 to 22, and wherein said polypeptide of the invention comprises a tag, preferably a peptide tag, wherein said tag is located at the C- or N-terminal end, preferably at the C-terminal end, of said polypeptide, preferably of said amino acid sequence I, and wherein said tag is preferably fused to said C- or N-terminal end of said polypeptide, preferably of said amino acid sequence I.
In a preferred embodiment, said amino acid sequence I is a sequence selected from the group consisting of SEQ ID NO: 2 to 16 and SEQ ID NO: 20 to 27, and said surfactant composition comprises, preferably consists of, SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1, and wherein said polypeptide of the invention comprises a tag, preferably a peptide tag, wherein said tag is located at the C- or N-terminal end, preferably at the C-terminal end, of said polypeptide, preferably of said amino acid sequence I, and wherein said tag is preferably fused to said C- or N-terminal end of said polypeptide, preferably of said amino acid sequence I.
In a preferred embodiment, said amino acid sequence I is a sequence selected from the group consisting of SEQ ID NO: 2 to 5 and SEQ ID NO: 10 to 16, and said surfactant composition comprises, preferably consists of, SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1, and wherein said polypeptide of the invention comprises a tag, preferably a peptide tag, wherein said tag is located at the C- or N-terminal end, preferably at the C-terminal end, of said polypeptide, preferably of said amino acid sequence I, and wherein said tag is preferably fused to said C- or N-terminal end of said polypeptide, preferably of said amino acid sequence I.
In a preferred embodiment, said amino acid sequence I is a sequence selected from the group consisting of SEQ ID NO: 2 to 5 and SEQ ID NO: 10 to 16 and SEQ ID NO: 20 to 22, and said surfactant composition comprises, preferably consists of, SDS and CS, and wherein said molar ratio of said SDS to CS is equal or between 4 to 1 and 2 to 1, preferably wherein said molar ratio of said SDS to CS is 3 to 1, and wherein said polypeptide of the invention comprises a tag, preferably a peptide tag, wherein said tag is located at the C- or N-terminal end, preferably at the C-terminal end, preferably at the C-terminal end, of said polypeptide, preferably of said amino acid sequence I, and wherein said tag is preferably fused to said C- or N-terminal end of said polypeptide, preferably of said amino acid sequence I.
In another preferred embodiment, said tag is selected from the group consisting of polyhistidine (His tag, i.e., an amino acid sequence consisting of two or more consecutively linked histidines), degradation tag, targeting tag, cell penetration tag, and endosomal escape tag. In a preferred embodiment, said additional tag included in the polypeptide of the invention is connected via a releasable linkage, such as a photo-cleavable linkage, or via reversible coupling.
Said targeting tag preferably binds to a cancer target, i.e. said targeting tag is a cancer targeting tag. Said cancer target includes receptors with an increased level of expression in/on certain tumor cells and tumor antigens.
In a preferred embodiment, said targeting tag is a peptide ligand, peptidomimetic, affibody, antibody binding domain or antibody. Said targeting tag is preferably folic acid.
In a preferred embodiment, said His tag, preferably said His6 tag comprises halogenated, preferably fluorinated histidines. In a preferred embodiment, said His tag comprises a fluorophore. In a preferred embodiment, said polypeptide of the invention comprises a fluorophore.
In a preferred embodiment, said polypeptide of the invention comprises a degradation tag. Preferably said degradation tag is functional in mammalian cells. In a preferred embodiment, said degradation tag is a C-terminal sequence of ornithine decarboxylase (cODC), preferably of SEQ ID NO: 17, or (poly)ubiquitin comprising or consisting of at least two consecutively linked ubiquitins. In another preferred embodiment, said degradation tag consists of cODC of SEQ ID NO: 17 (EFPPEVEEQDDGTLPMSCAQESGMDRHPAACASARINV). In another preferred embodiment, said degradation tag comprises at least two consecutively linked ubiquitins. More preferably, said degradation tag consists of at least two consecutively linked ubiquitins.
In a preferred embodiment, said polypeptide of the invention comprises a polyhistidine (His tag). In a preferred embodiment, said polyhistidine is an amino acid sequence comprising two or more histidines. In another preferred embodiment, said polyhistidine is an amino acid sequence comprising two or more consecutively linked histidines. In another preferred embodiment, said polyhistidine is an amino acid sequence comprising 3 or more consecutively linked histidines. In another preferred embodiment, said polyhistidine is an amino acid sequence comprising 3 to 9 consecutively linked histidines. In a preferred embodiment, said
His tag is attached to the C- or N-terminus, preferably the C-terminus, of the amino acid sequence I.
In a further embodiment, the polypeptide of the invention comprises an endosomal escape peptide or cell-penetrating peptide (CPP). The endosomal escape peptide is for example a dimerized disulfide-linked TAT or a thiol group. The term “cell-penetrating peptide” or CPP, as used herein, refers to a group of peptides with the ability to penetrate the plasma membrane for delivery of cargo into cells. Preferably, said CPP used in the polypeptide of the invention is a hydrophilic or cationic peptide. In another embodiment, said CPP is selected from an amphiphilic, anionic or hydrophobic peptide. A database of more than 1,600 CPPs is described by Agrawal et al. (Agrawal P, Bhalla S, Usmani S S, Singh S, CHaudhary K, Raghava G P S, et al. CPPsite 2.0: a repository of experimentally validated cell-penetrating peptides. Nucl Acids Res. 2016, 44:D1098-D103).
In a preferred embodiment, said additional tag included in the polypeptide of the invention is connected via a releasable linkage, such as a photo-cleavable linkage, or via reversible coupling.
In a preferred embodiment, said polypeptide of the invention comprises an amino acid sequence I selected from the group consisting of SEQ ID NO: 2 to 27, and wherein said lipoprotein cage is loadable and unloadable with cargo without disassembly of said lipoprotein cage.
In a preferred embodiment, said polypeptide of the invention comprises an amino acid sequence I selected from the group consisting of SEQ ID NO: 2 to 5, 10 to 16 and 18 to 22, and wherein said lipoprotein cage is loadable and unloadable with cargo without disassembly of said lipoprotein cage.
In a preferred embodiment, said polypeptide of the invention comprises an amino acid sequence I selected from the group consisting of SEQ ID NO: 2 to 5, 10 to 16, 18 or 19, and wherein said lipoprotein cage is loadable and unloadable with cargo without disassembly of said lipoprotein cage.
In a preferred embodiment, the polypeptide of the invention is modified genetically (for direct fusion of peptides) or chemically after production.
In a preferred embodiment, said lipoprotein cage of the invention is included in a composition, preferably in a pharmaceutical composition comprising a pharmaceutically acceptable carrier.
In a further aspect, the invention relates to a complex comprising the lipoprotein cage of the invention and one or more cargo molecules.
Preferably, said cargo is hydrophobic cargo, more preferably non-polar cargo. Preferably, said cargo is small cargo. Small cargo has preferably a size of 1000 Da or below.
In a preferred embodiment, a size of 1000 Da or below means that said cargo has a size of 1000 Da or lower, preferably 800 Da or lower, more preferably 600 Da or lower, again more preferably 500 Da or lower, again more preferably 400 Da or lower, again more preferably 300 Da or lower, again more preferably 200 Da or lower, again more preferably 100 Da or lower. In another embodiment, said cargo is small hydrophobic cargo having a size of 1000 Da or below, again more preferably small non-polar cargo having a size of 1000 Da or below. Preferably, said cargo has a low solubility in aqueous media. Preferably, said small cargo having a size of 1000 Da or below has a low solubility in aqueous media. More preferably said cargo of low solubility is included in Class II or Class IV of the Biopharmaceutics Classification System (BCS). Again more preferably, said low soluble cargo has a lower solubility than a highly soluble cargo for which the highest strength dose is soluble in 250 mL or less of aqueous media over the pH range of 1.0-7.5, more preferably, 1.0-6.8, at 37±1° C. Preferred methods for determining solubility are the USP Dissolution Apparatus, shake-flask method or acid or base titration methods.
Preferably said cargo is an active agent, preferably a therapeutically and/or diagnostically active agent. In a preferred embodiment, said cargo is an imaging agent, such as a fluorescent agent. More preferably said cargo is selected from the group consisting of a chemotherapeutic agent, an antifungal agent, such as bifonazole or amphotericin B, an antiviral agent such as indinavir or ritonavir, and an antibiotic. Preferably said cargo is included in Class II or Class IV of the Biopharmaceutics Classification System (BCS). Preferably said chemotherapeutic agent is a small cargo molecule selected from the group consisting of doxorubicin, paclitaxel, dasatinib, imatinib, lapatinib, camptothecin, daunorubicin, buparlisib, amsacrine, bifonazole, glibenclamide, bicalutamide, celecoxib, fenofibrate, and danazol.
The treatment of cells with the complex of the invention leads to intracellular delivery of the complex and release of its cargo into the cytosol of treated cells, without disassembly of the lipoprotein cage. Thus, in a preferred embodiment, said complex of the invention comprises the lipoprotein cage of the invention and one or more cargo molecule, wherein said cargo molecule is encapsulated in the lipoprotein cage without disassembly of the lipoprotein cage. In a further preferred embodiment, said complex of the invention comprises a lipoprotein cage of the invention and one or more cargo molecule, wherein said cargo molecule is encapsulated into the lipoprotein cage extracellularly without disassembly of the lipoprotein cage, and said lipoprotein cage is capable of release said cargo intracellularly without disassembly of the lipoprotein cage. In a further preferred embodiment, said complex of the invention comprises a lipoprotein cage of the invention and one or more cargo molecule, wherein said cargo molecule is encapsulated into the lipoprotein cage extracellularly without disassembly of the lipoprotein cage, and said lipoprotein cage is capable of be taken-up by a cell and to release said encapsulated cargo molecule intracellularly, preferably into the cytosol of said cell, without disassembly of the lipoprotein cage.
Encapsulation of cargo into the lipoprotein cage of the invention is reversible in the presence of competing host molecules or environments, while the lipoprotein cage and complex of the invention are stable and not disassembled extracellularly. Thus, in a further preferred embodiment, the lipoprotein cage of the invention is capable of reversibly encapsulating and releasing cargo molecules, without disassembly of the lipoprotein cage. The release of cargo is triggered by a protein that can bind the cargo more tightly than the surfactant composition can bind the cargo, or a lipid bilayer into which the cargo preferentially partitions.
In a further aspect, the invention provides a method for manufacturing the lipoprotein cage of the invention comprising the steps of:
wherein any of X1 to X29 are independently of each other an amino acid, provided that at least 3 of X1 to X6 are independently of each other a positively charged amino acid, and wherein optionally up to 5 amino acids in positions other than denoted by X1 to X29 in SEQ ID NO: 1 are exchanged by any amino acid; and encapsulating the surfactant composition of the invention into the protein cage, without disassembly of the protein cage.
In a preferred embodiment, said amino acid sequence I is an amino acid sequence selected from the group consisting of SEQ ID NO: 2 to 5 and SEQ ID NO: 10 to 16, and 20 to 22. In a preferred embodiment, said amino acid sequence I is an amino acid sequence selected from the group consisting of SEQ ID NO: 2 to 5 and SEQ ID NO: 10 to 16, 20 and 21. In a preferred embodiment, said at least at least one polypeptide is defined as 24 polypeptides.
In a preferred embodiment, the method for manufacturing the lipoprotein cage of the invention comprises the further steps of producing the polypeptide of the invention, preferably by recombinant expression (e.g. in bacterial cells, preferably in Escherichia coli cells). Upon expression, the polypeptide of the invention self-assembles into a well-defined protein cage which can be isolated.
In a further aspect, the invention provides a method for manufacturing the complex of the invention comprising the step of mixing the lipoprotein cage of the invention with one or more cargo molecules, wherein said cargo is encapsulated into the lipoprotein cage of the invention without disassembly of the lipoprotein cage.
In a further aspect, the invention provides a method for treating cells with the complex of the invention comprising the step of contacting said cell with the complex of the invention. Preferably, said method for treating cells is an in vitro method. In a preferred embodiment, said cell is a eukaryotic cell. More preferably, said cell is an animal cell, again more preferably a mammalian cell.
Materials. All chemicals were used as supplied without further purification. Isopropyl-beta-D-thiogalactopyranoside (IPTG) was purchased from Fluorochem (UK). Lysozyme was purchased from PanReac Axon Lab AG (Switzerland). For His-tagged protein isolation, Ni-NTA Agarose from Qiagen (Germany) was used. DNase I was from Roche (Switzerland) and RNase A was from Merck (Germany). GelRed was purchased from Biotium, Inc. (USA). Sodium dodecyl sulfate and cholesterol sulfate were purchased from Sigma-Aldrich (Merck, Germany).
Instrumentation. Protein quantification was carried out using a NanoDrop 2000c spectrophotometer from ThermoFisher Scientific Inc. (USA). All size-exclusion chromatography was carried out on an NGC™ Medium-Pressure Chromatography System from Bio-Rad Laboratories, Inc. (USA). Agarose gel electrophoresis (AGE) was performed on Mini-Sub® cell GT from Bio-Rad Laboratories, Inc. (USA). Gel images were captured using an EOS 1100D from Canon (Japan). Transmission electron microscopy (TEM) images were obtained on a Morgagni 268 from FEI (USA). Fluorimetry was carried out on a QuantaMaster™ 50 fluorometer from Photon Technology International (USA). Confocal fluorescence microscopy images were obtained on an SP8-AOBS from Leica (Germany). Flow cytometry was carried out on an LSRFortessa from BD Biosciences (USA).
Protein production. Proteins were expressed in E. coli strain BL21-Gold(DE3). Cells were grown at 37° C. in LB medium containing kanamycin sulfate (86 μM) until OD600 reached 0.6-0.8, and protein over-expression was induced with IPTG (0.1 mM). After culturing for ˜18 h at 25° C., cells were harvested by centrifugation (5,000×g) at 4° C. for 15 min. Cell pellets were stored at −20° C. until purification. OP cages were isolated from E. coli cell pellets and purified by Ni-affinity and size-exclusion chromatography as previously reported (Edwardson et al., 2018, op. cit.).
Preparation of OP:SDS and OP:SDS:CS complexes. Protein cage-micelle complexes were formed directly from purified, empty OP cages and aqueous solutions of anionic surfactants. Unless otherwise specified, the molar ratio of total surfactant molecules to OP cages is 800:1 in all experiments. Buffers used were PBS (9.5 mM Na2HPO4, 1.4 mM KH2PO4, 136 mM NaCl, 2.7 mM KCl, pH 7.4) and TSEC (25 mM Tris-HCl, 200 mM NaCl, 5 mM EDTA, pH 7.4). To form complexes, appropriate volumes of concentrated SDS solution (1-100 mM in PBS) or CS solution (8-16 mM, DMSO) were first diluted in PBS buffer to concentrations below 1 mM to avoid protein denaturation. Then, the necessary volume of OP solution (2-20 μM capsid, PBS or TSEC buffer) was added and the mixture was incubated for 1 hour at room temperature to allow complete complex formation. For small molecule encapsulation, concentrated solutions of fluorescent probe/drug in acetone or DMSO were added to the pre-formed OP: SDS:CS complexes and incubated for a further 15 minutes at room temperature. In each case, the total fraction of organic solvent was kept below 10% v/v.
Native agarose gel electrophoresis. All native gel electrophoresis was carried out using 2% (w/v) agarose gels in Tris-acetate-EDTA buffer (40 mM Tris-HCl, 19 mM acetic acid, 1 mM EDTA, pH 8.3). After visualization of fluorescent molecules by UV transillumination, gels were stained with Coomassie Blue for protein visualization. In a typical experiment, ca. 100 pmol of capsid (with respect to monomer) was loaded per lane in 10 μL of buffer with an additional 2 μL of 70% (v/v) aqueous glycerol for loading.
Size-exclusion chromatography. Analytical SEC was carried out on a Superose 6 Increase 10/300 GL column (GE Healthcare, USA). Samples were 800 μL of 10-50 μM protein monomer and the mobile phase was 0.75×TSEC buffer. Peaks were detected by absorbance at 280 nm.
Dynamic light scattering. DLS measurements were carried out on Zetasizer Nano (Malvern Instruments, UK) at 25° C. using samples prepared from 0.22 μm filtered solutions of protein and surfactants. Sample concentrations were 30-100 μM of protein monomer.
Transmission electron microscopy. Negatively-stained transmission electron microscopy (TEM) was carried out as reported previously (Beck, T., Tetter, S., Kiinzle, M. & Hilvert, D. Construction of Matryoshka-Type Structures from Supercharged Protein Nanocages. Angew. Chem. Int. Ed. 54, 937-940 (2015)). For all TEM experiments, samples were between 2-4 μM of OP monomer in PBS buffer.
Nile Red fluorescence. For a typical fluorimetry experiment, 800 μL samples in PBS buffer were used. For the surfactant loading kinetics the ionic strength was adjusted by dissolving appropriate amounts of NaCl in PBS buffer. Stock solutions of Nile Red in acetone:water (1:1 v/v), or DMSO at concentrations of 50-500 μM were used. The excitation wavelength was set to 535 nm for all experiments.
Effective concentration calculations. The volume of the OP lumenal cavity (255,528 A) was estimated as a sphere with a radius of 39.4 A. This radius determined by averaging the distances between lumenally exposed residues from the reported crystal structure (Edwardson et al., 2018, op. cit.), using the UCSF Chimera software (Pettersen, E. F. et al. UCSF Chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605-1612 (2004)). The effective concentration of SDS was simply estimated as the number of moles SDS per lumenal cavity volume. The expected number of SDS molecules was estimated by first determining the volume occupied by an individual SDS molecule in a micelle, from the reported average values of SDS micelle radius (17.5 Å) and aggregation number (n=64). Division of the OP cavity volume by the average volume of a single SDS molecule packed into micellar aggregates gives an estimated value of 729 molecules per OP cage, which is within error of the volume estimations and experimental results, which approximate 800 molecules.
Cell culture. HeLa cells were maintained in Dulbecco's Modified Eagle Medium (high glucose) supplemented with 10% fetal bovine serum (FBS), 2 mM L-glutamine, 2 mM GlutaMAX and 1 μg/mL gentamicin. Cells were cultured in 5% CO2 at 37° C. and typically split in a 1:4 ratio every 3 days. Only passage numbers between 7-20 were used for all experiments.
Flow cytometry. HeLa cells were seeded at a density of 30,000 cells per well in a 24-well plate in 500 μL of culture medium and allowed to recover at 37° C. and 5% CO2 for 24 hours to reach 60-80% confluency. Both OP protein and surfactant solutions were sterilized by filtration through a 0.22 μm membrane, and stocks were prepared in sterile PBS. Nile Red solution (50 μM) in 1:1 EtOH:H2O was used without sterilization. OP capsid-micelle complexes were prepared as described above. For each well 20 μL of sample in PBS was added to 200 μL of culture media to give the final concentration of 200 nM. Cells were incubated for 16-20 h in 5% CO2 at 37° C. before washing with PBS and trypsinization (0.05% Trypsin-EDTA (Thermo Fisher Scientific, USA), 4 minutes at 37° C.). Cells were collected in cold culture medium and washed twice with cold PBS before resuspension in flow cytometry buffer (PBS with 5% FBS). A representative cytometry analysis with all gating is shown in
Fluorescent labelling of OP cages. To provide a specific handle for fluorophore conjugation, a single serine to cysteine mutation was introduced at residue 38 of the OP protein through ‘QuikChange’ (Agilent) site-directed mutagenesis. This lumenally presented residue was chosen to avoid interfering with the exterior surface of the OP cage, which could disrupt the cellular uptake profile. Successful molecular cloning was confirmed by Sanger sequencing (Microsynth AG, Switzerland) of the pET29b(+)_OPS38C plasmid used for protein expression and the protein was produced as previously reported.19 Labelling of the OP cage with the Atto425-maleimide (Sigma-Aldrich) was carried out by simply mixing purified protein in TSEC buffer with dye solution (10 mM, DMSO) and incubating in the dark overnight at room temperature. To terminate the reaction, 2 equiv. (w.r.t. maleimide) of β-mercaptoethanol was added and after 30 minutes incubation, the protein was purified using a PD minitrap G-10 column (GE Healthcare, USA). The labelling efficiency was determined from UV-Vis absorbance measurement and the ε280 and ε439 values of Atto425 and the OP protein. For the experiments shown in
Confocal microscopy. Cells were seeded at a density of 15-20,000 cells per well in a μ slide 8-well chambered coverslip with ibiTreat surface from ibidi GmbH (Germany). Cells were incubated in 200 μL of culture medium at 37° C. and 5% CO2 for 24 hours before sample addition. Sterile samples were prepared in PBS and for each well 10 μL of sample solution was added to 100 μL of culture media to give the desired final concentrations of protein and fluorophore. Cells were incubated with samples for 24 h in 5% CO2 at 37° C. before washing with PBS and nuclear staining with 100 μL of Hoechst 33342 solution (5 μg/mL in PBS) at 37° C. for 10-15 mins. Cells were then washed twice with PBS and microscopy was carried out at 37° C. in PBS containing 10% FBS.
Cell viability assay. Cytotoxicity was assessed using the WST-8-based Cell Counting Kit-8 from Sigma according to the manufacturer's instructions. HeLa cells were seeded at a density of 5,000 cells per well in a 96-well plate in 100 μL of culture medium and allowed to recover at 37° C. and 5% CO2 for 24 hours. Protein, surfactant and drug samples were prepared by serial dilution in sterile PBS. A total volume of 25 μL sample was added to each well to provide the final concentrations shown in
Self-assembly of lipoprotein-mimetic cages. For preparing an exemplarily lipoprotein cage according to the invention the artificial assembly OP was chosen as protein scaffold, which is a small porous capsid with a positively charged interior cavity (Edwardson et al., 2018, op. cit.). After expression in Escherichia coli, the OP protein is isolated as a complete octahedral assembly comprising 24 monomers with a ˜3.5 nm pore on each of its six faces (
Starting from the positively charged OP cage, negatively charged amphiphiles were encapsulated, which in turn phase separate and create a hydrophobic core within the protein cage. The resulting protein-scaffolded lipid droplet then acts as a hydrophobic compartment that can sequester small molecules.
Cage-templated micelle formation. Sodium dodecyl sulfate (SDS) was chosen as the anionic surfactant (
In order to better understand the tolerance to SDS, OP was further interrogated in the presence of 800 equivalents of SDS per cage, through a combination of biophysical techniques (
To determine whether SDS molecules were drawn within the interior cavity of the OP cage as envisioned, a fluorescently labelled oligonucleotide probe was used. It was expected that if the SDS molecules were localized in the lumen via the intended sulfate-guanidinium interactions, other potential negatively charged guests would be blocked from entry. This obstruction would be both due to negation of the high positive charge, which is the driving force for encapsulation, and occlusion of the entry pores. As oligonucleotides are internalized by OP cages rapidly with high affinity (Edwardson et al., 2018, op. cit.), they provided an ideal probe to test this hypothesis. Atto488-labelled 21 nt DNA was added to either empty OP cages or those pre-incubated with SDS, and the complexes were analyzed by native gel electrophoresis (
Cryogenic electron microscopy offers a means to probe both protein cage structure and the presence of internalized guests directly. As such, the inventors analyzed both empty and surfactant (SDS)-filled OP cages (
Hydrophobic core formation, cargo capacity and kinetics. To gain a deeper understanding of the internal structure of the OP-templated micelles, the inventors used the solvatochromic dye Nile Red. This small molecule fluorophore is nearly non-emissive in aqueous media, but exhibits fluorescence in nonpolar environments (Greenspan, P., Mayer, E. P. & Fowler, S. D. Nile red: a selective fluorescent stain for intracellular lipid droplets. J. Cell Biol. 1985, vol. 100, pp. 965-973). To discern if the OP cage chaperoned SDS molecules into micelle-like aggregates with a hydrophobic core, Nile Red fluorescence was measured in the presence of each component of the system.
Below its critical micelle concentration (4-5 mM in PBS buffer)(De Paula, R., da Hora Machado, A. E. & de Miranda, J. A. 3-Benzoxazol-2-yl-7-(N,N-diethylamino)-chromen-2-one as a fluorescence probe for the investigation of micellar microenvironments. J. Photochem. Photobiol. A: Chem. 2004, vol. 165, pp. 109-114), SDS had negligible effect on the fluorescence of a 500 nM aqueous solution of Nile Red (
With the suitability and utility of Nile Red established, the inventors carried out further experiments to determine the optimal number of SDS molecules per capsid and the number of Nile Red guests that could be encapsulated.
Fluorescence monitoring of two equivalents of Nile Red in the presence of OP:SDS complexes with increasing SDS content revealed a plateau at around 800 surfactant molecules per cage (
As cargo-carrying is an important characteristic of the protein-micelle complex, the inventors also used fluorescence titration to determine the number of Nile Red molecules that could be accommodated per cage. Single equivalents of Nile Red were added stepwise to a solution of OP:SDS complexes and the fluorescence spectra measured for each addition (
Measurement of biological activity. The OP assembly is capable of delivering short interfering RNA to the cytosol of mammalian cells, inducing efficient gene knockdown. As such, the inventors were interested if the micelle containing OP cages could improve the cellular uptake of poorly soluble compounds. Human cancer cells (HeLa) were treated with OP: SDS complexes carrying Nile Red or the free fluorophore itself. Analysis by flow cytometry (
To test the encapsulation, transport and intracellular release of a bioactive small molecule, the inventors chose the dual tyrosine kinase inhibitor, lapatinib (Moy, B. & Goss, P. E. Lapatinib: Current Status and Future Directions in Breast Cancer. The Oncologist 2006, vol. 11, pp. 1047-1057), as a model compound. Further preferred and suitable cargo molecules having a preferred size of below 600 Da are listed in the table 3 below. Lapatinib is used as a therapy for solid tumours, and has been shown to benefit from nanoparticle-mediated delivery due to its poor solubility and serum protein binding (Bonde, G. V. et al. Lapatinib nano-delivery systems: a promising future for breast cancer treatment. Expert Opin. Drug. Deliv. 2018, vol. 15, pp. 495-507). To reduce the thermodynamic cost of encapsulation and favour formation of a more stable complex the inventors modified the surfactant composition with the endogenous steroid, cholesterol sulfate (CS), to promote loading of large planar guest molecules. We found that a surfactant composition of 75% SDS and 25% CS was well-tolerated by the OP cage (
To determine how stably the OP:SDS:CS complexes retained their lapatinib cargo, they were dialyzed against media containing 10% fetal bovine serum. After 72 hours, fluorescence spectroscopy showed the lapatinib content in the OP:SDS:CS sample, revealing signals in the range of control samples (
Finally, the inventors assayed the ability of OP:SDS:CS complexes to increase the effective cytotoxicity of lapatinib in human cancer cells. Cells were treated with either free lapatinib or lapatinib packaged in OP:SDS:CS cages and cell viability was monitored after an 18 hour incubation (
Energy transfer between chemically conjugate fluorophores and Nile Red. To demonstrate the compatibility of lipoprotein cage formation and small molecule cargo loading with cysteine containing proteins and their chemical conjugation, the inventors tested complex formation with two protein variants OP-K93C (SEQ ID NO: 4) and OP-S38C (SEQ ID NO: 22), which contain a single cysteine mutation per monomer, displayed on either the exterior or interior surface, respectively. These proteins form the same cage structure as OP (SEQ ID NO: 2), as shown by transmission electron microscopy (
Formation of lipoprotein cages with C-terminal appendages. The attachment of peptides, such as degradation tags, targeting tags, cell penetration tags, and endosomal escape tags, to the C-terminus of the protein allows to tune the functionality of this encapsulation system. To demonstrate the compatibility of lipoprotein cage formation with this type of protein modification, the inventors tested four protein variants with differing C-terminal peptide tags, which were created via genetic fusion. The four proteins tested were OP-96 (SEQ ID NO: 24), OP-ZHER2 (SEQ ID NO: 25), OP-ZEGFR (SEQ ID NO: 26), and OP-SP94 (SEQ ID NO: 27)/ Each of these four variants formed the protein cage structure, as demonstrated by size-exclusion chromatography (
Formation of a lipoprotein cage with N-terminal appendages. In addition to the C-terminus of the protein, the attachment of peptides, such as degradation tags, targeting tags, cell penetration tags, and endosomal escape tags, to the N-terminus of the protein allows also to tune the functionality of this encapsulation system. To demonstrate the compatibility of lipoprotein cage formation with this type of protein modification, the inventors tested a protein variant, OP-93 (SEQ ID NO: 23) with a N-terminal peptide tag, which was created via genetic fusion. This protein formed the protein cage structure as well, as demonstrated by size-exclusion chromatography (
Formation of lipoprotein cages with sodium dodecylbenzenesulfonate. The protein cage is highly stable and acts as a template to form the lipidic/micellar core within its inner cavity, meaning a previous surfactant formulation step is not required. Therefore, the amphiphiles do not need to form a stable particle on their own before addition of the protein, allowing the use of various different amphiphiles, as long as the mixture possess sufficient negative charge. To further demonstrate the generality of the system to different surfactants, sodium dodecylbenzenesulfonate (SDBS) was used as a 1:1 mixture with SDS to form lipoprotein cages with the OP protein (SEQ ID NO: 2). Native agarose gel analysis revealed the formation of stable lipoprotein complexes, which can encapsulate three different types of small molecule cargo (
Formation of lipoprotein cages with alternative small molecule cargo. To further demonstrate that the two-tier encapsulation concept can be generalized to alternative cargo, taking advantage of the hydrophobic effect, the inventors tested the encapsulation of five different small molecules: Nile Red, lapatinib, daunorubicin, curcumin and laurdan. These molecules have diverse molecular structures as well as unique biological and photophysical properties. Nevertheless, each of these molecules could be encapsulated and efficiently delivered to cells. Native agarose gel analysis of OP:SDS:SDBS (1:400:400) protein cages in the presence of curcumin, lapatinib and daunorubicin reveals encapsulation of these bioactive compounds within the lipoprotein cages (
Number | Date | Country | Kind |
---|---|---|---|
20157250.0 | Feb 2020 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2021/053571 | 2/14/2021 | WO |