Aldehyde-tagged protein-based drug carriers and methods of use

Description

INTRODUCTION

The field of protein and small molecule therapeutics has advanced greatly, providing a number of clinically beneficial drugs and promising to provide more with the years to come. Protein therapeutics can provide several advantages in therapies, due to, for example, exquisite specificity, multiplicity of functions and relatively low off-target activity, resulting in fewer side effects. With the development of sophisticated screening methods, small molecule drugs have also advanced in specificity of action.

Often, though, such therapeutics can be further improved by providing for enhanced activity following administration. For example, it is often desirable to increase the serum half-life of the therapeutic (e.g., in order to reduce the overall dose and/or the number of administrations required over a dosage period). Alternatively or in addition, therapeutics could benefit from improving their bioavailability. For example, some drugs may benefit from improving solubility in the relevant physiological environment and/or to facilitation formulation (e.g., to increase shelf-life). Moreover, conjugation of a drug to a carrier protein can be difficult to control, resulting in a heterogeneous mixture of conjugates that differ in the number of drug molecules attached. This can make controlling the amount administered to a patient difficult.

There is a need for methods and compositions that provide drug conjugates.

SUMMARY

The disclosure provides aldehyde-tagged protein carriers that can be covalently and site-specifically bound to drug to provide a drug-containing scaffold. The disclosure also provides methods of production of such drug-containing scaffolds and intermediates, as well as methods of use.

Accordingly, the present disclosure provides carrier protein-drug conjugates composed of a carrier protein and a covalently bound drug, wherein the carrier protein comprises a modified sulfatase motif of the formula:

X₁(FGly′)X₂Z₂X₃Z₃

where FGly′ is of the formula:

embedded image

wherein J¹is the covalently bound drug;

each L¹is a divalent moiety independently selected from alkylene, substituted alkylene, alkenylene, substituted alkenylene, alkynylene, alkynylene, arylene, substituted arylene, cycloalkylene, substituted cycloalkylene, heteroarylene, substituted heteroarylene, heterocyclene, substituted heterocyclene, acyl, amido, acyloxy, urethanylene, thioester, sulfonyl, sulfonamide, sulfonyl ester, —O—, —S—, —NH—, and substituted amine;

n is a number selected from zero to 40;

Z₂is a proline or alanine residue;

X₁is present or absent and, when present, is any amino acid, with the proviso that when the sulfatase motif is at an N-terminus of the polypeptide, X₁is present;

X₂and X₃are each independently any amino acid; and

Z₃is a basic amino acid, and

wherein the carrier protein presents the covalently bound drug on a solvent-accessible surface when in a folded state.

In some embodiments, the carrier protein-drug conjugate contains two or more modified sulfatase motifs, and can contain three or more modified sulfatase motifs.

In further embodiments the modified sulfatase motifs of the carrier protein-drug conjugate are positioned in the carrier protein-drug conjugate at at least one of an N-terminus of the carrier protein, a C-terminus of the carrier protein, and a solvent-accessible loop of the carrier protein.

The modified sulfatase motifs of the carrier protein-drug conjugate can be provided as a concatamer composed of modified sulfatase motifs separated by a flexible linker.

In one exemplar, the carrier protein of the carrier protein-drug conjugate is albumin. The covalently bound drug can be a peptide drug, such as glucagon-like peptide 1 (GLP-1) or a biologically active variant thereof, or calcitonin or a biologically active variant thereof. The covalently bound drug of the carrier protein-drug conjugate can be a small molecule drug (e.g., doxorubicin).

Exemplary carrier protein-drug conjugates include those where Z₃is arginine (R). In exemplary embodiments, X₁, when present, X₂, and X₃are each independently an aliphatic amino acid, a sulfur-containing amino acid, or a polar, uncharged amino acid. In specific examples, X₁, when present, is L, M, V, S or T. In specific examples, X₂and X₃are each independently S, T, A, V, G, or C.

The disclosure also provides aldehyde-tagged carrier proteins having an amino acid sequence of:

X₁Z₁X₂Z₂X₃Z₃

where

Z₁is a cysteine, a serine, or a 2-formylglycine residue;

Z₂is a proline or alanine residue;

X₁is present or absent and, when present, is any amino acid, with the proviso that when the heterologous sulfatase motif is at an N-terminus of the aldehyde tagged polypeptide, X₁is present; and

X₂and X₃are each independently any amino acid; and

Z₃is a basic amino acid;

wherein the carrier protein presents the covalently bound drug on a solvent-accessible surface when in a folded state.

In some examples the aldehyde-tagged carrier protein contains two or more modified sulfatase motifs, and can contain three or more modified sulfatase motifs. In some examples, the modified sulfatase motifs are positioned in the aldehyde-tagged carrier protein at at least one of an N-terminus of the carrier protein, a C-terminus of the carrier protein, and a solvent-accessible loop of the carrier protein. In one example, the carrier protein is albumin

Exemplary aldehyde-tagged carrier proteins include those in which Z₃is arginine (R). Exemplary aldehyde-tagged carrier proteins include those in which X₁, when present, X₂, and X₃are each independently an aliphatic amino acid, a sulfur-containing amino acid, or a polar, uncharged amino acid. In specific examples of aldehyde-tagged carrier protein, X₁, when present, is L, M, V, S or T. In specific examples of aldehyde-tagged carrier protein, X₂and X₃are each independently S, T, A, V, G, or C.

The disclosure also provides libraries of modified carrier proteins containing a population of aldehyde-tagged carrier proteins according to the present disclosure, or nucleic acid constructs encoding the aldehyde-tagged carrier proteins, where Z₁is a cysteine or serine residue, wherein the population comprises members having differently aldehyde-tagged carrier proteins. In some examples, the population of the library includes aldehyde-tagged carrier proteins having two or more aldehyde tags. In some examples, the wherein the population of the library includes aldehyde-tagged carrier proteins having at least one aldehyde tag at one or more of the N-terminus, the C-terminus, or an interior loop and the carrier protein. In some embodiments, the library is provided as a population of recombinant cells genetically modified to express the nucleic acid constructs.

The disclosure also provides methods of producing a carrier protein-drug conjugate by combining in a reaction mixture an aldehyde-tagged carrier protein having a 2-formyl-glycine residue (FGly′ at Z₁) and a drug for conjugation to the carrier protein, wherein the drug has an aminooxy or hydrazide reactive group. The drug is provided in the reaction mixture in an amount sufficient to provide for a desired ratio of drug to carrier protein, said combining being under conditions suitable to promote reaction between an aldehyde of the carrier protein and reactive group of the drug to generate a carrier protein-drug conjugate. The carrier protein-drug conjugate is then isolated from the reaction mixture. In specific embodiments, the aldehyde-tagged carrier protein is folded prior to said combining.

The disclosure also provides formulations containing a carrier protein-drug conjugate of the present disclosure and a pharmaceutically acceptable excipient.

The disclosure also provides methods of treating a subject having or at risk of having condition amenable to treatment with glucagon-like peptide 1 (GLP-1) by administering to a subject in of treatment a carrier protein-drug conjugate of the present disclosure in which the covalently bound drug is glucagon-like peptide 1 (GLP-1) or a biologically active variant thereof, where administration is effective to treat the condition in the subject. The disclosure also provides methods of treating a subject having or at risk of having condition amenable to treatment with calcitonin by administering to a subject in of treatment a carrier protein-drug conjugate of the present disclosure in which the covalently bound drug is calcitonin or a biologically active variant thereof, where administration is effective to treat the condition in the subject.

The disclosure also provides recombinant nucleic acids having nucleic acid encoding an aldehyde-tagged carrier protein of the present disclosure in which Z₁is a cysteine residue or a serine residue.

Other features are provided below, and will be readily apparent to the ordinarily skilled artisan upon reading the present disclosure.

BRIEF DESCRIPTION OF THE DRAWINGS

It is emphasized that, according to common practice, the various features of the drawings may not be to-scale. On the contrary, the dimensions of the various features are arbitrarily expanded or reduced for clarity. Included in the drawings are the following figures:

FIG. 1A is a schematic illustrating production of an ald-tagged carrier protein containing a 2-formylglycine (FGly), which is reacted with an aminooxy-containing peptide to form a drug-conjugate of the present disclosure. The aldehyde tag is exemplified in FIG. 1A by LCTPSR (SEQ ID NO:1).

FIG. 1B is a schematic illustrating a library of ald-tagged carrier proteins, exemplified by an ald-tagged human serum albumin (HSA) (top panel), and ald-tagged HSA-drug conjugates (bottom panel) conjugated to a drug. By changing the location of the aldehyde tag, the spatial display of the peptide relative to the surface of the rHSA can be altered.

FIG. 2 is schematic illustrating an exemplary synthesis of peptides to contain a reactive partner for reaction with an aldehyde group of an ald-tagged carrier protein.

FIG. 3 is a schematic providing an amino acid sequence (SEQ ID NO:70) and nucleic acid sequence (SEQ ID NO:71) of human serum albumin (HSA).

FIG. 4 is a schematic providing amino acid sequences of exemplary ald-tagged HSA proteins (SEQ ID NO:72-76). The prepro leader sequence is indicated by a single underline. The sulfatase motif is indicated by a double underline.

FIGS. 5-9 are schematics providing the nucleic acid sequences (SEQ ID NO:77-81) of the exemplary ald-tagged HSA proteins of FIG. 4.

FIG. 10 is a schematic illustrating (top panel) the crystal structure of a recombinant HSA and (bottom panel) a carrier protein-drug conjugate of an ald-tagged recombinant HSA and GLP-1, with the GLP-1 peptide shown in the same scale as the HSA carrier protein. In each panel, the N-terminal end of the protein is on the right side of the schematic; the C-terminal end of the protein is on the left side of the schematic.

FIG. 11 provides amino acid sequences (SEQ ID NO:82-89) of exemplary ald-tagged Fc proteins. The sulfatase motif is indicated by double-underlined text.

FIG. 12 is a schematic of an exemplary ald-tagged carrier protein modified by conjugation to a small molecule drug. The N-terminal end of the protein is on the right side of the schematic; the C-terminal end of the protein is on the left side of the schematic. The aldehyde tag is exemplified in FIG. 12 by LCTPSR (SEQ ID NO:1).

FIG. 13 is a picture of a protein gel illustrating HSA, purified using Ni/NTA, purified using Ni/NTA, containing an aldehyde tag at the C terminus conjugated with a fluorophore. The negative control, CtoA construct, does not get converted to a formylglycine and is subsequently not conjugated when reacted with the fluorophore.

FIG. 14 is a picture of a gel showing Ald tag-HSA was expressed in and secreted from the yeast Pichia pastoris. A Pichia strain expressing no HSA (none), wild-type HSA (WT HSA), or ald tag-HSA (clones #7, 11, 24, 25) was grown in methanol-containing medium to induce expression of HSA. After 6 days, the media was collected, cleared of cells, run on an SDS-PAGE gel, and stained with Coomassie Blue. proteing gel is a Ald tag-HSA was expressed in and secreted from the yeast Pichia pastoris. A Pichia strain expressing no HSA (none), wild-type HSA (WT HSA), or ald tag-HSA (clones #7, 11, 24, 25) was grown in methanol-containing medium to induce expression of HSA. After 6 days, the media was collected, cleared of cells, run on an SDS-PAGE gel, and stained with Coomassie Blue.

FIG. 15 is a picture of a gel showing aldehyde-tagged-HSA was expressed and secreted from CHO cells. After 72 h, the media was collected, cleared of cells, and purified on Ni-NTA resin. Flow-through (FT), wash (W) and elution (E) fractions were collected, run on an SDS-PAGE gel and stained with Coomassie Blue.

DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS

Before the present invention is described, it is to be understood that this invention is not limited to particular embodiments described, as such may, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting, since the scope of the present invention will be limited only by the appended claims.

Where a range of values is provided, it is understood that each intervening value, to the tenth of the unit of the lower limit unless the context clearly dictates otherwise, between the upper and lower limits of that range is also specifically disclosed. Each smaller range between any stated value or intervening value in a stated range and any other stated or intervening value in that stated range is encompassed within the invention. The upper and lower limits of these smaller ranges may independently be included or excluded in the range, and each range where either, neither or both limits are included in the smaller ranges is also encompassed within the invention, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in the invention.

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, some potential and exemplary methods and materials are now described. All publications mentioned herein are incorporated herein by reference to disclose and describe the methods and/or materials in connection with which the publications are cited. It is understood that the present disclosure supersedes any disclosure of an incorporated publication to the extent there is a contradiction.

It must be noted that as used herein and in the appended claims, the singular forms “a”, “an”, and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “an aldehyde tag” includes a plurality of such tags and reference to “the polypeptide” includes reference to one or more polypeptides and equivalents thereof known to those skilled in the art, and so forth.

It is further noted that the claims may be drafted to exclude any element which may be optional. As such, this statement is intended to serve as antecedent basis for use of such exclusive terminology as “solely”, “only” and the like in connection with the recitation of claim elements, or the use of a “negative” limitation.

It is appreciated that certain features of the invention, which are, for clarity, described in the context of separate embodiments, may also be provided in combination in a single embodiment. Conversely, various features of the invention, which are, for brevity, described in the context of a single embodiment, may also be provided separately or in any suitable sub-combination. All combinations of the embodiments pertaining to the invention are specifically embraced by the present invention and are disclosed herein just as if each and every combination was individually and explicitly disclosed, to the extent that such combinations embrace subject matter that are, for example, compounds that are stable compounds (i.e., compounds that can be made, isolated, characterized, and tested for biological activity). In addition, all sub-combinations of the various embodiments and elements thereof (e.g., elements of the chemical groups listed in the embodiments describing such variables) are also specifically embraced by the present invention and are disclosed herein just as if each and every such sub-combination was individually and explicitly disclosed herein.

The publications discussed herein are provided solely for their disclosure prior to the filing date of the present application. Nothing herein is to be construed as an admission that the present invention is not entitled to antedate such publication by virtue of prior invention. Further, the dates of publication provided may be different from the actual publication dates which may need to be independently confirmed.

Definitions

The terms “polypeptide”, “peptide” and “protein” are used interchangeably herein to refer to a polymeric form of amino acids of any length. Unless specifically indicated otherwise, “polypeptide”, “peptide” and “protein” can include genetically coded and non-coded amino acids, chemically or biochemically modified or derivatized amino acids, and polypeptides having modified peptide backbones. The term includes fusion proteins, including, but not limited to, fusion proteins with a heterologous amino acid sequence, fusions with heterologous and homologous leader sequences, proteins which contain at least one N-terminal methionine residue (e.g., to facilitate production in a recombinant bacterial host cell); immunologically tagged proteins; and the like.

“Native amino acid sequence” or “parent amino acid sequence” are used interchangeably herein in the context of a carrier protein to refer to the amino acid sequence of the carrier protein prior to modification to include a heterologous aldehyde tag.

By “aldehyde tag” or “ald-tag” is meant an amino acid sequence that contains an amino acid sequence derived from a sulfatase motif which is capable of being converted, or which has been converted, by action of a formylglycine generating enzyme (FGE) to contain a 2-formylglycine residue (referred to herein as “FGly”). Although this is technically incorrect, the FGly residue generated by an FGE is often referred to in the literature as a “formylglycine”. Stated differently, the term “aldehyde tag” is used herein to refer to an amino acid sequence comprising an “unconverted” sulfatase motif (i.e., a sulfatase motif in which the cysteine or serine residues has not been converted to FGly by an FGE, but is capable of being converted) as well as to an amino acid sequence comprising a “converted” sulfatase motif (i.e., a sulfatase motif in which the cysteine or serine residues has been converted to FGly by action of an FGE).

By “conversion” as used in the context of action of a formylglycine generating enzyme (FGE) on a sulfatase motif refers to biochemical modification of a cysteine or serine residue in a sulfatase motif to a formylglycine (FGly) residue (e.g., Cys to FGly, or Ser to FGly).

“Modification” encompasses addition, removal, or alteration of a moiety. As used in the context of a polypeptide having a converted sulfatase motif, “modification” is meant to refer to chemical or biochemical modification of an FGly residue of an aldehyde tag of a polypeptide through reaction of the FGly aldehyde moiety with a reactive partner. As discussed above, the term “conversion” refers to a type of biochemical modification of an FGly residue of an aldehyde tag mediated by an FGE. An aldehyde tag that is modified by reaction of an FGly with a reactive partner as described herein is sometimes referred to as a “modified ald tag” or an aldehyde tag containing “FGly′”.

By “genetically-encodable” as used in reference to an amino acid sequence of polypeptide, peptide or protein means that the amino acid sequence is composed of amino acid residues that are capable of production by transcription and translation of a nucleic acid encoding the amino acid sequence, where transcription and/or translation may occur in a cell or in a cell-free in vitro transcription/translation system.

The term “control sequences” refers to DNA sequences to facilitate expression of an operably linked coding sequence in a particular expression system, e.g. mammalian cell, bacterial cell, cell-free synthesis, etc. The control sequences that are suitable for prokaryote systems, for example, include a promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cell systems may utilize promoters, polyadenylation signals, and enhancers.

A nucleic acid is “operably linked” when it is placed into a functional relationship with another nucleic acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate the initiation of translation. Generally, “operably linked” means that the DNA sequences being linked are contiguous, and, in the case of a secretory leader, contiguous and in reading frame. Linking is accomplished by ligation or through amplification reactions. Synthetic oligonucleotide adaptors or linkers may be used for linking sequences in accordance with conventional practice.

The term “expression cassette” as used herein refers to a segment of nucleic acid, usually DNA, that can be inserted into a nucleic acid (e.g., by use of restriction sites compatible with ligation into a construct of interest or by homologous recombination into a construct of interest or into a host cell genome). In general, the nucleic acid segment comprises a polynucleotide that encodes a polypeptide of interest (e.g., an aldehyde tagged-carrier protein), and the cassette and restriction sites are designed to facilitate insertion of the cassette in the proper reading frame for transcription and translation. Expression cassettes can also comprise elements that facilitate expression of a polynucleotide encoding a polypeptide of interest in a host cell. These elements may include, but are not limited to: a promoter, a minimal promoter, an enhancer, a response element, a terminator sequence, a polyadenylation sequence, and the like.

As used herein the term “isolated” is meant to describe a compound of interest that is in an environment different from that in which the compound naturally occurs. “Isolated” is meant to include compounds that are within samples that are substantially enriched for the compound of interest and/or in which the compound of interest is partially or substantially purified.

As used herein, the term “substantially purified” refers to a compound that is removed from its natural environment and is at least 60% free, usually 75% free, and most usually 90% free from other components with which it is naturally associated.

The term “physiological conditions” is meant to encompass those conditions compatible with living cells, e.g., predominantly aqueous conditions of a temperature, pH, salinity, etc. that are compatible with living cells.

By “heterologous” is meant that a first entity and second entity are provided in an association that is not normally found in nature. For example, a protein containing a “heterologous” sulfatase motif or “heterologous” ald-tag is a protein that does not normally contain a sulfatase motif at that position within its amino acid sequence (e.g., proteins which have a single, native sulfatase motif can contain a second sulfatase motif that is “heterologous”; further proteins which contain a sulfatase motif can be modified so as to reposition the sulfatase motif, rendering the re-positioned sulfatase motif “heterologous” to the protein). In some embodiments, a heterologous sulfatase motif is present in a polypeptide which contains no native sulfatase motif.

By “reactive partner” is meant a molecule or molecular moiety that specifically reacts with another reactive partner to produce a reaction product. Exemplary reactive partners include an cysteine or serine of sulfatase motif and a formylglycine generating enzyme (FGE), which react to form a reaction product of a converted aldehyde tag containing a FGly in lieu of cysteine or serine in the motif. Other exemplary reactive partners include an aldehyde of a formylglycine (FGly) residue of a converted aldehyde tag and a reactive partner reagent comprising a moiety of interest, which react to form a reaction product of a modified aldehyde tagged polypeptide having the moiety of interest conjugated to the aldehyde tagged polypeptide at the FGly residue.

“N-terminus” refers to the terminal amino acid residue of a polypeptide having a free amine group, which amine group in non-N-terminus amino acid residues normally forms part of the covalent backbone of the polypeptide.

“C-terminus” refers to the terminal amino acid residue of a polypeptide having a free carboxyl group, which carboxyl group in non-C-terminus amino acid residues normally forms part of the covalent backbone of the polypeptide.

By “N-terminal” is meant the region of a polypeptide that is closer to the N-terminus than to the C-terminus.

By “C-terminal” is meant the region of a polypeptide that is closer to the C-terminus than to the N-terminus.

By “internal site” as used in referenced to a polypeptide or an amino acid sequence of a polypeptide means a region of the polypeptide that is not at the N-terminus or at the C-terminus, and includes both N-terminal and C-terminal regions of the polypeptide.

Introduction

The present disclosure provides aldehyde-tagged protein carriers that can be covalently and site-specifically bound to drug to provide a drug-containing scaffold, as well as methods of production of such drug-containing scaffolds and intermediates, as well as methods of use. Aldehyde-tagged carrier proteins may also be referred to herein as “ald-tagged carrier proteins”, “ald-tagged protein scaffolds” or “ald-tagged scaffolds”. Such Ald-tagged scaffolds can be site-specifically decorated with a covalently bound molecule of interest, such as a drug (e.g., a peptide, a small molecule drug, and the like). Such drug-scaffold conjugates can provide for enhanced serum half-life of the drug.

The compositions and methods of the present disclosure exploit a naturally-occurring, genetically-encodable sulfatase motif for use as a tag, referred to herein as an “aldehyde tag” or “ald tag”, to direct site-specific modification of the carrier protein. The sulfatase motif of the aldehyde tag, which is based on a motif found in active sites of sulfatases, contains a serine or cysteine residue that is capable of being converted (oxidized) to a 2-formylglycine (FGly) residue by action of a formylglycine generating enzyme (FGE) either in vivo (e.g., at the time of translation of an ald tag-containing protein in a cell) or in vitro (e.g., by contacting an ald tag-containing protein with an FGE in a cell-free system). The aldehyde moiety of the resulting FGly residue can be used as a “chemical handle” to facilitate site-specific chemical modification of the protein, and thus site-specific attachment of a drug of interest. For example, a peptide modified to contain an α-nucleophile-containing moiety (e.g., an aminooxy or hydrazide moiety) can be reacted with the FGly-containing carrier protein to yield a conjugate in which the carrier protein and peptide are linked by a hydrazone or oxime bond, respectively. The reactivity of the aldehyde thus allows for bioorthongonal and chemoselective modification of the carrier protein, and thus provides a site-specific means for chemical modification that in turn can be exploited to provide for site-specific attachment of a moiety of interest in the final conjugate.

For illustrative purposes, a schematic of production of an ald-tagged carrier protein is provided in FIG. 1A. In this example, a construct encoding a carrier protein having an ald tag (exemplified by LCTPSR (SEQ ID NO:1)) is expressed in a host cell (exemplified by the yeast S. cerevisae) which is genetically modified to contain an FGE of M. tuberculosis. Expressing the recombinant protein in yeast not modified with FGE is produced without the cysteine being converted to FGly. The carrier protein can be purified and added to recombinant, purified FGE generating the aldehyde tag on the carrier protein. The resulting carrier protein contains an FGly having an aldehyde moiety (arrow). The ald-tagged carrier protein is then reacted with a peptide having a reactive aminoooxy moiety. The reaction product is a drug-carrier protein conjugate having the peptide bound to the carrier protein through an oxime bond. FIG. 1B illustrates how the ald-tag can be placed at various positions on the carrier protein, thus providing for drug-carrier protein conjugates having bound drug (exemplified by a peptide drug) at different positions on the carrier protein.

Exemplary methods and compositions for practice of the invention will now be described in more detail.

Aldehyde Tags

In general, an aldehyde tag (“ald tag”) can be based on any amino acid sequence derived from a sulfatase motif (also referred to as a “sulfatase domain”), which is capable of being converted by action of a formylglycine generating enzyme (FGE) to contain a formylglycine (FGly). Action of FGE is directed in a sequence-specific manner in that the FGE acts at a sulfatase motif, but this sulfatase motif can be positioned within any region of carrier protein. Thus, FGE-mediated conversion of a sulfatase motif is site-specific (i.e., in that FGE acts at the amino acid sequence of a sulfatase motif) but the ability of FGE to act upon the sulfatase motif is sequence context-independent (i.e., the ability of the FGE to convert a cysteine/serine of a sulfatase motif is independent of the sequence context in which the sulfatase motif is presented in the carrier protein).

Exemplary Aldehyde Tags

A minimal sulfatase motif of an aldehyde tag is usually about 5 or 6 amino acid residues in length, usually no more than 6 amino acid residues in length. In general, it is normally desirable to minimize the extent of modification of the native amino acid sequence of the carrier protein, so as to minimize the number of amino acid residues that are inserted, deleted, substituted (replaced), or added (e.g., to the N- or C-terminus). Minimizing the extent of amino acid sequence modification of the carrier protein is usually desirable so as to minimize the impact such modifications may have upon carrier protein structure and/or immunogenicity. Thus, aldehyde tags of particular interest include those that require modification (insertion, addition, deletion, substitution/replacement) of less than 16, 15, 14, 13, 12, 11, 10, 9, 8, or 7 amino acid residues of the amino acid sequence of the carrier protein (e.g., the carrier polypeptide).

It should be noted that while aldehyde tags of particular interest are those based on a minimal sulfatase motif, it will be readily appreciated that longer aldehyde tags are both contemplated and encompassed by the present disclosure and can find use in the compositions and methods of the invention. Aldehyde tags can thus comprise a minimal sulfatase motif of 5 or 6 residues, or can be longer and comprise a minimal sulfatase motif which can be flanked at the N- and/or C-terminal sides of the motif by additional amino acid residues. Aldehyde tags of, for example, 5 or 6 amino acid residues are contemplated, as well as longer amino acid sequences of more than 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or more amino acid residues.

In general, sulfatase motifs useful in aldehyde tags as described herein are of the formula:

X₁Z₁X₂Z₂X₃Z₃ (I)

or, in an exemplary embodiment

X₁Z₁X₂Z₂X₃R (Ia)

where

Z₁is cysteine or serine (which can also be represented by (C/S));

Z₂is either a proline or alanine residue (which can also be represented by (P/A));

Z₃is a basic amino acid, and may be arginine (R), lysine (K) or histidine (H), usually lysine), or an aliphatic amino acid (alanine (A), glycine (G), leucine (L), valine (V), isoleucine (I), or proline (P), usually A, G, L, V, or I (in Formula (Ia) Z₃is arginine (R));

X₁is present or absent and, when present, can be any amino acid, though usually an aliphatic amino acid, a sulfur-containing amino acid, or a polar, uncharged amino acid, (i.e., other than a aromatic amino acid or a charged amino acid), usually L, M, V, S or T, more usually L, M, S or V, with the proviso that when the sulfatase motif is at the N-terminus of the carrier protein, X₁is present; and

X₂and X₃independently can be any amino acid, though usually an aliphatic amino acid, a polar, uncharged amino acid, or a sulfur containing amino acid (i.e., other than a aromatic amino acid or a charged amino acid), usually S, T, A, V, G or C, more usually S, T, A, V or G.

It should be noted that, following action of an FGE on the sulfatase motif, Z₁is oxidized to generate a 2-formylglycine (FGly) residue. Furthermore, following both FGE-mediated conversion and reaction with a reactive partner of a drug of interest, FGly position at Z₁in the formula above is covalently bound to the drug (e.g., a peptide drug, etc). The reactive partner generally is an α-nucleophile, such as an aminooxy or hydrazide group, and provides for linkage of the carrier protein to the drug through an oxime or hydrazone linkage. Thus the carrier protein and drug are not linked through an amide bond, as may be found in other drug conjugates based on recombinant fusion protein technology.

Where the aldehyde tag is present at a location other than the N-terminus of the carrier protein, X₁of the formula above can be provided by an amino acid residue of the native amino acid sequence of the carrier protein. Therefore, in some embodiments, and when present at a location other than the N-terminus of a carrier protein, sulfatase motifs are of the formula:

(C/S)X₁(P/A)X₂Z₃ (II)

or, in an exemplary embodiment

(C/S)X₁(P/A)X₂R (IIa)

where X₁and X₂independently can be any amino acid, though usually an aliphatic amino acid, a polar, uncharged amino acid, or a sulfur-containing amino acid (i.e., other than an aromatic amino acid or a charged amino acid), usually S, T, A, V, or C, more usually S, T, A, or V. Z₃in Formula II is defined as above.

As noted above, the sulfatase motif can contain additional residues at one or both of the N- and C-terminus of the sequence, e.g., such that the aldehyde tag includes both a sulfatase motif and an “auxiliary motif”. In one embodiment, the sulfatase motif includes an auxiliary motif at the C-terminus (i.e., following the arginine residue in the formula above) 1, 2, 3, 4, 5, 6, or all 7 of the contiguous residues of an amino acid sequence of AALLTGR (SEQ ID NO:46), SQLLTGR (SEQ ID NO:47), AAFMTGR (SEQ ID NO:48), AAFLTGR (SEQ ID NO:49), SAFLTGR (SEQ ID NO:50), ASILTGK (SEQ ID NO:51), VSFLTGR (SEQ ID NO:52), ASLLTGL (SEQ ID NO:53), ASILITG (SEQ ID NO:54), VSFLTGR (SEQ ID NO:55), SAIMTGR (SEQ ID NO:56), SAIVTGR (SEQ ID NO:57), TNLWRG (SEQ ID NO:58), TNLWRGQ (SEQ ID NO:59), TNLCAAS (SEQ ID NO:60), VSLWTGK (SEQ ID NO:61), SMLLTG (SEQ ID NO:62), SMLLTGN (SEQ ID NO:63), SMLLTGT (SEQ ID NO:64), ASFMAGQ (SEQ ID NO:65), or ASLLTGL (SEQ ID NO:66), (see, e.g., Dierks et al. (1999) EMBO J 18(8): 2084-2091), or of GSLFTGR (SEQ ID NO:67). Additional C-terminal amino acid residues are not required for FGE-mediated conversion of the sulfatase motif of the aldehyde tag, and thus are only optional and may be specifically excluded from the aldehyde tags described herein. In some embodiments the aldehyde tag does not contain an amino acid sequence CGPSR(M/A)S (SEQ ID NO:68) or CGPSR(M/A) (SEQ ID NO:69), which may be present as a native amino acid sequence in phosphonate monoester hydrolases.

The sulfatase motif of the aldehyde tag is generally selected so as to be capable of conversion by a selected FGE, e.g., an FGE present in a host cell in which the aldehyde tagged polypeptide is expressed or an FGE which is to be contacted with the aldehyde tagged polypeptide in a cell-free in vitro method.

Selection of aldehyde tags and an FGE that provide for suitable reactive partners to provide for generation of an FGly in the aldehyde tagged carrier protein can be readily accomplished in light of information available in the art. In general, sulfatase motifs susceptible to conversion by a eukaryotic FGE contain a cysteine and a proline (i.e., a cysteine and proline at Z₁and Z₂, respectively, in Formula I above (e.g., X₁CX₂PX₃R); CX₁PX₂R in Formula II above) and are modified by the “SUMF1-type” FGE (Cosma et al. Cell 2003, 113, (4), 445-56; Dierks et al. Cell 2003, 113, (4), 435-44). Sulfatase motifs susceptible to conversion by a prokaryotic FGE contain either a cysteine or a serine, and a proline in the sulfatase motif (i.e., a cysteine or serine at Z₁, and a proline at Z₂, respectively, in Formula I above (e.g., X₁(C/S)X₂PX₃R); (C/S)X₁PX₂R in Formula II above) are modified either by the “SUMF1-type” FGE or the “AtsB-type” FGE, respectively (Szameit et al. J Biol Chem 1999, 274, (22), 15375-81). Other sulfatase motifs susceptible to conversion by a prokaryotic FGE contain either a cysteine or a serine, and either a proline or an alanine in the sulfatase motif (i.e., a cysteine or serine at Z₁, and a proline or alanine at Z₂, respectively, e.g, SX₁AX₂R; X₁CX₂PX₃Z₃; X₁SX₂PX₂Z₃; X₁CX₂AX₃Z₃; X₁SX₂AX₃Z₃; CX₁PX₂Z₃; SX₁PX₂Z₃; CX₁AX₂Z₃; SX₁AX₂Z₃(in Formula I above); CX₁PX₂Z₃(in Formula II above); X₁CX₂PX₃R; X₁SX₂PX₂R; X₁CX₂AX₃R; X₁SX₂AX₃R (in Formula Ia above); CX₁PX₂R; SX₁PX₂R; CX₁AX₂R; SX₁AX₂R (in Formula IIa above), and are susceptible to modification by, for example, can be modified by an FGE of a Firmicutes (e.g., Clostridium perfringens) (see Berteau et al. J. Biol. Chem. 2006; 281:22464-22470).

Therefore, for example, where the FGE is a eukaryotic FGE (e.g., a mammalian FGE, including a human FGE), the sulfatase motif is usually of the formula:

X₁CX₂PX₃Z₃ (III)

or, in an exemplary embodiment

X₁CX₂PX₃R (IIIa)

where

X₁may be present or absent and, when present, can be any amino acid, though usually an aliphatic amino acid, a sulfur-containing amino acid, or a polar, uncharged amino acid, (i.e., other than a aromatic amino acid or a charged amino acid), usually L, M, S or V, with the proviso that when the sulfatase motif is at the N-terminus of the carrier protein, X₁is present;

X₂and X₃independently can be any amino acid, though usually an aliphatic amino acid, a sulfur-containing amino acid, or a polar, uncharged amino acid, (i.e., other than a aromatic amino acid or a charged amino acid), usually S, T, A, V, G, or C, more usually S, T, A, V or G; and

Z₃is a basic amino acid (which may be other than arginine (R), and may be lysine (K) or histidine (H), usually lysine), or an aliphatic amino acid (alanine (A), glycine (G), leucine (L), valine (V), isoleucine (I), or proline (P), usually A, G, L, V, or I, where Z₃is arginine (R) in Formula IIIa.

Specific examples of sulfatase motifs include LCTPSR (SEQ ID NO:1), MCTPSR (SEQ ID NO:2), VCTPSR(SEQ ID NO:3), LCSPSR (SEQ ID NO:4), LCAPSR (SEQ ID NO:5) LCVPSR (SEQ ID NO:6), LCGPSR(SEQ ID NO:7), ICTPAR(SEQ ID NO:8), LCTPSK(SEQ ID NO:9), MCTPSK (SEQ ID NO:10), VCTPSK (SEQ ID NO:11), LCSPSK (SEQ ID NO:12), LCAPSK (SEQ ID NO:13), LCVPSK(SEQ ID NO:14), LCGPSK(SEQ ID NO:15), LCTPSA (SEQ ID NO:16), ICTPAA (SEQ ID NO:17), MCTPSA (SEQ ID NO:18), VCTPSA (SEQ ID NO:19), LCSPSA (SEQ ID NO:20), LCAPSA (SEQ ID NO:21), LCVPSA (SEQ ID NO:22), and LCGPSA (SEQ ID NO:23). Other specific sulfatase motifs are readily apparent from the disclosure provided herein.

As described in more detail below, a converted aldehyde tagged polypeptide is reacted with a reactive partner of a moiety of interest to provide for conjugation between the moiety of interest to the FGly residue of the converted aldehyde tagged polypeptide, and production of a modified polypeptide (e.g., a conjugate of the ald-tagged carrier protein and a peptide drug). Modified polypeptides having a modified aldehyde tag are generally described by comprising a modified sulfatase motif of the formula:

X₁(FGly′)X₂Z₂X₃Z₃ (I′)

or, in an exemplary embodiment

X₁(FGly′)X₂Z₂X₃R (Ia′)

where

FGly′ is a formylglycine residue having a covalently attached moiety (e.g., a peptide drug);

Z₂is either a proline or alanine residue (which can also be represented by (P/A));

Z₃in Formula I′ is a basic amino acid, and may be arginine (R) (as in Formula Ia′), lysine (K) or histidine (H), usually lysine), or an aliphatic amino acid (alanine (A), glycine (G), leucine (L), valine (V), isoleucine (I), or proline (P), usually A, G, L, V, or I;

X₁may be present or absent and, when present, can be any amino acid, though usually an aliphatic amino acid, a sulfur-containing amino acid, or a polar, uncharged amino acid, (i.e., other than a aromatic amino acid or a charged amino acid), usually L, M, V, S or T, more usually L, M or V, with the proviso that when the sulfatase motif is at the N-terminus of the carrier protein, X₁is present; and

Specific examples of converted sulfatase motifs include L(FGly)TPSR (SEQ ID NO:24), M(FGly)TPSR (SEQ ID NO:25), V(FGly)TPSR (SEQ ID NO:26), L(FGly)SPSR (SEQ ID NO:27), L(FGly)APSR (SEQ ID NO:28), L(FGly)VPSR (SEQ ID NO:29), L(FGly)GPSR (SEQ ID NO:30), I(FGly)TPAR (SEQ ID NO:31), L(FGly)TPSK (SEQ ID NO:32), M(FGly)TPSK (SEQ ID NO:33), V(FGly)TPSK (SEQ ID NO:34), L(FGly)SPSK (SEQ ID NO:35), L(FGly)APSK (SEQ ID NO:36), L(FGly)VPSK (SEQ ID NO:37), L(FGly)GPSK (SEQ ID NO:38), L(FGly)TPSA (SEQ ID NO:39), M(FGly)TPSA (SEQ ID NO:40), V(FGly)TPSA (SEQ ID NO:41), L(FGly)SPSA (SEQ ID NO:42), L(FGly)APSA (SEQ ID NO:43), L(FGly)VPSA (SEQ ID NO:44), and L(FGly)GPSA (SEQ ID NO:45). It will be appreciated that exemplary carrier proteins that are covalently bound to drug through reaction with the aldehyde of the FGly residue include those having the amino acid sequences described above, but the modified FGly (represented above by FGly′) in lieu of the unmodified FGly.

Carrier Proteins

In general a “carrier protein” is a protein that is biologically inert, is susceptible to modification by use of the ald tag technology as disclosed herein, and which can provide for solvent-accessible presentation of drug conjugated to the carrier protein through a modified ald-tag positioned in the carrier protein (e.g., through an oxime or hydrazone bond within the converted sulfatase motif of the ald tagged carrier protein) in a physiological environment. “Biologically inert” is meant to indicate the carrier protein exhibits clinically insignificant or no detectable biological activity when administered to the appropriate subject, particularly when administered to a human subject. Thus, carrier proteins are biologically inert in that they, for example, are of low immunogenicity, do not exhibit significant or detectable targeting properties (e.g., do not exhibit significant or detectable activity in binding to a specific receptor), and exhibit little or no detectable biological activity that may interfere with activity of a drug to be conjugated to the ald-tagged carrier protein. By “low immunogenicity” is meant that the carrier protein elicits little or no detectable immune response upon administration to a subject, especially a mammalian subject, more especially a human subject. Carrier proteins can be provided in monomeric or multimeric (e.g., dimeric) forms.

Carrier proteins having a three-dimensional structure when folded that provides for multiple different solvent-accessible sites that are amenable to ald-tag modification (and thus conjugation to a drug) are of particular interest. In general, carrier proteins of interest are those that are of a size and three-dimensional folded structure so as to provide for presentation of conjugated drug on solvent accessible surfaces in a manner that is sufficient spatially separated so as to provide for activity and bioavailability of the conjugated drug molecules are of particular interest. The carrier protein will be selected according to a variety of factors including, but not limited to, the drug to be conjugated to the carrier.

Accordingly, any of a wide variety of polypeptides can be suitable for use as ald-tagged carrier proteins for use in the drug-carrier proteins conjugates of the present disclosure. Such carrier proteins can include those having a naturally-occurring amino acid sequence, a native amino acid sequence having an N-terminal methionine, fragments of naturally-occurring polypeptides, and non-naturally occurring polypeptides and fragments thereof.

Exemplary carrier proteins include, but are not necessarily limited to, albumin and fragments thereof (e.g., human serum albumin, bovine serum albumin, and the like), transferrin and fragments thereof (e.g. human transferrin), and Fc fragments having reduced binding to a mammalian Fc receptor, particularly a human Fc receptor (e.g., a modified Fc fragment of an antibody (e.g., IgG), particularly a mammalian antibody, e.g., a human antibody). Exemplary modified Fc fragments having reduced Fc receptor binding are exemplified by the Fc fragments of Herceptin (trastuzumab) and Rituxan (Rituximab), which contain point mutations that provide for reduced Fc receptor binding (see, e.g., Clynes et al Nature Medicine 2000, 6, 443-446). Alternatively or in addition, the isotype of the Fc fragment can be selected according to a desired level of Fc receptor binding (e.g., use of an Fc fragment of an IgG4 isotype human heavy chain constant region rather than from IgG1 or IgG3. (see, e.g, Fridman FASEB J 1991 September; 5 (12): 2684-90) In general, carrier proteins can be at least about 4 kDa (e.g., about 50 amino acid residues in length), usually at least about 25 kDa, and can be larger in size (e.g., transferrin has a molecular weight of 90 kDa while Fc fragments can have molecular weights of 30 kDa to 50 kDa).

Modification of Carrier Proteins to Contain an Aldehyde Tag

An aldehyde tag can be provided in a carrier protein by insertion (e.g., so as to provide a 5 or 6 amino acid residue insertion within the native amino acid sequence) and/or by addition (e.g., at an N- or C-terminus of the carrier protein). An aldehyde tag can also be provided by complete or partial substitution of native amino acid residues of the carrier protein with the contiguous amino acid sequence of an aldehyde tag. For example, a heterologous aldehyde tag of 5 (or 6) amino acid residues can be provided in a carrier protein by replacing 1, 2, 3, 4, or 5 (or 1, 2, 3, 4, 5, or 6) amino acid residues of the native amino acid sequence with the corresponding amino acid residues of the aldehyde tag.

Modification of a carrier protein to include one or more aldehyde tags can be accomplished using recombinant molecular genetic techniques, so as produce nucleic acid encoding the desired aldehyde tagged carrier protein. Such methods are well known in the art, and include cloning methods, site-specific mutation methods, and the like (see, e.g., Sambrook et al., In “Molecular Cloning: A Laboratory Manual” (Cold Spring Harbor Laboratory Press 1989); “Current Protocols in Molecular Biology” (eds., Ausubel et al.; Greene Publishing Associates, Inc., and John Wiley & Sons, Inc. 1990 and supplements). Alternatively, an aldehyde tag can be added using non-recombinant techniques, e.g., using native chemical ligation or pseudo-native chemical ligation, e.g., to add an aldehyde tag to a C-terminus of the carrier protein (see, e.g., U.S. Pat. Nos. 6,184,344; 6,307,018; 6,451,543; 6,570,040; US 2006/0173159; US 2006/0149039). See also Rush et al. (Jan. 5, 2006) Org Lett. 8(1):131-4.

Aldehyde tags can be positioned at any suitable location within a carrier protein, with the proviso that the site of the aldehyde tag is accessible for conversion by an FGE and subsequent modification at the FGly, or can be rendered accessible (e.g., by denaturing the protein). The carrier protein can include one or more aldehyde tags. The number of aldehyde tags that can be present in a carrier protein will vary with the carrier protein selected, and may include 1, 2, 3, 4, 5, or more aldehyde tags.

Carrier Proteins Containing Multiple Ald Tags

Multiple ald tags can be positioned in the ald-tagged carrier protein so as to distribute the tags over the surface of the folded carrier protein. Where the carrier protein is modified to contain multiple ald tags, the ald tags can be spaced apart in the carrier protein by amino acid residues native to the carrier protein. Alternatively or in addition, the carrier protein can include ald tags spaced apart by a linker, where the linker has an amino acid sequence heterologous to the carrier protein.

Alternatively or in addition, the ald tags can be provided in the ald-tagged carrier protein as a concatameric construct of 2, 3, 4 or more ald tags, where the expression construct thus encodes for 2, 3, 4 or more sulfatase motifs in a contiguous sequence of the modified carrier protein, wherein the sulfatase motifs are separated by a linker] The linkers of the concatemeric constructs may be designed so as to facilitate presentation of drug conjugated to the ald tag in the final carrier protein-drug conjugate. For example, the linker can be selected so as to provide flexibility between the ald tags, thus allowing for rotation of covalently-bound drug molecules so as to enhance presentation of biologically active drug on the carrier protein-drug conjugate surface. Such linkers can also be used in where the ald tags are not provided as a concatamer, e.g., where an ald tag is positioned at a C- or N-terminus of a carrier protein. Ald tags, including those provided as concatamers, can be positioned at or near the C-terminus of the carrier protein, at or near the N-terminus of the carrier protein, and/or in one or more solvent-accessible loops of the carrier protein.

Linkers will be selected according to a variety of factors (e.g., the ald tag used, the number of ald tags in the concatamer, the degree of flexibility desired), and will be variable length, such as from about 3 amino acids to about 25 amino acids, including about 4 amino acids to about 23 amino acids, about 5 amino acids to about 20 amino acids, about 6 amino acids to about 18 amino acids, about 7 amino acids to about 16 amino acids, about 8 amino acids to about 14 amino acids, and about 9 amino acids to about 12 amino acids. Exemplary flexible linkers include glycine polymers (G)_n, glycine-serine polymers (including, for example, (GS)_n, (GSGGS)_n(SEQ ID NO:90) and (GGGS)_n(SEQ ID NO:91), where n is an integer of at least one), glycine-alanine polymers, alanine-serine polymers, and other flexible linkers such as the tether for the shaker potassium channel, and a large variety of other flexible linkers, as will be appreciated by those in the art. Glycine and glycine-serine polymers are of interest since both of these amino acids are relatively unstructured, and therefore may be able to serve as a neutral tether between components. Glycine polymers are of particular interests glycine accesses significantly more phi-psi space than even alanine, and is much less restricted tan residues with longer side chains (see Scheraga, Rev. Computational Chem. 11173-142 (1992)). Exemplary flexible linkers include, but are not limited Gly-Gly-Ser-Gly-Gly (SEQ ID NO:92), Gly-Ser-Gly-Ser-Gly (SEQ ID NO:93), Gly-Ser-Gly-Gly-Gly (SEQ ID NO:94), Gly-Gly-Gly-Ser-Gly (SEQ ID NO:95), Gly-Ser-Ser-Ser-Gly (SEQ ID NO:96), and the like.

Concatameric ald tag constructs containing a linker can be described by the general formula:

T₁-L_n-T₂

where T₁and T₂are the same or different ald tags as described herein (see, e.g., formulae I, Ia, I′, Ia′, II, IIa, III, and IIIa), L is a linker peptide, and n is an integer of 1 or more, and may be 2, 3, 4, 5, 6, 7, 8 or more. An exemplary amino acid sequence of a concatameric ald tag containing a linker is LCTPSR GGGG LCTPSR (SEQ ID NO:97), where the cysteine (C) is modified to an FGly by action of an FGE, and can be reacted with a reactive partner-containing drug to provide for covalently bound drug as described herein.

The aldehyde tag(s) can be positioned in the carrier protein so as to take into account its structure when folded (e.g., in a cell-free environment, usually a cell-free physiological environment), e.g., so as to provide the aldehyde tag at a solvent accessible site in the folded carrier protein. The solvent accessible aldehyde tag can thus be accessed in the folded, unconverted ald-tagged carrier protein so as to be accessible to an FGE for conversion of the serine or cysteine to an FGly and/or to a reactive partner reagent for conjugation to a drug of interest. Where an aldehyde tag is positioned at a solvent accessible site, in vitro FGE-mediated conversion and modification with a moiety by reaction with a reactive partner can be performed without the need to denature the protein. Solvent accessible sites can also include carrier protein regions that are exposed at an extracellular or intracellular cell surface when expressed in a host cell.

Accordingly, or more aldehyde tags can be provided at sites independently selected from, for example, a solvent accessible N-terminus, a solvent accessible N-terminal region, a solvent accessible C-terminus, a solvent accessible C-terminal region, and/or a loop structure. In some embodiments, the aldehyde tag is positioned at a site other than the C-terminus of the polypeptide. In other embodiments, the polypeptide in which the aldehyde tag is positioned is a full-length polypeptide.

In other embodiments, an aldehyde tag site is positioned at a site which is post-translationally modified in the parent carrier protein (e.g., a naturally-occurring site). For example, an aldehyde tag can be introduced at a site of glycosylation (e.g., N-glycosylation, O-glycosylation), phosphorylation, sulftation, ubiquitination, acylation, methylation, prenylation, hydroxylation, carboxylation, and the like in the native carrier protein. In addition or alternatively the site of post-translational modification can be one that has been engineered (e.g., through recombinant techniques) and does not occur naturally in the carrier protein.

Nucleic and amino acid sequences of polypeptides suitable for use as ald-tagged carrier proteins as described herein are available in the art. For example, FIG. 3 provides the amino acid sequence and encoding nucleic acid sequence for human serum albumin (HSA). Once provided the guidance of the present disclosure, the ordinarily skilled artisan can readily generate ald-tagged HSA useful in the methods and compositions disclosed herein. Exemplary ald-tagged HSA amino acid and encoding nucleic acid sequences are provided in FIG. 4. Exemplary ald-tagged HSA amino acid sequences are provided in FIG. 4, with the corresponding encoding nucleic acid sequences provided in FIGS. 5-9. The three-dimensional structure of HSA is provided in the top panel of FIG. 10.

Further exemplary ald-tagged carrier proteins include ald-tagged Fc fragment. FIG. 11 provides the amino acid sequences of exemplary ald-tagged mouse IgG1 Fc fragments having single and multiple ald tags, including exemplary ald-tagged Fc fragments containing an ald tag concatmer with two ald tags separated by a linker.

Ald-tagged Carrier Protein Libraries

As exemplified in the schematic of FIG. 1B, the carrier protein can be modified to contain an ald tag at different positions to provide a library composed of differently ald-tagged carrier proteins, e.g., ald-tagged carrier proteins having an ald-tag at one or more of the N-terminus, the C-terminus, an interior loop and the like. The members of the ald-tagged carrier protein library can contain 1, 2, 3, 4, 5, or more ald-tags. The library can be provided as a population of expression constructs encoding such ald-tagged carrier proteins for introduction into host cells for expression, e.g., a host cell that expresses an compatible FGE to provide for production of FGly-containing carrier proteins. Alternatively or in addition, the library can be provided as a population or recombinant host cells that are genetically modified to express the ald-tagged carrier protein and which, optionally, express a compatible FGE.

Such libraries can serve as a “plug and play” system for reaction of the produced ald-tagged carrier proteins with a candidate drug having a reactive partner (e.g., an aminooxy or hydrazide moiety). The reaction productions of drug-carrier protein conjugates can then be screened for desired characteristics (e.g., biological activity of the drug, low immunogenicity of the conjugate, and the like).

Formylglycine Generating Enzymes (FGEs)

A formylglycine generating enzyme (FGE) is an enzyme that oxidizes cysteine or serine in a sulfatase motif to FGly. It should be noted that in general, the literature refers to FGly-generating enzymes that convert a cysteine (C to FGly in a sulfatase motif as FGEs, and refers to enzymes that convert serine (S) to FGly in a sulfatase motif as Ats-B-like. However, for purposes of the present disclosure “FGE” is used generically to refer to both types of FGly-generating enzymes, with the understanding that an appropriate FGE will be selected according to the sulfatase motif (i.e., C-containing or S-containing) present in the modified carrier protein.

In general, the FGE used to facilitate conversion of cysteine or serine to FGly in a sulfatase motif of an aldehyde tag of a carrier protein is selected according to the sulfatase motif present in the aldehyde tag. The FGE can be native to the host cell in which the aldehyde tagged carrier protein is expressed, or the host cell can be genetically modified to express an appropriate FGE. Eukaryotic sulfatases generally contain a cysteine in their sulfatase motif and are modified by the “SUMF1-type” FGE (Cosma et al. Cell 2003, 113, (4), 445-56; Dierks et al. Cell 2003, 113, (4), 435-44). Prokaryotic sulfatases generally contain either a cysteine or a serine in their sulfatase motif and are modified either by the “SUMF1-type” FGE or the “AtsB-type” FGE, respectively (Szameit et al. J Biol Chem 1999, 274, (22), 15375-81). An FGE has been described in Mycobacterium tuberculosis (see, e.e.g GenBank Acc. No. NP_215226 (gi:15607852) and WO 2008/036350). FGEs have also been described in deuterostomia, including vertebrates and echinodermata (see, e.g., Pepe et al. (2003) Cell 113, 445-456, Dierks et al. (2003) Cell 113, 435-444; Cosma et al. (2004) Hum. Mutat. 23, 576-581). In some embodiments it may be desired to use a sulfatase motif compatible with a human FGE (e.g., the SUMF1-type FGE, see, e.g., Cosma et al. Cell 113, 445-56 (2003); Dierks et al. Cell 113, 435-44 (2003)), and express the aldehyde tagged protein in a human cell that expresses the FGE or in a host cell, usually a mammalian cell, genetically modified to express a human FGE.

In general, an FGE for use in the methods disclosed herein can be obtained from naturally occurring sources or synthetically produced. For example, an appropriate FGE can be derived from biological sources which naturally produce an FGE or which are genetically modified to express a recombinant gene encoding an FGE. Nucleic acids encoding a number of FGEs are known in the art and readily available (see, e.g., Preusser et al. 2005 J. Biol. Chem. 280(15):14900-10 (Epub 2005 Jan. 18); Fang et al. 2004 J Biol Chem. 79(15):14570-8 (Epub 2004 Jan. 28); Landgrebe et al. Gene. 2003 Oct. 16; 316:47-56; Dierks et al. 1998 FEBS Lett. 423(1):61-5; Dierks et al. Cell. 2003 May 16; 113(4):435-44; Cosma et al. (2003 May 16) Cell 113(4):445-56; Baenziger (2003 May 16) Cell 113(4):421-2 (review); Dierks et al. Cell. 2005 May 20; 121(4):541-52; Roeser et al. (2006 Jan. 3)Proc Natl Acad Sci USA 103(1):81-6; Sardiello et al. (2005 Nov. 1) Hum Mol Genet. 14(21):3203-17; WO 2004/072275; GenBank Accession No. NM_182760; and WO 2008/036350). Accordingly, the disclosure here provides for recombinant host cells genetically modified to express an FGE that is compatible for use with an aldehyde tag of a tagged carrier protein.

Where a cell-free method is used to convert a sulfatase motif-containing carrier protein, an isolated FGE can be used. Any convenient protein purification procedures may be used to isolate an FGE, see, e.g., Guide to Protein Purification, (Deuthser ed.) (Academic Press, 1990). For example, a lysate may prepared from a cell the produces a desired FGE, and purified using HPLC, exclusion chromatography, gel electrophoresis, affinity chromatography, and the like.

Expression Vectors and Host Cells for Production of Aldehyde Tagged-Carrier Polypeptides

The present disclosure provides nucleic acid encoding ald-tagged carrier polypeptides, as well as constructs and host cells containing nucleic acid. Such nucleic acids comprise a sequence of DNA having an open reading frame that encodes an aldehyde tagged carrier protein and, in most embodiments, is capable, under appropriate conditions, of being expressed. “Nucleic acid” encompasses DNA, cDNA, mRNA, and vectors comprising such nucleic acids.

Nucleic acids contemplated herein can be provided as part of a vector (also referred to as a construct), a wide variety of which are known in the art and need not be elaborated upon herein. Exemplary vectors include, but are not limited to, plasmids; cosmids; viral vectors (e.g., retroviral vectors); non-viral vectors; artificial chromosomes (YAC's, BAC's, etc.); mini-chromosomes; and the like. The choice of vector will depend upon a variety of factors such as the type of cell in which propagation is desired and the purpose of propagation.

Vectors can provide for extrachromosomal maintenance in a host cell or can provide for integration into the host cell genome. Vectors are amply described in numerous publications well known to those in the art, including, e.g., Short Protocols in Molecular Biology, (1999) F. Ausubel, et al., eds., Wiley & Sons. Vectors may provide for expression of the nucleic acids encoding a polypeptide of interest (e.g., an aldehyde tagged polypeptide, an FGE, etc.), may provide for propagating the subject nucleic acids, or both.

Exemplary vectors that may be used include but are not limited to those derived from recombinant bacteriophage DNA, plasmid DNA or cosmid DNA. For example, plasmid vectors such as pBR322, pUC 19/18, pUC 118, 119 and the M13 mp series of vectors may be used. Bacteriophage vectors may include λgt10, λgt11, λgt18-23, λZAP/R and the EMBL series of bacteriophage vectors. Cosmid vectors that may be utilized include, but are not limited to, pJB8, pCV 103, pCV 107, pCV 108, pTM, pMCS, pNNL, pHSG274, COS202, COS203, pWE15, pWE16 and the charomid 9 series of vectors. Alternatively, recombinant virus vectors may be engineered, including but not limited to those derived from viruses such as herpes virus, retroviruses, vaccinia virus, poxviruses, adenoviruses, adeno-associated viruses or bovine papilloma virus.

For expression of a polypeptide of interest, an expression cassette may be employed. Thus, the present invention provides a recombinant expression vector comprising a subject nucleic acid. The expression vector provides a transcriptional and translational regulatory sequence, and may provide for inducible or constitutive expression, where the coding region is operably linked under the transcriptional control of the transcriptional initiation region, and a transcriptional and translational termination region. These control regions may be native to the gene encoding the polypeptide (e.g., the carrier protein or the FGE), or may be derived from exogenous sources. In general, the transcriptional and translational regulatory sequences may include, but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop sequences, translational start and stop sequences, and enhancer or activator sequences. In addition to constitutive and inducible promoters, strong promoters (e.g., T7, CMV, and the like) find use in the constructs described herein, particularly where high expression levels are desired in an in vivo (cell-based) or in an in vitro expression system. Further exemplary promoters include mouse mammary tumor virus (MMTV) promoters, Rous sarcoma virus (RSV) promoters, adenovirus promoters, the promoter from the immediate early gene of human CMV (Boshart et al., Cell 41:521-530, 1985), and the promoter from the long terminal repeat (LTR) of RSV (Gorman et al., Proc. Natl. Acad. Sci. USA 79:6777-6781, 1982). The promoter can also be provided by, for example, a 5′UTR of a retrovirus.

Expression vectors generally have convenient restriction sites located near the promoter sequence to provide for the insertion of nucleic acid sequences encoding proteins of interest. A selectable marker operative in the expression host may be present to facilitate selection of cells containing the vector. In addition, the expression construct may include additional elements. For example, the expression vector may have one or two replication systems, thus allowing it to be maintained in organisms, for example in mammalian or insect cells for expression and in a prokaryotic host for cloning and amplification. In addition the expression construct may contain a selectable marker gene to allow the selection of transformed host cells. Selection genes are well known in the art and will vary with the host cell used.

Expression constructs encoding aldehyde tagged carrier proteins can also be generated using amplification methods (e.g., polymerase chain reaction (PCR)), where at least one amplification primer (i.e., at least one of a forward or reverse primer) includes a nucleic acid sequence encoding an aldehyde tag. For example, an amplification primer having an aldehyde tag-encoding sequence is designed to provide for amplification of a nucleic acid encoding a carrier protein of interest. The extension product that results from polymerase-mediated synthesis from the aldehyde tag-containing forward primer produces a nucleic acid amplification product encoding a fusion protein composed of an aldehyde tagged-carrier protein. The amplification product is then inserted into an expression construct of choice to provide an aldehyde tagged polypeptide expression construct.

Host Cells

Any of a number of suitable host cells can be used in the production of an aldehyde tagged carrier protein. The host cell used for production of an aldehyde tagged-carrier protein can optionally provide for FGE-mediated conversion, so that the polypeptide produced contains an FGly-containing aldehyde tag following expression and post-translational modification by FGE. Alternatively the host cell can provide for production of an unconverted aldehyde tagged carrier protein (e.g., due to lack of expression of an FGE that facilitates conversion of the aldehyde tag).

In general, the polypeptides described herein may be expressed in prokaryotes or eukaryotes in accordance with conventional ways, depending upon the purpose for expression. Thus, the present invention further provides a host cell, e.g., a genetically modified host cell that comprises a nucleic acid encoding an aldehyde tagged polypeptide. The host cell can further optionally comprise a recombinant FGE, which may be endogenous or heterologous to the host cell.

Host cells for production (including large scale production) of an unconverted or (where the host cell expresses a suitable FGE) converted aldehyde tagged carrier protein, or for production of an FGE (e.g., for use in a cell-free method) can be selected from any of a variety of available host cells. Exemplary host cells include those of a prokaryotic or eukaryotic unicellular organism, such as bacteria (e.g., Escherichia coli strains, Bacillus spp. (e.g., B. subtilis), and the like) yeast or fungi (e.g., S. cerevisiae, Pichia spp., and the like), and other such host cells can be used. Exemplary host cells originally derived from a higher organism such as insects, vertebrates, particularly mammals, (e.g. CHO, HEK, and the like), may be used as the expression host cells.

Specific expression systems of interest include bacterial, yeast, insect cell and mammalian cell derived expression systems. Representative systems from each of these categories are provided below.

The product can be recovered by any appropriate means known in the art. Further, any convenient protein purification procedures may be employed, where suitable protein purification methodologies are described in Guide to Protein Purification, (Deuthser ed.) (Academic Press, 1990). For example, a lysate may prepared from a cell comprising the expression vector expressing the ald-tagged carrier protein, and purified using HPLC, exclusion chromatography, gel electrophoresis, affinity chromatography, and the like.

Methods for Conversion and Modification of an Aldehyde Tag

Conversion of an aldehyde tag present in an aldehyde tagged carrier protein can be accomplished by cell-based (in vivo) or cell-free methods (in vitro). Similarly, modification of a converted aldehyde tag of an aldehyde tagged polypeptide can be accomplished by cell-based (in vivo) or cell-free methods (in vitro). These are described in more detail below.

“In Vivo” Host Cells Conversion and Modification

Conversion of an aldehyde tag of an aldehyde tagged polypeptide can be accomplished by expression of the aldehyde tagged polypeptide in a cell that contains a suitable FGE. In this embodiment, conversion of the cysteine or serine of the aldehyde tag is occurs during or following translation in the host cell. The FGE of the host cell can be endogenous to the host cell, or the host cell can be recombinant for a suitable FGE that is heterologous to the host cell. FGE expression can be provided by an expression system endogenous to the FGE gene (e.g., expression is provided by a promoter and other control elements present in the native FGE gene of the host cell), or can be provided by from a recombinant expression system in which the FGE coding sequence is operably linked to a heterologous promoter to provide for constitutive or inducible expression.

Conditions suitable for use to accomplish conjugation of a reactive partner moiety to an aldehyde tagged polypeptide are similar to those described in Mahal et al. (1997 May 16) Science 276(5315):1125-8.

“In Vitro” (Cell-Free) Conversion and Modification

In vitro (cell-free) conversion of an aldehyde tag of an aldehyde tagged carrier protein can be accomplished by contacting an aldehyde tagged polypeptide with an FGE under conditions suitable for conversion of a cysteine or serine of a sulfatase motif of the aldehyde tag to a FGly. For example, nucleic acid encoding an aldehyde tagged polypeptide can be expressed in an in vitro transcription/translation system in the presence of a suitable FGE to provide for production of converted aldehyde tagged polypeptides.

Alternatively, isolated, unconverted aldehyde tagged carrier protein can be isolated following recombinant production in a host cell lacking a suitable FGE or by synthetic production. The isolated aldehyde tagged carrier protein is then contacted with a suitable FGE under conditions to provide for aldehyde tag conversion. The aldehyde tagged carrier protein can be unfolded by methods known in the art (e.g., using heat, adjustment of pH, chaotropic agents, (e.g., urea, and the like), organic solvents (e.g., hydrocarbons: octane, benzene, chloroform), etc.) and the denatured protein contacted with a suitable FGE. The ald-tagged carrier protein can then be refolded under suitable conditions.

With respect to modification of converted aldehyde tagged, modification is normally carried out in vitro. Converted aldehyde tagged carrier protein is isolated from a production source (e.g., recombinant host cell production, synthetic production), and contacted with a reactive partner-containing drug under conditions suitable to provide for conjugation of a moiety of the reactive partner to the FGly of the aldehyde tag.

Drugs for Conjugation to Ald-Tagged Carrier Proteins

Any of a number of drugs are suitable for use, or can be modified to be rendered suitable for use, as a reactive partner to conjugate to an ald tagged-carrier protein. Exemplary drugs include small molecule drugs and peptide drugs.

“Small molecule drug” as used herein refers to compound, usually an organic compound, which exhibits a pharmaceutical activity of interest and which is generally of a molecular weight of no greater than about 800 Da, and usually no greater than 2000 Da, but can encompass molecules of up to 5 kDa and can be as large as about 10 kDa. A small inorganic molecule refers to a molecule containing no carbon atoms, while a small organic molecules refers to a compound containing at least one carbon atom.

“Peptide drug” as used herein refers to amino-acid containing polymeric compounds, and is meant to encompass naturally-occurring and non-naturally-occurring peptides, oligopeptides, cyclic peptides, polypeptides, and proteins, as well as peptide mimetics. The peptide drugs may be obtained by chemical synthesis or be produced from a genetically encoded source (e.g., recombinant source). Peptide drugs can range in molecular weight, and can be from 200 Da to 10 kDa or greater in molecular weight.

Glucagon-like peptide 1 (GLP-1), calcitonin, and biologically active fragments and variants thereof are exemplary peptide drugs. By “variant” is meant a polypeptide that has an amino acid sequence that is not found in nature, and includes polypeptides having one or more amino acid substitutions, insertions and/or deletions relative to a naturally-occurring parent polypeptide. “Variant” polypeptides thus encompass a polypeptide having an N- or C-terminal truncation relative to a parent polypeptide. A “fragment” of a polypeptide is one that shares an amino acid sequence of a naturally-occurring polypeptide, but that is truncated at the N-terminus, C-terminus or both relative to a naturally-occurring parent polypeptide.

GLP-1 is one of several naturally occurring incretin compounds that possess biologic activity when released from the gut during digestion. GLP-1 naturally works on several deficient organs to lower blood sugar levels. It is able to significantly delay the progression of Type 2 diabetes, and is useful in treatment of hyperglycemis. Currently, GLP-1 is less useful as a drug because it is broken down within minutes by the enzyme DPP-4, which is present throughout the human body. Coupling of GLP1 to an ald-tagged carrier protein can provide for increased serum half-lifeGLP-1 and biologically active fragments and variants thereof represent an exemplary peptide drug of interest for conjugation to a ald-tagged carrier protein of the present disclosure. Exemplary fragments and variants of GLP-1 include, but are not necessarily limited to, those described in Green et al. 2007 Best Pract Res Clin Endocrinol Metab 21:497-516; Brubaker et al. 2007 Trends Endocrinol Metab 18:240-245; Boyle et al. 2007 J Am Osteopath Assoc 107(Suppl):S10-S16; and Drucker et al. 2006 The incretin system: glucagon-like peptide-1 receptor agonists and dipeptidyl peptidase-4 inhibitors in type 2 diabetes. Lancet 368:1696-1705 32. Exemplary biologically active GLP-1 variants include those having, for example, an amino acid substitution at amino acid residues His(7), Ala(8), or Glu(9) of the native GLP-1 amino acid sequence. Specific examples include (D-His7)GLP-1, (D-Ala8)GLP-1, (Gly8)GLP-1, (Ser8)GLP-1, (Aha8)GLP-1, (Thr8)GLP-1, (Aib8)GLP-1, (Abu8)GLP-1, (Va18)GLP-1, (Asp9)GLP-1, (Ala9)GLP-1, (Pro9)GLP-1, (Phe9)GLP-1, and (Lys9)GLP-1. Specific exemplary biologically active GLP-1 variants are known as Exenatide, LY548806, CJC-1131, and Lirglutide.

Calcitonin and biologically active variants thereof represent an exemplary peptide drug of interest for conjugation to an ald-tagged carrier protein of the present disclosure. Calcitonin is a 32 amino-acid linear polypeptide hormone that is produced in humans primarily by the parafollicular (also known as C-cells) of the thyroid. Calcitonin has short absorption and elimination half-lives of 10-15 minutes and 50-80 minutes, respectively and can be used therapeutically for the treatment of hypercalcaemia or osteoporosis. Conjugation of calcintonin to an ald-tagged carrier protein as disclosed herein can provide for enhanced serum half-life, and thus provide for a therapeutic that can be administered much less frequently then the peptide alone. Exemplary biologically active calcitonin variants include, but are not necessarily limited to, those described in Fowler et al. Proc Natl Acad Sci USA. 2005 Jul. 19; 102(29):10105-10.

The biological activity of drug conjugated to an ald-tagged carrier protein as disclosed herein can be assayed according to methods known in the art. Such conjugated drugs that retain at least one desired pharmacologic activity of the corresponding parent compound are of interest.

Methods for Modification of Drugs to Contain Reactive Partner for Reaction with 2-formylglycine

Peptide drugs to be conjugated to an ald-tagged carrier protein are modified to incorporate a reactive partner for reaction with an aldehyde of the FGly residue of the ald-tagged carrier protein. Since the methods of ald-tagged polypeptide modification are compatible with conventional chemical processes, any of a wide variety of commercially available reagents can be used to accomplish conjugation. For example, aminooxy, hydrazide, hydrazine, or thiosemicarbazide derivatives of a number of moieties of interest are suitable reactive partners, and are readily available or can be generated using standard chemical methods.

Where the drug is a peptide drug, the reactive moiety (e.g., aminooxy or hydrazide can be positioned at an N-terminal region, the N-terminus, a C-terminal region, the C-terminus, or at a position internal to the peptide. FIG. 2 provides a schematic of an exemplary method for synthesizing a peptide drug having an aminooxy group. In this example, the peptide is synthesized from a Boc-protected precursor. An amino group of a peptide can react with a compound comprising a carboxylic acid group and oxy-N-Boc group. As shown in FIG. 2 for example, the amino group of the peptide reacts with 3-(2,5-dioxopyrrolidin-1-yloxy)propanoic acid. Other variations on the compound comprising a carboxylic acid group and oxy-N-protecting group can include different number of carbons in the alkylene linker and substituents on the alkylene linker. The reaction between the amino group of the peptide and the compound comprising a carboxylic acid group and oxy-N-protecting group occurs through standard peptide coupling chemistry. Examples of peptide coupling reagents that can be used include, but not limited to, DCC (dicyclohexylcarbodiimide), DIC (diisopropylcarbodiimide), di-p-toluoylcarbodiimide, BDP (1-benzotriazole diethylphosphate-1-cyclohexyl-3-(2-morpholinylethyl)carbodiimide), EDC (1-(3-dimethylaminopropyl-3-ethyl-carbodiimide hydrochloride), cyanuric fluoride, cyanuric chloride, TFFH (tetramethyl fluoroformamidinium hexafluorophosphosphate), DPPA (diphenylphosphorazidate), BOP (benzotriazol-1-yloxytris(dimethylamino)phosphonium hexafluorophosphate), HBTU (O-benzotriazol-1-yl-N,N,N′,N-tetramethyluronium hexafluorophosphate), TBTU (O-benzotriazol-1-yl-N,N,N′,N′-tetramethyluronium tetrafluoroborate), TSTU (O-(N-succinimidyl)-N,N,N′,N′-tetramethyluronium tetrafluoroborate), HATU (N-[(dimethylamino)-1-H-1,2,3-triazolo[4,5,6]-pyridin-1-ylmethylene]-N-methylmethanaminium hexafluorophosphate N-oxide), BOP-Cl (bis(2-oxo-3-oxazolidinyl)phosphinic chloride), PyBOP ((1-H-1,2,3-benzotriazol-1-yloxy)-tris(pyrrolidino)phosphonium tetrafluorophopsphate), BrOP (bromotris(dimethylamino)phosphonium hexafluorophosphate), DEPBT (3-(diethoxyphosphoryloxy)-1,2,3-benzotriazin-4(3H)-one) PyBrOP (bromotris(pyrrolidino)phosphonium hexafluorophosphate). In FIG. 2, HOBt and DIC are used as peptide coupling reagents.

Deprotection to expose the amino-oxy functionality is performed on the peptide comprising an N-protecting group. Deprotection of the N-oxysuccinimide group, for example, occurs according to standard deprotection conditions for a cyclic amide group. Deprotecting conditions can be found in Greene and Wuts, Protective Groups in Organic Chemistry, 3rd Ed., 1999, John Wiley & Sons, NY and Harrison et al. Certain deprotection conditions include a hydrazine reagent, amino reagent, or sodium borohydride. In FIG. 2, the deprotection of the Boc protecting group occurs with TFA. Other reagents for deprotection include, but are not limited to, hydrazine, methylhydrazine, phenylhydrazine, sodium borohydride, and methylamine. The product and intermediates can be purified by conventional means, such as HPLC purification.

The ordinarily skilled artisan will appreciate that factors such as pH and steric hindrance (i.e., the accessibility of the aldehyde tag to reaction with a reactive partner of interest) are of importance, Modifying reaction conditions to provide for optimal conjugation conditions is well within the skill of the ordinary artisan, and is routine in the art. In general, it is normally desirable to conduction conjugation reactions at a pH below 7, with a pH of about 5.5, about 6, about 6.5, usually about 5.5 being optimal. Where conjugation is conducted with an aldehyde tagged polypeptide present in or on a living cell, the conditions are selected so as to be physiologically compatible. For example, the pH can be dropped temporarily for a time sufficient to allow for the reaction to occur but within a period tolerated by the cell having an aldehyde tag (e.g., from about 30 min to 1 hour). Physiological conditions for conducting modification of aldehyde tagged polypeptides on a cell surface can be similar to those used in a ketone-azide reaction in modification of cells bearing cell-surface azides (see, e.g., U.S. Pat. No. 6,570,040).

Small molecule compounds containing, or modified to contain, an α-nucleophilic group that serves as a reactive partner with an aldehyde of an FGly of an ald tag are also contemplated for use as drugs in the carrier protein-drug conjugates of the present disclosure. General methods are known in the art for chemical synthetic schemes and conditions useful for synthesizing a compound of interest (see, e.g., Smith and March, March's Advanced Organic Chemistry: Reactions, Mechanisms, and Structure, Fifth Edition, Wiley-Interscience, 2001; or Vogel, A Textbook of Practical Organic Chemistry, Including Qualitative Organic Analysis, Fourth Edition, New York: Longman, 1978).

Thus small molecules having an aminooxy or hydrazone group for reaction with an aldehyde of an FGly of an ald-tagged carrier protein group are available or can be readily synthesized. An aminooxy or hydrazone group can be installed onto a small molecule using standard synthetic chemistry techniques. FIG. 12 provides a schematic of an exemplary ald-tagged carrier protein (represented by HSA) modified by conjugation to a small molecule drug (represented by doxorubicin).

Peptide Drug—Carrier Protein Conjugates

The conjugates of the present disclosure are site-specifically decorated with covalently bound drug. The site-specificity of reaction of a reactive partner-containing drug with an aldehyde tag of the carrier protein allows for production of carrier proteins having multiple sites for chemical conjugation, thus providing a scaffold for production of carrier protein-drug conjugates have a desired drug payload per protein ratio. Moreover, the relative position of the ald tags in the ald-tagged carrier protein can be designed so as to provide for a desired presentation of covalently bound drug molecules on the surface of the final carrier protein-drug conjugate, thus allowing for control of spatial orientation of the displayed drug payload.

Further, the site-specific nature of chemical modification of ald tags to attach drug to the carrier protein can be exploited to provide for a composition composed of a substantially homogenous population carrier protein-drug conjugates. Such carrier protein-drug conjugates can provide for control of the stoichiometry of drug delivery.

Carrier protein-drug conjugates of the present disclosure are composed of a carrier protein and one or more covalently bound drug molecules, where the carrier protein comprises a modified sulfatase motif of the formula:

X₁(FGly′)X₂Z₂X₃Z₃

where FGly′ is of the formula:

embedded image

wherein J¹is the covalently bound drug;

n is a number selected from zero to 40;

Z₂is a proline or alanine residue;

X₁is present or absent and, when present, is any amino acid, with the proviso that when the sulfatase motif is at an N-terminus of the polypeptide, X₁is present;

X₂and X₃are each independently any amino acid; and

Z₃is a basic amino acid, and

wherein the carrier protein presents the covalently bound drug on a solvent-accessible surface when in a folded state. The X₁, X₂, Z₂, X₃, and Z₃can be further defined as discussed above.

As noted above, the ald-tagged carrier protein can be designed so as to provide for multiple sites for chemical conjugation, thus providing a scaffold for production of carrier protein-drug conjugates have a desired drug payload per protein ratio. The carrier protein-drug conjugates contemplated by the present disclosure generally include at least 2 modified sulfatase motifs having covalently bound drug molecules, and usually include 3 or more modified sulfatase motifs having covalently bound drug molecules. The carrier protein-drug conjugates of the present disclosure can provide for a 4 or more, 5 or more, or 6 or more covalently bound drug molecules in the carrier protein-drug conjugate. Carrier protein-drug conjugates of the present disclosure thus include those having a drug payload to protein carrier ratio of at least 2:1, at least 3:1, at least 4:1, at least 5:1 and, depending upon, for example, the size of the drug molecule relative to the size of the carrier protein and/or the available sites for insertion of an ald tag on the solvent-accessible surface area of the folded carrier protein.

As noted above, the relative position of the ald tags in the ald-tagged carrier protein can be designed so as to provide for a desired presentation of covalently bound drug molecules on the surface of the final carrier protein-drug conjugate. This feature allows for control of spatial orientation of the displayed drug payload on the surface of the final carrier protein-drug conjugate. Carrier protein-drug conjugates containing multiple ald tags, which may include concatameric ald tags separated by flexible linkers as described herein, can provide for greater drug payload:carrier protein ratios and enhanced presentation of drug to a physiological environment in which the carrier protein-drug conjugate is present. As such, the carrier protein-drug conjugates can be described as a modified carrier protein “decorated” with drug covalently bound to the carrier protein through an oxime or hydrazone linkage to the peptide backbone of the carrier protein.

For example, the ald tags of the carrier protein-drug conjugate can be positioned in the carrier protein-drug conjugate at at least one of an N-terminus of the carrier protein, a C-terminus of the carrier protein, and a solvent-accessible loop of the carrier protein. The ald tags can optionally be provided in connection with a linker, e.g., a flexible linker, as described above. The multiple ald tags can be localized to a particular region(s) of the carrier protein (e.g., provided in one or more of a solvent-accessible loop, N-terminal region (including N-terminus), C-terminal region (including C-terminus)), or can be distributed over the solvent-accessible surface area of the folded modified carrier protein.

In general, it may be desirable to space the ald tags of the ald-tagged carrier protein so that the final carrier protein-drug conjugate has covalently bound drug spaced apart at a distance sufficient to avoid interaction between the covalently bound drug molecules, e.g., so that drug molecules do not contact one another or otherwise interfere with their respective biological activities. The spatial orientation and positioning within the carrier protein will vary according to a variety of factors including the relative sizes of the drug to be conjugated and the carrier protein. FIG. 10, bottom panel, provides a schematic of the three-dimensional structure exemplifying an ald-tagged recombination HSA having a covalently bound GLP-1 peptide at its N-terminus.

As noted above, the site-specific nature of chemical modification of ald tags to attach drug to the carrier protein can be exploited to provide for a composition composed of a substantially homogenous population carrier protein-drug conjugates. Such carrier protein-drug conjugates can provide for control of the stoichiometry of drug delivery. Such homogenous populations of carrier protein-drug conjugates include those in which at least 60%, at least 70%, at least 80% at least 90% or more of the carrier protein-drug conjugates of the population have the same drug payload to carrier protein ratio.

Methods of Making Carrier Protein-drug Conjugates

Methods of conjugation of an FGly-containing ald-tagged carrier protein with a reactive-partner containing-drug to provide a carrier protein-drug conjugate having a desired drug payload:carrier protein ratio are contemplated by the present disclosure. In general, such methods involve combining an FGly-containing, ald-tagged carrier protein with a reactive partner-containing drug (e.g., an aminooxy- or hydrazide-containing drug) in a reaction mixture under conditions suitable to promote reaction between the aldehyde(s) for the FGly(s) of the ald-tagged carrier protein with the reactive partner of the drug molecule(s), thereby producing a reaction product of a carrier protein-drug conjugate having drug covalently bound to the peptide backbone of the carrier protein through an oxime bond, hydrazide bond, or other aldehyde specific chemistries such as reductive aminations, or Wittig reactions.

After production of the ald-tagged carrier protein, it is isolated using any of a variety of techniques available in the art (e.g., chromatography, e.g., HPLC, FPLC, immunoaffinity purification, and the like). In some embodiments, the carrier protein of the carrier protein-drug conjugate contains an immunotag (e.g., His tag, FLAG tag), usually positioned at an N- or C-terminus to facilitate isolation and purification prior to conjugation with drug. The FGly-containing ald-tagged carrier protein for use in a conjugation reaction with drug can be provided in denatured form or can be folder prior to combining in the reaction mixture. Usually, the FGly-containing ald-tagged carrier protein is provided in folded form in the conjugation reaction mixture. Where obtained from cells expressing the ald-tagged carrier protein and a compatible FGE, the FGly-containing ald-tagged carrier protein can be isolated in folded form from cells or, where secreted, from culture supernatant. Where needed, methods for folding of proteins are available in the art, and can be readily applied to the methods here.

In general, the ald-tagged carrier protein having FGly residues is isolated, and usually is purified. The carrier protein-drug conjugate is combined in a reaction mixture in buffered solution with a reactive partner-containing drug. The buffered solution can be at a physiological or near physiological pH, e.g., a pH of about 5 to 7, usually a pH of about 6.5. The reactive partner-containing drug is provided in the reaction mixture in excess to the aldehyde moieties of the FGly-containing ald-tagged carrier protein, usually at least 2 fold, 3 fold, 4 fold, 5 fold or more excess, in order drive the reaction to completion. After addition of reactive partner-containing drug to the reaction mixture, the mixture is stirred under suitable conditions of time and temperature (e.g., at room temperature for about 2 h). The resulting carrier protein-drug conjugate is isolated from the reaction mixture and can be further purified using standard techniques (e.g., chromatography, e.g., HPLC, FPLC).

Assessment of Carrier Protein-drug Conjugate Activity

Following isolation of a carrier protein-drug conjugate from a reaction mixture, the carrier protein-drug conjugate can be screened for activity in one or more assays. Such assays can be for one or more biological activities of the drug conjugated to the carrier protein-drug conjugate and/or for one or more characteristics of the carrier protein-drug conjugate (e.g., immunogenicity).

Methods for assessing immunogenicity are available in the art and can be adapted for use in assessing carrier protein-drug conjugates of the present disclosure. For example, the carrier protein-drug conjugate can be administered to a non-human animal (e.g., an animal that can serve as a model for a human immune response), and the immune response to the carrier protein-drug conjugate assessed. Carrier protein-drug conjugates can be assessed for their activity in eliciting a humoral and/or cellular immune response in a non-human animal. Of particular interest is the production of anti-carrier protein-drug conjugate antibodies by the immunized host. Methods for assessing antibody production in a host are well known in the art.

Methods for assessing activity of the drug conjugated to the carrier protein-drug conjugate are selected according to the drug bound to the carrier protein-drug conjugate and are available in the art. Such assays can be in vitro cell-free assays, in vitro cell-based assays, or in vivo assays (e.g., in an animal model). Usually the assay is a cell-based in vitro functional assay or an in vivo assay using a non-human animal model (e.g, an animal model of human disease).

For example, activity of a carrier protein-GLP-1 conjugate of the present disclosure can be assayed in a cellular receptor activity assay, as exemplified in the Example below. Activity of a carrier protein-calicitonin conjugate of the present disclosure can be assayed in a bone cell culture system to assess bone resorption of calcium.

Formulations

The carrier protein-drug conjugates of the present disclosure can be formulated in a variety of different ways. In general, the carrier protein-drug conjugate is formulated in a manner compatible with the drug conjugated to the carrier protein-drug conjugate, the condition to be treated, and the route of administration to be used.

The carrier protein-drug conjugate can be provided in any suitable form, e.g., in the form of a pharmaceutically acceptable salt, and can be formulated for any suitable route of administration, e.g., oral, topical or parenteral administration. Where the carrier protein-drug conjugate is provided as a liquid injectable (such as in those embodiments where they are administered intravenously or directly into a tissue), the carrier protein-drug conjugate can be provided as a ready-to-use dosage form, or as a reconstitutable storage-stable powder or liquid composed of pharmaceutically acceptable carriers and excipients.

Methods for formulating carrier protein-drug conjugates can be adapted from those available in the art. For example, carrier protein-drug conjugates can be provided in a pharmaceutical composition comprising an effective amount of a carrier protein-drug conjugate and a pharmaceutically acceptable carrier (e.g., saline). The pharmaceutical composition may optionally include other additives (e.g., buffers, stabilizers, preservatives, and the like). Of particular interest are formulations that are suitable for administration to a mammal, particularly those that are suitable for administration to a human.

Methods of Treatment

The carrier protein-drug conjugates of the present disclosure find use in treatment of a condition or disease in a subject that is amenable to treatment by administration of the parent drug (i.e., the drug prior to conjugation to the carrier protein. By “treatment” is meant that at least an amelioration of the symptoms associated with the condition afflicting the host is achieved, where amelioration is used in a broad sense to refer to at least a reduction in the magnitude of a parameter, e.g. symptom, associated with the condition being treated. As such, treatment also includes situations where the pathological condition, or at least symptoms associated therewith, are completely inhibited, e.g., prevented from happening, or stopped, e.g. terminated, such that the host no longer suffers from the condition, or at least the symptoms that characterize the condition. Thus treatment includes: (i) prevention, that is, reducing the risk of development of clinical symptoms, including causing the clinical symptoms not to develop, e.g., preventing disease progression to a harmful state; (ii) inhibition, that is, arresting the development or further development of clinical symptoms, e.g., mitigating or completely inhibiting an active disease; and/or (iii) relief, that is, causing the regression of clinical symptoms.

The subject to be treated can be one that is in need of therapy, where the host to be treated is one amenable to treatment using the parent drug. Accordingly, a variety of subjects may be amenable to treatment using the carrier protein-drug conjugates disclosed herein. Generally such subjects are “mammals”, with humans being of particular interest. Other subjects can include domestic pets (e.g., dogs and cats), livestock (e.g., cows, pigs, goats, horses, and the like), rodents (e.g., mice, guinea pigs, and rats, e.g., as in animal models of disease), as well as other primates (e.g., chimpanzees, and monkeys.

The amount of carrier protein-drug conjugate administered can be initially determined based on guidance of a dose and/or dosage regimen of the parent drug. In general, the carrier protein-drug conjugates can provide for enhanced blood serum half-life of the bound drug, thus providing for at least one of reduced dose or reduced administrations in a dosage regimen. Thus the carrier protein-drug conjugates can provide for reduced dose and/or reduced administration in a dosage regimen relative to the parent drug prior to conjugated in a carrier protein-drug conjugate of the present disclosure.

Furthermore, as noted above, because the carrier protein-drug conjugates can provide for controlled stoichiometry of drug delivery, dosages of carrier protein-drug conjugates can be calculated based on the number of drug molecules provided on a per carrier protein-drug conjugate basis.

Accordingly, the carrier protein-drug conjugates of the present disclosure where in the drug is GLP-1, or a biologically active variant thereof, can be used in treatment of conditions amenable to therapy by administration of GLP-1. Such conditions include Type II diabetes and hyperglycemia. Such methods involve administration of an effective amount of a carrier protein-GLP-1 conjugate (or a carrier protein-drug conjugate having a covalently bound variant of GLP-1) to a subject in need to treatment (e.g., a subject having or at risk of Type II diabetes and/or hyperglycemia, wherein administration of the carrier protein-drug conjugate is effect to treat the condition.

Where the drug of the carrier protein-drug conjugates of the present disclosure is Calcitonin, or a biologically active variant thereof, can be used in treatment of conditions amenable to therapy by administration of Calcitonin Such conditions include osteoporosis and hypercalcaemia. Such methods involve administration of an effective amount of a carrier protein-Calcitonin conjugate (or a carrier protein-drug conjugate having a covalently bound variant of Calcitonin) to a subject in need to treatment (e.g., a subject having or at risk of osteoporosis or hypercalcaemia, wherein administration of the carrier protein-drug conjugate is effect to treat the condition.

Kits and Systems

Kits and systems are provided to facilitate and, where desired, standardize the compositions of the invention and the uses thereof. Kits contemplated herein can include one or more of a construct encoding an aldehyde tagged carrier protein (and may encompass a library composed of constructs encoding a population of differently ald-tagged carrier proteins) for expression in a host cell; a host cell that produces an FGE compatible with an aldehyde tag of the kit, where the FGE may be endogenous, recombinant, or heterologous; a host cell genetically modified to express an aldehyde tagged carrier protein (and may encompass a library composed of recombinant host cells containing constructs encoding a population of differently ald-tagged carrier proteins), which host cell can further express an endogenous, recombinant, or heterologous FGE compatible for conversion of the aldehyde tag of the tagged polypeptide; reagents to provide for production of a reactive partner-containing drug; and reagents to promote a reaction between an ald-tagged carrier protein and a reactive partner-containing drug.

In addition, the kit can contain instructions for using the components of the kit, particularly the compositions of the invention that are contained in the kit.

EXAMPLES

The following examples are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how to make and use the present invention, and are not intended to limit the scope of what the inventors regard as their invention nor are they intended to represent that the experiments below are all or the only experiments performed. Efforts have been made to ensure accuracy with respect to numbers used (e.g. amounts, temperature, etc.) but some experimental errors and deviations should be accounted for. Unless indicated otherwise, parts are parts by weight, molecular weight is weight average molecular weight, temperature is in degrees Centigrade, and pressure is at or near atmospheric.

Example 1
Production of Panel of HSA Carrier Proteins

The ald-tag technology is used to provide for high-efficiency modification of secreted carrier proteins in mammalian cell culture system. An FGE and sulfatase motifs are used to install aldehyde tags in a recombinant human serum albumin (rHSA) in a yeast expression system, e.g., Saccharomyces cerevisiae. The FGly-containing ald tag will be exploited to conjugate drugs (e.g., small molecule drugs) to the HSA carrier protein. HSA is trafficked through the secretory pathway, similar to native sulfatases and the previously studied secreted Fc proteins, and are therefore will be readily be recognized as substrates by ER-resident FGE when expressed in mammalian cells. When expressed in yeast, the recombinant HSA containing the aldehyde tag motif is purified and reacted with purified recombinant FGE to convert the cysteine to formylglycine ex vivo, the addition of purified FGE to a purified recombinant protein to give the enzymatic transformation. The converted aldehyde tagged HSA were chemically modified with small molecules to afford a final conjugated protein construct.

C-terminal Modified HSA as a Model Protein

The vector construction utilized the native (human) HSA leader sequence. Modifications to the sequence were made after initial cloning of HSA into a plasmid. A strong promoter was used (for example, Galactose inducible GAL or constitutively-active GPD promoter with -LEU2. For example, the vector p425-GAL1 or the vector p425GalL have a galactose-inducible strong promoter in place with a MCS that can receive the HSA sequence.

Yeast cell lines containing the HSA-Ald₆tag construct were generated using standard molecular biology techniques. After establishing and confirming transformed colonies by PCR, the expression of the Ald₆tagged-HSA was induced and production assayed by immunoblotting. In order to probe directly the aldehyde-modified protein, rHSA was reacted with aminooxy-FLAG peptide and analyzed by Western blot with an anti-FLAG antibody as well as an anti-HSA antibody. The percent conversion of Cys to FGly was quantified by isolation of the protein followed by tryptic digestion and mass spectrometry analysis.

Example 2
Production of Carrier Proteins Having Multiple Ald Tags

Carrier proteins conjugated with multiple peptide drugs or small molecules can greatly enhance the efficacy of the biotherapeutic of interest. Thus multiple aldehyde tags will be installed into a single HSA carrier protein. An HSA carrier protein having Ald₆tags (LCTPSR, (SEQ ID NO:1)) placed in different locations along the peptide backbone of the carrier protein will be generated.

Three different Ald₆tag sequences were appended to the recombinant HSA carrier protein. These constructs, containing Ald₆sequences, are shown in FIG. 6 (depicting three uniquely tagged proteins). The constructs were expressed in S. cerevisae, purified and reacted along with purified M. tuberculosis FGE. Reaction conditions were optimized to maximize the conversion of the cysteine to FGly. The converted protein was purified and analyzed for the presence of FGly by reaction with aminooxy-biotin or aminooxy FLAG peptide followed by Western blot. A CHO expression system was developed for the production of HSA and E. coli expression systems can also be developed for production of ald-tagged rHSAs. The percent conversion of Cys to FGly for each individual tag in the series of multi-ald-tagged rHSAs is assessed by isolation of the protein followed by tryptic digestion and mass spectrometry analysis.

A panel of ald-tagged rHSA carrier proteins was generated, with differently ald-tagged rHSA carrier proteins having aldehydes placed strategically throughout the scaffold. The panel thus serves as a library of differently ald-tagged rHSAs, where the members of the library differ in the number and/or position of ald tags in the scaffold. For example, as exemplified in FIG. 1B, one simple panel was generated where the members include an ald-tagged rHSA having an ald-tag position at the N-terminus, an ald-tagged rHSA having an ald-tag position at the C-terminus, an ald-tagged rHSA having an ald-tag positioned within a loop of the protein. Another library includes these members, as well as ald-tagged rHSAs having ald tags at both the N- and C-termini, at the N-termini and a solvent-accessible loop, at the C-termini and a solvent-accessible loop, and at each of the N-termini, the C-termini and a solvent-accessible loop. FIG. 10 provides the three-dimensional structure of HSA, which can be used for guidance in selecting sites for ald-tag insertion. Exemplary carrier protein-drug conjugates of HSA having a peptide drug positioned at exemplary solvent-accessible sites on HSA are provided in the bottom portion of FIG. 10.

Example 3
Conjugation of Peptides to the Protein Scaffold

Short serum half-life has been a challenge in the development peptide therapeutics. Peptides are typically cleared from the bloodstream within minutes to hours after administration, and thus may not be sufficiently exposed in the target tissue for a desired clinical effect. Aldehyde tagged carrier proteins, such as ald tagged HSA can be used as a carrier protein to increase the serum half-life of the peptides.

Two carrier protein-drug conjugates are generated—one a conjugate with Calcitonin and one a conjugate with GLP-1. As mentioned previously, by coupling the peptides to the HSA carrier protein the absorption and elimination half-lives will be increased.

The peptides were synthesized via standard Fmoc-based solid phase peptide synthesis protocols. The final residue added at the N terminus was (t-Boc-aminooxy)acetic acid followed by cleavage under standard conditions. Deprotection to expose the amino-oxy functionality is followed by HPLC purification. Purified ald-tagged HSA is added to a buffered solution of peptide that has been functionalized with an N-terminus amino-oxy functionality. Upon coupling to the ald-tagged HSA, the final protein-peptide complex is purified using FPLC.

Example 4
Assessment of HSA-GLP-1 Conjugates

The HSA-GLP-1 conjugate is assayed for activity as compared to native GLP-1. GLP-1, released from intestinal L-cells, is known for its potent stimulation of insulin biosynthesis and release from pancreatic β-cells. For the identification of GLP-1 receptor agonist, a cellular receptor activation assay based on the formation of cAMP occurring due to receptor activation is used. Receptor activation studies are performed by incubating RINm5F cells, a rat insulinoma cell line, with or without the test peptides or the HSA-peptide conjugates at increasing concentrations. Activation of the GLP-1 receptor is measured by quantification of the intracellular cAMP after cell lysis. EC₅₀values (concentration of test compound leading to a half maximal stimulation of camp production) are calculated from the resulting dose response curves.

Example 5
Assessment of HSA-Calcitonin Conjugates

The HSA-Calcitonin conjugate is assayed for osteoclast activity as compared to native Calcitonin. The BD BioCoat Osteologic Bone Cell Culture System is used to assess the effect of treatment with the HSA-peptide conjugate and the native Calcitonin peptide on bone resorption of calcium. The BD BioCoat Osteologic Bone Cell Culture System involves sub-micron synthetic calcium phosphate thin films coated onto various culture vessels. This system has been used as an alternative method for compound screening for direct assessment of osteoclast and osteoblast activity in vitro. The thin film design permits easy and reliable quantification of results.

Example 6
Construction of Aldehyde Tagged Human Serum Albumin Carrier Proteins

The following describes production of an exemplary ald-tagged HSA.

A. Primer Design:

A nucleic acid encoding the wildtype HSA was inserted into a vector which can be exploited for as a template for subcloning. Using a vector with the appropriate internal restriction sites the first PCR product was:

- -xmaI-----HSA-stop_codon-----xhoI-

After insertion of this sequence in a vector, variants were made using longer primers such as:

- -xmaI------HSA-6×his-stop_codon-----xhoI-
- -xmaI------HSA-LCTPSR-stop_codon-----xhoI

The following primers were used for PCR cDNA amplification.

Forward:

(SEQ ID NO: 98)

5′-AATCCCGGGATGAAGTGGGTAACCTTTATTTCCC-3′

Reverse:

(SEQ ID NO: 99)

5′-TGACTCGAGTTATAAGCCTAAGGCAGCTTGACTTG-3′

The double underline represents the native sequence, with the single underline the newly introduced restriction sites for further cloning.

A 1830 bp fragment was isolated after gel purification. This was followed by digestion with XmaI and XhoI, and the DNA fragment inserted in the expression vector. The open reading frame for HSA in the expression vector was as follows.

HSA-Encoding Nucleic Acid Sequence:

(SEQ ID NO: 100)

aatcccgggatgaagtgggtaacctttatttcccttctttttctctt

tagctcggcttattccaggggtgtgatcgtcgagatgcacacaagag

tgaggttgctcatcggataaagatttgggagaagaaaatttcaaagc

caggtgttgattgcctttgctcagtatcttcagcagtgtccatttga

agatcatgtaaaattagtgaatgaagtaactgaatttgcaaaaacat

gtgttgctgatgagtcagctgaaaattgtgacaaatcacttcatacc

ctttttggagacaaattatgcacagttgcaactcttcgtgaaaccta

tggtgaaatggctgactgctgtgcaaaacaagaacctgagagaaatg

aatgcttcttgcaacacaaagatgacaacccaaacctcccccgattg

gtgagaccagaggttgatgtgatgtgcactgatttcatgacaatgaa

gagacatattgaaaaaatacttatatgaaattgccagaagacatcct

tacttttatgccccggaactccttttctttgctaaaaggtataaagc

tgcttttacagaatgttgccaagctgctgataaagctgcctgcctga

gccaaagctcgatgaacttcgggatgaagggaaggcttcgtctgcca

aacagagactcaagtgtgccagtctccaaaaatttggagaaagagct

ttcaaagcatgggcagtagctcgcctgagccagagatttcccaaagc

tgagtttgcagaagtttccaagttagtgacagatcttaccaaagtcc

acacggaatgctgccatggagatctgcttgaatgtgctgatgacagg

gcggaccagccaagtatatctgtgaaaatcaagattcgatctccagt

aaactgaaggaatgctgtgaaaaacctctgttggaaaaatcccactg

cattgccgaagtggaaaatgatgagatgcctgctgacttgccttcat

tagctgctgattttgttgaaagtaaggatgtttgcaaaaactatgct

gaggcaaaggatgtcttcctgggcatgtttttgtatgaatatgcaag

aaggcatcctgattactctgtcgtgctgctgctgagacttgccaaga

catatgaaaccactctagagaagtgctgtgccgctgcagatcctcat

gaatgctatgccaaagtgacgatgaatttaaacctcttgtggaagag

cctcagaatttaatcaaacaaaattgtgagctattgagcagcttgga

gagtacaaattccagaatgcgctattagttcgttacaccaagaaagt

accccaagtgtcaactccaactcagtagaggtctcaagaaacctagg

aaaagtgggcagcaaatgagtaaacatcctgaagcaaaaagaatgcc

ctgtgcagaagactatctatccgtggtcctgaaccagttatgtgtga

gcatgagaaaacgccagtaagtgacagagtcaccaaatgctgcacag

aatccttggtgaacaggcgaccatgcttttcagctctggaagtcgat

gaaacatacgttcccaaagagtttaatgctgaaacattcaccaccat

gcagatatatgcacactactgagaaggagagacaaatcaagaaacaa

actgcacttgttgagctcgtgaaacacaagcccaaggcaacaaaaga

gcaactgaaagctgttatggatgatttcgcagcttttgtagagaagt

gctgcaaggctgacgataaggagacctgctttgccgaggagggtaaa

aaacttgttgctgcaagtcaagctgccttaggcttaactcgag

Amino Acid Sequence of Encoded HSA

The HSA native leader sequence (single and double underlined residues) is removed in 2-step process (in humans) before secretion of mature protein:

(SEQ ID NO: 101)

NPGMKWVTFI SLLFLFSSAY SRGVFRRDAH KSEVAHRFKD

LGEENFKALV LIAFAQYLQQ CPFEDHVKLV NEVTEFAKTC

VADESAENCD KSLHTLFGDK LCTVATLRET YGEMADCCAK

QEPERNECFL QHKDDNPNLP RLVRPEVDVM CTAFHDNEET

FLKKYLYEIA RRHPYFYAPE LLFFAKRYKA AFTECCQAAD

KAACLLPKLD ELRDEGKASS AKQRLKCASL QKFGERAFKA

WAVARLSQRF PKAEFAEVSK LVTDLTKVHT ECCHGDLLEC

ADDRADLAKY ICENQDSISS KLKECCEKPL LEKSHCIAEV

ENDEMPADLP SLAADFVESK DVCKNYAEAK DVFLGMFLYE

YARRHPDYSV VLLLRLAKTY ETTLEKCCAA ADPHECYAKV

FDEFKPLVEE PQNLIKQNCE LFEQLGEYKF QNALLVRYTK

KVPQVSTPTL VEVSRNLGKV GSKCCKHPEA KRMPCAEDYL

SVVLNQLCVL HEKTPVSDRV TKCCTESLVN RRPCFSALEV

DETYVPKEFN AETFTFHADI CTLSEKERQI KKQTALVELV

KHKPKATKEQ LKAVMDDFAA FVEKCCKADD KETCFAEEGK

KLVAASQAAL GLTR

B. Construction of C-terminal Modified HSA

Using the plasmid with native HSA as a PCR template a new 3′ HSA PCR primer with additional restriction sites for appending C-terminal tags onto recombinant HSA was designed as follows:

(SEQ ID NO: 102)

5′-ATACTCGAGTTAGTCGACTTCAAGCTTTAAGCCTAAGGCAGCTTG

ACTTG-3′

Double underline: native C-terminus of HSA sequence.

Single underline adjacent double underline: HinDIII site

Bold residues=Stop codon

Single underline 3′ of stop codon: SalI site

Single underline 5′ of stop codon: XhoI site

The SalI and HindIII were provided in the primer as these are not in the plasmid constructs. Used in conjunction with the same Forward primer used for original HSA amplification from cDNA, an 1863 residue PCR product was obtained as follows (with the predicted amino sequence following):

(SEQ ID NO: 103)

aatcccgggatgaagtgggtaaccatatacccttctattctctttagctcggcttattccagggg

tgtgatcgtcgagatgcacacaagagtgaggagctcatcggtttaaagatagggagaagaaaatt

tcaaagccaggtgagattgcctttgctcagtatcttcagcagtgtccatttgaagatcatgtaaa

attagtgaatgaagtaactgaatttgcaaaaacatgtgttgctgatgagtcagctgaaaattgtg

acaaatcacttcataccctattggagacaaattatgcacagagcaactcttcgtgaaacctatgg

tgaaatggctgactgctgtgcaaaacaagaacctgagagaaatgaatgcacttgcaacacaaaga

tgacaacccaaacctcccccgattggtgagaccagaggttgatgtgatgtgcactgcattcatga

caatgaagagacatttagaaaaaatacttatatgaaattgccagaagacatccttactatatgcc

ccggaactccattattgctaaaaggtataaagctgctatacagaatgagccaagctgctgataaa

gctgcctgcctgagccaaagctcgatgaacttcgggatgaagggaaggcttcgtctgccaaacag

agactcaagtgtgccagtctccaaaaatttggagaaagagctttcaaagcatgggcagtagctcg

cctgagccagagatttcccaaagctgagtttgcagaagtaccaagttagtgacagatcttaccaa

agtccacacggaatgctgccatggagatctgcttgaatgtgctgatgacagggcggaccagccaa

gtatatctgtgaaaatcaagattcgatctccagtaaactgaaggaatgctgtgaaaaacctctgt

tggaaaaatcccactgcattgccgaagtggaaaatgatgagatgcctgctgacttgccttcatta

gctgctgattagttgaaagtaaggatgatgcaaaaactatgctgaggcaaaggatgtcttcctgg

gcatgatagtatgaatatgcaagaaggcatcctgattactctgtcgtgctgctgctgagacttgc

caagacatatgaaaccactctagagaagtgctgtgccgctgcagatcctcatgaatgctatgcca

aagtgacgatgaatttaaacctcagtggaagagcctcagaatttaatcaaacaaaattgtgagct

attgagcagcaggagagtacaaattccagaatgcgctattagttcgttacaccaagaaagtaccc

caagtgtcaactccaactcagtagaggtctcaagaaacctaggaaaagtgggcagcaaatgagta

aacatcctgaagcaaaaagaatgccctgtgcagaagactatctatccgtggtcctgaaccagtta

tgtgtgagcatgagaaaacgccagtaagtgacagagtcaccaaatgctgcacagaatccaggtga

acaggcgaccatgctatcagctctggaagtcgatgaaacatacgacccaaagagataatgctgaa

acattcaccaccatgcagatatatgcacactactgagaaggagagacaaatcaagaaacaaactg

cacttgttgagctcgtgaaacacaagcccaaggcaacaaaagagcaactgaaagctgttatggat

gatttcgcagcttttgtagagaagtgctgcaaggctgacgataaggagacctgctagccgaggag

ggtaaaaaacttgagctgcaagtcaagctgccttaggcttaaagcttgaagtcgactaactcgag

ata

(SEQ ID NO: 104)

NPG

MKWVTFI SLLFLFSSAY S

RGVFRR
DAH KSEVAHRFKD LGEENFKALV LIAFAQYLQQ

CPFEDHVKLV NEVTEFAKTC VADESAENCD KSLHTLFGDK LCTVATLRET YGEMADCCAK

QEPERNECFL QHKDDNPNLP RLVRPEVDVM CTAFHDNEET FLKKYLYEIA RRHPYFYAPE

LLFFAKRYKA AFTECCQAAD KAACLLPKLD ELRDEGKASS AKQRLKCASL QKFGERAFKA

WAVARLSQRF PKAEFAEVSK LVTDLTKVHT ECCHGDLLEC ADDRADLAKY ICENQDSISS

KLKECCEKPL LEKSHCIAEV ENDEMPADLP SLAADFVESK DVCKNYAEAK DVFLGMFLYE

YARRHPDYSV VLLLRLAKTY ETTLEKCCAA ADPHECYAKV FDEFKPLVEE PQNLIKQNCE

LFEQLGEYKF QNALLVRYTK KVPQVSTPTL VEVSRNLGKV GSKCCKHPEA KRMPCAEDYL

SVVLNQLCVL HEKTPVSDRV TKCCTESLVN RRPCFSALEV DETYVPKEFN AETFTFHADI

CTLSEKERQI KKQTALVELV KHKPKATKEQ LKAVMDDFAA FVEKCCKADD KETCFAEEGK

KLVAASQAAL GLKLEVDLEI

The product was digested with XmaI and XhoI and inserted into the expression vector, then digested with HinDIII and SalI (sequentially) for insertion of a synthetic piece of double-stranded DNA with complementary sticky ends. The sequence of the synthetic DNA encoding the HSA having an ald tag LCTPSR (SEQ ID NO:1) at the C terminus is provided below (with the predicted amino sequence following):

(SEQ ID NO: 105)

aacccgggcatgaaatgggtgacctttattagcctgctgtttctgtttagcagcgcgtatagccg

cggcgtgtttcgccgcgatgcgcataaaagcgaagtggcgcatcgctttaaagatctgggcgaag

aaaactttaaagcgctggtgctgattgcgtttgcgcagtatctgcagcagtgcccgatgaagatc

atgtgaaactggtgaacgaagtgaccgaatttgcgaaaacctgcgtggcggatgaaagcgcggaa

aactgcgataaaagcctgcataccctgtaggcgataaactgtgcaccgtggcgaccctgcgcgaa

acctatggcgaaatggcggattgctgcgcgaaacaggaaccggaacgcaacgaatgctactgcag

cataaagatgataacccgaacctgccgcgcctggtgcgcccggaagtggatgtgatgtgcaccgc

gatcatgataacgaagaaacctactgaaaaaatatctgtatgaaattgcgcgccgccatccgtat

tatatgcgccggaactgctgttttttgcgaaacgctataaagcggcgtttaccgaatgctgccag

gcggcggataaagcggcgtgcctgctgccgaaactggatgaactgcgcgatgaaggcaaagcgag

cagcgcgaaacagcgcctgaaatgcgcgagcctgcagaaatttggcgaacgcgcgtttaaagcgt

gggcggtggcgcgcctgagccagcgctttccgaaagcggaatttgcggaagtgagcaaactggtg

accgatctgaccaaagtgcataccgaatgctgccatggcgatctgctggaatgcgcggatgatcg

cgcggatctggcgaaatatatttgcgaaaaccaggatagcattagcagcaaactgaaagaatgct

gcgaaaaaccgctgctggaaaaaagccattgcattgcggaagtggaaaacgatgaaatgccggcg

gatctgccgagcctggcggcggattttgtggaaagcaaagatgtgtgcaaaaactatgcggaagc

gaaagatgtgtttctgggcatgtttctgtatgaatatgcgcgccgccatccggattatagcgtgg

tgctgctgctgcgcctggcgaaaacctatgaaaccaccctggaaaaatgctgcgcggcggcggat

ccgcatgaatgctatgcgaaagtgatgatgaatttaaaccgctggtggaagaaccgcagaacctg

attaaacagaactgcgaactgatgaacagctgggcgaatataaatttcagaacgcgctgctggtg

cgctataccaaaaaagtgccgcaggtgagcaccccgaccctggtggaagtgagccgcaacctggg

caaagtgggcagcaaatgctgcaaacatccggaagcgaaacgcatgccgtgcgcggaagattatc

tgagcgtggtgctgaaccagctgtgcgtgctgcatgaaaaaaccccggtgagcgatcgcgtgacc

aaatgctgcaccgaaagcctggtgaaccgccgcccgtgctttagcgcgctggaagtggatgaaac

ctatgtgccgaaagaatttaacgcggaaacctttacctttcatgcggatatttgcaccctgagcg

aaaaagaacgccagattaaaaaacagaccgcgctggtggaactggtgaaacataaaccgaaagcg

accaaagaacagctgaaagcggtgatggatgattagcggcgtagtggaaaaatgctgcaaagcgg

atgataaagaaacctgctttgcggaagaaggcaaaaaactgctgtgcaccccgagccgcgtggat

ctggaaatt

(SEQ ID NO: 106)

NPGMKWVTFI SLLFLFSSAY SRGVFRRDAH KSEVAHRFKD LGEENFKALV LIAFAQYLQQ

CPFEDHVKLV NEVTEFAKTC VADESAENCD KSLHTLFGDK LCTVATLRET YGEMADCCAK

QEPERNECFL QHKDDNPNLP RLVRPEVDVM CTAFHDNEET FLKKYLYEIA RRHPYFYAPE

LLFFAKRYKA AFTECCQAAD KAACLLPKLD ELRDEGKASS AKQRLKCASL QKFGERAFKA

WAVARLSQRF PKAEFAEVSK LVTDLTKVHT ECCHGDLLEC ADDRADLAKY ICENQDSISS

KLKECCEKPL LEKSHCIAEV ENDEMPADLP SLAADFVESK DVCKNYAEAK DVFLGMFLYE

YARRHPDYSV VLLLRLAKTY ETTLEKCCAA ADPHECYAKV FDEFKPLVEE PQNLIKQNCE

LFEQLGEYKF QNALLVRYTK KVPQVSTPTL VEVSRNLGKV GSKCCKHPEA KRMPCAEDYL

SVVLNQLCVL HEKTPVSDRV TKCCTESLVN RRPCFSALEV DETYVPKEFN AETFTFHADI

CTLSEKERQI KKQTALVELV KHKPKATKEQ LKAVMDDFAA FVEKCCKADD KETCFAEEGK

KLLCTPSRVD LEI

The plasmid encoding recombinant HSA was further modified to include the FGE motif at the C-terminus. Primers, designed for the insertion of FGE motif and thrombin-cleavable affinity tag at C-terminus, were ligated into the vector using standard molecular biology techniques. The insert design was as follows:

(SEQ ID NO:: 107)

HinDIII-LCTPSR-LVPRGS-PstI-HHHHHH-SalI

(SEQ ID NO: 108)

5′ AGCTTCTTTGTACCCCTAGCAGGCTGGTGCCGCGCGGCAGCCTGCAG

CATCATCACCACCATCACG

(SEQ ID NO: 109)

5′ AGAAACATGGGGATCGTCCGACCACGGCGCGCCGTCGGACGTCGTAG

TAGTGGTGGTAGTGCAGCT

PstI site allows for detection of insert via diagnostic digestion instead of sequencing each miniprep. LVPRGS is a thrombin cleavage site. The ORF translates to:

(SEQ ID NO: 110)

MKWTFISLLF LFSSAYSRGV FRRDAHKSEV AHRFKDLGEE

NFKALVLIAF AQYLQQCPFE DHVKLVNEVT EFAKTCVADE

SAENCDKSLH TLFGDKLCTVA TLRETYGEMA DCCAKQEPER

NECFLQHKDD NPNLPRLVRP EVDVMCTAFH DNEETFLKKY

LYEIARRHPY FYAPELLFFAK RYKAAFTECC QAADKAACLLP

KLDELRDEGK ASSAKQRLKCA SLQKFGERA FKAWAVARLSQ

RFPKAEFAEV SKLVTDLTKV HTECCHGDLL ECADDRADLA

KYICENQDSI SSKLKECCEK PLLEKSHCIA EVENDEMPAD

LPSLAADFVE SKDVCKNYAE AKDVFLGMFL YEYARRHPDY

SVVLLLRLAK TYETTLEKCC AAADPHECYAK VFDEFKPLVE

EPQNLIKQNCE LFEQLGEYKFQ NALLVRYTKK VPQVSTPTLV

EVSRNLGKVG SKCCKHPEAK RMPCAEDYLS VVLNQLCVLH

EKTPVSDRVT KCCTESLVNR RPCFSALEVD ETYVPKEFNA

ETFTFHADIC TLSEKERQIK KQTALVELVK HKPKATKEQL

KAVMDDFAAF VEKCCKADDK ETCFAEEGKK LVAASQAALG

LKL custom character

LV PRGSLQHHHH HHVD

The construct was expressed in S. cerevisae and purified using affinity column purification (FIG. 13). The modified protein was reacted with purified FGE to convert the cysteine to formyl glycine. The recombinant HSA was reacted with a fluorophore containing a hydrazide and the conversion and conjugation was quantified by measuring the resulting fluorescence of the modified protein (FIG. 13).

C. Construction of N-terminal Modified HSA

Generating the N-terminal aldehyde tagged HSA was accomplished by inserting an in-frame synthetic gene where the N-terminus of mature HSA was modified with the aldehyde tag. The synthetic gene was cloned into a yeast expression vector using standard molecular biology techniques. The designed sequence is as follows. Bold/underline=restriction site arrays

aaacgatg = kozak (shine dalgarno) sequence

aagtgggtaacctttatttcccttctattctctttagctcggcttattccaggggtgtgtttcgtcga

(SEQ ID NO: 11) = prepro region (removed from mature protein)

caccatcatcaccaccatcac (SEQ ID NO: 112) = 7xHIS tag

ctggtgccgcgcggcagc (SEQ ID NO: 113) = thrombin recognition site

ctttgtacccctagcagg (SEQ ID NO: 114) = LCTPSR motif (SEQ ID NO: 1)

ggaggc = diglycine linker

taa = stop codon

(SEQ ID NO: 115)

aaataaagcttcccgggggatccaaacgatgaagtgggtaacctttatttcccttctttttctcttta

gctcggcttattccaggggtgtgtttcgtcgagatgcacacaagcaccatcatcaccaccatcacctg

gtgccgcgcggcagcctttgtacccctagcaggggaggcagtgaggttgctcatcggtttaaagattt

gggagaagaaaatttcaaagccttggtgttgattgcctttgctcagtatcttcagcagtgtccatttg

aagatcatgtaaaattagtgaatgaagtaactgaatttgcaaaaacatgtgttgctgatgagtcagct

gaaaattgtgacaaatcacttcataccctttttggagacaaattatgcacagttgcaactcttcgtga

aacctatggtgaaatggctgactgctgtgcaaaacaagaacctgagagaaatgaatgcttcttgcaac

acaaagatgacaacccaaacctcccccgattggtgagaccagaggttgatgtgatgtgcactgctttt

catgacaatgaagagacatttttgaaaaaatacttatatgaaattgccagaagacatccttactttta

tgccccggaactccttttctttgctaaaaggtataaagctgcttttacagaatgttgccaagctgctg

ataaagctgcctgcctgagccaaagctcgatgaacttcgggatgaagggaaggcttcgtctgccaaac

agagactcaagtgtgccagtctccaaaaatttggagaaagagctttcaaagcatgggcagtagctcgc

ctgagccagagatttcccaaagctgagtttgcagaagtttccaagttagtgacagaccttaccaaagt

ccacacggaatgctgccatggagacctgcttgaatgtgctgatgacagggcggaccttgccaagtata

tctgtgaaaatcaagattcgatctccagtaaactgaaggaatgctgtgaaaaacctctgttggaaaaa

tcccactgcattgccgaagtggaaaatgatgagatgcctgctgacttgccttcattagctgctgattt

tgttgaaagtaaggatgtttgcaaaaactatgctgaggcaaaggatgtcttcctgggcatgtttttgt

atgaatatgcaagaaggcatcctgattactctgtcgtgctgctgctgagacttgccaagacatatgaa

accactctagagaagtgctgtgccgctgcagatcctcatgaatgctatgccaaagtgttcgatgaatt

taaacctcttgtggaagagcctcagaatttaatcaaacaaaattgtgagctttttgagcagcttggag

agtacaaattccagaatgcgctattagttcgttacaccaagaaagtaccccaagtgtcaactccaact

cttgtagaggtctcaagaaacctaggaaaagtgggcagcaaatgttgtaaacatcctgaagcaaaaag

aatgccctgtgcagaagactatctatccgtggtcctgaaccagttatgtgtgttgcatgagaaaacgc

cagtaagtgacagagtcaccaaatgctgcacagaatccttggtgaacaggcgaccatgcttttcagct

ctggaagtcgatgaaacatacgttcccaaagagtttaatgctgaaacattcaccttccatgcagatat

atgcacactttctgagaaggagagacaaatcaagaaacaaactgcacttgttgagctcgtgaaacaca

agcccaaggcaacaaaagagcaactgaaagctgttatggatgatttcgcagcttttgtagagaagtgc

tgcaaggctgacgataaggagacctgctttgccgaggagggtaaaaaacttgttgctgcaagtcaagc

tgccttaggcttataatgaattcgtcgacctcgaggatatcacaag

The expected ORF product was

(SEQ ID NO: 116)

MKWVTFISLL FLFSSAYSRG VFRRDAHKHH HHHHHLVPRG

SLCTPSRGGS EVAHRFKDLG EENFKALVLI AFAQYLQQCP

FEDHVKLVNE VTEFAKTCVA DESAENCDKS LHTLFGDKLC

TVATLRETYG EMADCCAKQE PERNECFLQH KDDNPNLPRL

VRPEVDVMCT AFHDNEETFL KKYLYEIARR HPYFYAPELL

FFAKRYKAAF TECCQAADKA ACLLPKLDEL RDEGKASSAK

QRLKCASLQK FGERAFKAWA VARLSQRFPKA EFAEVSKLVT

DLTKVHTECC HGDLLECADDRADLAKYICE NQDSISSKLK

ECCEKPLLEK SHCIAEVEND EMPADLPSLA ADFVESKDVC

KNYAEAKDVF LGMFLYEYAR RHPDYSVVLL LRLAKTYETT

LEKCCAAADP HECYAKVFDE FKPLVEEPQN LIKQNCELFE

QLGEYKFQNA LLVRYTKKVP QVSTPTLVEV SRNLGKVGSK

CCKHPEAKRM PCAEDYLSVV LNQLCVLHEK TPVSDRVTKC

CTESLVNRRP CFSALEVDET YVPKEFNAET FTFHADICTL

SEKERQIKKQ TALVELVKHK PKATKEQLKA VMDDFAAFVE

KCCKADDKET CFAEEGKKLV AASQAALGL

The construct was expressed in S. cerevisae and purified using affinity column purification. The modified protein was reacted with purified FGE to convert the cysteine to formylglycine. The recombinant HSA was reacted with a fluorophore containing a hydrazide and the conversion and conjugation was quantified by measuring the resulting fluorescence of the modified protein.

D. Construction of Internal Modified HSA

Generating the internal aldehyde tagged HSA was accomplished by inserting an in-frame synthetic gene where key restriction sites are placed where mature HSA is to be modified with the aldehyde tag. The synthetic gene was cloned into a yeast expression vector using standard molecular biology techniques. The designed sequence was as follows:

(SEQ ID NO: 117)

CGAAGGATCCAAACGATGAAGTGGGTAACCTTTATTTCCCTTCTTTTTC

TCTTTAGCTCGGCTTATTCCAGGGGTGTGTTTCGTCGAGATGCACACAA

GAGTGAGGTTGCTCATCGGTTTAAAGATTTGGGAGAAGAAAATTTCAAA

GCCTTGGTGTTGATTGCCTTTGCTCAGTATCTTCAGCAGTGTCCATTTG

AAGATCATGTAAAATTAGTGAATGAAGTAACTGAATTTGCAAAAACATG

TGTTGCTGATGAGTCAGCTGAAAATTGTGACAAATCACTTCATACCCTT

TTTGGAGACAAATTATGCACAGTTGCAACTCTTCGTGAAACCTATGGTG

AAATGGCTGACTGCTGTGCAAAACAAGAACCTGAGAGAAATGAATGCTT

CTTGCAACACAAAGATGACAACCCAAACCTCCCCCGATTGGTGAGACCA

GAGGTTGATGTGATGTGCACTGCTTTTCATGACAATGAAGAGACATTTT

TGAAAAAATACTTATATGAAATTGCCAGAAGACATCCTTACTTTTATGC

CCCGGAACTCCTTTTCTTTGCTAAAAGGTATAAAGCTGCTTTTACAGAA

TGTTGCCAAGCTGCTGATAAAGCTGCCTGCCTGTTGCCAAAGCTCGATG

AACTTCGGGATGAAGGGAAGGCTTCGTCTGCCAAACAGAGACTCAAGTG

TGCCAGTCTCCAAAAATTTGGAGAAAGAGCTTTCAAAGCATGGGCAGTA

GCTCGCCTGAGCCAGAGATTTCCCAAGGCTGAGTTTGCAGAAGTTTCCA

AGTTAGTGACTGACCTTACCAAAGTCCACACGGAATGCTGTCACGGAGA

CCTGCTTGAATGTGCTGATGACAGGGCGGACCTTGCCAAGTATATCTGT

GAAAATCAAGATTCGATCTCCAGTAAACTGAAGGAATGCTGTGAAAAAC

CTCTGTTGGAAAAATCCCACTGCATTGCCGAAGTGGAAAATGATGAGAT

GCCTGCTCTCGAGCCTTCTACTAGTGCTGATTTTGTTGAAAGTAAGGAT

GTTTGCAAAAACTATGCTGAGGCAAAGGATGTCTTCCTGGGCATGTTTT

TGTATGAATATGCAAGAAGGCATCCTGATTACTCTGTCGTGCTGCTGCT

GAGACTTGCCAAGACATATGAAACCACACTTGAGAAGTGCTGTGCCGCC

GCTGATCCTCATGAATGCTATGCCAAAGTGTTCGATGAATTTAAACCTC

TTGTGGAAGAGCCTCAGAATTTAATCAAACAAAATTGTGAGCTTTTTGA

GCAGCTTGGAGAGTACAAATTCCAGAATGCGCTATTAGTTCGTTACACC

AAGAAAGTACCCCAAGTGTCAACTCCAACTCTTGTAGAGGTCTCAAGAA

ACCTAGGAAAAGTGGGCAGCAAATGTTGTAAACATCCTGAAGCAAAAAG

AATGCCCTGTGCAGAAGACTATCTATCCGTGGTCCTGAACCAGTTATGT

GTGTTGCATGAGAAAACGCCAGTAAGTGACAGAGTCACCAAATGCTGCA

CAGAATCCTTGGTGAACAGGCGACCATGCTTTTCAGCTCTGGAAGTCGA

TGAAACATACGTTCCCAAAGAGTTTAATGCTGAAACATTCACCTTCCAT

GCAGATATATGCACACTTTCTGAGAAGGAGAGACAAATCAAGAAACAAA

CTGCACTTGTTGAGCTTGTGAAACACAAGCCCAAGGCAACAAAAGAGCA

ACTGAAAGCTGTTATGGATGATTTCGCAGCTTTTGTAGAGAAGTGCTGC

AAGGCTGACGATAAGGAGACCTGCTTTGCCGAGGAGGGTAAAAAACTTG

TTGCTGCAAGTCAAGCTGCCTTAGGCTTACCCGGGTCTACTCCGCGGCT

GGTGCCGCGCGGCAGCCTTCAACATCATCACCACCATCACGTCGACTAA

TGGAATTCCCTA

The expected ORF was:

(SEQ ID NO: 118)

MKWVTFISLL FLFSSAYSRG VFRRDAHKSE VAHRFKDLGE

ENFKALVLIA FAQYLQQCPF EDHVKLVNEV TEFAKTCVAD

ESAENCDKSL HTLFGDKLCT VATLRETYGE MADCCAKQEP

ERNECFLQHK DDNPNLPRLV RPEVDVMCTA FHDNEETFLK

KYLYEIARRH PYFYAPELLF FAKRYKAAFT ECCQAADKAA

CLLPKLDELR DEGKASSAKQ RLKCASLQKF GERAFKAWAV

ARLSQRFPKA EFAEVSKLVT DLTKVHTECC HGDLLECADD

RADLAKYICE NQDSISSKLK ECCEKPLLEK SHCIAEVEND

EMPALEPSTS ADFVESKDVC KNYAEAKDVF LGMFLYEYAR

RHPDYSVVLL LRLAKTYETTL EKCCAAADPH ECYAKVFDEF

KPLVEEPQNL IKQNCELFEQ LGEYKFQNAL LVRYTKKVPQ

VSTPTLVEVS RNLGKVGSKC CKHPEAKRMP CAEDYLSVVL

NQLCVLHEKT PVSDRVTKCC TESLVNRRPC FSALEVDETY

VPKEFNAETF TFHADICTLS EKERQIKKQT ALVELVKHKP

KATKEQLKAV MDDFAAFVEK CCKADDKETC FAEEGKKLVA

ASQAALGLPG STPRLVPRGS LQHHHHHHVD

6×His-HSA synthetic gene was ligated into pCR blunt II-TOPO vector, followed by digestion of pRW33 with EcoRI and BamHI to cut out 6×His-HSA, which was purified and then ligated into pcDNA3.1 using the EcoRI and BamHI sites. The resulting vector was Digested with XhoI/SpeI and the annealed primers:

(SEQ ID NO: 119)

5′-CTAGCCTTTGTACCCCTAGCAGGG-3′

and

(SEQ ID NO: 120)

5-CTAGCCCTGCTAGGGGTACAAAGA-3′

were ligated in generating the aldehyde tag. The designed sequence was as follows:

(SEQ ID NO: 121)

CCGGAACTCCTTTTCTTTGCTAAAAGGTATAAAGCTGCTTTTACAGAAT

GTTGCCAAGCTGCTGATAAAGCTGCCTGCCTGTTGCCAAAGCTCGATGA

ACTTCGGGATGAAGGGAAGGCTTCGTCTGCCAAACAGAGACTCAAGTGT

GCCAGTCTCCAAAAATTTGGAGAAAGAGCTTTCAAAGCATGGGCAGTAG

CTCGCCTGAGCCAGAGATTTCCCAAGGCTGAGTTTGCAGAAGTTTCCAA

GTTAGTGACTGACCTTACCAAAGTCCACACGGAATGCTGTCACGGAGAC

CTGCTTGAATGTGCTGATGACAGGGCGGACCTTGCCAAGTATATCTGTG

AAAATCAAGATTCGATCTCCAGTAAACTGAAGGAATGCTGTGAAAAACC

TCTGTTGGAAAAATCCCACTGCATTGCCGAAGTGGAAAATGATGAGATG

CCTGCTCTCGATCTTTGTACCCCTAGCAGGGCTACTAGTGCTGATTTTG

TTGAAAGTAAGGATGTTTGCAAAAACTATGCTGAGGCAAAGGATGTCTT

CCTGGGCATGTTTTTGTATGAATATGCAAGAAGGCATCCTGATTACTCT

GTCGTGCTGCTGCTGAGACTTGCCAAGACATATGAAACCACACTTGAGA

AGTGCTGTGCCGCCGCTGATCCTCATGAATGCTATGCCAAAGTGTTCGA

TGAATTTAAACCTCTTGTGGAAGAGCCTCAGAATTTAATCAAACAAAAT

TGTGAGCTTTTTGAGCAGCTTGGAGAGTACAAATTCCAGAATGCGCTAT

TAGTTCGTTACACCAAGAAAGTACCCCAAGTGTCAACTCCAACTCTTGT

AGAGGTCTCAAGAAACCTAGGAAAAGTGGGCAGCAAATGTTGTAAACAT

CCTGAAGCAAAAAGAATGCCCTGTGCAGAAGACTATCTATCCGTGGTCC

TGAACCAGTTATGTGTGTTGCATGAGAAAACGCCAGTAAGTGACAGAGT

CACCAAATGCTGCACAGAATCCTTGGTGAACAGGCGACCATGCTTTTCA

GCTCTGGAAGTCGATGAAACATACGTTCCCAAAGAGTTTAATGCTGAAA

CATTCACCTTCCATGCAGATATATGCACACTTTCTGAGAAGGAGAGACA

AATCAAGAAACAAACTGCACTTGTTGAGCTTGTGAAACACAAGCCCAAG

GCAACAAAAGAGCAACTGAAAGCTGTTATGGATGATTTCGCAGCTTTTG

TAGAGAAGTGCTGCAAGGCTGACGATAAGGAGACCTGCTTTGCCGAGGA

GGGTAAAAAACTTGTTGCTGCAAGTCAAGCTGCCTTAGGCTTACCCGGG

TCTACTCCGCGGCTGGTGCCGCGCGGCAGCCTTCAACATCATCACCACC

ATCACGTCGACTAATGGAATTCCCTA

The expected ORF product was:

(SEQ ID NO: 122)

MKWVTFISLL FLFSSAYSRG VFRRDAHKSE VAHRFKDLGE

ENFKALVLIA FAQYLQQCPF EDHVKLVNEV TEFAKTCVAD

ESAENCDKSL HTLFGDKLCT VATLRETYGE MADCCAKQEP

ERNECFLQHK DDNPNLPRLV RPEVDVMCTA FHDNEETFLK

KYLYEIARRH PYFYAPELLF FAKRYKAAFT ECCQAADKAA

CLLPKLDELR DEGKASSAKQ RLKCASLQKF GERAFKAWAV

ARLSQRFPKA EFAEVSKLVT DLTKVHTECC HGDLLECADD

RADLAKYICE NQDSISSKLK ECCEKPLLEK SHCIAEVEND

EMPALDLCTP SRATSADFVE SKDVCKNYAE AKDVFLGMFL

YEYARRHPDY SVVLLLRLAK TYETTLEKCC AAADPHECYA

KVFDEFKPLV EEPQNLIKQN CELFEQLGEY KFQNALLVRY

TKKVPQVSTP TLVEVSRNLG KVGSKCCKHP EAKRMPCAED

YLSVVLNQLC VLHEKTPVSD RVTKCCTESL VNRRPCFSAL

EVDETYVPKE FNAETFTFHA DICTLSEKER QIKKQTALVE

LVKHKPKATK EQLKAVMDDF AAFVEKCCKA DDKETCFAEE

GKKLVAASQA ALGLPGSTPR LVPRGSLQHH HHHHVD

6×His-LCTPSR-HSA (HHHHHHLCTPSR (SEQ ID NO:127)) was expressed and purified from CHO cells. 6×His-LCTPSR-HSA (SEQ ID NO:127) was transfected in pcDNA3.1 into CHO cells in Opti-MEM serum-free medium using Lipofectin transfection reagent in a 10 cm dish. After 3 h at 37° C., the Opti-MEM medium was removed and added 10 mL of HAM'S F12 serum-free medium. After 3 days at 37° , the media was collected and added 10 mL Binding Buffer (20 mM Na2PO4, 500 mM NaCl, 20 mM Imidazole, pH 7.5) and 200 μl of Ni-NTA resin. After incubating with rotation for 1 h at 4° C., the mixture was applied to a column. The resin was washed with 5 mL Binding Buffer and then eluted with 5×500 μl Elution Buffer (20 mM Na2PO4, 500 mM NaCl, 500 mM Imidazole, pH 7.5). The samples were run on 10% Tric-HCl gels and either stained with Coomassie or transferred to nitrocellulose for immunoblotting with an anti-His antibody to verify the presence of protein.

E. Construction HSA Modified with Two Aldehyde Tags, Internally Modified and C-terminally Modified HSA

The vectors containing recombinant HSA was digested with XmaI/SacII and the annealed primers 5′-CCGGACTTTGTACCCCTAGCAGGGGGC-3′ (SEQ ID NO:123) and 5′-CCCCTGCTAGGGGTACAAAGT-3′ (SEQ ID NO:124) were ligated in resulting in the insertion of the aldehyde tag. The designed sequence was as follows:

(SEQ ID NO: 125)

GAGACATTTTTGAAAAAATACTTATATGAAATTGCCAGAAGACATCCTT

ACTTTTATGCCCCGGAACTCCTTTTCTTTGCTAAAAGGTATAAAGCTGC

TTTTACAGAATGTTGCCAAGCTGCTGATAAAGCTGCCTGCCTGTTGCCA

AAGCTCGATGAACTTCGGGATGAAGGGAAGGCTTCGTCTGCCAAACAGA

GACTCAAGTGTGCCAGTCTCCAAAAATTTGGAGAAAGAGCTTTCAAAGC

ATGGGCAGTAGCTCGCCTGAGCCAGAGATTTCCCAAGGCTGAGTTTGCA

GAAGTTTCCAAGTTAGTGACTGACCTTACCAAAGTCCACACGGAATGCT

GTCACGGAGACCTGCTTGAATGTGCTGATGACAGGGCGGACCTTGCCAA

GTATATCTGTGAAAATCAAGATTCGATCTCCAGTAAACTGAAGGAATGC

TGTGAAAAACCTCTGTTGGAAAAATCCCACTGCATTGCCGAAGTGGAAA

ATGATGAGATGCCTGCTCTCGATCTTTGTACCCCTAGCAGGGCTACTAG

TGCTGATTTTGTTGAAAGTAAGGATGTTTGCAAAAACTATGCTGAGGCA

AAGGATGTCTTCCTGGGCATGTTTTTGTATGAATATGCAAGAAGGCATC

CTGATTACTCTGTCGTGCTGCTGCTGAGACTTGCCAAGACATATGAAAC

CACACTTGAGAAGTGCTGTGCCGCCGCTGATCCTCATGAATGCTATGCC

AAAGTGTTCGATGAATTTAAACCTCTTGTGGAAGAGCCTCAGAATTTAA

TCAAACAAAATTGTGAGCTTTTTGAGCAGCTTGGAGAGTACAAATTCCA

GAATGCGCTATTAGTTCGTTACACCAAGAAAGTACCCCAAGTGTCAACT

CCAACTCTTGTAGAGGTCTCAAGAAACCTAGGAAAAGTGGGCAGCAAAT

GTTGTAAACATCCTGAAGCAAAAAGAATGCCCTGTGCAGAAGACTATCT

ATCCGTGGTCCTGAACCAGTTATGTGTGTTGCATGAGAAAACGCCAGTA

AGTGACAGAGTCACCAAATGCTGCACAGAATCCTTGGTGAACAGGCGAC

CATGCTTTTCAGCTCTGGAAGTCGATGAAACATACGTTCCCAAAGAGTT

TAATGCTGAAACATTCACCTTCCATGCAGATATATGCACACTTTCTGAG

AAGGAGAGACAAATCAAGAAACAAACTGCACTTGTTGAGCTTGTGAAAC

ACAAGCCCAAGGCAACAAAAGAGCAACTGAAAGCTGTTATGGATGATTT

CGCAGCTTTTGTAGAGAAGTGCTGCAAGGCTGACGATAAGGAGACCTGC

TTTGCCGAGGAGGGTAAAAAACTTGTTGCTGCAAGTCAAGCTGCCTTAG

GCTTACCCGGACTTTGTACCCCTAGCAGGGGGCGGCTGGTGCCGCGCGG

CAGCCTTCAACATCATCACCACCATCACGTCGACTAATGGAATTCCCTA

The expected ORF product was:

(SEQ ID NO: 126)

MKWVTFISLL FLFSSAYSRG VFRRDAHKSE VAHRFKDLGE

ENFKALVLIA FAQYLQQCPF EDHVKLVNEV TEFAKTCVAD

ESAENCDKSL HTLFGDKLCT VATLRETYGE MADCCAKQEP

ERNECFLQHK DDNPNLPRLV RPEVDVMCTA FHDNEETFLK

KYLYEIARRH PYFYAPELLF FAKRYKAAFT ECCQAADKAA

CLLPKLDELR DEGKASSAKQ RLKCASLQKF GERAFKAWAV

ARLSQRFPKA EFAEVSKLVT DLTKVHTECC HGDLLECADD

RADLAKYICE NQDSISSKLK ECCEKPLLEK SHCIAEVEND

EMPALDLCTP SRATSADFVE SKDVCKNYAE AKDVFLGMFL

YEYARRHPDY SVVLLLRLAK TYETTLEKCC AAADPHECYA

KVFDEFKPLV EEPQNLIKQN CELFEQLGEY KFQNALLVRY

TKKVPQVSTP TLVEVSRNLG KVGSKCCKHP EAKRMPCAED

YLSVVLNQLC VLHEKTPVSD RVTKCCTESL VNRRPCFSAL

EVDETYVPKE FNAETFTFHA DICTLSEKER QIKKQTALVE

LVKHKPKATK EQLKAVMDDF AAFVEKCCKA DDKETCFAEE

GKKLVAASQA ALGLPGLCTP SRGRLVPRGS LQHHHHHHV D

Example 7
Production of Drug-HSA Conjugate

Purified HSA modified with aldehyde tags was added to a buffered solution of peptide that has been functionalized with an N-terminus amino-oxy functionality. The conjugation reaction is carried out in a buffered solution at a pH of 6.0 with 100 mM aniline added. A three-fold excess of aminooxy peptide to aldehyde was added to the reaction mixture to drive the reaction to completion. After addition of peptide to a solution of the ald-tagged HSA, the mixture was stirred at room temp for 2 h, dialyzed and the protein-peptide conjugate purified using FPLC.

Example 8
Expression of Aldehyde-Tagged HSA with Pichia Pastoris

6×His-LCTPSR-HSA (SEQ ID NO:127) was transformed into the Pichia strain GS115using pRW39 (6×His-LCTPSR-HSA (SEQ ID NO:127) in pPIC3.5K). pRW39 was linearized with BglII. 20 μg of DNA in 10 μL water was added to 80 μL of freshly competent GS115 cells and electroporated in a 2 mm cuvette (2000V). 1 mL of ice cold 1M sorbitol was added immediately after electroporation. The cells were plated on regeneration dextrose Bacto agar plates (lacking histidine to select for HIS+transformants) and incubated 30° C. for 3 days. Colonies were isolated and tested for resistance to G418 to select for those colonies containing multiple copies of aldehyde-tagged-HSA integrated into the Pichia genome.

Colonies were grown 10 mL of buffered buffered glycerol-complex medium overnight at 30° C. Cultures were centrifuged the cells were resuspended in buffered methanol-complex medium to induce expression of 6×His-LCTPSR-HSA (SEO ID NO:127), integrated into the Pichia genome under the control of a methanol-inducible promoter. The cells were grown for 6 days at 30° C. Methanol was added to each culture every 24 h to 0.5%. After 6 days, cells were cleared from the media by centrifugation and 10 uL of the media was run on an SDS-PAGE gel, and the gel was stained with Coomassie Blue. As a negative control, the original untransformed GS115 strain was also grown and taken through the same procedure. As a positive control for methanol induction and secretion of a protein into the media, a GS115 strain containing wild-type HSA integrated into the Pichia genome under control of the methanol-inducible promoter was also grown. The colonies expressed aldehyde-tagged-HSA and secreted it into the media (see FIG. 14).

Example 9
Expressing and Purifying Aldehyde-Tagged-HSA from CHO Cells

24 μg of a DNA construct containing aldehyde-tagged-HSA in pcDNA3.1 (pRW38) was transfected into CHO-K1 cells in Opti-MEM serum-free medium using Lipofectin transfection reagent in a 10 cm dish. After 5 h at 37° C., the Opti-MEM was removed and Ex-Cell 325 protein-free medium (+1% FBS+L-glut+Pen/Strep) was added. After 72 h at 37°, the media was collected and cleared of debris. 10 mL Binding Buffer (20 mM Na₂PO₄, 500 mM NaCl, 20 mM Imidazole, pH 7.5) and 200 μl of Ni-NTA resin was added. After incubating with rotation for 1 h at 4° C., the mixture was added to a column and the flow-through fraction was collected. The resin was washed with 4 mL Binding Buffer and then eluted 5 times with 500 μL Elution Buffer (20 mM Na₂PO₄, 500 mM NaCl, 500 mM Imidazole, pH 7.5). 10 μL of the media was run on an SDS-PAGE gel, and the gel was stained with Coomassie Blue (FIG. 15).

Claims

1. A method of producing a carrier protein-drug conjugate, the method comprising: (a) combining in a reaction mixture: (1) an aldehyde-tagged carrier protein comprising a heterologous sulfatase motif, wherein the heterologous sulfatase motif is less than 13 amino acid residues and contains a sequence of the formula: X1Z1X2Z2X3Z3 wherein: Z1 is a 2-formylglycine residue;Z2 is a proline or alanine residue;X1 is present or absent and, when present, is any amino acid,wherein X1 is present when the sulfatase motif is at the N-terminus of the polypeptide;X2 and X3 are each independently any amino acid; andZ3 is a basic amino acid; and(2) a drug for conjugation to the carrier protein, wherein the drug comprises a reactive partner for an aldehyde of the carrier protein;wherein the drug is provided in the reaction mixture in an amount sufficient to provide for a desired drug to carrier protein ratio, said combining being under conditions suitable to promote reaction between the aldehyde of the carrier protein and the reactive partner of the drug to generate a carrier protein-drug conjugate; and(b) isolating the carrier protein-drug conjugate from the reaction mixture; andwherein the heterologous sulfatase motif of the carrier protein-drug conjugate contains a sequence of the formula: X1(FGly′)X2Z2X3Z3 where FGly′ is of the formula:
2. The method of claim 1, wherein the carrier protein is folded prior to said combining.
3. The method of claim 2, wherein the carrier protein presents the drug on a solvent-accessible surface of the carrier protein-drug conjugate when the carrier protein is folded.
4. The method of claim 1, wherein the carrier protein-drug conjugate comprises two or more heterologous sulfatase motifs.
5. The method of claim 4, wherein the carrier protein-drug conjugate comprises three or more heterologous sulfatase motifs.
6. The method of claim 4, wherein the heterologous sulfatase motifs are positioned in the carrier protein-drug conjugate at at least one of the N-terminus of the carrier protein, the C-terminus of the carrier protein, and a solvent-accessible loop of the carrier protein.
7. The method of claim 4, wherein the two or more heterologous sulfatase motifs are provided as a concatamer and are separated by a flexible linker.
8. The method of claim 1, wherein the carrier protein-drug conjugate has a drug to carrier protein ratio of 2:1 or more.
9. The method of claim 1, wherein the carrier protein-drug conjugate has a drug to carrier protein ratio of 3:1 or more.
10. The method of claim 1, wherein the carrier protein-drug conjugate has a drug to carrier protein ratio of 4:1 or more.
11. The method of claim 1, wherein the carrier protein-drug conjugate has a drug to carrier protein ratio of 5:1 or more.
12. The method of claim 1, wherein the carrier protein is albumin.
13. The method of claim 1, wherein the covalently bound drug is a peptide drug.
14. The method of claim 13, wherein the peptide drug is glucagon-like peptide 1 (GLP-1) or a biologically active variant thereof.
15. The method of claim 13, wherein the peptide drug is calcitonin or a biologically active variant thereof.
16. The method of claim 1, wherein the covalently bound drug is a small molecule drug.
17. The method of claim 1, wherein Z3 is arginine (R).
18. The method of claim 1, wherein X1, when present, is an aliphatic amino acid, a sulfur-containing amino acid, or a polar, uncharged amino acid; and X2, and X3 are each independently an aliphatic amino acid, a sulfur-containing amino acid, or a polar, uncharged amino acid.
19. The method of claim 1, wherein the X1, when present, is L, M, V, S or T.
20. The method of claim 1, wherein X2 and X3 are each independently S, T, A, V, G, or C.
21. The method of claim 1, wherein the heterologous sulfatase motif is less than 12 amino acid residues.
22. The method of claim 1, wherein the heterologous sulfatase motif is less than 11 amino acid residues.
23. The method of claim 1, wherein the heterologous sulfatase motif is less than 10 amino acid residues.
24. The method of claim 1, wherein the heterologous sulfatase motif is less than 9 amino acid residues.
25. The method of claim 1, wherein the heterologous sulfatase motif is less than 8 amino acid residues.
26. The method of claim 1, wherein the heterologous sulfatase motif is less than 7 amino acid residues.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No. 12/706,679 filed Feb. 16, 2010, now U.S. Pat. No. 9,238,878, which claims priority benefit of U.S. provisional application Ser. No. 61/153,162, filed Feb. 17, 2009, which applications are incorporated herein by reference in their entirety.

US Referenced Citations (396)

Number	Name	Date	Kind
4332717	Kanaoka et al.	Jun 1982	A
4342832	Goeddel et al.	Aug 1982	A
4671958	Rodwell et al.	Jun 1987	A
4952394	Senter	Aug 1990	A
5204449	Puri	Apr 1993	A
5208020	Chari et al.	May 1993	A
5416064	Cheri et al.	May 1995	A
5428130	Capon et al.	Jun 1995	A
5455165	Capon et al.	Oct 1995	A
5475092	Chari et al.	Dec 1995	A
5484892	Tedder et al.	Jan 1996	A
5514582	Capon et al.	May 1996	A
5585499	Chari et al.	Dec 1996	A
5624821	Winter et al.	Apr 1997	A
5681566	Stevenson	Oct 1997	A
5686072	Uhr et al.	Nov 1997	A
5698672	Labroo et al.	Dec 1997	A
5789554	Leung et al.	Aug 1998	A
5831000	Murayama et al.	Nov 1998	A
5846545	Chari et al.	Dec 1998	A
5885573	Bluestone et al.	Mar 1999	A
5980895	Pastan et al.	Nov 1999	A
5981485	O'Connor et al.	Nov 1999	A
5981488	Hoffman	Nov 1999	A
6103236	Suzawa et al.	Aug 2000	A
6121022	Presta et al.	Sep 2000	A
6183744	Goldenberg	Feb 2001	B1
6187287	Leung et al.	Feb 2001	B1
6194551	Idusogie et al.	Feb 2001	B1
6254868	Leung et al.	Jul 2001	B1
6268343	Knudsen et al.	Jul 2001	B1
6284727	Kim et al.	Sep 2001	B1
6395226	Plunkett	May 2002	B1
6441163	Chari et al.	Aug 2002	B1
6528624	Idusogie et al.	Mar 2003	B1
6538124	Idusogie et al.	Mar 2003	B1
6548644	Pettit	Apr 2003	B1
6570040	Saxon et al.	May 2003	B2
6576744	Presnell et al.	Jun 2003	B1
6608183	Cox, III	Aug 2003	B1
6608196	Wang et al.	Aug 2003	B2
6653104	Goldenberg	Nov 2003	B2
6660843	Feige et al.	Dec 2003	B1
6692924	Presnell et al.	Feb 2004	B2
6710169	Capon et al.	Mar 2004	B2
6716821	Zhao et al.	Apr 2004	B2
6737056	Presta	May 2004	B1
6756480	Kostenuik et al.	Jun 2004	B2
6770625	Soltero et al.	Aug 2004	B2
6777539	Sprecher et al.	Aug 2004	B2
6800740	Cunningham et al.	Oct 2004	B1
6803451	Presnell et al.	Oct 2004	B2
6825166	McChesney et al.	Nov 2004	B2
6875845	Presnell et al.	Apr 2005	B2
6884869	Senter et al.	Apr 2005	B2
6897044	Braslawsky et al.	May 2005	B1
6897292	Presnell et al.	May 2005	B2
6900218	Wang et al.	May 2005	B2
6913748	Widdison	Jul 2005	B2
6927042	Schultz et al.	Aug 2005	B2
6989366	Beeley et al.	Jan 2006	B2
6989452	Ng et al.	Jan 2006	B2
6998253	Presta et al.	Feb 2006	B1
7045337	Schultz et al.	May 2006	B2
7045498	Kindsvogel et al.	May 2006	B2
7045605	Bader et al.	May 2006	B2
7049316	Zhao et al.	May 2006	B2
7056701	Fleer et al.	Jun 2006	B2
7087600	Ng et al.	Aug 2006	B2
7091186	Senter et al.	Aug 2006	B2
7097840	Erickson et al.	Aug 2006	B2
7098308	Senter et al.	Aug 2006	B2
7112439	Johnson et al.	Sep 2006	B2
7115573	Pickford et al.	Oct 2006	B2
7122637	Presta	Oct 2006	B2
7138371	DeFrees	Nov 2006	B2
7141547	Rosen et al.	Nov 2006	B2
7176278	Prior	Feb 2007	B2
7183387	Presta	Feb 2007	B1
7189811	Panda et al.	Mar 2007	B2
7189835	Raymond et al.	Mar 2007	B2
7189839	Presnell et al.	Mar 2007	B2
7214663	Bebbington et al.	May 2007	B2
7214685	Tietze et al.	May 2007	B2
7226990	Knudsen et al.	Jun 2007	B2
7230068	Wilson	Jun 2007	B2
7255012	Hedtke	Aug 2007	B2
7256257	Doronina et al.	Aug 2007	B2
7265203	Presnell et al.	Sep 2007	B2
7271149	Glaesner et al.	Sep 2007	B2
7271255	Raymond et al.	Sep 2007	B2
7276497	Chari et al.	Oct 2007	B2
7276585	Lazar et al.	Oct 2007	B2
7276947	Becker et al.	Oct 2007	B2
7297775	Idusogie et al.	Nov 2007	B2
7317091	Lazar et al.	Jan 2008	B2
7321026	Leung	Jan 2008	B2
7332571	Miao et al.	Feb 2008	B2
7332581	Presta	Feb 2008	B2
7335742	Presta	Feb 2008	B2
7338659	Leung	Mar 2008	B2
7351555	Presnell et al.	Apr 2008	B2
7355008	Stavenhagen et al.	Apr 2008	B2
7355011	Popplewell et al.	Apr 2008	B2
7355012	Pastan et al.	Apr 2008	B2
7361347	Adolf et al.	Apr 2008	B2
7368565	Chari et al.	May 2008	B2
7371826	Presta	May 2008	B2
7374762	Amphlett	May 2008	B2
7381408	Mezo et al.	Jun 2008	B2
7385028	Miao et al.	Jun 2008	B2
7388026	Zhao et al.	Jun 2008	B2
7404956	Peters et al.	Jul 2008	B2
7364731	Idusogie et al.	Aug 2008	B2
7410779	Fleer et al.	Aug 2008	B2
7411056	Presnell et al.	Aug 2008	B2
7416727	Presta	Aug 2008	B2
7423116	Doronina et al.	Sep 2008	B2
7425541	Dubois et al.	Sep 2008	B2
7435416	Devaux et al.	Oct 2008	B2
7435550	Novak et al.	Oct 2008	B2
7442778	Gegg et al.	Oct 2008	B2
7445764	Kratz	Nov 2008	B1
7456260	Rybak et al.	Nov 2008	B2
7473796	Chari et al.	Jan 2009	B2
7488590	Feige et al.	Feb 2009	B2
7491809	Presnell et al.	Feb 2009	B2
7494649	Amphlett et al.	Feb 2009	B2
7498298	Doronina et al.	Mar 2009	B2
7498302	Ng et al.	Mar 2009	B2
7501120	Amphlett et al.	Mar 2009	B2
7501497	Rixon et al.	Mar 2009	B2
7507420	Ng et al.	Mar 2009	B2
7514080	Amphlett et al.	Apr 2009	B2
7517903	Chen et al.	Apr 2009	B2
7521541	Eigenbrot et al.	Apr 2009	B2
7521542	Johnson et al.	Apr 2009	B2
7534427	Goldenberg et al.	May 2009	B2
7541034	Fitzgerald et al.	Jun 2009	B1
7553816	Senter et al.	Jun 2009	B2
7572456	Johnson et al.	Aug 2009	B2
7572892	Novak et al.	Aug 2009	B2
7575748	Erickson et al.	Aug 2009	B1
7601354	Chari	Oct 2009	B2
7608686	Gross et al.	Oct 2009	B2
7618628	Johnson et al.	Nov 2009	B2
7622116	Kuestner et al.	Nov 2009	B2
7629452	Sprecher et al.	Dec 2009	B2
7632497	Stavenhagen	Dec 2009	B2
7635767	Rixon et al.	Dec 2009	B2
7642228	Carter et al.	Jan 2010	B2
7655660	Zhao et al.	Feb 2010	B2
7655661	Zhao et al.	Feb 2010	B2
7657380	Lazar et al.	Feb 2010	B2
7659241	Senter et al.	Feb 2010	B2
7662387	Law et al.	Feb 2010	B2
7662925	Lazar et al.	Feb 2010	B2
7662936	Kadkhodayan et al.	Feb 2010	B2
7691962	Boyd et al.	Apr 2010	B2
7722865	Vellard et al.	May 2010	B2
7723485	Junutula et al.	May 2010	B2
7745394	Doronina et al.	Jun 2010	B2
7750116	Doronina et al.	Jul 2010	B1
7754681	Feng	Jul 2010	B2
7771727	Fuselier et al.	Aug 2010	B2
7777019	Pastan et al.	Aug 2010	B2
7803915	Cairns et al.	Sep 2010	B2
7816317	Bebbington et al.	Oct 2010	B2
7829086	Hilbert et al.	Nov 2010	B2
7829531	Senter et al.	Nov 2010	B2
7837980	Alley et al.	Nov 2010	B2
7847105	Gangwar et al.	Dec 2010	B2
7851432	Chari et al.	Dec 2010	B2
7851437	Senter et al.	Dec 2010	B2
7855275	Eigenbrot et al.	Dec 2010	B2
7858759	Brandt et al.	Dec 2010	B2
7893023	Tronet et al.	Feb 2011	B2
7906545	Zhao et al.	Mar 2011	B2
7960512	Stavenhagen et al.	Jun 2011	B2
7964195	Papkoff et al.	Jun 2011	B2
7964566	Doronina et al.	Jun 2011	B2
7964567	Doronina et al.	Jun 2011	B2
7978319	Okabe et al.	Jul 2011	B2
8163882	Presta	Apr 2012	B2
8192737	Stavenhagen et al.	Jun 2012	B2
20020103345	Zhu	Aug 2002	A1
20020128448	Reff	Sep 2002	A1
20020146504	Schwartz	Oct 2002	A1
20020177756	Pierre Godinot et al.	Nov 2002	A1
20030082575	Schultz et al.	May 2003	A1
20030108609	Berry et al.	Jun 2003	A1
20030109682	Santi et al.	Jun 2003	A1
20030124669	Pan et al.	Jul 2003	A1
20030157108	Presta	Aug 2003	A1
20030166868	Presta et al.	Sep 2003	A1
20030171285	Finn et al.	Sep 2003	A1
20040010124	Johnson et al.	Jan 2004	A1
20040048395	Lee et al.	Mar 2004	A1
20040110226	Lazar et al.	Jun 2004	A1
20040115774	Kochendorfer et al.	Jun 2004	A1
20040132101	Lazar et al.	Jul 2004	A1
20040197866	Johnson et al.	Oct 2004	A1
20040219156	Goldenberg et al.	Nov 2004	A1
20040229250	Figura et al.	Nov 2004	A1
20040265952	Deiters et al.	Dec 2004	A1
20050037000	Stavenhagen et al.	Feb 2005	A1
20050064514	Stavenhagen et al.	Mar 2005	A1
20050084862	Lee et al.	Apr 2005	A1
20050118174	Presta	Jun 2005	A1
20050118182	Pastan et al.	Jun 2005	A1
20050142133	Lazar	Jun 2005	A1
20050147618	Rivera et al.	Jul 2005	A1
20050170404	Cho et al.	Aug 2005	A1
20050177878	Melo et al.	Aug 2005	A1
20050220762	Cho et al.	Oct 2005	A1
20050249723	Lazar	Nov 2005	A1
20050260711	Datta et al.	Nov 2005	A1
20050281829	Hehir et al.	Dec 2005	A1
20060013810	Johnson et al.	Jan 2006	A1
20060014212	Benkovic et al.	Jan 2006	A1
20060019347	Cho et al.	Jan 2006	A1
20060024298	Lazar et al.	Feb 2006	A1
20060035305	Bertozzi	Feb 2006	A1
20060057149	Johnson et al.	Mar 2006	A1
20060121032	Dahiyat et al.	Jun 2006	A1
20060134105	Lazar et al.	Jun 2006	A1
20060134709	Stavehagen et al.	Jun 2006	A1
20060135427	Hays et al.	Jun 2006	A1
20060153860	Cho et al.	Jul 2006	A1
20060173170	Chamberlain et al.	Aug 2006	A1
20060182750	Chari et al.	Aug 2006	A1
20060183198	Buechler et al.	Aug 2006	A1
20060189529	Cho et al.	Aug 2006	A1
20060194290	Presta	Aug 2006	A1
20060194957	Presta	Aug 2006	A1
20060217289	Miao et al.	Sep 2006	A1
20060235208	Lazar et al.	Oct 2006	A1
20060275254	Kim et al.	Dec 2006	A1
20070004909	Johnson et al.	Jan 2007	A1
20070009523	Presta	Jan 2007	A1
20070020258	Jardieu et al.	Jan 2007	A1
20070020260	Presta	Jan 2007	A1
20070024389	Mizutani	Feb 2007	A1
20070031922	Presta et al.	Feb 2007	A1
20070036799	Stavehagen et al.	Feb 2007	A1
20070037216	Johnson et al.	Feb 2007	A1
20070053901	Lazar et al.	Mar 2007	A1
20070077429	Mirkin et al.	Apr 2007	A1
20070122408	Barbas, III	May 2007	A1
20070123691	Wilson	May 2007	A1
20070123693	Wilson	May 2007	A1
20070135620	Chamberlain et al.	Jun 2007	A1
20070148170	Desjarlais et al.	Jun 2007	A1
20070148171	Lazar et al.	Jun 2007	A1
20070160597	Lazar et al.	Jul 2007	A1
20070166309	Lazar et al.	Jul 2007	A1
20070189962	Pastan et al.	Aug 2007	A1
20070198996	Chiu et al.	Aug 2007	A1
20070202098	Lazar et al.	Aug 2007	A1
20070219133	Lazar et al.	Sep 2007	A1
20070224189	Lazar et al.	Sep 2007	A1
20070224192	Lazar et al.	Sep 2007	A1
20070231329	Lazar et al.	Oct 2007	A1
20070237765	Lazar et al.	Oct 2007	A1
20070237766	Lazar et al.	Oct 2007	A1
20070237767	Lazar et al.	Oct 2007	A1
20070238665	Lazar et al.	Oct 2007	A1
20070243188	Lazar et al.	Oct 2007	A1
20070244303	Johnson et al.	Oct 2007	A1
20070248602	Lazar et al.	Oct 2007	A1
20070248603	Lazar et al.	Oct 2007	A1
20070264260	Tuscano et al.	Nov 2007	A1
20070269369	Gegg et al.	Nov 2007	A1
20070275460	Desjarlais et al.	Nov 2007	A1
20070286859	Lazar et al.	Dec 2007	A1
20080044417	Johnson et al.	Feb 2008	A1
20080044429	Johnson et al.	Feb 2008	A1
20080050310	Ebens et al.	Feb 2008	A1
20080050371	Johnson et al.	Feb 2008	A1
20080050374	Cho et al.	Feb 2008	A1
20080051563	Lazar et al.	Feb 2008	A1
20080057056	Lazar et al.	Mar 2008	A1
20080081038	Cho et al.	Apr 2008	A1
20080085277	Cho et al.	Apr 2008	A1
20080085538	Buechler et al.	Apr 2008	A1
20080095762	Presta	Apr 2008	A1
20080096819	Grabstein et al.	Apr 2008	A1
20080097083	Cho et al.	Apr 2008	A1
20080102124	Cho et al.	May 2008	A1
20080102125	Cho et al.	May 2008	A1
20080103293	Cho et al.	May 2008	A1
20080103294	Cho et al.	May 2008	A1
20080108791	Cho et al.	May 2008	A1
20080108792	Hays et al.	May 2008	A1
20080108797	Cho et al.	May 2008	A1
20080112943	Mariani et al.	May 2008	A1
20080112961	Stavehagen et al.	May 2008	A1
20080113408	Mariani et al.	May 2008	A1
20080113411	Sheffer et al.	May 2008	A1
20080113412	Sheffer et al.	May 2008	A1
20080113457	Tsay et al.	May 2008	A1
20080113912	Hays et al.	May 2008	A1
20080113913	Hays et al.	May 2008	A1
20080113914	Hays et al.	May 2008	A1
20080114154	Cho et al.	May 2008	A1
20080114155	Cho et al.	May 2008	A1
20080118505	Tedder	May 2008	A1
20080119640	Hays et al.	May 2008	A1
20080125574	Sheffer et al.	May 2008	A1
20080131435	Stavehagen et al.	Jun 2008	A1
20080132681	Hays et al.	Jun 2008	A1
20080138338	Idusogie et al.	Jun 2008	A1
20080138344	Stavenhagen et al.	Jun 2008	A1
20080138349	Stavenhagen et al.	Jun 2008	A1
20080146781	Cho et al.	Jun 2008	A1
20080152649	Chamberlain et al.	Jun 2008	A1
20080154025	Lazar et al.	Jun 2008	A1
20080161539	Cho et al.	Jul 2008	A1
20080161541	Lazar et al.	Jul 2008	A1
20080167452	Maiti et al.	Jul 2008	A1
20080177027	Miao et al.	Jul 2008	A1
20080177038	Miao et al.	Jul 2008	A1
20080181890	Lazar et al.	Jul 2008	A1
20080182968	Lazar et al.	Jul 2008	A1
20080182969	Miao et al.	Jul 2008	A1
20080187491	Miao et al.	Aug 2008	A1
20080187956	Carrico et al.	Aug 2008	A1
20080194459	Miao et al.	Aug 2008	A1
20080199471	Bernett et al.	Aug 2008	A1
20080199909	Buechler et al.	Aug 2008	A1
20080206242	Lawrence et al.	Aug 2008	A1
20080206853	Lee et al.	Aug 2008	A1
20080206867	Desjarlais et al.	Aug 2008	A1
20080207877	Cho et al.	Aug 2008	A1
20080213840	Miao et al.	Sep 2008	A1
20080219974	Bernett et al.	Sep 2008	A1
20080225287	Mirkin et al.	Sep 2008	A1
20080227205	Cho	Sep 2008	A1
20080242845	Lazar et al.	Oct 2008	A1
20080244222	Supalov et al.	Oct 2008	A1
20080248028	Lazar et al.	Oct 2008	A1
20080249288	Mezo et al.	Oct 2008	A1
20080254027	Bernett et al.	Oct 2008	A1
20080255045	Cujec et al.	Oct 2008	A1
20080260731	Bernett et al.	Oct 2008	A1
20080268518	Miao et al.	Oct 2008	A1
20080268519	Miao et al.	Oct 2008	A1
20080274105	Presta	Nov 2008	A1
20080274108	Presta	Nov 2008	A1
20080274506	Presta	Nov 2008	A1
20080292621	Lazar et al.	Nov 2008	A1
20080317758	Presta	Dec 2008	A9
20090004734	Pastan et al.	Jan 2009	A1
20090005312	Hansen et al.	Jan 2009	A1
20090010920	Lazar et al.	Jan 2009	A1
20090041758	Glaser et al.	Feb 2009	A1
20090041770	Chamberlain et al.	Feb 2009	A1
20090042291	Chu et al.	Feb 2009	A1
20090053211	Lazar et al.	Feb 2009	A9
20090053240	Lazar et al.	Feb 2009	A1
20090060910	Johnson et al.	Mar 2009	A1
20090068175	Lazar et al.	Mar 2009	A1
20090068177	Lazar et al.	Mar 2009	A1
20090081208	Lazar et al.	Mar 2009	A1
20090092599	Lazar et al.	Apr 2009	A1
20090098124	Stavehagen et al.	Apr 2009	A1
20090136485	Chu et al.	May 2009	A1
20090142340	Lazar et al.	Jun 2009	A1
20090143246	Mirkin et al.	Jun 2009	A1
20090155587	Mirkin et al.	Jun 2009	A1
20090162353	Johnson et al.	Jun 2009	A1
20090162382	Bernett et al.	Jun 2009	A1
20090163699	Chamberlain et al.	Jun 2009	A1
20090185290	Li et al.	Jul 2009	A1
20090202536	Ebens et al.	Aug 2009	A1
20090202537	Johnson et al.	Aug 2009	A1
20090214526	Lazar et al.	Aug 2009	A1
20090215991	Lazar et al.	Aug 2009	A1
20090281286	Gregg et al.	Nov 2009	A1
20090286964	Gregg et al.	Nov 2009	A1
20090305411	FitzGerald et al.	Dec 2009	A1
20090324593	Johnson et al.	Dec 2009	A1
20100129908	Fang et al.	May 2010	A1
20100143368	King et al.	Jun 2010	A1
20100204454	Chamberlain et al.	Aug 2010	A1
20100210543	Rabuka et al.	Aug 2010	A1
20100234571	Chamberlain et al.	Sep 2010	A1
20100234572	Chamberlain et al.	Sep 2010	A1
20100234573	Chamberlain et al.	Sep 2010	A1
20100234574	Chamberlain et al.	Sep 2010	A1
20100234575	Chamberlain et al.	Sep 2010	A1
20100311954	Chamberlain et al.	Dec 2010	A1
20110020344	Dimitrov et al.	Jan 2011	A1
20110065185	Pastan et al.	Mar 2011	A1
20110142859	Ebens et al.	Jun 2011	A1
20110293632	Presta	Dec 2011	A1

Foreign Referenced Citations (52)

Number	Date	Country
WO 1990004413	May 1990	WO
WO 1993012812	Jul 1993	WO
WO 1994026778	Nov 1994	WO
WO 1996004925	Feb 1996	WO
WO 1999058572	Nov 1999	WO
WO 2000042072	Jul 2000	WO
WO 2001081415	Jan 2001	WO
WO 2001060991	Aug 2001	WO
WO 2003027135	Apr 2003	WO
WO 2003105782	Dec 2003	WO
WO 2004072275	Aug 2004	WO
WO 2004082640	Sep 2004	WO
WO 2004099249	Nov 2004	WO
WO 2005000892	Jan 2005	WO
WO 2005035727	Apr 2005	WO
WO 2005047336	May 2005	WO
WO 2005052006	Jun 2005	WO
WO 2005074524	Aug 2005	WO
WO 2005074546	Aug 2005	WO
WO 2005074650	Aug 2005	WO
WO 2006009901	Jan 2006	WO
WO 2006068802	Jun 2006	WO
WO 2006069220	Jun 2006	WO
WO 2006071840	Jul 2006	WO
WO 2006073846	Jul 2006	WO
WO 2006091231	Aug 2006	WO
WO 2006069246	Sep 2006	WO
WO 2006132969	Dec 2006	WO
WO 2006133089	Dec 2006	WO
WO 2007021297	Feb 2007	WO
WO 2007056083	May 2007	WO
WO 2007056448	May 2007	WO
WO 2007059312	May 2007	WO
WO 2007070659	Jun 2007	WO
WO 2007079130	Jul 2007	WO
WO 2007094916	Aug 2007	WO
WO 2007103470	Sep 2007	WO
WO 2007140371	Dec 2007	WO
WO 2008011446	Jan 2008	WO
WO 2008030558	Mar 2008	WO
WO 2008030612	Mar 2008	WO
WO 2008030613	Mar 2008	WO
WO 2008030614	Mar 2008	WO
WO 2008036350	Mar 2008	WO
WO 2008070569	Jun 2008	WO
WO 2008077079	Jun 2008	WO
WO 2008083346	Jul 2008	WO
WO 2008121563	Oct 2008	WO
WO 2008137471	Nov 2008	WO
WO 2009058492	May 2009	WO
WO 2009120611	Oct 2009	WO
WO 2010096394	Aug 2010	WO

Non-Patent Literature Citations (101)

Entry
Baenziger (2003) “A major step on the road to understanding a unique posttranslational modification and its role in a genetic disease” Cell 113(4):421-422.
Berteau et al. (2006) “A new type of bacterial sulfatase reveals a novel maturation pathway in prokaryotes” J. Biol. Chem. 281(32):22464-22470.
Cosma et al. (2003) “The multiple sulfatase deficiency gene encodes an essential and limiting factor for the activity of sulfatases” Cell 113(4):445-456.
Cosma et al. (2004) “Molecular and functional analysis of SUMF1 mutations in multiple sulfatase deficiency” Hum. Mutat. 23, 576-581.
Dierks et al. (1997) “Conversion of cysteine to formylglycine: a protein modification in the endoplasmic reticulum” Proc Natl Acad Sci U S A 94(22):11963-8.
Dierks et al. (1998) “Conversion of cysteine to formylglycine in eukaryotic sulfatases occurs by a common mechanism in the endoplasmic reticulum” FEBS Lett. 423(1):61-5.
Dierks et al. (1999) “Sequence determinants directing conversion of cysteine to formylglycine in eukaryotic sulfatases” EMBO J 18(8):2084-2091.
Dierks et al. (2003) “Multiple sulfatase deficiency is caused by mutations in the gene encoding the human Cα-formylglycine generating enzyme” Cell 113(4):435-444.
Dierks et al. (2005) “Molecular basis for multiple sulfatase deficiency and mechanism for formylglycine generation of the human formylglycine-generating enzyme” Cell. 121(4):541-552.
Fang et al. (2004) “Post-translational formylglycine modification of bacterial sulfatases by the radical S-adenosylmethionine protein AtsB” J Biol Chem. 79(15):14570-8.
GenBank Accession No. NM—182760 “Homo sapiens sulfatase modifying factor 1 (SUMF1), transcript variant 1, mRNA” dated Nov. 28, 2012.
Jefferis & Lefranc (2009) “Human Immunoglobulin Allotypes” MAbs 1(4):332-338.
Landgrebe et al. (2003) “The human SUMF1 gene, required for posttranslational sulfatase modification, defines a new gene family which is conserved from pro- to eukaryotes” Gene. 316:47-56.
Preusser-Kunze et al. (2005) “Molecular characterization of the human Cα-formylglycine-generating enzyme” J. Biol. Chem. 280(15):14900-10.
Roeser et al. (2006) “A general binding mechanism for all human sulfatases by the formylglycine-generating enzyme” Proc Natl Acad Sci USA 103(1):81-86.
Sardiello et al. (2005) “Sulfatases and sulfatase modifying factors: an exclusive and promiscuous relationship” Hum Mol Genet. 14(21):3203-3217.
Szameit et al. (1999) “The iron sulfur protein AtsB is required for posttranslational formation of formylglycine in the Klebsiella sulfatase” J Biol Chem 274(22):15375-15381.
Connolly “Analytical molecular surface calculation” J. Appl. Cryst. (1983) 16:548-558.
GenBank Accession No. AAG00909 “recombinant IgG1 heavy chain [Homo sapiens]” dated May 11, 2001.
Kabsch & Sander (1983) “Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features” Biopolymers 22: 2577-637.
Lee & Richards (1971) “The interpretation of protein structures: estimation of static accessibility” J. Mol. Biol. 55(3):379-400.
Mahal et al. (1997) “Engineering chemical reactivity on cell surfaces through oligosaccharide biosynthesis” Science 276(5315):1125-1128.
Adams, et al. (2003) “Safety and Utilization of Blood Components as Therapeutic Delivery Systems” Curr Pharm Biotechnol 4(5):275-282.
Andreotti, et al. (2006) “Structural determinants of salmon calcitonin bioactivity: the role of the Leu-based amphipathic α-helix” J. Biol. Chem. 281(34):24193-24203.
Baggio, et al. (2008) “An albumin-exendin-4 conjugate engages central and peripheral circuits regulating murine energy and glucose homeostasis” Gastroenterology 134(4):1137-1147.
Baker (2002) “Albumin, steroid hormones and the origin of vertebrates” J Endocrinol 175(1):121-127.
Brubaker (2007) “Incretin-based therapies: mimetics versus protease inhibitors” TRENDS Endoccrinol. Metab.18(6):240-245.
Carter & Senter (2008) “Antibody-Drug Conjugates for Cancer Therapy” Cancer J 14(3):154-619.
Doronina, et al. (2008) “Novel peptide linkers for highly potent antibody-auristatin conjugate” Bioconjugate Chem 19(10):1960-1963.
Dou, et al. (2008) “Expression, purification, and characterization of recombinant human serum albumin fusion protein with two human glucagon-like peptide-1 mutants in Pichia pastoris” Protein Expr Purif 61(1):45-49.
Harohalli, et al. (2002) “Site-directed mutagenesis studies of human serum albumin define tryptophan at amino acid position 214 as the principal site for nitrosation” J Biomed Sci 9(1):47-58.
Junutula, et al. (2008) “Site-specific conjugation of a cytotoxic drug to an antibody improves the therapeutic index” Nat Biotechnol 26(8):925-932.
Komarova (2003) “Regulation of Osteoclasts by Calcitonin and Amphiphilic Calcitonin Conjugates: Role of Cytosolic Calcium” Calcif Tissue Int 73(3):265-273.
Kumar, et al. (2007) “Gene therapy of diabetes using a novel GLP-1/IgG1-Fc fusion construct normalizes glucose levels in db/db mice” Gene Ther. 14(2):162-172.
Léger, et al. (2004) “Identification of CJC-1131-albumin bioconjugate as a stable and bioactive GLP-1(7-36) analog” Bioorg. Med. Chem. Lett. 14(17):4395-4398.
Matthews, (2008) et al. “Pharmacodynamics, Pharmacokinetics, Safety, and Tolerability of Albiglutide, a Long-Acting Glucagon-Like Peptide-1 Mimetic, in Patients with Type 2 Diabetes” J. Clin. Endocrinol. Metab. 93(12):4810-4817.
McDonagh, et al. (2006) “Engineered antibody-drug conjugates with defined sites and stoichiometries of drug attachment” Protein Eng Des Sel 19(7):299-307.
Müller, et al. (2007) “Improved pharmacokinetics of recombinant bispecific antibody molecules by fusion to human serum albumin” J Bio Chem 282(17):12650-12660.
Peterson, et al. (2002) “Probing the structure of the warfarin-binding site on human serum albumin using site-directed mutagenesis” Proteins 47(2):116-125.
Picha, et al. (2008) “Protein Engineering Strategies for Sustained Glucagon-Like Peptide-1 Receptor-Dependent Control of Glucose Homeostasis” Diabetes 57(7):1926-1934.
Wu & Senter (2005) “Arming antibodies: prospects and challenges for immunoconjugates” Nat Biotechnol 23(9):1137-1146.
Youn, et al. (2007) “High-yield production of biologically active mono-PEGylated salmon calcitonin by site-specific PEGylation” J. Control. Release 117(3):371-379.
Arslan et al. (1997) “Structurally Modified Firefly Luciferase. Effects of Amino Acid Substitution at Position 286” J Am Chem Soc 119(45)10877-10887.
Bain et al. (1989) “Biosynthetic site-specific incorporation of a non-natural amino acid into a polypeptide” J Am Chem Soc 111(20):8013-8014.
Boer et al. (2003) “The genome-wide transcriptional responses of Saccharomyces cerevisiae grown on glucose in aerobic chemostat cultures limited for carbon, nitrogen, phosphorus, or sulfur” J Biol Chem 278(5):3265-3274.
Cornish et al. (1994) “Site-specific incorporation of biophysical probes into proteins” Proc Natl Acad Sci USA 91(8):2910-2914.
Cornish et al. (1995) “Probing Protein Structure and Function with an Expanded Genetic Code” Angew Chem Int Ed Engl 34:621-633.
Deiters et al. (2003) “Adding amino acids with novel reactivity to the genetic code of Saccharomyces cerevisiae” J Am Chem Soc 125(39):11782-11783.
Hall et al. (2005) “Contribution of horizontal gene transfer to the evolution of Saccharomyces cerevisiae” Eukaryot Cell 4(6):1102-1115.
Hecht (1992) “Probing the Synthetic Capabilities of a Center of Biochemical Catalysis” Acc Chem Res 25(12):545-552.
Hortin & Boime (1983) “Applications of amino acid analogs for studying co- and posttranslational modifications of proteins” Meth Enzymol 96:777-784.
Kirshenbaum et al. (2002) “Biosynthesis of proteins incorporating a versatile set of phenylalanine analogues” Chembiochem 3(2-3):235-237.
Takebe (1988) “SR alpha promoter: an efficient and versatile mammalian cDNA expression system composed of the simian virus 40 early promoter and the R-U5 segment of human T-cell leukemia virus type 1 long terminal repeat” Mol Cell Biol 8(1):466-472.
Abdel-Magid et al. (2006) “A review on the use of Sodium Triacetoxyborohydride in the Reductive Amination of Ketones and Aldehydes” Organic Process Research & Development 10: 971-1031.
Carrico (2008) “Chemoselective modification of proteins: hitting the target,” Chemical Society Reviews, 31: 1423-1431 (Epub Jun. 5, 2008).
Carter et al. (1998) “Humanization of an anti-p185 her2 antibody for human cancer therapy” Proceedings of the National Academy of Sciences U.S.A 89: 4285-4289.
Presta (2008) “Molecular engineering and design of therapeutic antibodies” Current Opinion in Immunology 20: 460-470.
Prescher et al. (2005) “Chemistry in living system,” Nature Chemical Biology 1(1):13-21.
Rush et al. (2008) “New Aldehyde Tag Sequences Identified by Screening Formylglycine Generating Enzymes in Vitro and in Vivo.” J. Am. Chem. Soc.130: 12240-12241.
Carrico et al (2007) “Introducing genetically encoded aldehydes into proteins” Nature Chemical Biology, 3(6) 321-322.
Smith et al (2014) “Chemenzymatic Fc Glycosylation via Engineered Aldehyde Tags” Bioconjugate Chemistry 25(4): 788-795.
Wu et al. (2009) “Site-specific chemical modification of recombinant proteins produced in mammalian cells by using the benetically encoded aldehyde tag” PNAS 106(9): 300-3005.
Advani et al. (2010) “Safety, pharmacokinetics, and preliminary clinical activity of inotuzumab ozogamicin, a novel immunoconjugate for the treatment of B-cell non-Hodgkin's lymphoma: results of a phase I study” J Clin Oncol 28(12):2085-2093.
Amlot et al. (1993) “A phase I study of an anti-CD22-deglycosylated ricin A chain immunotoxin in the treatment of B-cell lymphomas resistant to conventional therapy” Blood 82(9):2624-2633.
Asai et al. (1999) “Synthesis and antitumor activity of water-soluble duocarmycin B1 prodrugs” Bioorg Med Chem Lett 9(20):2995-2998.
Baird & Holowka (1985) “Structural mapping of Fc receptor bound immunoglobulin E: proximity to the membrane surface of the antibody combining site and another site in the Fab segments” Biochem 24(22):6252-6259.
Boghaert et al. (2008) “Determination of pharmacokinetic values of calicheamicin-antibody conjugates in mice by plasmon resonance analysis of small (5 microl) blood samples” Cancer Chemother Pharmacol 61(6):1027-1035.
Dijoseph et al. (2004) “Antibody-targeted chemotherapy with CMC-544: a CD22-targeted immunoconjugate of calicheamicin for the treatment of B-lymphoid malignancies” Blood 103(5):1807-1814.
Dijoseph et al. (2004) “Potent and specific antitumor efficacy of CMC-544, a CD22-targeted immunoconjugate of calicheamicin, against systemically disseminated B-cell lymphoma” Clin Cancer Res 10:8620-8629.
Dijoseph et al. (2006) “Antitumor efficacy of a combination of CMC-544 (inotuzumab ozogamicin), a CD22-targeted cytotoxic immunoconjugate of calicheamicin, and rituximab against non-Hodgkin's B-cell lymphoma” Clin Cancer Res 12(1):242-249.
Dijoseph et al. (2007) “Therapeutic potential of CD22-specific antibody-targeted chemotherapy using inotuzumab ozogamicin (CMC-544) for the treatment of acute lymphoblastic leukemia” Leukemia 21(11):2240-2245.
Fanslow et al. (1992) “Soluble forms of CD40 inhibit biologic responses of human B cells” J Immunol 149(2):655-660.
Ghetie et al. (1991) “Antitumor activity of Fab′ and IgG-anti-CD22 immunotoxins in disseminated human B lymphoma grown in mice with severe combined immunodeficiency disease: effect on tumor cells in extranodal sites” Cancer Res 51(21):5876-5880.
Gilon et al. (1967) “Synthesis of ω-aminooxy acids by oxygen-alkyl fission of lactones: An improved synthesis of DL-canaline” Tetrahedron 23(11):4441-4447.
Idusogie et al. (2000) “Mapping of the C1q binding site on rituxan, a chimeric antibody with a human IgG1 Fc” J Immunol 164(8):4178-4184.
ImmunoGen, Inc. (2008) “ImmunoGen, Inc. Announces Clinical Findings Reported at ASCO with Targeted Anticancer Compounds IMGN242 and AVE1642” http://www.drugs.com/clinical—trials/immunogen-inc-announces-clinical-findings-reported-asco-targeted-anticancer-compounds-imgn242-4545.html#ixzz0r9nPIIXM.
Jeffrey et al. (2005) “Design, synthesis, and in vitro evaluation of dipeptide-based antibody minor groove binder conjugates” J Med Chem 48(5):1344-1358.
Johnson & Wu (2000) “Kabat database and its applications: 30 years after the first variability plot” Nucl Acids Res 28(1):214-218.
Jones et al. (2000) “A convenient synthesis of N-(tert-butyloxycarbonyl)aminooxy ethers” Tetrahedron Lett 41(10):1531-1533.
Kan (2001) “Thioether-bonded constructs of Fab′gamma and Fc gamma modules utilizing differential reduction of interchain disulfide bonds” J Immunol 166(2):1320-1326.
Ogura et al. (2010) “Phase I Study of Inotuzumab Ozogamicin (CMC-544) in Japanese Patients with Follicular Lymphoma Pretreated with Rituximab-Based Therapy” Cancer Sci 101(8):1840-1845. Epub Apr. 23, 2012 doi:10.1111/j. 1349-7006.2010.01601.x.
Passerini Reaction (2012) Wikipedia, available online at http://en.wikipedia.org/wiki/Passerini—reaction.
Pettit (1996) “Progress in the discovery of biosynthetic anticancer drugs” J Nat Prod 59(8):812-821.
Pleass (1999) “Identification of residues in the CH2/CH3 domain interface of IgA essential for interaction with the human fcalpha receptor (FcalphaR) CD89” J Biol Chem 274(33):23508-23514.
Presta (2002) “Engineering therapeutic antibodies for improved function” Biochem Soc Trans 30(4):487-490.
Rakestraw et al. (1990) “Preparation and characterization of immunoconjugates for antibody-targeted photolysis” Bioconjugate Chem 1(3):212-221.
Rutishauser et al. (1968) “Amino Acid Sequence of the Fc Region of a Human γ G-Immunoglobulin” Proc Natl Acad Sci USA 61(4)1414-1421.
Sayers et al. (1998) “Amino acid residues that influence Fc epsilon RI-mediated effector functions of human immunoglobulin E” Biochemistry 37(46):16152-16164.
Shields (2001) “High resolution mapping of the binding site on human IgG1 for Fc gamma RI, Fc gamma RII, Fc gamma RIII, and FcRn and design of IgG1 variants with improved binding to the Fc gamma R” J Biol Chem 276(9):6591-6604.
Singh & Francis (1978) “A direct binding assay for rheumatoid factor serum antiglobulins using fluorescein-labelled Fc fragment of human immunoglobulin-G” J Clin Path 31(10):963-973.
Singh et al. (2008) “Recent Trends in Targeted Anticancer Prodrug and Conjugate Design” Curr Med Chem15(18):1802-1826.
Sondermann & Oosthuizen (2002) “Mediation and Modulation of Antibody Function” Biochem Soc Trans 30(pt.4):481-486.
Stevenson et al. (1997) “Conjugation of human Fc gamma in closed-hinge or open-hinge configuration to Fab′gamma and analogous ligands” J Immunol 158(5):2242-2250.
Stevenson et al. (1999) “Preparation of fcgamma for addition to sulfhydryl-expressing ligands with minimal disturbance of the hinge” J Immunol Meth 231(1-2):169-175.
Stimmel et al. (2000) “Site-specific conjugation on serine → cysteine variant monoclonal antibodies” J Biol Chem 275(39):30445-30450.
Takeshita (2009) “CMC-544 (inotuzumab ozogamicin) shows less effect on multidrug resistant cells: analyses in cell lines and cells from patients with B-cell chronic lymphocytic leukaemia and lymphoma” Br J Haematol 146:34-43.
Taylor (2010) Mutations in an avian IgY-Fc fragment reveal the locations of monocyte Fc receptor binding sites Dev Comp Immunol 34(2):97-101.
Thrasher et al. (1975) “The effect of fluorescein conjugation on Fc-dependent properties of rabbit antibody” J Immunol 114(2 pt. 2):762-764.
Vitetta et al. (1991) “Phase I immunotoxin trial in patients with B-cell lymphoma” Cancer Res 51(15):4052-4058.
Wooley et al. (1993) “Influence of a recombinant human soluble tumor necrosis factor receptor FC fusion protein on type II collagen-induced arthritis in mice” J Immunol 151(11):6602-6607.
Xu et al. (1999) “Bis(Hydroxamamide)-Based Bifunctional Chelating Agent 99mTc Labeling of Polypeptides” Bioconjug Chem 10(1):9-17.

Related Publications (1)

	Number	Date	Country
	20160160206 A1	Jun 2016	US

Provisional Applications (1)

	Number	Date	Country
	61153162	Feb 2009	US

Continuations (1)

	Number	Date	Country
Parent	12706679	Feb 2010	US
Child	14959360		US

Aldehyde-tagged protein-based drug carriers and methods of use

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

Field of Search

CPC

International Classifications

Term Extension

Abstract

Description

Claims

CROSS-REFERENCE TO RELATED APPLICATIONS

US Referenced Citations (396)

Foreign Referenced Citations (52)

Non-Patent Literature Citations (101)

Related Publications (1)

Provisional Applications (1)

Continuations (1)