FUSION PROTEIN OF ANTIGEN-BINDING PROTEIN AND FLUORESCENT PROTEIN OR FLUORESCENCE-LABELED TAG PROTEIN

Abstract
The present invention provides a fusion protein of an antigen-binding protein and a fluorescent protein or a fluorescence-labeled tag protein. The present invention provides a fusion protein of an antigen-binding protein and a fluorescent protein or a fluorescence-labeled tag, wherein the antigen-binding protein is an antigen-binding protein having a helix structure or a β-sheet structure at a terminus, the fluorescent protein or the fluorescence-labeled tag is a fluorescent protein or a fluorescence-labeled tag having a helix structure or a β-sheet structure at a terminus, and the helix at the terminus and the helix at the terminus are linked, or the β-sheet structure at the terminus and the β-sheet structure at the terminus are linked.
Description
TECHNICAL FIELD

The present invention relates to a fusion protein of an antigen-binding protein and a fluorescent protein or a fluorescence-labeled tag protein.


BACKGROUND ART

Techniques have been developed to link a fluorescent protein or a fluorescence-labeled tag protein to a target protein for visualizing the protein. These techniques are used for observing localization of the target protein by fluorescence microscopy.


Fluorescence is known to have polarization. For example, it is disclosed that GFP, which is a green fluorescent protein, has polarization (Non Patent Literature 1). Further, a case in which polarization of a protein called septin was observed by linking GFP to septin is disclosed (Non Patent Literature 2). In Non Patent Literature 2, it was revealed for the first time that the orientation of septin fibers on the cell division plane of Saccharomyces cerevisiae changed over time, by observing its polarization.


CITATION LIST
Non Patent Literature



  • Non Patent Literature 1: Inoue et al., PNAS, 99: 4272-4277, 2002

  • Non Patent Literature 2: Vrabioiu and Mitchison, Nature, 28, 466-469, 2006



SUMMARY OF INVENTION

The present invention provides a fusion protein of an antigen-binding protein and a fluorescent protein or a fluorescence-labeled tag protein.


The inventors have obtained a large number of fusion proteins of an antigen-binding protein and a fluorescent protein or a fluorescence-labeled tag protein—those in which helices of two proteins are linked and those in which β-sheets are linked. The inventors have found that the obtained fusion proteins retain the binding property to antigens and are suitable for observation of fluorescence. The inventors have also found that the fusion proteins obtained are suitable for observation of fluorescence polarization.


That is, the present invention provides the following industrially applicable invention.


(1) A fusion protein of an antigen-binding protein and a fluorescent protein or a fluorescence-labeled tag, wherein the antigen-binding protein is an antigen-binding protein having a helix structure or a β-sheet structure at a terminus, the fluorescent protein or the fluorescence-labeled tag is a fluorescent protein or a fluorescence-labeled tag having a helix structure or a β-sheet structure at a terminus, and the helix at the terminus and the helix at the terminus are linked, or the β-sheet structure at the terminus and the β-sheet structure at the terminus are linked.


(2) The fusion protein according to (1) above, wherein the antigen-binding protein has a helix structure at the N-terminus, the fluorescent protein or the fluorescence-labeled tag has a helix structure at the C-terminus, and the helix at the terminus of the antigen-binding protein and the helix at the terminus of the fluorescent protein or the fluorescence-labeled tag are linked.


(3) The fusion protein according to (2) above, wherein the antigen-binding protein is an Affimer.


(4) The fusion protein according to (2) or (3) above, wherein the fluorescent protein is a fluorescent protein having a helix structure at the C-terminus, selected from GFP and a fluorescent protein having a GFP-like β-barrel structure, and a circular permutated mutant of a point mutant.


(5) The fusion protein according to (1) above, wherein the antigen-binding protein has a β-sheet structure at the C-terminus, the fluorescent protein has a β-sheet structure at the N-terminus, and the β-sheet structure at the C-terminus and the β-sheet structure at the N-terminus are linked.


(6) The fusion protein according to (5) above, wherein the antigen-binding protein is a Nanobody or a scFv.


(7) The fusion protein according to (5) or (6) above, wherein the fluorescent protein is a fluorescent protein having a β-sheet structure at the N-terminus, selected from the group consisting of GFP and a fluorescent protein having a GFP-like β-barrel structure, and a point mutant, and a circular permutated mutant and a deletion mutant thereof.


(8) The fusion protein according to (2) or (3) above, wherein the fluorescence-labeled tag is a dehalogenation domain of haloalkane dehalogenase having an amino acid sequence corresponding to the amino acid sequence set forth in SEQ ID No: 9 or 29, having a helix structure at the C-terminus.


(9) The fusion protein according to (2) or (3) above, wherein the fluorescent protein is a phytochrome- or cyanobacteriochrome-based near-infrared fluorescent protein having a helix structure at the C-terminus.


(10) A composition comprising the fusion protein according to any one of (1) to (9) above.


(11) A nucleic acid encoding the fusion protein according to any one of (1) to (9) above.


(12) A gene expression vector, or a cell transformed with the vector, for use in expressing a fusion protein in a cell, comprising the nucleic acid of (11) above, wherein the nucleic acid is operably linked to a regulatory sequence, or a cell introduced with a mRNA comprising the nucleic acid of (11) above.


(13) A method for observing an antigen, comprising observing an antigen bound to the fusion protein according to any one of (1) to (9) above.


(14) The method according to (13) above, wherein the antigen is an antigen in an aqueous solution or a cell, or on a cell surface.


(15) The method according to (13) or (14) above, comprising:


observing each of: a first antigen bound to a first fusion protein which is the fusion protein according to any one of (1) to (9) above; and a second antigen bound to a second fusion protein which is the fusion protein according to any one of (1) to (9) above {with the proviso that the first fusion protein and the second fusion protein have different fluorescence wavelengths}.


(16) The method according to (15) above, comprising performing fluorescence correlation spectroscopy and fluorescence cross-correlation spectroscopy on the first fusion protein and the second fusion protein.


(17) The method according to (16) above, wherein the fluorescence correlation spectroscopy and the fluorescence cross-correlation spectroscopy are each fluorescence polarization correlation spectroscopy.


(18) A method for designing an amino acid sequence of a fusion protein, comprising: providing a first amino acid sequence which is an amino acid sequence of an antigen-binding protein and a second amino acid sequence which is an amino acid sequence of a fluorescent protein, wherein the N-terminus of the first amino acid sequence and the C-terminus of the second amino acid sequence or the C-terminus of the first amino acid sequence and the N-terminus of the second amino acid sequence are both helix structures or β-sheet structures, together form a helix structure upon excision of a part of the N-terminus or the C-terminus, or together form a β-sheet structure upon excision of a part of the N-terminus or the C-terminus; and linking the N-terminus of the first amino acid sequence and the C-terminus of the second amino acid sequence or the C-terminus of the first amino acid sequence and the N-terminus of the second amino acid sequence, to obtain an amino acid sequence with the junction serving as a part of a helix structure or a β-sheet structure.


(18A) The method according to (18) above, comprising obtaining a nucleic acid encoding the amino acid sequence obtained.


(18B) A method for producing a gene expression vector for expressing the nucleic acid, comprising operably linking the nucleic acid obtained in (18A) above to a promoter.


(18C) An amino acid sequence to be obtained by the method according to (18) above.


(18D) A nucleic acid to be obtained by the method according to (18A) above.


(18E) A gene expression vector to be obtained by the method according to (18B) above.





BRIEF DESCRIPTION OF DRAWINGS


FIG. 1A illustrates a method for linking two proteins in the fusion protein of the present invention (method for linking helices to each other).



FIG. 1B shows a three-dimensional structural model of a fusion protein of cp-sfGFP and Affimer obtained by estimation from the crystal structure of each protein.



FIG. 1C shows the results of crystal structure analysis for the fusion protein of cp-sfGFP and Affimer.



FIG. 2 shows the results of fluorescence polarization microscopic observation of cells expressing the fusion protein of the present invention in which the fluorescent protein is cp-sfGFP, and the Affimer is an Affimer that binds to actin.



FIG. 3 shows the results of fluorescence polarization microscopic observation of cells expressing the fusion protein of the present invention in which the fluorescent protein is cpmVenus, and the Affimer is an Affimer that binds to actin.



FIG. 4 shows the results of fluorescence polarization microscopic observation of cells expressing the fusion protein of the present invention in which the fluorescent protein is cpmTurquoise2, and the Affimer is an Affimer that binds to actin.



FIG. 5 illustrates a method of linking the fusion protein of the present invention in which the fluorescence-labeled tag is a HaloTag, and the Affimer is an Affimer that binds to actin.



FIG. 6A shows a three-dimensional structural model of a fusion protein of HaloTag and Affimer to actin and the results of fluorescence polarization microscopic observation of cells expressing the fusion protein.



FIG. 6B shows a three-dimensional structural model of a fusion protein of HaloTag and Affimer to actin and the results of fluorescence polarization microscopic observation of cells expressing the fusion protein. The number of turns of the helix around the junction is different from that in FIG. 6A.



FIG. 6C shows a three-dimensional structural model of a fusion protein of HaloTag and Affimer to actin and the results of fluorescence polarization microscopic observation of cells expressing the fusion protein. The number of turns around the junction of the helix is different from that in FIG. 6A and FIG. 6B.



FIG. 7A illustrates a linking method of a fusion protein of cp-sfGFP and Nanobody.



FIG. 7B shows the results of fluorescence polarization microscopic observation of cells expressing a fusion protein of cp-sfGFP and Nanobody that binds to vimentin linked by the linking method of FIG. 7A.



FIG. 7C shows the results of fluorescence polarization microscopic observation of cells expressing a fusion protein of Nanobody that binds to vimentin and cp176-172mVenus linked by the same linking method as in FIG. 7A.



FIG. 8A illustrates a method of linking a fusion protein of mApple and Nanobody.



FIG. 8B shows the results of fluorescence polarization microscopic observation of cells expressing a fusion protein of mApple and Nanobody that binds to vimentin linked by the linking method of FIG. 8A.



FIG. 9A illustrates a linking method of a fusion protein of cp-sfGFP and scFv.



FIG. 9B shows the results of fluorescence polarization microscopic observation of cells expressing a fusion protein of cp-sfGFP and scFv that binds to non-muscle myosin IIA linked by the linking method of FIG. 9A.



FIG. 9C shows the results of fluorescence polarization microscopic observation of cells expressing a fusion protein of scFv that binds to myosin and cp176-172mVenus linked by the same linking method as in FIG. 7C.



FIG. 10 shows a three-dimensional structural model of a fusion protein (SEQ ID No: 35) of the actin binding domain of utrophin shown in Table 1 and mEGFP and the results of fluorescence polarization microscopic observation of cells expressing the fusion protein.



FIG. 11 shows a three-dimensional structural model of a fusion protein of Affimer that binds to actin and mScarlet and the results of fluorescence polarization microscopic observation of cells using the fusion protein.



FIG. 12 shows a three-dimensional structural model of a fusion protein of Affimer that binds to actin and mNeonGreen and the results of fluorescence polarization microscopic observation of cells using the fusion protein.



FIG. 13 shows a three-dimensional structural model of a fusion protein of Nanobody that binds to an ALFA-tag and cp176-172mVenus and the results of fluorescence polarization microscopic observation of cells co-expressing the fusion protein and a fusion protein of Affimer and an ALFA-tag (that is, a fusion protein in which an alpha helix at the N-terminus of the Affimer that binds to actin is truncated, and an ALFA-tag sequence is connected to the truncated alpha helix so as to form consecutive alpha helixes).



FIG. 14 shows a three-dimensional structural model of an Affimer that binds to actin linked to an ALFA-tag. In the figure, the ALFA-tag is shown by dark gray, and the Affimer is shown by light gray.





DESCRIPTION OF EMBODIMENTS

In this description, the “antigen-binding protein” refers to a protein that binds to a specific entity. The antigen-binding protein can be a modified protein or a non-natural protein, or an antigen-binding fragment thereof. The antigen-binding protein is not specifically limited as long as it can obtain a binding affinity to an antigen by modification of an amino acid sequence and is a protein having a β-sheet structure or a helix structure at the N-terminus or the C-terminus (for example, a scaffold protein). Examples thereof include antibodies, antigen binding fragments thereof (such as scFv), single-chain variable domains (VHH domains) of heavy-chain antibodies of camels and llamas (such as Nanobody (trademark)), scaffold proteins with an albumin-binding domain of bacteria serving as the basic skeleton (such as ABD (trademark)), scaffold proteins with the 10th domain of human fibronectin serving as the basic skeleton (such as Adnectin (trademark), FingR (trademark), or Monobody (trademark)), scaffold proteins with the Z domain of Staphylococcus protein A serving as the basic skeleton (such as Affibody (trademark)), scaffold proteins with human γ-B-crystallin serving as the basic skeleton (such as Affilin (trademark)), scaffold proteins with a DNA-binding protein, Sac7d, of archaebacteria serving as the basic skeleton (such as Affitin (trademark)), scaffold proteins with a triple antiparallel helix serving as the basic skeleton (such as Alphabody (trademark)), scaffold proteins with human or insect lipocalin serving as the basic skeleton (such as Anticalin (trademark)), scaffold proteins with armadillo protein serving as the basic skeleton (such as armadillo repeat protein), scaffold proteins with human C-type lectin domain CTLD3 serving as the basic skeleton (such as Atrimer (trademark)), scaffold proteins with a polymerized LDLR-A module serving as the basic skeleton (such as Avimer (trademark)), scaffold proteins with the Fn3 domain of human tenascin C serving as the basic skeleton (such as Centyrin (trademark)), scaffold proteins with the SH3 domain of human Fyn tyrosine kinase serving as the basic skeleton (such as Fynomer (trademark)), scaffold proteins with human BPTI/LACI-D1/ITI-D2/APPI serving as the basic skeleton (such as Knitz domain (trademark)), scaffold proteins with the OB shape of aspartyl-tRNA synthetase of Pyrobaculum aerophilum serving as the basic skeleton (such as Obody (trademark)), scaffold proteins with the 14th extracellular domain of human fibronectin III serving as the basic skeleton (such as Pronectin (trademark)), scaffold proteins with a leucine-rich repeat module of a variable lymphocyte receptor of Agnatha serving as the basic skeleton (such as Repebody (trademark)), scaffold proteins with a human ankyrin repeat protein serving as the basic skeleton (such as DARPin (trademark)), and scaffold proteins with stefin A or cystatin serving as the basic skeleton (such as Affimer (trademark)) (Skrlec, K. et al., Trends in Biotechnology, Vol. 33, No. 7: 408-418, 2015). The β-sheet structure or the helix structure at the N-terminus or the C-terminus of each of these proteins can be used for the linkage in the fusion protein of the present invention. In such a scaffold protein, the site of the amino acid involved in the antigen-binding affinity is known, and a scaffold protein having a desired antigen-binding property can be obtained by selecting an amino acid at the site suitable for binding to the antigen by ribosome display or phage display. Among other antigen-binding proteins, an antigen-binding protein having a helix or β-sheet at the N-terminus or the C-terminus may be used in a fusion protein with the fluorescent protein in the present invention. The scaffold protein refers to a protein having a function of retaining a paratope (particularly, retaining a three-dimensional shape thereof) and having a skeleton similar to that of the parent protein (protein before modification) in the tertiary structure. The basic skeleton refers to a part of the scaffold protein other than the intended modification site (such as a paratope). The paratope refers to a part that binds to the antigen in the antigen-binding protein.


In this description, the “fluorescent protein” refers to a protein which absorbs energy by irradiation with short wavelength electromagnetic waves to excite electrons and emits longer wavelength electromagnetic waves when the electrons return to the ground state. Examples of the fluorescent protein include fluorescent proteins emitting visible light and near-infrared fluorescent proteins emitting near-infrared light. Fluorescent proteins suitable for observation of fluorescence polarization can be used preferably for observation of fluorescence polarization. Fluorescent proteins suitable for observation of fluorescence polarization and fluorescent proteins unsuitable for observation of fluorescence polarization can be used simply for fluorescence observation.


In this description, the “Affimer” is a protein which is biologically inactive and has a variant of physically stable stefin A or cystatin as the basic skeleton wherein the protein has antigen-binding sites in two loops presented on the same side from the four β-sheet structures possessed by the basic skeleton. The amino acid sequences of the loop portions can have diversity. The diversity of the amino acid sequences of the loop portions enables Affimers that bind to various antigens to be obtained by phage display. As the basic skeleton of the Affimer, a variant of stefin A having the amino acid sequence corresponding to SEQ ID No: 1 disclosed in WO2009/136182 can be used, for example, and the variant of stefin A in which the 4th glycine of stefin A is optionally substituted with arginine is a protein having heterologous amino acid sequences in two loops presented on the same side from the four β-sheet structures possessed by the basic skeleton (such as the 46th to 54th amino acid sites and the 67th to 84th amino acid sites of stefin A). The Affimer has an α-helix structure at the N-terminus. The Affimer can also be a protein with plant-derived cystatin serving as the basic skeleton and may also be a protein having antigen-binding sites in two loops presented on the same side from the four β-sheet structures possessed by the basic skeleton. Examples of the Affimer with cystatin serving as the basic skeleton include any one sequence of SEQ ID Nos: 1 to 6 disclosed in WO2014/125290. The molecular weight of the Affimer can be about 12 to 14 kDa.


In the present invention, the “Nanobody” is an antigen-binding protein based on variable region domains of an antibody consisting only of heavy chains found in certain animals, whereas antibodies generally consist of heavy chains and light chains. The antibody consisting only of heavy chains is an antibody commonly found in Arabian camels, Bactrian camels, llamas, and alpacas and can bind to an antigen only in the variable region domains of the above heavy chains. In recent years, similar antibodies consisting only of heavy chains have been found in cartilaginous fishes (sharks or the like). The Nanobody has three complementarity determining regions (CDR) and binds to an antigen in these three CDRs. The Nanobody can be obtained by immunizing the above animals which produce an antibody consisting only of the heavy chains with an antigen, isolating B cells from the animals immunized, obtaining a cDNA library containing variable regions, incorporating them into a phage display library using M13 phage, and screening them with the antigen. The Nanobody has a β-sheet structure to which a fluorescent entity can be connected in the present invention at each of the N-terminus and the C-terminus. In the present invention, the β-sheet structure at the N-terminus and/or the C-terminus of Nanobody, for example, the β-sheet structure at the C-terminus can be linked to the β-sheet structure of the fluorescent protein or the fluorescence-labeled tag.


In this description, the “scFv” is a single-chain antigen-binding protein in which a heavy-chain variable region (VH) and a light-chain variable region (VL) of an antibody are linked by a flexible peptide linker. The heavy-chain variable region and the light-chain variable region are connected by the flexible linker, in order to promote the association between the heavy-chain variable region and the light-chain variable region while retaining the antigen specificity of the original antibody by antigen recognition in the two regions. The flexible linker which can freely change its structure corresponding to the association state between the scFv and the antigen, and the like may be used. For example, a glycine-rich sequence of about 15 amino acids (for example, serine may be inserted to ensure hydrophilicity) may be preferably used. As the flexible linker, a linker having an amino acid sequence of -(GGGGS)3- can be used, for example. The scFv can be obtained by selecting, from a phage library, using a desired antigen binding property as an index. The phage library can be obtained by randomizing antigen-binding sites of scFv, which has framework sequences (for example, framework sequences of mammals such as humans). The scFv has a β-sheet structure at the N-terminus and the C-terminus to which the fluorescent entity can be connected in the present invention. In the present invention, the β-sheet structure at the N-terminus and/or the C-terminus of the scFv, for example, the β-sheet structure at the C-terminus can be linked to the β-sheet structure of the fluorescent protein or the fluorescence-labeled tag.


In this description, the “DARPin” is an antigen-binding protein developed by Molecular partners AG. The DARPin is an artificial protein having ankyrin repeat units (generally, about 2 to 30), and each repeat unit contains skeleton residues and target interaction residues (see WO2002/020565, for example). The ankyrin repeat units have a common folded structure consisting of two antiparallel α-helices followed by a β-hairpin {where the β-hairpin has a loop that binds to the next repeat unit}. The DARPin forms a curved structure by the ankyrin repeat units stacked on one another. The target interaction residues can be present in the β-hairpin of the ankyrin repeat units and a part where the first α-helix exposes. The DARPin can be obtained by selection based on the binding property to the antigen. The DARPin can be obtained, for example, using a method such as phage display, ribosome display, and plasmid display. The DARPin has an α-helix at one end and a β-sheet at the other end and provides a site to which the fluorescent entity is connected in the present invention.


In this description, the “Monobody” is also called FingR or Adnectin and is a scaffold protein with the 10th domain of human fibronectin (fibronectin type III domain) serving as the basic skeleton. This domain has the same structure as the variable domains of an antibody, that is, it has seven β-sheet structures forming a β-sandwich and three loops corresponding to three complementarity determining regions at each side. The binding specificity to an antigen can be modified by the amino acid sequences of the loop BC between the 2nd and 3rd β-sheets, the loop DE between the 4th and 5th β-sheets, and the loop FG between the 6th and 7th β-sheets. Alternatively, it can be modified by the amino acid sequences of the 3rd, 4th, 6th, and 7th β-sheets, in addition to the loop CD between the 3rd and the 4th β-sheets and the loop FG between the 6th and 7th β-sheets. The Monobody has β-sheets at the N-terminus and the C-terminus and can provide a site to which the fluorescent entity is connected.


As used herein, the phrase “having an amino acid sequence corresponding to a certain amino acid sequence” means to include not only having the amino acid sequence concerned but also having an amino acid sequence which has the same function corresponding to the amino acid sequence concerned. For example, in the case where some species have different sequences, not only the amino acid sequence of a specific species, but also an amino acid sequence having the same function of a closely related species with a high similarity to the amino acid sequence of the specific species are included. The phrase “having an amino acid sequence corresponding to a certain amino acid sequence” can mean having an amino acid sequence which has the same function corresponding to the certain amino acid sequence. For example, in the case of a protein, it can mean a protein of another species having an ortholog relationship with a protein having the certain amino acid sequence.


In the present invention, the helix structure (such as α-helix and 3/10 helix) at the C-terminus of the fluorescent protein can be linked to the helix structure at the N-terminus of an Affimer. The two helices linked can form a large helix structure. The fusion protein of the present invention can provide a fusion protein of a fluorescent protein and the Affimer thus linked. The linkage may be carried out by directly binding the helix structures to each other or may be carried out via a linker. As the linker, a linker having a helix structure (helix linker) can be used, for example. In the case of linking the helix structures to each other, the bond angle of the fluorescent protein and the antigen-binding protein can be adjusted by adjusting the lengths of the helix structures and/or the length of the helix linker. The helix linker is not specifically limited but can have an amino acid sequence according to any of SEQ ID Nos: 3, 10, 12, 14, and 23, for example.


In the present invention, the β-sheet structure at the N-terminus of the fluorescent protein and the β-sheet structure at the C-terminus of Nanobody or scFv can be linked. The two β-sheets linked can form one large β-sheet structure. The fusion protein of the present invention can provide a fusion protein of a fluorescent protein and the Nanobody or the scFv thus linked. The linkage may be carried out by directly binding the β-sheets to each other or may be carried out via a linker. As the linker, a linker having a β-sheet structure can be used, or a structurally inflexible amino acid such as one or two valines (V) can be used, for example.


As the fluorescent protein having a helix structure at the C-terminus, a circular permutated mutant of GFP (cpGFP) can be used. Various modified fluorescent proteins of GFP are known. The circular permutated mutant fluorescent protein is a mutant in which the positions of the sequences at the N-terminus and the C-terminus are changed by linking the N-terminus and the C-terminus of the protein with or without a linker and dividing the protein at another suitable position into two. For the circular permutated mutant, it can be an important condition that the N-terminus and the C-terminus are located in the vicinity in the three-dimensional structure. Many fluorescent proteins satisfy this condition, and modification of fluorescent proteins with circular permutated mutants is a standard technique for modifying fluorescent proteins. In the present invention, by using a circular permutated mutant, a fluorescent protein having a helix structure at the C-terminus can be used for producing a fusion protein with an Affimer. In the present invention, by using a circular permutated mutant, a fluorescent protein having a β-sheet structure at the N-terminus can be used for producing a fusion protein with a Nanobody or a scFv.


As the modified fluorescent proteins of GFP {more specifically, examples thereof include GFP, fluorescent proteins having a GFP-like β-barrel structure (that is, GFP-like proteins), and point mutants thereof, or circular permutated mutants of these}, GFP-like fluorescent proteins such as superfolder GFP (sfGFP), EGFP, Citrine, Venus, mVenus, YFP, mApple, mOrange, mCherry, BFP, TagBFP, mTurquoise, Cerulean, mHoneydew, mBanana, tdTomato, mTangerine, mStrawberry, mPlum, mScarlet, mNeonGreen, mNeptune, and NirFP, and circular permutated mutants of these variant fluorescent proteins (such as fluorescent proteins which are circular permutated mutants having a helix structure at the C-terminus; it is clearly indicated that the proteins are circular permutated mutants by prefixing the notation “cp-” to the protein names) can be used, for example. There are many examples known as the modified fluorescent proteins of GFP, such as substitutes in which the order of the β-sheets constituting the β-barrel is changed and fluorescent proteins having a β-barrel structure in which the number of β-sheets is reduced. These can be used for the fusion protein of the present invention. As the fluorescent proteins having a helix structure at the C-terminus, a circular permutated mutant of mTurquiose (cpmTurquoise) can be used. Since the mTurquiose has a helix structure at the N-terminus, cpmTurquiose which is a circular permutated mutant having a helix structure at the C-terminus can be used. There are examples known as the mTurquiose, such as mTurquiose-DR, mTurquiose-GL, mTurquiose-GV, mTurquiose-RA, mTurquiose2, mTurquiose2-G, mTurquiose-146G, and mTurquiose-1465, and variant fluorescent proteins thereof. One or more selected from the group consisting of these examples can be used by converting them into circular permutated mutants having a helix structure at the C-terminus, for example. The crystal structures of these proteins are published, for example, in the Protein Data Bank (PDB), and the β-sheet structures or the helix structures present at the N-terminus or the C-terminus of these proteins can be used for the linkage in the fusion protein of the present invention.


The cp-sfGFP having a helix structure at the C-terminus to be used in the present invention is not specifically limited, as long as it has a helix structure at the C-terminus. For example, cp-sfGFP having an amino acid sequence corresponding to the amino acid sequence of SEQ ID No: 1 can be used.


Examples of the fluorescent proteins having a helix structure at the C-terminus include phytochrome-based near-infrared fluorescent proteins. Examples of the phytochrome-based near-infrared fluorescent proteins include iRFP and miRFP. The iRFP contains the PAS domain and the GAF domain (PAS-GAF) of bacterio-phytochrome RpBphP2 of Rhodopseudomonas palustris, emits near-infrared fluorescence by incorporating biliverdin IXa, lacks the α-helix at the C-terminus required for dimerization, and has mutations of S13L, A92T, V104I, V114I, E161K, Y193K, F198Y, D202T, I203V, Y258F, A283V, K288T, and N290Y (see Filonov et al., Nat. Biotechnol., 29(8): 757-761, 2011). The miRFP contains the PAS domain and the GAF domain (PAS-GAF) of bacterio-phytochrome RpBphP1 of Rhodopseudomonas palustris, emits near-infrared fluorescence by incorporating biliverdin IXa, lacks the α-helix at the C-terminus required for dimerization, and has various mutations (see Shcherbakova D M et al., Nature Communication, 7, Article number: 12405, 2016). The miRFP is not specifically limited, but examples thereof include miRFP670 (excitation/emission at 642/670 nm), miRFP709 (excitation/emission at 683/709 nm), and miRFP703 (excitation/emission at 673/703 nm), and miRFPs induced from these miRFPs. The miRFP670 can have an amino acid sequence corresponding to the amino acid sequence set forth in SEQ ID No: 22, for example. These phytochrome-based near-infrared fluorescent proteins each have a helix structure at the C-terminus of the GAF domain and can be used for producing a fusion protein with an Affimer. Further, miRFPs derived from the GAF domain of Nostoc punctiforme-derived cyanobacteriochrome NpR3784 such as miRFP670nano also can be used for producing the fusion protein of the present invention. As the miRFP670nano, those with 18 mutations of V7M, F25C, M26V, Y27F, P31E, S41A, A48S, N51K, Q55R, T57R, I72Y, G82N, H87Y, N99I, N117H, C119L, L136Q, and Q139V introduced into the GAF domain of NpR3784 can be used. The miRFP670nano has a helix structure at the N-terminus and can be used for the linkage with the fluorescent protein in the present invention.


Examples of the fluorescence-labeled tag protein having a helix structure at the C-terminus include a modified protein derived from the dehalogenation domain of haloalkane dehalogenase of Rhodococcus bacteria. The dehalogenation domain of haloalkane dehalogenase has a pocket in which a ligand is inserted near the active center, Asp106. The group, —NH—CH2CH2—O—CH2CH2—O—(CH2)6—Cl, contained in the ligand is inserted in this pocket. A fluorescent entity is used as the ligand, and the group, —NH—CH2CH2—O—CH2CH2—O—(CH2)6—Cl, is introduced into this, thereby enabling the fluorescent entity to be bound to the protein with the above tag. Examples of the dehalogenation domain of haloalkane dehalogenase which can be used for such a purpose include the dehalogenation domain of haloalkane dehalogenase which has lost the dehalogenase activity. For example, an activity mutant (H272F) in which histidine at the active center is converted to phenylalanine can be used, and examples thereof include a dehalogenation domain having an amino acid sequence corresponding to the amino acid sequence set forth in SEQ ID No: 9 or 29. The dehalogenation domain of haloalkane dehalogenase has an α-helix at the C-terminus. The binding of the ligand to the dehalogenation domain is suitable for observing polarization. The present invention can provide use of the dehalogenation domain of haloalkane dehalogenase and an activity mutant thereof in observation of fluorescence polarization. Examples of the fluorescent entity which can be used as the ligand include coumarin, Oregon Green, diAcFAM, tetramethyl rhodamine (TMR), STELLA Fluor 650, STELLA Fluor700, STELLA Fluor 720, indocyanine green, Alexa Fluor 488, and Alexa Fluor 660. They are each commercially available as a HaloTag ligand.


The length of the helix structure is not particularly limited as long as the helix structure is formed. In the linkage between helix structures, the bond angle with an Affimer can be changed by changing the lengths of the amino acids of the helices at the linkage site, which can change the polarization planes of the fluorescent protein with respect to the Affimer. In the case of an α-helix, one turn is composed of about 3.5 amino acids, and therefore the bond angle is changed by about 100 degrees by increasing the α-helix by 1 amino acid. In the case of a 3/10 helix, one turn is composed by about 3 amino acids, and therefore the bond angle is changed by about 120 degrees by increasing the helix by 1 amino acid. The length of the helix can be adjusted by one or more selected from addition, deletion, and insertion of amino acids. Further, the length of the helix can be adjusted as appropriate according to its purpose. The adjustment method is not specifically limited but may be performed by changing the length of any of the helices of the fusion protein or may be performed by inserting a peptide having a helix structure into the linkage site of the fusion protein, for example. In any case, a new helix structure containing regions before and after the linkage site is formed, thereby allowing the fusion protein to be linked more tightly.


The length of the β-sheet structure is not specifically limited as long as the β-sheet structure is formed. In the linkage between β-sheet structures, a new structurally inflexible structure containing the β-sheet structures before and after the linkage site is formed after the linkage, thereby allowing the fusion protein to be linked more tightly.


The fusion protein of the present invention can be as follow, for example:


(A) a fusion protein of a fluorescent protein having a helix structure at the C-terminus and Affimer having a helix structure at the N-terminus {with the proviso that the helix structures of the two proteins are linked to each other};


(B) a fusion protein of the dehalogenation domain of haloalkane dehalogenase having a helix structure at the C-terminus and Affimer having a helix structure at the N-terminus {with the proviso that the helix structures of the two proteins are linked to each other};


(C) a fusion protein of Nanobody having a β-sheet structure at the C-terminus and a fluorescent protein having a β-sheet structure at the N-terminus {with the proviso that the β-sheet structures of the two proteins are linked to each other}; and


(D) a fusion protein of scFv having a β-sheet structure at the C-terminus and a fluorescent protein having a β-sheet structure at the N-terminus {with the proviso that the β-sheet structures of the two proteins are linked to each other}.


The present invention provides a nucleic acid (such as DNA) encoding the fusion protein of the present invention. The nucleic acid encoding the fusion protein of the present invention can be operably linked to at least one regulatory sequence (such as promoter). The present invention provides an expression vector (such as a vector which allows mammalian cells to express the fusion protein) containing a nucleic acid encoding the fusion protein of the present invention and operably linked to at least one regulatory sequence (such as promoter). The regulatory sequence is not specifically limited, but examples thereof include RNA polymerase II promoters such as CMV immediate/early promoter, HSV thymidine kinase promoter, early/late SV40 promoter, retrovirus LTR promoter, and metallothionein I promoter. Examples of promoters to cause expression in Escherichia coli include promoters such as lac, trp, lacI, lacZ, T3, T7, gpt, lambda PR, lambda PL, and 3-phosphoglycerate kinase. Examples of promoters to cause expression in plants include promotors such as cauliflower mosaic virus (CaMV) 35S transcription initiation region, 1′ or 2′ promoter derived from T-DNA of Agrobacterium tumefaciens, and auxin. Examples of promoters to cause expression in Saccharomyces cerevisiae include ADH1, TDH3, PGK1, and GAL1. Examples of promoters to cause expression in fission yeasts include promoters such as adh, tef, nmt1, and CMV. Examples of the expression vector include plasmids, phages, phagemids, cosmids, fosmids, artificial chromosomes, retroviruses, lentiviruses, measles viruses, vaccinia viruses, adenoviruses, adeno-associated viruses, and Sendai viruses. The present invention provides cells (such as animal cells, plant cells, bacteria, archaebacteria, and fungi cells) containing these expression cassettes in their genomes. Other than above, a method for introducing linear or circular DNA encoding the fusion protein of the present invention and operably linked to a regulatory sequence into a cell (after the introduction, a stable expression line may be obtained), a method for introducing linear or circular DNA encoding the fusion protein of the present invention and operably linked to a regulatory sequence into a cellular genome using genome editing techniques (such as TALEN, ZFN, or CRISPR/Cas9 system), or the like may be used.


A method for introducing a nucleic acid into a cell without using a gene expression vector and a method for expressing a protein encoded by a nucleic acid are also known. The fusion protein of the present invention can be expressed in a cell by introducing a nucleic acid (such as a linear nucleic acid, e.g., linear DNA) encoding the fusion protein of the present invention and operably linked to a regulatory sequence (such as a promoter) into the cell. The fusion protein of the present invention can also be expressed in a cell by introducing mRNA encoding the fusion protein of the present invention into a cell.


According to the present invention, the nucleic acid encoding the fusion protein of the present invention can be DNA or messenger RNA (mRNA).


The present invention provides a composition containing a nucleic acid encoding the fusion protein of the present invention. The present invention provides a composition containing a gene expression vector which contains a nucleic acid encoding the fusion protein of the present invention. Such a composition can contain a nucleic acid and/or a gene expression vector, and an excipient. Examples of the excipient include excipients for stabilizing a nucleic acid and/or a gene expression vector. Examples of the excipients include salts, chelating agents, and pH adjusters.


The fusion protein of the present invention is a protein and thus can be introduced into a cell or be forcibly expressed by introduction of a nucleic acid encoding the fusion protein into a cell. Accordingly, the fusion protein of the present invention can be used for observing an antigen in a cell. The present invention can provide a composition containing the fusion protein of the present invention. The present invention can provide a composition (such as a composition for intracellular expression of the fusion protein of the present invention) containing a nucleic acid encoding the fusion protein of the present invention.


The present invention provides a method for observing an antigen (which can be hereinafter read as “method for observing the fusion protein of the present invention”). The method for observing an antigen can include observing the antigen bound to the fusion protein of the present invention. The “antigen” herein is not specifically limited and may mean an entity which can be an immunogen of an antibody, an entity which can be an immunogen of Affimer, an entity which can be an immunogen of Nanobody, or an entity which can be an immunogen of scFv. Such an antigen may bind to the fusion protein of the present invention, for example, in a solution or a cell, or on a cell surface. The antigen may or may not form a complex with another protein. The observation can be generally performed on the antigen bound to the fusion protein of the present invention in a solution or a cell, or on a cell surface. The observation can be performed by microscopy (such as fluorescence microscopy and fluorescence polarization microscopy). In the fusion protein of the present invention, the fluorescent protein and the antigen-binding protein are bound at a constant rotation angle, and the polarization of the fluorescence is constant with respect to the antigen-binding protein. Accordingly, the fusion protein of the present invention can be observed, preferably, by fluorescence polarization microscopy.


According to the present invention, examples of the antigen include antigens of animals, plants, bacteria, fungi, and archaebacteria. Examples of the antigen include intracellular antigens, extracellular antigens, and antigens on cell membranes. The antigen may be in the free form or may form a complex with any other structural component. The antigen may be in the monomer form or in the polymer form. In the present invention, the cell may be any of fertilized eggs, pluripotent stem cells, stem cells, progenitor cells, somatic cells, and germ cells (such as oocytes). In the present invention, the antigen may be components other than cells in organisms, such as components of flagella (for example, sperm or microbial flagella). According to the present invention, the antigen can be a cytoskeleton such as actin, microtubules, septin, and intermediate filaments. According to the present invention, the antigen can be an cytoskeleton such as actins (such as α-actin, β-actin, and γ-actin), microtubule tubulins (such as α-tubulin, β-tubulin, and γ-tubulin), septin, and intermediate filaments (such as keratin, GFAP (glial cell fibrous acidic protein), vimentin, neurofilament-M, neurofilament-L, neurofilament-H, α-internexin, lamin B1, lamin B2, and lamin A) in a cell. According to the present invention, the antigen is not specifically limited, but examples thereof include myosins (myosins such as myosin I, myosin II, myosin III, myosin IV, myosin V, myosin VI, myosin VIII, and myosin IX and heavy chains or light chains thereof), dyneins (such as axonemal dyneins and cytoplasmic dyneins), and kinesins (for example, but not specifically limited to, KIF1A, KIF1Bα, KIF1Bβ, KIF1C, KIF2A, KIF3, KIF4, KIF5, KIF13B, KIF17, KIF26A, KIFC1, KIFC2, and KIFC3, and other kinesins belonging to the kinesin superfamily).


According to the present invention, the antigen is not specifically limited, but examples thereof include membrane proteins (such as receptor proteins, G proteins, ion transporters (ion channels and ion pumps), and transporter proteins). According to the present invention, the antigen is not specifically limited, but examples thereof include BAR domain proteins (such as Amphiphysin, EndophilinA1, FCHO2, Pacsin2, and IRSp53) and ESCRT-III proteins (such as CHMP2A, CHMP2B, CHMP4A, CHMP4B, and CHMP4C). In addition, the antigen is not particularly limited to the above examples and can be various biomolecules (such as proteins, lipids, nucleic acids, hormones, and sugars). According to the present invention, the antigen is not specifically limited and may be a lipid membrane or a lipid membrane domain {for example, fat-soluble fluorescent dyes and lipid-binding motifs can be used}. According to the present invention, cellular components other than the cytoskeleton can also be observed. For example, in the case of performing fluorescence polarization correlation spectroscopy or fluorescence polarization cross-correlation spectroscopy in the present invention, the rotational diffusion of cell components other than the cytoskeleton can be evaluated.


According to the present invention, the method for observing an antigen can include fluorescence correlation spectroscopy (FCS). The fluorescence correlation spectroscopy can comprise observing the molecular motion of the fluorescent entity by microscopy (such as confocal microscopy or a two-photon microscopy). The fluorescence correlation spectroscopy can comprise measuring the fluctuation in fluorescence intensity by irradiating the fusion protein of the present invention with light (such as laser light). The fluorescence correlation spectroscopy can comprise applying inverse Fourier transform to an intensity spectrum (time spectrum) to obtain temporal autocorrelation. The fluorescence correlation spectroscopy can evaluate the diffusion motion (translational diffusion motion) of the fusion protein of the present invention and the number of molecules in the measurement region from the results obtained. The method of the present invention can comprise determining the diffusion coefficient of the fusion protein of the present invention. According to the present invention, changes in fluorescence intensity derived from a plurality of types of fluorescent molecules can be simultaneously measured for detecting binding between a plurality of types of fusion proteins of the present invention. According to the present invention, the method for observing an antigen can comprise observing fluorescence polarization in an embodiment. The fluorescence polarization can be observed by fluorescence polarization microscopy. The rotational diffusion motion of the fusion protein can be evaluated by analyzing the intensity of the polarization through a polarizing plate. The method of the present invention can comprise determining the hydrodynamic radius of the fusion protein of the present invention. According to the present invention, the hydrodynamic radius can be used for evaluating the fluctuation in the structure of the fusion protein of the present invention, the fluctuation in the surrounding microscopic environment, the binding between the fusion protein of the present invention and other proteins, or the binding between the plurality of types of fusion proteins of the present invention.


According to the present invention, the method for observing an antigen can comprise observing each of an antigen bound to a first fusion protein which is the fusion protein of the present invention and a second antigen bound to a second fusion protein which is the fusion protein of the present invention {with the proviso that the first fusion protein and the second fusion protein have different fluorescence wavelengths}, in an aspect. According to the present invention, the method for observing an antigen can involve fluorescence correlation spectroscopy (FCS) of the antigen bound to a first fusion protein which is the fusion protein of the present invention and the second antigen bound to a second fusion protein which is the fusion protein of the present invention {with the proviso that the first fusion protein and the second fusion protein have different fluorescence wavelengths}. According to the present invention, the method for observing an antigen can comprise analyzing simultaneity of fluctuations in fluorescence intensity of two or more antigens (fluorescence cross-correlation spectroscopy (FCCS)). In order to measure the fluorescence intensity of two or more antigens, two or more fusion proteins of the present invention may be used (that is, the first fusion protein and the second fusion protein can be used, and a third fusion protein may be used, and a fourth fusion protein may further be used, and more than four fusion proteins of the present invention may be used, as required, for example (however, they preferably have different fluorescence wavelengths). According to the present invention, the method for observing an antigen can comprise observing fluorescence polarization in an aspect. The fluorescence polarization can be observed by fluorescence polarization microscopy. The advantage of performing fluorescence correlation spectroscopy (FCS) and fluorescence cross-correlation spectroscopy (FCCS) using a fluorescence polarization microscope is that the rotational diffusion motion of fluorescence can be detected. Accordingly, the method for observing an antigen in the present invention may further comprise detecting the rotational diffusion motion of molecules by observing fluorescence polarization, in an aspect. Since the rotational diffusion motion is sensitive to changes in particle size, it is expected that the binding state of two or more antigens can be evaluated more precisely by fluorescence polarization correlation spectroscopy. In the present invention, the method for observing an antigen can comprise evaluating the binding state of two or more antigens in cells or evaluating the binding state of two or more antigens in a solution, in an aspect.


The fluorescence polarization can be appropriately observed by those skilled in the art using a fluorescence polarization microscope. The polarization can be observed, for example, on two or more polarization planes including a first polarization plane and a second polarization plane. The polarization can be observed on four or more polarization planes including a first polarization plane, a second polarization plane, a third polarization plane, and a fourth polarization plane. It can be expected that, as the number of polarization planes to be observed increases, the sensitivity to detect the rotation of fluorescent molecules increases.


In the present invention, the fusion protein of the present invention can be observed by super-resolution microscopy. Examples of the super-resolution microscopy include stimulated emission depletion microscopy (STED), photoactivated localization microscopy (PALM), stochastic optical reconstruction microscopy, scanning near-field optical microscopy (SNOM), structured illumination microscopy, confocal laser microscopy, and two-photon excitation microscopy. In the super-resolution microscopy, the observation target can be observed with a resolution equal to or less than the diffraction limit of light (for example, a resolution of 200 nm or less). Under appropriate observation conditions such as total reflection illumination, fluorescent molecules can be detected as spots at the single molecular level. Also in super-resolution microscopy, antigens can be observed in the same manner as in fluorescence correlation spectroscopy (FCS) and fluorescence cross-correlation spectroscopy (FCCS), and the method using a luminescent entity instead of fluorescence. In the conventional super-resolution microscopy, only a position of an antigen is measured, whereas the structural correlation of antigens can be evaluated in the present invention by simultaneously observing the direction of polarization in addition to the positions of a plurality of antigens. It can be expected that this will be the experimental basis of the point image conversion algorithm which has not been conventionally present in super-resolution microscopy.


The present invention provides a protein having an amino acid sequence according to one sequence number selected from the group consisting of SEQ ID Nos: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, and 43, and a nucleic acid encoding such a protein. The present invention provides a vector for intracellular expression comprising the above nucleic acid.


The present invention provides a method for designing an amino acid sequence of a fusion protein, comprising: providing a first amino acid sequence which is an amino acid sequence of an antigen-binding protein and a second amino acid sequence which is an amino acid sequence of a fluorescent protein, wherein the N-terminus of the first amino acid sequence and the C-terminus of the second amino acid sequence or the C-terminus of the first amino acid sequence and the N-terminus of the second amino acid sequence are both helix structures or together form a helix structure upon excision of a part of the N-terminus or the C-terminus; and linking the N-terminus of the first amino acid sequence and the C-terminus of the second amino acid sequence or the C-terminus of the first amino acid sequence and the N-terminus of the second amino acid sequence, to obtain an amino acid sequence with the junction serving as a part of a helix structure. The fusion protein having the amino acid sequence designed can be suitable for observation of fluorescence polarization. Preferably, the excision of a part of the N-terminus or the C-terminus does not abolish the binding property of the antigen-binding protein and the fluorescence property of the fluorescent protein.


The present invention provides a method for designing an amino acid sequence of a fusion protein, comprising: providing a first amino acid sequence which is an amino acid sequence of an antigen-binding protein and a second amino acid sequence which is an amino acid sequence of a fluorescent protein, wherein the N-terminus of the first amino acid sequence and the C-terminus of the second amino acid sequence or the C-terminus of the first amino acid sequence and the N-terminus of the second amino acid sequence are both β-sheets or together form a β-sheet upon excision of a part of the N-terminus or the C-terminus; and linking the N-terminus of the first amino acid sequence and the C-terminus of the second amino acid sequence or the C-terminus of the first amino acid sequence and the N-terminus of the second amino acid sequence, to obtain an amino acid sequence with the junction serving as a part of a β-sheet structure. The fusion protein having the amino acid sequence designed can be suitable for observation of fluorescence polarization. Preferably, the excision of a part of the N-terminus or the C-terminus does not abolish the binding property of the antigen-binding protein and the fluorescence property of the fluorescent protein.


The present invention provides a method for designing a fusion protein, comprising: providing an amino acid sequence of a protein, wherein at least the N-terminus or the C-terminus of the amino acid sequence has a helix structure; and linking the helix and an amino acid sequence of a helix of an ALFA-tag (the ALFA-tag can have the amino acid sequence set forth in SEQ ID No: 40 or an amino acid sequence of a ALFA-tag corresponding thereto, for example), to obtain an amino acid sequence with the junction serving as a part of a helix structure. The designed fusion protein having the amino acid sequence can be detected by a fluorescent protein linked tightly to an antigen-binding protein that binds to the ALFA-tag and can be used for observation of fluorescence polarization. Preferably, the truncation of a part of the N-terminus or the C-terminus does not delete the functional region of the protein. In an aspect, the above protein can be an antigen-binding protein. In an aspect, the above protein can be a protein to be observed. Examples of the antigen-binding protein that binds to the ALFA-tag include Nanobody, scFv, Affimer, and the aforementioned examples of the antigen-binding protein that binds to the ALFA-tag, and examples of the fluorescent protein also include the aforementioned examples of the fluorescent protein. A fusion protein in which these are linked to each other via the helix structures or the β-sheets (for example, the fusion protein according to any one of (A) to (D) above that binds to the ALFA-tag) can be used for detecting the ALFA-tag. Further, the protein to be detected preferably has a helix structure at the N-terminus or the C-terminus and can be a fusion protein in which the helix structure is linked to the helix structure of the ALFA-tag so that the linkage site serves as a part of a helix structure.


The present invention provides a method comprising obtaining a nucleic acid sequence encoding the fusion protein, based on the amino acid sequence designed. The present invention can provide a method for producing a nucleic acid having a nucleic acid sequence encoding the amino acid sequence designed. The present invention can provide a nucleic acid (for example, DNA or mRNA) having a nucleic acid sequence encoding the amino acid sequence designed. The present invention further provides a gene expression vector comprising the nucleic acid having the nucleic acid sequence obtained. In the gene expression vector, the nucleic acid can be operably linked to a promoter. The gene expression vector obtained can be introduced into a cell so that the vector can be expressed, and thus can be used for synthesizing a fusion protein having the amino acid sequence designed in a cell. When the fusion protein obtained is suitable for observation of fluorescence polarization, the fusion protein and the nucleic acid encoding the protein can be useful for observation of fluorescence polarization.


The present invention provides a method for producing a fusion protein, based on the amino acid sequence designed as above.


EXAMPLES
Example 1: Design and Observation of Fusion Protein of Circular Permutated Mutant, cp-sfGFP, and Affimer

In this example, a fusion protein of a circular permutated mutant, cp-sfGFP, and an Affimer was designed.


The Affimer is a protein derived from stefin or cystatin and consisting of an α-helix and four folded β-sheet structures in contact with the α-helix. The four folded β-sheets are stabilized by the α-helix, and have a rigid tertiary structure. It is known that the four β-sheets form three loops, and amino acids can be introduced into two of them on the same side. It is also known that the Affimer can acquire selective binding affinity for antigens like antibodies, depending on the amino acids introduced. It has been also revealed that an Affimer exhibiting binding affinity for a desired antigen can be designed by techniques such as phage display.


In this example, a cp-sfGFP having a helix (considered to be a 3/10 helix) at the C-terminus and an Affimer having an α-helix at the N-terminus were linked through the helix at the C-terminus and the helix at the N-terminus via a helix linker, to obtain a fusion protein (see FIG. 1A).



FIG. 1B shows an expected three-dimensional structural model of the fusion protein obtained.


As the cp-sfGFP, a cp-sfGFP having the amino acid sequence set forth in SEQ ID No: 1 was used. As the Affimer, an Affimer having the amino acid sequence set forth in SEQ ID No: 2 was used. This Affimer can bind selectively to actin fibers (F-actin). Using a genetic engineering technique, the amino acid sequence set forth in SEQ ID No: 1 and the amino acid sequence set forth in SEQ ID No: 2 were linked in-frame via a helix linker having the amino acid sequence set forth in SEQ ID No: 3, thereby designing a fusion protein having the amino acid sequence set forth in SEQ ID No: 4. SEQ ID No: 4 contains the amino acid sequence of a HA tag at the C-terminus.


The fusion protein designed was produced in Escherichia coli and purified, and the fusion protein obtained was crystallized and subjected to X-ray crystal structure analysis. From the resultant X-ray diffraction image, it was revealed that the fusion protein had the crystal structure shown in FIG. 1C. As shown in FIG. 1C, it was revealed that the cp-sfGFP and the Affimer formed one new helix by linking the helix at the C-terminus and the helix at the N-terminus.


An expression vector in which DNA encoding the fusion protein designed was placed under an appropriate promoter was introduced by a lipofection method (Lipofectamine 3000 reagent), thereby allowing HeLa cells to express the above fusion protein. The cells expressing the fusion protein were observed by fluorescence polarization microscopy. It was expected that, when the fusion protein was bound to the actin fibers, if the fluorescence polarization planes of the fusion protein bound were aligned, the polarization (anisotropy) of fluorescence would be observed. Using a polarization beam splitter, images of the vertical observation polarization planes and the horizontal observation polarization planes were simultaneously obtained and compared. The results were as shown in FIG. 2. As shown in FIG. 2, an actin fiber bundle (see the arrowhead) in which a low fluorescence intensity was observed when the observation polarization plane was horizontal showed a high fluorescence intensity when the observation polarization plane was vertical, whereas an actin fiber bundle (see the arrow) in which a high fluorescence intensity was observed when the observation polarization plane was horizontal showed a low fluorescence intensity when the observation polarization plane was vertical. These facts indicated that the polarization planes of the fusion protein bound to actin fibers were aligned. The same experiment was performed using LLC-PK1 cells, Ptk2 cells, yeast cells, starfish oocytes, or fertilized eggs instead of HeLa cells, and then, it was confirmed that polarization of the above fusion protein can be observed in these cells.


Affimers are considered to bind to actin fibers in a uniformly aligned orientation. This is because the actin fibers are polymers composed of repeating actin monomers. The fact that the polarization planes of the fusion protein are aligned indicates that a cp-sfGFP in the fusion protein is firmly immobilized on an Affimer. Accordingly, the method for linking the helix at the C-terminus and the helix at the N-terminus in the fusion protein of the present invention provides a methodology in which a cp-sfGFP is immobilized on an Affimer, and the polarization planes of the cp-sfGFP are immobilized on the Affimer. It was shown that production of a circular permutated mutant in which the helix at the N-terminus of sfGFP is moved to the C-terminus so as to be linked to the α-helix at the N-terminus of Affimer can be useful for the methodology for immobilization.


This enables construction of a fluorescence labeling method in which binding to a desired antigen and polarized emission occur. The effectiveness of the strategy of the fluorescence polarization labeling method was further confirmed by the following examples.


Example 2: Design and Observation of Fusion Protein of Circular Permutated Mutant, cpmVenus, and Affimer

Using the strategy disclosed in Example 1, a cpmVenus and an Affimer were linked. Specifically, the 3/10 helix present at the C-terminus of the cpmVenus and the α-helix present at the N-terminus of the Affimer were linked in-frame, to design a fusion protein.


As the cpmVenus, a fluorescent protein having the amino acid sequence set forth in SEQ ID No: 5 was used. As the Affimer, an Affimer having the same amino acid sequence as in Example 1 was used. Using a genetic engineering technique, the amino acid sequence set forth in SEQ ID No: 5 and the amino acid sequence set forth in SEQ ID No: 2 were linked in-frame via alanine (A), thereby designing a fusion protein of cpmVenus and Affimer having the amino acid sequence set forth in SEQ ID No: 6. SEQ ID No: 6 contains the amino acid sequence of a HA tag at the C-terminus.


An expression vector in which DNA encoding the fusion protein designed was disposed under an appropriate promoter was introduced by a lipofection method (Avalanche-Everyday Transfection Reagent), thereby allowing HeLa cells to express the above fusion protein. The cells expressing the fusion protein were observed by fluorescence polarization microscopy. The results were as shown in FIG. 3. As shown in FIG. 3, an actin fiber bundle in which a low fluorescence intensity was observed when the observation polarization plane was horizontal showed a high fluorescence intensity when the observation polarization plane was vertical, whereas an actin fiber bundle in which a high fluorescence intensity was observed when the observation polarization plane was horizontal showed a low fluorescence intensity when the observation polarization plane was vertical. This indicated that the fusion protein obtained by directly linking the 3/10 helix present at the C-terminus of the cpmVenus and the α-helix present at the N-terminus of the Affimer in-frame also showed polarization.


Example 3: Design and Observation of Fusion Protein of Circular Permutated Mutant, cpmTurquoise2, and Affimer

Using the strategy disclosed in Example 1, a cpmTurquoise2 and an Affimer were linked. Specifically, the 3/10 helix present at the C-terminus of the cpmTurquoise2 and the α-helix present at the N-terminus of the Affimer were linked in-frame, to design a fusion protein.


As the cpmTurquoise2, a fluorescent protein having the amino acid sequence set forth in SEQ ID No: 7 was used. As the Affimer, an Affimer having the same amino acid sequence as in Example 1 was used. Using a genetic engineering technique, the amino acid sequence set forth in SEQ ID No: 7 and the amino acid sequence set forth in SEQ ID No: 2 were linked in-frame via alanine (A), thereby designing a fusion protein of cpmTurquoise2 and Affimer having the amino acid sequence set forth in SEQ ID No: 8. SEQ ID No: 8 contains the amino acid sequence of a HA tag at the C-terminus.


An expression vector in which DNA encoding the fusion protein designed was disposed under an appropriate promoter was introduced by a lipofection method (Avalanche-Everyday Transfection Reagent), thereby allowing HeLa cells to express the above fusion protein. The cells expressing the fusion protein were observed by fluorescence polarization microscopy. The results were as shown in FIG. 4. As shown in FIG. 4, an actin fiber bundle in which a low fluorescence intensity was observed when the observation polarization plane was horizontal showed a high fluorescence intensity when the observation polarization plane was vertical, whereas an actin fiber bundle in which a high fluorescence intensity was observed when the observation polarization plane was horizontal showed a low fluorescence intensity when the observation polarization plane was vertical. This indicated that the fusion protein obtained by linking the 3/10 helix present at the C-terminus of the cpmTurquoise2 and the α-helix present at the N-terminus of the Affimer in-frame also showed polarization.


From Examples 1 to 3 above, the strategy of linking the helix at the C-terminus of a fluorescent protein and the helix at the N-terminus of an Affimer enables the fluorescence polarization planes to be immobilized on the Affimer and provides a new technique for observation of fluorescence polarization of target proteins (that is, Affimer antigens).


Example 4: Design and Observation of Fusion Protein of Fluorescence-Labeled Tag and Affimer

Using the strategy disclosed in Examples above, a fluorescence-labeled tag and an Affimer were linked. As the fluorescence-labeled tag, a HaloTag was used. The HaloTag is a protein labeling technique which enables binding to various cell membrane-permeable fluorescent ligands. The HaloTag can be recruited to a desired antigen by linking the HaloTag and an Affimer. An attempt was made to apply the HaloTag to observation of fluorescence polarization by using the strategy disclosed in Examples above.


(1) Fusion Protein (First)

As the HaloTag, a protein having the amino acid sequence set forth in SEQ ID No: 9 was used. As the Affimer, an Affimer having the same amino acid sequence as in Example 1 was used. Using a genetic engineering technique, the amino acid sequence set forth in SEQ ID No: 9 and the amino acid sequence set forth in SEQ ID No: 2 were linked in-frame via a helix linker having the amino acid sequence of SEQ ID No: 10, thereby designing a fusion protein of HaloTag and Affimer having the amino acid sequence set forth in SEQ ID No: 11. The HaloTag has an α-helix structure at the C-terminus, as shown in FIG. 5. In this example, the α-helix at the C-terminus and the α-helix of the Affimer at the N-terminus were linked via the above helix linker (see FIG. 5).


An expression vector in which DNA encoding the fusion protein designed was disposed under an appropriate promoter was introduced by a lipofection method (Lipofectamine 3000 reagent), thereby allowing HeLa cells to express the above fusion protein. As the HaloTag ligand, an Oregon Green ligand was used, and the cells expressing the fusion protein were observed by fluorescence polarization microscopy. The results were as shown in FIG. 6A. As shown in FIG. 6A, an actin fiber bundle in which a low fluorescence intensity was observed when the observation polarization plane was horizontal showed a high fluorescence intensity when the observation polarization plane was vertical, whereas an actin fiber bundle in which a high fluorescence intensity was observed when the observation polarization plane was horizontal showed a low fluorescence intensity when the observation polarization plane was vertical. This indicated that the fusion protein obtained by directly linking the α-helix present at the C-terminus of the HaloTag and the α-helix present at the N-terminus of the Affimer in-frame also showed polarization. From the fact that polarization was shown, it was considered that there was tightly immobilized binding among the fluorescence dye, the HaloTag, and the Affimer, and the binding had almost no flexibility.


It was revealed from this that, when the α-helix present at the C-terminus of the HaloTag and the α-helix present at the N-terminus of the Affimer were linked by the strategy of this example, the HaloTag firmly bound to the Affimer, thereby enabling observation of fluorescence polarization via the fluorescence label bound to the HaloTag.


(2) Fusion Protein (Second; with the Number of Turns Changed)


The α-helix is a secondary structure of a protein in which one turn is composed of 3.5 amino acids. It was examined whether the polarization planes rotate by linking α-helices to each other with an insertion of an amino acid (the number of turns changed).


As the HaloTag, a fluorescent protein having the amino acid sequence set forth in SEQ ID No: 9 was used. As the Affimer, an Affimer having the same amino acid sequence as in Example 1 was used. Using a genetic engineering technique, the amino acid sequence set forth in SEQ ID No: 9 and the amino acid sequence set forth in SEQ ID No: 2 were linked in-frame via a helix linker having the amino acid sequence of SEQ ID No: 12, thereby designing a fusion protein of HaloTag and Affimer having the amino acid sequence set forth in SEQ ID No: 13. The length of the helix linker was different from the fusion protein shown in FIG. 6A by one amino acid (that is, the rotation angle was different by about 100 degrees). As will be understood by comparing FIG. 6A and FIG. 6B, the bond angle of the Affimer to the HaloTag was different between the fusion proteins of FIG. 6A and FIG. 6B.


An expression vector in which DNA encoding the fusion protein designed was placed under an appropriate promoter was introduced by a lipofection method (Lipofectamine 3000 reagent), thereby allowing HeLa cells to express the above fusion protein. As the HaloTag ligand, an Oregon Green ligand was used, and the cells expressing the fusion protein were observed by fluorescence polarization microscopy. The results were as shown in FIG. 6B. As shown in FIG. 6B, an actin fiber bundle in which a low fluorescence intensity was observed when the observation polarization plane was horizontal showed a high fluorescence intensity when the observation polarization plane was vertical, whereas an actin fiber bundle in which a high fluorescence intensity was observed when the observation polarization plane was horizontal showed a low fluorescence intensity when the observation polarization plane was vertical. This indicated that the fusion protein obtained by directly linking the α-helix present at the C-terminus of the HaloTag and the α-helix present at the N-terminus of the Affimer in-frame also showed polarization.


It was clear from this that, even if the α-helix present at the C-terminus of the HaloTag and the α-helix present at the N-terminus of the Affimer were linked with the number of turns changed, the fluorescence polarization labeling strategy was established. The models disclosed in FIG. 6A and FIG. 6B demonstrated that the rotation angle of polarization could be freely changed in principle by changing the number of turns.


(3) Fusion Protein (Third; with the Number of Turns Further Changed)


The α-helices were linked to each other with an insertion of a further amino acid (the number of turns further changed).


As the HaloTag, a fluorescent protein having the amino acid sequence set forth in SEQ ID No: 9 was used. As the Affimer, an Affimer having the same amino acid sequence as in Example 1 was used. Using a genetic engineering technique, the amino acid sequence set forth in SEQ ID No: 9 and the amino acid sequence set forth in SEQ ID No: 2 were linked in-frame via a helix linker having the amino acid sequence of SEQ ID No: 14, thereby designing a fusion protein of HaloTag and Affimer having the amino acid sequence set forth in SEQ ID No: 15. The length of the helix linker was different from the fusion protein shown in FIG. 6A by 2 amino acids (that is, the rotation angle was different by about 200 degrees). As will be understood by comparing FIG. 6A, FIG. 6B, and FIG. 6C, the bond angle of the Affimer to the HaloTag was different among the fusion proteins of FIG. 6A, FIG. 6B, and FIG. 6C.


An expression vector in which DNA encoding the fusion protein designed was placed under an appropriate promoter was introduced by a lipofection method (Lipofectamine 3000 reagent), thereby allowing HeLa cells to express the above fusion protein. As the HaloTag ligand, an Oregon Green ligand was used, and the cells expressing the fusion protein were observed by fluorescence polarization microscopy. The results were as shown in FIG. 6C. As shown in FIG. 6C, an actin fiber bundle in which a low fluorescence intensity was observed when the observation polarization plane was horizontal showed a high fluorescence intensity when the observation polarization plane was vertical, whereas an actin fiber bundle in which a high fluorescence intensity was observed when the observation polarization plane was horizontal showed a low fluorescence intensity when the observation polarization plane was vertical. This indicated that the fusion protein obtained by directly linking the α-helix present at the C-terminus of the HaloTag and the α-helix present at the N-terminus of the Affimer in-frame also showed polarization. It should be noted that, even if a TMR ligand or a HMSiR ligand was used as the HaloTag ligand instead of the Oregon Green ligand, strong polarization was observed, and polarization could be observed regardless of the fluorescent entity used as the HaloTag ligand.


It was further confirmed from this that, even if the α-helix present at the C-terminus of the HaloTag and the α-helix present at the N-terminus of the Affimer were linked with the number of turns changed, the fusion proteins can work in the fluorescence polarization labeling strategy.


Example 5: Design and Observation of Fusion Protein of Circular Permutated Mutant, Cp-sfGFP, and Nanobody

In this example, a fusion protein of a fluorescent protein and a Nanobody was designed.


Being different from common antibodies found in humans, rabbits, and mice (antibodies with heavy and light chains), Nanobodies, which are antibodies consisting only of heavy chains (for example, an antibody of cartilaginous fishes such as sharks and chimaeras, and antibodies of camels, Arabian camels, llamas, alpacas, and the like), include those capable of binding to antigens only in the heavy chain variable regions, and those derived from the heavy chain variable regions and the light chain variable regions. Peptides containing these variable regions have a molecular weight of about 1/10 (approximately 12 to 15 kDa) of ordinary IgG antibodies and are called Nanobodies. It has also been revealed that Nanobodies exhibiting binding affinity for desired antigens can be designed by a technique such as phage display.


The sfGFP (superfolder GFP) is a variant of GFP obtained from Aequorea victoria. It is characterized by rapidly forming a structure (folding) in cells. In this example, an attempt to link a sfGFP and a Nanobody through their β-sheets was made. Since the Nanobody had a β-sheet at the C-terminus, a circular permutated mutant having a β-sheet at the N-terminus, cp-sfGFP (Specifically, cp10/8 (10/9) sfGFP), was constructed.


As the circular permutated mutant having a β-sheet at the N-terminus, cp-sfGFP, a fluorescent protein having the amino acid sequence set forth in SEQ ID No: 1 was used. As the Nanobody, a vimentin-binding Nanobody having the amino acid sequence set forth in SEQ ID No: 16 was used. Using a genetic engineering technique, the β-sheet at the N-terminus of the cp-sfGFP having the amino acid sequence set forth in SEQ ID No: 1 and the β-sheet at the C-terminus of the vimentin-binding Nanobody having the amino acid sequence set forth in SEQ ID No: 16 were linked, to design a fusion protein having the amino acid sequence set forth in SEQ ID No: 17. The amino acid sequence set forth in SEQ ID No: 17 had a His tag and a T7 tag at the N-terminus. FIG. 7A shows an expected structural model of the fusion protein. In this model, the site where the Nanobody bound to antigens was present in the opposite direction to that of the fluorescent protein, thereby ensuring the accessibility to the antigens.


An expression vector in which DNA encoding the fusion protein designed was placed under an appropriate promoter was introduced by a lipofection method (Avalanche-Everyday Transfection Reagent), thereby allowing HeLa cells to express the above fusion protein. The cells expressing the fusion protein were observed by fluorescence polarization microscopy. The results were as shown in FIG. 7B. As shown in FIG. 7B, vimentin fibers in which a low fluorescence intensity was observed when the observation polarization plane was horizontal showed a high fluorescence intensity when the observation polarization plane was vertical, whereas vimentin fibers in which a high fluorescence intensity was observed when the observation polarization plane was horizontal showed a low fluorescence intensity when the observation polarization plane was vertical. In this way, the fusion protein produced could stain vimentin fibers, and its fluorescence exhibited polarization.


This suggested that the strategy to obtain a fusion protein by linking the β-sheet at the N-terminus of the cp-sfGFP and the β-sheet at the C-terminus of the Nanobody could be useful for fluorescence polarization labeling.


Example 6: Design and Observation of Fusion Protein of Nanobody and Cp176-172mVenus

Further, a fusion protein (SEQ ID No: 31; the amino acid sequence set forth in SEQ ID No: 31 had a His tag and a T7 tag at the N-terminus) in which a Nanobody that binds to vimentin (SEQ ID No: 16) and a cp176-172mVenus (SEQ ID No: 30) were linked via VV was designed (see the panel in the upper part of FIG. 7C) and expressed in HeLa cells in the same manner as in Example 5. As the cells expressing the fusion protein were observed by fluorescence polarization microscopy, intracellular vimentin could be observed, and its fluorescence exhibited polarization, as shown in FIG. 7C. The orientation of polarization was different from that in the previous cp-sfGFP. The cp-sfGFP of Example 5 and the cp176-172mVenus used in this example were basically the same type of fluorescent proteins but different in the position of cp. Accordingly, this example indicated that the angle of polarization can be changed by creating circular permutated mutants of the fluorescent protein at various positions.


Example 7: Design and Observation of Fusion Protein of mApple and Nanobody

Then, a fusion protein of another fluorescent protein, mApple, and a Nanobody was designed.


As the mApple, a fluorescent protein having the amino acid sequence set forth in SEQ ID No: 18 was used. As the Nanobody, a vimentin-binding Nanobody having the amino acid sequence set forth in SEQ ID No: 16 was used. Using a genetic engineering technique, the β-sheet at the N-terminus of the mApple having the amino acid sequence set forth in SEQ ID No: 18 and the β-sheet at the C-terminus of the vimentin-binding Nanobody having the amino acid sequence set forth in SEQ ID No: 16 were linked via an inflexible linker (“rigid linker”), thereby designing a fusion protein having the amino acid sequence set forth in SEQ ID No: 19. The amino acid sequence set forth in SEQ ID No: 19 had a His tag and a T7 tag at the N-terminus. FIG. 8A shows an expected structural model of the fusion protein. In this model, the site where the Nanobody binds to antigens is present in the opposite direction to the fluorescent protein, thereby ensuring the accessibility to the antigens.


An expression vector in which DNA encoding the fusion protein designed was disposed under an appropriate promoter was introduced by a lipofection method (Avalanche-Everyday Transfection Reagent), thereby allowing HeLa cells to express the above fusion protein. The cells expressing the fusion protein were observed by fluorescence polarization microscopy. The results were as shown in FIG. 8B. As shown in FIG. 8B, vimentin fibers in which a low fluorescence intensity was observed when the observation polarization plane was horizontal showed a high fluorescence intensity when the observation polarization plane was vertical, whereas vimentin fibers in which a high fluorescence intensity was observed when the observation polarization plane was horizontal showed a low fluorescence intensity when the observation polarization plane was vertical. In this way, the fusion protein produced could stain vimentin fibers, and its fluorescence exhibited polarization.


In this way, it was revealed that the fluorescent protein can be labeled with the Nanobody without losing the polarization by utilizing the β-sheet structure at the C-terminus of the Nanobody and linking it to the β-sheet structure present at the N-terminus of the fluorescent protein.


Example 8: Design and Observation of Fusion Protein of cp-sfGFP and scFv

In this example, a cp-sfGFP having a β-sheet structure at the N-terminus was designed and linked to the β-sheet structure present at the C-terminus of a scFv, to obtain a fusion protein.


The scFv is a protein having a structure in which heavy-chain variable regions and light-chain variable regions of antibodies are linked by a flexible linker. In the case of production from antibodies by a genetic engineering technique, the binding property of the antibodies can be maintained. At present, a technique for producing a scFv by phage display is well known, which enables a scFv having a desired binding property to a desired antigen to be obtained.


As the cp-sfGFP, a circular permutated mutant of sfGFP having the amino acid sequence set forth in SEQ ID No: 1 was used. As the scFv, a scFv that has the amino acid sequence set forth in SEQ ID No: 20 and binds to non-muscle myosin IIA was used. Using a genetic engineering technique, the β-sheet structure at the N-terminus of the sfGFP having the amino acid sequence set forth in SEQ ID No: 1 and the β-sheet structure at the C-terminus of the scFv having the amino acid sequence set forth in SEQ ID No: 20 were directly linked, to design a fusion protein having the amino acid sequence set forth in SEQ ID No: 21. The amino acid sequence set forth in SEQ ID No: 21 had a His tag and a T7 tag at the N-terminus. FIG. 9A shows an expected three-dimensional structure of the fusion protein. The antigen recognition site of the scFv fusion protein was exposed on the opposite side of the cp-sfGFP fused, thereby ensuring the accessibility of antigens to the antigen recognition site.


An expression vector in which DNA encoding the fusion protein designed was placed under an appropriate promoter was introduced by a lipofection method (Avalanche-Everyday Transfection Reagent), thereby allowing HeLa cells to express the above fusion protein. The cells expressing the fusion protein were observed by fluorescence polarization microscopy. The results were as shown in FIG. 9B. As shown in FIG. 9B, a part in which a low fluorescence intensity was observed when the observation polarization plane was horizontal showed a high fluorescence intensity when the observation polarization plane was vertical, whereas a part in which a high fluorescence intensity was observed when the observation polarization plane was horizontal showed a low fluorescence intensity when the observation polarization plane was vertical. In this way, the fusion protein produced could stain myosin, and its fluorescence exhibited polarization.


Further, a fusion protein (SEQ ID No: 32) in which a scFv (SEQ ID No: 20) and a cp176-172mVenus (SEQ ID No: 30) were linked via VV was designed (see the panel in the upper part of FIG. 9C) and expressed in HeLa cells in the same manner. As the cells expressing the fusion protein were observed by fluorescence polarization microscopy, intracellular myosin could be observed, and its fluorescence exhibited polarization, as shown in FIG. 9C. The orientation of polarization was different from that in the above-described cp-sfGFP-based fusion protein. The cp-sfGFP and the cp176-172mVenus in this example were basically the same type of fluorescent proteins but different in the position of cp. Accordingly, this example indicated that the angle of polarization can be changed by creating circular permutated mutants of the fluorescent protein at various positions.


In this way, it was revealed that the fluorescent protein can be labeled with the scFv without losing the polarization by utilizing the β-sheet structure at the C-terminus of the scFv and linking it to the β-sheet structure present at the N-terminus of the fluorescent protein.


Example 9: Design and Observation of Fusion Protein of Affimer and mScarlet

The N-terminus of a mScarlet was truncated and linked to the β-sheet at the C-terminus of an Affimer, to obtain a fusion protein of Affimer and mScarlet. As the mScarlet, a protein having the amino acid sequence set forth in SEQ ID No: 36, created by truncating amino acids at the N-terminus was used. As the Affimer, an actin-binding Affimer having the amino acid sequence set forth in SEQ ID No: 2 was used. Using a genetic engineering technique, these were linked, to design a fusion protein having the amino acid sequence set forth in SEQ ID No: 37.


The fusion protein obtained is shown by a three-dimensional structural model in FIG. 11. An expression vector in which DNA encoding the fusion protein designed was placed under an appropriate promoter was introduced by a lipofection method (Lipofectamine 3000 reagent), thereby allowing HeLa cells to express the above fusion protein. The cells expressing the fusion protein were observed by fluorescence polarization microscopy.


The results were as shown in FIG. 11. As shown in FIG. 11, the images observed with vertical polarization and horizontal polarization were different, and this revealed that the fusion protein obtained was a fluorescent protein that binds to an antigen (actin) and exhibit polarization under polarization microscopic observation.


Example 10: Design and Observation of Fusion Protein of Affimer and mNeonGreen

The β-sheet at the N-terminus of a mNeonGreen was exposed and linked to the β-sheet at the C-terminus of an Affimer, to obtain a fusion protein of Affimer and mNeonGreen. As the mNeonGreen, a protein having the amino acid sequence set forth in SEQ ID No: 38, created by truncating amino acids at the N-terminus was used. As the Affimer, an actin-binding Affimer having the amino acid sequence set forth in SEQ ID No: 2 was used. Using a genetic engineering technique, these were linked, to design a fusion protein having the amino acid sequence set forth in SEQ ID No: 39.


The fusion protein obtained is shown by a three-dimensional structural model in FIG. 12. In FIG. 12, it can be seen that the β-sheet at the C-terminus of the Affimer and the β-sheet at the N-terminus of the mNeonGreen form a stretch of β-sheets. An expression vector in which DNA encoding the fusion protein designed was placed under an appropriate promoter was introduced by a lipofection method (Lipofectamine 3000 reagent), thereby allowing HeLa cells to express the above fusion protein. The cells expressing the fusion protein were observed by fluorescence polarization microscopy.


The results were as shown in FIG. 12. As shown in FIG. 12, the images observed with vertical polarization and horizontal polarization were different, and this revealed that the fusion protein obtained was a fluorescent protein that binds to an antigen (actin) and exhibit polarization under polarization microscopic observation.


Example 11: Technique for Observing Fluorescence Polarization by Introducing ALFA-Tag into Protein

In this example, an attempt was made to establish a technique for enabling fluorescence polarization observation of a protein of interest without obtaining a protein binding to the protein. Specifically, a system to label the protein to be observed with an ALFA-tag and detect the protein using a Nanobody that binds to the ALFA-tag was established.


Specifically, an ALFA-tag (SEQ ID No: 40) was introduced into an Affimer (SEQ ID No: 2) that binds to F-actin as a model system for verifying the above detection system, to obtain a fusion protein having the amino acid sequence set forth in SEQ ID No: 41. The three-dimensional structural model of the fusion protein was as shown in FIG. 13. This fusion protein was expressed in HeLaM cells. A fusion protein of an anti-ALFA-tag Nanobody (SEQ ID No: 42) and cp176-172mVenus (SEQ ID No: 43) were brought into contact, and fluorescence polarization was observed. The fusion protein of anti-ALFA-tagged Nanobody and cp176-172mVenus was a fusion protein suitable for observation of fluorescence polarization, in which the Nanobody and the cp176-172mVenus were tightly linked (c.f. Example 5).


The results were as shown in FIG. 13. As shown in FIG. 13, the images observed with vertical polarization and horizontal polarization were different, and this revealed that the fusion protein obtained was a fluorescent protein that binds to an antigen (actin) to exhibit polarization under polarization microscopic observation. In an embodiment of this example, first, the Affimer firmly recognized the F-actin. Then, the Affimer and the ALFA-tag were tightly linked, and the Nanobody firmly recognized the ALFA-tag. Thereby, the interaction and linkage between any proteins became tight, therefore it was concluded the F-actin could be observed using the fluorescence polarization of the cp176-172mVenus.


Example 12: Other Examples

In addition, it was examined whether fluorescence polarization labeling was possible in the same manner by designing the fusion proteins shown in Table 1 below. In any of the following cases, it was confirmed that antigens could be fluorescently labeled, and the fluorescence clearly showed polarization and could be used as a fluorescence polarization labeling. Further, the Affimer, the Nanobody, and the scFv could be used as antigen-binding proteins. In Table 1, utrophin is a protein known as a protein that binds to actin.









TABLE 1







Examples of fluorescence polarization labeling









Protein at N terminus
Protein at C terminus
Linkage form


(Number of amino acid
(Number of amino acid
(Number of amino acid


sequence)
sequence)
sequence of fusion protein)





miRFP670
Affimer that binds to actin
Linkage between α-helices


(SEQ ID No: 22)
(SEQ ID No: 2)
(SEQ ID No: 24)




*α-helices were linked via




helix linker (SEQ ID No: 23).


Nanobody that binds to
cpmTurquoise2
Linkage between β-sheets


vimentin
(SEQ ID No: 7)
(SEQ ID No: 25)


(SEQ ID No: 16)

*Amino acid sequence set




forth in SEQ ID No: 25 had




His tag and T7 tag at N




terminus.


Nanobody that binds to
cpmVenus
Linkage between β-sheets


vimentin
(SEQ ID No: 5)
(SEQ ID No: 26)


(SEQ ID No: 16)

*Amino acid sequence set




forth in SEQ ID No: 26 had




His tag and T7 tag at N




terminus.


scFv that binds to non-
cpmTurquoise2
Linkage between β-sheets


muscle myosin IIA
(SEQ ID No: 7)
(SEQ ID No: 27)


(SEQ ID No: 20)

*Amino acid sequence set




forth in SEQ ID No: 27 had




His tag and T7 tag at N




terminus.


Actin binding site of utrophin
mEGFP
Linkage between α-helices


(SEQ ID No: 33)
(SEQ ID No: 34)
(SEQ ID No: 35)




*Helix linker of SEQ ID No: 3




was used as linker.




Results are shown in FIG.




10.









As has been described above, all of the fusion proteins designed in Examples were suitable for fluorescence observation of antigens and all were suitable for fluorescence polarization observation. It was considered to be advantageous in the observation of polarization that fluorescence labels and antigens were linked in a constrained manner.


According to the results of Examples, it can be effective in fluorescence polarization labeling that a helix structure at a terminus of a fluorescent protein or a fluorescence-labeled tag protein and a helix structure at a terminus of an antigen-binding protein are linked, and that a β-sheet structure at a terminus of a fluorescent protein and a β-sheet structure at a terminus of an antigen-binding protein are linked. Regarding fluorescent proteins with no helix or no β-sheet structure at a terminus, it can be a useful technique for protein modification to produce a circular permutated mutant of a fluorescent protein so as to have a helix or a β-sheet structure at a terminus, for example.


Further, it was revealed that helices might be directly linked to each other or might be linked via a linker that may have a helix structure. In the case of linkage via a linker that may have a helix structure, the bond angle of the two protein parts in the fusion protein can be rotated by about 100 degrees by changing the length of the linker by one amino acid, thereby possibly widening the range of the strategy of techniques for producing fusion proteins. It was revealed that, also in the case of linkage between β-sheet structures, they might be directly linked, might be linked by deleting a terminus, or might be linked via a rigid amino acid residue such as valine (V). It is advantageous for observing polarization that a fluorescence label and an antigen were linked in a constrained manner, and it turned out that all the various linking methods shown in Examples were effective for observing polarization.


SEQUENCE LISTING

SEQ ID No: 1: Example of amino acid sequence of cp-sfGFP


SEQ ID No: 2: Example of amino acid sequence of Affimer that binds to actin


SEQ ID No: 3: Example of amino acid sequence of helix linker


SEQ ID No: 4: Example of amino acid sequence of fusion protein of cp-sfGFP and Affimer


SEQ ID No: 5: Example of amino acid sequence of cpmVenus


SEQ ID No: 6: Example of amino acid sequence of fusion protein of cpmVenus and Affimer


SEQ ID No: 7: Example of amino acid sequence of cpmTurquoise2


SEQ ID No: 8: Example of amino acid sequence of fusion protein of cpmTurquoise2 and Affimer


SEQ ID No: 9: Example of amino acid sequence of HaloTag


SEQ ID No: 10: Example of amino acid sequence of helix linker 2


SEQ ID No: 11: Example of amino acid sequence of fusion protein of HaloTag and Affimer


SEQ ID No: 12: Example of amino acid sequence of helix linker 3


SEQ ID No: 13: Example of amino acid sequence of fusion protein of HaloTag and Affimer


SEQ ID No: 14: Example of amino acid sequence of helix linker 4


SEQ ID No: 15: Example of amino acid sequence of fusion protein of HaloTag and Affimer


SEQ ID No: 16: Example of amino acid sequence of Nanobody that binds to vimentin


SEQ ID No: 17: Example of amino acid sequence of fusion protein of Nanobody and cp-sfGFP


SEQ ID No: 18: Amino acid sequence of mApple


SEQ ID No: 19: Example of amino acid sequence of fusion protein of Nanobody and mApple


SEQ ID No: 20: Example of amino acid sequence of scFv that binds to non-muscle myosin IIA


SEQ ID No: 21: Example of amino acid sequence of fusion protein of scFv and cp-sfGFP


SEQ ID No: 22: Amino acid sequence of miRFP670


SEQ ID No: 23: Example of amino acid sequence of helix linker 5


SEQ ID No: 24: Example of amino acid sequence of fusion protein of miRFP670 and Affimer


SEQ ID No: 25: Example of amino acid sequence of fusion protein of Nanobody and cpmTurquoise2


SEQ ID No: 26: Example of amino acid sequence of fusion protein of Nanobody and cpmVenus


SEQ ID No: 27: Example of amino acid sequence of fusion protein of scFv and cpmTurquoise2


SEQ ID No: 28: Amino acid sequence of Haloalkane dehalogenase


SEQ ID No: 29: Amino acid sequence of H272F mutant of Haloalkane dehalogenase


SEQ ID No: 30: Example of amino acid sequence of cp176-172mVenus


SEQ ID No: 31: Example of amino acid sequence of fusion protein of Nanobody and cp176-172mVenus


SEQ ID No: 32: Example of amino acid sequence of fusion protein of scFV and cp176-172mVenus


SEQ ID No: 33: Example of amino acid sequence of actin binding domain of utrophin


SEQ ID No: 34: Example of amino acid sequence of mEGFP


SEQ ID No: 35: Example of amino acid sequence of fusion protein of actin binding domain of utrophin and mEGFP


SEQ ID No: 36: Example of amino acid sequence of mScarlet fragment with N-terminus excised


SEQ ID No: 37: Example of amino acid sequence of fusion protein of Affimer that binds to actin and mScarlet


SEQ ID No: 38: Example of amino acid sequence of mNeonGreen fragment with N-terminus excised


SEQ ID No: 39: Example of amino acid sequence of fusion protein of Affimer that binds to actin and mNeonGreen


SEQ ID No: 40: Example of amino acid sequence of ALFA-tag


SEQ ID No: 41: Example of amino acid sequence of Affimer with ALFA-tag introduced


SEQ ID No: 42: Example of amino acid sequence of anti-ALFA-tag Nanobody


SEQ ID No: 43: Example of amino acid sequence of fusion protein of anti-ALFA-tag Nanobody and cp176-172mVenus

Claims
  • 1. A fusion protein of an antigen-binding protein and a fluorescent protein or a fluorescence-labeled tag, wherein the antigen-binding protein is an antigen-binding protein having a helix structure or a β-sheet structure at a terminus,the fluorescent protein or the fluorescence-labeled tag is a fluorescent protein or a fluorescence-labeled tag having a helix structure or a β-sheet structure at a terminus, andthe helix at the terminus and the helix at the terminus are linked, orthe β-sheet structure at the terminus and the β-sheet structure at the terminus are linked.
  • 2. The fusion protein according to claim 1, wherein the antigen-binding protein has a helix structure at the N-terminus,the fluorescent protein or the fluorescence-labeled tag has a helix structure at the C-terminus, andthe helix at the terminus of the antigen-binding protein and the helix at the terminus of the fluorescent protein or the fluorescence-labeled tag are linked.
  • 3. The fusion protein according to claim 2, wherein the antigen-binding protein is an Affimer.
  • 4. The fusion protein according to claim 2 or 3, wherein the fluorescent protein is a fluorescent protein having a helix structure at the C-terminus, selected from GFP and a fluorescent protein having a GFP-like β-barrel structure, and a circular permutated mutant of a point mutant.
  • 5. The fusion protein according to claim 1, wherein the antigen-binding protein has a β-sheet structure at the C-terminus,the fluorescent protein has a β-sheet structure at the N-terminus, andthe β-sheet structure at the C-terminus and the β-sheet structure at the N-terminus are linked.
  • 6. The fusion protein according to claim 5, wherein the antigen-binding protein is a Nanobody or a scFv.
  • 7. The fusion protein according to claim 5 or 6, wherein the fluorescent protein is a fluorescent protein having a β-sheet structure at the N-terminus, selected from the group consisting of GFP and a fluorescent protein having a GFP-like β-barrel structure, and a point mutant thereof, and a circular permutated mutant and a deletion mutant thereof.
  • 8. The fusion protein according to claim 2 or 3, wherein the fluorescence-labeled tag is a dehalogenation domain of haloalkane dehalogenase having an amino acid sequence corresponding to the amino acid sequence set forth in SEQ ID No: 9 or 29, having a helix structure at the C-terminus.
  • 9. The fusion protein according to claim 2 or 3, wherein the fluorescent protein is a phytochrome- or cyanobacteriochrome-based near-infrared fluorescent protein having a helix structure at the C-terminus.
  • 10. A composition comprising the fusion protein according to any one of claims 1 to 9.
  • 11. A nucleic acid encoding the fusion protein according to any one of claims 1 to 9.
  • 12. A gene expression vector for use in expressing a fusion protein in a cell, or a cell transformed with the vector, comprising the nucleic acid of claim 11,wherein the nucleic acid is operably linked to a regulatory sequence, ora cell introduced with a mRNA comprising the nucleic acid of claim 11.
  • 13. A method for observing an antigen, comprising observing an antigen bound to the fusion protein according to any one of claims 1 to 9.
  • 14. The method according to claim 13, wherein the antigen is an antigen in an aqueous solution or a cell, or on a cell surface.
  • 15. The method according to claim 13 or 14, comprising observing each of:a first antigen bound to a first fusion protein which is the fusion protein according to any one of claims 1 to 9; anda second antigen bound to a second fusion protein which is the fusion protein according to any one of claims 1 to 9, with the proviso that the first fusion protein and the second fusion protein have different fluorescence wavelengths.
  • 16. The method according to claim 15, comprising performing fluorescence correlation spectroscopy or fluorescence cross-correlation spectroscopy on the first fusion protein and the second fusion protein.
  • 17. The method according to claim 16, wherein the fluorescence correlation spectroscopy and the fluorescence cross-correlation spectroscopy are each fluorescence polarization correlation spectroscopy.
  • 18. A method for designing an amino acid sequence of a fusion protein, comprising: providing a first amino acid sequence which is an amino acid sequence of an antigen-binding protein and a second amino acid sequence which is an amino acid sequence of a fluorescent protein,wherein the N-terminus of the first amino acid sequence and the C-terminus of the second amino acid sequence or the C-terminus of the first amino acid sequence and the N-terminus of the second amino acid sequence are both helix structures or β-sheet structures, together form a helix structure upon excision of a part of the N-terminus or the C-terminus, or together form a β-sheet structure upon excision of a part of the N-terminus or the C-terminus; andlinking the N-terminus of the first amino acid sequence and the C-terminus of the second amino acid sequence or the C-terminus of the first amino acid sequence and the N-terminus of the second amino acid sequence, to obtain an amino acid sequence with the junction serving as a part of a helix structures or a β-sheet structures.
Priority Claims (1)
Number Date Country Kind
2019-034315 Feb 2019 JP national
PCT Information
Filing Document Filing Date Country Kind
PCT/JP2020/007568 2/26/2020 WO 00