MUTANT VSV ECTODOMAIN POLYPEPTIDE AND USES THEREOF

Abstract
The present invention relates to a mutant polypeptide comprising the amino acid sequence of the ectodomain of glycoprotein G of a vesicular stomatitis vims (VSV) strain, wherein said ectodomain comprises the amino acid sequence as set forth in SEQ ID NO: 2 or a sequence having at least 50% of identity with the amino acid sequence as set forth in SEQ ID NO: 2, with at least one substitution of an amino acid residue selected from the group consisting of: a) any amino acid residue located from position 421 to position 429 and any amino acid residue located from position 17 to position 25 of SEQ ID NO:2; or b) any amino acid residue located from a position equivalent to position 421 to a position equivalent to position 429 of SEQ ID NO: 2 and any amino acid residue located from a position equivalent to position 17 to a position equivalent to position 25 of SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2.
Description
TECHNICAL FIELD OF THE INVENTION

The present invention is in the field of viruses with improved properties, and more particularly of VSV viruses with improved properties, in particular for use in oncolytic cancer therapy, gene therapy, immunotherapy and selective delivery of a cargo to a chosen cell type, and relates to a mutant polypeptide comprising the amino acid sequence of the ectodomain of glycoprotein G of a Vesicular stomatitis virus (VSV) strain, wherein said ectodomain comprises the amino acid sequence as set forth in SEQ ID NO: 2 or a sequence having at least 50% of identity with the amino acid sequence as set forth in SEQ ID NO: 2, with at least one substitution of an amino acid residue selected from the group consisting of:

    • a) any amino acid residue located from position 421 to position 429 and any amino acid residue located from position 17 to position 25 of SEQ ID NO:2; or
    • b) any amino acid residue located from a position equivalent to position 421 to a position equivalent to position 429 of SEQ ID NO: 2 and any amino acid residue located from a position equivalent to position 17 to a position equivalent to position 25 of SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2.


BACKGROUND ART

Vesicular stomatitis virus (VSV) is an enveloped, negative strand RNA virus that belongs to the Vesiculovirus genus of the Rhabdovirus family. It is an arbovirus which can infect insects, cattle, horses, and pigs. In mammals, its ability to infect and kill tumor cells, while sparing normal cells makes it a promising oncolytic virus for the treatment of cancer (Barber. Oncogene 24, 7710-7719 (2005); Fernandez et al. J. Virol. 76, 895-904 (2002); Hastie et al. Virus Res. 176, 16-32 (2013)).


VSV genome consists of 11 kb single-stranded negative-sense RNA (FIG. 1A). It contains 5 genes encoding 5 structural proteins; one of them codes in particular for a transmembrane glycoprotein, the glycoprotein G (FIGS. 1A and 1B). Glycoprotein G plays a key role during the early stages of infection (Albertini et al. Viruses 4, 117-139 (2012)). First, by interacting with its cellular receptor (the LDL-R or members of the LDL-R family), it allows viral attachment. Then, after endocytosis of the virion, glycoprotein G undergoes a conformational change, induced by the acidification of the interior of the endosome, from a pre-fusion form to a post-fusion form. This structural transition catalyzes the fusion of the viral and endosomal membranes.


Glycoprotein G is the only protein responsible for VSV tropism. This tropism is wide due to the ubiquitous nature of LDL-R family members, allowing VSV to enter many cell types. This is why for example VSV glycoprotein G is used to pseudotype many retrovirals vectors. Nevertheless, despite this broad tropism, several cell types only have a low expression level or do not carry the VSV receptors (such as resting T cells, B cells, and CD341 cells) (Amirache. Blood 123, 1422-4 (2014))


Hence, there is a need for modified VSV viruses with an altered tropism. Such modified VSV viruses would infect cell types not carrying the natural VSV receptors.


In order to overcome this problem, the inventors sought to identify sites allowing the insertion of the sequence of a targeting moiety (such as a nanobody), specifically recognizing a given receptor, in the amino acid sequence of VSV glycoprotein G. In particular, this insertion should not interfere with the membrane fusion properties of glycoprotein G.


They analyzed several insertion sites and demonstrated that the amino-terminal site tolerated the insertion of a nanobody. The resulting first chimera (GNano described precisely in FIG. 2) was correctly folded and transported to the cell membrane. It had retained its fusion properties (which were nevertheless attenuated).


SUMMARY OF THE INVENTION

In the context of the invention, the inventors then built a recombinant VSV, in which the gene encoding the wild-type glycoprotein G was replaced by that encoding GNano (first chimera). The amplification of this initial recombinant VSV virus was however very difficult; and only very low infectious titers ˜1.8·106 pfu/ml were obtained after amplification in BSR cells after 24 hours. Such a low titer is not compatible with industrial production of a targeted VSV virus intended for therapy.


The inventors then performed several passages of this recombinant virus on BSR cells to select mutations improving its amplification. After 10 passages, the sequencing of the genomes of the viral population showed the progressive invasion of this population by a variant containing two mutations in the nucleotide sequence of the glycoprotein, resulting in the change of the histidine residue in position 22 into asparagine (H22N substitution) and of the serine 422 residue in position into isoleucine (S422I substitution). The titer of the recombinant virus obtained after this optimization was around ˜108 pfu/ml.


Surprisingly, the inventors thus found that two mutations (H22N and S422I), in combination, allowed the recombinant virus (GNano H22N S422I, or improved chimera, see FIG. 2) to be produced at ˜108 pfu/ml after amplification in BSR cells for 24 hours (see Example 1).


Moreover, the inventors also found that substitution S422I was able to rescue G fusion properties, which were completely abolished by substitution H407A. In this context, they demonstrated that S422I facilitates G folding and stabilizes G prefusion form by showing that it improves LDL-R CR domains recognition (the structural transition toward the postfusion state disrupts the CR domain binding site).


In the context of GNano H22N, they also demonstrated that substitutions S422F, S422M, S422L or S422V have a similar phenotype as S422I.


The inventors also verified that, in the presence of H22N substitution and/or S422I, S422F, S422M, S422L or S422V substitution, a recombinant VSV expressing a fusion polypeptide consisting of VSV Indiana G signal peptide, a dipeptide linker, an anti-GFP nanobody, a GGGGSGGGGS (SEQ ID NO:81) linker and VSV Indiana G ectodomain with one of substitutions S422I, S422F, S422M, S422L and S422V, was able to form syncytia when exposed to low pH, between 5.5 and 6.3, similarly to wild-type VSV Indiana. This also suggests that the presence of a hydrophobic residue in position 422 is key in the regulation of the pH-dependent structural transition, and facilitates G folding and stabilizes the prefusion form of a fusion polypeptide of G with a nanobody inserted N-terminal of the ectodomain (see Example 3). This is also confirmed by the crystal structure of VSV G ectodomain, solved by the inventors. Indeed, the inventors surprisingly showed that the C-terminal part (from residue 407 to residue 429, more particularly the region located from residue 421 to residue 429, even more particularly the region from residue 421 to residue 425) largely interacts with the fusion domain and therefore contributes to the stabilization of the pre-fusion complex. The crystal structure further revealed that compensatory substitution of S422 residue with a hydrophobic residue (such as I, F, M, L or V) stabilizes the beta-hairpin structure (and thus the pre-fusion state) through hydrophobic interactions (see Example 4).


In a first aspect, the present invention thus relates to a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence of the ectodomain of glycoprotein G of a vesicular stomatitis virus (VSV) strain, wherein said ectodomain comprises, or consists essentially of, or consists of, the amino acid sequence as set forth in SEQ ID NO: 2 or a sequence having at least 50% of identity with the amino acid sequence as set forth in SEQ ID NO: 2, with at least one substitution of an amino acid residue selected from the group consisting of:

    • a) any amino acid residue located from position 421 to position 429 and any amino acid residue located from position 17 to position 25 of SEQ ID NO:2, preferably any amino acid residue located from position 421 to position 425 and any amino acid residue located from position 17 to position 25 of SEQ ID NO:2, more preferably any amino acid residue located in positions 422 and 22 of SEQ ID NO:2; or
    • b) any amino acid residue located from a position equivalent to position 421 to a position equivalent to position 429 of SEQ ID NO: 2 and any amino acid residue located from a position equivalent to position 17 to a position equivalent to position 25 of SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2, preferably any amino acid residue located from a position equivalent to position 421 to a position equivalent to position 425 of SEQ ID NO: 2 and any amino acid residue located from a position equivalent to position 17 to a position equivalent to position 25 of SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2, more preferably any amino acid residue located in positions equivalent to positions 422 and 22 of SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2.


The present invention also relates to fusion polypeptide comprising, or consisting essentially of, or consisting of, the mutant polypeptide according to the invention, and an additional peptide, polypeptide or protein, wherein said additional peptide, polypeptide or protein is inserted:

    • a) in N-terminal of the mutant polypeptide,
    • b) between any two consecutive amino acids located in regions corresponding to positions 192 to 202 of SEQ ID NO:2 or equivalent positions after optimal global alignment with SEQ ID NO:2,
    • c) between any two consecutive amino acids located in regions corresponding to positions 240 to 257 of SEQ ID NO:2 or equivalent positions after optimal global alignment with SEQ ID NO:2,
    • d) between any two consecutive amino acids located in regions corresponding to positions 347 to 353 of SEQ ID NO:2 or equivalent positions after optimal global alignment with SEQ ID NO:2,
    • e) between any two consecutive amino acids located in regions corresponding to positions 364 to 366 of SEQ ID NO:2 or equivalent positions after optimal global alignment with SEQ ID NO:2,
    • f) between any two consecutive amino acids located in regions corresponding to positions 376 to 379 of SEQ ID NO:2 or equivalent positions after optimal global alignment with SEQ ID NO:2,


      preferably said additional peptide, polypeptide or protein is inserted in N-terminal of the mutant polypeptide or between the amino acids at positions 351 and 352 of SEQ ID NO:2 or equivalent positions after optimal global alignment with SEQ ID NO:2.


The present invention also relates to a nucleic acid molecule encoding the mutant polypeptide or the fusion polypeptide according to the invention.


The present invention also relates to a vector comprising at least one nucleic acid according to the invention or expressing the mutant polypeptide or the fusion polypeptide according to the invention.


The present invention also relates to a host cell containing or expressing the mutant polypeptide or the fusion polypeptide according to the invention, or containing the nucleic acid molecule according to the invention, or containing the vector (in particular a VSV vector) according to the invention.


The present invention also relates to a composition comprising, or consisting essentially of, or consisting of, the mutant polypeptide or the fusion polypeptide according to the invention, the nucleic acid molecule according to the invention, the vector according to the invention, the host cell according to the invention, or any combination thereof.


The present invention also relates to therapeutic uses of the mutant polypeptide or the fusion polypeptide according to the invention, the nucleic acid molecule according to the invention, the vector according to the invention, the host cell according to the invention, the composition according to the invention, or any combination thereof.


The present invention also relates to the mutant polypeptide according to the invention or the fusion polypeptide according to the invention for use for targeting a lipid membrane in a subject to a specific target, for instance a cell, in particular a cell to be killed, such as a cancer cell, wherein said mutant polypeptide or fusion polypeptide is anchored in said lipid membrane.


The present invention also relates to an in vitro use of the mutant polypeptide according to the invention or the fusion polypeptide according to the invention for targeting a lipid membrane to a specific target, for instance a cell, in particular a cell to be killed, such as a cancer cell, wherein said mutant polypeptide or fusion polypeptide is anchored in said lipid membrane.





DESCRIPTION OF THE FIGURES


FIG. 1. A. VSV genome is a single RNA molecule of negative polarity; it encodes 5 structural proteins N (nucleoprotein), P (phosphoprotein), M (matrix protein), G (glycoprotein) and L (Large RNA-dependent RNA polymerase). B. VSV is an enveloped virus, glycoprotein G is the only protein external to the virus. It is involved in the steps of virus entry (receptor interaction and membrane fusion).



FIG. 2. Schematic representation of first chimera (GNano) and improved chimera (GNano H22N S422I). The mutations that have been selected are indicated. (SP: signal peptide of VSV glycoprotein G, Nano: nanobody, G: glycoprotein G of VSV). The additional inserts (dipeptide QF and the linker which has the sequence GGGGSGGGGS (SEQ ID NO:81)) are also indicated.



FIG. 3A. Optimal global alignment of the amino acid sequences of the ectodomain of glycoprotein G of VSV strains Indiana (SEQ ID NO:2) and Maraba (SEQ ID NO:4) using Emboss Needle software with default parameters. Positions 17-25 and 421-429 of SEQ ID NO:2 and equivalent positions of SEQ ID NO:4 (also positions 17-25 and 421-429) are boxed. The preferred substitution positions (positions 22 and 422 of both SEQ ID NO:2 and SEQ ID NO:4) are further boxed. The default parameters used are indicated. The obtained identity percentage (“Identity”), similarity percentage (“Similarity”), and similarity score (“Score”) are indicated.



FIG. 3B. Optimal global alignment of the amino acid sequences of the ectodomain of glycoprotein G of VSV strains Indiana (SEQ ID NO:2) and Cocal (SEQ ID NO:6) using Emboss Needle software with default parameters. Positions 17-25 and 421-429 of SEQ ID NO:2 and equivalent positions of SEQ ID NO:6 (also positions 17-25 and 421-429) are boxed. The preferred substitution positions (positions 22 and 422 of both SEQ ID NO:2 and SEQ ID NO:6) are further boxed. The default parameters used are indicated. The obtained identity percentage (“Identity”), similarity percentage (“Similarity”), and similarity score (“Score”) are indicated.



FIG. 3C. Optimal global alignment of the amino acid sequences of the ectodomain of glycoprotein G of VSV strains Indiana (SEQ ID NO:2) and Morreton (SEQ ID NO:8) using Emboss Needle software with default parameters. Positions 17-25 and 421-429 of SEQ ID NO:2 and equivalent positions of SEQ ID NO:8 (also positions 17-25 and 421-429) are boxed. The preferred substitution positions (positions 22 and 422 of both SEQ ID NO:2 and SEQ ID NO:8) are further boxed. The default parameters used are indicated. The obtained identity percentage (“Identity”), similarity percentage (“Similarity”), and similarity score (“Score”) are indicated.



FIG. 3D. Optimal global alignment of the amino acid sequences of the ectodomain of glycoprotein G of VSV strains Indiana (SEQ ID NO:2) and Alagoa (SEQ ID NO:10) using Emboss Needle software with default parameters. Positions 17-25 and 421-429 of SEQ ID NO:2 and equivalent positions of SEQ ID NO:10 (also positions 17-25 and 421-429) are boxed. The preferred substitution positions (positions 22 and 422 of both SEQ ID NO:2 and SEQ ID NO:10) are further boxed. The default parameters used are indicated. The obtained identity percentage (“Identity”), similarity percentage (“Similarity”), and similarity score (“Score”) are indicated.



FIG. 3E. Optimal global alignment of the amino acid sequences of the ectodomain of glycoprotein G of VSV strains Indiana (SEQ ID NO:2) and New Jersey (SEQ ID NO:12) using Emboss Needle software with default parameters. Positions 17-25 and 421-429 of SEQ ID NO:2 and equivalent positions of SEQ ID NO:12 (positions 17-25 and 428-436) are boxed. The preferred substitution positions (positions 22 and 422 of SEQ ID NO:2 and positions 22 and 429 of SEQ ID NO:12) are further boxed. The default parameters used are indicated. The obtained identity percentage (“Identity”), similarity percentage (“Similarity”), and similarity score (“Score”) are indicated.



FIG. 3F. Optimal global alignment of the amino acid sequences of the ectodomains of glycoprotein G of VSV strains Indiana (SEQ ID NO:2) and Carajas (SEQ ID NO:14) using Emboss Needle software with default parameters. Positions 17-25 and 421-429 of SEQ ID NO:2 and equivalent positions of SEQ ID NO:14 (positions 17-25 and 425-433) are boxed. The preferred substitution positions (positions 22 and 422 of SEQ ID NO:2 and positions 22 and 426 of SEQ ID NO:14) are further boxed. The default parameters used are indicated. The obtained identity percentage (“Identity”), similarity percentage (“Similarity”), and similarity score (“Score”) are indicated.



FIG. 4. Multiple sequence alignment of amino acid sequences of the ectodomains of glycoprotein G of VSV strains Indiana (SEQ ID NO:2), Maraba (SEQ ID NO:4), Cocal (SEQ ID NO:6), Morreton (SEQ ID NO:8), Alagoa (SEQ ID NO:10), New Jersey (SEQ ID NO:12), and Carajas (SEQ ID NO:14) generated using clustal omega (https://www.ebi.ac.uk/Tools/msa/clustalo/) and espript (http://espript.ibcp.fr/ESPript/ESPript/esp_userguide.php). Strictly Conserved amino acids (identical for all strains) are written in white on grey background. Boxed areas correspond to highly similar residues. The consensus sequence (similarity score >0.7) indicated at the bottom was generated using criteria from MultAlin: uppercase is identity, lowercase is consensus level >0.5, ! is anyone of IV, $ is anyone of LM, % is anyone of FY, # is anyone of NDQEBZ.



FIG. 5. Histidine cluster in the prefusion protomer of VSV Indiana G. The mutation of Histidine 407 into an alanine abolishes G fusion properties.



FIG. 6. Top: Flow cytometry analysis of the expression of WT and mutant glycoproteins at the surface of HEK293T cells and of the binding of fluorescent GST-CR2 (CR2 with an amino-terminal GST fusion). After 24 h of transfection, cell surface expression of WT and mutant G was assessed using monoclonal anti-G antibody 8G5F11 directly on living cells at 4° C. during 1 h. Cells were then incubated simultaneously with anti-mouse Alexa Fluor 488 and the indicated GST-CR2ATTO550. Cells transfected with a G construct that was still able to bind GST-CR proteins exhibited red fluorescence due the ATTO550 dye. Mutant R354A is a mutant unable to bind CR domains.


Bottom: Flow cytometry analysis of surface expression of mutant glycoproteins G-H407A and G.H407A/S422I. The surface G protein expression was detected with monoclonal antibody 8G5F11 (KeraFAST) and a goat anti-mouse IgG secondary antibody conjugated to Alexa Fluor 488. A total of 10,000 cells were counted by flow cytometry. In each frame, the black curve corresponds to the fluorescence of untransfected cells, the blue curve corresponds to the fluorescence of cells transfected by pCAGGS plasmids encoding WT G, and the red curve corresponds to the fluorescence of cells transfected by pCAGGS plasmids encoding the indicated mutant. In each frame, the surface expression of mutant G is expressed as a percentage of that of WT G.



FIG. 7. Cell-cell fusion assays. Fusion activity of VSV G Indiana WT and VSV GNano after optimization (ie. Mutation of residue H22 to N and residue S422 to I, F, M, L, or V).



FIG. 8. Crystal structure of VSV G1-440 (residues 1 to 440 of VSV G Indiana WT) and environment of residue S422.





(A) Crystalline structure of the pre-fusion trimer of G1-440. The chain is traced up to residue 432. Residues 410 to 432 are in dark grey.


(B) Structure of segment 410-432. The two close-up detail the organization of segment 410-432 and its interactions with the fusion domain.


(C) Close-up view of residue S422 in G pre-fusion state.


DETAILED DESCRIPTION OF THE INVENTION

In the context of the present invention, the inventors surprisingly found that, while a recombinant VSV virus comprising a nanobody inserted between the signal peptide and the ectodomain of VSV Indiana glycoprotein G retained its fusion properties and infectivity, it was very difficult to amplify, not permitting an industrial use thereof. They further surprisingly found that insertion of at least one substitution in position 22 or position 422 of the ectodomain of VSV Indiana glycoprotein G, while retaining fusion properties and infectivity, permitted to dramatically increase the amplification of the recombinant virus, thus permitting an industrial use thereof (see Example 1). They also obtained results showing that the presence of a hydrophobic residue in position 422 is key in the regulation of the pH-dependent structural transition, and facilitates G folding and stabilizes the prefusion form of a fusion polypeptide of G with a nanobody inserted N-terminal of the ectodomain (see Examples 2, 3 and 4).


Mutant Polypeptide with Improved Production


The present invention thus firstly relates to a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence of the ectodomain of glycoprotein G of a Vesicular stomatitis virus (VSV) strain, wherein said ectodomain comprises, or consists essentially of, or consists of, the amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least 50% of identity with the amino acid sequence as set forth in SEQ ID NO: 2, with at least one substitution of an amino acid residue selected from the group consisting of:

    • a) any amino acid residue located from position 421 to position 429 and any amino acid residue located from position 17 to position 25 of SEQ ID NO:2, preferably any amino acid residue located from position 421 to position 425 and any amino acid residue located from position 17 to position 25 of SEQ ID NO:2, more preferably any amino acid residue located in positions 422 and 22 of SEQ ID NO:2; or
    • b) any amino acid residue located from a position equivalent to position 421 to a position equivalent to position 429 of SEQ ID NO: 2 and any amino acid residue located from a position equivalent to position 17 to a position equivalent to position 25 of SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2, preferably any amino acid residue located from a position equivalent to position 421 to a position equivalent to position 425 of SEQ ID NO: 2 and any amino acid residue located from a position equivalent to position 17 to a position equivalent to position 25 of SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2, more preferably any amino acid residue located in positions equivalent to positions 422 and 22 of SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2.


In a preferred embodiment, the mutant polypeptide of the invention is an isolated and/or non-naturally occurring polypeptide.


Reference Sequences and Positions for Substitution


The mutant polypeptide is based on the amino acid sequence of the ectodomain of glycoprotein G of a Vesicular stomatitis virus (VSV) strain, which corresponds to the amino acid sequence of glycoprotein G without the signal peptide. In the present description, the reference VSV strain is the Indiana strain. The amino acid sequence of glycoprotein G of VSV Indiana strain is as set forth in SEQ ID NO:1. The amino acid sequence of the ectodomain of glycoprotein G of VSV Indiana strain is as set forth in SEQ ID NO:2, and corresponds to amino acids 17-511 of SEQ ID NO:1, amino acids 1-16 of SEQ ID NO:1 corresponding to the signal peptide. In a preferred embodiment, the mutant polypeptide according to the invention is derived from the ectodomain of glycoprotein G of VSV Indiana strain and thus comprises, or consists essentially of, or consists of, the amino acid sequence as set forth in SEQ ID NO: 2, with at least one substitution of an amino acid residue selected from the group consisting of any amino acid residue located from position 421 to position 429 and any amino acid residue located from position 17 to position 25 of SEQ ID NO:2, preferably any amino acid residue located from position 421 to position 425 and any amino acid residue located from position 17 to position 25 of SEQ ID NO:2, more preferably any amino acid residue located in positions 422 and 22 of SEQ ID NO:2. However, the mutant polypeptide according to the invention may also be derived from the ectodomain of glycoprotein G of a VSV strain other than Indiana or may comprise further mutations, provided that it comprises, or consists essentially of, or consists of, an amino acid sequence having at least 50% of identity, preferably at least 55% of identity, at least 60% of identity, at least 65% of identity, more preferably at least 70% of identity, at least 75% of identity, at least 78% of identity, at least 79% of identity, at least 80% of identity, at least 81% of identity, at least 82% of identity, at least 83% of identity, at least 84% of identity, at least 85% of identity, at least 86% of identity, at least 87% of identity, at least 88% of identity, at least 89% of identity, at least 90% of identity, at least 91% of identity, at least 92% of identity, at least 93% of identity, at least 94% of identity, at least 95% of identity, at least 96% of identity, at least 97% of identity, at least 98% of identity, or even at least 99% of identity with the amino acid sequence as set forth in SEQ ID NO: 2, with at least one substitution of an amino acid residue selected from the group consisting of any amino acid residue located from a position equivalent to position 421 to a position equivalent to position 429 of SEQ ID NO: 2 and any amino acid residue located from a position equivalent to position 17 to a position equivalent to position 25 of SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2, preferably any amino acid residue located from a position equivalent to position 421 to a position equivalent to position 425 of SEQ ID NO: 2 and any amino acid residue located from a position equivalent to position 17 to a position equivalent to position 25 of SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2, more preferably any amino acid residue located in positions equivalent to positions 422 and 22 of SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2. In this case, an optimal global alignment of SEQ ID NO:2 and of the initial amino acid sequence (for instance amino acid sequence of the ectodomain of glycoprotein G of a VSV strain other than Indiana, provided that this sequence has at least 50% of identity or any other preferred minimal percentage of identity defined above with SEQ ID NO:2 after optimal global alignment) is made, and the initial amino acid sequence has at least one substitution of an amino acid residue selected from the group consisting of any amino acid residue located from a position equivalent to position 421 to a position equivalent to position 429 of SEQ ID NO: 2 and any amino acid residue located from a position equivalent to position 17 to a position equivalent to position 25 of SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2, preferably any amino acid residue located from a position equivalent to position 421 to a position equivalent to position 425 of SEQ ID NO: 2 and any amino acid residue located from a position equivalent to position 17 to a position equivalent to position 25 of SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2, more preferably any amino acid residue located in positions equivalent to positions 422 and 22 of SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2.


The percent identities referred to in the context of the disclosure of the present invention are determined on the basis of an optimal global alignment of sequences to be compared, i.e., on an optimal alignment of the sequences taken in their entirety over their entire length using any algorithm well-known to a person skilled in the art, such as the algorithm of Needleman and Wunsch (1970). This sequence comparison may be performed using any software well-known to a person skilled in the art, for example the Emboss Needle software. The Emboss Needle software is for example available at https://www.ebi.ac.uk/Tools/psa/emboss_needle/. This software reads two input sequences and writes their optimal global sequence alignment to file. It uses the Needleman-Wunsch alignment algorithm to find the optimum alignment (including gaps) of two sequences along their entire length. The algorithm uses a dynamic programming method to ensure the alignment is optimum, by exploring all possible alignments and choosing the best. A scoring matrix is read that contains values for every possible residue or nucleotide match. Needle finds the alignment with the maximum possible score where the score of an alignment is equal to the sum of the matches taken from the scoring matrix, minus penalties arising from opening and extending gaps in the aligned sequences. The substitution matrix and gap opening and extension penalties are user-specified. In the context of the invention, in order to obtain an optimal global alignment, the Emboss Needle software may be used with default parameters, i.e. using the “Gap open” parameter equal to 10.0, the “Gap extend” parameter equal to 0.5, the “End gap penalty” parameter to “false”, the “End gap open” parameter to 10.0, and a “Blosum 62” matrix. When entering two amino acid sequences, the Emboss Needle software returns an optimal global alignment, as well as several values characterizing the alignment:

    • “Identity” is the percentage of identity, i.e. the percentage of identical matches between the two sequences over the reported aligned region (including any gaps in the length),
    • “Similarity” is the percentage of similarity, i.e. the percentage of matches between the two sequences over the reported aligned region (including any gaps in the length), and
    • “Score” is the total score of the alignment, corresponding to the best score obtained by the software for all tested global alignments (i.e. over the entire length of both sequences).
    • This score may be referred to as a “similarity score”, since the higher is the value of the score between SEQ ID NO:2 and another initial sequence, the higher is the similarity between this initial sequence and SEQ ID NO:2, taking into account not only aligned identical amino acids, but also conservative substitutions that are not expected to significantly alter the function of the protein.


Once an optimal global alignment has been obtained between SEQ ID NO:2 and another initial sequence, the percentage of identity and position equivalent to specific position(s) of SEQ ID NO:2 are defined. In particular, a position equivalent to position N of SEQ ID NO:2 is the position of the other initial sequence that is aligned with position N of SEQ ID NO:2.


Examples of VSV strains other than Indiana also include 7 serotypes such as Maraba, Cocal, Morreton, Alagoa, New Jersey., and Carajas strains. The non-mutated (wild-type) amino acid sequences of the full-length glycoprotein G and of the ectodomain of glycoprotein G of these strains and Indiana strain are presented in Table 1 below:









TABLE 1







non-mutated (wild-type) amino acid sequences of the full-length


glycoprotein G and of the ectodomain of glycoprotein G of Indiana,


Maraba, Cocal, Morreton, Alagoa, New Jersey, and Carajas VSV strains.


In the full-length amino acid sequences, the signal peptide is


underlined. Positions of the signal peptide in the full-length


amino acid sequences is further indicated.









VSV
Full-length glycoprotein G
Ectodomain of glycoprotein G


strain
amino acid sequence
amino acid sequence





Indiana

MKCLLYLAFLFIGVNCKFTIVFPHNQKGNWK

KFTIVFPHNQKGNWKNVPSNYHYCPSSSDLN



NVPSNYHYCPSSSDLNWHNDLIGTAIQVKMP
WHNDLIGTAIQVKMPKSHKAIQADGWMCHA



KSHKAIQADGWMCHASKWVTTCDFRWYGP
SKWVTTCDFRWYGPKYITQSIRSFTPSVEQC



KYITQSIRSFTPSVEQCKESIEQTKQGTWLNP
KESIEQTKQGTWLNPGFPPQSCGYATVTDAE



GFPPQSCGYATVTDAEAVIVQVTPHHVLVDE
AVIVQVTPHHVLVDEYTGEWVDSQFINGKCS



YTGEWVDSQFINGKCSNYICPTVHNSTTWHS
NYICPTVHNSTTWHSDYKVKGLCDSNLISMDI



DYKVKGLCDSNLISMDITFFSEDGELSSLGKE
TFFSEDGELSSLGKEGTGFRSNYFAYETGGK



GTGFRSNYFAYETGGKACKMQYCKHWGVR
ACKMQYCKHWGVRLPSGVWFEMADKDLFA



LPSGVWFEMADKDLFAAARFPECPEGSSISAP
AARFPECPEGSSISAPSQTSVDVSLIQDVERIL



SQTSVDVSLIQDVERILDYSLCQETWSKIRAGL
DYSLCQETWSKIRAGLPISPVDLSYLAPKNPG



PISPVDLSYLAPKNPGTGPAFTIINGTLKYFET
TGPAFTIINGTLKYFETRYIRVDIAAPILSRMV



RYIRVDIAAPILSRMVGMISGTTTERELWDDW
GMISGTTTERELWDDWAPYEDVEIGPNGVLR



APYEDVEIGPNGVLRTSSGYKFPLYMIGHGM
TSSGYKFPLYMIGHGMLDSDLHLSSKAQVFE



LDSDLHLSSKAQVFEHPHIQDAASQLPDDESL
HPHIQDAASQLPDDESLFFGDTGLSKNPIELV



FFGDTGLSKNPIELVEGWFSSWKSSIASFFFII
EGWFSSWKSSIASFFFIIGLIIGLFLVLRVGIHL



GLIIGLFLVLRVGIHLCIKLKHTKKRQIYTDIEM
CIKLKHTKKRQIYTDIEMNRLGK



NRLGK (SEQ ID NO: 1)
(SEQ ID NO: 2)



Positions of signal peptide: 1-16






Maraba

MLRLFLFCFLALGAHSKFTIVFPHHQKGNWK

KFTIVFPHHQKGNWKNVPSTYHYCPSSSDQN



NVPSTYHYCPSSSDQNWHNDLTGVSLHVKIP
WHNDLTGVSLHVKIPKSHKAIQADGWMCHA



KSHKAIQADGWMCHAAKWVTTCDFRWYGP
AKWVTTCDFRWYGPKYITHSIHSMSPTLEQC



KYITHSIHSMSPTLEQCKTSIEQTKQGVWINP
KTSIEQTKQGVWINPGFPPQSCGYATVTDAE



GFPPQSCGYATVTDAEVVVVQATPHHVLVD
VVVVQATPHHVLVDEYTGEWIDSQLVGGKC



EYTGEWIDSQLVGGKCSKEVCQTVHNSTVW
SKEVCQTVHNSTVWHADYKITGLCESNLASV



HADYKITGLCESNLASVDITFFSEDGQKTSLG
DITFFSEDGQKTSLGKPNTGFRSNHFAYESGE



KPNTGFRSNHFAYESGEKACRMQYCTQWGI
KACRMQYCTQWGIRLPSGVWFELVDKDLFQ



RLPSGVWFELVDKDLFQAAKLPECPRGSSISA
AAKLPECPRGSSISAPSQTSVDVSLIQDVERIL



PSQTSVDVSLIQDVERILDYSLCQETWSKIRAK
DYSLCQETWSKIRAKLPVSPVDLSYLAPKNPG



LPVSPVDLSYLAPKNPGSGPAFTIINGTLKYFE
SGPAFTIINGTLKYFETRYIRVDISNPIIPHMVG



TRYIRVDISNPIIPHMVGTMSGTTTERELWND
TMSGTTTERELWNDWYPYEDVEIGPNGVLK



WYPYEDVEIGPNGVLKTPTGFKFPLYMIGHG
TPTGFKFPLYMIGHGMLDSDLHKSSQAQVFE



MLDSDLHKSSQAQVFEHPHAKDAASQLPDDE
HPHAKDAASQLPDDETLFFGDTGLSKNPVEL



TLFFGDTGLSKNPVELVEGWFSSWKSTLASF
VEGWFSSWKSTLASFFLIIGLGVALIFIIRIIVAI



FLIIGLGVALIFIIRIIVAIRYKYKGRKTQKIYND
RYKYKGRKTQKIYNDVEMSRLGNK



VEMSRLGNK (SEQ ID NO: 3)
(SEQ ID NO: 4)



Positions of signal peptide: 1-16






Cocal

MNFLLLTFIVLPLCSHAKFSIVFPQSQKGNWK

KFSIVFPQSQKGNWKNVPSSYHYCPSSSDQN



NVPSSYHYCPSSSDQNWHNDLLGITMKVKMP
WHNDLLGITMKVKMPKTHKAIQADGWMCH



KTHKAIQADGWMCHAAKWITTCDFRWYGP
AAKWITTCDFRWYGPKYITHSIHSIQPTSEQC



KYITHSIHSIQPTSEQCKESIKQTKQGTWMSP
KESIKQTKQGTWMSPGFPPQNCGYATVTDS



GFPPQNCGYATVTDSVAVVVQATPHHVLVD
VAVVVQATPHHVLVDEYTGEWIDSQFPNGK



EYTGEWIDSQFPNGKCETEECETVHNSTVW
CETEECETVHNSTVWYSDYKVTGLCDATLVD



YSDYKVTGLCDATLVDTEITFFSEDGKKESIG
TEITFFSEDGKKESIGKPNTGYRSNYFAYEKG



KPNTGYRSNYFAYEKGDKVCKMNYCKHAGV
DKVCKMNYCKHAGVRLPSGVWFEFVDQDVY



RLPSGVWFEFVDQDVYAAAKLPECPVGATIS
AAAKLPECPVGATISAPTQTSVDVSLILDVERI



APTQTSVDVSLILDVERILDYSLCQETWSKIRS
LDYSLCQETWSKIRSKQPVSPVDLSYLAPKNP



KQPVSPVDLSYLAPKNPGTGPAFTIINGTLKY
GTGPAFTIINGTLKYFETRYIRIDIDNPIISKMV



FETRYIRIDIDNPIISKMVGKISGSQTERELWTE
GKISGSQTERELWTEWFPYEGVEIGPNGILKT



WFPYEGVEIGPNGILKTPTGYKFPLFMIGHG
PTGYKFPLFMIGHGMLDSDLHKTSQAEVFEH



MLDSDLHKTSQAEVFEHPHLAEAPKQLPEEE
PHLAEAPKQLPEEETLFFGDTGISKNPVELIE



TLFFGDTGISKNPVELIEGWFSSWKSTVVTFF
GWFSSWKSTVVTFFFAIGVFILLYVVARIVIAV



FAIGVFILLYVVARIVIAVRYRYQGSNNKRIYN
RYRYQGSNNKRIYNDIEMSRFRK



DIEMSRFRK (SEQ ID NO: 5)
(SEQ ID NO: 6)



Positions of signal peptide: 1-17






Morreton

MLVLYLLLSLLALGAQCKFTIVFPHNQKGNW

KFTIVFPHNQKGNWKNVPANYQYCPSSSDLN



KNVPANYQYCPSSSDLNWHNGLIGTSLQVKM
WHNGLIGTSLQVKMPKSHKAIQADGWMCHA



PKSHKAIQADGWMCHAAKWVTTCDFRWYG
AKWVTTCDFRWYGPKYVTHSIKSMIPTVDQC



PKYVTHSIKSMIPTVDQCKESIAQTKQGTWLN
KESIAQTKQGTWLNPGFPPQSCGYASVTDAE



PGFPPQSCGYASVTDAEAVIVKATPHQVLVD
AVIVKATPHQVLVDEYTGEWVDSQFPTGKC



EYTGEWVDSQFPTGKCNKDICPTVHNSTTW
NKDICPTVHNSTTWHSDYKVTGLCDANLISM



HSDYKVTGLCDANLISMDITFFSEDGKLTSLG
DITFFSEDGKLTSLGKEGTGFRSNYFAYENGD



KEGTGFRSNYFAYENGDKACRMQYCKHWG
KACRMQYCKHWGVRLPSGVWFEMADKDIYN



VRLPSGVWFEMADKDIYNDAKFPDCPEGSSIA
DAKFPDCPEGSSIAAPSQTSVDVSLIQDVERIL



APSQTSVDVSLIQDVERILDYSLCQETWSKIRA
DYSLCQETWSKIRAHLPISPVDLSYLSPKNPG



HLPISPVDLSYLSPKNPGTGPAFTIINGTLKYF
TGPAFTIINGTLKYFETRYIRVDIAGPIIPQMR



ETRYIRVDIAGPIIPQMRGVISGTTTERELWTD
GVISGTTTERELWTDWYPYEDVEIGPNGVLK



WYPYEDVEIGPNGVLKTATGYKFPLYMIGHG
TATGYKFPLYMIGHGMLDSDLHISSKAQVFEH



MLDSDLHISSKAQVFEHPHIQDAASQLPDDET
PHIQDAASQLPDDETLFFGDTGLSKNPIELVE



LFFGDTGLSKNPIELVEGWFSGWKSTIASFFF
GWFSGWKSTIASFFFIIGLVIGLYLVLRIGIALC



IIGLVIGLYLVLRIGIALCIKCRVQEKRPKIYTD
IKCRVQEKRPKIYTDVEMNRLDR



VEMNRLDR (SEQ ID NO: 7)
(SEQ ID NO: 8)



Positions of signal peptide: 1-17






Alagoa

MTPAFILCMLLAGSSWAKFTIVFPQSQKGDW

KFTIVFPQSQKGDWKDVPPNYRYCPSSADQN



KDVPPNYRYCPSSADQNWHGDLLGVNIRAK
WHGDLLGVNIRAKMPKVHKAIKADGWMCHA



MPKVHKAIKADGWMCHAAKWVTTCDYRWY
AKWVTTCDYRWYGPQYITHSIHSFIPTKAQC



GPQYITHSIHSFIPTKAQCEESIKQTKEGVWIN
EESIKQTKEGVWINPGFPPKNCGYASVSDAES



PGFPPKNCGYASVSDAESIIVQATAHSVMIDE
IIVQATAHSVMIDEYSGDWLDSQFPTGRCTGS



YSGDWLDSQFPTGRCTGSTCETIHNSTLWYA
TCETIHNSTLWYADYQVTGLCDSALVSTEVT



DYQVTGLCDSALVSTEVTFYSEDGLMTSIGR
FYSEDGLMTSIGRQNTGYRSNYFPYEKGAAA



QNTGYRSNYFPYEKGAAACRMKYCTHEGIRL
CRMKYCTHEGIRLPSGVWFEMVDKELLESVQ



PSGVWFEMVDKELLESVQMPECPAGLTISAP
MPECPAGLTISAPTQTSVDVSLILDVERMLDY



TQTSVDVSLILDVERMLDYSLCQETWSKVHS
SLCQETWSKVHSGLPISPVDLGYIAPKNPGAG



GLPISPVDLGYIAPKNPGAGPAFTIVNGTLKY
PAFTIVNGTLKYFDTRYLRIDIEGPVLKKMTG



FDTRYLRIDIEGPVLKKMTGKVSGTPTKREL
KVSGTPTKRELWTEWFPYDDVEIGPNGVLKT



WTEWFPYDDVEIGPNGVLKTPEGYKFPLYMI
PEGYKFPLYMIGHGLLDSDLQKTSQAEVFHH



GHGLLDSDLQKTSQAEVFHHPQIAEAVQKLP
PQIAEAVQKLPDDETLFFGDTGISKNPVEVIE



DDETLFFGDTGISKNPVEVIEGWFSNWRSSV
GWFSNWRSSVMAIVFAILLLVITVLMVRLCVA



MAIVFAILLLVITVLMVRLCVAFRHFCCQKRH
FRHFCCQKRHKIYNDLEMNQLRR



KIYNDLEMNQLRR (SEQ ID NO: 9)
(SEQ ID NO: 10)



Positions of signal peptide: 1-17






New

MLSYLIFALVVSPILGKIEIVFPQHTTGDWKRV

KIEIVFPQHTTGDWKRVPHEYNYCPTSADKN


Jersey
PHEYNYCPTSADKNSHGTQTGIPVELTMPKG
SHGTQTGIPVELTMPKGLTTHQVDGFMCHS



LTTHQVDGFMCHSALWMTTCDFRWYGPKYI
ALWMTTCDFRWYGPKYITHSIHNEEPTDYQC



THSIHNEEPTDYQCLEAIKAYKDGVSFNPGFP
LEAIKAYKDGVSFNPGFPPQSCGYGTVTDAE



PQSCGYGTVTDAEAHIVTVTPHSVKVDEYTG
AHIVTVTPHSVKVDEYTGEWIDPHFIGGRCK



EWIDPHFIGGRCKGQICETVHNSTKWFTSSD
GQICETVHNSTKWFTSSDGESVCSQLFTLVG



GESVCSQLFTLVGGTFFSDSEEITSMGLPETG
GTFFSDSEEITSMGLPETGIRSNYFPYVSTEGI



IRSNYFPYVSTEGICKMPFCRKPGYKLKNDLW
CKMPFCRKPGYKLKNDLWFQITDPDLDKTVR



FQITDPDLDKTVRDLPHIKDCDLSSSIVTPGEH
DLPHIKDCDLSSSIVTPGEHATDISLISDVERIL



ATDISLISDVERILDYALCQNTWSKIEAGEPITP
DYALCQNTWSKIEAGEPITPVDLSYLGPKNP



VDLSYLGPKNPGAGPVFTIINGSLHYFMSKYL
GAGPVFTIINGSLHYFMSKYLRVELESPVIPRM



RVELESPVIPRMEGKVAGTRIVRQLWDQWFP
EGKVAGTRIVRQLWDQWFPFGEVEIGPNGV



FGEVEIGPNGVLKTKQGYKFPLHIIGTGEVDN
LKTKQGYKFPLHIIGTGEVDNDIKMERIVKHW



DIKMERIVKHWEHPHIEAAQTFLKKDDTEEVL
EHPHIEAAQTFLKKDDTEEVLYYGDTGVSKN



YYGDTGVSKNPVELVEGWFSGWRSSIMGVL
PVELVEGWFSGWRSSIMGVLAVIIGFVILIFLIR



AVIIGFVILIFLIRLIGVLSSLFRQKRRPIYKSDV
LIGVLSSLFRQKRRPIYKSDVEMAHFR



EMAHFR (SEQ ID NO: 11)
(SEQ ID NO: 12)



Positions of signal peptide: 1-16






Carajas

MKMKMVIAGLILCIGILPAIGKITISFPQSLKGD

KITISFPQSLKGDWRPVPKGYNYCPTSADKNL



WRPVPKGYNYCPTSADKNLHGDLIDIGLRLR
HGDLIDIGLRLRAPKSFKGISADGWMCHAAR



APKSFKGISADGWMCHAARWITTCDFRWYG
WITTCDFRWYGPKYITHSIHSFRPSNDQCKEA



PKYITHSIHSFRPSNDQCKEAIRLTNEGNWIN
IRLTNEGNWINPGFPPQSCGYASVTDSESVVV



PGFPPQSCGYASVTDSESVVVTVTKHQVLVD
TVTKHQVLVDEYSGSWIDSQFPGGSCTSPIC



EYSGSWIDSQFPGGSCTSPICDTVHNSTLWH
DTVHNSTLWHADHTLDSICDQEFVAMDAVLF



ADHTLDSICDQEFVAMDAVLFTESGKFEEFG
TESGKFEEFGKPNSGIRSNYFPYESLKDVCQM



KPNSGIRSNYFPYESLKDVCQMDFCKRKGFKL
DFCKRKGFKLPSGVWFEIEDAEKSHKAQVEL



PSGVWFEIEDAEKSHKAQVELKIKRCPHGAVI
KIKRCPHGAVISAPNQNAADINLIMDVERILDY



SAPNQNAADINLIMDVERILDYSLCQATWSKI
SLCQATWSKIQNKEALTPIDISYLGPKNPGPG



QNKEALTPIDISYLGPKNPGPGPAFTIINGTLH
PAFTIINGTLHYFNTRYIRVDIAGPVTKEITGF



YFNTRYIRVDIAGPVTKEITGFVSGTSTSRVL
VSGTSTSRVLWDQWFPYGENSIGPNGLLKTA



WDQWFPYGENSIGPNGLLKTASGYKYPLFM
SGYKYPLFMVGTGVLDADIHKLGEATVIEHP



VGTGVLDADIHKLGEATVIEHPHAKEAQKVV
HAKEAQKVVDDSEVIFFGDTGVSKNPVEVVE



DDSEVIFFGDTGVSKNPVEVVEGWFSGWRSS
GWFSGWRSSLMSIFGIILLIVCLVLIVRILIALKY



LMSIFGIILLIVCLVLIVRILIALKYCCVRHKKRTI
CCVRHKKRTIYKEDLEMGRIPRRA



YKEDLEMGRIPRRA (SEQ ID NO: 13)
(SEQ ID NO: 14)



Positions of signal peptide: 1-21









An optimal global alignment of the non-mutated the amino acid sequences of the ectodomain of the glycoprotein G of VSV strain Indiana versus each of Maraba, Cocal, Morreton, Alagoa, New Jersey, and Carajas strains is presented in FIGS. 3A to 3F. Based on these optimal global alignments, the percentage of identity, the similarity score (obtained using the Emboss Needle software with default parameters as defined above), and the positions in the amino acid sequences of the ectodomain of the glycoprotein G of VSV strains Maraba, Cocal, Morreton, Alagoa, New Jersey, and Carajas equivalent to positions 17-25 and 421-429 of SEQ ID NO:2 (VSV Indiana glycoprotein G ectodomain) may be determined. They are listed in Table 2 below:









TABLE 2







Positions in the amino acid sequences of the ectodomain of the glycoprotein G


of VSV strains Maraba, Cocal, Morreton, Alagoa, New Jersey, and Carajas equivalent to


positions 17-25 and 421-429 of SEQ ID NO: 2 (VSV Indiana glycoprotein G ectodomain), %


identity with SEQ ID NO: 2 after optimal global alignment and similarity score (as calculated


by Emboss Needle, using default parameters: Gap open = 10.0, Gap extend = 0.5,


End gap penalty = false, End gap open = 10.0, Blosum 62 matrix).











VSV strain






(sequence
Positions in
Equivalent

Similarity


identifier)
SEQ ID NO: 2
positions
% identity
score














Indiana
/
/
100%
2693


(SEQ ID NO: 2)






Maraba
17-25
17-25
78.8
2227


(SEQ ID NO: 4)
421-429
421-429




Cocal
17-25
17-25
73.1
2085


(SEQ ID NO: 6)
421-429
421-429




Morreton
17-25
17-25
86.3
2385


(SEQ ID NO: 8)
421-429
421-429




Alagoa
17-25
17-25
64.7
1916.5


(SEQ ID NO: 10)
421-429
421-429




New Jersey
17-25
17-25
51.6
1404


(SEQ ID NO: 12)
421-429
428-436




Carajas
17-25
17-25
56.4
1622


(SEQ ID NO: 14)
421-429
425-433











The alignments of FIGS. 3A to 3F also show that, in addition to identical amino acids between two amino acid sequences, some substitutions are conservative, ensuring the function of the ectodomain. The presence of such conservative substitutions is taken into account when calculating the similarity score of two sequences after optimal global alignment, in particular when using the Emboss Needle software with default parameters. Therefore, when an initial sequence different from SEQ ID NO:2 is used, in addition to an identity of at least 50% (or any other minimal identity percentage defined above) with SEQ ID NO:2, the initial sequence preferably has a similarity score after optimal global alignment (preferably using the Emboss Needle software with default parameters as defined above) of at least 1400, more preferably at least 1500, at least 1600, at least 1700, at least 1800, even more preferably at least 1900, at least 2000, at least 2100, at least 2200, or even at least 2300, knowing that the similarity score of SEQ ID NO:2 with itself is 2693.


A multiple amino acid sequences alignment of the ectodomains of the glycoprotein G of VSV strains Indiana, Maraba, Cocal, Morreton, Alagoa, New Jersey, and Carajas is presented in FIG. 4. As appears from FIG. 4, all amino sequences of the ectodomain of glycoprotein G of VSV, no matter the specific strain, contain conserved amino acids and domains (i.e. amino acids or domains that are identical between SEQ ID NO:2 and the other aligned amino acid sequences).


In particular, positions 17-25 of SEQ ID NO:2 contain conserved amino acids in positions 17-18, 21 and 23-25. A consensus sequence of positions 17-25 of amino sequences of the ectodomain of glycoprotein G of VSV strains Indiana, Maraba, Cocal, Morreton, Alagoa, New Jersey, and Carajas may be defined as VPX1X2YX3YCP (SEQ ID NO:15), wherein:

    • X1 is selected from serine (S), histidine (H), lysine (K), proline (P), and alanine (A),
    • X2 is selected from asparagine (N), glutamic acid (E), threonine (T), glycine (G), and serine (S), and
    • X3 is selected from histidine (H), asparagine (N), arginine (R), and glutamine (Q).


Similarly, positions 421-429 of SEQ ID NO:2 contain conserved amino acids in positions 421 and 426-429. A consensus sequence of positions 421-429 of amino sequences of the ectodomain of glycoprotein G of VSV strains Indiana, Maraba, Cocal, Morreton, Alagoa, New Jersey, and Carajas may be defined as EX4X5X6X7GDTG (SEQ ID NO:16), wherein:

    • X4 is selected from serine (S), valine (V), and threonine (T),
    • X5 is selected from leucine (L) and isoleucine (I),
    • X6 is selected from phenylalanine (F) and tyrosine (Y), and
    • X7 is selected from phenylalanine (F) and tyrosine (Y).


Therefore, in a preferred embodiment, the mutant polypeptide according to the invention comprises, or consists essentially of, or consists of:

    • a) the amino acid sequence as set forth in SEQ ID NO: 2 or a sequence having at least 50% of identity with the amino acid sequence as set forth in SEQ ID NO: 2 after optimal global alignment with SEQ ID NO:2, and
    • b) two domains of sequence VPX1X2YX3YCP (SEQ ID NO:15) and EX4X5X6X7GDTG (SEQ ID NO:16),


wherein:

    • i) one of X4 to X7 (corresponding to positions 422-425 of SEQ ID NO:2), preferably X4 (corresponding to position 422 of SEQ ID NO:2);
    • ii) one of X1 to X3 (corresponding to positions 19-20 and 22 of SEQ ID NO:2), preferably X3 (corresponding to position 22 of SEQ ID NO:2); or
    • iii) one of X1 to X3 (corresponding to positions 19-20 and 22 of SEQ ID NO:2) and one of X4 to X7 (corresponding to positions 422-425 of SEQ ID NO:2), preferably X3 and X4 (corresponding to positions 22 and 422 of SEQ ID NO:2),


is(are) substituted.


Moreover, amino acids conserved in all amino sequences of the ectodomain of glycoprotein G of VSV strains Indiana, Maraba, Cocal, Morreton, Alagoa, New Jersey, and Carajas correspond to positions 1, 4, 6-7, 12, 14, 17-18, 21, 23-25, 27, 29, 31, 33, 46-4755-56, 58-60, 64, 66-69, 71-75, 77, 79, 81-82, 87, 91-92, 96, 102, 107-111, 114-116, 119, 121, 127, 130, 132, 134, 137-139, 141, 143, 145, 151, 153, 158, 160, 162-165, 167, 177, 199, 204, 206-208, 210, 212, 219, 221, 224, 228, 231, 236-237, 241, 253, 259, 262, 268, 271-272, 274-277, 279-281, 283-285, 287-290, 299, 301, 304, 307-311, 313-314, 316-318, 320-321, 323, 325-326, 330, 332, 338, 345, 349, 354, 356-357, 360, 362, 368-372, 374, 376, 379, 381, 383-384, 388, 390, 393, 395, 407-408, 413, 421, 426-429, 431-434, 436, 439-443, 445, 447, 456, 467, 487, and 489-490 of SEQ ID NO:2. In the following, these positions are referred to as “conserved positions”.


Therefore, in some embodiments of the mutated polypeptide according to the invention, the amino acid sequence having at least 50% of identity with the amino acid sequence as set forth in SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2, has the same amino acids as SEQ ID NO: 2 in the above defined conserved positions.


An exception may be made for conserved positions 47 and 354, when it is desired to alter the ability of the ectodomain to bind to the low-density lipoprotein receptor (LDL-R) (see below section entitled “Further optional mutations”). In this case, the amino acid sequence having at least 50% of identity (or any other minimal % of identity disclosed above) with the amino acid sequence as set forth in SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2, may have the same amino acids as SEQ ID NO: 2 in the above defined conserved positions, except for possible additional substitution(s) in position 47 and/or 354 of SEQ ID NO:2.


Type of Amino Acid for Substitution


While the substitute amino acid(s) in the mutant polypeptide according to the invention may be selected from any amino acid different from the original amino acid, depending on the location(s) of the substitution(s), some particular types of amino acids are preferred.


In particular, when the mutant polypeptide according to the invention comprises a substitution in any amino acid residue located from position 421 to position 429 of SEQ ID NO:2 or equivalent position after optimal global alignment with SEQ ID NO:2, the original amino acid is preferably substituted by a hydrophobic amino acid, preferably selected for the group consisting of glycine (G), alanine (A), valine (V), leucine (L), isoleucine (I), methionine (M), proline (P), phenylalanine (F) and tryptophan (W), more preferably isoleucine (I), leucine (L), methionine (M), valine (V) and phenylalanine (F), even more preferably isoleucine (I). Indeed, the present inventors have shown that the presence of a hydrophobic amino acid in any amino acid residue located from position 421 to position 429 of SEQ ID NO:2 or equivalent position after optimal global alignment with SEQ ID NO:2 results in the stabilization of the glycoprotein G prefusion conformation.


When the mutant polypeptide according to the invention comprises a substitution in any amino acid residue located from position 17 to position 25 of SEQ ID NO:2 or equivalent position after optimal global alignment with SEQ ID NO:2, the original amino acid is preferably substituted by a polar amino acid, preferably selected from the group consisting of asparagine (N), glutamine (Q), cysteine (C), tyrosine (Y), threonine (T), serine (S), more preferably asparagine (N). Indeed, it is believed that the presence of a polar amino acid in any amino acid residue located from position 17 to position 25 of SEQ ID NO:2 or equivalent position after optimal global alignment with SEQ ID NO:2 results in the stabilization of the glycoprotein G prefusion conformation.


When the mutant polypeptide according to the invention comprises a substitution in any amino acid residue located from position 17 to position 25 of SEQ ID NO:2 or equivalent position after optimal global alignment with SEQ ID NO:2, and also a substitution in any amino acid residue located from position 421 to position 429 of SEQ ID NO:2 or equivalent position after optimal global alignment with SEQ ID NO:2:

    • a) the original amino acid located from position 17 to position 25 of SEQ ID NO:2 or equivalent position after optimal global alignment with SEQ ID NO:2 is preferably substituted by a polar amino acid, preferably selected from the group consisting of asparagine (N), glutamine (Q), cysteine (C), tyrosine (Y), threonine (T), serine (S), more preferably asparagine (N); and
    • b) the original amino acid located from position 421 to position 429 of SEQ ID NO:2 or equivalent position after optimal global alignment with SEQ ID NO:2 is preferably substituted by a hydrophobic amino acid, preferably selected for the group consisting of glycine (G), alanine (A), valine (V), leucine (L), isoleucine (I), methionine (M), proline (P), phenylalanine (F) and tryptophan (W), more preferably isoleucine (I), leucine (L), methionine (M), valine (V) and phenylalanine (F), even more preferably isoleucine (I) (FIG. 7).


Preferred Mutant Polypeptides


Some preferred mutant polypeptides according to the invention are based on the amino acid sequence of the ectodomain of glycoprotein G of VSV Indiana strain, and may be selected from:

    • a) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:17 corresponding to SEQ ID NO: 2 with a substitution of the histidine (H) amino acid residue in position 22 by an asparagine (N) amino acid residue;
    • b) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:18 corresponding to SEQ ID NO: 2 with a substitution of the serine (S) amino acid residue in position 422 by an isoleucine (I) amino acid residue;
    • c) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:19 corresponding to SEQ ID NO: 2 with a substitution of the serine (S) amino acid residue in position 422 by a leucine (L) amino acid residue;
    • d) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:20 corresponding to SEQ ID NO: 2 with a substitution of the serine (S) amino acid residue in position 422 by a phenylalanine (F) amino acid residue;
    • e) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:21 corresponding to SEQ ID NO: 2 with a substitution of the serine (S) amino acid residue in position 422 by a methionine (M) amino acid residue;
    • f) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:22 corresponding to SEQ ID NO: 2 with a substitution of the serine (S) amino acid residue in position 422 by a valine (V) amino acid residue;
    • g) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:23 corresponding to SEQ ID NO: 2 with a substitution of the histidine (H) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the serine (S) amino acid residue in position 422 by an isoleucine (I) amino acid residue;
    • h) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:24 corresponding to SEQ ID NO: 2 with a substitution of the histidine (H) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the serine (S) amino acid residue in position 422 by a leucine (L) amino acid residue;
    • i) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:25 corresponding to SEQ ID NO: 2 with a substitution of the histidine (H) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the serine (S) amino acid residue in position 422 by a phenylalanine (F) amino acid residue;
    • j) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:26 corresponding to SEQ ID NO: 2 with a substitution of the histidine (H) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the serine (S) amino acid residue in position 422 by a methionine (M) amino acid residue;
    • k) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:27 corresponding to SEQ ID NO: 2 with a substitution of the histidine (H) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the serine (S) amino acid residue in position 422 by a valine (V) amino acid residue; and
    • l) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as defined in any one of a) to k) above, and further comprising a signal peptide in N-terminal, preferably the signal peptide corresponding to positions 1-16 of SEQ ID NO:1.


Most preferred mutant polypeptides based on the amino acid sequence of the ectodomain of glycoprotein G of VSV Indiana strain are those comprising at least a substitution in position 422 (SEQ ID NO: 18 to 27 or a mutant polypeptide comprising one of these amino acid sequences and further comprising a signal peptide in N-terminal, preferably the signal peptide corresponding to positions 1-16 of SEQ ID NO:1).


The amino acid sequences SEQ ID NO:17 to 27 of preferred mutant polypeptide derived from VSV Indiana glycoprotein G ectodomain are presented in Table 3 below.









TABLE 3







Amino acid sequences SEQ ID NO: 17 to 27 based on Indiana VSV strain


glycoprotein G ectodonnain (SEQ ID NO: 2) and associated substitution


compared to SEQ ID NO: 2. For each substitution, the initial amino


acid in SEQ ID NO: 2 is indicated in one letter code and followed by


its position in SEQ ID NO: 2 and the substitute amino acid in one


letter code.









Sequence




identifier
Substitution(s)
Amino acid sequence





SEQ ID
H22N
KFTIVFPHNQKGNWKNVPSNYNYCPSSSDLNWHNDLIGTAIQVKMPKSH


NO: 17

KAIQADGWMCHASKWVTTCDFRWYGPKYITQSIRSFTPSVEQCKESIEQ




TKQGTWLNPGFPPQSCGYATVTDAEAVIVQVTPHHVLVDEYTGEWVDS




QFINGKCSNYICPTVHNSTTWHSDYKVKGLCDSNLISMDITFFSEDGELSS




LGKEGTGFRSNYFAYETGGKACKMQYCKHWGVRLPSGVWFEMADKDL




FAAARFPECPEGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAGLP




ISPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRVDIAAPILSRMVGMIS




GTTTERELWDDWAPYEDVEIGPNGVLRTSSGYKFPLYMIGHGMLDSDLH




LSSKAQVFEHPHIQDAASQLPDDESLFFGDTGLSKNPIELVEGWFSSWKS




SIASFFFIIGLIIGLFLVLRVGIHLCIKLKHTKKRQIYTDIEMNRLGK





SEQ ID
S422I
KFTIVFPHNQKGNWKNVPSNYHYCPSSSDLNWHNDLIGTAIQVKMPKSH


NO: 18

KAIQADGWMCHASKWVTTCDFRWYGPKYITQSIRSFTPSVEQCKESIEQ




TKQGTWLNPGFPPQSCGYATVTDAEAVIVQVTPHHVLVDEYTGEWVDS




QFINGKCSNYICPTVHNSTTWHSDYKVKGLCDSNLISMDITFFSEDGELSS




LGKEGTGFRSNYFAYETGGKACKMQYCKHWGVRLPSGVWFEMADKDL




FAAARFPECPEGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAGLP




ISPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRVDIAAPILSRMVGMIS




GTTTERELWDDWAPYEDVEIGPNGVLRTSSGYKFPLYMIGHGMLDSDLH




LSSKAQVFEHPHIQDAASQLPDDEILFFGDTGLSKNPIELVEGWFSSWKSS




IASFFFIIGLIIGLFLVLRVGIHLCIKLKHTKKRQIYTDIEMNRLGK





SEQ ID
S422L
KFTIVFPHNQKGNWKNVPSNYHYCPSSSDLNWHNDLIGTAIQVKMPKSH


NO: 19

KAIQADGWMCHASKWVTTCDFRWYGPKYITQSIRSFTPSVEQCKESIEQ




TKQGTWLNPGFPPQSCGYATVTDAEAVIVQVTPHHVLVDEYTGEWVDS




QFINGKCSNYICPTVHNSTTWHSDYKVKGLCDSNLISMDITFFSEDGELSS




LGKEGTGFRSNYFAYETGGKACKMQYCKHWGVRLPSGVWFEMADKDL




FAAARFPECPEGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAGLP




ISPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRVDIAAPILSRMVGMIS




GTTTERELWDDWAPYEDVEIGPNGVLRTSSGYKFPLYMIGHGMLDSDLH




LSSKAQVFEHPHIQDAASQLPDDELLFFGDTGLSKNPIELVEGWFSSWKS




SIASFFFIIGLIIGLFLVLRVGIHLCIKLKHTKKRQIYTDIEMNRLGK





SEQ ID
S422F
KFTIVFPHNQKGNWKNVPSNYHYCPSSSDLNWHNDLIGTAIQVKMPKSH


NO: 20

KAIQADGWMCHASKWVTTCDFRWYGPKYITQSIRSFTPSVEQCKESIEQ




TKQGTWLNPGFPPQSCGYATVTDAEAVIVQVTPHHVLVDEYTGEWVDS




QFINGKCSNYICPTVHNSTTWHSDYKVKGLCDSNLISMDITFFSEDGELSS




LGKEGTGFRSNYFAYETGGKACKMQYCKHWGVRLPSGVWFEMADKDL




FAAARFPECPEGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAGLP




ISPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRVDIAAPILSRMVGMIS




GTTTERELWDDWAPYEDVEIGPNGVLRTSSGYKFPLYMIGHGMLDSDLH




LSSKAQVFEHPHIQDAASQLPDDEFLFGDTGLSKNPIELVEGWFSSWKSSI




ASFFFIIGLIIGLFLVLRVGIHLCIKLKHTKKRQIYTDIEMNRLGK





SEQ ID
S422M
KFTIVFPHNQKGNWKNVPSNYHYCPSSSDLNWHNDLIGTAIQVKMPKSH


NO: 21

KAIQADGWMCHASKWVTTCDFRWYGPKYITQSIRSFTPSVEQCKESIEQ




TKQGTWLNPGFPPQSCGYATVTDAEAVIVQVTPHHVLVDEYTGEWVDS




QFINGKCSNYICPTVHNSTTWHSDYKVKGLCDSNLISMDITFFSEDGELSS




LGKEGTGFRSNYFAYETGGKACKMQYCKHWGVRLPSGVWFEMADKDL




FAAARFPECPEGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAGLP




ISPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRVDIAAPILSRMVGMIS




GTTTERELWDDWAPYEDVEIGPNGVLRTSSGYKFPLYMIGHGMLDSDLH




LSSKAQVFEHPHIQDAASQLPDDEMLFFGDTGLSKNPIELVEGWFSSWKS




SIASFFFIIGLIIGLFLVLRVGIHLCIKLKHTKKRQIYTDIEMNRLGK





SEQ ID
S422V
KFTIVFPHNQKGNWKNVPSNYHYCPSSSDLNWHNDLIGTAIQVKMPKSH


NO: 22

KAIQADGWMCHASKWVTTCDFRWYGPKYITQSIRSFTPSVEQCKESIEQ




TKQGTWLNPGFPPQSCGYATVTDAEAVIVQVTPHHVLVDEYTGEWVDS




QFINGKCSNYICPTVHNSTTWHSDYKVKGLCDSNLISMDITFFSEDGELSS




LGKEGTGFRSNYFAYETGGKACKMQYCKHWGVRLPSGVWFEMADKDL




FAAARFPECPEGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAGLP




ISPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRVDIAAPILSRMVGMIS




GTTTERELWDDWAPYEDVEIGPNGVLRTSSGYKFPLYMIGHGMLDSDLH




LSSKAQVFEHPHIQDAASQLPDDEVLFFGDTGLSKNPIELVEGWFSSWKS




SIASFFFIIGLIIGLFLVLRVGIHLCIKLKHTKKRQIYTDIEMNRLGK





SEQ ID
H22N and
KFTIVFPHNQKGNWKNVPSNYNYCPSSSDLNWHNDLIGTAIQVKMPKSH


NO: 23
S422I
KAIQADGWMCHASKWVTTCDFRWYGPKYITQSIRSFTPSVEQCKESIEQ




TKQGTWLNPGFPPQSCGYATVTDAEAVIVQVTPHHVLVDEYTGEWVDS




QFINGKCSNYICPTVHNSTTWHSDYKVKGLCDSNLISMDITFFSEDGELSS




LGKEGTGFRSNYFAYETGGKACKMQYCKHWGVRLPSGVWFEMADKDL




FAAARFPECPEGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAGLP




ISPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRVDIAAPILSRMVGMIS




GTTTERELWDDWAPYEDVEIGPNGVLRTSSGYKFPLYMIGHGMLDSDLH




LSSKAQVFEHPHIQDAASQLPDDEILFFGDTGLSKNPIELVEGWFSSWKSS




IASFFFIIGLIIGLFLVLRVGIHLCIKLKHTKKRQIYTDIEMNRLGK





SEQ ID
H22N and
KFTIVFPHNQKGNWKNVPSNYNYCPSSSDLNWHNDLIGTAIQVKMPKSH


NO: 24
S422L
KAIQADGWMCHASKWVTTCDFRWYGPKYITQSIRSFTPSVEQCKESIEQ




TKQGTWLNPGFPPQSCGYATVTDAEAVIVQVTPHHVLVDEYTGEWVDS




QFINGKCSNYICPTVHNSTTWHSDYKVKGLCDSNLISMDITFFSEDGELSS




LGKEGTGFRSNYFAYETGGKACKMQYCKHWGVRLPSGVWFEMADKDL




FAAARFPECPEGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAGLP




ISPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRVDIAAPILSRMVGMIS




GTTTERELWDDWAPYEDVEIGPNGVLRTSSGYKFPLYMIGHGMLDSDLH




LSSKAQVFEHPHIQDAASQLPDDELLFFGDTGLSKNPIELVEGWFSSWKS




SIASFFFIIGLIIGLFLVLRVGIHLCIKLKHTKKRQIYTDIEMNRLGK





SEQ ID
H22N and
KFTIVFPHNQKGNWKNVPSNYNYCPSSSDLNWHNDLIGTAIQVKMPKSH


NO: 25
S422F
KAIQADGWMCHASKWVTTCDFRWYGPKYITQSIRSFTPSVEQCKESIEQ




TKQGTWLNPGFPPQSCGYATVTDAEAVIVQVTPHHVLVDEYTGEWVDS




QFINGKCSNYICPTVHNSTTWHSDYKVKGLCDSNLISMDITFFSEDGELSS




LGKEGTGFRSNYFAYETGGKACKMQYCKHWGVRLPSGVWFEMADKDL




FAAARFPECPEGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAGLP




ISPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRVDIAAPILSRMVGMIS




GTTTERELWDDWAPYEDVEIGPNGVLRTSSGYKFPLYMIGHGMLDSDLH




LSSKAQVFEHPHIQDAASQLPDDEFLFGDTGLSKNPIELVEGWFSSWKSSI




ASFFFIIGLIIGLFLVLRVGIHLCIKLKHTKKRQIYTDIEMNRLGK





SEQ ID
H22N and
KFTIVFPHNQKGNWKNVPSNYNYCPSSSDLNWHNDLIGTAIQVKMPKSH


NO: 26
S422M
KAIQADGWMCHASKWVTTCDFRWYGPKYITQSIRSFTPSVEQCKESIEQ




TKQGTWLNPGFPPQSCGYATVTDAEAVIVQVTPHHVLVDEYTGEWVDS




QFINGKCSNYICPTVHNSTTWHSDYKVKGLCDSNLISMDITFFSEDGELSS




LGKEGTGFRSNYFAYETGGKACKMQYCKHWGVRLPSGVWFEMADKDL




FAAARFPECPEGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAGLP




ISPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRVDIAAPILSRMVGMIS




GTTTERELWDDWAPYEDVEIGPNGVLRTSSGYKFPLYMIGHGMLDSDLH




LSSKAQVFEHPHIQDAASQLPDDEMLFFGDTGLSKNPIELVEGWFSSWKS




SIASFFFIIGLIIGLFLVLRVGIHLCIKLKHTKKRQIYTDIEMNRLGK





SEQ ID
H22N and
KFTIVFPHNQKGNWKNVPSNYNYCPSSSDLNWHNDLIGTAIQVKMPKSH


NO: 27
S422V
KAIQADGWMCHASKWVTTCDFRWYGPKYITQSIRSFTPSVEQCKESIEQ




TKQGTWLNPGFPPQSCGYATVTDAEAVIVQVTPHHVLVDEYTGEWVDS




QFINGKCSNYICPTVHNSTTWHSDYKVKGLCDSNLISMDITFFSEDGELSS




LGKEGTGFRSNYFAYETGGKACKMQYCKHWGVRLPSGVWFEMADKDL




FAAARFPECPEGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAGLP




ISPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRVDIAAPILSRMVGMIS




GTTTERELWDDWAPYEDVEIGPNGVLRTSSGYKFPLYMIGHGMLDSDLH




LSSKAQVFEHPHIQDAASQLPDDEVLFFGDTGLSKNPIELVEGWFSSWKS




SIASFFFIIGLIIGLFLVLRVGIHLCIKLKHTKKRQIYTDIEMNRLGK









Other preferred mutant polypeptides according to the invention are based on the amino acid sequence of the ectodomain of glycoprotein G of VSV Maraba strain, and may be selected from:

    • a) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:28 corresponding to SEQ ID NO: 4 with a substitution of the histidine (H) amino acid residue in position 22 by an asparagine (N) amino acid residue;
    • b) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:29 corresponding to SEQ ID NO: 4 with a substitution of the threonine (T) amino acid residue in position 422 by an isoleucine (I) amino acid residue;
    • c) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:30 corresponding to SEQ ID NO: 4 with a substitution of the threonine (T) amino acid residue in position 422 by a leucine (L) amino acid residue;
    • d) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:31 corresponding to SEQ ID NO: 4 with a substitution of the threonine (T) amino acid residue in position 422 by a phenylalanine (F) amino acid residue;
    • e) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:32 corresponding to SEQ ID NO: 4 with a substitution of the threonine (T) amino acid residue in position 422 by a methionine (M) amino acid residue;
    • f) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:33 corresponding to SEQ ID NO: 4 with a substitution of the threonine (T) amino acid residue in position 422 by a valine (V) amino acid residue;
    • g) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:34 corresponding to SEQ ID NO: 4 with a substitution of the histidine (H) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by an isoleucine (I) amino acid residue;
    • h) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:35 corresponding to SEQ ID NO: 4 with a substitution of the histidine (H) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by a leucine (L) amino acid residue;
    • i) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:36 corresponding to SEQ ID NO: 4 with a substitution of the histidine (H) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by a phenylalanine (F) amino acid residue;
    • j) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:37 corresponding to SEQ ID NO: 4 with a substitution of the histidine (H) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by a methionine (M) amino acid residue; and
    • k) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:38 corresponding to SEQ ID NO: 4 with a substitution of the histidine (H) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by a valine (V) amino acid residue; and
    • l) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as defined in any one of a) to k) above, and further comprising a signal peptide in N-terminal, preferably the signal peptide corresponding to positions 1-16 of SEQ ID NO:3.


Most preferred mutant polypeptides based on the amino acid sequence of the ectodomain of glycoprotein G of VSV Maraba strain are those comprising at least a substitution in position 422 (SEQ ID NO: 29 to 38 or a mutant polypeptide comprising one of these amino acid sequences and further comprising a signal peptide in N-terminal, preferably the signal peptide corresponding to positions 1-16 of SEQ ID NO:3).


The amino acid sequences SEQ ID NO:28 to 38 of preferred mutant polypeptide derived from VSV Maraba glycoprotein G ectodomain are presented in Table 4 below.









TABLE 4







Amino acid sequences SEQ ID NO: 28 to 38 based on Maraba VSV strain


glycoprotein G ectodomain (SEQ ID NO: 4) and associated substitution


compared to SEQ ID NO: 4. For each substitution, the initial amino


acid in SEQ ID NO: 4 is indicated in one letter code and followed by


its position in SEQ ID NO: 4 and the substitute amino acid in one


letter code.









Sequence




identifier
Substitution(s)
Amino acid sequence





SEQ ID
H22N
KFTIVFPHHQKGNWKNVPSTYNYCPSSSDQNWHNDLTGVSLHVKIPKSH


NO: 28

KAIQADGWMCHAAKWVTTCDFRWYGPKYITHSIHSMSPTLEQCKTSIEQ




TKQGVWINPGFPPQSCGYATVTDAEVVVVQATPHHVLVDEYTGEWIDS




QLVGGKCSKEVCQTVHNSTVWHADYKITGLCESNLASVDITFFSEDGQK




TSLGKPNTGFRSNHFAYESGEKACRMQVCTQWGIRLPSGVWFELVDKD




LFQAAKLPECPRGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAKL




PVSPVDLSYLAPKNPGSGPAFTIINGTLKYFETRYIRVDISNPIIPHMVGTM




SGTTTERELWNDWYPYEDVEIGPNGVLKTPTGFKFPLYMIGHGMLDSDL




HKSSQAQVFEHPHAKDAASQLPDDETLFFGDTGLSKNPVELVEGWFSS




WKSTLASFFLIIGLGVALIFIIRIIVAIRYKYKGRKTQKIYNDVEMSRLGNK





SEQ ID
T422I
KFTIVFPHHQKGNWKNVPSTYHYCPSSSDQNWHNDLTGVSLHVKIPKSH


NO: 29

KAIQADGWMCHAAKWVTTCDFRWYGPKYITHSIHSMSPTLEQCKTSIEQ




TKQGVWINPGFPPQSCGYATVTDAEVVVVQATPHHVLVDEYTGEWIDS




QLVGGKCSKEVCQTVHNSTVWHADYKITGLCESNLASVDITFFSEDGQK




TSLGKPNTGFRSNHFAYESGEKACRMQYCTQWGIRLPSGVWFELVDKD




LFQAAKLPECPRGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAKL




PVSPVDLSYLAPKNPGSGPAFTIINGTLKYFETRYIRVDISNPIIPHMVGTM




SGTTTERELWNDWYPYEDVEIGPNGVLKTPTGFKFPLYMIGHGMLDSDL




HKSSQAQVFEHPHAKDAASQLPDDEILFFGDTGLSKNPVELVEGWFSSW




KSTLASFFLIIGLGVALIFIIRIIVAIRYKYKGRKTQKIYNDVEMSRLGNK





SEQ ID
T422L
KFTIVFPHHQKGNWKNVPSTYHYCPSSSDQNWHNDLTGVSLHVKIPKSH


NO: 30

KAIQADGWMCHAAKWVTTCDFRWYGPKYITHSIHSMSPTLEQCKTSIEQ




TKQGVWINPGFPPQSCGYATVTDAEVVVVQATPHHVLVDEYTGEWIDS




QLVGGKCSKEVCQTVHNSTVWHADYKITGLCESNLASVDITFFSEDGQK




TSLGKPNTGFRSNHFAYESGEKACRMQYCTQWGIRLPSGVWFELVDKD




LFQAAKLPECPRGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAKL




PVSPVDLSYLAPKNPGSGPAFTIINGTLKYFETRYIRVDISNPIIPHMVGTM




SGTTTERELWNDWYPYEDVEIGPNGVLKTPTGFKFPLYMIGHGMLDSDL




HKSSQAQVFEHPHAKDAASQLPDDELLFFGDTGLSKNPVELVEGWFSSW




KSTLASFFLIIGLGVALIFIIRIIVAIRYKYKGRKTQKIYNDVEMSRLGNK





SEQ ID
T422F
KFTIVFPHHQKGNWKNVPSTYHYCPSSSDQNWHNDLTGVSLHVKIPKSH


NO: 31

KAIQADGWMCHAAKWVTTCDFRWYGPKYITHSIHSMSPTLEQCKTSIEQ




TKQGVWINPGFPPQSCGYATVTDAEVVVVQATPHHVLVDEYTGEWIDS




QLVGGKCSKEVCQTVHNSTVWHADYKITGLCESNLASVDITFFSEDGQK




TSLGKPNTGFRSNHFAYESGEKACRMQYCTQWGIRLPSGVWFELVDKD




LFQAAKLPECPRGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAKL




PVSPVDLSYLAPKNPGSGPAFTIINGTLKYFETRYIRVDISNPIIPHMVGTM




SGTTTERELWNDWYPYEDVEIGPNGVLKTPTGFKFPLYMIGHGMLDSDL




HKSSQAQVFEHPHAKDAASQLPDDEFLFFGDTGLSKNPVELVEGWFSSW




KSTLASFFLIIGLGVALIFIIRIIVAIRYKYKGRKTQKIYNDVEMSRLGNK





SEQ ID
T422M
KFTIVFPHHQKGNWKNVPSTYHYCPSSSDQNWHNDLTGVSLHVKIPKSH


NO: 32

KAIQADGWMCHAAKWVTTCDFRWYGPKYITHSIHSMSPTLEQCKTSIEQ




TKQGVWINPGFPPQSCGYATVTDAEVVVVQATPHHVLVDEYTGEWIDS




QLVGGKCSKEVCQTVHNSTVWHADYKITGLCESNLASVDITFFSEDGQK




TSLGKPNTGFRSNHFAYESGEKACRMQYCTQWGIRLPSGVWFELVDKD




LFQAAKLPECPRGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAKL




PVSPVDLSYLAPKNPGSGPAFTIINGTLKYFETRYIRVDISNPIIPHMVGTM




SGTTTERELWNDWYPYEDVEIGPNGVLKTPTGFKFPLYMIGHGMLDSDL




HKSSQAQVFEHPHAKDAASQLPDDEMLFFGDTGLSKNPVELVEGWFSS




WKSTLASFFLIIGLGVALIFIIRIIVAIRYKYKGRKTQKIYNDVEMSRLGNK





SEQ ID
T422V
KFTIVFPHHQKGNWKNVPSTYHYCPSSSDQNWHNDLTGVSLHVKIPKSH


NO: 33

KAIQADGWMCHAAKWVTTCDFRWYGPKYITHSIHSMSPTLEQCKTSIEQ




TKQGVWINPGFPPQSCGYATVTDAEVVVVQATPHHVLVDEYTGEWIDS




QLVGGKCSKEVCQTVHNSTVWHADYKITGLCESNLASVDITFFSEDGQK




TSLGKPNTGFRSNHFAYESGEKACRMQYCTQWGIRLPSGVWFELVDKD




LFQAAKLPECPRGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAKL




PVSPVDLSYLAPKNPGSGPAFTIINGTLKYFETRYIRVDISNPIIPHMVGTM




SGTTTERELWNDWYPYEDVEIGPNGVLKTPTGFKFPLYMIGHGMLDSDL




HKSSQAQVFEHPHAKDAASQLPDDEVLFFGDTGLSKNPVELVEGWFSS




WKSTLASFFLIIGLGVALIFIIRIIVAIRYKYKGRKTQKIYNDVEMSRLGNK





SEQ ID
H22N and
KFTIVFPHHQKGNWKNVPSTYNYCPSSSDQNWHNDLTGVSLHVKIPKSH


NO: 34
T422I
KAIQADGWMCHAAKWVTTCDFRWYGPKYITHSIHSMSPTLEQCKTSIEQ




TKQGVWINPGFPPQSCGYATVTDAEVVVVQATPHHVLVDEYTGEWIDS




QLVGGKCSKEVCQTVHNSTVWHADYKITGLCESNLASVDITFFSEDGQK




TSLGKPNTGFRSNHFAYESGEKACRMQVCTQWGIRLPSGVWFELVDKD




LFQAAKLPECPRGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAKL




PVSPVDLSYLAPKNPGSGPAFTIINGTLKYFETRYIRVDISNPIIPHMVGTM




SGTTTERELWNDWYPYEDVEIGPNGVLKTPTGFKFPLYMIGHGMLDSDL




HKSSQAQVFEHPHAKDAASQLPDDEILFFGDTGLSKNPVELVEGWFSSW




KSTLASFFLIIGLGVALIFlIRIIVAIRYKYKGRKTQKIYNDVEMSRLGNK





SEQ ID
H22N and
KFTIVFPHHQKGNWKNVPSTYNYCPSSSDQNWHNDLTGVSLHVKIPKSH


NO: 35
T422L
KAIQADGWMCHAAKWVTTCDFRWYGPKYITHSIHSMSPTLEQCKTSIEQ




TKQGVWINPGFPPQSCGYATVTDAEVVVVQATPHHVLVDEYTGEWIDS




QLVGGKCSKEVCQTVHNSTVWHADYKITGLCESNLASVDITFFSEDGQK




TSLGKPNTGFRSNHFAYESGEKACRMQVCTQWGIRLPSGVWFELVDKD




LFQAAKLPECPRGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAKL




PVSPVDLSYLAPKNPGSGPAFTIINGTLKYFETRYIRVDISNPIIPHMVGTM




SGTTTERELWNDWYPYEDVEIGPNGVLKTPTGFKFPLYMIGHGMLDSDL




HKSSQAQVFEHPHAKDAASQLPDDELLFFGDTGLSKNPVELVEGWFSSW




KSTLASFFLIIGLGVALIFlIRIIVAIRYKYKGRKTQKIYNDVEMSRLGNK





SEQ ID
H22N and
KFTIVFPHHQKGNWKNVPSTYNYCPSSSDQNWHNDLTGVSLHVKIPKSH


NO: 36
T422F
KAIQADGWMCHAAKWVTTCDFRWYGPKYITHSIHSMSPTLEQCKTSIEQ




TKQGVWINPGFPPQSCGYATVTDAEVVVVQATPHHVLVDEYTGEWIDS




QLVGGKCSKEVCQTVHNSTVWHADYKITGLCESNLASVDITFFSEDGQK




TSLGKPNTGFRSNHFAYESGEKACRMQYCTQWGIRLPSGVWFELVDKD




LFQAAKLPECPRGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAKL




PVSPVDLSYLAPKNPGSGPAFTIINGTLKYFETRYIRVDISNPIIPHMVGTM




SGTTTERELWNDWYPYEDVEIGPNGVLKTPTGFKFPLYMIGHGMLDSDL




HKSSQAQVFEHPHAKDAASQLPDDEFLFFGDTGLSKNPVELVEGWFSSW




KSTLASFFLIIGLGVALIFIIRIIVAIRYKYKGRKTQKIYNDVEMSRLGNK





SEQ ID
H22N and
KFTIVFPHHQKGNWKNVPSTYNYCPSSSDQNWHNDLTGVSLHVKIPKSH


NO: 37
T422M
KAIQADGWMCHAAKWVTTCDFRWYGPKYITHSIHSMSPTLEQCKTSIEQ




TKQGVWINPGFPPQSCGYATVTDAEVVVVQATPHHVLVDEYTGEWIDS




QLVGGKCSKEVCQTVHNSTVWHADYKITGLCESNLASVDITFFSEDGQK




TSLGKPNTGFRSNHFAYESGEKACRMQYCTQWGIRLPSGVWFELVDKD




LFQAAKLPECPRGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAKL




PVSPVDLSYLAPKNPGSGPAFTIINGTLKYFETRYIRVDISNPIIPHMVGTM




SGTTTERELWNDWYPYEDVEIGPNGVLKTPTGFKFPLYMIGHGMLDSDL




HKSSQAQVFEHPHAKDAASQLPDDEMLFFGDTGLSKNPVELVEGWFSS




WKSTLASFFLIIGLGVALIFIIRIIVAIRYKYKGRKTQKIYNDVEMSRLGNK





SEQ ID
H22N and
KFTIVFPHHQKGNWKNVPSTYNYCPSSSDQNWHNDLTGVSLHVKIPKSH


NO: 38
T422V
KAIQADGWMCHAAKWVTTCDFRWYGPKYITHSIHSMSPTLEQCKTSIEQ




TKQGVWINPGFPPQSCGYATVTDAEVVVVQATPHHVLVDEYTGEWIDS




QLVGGKCSKEVCQTVHNSTVWHADYKITGLCESNLASVDITFFSEDGQK




TSLGKPNTGFRSNHFAYESGEKACRMQYCTQWGIRLPSGVWFELVDKD




LFQAAKLPECPRGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAKL




PVSPVDLSYLAPKNPGSGPAFTIINGTLKYFETRYIRVDISNPIIPHMVGTM




SGTTTERELWNDWYPYEDVEIGPNGVLKTPTGFKFPLYMIGHGMLDSDL




HKSSQAQVFEHPHAKDAASQLPDDEVLFFGDTGLSKNPVELVEGWFSS




WKSTLASFFLIIGLGVALIFIIRIIVAIRYKYKGRKTQKIYNDVEMSRLGNK









Other preferred mutant polypeptides according to the invention are based on the amino acid sequence of the ectodomain of glycoprotein G of VSV Cocal strain, and may be selected from:

    • a) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:39 corresponding to SEQ ID NO: 6 with a substitution of the histidine (H) amino acid residue in position 22 by an asparagine (N) amino acid residue;
    • b) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:40 corresponding to SEQ ID NO: 6 with a substitution of the threonine (T) amino acid residue in position 422 by an isoleucine (I) amino acid residue;
    • c) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:41 corresponding to SEQ ID NO: 6 with a substitution of the threonine (T) amino acid residue in position 422 by a leucine (L) amino acid residue;
    • d) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:42 corresponding to SEQ ID NO: 6 with a substitution of the threonine (T) amino acid residue in position 422 by a phenylalanine (F) amino acid residue;
    • e) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:43 corresponding to SEQ ID NO: 6 with a substitution of the threonine (T) amino acid residue in position 422 by a methionine (M) amino acid residue;
    • f) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:44 corresponding to SEQ ID NO: 6 with a substitution of the threonine (T) amino acid residue in position 422 by a valine (V) amino acid residue;
    • g) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:45 corresponding to SEQ ID NO: 6 with a substitution of the histidine (H) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by an isoleucine (I) amino acid residue;
    • h) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:46 corresponding to SEQ ID NO: 6 with a substitution of the histidine (H) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by a leucine (L) amino acid residue;
    • i) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:47 corresponding to SEQ ID NO: 6 with a substitution of the histidine (H) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by a phenylalanine (F) amino acid residue;
    • j) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:48 corresponding to SEQ ID NO: 6 with a substitution of the histidine (H) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by a methionine (M) amino acid residue; and
    • k) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:49 corresponding to SEQ ID NO: 6 with a substitution of the histidine (H) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by a valine (V) amino acid residue; and
    • l) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as defined in any one of a) to k) above, and further comprising a signal peptide in N-terminal, preferably the signal peptide corresponding to positions 1-17 of SEQ ID NO:5.


Most preferred mutant polypeptides based on the amino acid sequence of the ectodomain of glycoprotein G of VSV Cocal strain are those comprising at least a substitution in position 422 (SEQ ID NO: 40 to 49 or a mutant polypeptide comprising one of these amino acid sequences and further comprising a signal peptide in N-terminal, preferably the signal peptide corresponding to positions 1-17 of SEQ ID NO:5).


The amino acid sequences SEQ ID NO:39 to 49 of preferred mutant polypeptide derived from VSV Cocal glycoprotein G ectodomain are presented in Table 5 below.









TABLE 5







Amino acid sequences SEQ ID NO: 39 to 49 based on Cocal VSV strain


glycoprotein G ectodomain (SEQ ID NO: 6) and associated substitution


compared to SEQ ID NO: 6. For each substitution, the initial amino acid


in SEQ ID NO: 6 is indicated in one letter code and followed by its


position in SEQ ID NO: 6 and the substitute amino acid in one letter code.









Sequence




identifier
Substitution(s)
Amino acid sequence





SEQ ID
H22N
KFSIVFPQSQKGNWKNVPSSYNYCPSSSDQNWHNDLLGITMKVKMPKTH


NO: 39

KAIQADGWMCHAAKWITTCDFRWYGPKYITHSIHSIQPTSEQCKESIKQT




KQGTWMSPGFPPQNCGYATVTDSVAVVVQATPHHVLVDEYTGEWIDS




QFPNGKCETEECETVHNSTVWYSDYKVTGLCDATLVDTEITFFSEDGKK




ESIGKPNTGYRSNYFAYEKGDKVCKMNYCKHAGVRLPSGVWFEFVDQD




VYAAAKLPECPVGATISAPTQTSVDVSLILDVERILDYSLCQETWSKIRSK




QPVSPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRIDIDNPIISKMVGKI




SGSQTERELWTEWFPYEGVEIGPNGILKTPTGYKFPLFMIGHGMLDSDL




HKTSQAEVFEHPHLAEAPKQLPEEETLFFGDTGISKNPVELIEGWFSSWK




STVVTFFFAIGVFILLYVVARIVIAVRYRYQGSNNKRIYNDIEMSRFRK





SEQ ID
T422I
KFSIVFPQSQKGNWKNVPSSYHYCPSSSDQNWHNDLLGITMKVKMPKTH


NO: 40

KAIQADGWMCHAAKWITTCDFRWYGPKYITHSIHSIQPTSEQCKESIKQT




KQGTWMSPGFPPQNCGYATVTDSVAVVVQATPHHVLVDEYTGEWIDS




QFPNGKCETEECETVHNSTVWYSDYKVTGLCDATLVDTEITFFSEDGKK




ESIGKPNTGYRSNYFAYEKGDKVCKMNYCKHAGVRLPSGVWFEFVDQD




VYAAAKLPECPVGATISAPTQTSVDVSLILDVERILDYSLCQETWSKIRSK




QPVSPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRIDIDNPIISKMVGKI




SGSQTERELWTEWFPYEGVEIGPNGILKTPTGYKFPLFMIGHGMLDSDL




HKTSQAEVFEHPHLAEAPKQLPEEEILFFGDTGISKNPVELIEGWFSSWKS




TVVTFFFAIGVFILLYVVARIVIAVRYRYQGSNNKRIYNDIEMSRFRK





SEQ ID
T422L
KFSIVFPQSQKGNWKNVPSSYHYCPSSSDQNWHNDLLGITMKVKMPKTH


NO: 41

KAIQADGWMCHAAKWITTCDFRWYGPKYITHSIHSIQPTSEQCKESIKQT




KQGTWMSPGFPPQNCGYATVTDSVAVVVQATPHHVLVDEYTGEWIDS




QFPNGKCETEECETVHNSTVWYSDYKVTGLCDATLVDTEITFFSEDGKK




ESIGKPNTGYRSNYFAYEKGDKVCKMNYCKHAGVRLPSGVWFEFVDQD




VYAAAKLPECPVGATISAPTQTSVDVSLILDVERILDYSLCQETWSKIRSK




QPVSPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRIDIDNPIISKMVGKI




SGSQTERELWTEWFPYEGVEIGPNGILKTPTGYKFPLFMIGHGMLDSDL




HKTSQAEVFEHPHLAEAPKQLPEEELLFFGDTGISKNPVELIEGWFSSWK




STVVTFFFAIGVFILLYVVARIVIAVRYRYQGSNNKRIYNDIEMSRFRK





SEQ ID
T422F
KFSIVFPQSQKGNWKNVPSSYHYCPSSSDQNWHNDLLGITMKVKMPKTH


NO: 42

KAIQADGWMCHAAKWITTCDFRWYGPKYITHSIHSIQPTSEQCKESIKQT




KQGTWMSPGFPPQNCGYATVTDSVAVVVQATPHHVLVDEYTGEWIDS




QFPNGKCETEECETVHNSTVWYSDYKVTGLCDATLVDTEITFFSEDGKK




ESIGKPNTGYRSNYFAYEKGDKVCKMNYCKHAGVRLPSGVWFEFVDQD




VYAAAKLPECPVGATISAPTQTSVDVSLILDVERILDYSLCQETWSKIRSK




QPVSPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRIDIDNPIISKMVGKI




SGSQTERELWTEWFPYEGVEIGPNGILKTPTGYKFPLFMIGHGMLDSDL




HKTSQAEVFEHPHLAEAPKQLPEEEFLFFGDTGISKNPVELIEGWFSSWK




STVVTFFFAIGVFILLYVVARIVIAVRYRYQGSNNKRIYNDIEMSRFRK





SEQ ID
T422M
KFSIVFPQSQKGNWKNVPSSYHYCPSSSDQNWHNDLLGITMKVKMPKTH


NO: 43

KAIQADGWMCHAAKWITTCDFRWYGPKYITHSIHSIQPTSEQCKESIKQT




KQGTWMSPGFPPQNCGYATVTDSVAVVVQATPHHVLVDEYTGEWIDS




QFPNGKCETEECETVHNSTVWYSDYKVTGLCDATLVDTEITFFSEDGKK




ESIGKPNTGYRSNYFAYEKGDKVCKMNYCKHAGVRLPSGVWFEFVDQD




VYAAAKLPECPVGATISAPTQTSVDVSLILDVERILDYSLCQETWSKIRSK




QPVSPVDLSYLAPKNPGTGAFTIINGTLKYFETRYIRIDIDNPIISKMVGKI




SGSQTERELWTEWFPYEGVEIGPNGILKTPTGYKFPLFMIGHGMLDSDL




HKTSQAEVFEHPHLAEAPKQLPEEEMLFFGDTGISKNPVELIEGWFSSWK




STVVTFFFAIGVFILLYVVARIVIAVRYRYQGSNNKRIYNDIMSRFRK





SEQ ID
T422V
KFSIVFPQSQKGNWKNVPSSYHYCPSSSDQNWHNDLLGITMKVKMPKTH


NO: 44

KAIQADGWMCHAAKWITTCDFRWYGPKYITHSIHSIQPTSEQCKESIKQT




KQGTWMSPGFPPQNCGYATVTDSVAVVVQATPHHVLVDEYTGEWIDS




QFPNGKCETEECETVHNSTVWYSDYKVTGLCDATLVDTEITFFSEDGKK




ESIGKPNTGYRSNYFAYEKGDKVCKMNYCKHAGVRLPSGVWFEFVDQD




VYAAAKLPECPVGATISAPTQTSVDVSLILDVERILDYSLCQETWSKIRSK




QPVSPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRIDIDNPIISKMVGKI




SGSQTERELWTEWFPYEGVEIGPNGILKTPTGYKFPLFMIGHGMLDSDL




HKTSQAEVFEHPHLAEAPKQLPEEEVLFFGDTGISKNPVELIEGWFSSWK




STVVTFFFAIGVFILLYVVARIVIAVRYRYQGSNNKRIYNDIEMSRFRK





SEQ ID
H22N and
KFSIVFPQSQKGNWKNVPSSYNYCPSSSDQNWHNDLLGITMKVKMPKTH


NO: 45
T422I
KAIQADGWMCHAAKWITTCDFRWYGPKYITHSIHSIQPTSEQCKESIKQT




KQGTWMSPGFPPQNCGYATVTDSVAVVVQATPHHVLVDEYTGEWIDS




QFPNGKCETEECETVHNSTVWYSDYKVTGLCDATLVDTEITFFSEDGKK




ESIGKPNTGYRSNYFAYEKGDKVCKMNYCKHAGVRLPSGVWFEFVDQD




VYAAAKLPECPVGATISAPTQTSVDVSLILDVERILDYSLCQETWSKIRSK




QPVSPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRIDIDNPIISKMVGKI




SGSQTERELWTEWFPYEGVEIGPNGILKTPTGYKFPLFMIGHGMLDSDL




HKTSQAEVFEHPHLAEAPKQLPEEEILFFGDTGISKNPVELIEGWFSSWKS




TVVTFFFAIGVFILLYVVARIVIAVRYRYQGSNNKRIYNDIEMSRFRK





SEQ ID
H22N and
KFSIVFPQSQKGNWKNVPSSYNYCPSSSDQNWHNDLLGITMKVKMPKTH


NO: 46
T422L
KAIQADGWMCHAAKWITTCDFRWYGPKYITHSIHSIQPTSEQCKESIKQT




KQGTWMSPGFPPQNCGYATVTDSVAVVVQATPHHVLVDEYTGEWIDS




QFPNGKCETEECETVHNSTVWYSDYKVTGLCDATLVDTEITFFSEDGKK




ESIGKPNTGYRSNYFAYEKGDKVCKMNYCKHAGVRLPSGVWFEFVDQD




VYAAAKLPECPVGATISAPTQTSVDVSLILDVERILDYSLCQETWSKIRSK




QPVSPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRIDIDNPIISKMVGKI




SGSQTERELWTEWFPYEGVEIGPNGILKTPTGYKFPLFMIGHGMLDSDL




HKTSQAEVFEHPHLAEAPKQLPEEELLFFGDTGISKNPVELIEGWFSSWK




STVVTFFFAIGVFILLYVVARIVIAVRYRYQGSNNKRIYNDIEMSRFRK





SEQ ID
H22N and
KFSIVFPQSQKGNWKNVPSSYNYCPSSSDQNWHNDLLGITMKVKMPKTH


NO: 47
T422F
KAIQADGWMCHAAKWITTCDFRWYGPKYITHSIHSIQPTSEQCKESIKQT




KQGTWMSPGFPPQNCGYATVTDSVAVVVQATPHHVLVDEYTGEWIDS




QFPNGKCETEECETVHNSTVWYSDYKVTGLCDATLVDTEITFFSEDGKK




ESIGKPNTGYRSNYFAYEKGDKVCKMNYCKHAGVRLPSGVWFEFVDQD




VYAAAKLPECPVGATISAPTQTSVDVSLILDVERILDYSLCQETWSKIRSK




QPVSPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRIDIDNPIISKMVGKI




SGSQTERELWTEWFPYEGVEIGPNGILKTPTGYKFPLFMIGHGMLDSDL




HKTSQAEVFEHPHLAEAPKQLPEEEFLFFGDTGISKNPVELIEGWFSSWK




STVVTFFFAIGVFILLYVVARIVIAVRYRYQGSNNKRIYNDIEMSRFRK





SEQ ID
H22N and
KFSIVFPQSQKGNWKNVPSSYNYCPSSSDQNWHNDLLGITMKVKMPKTH


NO: 48
T422M
KAIQADGWMCHAAKWITTCDFRWYGPKYITHSIHSIQPTSEQCKESIKQT




KQGTWMSPGFPPQNCGYATVTDSVAVVVQATPHHVLVDEYTGEWIDS




QFPNGKCETEECETVHNSTVWYSDYKVTGLCDATLVDTEITFFSEDGKK




ESIGKPNTGYRSNYFAYEKGDKVCKMNYCKHAGVRLPSGVWFEFVDQD




VYAAAKLPECPVGATISAPTQTSVDVSLILDVERILDYSLCQETWSKIRSK




QPVSPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRIDIDNPIISKMVGKI




SGSQTERELWTEWFPYEGVEIGPNGILKTPTGYKFPLFMIGHGMLDSDL




HKTSQAEVFEHPHLAEAPKQLPEEEMLFFGDTGISKNPVELIEGWFSSWK




STVVTFFFAIGVFILLYVVARIVIAVRYRYQGSNNKRIYNDIEMSRFRK





SEQ ID
H22N and
KFSIVFPQSQKGNWKNVPSSYNYCPSSSDQNWHNDLLGITMKVKMPKTH


NO: 49
T422V
KAIQADGWMCHAAKWITTCDFRWYGPKYITHSIHSIQPTSEQCKESIKQT




KQGTWMSPGFPPQNCGYATVTDSVAVVVQATPHHVLVDEYTGEWIDS




QFPNGKCETEECETVHNSTVWYSDYKVTGLCDATLVDTEITFFSEDGKK




ESIGKPNTGYRSNYFAYEKGDKVCKMNYCKHAGVRLPSGVWFEFVDQD




VYAAAKLPECPVGATISAPTQTSVDVSLILDVERILDYSLCQETWSKIRSK




QPVSPVDLSYLAPKNPGTGPAFTIINGTLKYFETRYIRIDIDNPIISKMVGKI




SGSQTERELWTEWFPYEGVEIGPNGILKTPTGYKFPLFMIGHGMLDSDL




HKTSQAEVFEHPHLAEAPKQLPEEEVLFFGDTGISKNPVELIEGWFSSWK




STVVTFFFAIGVFILLYVVARIVIAVRYRYQGSNNKRIYNDIEMSRFRK









Other preferred mutant polypeptides according to the invention are based on the amino acid sequence of the ectodomain of glycoprotein G of VSV Morreton strain, and may be selected from:

    • a) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:50 corresponding to SEQ ID NO:8 with a substitution of the glutamine (Q) amino acid residue in position 22 by an asparagine (N) amino acid residue;
    • b) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:51 corresponding to SEQ ID NO:8 with a substitution of the threonine (T) amino acid residue in position 422 by an isoleucine (I) amino acid residue;
    • c) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:52 corresponding to SEQ ID NO:8 with a substitution of the threonine (T) amino acid residue in position 422 by a leucine (L) amino acid residue;
    • d) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:53 corresponding to SEQ ID NO:8 with a substitution of the threonine (T) amino acid residue in position 422 by a phenylalanine (F) amino acid residue;
    • e) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:54 corresponding to SEQ ID NO:8 with a substitution of the threonine (T) amino acid residue in position 422 by a methionine (M) amino acid residue;
    • f) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:55 corresponding to SEQ ID NO:8 with a substitution of the threonine (T) amino acid residue in position 422 by a valine (V) amino acid residue;
    • g) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:56 corresponding to SEQ ID NO:8 with a substitution of the glutamine (Q) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by an isoleucine (I) amino acid residue;
    • h) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:57 corresponding to SEQ ID NO:8 with a substitution of the glutamine (Q) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by a leucine (L) amino acid residue;
    • i) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:58 corresponding to SEQ ID NO:8 with a substitution of the glutamine (Q) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by a phenylalanine (F) amino acid residue;
    • j) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:59 corresponding to SEQ ID NO:8 with a substitution of the glutamine (Q) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by a methionine (M) amino acid residue; and
    • k) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:60 corresponding to SEQ ID NO:8 with a substitution of the glutamine (Q) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by a valine (V) amino acid residue; and
    • l) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as defined in any one of a) to k) above, and further comprising a signal peptide in N-terminal, preferably the signal peptide corresponding to positions 1-17 of SEQ ID NO:7.


Most preferred mutant polypeptides based on the amino acid sequence of the ectodomain of glycoprotein G of VSV Morreton strain are those comprising at least a substitution in position 422 (SEQ ID NO: 51 to 60 or a mutant polypeptide comprising one of these amino acid sequences and further comprising a signal peptide in N-terminal, preferably the signal peptide corresponding to positions 1-17 of SEQ ID NO:7).


The amino acid sequences SEQ ID NO:50 to 60 of preferred mutant polypeptide derived from VSV Morreton glycoprotein G ectodomain are presented in Table 6 below.









TABLE 6







Amino acid sequences SEQ ID NO: 50 to 60 based on Morreton VSV strain


glycoprotein G ectodomain (SEQ ID NO: 8) and associated substitution


compared to SEQ ID NO: 8. For each substitution, the initial amino acid


in SEQ ID NO: 8 is indicated in one letter code and followed by its


position in SEQ ID NO: 8 and the substitute amino acid in one letter code.









Sequence




identifier
Substitution(s)
Amino acid sequence





SEQ ID
Q22N
KFTIVFPHNQKGNWKNVPANYNYCPSSSDLNWHNGLIGTSLQVKMPKS


NO: 50

HKAIQADGWMCHAAKWVTTCDFRWYGPKYVTHSIKSMIPTVDQCKESI




AQTKQGTWLNPGFPPQSCGYASVTDAEAVIVKATPHQVLVDEYTGEWV




DSQFPTGKCNKDICPTVHNSTTWHSDYKVTGLCDANLISMDITFFSEDGK




LTSLGKEGTGFRSNYFAYENGDKACRMQYCKHWGVRLPSGVWFEMAD




KDIYNDAKFPDCPEGSSIAAPSQTSVDVSLIQDVERILDYSLCQETWSKIRA




HLPISPVDLSYLSPKNPGTGPAFTIINGTLKYFETRYIRVDIAGPIIPQMRG




VISGTTTERELWTDWYPYEDVEIGPNGVLKTATGYKFPLYMIGHGMLDS




DLHISSKAQVFEHPHIQDAASQLPDDETLFFGDTGLSKNPIELVEGWFSG




WKSTIASFFFIIGLVIGLYLVLRIGIALCIKCRVQEKRPKIYTDVEMNRLDR





SEQ ID
T422I
KFTIVFPHNQKGNWKNVPANYQYCPSSSDLNWHNGLIGTSLQVKMPKS


NO: 51

HKAIQADGWMCHAAKWVTTCDFRWYGPKYVTHSIKSMIPTVDQCKESI




AQTKQGTWLNPGFPPQSCGYASVTDAEAVIVKATPHQVLVDEYTGEWV




DSQFPTGKCNKDICPTVHNSTTWHSDYKVTGLCDANLISMDITFFSEDGK




LTSLGKEGTGFRSNYFAYENGDKACRMQYCKHWGVRLPSGVWFEMAD




KDIYNDAKFPDCPEGSSIAAPSQTSVDVSLIQDVERILDYSLCQETWSKIRA




HLPISPVDLSYLSPKNPGTGPAFTIINGTLKYFETRYIRVDIAGPIIPQMRG




VISGTTTERELWTDWYPYEDVEIGPNGVLKTATGYKFPLYMIGHGMLDS




DLHISSKAQVFEHPHIQDAASQLPDDEILFFGDTGLSKNPIELVEGWFSG




WKSTIASFFFIIGLVIGLYLVLRIGIALCIKCRVQEKRPKIYTDVEMNRLDR





SEQ ID
T422L
KFTIVFPHNQKGNWKNVPANYQYCPSSSDLNWHNGLIGTSLQVKMPKS


NO: 52

HKAIQADGWMCHAAKWVTTCDFRWYGPKYVTHSIKSMIPTVDQCKESI




AQTKQGTWLNPGFPPQSCGYASVTDAEAVIVKATPHQVLVDEYTGEWV




DSQFPTGKCNKDICPTVHNSTTWHSDYKVTGLCDANLISMDITFFSEDGK




LTSLGKEGTGFRSNYFAYENGDKACRMQYCKHWGVRLPSGVWFEMAD




KDIYNDAKFPDCPEGSSIAAPSQTSVDVSLIQDVERILDYSLCQETWSKIRA




HLPISPVDLSYLSPKNPGTGPAFTIINGTLKYFETRYIRVDIAGPIIPQMRG




VISGTTTERELWTDWYPYEDVEIGPNGVLKTATGYKFPLYMIGHGMLDS




DLHISSKAQVFEHPHIQDAASQLPDDELLFFGDTGLSKNPIELVEGWFSG




WKSTIASFFFIIGLVIGLYLVLRIGIALCIKCRVQEKRPKIYTDVEMNRLDR





SEQ ID
T422F
KFTIVFPHNQKGNWKNVPANYQYCPSSSDLNWHNGLIGTSLQVKMPKS


NO: 53

HKAIQADGWMCHAAKWVTTCDFRWYGPKYVTHSIKSMIPTVDQCKESI




AQTKQGTWLNPGFPPQSCGYASVTDAEAVIVKATPHQVLVDEYTGEWV




DSQFPTGKCNKDICPTVHNSTTWHSDYKVTGLCDANLISMDITFFSEDGK




LTSLGKEGTGFRSNYFAYENGDKACRMQYCKHWGVRLPSGVWFEMAD




KDIYNDAKFPDCPEGSSIAAPSQTSVDVSLIQDVERILDYSLCQETWSKIRA




HLPISPVDLSYLSPKNPGTGPAFTIINGTLKYFETRYIRVDIAGPIIPQMRG




VISGTTTERELWTDWYPYEDVEIGPNGVLKTATGYKFPLYMIGHGMLDS




DLHISSKAQVFEHPHIQDAASQLPDDEFLFFGDTGLSKNPIELVEGWFSG




WKSTIASFFFIIGLVIGLYLVLRIGIALCIKCRVQEKRPKIYTDVEMNRLDR





SEQ ID
T422M
KFTIVFPHNQKGNWKNVPANYQYCPSSSDLNWHNGLIGTSLQVKMPKS


NO: 54

HKAIQADGWMCHAAKWVTTCDFRWYGPKYVTHSIKSMIPTVDQCKESI




AQTKQGTWLNPGFPPQSCGYASVTDAEAVIVKATPHQVLVDEYTGEWV




DSQFPTGKCNKDICPTVHNSTTWHSDYKVTGLCDANLISMDITFFSEDGK




LTSLGKEGTGFRSNYFAYENGDKACRMQYCKHWGVRLPSGVWFEMAD




KDIYNDAKFPDCPEGSSIAAPSQTSVDVSLIQDVERILDYSLCQETWSKIRA




HLPISPVDLSYLSPKNPGTGPAFTIINGTLKYFETRYIRVDIAGPIIPQMRG




VISGTTTERELWTDWYPYEDVEIGPNGVLKTATGYKFPLYMIGHGMLDS




DLHISSKAQVFEHPHIQDAASQLPDDEMLFFGDTGLSKNPIELVEGWFSG




WKSTIASFFFIIGLVIGLYLVLRIGIALCIKCRVQEKRPKIYTDVEMNRLDR





SEQ ID
T422V
KFTIVFPHNQKGNWKNVPANYQYCPSSSDLNWHNGLIGTSLQVKMPKS


NO: 55

HKAIQADGWMCHAAKWVTTCDFRWYGPKYVTHSIKSMIPTVDQCKESI




AQTKQGTWLNPGFPPQSCGYASVTDAEAVIVKATPHQVLVDEYTGEWV




DSQFPTGKCNKDICPTVHNSTTWHSDYKVTGLCDANLISMDITFFSEDGK




LTSLGKEGTGFRSNYFAYENGDKACRMQYCKHWGVRLPSGVWFEMAD




KDIYNDAKFPDCPEGSSIAAPSQTSVDVSLIQDVERILDYSLCQETWSKIRA




HLPISPVDLSYLSPKNPGTGPAFTIINGTLKYFETRYIRVDIAGPIIPQMRG




VISGTTTERELWTDWYPYEDVEIGPNGVLKTATGYKFPLYMIGHGMLDS




DLHISSKAQVFEHPHIQDAASQLPDDEVLFFGDTGLSKNPIELVEGWFSG




WKSTIASFFFIIGLVIGLYLVLRIGIALCIKCRVQEKRPKIYTDVEMNRLDR





SEQ ID
Q22N and
KFTIVFPHNQKGNWKNVPANYNYCPSSSDLNWHNGLIGTSLQVKMPKS


NO: 56
T422I
HKAIQADGWMCHAAKWVTTCDFRWYGPKYVTHSIKSMIPTVDQCKESI




AQTKQGTWLNPGFPPQSCGYASVTDAEAVIVKATPHQVLVDEYTGEWV




DSQFPTGKCNKDICPTVHNSTTWHSDYKVTGLCDANLISMDITFFSEDGK




LTSLGKEGTGFRSNYFAYENGDKACRMQYCKHWGVRLPSGVWFEMAD




KDIYNDAKFPDCPEGSSIAAPSQTSVDVSLIQDVERILDYSLCQETWSKIRA




HLPISPVDLSYLSPKNPGTGPAFTIINGTLKYFETRYIRVDIAGPIIPQMRG




VISGTTTERELWTDWYPYEDVEIGPNGVLKTATGYKFPLYMIGHGMLDS




DLHISSKAQVFEHPHIQDAASQLPDDEILFFGDTGLSKNPIELVEGWFSG




WKSTIASFFFIIGLVIGLYLVLRIGIALCIKCRVQEKRPKIYTDVEMNRLDR





SEQ ID
Q22N and
KFTIVFPHNQKGNWKNVPANYNYCPSSSDLNWHNGLIGTSLQVKMPKS


NO: 57
T422L
HKAIQADGWMCHAAKWVTTCDFRWYGPKYVTHSIKSMIPTVDQCKESI




AQTKQGTWLNPGFPPQSCGYASVTDAEAVIVKATPHQVLVDEYTGEWV




DSQFPTGKCNKDICPTVHNSTTWHSDYKVTGLCDANLISMDITFFSEDGK




LTSLGKEGTGFRSNYFAYENGDKACRMQYCKHWGVRLPSGVWFEMAD




KDIYNDAKFPDCPEGSSIAAPSQTSVDVSLIQDVERILDYSLCQETWSKIRA




HLPISPVDLSYLSPKNPGTGPAFTIINGTLKYFETRYIRVDIAGPIIPQMRG




VISGTTTERELWTDWYPYEDVEIGPNGVLKTATGYKFPLYMIGHGMLDS




DLHISSKAQVFEHPHIQDAASQLPDDELLFFGDTGLSKNPIELVEGWFSG




WKSTIASFFFIIGLVIGLYLVLRIGIALCIKCRVQEKRPKIYTDVEMNRLDR





SEQ ID
Q22N and
KFTIVFPHNQKGNWKNVPANYNYCPSSSDLNWHNGLIGTSLQVKMPKS


NO: 58
T422F
HKAIQADGWMCHAAKWVTTCDFRWYGPKYVTHSIKSMIPTVDQCKESI




AQTKQGTWLNPGFPPQSCGYASVTDAEAVIVKATPHQVLVDEYTGEWV




DSQFPTGKCNKDICPTVHNSTTWHSDYKVTGLCDANLISMDITFFSEDGK




LTSLGKEGTGFRSNYFAYENGDKACRMQYCKHWGVRLPSGVWFEMAD




KDIYNDAKFPDCPEGSSIAAPSQTSVDVSLIQDVERILDYSLCQETWSKIRA




HLPISPVDLSYLSPKNPGTGPAFTIINGTLKYFETRYIRVDIAGPIIPQMRG




VISGTTTERELWTDWYPYEDVEIGPNGVLKTATGYKFPLYMIGHGMLDS




DLHISSKAQVFEHPHIQDAASQLPDDEFLFFGDTGLSKNPIELVEGWFSG




WKSTIASFFFIIGLVIGLYLVLRIGIALCIKCRVQEKRPKIYTDVEMNRLDR





SEQ ID
Q22N and
KFTIVFPHNQKGNWKNVPANYNYCPSSSDLNWHNGLIGTSLQVKMPKS


NO: 59
T422M
HKAIQADGWMCHAAKWVTTCDFRWYGPKYVTHSIKSMIPTVDQCKESI




AQTKQGTWLNPGFPPQSCGYASVTDAEAVIVKATPHQVLVDEYTGEWV




DSQFPTGKCNKDICPTVHNSTTWHSDYKVTGLCDANLISMDITFFSEDGK




LTSLGKEGTGFRSNYFAYENGDKACRMQYCKHWGVRLPSGVWFEMAD




KDIYNDAKFPDCPEGSSIAAPSQTSVDVSLIQDVERILDYSLCQETWSKIRA




HLPISPVDLSYLSPKNPGTGPAFTIINGTLKYFETRYIRVDIAGPIIPQMRG




VISGTTTERELWTDWYPYEDVEIGPNGVLKTATGYKFPLYMIGHGMLDS




DLHISSKAQVFEHPHIQDAASQLPDDEMLFFGDTGLSKNPIELVEGWFSG




WKSTIASFFFIIGLVIGLYLVLRIGIALCIKCRVQEKRPKIYTDVEMNRLDR





SEQ ID
Q22N and
KFTIVFPHNQKGNWKNVPANYNYCPSSSDLNWHNGLIGTSLQVKMPKS


NO: 60
T422V
HKAIQADGWMCHAAKWVTTCDFRWYGPKYVTHSIKSMIPTVDQCKESI




AQTKQGTWLNPGFPPQSCGYASVTDAEAVIVKATPHQVLVDEYTGEWV




DSQFPTGKCNKDICPTVHNSTTWHSDYKVTGLCDANLISMDITFFSEDGK




LTSLGKEGTGFRSNYFAYENGDKACRMQYCKHWGVRLPSGVWFEMAD




KDIYNDAKFPDCPEGSSIAAPSQTSVDVSLIQDVERILDYSLCQETWSKIRA




HLPISPVDLSYLSPKNPGTGPAFTIINGTLKYFETRYIRVDIAGPIIPQMRG




VISGTTTERELWTDWYPYEDVEIGPNGVLKTATGYKFPLYMIGHGMLDS




DLHISSKAQVFEHPHIQDAASQLPDDEVLFFGDTGLSKNPIELVEGWFSG




WKSTIASFFFIIGLVIGLYLVLRIGIALCIKCRVQEKRPKIYTDVEMNRLDR









Other preferred mutant polypeptides according to the invention are based on the amino acid sequence of the ectodomain of glycoprotein G of VSV Alagoa strain, and may be selected from:

    • a) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:61 corresponding to SEQ ID NO:10 with a substitution of the arginine (R) amino acid residue in position 22 by an asparagine (N) amino acid residue;
    • b) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:62 corresponding to SEQ ID NO:10 with a substitution of the threonine (T) amino acid residue in position 422 by an isoleucine (I) amino acid residue;
    • c) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:63 corresponding to SEQ ID NO:10 with a substitution of the threonine (T) amino acid residue in position 422 by a leucine (L) amino acid residue;
    • d) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:64 corresponding to SEQ ID NO:10 with a substitution of the threonine (T) amino acid residue in position 422 by a phenylalanine (F) amino acid residue;
    • e) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:65 corresponding to SEQ ID NO:10 with a substitution of the threonine (T) amino acid residue in position 422 by a methionine (M) amino acid residue;
    • f) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:66 corresponding to SEQ ID NO:10 with a substitution of the threonine (T) amino acid residue in position 422 by a valine (V) amino acid residue;
    • g) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:67 corresponding to SEQ ID NO:10 with a substitution of the arginine (R) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by an isoleucine (I) amino acid residue;
    • h) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:68 corresponding to SEQ ID NO:10 with a substitution of the arginine (R) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by a leucine (L) amino acid residue;
    • i) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:69 corresponding to SEQ ID NO:10 with a substitution of the arginine (R) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by a phenylalanine (F) amino acid residue;
    • j) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:70 corresponding to SEQ ID NO:10 with a substitution of the arginine (R) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by a methionine (M) amino acid residue; and
    • k) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:71 corresponding to SEQ ID NO:10 with a substitution of the arginine (R) amino acid residue in position 22 by an asparagine (N) amino acid residue and a substitution of the threonine (T) amino acid residue in position 422 by a valine (V) amino acid residue; and
    • l) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as defined in any one of a) to k) above, and further comprising a signal peptide in N-terminal, preferably the signal peptide corresponding to positions 1-17 of SEQ ID NO:9.


Most preferred mutant polypeptides based on the amino acid sequence of the ectodomain of glycoprotein G of VSV Alagoa strain are those comprising at least a substitution in position 422 (SEQ ID NO: 62 to 71 or a mutant polypeptide comprising one of these amino acid sequences and further comprising a signal peptide in N-terminal, preferably the signal peptide corresponding to positions 1-17 of SEQ ID NO:9).


The amino acid sequences SEQ ID NO:61 to 71 of preferred mutant polypeptide derived from VSV Alagoa glycoprotein G ectodomain are presented in Table 7 below.









TABLE 7







Amino acid sequences SEQ ID NO: 61 to 71 based on Alagoa VSV


strain glycoprotein G ectodomain (SEQ ID NO: 10) and associated


substitution compared to SEQ ID NO: 10. For each substitution,


the initial amino acid in SEQ ID NO: 10 is indicated in one letter


code and followed by its position in SEQ ID NO: 10 and the


substitute amino acid in one letter code.









Sequence




identifier
Substitution(s)
Amino acid sequence





SEQ ID
R22N
KFTIVFPQSQKGDWKDVPPNYNYCPSSADQNWHGDLLGVNIRAKMPKV


NO: 61

HKAIKADGWMCHAAKWVTTCDYRWYGPQYITHSIHSFIPTKAQCEESIK




QTKEGVWINPGFPPKNCGYASVSDAESIIVQATAHSVMIDEYSGDWLDSQ




FPTGRCTGSTCETIHNSTLWYADYQVTGLCDSALVSTEVTFYSEDGLMT




SIGRQNTGYRSNYFPYEKGAAACRMKYCTHEGIRLPSGVWFEMVDKELL




ESVQMPECPAGLTISAPTQTSVDVSLILDVERMLDYSLCQETWSKVHSGL




PISPVDLGYIAPKNPGAGPAFTIVNGTLKYFDTRYLRIDIEGPVLKKMTGK




VSGTPTKRELWTEWFPYDDVEIGPNGVLKTPEGYKFPLYMIGHGLLDSD




LQKTSQAEVFHHPQIAEAVQKLPDDETLFFGDTGISKNPVEVIEGWFSN




WRSSVMAIVFAILLLVITVLMVRLCVAFRHFCCQKRHKIYNDLEMNQLRR





SEQ ID
T422I
KFTIVFPQSQKGDWKDVPPNYRYCPSSADQNWHGDLLGVNIRAKMPKV


NO: 62

HKAIKADGWMCHAAKWVTTCDYRWYGPQYITHSIHSFIPTKAQCEESIK




QTKEGVWINPGFPPKNCGYASVSDAESIIVQATAHSVMIDEYSGDWLDSQ




FPTGRCTGSTCETIHNSTLWYADYQVTGLCDSALVSTEVTFYSEDGLMT




SIGRQNTGYRSNYFPYEKGAAACRMKYCTHEGIRLPSGVWFEMVDKELL




ESVQMPECPAGLTISAPTQTSVDVSLILDVERMLDYSLCQETWSKVHSGL




PISPVDLGYIAPKNPGAGPAFTIVNGTLKYFDTRYLRIDIEGPVLKKMTGK




VSGTPTKRELWTEWFPYDDVEIGPNGVLKTPEGYKFPLYMIGHGLLDSD




LQKTSQAEVFHHPQIAEAVQKLPDDEILFFGDTGISKNPVEVIEGWFSNW




RSSVMAIVFAILLLVITVLMVRLCVAFRHFCCQKRHKIYNDLEMNQLRR





SEQ ID
T422L
KFTIVFPQSQKGDWKDVPPNYRYCPSSADQNWHGDLLGVNIRAKMPKV


NO: 63

HKAIKADGWMCHAAKWVTTCDYRWYGPQYITHSIHSFIPTKAQCEESIK




QTKEGVWINPGFPPKNCGYASVSDAESIIVQATAHSVMIDEYSGDWLDSQ




FPTGRCTGSTCETIHNSTLWYADYQVTGLCDSALVSTEVTFYSEDGLMT




SIGRQNTGYRSNYFPYEKGAAACRMKYCTHEGIRLPSGVWFEMVDKELL




ESVQMPECPAGLTISAPTQTSVDVSLILDVERMLDYSLCQETWSKVHSGL




PISPVDLGYIAPKNPGAGPAFTIVNGTLKYFDTRYLRIDIEGPVLKKMTGK




VSGTPTKRELWTEWFPYDDVEIGPNGVLKTPEGYKFPLYMIGHGLLDSD




LQKTSQAEVFHHPQIAEAVQKLPDDELLFFGDTGISKNPVEVIEGWFSN




WRSSVMAIVFAILLLVITVLMVRLCVAFRHFCCQKRHKIYNDLEMNQLRR





SEQ ID
T422F
KFTIVFPQSQKGDWKDVPPNYRYCPSSADQNWHGDLLGVNIRAKMPKV


NO: 64

HKAIKADGWMCHAAKWVTTCDYRWYGPQYITHSIHSFIPTKAQCEESIK




QTKEGVWINPGFPPKNCGYASVSDAESIIVQATAHSVMIDEYSGDWLDSQ




FPTGRCTGSTCETIHNSTLWYADYQVTGLCDSALVSTEVTFYSEDGLMT




SIGRQNTGYRSNYFPYEKGAAACRMKYCTHEGIRLPSGVWFEMVDKELL




ESVQMPECPAGLTISAPTQTSVDVSLILDVERMLDYSLCQETWSKVHSGL




PISPVDLGYIAPKNPGAGPAFTIVNGTLKYFDTRYLRIDIEGPVLKKMTGK




VSGTPTKRELWTEWFPYDDVEIGPNGVLKTPEGYKFPLYMIGHGLLDSD




LQKTSQAEVFHHPQIAEAVQKLPDDEFLFFGDTGISKNPVEVIEGWFSN




WRSSVMAIVFAILLLVITVLMVRLCVAFRHFCCQKRHKIYNDLEMNQLRR





SEQ ID
T422M
KFTIVFPQSQKGDWKDVPPNYRYCPSSADQNWHGDLLGVNIRAKMPKV


NO: 65

HKAIKADGWMCHAAKWVTTCDYRWYGPQYITHSIHSFIPTKAQCEESIK




QTKEGVWINPGFPPKNCGYASVSDAESIIVQATAHSVMIDEYSGDWLDSQ




FPTGRCTGSTCETIHNSTLWYADYQVTGLCDSALVSTEVTFYSEDGLMT




SIGRQNTGYRSNYFPYEKGAAACRMKYCTHEGIRLPSGVWFEMVDKELL




ESVQMPECPAGLTISAPTQTSVDVSLILDVERMLDYSLCQETWSKVHSGL




PISPVDLGYIAPKNPGAGPAFTIVNGTLKYFDTRYLRIDIEGPVLKKMTGK




VSGTPTKRELWTEWFPYDDVEIGPNGVLKTPEGYKFPLYMIGHGLLDSD




LQKTSQAEVFHHPQIAEAVQKLPDDEMLFFGDTGISKNPVEVIEGWFSN




WRSSVMAIVFAILLLVITVLMVRLCVAFRHFCCQKRHKIYNDLEMNQLRR





SEQ ID
T422V
KFTIVFPQSQKGDWKDVPPNYRYCPSSADQNWHGDLLGVNIRAKMPKV


NO: 66

HKAIKADGWMCHAAKWVTTCDYRWYGPQYITHSIHSFIPTKAQCEESIK




QTKEGVWINPGFPPKNCGYASVSDAESIIVQATAHSVMIDEYSGDWLDSQ




FPTGRCTGSTCETIHNSTLWYADYQVTGLCDSALVSTEVTFYSEDGLMT




SIGRQNTGYRSNYFPYEKGAAACRMKYCTHEGIRLPSGVWFEMVDKELL




ESVQMPECPAGLTISAPTQTSVDVSLILDVERMLDYSLCQETWSKVHSGL




PISPVDLGYIAPKNPGAGPAFTIVNGTLKYFDTRYLRIDIEGPVLKKMTGK




VSGTPTKRELWTEWFPYDDVEIGPNGVLKTPEGYKFPLYMIGHGLLDSD




LQKTSQAEVFHHPQIAEAVQKLPDDEVLFFGDTGISKNPVEVIEGWFSN




WRSSVMAIVFAILLLVITVLMVRLCVAFRHFCCQKRHKIYNDLEMNQLRR





SEQ ID
R22N and
KFTIVFPQSQKGDWKDVPPNYNYCPSSADQNWHGDLLGVNIRAKMPKV


NO: 67
T422I
HKAIKADGWMCHAAKWVTTCDYRWYGPQYITHSIHSFIPTKAQCEESIK




QTKEGVWINPGFPPKNCGYASVSDAESIIVQATAHSVMIDEYSGDWLDSQ




FPTGRCTGSTCETIHNSTLWYADYQVTGLCDSALVSTEVTFYSEDGLMT




SIGRQNTGYRSNYFPYEKGAAACRMKYCTHEGIRLPSGVWFEMVDKELL




ESVQMPECPAGLTISAPTQTSVDVSLILDVERMLDYSLCQETWSKVHSGL




PISPVDLGYIAPKNPGAGPAFTIVNGTLKYFDTRYLRIDIEGPVLKKMTGK




VSGTPTKRELWTEWFPYDDVEIGPNGVLKTPEGYKFPLYMIGHGLLDSD




LQKTSQAEVFHHPQIAEAVQKLPDDEILFFGDTGISKNPVEVIEGWFSNW




RSSVMAIVFAILLLVITVLMVRLCVAFRHFCCQKRHKIYNDLEMNQLRR





SEQ ID
R22N and
KFTIVFPQSQKGDWKDVPPNYNYCPSSADQNWHGDLLGVNIRAKMPKV


NO: 68
T422L
HKAIKADGWMCHAAKWVTTCDYRWYGPQYITHSIHSFIPTKAQCEESIK




QTKEGVWINPGFPPKNCGYASVSDAESIIVQATAHSVMIDEYSGDWLDSQ




FPTGRCTGSTCETIHNSTLWYADYQVTGLCDSALVSTEVTFYSEDGLMT




SIGRQNTGYRSNYFPYEKGAAACRMKYCTHEGIRLPSGVWFEMVDKELL




ESVQMPECPAGLTISAPTQTSVDVSLILDVERMLDYSLCQETWSKVHSGL




PISPVDLGYIAPKNPGAGPAFTIVNGTLKYFDTRYLRIDIEGPVLKKMTGK




VSGTPTKRELWTEWFPYDDVEIGPNGVLKTPEGYKFPLYMIGHGLLDSD




LQKTSQAEVFHHPQIAEAVQKLPDDELLFFGDTGISKNPVEVIEGWFSN




WRSSVMAIVFAILLLVITVLMVRLCVAFRHFCCQKRHKIYNDLEMNQLRR





SEQ ID
R22N and
KFTIVFPQSQKGDWKDVPPNYNYCPSSADQNWHGDLLGVNIRAKMPKV


NO: 69
T422F
HKAIKADGWMCHAAKWVTTCDYRWYGPQYITHSIHSFIPTKAQCEESIK




QTKEGVWINPGFPPKNCGYASVSDAESIIVQATAHSVMIDEYSGDWLDSQ




FPTGRCTGSTCETIHNSTLWYADYQVTGLCDSALVSTEVTFYSEDGLMT




SIGRQNTGYRSNYFPYEKGAAACRMKYCTHEGIRLPSGVWFEMVDKELL




ESVQMPECPAGLTISAPTQTSVDVSLILDVERMLDYSLCQETWSKVHSGL




PISPVDLGYIAPKNPGAGPAFTIVNGTLKYFDTRYLRIDIEGPVLKKMTGK




VSGTPTKRELWTEWFPYDDVEIGPNGVLKTPEGYKFPLYMIGHGLLDSD




LQKTSQAEVFHHPQIAEAVQKLPDDEFLFFGDTGISKNPVEVIEGWFSN




WRSSVMAIVFAILLLVITVLMVRLCVAFRHFCCQKRHKIYNDLEMNQLRR





SEQ ID
R22N and
KFTIVFPQSQKGDWKDVPPNYNYCPSSADQNWHGDLLGVNIRAKMPKV


NO: 70
T422M
HKAIKADGWMCHAAKWVTTCDYRWYGPQYITHSIHSFIPTKAQCEESIK




QTKEGVWINPGFPPKNCGYASVSDAESIIVQATAHSVMIDEYSGDWLDSQ




FPTGRCTGSTCETIHNSTLWYADYQVTGLCDSALVSTEVTFYSEDGLMT




SIGRQNTGYRSNYFPYEKGAAACRMKYCTHEGIRLPSGVWFEMVDKELL




ESVQMPECPAGLTISAPTQTSVDVSLILDVERMLDYSLCQETWSKVHSGL




PISPVDLGYIAPKNPGAGPAFTIVNGTLKYFDTRYLRIDIEGPVLKKMTGK




VSGTPTKRELWTEWFPYDDVEIGPNGVLKTPEGYKFPLYMIGHGLLDSD




LQKTSQAEVFHHPQIAEAVQKLPDDEMLFFGDTGISKNPVEVIEGWFSN




WRSSVMAIVFAILLLVITVLMVRLCVAFRHFCCQKRHKIYNDLEMNQLRR





SEQ ID
R22N and
KFTIVFPQSQKGDWKDVPPNYNYCPSSADQNWHGDLLGVNIRAKMPKV


NO: 71
T422V
HKAIKADGWMCHAAKWVTTCDYRWYGPQYITHSIHSFIPTKAQCEESIK




QTKEGVWINPGFPPKNCGYASVSDAESIIVQATAHSVMIDEYSGDWLDSQ




FPTGRCTGSTCETIHNSTLWYADYQVTGLCDSALVSTEVTFYSEDGLMT




SIGRQNTGYRSNYFPYEKGAAACRMKYCTHEGIRLPSGVWFEMVDKELL




ESVQMPECPAGLTISAPTQTSVDVSLILDVERMLDYSLCQETWSKVHSGL




PISPVDLGYIAPKNPGAGPAFTIVNGTLKYFDTRYLRIDIEGPVLKKMTGK




VSGTPTKRELWTEWFPYDDVEIGPNGVLKTPEGYKFPLYMIGHGLLDSD




LQKTSQAEVFHHPQIAEAVQKLPDDEVLFFGDTGISKNPVEVIEGWFSN




WRSSVMAIVFAILLLVITVLMVRLCVAFRHFCCQKRHKIYNDLEMNQLRR









Other preferred mutant polypeptides according to the invention are based on the amino acid sequence of the ectodomain of glycoprotein G of VSV New Jersey strain, and may be selected from:

    • a) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:72 corresponding to SEQ ID NO: 12 with a substitution of the valine (V) amino acid residue in position 429 by an isoleucine (I) amino acid residue;
    • b) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:73 corresponding to SEQ ID NO: 12 with a substitution of the valine (V) amino acid residue in position 429 by a leucine (L) amino acid residue;
    • c) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:74 corresponding to SEQ ID NO: 12 with a substitution of the valine (V) amino acid residue in position 429 by a phenylalanine (F) amino acid residue; and
    • d) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:75 corresponding to SEQ ID NO: 12 with a substitution of the valine (V) amino acid residue in position 429 by a methionine (M) amino acid residue; and
    • e) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as defined in any one of a) to d) above, and further comprising a signal peptide in N-terminal, preferably the signal peptide corresponding to positions 1-16 of SEQ ID NO:11.


The amino acid sequences SEQ ID NO:72 to 75 of preferred mutant polypeptide derived from VSV New Jersey glycoprotein G ectodomain are presented in Table 8 below.









TABLE 8







Amino acid sequences SEQ ID NO: 72 to 75 based on New Jersey VSV


strain glycoprotein G ectodomain (SEQ ID NO: 12) and associated


substitution compared to SEQ ID NO: 12. For each substitution,


the initial amino acid in SEQ ID NO:12 is indicated in one


letter code and followed by its position in SEQ ID NO: 12


and the substitute amino acid in one letter code.









Sequence




identifier
Substitution(s)
Amino acid sequence





SEQ ID
V429I
KIEIVFPQHTTGDWKRVPHEYNYCPTSADKNSHGTQTGIPVELTMPKGL


NO: 72

TTHQVDGFMCHSALWMTTCDFRWYGPKYITHSIHNEEPTDYQCLEAIKA




YKDGVSFNPGFPPQSCGYGTVTDAEAHIVTVTPHSVKVDEYTGEWIDPH




FIGGRCKGQICETVHNSTKWFTSSDGESVCSQLFTLVGGTFFSDSEEITS




MGLPETGIRSNYFPYVSTEGICKMPFCRKPGYKLKNDLWFQITDPDLDKT




VRDLPHIKDCDLSSSIVTPGEHATDISLISDVERILDYALCQNTWSKIEAGE




PITPVDLSYLGPKNPGAGPVFTIINGSLHYFMSKYLRVELESPVIPRMEGK




VAGTRIVRQLWDQWFPFGEVEIGPNGVLKTKQGYKFPLHIIGTGEVDND




IKMERIVKHWEHPHIEAAQTFLKKDDTEEILYYGDTGVSKNPVELVEGWF




SGWRSSIMGVLAVIIGFVILIFLIRLIGVLSSLFRQKRRPIYKSDVEMAHFR





SEQ ID
V429L
KIEIVFPQHTTGDWKRVPHEYNYCPTSADKNSHGTQTGIPVELTMPKGL


NO: 73

TTHQVDGFMCHSALWMTTCDFRWYGPKYITHSIHNEEPTDYQCLEAIKA




YKDGVSFNPGFPPQSCGYGTVTDAEAHIVTVTPHSVKVDEYTGEWIDPH




FIGGRCKGQICETVHNSTKWFTSSDGESVCSQLFTLVGGTFFSDSEEITS




MGLPETGIRSNYFPYVSTEGICKMPFCRKPGYKLKNDLWFQITDPDLDKT




VRDLPHIKDCDLSSSIVTPGEHATDISLISDVERILDYALCQNTWSKIEAGE




PITPVDLSYLGPKNPGAGPVFTIINGSLHYFMSKYLRVELESPVIPRMEGK




VAGTRIVRQLWDQWFPFGEVEIGPNGVLKTKQGYKFPLHIIGTGEVDND




IKMERIVKHWEHPHIEAAQTFLKKDDTEEVLYYGDTGVSKNPVELVEGW




FSGWRSSIMGVLAVIIGFVILIFLIRLIGVLSSLFRQKRRPIYKSDVEMAHFR





SEQ ID
V429F
KIEIVFPQHTTGDWKRVPHEYNYCPTSADKNSHGTQTGIPVELTMPKGL


NO: 74

TTHQVDGFMCHSALWMTTCDFRWYGPKYITHSIHNEEPTDYQCLEAIKA




YKDGVSFNPGFPPQSCGYGTVTDAEAHIVTVTPHSVKVDEYTGEWIDPH




FIGGRCKGQICETVHNSTKWFTSSDGESVCSQLFTLVGGTFFSDSEEITS




MGLPETGIRSNYFPYVSTEGICKMPFCRKPGYKLKNDLWFQITDPDLDKT




VRDLPHIKDCDLSSSIVTPGEHATDISLISDVERILDYALCQNTWSKIEAGE




PITPVDLSYLGPKNPGAGPVFTIINGSLHYFMSKYLRVELESPVIPRMEGK




VAGTRIVRQLWDQWFPFGEVEIGPNGVLKTKQGYKFPLHIIGTGEVDND




IKMERIVKHWEHPHIEAAQTFLKKDDTEEFLYYGDTGVSKNPVELVEGW




FSGWRSSIMGVLAVIIGFVILIFLIRLIGVLSSLFRQKRRPIYKSDVEMAHFR





SEQ ID
V429M
KIEIVFPQHTTGDWKRVPHEYNYCPTSADKNSHGTQTGIPVELTMPKGL


NO: 75

TTHQVDGFMCHSALWMTTCDFRWYGPKYITHSIHNEEPTDYQCLEAIKA




YKDGVSFNPGFPPQSCGYGTVTDAEAHIVTVTPHSVKVDEYTGEWIDPH




FIGGRCKGQICETVHNSTKWFTSSDGESVCSQLFTLVGGTFFSDSEEITS




MGLPETGIRSNYFPYVSTEGICKMPFCRKPGYKLKNDLWFQITDPDLDKT




VRDLPHIKDCDLSSSIVTPGEHATDISLISDVERILDYALCQNTWSKIEAGE




PITPVDLSYLGPKNPGAGPVFTIINGSLHYFMSKYLRVELESPVIPRMEGK




VAGTRIVRQLWDQWFPFGEVEIGPNGVLKTKQGYKFPLHIIGTGEVDND




IKMERIVKHWEHPHIEAAQTFLKKDDTEEMLYYGDTGVSKNPVELVEGW




FSGWRSSIMGVLAVIIGFVILIFLIRLIGVLSSLFRQKRRPIYKSDVEMAHFR









Other preferred mutant polypeptides according to the invention are based on the amino acid sequence of the ectodomain of glycoprotein G of VSV Carajas strain, and may be selected from:

    • a) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:76 corresponding to SEQ ID NO: 14 with a substitution of the valine (V) amino acid residue in position 426 by an isoleucine (I) amino acid residue;
    • b) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:77 corresponding to SEQ ID NO: 14 with a substitution of the valine (V) amino acid residue in position 426 by a leucine (L) amino acid residue;
    • c) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:78 corresponding to SEQ ID NO: 14 with a substitution of the valine (V) amino acid residue in position 426 by a phenylalanine (F) amino acid residue;
    • d) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as set forth in SEQ ID NO:79 corresponding to SEQ ID NO: 14 with a substitution of the valine (V) amino acid residue in position 426 by a methionine (M) amino acid residue; and
    • e) a mutant polypeptide comprising, or consisting essentially of, or consisting of, the amino acid sequence as defined in any one of a) to d) above, and further comprising a signal peptide in N-terminal, preferably the signal peptide corresponding to positions 1-21 of SEQ ID NO:13.


The amino acid sequences SEQ ID NO:76 to 79 of preferred mutant polypeptide derived from VSV Carajas glycoprotein G ectodomain are presented in Table 9 below.









TABLE 9







Amino acid sequences SEQ ID NO: 76 to 79 based on Carajas VSV strain


glycoprotein G ectodomain (SEQ ID NO: 14) and associated substitution


compared to SEQ ID NO: 14. For each substitution, the initial amino


acid in SEQ ID NO: 14 is indicated in one letter code and followed by


its position in SEQ ID NO: 14 and the substitute amino acid in


one letter code.









Sequence




identifier
Substitution(s)
Amino acid sequence





SEQ ID
V426I
KITISFPQSLKGDWRPVPKGYNYCPTSADKNLHGDLIDIGLRLRAPKSFKG


NO: 76

ISADGWMCHAARWITTCDFRWYGPKYITHSIHSFRPSNDQCKEAIRLTNE




GNWINPGFPPQSCGYASVTDSESVVVTVTKHQVLVDEYSGSWIDSQFPG




GSCTSPICDTVHNSTLWHADHTLDSICDQEFVAMDAVLFTESGKFEEFGK




PNSGIRSNYFPYESLKDVCQMDFCKRKGFKLPSGVWFEIEDAEKSHKAQV




ELKIKRCPHGAVISAPNQNAADINLIMDVERILDYSLCQATWSKIQNKEAL




TPIDISYLGPKNPGPGPAFTIINGTLHYFNTRYIRVDIAGPVTKEITGFVSG




TSTSRVLWDQWFPYGENSIGPNGLLKTASGYKYPLFMVGTGVLDADIHK




LGEATVIEHPHAKEAQKVVDDSEIIFFGDTGVSKNPVEVVEGWFSGWRSS




LMSIFGIILLIVCLVLIVRILIALKYCCVRHKKRTIYKEDLEMGRIPRRA





SEQ ID
V426L
KITISFPQSLKGDWRPVPKGYNYCPTSADKNLHGDLIDIGLRLRAPKSFKG


NO: 77

ISADGWMCHAARWITTCDFRWYGPKYITHSIHSFRPSNDQCKEAIRLTNE




GNWINPGFPPQSCGYASVTDSESVVVTVTKHQVLVDEYSGSWIDSQFPG




GSCTSPICDTVHNSTLWHADHTLDSICDQEFVAMDAVLFTESGKFEEFGK




PNSGIRSNYFPYESLKDVCQMDFCKRKGFKLPSGVWFEIEDAEKSHKAQV




ELKIKRCPHGAVISAPNQNAADINLIMDVERILDYSLCQATWSKIQNKEAL




TPIDISYLGPKNPGPGPAFTIINGTLHYFNTRYIRVDIAGPVTKEITGFVSG




TSTSRVLWDQWFPYGENSIGPNGLLKTASGYKYPLFMVGTGVLDADIHK




LGEATVIEHPHAKEAQKVVDDSEILFFGDTGVSKNPVEVVEGWFSGWRS




SLMSIFGIILLIVCLVLIVRILIALKYCCVRHKKRTIYKEDLEMGRIPRRA





SEQ ID
V426F
KITISFPQSLKGDWRPVPKGYNYCPTSADKNLHGDLIDIGLRLRAPKSFKG


NO: 78

ISADGWMCHAARWITTCDFRWYGPKYITHSIHSFRPSNDQCKEAIRLTNE




GNWINPGFPPQSCGYASVTDSESVVVTVTKHQVLVDEYSGSWIDSQFPG




GSCTSPICDTVHNSTLWHADHTLDSICDQEFVAMDAVLFTESGKFEEFGK




PNSGIRSNYFPYESLKDVCQMDFCKRKGFKLPSGVWFEIEDAEKSHKAQV




ELKIKRCPHGAVISAPNQNAADINLIMDVERILDYSLCQATWSKIQNKEAL




TPIDISYLGPKNPGPGPAFTIINGTLHYFNTRYIRVDIAGPVTKEITGFVSG




TSTSRVLWDQWFPYGENSIGPNGLLKTASGYKYPLFMVGTGVLDADIHK




LGEATVIEHPHAKEAQKVVDDSEIVFFGDTGVSKNPVEVVEGWFSGWRS




SLMSIFGIILLIVCLVLIVRILIALKYCCVRHKKRTIYKEDLEMGRIPRRA





SEQ ID
V426M
KITISFPQSLKGDWRPVPKGYNYCPTSADKNLHGDLIDIGLRLRAPKSFKG


NO: 79

ISADGWMCHAARWITTCDFRWYGPKYITHSIHSFRPSNDQCKEAIRLTNE




GNWINPGFPPQSCGYASVTDSESVVVTVTKHQVLVDEYSGSWIDSQFPG




GSCTSPICDTVHNSTLWHADHTLDSICDQEFVAMDAVLFTESGKFEEFGK




PNSGIRSNYFPYESLKDVCQMDFCKRKGFKLPSGVWFEIEDAEKSHKAQV




ELKIKRCPHGAVISAPNQNAADINLIMDVERILDYSLCQATWSKIQNKEAL




TPIDISYLGPKNPGPGPAFTIINGTLHYFNTRYIRVDIAGPVTKEITGFVSG




TSTSRVLWDQWFPYGENSIGPNGLLKTASGYKYPLFMVGTGVLDADIHK




LGEATVIEHPHAKEAQKVVDDSEIMFFGDTGVSKNPVEVVEGWFSGWRS




SLMSIFGIILLIVCLVLIVRILIALKYCCVRHKKRTIYKEDLEMGRIPRRA









Thus, the mutant polypeptide of the invention advantageously comprises, or consists essentially of, or consists of, an amino acid sequence selected from the group consisting of SEQ ID NO: 17-79, preferably selected from the group consisting of SEQ ID NO: 17-72 and SEQ ID NO: 74-79.


Further Optional Mutations


The mutant polypeptide according to the invention is derived from the amino acid sequence of the ectodomain of glycoprotein G of a VSV strain, and thus comprises, or consists essentially of, or consists of, an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least 50% of identity (or any other minimal % of identity disclosed above) with the amino acid sequence as set forth in SEQ ID NO: 2, with at least one substitution of an amino acid residue as defined above.


The mutant polypeptide according to the invention may further contain one or more (such as 1, 2, or more) additional substitutions at additional positions of SEQ ID NO:2, or equivalent positions after optimal global alignment with SEQ ID NO:2.


When it is desired not to alter the function of the ectodomain, the possible one or more additional substitutions will preferably be conservative, meaning that the substitute amino acid has a structure that is similar to that of the original amino acid and is therefore unlikely to change the biological activity of the polypeptide. Examples of such conservative substitutions are presented in Table 10 below:









TABLE 10







Examples of conservative substitutions.










Original amino acid
Conservative substitution(s)







Ala (A)
Val, Gly, Pro



Arg (R)
Lys, His



Asn (N)
Gln



Asp (D)
Glu



Cys (C)
Ser



Gln (Q)
Asn



Glu (G)
Asp



Gly (G)
Ala



His (H)
Arg



Ile (I)
Leu



Leu (L)
Ile, Val, Met



Lys (K)
Arg



Met (M)
Leu



Phe (F)
Tyr



Pro (P)
Ala



Ser (S)
Thr, Cys



Thr (T)
Ser



Trp (W)
Tyr



Tyr (Y)
Phe, Trp



Val (V)
Leu, Ala










In addition, as explained above, the alignment presented in FIG. 4 shows that positions 1, 4, 6-7, 12, 14, 17-18, 21, 23-25, 27, 29, 31, 33, 46-47, 55-56, 58-60, 64, 66-69, 71-75, 77, 79, 81-82, 87, 91-92, 96, 102, 107-111, 114-116, 119, 121, 127, 130, 132, 134, 137-139, 141, 143, 145, 151, 153, 158, 160, 162-165, 167, 177, 199, 204, 206-208, 210, 212, 219, 221, 224, 228, 231, 236-237, 241, 253, 259, 262, 268, 271-272, 274-277, 279-281, 283-285, 287-290, 299, 301, 304, 307-311, 313-314, 316-318, 320-321, 323, 325-326, 330, 332, 338, 345, 349, 354, 356-357, 360, 362, 368-372, 374, 376, 379, 381, 383-384, 388, 390, 393, 395, 407-408, 413, 421, 426-429, 431-434, 436, 439-443, 445, 447, 456, 467, 487, and 489-490 of SEQ ID NO:2 are conserved in the amino acid sequences of the ectodomain of glycoprotein G of VSV strains Indiana, Maraba, Cocal, Morreton, Alagoa, New Jersey, and Carajas. Conserved amino acid positions are often associated with conserved function, and the mutant polypeptide according to the invention comprising an amino acid sequence having at least 50% of identity (or any other minimal % of identity disclosed above) with the amino acid sequence as set forth in SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2, preferably has the same amino acids as SEQ ID NO: 2 in one or more of the above defined conserved positions, such as in all the above defined conserved positions when it is desired not to alter the function of the ectodomain.


However, in some cases, it may be desired to alter the function of the ectodomain, for instance in order to alter the ability of the ectodomain to bind to the low-density lipoprotein receptor (LDL-R), as described in the application filed under number PCT/EP2018/075824 and in Nikolic et al. Nat Commun. 2018 Mar. 12; 9(1):1029. This permits to alter the tropism of a VSV expressing an ectodomain unable to interact with LDL membrane receptor.


Therefore, in an embodiment of the present invention, the mutant polypeptide according to the invention further comprises at least one substitution of an amino acid residue selected from the group consisting of amino acid residues located at positions 8, 47, 209 and 354 of SEQ ID NO:2 or amino acid residues located at positions equivalent to positions 8, 47, 209 and 354 of SEQ ID NO:2 after optimal global alignment with SEQ ID NO:2.


For substitution in position 8 or equivalent position after optimal global alignment with SEQ ID NO:2, the original amino acid is preferably substituted by any amino acid except histidine (H), glutamine (Q) or tyrosine (Y), more preferably the original amino acid is substituted by alanine (A).


For substitution in position 47 of SEQ ID NO:2 or equivalent position after optimal global alignment with SEQ ID NO:2, the original amino acid is preferably substituted by any amino acid except lysine (K) or arginine (R), more preferably the original amino acid is substituted by alanine (A) or glutamine (Q).


For substitution in position 209 or equivalent position after optimal global alignment with SEQ ID NO:2, the original amino acid is preferably substituted by any amino acid except tyrosine (Y) or histidine (H), more preferably the original amino acid is substituted by alanine (A).


For substitution in position 354 or equivalent position after optimal global alignment with SEQ ID NO:2, the original amino acid is preferably substituted by any amino acid except lysine (K) or arginine (R), more preferably the original amino acid is substituted by alanine (A) or glutamine (Q).


Positions 47 and 354 of SEQ ID NO:2 or equivalent positions after optimal global alignment with SEQ ID NO:2 are preferred for obtaining an ectodomain unable to interact with LDL membrane receptor. When it is desired that the mutant polypeptide according to the invention is unable to interact with LDL membrane receptor; the mutant polypeptide further thus preferably comprises a substitution at position 47 of SEQ ID NO:2 or equivalent position after optimal global alignment with SEQ ID NO:2, a substitution at position 354 of SEQ ID NO:2 or equivalent position after optimal global alignment with SEQ ID NO:2, or two substitutions at positions 47 and 354 of SEQ ID NO:2 or equivalent positions after optimal global alignment with SEQ ID NO:2. More preferably, the substitutions at position(s) 47 and/or 354 are those defined as preferred above.


Fusion Polypeptide


The present invention also relates to a fusion polypeptide comprising, or consisting essentially of, or consisting of, the mutant polypeptide of the invention, and an additional peptide, polypeptide or protein, wherein said additional peptide, polypeptide or protein is inserted:

    • a) in N-terminal of the mutant polypeptide,
    • b) between any two consecutive amino acids located in regions corresponding to positions 192 to 202 of SEQ ID NO:2 or equivalent positions after optimal global alignment with SEQ ID NO:2,
    • c) between any two consecutive amino acids located in regions corresponding to positions 240 to 257 of SEQ ID NO:2 or equivalent positions after optimal global alignment with SEQ ID NO:2,
    • d) between any two consecutive amino acids located in regions corresponding to positions 347 to 353 of SEQ ID NO:2 or equivalent positions after optimal global alignment with SEQ ID NO:2,
    • e) between any two consecutive amino acids located in regions corresponding to positions 364 to 366 of SEQ ID NO:2 or equivalent positions after optimal global alignment with SEQ ID NO:2,
    • f) between any two consecutive amino acids located in regions corresponding to positions 376 to 379 of SEQ ID NO:2 or equivalent positions after optimal global alignment with SEQ ID NO:2,


preferably said additional peptide, polypeptide or protein is inserted in N-terminal of the mutant polypeptide or between the amino acids at positions 351 and 352 of SEQ ID NO:2 or equivalent positions after optimal global alignment with SEQ ID NO:2.


The additional peptide, polypeptide or protein may have various functions, but may notably be used in order to target a recombinant VSV virus to specific target cells. In this case, any additional peptide, polypeptide or protein that is able to specifically bind to an antigen expressed at the extracellular surface of the target cells may be used. In particular, the additional peptide, polypeptide or protein may comprise at least part of a ligand of a cellular receptor.


When the target cell expresses at its surface one of the components of a (receptor-ligand) couple (for instance: EGF-EGFR, TNFa-TNFaR, PD1-PDL1 or PDL2, etc. . . . ) (which will happen often, since molecules expressed at the cell surface generally have a cognate receptor or ligand), then at least part (provided that it is still able to bind its cognate receptor or ligand) of the second component of the (receptor-ligand) couple (i.e. the natural receptor or ligand of the targeted cell surface antigen) may be used.


Alternatively, an antibody, a functional antibody fragment.


By “antibody” or “immunoglobulin” is meant a molecule comprising at least one binding domain for a given antigen and a constant domain comprising an Fc fragment capable of binding to Fc receptors (FcR). In most mammals, like humans and mice, an antibody consists of four polypeptide chains: two heavy chains and two light chains bound together by a variable number of disulfide bridges providing flexibility to the molecule. Each light chain consists of a constant domain (CL) and a variable domain (VL); the heavy chains consisting of a variable domain (VH) and three or four constant domains (CH1 to CH3 or CH1 to CH4) according to the isotype of the antibody. In a few rare mammals, such as camels and llamas, the antibodies consist of only two heavy chains, each heavy chain comprising a variable domain (VH) and a constant region.


The variable domains are involved in antigen recognition, while the constant domains are involved in the biological, pharmacokinetic and effector properties of the antibody.


By “functional antibody fragment” is meant an antibody fragment retaining the antigen-binding domain and thus having the same antigen specificity as the original antibody. In most mammals, such functional antibody fragments comprise the fragments Fv, ScFv, Fab, F(ab′)2, Fab′, and scFv-Fc. In the case of camels and llamas, in which the antibodies consist of only two heavy chains, each heavy chain comprising a variable domain (referred to as VHH) and a constant region, such functional antibody corresponds to the VHH fragment, also known as nanobody.


In the context of the invention, the additional peptide, polypeptide or protein will preferably be a peptide or relatively short polypeptide (preferably less than 200 amino acids), in order not to alter the function of the mutated polypeptide according to the invention. As a result, when antibodies or functional antibody fragments are the selected type of additional peptide, polypeptide or protein, functional antibody fragments will preferably be used. While any suitable functional antibody fragment may be used in the context of the invention, a nanobody (or VHH fragment) will preferably be used, because of its small size, globular shape and robustness.


Depending on the targeted cell, the antigen targeted at the cell surface will vary. For instance, when a recombinant VSV virus expressing the mutant polypeptide according to the invention is used as on oncolytic virus, the mutated polypeptide according to the invention may be fused as described above with an antibody or functional antibody fragment (in particular a nanobody) targeting the G ectodomain to a cancer cell antigen. For instance, an anti-HER2 or anti-MUC18 or anti-EGFR or anti-CD20 or anti-CD52 antibody or functional antibody fragment (in particular a nanobody) may be used. In another context, an antibody or functional antibody fragment (in particular a nanobody) targeting an inhibitory immune checkpoint (such as PD1 or its ligands PD-L1 and PD-L2, or CTLA4) may be used, when it is desired to target immune cells expressing such inhibitory immune checkpoints. Also, an antibody or functional antibody fragment (in particular a nanobody) targeting an inhibitory immune checkpoint (such as PD1 or its ligands PD-L1 and PD-L2, or CTLA4) may be used, when it is desired to target immune cells expressing such inhibitory immune checkpoints. Likewise, an antibody or functional antibody fragment (in particular a nanobody) targeting the B-lymphocyte antigen CD19 may be used, when it is desired to target B cells. An exemplary nanobody directed to GFP has the amino acid sequence SEQ ID NO:80











(VQLVESGGALVQPGGSLRLSCAASGFPVNRYSMRWYRQAPGK







EREWVAGMSSAGDRSSYEDSVKGRFTISRDDARNTVYLQMNSL







KPEDTAVYYCNVNVGFEYWGQGTQVTVSS).






When the additional peptide, polypeptide or protein (preferably a functional antibody fragment such as a nanobody) is inserted in N-terminal of the mutant polypeptide according to the invention, it may be fused directly to the N-terminal extremity of the mutant polypeptide, or separated from the N-terminal extremity of the mutant polypeptide by a linker peptide, preferably it is separated from the N-terminal extremity of the mutant polypeptide by a first linker peptide.


Typically, linkers are 1 to 30 amino acids long peptides composed of amino acid residues such as glycine (G), serine (S), threonine (T), asparagine (N), glutamine (Q), alanine (A) proline (P), and/or phenylalanine (F). Preferred linkers in the context of this invention comprise 2 to 15 amino acids, with a preference for 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or 12 amino acids.


The first linker peptide between the additional peptide, polypeptide or protein (preferably a functional antibody fragment such as a nanobody) and the mutated polypeptide according to the invention is 2 to 15 amino acids long, preferably 4 to 12 or 6 to 10 amino acids long, and notably 8 amino acids long. Alternatively or in combination, its amino acid sequence is preferably constituted of amino acids selected from glycine (G), serine (S), threonine (T), asparagine (N), glutamine (Q), alanine (A), proline (P), and phenylalanine (F), more preferably selected from glycine (G), serine (S), threonine (T), and alanine (A), or from glycine (G), serine (S), threonine (T), or from glycine (G), serine (S), and alanine (A), even more preferably from glycine (G) and serine (S). In a preferred embodiment, the first linker peptide is constituted of 6 to 10 amino acids selected from glycine (G) and serine (S), and may notably be of amino acid sequence GGGGSGGGGS (SEQ ID NO:81).


When the additional peptide, polypeptide or protein (preferably a functional antibody fragment such as a nanobody) is inserted in N-terminal of the mutant polypeptide according to the invention, the fusion polypeptide may further comprise, in N-terminal of the additional peptide, polypeptide or protein (preferably a functional antibody fragment such as a nanobody), a signal peptide in order to address the fusion polypeptide to the virus envelope (when the fusion polypeptide is expressed by a virus) or to the cell membrane (when the fusion polypeptide is expressed by a host cell).


In this case, the fusion polypeptide according to the invention comprises, or consists essentially of, or consists of, from N- to C-terminal: a signal peptide, optionally a second linker peptide, an additional peptide, polypeptide or protein (preferably a functional antibody fragment such as a nanobody), optionally a first linker peptide, and the mutated polypeptide according to the invention. In this fusion polypeptide, the first linker peptide, when present, is as described above. The second linker peptide (between the signal peptide and the additional peptide, polypeptide or protein (preferably a functional antibody fragment such as a nanobody)), when present, is 2 to 15 amino acids long, preferably 2 to 6 amino acids long, 2 to 4 amino acids long, and notably 2 or 3 amino acids long. Alternatively or in combination, its amino acid sequence is preferably constituted of amino acids selected from glycine (G), serine (S), threonine (T), asparagine (N), glutamine (Q), alanine (A), proline (P), phenylalanine (F), more preferably selected from serine (S), threonine (T), asparagine (N), glutamine (Q), and phenylalanine (F), and in particular from serine (S), threonine (T), asparagine (N), and glutamine (Q), or from glutamine (Q) and serine (S), or glutamine (Q) and phenylalanine (F). In a preferred embodiment, the second linker peptide is constituted of 2 to 4 amino acids selected from serine (S), threonine (T), asparagine (N), glutamine (Q), and phenylalanine (F), and in particular from serine (S), threonine (T), asparagine (N), and glutamine (Q), or from glutamine (Q) and serine (S), or glutamine (Q) and phenylalanine (F), and may notably be of amino acid sequence QF or QS.


In a preferred embodiment, the fusion polypeptide according to the invention comprises a first linker peptide or a second linker peptide, and more preferably both a first linker peptide and a second linker peptide.


When the fusion polypeptide according to the invention comprises both a first linker peptide and a second linker peptide:

    • the first linker peptide (between the additional peptide, polypeptide or protein (preferably a functional antibody fragment such as a nanobody) and the mutated polypeptide according to the invention) is preferably constituted of 6 to 10 amino acids selected from glycine (G) and serine (S), and may notably be of amino acid sequence GGGGSGGGGS (SEQ ID NO:81); and
    • the second linker peptide (between the signal peptide and the additional peptide, polypeptide or protein (preferably a functional antibody fragment such as a nanobody)) is preferably constituted of 2 to 4 amino acids selected from serine (S), threonine (T), asparagine (N), glutamine (Q), and phenylalanine (F), and in particular from serine (S), threonine (T), asparagine (N), and glutamine (Q), or from glutamine (Q) and serine (S), or glutamine (Q) and phenylalanine (F), and may notably be of amino acid sequence QF or QS.


In a preferred embodiment, when the fusion polypeptide according to the invention comprises both a first linker peptide and a second linker peptide, the first linker peptide (between the additional peptide, polypeptide or protein (preferably a functional antibody fragment such as a nanobody) and the mutated polypeptide according to the invention) has the amino acid sequence GGGGSGGGGS (SEQ ID NO:81)., and the second linker peptide (between the signal peptide and the additional peptide, polypeptide or protein (preferably a functional antibody fragment such as a nanobody)) has the amino acid sequence QF or QS.


An exemplary preferred fusion polypeptide has the sequence SEQ ID NO:82











(MKCLLYLAFLFIGVNCQSVQLVESGGALVQPGGSLRLSCAAS







GFPVNRYSMRWYRQAPGKEREWVAGMSSAGDRSSYEDSVKGRF







TISRDDARNTVYLQMNSLKPEDTAVYYCNVNVGFEYWGQGTQV







TVSSGGGGSGGGGSKFTIVFPHNQKGNWKNVPSNYNYCPSSSD







LNWHNDLIGTALQVKMPKSHKAIQADGWMCHASKWVTTCDFRW







YGPKYITHSIRSFTPSVEQCKESIEQTKQGTWLNPGFPPQSCG







YATVTDAEAVIVQVTPHHVLVDEYTGEWVDSQFINGKCSNYIC







PTVHNSTTWHSDYKVKGLCDSNLISMDITFFSEDGELSSLGKE







GTGFRSNYFAYETGGKACKMQYCKHWGVRLPSGVWFEMADKDL







FAAARFPECPEGSSISAPSQTSVDVSLIQDVERILDYSLCQET







WSKIRAGLPISPVDLSYLAPKNPGTGPAFTIINGTLKYFETRY







IRVDIAAPILSRMVGMISGTTTERELWDDWAPYEDVEIGPNGV







LRTSSGYKFPLYMIGHGMLDSDLHLSSKAQVFEHPHIQDAASQ







LPDDEILFFGDTGLSKNPIELVEGWFSSWKSSIASFFFIIGLI







IGLFLVLRVGIHLCIKLKHTKKRQIYTDIEMNRLGK).






Nucleic Acid Molecule


The present invention also relates to a nucleic acid molecule encoding the mutant polypeptide or the fusion polypeptide according to the invention.


All the different nucleic sequences encoding a particular amino acid sequence, because of degeneration of the genetic code, are within the scope of the invention.


In particular, the sequence of a nucleic acid according to the invention may be optimized to promote the expression thereof in a host cell or any other production host. Indeed, there are in general several three-nucleotide combinations encoding the same amino acid (except for methionine and tryptophan), called synonymous codons. However, some of these combinations are in general used preferentially by a cell or a given organism (this is referred to as genetic code usage bias). This preference depends notably on the producing organism from which the cell is derived. Consequently, when a protein derived from one or more organisms is produced in a heterologous organism or a cell of such a heterologous organism, it may be useful to modify the nucleic sequence encoding the protein to use mainly the preferred codons of the heterologous organism. Data are available in the literature concerning the use of codons preferred by different species and a person skilled in the art knows how to optimize the expression of a given protein in a heterologous organism or a cell of a heterologous organism.


Preferred nucleic acid molecules include those based on the natural nucleotide sequence encoding the ectodomain of glycoprotein G of VSV Indiana strain (SEQ ID NO:83), mutated at one or more nucleotides in order to generate a substitution as defined above. Preferred nucleic acid molecules include those having a nucleotide sequence selected from SEQ ID NO:83 to SEQ ID NO:86, which are more precisely defined in Table 11 below.









TABLE 11







Preferred nucleotide sequences encoding wild-type (SEQ ID NO: 83)


or mutated (SEQ ID NO: 84: c112a nucleotide substitution encoding


H22N mutant of amino acid sequence SEQ ID NO: 17; SEQ ID NO: 85:


g1313t nucleotide substitution encoding S422I mutant of amino


acid sequence SEQ ID NO: 18; and SEQ ID NO: 86: c112a g1313t


nucleotide substitutions encoding H22N S422I mutant of amino acid


sequence SEQ ID NO: 23) VSV Indiana glycoprotein G ectodomain.










Sequence
VSV




identifier
strain
Mutation
Nucleotide sequence





SEQ ID
Indiana
None
ATGAAGTGCCTTTTGTACTTAGCCTTTTTATTCATTGGGGTGA


NO: 83


ATTGCAAGTTCACCATAGTTTTTCCACACAACCAAAAAGGAAAC





TGGAAAAATGTTCCTTCTAATTACCATTATTGCCCGTCAAGCTC





AGATTTAAATTGGCATAATGACTTAATAGGCACAGCATTACAA





GTCAAAATGCCCAAGAGTCACAAGGCTATTCAAGCAGACGGTT





GGATGTGTCATGCTTCCAAATGGGTCACTACTTGTGATTTCCG





CTGGTATGGACCGAAGTATATAACACATTCCATCCGATCCTTCA





CTCCATCTGTAGAACAATGCAAGGAAAGCATTGAACAAACGAA





ACAAGGAACTTGGCTGAATCCAGGCTTCCCTCCTCAAAGTTGT





GGATATGCAACTGTGACGGATGCCGAAGCAGTGATTGTCCAG





GTGACTCCTCACCATGTGCTGGTTGATGAATACACAGGAGAAT





GGGTTGATTCACAGTTCATCAACGGAAAATGCAGCAATTACAT





ATGCCCCACTGTCCATAACTCTACAACCTGGCATTCTGACTATA





AGGTCAAAGGGCTATGTGATTCTAACCTCATTTCCATGGACAT





CACCTTCTTCTCAGAGGACGGAGAGCTATCATCCCTGGGAAAG





GAGGGCACAGGGTTCAGAAGTAACTACTTTGCTTATGAAACTG





GAGGCAAGGCCTGCAAAATGCAATACTGCAAGCATTGGGGAG





TCAGACTCCCATCAGGTGTCTGGTTCGAGATGGCTGATAAGGA





TCTCTTTGCTGCAGCCAGATTCCCTGAATGCCCAGAAGGGTCA





AGTATCTCTGCTCCATCTCAGACCTCAGTGGATGTAAGTCTAA





TTCAGGACGTTGAGAGGATCTTGGATTATTCCCTCTGCCAAGA





AACCTGGAGCAAAATCAGAGCGGGTCTTCCAATCTCTCCAGTG





GATCTCAGCTATCTTGCTCCTAAAAACCCAGGAACCGGTCCTG





CTTTCACCATAATCAATGGTACCCTAAAATACTTTGAGACCAGA





TACATCAGAGTCGATATTGCTGCTCCAATCCTCTCAAGAATGG





TCGGAATGATCAGTGGAACTACCACAGAAAGGGAACTGTGGG





ATGACTGGGCACCATATGAAGACGTGGAAATTGGACCCAATGG





AGTTCTGAGGACCAGTTCAGGATATAAGTTTCCTTTATACATG





ATTGGACATGGTATGTTGGACTCCGATCTTCATCTTAGCTCAA





AGGCTCAGGTGTTCGAACATCCTCACATTCAAGACGCTGCTTC





GCAACTTCCTGATGATGAGAGTTTATTTTTTGGTGATACTGGG





CTATCCAAAAATCCAATCGAGCTTGTAGAAGGTTGGTTCAGTA





GTTGGAAAAGCTCTATTGCCTCTTTTTTCTTTATCATAGGGTTA





ATCATTGGACTATTCTTGGTTCTCCGAGTTGGTATCCATCTTT





GCATTAAATTAAAGCACACCAAGAAAAGACAGATTTATACAGAC





ATAGAGATGAACCGACTTGGAAAGTAA





SEQ ID
Indiana
c112a
ATGAAGTGCCTTTTGTACTTAGCCTTTTTATTCATTGGGGTGA


NO: 84

(H22N)
ATTGCAAGTTCACCATAGTTTTTCCACACAACCAAAAAGGAAAC





TGGAAAAATGTTCCTTCTAATTACaATTATTGCCCGTCAAGCTC





AGATTTAAATTGGCATAATGACTTAATAGGCACAGCATTACAA





GTCAAAATGCCCAAGAGTCACAAGGCTATTCAAGCAGACGGTT





GGATGTGTCATGCTTCCAAATGGGTCACTACTTGTGATTTCCG





CTGGTATGGACCGAAGTATATAACACATTCCATCCGATCCTTCA





CTCCATCTGTAGAACAATGCAAGGAAAGCATTGAACAAACGAA





ACAAGGAACTTGGCTGAATCCAGGCTTCCCTCCTCAAAGTTGT





GGATATGCAACTGTGACGGATGCCGAAGCAGTGATTGTCCAG





GTGACTCCTCACCATGTGCTGGTTGATGAATACACAGGAGAAT





GGGTTGATTCACAGTTCATCAACGGAAAATGCAGCAATTACAT





ATGCCCCACTGTCCATAACTCTACAACCTGGCATTCTGACTATA





AGGTCAAAGGGCTATGTGATTCTAACCTCATTTCCATGGACAT





CACCTTCTTCTCAGAGGACGGAGAGCTATCATCCCTGGGAAAG





GAGGGCACAGGGTTCAGAAGTAACTACTTTGCTTATGAAACTG





GAGGCAAGGCCTGCAAAATGCAATACTGCAAGCATTGGGGAG





TCAGACTCCCATCAGGTGTCTGGTTCGAGATGGCTGATAAGGA





TCTCTTTGCTGCAGCCAGATTCCCTGAATGCCCAGAAGGGTCA





AGTATCTCTGCTCCATCTCAGACCTCAGTGGATGTAAGTCTAA





TTCAGGACGTTGAGAGGATCTTGGATTATTCCCTCTGCCAAGA





AACCTGGAGCAAAATCAGAGCGGGTCTTCCAATCTCTCCAGTG





GATCTCAGCTATCTTGCTCCTAAAAACCCAGGAACCGGTCCTG





CTTTCACCATAATCAATGGTACCCTAAAATACTTTGAGACCAGA





TACATCAGAGTCGATATTGCTGCTCCAATCCTCTCAAGAATGG





TCGGAATGATCAGTGGAACTACCACAGAAAGGGAACTGTGGG





ATGACTGGGCACCATATGAAGACGTGGAAATTGGACCCAATGG





AGTTCTGAGGACCAGTTCAGGATATAAGTTTCCTTTATACATG





ATTGGACATGGTATGTTGGACTCCGATCTTCATCTTAGCTCAA





AGGCTCAGGTGTTCGAACATCCTCACATTCAAGACGCTGCTTC





GCAACTTCCTGATGATGAGAGTTTATTTTTTGGTGATACTGGG





CTATCCAAAAATCCAATCGAGCTTGTAGAAGGTTGGTTCAGTA





GTTGGAAAAGCTCTATTGCCTCTTTTTTCTTTATCATAGGGTTA





ATCATTGGACTATTCTTGGTTCTCCGAGTTGGTATCCATCTTT





GCATTAAATTAAAGCACACCAAGAAAAGACAGATTTATACAGAC





ATAGAGATGAACCGACTTGGAAAGTAA





SEQ ID
Indiana
g1313t
ATGAAGTGCCTTTTGTACTTAGCCTTTTTATTCATTGGGGTGA


NO: 85

(S422I)
ATTGCAAGTTCACCATAGTTTTTCCACACAACCAAAAAGGAAAC





TGGAAAAATGTTCCTTCTAATTACCATTATTGCCCGTCAAGCTC





AGATTTAAATTGGCATAATGACTTAATAGGCACAGCATTACAA





GTCAAAATGCCCAAGAGTCACAAGGCTATTCAAGCAGACGGTT





GGATGTGTCATGCTTCCAAATGGGTCACTACTTGTGATTTCCG





CTGGTATGGACCGAAGTATATAACACATTCCATCCGATCCTTCA





CTCCATCTGTAGAACAATGCAAGGAAAGCATTGAACAAACGAA





ACAAGGAACTTGGCTGAATCCAGGCTTCCCTCCTCAAAGTTGT





GGATATGCAACTGTGACGGATGCCGAAGCAGTGATTGTCCAG





GTGACTCCTCACCATGTGCTGGTTGATGAATACACAGGAGAAT





GGGTTGATTCACAGTTCATCAACGGAAAATGCAGCAATTACAT





ATGCCCCACTGTCCATAACTCTACAACCTGGCATTCTGACTATA





AGGTCAAAGGGCTATGTGATTCTAACCTCATTTCCATGGACAT





CACCTTCTTCTCAGAGGACGGAGAGCTATCATCCCTGGGAAAG





GAGGGCACAGGGTTCAGAAGTAACTACTTTGCTTATGAAACTG





GAGGCAAGGCCTGCAAAATGCAATACTGCAAGCATTGGGGAG





TCAGACTCCCATCAGGTGTCTGGTTCGAGATGGCTGATAAGGA





TCTCTTTGCTGCAGCCAGATTCCCTGAATGCCCAGAAGGGTCA





AGTATCTCTGCTCCATCTCAGACCTCAGTGGATGTAAGTCTAA





TTCAGGACGTTGAGAGGATCTTGGATTATTCCCTCTGCCAAGA





AACCTGGAGCAAAATCAGAGCGGGTCTTCCAATCTCTCCAGTG





GATCTCAGCTATCTTGCTCCTAAAAACCCAGGAACCGGTCCTG





CTTTCACCATAATCAATGGTACCCTAAAATACTTTGAGACCAGA





TACATCAGAGTCGATATTGCTGCTCCAATCCTCTCAAGAATGG





TCGGAATGATCAGTGGAACTACCACAGAAAGGGAACTGTGGG





ATGACTGGGCACCATATGAAGACGTGGAAATTGGACCCAATGG





AGTTCTGAGGACCAGTTCAGGATATAAGTTTCCTTTATACATG





ATTGGACATGGTATGTTGGACTCCGATCTTCATCTTAGCTCAA





AGGCTCAGGTGTTCGAACATCCTCACATTCAAGACGCTGCTTC





GCAACTTCCTGATGATGAGAtTTTATTTTTTGGTGATACTGGG





CTATCCAAAAATCCAATCGAGCTTGTAGAAGGTTGGTTCAGTA





GTTGGAAAAGCTCTATTGCCTCTTTTTTCTTTATCATAGGGTTA





ATCATTGGACTATTCTTGGTTCTCCGAGTTGGTATCCATCTTT





GCATTAAATTAAAGCACACCAAGAAAAGACAGATTTATACAGAC





ATAGAGATGAACCGACTTGGAAAGTAA





SEQ ID
Indiana
c112a
ATGAAGTGCCTTTTGTACTTAGCCTTTTTATTCATTGGGGTGA


NO: 86

g1313t
ATTGCAAGTTCACCATAGTTTTTCCACACAACCAAAAAGGAAAC




(H22N
TGGAAAAATGTTCCTTCTAATTACaATTATTGCCCGTCAAGCTC




S4221)
AGATTTAAATTGGCATAATGACTTAATAGGCACAGCATTACAA





GTCAAAATGCCCAAGAGTCACAAGGCTATTCAAGCAGACGGTT





GGATGTGTCATGCTTCCAAATGGGTCACTACTTGTGATTTCCG





CTGGTATGGACCGAAGTATATAACACATTCCATCCGATCCTTCA





CTCCATCTGTAGAACAATGCAAGGAAAGCATTGAACAAACGAA





ACAAGGAACTTGGCTGAATCCAGGCTTCCCTCCTCAAAGTTGT





GGATATGCAACTGTGACGGATGCCGAAGCAGTGATTGTCCAG





GTGACTCCTCACCATGTGCTGGTTGATGAATACACAGGAGAAT





GGGTTGATTCACAGTTCATCAACGGAAAATGCAGCAATTACAT





ATGCCCCACTGTCCATAACTCTACAACCTGGCATTCTGACTATA





AGGTCAAAGGGCTATGTGATTCTAACCTCATTTCCATGGACAT





CACCTTCTTCTCAGAGGACGGAGAGCTATCATCCCTGGGAAAG





GAGGGCACAGGGTTCAGAAGTAACTACTTTGCTTATGAAACTG





GAGGCAAGGCCTGCAAAATGCAATACTGCAAGCATTGGGGAG





TCAGACTCCCATCAGGTGTCTGGTTCGAGATGGCTGATAAGGA





TCTCTTTGCTGCAGCCAGATTCCCTGAATGCCCAGAAGGGTCA





AGTATCTCTGCTCCATCTCAGACCTCAGTGGATGTAAGTCTAA





TTCAGGACGTTGAGAGGATCTTGGATTATTCCCTCTGCCAAGA





AACCTGGAGCAAAATCAGAGCGGGTCTTCCAATCTCTCCAGTG





GATCTCAGCTATCTTGCTCCTAAAAACCCAGGAACCGGTCCTG





CTTTCACCATAATCAATGGTACCCTAAAATACTTTGAGACCAGA





TACATCAGAGTCGATATTGCTGCTCCAATCCTCTCAAGAATGG





TCGGAATGATCAGTGGAACTACCACAGAAAGGGAACTGTGGG





ATGACTGGGCACCATATGAAGACGTGGAAATTGGACCCAATGG





AGTTCTGAGGACCAGTTCAGGATATAAGTTTCCTTTATACATG





ATTGGACATGGTATGTTGGACTCCGATCTTCATCTTAGCTCAA





AGGCTCAGGTGTTCGAACATCCTCACATTCAAGACGCTGCTTC





GCAACTTCCTGATGATGAGAtTTTATTTTTTGGTGATACTGGG





CTATCCAAAAATCCAATCGAGCTTGTAGAAGGTTGGTTCAGTA





GTTGGAAAAGCTCTATTGCCTCTTTTTTCTTTATCATAGGGTTA





ATCATTGGACTATTCTTGGTTCTCCGAGTTGGTATCCATCTTT





GCATTAAATTAAAGCACACCAAGAAAAGACAGATTTATACAGAC





ATAGAGATGAACCGACTTGGAAAGTAA









The nucleic acid molecule according to the invention may further contain any modification aimed to improve cloning and/or expression of the encoded mutated polypeptide or fusion polypeptide according to the invention, as well as its folding and stability.


Vector


The present invention also relates to a vector comprising at least one nucleic acid according to the invention or expressing the mutant polypeptide or the fusion polypeptide according to the invention.


Such a vector comprises the elements necessary for the expression of said nucleic sequence, and notably a promoter, a transcription initiation codon, termination sequences, and suitable transcription regulatory sequences. These elements vary according to the host used for the expression and are easily selected by persons skilled in the art based on their general knowledge.


In the context of the invention, the term “vector” has to be understood broadly as including plasmid and viral vectors. Vectors which are appropriate in the context of the present invention, include, without limitation, bacteriophage, plasmid or cosmid vectors for expression in prokaryotic host cells such as bacteria (e.g. E. coli, BCG or Listeria); vectors for expression in yeast (e.g. Saccharomyces cerevisiae, Schizosaccharomyces pombe, Pichia pastoris); baculovirus vectors for expression in insect cell systems (e.g. Sf9 cells); as well as plasmid and viral vectors for expression in higher eukaryotic cells or subjects. Typically, such vectors are commercially available (e.g. in Invitrogen, Stratagene, Amersham Biosciences, Promega, etc.) or available from depositary institutions such as the American Type Culture Collection (ATCC, Rockville, Md.) or have been the subject of numerous publications describing their sequence, organization and methods of producing, allowing the skilled person to apply them.


A “plasmid vector” as used herein refers to a replicable DNA construct. Usually plasmid vectors contain selectable marker genes that allow host cells carrying the plasmid vector to be selected for or against in the presence of a corresponding selective drug. A variety of positive and negative selectable marker genes are known in the art. By way of illustration, an antibiotic resistance gene can be used as a positive selectable marker gene that allows a host cell to be selected in the presence of the corresponding antibiotic. Suitable plasmid vectors are commercially available.


The term “viral vector” as used herein refers to a nucleic acid vector that includes at least one element of a virus genome and may be packaged into a viral particle or to a viral particle. The terms “virus”, “virions”, “viral particles” and “viral vector particle” are used interchangeably to refer to viral particles that are formed when the nucleic acid vector is transduced into an appropriate cell or cell line according to suitable conditions allowing the generation of viral particles. In the context of the present invention, the term “viral vector” has to be understood broadly as including nucleic acid vector (e.g. DNA viral vector) as well as viral particles generated thereof. The term “infectious” refers to the ability of a viral vector to infect and enter into a host cell or subject.


The viral vector may preferably be a Vesicular stomatitis virus (VSV) vector comprising at least one nucleic acid according to the invention or expressing the mutant polypeptide or the fusion polypeptide according to the invention.


VSV is an enveloped, negative-strand RNA virus that belongs to the Vesiculovirus genus of the Rhabdovirus family. VSV negative-strand RNA genome encodes five structural proteins and comprises, from 3′ to 5′: a leader region, the genes for the nucleoprotein (N), the phosphoprotein (P), the matrix protein (M), the glycoprotein (G), the polymerase protein (L), and the trailer region. A mutated VSV vector according to the invention may thus be obtained by replacing the natural wild-type sequence encoding the natural wild-type glycoprotein G by a nucleic acid sequence according to the invention, encoding a mutant polypeptide or a fusion polypeptide according to the invention. Standard techniques for generating mutated VSV vectors may be used for this purpose. In particular, reverse genetic for the recovery of recombinant viruses may be used.


With respect to other regions of the VSV genome (leader, genes encoding the nucleoprotein (N), the phosphoprotein (P), the matrix protein (M), the glycoprotein (G), the polymerase protein (L), and trailer region), the mutated VSV vector may contain any suitable natural or artificial sequence. In particular, sequences of the complete genome of various Indiana strain isolates (Rodriguez et al. J Gen Virol. 2002 October; 83(Pt 10):2475-83) and of New Jersey strain isolates (Velazquez-Salinas et al. Genome Announc. 2017 Sep. 14; 5(37)) are known in the art. In addition, various VSV strains are available at the American Type Culture Collection (ATCC) under references VR-1238, VR-1239, VR-159, and VR-1415 to VR-1421.


In an embodiment, the vector and preferably the VSV vector according to the invention may preferably be oncolytic, meaning that it preferentially infects and kills cancer cells. All strains of VSV previously listed (Indiana, Maraba, Cocal, Morreton, Alagoa, New Jersey, and Carajas) are potentially oncolytic and may thus be used.


Alternatively or in combination, the vector and preferably the VSV vector according to the invention may also be used for gene therapy, immunotherapy and selective delivery of a cargo to a chosen cell type.


When used for gene therapy, the vector and preferably the VSV vector according to the invention further encodes a gene of therapeutic interest, in order to restore the activity of this gene in cells that are deficient for this activity.


When used for immunotherapy, the vector and preferably the VSV vector according to the invention further encodes at least one antigen of interest and/or molecules involved in regulation of immune responses, such as cytokines, adjuvants, immune checkpoint modulators, antibodies targeting cytokines . . . .


When intended for delivery of a cargo to a chosen cell type such as unstimulated T cells, B cells, and hematopoietic cells, the recombinant mutated glycoprotein can be inserted at the surface of VLP containing a therapeutic molecule a marker or any drug.


According to a preferred embodiment, the viral vector according to the invention (in particular the VSV vector) is in the form of infectious viral particles. Typically, such viral particles are produced by a process comprising the steps of (i) introducing the viral vector of the invention into a suitable producer cell, (ii) culturing said producer cell under suitable conditions allowing the production of said infectious viral particle, (iii) recovering the produced viral particles from the culture of said producer cell, and (iv) optionally purifying said recovered viral particle.


Host Cell


The present invention also relates to a host cell containing or expressing the mutant polypeptide or the fusion polypeptide according to the invention, or containing the nucleic acid molecule according to the invention, or containing the vector (in particular a VSV vector) according to the invention.


When the host cell contains the vector (in particular a VSV vector) according to the invention, the host cell is preferably an eukaryotic cell suitable for VSV replication, such as BSR cells (a clone of BHK-21: Baby Hamster Kidney cells; ATCC CCL-10), BHK-21 cells (ATCC CCL-10), or Vero cells (ATCC CCL-81).


Composition


The present invention also relates to a composition comprising, or consisting essentially of, or consisting of, the mutant polypeptide or the fusion polypeptide according to the invention, the nucleic acid molecule according to the invention, the vector according to the invention, the host cell according to the invention, or any combination thereof.


Preferably, the composition is a pharmaceutical composition which comprises a therapeutically effective amount of the active agent(s) (the mutant polypeptide or the fusion polypeptide according to the invention, the nucleic acid molecule according to the invention, the vector according to the invention, the host cell according to the invention, or any combination thereof), and one or more pharmaceutically acceptable vehicle(s).


As used herein, a “pharmaceutically acceptable vehicle” is intended to include any and all carriers, solvents, diluents, excipients, adjuvants, dispersion media, coatings, antibacterial and antifungal agents, and absorption delaying agents, and the like, compatible with administration in a subject and in particular in a human.


As used herein a “therapeutically effective amount” is a dose sufficient for the intended use.


The composition preferably comprises a vector according to the invention, and more particularly a VSV vector according to the invention.


Therapeutic/In Vivo Uses


The present invention also relates to a vector according to the invention (in particular a VSV vector according to the invention, preferably an oncolytic VSV vector according to the invention) or a (pharmaceutical) composition according to the invention (in particular a composition comprising a vector according to the invention, and more particularly a composition comprising VSV vector according to the invention, preferably an oncolytic VSV vector according to the invention), for use as a drug.


The present invention also relates to a vector according to the invention (in particular a VSV vector according to the invention, preferably an oncolytic VSV vector according to the invention) or a (pharmaceutical) composition according to the invention (in particular a composition comprising a vector according to the invention, and more particularly a composition comprising VSV vector according to the invention, preferably an oncolytic VSV vector according to the invention), for use for treating cancer.


The present invention also relates to the use of a vector according to the invention (in particular a VSV vector according to the invention, preferably an oncolytic VSV vector according to the invention) or of a (pharmaceutical) composition according to the invention (in particular a composition comprising a vector according to the invention, and more particularly a composition comprising VSV vector according to the invention, preferably an oncolytic VSV vector according to the invention), for the manufacture of a drug for treating cancer.


The present invention also relates to the use of a vector according to the invention (in particular a VSV vector according to the invention, preferably an oncolytic VSV vector according to the invention) or of a (pharmaceutical) composition according to the invention (in particular a composition comprising a vector according to the invention, and more particularly a composition comprising VSV vector according to the invention, preferably an oncolytic VSV vector according to the invention), for treating cancer.


The present invention also relates to a method for treating cancer in a subject in need thereof, comprising administering to said subject a therapeutically effective amount of a vector according to the invention (in particular a VSV vector according to the invention, preferably an oncolytic VSV vector according to the invention) or of a (pharmaceutical) composition according to the invention (in particular a composition comprising a vector according to the invention, and more particularly a composition comprising VSV vector according to the invention, preferably an oncolytic VSV vector according to the invention).


Since the VSV virus is an oncolytic virus, it may be used in the treatment of various types of cancers, including prostate cancers, breast cancers, pancreatic cancers and melanoma (Bishnoi. Viruses 10, (2018)).


The present invention also relates to a vector according to the invention (in particular a VSV vector according to the invention) or a (pharmaceutical) composition according to the invention (in particular a composition comprising a vector according to the invention, and more particularly a composition comprising VSV vector according to the invention), for use in gene therapy or immunotherapy (in particular as a vaccine).


The present invention also relates to the use of a vector according to the invention (in particular a VSV vector according to the invention) or of a (pharmaceutical) composition according to the invention (in particular a composition comprising a vector according to the invention, and more particularly a composition comprising VSV vector according to the invention), for the manufacture of a drug for gene therapy or immunotherapy (in particular as a vaccine).


The present invention also relates to the use of a vector according to the invention (in particular a VSV vector according to the invention) or of a (pharmaceutical) composition according to the invention (in particular a composition comprising a vector according to the invention, and more particularly a composition comprising VSV vector according to the invention), for gene therapy or immunotherapy (in particular as a vaccine).


The present invention also relates to a method of gene therapy or immunotherapy in a subject in need thereof, comprising administering to said subject a therapeutically effective amount of a vector according to the invention (in particular a VSV vector according to the invention) or of a (pharmaceutical) composition according to the invention (in particular a composition comprising a vector according to the invention, and more particularly a composition comprising VSV vector according to the invention).


Vectors according to the invention (and in particular VSV vectors according to the invention) used in gene therapy or immunotherapy are preferably as disclosed above in “Vector” section.


Gene therapy may be used each time it is necessary to correct deficient gene activity in deficient cells.


Immunotherapy may be used for the treatment of various types of diseases, including cancers, infectious diseases, and inflammatory diseases (in particular autoimmune diseases).


The present invention also relates to a vector according to the invention (in particular a VSV vector according to the invention) or a (pharmaceutical) composition according to the invention (in particular a composition comprising a vector according to the invention, and more particularly a composition comprising VSV vector according to the invention), for use for selective delivery in vivo of a cargo to a chosen cell type.


The present invention also relates to the use of a vector according to the invention (in particular a VSV vector according to the invention) or of a (pharmaceutical) composition according to the invention (in particular a composition comprising a vector according to the invention, and more particularly a composition comprising VSV vector according to the invention), for the manufacture of a drug for selective delivery in vivo of a cargo to a chosen cell type.


The present invention also relates to the use of a vector according to the invention (in particular a VSV vector according to the invention) or of a (pharmaceutical) composition according to the invention (in particular a composition comprising a vector according to the invention, and more particularly a composition comprising VSV vector according to the invention), for selective delivery in vivo of a cargo to a chosen cell type.


The present invention also relates to a method for selective delivery in vivo of a cargo to a chosen cell type in a subject in need thereof, comprising administering to said subject a therapeutically effective amount of a vector according to the invention (in particular a VSV vector according to the invention) or of a (pharmaceutical) composition according to the invention (in particular a composition comprising a vector according to the invention, and more particularly a composition comprising VSV vector according to the invention). Vectors according to the invention (and in particular VSV vectors according to the invention) used for selective delivery in vivo of a cargo to a chosen cell type are preferably as disclosed above in “Vector” section.


The present invention also relates to the mutant polypeptide according to the invention or the fusion polypeptide according to the invention or the vector according to the invention or the host cell according to the invention or of a composition according to the invention (in particular a composition comprising a vector according to the invention, and more particularly a composition comprising VSV vector according to the invention) or of a combination thereof, for use for targeting a lipid membrane in a subject to a specific target, for instance a cell, in particular a cell to be killed (such as a cancer cell) or unstimulated T cells, B cells, and hematopoietic cells, wherein said mutant polypeptide or fusion polypeptide is anchored in said lipid membrane.


In Vitro Uses


The present invention also relates to the in vitro use of the mutant polypeptide according to the invention or the fusion polypeptide according to the invention or the vector according to the invention (in particular a VSV vector according to the invention) or the host cell according to the invention, or of a composition according to the invention (in particular a composition comprising a vector according to the invention, and more particularly a composition comprising VSV vector according to the invention) or of a combination thereof, for selective delivery of a cargo to a chosen cell type, such as unstimulated T cells, B cells, and hematopoietic cells.


Vectors according to the invention (and in particular VSV vectors according to the invention) used for selective in vitro delivery of a cargo to a chosen cell type are preferably as disclosed above in “Vector” section.


The present invention also relates to an in vitro use of the mutant polypeptide according to the invention or the fusion polypeptide according to the invention or the vector according to the invention or the host cell according to the invention or of a composition according to the invention (in particular a composition comprising a vector according to the invention, and more particularly a composition comprising VSV vector according to the invention) or of a combination thereof, for targeting a lipid membrane to a specific target, for instance a cell, in particular a cell to be killed (such as a cancer cell) or unstimulated T cells, B cells, and hematopoietic cells, wherein said mutant polypeptide or fusion polypeptide is anchored in said lipid membrane.


The following examples merely intend to illustrate the present invention.


EXAMPLES
Example 1: Selection of Mutations H22N and S422I in Indiana VSV G for Production of Recombinant VSV Virus Expressing an Anti-GFP Targeted G Protein

Materials and Methods


Construction of GNano (First Chimera)


GNano constructions were created starting from the cloned VSV G gene (Indiana strain) in the pCAGGS plasmid. pCAGGS plasmids containing the desired coding GNano sequence with the nanobody inserted at various position were generated using Gibson assembly method. The empty vector pCAGGS was linearized using EcoRI restriction enzyme. Then 3 PCR products with overlapping parts were generated. The product I is the fragment of G before the insertion site of the nanobody. The product II is the nanobody gene. The product III is the fragment of G after the insertion site. PCR products and linearized vector were combined and joint by incubation with Gibson Assembly® Master Mix (NEB).


Obtention and Amplification of Recombinant VSV Virus in BSR Cells


Recombinant VSV were obtained as described in Schnell et al. J. virol. 1996. A Plasmid pVSVFL(+)-GNano expressing the 11,161-nucleotide (nt) positive-strand full-length VSV RNA sequence with the mutant GNano in place of the wt VSV G was generated using two unique restriction sites present on pVSVFL(+), MluI In the 5′ noncoding sequence of the G gene and NheI present in a sequence introduced between the G gene and the L gene. BSR cells were infected with vaccinia virus at a moi of 10. After one hour plasmids pBS-N, pBS-P, pBS-L and pcDNA3.1-G, respectively, encoding N, P, L and G proteins plus the pVSVFL(+)-GNano plasmid were transfected into the cells using lipofectamine 2000 (Invitrogen) in the presence of 10 μg/ml AraC (Sigma). Plasmids and amounts were as follows: 5 μg of pVSVFL(+)_GNano, 2.5 μg of pBS-N, 2 μg of pBS-P, 1 μg of pCDNA3.1-G and 0.5 μg of pBS-L. After 48 h of incubation at 37° C. in 5% CO2, supernatants from transfected cells were collected and debris were pelleted from the cell lysates by centrifugation. Supernatants were filtered through 0.2 μm to remove vaccinia virus, and 2.5 ml of this filtrate was added to approximately 106 BSR cells in a 3.5-cm-diameter plate. After 24 h, the supernatant containing the recombinant VSV was clarified by centrifugation and stored for further use at −80°.


Titer in pfu/ml of the obtained recombinant VSV supernatants were determined by plaque assay in 6 well plates on BSR cells by counting the plaques in each wheel 24 h after infection using crystal violet coloration.


Then, the harvested recombinant virus was amplified by infecting 3 106 BSR cells at moi 0.1 on a 6 cm-diameter cell plate. This new supernatant was tittered by plaque assay and stored for further use at −80°.


To obtain higher titers, 3 106 BSR cells in a 6-cm-diameter plate were infected with 10 000 pfu in order to favor apparition of mutation susceptible to increase the growth rate of the recombinant virus. Ten subsequent passages of the viral supernatant on BSR cells were made. Titer was again estimated using plaque assay.


Titration by Plaque Assay


The day before, 106 BSR cells were seed in 6 wheel plate to ensure a ˜90% confluent monolayer the day after. Then cells were infected with 200 μL of tenfold serial dilutions for 1 h at room temperature. After the infection, cells are overlaid with immobilizing medium added directly to the inoculums containing 0.8% agarose in the wheel.


After addition of the overlay, plates are incubated at 37° C. in 5% CO2 for 24 h. To determine viral titer plaques are counted in wells containing from 5 to 100 plaques. The viral titer is determined as follow: pfu/ml=(average number of plaque)/(dilution×volume of dilution added to the plate).


Sequencing of the G Gene in the Viral Population


3 106 BSR cells on a 6 cm-diameter plate were infected at MOI 3 and total RNA extracts was done using TRIzol (Invitrogen). Then RT-PCR using the primer TTTTTTTTTTTTCAT (SEQ ID NO: 87) specific of all viral mRNAs and was performed on total RNA extracts. Then VSVG gene was amplified by PCR and the PCR product obtained was sent to sequencing to identify compensatory mutations.


Results


A recombinant VSV virus, in which the gene encoding the wild-type glycoprotein G was replaced by that encoding GNano (first chimera, consisting of the signal peptide of VSV Indiana G protein, a two amino acids QF linker, an anti-GFP nanobody of sequence SEQ ID NO: 80, a 10 amino acids linker GGGGSGGGGS (SEQ ID NO:81) and wild-type VSV Indiana G protein ectodomain (SEQ ID NO:2), see top of FIG. 2) was constructed. The amplification of this initial recombinant VSV virus was however very difficult; and only very low infectious titers ˜8·104 pfu/ml were obtained after amplification in BSR cells after 24 hours. Such a low titer is not compatible with industrial production of a targeted VSV virus intended for therapy.


Several passages of this recombinant virus on BSR cells were then performed in order to select mutations improving its amplification. After 10 passages, the sequencing of the genomes of the viral population showed the progressive invasion of this population by a variant containing two mutations in the nucleotide sequence of the glycoprotein resulting in the change of:

    • the histidine residue in position 22 into asparagine (H22N substitution), and
    • the serine 422 residue in position into isoleucine (S422I substitution).


The titer of the recombinant virus obtained after this optimization was around ˜108 pfu/ml.


The resulting improved chimera (Mutated GNano, see bottom of FIG. 2) thus shows high amplification in BSR cells after 24 hours, compatible with industrial production of a targeted VSV virus intended for therapy.


The titers obtained in BSR cells at 24 hours after infection are presented in Table 12 below:









TABLE 12







Recombinant VSV titers in BSR cells at 24 hours after infection.









Titer in BSR cells at 24 hours after


Recombinant VSV virus
infection (pfu/ml)











First chimera (see top of FIG. 2)
1.8 106


Mutated chimera (H22N and S4221)
9.107









It is worth noting that a third mutation subsequently appeared in about 50% of cases in the sequence encoding the dipeptide linker consisting of glutamine and phenylalanine (QF) and located between the signal peptide of VSV glycoprotein G and the inserted nanobody. This mutation led to the replacement of phenylalanine to serine (QF->QS; data not shown).


Importantly, this third mutation had no effect on the kinetics and on the viral titer (data not shown). Indeed, the titer of the recombinant viruses having the 3 mutations was still ˜108 pfu/ml. Therefore, this third mutation did not affect the improved amplification of the recombinant virus in BSR cells.


Conclusions

The insertion of mutations H22N and S422I in the ectodomain of VSV Indiana glycoprotein G surprisingly resulted in a dramatic increase in the ability to amplify a recombinant VSV virus comprising a GFP targeted glycoprotein G (by insertion of an anti-GFP nanobody).


Example 2: The Mutation S422I in Indiana VSV G was Also Selected in Another Context which Explains its Role

In an attempt to identify the histidines which play the role of pH sensitive molecular switch, we replaced the histidines of G ectodomain by an alanine.


Particularly, in VSV Indiana G, in the prefusion form of the protomer, there is a cluster of four histidines (H60, H162, H407) (FIG. 5). We made the hypothesis that protonation of these histidines create a cluster of positive charge which induces a local repulsive force which initiates the structural transition.


Materials and Methods


Plasmids and Cloning.


Point mutations were created starting from the cloned VSV G gene (Indiana Mudd-Summer strain) in the pCAGGS plasmid. Briefly, forward and reverse primers containing the desired mutation were combined separately with one of the primers flanking the G gene to generate two PCR products. These two G gene fragments overlap in the region containing the mutation and were assembled into the pCAGGS linearized vector (by EcoRI) using Gibson assembly reaction kit (New England Biolabs).


Cells and Antibodies.


BSR, clones of BHK-21 (baby hamster kidney; ATCC CCL-10), and HEK-293T (human embryonic kidney expressing simian virus 40 T antigen [SV40T]; ATCC CRL-3216) cells were grown in Dulbecco's modified Eagle's medium (DMEM) supplemented with 8% fetal calf serum (FCS). Mouse monoclonal antibody directed against G ectodomain was supplied by KeraFAST (8G5F11);


Transfection.


For fusion assays and indirect immunofluorescence, BSR cells, grown in six-well plate at 70% confluence, were transfected by the phosphate calcium method (5 μg of the appropriate plasmid per well). For pseudotyped virus production, HEK-293 cells were grown in 10-cm dishes; they were transfected by 6 μg of the appropriate plasmid with polyethylenimine (PEI; Sigma-Aldrich).


Cell Surface Expression.


In order to quantify the expression of G protein on the cell surface, HEK-293 cells plated on six-well dishes at 70% confluence were transfected as described above. At 24 h after transfection, cells were collected by scraping into 1 mM EDTA-PBS, followed by centrifugation at 600 g for 5 min. Cells were incubated with a 1:2,000 dilution of mouse monoclonal anti-G ectodomain antibody (8G5F11; KeraFAST) in PBS on ice for 1 h. Cells were washed twice in PBS, fixed at 4° C. in paraformaldehyde, incubated with a 1:100 dilution of goat anti-mouse Alexa Fluor 488 (Invitrogen) on ice for 1 h, and rinsed in PBS. After resuspension in 500 ml of 0.5 mM EDTA-PBS, the fluorescence of 10,000 cells from each population was determined by flow cytometry using a BD Accuri C6 fluorescence-activated cell sorter (FACS). The mean fluorescence intensity (MFI) of the transfected cells expressing G was quantified by flow cytometry. The relative cell surface expression of transfected cells was determined as follows: (MFI for the mutant)/(MFI for the WT). For each mutant, the percentage given in the bottom of FIG. 6 is the average of three independent experiments.


Results


Mutation H407A completely abolishes G fusion properties. This mutation is lethal for the virus. Therefore, the recombinant viruses (VSV G H407A) cannot be generated spontaneously. We have employed a complementation strategy to support its growth.


Briefly, the recovery of the recombinant VSV was supported by expression of functional WT VSV G protein in trans from a transfected plasmid. To generate virus particles containing only the fusion-defective VSV G expressed from the viral genome, viruses present in the supernatant were amplified in cells that lack the trans-complementing VSV G plasmid. Sequence analysis of independent clones revealed two distinct compensatory mutations: one of them resulted in the replacement of serine 422 by an isoleucine (leading to the double mutant VSV G H407A S422I).


Conclusions

The comparison of the characteristics of VSV G H407A with those of VSV G H407A/54221 revealed that, in the context of mutation H407A, S422I improves the efficiency of VSV G transport efficiency and G recognition by a soluble form of LDL-R CR2 domain (FIG. 6). This strongly suggests that the mutation S422I (or similar substitutions, such as S422F, S422M, S422L or S422V, or equivalent substitutions in equivalent positions of G ectodomain of other VSV strains) facilitates G folding and stabilizes the prefusion form (as the structural transition disrupts the CR domain binding site) in the context of mutation H407A.


Example 3: Cell-Cell Fusion Assays. Comparison of the Fusion Properties of VSV Indiana WTG Glycoprotein with VSV Indiana GNano Optimized Glycoprotein

Materials and Methods


Cell-Cell Fusion Assay.


BSR cells plated on glass coverslips at 70% confluence were cotransfected with pCAGGS plasmids encoding wild-type (WT) G or mutant G, and P-GFP plasmid encoding the phosphoprotein of Rabies virus fused to GFP (cytoplasmic marker). Twenty-four hours after transfection, cells were incubated with fusion buffer (DMEM+10 mM MES) at various pH values (from 5.0 to 6.5) for 10 min at 37°. Cells were then washed once and incubated with DMEM+10 mM HEPES-NaOH buffered at pH 7.4, 1% BSA at 37° C. for 1 h. Cells were fixed with 4% paraformaldehyde in 1×PBS for 15 min. Cells nuclei were stained with DAPI, and syncytium formation was analyzed with Zeiss Axio vert 200 fluorescence microscope with a 20× lens.


Results


Fusion properties of VSV G Indiana WT and VSV GNano after optimization (ie. Mutation of residue H22 to N and residue S422 to I, F, M, L, or V) were analysed in cell-cell fusion assay. For this, BSR cells were transfected with pCAGGS plasmids expressing VSV G (either WT or mutant) and P-GFP (a protein exclusively cytoplasmic allowing an easy observation of syncytia). At 24 h post-transfection, the cells were exposed for 10 min to DMEM adjusted to the indicated pH, which was then replaced by DMEM at pH 7.4. The cells were then kept at 37° C. for 1 h before fixation.


Cells expressing wild-type G protein formed massive syncytia when exposed to low pH, between 5.5 and 6.3 (FIG. 7). Cells expressing GNano after optimization were all able to form syncytia when exposed at low pH (FIG. 7).


Conclusions

The results are consistent with the fact that the presence of a hydrophobic residue in position 422 is key in the regulation of the pH-dependent structural transition. This strongly suggests that the mutation S422I (or S422F, S422M, S422L or S422V, or equivalent substitutions in equivalent positions of G ectodomain of other VSV strains) facilitates G folding and stabilizes the prefusion form of GNano.


Example 4: VSV G C-Terminal Part Contributes to the Locking of the Ectodomain in its Pre-Fusion State

Results


The structure of WT Indiana VSV G ectodomain (G1-440 comprising residues 1 to 440) was determined at 2.1 Å resolution. The ectodomain was crystallized in its trimeric pre-fusion conformation and its structure was determined at 2.1 Å resolution (FIG. 8). The polypeptidic chain could be traced up to residue 432. The C-terminal part of G1-440 is made of an alpha-helix (residues 409 to 416) followed by a kinked region ending with a beta-turn between Pro 418 and Glu 421 and a short beta-hairpin structure (residues 422 to 432). The hairpin structure interacts with strand 79-84 of VSV G fusion domain beta-sheet through hydrogen bonds between the main chain of both segments and between the lateral chain of residues H80 and E421.


Conclusions

This X-ray structure of VSV G ectodomain reveals that the C-terminal part (from residue 407 to residue 425) largely interacts with the fusion domain and therefore contributes to the stabilization of the pre-fusion complex. Particularly relevant for the present invention, is the fact that, in this pre-fusion structure, residue S422 is pointing toward F424 and L430. Therefore, the compensatory mutation S422I stabilizes the beta-hairpin structure (and thus the pre-fusion state) through hydrophobic interactions between 1422 and residues F424 and L430. This also explained why the replacement of S422 by other hydrophobic residues (L, M, V or F) has the same effect (i.e. stabilizes the pre-fusion conformation).


BIBLIOGRAPHIC REFERENCES



  • Albertini et al. Viruses 4, 117-139 (2012).

  • Amirache. Blood 123, 1422-4 (2014).

  • Barber. Oncogene 24, 7710-7719 (2005).

  • Bishnoi et al. Viruses 10, 90 (2018). Fernandez et al. J. Virol. 76, 895-904 (2002).

  • Hastie et al. Virus Res. 176, 16-32 (2013).

  • Needleman et Wunsch. J. Mol. Biol. 48,443-453, 1970.

  • Nikolic et al. Nat Commun. 2018 Mar. 12; 9(1):1029.

  • Rodriguez et al. J Gen Virol. 2002 October; 83(Pt 10):2475-83.

  • Schnell et al. J. virol. 1996 70:2318-2323.

  • Velazquez-Salinas et al. Genome Announc. 2017 Sep. 14; 5(37).


Claims
  • 1. A mutant polypeptide comprising the amino acid sequence of the ectodomain of glycoprotein G of a Vesicular stomatitis virus (VSV) strain, wherein said ectodomain comprises the amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least 50% of identity with the amino acid sequence as set forth in SEQ ID NO: 2, with at least one substitution of an amino acid residue selected from the group consisting of: a) any amino acid residue located from position 421 to position 429 and any amino acid residue located from position 17 to position 25 of SEQ ID NO:2; orb) any amino acid residue located from a position equivalent to position 421 to a position equivalent to position 429 of SEQ ID NO: 2 and any amino acid residue located from a position equivalent to position 17 to a position equivalent to position 25 of SEQ ID NO:2, after optimal global alignment with SEQ ID NO:2.
  • 2. The mutant polypeptide according to claim 1, wherein: a) any amino acid residue located from position 421 to position 429 of SEQ ID NO:2 or equivalent position after optimal global alignment with SEQ ID NO:2 is substituted by a hydrophobic amino acid; and/orb) any amino acid residue located from position 17 to position 25 of SEQ ID NO:2 or equivalent position after optimal global alignment with SEQ ID NO:2 is substituted by a polar amino acid.
  • 3. The mutant polypeptide according to claim 1, which comprises, or consists essentially of, or consists of, an amino acid sequence selected from the group consisting of SEQ ID NOs 17-79.
  • 4. The mutant polypeptide according to claim 1, which further comprises at least one substitution of an amino acid residue selected from the group consisting of amino acid residues located at positions 8, 47, 209 and 354 of SEQ ID NO:2 or amino acid residues located at positions equivalent to positions 8, 47, 209, and 354 of SEQ ID NO:2 after optimal global alignment with SEQ ID NO:2.
  • 5. A fusion polypeptide comprising the mutant polypeptide according to claim 1, and an additional peptide, polypeptide, or protein, wherein the additional peptide, polypeptide or protein is inserted in N-terminal of the mutant polypeptide, or between any two consecutive amino acids located in regions corresponding to positions 192 to 202, 240 to 257, 347 to 353, 364 to 366, or 376 to 379 of SEQ ID NO:2 or equivalent positions after optimal global alignment with SEQ ID NO:2.
  • 6. The fusion polypeptide according to claim 5, wherein the additional peptide, polypeptide, or protein is at least a part of a ligand of a cellular receptor.
  • 7. A nucleic acid molecule encoding the mutant polypeptide according to claim 1 or a fusion polypeptide comprising the mutant polypeptide according to claim 1 and an additional peptide.
  • 8. A vector comprising at least one nucleic acid selected from the group consisting of a nucleic acid encoding the mutant polypeptide according to claim 1 and a nucleic acid encoding a fusion polypeptide comprising the mutant polypeptide according to claim 1 and an additional peptide, or a vector expressing at least one selected from the group consisting of the mutant polypeptide according to claim 1 and a fusion polypeptide comprising the mutant polypeptide according to claim 1 and an additional peptide.
  • 9. The vector according to claim 8, which is a viral vector.
  • 10. A host cell containing or expressing at least one selected from: the mutant polypeptide according to claim 1,a fusion polypeptide comprising the mutant polypeptide according to claim 1 and an additional peptide,a nucleic acid encoding the mutant polypeptide according to claim 1,a nucleic acid encoding a fusion polypeptide comprising the mutant polypeptide according to claim 1 and an additional peptide,a vector comprising at least one nucleic acid selected from the group consisting of a nucleic acid encoding the mutant polypeptide according to claim 1 and a nucleic acid encoding a fusion polypeptide comprising the mutant polypeptide according to claim 1 and an additional peptide,a vector expressing at least one selected from the group consisting of the mutant polypeptide according to claim 1 and a fusion polypeptide comprising the mutant polypeptide according to claim 1 and an additional peptide, andany combination thereof.
  • 11. A composition comprising at least one selected from: the mutant polypeptide according to claim 1,a fusion polypeptide comprising the mutant polypeptide according to claim 1 and an additional peptide,a nucleic acid encoding the mutant polypeptide according to claim 1,a nucleic acid encoding a fusion polypeptide comprising the mutant polypeptide according to claim 1 and an additional peptide,a vector comprising at least one nucleic acid selected from the group consisting of a nucleic acid encoding the mutant polypeptide according to claim 1 and a nucleic acid encoding a fusion polypeptide comprising the mutant polypeptide according to claim 1 and an additional peptide,a vector expressing at least one selected from the group consisting of the mutant polypeptide according to claim 1 and a fusion polypeptide comprising the mutant polypeptide according to claim 1 and an additional peptide,a host cell containing or expressing at least one of the above, andany combination thereof.
  • 12. A method for treating a disease, comprising administering, to a subject in need thereof, the composition according to claim 11.
  • 13. The method according to claim 12, for treating cancer, for gene therapy, or for immunotherapy.
  • 14. The method according to claim 12, for selective delivery in vivo of a cargo to a chosen cell type or for targeting a lipid membrane in a subject to a specific target, wherein the mutant polypeptide or fusion polypeptide is anchored to the cargo or anchored in the lipid membrane.
  • 15. An in vitro method of using the composition according to claim 11, for selective delivery in vitro of a cargo to a chosen cell type or for targeting a lipid membrane to a specific target, wherein the mutant polypeptide or fusion polypeptide is anchored to the cargo or anchored in the lipid membrane.
  • 16. The mutant polypeptide according to claim 1, wherein the at least one substitution of an amino acid residue is selected from the group consisting of: a) any amino acid residue located from position 421 to position 425 and any amino acid residue located from position 17 to position 25 of SEQ ID NO:2; orb) any amino acid residue located from a position equivalent to position 421 to a position equivalent to position 425 of SEQ ID NO: 2 and any amino acid residue located from a position equivalent to position 17 to a position equivalent to position 25 of SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2.
  • 17. The mutant polypeptide according to claim 1, wherein the at least one substitution of an amino acid residue is selected from the group consisting of: a) any amino acid residue located in positions 422 and 22 of SEQ ID NO:2; orb) any amino acid residue located in positions equivalent to positions 422 and 22 of SEQ ID NO: 2, after optimal global alignment with SEQ ID NO:2.
  • 18. The mutant polypeptide according to claim 1, wherein: a) the hydrophobic amino acid is selected for the group consisting of glycine (G), alanine (A), valine (V), leucine (L), isoleucine (I), methionine (M), proline (P), phenylalanine (F), and tryptophan (W); and/orb) the polar amino acid is selected from the group consisting of asparagine (N), glutamine (Q), cysteine (C), tyrosine (Y), threonine (T), and serine (S).
  • 19. The mutant polypeptide according to claim 1, which further comprises a substitution at position 47 of SEQ ID NO:2 or equivalent position after optimal global alignment with SEQ ID NO:2, or a substitution at position 354 of SEQ ID NO:2 or equivalent position after optimal global alignment with SEQ ID NO:2, or two substitutions at positions 47 and 354 of SEQ ID NO:2 or equivalent positions after optimal global alignment with SEQ ID NO:2.
  • 20. The fusion polypeptide according to claim 5, wherein the additional peptide, polypeptide, or protein is inserted in the N-terminal of the mutant polypeptide or between the amino acids at positions 351 and 352 of SEQ ID NO:2 or equivalent positions after optimal global alignment with SEQ ID NO:2.
  • 21. The fusion polypeptide according to claim 6, wherein the at least a part of a ligand of a cellular receptor is an antibody or a functional antibody fragment or a nanobody.
Priority Claims (1)
Number Date Country Kind
19305317.0 Mar 2019 EP regional
PCT Information
Filing Document Filing Date Country Kind
PCT/EP2020/057144 3/16/2020 WO 00