CoV-2 (CoV-n) antibody neutralizing and CTL vaccines using protein scaffolds and molecular evolution

Abstract
The embodiment of the invention is to innovate immunogenic CoV-2, CoV-n B cell epitopes, which are selected from the loop regions constrained by the two constraining beta strands; or between one beta strand and one alpha-helical strand; or between two alpha-helical strands of CoV-2, CoV-n proteins, and such selected loops replace the native loops of the thermostable protein scaffolds, which provide thermostable constraint of the transplanted CoV-2, CoV-n loop antigens as well as CD4 helper T cell determinants to elicit CD4 dependent neutralizing and blocking antibodies against viral entry, replication and viral clearance. The B cell loops can be cleaved and processed as CD8 T cell epitopes for eliciting cytotoxic T lymphocyte (CTL) responses against and clear viral infected cells. MHCI viral peptide epitopes in nonamers, octamers, or decamers will be used as peptide vaccines along with viral CD4 helper peptide epitopes. These CTL peptides will be also inserted into the loop or replacing the native loop of the protein scaffolds. All the above sequences can be embodied in RNA vaccines to augment protection or suppress cytokine storms. And the embodiment of the invention is to employ CTL peptides and CD4 helper peptides inserted into each respective candidate loop of the protein scaffolds as CTL vaccines. Thus, the CTL vaccines safely eliminate infectious foci and reservoir of the offending virus and mutant viral strains.
Description
LEGEND TO FIG. 1

The protein folding is dictated by primary sequence and the secondary structure of beta strands or beta-sheet accommodating the loop structures, which assume conformations as an antigenic site, a pharmacophore, or enzymatic sites. The scaffold protein is defined as a candidate protein with such beta strands that can constrain a given loop of protein, or a beta (β) sheet that can accommodate multiple loops in adjacent orientations. Beta strands of FIG. 1A to 1D are colored yellow bar, and the loop sequences are left blank, while the alpha helical sequences are colored. The lipocalins is a family of proteins which transport small hydrophobic molecules such as steroids, billins, retinoids and lipids. They share limited regions of sequence homology and a common tertiary structure architecture. This is an eight stranded antiparallel β barrel with a repeated+1 topology enclosing an internal ligand binding site. GFP consists of a stable beta (β) barrel consisting of 8 anti-parallel beta strands that can accommodate the loop structure wherein. Fibronectin exhibit six beta strands in a barrel. Immunoglobulin heavy and light chain consist of six beta strands in a barrel. The complementarity-determining region (CDR) are typical loop regions scaffolded by the two beta (β)-strands. The invention embodies the replacement of the native loop sequences of immunoglobulin CDRs with loop sequences of CoV-2. These scaffold proteins are embodied to exhibit and conformationally constrained antigenic B cell epitopes of CoV-2, CoV-1 and CoV-n variants, defined as a CoV-2 mutant, or a next generation of CoV-2 with distinct which can be a recombinant with zoonotic origin.


LEGEND TO FIG. 2

Due to selective pressure of CoV-2 facing the herd immunity, mutant CoV-2 ensued according to antibody-mediated immunity against attachment site to host cell receptors such as the cognate interactions between the spike protein and ACE2 receptor. The viral mutant can be predicted a prior by selective binding pressure using ribosome display. Ribosome display (RD) is illustrated as an example for selecting optimized Covid-19 sequences, based on the pre-existing sequences. Other protein displays such as phage display, yeast display are equally applied. RD is a straightforward in vitro phenotype-genotype linked selection. FIG. 2 diagrammatically illustrates the principle of RD as a selection strategy in the laboratory. (i) A highly stable complex of Aptameric or Antibody-Ribosome-Message, i.e., ARM is first formed; (ii) it can be selected on the ligand-coated solid phase; (iii) via RT-PCR, the mRNA can be amplified and undergo molecular evolution at the same time, and improved aptameric protein phenotype is again physically linked to mRNA ready for the next re-iterative round selection. In the embodiment of the invention, RD is used on VHH nanobody platform as well as fibronectin, CTL-A4, GFP and ankyrin repeated (AR) domains as the library scaffold.


Embodiment and Biochemical enablement: The biochemical mechanisms were built into the IgV/VHH, AR, and aforementioned protein scaffolds: (i) Cκ or any protein as a stuffer region rendering this a stalling interaction with the introduction of termination codon in the C-terminus, fused to the viral sequences containing CDRs along with cognate mRNA. (ii) To enable reiteration of phenotype-library linked selection, T7 promoter and the Kozak sequence of the eukaryotic system were built into 5′ terminal PCR primer. (iii) The cognate message amplified by RT-PCR and further follow through repeated rounds of selection, and the final product cloned and expressed, and protein purified.


LEGEND TO FIG. 3

Diagram of the spike protein and ACE2 interaction. The spike protein exhibits multiple functions: attachment to ACE2 via the RBD (RBM); and the opening up or conformational change of the spike protein to reveal the enzymatic furin site for maturation and finally exposure of the fusion peptides, constrained by the conformationally rearranged HR regions, that permit target cell membrane fusion that facilitate viral entry. The invention embodies eliciting neutralizing antibodies that block all three aspects of the viral life cycles. Moreover, the invention embodies elicitation of CTL against peptide fragment decorated on MHCI that lyse the infected targets. Such CTL targeted peptides are derived from all parts of the spike proteins. FIG. 3c also helps to illustrate the embodiment of the invention producing highly competitive spike protein ACE2 receptor-binding variants; notice four contact regions of the spike protein to the receptors. Mutation can be introduced by error prone PCR an DNA shuffle and the high affinity winning competitors, commensurate with the successfully evolving mutants. High affinity is defined by accessibility of RBE/RBM to ACE2 as well as NTD, e.g., domain N-terminal to RBD/RBM domain permits the conformational flexibility of NTD, and consequently modulate the binding of open form (versus close form inaccessible to the ACE2). Vaccines embodied by this invention, directing such NTD mutants also play an equal important role in defense similar to mutants of RBD/RBE.


LEGEND TO FIG. 4

Vessler's sequences reports full length of spike protein, and is presented herein. As described in FIG. 1, the embodiment of the invention is rendered equivalent pertaining to conformational constraint in that the loop present in the spike protein or any other structural proteins of CoV-2 can be swapped between the beta strands of the four protein scaffolds and replacing the native loop of the scaffold protein. Therefore, the loop in between the beta-beta strands, is employed as candidates of antigenic B cell epitopes. Moreover, the invention also embodies the constraining scaffolding principle between the beta strand and an alpha helix; and between alpha helix and alpha helix.


LEGEND TO FIG. 5

Amino acid sequence alignment of the S1 protein of the 2019-nCoV with SARS-CoV and selected bat SARSr-CoVs. The receptor-binding motifs of SARS-CoV and the homologous regions of other coronaviruses are indicated by the red box. The key amino acid residues involved in the interaction with human ACE2 are numbered on top of the aligned sequences. The short insertions in the N-terminal domain of the novel coronavirus are indicated by the blue boxes. Bat CoV RaTG13 was identified from R. affinis in Yunnan Province. Bat CoV ZC45 was identified from R. sinicus in Zhejiang Province.


LEGEND TO FIG. 6

Examples of scaffolding immunoglobulin heavy chain protein scaffold for presenting Cov19 Spike protein fragment: Three firewall precision vaccines constructed in CDRs of camelid VHH in pET45b.


Fragments of Cov19 were inserted into CDR2 or CDR3 domain of Camelid-VHH-GFP by using site-directed mutagenesis with primers, attaccaccaaccttagaatcaagattgttagaattgctcaccagttcac, tataattacctgtatagattggcaaattatgccggc for VHH-Cov1-3. attaccaccaaccttagaatcaagattgttagaatttgctgcgcaataataaac, tataattacctgtatagattgtggggccagggcacc for VHH-Cov2-3. tccatcattgcctacactatgtcacttggttggggccagggcacc, ttgactagctacactacgtgcccgccgaggagaattagtctgagtctgatatgctgcgcaataataaac for Camelid-Cov-3- 2. Aaagtgacacttgcagatgctggcttcatcaaatggggccagggcacc, gttgaaaagtagatcttcaataaatgacctcttgcttggttttgatggtgctgcgcaataataaac for Camelid-Cov-4-2. Aaaaccaccaaaatctttaattggtggtgttttgtaaatgctcaccagttcacattc, aatttttcacaaatattagcaaattatgccggc for Camelid-CoV-5-3. aaaaccaccaaaatctttaattggtggtgttttgtaaattgctgcgcaataataaac, aatttttcacaaatattatggggccagggcacc for Camelid-Cov 6-1. ccgccgaggagagctcaccagttcacattc, gcacgtagtgtagatggcagtgcaaattatgcc; and gttgaaaagtagatcttcaataaatgatgctgcgcaataataaacggc, aaagtgacacttgcagatgctggcttctggggccagggcacccaggttaccfor Camelid-Cov-S1-2. ttcagttgaaatatctctctcaaaaggtttgagattagacttcctaaagctcaccagttcaca, aatatctctctcaaaaggtttgagattagacttcctaaagctcaccagttcacattctttacc; and ttggaaaccatatgattgtaaaggaaagtaacaattaaaaccttcaacacctgctgcgcaataataaac, cccactaatggtgttggttaccaaccatacagagtatggggccagggcacccaggttacfor Camelid-Cov-469. attaaagattttggtggttttaatttttcacaaatattaccatggggccagggcacc, aaaatctttaattggtggtgttttgtaaatttgtttgacttgtgctgctgcgcaataataaac for Camelid-Cov-Fu-2


LEGEND TO FIG. 7

The Table consists replacing the above RBD regions into the immunoglobulin CDR1, 2, 3


LEGEND TO FIG. 8

Different Vaccine construct data. Although GFP as a fusion protein to the viral vaccine B cell epitope in VHH. The helper CD4 sequences can be any immunogenic carrier protein such as OVA, BSA, KLH, BGG or promiscuous helper T cell determinants PADRE and also promiscuous helper protein of infectious origins of measles viral protein, diphtheria toxin, and tetanus toxin for enhancing antibody responses to Covid-19 spike protein, and other proteins. For example, NSNNLDSKVGGNYNYLYRL, conformationally constrained to the native conformation of the RBD regions, embracing a large contact area of the ACE2 receptor in the spike protein. Since CoV-2 is lethal to patients elicited with CD4 helper T cell epitopes induced cytokine storm, the embodiment of the invention utilizes minimal IgE B cell epitope, which even if can be processed to a CD4 helper epitope, result in minimal CD4 T cell activation.


LEGEND TO FIG. 9

As discussed in the Vessler's sequences, a Table consists of B cell epitopes of a complete spike protein is made wherein; moreover, this Table consists of B cell epitopes from the RBD region of the Vessler's sequences distributed into the CDR1, 2, 3, of the immunoglobulin.


FIG. 10. CoV-2 Total Genomic Sequences

Severe acute respiratory syndrome (SARS) CoV-2, complete genome (NCBI Reference Sequence: NC_045512.2), including in the translation frame: ORF1a, ORF-1ab, leader protein, nsp-2, nsp-3, nsp-4, 3C-like proteinase, nsp-5, nsp-6, nsp-7, nsp-8, nsp-9, nsp-10, RNA-dependent RNA polymerase, helicase, 3->5′ exonuclease, endoRNAse, 2-O′ ribose methyltransferase, Surface Glycoprotein (S), ORF-3a, E, M, ORF-6, N, ORF-7a, ORF-7b, ORF-8, nucleocapsid (NC).


Sequences of Table 1 to Table 11

1) Name of the Sequence ASCII text file: CoV-2 Vaccine Sequences


2) Date of creation: July 2nd 2020


3) Size of the Sequence File: 295 KB


4) Note: The highlighted columns in yellow are the only ones included in the Sequence Listing CRF/Text File









TABLE 1







RDRP















Sequence

Sequence






Allele
ID No.
Start
Annotation
Length
Sequence
Score
Percentile_rank

















HLA-A*01:01
396
450
RDRP A1.01_450
9
ISDYDYYRY
0.987873
0.01





HLA-A*01:01
397
738
RDRP A1.01_738
9
DTDFVNEFY
0.987646
0.01





HLA-A*01:01
398
907
RDRP A1.01 907
9
LTNDNTSRY
0.973398
0.01





HLA-A*01:01
399
475
RDRP A1.01_475
9
VVDKYFDCY
0.93711
0.02





HLA-A*01:01
400
681
RDRP A1.01_681
9
SSGDATTAY
0.855722
0.05





HLA-A*01:01
401
895
RDRP A1.01 895
9
LTGHMLDMY
0.666481
0.1





HLA-A*01:01
402
869
RDRP A1.01 869
9
LTKHPNQEY
0.648646
0.11





HLA-A*01:01
403
366
RDRP A1.01_366
9
LSFKELLVY
0.62167
0.13





HLA-A*01:01
404
758
RDRP A1.01_758
9
LSDDAVVCF
0.574853
0.15





HLA-A*01:01
405
877
RDRP A1.01_877
9
YADVFHLYL
0.452167
0.23





HLA-A*01:01
406
209
RDRP A1.01_209
9
NQDLNGNWY
0.448436
0.23





HLA-A*01:01
407
606
RDRP A1.01_606
9
YSDVENPHL
0.443161
0.23





HLA-A*01:01
408
27
RDRP A1.01_27
9
STDVVYRAF
0.44026
0.24





HLA-A*01:01
409
471
RDRP A1.01_471
9
FVVEVVDKY
0.431406
0.24





HLA-A*01:01
410
538
RDRP A1.01_538
9
TITQMNLKY
0.388855
0.28





HLA-A*01:01
411
859
RDRP A1.01_859
9
FVSLAI DAY
0.361526
0.3





HLA-A*01:01
412
448
RDRP A1.01_448
9
AAISDYDYY
0.35683
0.3





HLA-A*01:01
413
281
RDRP A1.01_281
9
KLFDRYFKY
0.336405
0.32





HLA-A*02:01
414
123
RDRP A2.01_123
9
TMADLVYAL
0.933263
0.03





HLA-A*02:01
415
854
RDRP A2.01_854
9
LMIERFVSL
0.906906
0.03





HLA-A*02:01
416
334
RDRP A2.01_334
9
FVDGVPFVV
0.902701
0.03





HLA-A*02:01
417
64
RDRP A2.01_64
9
NLIDSYFVV
0.822701
0.07





HLA-A*02:01
418
239
RDRP A2.01_239
9
SLLMPILTL
0.793618
0.08





HLA-A*02:01
419
741
RDRP A2.01_741
9
FVNEFYAYL
0.760859
0.1





HLA-A*02:01
420
899
RDRP A2.01_899
9
MLDMYSVML
0.701191
0.13





HLA-A*02:01
421
654
RDRP A2.01_654
9
RLANECAQV
0.659383
0.15





HLA-A*02:01
422
467
RDRP A2.01_467
9
RQLLFVVEV
0.65728
0.16





HLA-A*02:01
423
88
RDRP A2.01_88
9
NLLKDCPAV
0.553623
0.23





HLA-A*02:01
424
240
RDRP A2.01_240
9
LLMPILTLT
0.514208
0.26





HLA-A*02:01
425
667
RDRP A2.01_667
9
VMCGGSLYV
0.482022
0.27





HLA-A*02:01
426
821
RDRP A2.01_821
9
KQGDDYVYL
0.449688
0.32





HLA-A*02:01
427
122
RDRP A2.01_122
9
YTMADLVYA
0.448847
0.32





HLA-A*02:01
428
647
RDRP A2.01_647
9
SLSHRFYRL
0.438669
0.32





HLA-A*02:01
429
365
RDRP A2.01_365
9
RLSFKELLV
0.406213
0.35





HLA-A*02:01
430
923
RDRP A2.01_923
9
AMYTPHTVL
0.33343
0.48





HLA-A*02:01
431
397
RDRP A2.01_397
9
SVAALTNNV
0.322593
0.5





HLA-A*02:01
432
307
RDRP A2.01_307
9
ILHCANFNV
0.311923
0.53





HLA-A*02:01
433
574
RDRP A2.01_574
9
KLLKSIAAT
0.298377
0.56





HLA-A*03:01
434
281
RDRP A3.01_281
9
KLFDRYFKY
0.94431
0.02





HLA-A*03:01
435
324
RDRP A3.01_324
9
TSFGPLVRK
0.940452
0.02





HLA-A*03:01
436
500
RDRP A3.01_500
9
KSAGFPFNK
0.938515
0.02





HLA-A*03:01
437
569
RDRP A3.01_569
9
RQFHQKLLK
0.936158
0.02





HLA-A*03:01
438
513
RDRP A3.01_513
9
RLYYDSMSY
0.933607
0.02





HLA-A*03:01
439
882
RDRP A3.01_882
9
HLYLQYIRK
0.930444
0.02





HLA-A*03:01
440
863
RDRP A3.01_863
9
AIDAYPLTK
0.849649
0.05





HLA-A*03:01
441
409
RDRP A3.01_409
9
TVKPGNFNK
0.830192
0.06





HLA-A*03:01
442
95
RDRP A3.01_95
9
AVAKHDFFK
0.813978
0.07





HLA-A*03:01
443
566
RDRP A3.01_566
9
MTNRQFHQK
0.779259
0.1





HLA-A*03:01
444
173
RDRP A3.01_173
9
RVYANLGER
0.738206
0.14





HLA-A*03:01
445
585
RDRP A3.01_585
9
ATVVIGTSK
0.724568
0.14





HLA-A*03:01
446
113
RDRP A3.01_113
9
HISRQRLTK
0.710757
0.15





HLA-A*03:01
447
775
RDRP A3.01_775
9
LVASIKNFK
0.669788
0.19





HLA-A*03:01
448
33
RDRP A3.01_33
9
RAFDIYNDK
0.50112
0.33





HLA-A*03:01
449
613
RDRP A3.01_613
9
HLMGWDYPK
0.486599
0.36





HLA-A*03:01
450
706
RDRP A3.01_706
9
ALLSTDGNK
0.409837
0.47





HLA-A*03:01
451
141
RDRP A3.01_141
9
TLKEILVTY
0.406267
0.47





HLA-A*03:01
452
543
RDRP A3.01_543
9
NLKYAISAK
0.37021
0.53





HLA-A*03:01
453
890
RDRP A3.01_890
9
KLHDELTGH
0.366656
0.53





HLA-A*03:01
454
632
RDRP A3.01_632
9
IMASLVLAR
0.361638
0.54





HLA-A*03:01
455
65
RDRP A3.01_65
9
LIDSYFVVK
0.336463
0.59





HLA-A*24:02
456
37
RDRP A24.02_37
9
IYNDKVAGF
0.947951
0.01





HLA-A*24:02
457
883
RDRP A24.02_883
9
LYLQYIRKL
0.838016
0.04





HLA-A*24:02
458
688
RDRP A24.02_688
9
AYANSVFNI
0 809659
0.06





HLA-A*24:02
459
69
RDRP A24.02_69
9
YFVVKRHTF
0.807841
0.06





HLA-A*24:02
460
876
RDRP A24.02_876
9
EYADVFHLY
0.790376
0.06





HLA-A*24:02
461
745
RDRP A24.02_745
9
FYAYLRKHF
0.687371
0.11





HLA-A*24:02
462
747
RDRP A24.02_747
9
AYLRKHFSM
0.651172
0.13





HLA-A*24:02
463
520
RDRP A24.02_520
9
SYEDQDALF
0.62922
0.13





HLA-A*24:02
464
594
RDRP A24.02_594
9
FYGGWHNML
0.610031
0.14





HLA-A*24:02
465
593
RDRP A24.02 593
9
KFYGGWHNM
0.606663
0.14





HLA-A*24:02
466
455
RDRP A24.02_455
9
YYRYNLPTM
0.599652
0.15





HLA-A*24:02
467
414
RDRP A24.02_414
9
NFNKDFYDF
0.560985
0.16





HLA-A*24:02
468
786
RDRP A24.02_786
9
LYYQNNVFM
0.542169
0.18





HLA-A*24:02
469
237
RDRP A24.02_237
9
YYSLLMPIL
0.539305
0.18





HLA-A*24:02
470
325
RDRP A24.02_325
9
SFGPLVRKI
0.471972
0.21





HLA-A*24:02
471
174
RDRP A24.02_174
9
VYANLGERV
0.450562
0.22





HLA-A*24:02
472
830
RDRP A24.02_830
9
PYPDPSRIL
0.445807
0.23





HLA-A*24:02
473
236
RDRP A24.02_236
9
SYYSLLMPI
0.406355
0.26





HLA-A*24:02
474
267
RDRP A24.02_267
9
KWDLLKYDF
0.357142
0.29





HLA-A*24:02
475
272
RDRP A24.02_272
9
KYDFTEERL
0.340771
0.31





HLA-A*26:01
476
471
RDRP A26.01_471
9
FVVEVVDKY
0.974688
0.01





HLA-A*26:01
477
534
RDRP A26.01_534
9
NVIPTITQM
0.958845
0.01





HLA-A*26:01
478
879
RDRP A26.01_879
9
DVFHLYLQY
0.93494
0.01





HLA-A*26:01
479
141
RDRP A26.01_141
9
TLKEILVTY
0.786238
0.04





HLA-A*26:01
480
907
RDRP A26.01_907
9
LINDNTSRY
0.775465
0.05





HLA-A*26:01
481
318
RDRP A26.01_318
9
STVFPPTSF
0.717593
0.06





HLA-A*26:01
482
587
RDRP A26.01_587
9
VVIGTSKFY
0.686358
0.06





HLA-A*26:01
483
586
RDRP A26.01_586
9
TVVIGTSKF
0.685805
0.06





HLA-A*26:01
484
876
RDRP A26.01_876
9
EYADVFHLY
0.648839
0.07





HLA-A*26:01
485
265
RDRP A26.01_265
9
YIKWDLLKY
0.592281
0.08





HLA-A*26:01
486
666
RDRP A26.01_666
9
MVMCGGSLY
0.528754
0.1





HLA-A*26:01
487
340
RDRP A26.01_340
9
FVVSTGYHF
0.528092
0.1





HLA-A*26:01
488
859
RDRP A26.01_859
9
FVSLAIDAY
0.517477
0.11





HLA-A*26:01
489
167
RDRP A26.01_167
9
ENPDILRVY
0.468802
0.13





HLA-A*26:01
490
869
RDRP A26.01_869
9
LTKHPNQEY
0.463446
0.13





HLA-A*26:01
491
126
RDRP A26.01_126
9
DLVYALRHF
0.407151
0.17





HLA-A*26:01
492
762
RDRP A26.01_762
9
AVVCFNSTY
0.379217
0.19





HLA-A*26:01
493
538
RDRP A26.01_538
9
TITQMNLKY
0.372912
0.19





HLA-A*26:01
494
738
RDRP A26.01_738
9
DTDFVNEFY
0.360999
0.2





HLA-A*26:01
495
686
RDRP A26.01_686
9
TTAYANSVF
0.353076
0.21





HLA-A*26:01
496
741
RDRP A26.01_741
9
FVNEFYAYL
0.325335
0.22





HLA-A*26:01
497
358
RDRP A26.01_358
9
DVNLHSSRL
0.307842
0.24





HLA-B*07:02
498
111
RDRP B07.02_111
9
VPHISRQRL
0.97802
0.02





HLA-B*07:02
499
263
RDRP B07.02_263
9
KPYIKWDLL
0.713089
0.12





HLA-B*07:02
500
536
RDRP B07.02_536
9
IPTITQMNL
0.663521
0.14





HLA-B*07:02
501
829
RDRP B07.02_829
9
LPYPDPSRI
0.562533
0.2





HLA-B*07:02
502
321
RDRP B07.02_321
9
FPPTSFGPL
0.535352
0.21





HLA-B*07:02
503
226
RDRP B07.02_226
9
TPGSGVPVV
0.376591
0.37





HLA-B*07:02
504
778
RDRP B07.02 778
9
SIKNFKSVL
0.343102
0.42





HLA-B*07:02
505
887
RDRP B07.02_887
9
YIRKLHDEL
0.34074
0.43





HLA-B*07:02
506
626
RDRP B07.02_626
9
MPNMLRIMA
0.334597
0.43





HLA-B*07:02
507
719
RDRP B07.02_719
9
YVRNLQHRL
0.320658
0.45





HLA-B*15:01
508
141
RDRP B15.01_141
9
TLKEILVTY
0.974655
0.01





HLA-B*15:01
509
513
RDRP B15.01_513
9
RLYYDSMSY
0.97292
0.01





HLA-B*15:01
510
869
RDRP B15.01_869
9
LTKHPNQEY
0.858889
0.03





HLA-B*15:01
511
785
RDRP B15.01_785
9
VLYYQNNVF
0.836058
0.05





HLA-B*15:01
512
281
RDRP B15.01_281
9
KLFDRYFKY
0.826664
0.05





HLA-B*15:01
513
360
RDRP B15.01_360
9
NLHSSRLSF
0.761224
0.09





HLA-B*15:01
514
265
RDRP B15.01_265
9
YIKWDLLKY
0.757898
0.1





HLA-B*15:01
515
318
RDRP B15.01_318
9
STVFPPTSF
0.735881
0.11





HLA-B*15:01
516
116
RDRP B15.01_116
9
RQRLTKYTM
0.712443
0.12





HLA-B*15:01
517
587
RDRP B15.01_587
9
VVIGTSKFY
0.70216
0.13





HLA-B*15:01
518
923
RDRP B15.01_923
9
AMYTPHTVL
0.694507
0.13





HLA-B*15:01
519
366
RDRP B15.01_366
9
LSFKELLVY
0.692667
0.14





HLA-B*15:01
520
471
RDRP B15.01_471
9
FVVEVVDKY
0.666634
0.16





HLA-B*15:01
521
660
RDRP B15.01_660
9
AQVLSEMVM
0.660662
0.17





HLA-B*15:01
522
279
RDRP B15.01_279
9
RLKLFDRYF
0.629707
0.19





HLA-B*15:01
523
332
RDRP B15.01_332
9
KIFVDGVPF
0.609134
0.21





HLA-B*15:01
524
907
RDRP B15.01_907
9
LINDNTSRY
0.597212
0.21





HLA-B*15:01
525
681
RDRP B15.01_681
9
SSGDATTAY
0.555694
0.24





HLA-B*15:01
526
340
RDRP B15.01_340
9
FVVSTGYHF
0.544647
0.25





HLA-B*15:01
527
30
RDRP B15.01_30
9
VVYRAFDIY
0.533353
0.26





HLA-B*15:01
528
774
RDRP B15.01_774
9
GLVASIKNF
0.529503
0.26





HLA-B*15:01
529
762
RDRP B15.01_762
9
AVVCFNSTY
0.512271
0.28





HLA-B*15:01
530
586
RDRP B15.01_586
9
TVVIGTSKF
0.511355
0.29





HLA-B*15:01
531
114
RDRP B15.01_114
9
ISRQRLTKY
0.505383
0.29





HLA-B*15:01
532
229
RDRP B15.01_229
9
SGVPVVDSY
0.469878
0.33





HLA-B*15:01
533
666
RDRP B15.01_666
9
MVMCGGSLY
0.463589
0.34





HLA-B*15:01
534
859
RDRP B15.01_859
9
FVSLAIDAY
0.44269
0.39





HLA-B*15:01
535
748
RDRP B15.01_748
9
YLRKHFSMM
0.442028
0.39





HLA-B*15:01
536
854
RDRP B15.01_854
9
LMIERFVSL
0.415997
0.41





HLA-B*15:01
537
407
RDRP B15.01_407
9
FQTVKPGNF
0.413296
0.42





HLA-B*15:01
538
534
RDRP B15.01_534
9
NVIPTITQM
0.392871
0.44





HLA-B*15:01
539
847
RDRP B15.01_847
9
IVKTDGTLM
0.37484
0.47





HLA-B*15:01
540
780
RDRP B15.01_780
9
KNFKSVLYY
0.347435
0.52





HLA-B*15:01
541
538
RDRP B15.01_538
9
TITQMNLKY
0.347019
0.52





HLA-B*15:01
542
286
RDRP B15.01_286
9
YFKYWDQTY
0.319992
0.59





HLA-B*15:01
543
818
RDRP B15.01_818
9
MLVKQGDDY
0.315119
0.59





HLA-B*15:01
544
898
RDRP B15.01_898
9
HMLDMYSVM
0.306896
0.62





HLA-B*40:01
545
875
RDRP B40.01_875
9
QEYADVFHL
0.976681
0.01





HLA-B*40:01
546
82
RDRP B40.01_82
9
HEETIYNLL
0.948908
0.04





HLA-B*40:01
547
810
RDRP B40.01_810
9
HEFCSQHTM
0.927691
0.05





HLA-B*40:01
548
253
RDRP B40.01_253
9
AESHVDTDL
0.90571
0.06





HLA-B*40:01
549
166
RDRP B40.01_166
9
VENPDILRV
0.652113
0.18





HLA-B*40:01
550
57
RDRP B40.01_57
9
QEKDEDDNL
0.623482
0.2





HLA-B*40:01
551
916
RDRP B40.01_916
9
WEPEFYEAM
0.388761
0.39





HLA-B*40:01
552
179
RDRP B40.01_179
9
GERVRQALL
0.365028
0.41





HLA-B*58:01
553
501
RDRP B58.01_501
9
SAGFPFNKW
0.902772
0.06





HLA-B*58:01
554
433
RDRP B58.01_433
9
SSVELKHFF
0.89824
0.06





HLA-B*58:01
555
366
RDRP B58.01_366
9
LSFKELLVY
0.853406
0.09





HLA-B*58:01
556
96
RDRP B58.01_96
9
VAKHDFFKF
0.656886
0.21





HLA-B*58:01
557
624
RDRP B58.01_624
9
RAMPNMLRI
0.631663
0.23





HLA-B*58:01
558
318
RDRP B58.01_318
9
STVFPPTSF
0.630451
0.23





HLA-B*58:01
559
590
RDRP B58.01_590
9
GTSKFYGGW
0.619113
0.24





HLA-B*58:01
560
184
RDRP B58.01_184
9
QALLKTVQF
0.512542
0.31





HLA-B*58:01
561
432
RDRP B58.01_432
9
GSSVELKHF
0.487645
0.33





HLA-B*58:01
562
908
RDRP B58.01_908
9
TNDNTSRYW
0.443616
0.38





HLA-B*58:01
563
758
RDRP B58.01_758
9
LSDDAVVCF
0.431197
0.38





HLA-B*58:01
564
27
RDRP B58.01_27
9
STDVVYRAF
0.417409
0.4





HLA-B*58:01
565
780
RDRP B58.01_780
9
KNFKSVLYY
0.382597
0.42





HLA-B*58:01
566
609
RDRP B58.01_609
9
VENPHLMGW
0.366443
0.44





HLA-B*58:01
567
281
RDRP B58.01_281
9
KLFDRYFKY
0.35087
0.47





HLA-B*58:01
568
907
RDRP B58.01_907
9
LINDNTSRY
0.34387
0.48





HLA-B*58:01
569
869
RDRP B58.01_869
9
LTKHPNQEY
0.341203
0.49





HLA-B*58:01
570
450
RDRP B58.01_450
9
ISDYDYYRY
0.329408
0.51





HLA-B*58:01
571
563
RDRP B58.01_563
9
CSTMTNRQF
0.302352
0.55
















TABLE 2







Helicase















Sequence

Sequence



Percentile_


Allele
ID No.
Start
Annotation
Length
Sequence
Score
rank





HLA-A*01:01
572
 56
Heli A1.01_56
9
DVTDVTQLY
0.673792
0.1





HLA-A*01:01
573
 57
Heli A1.01_57
9
VTDVTQLYL
0.669117
0.1





HLA-A*01:01
574
535
Heli A1.01_535
9
SSQGSEYDY
0.659933
0.1





HLA-A*01:01
575
347
Heli A1.01_347
9
KVNSTLEQY
0.456717
0.22





HLA-A*01:01
576
291
Heli A1.01_291
9
FAIGLALYY
0.42898
0.25





HLA-A*01:01
577
238
Heli A1.01_238
9
PTLVPQEHY
0.38219
0.29





HLA-A*01:01
578
316
Heli A1.01_316
9
ALCEKALKY
0.366728
0.29





HLA-A*01:01
579
533
Heli A1.01_533
9
VDSSQGSEY
0.350748
0.31





HLA-A*01:01
580
209
Heli A1.01_209
9
VVYRGTTTY
0.337049
0.32





HLA-A*01:01
581
258
Heli A1.01_258
9
ISDEFSSNV
0.334115
0.32





HLA-A*01:01
582
141
Heli A1.01_141
9
TEETFKLSY
0.286378
0.39





HLA-A*01:01
583
 62
Heli A1.01_62
9
QLYLGGMSY
0.265626
0.43





HLA-A*01:01
584
190
Heli A1.01_190
9
NSKVQIGEY
0.257774
0.44





HLA-A*01:01
585
574
Heli A1.01_574
9
CIMSDRDLY
0.192125
0.56





HLA-A*01:01
586
468
Heli A1.01_468
9
SAQCFKMFY
0.164928
0.64





HLA-A*02:01
587
239
Heli A2.01_239
9
TLVPQEHYV
0.94681
0.02





HLA-A*02:01
588
146
Heli A2.01_146
9
KLSYGIATV
0.89166
0.03





HLA-A*02:01
589
157
Heli A2.01_157
9
VLSDRELHL
0.834765
0.06





HLA-A*02:01
590
525
Heli A2.01_525
9
ILGLPTQTV
0.699124
0.13





HLA-A*02:01
591
218
Heli A2.01_218
9
KLNVGDYFV
0.692351
0.14





HLA-A*02:01
592
 41
Heli A2.01_41
9
LVLSVNPYV
0.641882
0.17





HLA-A*02:01
593
520
Heli A2.01_520
9
AVASKILGL
0.445682
0.32





HLA-A*02:01
594
448
Heli A2.01_448
9
IVDTVSALV
0.413249
0.35





HLA-A*02:01
595
371
Heli A2.01_371
9
VVFDEISMA
0.359348
0.43





HLA-A*02:01
596
296
Heli A2.01_296
9
ALYYPSARI
0.320686
0.51





HLA-A*02:01
597
248
Heli A2.01_248
9
RITGLYPTL
0.319478
0.51





HLA-A*02:01
598
294
Heli A2.01_294
9
GLALYYPSA
0.314282
0.52





HLA-A*02:01
599
403
Heli A2.01_403
9
AQLPAPRTL
0.313863
0.52





HLA-A*02:01
600
584
Heli A2.01_584
9
KLQFTSLEI
0.28788
0.57





HLA-A*02:01
601
232
Heli A2.01_232
9
VMPLSAPTL
0.262038
0.62





HLA-A*02:01
602
  6
Heli A2.01_6
9
VLCNSQTSL
0.260918
0.62





HLA-A*02:01
603
362
Heli A2.01_362
9
ALPETTADI
0.257268
0.63





HLA-A*02:01
604
33
Heli A2.01_33
9
HVISTSHKL
0.232777
0.7





HLA-A*02:01
605
258
Heli A2.01_258
9
ISDEFSSNV
0.208091
0.76





HLA-A*02:01
606
430
Heli A2.01_430
9
KTIGPDMFL
0.185051
0.86





HLA-A*02:01
607
487
Heli A2.01_487
9
AINRPQIGV
0.185002
0.86





HLA-A*02:01
608
332
Heli A2.01_332
9
RIIPARARV
0.163014
0.97





HLA-A*03:01
609
131
Heli A3.01_131
9
KLFAAETLK
0.957259
0.01





HLA-A*03:01
610
209
Heli A3.01_209
9
VVYRGTTTY
0.827356
0.06





HLA-A*03:01
611
386
Heli A3.01_386
9
VVNARLRAK
0.776827
0.11





HLA-A*03:01
612
321
Heli A3.01_321
9
ALKYLPIDK
0.742179
0.14





HLA-A*03:01
613
 62
Heli A3.01_62
9
QLYLGGMSY
0.676025
0.18





HLA-A*03:01
614
347
Heli A3.01_347
9
KVNSTLEQY
0.65183
0.2





HLA-A*03:01
615
500
Heli A3.01_500
9
LTRNPAWRK
0.619996
0.23





HLA-A*03:01
616
316
Heli A3.01_316
9
ALCEKALKY
0.589629
0.26





HLA-A*03:01
617
469
Heli A3.01_469
9
AQCFKMFYK
0.549939
0.29





HLA-A*03:01
618
280
Heli A3.01_280
9
LQGPPGTGK
0.501505
0.33





HLA-A*03:01
619
 68
Heli A3.01_68
9
MSYYCKSHK
0.484421
0.36





HLA-A*03:01
620
194
Heli A3.01_194
9
QIGEYTFEK
0.39373
0.49





HLA-A*03:01
621
390
Heli A3.01_390
9
RLRAKHYVY
0.333977
0.59





HLA-A*03:01
622
263
Heli A3.01_263
9
SSNVANYQK
0.328995
0.6





HLA-A*03:01
623
454
Heli A3.01_454
9
ALVYDNKLK
0.228612
0.84





HLA-A*03:01
624
147
Heli A3.01_147
9
LSYGIATVR
0.183323
0.99





HLA-A*24:02
625
397
Heli A24.02_397
9
VYIGDPAQL
0.913596
0.02





HLA-A*24:02
626
420
Heli A24.02_420
9
EYFNSVCRL
0.465424
0.21





HLA-A*24:02
627
298
Heli A24.02_298
9
YYPSARIVY
0.400243
0.26





HLA-A*24:02
628
297
Heli A24.02_297
9
LYYPSARIV
0.334529
0.32





HLA-A*24:02
629
232
Heli A24.02_232
9
VMPLSAPTL
0.311964
0.33





HLA-A*24:02
630
192
Heli A24.02_192
9
KVQIGEYTF
0.301849
0.34





HLA-A*24:02
631
 73
Heli A24.02_73
9
KSHKPPISF
0.255341
0.4





HLA-A*24:02
632
179
Heli A24.02_179
9
NYVFTGYRV
0.209981
0.47





HLA-A*24:02
633
 63
Heli A24.02_63
9
LYLGGMSYY
0.191939
0.52





HLA-A*24:02
634
498
Heli A24.02_498
9
EFLTRNPAW
0.177523
0.55





HLA-A*24:02
635
224
Heli A24.02_224
9
YFVLTSHTV
0.161541
0.6





HLA-A*26:01
636
 56
Heli A26.01_56
9
DVTDVTQLY
0.981717
0.01





HLA-A*26:01
637
447
Heli A26.01_447
9
EIVDTVSAL
0.866408
0.03





HLA-A*26:01
638
365
Heli A26.01_365
9
ETTADIVVF
0.69931
0.06





HLA-A*26:01
639
209
Heli A26.01_209
9
VVYRGTTTY
0.665769
0.06





HLA-A*26:01
640
291
Heli A26.01_291
9
FAIGLALYY
0.65294
0.07





HLA-A*26:01
641
 33
Heli A26.01_33
9
HVISTSHKL
0.577552
0.09





HLA-A*26:01
642
261
Heli A26.01_261
9
EFSSNVANY
0.365121
0.2





HLA-A*26:01
643
 62
Heli A26.01_62
9
QLYLGGMSY
0.348444
0.21





HLA-A*26:01
644
347
Heli A26.01_347
9
KVNSTLEQY
0.276517
0.27





HLA-A*26:01
645
190
Heli A26.01_190
9
NSKVQIGEY
0.274137
0.27





HLA-A*26:01
646
143
Heli A26.01_143
9
ETFKLSYGI
0.180406
0.41





HLA-A*26:01
647
269
Heli A26.01_269
9
YQKVGMQKY
0.163497
0.45





HLA-A*26:01
648
290
Heli A26.01_290
9
HFAIGLALY
0.160078
0.46





HLA-B*07:02
649
592
Heli B7.02_592
9
IPRRNVATL
0.991193
0.01





HLA-B*07:02
650
513
Heli B7.02_513
9
SPYNSQNAV
0.884275
0.05





HLA-B*07:02
651
503
Heli B7.02_503
9
NPAWRKAVF
0.856265
0.06





HLA-B*07:02
652
334
Heli B7.02_334
9
IPARARVEC
0.550566
0.21





HLA-B*07:02
653
173
Heli B7.02_173
9
RPPLNRNYV
0.548195
0.21





HLA-B*07:02
654
241
Heli B7.02_241
9
VPQEHYVRI
0.503526
0.24





HLA-B*07:02
655
233
Heli B7.02_233
9
MPLSAPTLV
0.384993
0.35





HLA-B*07:02
656
405
Heli B7.02_405
9
LPAPRTLLT
0.340578
0.43





HLA-B*07:02
657
168
Heli B7.02_168
9
EVGKPRPPL
0.249123
0.57





HLA-B*07:02
658
 73
Heli B7.02_73
9
KSHKPPISF
0.2024
0.67





HLA-B*07:02
659
325
Heli B7.02_325
9
LPIDKCSRI
0.200774
0.68





HLA-B*07:02
660
407
Heli B7.02_407
9
APRTLLTKG
0.160167
0.82





HLA-B*15:01
661
269
Heli B15.01_269
9
YQKVGMQKY
0.977243
0.01





HLA-B*15:01
662
209
Heli B15.01_209
9
VVYRGTTTY
0.975234
0.01





HLA-B*15:01
663
 62
Heli B15.01_62
9
QLYLGGMSY
0.958133
0.01





HLA-B*15:01
664
347
Heli B15.01_347
9
KVNSTLEQY
0.888728
0.03





HLA-B*15:01
665
137
Heli B15.01_137
9
TLKATEETF
0.87839
0.03





HLA-B*15:01
666
390
Heli B15.01_390
9
RLRAKHYVY
0.791589
0.08





HLA-B*15:01
667
 73
Heli B15.01_73
9
KSHKPPISF
0.769419
0.08





HLA-B*15:01
668
403
Heli B15.01_403
9
AQLPAPRTL
0.767849
0.09





HLA-B*15:01
669
 40
Heli B15.01_40
9
KLVLSVNPY
0.722868
0.12





HLA-B*15:01
670
316
Heli B15.01_316
9
ALCEKALKY
0.708448
0.13





HLA-B*15:01
671
491
Heli B15.01_491
9
PQIGVVREF
0.447628
0.38





HLA-B*15:01
672
291
Heli B15.01_291
9
FAIGLALYY
0.384168
0.44





HLA-B*15:01
673
192
Heli B15.01_192
9
KVQIGEYTF
0.320994
0.59





HLA-B*15:01
674
190
Heli B15.01_190
9
NSKVQIGEY
0.261432
0.74





HLA-B*15:01
675
428
Heli B15.01_428
9
LMKTIGPDM
0.248093
0.77





HLA-B*15:01
676
517
Heli B15.01_517
9
SQNAVASKI
0.207199
0.91





HLA-B*15:01
677
225
Heli B15.01_225
9
FVLTSHTVM
0.202742
0.94





HLA-B*15:01
678
 33
Heli B15.01_33
9
HVISTSHKL
0.190861
0.98





HLA-B*15:01
679
507
Heli B15.01_507
9
RKAVFISPY
0.179546
1.1





HLA-B*40:01
680
155
Heli B40.01_155
9
REVLSDREL
0.952984
0.03





HLA-B*40:01
681
161
Heli B40.01_161
9
RELHLSWEV
0.756562
0.13





HLA-B*40:01
682
403
Heli B40.01_403
9
AQLPAPRTL
0.490454
0.28





HLA-B*40:01
683
446
Heli B40.01_446
9
AEIVDTVSA
0.278573
0.49





HLA-B*40:01
684
417
Heli B40.01_417
9
LEPEYFNSV
0.23923
0.56





HLA-B*40:01
685
539
Heli B40.01_539
9
SEYDYVIFT
0.167767
0.72





HLA-B*58:01
686
 73
Heli B58.01_73
9
KSHKPPISF
0.972889
0.02





HLA-B*58:01
687
467
Heli B58.01_467
9
KSAQCFKMF
0.67623
0.2





HLA-B*58:01
688
347
Heli B58.01_347
9
KVNSTLEQY
0.629461
0.23





HLA-B*58:01
689
192
Heli B58.01_192
9
KVQIGEYTF
0.628101
0.23





HLA-B*58:01
690
430
Heli B58.01_430
9
KTIGPDMFL
0.560184
0.28





HLA-B*58:01
691
139
Heli B58.01_139
9
KATEETFKL
0.505766
0.32





HLA-B*58:01
692
209
Heli B58.01_209
9
VVYRGTTTY
0.495186
0.32





HLA-B*58:01
693
291
Heli B58.01_291
9
FAIGLALYY
0.409308
0.4





HLA-B*58:01
694
 57
Heli B58.01_57
9
VTDVTQLYL
0.318403
0.53





HLA-B*58:01
695
159
Heli B58.01_159
9
SDRELHLSW
0.274911
0.58





HLA-B*58:01
696
 35
Heli B58.01_35
9
ISTSHKLVL
0.267303
0.62





HLA-B*58:01
697
349
Heli B58.01_349
9
NSTLEQYVF
0.267298
0.62





HLA-B*58:01
698
365
Heli B58.01_365
9
ETTADIVVF
0.172291
0.87





HLA-B*58:01
699
414
Heli B58.01_414
9
KGTLEPEYF
0.169262
0.88
















TABLE 3







Exonuclease















Sequence

Sequence






Allele
ID No.
Start
Annotation
Length
Sequence
Score
Rank





HLA-A*01:01
700
343
ExNuc A1.01_343
9
QADVEWKFY
0.845088
0.05





HLA-A*01:01
701
362
ExNuc A1.01_362
9
KIEELFYSY
0.580712
0.15





HLA-A*01:01
702
252
ExNuc A1.01_252
9
NLQSNHDLY
0.565641
0.16





HLA-A*01:01
703
447
ExNuc A1.01_447
9
YSDSPCESH
0.529832
0.18





HLA-A*01:01
704
229
ExNuc A1.01_229
9
HSIGFDYVY
0.411767
0.26





HLA-A*01:01
705
322
ExNuc A1.01_322
9
LADKFPVLH
0.392505
0.28





HLA-A*01:01
706
377
ExNuc A1.01_377
9
FTDGVCLFW
0.345315
0.31





HLA-A*01:01
707
509
ExNuc A1.01_509
9
WVYKQFDTY
0.228543
0.5





HLA-A*01:01
708
373
ExNuc A1.01_373
9
HSDKFTDGV
0.214528
0.51





HLA-A*01:01
709
241
ExNuc A1.01_241
9
MIDVQQWGF
0.212108
0.51





HLA-A*01:01
710
438
ExNuc A1.01_438
9
NLKQLPFFY
0.204092
0.54





HLA-A*01:01
711
503
ExNuc A1.01_503
9
SAGFSLWVY
0.203487
0.54





HLA-A*02:01
712
321
ExNuc A2.01_321
9
LLADKFPVL
0.936725
0.03





HLA-A*02:01
713
176
ExNuc A2.01_176
9
NLSDRVVFV
0.931365
0.03





HLA-A*02:01
714
184
ExNuc A2.01_184
9
VLWAHGFEL
0.920969
0.03





HLA-A*02:01
715
494
ExNuc A2.01_494
9
YLDAYNMMI
0.909759
0.03





HLA-A*02:01
716
169
ExNuc A2.01_169
9
MLSDTLKNL
0.760198
0.1





HLA-A*02:01
717
320
ExNuc A2.01_320
9
ALLADKFPV
0.73306
0.11





HLA-A*02:01
718
518
ExNuc A2.01_518
9
NLWNTFTRL
0.700692
0.13





HLA-A*02:01
719
  6
ExNuc A2.01_6
9
GLFKDCSKV
0.583242
0.21





HLA-A*02:01
720
500
ExNuc A2.01_500
9
MMISAGFSL
0.447826
0.32





HLA-A*02:01
721
107
ExNuc A2.01_107
9
LQLGFSTGV
0.431342
0.33





HLA-A*02:01
722
 21
ExNuc A2.01_21
9
TQAPTHLSV
0.428025
0.33





HLA-A*02:01
723
156
ExNuc A2.01_156
9
GLPWNVVRI
0.2757
0.59





HLA-A*02:01
724
 37
ExNuc A2.01_37
9
GLCVDIPGI
0.249963
0.64





HLA-A*02:01
725
492
ExNuc A2.01_492
9
RLYLDAYNM
0.223591
0.72





HLA-A*03:01
726
 53
ExNuc A3.01_53
9
RLISMMGFK
0.880959
0.04





HLA-A*03:01
727
328
ExNuc A3.01_328
9
VLHDIGNPK
0.784051
0.1





HLA-A*03:01
728
 56
ExNuc A3.01_56
9
SMMGFKMNY
0.704771
0.15





HLA-A*03:01
729
 61
ExNuc A3.01_61
9
KMNYQVNGY
0.513948
0.32





HLA-A*03:01
730
504
ExNuc A3.01_504
9
AGFSLWVYK
0.450184
0.41





HLA-A*03:01
731
368
ExNuc A3.01_368
9
YSYATHSDK
0.395871
0.49





HLA-A*03:01
732
281
ExNuc A3.01_281
9
AVHECFVKR
0.331708
0.59





HLA-A*03:01
733
 26
ExNuc A3.01_26
9
HLSVDTKFK
0.321534
0.61





HLA-A*03:01
734
 39
ExNuc A3.01_39
9
CVDIPGIPK
0.303197
0.65





HLA-A*03:01
735
188
ExNuc A3.01_188
9
HGFELTSMK
0.232477
0.84





HLA-A*03:01
736
270
ExNuc A3.01_270
9
ASCDAIMTR
0.215693
0.88





HLA-A*03:01
737
192
ExNuc A3.01_192
9
LTSMKYFVK
0.201816
0.92





HLA-A*24:02
738
369
ExNuc A24.02_369
9
SYATHSDKF
0.87261
0.04





HLA-A*24:02
739
223
ExNuc A24.02_223
9
TYACWHHSI
0.6881
0.11





HLA-A*24:02
740
493
ExNuc A24.02_493
9
LYLDAYNMM
0.66176
0.12





HLA-A*24:02
741
376
ExNuc A24.02_376
9
KFTDGVCLF
0.607745
0.14





HLA-A*24:02
742
153
ExNuc A24.02_153
9
MYKGLPWNV
0.521798
0.18





HLA-A*24:02
743
182
ExNuc A24.02_182
9
VFVLWAHGF
0.520171
0.18





HLA-A*24:02
744
512
ExNuc A24.02_512
9
KQFDTYNLW
0.459256
0.22





HLA-A*24:02
745
234
ExNuc A24.02_234
9
DYVYNPFMI
0.39123
0.26





HLA-A*24:02
746
239
ExNuc A24.02_239
9
PFMIDVQQW
0.358467
0.29





HLA-A*24:02
747
236
ExNuc A24.02_236
9
VYNPFMIDV
0.318057
0.33





HLA-A*24:02
748
295
ExNuc A24.02_295
9
EYPIIGDEL
0.311669
0.33





HLA-A*24:02
749
359
ExNuc A24.02_359
9
KAYKIEELF
0.202243
0.49





HLA-A*24:02
750
 50
ExNuc A24.02_50
9
TYRRLISMM
0.192555
0.52





HLA-A*24:02
751
423
ExNuc A24.02_423
9
KHAFHTPAF
0.191351
0.52





HLA-A*24:02
752
418
ExNuc A24.02_418
9
SLYVNKHAF
0.187194
0.53





HLA-A*26:01
753
515
ExNuc A26.01_515
9
DTYNLWNTF
0.81449
0.04





HLA-A*26:01
754
509
ExNuc A26.01_509
9
WVYKQFDIY
0.625643
0.07





HLA-A*26:01
755
116
ExNuc A26.01_116
9
NLVAVPTGY
0.476247
0.12





HLA-A*26:01
756
229
ExNuc A26.01_229
9
HSIGFDYVY
0.445885
0.14





HLA-A*26:01
757
 49
ExNuc A26.01_49
9
MTYRRLISM
0.441044
0.14





HLA-A*26:01
758
 65
ExNuc A26.01_65
9
QVNGYPNMF
0.393098
0.18





HLA-A*26:01
759
 78
ExNuc A26.01_78
9
EAIRHVRAW
0.358489
0.2





HLA-A*26:01
760
 56
ExNuc A26.01_56
9
SMMGFKMNY
0.276548
0.27





HLA-A*26:01
761
438
ExNuc A26.01_438
9
NLKQLPFFY
0.216819
0.34





HLA-B*07:02
762
 19
ExNuc B07.02_19
9
HPTQAPTHL
0.944978
0.04





HLA-B*07:02
763
428
ExNuc B07.02_428
9
TPAFDKSAF
0.82409
0.07





HLA-B*07:02
764
466
ExNuc B07.02_466
9
VPLKSATCI
0.343923
0.42





HLA-B*07:02
765
487
ExNuc B07.02_487
9
HANEYRLYL
0.274952
0.51





HLA-B*07:02
766
161
ExNuc B07.02_161
9
VVRIKIVQM
0.268223
0.53





HLA-B*07:02
767
411
ExNuc B07.02_411
9
LPGCDGGSL
0.265949
0.53





HLA-B*15:01
768
418
ExNuc B15.01_418
9
SLYVNKHAF
0.837903
0.05





HLA-B*15:01
769
 56
ExNuc B15.01_56
9
SMMGFKMNY
0.81477
0.06





HLA-B*15:01
770
 61
ExNuc B15.01_61
9
KMNYQVNGY
0.771234
0.08





HLA-B*15:01
771
509
ExNuc B15.01_509
9
WVYKQFDIY
0.679954
0.15





HLA-B*15:01
772
457
ExNuc B15.01_457
9
KQVVSDIDY
0.679003
0.15





HLA-B*15:01
773
353
ExNuc B15.01_353
9
AQPCSDKAY
0.636589
0.19





HLA-B*15:01
774
116
ExNuc B15.01_116
9
NLVAVPTGY
0.536427
0.25





HLA-B*15:01
775
512
ExNuc B15.01_512
9
KQFDTYNLW
0.534301
0.26





HLA-B*15:01
776
362
ExNuc B15.01_362
9
KIEELFYSY
0.465824
0.33





HLA-B*15:01
777
438
ExNuc B15.01_438
9
NLKQLPFFY
0.462862
0.35





HLA-B*15:01
778
 64
ExNuc B15.01_64
9
YQVNGYPNM
0.45844
0.36





HLA-B*15:01
779
 21
ExNuc B15.01_21
9
TQAPTHLSV
0.432814
0.4





HLA-B*15:01
780
229
ExNuc B15.01_229
9
HSIGFDYVY
0.416088
0.41





HLA-B*15:01
781
436
ExNuc B15.01_436
9
FVNLKQLPF
0.402551
0.43





HLA-B*15:01
782
359
ExNuc B15.01_359
9
KAYKIEELF
0.373895
0.47





HLA-B*15:01
783
 65
ExNuc B15.01_65
9
QVNGYPNMF
0.341338
0.54





HLA-B*15:01
784
500
ExNuc B15.01_500
9
MMISAGFSL
0.287737
0.67





HLA-B*15:01
785
321
ExNuc B15.01_321
9
LLADKFPVL
0.240636
0.8





HLA-B*15:01
786
161
ExNuc B15.01_161
9
VVRIKIVQM
0.229016
0.84





HLA-B*15:01
787
146
ExNuc B15.01_146
9
FKHLIPLMY
0.195674
0.96





HLA-B*40:01
788
190
ExNuc B40.01_190
9
FELTSMKYF
0.242418
0.56





HLA-B*40:01
789
452
ExNuc B40.01_452
9
CESHGKQVV
0.195196
0.65





HLA-B*58:01
790
359
ExNuc B58.01_359
9
KAYKIEELF
0.981071
0.01





HLA-B*58:01
791
501
ExNuc B58.01_501
9
MISAGFSLW
0.880824
0.07





HLA-B*58:01
792
219
ExNuc B58.01_219
9
TASDTYACW
0.877406
0.07





HLA-B*58:01
793
512
ExNuc B58.01_512
9
KQFDTYNLW
0.875971
0.07





HLA-B*58:01
794
377
ExNuc B58.01_377
9
FTDGVCLFW
0.823576
0.1





HLA-B*58:01
795
318
ExNuc B58.01_318
9
KAALLADKF
0.682783
0.2





HLA-B*58:01
796
 78
ExNuc B58.01_78
9
EAIRHVRAW
0.671935
0.21





HLA-B*58:01
797
229
ExNuc B58.01_229
9
HSIGFDYVY
0.67101
0.21





HLA-B*58:01
798
506
ExNuc B58.01_506
9
FSLWVYKQF
0.479531
0.34





HLA-B*58:01
799
177
ExNuc B58.01_177
9
LSDRVVFVL
0.330223
0.51





HLA-B*58:01
800
340
ExNuc B58.01_340
9
CVPQADVEW
0.295757
0.56





HLA-B*58:01
801
 49
ExNuc B58.01_49
9
MTYRRLISM
0.247743
0.64





HLA-B*58:01
802
193
ExNuc B58.01_193
9
TSMKYFVKI
0.219788
0.72





HLA-B*58:01
803
278
ExNuc B58.01_278
9
RCLAVHECF
0.204788
0.76
















TABLE 4







EndoRNAse















Sequence

Sequence






Allele
ID No.
Start
Annotation
Length
Sequence
Score
Rank





HLA-A*01:01
804
217
EndoRNA A1.01_217
9
AMDEFIERY
0.957737
0.02





HLA-A*01:01
805
270
EndoRNA A1.01_270
9
PMDSTVKNY
0.565302
0.16





HLA-A*01:01
806
185
EndoRNA A1.01_185
9
VVQQLPETY
0.455162
0.22





HLA-A*01:01
807
126
EndoRNA A1.01_126
9
RVDGQVDLF
0.403582
0.27





HLA-A*01:01
808
 80
EndoRNA A1.01_80
9
AANTVIWDY
0.340249
0.32





HLA-A*01:01
809
324
EndoRNA A1.01_324
9
YTEISFMLW
0.313596
0.35





HLA-A*01:01
810
 24
EndoRNA A1.01_24
9
VSIINNTVY
0.278296
0.41





HLA-A*01:01
811
170
EndoRNA A1.01_170
9
EAVKTQFNY
0.229558
0.49





HLA-A*01:01
812
321
EndoRNA A1.01_321
9
TIDYTEISF
0.205817
0.53





HLA-A*01:01
813
171
EndoRNA A1.01_171
9
AVKTQFNYY
0.193908
0.55





HLA-A*02:01
814
297
EndoRNA A2.01_297
9
LLLDDFVEI
0.949682
0.02





HLA-A*02:01
815
181
EndoRNA A2.01_181
9
KVDGVVQQL
0.836673
0.06





HLA-A*02:01
816
312
EndoRNA A2.01_312
9
SVVSKVVKV
0.798599
0.08





HLA-A*02:01
817
243
EndoRNA A2.01_243
9
SQLGGLHLL
0.787679
0.09





HLA-A*02:01
818
298
EndoRNA A2.01_298
9
LLDDFVEll
0.677666
0.15





HLA-A*02:01
819
 34
EndoRNA A2.01_34
9
KVDGVDVEL
0.580429
0.21





HLA-A*02:01
820
  1
EndoRNA A2.01_1
9
SLENVAFNV
0.531107
0.24





HLA-A*02:01
821
 30
EndoRNA A2.01_30
9
TVYTKVDGV
0.38397
0.39





HLA-A*02:01
822
 41
EndoRNA A2.01_41
9
ELFENKTTL
0.368489
0.41





HLA-A*02:01
823
 18
EndoRNA A2.01_18
9
QQGEVPVSI
0.290526
0.57





HLA-A*02:01
824
147
EndoRNA A2.01_147
9
SVKGLQPSV
0.248532
0.65





HLA-A*02:01
825
244
EndoRNA A2.01_244
9
QLGGLHLLI
0.205953
0.77





HLA-A*03:01
826
 26
EndoRNA A3.01_26
9
IINNTVYTK
0.854165
0.05





HLA-A*03:01
827
150
EndoRNA A3.01_150
9
GLQPSVGPK
0.821194
0.07





HLA-A*03:01
828
173
EndoRNA A3.01_173
9
KTQFNYYKK
0.658832
0.2





HLA-A*03:01
829
308
EndoRNA A3.01_308
9
SQDLSVVSK
0.411216
0.47





HLA-A*03:01
830
316
EndoRNA A3.01_316
9
KVVKVTIDY
0.386608
0.51





HLA-A*03:01
831
141
EndoRNA A3.01_141
9
VLITEGSVK
0.353311
0.55





HLA-A*03:01
832
  4
EndoRNA A3.01_4
9
NVAFNVVNK
0.332838
0.59





HLA-A*03:01
833
171
EndoRNA A3.01_171
9
AVKTQFNYY
0.301184
0.66





HLA-A*03:01
834
 62
EndoRNA A3.01_62
9
NIKPVPEVK
0.196395
0.94





HLA-A*24:02
835
224
EndoRNA A24.02_224
9
RYKLEGYAF
0.809296
0.06





HLA-A*24:02
836
192
EndoRNA A24.02_192
9
TYFTQSRNL
0.711481
0.09





HLA-A*24:02
837
257
EndoRNA A24.02_257
9
RFKESPFEL
0.692999
0.11





HLA-A*24:02
838
323
EndoRNA A24.02_323
9
DYTEISFML
0.55007
0.17





HLA-A*24:02
839
195
EndoRNA A24.02_195
9
TQSRNLQEF
0.192654
0.52





HLA-A*26:01
840
170
EndoRNA A26.01_170
9
EAVKTQFNY
0.817141
0.04





HLA-A*26:01
841
171
EndoRNA A26.01_171
9
AVKTQFNYY
0.546803
0.1





HLA-A*26:01
842
 47
EndoRNA A26.01_47
9
TTLPVNVAF
0.441299
0.14





HLA-A*26:01
843
312
EndoRNA A26.01_312
9
SVVSKVVKV
0.285044
0.26





HLA-A*26:01
844
 41
EndoRNA A26.01_41
9
ELFENKTTL
0.23014
0.32





HLA-A*26:01
845
 78
EndoRNA A26.01_78
9
DIAANTVIW
0.2014
0.38





HLA-A*26:01
846
217
EndoRNA A26.01_217
9
AMDEFIERY
0.192512
0.39





HLA-A*26:01
847
185
EndoRNA A26.01_185
9
VVQQLPETY
0.190377
0.39





HLA-B*07:02
848
 64
EndoRNA A07.02_64
9
KPVPEVKIL
0.87903
0.05





HLA-B*07:02
849
154
EndoRNA A07.02_154
9
SVGPKQASL
0.468325
0.27





HLA-B*07:02
850
 49
EndoRNA A07.02_49
9
LPVNVAFEL
0.449505
0.29





HLA-B*07:02
851
152
EndoRNA A07.02_152
9
QPSVGPKQA
0.195335
0.69





HLA-B*15:01
852
195
EndoRNA A15.01_195
9
TQSRNLQEF
0.846768
0.04





HLA-B*15:01
853
185
EndoRNA A15.01_185
9
VVQQLPETY
0.832035
0.05





HLA-B*15:01
854
186
EndoRNA A15.01_186
9
VQQLPETYF
0.831024
0.05





HLA-B*15:01
855
171
EndoRNA A15.01_171
9
AVKTQFNYY
0.785894
0.08





HLA-B*15:01
856
316
EndoRNA A15.01_316
9
KVVKVTIDY
0.7276
0.12





HLA-B*15:01
857
 24
EndoRNA A15.01_24
9
VSIINNTVY
0.567129
0.23





HLA-B*15:01
858
217
EndoRNA A15.01_217
9
AMDEFIERY
0.491104
0.31





HLA-B*15:01
859
114
EndoRNA A15.01_114
9
TICAPLTVF
0.478278
0.32





HLA-B*15:01
860
250
EndoRNA A15.01_250
9
LLIGLAKRF
0.467424
0.33





HLA-B*15:01
861
243
EndoRNA A15.01_243
9
SQLGGLHLL
0.446004
0.38





HLA-B*15:01
862
 47
EndoRNA A15.01_47
9
TTLPVNVAF
0.41661
0.41





HLA-B*40:01
863
201
EndoRNA A40.01_201
9
QEFKPRSQM
0.899475
0.06





HLA-B*40:01
864
303
EndoRNA A40.01_303
9
VEIIKSQDL
0.768551
0.13





HLA-B*40:01
865
263
EndoRNA A40.01_263
9
FELEDFIPM
0.68244/1
0.16





HLA-B*40:01
866
219
EndoRNA A40.01_219
9
DEFIERYKL
0.436831
0.32





HLA-B*40:01
867
243
EndoRNA A40.01_243
9
SQLGGLHLL
0.410667
0.36





HLA-B*40:01
868
 43
EndoRNA A40.01_43
9
FENKTTLPV
0.355108
0.42





HLA-B*40:01
869
  2
EndoRNA A40.01_2
9
LENVAFNVV
0.323778
0.45





HLA-B*40:01
870
 67
EndoRNA A40.01_67
9
PEVKILNNL
0.321118
0.46





HLA-B*40:01
871
 55
EndoRNA A40.01_55
9
FELWAKRNI
0.24051
0.56





HLA-B*58:01
872
324
EndoRNA A58.01_324
9
YTEISFMLW
0.93187
0.04





HLA-B*58:01
873
 47
EndoRNA A58.01_47
9
TTLPVNVAF
0.878973
0.07





HLA-B*58:01
874
115
EndoRNA A58.01_115
9
ICAPLTVFF
0.688753
0.2





HLA-B*58:01
875
 24
EndoRNA A58.01_24
9
VSIINNTVY
0.474435
0.34





HLA-B*58:01
876
 50
EndoRNA A58.01_50
9
PVNVAFELW
0.457307
0.35





HLA-B*58:01
877
185
EndoRNA A58.01_185
9
VVQQLPETY
0.407242
0.4





HLA-B*58:01
878
316
EndoRNA A58.01_316
9
KVVKVTIDY
0.347361
0.48





HLA-B*58:01
879
181
EndoRNA A58.01_181
9
KVDGVVQQL
0.305934
0.55





HLA-B*58:01
880
 80
EndoRNA A58.01_80
9
AANTVIWDY
0.300606
0.55





HLA-B*58:01
881
126
EndoRNA A58.01_126
9
RVDGQVDLF
0.266188
0.62





HLA-B*58:01
882
107
EndoRNA A58.01_107
9
IAKKPTETI
0.217414
0.73
















TABLE 5







Methyl-Transferase















Sequence

Sequence






Allele
ID No.
Start
Annotation
Length
Sequence
Score
Rank





HLA-A*01:01
883
143
MthlTrns A1.01_143
9
DSKEGFFTY
0.454043
0.22





HLA-A*01:01
884
 55
MthlTrns A1.01_55
9
NTLTLAVPY
0.221872
0.5





HLA-A*01:01
885
103
MthlTrns A1.01_103
9
VSDADSTLI
0.214979
0.51





HLA-A*02:01
886
 53
MthlTrns A2.01_53
9
YLNTLTLAV
0.893078
0.03





HLA-A*02:01
887
109
MthlTrns A2.01_109
9
TLIGDCATV
0.667919
0.15





HLA-A*02:01
888
265
MthlTrns A2.01_265
9
QINDMILSL
0.569069
0.22





HLA-A*02:01
889
 87
MthlTrns A2.01_87
9
WLPTGTLLV
0.419197
0.34





HLA-A*02:01
890
102
MthlTrns A2.01_102
9
FVSDADSTL
0.404239
0.36





HLA-A*02:01
891
169
MthlTrns A2.01_169
9
KITEHSWNA
0.348693
0.45





HLA-A*02:01
892
 76
MthlTrns A2.01_76
9
KVAPGTAVL
0.315817
0.52





HLA-A*02:01
893
210
MthlTrns A2.01_210
9
YLGKPREQI
0.29128
0.57





HLA-A*02:01
894
 49
MthlTrns A2.01_49
9
QLCQYLNTL
0.286361
0.57





HLA-A*03:01
895
  8
MthlTrns A3.01_8
9
GVAMPNLYK
0.924863
0.02





HLA-A*03:01
896
 16
MthlTrns A3.01_16
9
KMQRMLLEK
0.887509
0.04





HLA-A*03:01
897
161
MthlTrns A3.01_161
9
ALGGSVAIK
0.779901
0.1





HLA-A*03:01
898
173
MthlTrns A3.01_173
9
HSWNADLYK
0.471426
0.38





HLA-A*03:01
899
151
MthlTrns A3.01_151
9
YICGFIQQK
0.453062
0.4





HLA-A*03:01
900
254
MthlTrns A3.01_254
9
RGTAVMSLK
0.333629
0.59





HLA-A*03:01
901
165
MthlTrns A3.01_165
9
SVAIKITEH
0.209014
0.9





HLA-A*24:02
902
 46
MthlTrns A24.02_46
9
KYTQLCQYL
0.825165
0.05





HLA-A*24:02
903
 62
MthlTrns A24.02_62
9
PYNMRVIHF
0.798089
0.06





HLA-A*24:02
904
236
MthlTrns A24.02_236
9
IQLSSYSLF
0.543077
0.18





HLA-A*24:02
905
 86
MthlTrns A24.02_86
9
QWLPTGTLL
0.511035
0.19





HLA-A*24:02
906
130
MthlTrns A24.02_130
9
MYDPKTKNV
0.460873
0.22





HLA-A*24:02
907
174
MthlTrns A24.02_174
9
SWNADLYKL
0.423192
0.24





HLA-A*24:02
908
222
MthlTrns A24.02_222
9
VMHANYIFW
0.318127
0.33





HLA-A*24:02
909
 14
MthlTrns A24.02_14
9
LYKMQRMLL
0.266761
0.38





HLA-A*24:02
910
221
MthlTrns A24.02_221
9
YVMHANYIF
0.25404
0.4





HLA-A*24:02
911
220
MthlTrns A24.02_220
9
GYVMHANYI
0.209373
0.47





HLA-A*24:02
912
181
MthlTrns A24.02_181
9
KLMGHFAWW
0.153946
0.64





HLA-A*26:01
913
143
MthlTrns A26.01_143
9
DSKEGFFTY
0.845441
0.03





HLA-A*26:01
914
202
MthlTrns A26.01_202
9
EAFLIGCNY
0.541176
0.1





HLA-A*26:01
915
178
MthlTrns A26.01_178
9
DLYKLMGHF
0.444497
0.14





HLA-A*26:01
916
 55
MthlTrns A26.01_55
9
NTLTLAVPY
0.229069
0.32





HLA-A*26:01
917
 39
MthlTrns A26.01_39
9
GIMMNVAKY
0.220047
0.34





HLA-A*26:01
918
221
MthlTrns A26.01_221
9
YVMHANYIF
0.16875
0.44





HLA-A*26:01
919
265
MthlTrns A26.01_265
9
QINDMILSL
0.16064
0.46





HLA-A*26:01
920
  3
MthlTrns A26.01_3
9
QAWQPGVAM
0.158924
0.47





HLA-B*07:02
921
 76
MthlTrns B07.02_76
9
KVAPGTAVL
0.64873
0.15





HLA-B07:02
922
 36
MthlTrns B07.02_36
9
LPKGIMMNV
0.485039
0.25





HLA-B*07:02
923
249
MthlTrns B07.02_249
9
FPLKLRGTA
0.459763
0.29





HLA-B*07:02
924
  6
MthlTrns B07.02_6
9
QPGVAMPNL
0.432742
0.31





HLA-B*07:02
925
  3
MthlTrns B07.02_3
9
QAWQPGVAM
0.401898
0.34





HLA-B*07:02
926
213
MthlTrns B07.02_213
9
KPREQIDGY
0.322174
0.44





HLA-B*15:01
927
236
MthlTrns B15.01_236
9
IQLSSYSLF
0.79202
0.08





HLA-B*15:01
928
 85
MthlTrns B15.01_85
9
RQWLPTGTL
0.59587
0.21





HLA-B*15:01
929
 39
MthlTrns B15.01_39
9
GIMMNVAKY
0.487521
0.31





HLA-B*15:01
930
 76
MthlTrns B15.01_76
9
KVAPGTAVL
0.478756
0.32





HLA-B*15:01
931
 51
MthlTrns B15.01_51
9
CQYLNTLTL
0.342603
0.54





HLA-B*15:01
932
221
MthlTrns B15.01_221
9
YVMHANYIF
0.307316
0.62





HLA-B*15:01
933
143
MthlTrns B15.01_143
9
DSKEGFFTY
0.241386
0.8





HLA-B*15:01
934
 45
MthlTrns B15.01_45
9
AKYTQLCQY
0.237921
0.8





HLA-B*15:01
935
  3
MthlTrns B15.01_3
9
QAWQPGVAM
0.207896
0.91





HLA-B*15:01
936
167
MthlTrns B15.01_167
9
AIKITEHSW
0.16209
1.2





HLA-B*40:01
937
215
MthlTrns B40.01_215
9
REQIDGYVM
0.880693
0.07





HLA-B*40:01
938
140
MthlTrns B40.01_140
9
KENDSKEGF
0.753159
0.13





HLA-B*40:01
939
171
MthlTrns B40.01_171
9
TEHSWNADL
0.752718
0.13





HLA-B*40:01
940
262
MthlTrns B40.01_262
9
KEGQINDMI
0.41942
0.35





HLA-B*40:01
941
 85
MthlTrns B40.01_85
9
RQWLPTGTL
0.332816
0.45





HLA-B*58:01
942
115
MthlTrns B58.01_115
9
ATVHTANKW
0.968816
0.02





HLA-B*58:01
943
167
MthlTrns B58.01_167
9
AIKITEHSW
0.695131
0.19





HLA-B*58:01
944
181
MthlTrns B58.01_181
9
KLMGHFAWW
0.641824
0.22





HLA-B*58:01
945
241
MthlTrns B58.01_241
9
YSLFDMSKF
0.631228
0.23





HLA-B*58:01
946
222
MthlTrns B58.01_222
9
VMHANYIFW
0.581609
0.27





HLA-B*58:01
947
  9
MthlTrns B58.01_9
9
VAMPNLYKM
0.543539
0.29





HLA-B*58:01
948
 57
MthlTrns B58.01_57
9
LTLAVPYNM
0.448643
0.37





HLA-B*58:01
949
 79
MthlTrns B58.01_79
9
PGTAVLRQW
0.325426
0.52





HLA-B*58:01
950
221
MthlTrns B58.01_221
9
YVMHANYIF
0.237627
0.68





HLA-B*58:01
951
198
MthlTrns B58.01_198
9
ASSSEAFLI
0.232174
0.69





HLA-B*58:01
952
 34
MthlTrns B58.01_34
9
ATLPKGIMM
0.210574
0.75





HLA-B*58:01
953
 76
MthlTrns B58.01_76
9
KVAPGTAVL
0.170736
0.87
















TABLE 6







Membrane















Sequence

Sequence






Allele
ID No.
Start
Annotation
Length
Sequence
Score
Rank





HLA-A*01:01
 954
171
Mem A1.01_171
9
ATSRTLSYY
0.903381
0.03





HLA-A*01:01
 955
 39
Mem A1.01_39
9
YANRNRFLY
0.69359
0.1





HLA-A*01:01
 956
188
Mem A1.01_188
9
AGDSGFAAY
0.664115
0.1





HLA-A*01:01
 957
213
Mem A1.01_213
9
SSDNIALLV
0.59507
0.14





HLA-A*01:01
 958
170
Mem A1.01_170
9
VATSRTLSY
0.591769
0.14





HLA-A*02:01
 959
 15
Mem A2.01_15
9
KLLEQWNLV
0.892896
0.03





HLA-A*02:01
 960
 65
Mem A2.01_65
9
FVLAAVYRI
0.460313
0.29





HLA-A*02:01
 961
108
Mem A2.01_108
9
SMWSFNPET
0.447798
0.32





HLA-A*02:01
 962
 89
Mem A2.01_89
9
GLMWLSYFI
0.432387
0.33





HLA-A*02:01
 963
 56
Mem A2.01_56
9
LLWPVTLAC
0.256837
0.63





HLA-A*02:01
 964
 55
Mem A2.01_55
9
WLLWPVTLA
0.177647
0.89





HLA-A*02:01
 965
101
Mem A2.01_101
9
RLFARTRSM
0.173774
0.91





HLA-A*03:01
 966
150
Mem A3.01_150
9
RIAGHHLGR
0.795487
0.09





HLA-A*03:01
 967
171
Mem A3.01_171
9
ATSRTLSYY
0.410177
0.47





HLA-A*03:01
 968
172
Mem A3.01_172
9
TSRTLSYYK
0.319722
0.62





HLA-A*03:01
 969
  6
Mem A3.01_6
9
GTITVEELK
0.240085
0.82





HLA-A*03:01
 970
 93
Mem A3.01_93
9
LSYFIASFR
0.235381
0.83





HLA-A*03:01
 971
138
Mem A3.01_138
9
LVIGAVILR
0.201656
0.93





HLA-A*24:02
 972
 95
Mem A24.02_95
9
YFIASFRLF
0.913594
0.02





HLA-A*24:02
 973
 94
Mem A24.02_94
9
SYFIASFRL
0.83484
0.05





HLA-A*24:02
 974
 46
Mem A24.02_46
9
LYIIKLIFL
0.685153
0.11





HLA-A*24:02
 975
198
Mem A24.02_198
9
RYRIGNYKL
0.670153
0.12





HLA-A*24:02
 976
 54
Mem A24.02_54
9
LWLLWPVTL
0.522979
0.18





HLA-A*24:02
 977
 57
Mem A24.02_57
9
LWPVTLACF
0.472421
0.21





HLA-A*24:02
 978
111
Mem A24.02_111
9
SFNPETNIL
0.364757
0.28





HLA-A*24:02
 979
 38
Mem A24.02_38
9
AYANRNRFL
0.364322
0.28





HLA-A*24:02
 980
102
Mem A24.02_102
9
LFARTRSMW
0.264358
0.39





HLA-A*24:02
 981
 44
Mem A24.02_44
9
RFLYIIKLI
0.237445
0.42





HLA-A*26:01
 982
171
Mem A26.01_171
9
ATSRTLSYY
0.465163
0.13





HLA-A*26:01
 983
170
Mem A26.01_170
9
VATSRTLSY
0.253246
0.29





HLA-A*26:01
 984
196
Mem A26.01_196
9
YSRYRIGNY
0.22437
0.33





HLA-A*26:01
 985
 39
Mem A26.01_39
9
YANRNRFLY
0.172251
0.43





HLA-A*26:01
 986
 37
Mem A26.01_37
9
FAYANRNRF
0.167471
0.44





HLA-B*07:02
 987
164
Mem B7.02_164
9
LPKEITVAT
0.452806
0.29





HLA-B*07:02
 988
148
Mem B7.02_148
9
HLRIAGHHL
0.369452
0.37





HLA-B*07:02
 989
101
Mem B7.02_101
9
RLFARTRSM
0.33272
0.43





HLA-B*15:01
 990
101
Mem B15.01_101
9
RLFARTRSM
0.715954
0.12





HLA-B*15:01
 991
170
Mem B15.01_170
9
VATSRTLSY
0.649814
0.18





HLA-B*15:01
 992
171
Mem B15.01_171
9
ATSRTLSYY
0.454935
0.36





HLA-B*15:01
 993
 37
Mem B15.01_37
9
FAYANRNRF
0.373977
0.47





HLA-B*15:01
 994
191
Mem B15.01_191
9
SGFAAYSRY
0.344812
0.53





HLA-B*15:01
 995
 39
Mem B15.01_39
9
YANRNRFLY
0.292655
0.65





HLA-B*15:01
 996
196
Mem B15.01_196
9
YSRYRIGNY
0.285031
0.68





HLA-B*15:01
 997
148
Mem B15.01_148
9
HLRIAGHHL
0.279509
0.68





HLA-B*15:01
 998
 18
Mem B15.01_18
9
EQWNLVIGF
0.244266
0.78





HLA-B*15:01
 999
 45
Mem B15.01_45
9
FLYIIKLIF
0.20731
0.91





HLA-B*40:01
1000
136
Mem B40 01_136
9
SELVIGAVI
0.729021
0.14





HLA-B*58:01
1001
 84
Mem B58.01_84
9
MACLVGLMW
0.913908
0.06





HLA-B*58:01
1002
 67
Mem B58.01_67
9
LAAVYRINW
0.87252
0.08





HLA-B*58:01
1003
 47
Mem B58.01_47
9
YIIKLIFLW
0.737525
0.15





HLA-B*58:01
1004
 39
Mem B58.01_39
9
YANRNRFLY
0.532269
0.29





HLA-B*58:01
1005
171
Mem B58.01_171
9
ATSRTLSYY
0.512223
0.31





HLA-B*58:01
1006
 23
Mem B58.01_23
9
VIGFLFLTW
0.459109
0.35





HLA-B*58:01
1007
 12
Mem B58.01_12
9
ELKKLLEQW
0.447047
0.37





HLA-B*58:01
1008
 50
Mem B58.01_50
9
KLIFLWLLW
0.402934
0.41





HLA-B*58:01
1009
170
Mem B58.01_170
9
VATSRTLSY
0.388936
0.42





HLA-B*58:01
1010
168
Mem B58.01_168
9
ITVATSRTL
0.308282
0.54





HLA-B*58:01
1011
102
Mem B58.01_102
9
LFARTRSMW
0.265216
0.62





HLA-B*58:01
1012
 37
Mem B58.01_37
9
FAYANRNRF
0.225301
0.71





HLA-B*58:01
1013
 29
Mem B58.01_29
9
LTWICLLQF
0.196817
0.79





HLA-B*58:01
1014
  8
Mem B58.01_8
9
ITVEELKKL
0.181947
0.83
















TABLE 7







Envelope















Sequence

Sequence






Allele
ID No.
Start
Annotation
Length
Sequence
Score
Rank





HLA-A*01:01
1015
49
Evlp A1.01_49
9
VSLVKPSFY
0.462205
0.21





HLA-A*01:01
1016
34
Evlp A1.01_34
9
LTALRLCAY
0.395652
0.27





HLA-A*01:01
1017
51
Evlp A1.01_51
9
LVKPSFYVY
0.150715
0.68





HLA-A*02:01
1018
50
Evlp A2.01_50
9
SLVKPSFYV
0.846768
0.06





HLA-A*02:01
1019
57
Evlp A2.01_57
9
YVYSRVKNL
0.49818
0.26





HLA-A*02:01
1020
20
Evlp A2.01_20
9
FLAFVVFLL
0.482968
0.27





HLA-A*02:01
1021
13
Evlp A2.01_13
9
IVNSVLLFL
0.358211
0.43





HLA-A*02:01
1022
11
Evlp A2.01_11
9
TLIVNSVLL
0.319511
0.51





HLA-A*02:01
1023
 4
Evlp A2.01_4
9
FVSEETGTL
0.299489
0.55





HLA-A*02:01
1024
26
Evlp A2.01_26
9
FLLVTLAIL
0.284749
0.58





HLA-A*02:01
1025
16
Evlp A2.01_16
9
SVLLFLAFV
0.207941
0.76





HLA-A*03:01
1026
61
Evlp A3.01_61
9
RVKNLNSSR
0.496778
0.34





HLA-A*03:01
1027
45
Evlp A3.01_45
9
NIVNVSLVK
0.234744
0.83





HLA-A*26:01
1028
51
Evlp A26.01_51
9
LVKPSFYVY
0.462542
0.13





HLA-A*26:01
1029
57
Evlp A26.01_57
9
YVYSRVKNL
0.290721
0.26





HLA-A*26:01
1030
48
Evlp A26.01_48
9
NVSLVKPSF
0.265747
0.27





HLA-A*26:01
1031
 4
Evlp A26.01_4
9
FVSEETGTL
0.190459
0.39





HLA-A*26:01
1032
34
Evlp A26.01_34
9
LTALRLCAY
0.188337
0.4





HLA-A*26:01
1033
12
Evlp A26.01_12
9
LIVNSVLLF
0.171493
0.43





HLA-B*07:02
1034
57
Evlp B07.02_57
9
YVYSRVKNL
0.154246
0.83





HLA-B*15:01
1035
51
Evlp B15.01_51
9
LVKPSFYVY
0.890774
0.03





HLA-B*15:01
1036
12
Evlp B15.01_12
9
LIVNSVLLF
0.427014
0.41





HLA-B*15:01
1037
18
Evlp B15.01_18
9
LLFLAFVVF
0.271964
0.7





HLA-B*15:01
1038
34
Evlp B15.01_34
9
LTALRLCAY
0.259594
0.74





HLA-B*15:01
1039
49
Evlp B15.01_49
9
VSLVKPSFY
0.231883
0.83





HLA-B*40:01
1040
 6
Evlp B40.01_6
9
SEETGTLIV
0.553646
0.24





HLA-B*58:01
1041
49
Evlp B58.01_49
9
VSLVKPSFY
0.498101
0.32





HLA-B*58:01
1042
51
Evlp B58.01_51
9
LVKPSFYVY
0.274628
0.59





HLA-B*58:01
1043
12
Evlp B58.01_12
9
LIVNSVLLF
0.18442
0.82





HLA-B*58:01
1044
29
Evlp B58.01_29
9
VTLAILTAL
0.175332
0.86
















TABLE 8







ORF-3A















Sequence

Sequence






Allele
ID No.
Start
Annotation
Length
Sequence
Score
Rank





HLA-A*01:01
1045
207
ORF3a A1.01_207
9
FTSDYYQLY
0.980978
0.01





HLA-A*01:01
1046
204
ORF3a A1.01_204
9
HSYFTSDYY
0.789539
0.07





HLA-A*01:01
1047
176
ORF3a A1.01_176
9
TSPISEHDY
0.409915
0.26





HLA-A*01:01
1048
220
ORF3a A1.01_220
9
STDTGVEHV
0.348237
0.31





HLA-A*01:01
1049
101
ORF3a A1.01_101
9
LEAPFLYLY
0.214665
0.51





HLA-A*01:01
1050
146
ORF3a A1.01_146
9
FLCWHTNCY
0.20996
0.52





HLA-A*02:01
1051
139
ORF3a A2.01_139
9
LLYDANYFL
0.961842
0.02





HLA-A*02:01
1052
 72
ORF3a A2.01_72
9
ALSKGVHFV
0.953159
0.02





HLA-A*02:01
1053
 89
ORF3a A2.01_89
9
TVYSHLLLV
0.839324
0.06





HLA-A*02:01
1054
107
ORF3a A2.01_107
9
YLYALVYFL
0.826661
0.06





HLA-A*02:01
1055
236
ORF3a A2.01_236
9
IVDEPEEHV
0.636237
0.17





HLA-A*02:01
1056
110
ORF3a A2.01_110
9
ALVYFLQSI
0.573805
0.21





HLA-A*02:01
1057
 51
ORF3a A2.01_51
9
ALLAVFQSA
0.558561
0.23





HLA-A*02:01
1058
100
ORF3a A2.01_100
9
GLEAPFLYL
0.430573
0.33





HLA-A*02:01
1059
 82
ORF3a A2.01_82
9
NLLLLFVTV
0.354378
0.44





HLA-A*02:01
1060
 33
ORF3a A2.01_33
9
ATIPIQASL
0.32159
0.51





HLA-A*02:01
1061
 87
ORF3a A2.01_87
9
FVTVYSHLL
0.274081
0.6





HLA-A*02:01
1062
 45
ORF3a A2.01_45
9
WLIVGVALL
0.263005
0.62





HLA-A*02:01
1063
247
ORF3a A2.01_247
9
HTIDGSSGV
0.205907
0.77





HLA-A*02:01
1064
220
ORF3a A2.01_220
9
STDTGVEHV
0.180098
0.88





HLA-A*03:01
1065
 58
ORF3a A3.01_58
9
SASKIITLK
0.830102
0.06





HLA-A*03:01
1066
 59
ORF3a A3.01_59
9
ASKIITLKK
0.784036
0.1





HLA-A*03:01
1067
  8
ORF3a A3.01_8
9
FTIGTVTLK
0.654487
0.2





HLA-A*03:01
1068
227
ORF3a A3.01_227
9
HVTFFIYNK
0.444929
0.41





HLA-A*03:01
1069
184
ORF3a A3.01_184
9
YQIGGYTEK
0.257177
0.76





HLA-A*03:01
1070
 13
ORF3a A3.01_13
9
VTLKQGEIK
0.182774
0.99





HLA-A*24:02
1071
112
ORF3a A24.02_112
9
VYFLQSINF
0.969883
0.01





HLA-A*24:02
1072
211
ORF3a A24.02_211
9
YYQLYSTQL
0.862532
0.04





HLA-A*24:02
1073
106
ORF3a A24.02_106
9
LYLYALVYF
0.772667
0.06





HLA-A*24:02
1074
159
ORF3a A24.02_159
9
PYNSVTSSI
0.669199
0.12





HLA-A*24:02
1075
206
ORF3a A24.02_206
9
YFTSDYYQL
0.598801
0.15





HLA-A*24:02
1076
  7
ORF3a A24.02_7
9
IFTIGTVTL
0.46568
0.21





HLA-A*24:02
1077
 86
ORF3a A24.02_86
9
LFVTVYSHL
0.271524
0.38





HLA-A*24:02
1078
 37
ORF3a A24.02_37
9
IQASLPFGW
0.203184
0.48





HLA-A*24:02
1079
119
ORF3a A24.02_119
9
NFVRIIMRL
0.180996
0.54





HLA-A*24:02
1080
 55
ORF3a A24.02_55
9
VFQSASKII
0.179447
0.54





HLA-A*26:01
1081
207
ORF3a A26.01_207
9
FTSDYYQLY
0.92003
0.02





HLA-A*26:01
1082
247
ORF3a A26.01_247
9
HTIDGSSGV
0.441513
0.14





HLA-A*26:01
1083
 33
ORF3a A26.01_33
9
ATIPIQASL
0.386256
0.18





HLA-A*26:01
1084
204
ORF3a A26.01_204
9
HSYFTSDYY
0.355037
0.21





HLA-A*26:01
1085
 89
ORF3a A26.01_89
9
TVYSHLLLV
0.217806
0.34





HLA-B*07:02
1086
 35
ORF3a B07.02_35
9
IPIQASLPF
0.765631
0.1





HLA-B*07:02
1087
103
ORF3a B07.02_103
9
APFLYLYAL
0.646458
0.15





HLA-B*07:02
1088
 33
ORF3a B07.02_33
9
ATIPIQASL
0.179015
0.74





HLA-B*15:01
1089
207
ORF3a B15.01_207
9
FTSDYYQLY
0.436442
0.4





HLA-B*15:01
1090
204
ORF3a B15.01_204
9
HSYFTSDYY
0.396492
0.43





HLA-B*15:01
1091
105
ORF3a B15.01_105
9
FLYLYALVY
0.351637
0.51





HLA-B*15:01
1092
 71
ORF3a B15.01_71
9
LALSKGVHF
0.329022
0.56





HLA-B*15:01
1093
146
ORF3a B15.01_146
9
FLCWHTNCY
0.324971
0.58





HLA-B*15:01
1094
 33
ORF3a B15.01_33
9
ATIPIQASL
0.236974
0.81





HLA-B*15:01
1095
 83
ORF3a B15.01_83
9
LLLLFVTVY
0.217874
0.87





HLA-B*40:01
1096
241
ORF3a B40.01_241
9
EEHVQIHTI
0.56881
0.22





HLA-B*40:01
1097
193
ORF3a B40.01_193
9
WESGVKDCV
0.189099
0.66





HLA-B*58:01
1098
 61
ORF3a B58.01_61
9
KIITLKKRW
0.792637
0.12





HLA-B*58:01
1099
 37
ORF3a B58.01_37
9
IQASLPFGW
0.770898
0.14





HLA-B*58:01
1100
 39
ORF3a B58.01_39
9
ASLPFGWLI
0.600839
0.25





HLA-B*58:01
1101
185
ORF3a B58.01_185
9
QIGGYTEKW
0.594673
0.25





HLA-B*58:01
1102
 33
ORF3a B58.01_33
9
ATIPIQASL
0.593727
0.26





HLA-B*58:01
1103
 71
ORF3a B58.01_71
9
LALSKGVHF
0.526496
0.3





HLA-B*58:01
1104
207
ORF3a B58.01_207
9
FTSDYYQLY
0.50603
0.32





HLA-B*58:01
1105
 63
ORF3a B58.01_63
9
ITLKKRWQL
0.388217
0.42





HLA-B*58:01
1106
 57
ORF3a B58.01_57
9
QSASKIITL
0.385737
0.42





HLA-B*58:01
1107
120
ORF3a B58.01_120
9
FVRIIMRLW
0.362631
0.45





HLA-B*58:01
1108
123
ORF3a B58.01_123
9
IIMRLWLCW
0.362597
0.45





HLA-B*58:01
1109
204
ORF3a B58.01_204
9
HSYFTSDYY
0.35909
0.45





HLA-B*58:01
1110
 99
ORF3a B58.01_99
9
AGLEAPFLY
0.302287
0.55





HLA-B*58:01
1111
 88
ORF3a B58.01_88
9
VTVYSHLLL
0.206627
0.76





HLA-B*58:01
1112
 79
ORF3a B58.01_79
9
FVCNLLLLF
0.194224
0.79
















TABLE 9







ORF-6















Sequence

Sequence






Allele
ID No.
Start
Annotation
Length
Sequence
Score
Rank





HLA-A*01:01
1113
23
ORF6 A1.01_23
9
KVSIWNLDY
0.175977
0.61





HLA-A*02:01
1114
 3
ORF6 A2.01_3
9
HLVDFQVTI
0.910832
0.03





HLA-A*02:01
1115
28
ORF6 A2.01_28
9
NLDYIINLI
0.366871
0.41





HLA-A*02:01
1116
10
ORF6 A2.01_10
9
TIAEILLII
0.252135
0.64





HLA-A*02:01
1117
16
ORF6 A2.01_16
9
LIIMRTFKV
0.204711
0.77





HLA-A*03:01
1118
15
ORF6 A3.01_15
9
LLIIMRTFK
0.435841
0.43





HLA-A*03:01
1119
34
ORF6 A3.01_34
9
NLIIKNLSK
0.378646
0.53





HLA-A*03:01
1120
23
ORF6 A3.01_23
9
KVSIWNLDY
0.237379
0.82





HLA-A*24:02
1121
21
0RF6 A24.02_21
9
TFKVSIWNL
0.422545
0.25





HLA-B*07:02
1122
36
ORF6 B07.01_36
9
IIKNLSKSL
0.241687
0.59





HLA-B*15:01
1123
50
ORF6 B15.01_50
9
SQLDEEQPM
0.368542
0.48





HLA-B*15:01
1124
23
ORF6 B15.01_23
9
KVSIWNLDY
0.189586
0.99





HLA-B*58:01
1125
 9
0RF6 B58.01_9
9
VTIAEILLI
0.45441
0.36
















TABLE 10







ORF-7















Sequence

Sequence






Allele
ID No.
Start
Annotation
Length
Sequence
Score
Rank





HLA-A*02:01
1126
13
ORF7 A2.01_13
9
FLAFLLFLV
0.391043
0.38





HLA-A*02:01
1127
26
ORF7 A2.01_26
9
IIFWFSLEL
0.311542
0.53





HLA-A*02:01
1128
10
ORF7 A2.01_10
9
YLCFLAFLL
0.238134
0.67





HLA-A*02:01
1129
 3
ORF7 A2.01_3
9
ELSLIDFYL
0.173084
0.91





HLA-A*02:01
1130
17
ORF7 A2.01_17
9
LLFLVLIML
0.114475
1.3





HLA-A*24:02
1131
 5
ORF7 A24.02_5
9
SLIDFYLCF
0.162262
0.6





HLA-A*24:02
1132
 9
ORF7 A24.02_9
9
FYLCFLAFL
0.098668
0.86





HLA-A*26:01
1133
 5
ORF7 A26.01_5
9
SLIDFYLCF
0.158145
0.47





HLA-B*15:01
1134
 5
ORF7 B15.01_5
9
SLIDFYLCF
0.494646
0.31





HLA-B*58:01
1135
21
ORF7 B58.01_21
9
VLIMLIIFW
0.30633
0.54
















TABLE 11







ORF-8















Sequence

Sequence






Allele
ID No.
Start
Annotation
Length
Sequence
Score
Rank





HLA-A*01:01
1136
 23
ORF8 A1.01_23
9
QSCTQHQPY
0.336403
0.32





HLA-A*01:01
1137
 65
ORF8 A1.01_65
9
AGSKSPIQY
0.187336
0.57





HLA-A*01:01
1138
102
ORF8 A1.01_102
9
SFYEDFLEY
0.168125
0.62





HLA-A*01:01
1139
 32
ORF8 A1.01_32
9
VVDDPCPIH
0.147161
0.69





HLA-A*01:01
1140
 33
ORF8 A1.01_33
9
VDDPCPIHF
0.07727
1.2





HLA-A*01:01
1141
 96
ORF8 A1.01_96
9
SLVVRCSFY
0.073703
1.2





HLA-A*02:01
1142
107
ORF8 A2.01_107
9
FLEYHDVRV
0.675619
0.15





HLA-A*02:01
1143
 93
ORF8 A2.01_93
9
KLGSLVVRC
0.277075
0.59





HLA-A*02:01
1144
 31
ORF8 A2.01_31
9
YVVDDPCPI
0.197175
0.8





HLA-A*02:01
1145
  6
ORF8 A2.01_6
9
FLGIITTVA
0.182085
0.88





HLA-A*02:01
1146
  5
ORF8 A2.01_5
9
VFLGIITTV
0.161779
0.97





HLA-A*02:01
1147
 72
ORF8 A2.01_72
9
QYIDINYTV
0.075501
1.6





HLA-A*03:01
1148
102
ORF8 A3.01_102
9
SFYEDFLEY
0.16552
1.1





HLA-A*24:02
1149
 72
ORF8 A24.02_72
9
QYIDINYTV
0.871876
0.04





HLA-A*24:02
1150
109
ORF8 A24.02_109
9
EYHDVRVVL
0.515969
0.19





HLA-A*24:02
1151
 41
ORF8 A24.02_41
9
FYSKWYIRV
0.421822
0.25





HLA-A*24:02
1152
 77
ORF8 A24.02_77
9
NYTVSCLPF
0.324175
0.32





HLA-A*24:02
1153
102
ORF8 A24.02_102
9
SFYEDFLEY
0.222279
0.45





HLA-A*24:02
1154
  5
ORF8 A24.02_5
9
VFLGIITIV
0.190632
0.53





HLA-A*26:01
1155
102
ORF8 A26.01_102
9
SFYEDFLEY
0.318827
0.23





HLA-A*26:01
1156
  8
ORF8 A26.01_8
9
GIITTVAAF
0.192488
0.39





HLA-A*26:01
1157
 75
ORF8 A26.01_75
9
DINYTVSCL
0.083624
0.81





HLA-A*26:01
1158
 34
ORF8 A26.01_34
9
DDPCPIHFY
0.071197
0.92





HLA-B*07:02
1159
 91
ORF8 B07.02_91
9
EPKLGSLVV
0.422674
0.32





HLA-B*15:01
1160
  8
ORF8 B15.01_8
9
GIITIVAAF
0.702601
0.13





HLA-B*15:01
1161
102
ORF8 B15.01_102
9
SFYEDFLEY
0.47413
0.33





HLA-B*15:01
1162
 65
ORF8 B15.01_65
9
AGSKSPIQY
0.386821
0.44





HLA-B*15:01
1163
 96
ORF8 B15.01_96
9
SLVVRCSFY
0.260667
0.74





HLA-B*15:01
1164
 23
ORF8 B15.01_23
9
QSCTQHQPY
0.143528
1.3





HLA-B*15:01
1165
 89
ORF8 B15.01_89
9
CQEPKLGSL
0.101927
1.6





HLA-B*40:01
1166
108
ORF8 B40.01_108
9
LEYHDVRVV
0.478756
0.29





HLA-B*40:01
1167
 90
ORF8 B40.01_90
9
QEPKLGSLV
0.346977
0.44





HLA-B*40:01
1168
 52
ORF8 B40.01_52
9
RKSAPLIEL
0.070944
1.2





HLA-B*58:01
1169
 53
ORF8 B58.01_53
9
KSAPLIELC
0.331298
0.5





HLA-B*58:01
1170
 95
ORF8 B58.01_95
9
GSLVVRCSF
0.245213
0.65





HLA-B*58:01
1171
 65
ORF8 B58.01_65
9
AGSKSPIQY
0.215598
0.73





HLA-B*58:01
1172
 37
ORF8 B58.01_37
9
CPIHFYSKW
0.165769
0.89





HLA-B*58:01
1173
 66
ORF8 B58.01_66
9
GSKSPIQYI
0.137132
1.1





HLA-B*58:01
1174
 23
ORF8 B58.01_23
9
QSCTQHQPY
0.097082
1.3
















TABLE 12







Mutant Covid-19 RNA Frames










Sequence





ID No.
Sequence Annotation
Length
Sequence





1175
MutRNA 1
 864
CTGGTACCGCCACCATCGAGTGGCGGCGGAGGATCCATGTGGCTGCAGAA





TTTACTTTTCCTGGGCATTGTGGTCTACAGCCTCTCAGCACCCACCCGCTCA





CCCATCACTGTCACCCGGCCTTGGAAGCATGTAGAGGCCATCAAAGAAGC





CCTGAACCTCCTGGATGACATGCCTGTCACGTTGAATGAAGAGGTAGAAG





TCGTCTCTAACGAGTTCTCCTTCAAGAAGCTAACATGTGTGCAGACCCGCC





TGAAGATATTCGAGCAGGGTCTACGGGGCAATTTCACCAAACTCAAGGGC





GCCTTGAACATGACAGCCAGCTACTACCAGACATACTGCCCCCCAACTCC





GGAAACGGACTGTGAAACACAAGTTACCACCTATGCGGATTTCATAGACA





GCCTTAAAACCTTTCTGACTGATATCCCCTTTGAATGCAAAAAACCAGGCC





AAAAATAAAGCTCGCTITCTTGCTGTCCAATTTCTATTAAAGGTTCCTTTGT





TCCCTAAGTCCAACTACTAAACTGGGGGATATTATGAAGGGCCTTGAGCA





TCTGGATTCTGCCTAATAAAAAACATTTATTTTCATTGCAGCTCGCTTTCTT





GCTGTCCAATTTCTATTAAAGGTTCCTTTGTTCCCTAAGTCCAACTACTAAA





CTGGGGGATATTATGAAGGGCCTTGAGCATCTGGATTCTGCCTAATAAAA





AACATTTATTTTCATTGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG





CATATGACTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA





AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGAAGAGCTCTAGATG





1176
MutRNA 1-Complete
1284
CTGGTACCGCCACCATGCAGGTTCAGCTGGTTGAAAGTGGCGGCGGCAGC





GTTCAGGCCGGCGGTAGCTTACGCCTGAGTTGTGCCGCAAGCGGTTATAC





CTTTAGTAGCTATCCGATGGGTTGGTATCGTCAGGCCCCGGGTAAAGAAT





GTGAACTGGTGAGCAACCACAGCACAAGAAAGGAGGAGAAGCAGAGAA





ACGGCACCCTGACAGTGACAAGCACACTGCCTGCAAATTATGCCGGCAGC





GTTAAAGGTCGTTTTACCATTAGTCGCGATAATGCCAAAAATACCGCCTAT





CTGCAGATGGATAGCCTGAAACCGGAAGATACCGCCGTTTATTATTGCGC





AGCAGAAACCTATCAGTGCCGTGTTACCCATCCGCATCTGCCGCGCGCCCT





GATGAGTACCACCAAATGGGGCCAGGGCACCCAGGTTACCGTTAGCAGT





AGTGGCGGCGGAGGATCCATGTGGCTGCAGAATTTACTTTTCCTGGGCATT





GTGGTCTACAGCCTCTCAGCACCCACCCGCTCACCCATCACTGTCACCCGG





CCTTGGAAGCATGTAGAGGCCATCAAAGAAGCCCTGAACCTCCTGGATGA





CATGCCTGTCACGTTGAATGAAGAGGTAGAAGTCGTCTCTAACGAGTTCTC





CTTCAAGAAGCTAACATGTGTGCAGACCCGCCTGAAGATATTCGAGCAGG





GTCTACGGGGCAATTTCACCAAACTCAAGGGCGCCTTGAACATGACAGCC





AGCTACTACCAGACATACTGCCCCCCAACTCCGGAAACGGACTGTGAAAC





ACAAGTTACCACCTATGCGGATTTCATAGACAGCCTTAAAACCTTTCTGAC





TGATATCCCCTTTGAATGCAAAAAACCAGGCCAAAAATAAAGCTCGCTTT





CTTGCTGTCCAATTTCTATTAAAGGTTCCTTTGTTCCCTAAGTCCAACTACT





AAACTGGGGGATATTATGAAGGGCCTTGAGCATCTGGATTCTGCCTAATA





AAAAACATTTATTTTCATTGCAGCTCGCTTTCTTGCTGTCCAATTTCTATTAA





AGGTTCCTTTGTTCCCTAAGTCCAACTACTAAACTGGGGGATATTATGAAG





GGCCTTGAGCATCTGGATTCTGCCTAATAAAAAACATTTATTTTCATTGCA





AAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCATATGACTAAAAAAAA





AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA





AAAAAAAAAAAAAAAAGAAGAGCTCTAGATG





1177
MutRNA 2
 553
CTGGTACCTCTCAACACAACATATACAAAACAAACGAATCTCAAGCAATC





AAGCATTCTACTTCTATTGCAGCAATTTAAATCATTTCTTTTAAAGCAAAA





GCAATTTTCTGAAAATTTTCACCATTTACGAACGATAGCCTGGTACTGCAT





GCACGCAATGCTAGCTGCCCCTTTCCCGTCCTGGGTACCCCGAGTCTCCCC





CGACCTCGGGTCCCAGGTATGCTCCCACCTCCACCTGCCCCACTCACCACC





TCTGCTAGTTCCAGACACCTCCCAAGCACGCAGCAATGCAGCTCAAAACG





CTTAGCCTAGCCACACCCCCACGGGAAACAGCAGTGATTAACCTTTAGCA





ATAAACGAAAGTTTAACTAAGCTATACTAACCCCAGGGTGGTCAATTTCG





TGCCAGCCACACCCTCGAGCTAGCAAAAAAAAAAAAAAAAAAAAAAAA





AAAAAAGCATATGACTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA





AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGAAGAGCT





CTAGATG





1178
MutRNA 2-Complete
 995
CTGGTACCTCTCAACACAACATATACAAAACAAACGAATCTCAAGCAATC





AAGCATTCTACTTCTATTGCAGCAATTTAAATCATTTCTTTTAAAGCAAAA





GCAATTTTCTGAAAATTTTCACCATTTACGAACGATAGCGCCACCATGGCC





ACCATGGGCCAGGTTCAGCTGGTTGAAAGTGGCGGCGGCAGCGTTCAGGC





CGGCGGTAGCTTACGCCTGAGTTGTGCCGCAAGCGGTTATACCTTTAGTAG





CTATCCGATGGGTTGGTATCGTCAGGCCCCGGGTAAAGAATGTGAACTGG





TGAGCTACCAGTGCAGGGTGACCCACCCCGTGGACCTGGCACCCGCCCTC





ATGCGGTCCACGGCAAATTATGCCGGCAGCGTTAAAGGTCGTTTTACCATT





AGTCGCGATAATGCCAAAAATACCGCCTATCTGCAGATGGATAGCCTGAA





ACCGGAAGATACCGCCGTTTATTATTGCGCAGCATACCAGTGCAGGGTGA





CCCACCCCAACCCGAGAGGGGTGAGCGCCCTCATGCGGTCCACGTGGGGC





CAGGGCACCCAGGTTACCGTTAGCAGTCTGGTACTGCATGCACGCAATGC





TAGCTGCCCCTTTCCCGTCCTGGGTACCCCGAGTCTCCCCCGACCTCGGGTC





CCAGGTATGCTCCCACCTCCACCTGCCCCACTCACCACCTCTGCTAGTTCCA





GACACCTCCCAAGCACGCAGCAATGCAGCTCAAAACGCTTAGCCTAGCCA





CACCCCCACGGGAAACAGCAGTGATTAACCTTTAGCAATAAACGAAAGTT





TAACTAAGCTATACTAACCCCAGGGTTGGTCAATTTCGTGCCAGCCACACC





CTCGAGCTAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCATATG





ACTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA





AAAAAAAAAAAAAAAAAAAAAAAAAAGAAGAGCTCTAGATG





1179
SP1-2-A32-mRNA
1093
CTGGTACCGCCACCATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTG





CTCCTCCTGTGGGTGCCCGGGTCCAGAGGAGCCAAGTTCGTGGCTGCCTG





GACCCTGAAGGCTGCCGCTATTTCGATCTCCGAGATCAAGGGAGTCATCG





TGCACAAAATCGAGACCATCCTCTTCCATCATCATCATCACCATTGAGGAA





GCGGAGCCACGAACTTCTCTCTGTTAAAGCAAGCAGGAGATGTTGAAGAA





AACCCCGGGCCTATGTGGCTGCAGAATTTACTTTTCCTGGGCATTGTGGTC





TACAGCCTCTCAGCACCCACCCGCTCACCCATCACTGTCACCCGGCCTTGG





AAGCATGTAGAGGCCATCAAAGAAGCCCTGAACCTCCTGGATGACATGCC





TGTCACGTTGAATGAAGAGGTAGAAGTCGTCTCTAACGAGTTCTCCTTCAA





GAAGCTAACATGTGTGCAGACCCGCCTGAAGATATTCGAGCAGGGTCTAC





GGGGCAATTTCACCAAACTCAAGGGCGCCTTGAACATGACAGCCAGCTAC





TACCAGACATACTGCCCCCCAACTCCGGAAACGGACTGTGAAACACAAGT





TACCACCTATGCGGATTTCATAGACAGCCTTAAAACCTTTCTGACTGATAT





CCCCTTTGAATGCAAAAAACCAGGCCAAAAATGACGGAATTCCGTAAAG





CTCGCTTTCTTGCTGTCCAATTTCTATTAAAGGTTCCTTTGTTCCCTAAGTCC





AACTACTAAACTGGGGGATATTATGAAGGGCCTTGAGCATCTGGATTCTG





CCTAATAAAAAACATTTATTTTCATTGCAGCTCGCTTTCTTGCTGTCCAATT





TCTATTAAAGGTTCCTITGTTCCCTAAGTCCAACTACTAAACTGGGGGATA





TTATGAAGGGCCTTGAGCATCTGGATTCTGCCTAATAAAAAACATTTATTT





TCATTGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCATATGACTA





AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA





AAAAAAAAAAAAAAAAAAAAAAAGAAGAGCTCTAGATG





1180
SP1-2-A32-mRNA-
1177
CTGGTACCGCCACCATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTG



Complete 

CTCCTCCTGTGGGTGCCCGGGTCCAGAGGAAACCCGAGAGGGGTGAGCG





CCTACCTAGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGCCGCTCACC





CCCACCTGCCCAGGGCCCTCATGATTTCGATCTCCGAGATCAAGGGAGTC





ATCGTGCACAAAATCGAGACCATCCTCTTCGTGACTCTGGGCTGCCTGGCC





ACGGGCTACCATCATCATCATCACCATTGAGGAAGCGGAGCCACGAACTT





CTCTCTGTTAAAGCAAGCAGGAGATGTTGAAGAAAACCCCGGGCCTATGT





GGCTGCAGAATTTACTTTTCCTGGGCATTGTGGTCTACAGCCTCTCAGCAC





CCACCCGCTCACCCATCACTGTCACCCGGCCTTGGAAGCATGTAGAGGCC





ATCAAAGAAGCCCTGAACCTCCTGGATGACATGCCTGTCACGTTGAATGA





AGAGGTAGAAGTCGTCTCTAACGAGTTCTCCTTCAAGAAGCTAACATGTG





TGCAGACCCGCCTGAAGATATTCGAGCAGGGTCTACGGGGCAATTTCACC





AAACTCAAGGGCGCCTTGAACATGACAGCCAGCTACTACCAGACATACTG





CCCCCCAACTCCGGAAACGGACTGTGAAACACAAGTTACCACCTATGCGG





ATTTCATAGACAGCCTTAAAACCTTTCTGACTGATATCCCCTTTGAATGCA





AAAAACCAGGCCAAAAATGACGGAATTCCGTAAAGCTCGCTTTCTTGCTG





TCCAATTTCTATTAAAGGTTCCTITGTTCCCTAAGTCCAACTACTAAACTGG





GGGATATTATGAAGGGCCTTGAGCATCTGGATTCTGCCTAATAAAAAACA





TTTATTTTCATTGCAGCTCGCTTTCTTGCTGTCCAATTTCTATTAAAGGTTCC





TTTGTTCCCTAAGTCCAACTACTAAACTGGGGGATATTATGAAGGGCCTTG





AGCATCTGGATTCTGCCTAATAAAAAACATTTATTTTCATTGCAAAAAAAA





AAAAAAAAAAAAAAAAAAAAAAGCATATGACTAAAAAAAAAAAAAAA





AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA





AAAAAAAAAGAAGAGCTCTAGATG














SPECIFICATION
Embodiment of the Invention

Structure SARS-CoV-2 or Covid-19 is classified within the subgenus Sarbecovirus of the genus Betacoronavirus of a genome size of ˜29,000 ribonucleotides (+ssRNA), comprising six open reading frames with 5′-cap and 3′-poly-A. The first ORF (ORF 1 a/b) about two-thirds of the whole genome, encodes 16 non-structural proteins (NSP 1-16) (Gralinski, L. E., 2020. Viruses 12, 135; Forni, D., 2017. Trends Microbiol 25, 35-48). ORF near the 3′ end encode the four main structure proteins including spike (S) and membrane (M), envelope (E) and nucleocapsid (N) protein (+9 differentially spliced proteins), as well as nonstructural protease, RNA-dependent RNA polymerase complex (RdRp, 5 proteins), as the most complex antigenic (B and T cell epitopes, 34 proteins) viral universe know to date risking cytokine storm (Gralinski, L. E., 2020. Viruses 12, 135). The complete genome is shown in FIG. 11, and all these genes are vaccine candidates for neutralizing antibody vaccines and cytotoxic T lymphocytes (CTL) vaccines.


Origin and phylogeny 3,713 complete genomic sequences have been deposited (up to Apr. 10, 2020; in NSAID: Zhang at Fudan deposited the first the full-length sequence (Jan. 10, 2020 in GenBank, Zhang, 2020 Nature). SARS CoV-2 maintains a relatively distant ˜80% nucleotide identity to the original SARS epidemic viruses. Shi showed that SARS-CoV-2 had 96.2% overall genome sequence (nt) identity to bat RaTG13 (Zhou, P., 2020. bioRxiv, 2020.2001.2022.914952; Wu, F., 2020. Nature 579, 265-269), a betacoronavirus, collected from horse-shoe bat, Rhinolophus affinis (Hu, B., 2017. PLoS path 13, e1006698-e1006698) upon a cave expedition by Shi at the Yunnan province, China (Zhou, P., 2020. Nature 579, 270-273). The S protein accounts for the host range. The subregion S1 contains the receptor-binding domain (RBD) for huACE2. Notably, Guan reported that receptor-binding domain (RBD) of pangolin-CoV of the Guangdong Province, China, exhibits 97.4% amino acid identity to that of human SARS CoV-2 (although ˜92.4% in nt identity to CoV-2) (Lam, T. T.-Y., 2020. Nature 583, 282-285), including identity of all five key amino acid contact residues to ACE2 as later shown the identical five amino acids in CoV-2 RBD all contact importantly to huACR2 cocrystal (Lam, T. T.-Y., 2020. Nature 583, 282-285; Yan, R., 2020. Science 367, 1444; Lan, J., 2020. Nature 581, 215-220).


The structural and genetic observations indicate that a niche ecology of genetic exchange among natural reservoirs of bats, and pangolins (or masked palm civets) in the Wuhan's exotic animal food market, substantiating zoonotic to human transmission. This transmission is also supported by two early clinical observations in that (i) the high proportion of earlier patents admitted in Wuhan hospitals (before Jan. 1, 2020) with history of market visits, tapered to human-to-human transmission exclusively afterwards (Li, Q., 2020. NEJM 382, 1199-1207); and in that (ii) 14 of the 41 patients (34%) has no contact with this marketplace (Huang, C., 2020. Lancet 395, 497-506). Thus, the clinical pattern suggests an alternative behavior contact with the intermediate hosts (pet companionship; lab environment and the patient zero) raising the possibility relating to accidental source of contamination with RaTG13 stock in the Wuhan's P4 laboratory. Therefore, modifying cultural behavior in the country indigenous with the intermediate hosts is central to future emergence and infectious disease control.


Epidemiology plays a central role to trace the origins and the modalities of the pandemic spread of the Covid-19 infectious disease. Epidemiological prevalence seroconversion data of Santa Clara County indicate up to 5.7% (2.58-5.7%), 85-fold more cases than current voluntary testing based on clinical urgency (Bendavid E., 2020. medRxiv, 2020.2004.2014.20062463). Therefore, an effective vaccine is in a dire need to break the transmission chain.


Recognition of the ACE2 receptor by the spike (S) membrane glycoprotein of SARS-CoV-2 is a major ligand for virus binding to host cells, infectivity, initiation of cytokine storm pathogenesis, and its mutations and adaptation to the various host range. S is a trimeric assembled protein, consisting of a central helical stalk, made of three interacting S2 portion, diverging at the surface using the S1 portion. Each S1 component consists of two large domains, the N-terminal domain (NTD) and receptor-binding domain (RBD), which contains highly immunogenic B cell epitopes. In virus membranes, Spike protein can exist in open and close form, regulating by the NTD, and the open form is accessible to ACE2 binding from cryoEM studies on the S of SARS-CoV.


Of the two RBDs per trimer that are not engaged with the receptor, either both are closed or one of the RBDs remains closed and one is in the open conformation. Trimers can bind two to three ACE2 receptors bound. ACE2 binding affects the bound RBD by a torqued forced, rigid-body rotation ˜5.5 Å away from the trimeric center, along with affected the NTD shift as well, and the rest two NTDs of all three S1 components move by ˜1.5-3.0 Å. Binding of more than one ACE2 remains the same altered configuration. Thus, the monomers are separated, make less contact with each other in the receptor-bound state. The stoichiometry is two RBD binding to three ACE receptors and leaving one RBD idling in the close conformation. Thus, one embodiment of the invention is to render vaccine induced-antibodies that lock the RBD open or close in a fixed state that do not permit the RBD binding to receptors, or do not permit rigid body torque force transmission and separation from the neighboring monomer for re-anchoring at the S2 moiety. ACE2-stabilised S1 opening therefore leads to opening up of S2 structure, exposing the S1′ and S1-S2 furin or proteolytic enzyme site, and the unleashing further conformation opening up of the fusion peptide for host cell fusion. Therefore, the embodiment of the invention is to have vaccine-elicited antibodies, anticipating the appearance of the aforementioned three sequences and arrest the viral infectivity due to torque force/rigid body induced conformational change.


Moreover, the torque also affects helix-loop-helix approximately, 980-990 within the HR1 region in CoV-1 studies: at the end of the S2 domain in that the torque creates 50×65 Å2 open cavity around the trimer axis that is for solvent exposed HR1, serving as additional target site for the vaccine-elicited antibodies in one embodiment of this invention. Noticeably, the embodiment of the invention focuses on the early and late stage of RBD phenomenology, namely, first neutralizing the open conformation of the monomer to prevent the ACE2 binding as a prerequisite; and second, focusing on maturation producing B cell epitopes, ignoring the cleaved or left over RBD-ACE since these are vestiges or left-over on the cell membrane without any protective significance. The left-over or hanging the S trimers permit this RBD in a more open ACE2-binding conformation of no more infectious disease significance, or passively, these vestiges can absorb neutralizing antibodies as a sink. In the process, the interaction of the closed form of S1 with a segment of the S2 chain that precedes the putative fusion peptide region is lost in the open form. Thus, the embodiment of the invention is to provide strong T cell help such as that from the herd immunity memory response to diphtheria or tetanus or viral measles CD4 helper T cells or Covid-19 viral CD4 helper T cells by using Covid-19 B cell epitopes integrated to a library of helper peptides. Successive trimeric RBD opening and ACE2 binding leads to a fully open and ACE2-bound form where the trimeric S1 ring remains bound to the core S2 trimer by limited contacts through the intermediate subdomains of S1. This structural and physiological arrangement leaves the top of the S2 helices fully exposed.


In recognition of the thermodynamic rule of folding as categorically governed by the primary amino acid sequence, the removal of the RBD due to cleavage, lead to the reconfiguration of the rest of the S2 polypeptide according to the remaining sequences, which are free from the torque force due to the RBD binding to the ACE2 receptors. The rest of the sequences can be under the constraint of the non-binding two free RBD or with one free and one bound RBD, which is yet to be cleaved and released. In the embodiment of the invention of using a series of protein scaffold such as beta-strands of lipocalin, fibronectin, CDR regions, equivalent to the “beta strand-loop-beta strand” motif of conventional immunoglobulins or CDR regions of an antibody light chain or heavy chain scaffolding, and N-, C-terminals of thermostable GFP for the primary furin site and fusion site sequences in the reformation of the natural folding of furin or fusion peptides similar to the devoid of RBD (de-RBDed) S2 using said protein scaffolds of FIG. 1.


Another embodiment of the invention is to use the “alpha helix-loop-alpha helix” motif of ankyrin repeat (AR) motif as yet another protein scaffold for alpha helices B cell epitopes, and beta-hairpin loop or a large or long B cell loop epitope. The AR is a modular, protein-protein interaction motif in nature, by compiling a given AR protein containing multiple repeats, being monomeric, of high thermostability. The ankyrin scaffold is organized in a unit of five, consisting of 5′ and 3′ capped and central three ankyrin will be synthesized, and the library will be replaced with amino acids in random in the three alpha helical regions and at three the beta turns, The consensus amino acid sequence contains all information required to define the ankyrin repeat fold for engineering B cell epitopes of Covid-19, Covid-n. One main embodiment is the 33-residue sequence motif into a helix-loop-helix structure with a beta hairpin/loop region projecting outward from the helices at a 90° angle. The repeats stack together to form a concave L-shaped structure with the inserted B cell epitope accessible as a concave recognizable structure by antibodies as receptor or pharmacological receptor. Moreover, the beta-hairpin loop or a longer loop can also contain a B cell epitope or a pharmacophore. Thus, the AR scaffold is a bifunctional representation of B cell epitopic conformations of secondary a helical structures and loop structure. Another versatility of the AR is for scaffolding coiled coils distorting away from the alpha helix or 3-10 helix.


Additionally, the ankyrin scaffold is also adapted into the eukaryotic ribosome display system, and the prokaryotic transcription/translation sequences, replaced with the eukaryotic enabling sequences. This system will be compared among protein scaffolds by the ribosome display (RD). We will test whether anti-ANK antibodies are elicited by random aptamers on ANK vs VHH scaffolded constructs, and antibodies to human VHIII family of immunoglobulin. Humanization of non-human immunoglobulins if employed such as VHH, will be conducted to eliminate immunogenicity. It is prudent to establish an efficacious RD technology platform, which is also safe for discovery of IgE drug or others.


Herein we further illustrate the principle of the loop scaffolding by two adjacent alpha-helices. The beta hairpin/loop region and the short alpha helices comprising the concave face have been previously characterized as the recognition surface. Positions classified as nonconserved and semiconserved of different type are mainly present on the recognition surface, whereas the opposite face shows mostly semiconserved positions of the same type. The embodiment of the invention is to replace the loop and short alpha helices with the B cell epitopes, or insert B cell epitopes into the loop or extend the short alpha helices as protruding concave B cell epitope so that AR then exhibits the highest variability to accommodate a diverse group of potential B cell epitopes or pharmacophore. Because the resilience of AR backbone and the foreign AR equivalent (from 20 to 40) will be replacing the constituent repeat for B cell epitopic or pharmacophore display. Thus, the consensus sequence carries the necessary structural integrity in the presence of a foreign alpha-helix, which can be subject to molecular evolution and protein display for more effective B cell epitopes and pharmacophore as a blueprint for reshaping CoV-2 and CoV-n protein engineering or to create novel biological functions.


Embodying B Cell Epitopic Vaccines of Cov-2, Cov-N
1. Foundation Paradigm:

The precision vaccine to target viral host range and infectivity is based on the foundation work of Harrison/Farzan/Li who established huACE2 as the receptor; and later elucidated RBD/ACE2 cocrystal for SARS CoV-1 and offered the molecular biology pathways leading to a clinical infection (Li, W., 2003. Nature 426, 450-454; Li, F., 2005, Science 309, 1864). Similarly, this strategy is rapidly established for SARS Cov-2 infection in human ACE2+ cells (Lam, T. T.-Y., 2020. Nature 583, 282-285; Yan, R., 2020. Science 367, 1444; Lan, J., 2020. Nature 581, 215-220; V*app, D., 2020. Science 367, 1260). Coronavirus S glycoprotein exists as a metastable prefusion homotrimer, comprising two functional subunits, S1 and S2 (Wrapp, D., 2020. Science 367, 1260). S protein exhibits the most diversity among coronaviruses, accounting for its wide host range (Li, F., 2013. Antiviral Res 100, 246-254). Thus, its threefold infection strategy unfolds, culminating in a productive infection:


(i) S1 Binding to ACE2 receptor: RBD or its core RBM or RBE (Receptor Binding B cell Epitope, 438-506 aa) in S1 exhibits extensive contact with ACE2 receptor oriented, by the N-terminus domain, NTD (aa 1-332 aa) (Wrapp, D., 2020. Science 367, 1260; Song, W., 2018. PLoS path 14, e1007236).


(ii) S2 cleavage and global conformational change: Receptor binding triggers proteolytic processing by the host surface enzymes, promoting a global (tectonic) conformational change to permit exposure of proteolytic furin sites (Wrapp, D., 2020. Science 367, 1260; Song, W., 2018. PLoS path 14, e1007236; Harrison, S. C., 2015. Virology 479-480, 498-507);


(iii) Fusion: the remaining trimeric S2 subunit arranging around a central triple-helical bundle to juxtapose the fusion peptide (trimeric) to the host membrane lipid bilayer, culminating in the delivery of the viral genome into the host cells (Wrapp, D., 2020. Science 367, 1260; Song, W., 2018. PLoS path 14, e1007236; Harrison, S. C., 2015. Virology 479-480, 498-507). Notably, the three-fold sophisticate strategy adopted by SARS CoV-2 reveals its natural prowess as well as its triple Achilles' heels subject to protective immune attack. Herein we will make three firewall vaccines in the proposed NIH-NOSI application. (i) Firewall 1st The first neutralizing antibody firewall blocks, competes or dissociates viral S trimer to the ACE2 receptor; (ii) Firewall 2nd The second neutralizing antibody firewall blocks furin cleavage sites to prevent S trimer maturation; (iii) Firewall 3rd the third neutralizing antibody firewall blocks and dissociates fusion peptide docking onto the lipid bilayer or micelles. Next, we will describe the technology platforms to build the protective shell via the three aforementioned firewall vaccines.


2. Vaccine Platform: The Technology Platform to Construct the Three Successive Layers of Firewall Vaccines

The protein folding is dictated by primary sequence and the secondary structure of beta strands or beta-sheet, and the most thermostable beta barrel thereof, accommodating the loop structures, which assume conformations as an antigenic site, a pharmacophore, or enzymatic sites. The scaffold protein is defined as a candidate protein with such beta strands that can constrain a given loop of protein, or a beta sheets that can accommodate multiple loops in adjacent orientations. Beta strands of FIG. 1A to 1D are colored yellow bar, and the loop sequences are left blank, while the alpha helical sequences are colored. The lipocalins are a family of proteins which transport small hydrophobic molecules. They share limited regions of sequence homology and a common tertiary structure architecture. This is an eight stranded antiparallel highly thermostable beta barrel. GFP consists of a stable beta barrel consisting of 8 anti-parallel beta strands that can accommodate the loop structure wherein. Fibronectin exhibits six beta strands in a barrel. Immunoglobulin heavy and light chain consist of six beta strands in a barrel. The complementarity-determining region (CDR) are typical loop region scaffolded by the two beta-strands. The invention embodies the replacement of the native loop sequences of immunoglobulin CDRs with loop sequences of CoV-2. These scaffold proteins are embodied to exhibit and conformationally constrained antigenic B cell epitopes of CoV-2, CoV-1 and CoV-n variants, defined as a CoV-2 mutant, or a next generation of CoV-2 with distinct which can be a recombinant with zoonotic origins, as warranted by general consumption of exotic food in China, e.g., civets, bats, pangolins.


Protein scaffolds, and Immunoglobulin heavy or light chain in humans or rodents as a constraining molecular clamp (chaperone), or a single heavy chain VHH antibody that is highly soluble due to the deletion of two hydrophobic residues in the hinge region mediating light chain pairing (Ewert, S., 2002. Biochem 41, 3628-3636). Human heavy chains and camelid heavy chains in particular CDR2 and CDR3 can accommodate longer, between 12 to 33 amino acids: the fact hypervariable regions can accommodate diverse amino acid sequences at will, are two-fold important in the embodiment of the invention (i) stably scaffolded within evolutionarily conserved thermostable framework regions as a highly thermostable β-barrel, acting as molecular scaffolds or chaperones; thus likewise (ii) the sequence space within the CDR1-CDR3 antibody binding loops can be replaced by antigenic B cell epitopes, or by conformation-delineated sequences acting as receptor agonists/antagonists, depending on the context of application.


The molecularly clamped, substituted CDR2 and/or CDR3 heavy or light chain, as antigenized constructs, can stabilize native configurations of the replaced foreign antigenic loop sequence to its native conformation on the parent protein. In this invention, the above three-firewall protective B cell epitopes will be clamped and constrained in the CDRs of heavy or light chain proven technology platform.


Furthermore, the embodiment of the invention expands the diversity of the constrained enclosed CoV-2 loop sequences via mutations and molecular evolution. the molecular clamped antigenized vaccine B cell epitopes constrained by the secondary structures, beta-strands and alpha-helices can in turn be displayed along with its mRNA on ribosomes, e.g., forming an Antigen/Ribosome/mRNA (ARM) ternary complexes, captured by huACE2 receptor-coated solid phase. Therefore phenotype-genotype linked selection can be subjected to molecular Darwinian evolution. And antigens can therefore be readily re-shaped for better fit antigens for eliciting neutralizing antibodies. The embodiment of the three firewall vaccines (including the Cov-2 S, NC, M, E compositions, and all the proteins indicated by FIG. 11) on the aforementioned said protein scaffolds, responding to the Covid-19 epidemic/pandemic challenge, and emerging mutants and continually evolving CoV-19, Covid-20, 21 or CoV-n. The embodiment of the invention covers all the mutant strain sequences, generated in the molecular evolution, Darwinian selection platform of the said protein display in the invention. The efficacies of the three firewall neutralizing antibodies to block or abrogate infection of huACE2+BHK-21 or HEK293 by CoV-2 S pseudotype virus as well as protection of pseudotype infection in huACE2 Tg mice.


Safety Margin: Need for a Precision SARS CoV-2, CoV-n Vaccine to Avoid the Lethal Cytokine Storm.

An important advantage of the precision vaccine is its avoidance of an inappropriate immune enhancement or viral antigen-induced cytokine storm. The two most immunogenic components conducted in SARS CoV-1 research, are attributed to S protein, and next to nucleocapsid (NC) protein, followed by M, E, and replicase and NSPs processed from the replicase/transcriptase complex (all together 34 viral protein epitopes universe!) (Janice Oh, H.-L., 2012. Emerg Microbes Infect 1, 1-6). Acute respiratory distress syndrome (ARDS) in both CoV-1 and CoV-2 infected patients is the cause of death due to viral antigen specific CD4 and CDB-mediated cytokine storm (Mehta, P., 2020. Lancet 395, 1033-1034; Ramos-Casals, M., 2014. Lancet 383, 1503-1516; Huang, C., 2020. Lancet 395, 497-506; Karakike, E., 2019. Front Immuno 10, 55-55). Inactivated or attenuated CoV-2 carries the antigenic universe for CD4 and CD8 T-cell epitopes; the complete S protein as vaccines carries the immunodominant CD4 helper T cell epitopes (Grifoni, A., 2020. Cell Host Microbe 27, 671-680.e672). Herein the three firewall CoV-2 precision vaccines made solely from critical and minimal protective B cell epitopes, can hit the three Achilles' heels for attachment, maturation and fusion of S protein life cycle without causing immune enhancement and cytokine storm detrimental to the host, henceforth increases vaccine safety margin (Chen, S.-S., 2020. USPTO 62,985,792; Chen, S.-S., 2020. USPTO Priority date: Feb. 11, 2020, 62/972,847) (Aim 1, Aim 3).


CoV-n Vaccines: Selecting the Emerging CoV-2 and CoV-n Variants or Mutants.

Two Concerns of Viral Adaption are in the Continuum:


(i) Emerging zoonosis: The five highly variable amino acid residues in RBE contacting ACE2, spanning among palm civets, bats, pangolins, and humans, exhibit a positive dN/dS trend for positive selection (Gralinski, L. E., 2020. Viruses 12, 13; Fomi, D., 2017. Trends Microbiol 25, 35-48; Zhang, T., 2020. Curr Biol 30, 1346-1351.e1342). Zoonotic migration of the fittest to humans poses a continual threat without human behavior modification.


(ii) Emerging new CoV-n or ‘seasonal flu’ evading herd immunity. The next waves of viral mutants are likely to re-emerge facing a sterile anti-RBD herd immunity for example as the most current FDA-approved vaccines are directed against a single protein or DNA sequence of a single viral strain to the SARS CoV-2 genomes of the year 2019 database. Cov-2 spike protein mutants such as B.1.526, E484K, S477N, within the region of RBD can be predicted by in vitro RD. Via ribosome display, we can the select molecular Darwinian antigenic variants of the entire S protein that bind ACE2 receptors under the three-firewall neutralizing antibody we made for the first generation of CoV-2 or herd immunity (Chen, S.-S., 2014. U.S. Pat. No. 8,865,179; Chen, S.-S., 2015. U.S. Pat. No. 9,187,553; Taussig, M., 2003. USPTO U.S. Pat. No. 6,620,587 B1).


In summary, the precision vaccines using said protein scaffolds raise three firewall neutralizing antibodies to abrogate productive CoV-2 pseudotype infection in human ACE2 cells and ACE2+Tg mice. The platform technology integrated with molecular evolution permits selection of S variants of emerging newer CoV-2, which can be used as future vaccines.


Embodying CTL Vaccine:

Any conception of cytotoxic T lymphocytes (CTL) to protect from a viral infected disease is based the observation of CTL in recognizing a processed viral protein constituent, restricted to major histocompatibility complex (MHC) I in killing virus infected cells (Zinkemagel and Doherty, 1974. Nature 248: 701-702). This original observation sheds the first light on the importance of processed protein constituents and its presentation by MHC I. The two critical pieces information is that nonameric cytotoxic peptides is generated in the ER via an unfold protein response or ER stress. The second piece information stems from the purification and sequencing of the nonameric peptides eluted platform the MHC I protein (Falk et al., 1991. Nature. 351, 290-6). In summary, this includes the totality of the prior knowledge across different species that CTL recognize viral protein nonameric to decameric peptides according to the MHC I polymorphisms. The embodiment of the invention is to compute nonameric peptide CoV-2 human 10 MHC I supertypes. The MHC I supertypes are: HLA-A01.01; HLA-A02.01; HLA-A03.01 (HLA-A11.01): HLA-A24.02; HLA-A26.01; HLA-B07.02; HLA-B08.01; HLA-B40.01; HLA-B58.01; HLA-B15.01 (Sidney, 2008. BMC Immunology. 9:1-15) Any high affinity supertype binding peptides from such as the spike protein or nucleocapsid protein and ORF1a of Cov-2 should yield the protective CTL vaccine epitopes that are used as CTL vaccines, and exclusively non-mutated and mutated sequences to ORF1a, ORF-1ab, leader protein, nsp-2, nsp-3, nsp-4, 3C-like proteinase, nsp-5, nsp-6, nsp-7, nsp-8, nsp-9, nsp-10, RNA-dependent RNA polymerase, helicase, 3→5′ exonuclease, endoRNAse, 2-O′ ribose methyltransferase, Surface Glycoprotein (S), ORF-3a, E, M, ORF-6, N, ORF-7a, ORF-7b, ORF-8. The embodiment of the invention will permit a use of a single or double or up to 12 such epitopes. The multiple epitopes can be arrayed in a linear head to tail format in a multimeric peptide assembly or a polypeptide produced as recombinant. Alternatively, the peptide in single or multimers can be inserted into the loop structure, or replace the loop structure of the protein scaffolds as described in the previous section. Because the loop of a protein scaffolded by exposed toward the surface and characterized by its hydrophilicity, and accessible to ER stress-mediated processing and degradation and assembly with the MHCI binding pocket. Therefore, the embodiment of the invention is to prolong the half-life of the peptide, or a contiguous peptide multimers or a polypeptide with contiguous peptides. In another embodiment of DNA/RNA immunization, the single peptide or contiguous peptide can be cloned into a eukaryotic expression vector that transcribed and translated in the modality of DNA/RNA immunization.


Antigenic variation due to molecular shift are frequent with the host range such as affinity of attachment increase or deviation away of attachment due to neutralization or protecting antibodies. CTL determinants are cryptic to immune evolutionary pressure, for the following reasons:


(i) Especially the multiple nonameric epitopes constellated upon a given MHCI haplotype or supertypes; thus, a mutation of a given nonamer does not affect the CTL protection to other nonamer of the given protein of interest;


(ii) The polymorphisms of a particular haplotype or supertype permit multiple different heterozygote combinations with respect to the diverse nonamers, and the redundancy or heterozygosity of the nonamer-presenting MHCI permit a robust defense repertoire. (iii) Even there is a mutation within the nonameric (+1/−1) sequences, the escape mutants have to be at the second and seven amino acid positions since these two are the anchor binding sites for the MHCI; and amino acids in non-anchored position will not affect the capacity of CTL induction. Therefore, the embodiment of the invention will ensure the CTL protection using the combination of nonameric or octameric or decameric peptides to defend against the CoV-2 or CoV-n infections and mutants.


Example One
Embodying Cov-2 B Cell Epitopic Sequences Illustrated:
Hypervariable and Conserved Region of Surface or Spike Protein:

The CoV-2 S exhibits 17 receptor binding amino acids (5 hypervariable+6 conserved+6 invariant amino acids in the RBE region of RBD) over ˜80 amino acid in one contiguous sequence contacting ACE2 receptor over 1750 A2 area (Lan, J., 2020. Nature 581, 215-220).


The embodiment of the invention is to claim the five hypervariable amino acids offering flexibility of change into any of the 20 amino acids selectable by RD or any kind of protein display platform to increase the mutant S binding to ACE2; and the six amino acids can suffer a conserved amino acid substitution for increasing binding affinity to ACE2. The embodiment of the invention includes mutants showing any or a combination of mutation of spike protein to gain advantage to high affinity binding to ACE2.


Therefore, the RBE will be used for replacement in CDR1 to CDR3 of camelid VHHcov-2 as a precision vaccine. CDRs as a molecular clamp also for Covid-19: (i) defining precision B cell vaccine epitopes, of the S protein, and RBE (RBD) of CoV-2; (ii) bifunctional T cell costimulation can be delivered using GFP helping the S B cell epitopes. Using the entire S protein as vaccine (1250 amino acids) may cause unfavorable immune enhancement (Gralinski, L. E., 2020. Viruses 12, 135; Fomi, D., 2017. Trends Microbiol 25, 35-48), currently a major focus of several key vaccine projects [DNA vaccine (Inovio), RNA vaccine (Moderna), CoV-2 S pseudotype flu (J&J), S protein on vaccinia virus (Oxford U), and recombinant S proteins)].


NTD Region within the Spike Protein:


Moreover, three additional sites on the NTD regions are considered. A vaccine neutralizing and blocking the entry of 2019_nCoV via human ACE2, constrained by VHH and optimized by mRNA and ribosome display coupled to molecular evolution.


The patent invention claims a vaccine representing a combination of four antigenic sites of 2019_nCoV onto a protein constraining scaffold.


Said Four HIV-1 Like Sequences, Antigenic Sites are:

Site 1: TNGTKR 71-79 of the spike protein of 2019_nCoV


Site 2: HKNNKS 145-150
Site 3: RSYLTPGDSSSG 245-256

Site 4: QTNSPRRA 676-684 (flanking RBD)


The patent invention uses the camelid nanobody as a scaffold to constrain or encompass said four sites (site1, site 2, site 3 and site 4) of the spike protein of CoV-2, 2019_new coronavirus that segregate and assemble together in a folding strategy on a protein scaffold as a vaccine. Said vaccine elicits neutralizing antibodies to destabilize the viral monomeric and/or the trimeric spike protein in order to block the contact of the amino acid region or the protein domain(s) of the spike protein of the CoV-2 to the angiotensin converting enzyme 2 (ACE2) as entry into susceptible lungs, kidney tissues and hearts.


Said four dis-contiguous sites, from regions distinct from the alpha-helix or 3-10 helix fusion domain, are chosen for inserting into, arranging in between, or replacing the native sequences of CDR of immunoglobulin or substitute native loop sequences of other protein scaffolds. Said platform can be optimized by mRNA and ribosome display, coupled to molecular evolution via solid phase ACE2.


Said four sites of the spike protein of CoV-2, 2019_nCoV in natural history, serve as anchors for the stabilizing the beta-pleated sheet and loops of the contact regions to the ACE2 receptor.


Said four sites of the spike protein of CoV-2, 2019_nCoV, in this patent invention hence assembled substituting the native loop sequences of the aforementioned said protein scaffolds, are claimed the prophylactic and/or therapeutic vaccine for eliciting neutralizing and/or blocking antibodies that inhibit entry of CoV-2, 2019_nCoV via contacting ACE2 of human susceptible lung and kidney tissues and hearts.


Example Two
Embodying Molecular Darwinian Evolution of Fitness Antigen-Ribosome/mRNA (ARM) Display Platform

The precision vaccines can be further re-shaped for fitness as a more efficacious vaccine, using the inhouse antigen ribosome/mRNA display (RD) platform (Chen, S.-S., 2014. U.S. Pat. No. 8,865,179; Chen, S.-S., 2015. U.S. Pat. No. 9,187,553; Chen, S.-S., 2019. USPTO 62/885,110; Chen, S.-S., 2018. USPTO 62/917,408).


The evolving ARM Platform (for antigenslantagonists) by various protein scaffolds, including said protein scaffolds (FIG. 1) was established and patented in the laboratory of the small concern. FIG. 2 shows the ARM platform integrated with the Darwinian molecular evolution (DME) and DNA shuffling for selecting a high-affinity final vaccine candidate.


(i) The inserted CoV-2 loop sequences in said protein scaffolds can be enabled for transcription initiation with the T7 promoter, translation with the Kozak sequence that scans rRNA binding site of the eukaryotic 40S ribosomes, wherein the termination codon protein scaffold stuffer region (using GFP or using Cu) was deleted so that the transcribed-translated antigen-ribosome-mRNA (ARM) ternary complex was not released from the P site of the ribosomes (e.g., stalled on eukaryotic ribosomes as a ternary ARM complex) (Chen, S.-S., 2014. U.S. Pat. No. 8,865,179; Taussig, M., 2003. U.S. Pat. No. 6,620,587 B1; Taussig, M. J., 2003. U.S. Pat. No. 6,620,587; Yang, Y.-M., 2007. B.B.R.C. 359, 251-257) (left trinity).


(ii) Next, this bulky ARM is selectable and captured by huACE2 receptor, chemically coupled to beads or coated on a 96-well plate (let trinity).


(iii) Selectable mRNA can be dissociated from the 96-well plate (right trinity) and mutations can be introduced by base analogues and error-prone RT-PCR (upper trinity) (Chen, S.-S., 2014. U.S. Pat. No. 8,865,179; Chen, S.-S., 2008. Biochem BioPhys Res Comms 374, 409-414). After 3-5 reiterative rounds of selection, the clonal DNA can be prepared and undergo DNA shuffling recombination/inbreeding to generate a high-affinity product (upper trinity) upon Darwinian competition. The ARM finalists and mRNA throughputs selected in Aim 2 will be tested as the ultimate API or precision vaccines for cell-based assay and animal protection.


Example Three
Embodying the Precision Vaccines of S Protein of Covid-19 Using the Molecular Clamps

In the example: we focus on insertion of the RBD domain, which contact ACE2 into camelid CDR constructs as blocking or neutralizing antigenic epitopes using a linker sequence co-synthesized along with GFP as additional constrainer and also serve to activate CD4 helper T cells to generate an anti-RBD antibody response. In IgE and FceRI receptors system, we pioneer IgE B cell epitopic vaccine discovery using on X-ray of ligand/receptor cocrystal, and most antigenic loops are delineated by the secondary b-strand flanking structure, which coincide with the tertiary structure by X-ray. This methodology based on empirical data (other than biocomputing) has yielded productive means to define minimal B cell epitopes, including IgE receptor-binding loops of IgE as universal allergy vaccines in raising neutralizing antibodies. Recently, high resolution X ray of CoV-2 S protein is finally available, as elucidated by Veesler in FIG. 3 and FIG. 4. X ray cocrystal using CTD1 (containing RBD)/ACE2 receptor was also reported by this group (Walls, A. C., 2020. Cell 181, 281-292.e286), as well as a high resolution CoV-2 RBD/ACE2 cocrystal by Wang (Lan, J., 2020. Nature 581, 215-220) in FIG. 3 showing tertiary structure here. These pivotal studies provide critical structural based information herein for designing molecular clamped precision three-firewall vaccines. Importantly, there are five key receptor binding amino acid residues (the hypervariable tract among humans and intermediate hosts), and 12 conserved amino acid residues with eight identical residues, embracing the ACE2 over an overall total area of 1,750 A2 (Yan, R., 2020. Science 367, 1444; Lan, J., 2020. Nature 581, 215-220), encompassing 17 H bond and one salt bridge: Tyr (Y), 479, 489, 495, 505 with receptor hydroxyl group; among 12 amino acids shared with CoV-1, 6 identical sequences, 6 conserved, and five distinct different residues (Lan, J., 2020. Nature 581, 215-220).


S-ACE2 induced twist can cause exposure and furin cleavage site on S protomer/homotrimer. The precision vaccine against RBD will be tested to not only block the initial attachment, but also tested for dissociation of S from huACE2. Importantly, dissociation will bring a ‘cure’ since stage 2 and 3 enzyme catalysis and membrane fusion are summarily blocked. All the sequence information is described by Veesler and Wang in their X ray crystal/cocrystals, as illustrated in the secondary structure enables an initial roadmap for B cell epitope discovery. The precision candidates are indicated in the flexible loops (hydrophilic, open accessible), flanked by the beta strands, or by alpha helix, or alpha-beta., As an example, Table 1 showed putative B cell epitopes of the loops flanked by the beta stands (RBE/RBD and NTD sequences), and B cell epitopes are over the entire spike protein, not limited to the example sequences, moreover, can include any sequences of the secondary structure α-helix, 3-10 helix, and beta strand itself for substituting the CDR loop (s). Table 2 (furin sites and fusion peptides), delineated by the secondary and tertiary structures of FIG. 4 (Veesler, pdb deposit: 6VXX). Table 1 also showed the replacement of immunoglobulin CDR1 CDR2 and CDR3 with the RBE.


The embodiment of the invention: Coronavirus Co-V2 or Covid_19, CoV-1, initiate human infection by the attachment of coronavirus spike protein (S protein) to the angiotensin converting enzyme II (ACE2) receptors in the lung, kidneys and hearts. We invent vaccines using sequences of the entire S protein: NTD, RBD (RBM), NTD partner, furin cleavage domain, HRN-HRC for fusion domain, and TM for transmembrane anchorage, as well as translated proteins of ORF1a, ORF-1ab, leader protein, nsp-2, nsp-3, nsp-4, 3C-like proteinase, nsp-5, nsp-6, nsp-7, nsp-8, nsp-9, nsp-10, RNA-dependent RNA polymerase, helicase, 3→5′ exonuclease, endoRNAse, 2-O′ ribose methyltransferase, ORF-3a, E, M, ORF-6, N, ORF-7a, ORF-7b, ORF-8, nucleocapsid (NC) the envelope protein (E). Protective B cell vaccine epitopes will be discovered around the loop regions of the entire S protein as a paradigm and aforementioned proteins scaffolded by the beta-strands or beta-sheet or alpha-helices, including also the N- and C-terminal loops as antigenic sites. Moreover, vaccines are made against envelope protein and RdRp to perturb viral life cycle. Vaccine against the variable regions such as RBD can be catered to the infectious endemic or pandemic strains. Vaccine against the universal protective epitopes such as nonvariant amino acids on S as well as canonical nonvariant sequences on RdRp and envelope proteins are universal vaccines. This patent invention also embodies the B cell epitopes from the nonstructural proteins.


Herein, the vaccine platform extended to CoV-1, and CoV-n, the evolving variants as the close relatives and kindreds to B.1.526, E484K, S477N. The constraining scaffold will include lipocalin, protein A, green fluorescent protein (GFP), fibronectin, immunoglobulin, super beta strand of FG of human IgE, shark antibodies, and VHH of the species of the Camelidae, including camels and llamas of FIG. 1.


A Partial List of Working Examples

Wuhan's strain (WUHAN/WVIO4), shown in FIG. 5 or including and not limited to all the variable isolates from China and world-wide, amino acid sequences that can be accommodated into the CDRs of the immunoglobulin heavy chains.


1. The First Firewall:


Anti-RBE/RBD Neutralizing Antibodies. Furthermore, cryo-EM structures show that the RBD of a protomer exists in two states, the up or open state capable binding to the receptor, while the down or close position is cryptic, waiting for its kinetic moment opening up to bind to receptor next (Wapp, D., 2020. Science 367, 1260; Song, W., 2018. PLoS path 14, e1007236). Like a spinning wheel, with three sharp spikes there be in two states present in homotrimer. Noteworthily, an X ray study using the complete untruncated huACE2, chaperoned as a dimer by amino acid transporters (BoAT1) in the membrane that have the potential to form multiple cross-linking networks with multiple homotrimers (Lam, T. T.-Y., 2020. Nature 583, 282-285). Since each RBD fragment binds to truncated huACE2 (most done in this expedient way) in the Biacore assay scores at a high affinity of 4.5 nM (Wapp, D., 2020, Science 367, 1260), it is likely that homotrimeric to dimeric synergistic binding may approach picomolar affinity posing challenging situations for vaccine makers.


The advantage of the precision first firewall vaccine in Table 1A-1B rendering RBE/RBD an Achilles' heel, over the current vaccine ventures (whole S-mRNA, -DNA, -recombinant, and whole S pseudotype) is:


(i) Molecular clamp can capture the lowest free energy configuration and render immunogenic either the open or the close state depending on the replacement of the different demarcated RBE shown in FIGS. 5 and 6, into the CDR1, CDR2 and CDR3, using camelid VHH prototype.


(ii) Molecular clamping using demarcated precision peptide subregions, like the clamped IgE vaccine, is highly efficient to elicit antibodies causing dissociation of the huACE2-bound CoV-2 S, despite at high affinity. Therefore, we will assay the dissociation of biotinylated RBE/RBD and S protein from huACE2 receptors on the solid phase.


(iii) Type-specific vs. broadly neutralizing or cross-reactive antibodies Despite its immunogenicity, we anticipate that the molecularly clamped RBE vaccine including the hypervariable receptor-binding amino acids as a precision antigen, may be CoV-2 type-specific (i.e., RBE between CoV-1 and CoV-2 shares only 33% homology) (Lan, J., 2020. Nature 581, 215-220) but not for neutralizing an emerging variant as a seasonal ‘flu’. Nevertheless, molecular clamped precision vaccine using RBD outside the RBE regions, exhibits greater conservation (77% homology) (Lan, J., 2020. Nature 581, 215-220), therefore precision vaccine, constructed accordingly offers an opportunity as a broad-spectrum vaccine.


Indeed, comparative X-ray of cocrystal of CoV-1 RBD vs CoV-2 RBD with huACE2 receptors, lends support for a highly homologous contour of the overall RBD region: six out of 17 receptor contact residues of RBD are identical, and the other six exhibit conserved amino acid change, while the α-carbon chain tracing the two respective receptor-bound RBD contour shows impressive side by side overlap (Lan, J., 2020. Nature 581, 215-220). This observation lends credence that a precision vaccine targeting the evolutionarily conserved configuration of RBD can then be broadly neutralizing (BN) or cross-reactive for both SARS CoV-1 and CoV-2, which will be tested by ELISAs: S protein, RBE/RBD capture by ACE2 receptor-coated plates were established for evaluating the titers of first firewall antibodies, and biological neutralizing activities will be ascertained by neutralizing infection of CoV-2 S pseudotype in huACE2+ cells (Aim 3).


2. The Second Firewall:


Type-specific and Universal vaccines aim at blocking the furin cleavage sites to prevent cleavage of S1-S2, posing S′2 cut as well as the S′2 site, cut at S′2 for S2 maturation, poising for the critical later membrane fusion event by fusion peptides. S′2 site being cryptic and evolutionarily conserved is exempt from immune surveillance, which is exposed only following global conformational change after S1 binding to ACE2 (Wapp, D., 2020. Science 367, 1260; Song, W., 2018. PLoS path 14, e1007236). Noticeably, whole S protein-based vaccines typically miss this important vaccine component due to their crypticity. Therefore, these epitopes molecularly clamped in VHH, are ideal candidates as universal vaccines to tackle the two more Achilles' heels once exposed.


(i) Anti-Furin Site-Specific Neutralizing Antibodies: Furin (a proprotein convertase, PC) enzymatic site exposure plays a critical role for generating S2 fragment for completing the stage 2 infectious cycle post RBD binding; noteworthily, NTD the adjacent N-terminal I domain (1 aa-333 aa) is also dramatically altered in conformation following the binding event of S/ACE2 (Song, W., 2018. PLoS path 14, e1007236), raising the importance of the ‘accessory NTD vaccine’ synergizing with the three firewall vaccines, attenuating S2 cleavage.


(ii) Antibody against the global conformational change of S1 in cooperativity with S2, essential for priming (opening up) the first furin site at S1-S2 junction (TNSPRAR/SA), which then boosts further conformational change in opening up the S2′ site (SKPSKR/SF) for cleavage by furin or serine protease (TMPRSS2), or cathepsin B/L (Coutard, B., 2020. Antiviral Res 176, 104742; Hoffmann, M., 2020. Cell 181, 271-280.e278) at the plasma membrane junction of host cells.


Therefore, anti-S1-S2 vaccines are prepared using: YQTQTNSPRRAR/SVASQSIIAY/TMSLG 676-700 furin cleavage site S1-S2; or anti-S2′ vaccine: PSKPSKR/SFIEDLLFNKVTLADAGFIK 810-828 S′2 furin cleavage site (Coutard, B., 2020. Antiviral Res 176, 104742) will be prepared into CDR2 or CDR3 domains in singles or a pair onto the camelid VHH.


ELISAs: 1-S2 and S′2 with C-terminal His tag are captured by plated coated with anti-His, and anti-second and firewall antibodies are assayed via binding to the S1-S2 and S′2 fragments respectively, whereby the neutralizing activities tested by neutralizing CoV-2 S pseudotype infection in huACE2+ cells.


Noticeably, S1-S2 cleavage site unique in CoV-2 provides enhanced infection capability (an additional virulence factor) as compared to CoV-1, due its ‘priming’ then opening up S′2 being universal to realize productive infection. These furin sites can only be attached with precision vaccines but the entire S injection as vaccine lacking exposure of these critical antigenic determinants. We anticipate that S1-S2 precision vaccine is unique for CoV-2 protection, while anti-S′ precision vaccine (commonly shared among CoV-2 and CoV-1 or -n) will be employed for broader spectrum CoV-2 protection or for CoV-n as the Universal Vaccine. And a combined bivalent second firewall vaccine offers a stronger protection for the pandemic SARS CoV-2.


3. The Third Firewall:


Universal vaccine is directed at blocking the conserved fusion peptide sequence, when CoV-2 escapes or leaks through the first and the second line of defense firewall. Anti-Fusion Peptide Neutralizing Antibodies: Should CoV-2 break through the second firewall, the mature homotrimers form the extended a-helix to present the triple bullets of fusion peptide, juxtaposing the membrane for fusion and genome entry. Hence, the exposed fusion presents yet another Achilles' heel, which the precision anti fusion peptide vaccine can instantly pull apart the injecting virion from the host cells: in particular, devoid now of S1, CoV-2 is nearly ‘dangling ‘in thin air’ with no more attachment to ACE2 (Harrison, S. C., 2008. Nature Struc Mol Bio 15, 690-698). Precision vaccine against the fusion peptide: YKTPPIKDFGGFNFSQILPDPSKPSKR (a-a) 770-778 (Wapp, D., 2020. Science 367, 1260; Cosset, F.-L., 2011. Adv Genet 73, 121-183) replaced into the CDR2 or CDR3 of camelid VHH (Table 1B), is intended for the third firewall specific neutralizing antibodies to block or dissociate the mature S2 from host cell membrane as a universal vaccine.


ELISAs His-tag fusion peptides are captured by anti-His on 96 wells, and added with anti-third firewall antibodies, detected by HRP-anti-IgA vs. IgG, wherein neutralizing evaluated by abrogating CoV-2 S pseudotype infection in huACE2+ cells.


4. The Fourth Firewall


Similarly, the vaccine is directed against CoV-2 membrane antigen, envelope antigen and nucleocapsid and RdRp to neutralize viral infectivity. CTL inhibition of viral replication and eliminate the CoV-2 infected foci and reservoir. This firewall is particularly effective against mutant virus.


Impact of Vaccination: Mucosal IgA, systemic IgG and CTL induction, and protection of CoV-2 S pseudotype infection The RBD is PCR-amplified fused to GFP, used as a positive control. Immunogenicity of the above precision vaccine constructs will be tested in BALB/c and B6 mice via SC and IM immunization for circulating IgG as well as peroral and intranasal (IN) immunization with cholera toxin (CT) for IgA production, in particular, IgA in bronchoalveolar fluid (BALF) will be evaluated for neutralizing IgA abrogating binding of S protein to the receptors by ELISA and blocking CoV-2 S viral pseudotype infection, which requires the entire S protein for initial attachment RBD/S1, priming of S1-S2 and boosting S′2 cleavage and the insertion of the boosted fusion peptides, wherein the three life cycles are subjected to the attack of each respective precision vaccines or a combination thereof.


Further Considerations:

(i) Glycan Shield: The landmark discovery of ACE2 receptor for CoV-1 by Harrison, Farzan and Li (Li, F., 2005. Science 309, 1864) and later elucidation of X ray cocrystal of RBD and ACE2 (2.5A2), entirely verified also in CoV-2 cocrystal (Lan, J., 2020. Nature 581, 215-220; Walls, A. C., 2020. Cell 181, 281-292.e286) provides a gateway for B cell vaccine discovery for coronavirus. The conserved feature of CoV-1 and CoV-2 S/ACE2 interaction indicate that vaccine designed using the conserved RBE clamped in B cell epitopes enveloping the ACE2 receptor, may be applicable across coronavirus variants. However, the above major vaccine RBD paradigm, wherein most investigators concentrate the effort, does not take into account the 18 N- and 4 O-glycan shields, which can mask the target B cell epitopes in S1 as well as S2;


(ii) Synergism for 1st Firewall by NTD1 (CTD1): Since the RBE/RBD conformation is influenced by the NTD domain of S1 (Song, W., 2018. PLoS path 14, e1007236), which is a 330-amino acid domain prior to RBD (sugar-binding in MERS) (Li, F., 2015. J Virol 89, 1954-1964). Therefore, we will test whether anti-NTD may alter a covert an up (open) position for ACE2 binding into a down (close) position incapable of receptor binding, which can also play a role in dissociating the initial RBD binding to human ACE2, subsequently negatively influence the opening up or priming of the furin site at S1-S2 junction. NTD exhibits an overall 50% homology between CoV-1 and CoV-2, however there are two tracts delineated by b-strands, exhibiting 90% homology as a choice for making precision broadly neutralizing vaccines.


The embodiment of the vaccine using GFP as help: GFP offers a strong CD4 T cell help for antibody response; and its extreme thermostability (Tm˜100° C.) indirectly enhances the camelid chaperone capacity as a molecular clamp (Chen, S.-S., 2014. U.S. Pat. No. 8,865,179; Tsien, R. Y., 1998. Annu Rev Biochem 67, 509-544). Although GFP is approved for clinical imaging, it is not conventionally thought as carrier protein for helper T cells. hence, the camelid VHH displaying vaccine B cell epitopes can be conjugated to tetanus toxoid or diphtheria toxoid to elicit long-term memory helper T cells, safely taking advantage of the herd immunity.


Vaccine Purification (GLP):

Recombinant human ACE2 is purchased. Complete S gene will be prepared by gene synthesis (Biomatik, Inc.), and prepared in full and in truncation for immune specificities. Primer sets designed and executed for PCR cloning for preparing the S1, truncated S1 (removing NTD), RBD, RBE, NTD; furin primed S2 (cleaved by S1-S2 site), furin-boosted S2′ (cleaved at S2′ site) into and produced in the SOP set up for periplasmic space purification of ST2 signal peptide engineered into pET11b (Chen, S.-S., 2014. U.S. Pat. No. 8,865,179; Chen, S.-S., 2015. U.S. Pat. No. 9,187,553). The proteins or polypeptides are properly tagged. The CoV-2 S protein epitopes from the N- as well as C-terminal S1/S2 were cloned into the CDR1, CDR2 and CDR3. The Company has extensive experience in recombinant protein constructs, protein purification in inclusion body purification; moreover, by concatenated expression with GFP on the C-terminus fusion, the destination of the IgV, is re-routed to cytosol, subject to expedient straightforward purification, sequenced ascertained by MS/MS. All procedures follow GLP set up, and rodents will be vaccinated IN and IM/SC routes for comparison.


The Advantage of the Embodiment of the Invention of the Above Molecular Defined B Cell and T Cell Epitopes in Contrast to Full Length S Protein as Vaccines:

a. Cytokine storm, immune deviation or tolerance: inappropriately prepared CoV-2 vaccine can lead to immune complexes (directed against harmful B-cell epitopes)-mediated T cell activation, viral CD4 T-cell epitope activation, resulting in immune dysregulation and cytokine storm, which is the cause of death of Covid-19 or SARS-Cov-2. In SARS CoV-1, nucleocapsid was found to induce a large repertoire of CD4 and CD8 epitopes contributing to CTL defense, and paradoxically also CD4-CD8 T-cell mediated cytokine storm and death (Blanco-Melo, D. a. t., 2020. Cell in press 181, 1036-1045.e9; Janice Oh, H.-L., 2012. Emg Microbes Infect 1, e23-e23). Therefore, it is of critical importance to appreciate the intricate web of the B-cell epitopes of the typically highly complex S protein (1,255 amino acids), in a recent biocomputing analysis stated that ˜80% repertoire of Covid-19 B cell epitopes as well as abundant CD4 helper T cell epitopes are on the S protein (Grifoni, A., 2020. Cell Host Microbe 27, 671-680.e672). The cytokine storm can be inhibited with suppressive RNA vaccines accommodating the above B and T cell epitopes. Several whole S protein vaccines in contrast to that of the precision firewall approach, are currently in progress, including the mRNA vaccine, DNA vaccine, recombinant vaccines and the S pseudotype adenoviral vaccine (J&J) in the phase 1 trial sponsored by CEPI/Gates Foundation and NIH. The precision vaccines lack viral CD4/CD8 T cell epitope, since GFP carrier-specific helper T cells contribute to activation of B cells specific for molecular clamped B cell epitopes.


b. Ribosome mRNA Display (RD) to further optimize four firewall vaccines: The thermodynamic law dictates the RBE peptide folding scaffolded by the molecular clamps assumes the lowest free energy state, which is accounted for by the lowest enthalpy with one degree of freedom for a fixed configuration for the native B cell epitope. Thus, the underlying thermodynamic force govern one unique folding of the peptide sequences given in the context of highly thermostable molecular chaperone providing the super-b flanking strands (Chen, S.-S., 2014. U.S. Pat. No. 8,865,179; Chen, S.-S., 2015. U.S. Pat. No. 9,187,553). As shown in the preliminary results, the four spatial dis-contiguous IgE B-cell epitopes folding within the CDR constrained by VHH exhibits binding comparably to FceRIa, therefore properly constrained RBE by camelid will exhibit binding to ACE2 receptor, and the antigenicity.


c. Re-shaped conformation for a better fit: As shown in the Preliminary results, we have in place a proprietary ribosome display technology, which can positively select conformation of the mutated molecular clamped RBE via error prone PCR for a best fit conformation with high affinity to huACE2 receptors. Therefore, mutated Antigenic RBE B cell epitopes-mRNA-Ribosome (ARM) ternary complexes, exhibiting a better fit to the receptors can be selected onto the Dynabeads covalently coupled with lower threshold of huACE2 or in the presence of excess first generation of RBE-immunoglobulin heavy chain for competitive selection, streamlined for potency antigenicity and immunogenicity.


Example Four

Embodying Precision Vaccines that Predict and Protect Against Emerging Covid-19 Variants Using Ribosome Display and Darwinian Molecular Evolution


(i) sequence data L vs S type among 103 strains showed two signature SNP at ORF1ab and ORF8 related to replicase and ER stress ATF6 (Tang, X., 2020. Nat Sci Rev). A new observation showed that a number of stains exhibit SNP at the S protein, alarmingly varying even with one amino acid missense SNP at the spike protein causes conspicuous pathogenicity (Yao, H., 2020. medRxiv, 2020.2004.2014.20060160). Thus, given cumulatively (December/2019 to April/2020, GISAID) 3,123 viral genomes known to date, pertinent information and insight can be continually accumulated. Therefore, we hypothesize that critical and cumulative adaptive changes at the S1-ACE2 interface, S1-S2, and S2 crucial sites of mutants facing herd immunity has the likelihood of breakthrough in renewed endemic, epidemic or pandemic proportion as a ‘season’s flu. Alternatively, the source strains can continue unabated via cultural culinary habits and history repeated. Herein we proposed a safe method to predict new strains by ribosome display platform using the entire S protein/ACE2 forced Darwinian evolution applying selective pressure using herd immunity as well as three firewalls of neutralizing antibodies in Aim 2.


(ii) Host range adaptation-lesson learned There have been impressive comparative data (including 4× ray cocrystals) of the RBE binding sites of different hosts, bats, humans, palm civets, pangolins having different adaptive binding motifs to the respective host ACE2. Moreover, the overall binding motif between CoV-1 and CoV-2 S protein is highly conserved around ACE2 contour region beside the intimate contact amino acid five (CoV-2/Cov-1: L455/Y442; F486/L472; Q493/N479; N501/T487, K417/V404; hence conveying the prowess of plastic adaptive binding extraordinaire) (Lan, J., 2020. Nature 581, 215-220). The evolution force behind this molecular fact can be interpreted as follows, despite the variability at the key contact, the evolutionarily conserved 14 ‘shared’ amino acids, amongst eight identical invariable amino acids, and the rest of six conserved amino acid changes (can still make conserved amino acid changes to meet the free energy change of five hypervariable amino acids), can constrain the five highly variable amino acids into spatial arrangement such that the any evolutive combination of the 19 amino acids of S, which contact the 20 amino acids of huACE2, is at the lowest free energy state.


As a result, one anticipates that the versatility of the rule of 17: 12 (6+6): the invariant 6 amino acids accompanied by six ‘semi-variable conservative’ 6, along with the five hypervariable amino acids can derive a sizable SARS CoV-2 variants, and impressive repertoire of CoV-n ready for continual selection upon human ACE2 selection: (i) purposefully to evade existing neutralizing antibody-based herd immunity; (ii) purposefully in an intermediate host to maximize zoonotic ecological, niche expansion. The proposed Aim 2 focus on selecting the S breakthrough mutants, and the vaccines that can be preempted thereof.


A SARS-CoV-2 variant carrying the Spike protein amino acid change D614G has become the most prevalent form in the global pandemic. Dynamic tracking of variant frequencies revealed a recurrent pattern of G614 increase at multiple geographic levels: national, regional, and municipal. The shift occurred even in local epidemics where the original D614 form was well established prior to introduction of the G614 variant. The consistency of this pattern was highly statistically significant, suggesting that the G614 variant may have a fitness advantage. The G614 variant grows to a higher titer as pseudotyped virions. In infected individuals, G614 is associated with lower RT-PCR cycle thresholds, suggestive of higher upper respiratory tract viral loads, but not with increased disease severity. These findings indicate changes important for a mechanistic understanding of the virus and support continuing surveillance of Spike mutations to aid with development of immunological interventions (Korber et al., 2020, Cell 182, 812-827).


Reiterative Rounds of Darwinian Selection in the Presence of Selective Pressure

The natural selection must be based on the intact whole S gene, e.g., the prefusion S ectodomain aa 1-1208 (GenBank MN908947), piggyback on the stuff protein (deleted termination codon for ribosome attachment, forming ternary complexes) for re-iterative rounds of Darwinian selection (two US patents awarded the platform). The binding of the RBE (RBM)/RBD of the S protein (not using RBD or subunits) is influenced by NTD and S2 domain reflecting the natural binding/selection, for huACE2 receptor on the solid phase. The scenario is that re-emergence of the new CoV-2 strain or CoV-n in the presence of herd immunity to be accompanied expediently via mutating the RBE sufficiently in the total 18 amino acid residues, and since NTD (within CTD1) and S2 contribute to post-binding event, which need be included in order to bypass or evade the herd immunity.


This can be accomplished in three steps using ribosome display integrated with molecular evolution:


(i) First, selecting high affinity S-mRNA-ribosome ternary complexes (SMR) binding to the solid phase ACE2 in the presence of increasing amounts of competing ligands: S protein (non-mutated), RBD fragment, or the RBE clamped in the VHH constructs. Via RT-PCT, mRNA carried these selective mutations can be obtained as a mutated gene, which can in turn be subjected to re-iterative rounds of mutations if needed. (Aim 1, preliminary results) (Yang, Y.-M., 2007. B.B.R.C. 359, 251-257; Chen, S.-S., 2008. Biochem BioPhys Res Comms 374, 409-414; Chen, S.-S., 2008. BBRC 374, 409-414; Chen, S.-S., 2008. FASEB J 22, 1075.1011; Chen, S.-S., 2015. UPSTO Confirm #6722);


(ii) Second, by selecting high affinity SMR in plates coated with diminished levels of ACE2.


(iii) Third, and importantly, selection of SMR is performed in the presence of neutralizing antibodies (Aim 1) or convalescent sera showing sterile herd immunity. Fourth, it is inexplicable for origin of the four identical HIV sequences inserted around NTD: three identical stretches of gp120 sequences at N-terminus of NTD, and one identical Gag sequences post RBD. Selection of SMR will be performed in the presence of anti-HIV sequence antibody (made in Aim 1), which may affect orientation of open vs. close RBD, and the folding of S2, which diminish an overall RBD binding to huACE2. The escape mutants of these HIV sequences may exhibit synergy in ACE2 binding due to conformational changes at the NTD and S2 region due to negative selection by anti-HIV antibodies.


The above selection can be accomplished using the mutant library: Mutation is randomly introduced into the S protein by using error-prone, low fidelity Taq PCR under Mg2+/Mn2+ stress (0.1 U/μl in the presence of high Mg2+ at 7 mM and Mn2+ at 0.5 mM), or further aided by 6-oxo-guanidine (0.2-1 mM nucleotide), mutations will be introduced into RBE/RBD, and the supportive NTD and S2 region equally distributed and the four receptor binding IgE moieties of the DE-FG-VHH or other construct candidates. Several types of mutant libraries with mutation percentages ranging from 0.5% to 5% will be pooled (Drummond, D. A., 2005. JMB 350, 806-816). Selection of the better fit in reiterative Darwinian rounds of selection will be carried out by the capture of the ternary complexes to ACE2 receptor solid phase under negative selection pressure in the presence of (i) RBE/RBD ligands, (ii) diminishing levels of receptor on solid phase resulting in more efficient high affinity binders to gain super-infectivity, alternatively in the presence of firewall specific neutralizing antibodies against RBD, or the convalescent sera, resulting in mutants overcoming the herd immunity.


The affinity of binding (Kd) to ACE2 will be determined using plasmon resonance by Biacore in the laboratory. To further improve receptor-binding affinity, DNA shuffle, routinely performed in the laboratory will be used to mimic genetic recombination and reassortment after viral mutation and selection (Footnote: Dr. Stemmer, 1994 Nature first described Darwinian molecular evolution using error prone and DNA shuffle for evolving agriculture enzymes in his humble small business concern (Stemmer, W. P., 1995. Science 270, 1510; Stemmer, W. P., 1994. PNAS 91, 10747-10751; Stemmer, W. P., 1994. Nature 370, 389-391). Since he passed away, the Nobel 2018 Chemistry awarded to Dr. FH Arnold at CIT (Nobel citation along with two phage display scientists) for using this technique for agriculture enzyme molecular evolution). High binding DNA from the above higher binder S candidates will be digested with DNase I and 100 bp fragments will be purified, shuffled/recombined, amplified with high fidelity pfu PCR, and assembled using terminal primer-based PCR reactions (Chen, S.-S., 2014. U.S. Pat. No. 8,865,179). Hundreds of shuffled library-transformed colonies in pET11b will be selected screening for final candidates (Yang, Y.-M., 2007. B.B.R.C. 359, 251-257; Chen, S.-S., 2008. Biochem BioPhys Res Comms 374, 409-414; Chen, S.-S., 2008. BBRC 374, 409-414; Chen, S.-S., 2008. FASEB J 22, 1075.1011; Chen, S.-S., 2015. U.S. Pat. No. 9,187,553) exhibiting high-affinity binding to ACE2 coated 96 well plates. The pertinent mutated sequences for RBE/RBE, NTD, furin site sequences, will be ascertained, ready for molecular clamping in camelid preemptive for emerging viral escape mutants.


Further Considerations:

a. Advantage: X-ray cocrystals typically use RBD or CTD1 fragment and truncated ACE2 domain. In contrast, we employ the complete S protein and ACE2 protein in the modality of ‘natural selection’; the complete protein ligand/receptor set therefore takes advantage of a full fledge conformational changes, wherein the mechanical torque force (generated post-binding of multivalent trimer/rigid receptor dimers) propagates onto the covalent S2, exposing the furin sites at S1-S2, and S′, mimicking a natural selection for viral host interactions. Selection of RBD variant retaining S2, e.g, use the complete S gene in our display setting simulates the state of binding under natural infection circumstances prior to cleavage.


b. Disadvantage: One vs. three and dangling/noncleaved S2 However, the ARM complexes consisting of a ternary complex exhibiting only one protomer, not the natural homotrimer, which impose a higher avidity toward the natural dimeric huACE2 (Yan, R., 2020. Science 367, 1444); moreover, dimeric ACE2 receptor coating can be improved by using full length ACEs chaperoned by amino acid transporter as the coating reagent. Nevertheless, the weakness of the chemical display is not endowed the biological, proteolytic cleavage of S2 by host enzymes: furin, serine protease, cathepsin B/D at plasma membrane post the tectonic conformational S2-altered conformational landscape.


Example Five

Embodying Precision Vaccines Against Covid-19 Pseudotype in huACE2+Cells as Well as Human ACE2+Tg Animals


Neutralizing antibodies elicited by the precision vaccine should abrogate infectivity of SARS CoV-2 in ACE2+ cells or Tg animal expressing huACE2. Viral pseudotypes using the rVSV delta G was widely investigated for complementing influenza, rabies, Ebola, Dengue, MERS (Garbutt, M., 2004. J Virol 78, 5458-5465; Millet, J. K., 2016. Bio Protoc 6, e2035), as well as SARS CoV-1 S (Kapadia, S. U., 2008. Virology 376, 165-172) for studying inhibition of vial entry by pseudotype-specific neutralizing antibodies. The rVSV delta G retaining only 16 amino acids (the stem) 5′ to the transmembrane domain of G protein gene is competent for viral budding, wherein the stem is fused to GFP for monitoring infectivity (Dr. Whitt) (Fredericksen, B. L., 1995. J Virol 69, 1435-1443). Herein we construct CoV-2 S rVSV delta G two component systems for infecting ACE2+cell lines, and Tg. And we will evaluate whether neutralizing antibodies elicited by the precision firewall vaccines can abort/abrogate CoV-2 pseudotype infection in huACE2+ cells in vitro, as well as in huACE2 Tg animals (one round) in vivo. These two criteria will serve as the in vitro and in vivo clinical correlates of vaccine efficacies, respectively. Thus,


1. Viral Pseudotype and ACE2+Cell Cultures:


Recently, using a two-component system for CoV-2 S pseudotype was reported (Ou, X., 2020. Nat Commun 11, 1620). Thus, we will clone CoV-2 S protein truncated of 19 amino acids (S delta 19) (Ou, X., 2020. Nat Commun 11, 1620) on the C-terminus into CMV-pCAGGS-G. CoV-2 S gene (synthesized by Biomatik, Inc. modified delta 19) will be cloned into pCAGGS vector with CMV promoter, with optimized Kozak sequences. huACE2+BHK-21 or HEK 293 transfected with truncated S will then be super-infected with the high titer viral stock of rVSV delta G_G* (rVSV genome with GFP cloned in MCS, deleting G protein ectodomain, and phenotypically transiently complemented with G protein (for entry into packaging cell only without being part of the pseudotype) (a generous gift from Dr. M A Whitt). Therefore, we will expediently produce the CoV-2 pseudotype using the two-component system, e.g., CoV-2 S delta 19 pseudotype packaged in rVSV delta G_G* in the transfected HEK 293 cells at the BSL-2 levels (Whitt, M. A., 2010. J Virol Methods 169, 365-374). Next, we will then evaluate whether the three firewall neutralizing antibodies abrogate the infectivity of CoV-2 S pseudotype in susceptible huACE2+BHK-21 or HeLa cell, and TCID50 and cytopathic effect will be examined by microscopy, and percentage of infected cells by GFP based immunofluorescence using Nikon laser-based epifluorescence, and by FACS analysis. We anticipate that the hyper-immune sera from type 1 precision anti-RBE/RBD vaccine(s) will diminish the above criteria up to 90%; and we anticipate up to 70% inhibition by neutralizing antibodies to second and third firewall. IgG in the serum (IM, subcut.) vs. IgA in the BALF (IN) by ELISA will be correlated to neutralizing titers using CoV-2 S pseudotype infection. Since viral entry relies on three successive pivotal events: ACE2 attachment, furin cleavage and fusion attachment, we anticipate pseudotype infection in the presence of all three firewall antibodies, can reach 95˜99% according to GFP fluorescence and TCID50.


Furthermore, pulse-chase metabolic labeling analysis will be conducted to monitor (the one round of) diminished viral replication (pseudotype being a defective particle, capable of only one round of replication) to corroborate viral neutralization by GFP and cytopathic effect, cells will be pulsed with S35 methionine, and de novo viral protein synthesis and replication in infected cells will be monitored, analyzed on denatured and native PAGE gel using cell lysates: native gel will reveal S homotrimer; moreover, CoV-2 S protein will be immunoprecipitated with anti-S prepared in the laboratory. By preventing entry, we anticipate that triple firewall antibodies will conspicuously also attenuate all the metabolically synthesized rVSV proteins.


Moreover, for large supplies, the rVSV (AG-P/M-MCS2-2.6) and packing vectors have been obtained from the Kerafast lab (prepared by and then supplied by Dr. M A Whitt), and helper plasmids for P, L, M, N, and G with T7, and T7 constitutive cell lines. Dr, Whitt will advise and assist in validating the preparation by the small concern with provided SOPs. As described, CMV-pCAGGS-CoV2 S delta 19 vector along long with helper plasmids will be transfected into HEK293 with CMV-pCAGGS-G, providing G* protein in trans and rVSV delta G culminating in the efficient assembly of the CoV-2 S pseudotype plaques, repeatedly handpicked for high titers, concentrated by centrifugation (˜109 per ml) for the following animal experiments.


2. Animal Model: Neutralization of CoV-2 S Pseudotype in the Lung of huACE2 Tg Mice


Human ACE2 transgenic animals B6.Cg-Tg (K18-S2)2Prlmn/J (Jackson lab) will be purchased for in vivo protection of rodent lung tissue with huACE2 transgene. Each group of three will be vaccinated with the respective layer of precision firewall vaccine or in four different combination (1st+2nd; 1st+3rd; 2nd+3rd; and 1st+2nd+3rd) vs VHH control three times via the IN for mucosal IgA pulmonary protection, and the SC or IM route for the IgG comparative purpose. Routes of vaccination is the pillar for the success of vaccines because the route is sentineled by the resident classical dendritic cells, subsequent chemokine-mediated migration of cDCs and resident and migratory CD4 helper and Treg to different compartments of lymphoid tissues, in particular the mucosal pulmonary tissues for Covid-19 protection. And 107 particle CoV-2 pseudotype will be introduced IN vs. IM/SC to normal vs. huACE2 Tg mice.


Cov-2 B Cell Vaccines Offering Protection:


One week later, mice will be sacrificed, and single cell suspension of lung cells will be examined for GFP by fluorescent microscopy, FACS analysis according to viral encoded GFP marker and biomarker for lung cell types: epithelial, endothelial, CD45+lymphocytes, type I and II alveolar epithelial cells, alveolar macrophages. Individual serum and BALF will be collected for different firewall-specific IgG and IgA antibodies. Residual surviving or the compromised viral pseudotype viral particles in the BALF fluid in vaccinated vs control mice will be assayed for reinfection in huACE2+BHK-21 and HeLa cells. The accelerated degradation of the infectivity of viral particles in vaccinated animals will be attributed to the neutralizing titers elicited by the above vaccination strategy with different precision vaccines, and with respect to the protective ratios of monomeric IgA vs IgG vs mixed IgA and IgG (in the serum), as well as the ratios of monomeric IgA vs secretory dimeric IgA in BALF. “The first pass” of pseudotype in animal protection in vaccinated vs. normal is therefore ascertained. Due to the affordability of normal mice, the role of antibody isotypes, and different routes-offered protection and mucosal immunity, can be comprehensively conducted. The comparison will provide relevant insight of the molecular clamped precision vaccines in this project over the entire S protein vaccines using DNA, mRNA (RNA-1273, Moderna), recombinant proteins, or CoV-2 pseudotype (J&J) via IM.


High titers and memory: Precision vaccines building the shell of three firewalls that elicit high titers neutralizing antibodies may offer sterile immunity to Covid-19. Completing Aim 1 to Aim 3 should provide powerful antibody-mediated vaccines that break the transmission of the pandemic Covid-19. However, a precaution should be in place that any antibodies not the neutralizing antibodies or low titers offer no comfort that the population at risk is averted, or the patients recovered with (any) antibodies may be re-infected. Therefore, optimizing protective mucosal IgA in high titers as well as recalling memory IgA or IgG are important to be validated in this Aim.


CoV-2 and CoV-n CTL vaccines offering protection: Furthermore, the cell-mediated mucosal and systemic immunity non-mutated or mutated sequences of ORF1a, ORF-1ab, leader protein, nsp-2, nsp-3, nsp-4, 3C-like proteinase, nsp-5, nsp-6, nsp-7, nsp-8, nsp-9, nsp-10, RNA-dependent RNA polymerase, helicase, 3-45′ exonuclease, endoRNAse, 2-O′ ribose methyltransferase, Surface Glycoprotein (S), ORF-3a, E, M, ORF-6, N, ORF-7a, ORF-7b, ORF-8, which also play a key role in eliciting “neutralizing” or ‘surgical’ CTL to sterilize and remove early infected foci, viral load with the precaution of not causing a concomitant cytokine storm.


The delivery can be via four forms: (i) Peptide in liposomes and ionizable liposomes: vaccine peptides plus promiscuous helper peptides in said liposomes; (ii) Peptide in protein scaffolds as DNA vaccines in said liposomes: vaccine peptides are inserted into the loop regions constrained by a pair of beta strands, a pair of alpha helices or a mixed of beta strand and alpha helix of the said protein scaffolds, and additional helper peptides sequences cloned in the scaffolded regions, or at the N- or C-terminus. Recombinant DNA can be used for DNA immunization.


(iii) Peptide in protein scaffolds as RNA vaccines in said liposomes: the constructs can be arranged in the following three manners for generating RNA as RNA vaccines. In modality one: candidate genes including but not limited to such as substituted vaccine peptides/scaffold, are followed the Mary Kozak translation sequence; GM-CSF, which can be substituted with another effector or suppressive cytokine or combinations of cytokines as CTL adjuvant is one example only including but not limited to microbial or viral protection and prevention of cytokine storms; and two different UTR are included followed by optimal length of poly(A), and cloned into a conventional vector. In second motif, one of the UTR is moved prior to the candidate peptide vaccine. In the third motif, multiple CoV-2 CTL vaccine peptides or other protective epitopes are in tandem and interspersed with promiscuous helper peptides for CD4 helper T cell induction.


(iv) Bifunctionality: B cell epitopes or using truncated CoV-2 proteins, for example different portions or functional segmental domains of the S protein, inserted either in replaced loop regions scaffolded by the adjacent constraining secondary structures, attached to the N- or C-terminus of the protein scaffolds in aforementioned (i) to (iii). Noticeably, RNA vaccines-encoded proteins are produced in the cytosol, functionally assembled in 3-D conformation to elicit B cell and T cell responses. Furthermore, the RNA vaccines synthesized in the cytosol via released from endosomes, or into ER and Golgi can be degraded into peptide fragments and are utilized in the MHC pathways to be presented for induction of CoV-2 CTL or for helper T cells. Thus, the RNA vaccine embodied in the invention will yield immunogenic and/or suppressive tolerogenic RNA vaccines in not only B cell epitopic protective antibody responses but also T cells protective responses, resulting in protection from microbial infection and cytokine storms from the inserted B cell and T cell epitopes in the loop regions of the protein scaffolds.


Example Six

Embodying Three Designs for Cov-2 RNA Vaccines: First and Second Design Motifs Offer Both B Cell Epitopic as Well as CTL Nonameric (+1/−1), which are Processed Via the ER Stress Pathways.


The three prototype vaccines (SEQ ID: 1175; SEQ ID: 1177; SEQ ID: 1179) for accommodating the larger protein and peptide fragments are listed below; and the three filled-in examples (complete) with the candidate genes are chosen from the human IgE-mediated allergy therapeutic and tolerogenic sequences are listed in the Sequence Files as SEQ ID: 1176; SEQ ID: 1178; SEQ ID: 1180.














First Motif:


Candidate gene-GMCSF-mRNA I Vaccine:


Kpn I-Kozak-candidate genes-linker=GMCSF-UTR-UTR-Poly(A)-Spa I-Xba I


CTGGTACCGCCACCATGATCGATCGATCGA


GTGGCGGCGGAGGATCCatgtggctgcagaatttacttttcctgggcatt


gtggtctacagcctctcagcacccacccgctcacccatcactgtcacccg


gccttggaagcatgtagaggccatcaaagaagccctgaacctcctggatg


acatgcctgtcacgttgaatgaagaggtagaagtcgtctctaacgagttc


tccttcaagaagctaacatgtgtgcagacccgcctgaagatattcgagca


gggtctacggggcaatttcaccaaactcaagggcgccttgaacatgacag


ccagctactaccagacatactgccccccaactccggaaacggactgtgaa


acacaagttaccacctatgcggatttcatagacagccttaaaacctttct


gactgatatcccctttgaatgcaaaaaaccaggccaaaaaTAAagctcgc


tttcttgctgtccaatttctattaaaggttcctttgttccctaagtccaa


ctactaaactgggggatattatgaagggccttgagcatctggattctgcc


taataaaaaacatttattttcattgcagctcgctttcttgctgtccaatt


tctattaaaggttcctttgttccctaagtccaactactaaactgggggat


attatgaagggccttgagcatctggattctgcctaataaaaaacatttat


tttcattgcaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaGcatatgactA


aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa


aaaaaaaaaaaaaaaaaaaGAAGAGCTCTAGATG (SEQ ID: 1175)





Second Motif


Candidate gene-mRNA Vaccine II


Kpn I-5UTR-Candidate gene(s)3UTR-Poly(A)-SpaI-Xba I


CTGGTACC


TCTCAACACAACATATACAAAACAAACGAATCTCAAGCAATCAAGCATTC


TACTTCTATTGCAGCAATTTAAATCATTTCTTTTAAAGCAAAAGCAATTT


TCTGAAAATTTTCACCATTTACGAACGATAGCCandidate


genesCTGGTACTGCATGCACGCAATGCTAGC


TGCCCCTTTCCCGTCCTGGGTACCCCGAGTCTCCCCCGACCTCGGGTCCC


AGGTATGCTCCCACCTCCACCTGCCCCACTCACCACCTCTGCTAGTTCCA


GACACCTCCCAAGCACGCAGCAATGCAGCTCAAAACGCTTAGCCTAGCCA


CACCCCCACGGGAAACAGCAGTGATTAACCTTTAGCAATAAACGAAAGTT


TAACTAAGCTATACTAACCCCAGGGTTGGTCAATTTCGTGCCAGCCACAC


CCTCGAGCTAGCaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaGcatatga


ctAaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa


aaaaaaaaaaaaaaaaaaaaaaGAAGAGCTCTAGATG (SEQ ID: 1177)





Third motifs


Candidate genes-mRNA Vaccine III


Kpn I-SIGNAL-monamer-Padre/helper-nonamer-Measle/helper-nonamer-P2A-


GMCSF-UTR-UTR-Poly(A)-Spa I-Xba I


CTGGTACCGCCACCATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTG


CTCCTCCTGTGGGTGCCCGGGTCCAGAGGAaacccgagaggggtgagcgc


ctacctaGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGCCGCTcacc


cccacctgcccagggccctcatgATTTCGATCTCCGAGATCAAGGGAGTC


ATCGTGCACAAAATCGAGACCATCCTCTTCgtgactctgggctgcctggc


cacgggctacCATCATCATCATCACCATTGAGGAAGCGGAGCCACGAACT


TCTCTCTGTTAAAGCAAGCAGGAGATGTTGAAGAAAACCCCGGGCCTatg


tggctgcagaatttacttttcctgggcattgtggtctacagcctctcagc


acccacccgctcacccatcactgtcacccggccttggaagcatgtagagg


ccatcaaagaagccctgaacctcctggatgacatgcctgtcacgttgaat


gaagaggtagaagtcgtctctaacgagttctccttcaagaagctaacatg


tgtgcagacccgcctgaagatattcgagcagggtctacggggcaatttca


ccaaactcaagggcgccttgaacatgacagccagctactaccagacatac


tgccccccaactccggaaacggactgtgaaacacaagttaccacctatgc


ggatttcatagacagccttaaaacctttctgactgatatcccctttgaat


gcaaaaaaccaggccaaaaatgaCGGAATTCCGTAAagctcgctttcttg


ctgtccaatttctattaaaggttcctttgttccctaagtccaactactaa


actgggggatattatgaagggccttgagcatctggattctgcctaataaa


aaacatttattttcattgcagctcgctttcttgctgtccaatttctatta


aaggttcctttgttccctaagtccaactactaaactgggggatattatga


agggccttgagcatctggattctgcctaataaaaaacatttattttcatt


gcaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaGcatatgactAaaaaaaa


aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa


aaaaaaaaaaaaGAAGAGCTCTAGATG (SEQ ID: 1180)









The RNA vaccines proposed are unique in taking the loop antigenic B and T cell epitopes or truncated polypeptide alone or onto the protein scaffolds (FIG. 1) according to the said bifunctionality embodiment (EXAMPLE 5 (vi)) or AR of the following translated proteins (motif 1 and 2); alternatively nonamer (+1/-1), arranged in tandem in motif3: ORF1a, ORF-1ab, leader protein, nsp-2, nsp-3, nsp-4, 3C-like proteinase, nsp-5, nsp-6, nsp-7, nsp-8, nsp-9, nsp-10, RNA-dependent RNA polymerase, helicase, 3->5′ exonuclease, endoRNAse, 2-0′ ribose methyltransferase, surface glycoprotein/spike (S), ORF-3a, E, M, ORF-6, N, ORF-7a, ORF-7b, ORF-8, nucleocapsid (NC).

Claims
  • 1. We claim the B cell antigenic epitopes of the CoV-2 surface spike protein (S), ORF1a, ORF-1ab, leader protein, nsp-2, nsp-3, nsp-4, 3C-like proteinase, nsp-5, nsp-6, nsp-7, nsp-8, nsp-9, nsp-10, RNA-dependent RNA polymerase, helicase, 3→5′ exonuclease, endoRNAse, 2-O′ ribose methyltransferase, ORF-3a, E, M, ORF-6, N, ORF-7a, ORF-7b, ORF-8, nucleocapsid, and CoV-n thereof that are conformationally constrained by, and replacing the native loop constrained by the beta strands of the scaffold proteins such as lipocalin, GFP, fibronectin, and CDR regions delineated by the beta strands of the immunoglobulin light and heavy chains.
  • 2. A claim to claim 1, wherein the B cell antigenic epitopes of the CoV-2 spike protein comprise but not limited to SEQ ID:1 to SEQ ID: 42; SEQ ID: 52 to SEQ ID: 59; SEQ ID: 282 to SEQ ID: 313; SEQ ID: 293 to SEQ ID:296; SEQ ID: 310 to SEQ ID:312.
  • 3. A claim to claim 2, wherein the mutant variants of B cell epitopes, in a molecularly mutated library and Darwinianly selected on solid phase ACE2 by via ribosome display for high affinity or high avidity binding.
  • 4. A claim to claim 1, wherein the RNA immunoregulatory vaccine design templates not limited to SEQ ID: 1175 to SEQ ID: 1180, were employed for said Covid-2 B cell epitopes, whereby immune protection and dampening cytokine storms are ensured.
  • 5. We claim the CTL antigenic epitopes of the CoV-2 surface spike protein (S), ORF1a, ORF-1ab, leader protein, nsp-2, nsp-3, nsp-4, 3C-like proteinase, nsp-5, nsp-6, nsp-7, nsp-8, nsp-9, nsp-10, RNA-dependent RNA polymerase, helicase, 3→5′ exonuclease, endoRNAse, 2-O′ ribose methyltransferase, ORF-3a, E, M, ORF-6, N, ORF-7a, ORF-7b, ORF-8, nucleocapsid, and CoV-n variants thereof, wherein these CTL epitopes are nonamer, and can be octameric (−1) or decameric (+) peptides; the nonameric, octameric and decameric peptides are used alone or in combination as peptide vaccines; or inserted into the loop constrained by the beta strands of the scaffold proteins such as such as lipocalin, GFP, fibronectin, and CDR regions delineated by the beta strands of the immunoglobulin light and heavy chains.
  • 6. A claim to claim 5, wherein the nonameric peptides sequences comprise but are limited to SEQ ID: 60 to SEQ ID: 235 of spike protein; SEQ ID: 236 to SEQ ID: 280 for NC protein; SEQ ID: 314 to SEQ ID: 395 of ORF1a, whereby the CTL epitopes can counter antibody-selected CoV-2 viral mutants, or CoV-n.
  • 7. A claim to claim 5, wherein the nonameric peptides sequences comprise but are limited to SEQ ID: 396 to SEQ ID: 571 of rdrp protein; SEQ ID: 572 to SEQ ID: 699 for helicase protein; SEQ ID: 700 to SEQ ID: 803 of Exonuclease 5>3; SEQ ID: 804 to SEQ ID: 882 of endoRNAse; SEQ ID: 883 to SEQ ID: 953 for methyltransferase protein; SEQ ID: 954 to SEQ ID: 1014 of membrane protein; SEQ ID: 1015 to SEQ ID: 1044 of envelope protein; SEQ ID: 1045 to SEQ ID: 1112 for ORF-3a protein; SEQ ID: 1113 to SEQ ID: 1125 of ORF-6 protein; SEQ ID: 1126 to SEQ ID: 1135 of ORF-7 protein; SEQ ID: 1136 to SEQ ID: 1174 for ORF-8 protein; whereby the CTL epitopes can counter antibody-selected CoV-2 viral mutants, or CoV-n.
  • 8. A claim to claim 5, wherein the RNA immunoregulatory vaccine design templates not limited to SEQ ID: 1175 to SEQ ID: 1180, were employed for said Covid-2 CTL epitopes, whereby immune protection and dampening cytokine storms are ensured.
  • 9. We claim truncated polypeptides of CoV-2 surface spike protein (S), ORF1a, ORF-1ab, leader protein, nsp-2, nsp-3, nsp-4, 3C-like proteinase, nsp-5, nsp-6, nsp-7, nsp-8, nsp-9, nsp-10, RNA-dependent RNA polymerase, helicase, 3→5′ exonuclease, endoRNAse, 2-O′ ribose methyltransferase, ORF-3a, E, M, ORF-6, N, ORF-7a, ORF-7b, ORF-8, nucleocapsid, and CoV-n variants thereof, that are recombinantly fused to the N-terminus or C-terminus of the protein scaffolds such as lipocalin, GFP, fibronectin, and CDR regions delineated by the beta strands of the immunoglobulin light and heavy chains.
  • 10. A claim to claim 9, wherein the polypeptides comprise but not limited to the following sequences: SEQ ID: 42 to SEQ ID: 51; SEQ ID: 281; SEQ ID: 282 to SEQ ID: 313, wherein the polypeptides are conformationally constrained and present B and CTL cell epitopes; wherein the protein scaffolds provide a CD4 helper costimulation; whereby the protective antibody and CTL responses to CoV-2, or CoV-n are ensured.
  • 11. A claim to claim 9, wherein the RNA immunoregulatory vaccine design templates, not limited to SEQ ID: 1175 to SEQ ID: 1180, were employed for said Cov-2 truncated polypeptides, whereby immune protection and dampening cytokine storms are ensured.
Provisional Applications (1)
Number Date Country
62985792 Mar 2020 US