Method and device for screening antigen epitope polypeptide

SEQUENCE LISTING

The present application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Sep. 13, 2022, is named “PN191892_SZTY Sequence Listing.txt”and is 24626 bytes in size, which is identical to the sequence listing filed in the corresponding International Patent Application No. PCT/CN2021/080636, filed on Mar. 12, 2021.

TECHNICAL FIELD

The present invention relates to the field of immunology, and specifically, to a method and device for screening an antigen epitope polypeptide.

BACKGROUND

Currently, Corona Virus Disease 2019 (COVID-19) caused by Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) infection is wreaking havoc around the world. As of Dec. 11, 2020, globally, there have been 70,714,214 SARS-CoV-2 infections, including 1,588,277 deaths. As the epidemic situation develops rapidly, and no effective drug has yet been found, a specific coronavirus vaccine for infection prevention is the hope of reducing infections and curbing the worsening of the epidemic situation.

A convention vaccine includes a live attenuated vaccine, an inactivated vaccine, and the like. A virus strain is required to be used during preparation. Although the immunogenicity is high, there is the possibility of virus reversion and potential pathogenic risks, resulting in relatively low safety. In recent years, various novel vaccines including a DNA recombinant vaccine, synthetic peptide vaccine and the like have been emerged one after another. However, since a vector commonly used by the DNA recombinant vaccine is an adenovirus, a vaccinia virus, or an SV40 virus, there are still some doubts about the in vivo safety of such vectors currently, so that there is still a great need to develop a safer next-generation vaccine. The polypeptide vaccine is a vaccine that is prepared by means of a chemical synthesis method according to an amino acid sequence of certain known or predicted antigen epitope in a pathogen antigen gene. Since the polypeptide vaccine is chemically synthesized, virulence reversion or incomplete inactivation does not exist. In addition, specific antigen epitope may be selected, so that the polypeptide vaccine has become a hot research point for vaccine development today. In a plurality of fields including tumor vaccines, there have been several studies have been published, and clinical trials are underway as well.

As described above, in view of the current global pandemic of novel coronavirus pneumonia, there is an urgent need to develop corresponding vaccines, especially the polypeptide vaccine.

SUMMARY

The present invention is mainly intended to provide a method and device for screening an antigen epitope polypeptide, to provide a corresponding polypeptide product that is developed for the polypeptide of such a novel virus.

A first aspect of this application provides a method for screening an antigen epitope. The screening method includes: using all proteome sequences of a target coronavirus to perform antigen epitope prediction, to obtain a predicted epitope region; using a polypeptide chip technology to screen a polypeptide with a differential response to a positive serum sample infected by the target coronavirus and a control serum sample, and recording the polypeptide as a differential peptide fragment; aligning the differential peptide fragment with all proteome sequences of the target coronavirus to obtain a first conserved motif region; and screening regions meeting epitope screening conditions from the predicted epitope region and the first conserved motif region to obtain the antigen epitope. The epitope screening conditions include a non-phosphorylation region and/or an extracellular region of the target coronavirus.

Further, the operation of using all proteome sequences of the target coronavirus to perform antigen epitope prediction, to obtain the predicted epitope region includes: using all proteome sequences of the target coronavirus to perform antigen epitope prediction by means of various methods, and screening epitope with a length of 8 to 20, preferably 10 to 15 amino acids, to obtain candidate prediction epitope; screening the candidate prediction epitope according to epitope and/or hydrophilicity-hydrophobicity that HLA is able to present in a specific population, to obtain the predicted epitope region; and preferably, screening, from the candidate prediction epitope, the epitope that the HLA is able to present in a Chinese population, and/or removing, from the candidate prediction epitope, the epitope of which hydrophobicity is higher than a first hydrophobic threshold, to obtain the predicted epitope region. Preferably, the epitope of which hydrophobicity is higher than the first hydrophobic threshold refers to epitope that the proportion of hydrophobic amino acids is greater than 45% and a hydrophobicity score is greater than 3.

Further, the operation of using an immune characterization method to screen the polypeptide with the differential response to the positive serum sample infected by the target coronavirus and the control serum sample, and recording the polypeptide as the differential peptide fragment includes: selecting the positive serum sample infected by the target coronavirus, a negative control serum sample and a control serum sample of another lung disease, where the another lung disease refers to a lung disease caused by infection of a virus other than the target coronavirus; using the immune characterization method to combine the positive serum sample, the negative control serum sample and the control serum sample of the another lung disease with a polypeptide array chip, to obtain signal values responsive to combined peptide fragments; for each combined peptide fragment, calculating a p value when there is a difference between the signal value of the positive serum sample and the signal value of the negative control serum sample, recording the p value as a first p value, and simultaneously, calculating a p value when there is a difference between the signal value of the positive serum sample and the signal value of the control serum sample of the another lung disease, and recording the p value as a second p value; and retaining all combined peptide fragments of which first p values and second p values simultaneously meet a difference threshold, to obtain the differential peptide fragment. The difference threshold is preferably <0.05.

Further, log10 conversion is performed on the signal value of the combined peptide fragment, and a conversed log value is used as a feature. By means of a single-tail T test, the p value of each feature when there is a difference between the positive serum sample and the negative control serum sample is calculated, and multiple hypothesis test correction is performed on the p value to obtain the first p value; the p value of the corresponding feature when there is a difference between the positive serum sample and the control serum sample of the another lung disease is simultaneously calculated, multiple hypothesis test correction is performed on the p value, and the p value is recorded as the second p value; and all combined peptide fragments of which first p values are less than the difference threshold and second p values are less than the difference threshold simultaneously are screened, to obtain the differential peptide fragment.

Further, the operation of aligning the differential peptide fragment with all proteome sequences of the target coronavirus to obtain the first conserved motif region includes: using a single amino acid as a unit, calculating a distribution of p1 values where the signal value of the combined peptide fragment covering the amino acid and matching the amino acid differs between the positive serum sample and the negative control serum sample and the control serum sample of the another lung disease, and simultaneously calculating a distribution of p2 values where the signal value of the combined peptide fragment covering the amino acid and not matching the amino acid differs between the positive serum sample and the negative control serum sample and the control serum sample of the another lung disease, where the distribution of p1 values is remarkably lower than the distribution of p2 values, and the amino acid is a first conserved site; and aligning the differential peptide fragment with all proteome sequences of the target coronavirus, and selecting, from matching regions, a region that has the first conserved site and has hydrophobicity lower than a second hydrophobic threshold, to obtain a first conserved motif region. Preferably, the region of which hydrophobicity is lower than the second hydrophobic threshold refers to a region that the proportion of the hydrophobic amino acids is less than or equal to 45% and the hydrophobicity score is less than or equal to 3. Preferably, the differential peptide fragment is a differential peptide fragment that is able to completely match all proteome sequences of the target coronavirus.

Further, before regions meeting the epitope screening condition are screened from the predicted epitope region and the first conserved motif region, the screening method further includes: comparing the differential peptide fragment with a protein sequence of a coronavirus family to obtain a second conserved motif region. Preferably, the operation of comparing the differential peptide fragment with a protein sequence of the coronavirus family to obtain the second conserved motif region includes: comparing the differential peptide fragment with the protein sequence of the coronavirus family, and selecting, from the matching regions, a region of which amino acid site meets the following region screening condition as the second conserved motif region. In all of the differential peptide fragments covering the amino acids, the ratio of the differential peptide fragments matching the amino acids meets a matching ratio threshold; and preferably, the matching ratio threshold is greater than or equal to 75%.

Further, the epitope screening condition in the third region screening module includes at least one of the following: (a) overlapping with the second conserved motif region; (b) a comparison score with a human proteome sequence being lower than a comparison threshold; and (c) meeting a plurality of the following performance indexes: 1) the covering number of the differential peptide fragment being ≥3; 2) hydrophilicity being within a hydrophilic threshold range; and 3) an accessibility score, a Beta turn and a multi-alignment score being all in the top 100. That the comparison score is lower than the comparison threshold means that a/b≤0.8, where a is a matching score that a sequence of a region to be screened is aligned with the human proteome sequence, and b is a matching score that the sequence of the region to be screened is aligned with all proteome sequences of the target coronavirus. preferably, the operation of screening regions meeting epitope screening conditions from the predicted epitope region and the first conserved motif region to obtain the antigen epitope polypeptide includes: merging the predicted epitope region and the first conserved motif region according to one of the following merging conditions: 1) there is an inclusion relation between the two regions; and 2) the two regions are predicted as antigen epitope regions by at least two different methods, to obtain a first candidate epitope region; screening a region overlapping with the second conserved motif region from the first candidate epitope region as a second candidate epitope region; screening, from the second candidate epitope region, a region of which comparison score with the human proteome sequence is lower than the comparison threshold, as a third candidate epitope region; screening and retaining the non-phosphorylation region and/or the extracellular region in the proteome sequence of the target coronavirus from the third candidate epitope region, as a fourth candidate epitope region; comprehensively sorting the fourth candidate epitope region according to accessibility, the beta turn, the hydrophilicity, the covering number of the differential peptide fragments and a multi-alignment result, and then performing optimal selection, to obtain the antigen epitope polypeptide of the target coronavirus. More preferably, after optimal selection is performed, the screening method further includes removing a region including mutations. Preferably, the target coronavirus is SARS-CoV-2.

A fourteenth aspect of this application provides a device for screening an antigen epitope polypeptide. The screening device includes: an epitope prediction module, configured to use all proteome sequences of a target coronavirus to perform antigen epitope prediction, to obtain a predicted epitope region; a differential peptide fragment screening module, configured to use a polypeptide chip technology to screen a polypeptide with a differential response to a positive serum sample infected by the target coronavirus and a control serum sample, and record the polypeptide as a differential peptide fragment; a first region screening module, configured to align the differential peptide fragment with all proteome sequences of the target coronavirus to obtain a first conserved motif region; and a third region screening module, configured to screen regions meeting epitope screening conditions from the predicted epitope region and the first conserved motif region to obtain the antigen epitope. The epitope screening conditions include a non-phosphorylation region and/or an extracellular region of the target coronavirus.

Further, the epitope prediction module includes: a first candidate epitope screening module, configured to use all proteome sequences of the target coronavirus to perform antigen epitope prediction by means of various methods, and screen epitope with a length of 8 to 20, preferably 10 to 15 amino acids, to obtain candidate prediction epitope; and a second candidate epitope screening module, configured to screen the candidate prediction epitope according to epitope and/or hydrophilicity-hydrophobicity that HLA is able to present in a specific population, to obtain the predicted epitope region.

Further, the second candidate epitope screening module includes: a population epitope screening module, configured to screen, from the candidate prediction epitope, the epitope that the HLA is able to present in a Chinese population; and/or a hydrophobicity screening module, configured to remove, from the candidate prediction epitope, the epitope of which hydrophobicity is higher than a first hydrophobic threshold, to obtain the predicted epitope region. Preferably, the epitope of which hydrophobicity is higher than the first hydrophobic threshold refers to epitope that the proportion of hydrophobic amino acids is greater than 45% and a hydrophobicity score is greater than 3.

Further, the differential peptide fragment screening module includes a first screening module. The first screening module includes: a sample selection unit, configured to select the positive serum sample infected by the target coronavirus, a negative control serum sample and a control serum sample of another lung disease, where the another lung disease refers to a lung disease caused by infection of a virus other than the target coronavirus; a signal acquisition unit, configured to use an immune characterization method to combine the positive serum sample, the negative control serum sample and the control serum sample of the another lung disease with a polypeptide array chip, to obtain signal values responsive to combined peptide fragments; and a differential peptide fragment screening unit, configured to, for each combined peptide fragment, calculate a p value when there is a difference between the signal value of the positive serum sample and the signal value of the negative control serum sample, record the p value as a first p value, and simultaneously, calculate a p value when there is a difference between the signal value of the positive serum sample and the signal value of the control serum sample of the another lung disease, and record the p value as a second p value; and retain all combined peptide fragments of which first p values and second p values simultaneously meet a difference threshold, to obtain the differential peptide fragment. The difference threshold is preferably <0.05.

Further, the differential peptide fragment screening unit includes: a signal conversion sub-unit, configured to perform log10 conversion on the signal value of the combined peptide fragment; and a differential peptide fragment screening sub-unit, configured to use a conversed log value as a feature, by means of a single-tail T test, calculate the p value of each feature when there is a difference between the positive serum sample and the negative control serum sample, and perform multiple hypothesis test correction on the p value to obtain the first p value; simultaneously calculate the p value of the corresponding feature when there is a difference between the positive serum sample and the control serum sample of the another lung disease, perform multiple hypothesis test correction on the p value, and record the p value as the second p value; and screen all combined peptide fragments of which first p values are less than the difference threshold and second p values are less than the difference threshold simultaneously, to obtain the differential peptide fragment.

Further, the first region screening module includes: a conserved site screening module, configured to use a single amino acid as a unit, calculate a distribution of p1 values where the signal value of the combined peptide fragment covering the amino acid and matching the amino acid differs between the positive serum sample and the negative control serum sample, simultaneously calculate a distribution of p2 values where the signal value of the combined peptide fragment covering the amino acid and not matching the amino acid differs between the positive serum sample and the negative control serum sample, and record the amino acid that the distribution of p1 values is remarkably lower than the distribution of p2 values as a first conserved site; and a first conserved motif screening module, configured to align the differential peptide fragment with all proteome sequences of the target coronavirus, and select, from matching regions, a region that has the first conserved site and has hydrophobicity lower than a second hydrophobic threshold, to obtain a first conserved motif region. Preferably, the region of which hydrophobicity is lower than the second hydrophobic threshold refers to a region that the proportion of the hydrophobic amino acids is less than or equal to 45% and the hydrophobicity score is less than or equal to 3. Preferably, the differential peptide fragment is a differential peptide fragment that is able to completely match all proteome sequences of the target coronavirus.

Further, the screening device further includes a second region screening module. Preferably, the second region screening module includes: a comparison module, configured to align the differential peptide fragment with a protein sequence of a coronavirus family; and a second conserved motif screening module, configured to select, from the matching regions, a region of which amino acid site meets the following region screening condition as the second conserved motif region. In all of the differential peptide fragments covering the amino acids, the ratio of the differential peptide fragments matching the amino acids meets a matching ratio threshold.

Further, the matching ratio threshold is greater than or equal to 75%.

Further, the epitope screening condition in the third region screening module 50 includes at least one of the following: (a) overlapping with the second conserved motif region; (b) a comparison score with a human proteome sequence being lower than a comparison threshold; and (c) meeting a plurality of the following performance indexes: 1) the covering number of the differential peptide fragment being ≥3; 2) hydrophilicity meeting a hydrophilic threshold; and 3) an accessibility score, a Beta turn and a multi-alignment score being all in the top 100. That the comparison score is lower than the comparison threshold means that a/b≤0.8, where a is a matching score that a sequence of a region to be screened is aligned with the human proteome sequence, and b is a matching score that the sequence of the region to be screened is aligned with all proteome sequences of the target coronavirus.

Further, the third region screening module includes: a merging module, configured to merge the predicted epitope region and the first conserved motif region according to one of the following merging conditions: 1) there is an inclusion relation between the two regions; and 2) the two regions are predicted as antigen epitope regions by at least two different methods, to obtain a first candidate epitope region; an overlap screening module, configured to screen a region overlapping with the second conserved motif region from the first candidate epitope region as a second candidate epitope region; a comparison screening module, configured to screen, from the second candidate epitope region, a region of which comparison score with the human proteome sequence is lower than a first threshold, as a third candidate epitope region; a non-phosphorylation and extracellular region screening module, configured to screen and retain the non-phosphorylation region and/or the extracellular region in the proteome sequence of the target coronavirus from the third candidate epitope region, as a fourth candidate epitope region; and a comprehensive sorting module, configured to comprehensively sort the fourth candidate epitope region according to accessibility, the beta turn, the hydrophilicity, the covering number of the differential peptide fragments and a multi-alignment result, and then perform optimal selection, to obtain the antigen epitope of the target coronavirus.

Further, the device further includes: a mutation removing module, configured to remove a region including mutations from regions optimally selected by the comprehensive sorting module, to obtain the antigen epitope polypeptide of the target coronavirus.

A third aspect of the present invention provides a storage medium. The storage medium includes a stored program. When the program is operated, a device where the storage medium is located is controlled to execute the method for screening a coronavirus antigen epitope described in any one of the above.

A fourth aspect of the present invention provides a processor. The processor is configured to operate a program. When the program is operated, the method for screening a coronavirus antigen epitope described in any one of the above is executed.

Through the application of the technical solution of the present invention, by innovatively combining the polypeptide chip technology, a batch of polypeptide specifically related to coronavirus infection (especially SARS-Cov-2 virus infection). The polypeptide can be used to prepare related detection reagents such as antigens, antibodies and kits, as well as related vaccine products such as polypeptide vaccines, nucleic acid vaccines and protein recombinant vaccines. Therefore, a more powerful tool can be provided for the prevention and control of the infection and prevalence of such viruses.

BRIEF DESCRIPTION OF THE DRAWINGS

The drawings, which form a part of this application, are used to provide a further understanding of the present invention. The exemplary embodiments of the present invention and the description thereof are used to explain the present invention, but do not constitute improper limitations to the present invention. In the drawings:

FIG. 1 is a schematic flowchart of a method for screening a coronavirus antigen epitope according to a preferred embodiment of this application.

FIG. 2A and FIG. 2B respectively show the activity of serum obtained from mice immunized with different single-peptides against a neutralizing antibody produced by live coronavirus. FIG. 2A shows a detection result under a microscope, and FIG. 2B shows a statistical result.

FIG. 3A, FIG. 3B and FIG. 3C respectively show changes in an antibody signal corresponding to each polypeptide in mice immunized with a combination 1, a combination 2 and a combination 3 with time.

FIG. 4A and FIG. 4B respectively show the activity of serum obtained from mice immunized by a combination 1, a combination 2 and a combination 3 against a neutralizing antibody produced by live coronavirus. FIG. 4A shows a detection result under a microscope, and FIG. 4B shows a statistical result.

FIG. 5A to FIG. 5J respectively show changes, with time, in antibody signals corresponding to 4 polypeptides of each mix after mice are immunized with Mix1 to Mix10.

FIG. 6A to FIG. 6F show antibody production at different time points after 7 peptides are co-immunized with each adjuvant in mice.

FIG. 7 is a block diagram of a hardware structure of a method for screening an antigen epitope polypeptide according to an embodiment of the present invention.

FIG. 8 is a schematic structural diagram of a device for screening an antigen epitope polypeptide according to a preferred embodiment of this application.

DETAILED DESCRIPTION OF THE EMBODIMENTS

It is to be noted that the embodiments in this application and the features in the embodiments may be combined with one another without conflict. The present invention will be described below in detail with reference to the embodiments.

TERM EXPLANATION

“Corona Virus Disease 2019” or COVID-19 in this application refers to a disease that occurs in a patient after being infected with a SARS-Cov-2 virus (also called the novel coronavirus in this application), that is, the novel coronavirus pneumonia.

Antigen epitope is also called an antigenic determinant, which is a special chemical group with a certain composition and structure on a surface or other parts of an antigenic substance molecule, and a structure that can specifically bind to corresponding antibodies or sensitized lymphocytes. During immune response, the epitopes identified by an antigen receptor TCR of a T cell and an antigen receptor BCR of a B cell have different characteristics, which are respectively called a T cell epitope and a B cell epitope. The T cell epitope is generally not located on a surface of an antigen molecule, and can only be identified by TCR when the antibody is processed by an antigen-presenting cell into small molecular polypeptides and combined with an MHC molecule. The T cell can only identify the processed epitope. The B cell epitope may exist on the surface of the antigen molecule, and may be directly identified by the B cell without being processed. In this application, the epitope refers to one or more predicted or screened peptide fragments that can specifically bind to the antibody.

Polypeptide refers to any one predicted or screened peptide fragment that can specifically bind to the antibody or the sensitized lymphocyte.

Polypeptide-carrier protein conjugate refers to an antigen that is formed by coupling the polypeptide and a carrier protein. One carrier protein may be coupled to one or more polypeptides. When a plurality of polypeptides are coupled, the plurality of polypeptides have a same amino acid sequence. According to a difference in physical and chemical properties of a specifically coupled polypeptide sequence, different types of specific carrier proteins and different coupling methods, the number of the polypeptides coupled to each carrier protein is different, and in this application, is preferably 2-50, and more preferably, 3-45, 5-40, 5-35, 5-30, 8-30, 10-30, 12-30, or 15-30; or further preferably, the number is any one of 6-36, 8-32, 10-28, 10-26, 10-24, 10-22, 10-20, 10-18, 10-16, or 10-15.

Antigen refers to all substances that can induce an immune response in an organism, that is, the substances that can specifically not bind to the antigen receptor (TCR/BCR) on the surface of the T/B lymphocyte, activates the T/B cell to cause the T/B cell to proliferate and differentiate, so as to produce an immune response product (sensitized lymphocyte or antibody), and can specifically bind to the corresponding product in vitro and in vivo. Therefore, the antigen has two important properties: immunogenicity and immunoreactivity. The antigen in this application refers to a complete antigen with immunogenicity that is formed after polypeptide hapten is coupled to the carrier protein, which may be the polypeptide-carrier protein conjugate that is formed by coupling the polypeptide of a single amino acid sequence to the carrier protein, or may be a composition of the polypeptide-carrier protein conjugates that are formed by coupling the polypeptides with various different amino acid sequences and the carrier proteins.

A vaccine usually refers to the ability to have both immunogenicity and reactogenicity. The immunogenicity refers to performance that can stimulate the organism to produce an immune response, that is, the ability of stimulating the organism to produce a specific immune cell, causing the immune cell to activate, proliferate and differentiate, and finally produce an immunologic effector substance-specific antibody or the sensitized lymphocyte.

Polypeptide vaccine: in order to enhance the immunogenicity of the polypeptide to stimulate the organism to produce the specific antibody or the sensitized lymphocyte, a polypeptide antigen is usually immunized with an adjuvant. The commonly used adjuvants include an aluminum hydroxide adjuvant, Corynebacterium parvum, lipopolysaccharide, cytokines, or alum. A Freund's complete adjuvant and a Freund's incomplete adjuvant are the most common adjuvant in animal immunization.

A polypeptide chip technology is a detection technology based on a polypeptide chip, which uses the contact between a wide variety of polypeptides on the polypeptide chip and a sample, then uses an image acquisition technology to collect characteristic signals on the polypeptide chip (which may specifically be expressed as a fluorescent image carrying the characteristic signals), and then outputs the signal intensity of each characteristic in the chip, that is, detection result data of the polypeptide chip. By means of a sample detection signal outputted based on the detection result data of the polypeptide chip, analysis of an object to be detected in the polypeptide combined sample on the polypeptide chip and the analysis of the sample can be realized.

Motif is a data-based mathematical statistical model in biology, and may typically be a sequence or a structure, which is the sequence prediction of a specific group. For example, a DNA sequence may be defined as a transcription factor binding site. That is to say, the sequence tends to be bound by a transcription factor. For protein, a sequence motif may be defined as a protein sequence belonging to a given protein family. A simple motif may be, for example, a pattern, and the pattern is shared by all members in the group.

An ROC curve refers to a curve reflecting a relationship between sensitivity and specificity. An abscissa X-axis is 1-specificity and also called a false positive rate, the accuracy is higher when the X axis is closer to zero. An ordinate Y-axis is called sensitivity and also called a true positive rate, and if the Y-axis larger, the sensitivity is better. According to a curve position, an entire graph is divided into two parts. An area of the lower part of the curve is called an Area Under Curve (AUC), which is used to indicate prediction accuracy. If an AUC value is higher, the prediction accuracy is higher. The prediction accuracy is higher if the curve is closer to a top left corner (the smaller the X, the larger the Y).

As mentioned in the part Background, an emerging coronavirus, for example, SARS-Cov-2, spreads rapidly around the world due to high infectivity. In addition, there is no target specific medicine currently, so that obtaining a corresponding vaccine as soon as possible is the key to preventing and controlling the deterioration and subsequent recurrence of the epidemic situation. Therefore, in this application, relevant research is carried out from the perspective of vaccine development, and based on research results, the technical solution of this application is proposed. This application starts with the search for novel coronavirus-specific antigen epitope. Based on an existing antigen epitope screening method and with the combination of the unique polypeptide chip technology, a batch of coronavirus family protein-related antigen epitopes is screened, and some are novel coronavirus-specific antigen epitopes. According to the polypeptide sequences corresponding to these epitopes, corresponding related products such as polypeptide antigens, detection kits, polypeptide antibodies, polypeptide vaccines and recombinant vaccines, and related products such as genetic vaccines or recombinant protein vaccines that are further developed by using these polypeptide sequences. Therefore, more ideas and means are provided for the prevention and control of coronavirus related diseases and/or COVID-19.

A preferred embodiment provides a polypeptide. The polypeptide is selected from any one of peptide fragments shown in SEQ ID NO:1 to SEQ ID NO:154 in Table 1.

A preferred embodiment provides an antigen epitope. The antigen epitope includes any one or more of SEQ ID NO:1 to SEQ ID NO:154 in Table 1.

TABLE 1

SEQ

Number

Iso-
Average
Name of
Serial

ID
Poly-
of
Molecular
electric
hydro-
source
number of

NO:
peptide
residues
weight
point
phobicity
protein
source protein

1
YTNDKACPL
9
1024.1483
5.8279
-0.7111
pp1ab
YP_009724389.1

2
RGGSYTNDKAC
11
1171.2411
8.1973
-1.3364
pp1ab
YP_009724389.1

3
SVYAWNRKR
9
1179.3309
11.0001
-1.4889
Surface
YP_009724390.1

glycoprotein

4
ALDPLSETKCT
11
1177.3253
4.3703
-0.2545
Surface
YP_009724390.1

glycoprotein

5
GRLQSLQTY
9
1065.1803
8.7476
-0.7889
Surface
YP_009724390.1

glycoprotein

6
KVFRSSVLHSTQ
12
1388.571
11.0008
-0.2667
Surface
YP_009724390.1

glycoprotein

7
GVYYPDKVFR
10
1243.4094
8.4966
-0.5300
Surface
YP_009724390.1

glycoprotein

8
KRISNCVADY
10
1168.3232
8.1973
-0.4500
Surface
YP_009724390.1

glycoprotein

9
NSVAYSNNS
9
954.9371
5.5244
-0.9111
Surface
YP_009724390.1

glycoprotein

10
ECVLGQSKR
9
1019.1766
8.3201
-0.6778
Surface
YP_009724390.1

glycoprotein

11
DYNYKLPDD
9
1142.1716
4.1697
-2.0333
Surface
YP_009724390.1

glycoprotein

12
KEIDRLNEV
9
1115.2375
4.6791
-1.1000
Surface
YP_009724390.1

glycoprotein

13
EVFAQVKQIY
10
1224.4045
6.0995
0.1800
Surface
YP_009724390.1

glycoprotein

14
LPFNDGVYF
9
1071.1812
3.7999
0.3667
Surface
YP_009724390.1

glycoprotein

15
NLDSKVGGNYNY
12
1343.3977
5.8343
-1.1750
Surface
YP_009724390.1

glycoprotein

16
MADSNGTIT
9
908.9732
3.7994
-0.1556
Membrane
YP_009724393.1

glycoprotein

17
FHPLADNKF
9
1088.2152
6.7436
-0.5000
ORF7a protein
YP_009724395.1

18
YEGNSPFH
8
949.962
5.2402
-1.4375
ORF7a protein
YP_009724395.1

19
ALNTPKDH
8
894.9715
6.7883
-1.3500
Nucleocapsid
YP_009724397.2

phosphoprotein

20
KLDDKDPNFK
10
1219.3436
6.0385
-2.0700
Nucleocapsid
YP_009724397.2

phosphoprotein

21
YGANKDGI
8
836.8889
5.8349
-0.8375
Nucleocapsid
YP_009724397.2

phosphoprotein

22
MEVTPSGTWLTY
12
1384.5525
3.9988
-0.0583
Nucleocapsid
YP_009724397.2

phosphoprotein

23
HGKEDLKF
8
973.0833
6.7512
-1.4750
Nucleocapsid
YP_009724397.2

phosphoprotein

24
KKPASRELKVTF
12
1403.6686
10.2897
-0.8500
pp1ab
YP_009724389.1

25
YYKKDNSYF
9
1227.3206
8.3788
-1.8556
pp1ab
YP_009724389.1

26
NVAKSEFDRDAA
12
1322.3805
4.5582
-0.9000
pp1ab
YP_009724389.1

27
VNKGEDIQLLKS
12
1343.5255
6.0395
-0.5583
pp1ab
YP_009724389.1

28
ERSEKSYEL
9
1140.2007
4.7864
-2.0000
pp1ab
YP_009724389.1

29
LQDLKWARFPKS
12
1488.7315
9.9943
-0.8667
pp1ab
YP_009724389.1

30
ETSNSFDVLKSE
12
1355.4038
4.4267
-0.8500
pp1ab
YP_009724389.1

31
DNQDLNGNWY
10
1238.2191
3.5637
-1.9800
pp1ab
YP_009724389.1

32
KLDNYYKKDNSY
12
1550.6667
8.3362
-2.2167
pp1ab
YP_009724389.1

33
DSFKEELDKY
10
1273.3446
4.3167
-1.7300
Surface
YP_009724390.1

glycoprotein

34
VYDPLQPEL
9
1073.1957
3.6660
-0.3556
Surface
YP_009724390.1

glycoprotein

35
RLFRKSNLK
9
1161.4002
12.0165
-1.1889
Surface
YP_009724390.1

glycoprotein

36
SNLKPFER
8
990.1138
8.4636
-1.4000
Surface
YP_009724390.1

glycoprotein

37
PLQPELDSFKEE
12
1431.5429
4.2446
-1.2500
Surface
YP_009724390.1

glycoprotein

38
QELGKYEQY
9
1157.2293
4.5314
-1.9000
Surface
YP_009724390.1

glycoprotein

39
GTITVEELKK
10
1117.2932
6.1425
-0.4100
Membrane
YP_009724393.1

glycoprotein

40
IRGGDGKMKD
10
1076.2279
8.5901
-1.4100
Nucleocapsid
YP_009724397.2

phosphoprotein

41
SLPGVFCGV
9
878.0467
5.2381
1.5889
pp1ab
YP_009724389.1

42
FLAHIQWMV
9
1144.3876
6.7411
1.2667
pp1ab
YP_009724389.1

43
QLFFSYFAV
9
1121.2829
5.5244
1.4000
pp1ab
YP_009724389.1

44
KLRSDVLLPL
10
1153.4146
8.7477
0.5100
pp1ab
YP_009724389.1

45
LVAEWFLAYI
10
1224.4458
3.9997
1.7000
pp1ab
YP_009724389.1

46
VMVELVAEL
9
1002.2254
3.7950
1.8778
pp1ab
YP_009724389.1

47
ILSPLYAFA
9
994.1834
5.5244
1.6444
pp1ab
YP_009724389.1

48
GLNDNLLEI
9
1000.1036
3.6660
0.1667
pp1ab
YP_009724389.1

49
YLFDESGEFKL
11
1347.4676
4.1374
-0.3364
pp1ab
YP_009724389.1

50
KLVNKFLAL
9
1045.318
10.0027
0.9889
pp1ab
YP_009724389.1

51
FLKKDAPYI
9
1094.3026
8.4975
-0.1444
pp1ab
YP_009724389.1

52
FVSNGTHWFV
10
1193.3092
6.7411
0.4500
Surface
YP_009724390.1

glycoprotein

53
VDEPEEHV
8
952.9612
3.9976
-1.3000
ORF3a protein
YP_009724391.1

54
KWESGVKD
8
948.0308
6.1922
-1.5875
ORF3a protein
YP_009724391.1

55
TDTGVEHV
8
856.8771
4.3513
-0.4500
ORF3a protein
YP_009724391.1

56
LLYDANYFL
9
1131.2762
3.7999
0.7111
ORF3a protein
YP_009724391.1

57
GYTEKWES
8
999.0312
4.5314
-1.8750
ORF3a protein
YP_009724391.1

58
TDHSSSSD
8
834.7425
4.5102
-1.7625
Membrane
YP_009724393.1

glycoprotein

59
DHSSSSDNI
9
960.8988
4.1967
-1.3778
Membrane
YP_009724393.1

glycoprotein

60
NTDHSSSS
8
833.7577
5.0767
-1.7625
Membrane
YP_009724393.1

glycoprotein

61
LNTDHSSS
8
859.838
5.0767
-1.1875
Membrane
YP_009724393.1

glycoprotein

62
CPDGVKHV
8
853.9857
6.7344
-0.2125
ORF7a protein
YP_009724395.1

63
CQEPKLGS
8
860.9751
5.9943
-0.9250
ORF8 protein
YP_009724396.1

64
RNPANNAA
8
826.8577
9.7501
-1.4000
Nucleocapsid
YP_009724397.2

phosphoprotein

65
EERLKLFDRYF
11
1515.7104
6.2791
-1.0455
pp1ab
YP_009724389.1

66
PGTAVLRQWLP
11
1237.4498
10.1800
0.0364
pp1ab
YP_009724389.1

67
CPAVAKHDFFK
11
1262.4791
8.2065
-0.0182
pp1ab
YP_009724389.1

68
LQDLKWARFPK
11
1401.6542
9.9943
-0.8727
pp1ab
YP_009724389.1

69
LLTKSSEYKGP
11
1222.3873
8.4976
-0.8455
pp1ab
YP_009724389.1

70
VLTLDNQDLNG
11
1201.2835
3.5637
-0.2727
pp1ab
YP_009724389.1

71
YMRSLKVPATV
11
1264.5364
9.9943
0.2818
pp1ab
YP_009724389.1

72
SVEEVLSEARQ
11
1246.3243
4.2519
-0.5545
pp1ab
YP_009724389.1

73
KVDGVDVELFE
11
1249.3661
4.1564
0.0818
pp1ab
YP_009724389.1

74
LTVFFDGRVDG
11
1225.3495
4.2078
0.4364
pp1ab
YP_009724389.1

75
EYADVFHLYL
10
1269.4003
4.3533
0.3600
pp1ab
YP_009724389.1

76
HECFVKRVDWT
11
1419.6065
6.7429
-0.5909
pp1ab
YP_009724389.1

77
STSHKLVLSVN
11
1184.3425
8.4894
0.2091
pp1ab
YP_009724389.1

78
KDYLASGGQPI
11
1148.2656
5.8349
-0.4818
pp1ab
YP_009724389.1

79
AVLQSGFRK
9
1005.1714
11.0010
-0.0556
pp1ab
YP_009724389.1

80
MASLVLARKHT
11
1226.4918
11.0003
0.3818
pp1ab
YP_009724389.1

81
MQNCVLKLKVD
11
1290.5952
7.9545
0.1909
pp1ab
YP_009724389.1

82
IERYKLEGYAF
11
1388.5659
6.1418
-0.5000
pp1ab
YP_009724389.1

83
TILGSALLEDE
11
1160.2715
3.9129
0.4818
pp1ab
YP_009724389.1

84
KLDNYYKKDNS
11
1387.4935
8.3788
-2.3000
pp1ab
YP_009724389.1

85
QLSLPVLQVRD
11
1267.4741
6.0877
0.2182
pp1ab
YP_009724389.1

86
AWYTERSEKSY
11
1419.494
6.1859
-1.7636
pp1ab
YP_009724389.1

87
YEKLKPVLDWL
11
1403.6634
6.0683
-0.2727
pp1ab
YP_009724389.1

88
QADVEWKFYDA
11
1371.4493
4.0280
-0.8636
pp1ab
YP_009724389.1

89
NEYRLYLDAY
10
1319.4177
4.3703
-0.9500
pp1ab
YP_009724389.1

90
INVIVFDGKSK
11
1219.4295
8.5910
0.3818
pp1ab
YP_009724389.1

91
KKPASRELKVT
11
1256.4948
10.2897
-1.1818
pp1ab
YP_009724389.1

92
KCVPOADVEW
10
1174.3261
4.3702
-0.4200
pp1ab
YP_009724389.1

93
TDVTQLYLGG
10
1066.1617
3.7991
0.1300
pp1ab
YP_009724389.1

94
NNDYYRSLPGV
11
1297.3724
5.8349
-1.1273
pp1ab
YP_009724389.1

95
TCTERLKLFAA
11
1252.4828
7.8871
0.2909
pp1ab
YP_009724389.1

96
NKGEDIQLLKS
11
1244.3945
6.0690
-0.9909
pp1ab
YP_009724389.1

97
ELWAKRNIKPV
11
1353.6114
9.9959
-0.6818
pp1ab
YP_009724389.1

98
EEAKTVLKKC
10
1148.3735
8.2707
-0.7100
pp1ab
YP_009724389.1

99
SFSGYLKLTDN
11
1244.3496
5.5526
-0.4091
pp1ab
YP_009724389.1

100
NVNRFNVAITR
11
1303.4697
12.0001
-0.2455
pp1ab
YP_009724389.1

101
KYFSGAMDTT
10
1120.2323
5.8349
-0.4800
pp1ab
YP_009724389.1

102
DDYFNKKDWYD
11
1508.5422
4.3300
-2.3636
pp1ab
YP_009724389.1

103
FKESPFELEDF
11
1387.4885
4.0020
-0.7364
pp1ab
YP_009724389.1

104
FAQDGNAAIS
10
993.0282
3.7999
0.1000
pp1ab
YP_009724389.1

105
MSYLFQHANLD
11
1338.4872
5.3151
-0.1545
pp1ab
YP_009724389.1

106
AQNSVRVLOKA
11
1213.387
11.0010
-0.3545
pp1ab
YP_009724389.1

107
VDAAKAYKDYL
11
1256.4034
5.9289
-0.3636
pp1ab
YP_009724389.1

108
KGFCDLKGKYV
11
1257.5007
9.1129
-0.3636
pp1ab
YP_009724389.1

109
EDIQLLKSAY
10
1179.3194
4.3704
-0.2600
pp1ab
YP_009724389.1

110
DPAQLPAPRTL
11
1178.3381
5.8364
-0.5273
pp1ab
YP_009724389.1

111
NKHAFHTPAF
10
1169.2913
8.7642
-0.6900
pp1ab
YP_009724389.1

112
NRYLALYNKYK
11
1445.6635
9.8232
-1.2545
pp1ab
YP_009724389.1

113
NVAKSEFDRDA
11
1251.3026
4.5582
-1.1455
pp1ab
YP_009724389.1

114
KLNVGDYFV
9
1054.1955
5.8349
0.2667
pp1ab
YP_009724389.1

115
THLSVDTKF
9
1047.1618
6.4061
-0.2222
pp1ab
YP_009724389.1

116
NGQVFGLYKNT
11
1240.3641
8.5909
-0.5818
pp1ab
YP_009724389.1

117
VWKSYVHWVD
10
1231.3987
6.7227
0.3200
pp1ab
YP_009724389.1

118
HPNPKGFCDLK
11
1255.4452
8.2065
-1.1364
pp1ab
YP_009724389.1

119
YRKVLLRKNGN
11
1360.6072
11.0972
-1.2455
pp1ab
YP_009724389.1

120
ATVRLQAGN
9
929.0324
9.7950
-0.1111
pp1ab
YP_009724389.1

121
ETSNSFDVLKS
11
1226.2898
4.3704
-0.6091
pp1ab
YP_009724389.1

122
LLTKGTLEPEY
11
1263.4359
4.5314
-0.3818
pp1ab
YP_009724389.1

123
TVREVLSDR
9
1074.1889
5.7352
-0.5889
pp1ab
YP_009724389.1

124
QSRNLQEFKPR
11
1402.5579
10.8350
-2.0636
pp1ab
YP_009724389.1

125
DWECLKLSHQ
11
1270.4549
5.3203
0.0091
pp1ab
YP_009724389.1

126
RVEKKKLDGFM
11
1350.629
9.6998
-0.9909
pp1ab
YP_009724389.1

127
KLFDRYFKYW
10
1465.6945
9.5263
-0.9900
pp1ab
YP_009724389.1

128
DAQSFLNRVCG
11
1209.332
5.8294
-0.1000
pp1ab
YP_009724389.1

129
TCFANKHADFD
11
1268.3545
5.3603
-0.6000
pp1ab
YP_009724389.1

130
HPNQEYADVF
10
1219.2589
4.3531
-1.1300
pp1ab
YP_009724389.1

131
YKQARSEDKRA
11
1351.4682
9.6966
-2.3455
pp1ab
YP_009724389.1

132
TANVNALLSTD
11
1118.195
4.2972
0.2455
pp1ab
YP_009724389.1

133
SCKRVLNVVCK
11
1248.5619
9.4997
0.4364
pp1ab
YP_009724389.1

134
RHINAQVAKSH
11
1260.4054
11.0009
-0.9364
pp1ab
YP_009724389.1

135
KSAGFPFNKW
10
1181.3417
10.0027
-0.7600
pp1ab
YP_009724389.1

136
IMSDRDLYDKL
11
1368.5548
4.4290
-0.6364
pp1ab
YP_009724389.1

137
KLRSDVLLPL
10
1153.4146
8.7477
0.5100
pp1ab
YP_009724389.1

138
CLYRNRDVDTD
11
1369.4601
4.6762
-1.3182
pp1ab
YP_009724389.1

139
VGQQDGSEDNQ
11
1176.1052
3.4924
-1.9909
pp1ab
YP_009724389.1

140
IVNNWLKQLIK
11
1368.6656
10.0027
0.1455
pp1ab
YP_009724389.1

141
ALLTKSSEYK
10
1139.2987
8.5410
-0.5500
pp1ab
YP_009724389.1

142
PLQPELDSFKE
11
1302.4289
4.4269
-1.0455
Surface
YP_009724390.1

glycoprotein

143
TSNQVAVLYQ
10
1122.2282
5.1849
0.0700
Surface
YP_009724390.1

glycoprotein

144
LIDLQELGKY
10
1191.3731
4.3703
-0.0200
Surface
YP_009724390.1

glycoprotein

145
PFERDIS
7
862.9263
4.3708
-0.9429
Surface
YP_009724390.1

glycoprotein

146
AHFPREGVFVS
11
1245.3856
6.7944
0.1636
Surface
YP_009724390.1

glycoprotein

147
TECSNLLLQYG
11
1240.3825
3.9984
0.0182
Surface
YP_009724390.1

glycoprotein

148
KIITLKKRWQL
11
1426.7913
11.2639
-0.4273
ORF3a protein
YP_009724391.1

149
TLSYYKLGASQ
11
1230.3661
8.1651
-0.3000
Membrane
YP_009724393.1

glycoprotein

150
EELKKLLEQW
10
1315.5138
4.7864
-1.1300
Membrane
YP_009724393.1

glycoprotein

151
CPDGVKHVYQ
10
1145.2881
6.7336
-0.6500
ORF7a protein
YP_009724395.1

152
LFIRQEEVQEL
11
1403.579
4.2526
-0.2636
ORF7a protein
YP_009724395.1

153
KMKDLSPRWY
10
1323.5622
9.6998
-1.4700
Nucleocapsid
YP_009724397.2

phosphoprotein

154
DQVILLNKHID
11
1307.4949
5.3918
-0.0273
Nucleocapsid
YP_009724397.2

phosphoprotein

In a more preferred embodiment, the above antigen epitope includes any one or more of SEQ ID NO:25, SEQ ID NO:28, SEQ ID NO:31, SEQ ID NO:35 and SEQ ID NO:36, and SEQ ID NO:41 to SEQ ID NO:154. The polypeptides shown in SEQ ID NO:25, SEQ ID NO:28, SEQ ID NO:31, and SEQ ID NO:35 are obtained by screening the polypeptide chip at least twice, so that the polypeptides have higher potential application values as the antigen epitopes.

The above polypeptides act as the antigen epitopes specifically identified by the B cell or the T cell, and may be prepared into polypeptide vaccines to stimulate the organism to produce specific antibodies or sensitized lymphocytes (immunogenicity). During the immunizing of the organism, in order to better stimulate an immune response, an adjuvant is often added to stimulate the organism to produce a helper T cell, so as to further induce a B cell immune response. Definitely, the individual polypeptides may also be used to stimulate the immunized organism to produce the immune response.

The above polypeptide may also be prepared into an antigen, to stimulate the organism to produce antibodies. In order to better stimulate to achieve an adequate immune response (that is, the immunogenicity is very low), the using of a carrier protein with many antigen epitopes facilitates the stimulation of the helper T cell, to further induce the B cell immune response.

Therefore, a preferred embodiment further provides a polypeptide-carrier protein conjugate. The polypeptide-carrier protein conjugate includes any one of the above polypeptides and the carrier protein coupled to the polypeptide. The polypeptide-carrier protein conjugate generally acts as the antigen to detect the antibody, or acts as the antigen to prepare the antibody by immunizing an animal. Since the polypeptide can specifically identify the coronavirus, especially a SARS-CoV-2 virus, the polypeptide-carrier protein conjugate can specifically identify the antibody of the coronavirus, especially the antibody of the SARS-CoV-2 virus.

According to a preparation requirement of the polypeptide-carrier protein conjugate, the specific and appropriate carrier protein may be selected to form the polypeptide-carrier protein conjugate. The carrier protein in this application includes, but is not limited to, Bovine Serum Albumin (BSA), Ovalbumin (OVA), Keyhole Limpet Hemocyanin (KLH), or Casein (CS). According to an amino acid sequence composition of different polypeptides, in order to facilitate coupling with the carrier protein, the polypeptides required to be coupled to the carrier protein by using a linker sequence (which is also called a connexon or a linker). In this application, the linker sequence is preferably CGSG.

According to the physical and chemical properties of polypeptide amino acids, different the carrier proteins used and different coupling methods, the number of the polypeptides that can be coupled to each carrier protein is different. By comprehensively considering the efficiency of coupling and the ability of antibody recognition and binding, preferably, the number of the polypeptides coupled to each carrier protein is 2-50, and more preferably, 3-45, 5-40, 5-35, 5-30, 8-30, 10-30, 12-30, or 15-30; or further preferably, the number is any one of 6-36, 8-32, 10-28, 10-26, 10-24, 10-22, 10-20, 10-18, 10-16, or 10-15.

A preferred embodiment further provides an antigen. The antigen includes a polypeptide-carrier protein conjugate or a composition of a plurality of different polypeptide-carrier protein conjugates. The polypeptide-carrier protein conjugate is any one of the above polypeptide-carrier protein conjugates.

It is to be noted that, in the above polypeptide-carrier protein conjugate, the polypeptides coupled to the carrier protein are polypeptides having a same amino acid sequence. That is to say, the same carrier protein is coupled to the same polypeptides, so that the polypeptide-carrier protein conjugate has a single antigen epitope when acting as the antigen. In certain embodiments, when acting as the antigen to detect whether there is serum in a virus antibody, the antigen may be an antigen having the single antigen epitope, or may be an antigen having a plurality of antigen epitopes. When the polypeptide-carrier protein conjugate coupled to different polypeptide sequences acts as the antigen in the form of a composition, the plurality of antigen epitopes may be produced. For example, if an A-BSA conjugate is obtained by coupling the polypeptide of a sequence A to the BSA, a B-BSA conjugate is obtained by coupling the polypeptide of a sequence B to the BSA, and a C-OVA conjugate is obtained by coupling the polypeptide of a sequence C to the OVA, the antigen including the three polypeptide-carrier protein conjugates has A, B and C antigen epitopes. If the antigen only includes one of the three polypeptide-carrier protein conjugates, the antigen only has one antigen epitope.

A preferred embodiment further provides a detection kit for a coronavirus antibody. The kit includes any one of the above antigens. The antigen epitope of the antigens are from any one of the above polypeptides. Known coronavirus protein families all have the above polypeptides. Therefore, the kit including the antigen can accurately and specifically identify and diagnose the coronavirus, especially a patient infected with SARS-CoV-2.

The kit may be prepared into detection kits of a plurality of different types according to specific requirements. However, for easy of detection and determination of detection results, most of the polypeptide antigens in the kit are pre-coated antigens. Preferably, the pre-coated antigen is coated on a solid phase carrier, and the specific pre-coated solid phase carrier is rationally designed according to requirements. More preferably, the solid phase carrier includes an ELISA plate (which is mostly a polystyrene material), a membrane carrier or microsphere. Further preferably, the membrane carrier includes a nitrocellulose membrane (which is most widely used), a glass cellulose membrane or a nylon membrane. Further preferably, the membrane carrier is also coated with a positive control. The polypeptide-carrier protein conjugate and the positive control are successively arranged on the nitrocellulose membrane according to a detection order.

According to different specific detection methods of the kit, specific supporting reagents in the kit are different accordingly, but may be combined according to preparation methods of known kits. Preferably, the above kit also includes one of the following: (1) an enzyme-labeled secondary antibody, more preferably, the enzyme-labeled secondary antibody being an HRP-labeled secondary antibody (corresponding to an ELISA detection kit); (2) a colloidal gold bonding pad, coated with a colloidal gold-labeled specific conjugate (corresponding to an immune colloidal gold detection kit) of the polypeptide-carrier protein conjugate and the positive control; and (3) a labeling pad, coated with fluorescently labeled microsphere, the microsphere being loaded with the specific conjugate (corresponding to an immunofluorescence detection kit) of the positive control.

The immune colloidal gold detection kit and the immunofluorescence detection kit are relatively convenient in detection, which only need to establish a C line of the positive control and a T line of a detection sample. As long as the pre-coated positive control at the C line of the positive control can bind with the specific conjugate with a detection label carried during serum chromatography of a sample to be detected, the specific antigen or antibody of the specific positive control is not specifically limited. Preferably, the positive control is selected from murine immunoglobulin, human immunoglobulin, ovine immunoglobulin or rabbit immunoglobulin; and accordingly, the specific conjugate of the positive control is selected from anti-murine immunoglobulin, anti-human immunoglobulin, anti-ovine immunoglobulin or anti-rabbit immunoglobulin.

According to different immune objects, the anti-murine immunoglobulin may be the anti-murine immunoglobulin of goats or the anti-murine immunoglobulin of rabbits, or the anti-murine immunoglobulin of other immune animals. Likewise, according to different immune animals, the anti-human immunoglobulin, anti-ovine immunoglobulin or anti-rabbit immunoglobulin may also be immunoglobulin from different species. The immunoglobulin may be any one of IgM, IgG, IgA, IgD, or IgE. These anti-immunoglobulin antibodies may be monoclonal antibodies or polyclonal antibodies.

In the kit, according to the number of samples required to be detected, the specification of the ELISA plate used is different, which may be rationally selected from 12 to 384 well ELISA plate. In the pre-coated ELISA plate, according to different antigen epitopes in different polypeptide-carrier protein conjugates, or different detection objects at different onset stages, the coating amount of the polypeptide-carrier protein conjugate in each well is also different. In certain embodiments of this application, the coating amount of the polypeptide-carrier protein conjugate in each well is preferably 0.1-32 μg; preferably, 0.2-30 μg, 0.3-30 μg, 0.4-28 μg, 0.6-25 μg, 0.6-24 μg, 0.7-24 μg, 0.7-22 μg, or 0.7-20 μg; more preferably, 0.7-19 μg, 0.7-18 μg, 0.7-17 μg, 0.7-16 μg, 0.7-15 μg, 0.7-14 μg, 0.7-13 μg, or 0.7-12 μg; and further preferably, 0.8-19 μg, 0.8-18 μg, 0.8-17 μg, 0.8-16 μg, 0.8-15 μg, 0.8-14 μg, 0.8-13 μg, 0.8-12 μg, 0.8-11 μg, 0.8-10 μg, 0.8-9 μg, 0.8-8 μg, 0.8-7 μg, 0.8-6 μg, 0.8-5 μg, 0.8-4 μg, 0.8-3 μg, 0.8-2 μg, 0.8-1.8 μg, 0.8-1.7 μg, 0.8-1.6 μg, 0.8-1.5 μg, 0.8-1.4 μg, or 0.8-1.2 μg.

Similarly, the coating amount of the polypeptide-carrier protein conjugate on the membrane carrier (for example, the nitrocellulose membrane) is also different, preferably 0.8-8 μg/cm, and more preferably 0.8-7 μg/cm, 0.8-6 μg/cm, 0.8-5 μg/cm, 0.8-4 μg/cm, 0.8-3 μg/cm, 0.8-2 μg/cm, 0.8-1.8 μg/cm, 0.8-1.7 μg/cm, 0.8-1.6 μg/cm, 0.8-1.5 μg/cm, 0.8-1.4 μg/cm, or 0.8-1.2 μg/cm.

A preferred embodiment further provides applications of the polypeptide or the antigen epitope in preparation of drugs for treating related diseases caused by a coronavirus. In some preferred embodiments, the coronavirus is SARS-CoV-2. For example, the polypeptide-carrier protein conjugate including these polypeptides or the antigen epitopes is used as the antigen to immunize an animal, so as to prepare a specific antibody. Or according to the related antigen epitope provided in this application, a related polypeptide vaccine may be prepared by means of chemical synthesis. Or a nucleic acid encoding the polypeptide is obtained by using a recombinant gene, so as to obtain a genetic vaccine. Therefore, the above drug may be an antibody or a vaccine. The antibody may be the monoclonal antibody or the polyclonal antibody. The vaccine may be the polypeptide vaccine or the genetic vaccine.

Correspondingly, a preferred embodiment further provides the above drug. The drug may be an antibody or a vaccine. The antibody is obtained by immunizing an animal with the above antigen. The vaccine is a polypeptide vaccine or a genetic vaccine. The polypeptide vaccine includes any one or more of the polypeptides in Table 1. The genetic vaccine includes nucleic acids encoding any one or more of the polypeptides in Table 1. Preferably, the polypeptides are selected from any one or more of SEQ ID NO:1 to SEQ ID NO:40; and more preferably, the polypeptides are selected from any one or more of SEQ ID NO:25, SEQ ID NO:28, SEQ ID NO:31, SEQ ID NO:35 and SEQ ID NO:36. The 5 polypeptides are obtained by independently screening a polypeptide chip at least twice, so that the polypeptides are more likely to be used as vaccines in terms of probability.

It is to be noted that, the antibody is obtained by using the polypeptide-carrier protein conjugate as the antigen to immunize the animal. Commonly used immune animals include mammals such as rats, mice, goats or rabbits. According to different types of the polypeptide-carrier protein conjugates included in the antigen, the obtained antibody may be a monoclonal antibody or a polyclonal antibody. The vaccine may be a polypeptide vaccine. The polypeptide vaccine may be obtained by means of chemical synthesis according to a polypeptide sequence, or may be obtained through enzymatic digestion and purification after in vitro recombinant expression by means of genetic engineering. The genetic vaccine is designed by means of genetic engineering to include a nucleic acid encoding a target polypeptide, to cause the nucleic acid to express so as to produce the polypeptide with an antigen epitope effect.

A preferred embodiment further provides a method for preventing or treating pneumonia caused by a coronavirus. The prevention method includes giving a subject a prophylactically effective amount of an anti-coronavirus drug. The drug is the vaccine in the above drug. The treatment method includes giving the subject therapeutically effective amount of the anti-coronavirus drug. The drug is the antibody in the above drug.

Preferably, the coronavirus is SARS-CoV-2.

In this application, in order to further enhance an immune response produced due to the stimulation of the polypeptide to an organism, a preferred embodiment provides a polypeptide composition. The polypeptide composition includes at least two of peptide fragments shown in SEQ ID NO:1 to SEQ ID NO:154 in Table 1.

In certain preferred embodiments, the polypeptide composition includes at least any one of the peptide fragments shown in SEQ ID NO:1 to SEQ ID NO:40. Preferably, the polypeptide composition includes at least any one of SEQ ID NO:25, SEQ ID NO:28, SEQ ID NO:31, SEQ ID NO:35, or SEQ ID NO:36.

According to different research and development requirements such as vaccine or antibody preparation, the polypeptide compositions may be mixed in physical form to form a composition, or may be connected by using chemical bonds to form a composition in the form of long chain polypeptides. A specific connected peptide fragment sequence, number and sequential order may be rationally adjusted according to actual requirements. Preferably, connection is achieved by using two peptide fragments. A specific way of connection may be implemented by using a linker arm (which may be, for example, glycine or lysine).

In some preferred embodiments, the polypeptide composition includes one or more peptide fragments in a first peptide fragment set. The first peptide fragment set includes the peptide fragments shown in SEQ ID NO:1-4, 6-8, 11, 13-17, 20-25, 27-30, 32-33, 35-36, and 39-40. The peptide fragments in the first peptide fragment set show stronger sequence specificity to the novel coronavirus. The preparation of a vaccine on the basis of these polypeptides facilitates the obtaining of a vaccine specifically targeting the novel coronavirus.

In some other preferred embodiments, the polypeptide composition includes one or more peptide fragments in a second peptide fragment set. The second peptide fragment set includes the peptide fragments shown in SEQ ID NO:5, 9, 10, 12, 18, 19, 26, 31, 34, 37, and 38. The peptide fragments in the second peptide fragment set show stronger sequence conservation to the coronavirus. The preparation of a vaccine on the basis of these polypeptides facilitates the obtaining of a broad-spectrum vaccine for the coronavirus.

In certain embodiments, the polypeptide composition also includes, in addition to one or more peptide fragments in the first peptide fragment set, one or more peptide fragments in the second peptide fragment set. A vaccine is prepared on the basis of the peptide fragments in the above two sets, to obtain a vaccine with stronger immunogenicity against various coronaviruses.

In some embodiments, the polypeptide composition may also be formed by combining a T cell epitope and a B cell epitope, so that an immune effect can be enhanced. Specifically, whether the above 40 polypeptides are from the T cell epitope or the B cell epitope may be distinguished according to multiple epitope prediction software.

In some embodiments, the polypeptide composition includes the polypeptides derived from a same protein and/or different proteins. More preferably, there are no more than two polypeptides derived from the same protein in the polypeptide composition. Further preferably, the polypeptide composition is selected from one of the following combinations:

A combination 1: SEQ ID NO:28, SEQ ID NO:6, SEQ ID NO:13, and SEQ ID NO:18.

A combination 2: SEQ ID NO:27, SEQ ID NO:14, SEQ ID NO:5 and SEQ ID NO:17.

A combination 3: SEQ ID NO:32, SEQ ID NO:4, SEQ ID NO:10 and SEQ ID NO:23.

A combination 4: SEQ ID NO:25, SEQ ID NO:3, SEQ ID NO:34 and SEQ ID NO:40.

A combination 5: SEQ ID NO:30, SEQ ID NO:8, SEQ ID NO:37 and SEQ ID NO:21.

A combination 6: SEQ ID NO:2, SEQ ID NO:11, SEQ ID NO:33 and SEQ ID NO:19.

A combination 7: SEQ ID NO:1, SEQ ID NO:15, SEQ ID NO:12 and SEQ ID NO:29.

A combination 8: SEQ ID NO:26, SEQ ID NO:35, SEQ ID NO:38 and SEQ ID NO:22.

A combination 9: SEQ ID NO:31, SEQ ID NO:36, SEQ ID NO:16 and SEQ ID NO:20.

A combination 10: SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:39 and SEQ ID NO:24.

A combination 11: SEQ ID NO:29, SEQ ID NO:35, SEQ ID NO:40 and SEQ ID NO:20.

A combination 12: SEQ ID NO:3, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:29, SEQ ID NO:33, and SEQ ID NO:34.

In order to further effectively control the infection of the coronavirus to humans, a preferred embodiment of this application provides a polypeptide vaccine. The polypeptide vaccine includes any one or more of peptide fragments shown in SEQ ID NO:1 to SEQ ID NO:154 in Table 1. By using these polypeptides, specific peptide fragments may be rationally selected to form the effective polypeptide vaccine according to the broad-spectrum and/or novel coronavirus-specific peptide fragments.

In some preferred embodiments, the polypeptide vaccine includes at least any one of the peptide fragments shown in SEQ ID NO:1 to SEQ ID NO:40. Preferably, the polypeptide vaccine includes at least one of SEQ ID NO:25, SEQ ID NO:28, SEQ ID NO:31, SEQ ID N0:35, or SEQ ID NO:36.

In some preferred embodiments, the polypeptide vaccine includes one or more peptide fragments in the first peptide fragment set. The first peptide fragment set includes the peptide fragments shown in SEQ ID NO: 1-4, 6-8, 11, 13-17, 20-25, 27-30, 32-33, 35-36, and 39-40. The peptide fragments in the first peptide fragment set show stronger sequence specificity to the novel coronavirus.

In some other preferred embodiments, the polypeptide vaccine includes one or more peptide fragments in the second peptide fragment set. The second peptide fragment set includes the peptide fragments shown in SEQ ID NO: 5, 9, 10, 12, 18, 19, 26, 31, 34, 37, and 38. The peptide fragments in the second peptide fragment set show stronger sequence conservation to the coronavirus.

In certain embodiments, the polypeptide vaccine also includes, in addition to one or more peptide fragments in the first peptide fragment set, one or more peptide fragments in the second peptide fragment set. A vaccine is prepared on the basis of the peptide fragments in the above two sets, to obtain a vaccine with stronger immunogenicity against various coronaviruses.

The preparation of a vaccine by using coronavirus broad-spectrum polypeptides facilitates the development of a general vaccine for the coronavirus, so that the different coronavirus infections can be prevented. The vaccine prepared by using the novel coronavirus-specific polypeptides can specifically target the novel coronavirus.

In some embodiments, the polypeptide vaccine may also be formed by combining the epitope from the T cell and the epitope from the B cell, so that the combined polypeptide vaccine facilitates the enhancement of the immune effect. Specifically, whether the above 40 polypeptides are from the T cell epitope or the B cell epitope may be distinguished according to multiple epitope prediction software.

In some embodiments, the polypeptide vaccine includes the polypeptides derived from different proteins. More preferably, there are no more than two polypeptides derived from the same protein in the polypeptide vaccine. Further preferably, the polypeptide vaccine is selected from any one of the following combinations: