Since the genome for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) was released on Jan. 11, 2020, scientists raced to develop a vaccine.1 However, vaccine development is a long and expensive process, at least in part because of evolutionary strategies adopted by viruses to evade immune surveillance and exploit host cell systems, e.g., molecular mimicry of host proteins. Accordingly, there is a long and unmet need for other treatment modalities for viral infection. Embodiments of the present disclosure relate to such methods of treatment, including at least in part, the identification and exploitation of molecular mimicry exhibited by viruses.
The invention provided herein, relates, at least in part, on molecular mimicry of host proteins as used by viruses (.e.g., coronavirus, such as SARS-CoV-2) for evasion of immune surveillance and to exploit host cell systems. The surface of SARS-CoV-2 virions is coated with the spike (S) glycoprotein, whose proteolysis is key to the infection lifecycle. After the initial interaction of the S-protein with the ACE2 receptor', host cell entry is mediated by two key proteolytic steps. The S1 subunit of the S-protein engages ACE2, and viral entry into the host cell is facilitated by proteases that catalyze S1/S2 cleavage2,3 at Arginine-667/Serine-668 (
In some aspects of the invention, disclosed herein are methods of treating or reducing viral infection in a human subject, comprising administering to the subject a pharmaceutical composition comprising a protease inhibitor.
Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure pertains. Preferred methods and materials are described below, although methods and materials similar or equivalent to those described herein can also be used in the practice or testing of the presently disclosed methods and compositions. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety.
As provided herein, single cell RNA-sect from 65 studies showed significant overlap between the expression of ENaC-α, and ACE2, the putative receptor for the virus, in cell types linked to the cardiovascular-renal-pulmonary pathophysiology of COVID-19. Triangulating this cellular fingerprint with amino acid cleavage signatures of 178 human proteases showed the potential for tissue-specific proteolytic degeneracy wired into the SARS-CoV-2 lifecycle. Thus, the evolution of SARS-CoV-2 into a global pandemic may be in part due to targeted mimicry of human ENaC and hijack of the associated host proteolytic network.
Aspects of the invention disclosed herein include methods of treating or reducing viral infection in a human subject, comprising administering to the subject a pharmaceutical composition comprising a protease inhibitor. In some aspects, provided herein are methods that can be used to reduce infectivity of a viral particle, e.g., reducing propagation of the virus particle; reducing release of the virus particle from a cell; and preferably reducing entry of the virus particle into a cell.
In some embodiments, the pharmaceutical composition comprises an inhibitor of one or more proteases selected from Furin, PCSK2, PCSK 4, PCSK 5, PCSK 6, PCSK 7, PLG, or any combination thereof. In some such embodiments, the composition is administered to the subject directly to an organ selected from lung, small intestine, large intestine, pancreas, kidney, heart, or tongue. Said composition may be administered subcutaneously, intravenously, intramuscularly, intranasally, by inhalation, orally, sublingually, by buccal administration, topically, transdermally, or transmucosally, as are known in the art. For example, and without limitation, the composition is administered to the subject directly to the nasopharynx and/or the oropharynx. In preferred embodiments, the subject is suffering from SARS-CoV-2 infection.
As used herein, the expression “treating or reducing viral infection” means improving, reducing, or alleviating at least one symptom or biological consequence of virus infection in a subject, and/or reducing or decreasing virus titer, load, replication or proliferation in a subject following exposure to a virus. The expression “treating or reducing viral infection” also includes shortening the time period during which a subject exhibits at least one symptom or biological consequence of the infection. Methods for treating virus infection, according to the present invention, comprise administering a pharmaceutical composition of the present invention to a subject after the subject is infected with the virus and/or after the subject exhibits or is diagnosed with one or more symptoms or biological consequences of virus infection. Preferably, the virus is a coronavirus such as SARS-CoV, or variants thereof. More preferably the virus is SARS-CoV-2.
As will be appreciated by those of skill in the relevant art, the symptom or biological consequence of virus infection will depend on the infecting viral particle. As a non-limiting example, SARS-CoV infection may include one or more of nasal congestion, sinus congestion, runny nose, sneezing, body (muscle) ache, head ache, chills, fever, cough, sore throat, fatigue, ear ache, or a diagnostic indicator of infection, e.g., detection of SARS-CoV by viral culture, hemagglutinin agglutination inhibition (HAI) assay, immunofluorescence, or nucleic acid-based detection (e.g., RT-PCR) using an appropriate specimen (e.g., nasal swab, nasopharyngeal swab, throat swab, endotracheal aspirate, sputum, bronchial wash, etc.). Thus, a subject who tests positive for infection by a diagnostic assay is considered a subject exhibiting a “symptom or biological consequence” of said virus infection.
As used herein, a “subject” shall mean a vertebrate animal including but not limited to a human, non-human primate, mouse, rat, guinea pig, rabbit, cow, dog, cat, horse, goat, bird, reptile, or fish. In some embodiments of the invention, a subject is a mammal. In some embodiments, the subject may be a domesticated animal, a wild animal, or an agricultural animal. Thus, the invention can be used to inhibit virus particle infectivity and to treat or reduce viral infection in human and non-human subjects. For instance, methods and compositions of the invention can be used in veterinary applications (for examples in zoos, reserves, farms, in the wild, etc.) as well as in human treatment regimens. In some embodiments of the invention, the subject is a human. In some embodiments of the invention, a subject is at risk of having, or has a viral infection.
The treatment approaches disclosed herein take advantage of the molecular mimicry exhibited by viruses as a strategy to evade immune surveillance and exploit mechanisms of the host cell to maintain or increase infectivity. The investigation of targeted mimicry of host proteins lead to the discovery that viruses may selectively hijack the host proteolytic network, which may represent therapeutic targets. Thus, disclosed herein are agents that target the activity a target protease (e.g., protease inhibitors). Without being bound by any particular theory, such agents/inhibitors include chemical compounds, small molecules, mixtures of chemical compounds and/or a biological macromolecules (such nucleic acids, antibodies, antibody fragments, proteins or a peptides). Moreover, the agents contemplated herein include those known in the art, and those that may be identified by screening or validation assays known in the art.
In some embodiments, the agent is an antibody or antibody fragment that binds specifically to the protease of interest. In some embodiments, the antibody depletes, neutralizes, or inhibits one or more associated activities of said protease.
In some embodiments, the inhibitor is an interfering nucleic acid specific for an mRNA product of a target gene encoding the protease. Such interfering nucleic acids are known in the art and include, without limitation, siRNAs, shRNAs, miRNAs, peptide nucleic acids (PNAs), and the like, as are known in the art.
Without being bound by any particular theory or methodology, a combination of agents may he administered to the subject in need thereof. The combination and administration of such agents may be informed, at least in part, by the methods disclosed herein. In some such embodiments, the combination of agents can be administered either in the same formulation or in separate formulations, either concomitantly or sequentially. Thus, a subject who receives such personalized treatment can benefit from a combined effect of different therapeutic agents.
In some embodiments, the agent (e.g protease inhibitor), or combination of agents, is administered with a vaccine composition. The agent, or agents, may be administered either in the same formulation as the vaccine or in separate formulations, either concomitantly or sequentially. For example, in sonic embodiments, the protease inhibitor is administered to a vaccinated subject, or is administered prior to vaccination of a subject.
It will be appreciated by those of ordinary skill in the art that evaluation of a treatment, e.g., evaluation of the symptoms or clinical end-points of a viral infection, can be used in conjunction with methods of the invention to assess the status of a viral infection and/or the efficacy of a treatment for a viral infection.
Comparing SARS-CoV-2 with SARS-CoV shows that the former has acquired a distinctive sequence insertion at the S1/S2 site (
RRARSVAS
L
RARSVAS
W
RARSVAS
Q
RARSVAS
ENaC regulates Na+ and water homeostasis and its expression levels are controlled by aldosterone and the associated Renin-Angiotensin-Aldosterone System (RAAS)6. Similar to SARS-CoV2, ENaC-α needs to be proteolytically activated for its function7. FURIN cleaves the equivalent peptide on mouse ENaC-α between the Arginine and Serine residues in the 4th and 5th positions respectively (RSARISASS)8,9, akin to the recent report establishing FURIN cleavage at the S1/S2 site of SARS-CoV-2 (
Mimicry of human ENaC-α by the S1/S2 site suggests that SARS-CoV-2 is hijacking the protease network of ENaC-α for viral activation. An overlap between putative SARS-CoV-2 infecting cells and ENaC-α expressing cells was observed. Systematic single cell expression profiling of the ACE2 receptor and ENaC-α was performed across human and mouse samples comprising ˜1.3 million cells (
A 160-dimensional vector space (20 amino acids×8 positions on the peptide) was created for assessment of cleavage similarities between the 178 human proteases with biochemical validation from the MEROPS database15. FURIN (PCSK3) had overall proteolytic similarity to select PCSK family members, specifically PCSK5 (0.99), PCSK7 (0.99), PCSK6 (0.99), PCSK4 (0.98), and PCSK2 (0.94) (Table 2). It is also known that the protease PLG cleaves the -subunit of ENaC (ENaC-)16.
1.00
R(158)
K(34)
K(88)
R(203)
S(57)
V(46)
S(30)
S(27)
R(68)
A(23)
A(34)
G(21)
R(26)
D(22)
L(31)
R(97)
K(23)
K(59)
R(125)
S(37)
V(25)
S(15)
E(15)
S(16)
R(41)
A(24)
G(15)
L(13)
R(13)
L(22)
D(15)
E(13)
R(77)
K(18)
K(49)
R(100)
S(31)
V(25)
S(13)
E(13)
R(12)
R(32)
A(19)
G(12)
P(11)
S(11)
L(15)
D(12)
L(11)
E(11)
R(85)
K(19)
K(53)
R(102)
S(33)
V(29)
G(15)
E(13)
S(12)
S(36)
A(20)
S(14)
L(12)
R(12)
L(15)
D(13)
P(11)
E(11)
R(85)
K(23)
K(54)
R(112)
S(34)
V(25)
D(14)
E(15)
S(13)
R(38)
L(22)
S(13)
R(12)
A(20)
G(13)
E(13)
R(86)
Q(27)
K(123)
R(192)
S(43)
V(27)
G(31)
E(31)
S(22)
R(44)
Y(25)
G(23)
D(27)
L(22)
A(22)
P(18)
R(17)
L(15)
R(65)
S(23)
R(13)
S(13)
A(16)
S(13)
K(57)
A(20)
R(13)
The tissue tropism of SARS-CoV-2, in view of the cleavage similarities found within the host proteolytic network, the co-expression of these proteases concomitant with the viral receptor ACE2, and ENaC-α was assessed (
The mimicry of a cleavable host peptide central to cardiovascular, renal, and pulmonary function provides a further strategy for the treatment and/or prevention of viral infection.
The complete S-protein sequence for SARS-CoV (Uniprot ID: P59594) and SARS-CoV-2 was obtained from uniprot (ftp.uniprot.org/pub/databases/uniprot/pre_release)16. Sequence alignments using Clustal-W, and comparison of SARS-CoV-2 versus other coronavirus strains were performed using JalView17.
The number of 8-mors in Uniprot 20,350 reference sequences are 10,257,893 (10.26M). The previously identified SARS-CoV-2 8-mer ‘RRARSVAS’ was found in a Uniprot reference sequence (p-value≈10.26M/208=4E-4; chance of finding that particular 8-mer anywhere in the reference sequences).
Biochemical specificities are inferred from substrates of the proteases that have been determined using the evidence from (i) N-terminal sequencing, (ii) mass spectroscopy, (iii) mutational studies, (iv) consensus analysis or (v) liquid chromatography. The protein residues from the protease substrates spanning the scissile bond (±4 residues) is considered to define sequence specificity of the motif. The motif length thus spans across 8 residues (or positions), and frequency of all the 20 amino acids at each position is calculated from the substrates identified for each of the proteases to get a position frequency matrix (PFM, with a dimension of 8×20). This PFM was converted to a probability matrix by normalizing to the frequency distribution of all the 20 amino acids per position. The position frequency matrices were downloaded from the MEROPS database15. In total, probabilities matrices for 178 proteases were constructed. The number of distinct cleavages for each protease ranges from as few as 10 to over 3000, based on which probability matrices for the cleavage of the 8-mer polypeptide was estimated (Table 2). The probability matrix is then used to scan the peptide sequence (‘RRARSVAS’) using a window frame equal to the length of the motif. A log-likelihood ratio score is calculated for each sequence position in the scanned window with respect to background distribution of particular amino acid in the human proteome and summed up across all the positions to get a motif score. A random set of sequences are generated from the given background of amino acid distribution in the human proteome and used as a null model to evaluate the statistical significance of the scanned window. FIMO tool was used to obtain the scores and the corresponding statistical significance of the PWM match18. These were then used to estimate cleavage propensity by assuming background distribution of amino-acid frequencies from human proteome19 using MEME20 and FIMO18. A stringent cut-off of 1E-04 was used to identify the potential cleavage sites on S proteins of both SARS-CoV and SARS-CoV-2 at S1/S2 site. The hits obtained were further manually filtered to ensure that residue at the immediate vicinity of cleavage site is conserved.
The position frequency matrix (PFM) of the individual proteases obtained from the MEROPS database was converted to a probability weight matrix (PWM) (normalized and scaled). Out of 178 proteases, there were 146 proteases that had specificity information available on the 8 mer peptide spanning the cleavage site (±4). The 20 (amino acids)×8 (position) matrix defined for each of the proteases were flattened into a single vector with 160 elements. A cosine similarity calculation was performed between all pairs (X,Y) of protease specificity vector. The similarity was derived as the normalized dot product of X and Y: K(X, Y)=<X, Y>/(∥X∥*∥Y∥)).
A systematic expression profiling of the ACE2 and ENaC-α was performed across 65 published human and mouse single-cell studies comprising ˜1.3 million cells using nferX Single Cell platform (Table 3, academia.nferx.com)6. The ACE2 expression could be detected in 67 studies (59 studies of human samples and 8 studies of mouse samples) spanning across ˜50 tissues, over 450 cell-types and ˜1.05 million cells. In order to call a given cell-type to be positive for both ACE2 and a protease we applied a cutoff of 1% of the cells in the total cell-type cluster population to have a non-zero count associated with both ACE2 and the respective protease. The mean expression of the proteases, ENaC-α and ACE2 was derived for individual cell population within each of the studies. The cell-type information was obtained from the author annotations provided for each of the studies. The analysis was performed separately on the mouse and human datasets. For each protease, the mean expression of in a given cell-population (mean log[cp10k+1] counts) was Z-score normalized (to ensure the sd=1 and mean −0 for all the genes) to obtain relative expression profiles across all the samples. The same normalization was applied to ACE2 and ENaC-α and both human and mouse datasets were analyzed independently by generating heatmaps. The cell types having zero-expression values of ACE2 were also included as negative control to probe the expression of various proteases.
Mus
musculus
Mus
musculus
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Mus
musculus
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Mus
musculus
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Mus
musculus
Mus
musculus
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Mus
musculus
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
Homo
sapiens
An analysis to identify the cell types with significant overlap of ACE2 and ENaC-α expression was performed. To this end, cell types in which ENaC-α is expressed in a significantly higher proportion of ACE2-expressing cells than in the overall population of cells of that sub-type were shortlisted. The ratios of these proportions were computed, and a corresponding Fisher exact test was used to compute significance.
1. Walls, A. C. et al. Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein. Cell (2020) doi:10.1016/j.cell.2020.02.058.
2. Belouzard, S., Chu, V. C. & Whittaker, G. R. Activation of the SARS coronavirus spike protein via sequential proteolytic cleavage at two distinct sites. Proceedings of the National Academy of Sciences 106, 5871-5876 (2009).
3. Belouzard, S., Millet, J. K., Licitra, B. N. & Whittaker, G. R. Mechanisms of Coronavirus Cell Entry Mediated by the Viral Spike Protein. Viruses 4, 1011-1033 (2012).
4. Elbe, S. & Buckland-Merrett, G. Data, disease and diplomacy: GISAID's innovative contribution to global health: Data, Disease and Diplomacy. Global Challenges 1, 33-46 (2017).
5. Carrillo-Tripp, M. et al. VIPERdb2: an enhanced and web API enabled relational database for structural virology. Nucleic Acids Research 37, D436—D442 (2009).
6. Rossier, B. C., Baker, M. E. & Studer, R. A. Epithelial Sodium Transport and Its Control by Aldosterone: The Story of Our Internal Environment Revisited. Physiological Reviews 95, 297-340 (2015).
7. Noreng, S., Bharadwaj, A., Posert, R., Yoshioka, C. & Baconguis, I. Structure of the human epithelial sodium channel by cryo-electron microscopy. Elife 7, (2018).
8. Hughey, R. P., Bruns, J. B., Kinlough, C. L. & Kleyman, T. R. Distinct pools of epithelial sodium channels are expressed at the plasma membrane. J. Biol. Chem. 279, 48491-48494 (2004).
9. Hughey, R. P. et al. Epithelial sodium channels are activated by furin-dependent proteolysis. J. Biol. Chem. 279, 18111-18114 (2004).
10. Welzel, M. et al. Five novel mutations in the SCNN1A gene causing autosomal recessive pseudohypoaldosteronism type 1. European Journal of Endocrinology 168, 707-715 (2013).
11. Venkatakrishnan, A. et al. Knowledge synthesis from 100 million biomedical documents augments the deep expression profiling of coronavirus receptors. http://biorxiv.org/lookup/doi/10.1101/2020.03.24.005702 (2020) doi:10.1101/2020.03.24.005702.
12. Shweta, F. et al. Augmented Curation of Unstructured Clinical Notes from a Massive EHR System Reveals Specific Phenotypic Signature of Impending COVID-19 Diagnosis. http://medrxiv.org/lookup/doi/10.1101/2020.04.19.20067660 (2020) doi:10.1101/2020.04.19.20067660.
13. Butterworth, M. B. Regulation of the epithelial sodium channel (ENaC) by membrane trafficking. Biochimica et Biophysica Acta (BBA)—Molecular Basis of Disease 1802, 1166-1177 (2010).
14. Musante, I. et al. Peripheral localization of the epithelial sodium channel in the apical membrane of bronchial epithelial cells. Exp Physiol 104,866-875 (2019).
15. Rawlings, N. D. et al. The MEROPS database of proteolytic enzymes, their substrates and inhibitors in 2017 and a comparison with peptidases in the PANTHER database. Nucleic Acids Research 46, D624—D632 (2018).
16. Passero, C. J. et al. Plasmin Activates Epithelial Na+ Channels by Cleaving the γ Subunit. J. Biol. Chem. 283,36586-36591 (2008).
All publications and patents mentioned herein are hereby incorporated by reference in their entirety as if each individual publication or patent was specifically and individually indicated to be incorporated by reference. In case of conflict, the present application, including any definitions herein, will control.
While specific embodiments of the subject invention have been discussed, the above specification is illustrative and not restrictive. Many variations of the invention will become apparent to those skilled in the art upon review of this specification and the claims below. The full scope of the invention should be determined by reference to the claims, along with their full scope of equivalents, and the specification, along with such variations.
This application claims the benefit of U.S. Provisional Application No. 63/183,140, filed May 3, 2021, which is hereby incorporated by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
63183140 | May 2021 | US |