Exploring COVID-19: Relating the spike protein to infectivity, pathogenicity and Immunogenicity

The disease which reportedly began in Chinese city Wuhan in November-December 2019 manifesting as severe respiratory illness, soon spread to various parts of the world, and was named COVID-19, and declared a pandemic disease by the World Health Organization (WHO). The disease further spread to certain Arabian and European countries. While these countries were recovering from the epidemic, the disease took hold in the UK and USA, and the South American, Arabian, and South Asian countries, predominantly affecting Brazil, Peru, Iran, and India. Presently, most of the European countries are witnessing recurrent outbreaks and a resurgence of COVID-19, whereas the epidemic is not yet over in other countries [1]. This calls for a need for exploring further the factors affecting the disease transmission, pathogenicity, and immunogenicity, apart from rigorous preventive and control measures, and vigorous vigilance and surveillance.

glycoprotein triggered by ACE2 receptor binding. The cleavage at the S1/S2 site yields a surface subunit S1, which attaches the virus to the host cell ACE2 receptor, and a transmembrane subunit S2, which mediates the fusion of viral and host cell membranes to facilitate viral entry. Following entry, SARS-CoV-2 releases its genomic RNA into the host cell cytoplasm.
Genome RNA is irst translated into viral replicase polyproteins (pp1a and 1ab), which are further cleaved by viral proteases into some 16 non-structural proteins. A replication-transcription complex (RTC) is formed based on the non-structural proteins. In the process of genome replication and transcription mediated by RTC, the negative− sense genomic RNA is synthesized and used as a template to produce positive-sense genomic RNA and sub-genomic RNAs.
The nucleocapsid (N) structural protein and viral RNA are replicated, transcribed, and synthesized in the cytoplasm, whereas other viral structural proteins, including the S protein, membrane (M) protein and envelope (E) protein, are transcribed and then translated in the rough endoplasmic reticulum (RER) and transported to the Golgi complex.
In the RER and Golgi complex, the SARS-CoV-2 glycoprotein is subjected to co-translational and post-translational processing, including signal peptide removal, trimerization, extensive glycosylation and subunit cleavage. The N protein is subsequently associated with the positive sense genomic RNA to become a nucleoprotein complex (nucleocapsid). While S, M, and E proteins as well as other viral proteins are further assembled, followed by budding into the lumen of the ER-Golgi intermediate compartment (ERGIC) to form mature virions.
Finally, the mature virions are released from the host cell.

Exploring SARS-CoV-2 S protein
The virus and spike (S) protein: The novel coronavirus, SARS-CoV-2 is a single-stranded RNA-enveloped virus. Its RNA-based genome is 29,881 bp in length and encodes 9860 amino acids. The SARS-CoV-2 S protein, primarily facilitating the viral entry into the host cells, is a 1273 amino acid homotrimeric class I fusion protein. The third open reading frame (ORF) in the SARS-CoV-2 genome encodes the S protein, which is the largest protein in the group of four structural proteins including M, E and N proteins [2]. The SARS-CoV-2 S protein bears 76% amino acid sequence identity with SARS-CoV and 93% and 97% amino acid identity with that of the Bat-CoV RaTG13 and Pangolin-CoV, respectively [3]. Due to the conserved residues and binding to the receptors, immunity to SARS-CoV appears to confer a limited immunity to SARS-CoV-2 [4].
The SARS-CoV-2 S protein identi ies a transmembrane protein, neuropilin-1 on the surface of human cells, through which it facilitates the virus to attach, invade and infect the host cells [5]. The neuropilins are adhesion molecules responsible for cellular activities such as cell adhesion, survival, repulsion, and attraction.
The structure of S protein: The S protein has a size of 180-200 kDa and length of 1273 aa. The S protein trimers visually form a characteristic bulbous, crown-like protrusions surrounding the viral particle. The S protein is composed of various functional regions or domains. The globular head, S1 subunit contains the N-terminal domain (NTD) and receptorbinding domain (RBD), whereas the stalk, S2 subunit contains the C-terminal (CT) membrane fusion domain followed by the two heptad regions (HR1 and HR2), the transmembrane domain (TMD) and the cytosolic tail (CT). The S protein ectodomain is heavily glycosylated with heterogeneous N-linked glycans and exists in a prefusion and a post-fusion conformation. The associated oligosaccharides in luence priming by host proteases and conceal it from antibody recognition [6].
The RBD binds to the aminopeptidase N (APN) region of ACE2 receptors in the host cells [7]. APN is transmembrane glycoprotein expressed on the apical membranes of epithelial cells in the respiratory and enteric tracts, endothelial cells, and renal cells; at synaptic junctions in neural tissues; and on cells of the immune system. While the S1 RBD domain is a highly mutable region, the HR region of the S2 subunit is conserved among HCoVs. The mutations of key residues in RBD can enhance the interactions with ACE2 [8]. The S1 subunit binds with ACE2 to promote the formation of endosomes and triggers the viral fusion activity. The S2 subunit, composed successively of a FP, HR1, HR2, TM domain, and cytoplasmic tail domain (CT), is responsible for viral fusion and entry. The HR1 and HR2 are composed of a repetitive heptapeptide: HPPHCPC, where H is a hydrophobic or traditionally bulky residue, P is a polar or hydrophilic residue, and C is a charged residue ( Figure 1). The S protein exists in a metastable conformation and when the virus interacts with the host cell, extensive structural rearrangement of its components occurs, allowing the virus to fuse with the host cell membrane. The switch in conformations is triggered by RBD-ACE2 receptor binding, which exposes various regions within the S2 subunit [9].

ACE2 receptor binding and viral fusion:
To initiate cellular entry, while engaging with ACE2 receptor, the RBD moves like a hinge between two conformations ('up' or 'down') to expose or hide the residues that bind the APN region. Within the RBD, there is a receptor binding motif (RBM) which makes the primary contact with the carboxypeptidase domain of ACE2. SARS-CoV-2 binds to ACE2 with higher af inity than SARS-CoV owing to the presence of a 4-residue motif in the RBM that makes a better contact with ACE2 than the SARS-CoV S protein. ACE2 as receptors, however, have differential tissue tropism and the structural variations in ACE2 may in luence its binding with the S protein [10].
There occurs change in S2 conformation with the RBD bonding with ACE2, which exposes the cleavage sites to cellular proteases, followed by cleavage of the S protein by transmembrane protease serine 2 (TMPRSS2) and other cellular proteases. The cleavage initiates the insertion of FP into the host cell membrane, exposure of the pre-hairpin coiled-coil of the HR1 domain and its interaction with the HR2 to form the six-helical bundle (6-HB). HR1 forms a homo-trimeric assembly in which three highly conserved hydrophobic grooves on the surface binding to HR2 are exposed and bring the viral envelope and cell membrane into proximity for fusion and entry [11].
Whereas other CoVs spike proteins are cleaved at the junction between S1 and S2, the SARS-CoV2 has an additional distinct protease (furin) cleavage site (S2'). With the binding of S1 to the receptor, S2 undergoes an additional cleavage by host proteases through exposure of a second cleavage site (S2′). In the two-step process, the cleavage at the S2′ site activates the protein for membrane fusion and allows fusion of the viral membrane with the host cell membrane. Apart from TMPRSS2, the SARS-CoV-2 S protein can be proteolytically activated by a variety of other cellular proteases including cathepsin B and L (endosomal cysteine proteases), furin, elastase, factor X and trypsin. The protease induced priming and proteolysis initiate the process of cellular entry.
The second cleavage site (S2') changes the conformation of the S protein from the prefusion to the post-fusion state and expands the versatility of SARS-CoV-2 for cleavage by cellular proteases, the tropism and transmissibility owing to the wide cellular expression of furin proteases, especially in the respiratory tract [12]. This also allows the newly synthesised virions can be secreted in a 'preactivated' state ready to fuse with and infect other cells by releasing the FP to disrupt the host cell bilayer-lipid membrane lipid bilayers to prime for fusion without the need to bind to a cellular receptor, such as ACE2. The SARS-CoV-2 S protein has thus evolved further to exploit respiratory cell receptors and proteases to enable enhanced infectivity and rapid spread.
The steps in viral invasion and internalisation, thus, include binding of the virus to the cell surface, alteration of the conformation of the S protein, proteolysis of the S protein, and release of the S2 subunit followed by fusion of the virion and endocytosis. The S protein binds to the host cell by recognizing the receptor ACE2, which is a homolog of ACE and distributed in the lungs, intestines, heart and vasculature, and kidneys. The alveolar epithelial type II cells are the major expressing cells. SARS-CoV-2 S binds to human ACE2 with a dissociation constant (KD) of 14.7 nM, which for SARS-CoV S is 325.8 nM, indicating that SARS-CoV-2 S is more sensitive to ACE2 than is SARS-CoV S [13].
The S subunits exist in a noncovalent form in the uncleaved state and cleavage of S1 and S2 subunits by host proteases, initiates the fusion process. The proteases furin and neuropilin-1 (NRP1), which are abundantly expressed in the respiratory and olfactory epithelium, with highest expression in endothelial and epithelial cells, bind the substrates and potentiate the SARS-CoV-2 infectivity [14]. Trypsin is another host cell protease that can cleave the S protein. Viral fusion process refers to fusion of the viral membrane and host cell membrane, followed by internalisation and the release of the viral genome into the host cell.

Immunogenicity of the s protein
The experimental immunological studies: The S protein is considered the main antigenic component among structural proteins of SARS-CoV-2 and responsible for inducing the host immune response. The neutralising antibodies (nAbs) targeting the S protein are produced and may confer a protective immunity against the viral infection. The RBD region is a critical target for nAbs for preventive as well as blocking therapy with antibodies. Further, the role of the S protein in infectivity also makes it an important tool for diagnostic antigen-based testing and vaccine development [15]. On the other hand, several monoclonal antibodies (mAbs) have shown promising results in neutralizing SARS-CoV-2. CR3022, a SARS-CoV-speci ic human mAb, binds potently with SARS-CoV-2, and has potential as therapeutic as well as prophylactic agent alone or in combination with other nAbs, for SARS-CoV-2 infection. In addition, sera from SARS patients during convalescence or animals speci ically immunized with SARS-CoV S1 may cross-neutralize SARS-CoV-2 and reduce viral load or S protein-mediated SARS-CoV-2 entry.
Yuan, et al. have determined the structure of CR3022, the neutralizing antibody obtained from convalescent COVID-19 patients in complex with the receptor-binding domain of SARS-CoV-2 spike [16]. While Yu, et al. designed a series of prototype DNA vaccines against the SARS-CoV-2 S protein [17]. The analysis of the vaccine candidates in rhesus macaques have shown that the animals developed protective humoral as well as cellular immune responses when challenged with the virus. Neutralizing antibody titers were also observed at levels similar to those seen in humans who have recovered from SARS-CoV-2 infection [18].
In addition, the peptide fusion inhibitors such as nel inavir mesylate, suppress both SARS-CoV-2 S and SARS-CoV S-mediated the virion-cell fusion. Nel inavir may also inhibit TMPRSS2 involved in activation of the S protein. Camostat mesylate, a protease inhibitor targeting SARS-CoV-2 S cleavage sites, is a potent serine protease inhibitor of TMPRSS2 and appear to block the SARS-CoV-2 cellular entry [19].
The COVID-19 pandemic has dramatically accelerated global vaccine development efforts, most targeting the viral S protein. There are SARS-CoV-2 vaccine candidates based on types DNA-and RNA-based formulations, recombinant subunits containing viral epitopes, adenovirus-based vectors, and inactivated virus, under development. The inactivated virus vaccines have been traditionally used. Thus, inactivated virus vaccine development for SAR-CoV-2 may be a time-tested and effective way for the prevention of disease [20]. Gao, et al. developed experimentally a puri ied inactivated SARS-CoV-2 virus vaccine candidate, PiCoVacc, which was able to induce SARS-CoV-2-speci ic neutralizing antibodies in mice, rats, and nonhuman primates. These antibodies neutralized various SARS-CoV-2 strains, thus displaying a possible broader neutralizing ability. The animals were immunized with one of two vaccine doses and then inoculated with SARS-CoV-2. Those having received the lowest dose showed signs of controlling the infection, whereas those receiving the higher dose appeared more protected with undetectable viral load in the pharynx or lungs at 7 days after infection.
The immune response to S protein: In the early phase of SARS-CoV-2 infection, T-and B-cell counts are substantially decreased [21]. In a period of about 2 weeks following onset of symptoms, IgM and IgG become detectable. The T cells play an important role in supporting the development of the B cell response, while the B cells produce the antibodies that recognize SARS-CoV-2 S protein. A particular subset of T cells, called T-follicular helper (TFH) cells, is predictor of an effective immune response. The circulating TFH cells are S-speci ic and functional, and the occurrence of CXCR3+ TFH cells is positively correlated with neutralising antibody titres in COVID-19-convalescent individuals [22].
The S glycoprotein plays essential roles in virus attachment, fusion, and entry into the host cell. Simultaneously, its surface location renders it a directly accessible target for host immune response [23]. The SARS-CoV-2 RBD has a high hACE2 binding af inity than SARS-CoV to support ef icient cell entry. Further, SARS-CoV-2 RBD is less exposed than SARS-CoV RBD and helps the virus to evade immune response. Furthermore, the SARS-CoV-2 spike contains a proprotein convertase (PPC) motif at the S1/S2 boundary and its prior cleavage during viral packaging enhances the ef iciency for entry into new target cells including those with low expression of TMPRSS2 and other proteases [24].
The immune syste m responds to the S protein, both S1 and S2 subunits being highly antigenic [25]. As a result, the S-speci ic antibodies, memory B and circulating TFH cells are consistently elicited following SARS-CoV-2 infection and COVID-19 vaccine shots in the clinical trials ( Figure 2).
There evolves humoral and circulating TFH immunity against S protein in the convalescent COVID-19 patients and those with robust B cell response show a strong plasma neutralising activity [26]. Still, in the COVID-19-convalescent individuals, the titres of spike-speci ic neutralising antibodies are variable [27]. In the clinical studies, the convalescent individuals who experienced severe COVID-19 showed higher neutralising antibody titres, a faster increase in lymphocyte counts and a higher frequency of CXCR3+ T follicular help (TFH) cells compared with COVID-19-convalescent individuals who experienced non-severe disease [28]. Remarkably, those recovering from severe COVID-19 elicit and maintain higher antibody and neutralization titres than the non-severe group. Thus, the neutralising antibody titre appears to correlate with the severity of the disease [29].
These indings provide insights into neutralising antibody responses in COVID-19-convalescent individuals and may facilitate the treatment and vaccine development for SARS-CoV-2 infection. The neutralising antibodies are crucial in protecting from re-infection. They bind to the S protein as well as prevent it from getting to attached to the host cells to facilitate virus entry [30]. The presence of anti-spike or anti-nucleocapsid IgG antibodies has been associated with a substantially reduced risk of SARS-CoV-2 reinfection in the ensuing 6 months [31]. Thus, generating a strong neutralising antibody response is the primary goal for SARS-CoV-2 vaccines.

Associated immunological phenomena: Antibodies predating COVID-19 infection:
It has been documented that immunological memory after infection with seasonal human coronaviruses (hCoVs) may potentially contribute to crossprotection against severe acute SARS-CoV-2 infection [32]. It has been seen that a proportion of SARS-CoV-2-uninfected persons have circulating immunoglobulin G (IgG) antibodies that could cross-react with the S2 subunit of the SARS-CoV-2 S protein. By contrast, the COVID-19 patients generate IgA, IgG, and IgM antibodies that recognize both the S1 and S2 subunits.
Experimentally, these anti-S2 antibodies from SARS-CoV-2-uninfected patients have neutralizing activity against both SARS-CoV-2 and SARS-CoV-2 S pseudo-types. A much higher percentage of SARS-CoV-2-uninfected children and adolescents show these antibodies compared with adults, which may be because children and adolescents generally have higher hCoV infection rates and a more diverse antibody repertoire, which may explain the age distribution of COVID-19 susceptibility [33].
Similarly, the study by Demers-Mathieu et al. has evaluated the presence and the levels of antibodies reactive to SARS-CoV-2 S1 and S2 subunits in the human milk. These antibodies are reactive to S protein and nucleocapsid protein and could provide passive immunity to breastfed infants and protect from COVID-19 [34].
There exists striki ng structural similarity and sequence conservation between the SARS-CoV-2 S and SARS-CoV S glycoproteins that recognize hACE2 to enter the host cells. This resemblance is further strengthened by inding that SARS-CoV S elicits polyclonal antibody (pAb) response which can potently neutralize the SARS-CoV-2 S-mediated entry into the cells. Most of the SARS-CoV neutralising Abs target the SB domain and several of them recognize the RBM and prevent receptor engagement [35]. Similarly, all other SARS-CoV-2 vaccine candidates entering clinical trials contain or express full-length or near full-length S protein and therefore also bear risk of ADE, albeit less so [36]. Considering this phenomenon, the continued search for a safe and effective SARS-CoV-2 vaccine may appear a farther dream but should go on.
The recombinant RBD proteins of SARS-CoV and MERS-CoV have been shown to potently induce protective nAbs. Further, it has been shown that the RBD part within RBD-Fc fusion protein is responsible for induction of nAbs against SARS-CoV-2. Furthermore, the anti-SARS2-RBD sera have been found to cross-react with SARS-RBD. Therefore, the observed cross-binding with SARS-RBD and cross-neutralization of SARS-CoV is likely to be contributed by antibodies targeting the conserved SARS-CoV-2 core subdomain, which contains cross-neutralization antibody epitopes. Zang, et al. have documented that the anti-RBD sera exhibited potent neutralization effects on SARS-CoV-2. Moreover, the anti-RBD sera also inhibited SARS2-S-mediated cell-cell fusion. On this basis, it seems that the anti-RBD antibodies do not promote ADE [37].

Development of COVID-19 vaccines:
Various SARS-CoV-2 vaccine candidates based on various vaccine platforms, such as inactivated or live attenuated vaccines, DNA and mRNA vaccines, viral vector-based vaccines, and recombinant protein-based vaccines, have been developed [38]. Most of these vaccine strategies are based on the full-length S glycoprotein, which is the major SARS-CoV-2 surface antigen. Though the S protein is thought to be a promising vaccine immunogen for generating protective immunity, optimizing antigen design is critical to ensure an optimal immune response through exposing more neutralising epitopes and displaying fewer potentially weakly or non-neutralizing epitopes.
1. There is fear of erratic immune response to the S protein-based vaccines [39]. The RBM appears the most immunodominant neutralizing epitope of the whole S protein, capable of readily eliciting strong neutralizing antibody response but presents dif iculties in designing and developing vaccine because of certain issues as folloThe native trimeric SARS-CoV-2 S protein conceals its immunodominant RBMs by adopting the closed conformation. The SARS-CoV-2 evades immune surveillance also through conformational masking [40].
2. The S1 subunit spontaneously dissociates from the S glycoprotein as a trimer to assume the RBD closed conformation, leaving only the post-fusion S2 trimer. The resulting S1 and S2 subunits might expose immunodominant, non-neutralising epitopes that are utilized by SARS-CoV-2 to serve as decoys to distract the host immune system, inducing a large proportion of ineffective antibody responses. Besides the RBD, which has been shown to be a major target for human neutralising antibody responses, the NTD has recently been identi ied to be a new vulnerable site of the SARS-CoV-2 S protein for antibody mediated neutralization and therefore could also serve as a recombinant proteinbased vaccine [41]. The NTD-speci ic neutralising antibodies are likely to target the S protein in both closed and open conformations. In addition, the apparent accessibility of the FP and HR1 region in the SARS-CoV-2 S ectodomain trimer may be good immunogen candidates for epitope-focused vaccine design aimed at raising effective CoV neutralising antibodies.

Mutations and variation in SARS-CoV-2:
Understanding the nucleotide variations in the SARS-CoV-2 genome provides a useful insight for the evolution of the disease and the propagation of the pandemic [42]. The early variations in SARS-CoV-2 have made their way almost unnoticed as the virus spread around the world. Whereas most variations or mutations have no impact on the viral ability to transmit or cause disease, some mutations appear to have impact on transmissibility, infectivity, or lethality. Some of these mutations have possibly arisen because of the virus evolving from immune selection pressure in infected individuals and are more prevalent in patients with mild than those with severe disease. In general, the mutations can be interpreted as a strategy through natural selection to facilitate extensive spread of the viral infection.
Though, the SARS-CoV-2 virus has a low mutation rate by virtue of the nsp14 protein acting as a 3′-5′ exoribonuclease on both single-stranded and double-stranded RNA during the viral replication cycle [43]. Still its large genome appears to facilitate recombination, insertions, and deletions. Andrés et al found that the viral S protein accumulates deletions upstream and close to the S1/S2 cleavage site [44]. Further, SARS-CoV-2 can resort to RNA viral evolution through recombination (synthesis of chimeric RNA molecules from two different progeny genomes) and reassortment (the packaging within a single virion of genomic segments from different progeny viruses. The single nucleotide variations (SNVs) as SARS-CoV-2 Spike amino acid replacements in the receptor binding domain (RBD) occur relatively frequently [45]. There is recurrent emergence and signi icant onward transmission of a six-nucleotide deletion in the Spike gene resulting in loss of two amino acids labelled as ΔH69/ΔV70. This deletion often co-occurs with the receptor binding motif amino acid replacements N501Y, N439K and Y453F. As such, these deletions have been found in a small percentage (2.2%) of the samples [46]. Currently the major SARS-CoV-2 Lineages are A, B, B.1, B.1.1, B.1.177. The lineage -B.1.1.7, of present concern, was irst sequenced on 20 Sep 2020 and is spreading from the UK and has been discovered in Denmark, the Netherlands, Italy, Israel, Australia, Hong Kong, and Singapore. Other countries are being increasingly involved. Taking a note of the major mutations, their lineages and effects on disease transmissibility is important to understand the changing face of the pandemic (Table 1).
Using the complete sequences of 1,932 SARS-CoV-2 genomes, six types of the strains have been identi ied. The 13 signature variations in the form of SNVs in protein coding regions and one SNV in the 5′ untranslated region (UTR) provide interpretation for the six types (types I to VI). The type VI, characterized by the four signature SNVs C241T (5′UTR), C3037T (nsp3 F924F), C14408T (nsp12 P4715L), and A23403G (Spike D614G), with strong allelic associations, irst reported in China, has become the dominant type world over. Out of these, C241T is in the 5′ UTR appears to be of uncertain signi icance. The other three SNVs, 3037T-14408T-23403G characterizing the increasing frequency of the type VI, in majority of samples from various regions suggests a possible itness gain for the virus. Further, the strains missing one or two of these signature SNVs fail to persist or wiped out by the evolutionary more it variants [47].

Emergence of the D614G variant:
The genome analysis of various isolates of the SARS-CoV-2 shows several regions having an increased proportion of some variants. One such variant is the D614G mutation in the C-terminal end of the S1 domain and in proximity to the S2 subunit. This variant has increased in prevalence with rapidity, becoming the predominant variant now world-over [48]. The D614G variant is associated with the faster viral transmission and harbouring and discharge of higher viral loads [49]. The variant has been found to be associated with increased Infectivity and reduced S1 Shedding [50]. Thus, the presence of glycine at 614 appears to improve the S-Protein Stability and Increase its incorporation into virions. Further, the structural analyses have revealed that the RBD of the D614G form of the spike protein is more likely to assume an "open" conformation than the RBD of the ancestral D614S form, implying an improved ability to bind to the hACE2 receptor. Furthermore, the higher viral load with D614G is consistent with epidemiological data suggesting enhanced infectivity associated with D614G [51].
The studies in hamsters infected with D614S or D614G variants, Plante, et al. have documented that the contemporary D614G variant replicated to higher titers in nasal-wash samples early after infection and outcompeted the ancestral D614S variant. These indings suggest increased viral itness for D614G in a major upper airway compartment potentially associated with enhanced transmission [52]. The D614G variant was found to be equally sensitive to neutralizing antibodies and did not cause more severe disease than the ancestral strain in hamsters, a inding that supports current indings in humans [53].   There are multiple spike  protein mutations such as a double deletion (positions 69  and 70), deletion 144, N501Y, A570D, D614G, Y453F, P681H,  T716I, S982A, D1118H) as well as mutations in other genomic regions. The preliminary analyses in the UK suggests that the UK variant B.1.1.7 has multiple mutations on it and is signi icantly more transmissible than previously circulating variants, with an estimated potential to increase the reproductive number (R) by 0.4 or greater with an estimated increased transmissibility of up to 70percent [54].
The variant was irst detected during October 2020 in the UK from a sample taken in Sep 2020. The backwards tracing using genetic evidence suggests this new variant irst emerged in September 2020 and then circulated at low levels in the population until mid-November, and thereafter it has spread alarmingly by mid-December [55]. It has been correlated with a signi icant increase in the rate of COVID-19 infection in the UK. The cases with the new variant have been from Denmark, Netherlands, and Belgium. The VOC 202012/01 variant has not been identi ied so far in the United States [56]. • The potential impact on SARS-CoV-2 diagnostics -The UK reports that the deletion 69-70 in the spike protein of the variant causes a negative result from S-gene RT-PCR assays applied in some laboratories in the UK. This speci ic mutation has occurred many times in different countries and is geographically widespread.
• The potential impact on severity of disease in a population or groups -Potential impact on severity of disease in a population or group -The available information regarding severity of the new virus variant is limited. To date, there is no indication of increased infection severity observed related to the variant, but the assessment is challenged by the fact that the majority of cases were reported in people under 60 years old, who are less likely to develop severe symptoms.
• Rapid increase of a SARS-CoV-2 variant with multiple spike protein mutations observed in the UK. None of the previously described SARS-CoV-2 variants have been shown to cause increased infection severity; on the contrary, a clade 19B variant with lower severity was detected in Singapore in the spring and then disappeared.
• Frequency of reinfections -The mutations observed in the new variant are related to the receptor binding site and other surface structures, which may alter the antigenic properties of the virus. Based on the number and location of spike protein mutations, it seems likely that some reduction in neutralization by antibodies will be seen, but there is as yet no evidence that there is a resulting impact on increased risk for reinfection or lower vaccine effectiveness.
• The potential impact on vaccine effectiveness is a worrisome issue. It is being conjectured that the ef icacy COVID-19 vaccines will remain like the ancestral SSARS-CoV-2 virus.
Recently a distinct phylogenetic cluster (named lineage B.1.1.7) was detected within the COG-UK surveillance dataset. This cluster has been growing rapidly over the past 4 weeks and since been observed in other UK locations, indicating further spread [58]. Three of these mutations -mutation N501Y in the receptorbinding domain (RBD) leads to increased binding af inity to human ACE2; the spike deletion 69-70del has been associated with evasion to immune response; and mutation P681H is immediately adjacent to the furin cleavage site.

Other isolated mutations
P681H mutation: The P681H is S protein mutation and involves the S1/S2 furin cleavage site. It has also emerged spontaneously multiple times and has been recently reported from Nigeria. There is no evidence to indicate that the P681H variant is contributing to increased transmission of the virus in Nigeria [59]. Earlier the analysis of samples collected in Aug 2020 and sequenced at the African Centre of Excellence for Genomics of Infectious Diseases (ACEGID), Nigeria, identi ied two SARS-CoV-2 sequences that share one non-synonymous SNP in S protein in common with this lineage. The nonsynonymous SNV, S:P681H, has been observed in global data outside of the B.1.1.7 lineage.
At the moment, only about 1% of the SARS-CoV-2 genomes from Nigeria share any of the 17 protein-altering variants from the UK lineage of concern (B. 1.1.7). Other reported mutations such as the N501Y, A570D, and the HV 69 -70 deletion in S protein have not been detected in Nigeria currently. mutations have occurred in the past. In Singapore, one strain with this type of mutation emerged and disappeared. This is in line with the understanding that SARS-CoV-2 ORF8 is poorly conserved among coronaviruses. Among accessory genes, open reading frame 8 (ORF8) stands out by being highly variable and may be related with the virus ability to spread [60].

Conclusion
The pandemic and unmet challenges COVID-19 as a disease and SARS-CoV-2 as its causative organism, continue to remain an enigma. While we continue to explore the agent factors, disease transmission dynamics, pathogenesis and clinical spectrum of the disease, and therapeutic modalities, the grievous nature of the disease has led to emergency and non-emergency authorizations for COVID-19 vaccines in various countries around the world.

Challenges posed by virus mutants:
The genome data outlines that two SARS-CoV-2 virus samples collected from anywhere in the world differ by an average of just 10 RNA letters out of 29,903 [61]. Thus, there is a sluggish mutation rate. Despite this, through extensive research, researchers have catalogued more than 12,000 mutations in SARS-CoV-2 genomes. Many mutations appear not to affect the virus's ability to spread or cause disease. In fact, the mutations altering the viral structural proteins are more likely to harm a virus and make it extinct rather than improve it. Further, various SARS-CoV-2 strains have no major impact but might in future on the course of the disease and pandemic, as they accumulate. Even an insigni icant mutation can have a profound by downgrading the immune response.
The D614G mutation leads to favour open conformations and the viral entry into the host cells. Further, this mutation It is often accompanied by three mutations in other parts of the SARS-CoV-2 genome. The variation increases density of the S protein and viral infectivity [62]. There has been a rapid spread of D614G which was irst spotted in samples collected early in the course of the pandemic from China and Germany. It has become the dominant strain across the European continent, Australia, Canada, and parts of the United States, and predicted to involve rest of the world regions. It appears that D614G represents a more transmissible form of SARS-CoV-2, which has emerged as a product of natural selection [63]. There is fear that a similar situation may occur with the VUI-202012/01 or B.1.1.7 variant.

Ill-de ined and short-lived immune response:
The studies in mice, monkeys and humans that received one of the experimental RNA vaccines, produced antibodies that proved more potent at blocking G viruses than D viruses [64]. With the G strain now ubiquitous, these indings are encouraging. Though, the experience with HIV that mutates to elude various vaccines developed so far, makes the researchers wary of the potential of SARS-CoV-2 to evade immunity and antigenic responses. The historic trial with the common cold Corona vaccine in the late 1980s, should be mentioned in this context, which included 15 healthy people given nasal solution containing inactivated common cold coronavirus. The monitoring for one year followed by analyses documented that most of them became infected before developing an effective immune defense. So far, various SARS-CoV-2 strains do not stop the immune system from developing neutralizing antibodies that recognize the virus. Still, there is a possibility that the virus may acquire mutations that either evade the immune response or alter the susceptibility to antibodies and immune cells.
In the experimental studies using animals and cell cultures, along with the latest molecular techniques, and in small human clinical studies, the immune response to SARS-CoV-2 has been recognized [65]. But it is not certain whether the immunity will be effective or lasting. Whether a vaccine will provide adequate protection, whether those who have recovered from COVID-19 can return to pre-pandemic behaviours and how readily the world can reduce the threat posed by the disease. The reinfections have been shown to occur [66]. Neither the frequency nor the elements of the immune response associated with reinfection have been studied and well documented.
The future course of COVID-19 pandemic: The, virus SARS-CoV-2 is here to stay for the foreseeable future [67]. The future of the disease, of course, appear to depend on various unknown factors, which include the effects of seasonality on the disease transmission and spread, the degree and duration of effective immune response to the disease, and the life-style choices made by individuals and measures enforced by the governments. Further, the pandemic's course will also depend on the availability of a COVID-19 vaccine, it's effectively and duration of its protective immune response. Concerning the latter, the signi icance of viral mutations should also be borne in mind and highlighted. There are serious challenges posed by SARS-CoV-2 virus and COVID-19 as the disease to the humanity. Let us hope that the today's unmet challenges are solved in near future.