NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1189450745|pdb|5X5C|C]
View 

Chain C, S protein

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MERS-CoV-like_Spike_SD1-2_S1-S2_S2 cd22379
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
591-1277 0e+00

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from Middle East respiratory syndrome coronavirus and related betacoronaviruses in the C lineage; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from betacoronaviruses in the merbecovirus subgenus (C lineage), including Middle East respiratory syndrome coronavirus (MERS-CoV). The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HCoV-KU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


:

Pssm-ID: 411966 [Multi-domain]  Cd Length: 682  Bit Score: 1362.17  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       591 LYGVSGRGVFQNCTAVGVRQQRFVYDAYQNLVGYYSDDGNYYCLRACVSVPVSVIYDKETKTHATLFGSVACEHISSTMS 670
Cdd:cd22379    1 LYGVTGRGVFQNCTAVGIRQQRFVYDSFDNLVGYHSDDGNYYCVRPCVSVPVSVIYDKSTNTHATLFGSVACEHISTMMS 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       671 QYSRSTRSMLkRRDSTYGPLQTPVGCVLGLVNSSLFVEDCKLPLGQSLCALPDTPSTltpRSVSSVPGEmRLASIAFNHP 750
Cdd:cd22379   81 QFSRSTQSML-RRRSTNGPLQTAVGCVIGLVNTSLTVEDCKLPLGQSLCAVPPTLTP---RSVSSVPGE-QLASINFNHP 155
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       751 IQVDQLNSSYFKLSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFQKCEQLLREYGQFCSKINQALHGANLRQDDSVR 830
Cdd:cd22379  156 LQVDQLNSSGFKVSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFEKCEQLLREYGQFCSKINQALHGANLRQDDSVR 235
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       831 NLFASVKSSQSSPIIPGFGGDFNLTLLEPVSISTGSRSARSAIEDLLFDKVTIADPGYMQGYDDCMQQGPASARDLICAQ 910
Cdd:cd22379  236 NLFASIKTSQSQPLIAGLGGDFNLTLLEPPSISTGSRSYRSAIEDLLFDKVTIADPGYMQGYDECMKQGPPSARDLICAQ 315
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       911 YVAGYKVLPPLMDVNMEAAYTSSLLGSIAGVGWTAGLSSFAAIPFAQSIFYRLNGVGITQQVLSENQKLIANKFNQALGA 990
Cdd:cd22379  316 YVAGYKVLPPLYDVNMEAAYTSSLLGSIAGAGWTAGLSSFAAIPFAQSIFYRLNGVGITQQVLSENQKLIANKFNQALGA 395
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       991 MQTGFTTTNEAFQKVQDAVNNNAQALSKLASELSNTFGAISASIGDIIQRLDVLEQDAQIDRLINGRLTTLNAFVAQQLV 1070
Cdd:cd22379  396 MQTGFTTTNLAFQKVQDAVNANAQALSKLASELSNTFGAISSSIGDILKRLDVLEQEAQIDRLINGRLTSLNAFVAQQLV 475
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1071 RSESAALSAQLAKDKVNECVKAQSKRSGFCGQGTHIVSFVVNAPNGLYFMHVGYYPSNHIEVVSAYGLCDAANPTNCIAP 1150
Cdd:cd22379  476 RSETAARSAQLAKDKVNECVKSQSKRNGFCGQGTHIVSFVINAPNGLYFFHVGYVPTNHVNVTAAYGLCDSANPTNCIAP 555
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1151 VNGYFIKTNNTRIVDEWSYTGSSFYAPEPITSLNTKYVAPQVTYQNISTNLPPPLLGNSTGIDFQDELDEFFKNVSTSIP 1230
Cdd:cd22379  556 VNGYFIKNNTTRIVDEWSYTGSSFYAPEPITSANTRYVSPDVTFQNLSNNLPPPLLSNSTDIDFKDELEEFFKNVSSQIP 635
                        650       660       670       680
                 ....*....|....*....|....*....|....*....|....*..
5X5C_C      1231 NFGSLTQINTTLLDLTYEMLSLQQVVKALNESYIDLKELGNYTYYNK 1277
Cdd:cd22379  636 NFGSISQINTTLLDLSDEMLSLQQVVKALNESYIDLKELGNYTYYQK 682
MERS-CoV-like_Spike_S1_NTD cd21626
N-terminal domain of the S1 subunit of the Spike (S) protein from Middle East respiratory ...
7-333 0e+00

N-terminal domain of the S1 subunit of the Spike (S) protein from Middle East respiratory syndrome-related coronavirus and related betacoronaviruses in the C lineage; This subfamily contains the N-terminal domain (NTD) of the S1 subunit of the Spike (S) proteins from betacoronaviruses in the merbecovirus subgenera (C lineage), including the highly pathogenic human coronavirus (CoV), Middle East respiratory syndrome (MERS)-related CoV, and related bat CoVs. The CoV S protein is an envelope glycoprotein that plays the most important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains a hydrophobic fusion peptide and two heptad repeat regions. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). Most CoVs, including MERS-CoV, use the C-domain to bind their receptors. Despite using the C-domain as its receptor, neutralizing antibodies targeting MERS-CoV S1-NTD have been reported, including human antibody CDC2-A2, murine antibodies G2 and 5F9, and macaque antibodies FIB-H1 and JC57-13. G2 has been shown to strongly disrupt the attachment of MERS-CoV S to its receptor, dipeptidyl peptidase-4 (DPP4). In addition, the S1 NTD contributes to the Spike trimer interface.


:

Pssm-ID: 394952  Cd Length: 328  Bit Score: 645.94  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         7 DSVKSACIEVDIQQTFF-DKTWPRPIDVSKADGIIYPQGRTYSNITITYQGLFPYQGDHGDMYVYSAGHATGTTPQKLFV 85
Cdd:cd21626    1 DSGTGACIEVDVQPSYFeDKTWPMPIDVSKADGVIYPNGRTYSNITLTYTGLFPKQGDLGKQYVYSAGHAAPNTLNKLFV 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C        86 ANYSQDVKQFANGFVVRIGAAANSTGTVIISPSTSATIRKIYPAFMLGSSVGNFSDGKMGRFFNHTLVLLPDGCGTLLRA 165
Cdd:cd21626   81 SNYSLQVEPFDNGFVVRIGAAANKTGTVIISPSTSTTIKKIYPAFMLGSSVGNFSNNKTGRYFNHTLVILPDGCGTLLHA 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       166 FYCILEPRSGNHCPAGNSYTSFATYHTPATDCSDGNYNRNASLNSFKEYFNLRNCTFMYTYNITEDEILEWFGITQTAQG 245
Cdd:cd21626  161 FYCILQPRTGNRCPGGSSYTSYALWDTPATDCSSGNYNRNASLNSFKEYFNLRNCTFFYNYNITEDERAEWFGITQDTQG 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       246 VHLFSSRYVDLYGGNMFQFATLPVYDTIKYYSIIPHSIRSIQSDRKAWAAFYVYKLQPLTFLLDFSVDGYIRRAIDCGFN 325
Cdd:cd21626  241 VHLYSSRKGDLYGGNMFLFATLPVYDKIKYYTVIPRSIRSPQNDRKAWAAFYIYKLHPLTYLLDFDVDGYIRRAIDCGYD 320

                 ....*...
5X5C_C       326 DLSQLHCS 333
Cdd:cd21626  321 DLSQLQCS 328
human_MERS-CoV_Spike_S1_RBD cd21486
receptor-binding domain of the S1 subunit of the Spike (S) protein from human Middle East ...
351-569 2.29e-152

receptor-binding domain of the S1 subunit of the Spike (S) protein from human Middle East respiratory syndrome coronavirus; This group contains the receptor-binding domain (RBD) of the S1 subunit of the spike (S) protein from the human coronavirus that causes Middle East Respiratory Syndrome (MERS-CoV). MERS-CoV causes severe pulmonary disease in humans. The CoV S protein is an envelope glycoprotein that plays the most important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains a hydrophobic fusion peptide and two heptad repeat regions. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including MERS-CoV use the C-domain to bind their receptors. MERS-CoV use human dipeptidyl peptidase 4 (DPP4), also called CD26, as its receptor. It binds DPP4 through the RBD of its S1 subunit and then fuses viral and host membranes through its S2 subunit. Due to the key role of the S protein RBD in viral attachment, it is the major target for antibody-mediated neutralization. This model corresponds to the S1 subunit C-domain that serves as the RBD for most CoVs including MERS-CoV.


:

Pssm-ID: 394833  Cd Length: 219  Bit Score: 457.67  E-value: 2.29e-152
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       351 AKPSGSVVEQAEGVECDFSPLLSGTPPQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSLILDYFS 430
Cdd:cd21486    1 AKPSGSVVEQAEGVECDFSPLLSGTPPQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSLILDYFS 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       431 YPLSMKSDLSVSSAGPISQFNYKQSFSNPTCLILATVPHNLTTITKPLKYSYINKCSRLLSDDRTEVPQLVNANQYSPCV 510
Cdd:cd21486   81 YPLSMKSDLSVSSAGPISQFNYKQSFSNPTCLILATVPHNLTTITKPLKYSYINKCSRLLSDDRTEVPQLVNANQYSPCV 160
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*....
5X5C_C       511 SIVPSTVWEDGDYYRKQLSPLEGGGWLVASGSTVAMTEQLQMGFGITVQYGTDTNSVCP 569
Cdd:cd21486  161 SIVPSTVWEDGDYYRKQLSPLEGGGWLVASGSTVAMTEQLQMGFGITVQYGTDTNSVCP 219
Fibritin_C super family cl44324
Fibritin C-terminal region; This family features sequences bearing similarity to the ...
1288-1315 8.92e-08

Fibritin C-terminal region; This family features sequences bearing similarity to the C-terminal portion of the bacteriophage T4 protein fibritin. This protein is responsible for attachment of long tail fibres to virus particle, and forms the 'whiskers' or fibres on the neck of the virion. The region seen in this family contains an N-terminal coiled-coil portion and the C-terminal globular foldon domain (residues 457-486), which is essential for fibritin trimerization and folding. This domain consists of a beta-hairpin; three such hairpins come together in a beta-propeller-like arrangement in the trimer, which is stabilized by hydrogen bonds, salt bridges and hydrophobic interactions.


The actual alignment was detected with superfamily member pfam07921:

Pssm-ID: 254516 [Multi-domain]  Cd Length: 93  Bit Score: 51.30  E-value: 8.92e-08
                           10        20
                   ....*....|....*....|....*...
5X5C_C        1288 GSGYIPEAPRDGQAYVRKDGEWVLLSTF 1315
Cdd:pfam07921   66 ESGKIDDAPDDGRWYVRKDGAWVLLSSI 93
 
Name Accession Description Interval E-value
MERS-CoV-like_Spike_SD1-2_S1-S2_S2 cd22379
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
591-1277 0e+00

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from Middle East respiratory syndrome coronavirus and related betacoronaviruses in the C lineage; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from betacoronaviruses in the merbecovirus subgenus (C lineage), including Middle East respiratory syndrome coronavirus (MERS-CoV). The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HCoV-KU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411966 [Multi-domain]  Cd Length: 682  Bit Score: 1362.17  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       591 LYGVSGRGVFQNCTAVGVRQQRFVYDAYQNLVGYYSDDGNYYCLRACVSVPVSVIYDKETKTHATLFGSVACEHISSTMS 670
Cdd:cd22379    1 LYGVTGRGVFQNCTAVGIRQQRFVYDSFDNLVGYHSDDGNYYCVRPCVSVPVSVIYDKSTNTHATLFGSVACEHISTMMS 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       671 QYSRSTRSMLkRRDSTYGPLQTPVGCVLGLVNSSLFVEDCKLPLGQSLCALPDTPSTltpRSVSSVPGEmRLASIAFNHP 750
Cdd:cd22379   81 QFSRSTQSML-RRRSTNGPLQTAVGCVIGLVNTSLTVEDCKLPLGQSLCAVPPTLTP---RSVSSVPGE-QLASINFNHP 155
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       751 IQVDQLNSSYFKLSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFQKCEQLLREYGQFCSKINQALHGANLRQDDSVR 830
Cdd:cd22379  156 LQVDQLNSSGFKVSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFEKCEQLLREYGQFCSKINQALHGANLRQDDSVR 235
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       831 NLFASVKSSQSSPIIPGFGGDFNLTLLEPVSISTGSRSARSAIEDLLFDKVTIADPGYMQGYDDCMQQGPASARDLICAQ 910
Cdd:cd22379  236 NLFASIKTSQSQPLIAGLGGDFNLTLLEPPSISTGSRSYRSAIEDLLFDKVTIADPGYMQGYDECMKQGPPSARDLICAQ 315
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       911 YVAGYKVLPPLMDVNMEAAYTSSLLGSIAGVGWTAGLSSFAAIPFAQSIFYRLNGVGITQQVLSENQKLIANKFNQALGA 990
Cdd:cd22379  316 YVAGYKVLPPLYDVNMEAAYTSSLLGSIAGAGWTAGLSSFAAIPFAQSIFYRLNGVGITQQVLSENQKLIANKFNQALGA 395
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       991 MQTGFTTTNEAFQKVQDAVNNNAQALSKLASELSNTFGAISASIGDIIQRLDVLEQDAQIDRLINGRLTTLNAFVAQQLV 1070
Cdd:cd22379  396 MQTGFTTTNLAFQKVQDAVNANAQALSKLASELSNTFGAISSSIGDILKRLDVLEQEAQIDRLINGRLTSLNAFVAQQLV 475
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1071 RSESAALSAQLAKDKVNECVKAQSKRSGFCGQGTHIVSFVVNAPNGLYFMHVGYYPSNHIEVVSAYGLCDAANPTNCIAP 1150
Cdd:cd22379  476 RSETAARSAQLAKDKVNECVKSQSKRNGFCGQGTHIVSFVINAPNGLYFFHVGYVPTNHVNVTAAYGLCDSANPTNCIAP 555
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1151 VNGYFIKTNNTRIVDEWSYTGSSFYAPEPITSLNTKYVAPQVTYQNISTNLPPPLLGNSTGIDFQDELDEFFKNVSTSIP 1230
Cdd:cd22379  556 VNGYFIKNNTTRIVDEWSYTGSSFYAPEPITSANTRYVSPDVTFQNLSNNLPPPLLSNSTDIDFKDELEEFFKNVSSQIP 635
                        650       660       670       680
                 ....*....|....*....|....*....|....*....|....*..
5X5C_C      1231 NFGSLTQINTTLLDLTYEMLSLQQVVKALNESYIDLKELGNYTYYNK 1277
Cdd:cd22379  636 NFGSISQINTTLLDLSDEMLSLQQVVKALNESYIDLKELGNYTYYQK 682
MERS-CoV-like_Spike_S1_NTD cd21626
N-terminal domain of the S1 subunit of the Spike (S) protein from Middle East respiratory ...
7-333 0e+00

N-terminal domain of the S1 subunit of the Spike (S) protein from Middle East respiratory syndrome-related coronavirus and related betacoronaviruses in the C lineage; This subfamily contains the N-terminal domain (NTD) of the S1 subunit of the Spike (S) proteins from betacoronaviruses in the merbecovirus subgenera (C lineage), including the highly pathogenic human coronavirus (CoV), Middle East respiratory syndrome (MERS)-related CoV, and related bat CoVs. The CoV S protein is an envelope glycoprotein that plays the most important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains a hydrophobic fusion peptide and two heptad repeat regions. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). Most CoVs, including MERS-CoV, use the C-domain to bind their receptors. Despite using the C-domain as its receptor, neutralizing antibodies targeting MERS-CoV S1-NTD have been reported, including human antibody CDC2-A2, murine antibodies G2 and 5F9, and macaque antibodies FIB-H1 and JC57-13. G2 has been shown to strongly disrupt the attachment of MERS-CoV S to its receptor, dipeptidyl peptidase-4 (DPP4). In addition, the S1 NTD contributes to the Spike trimer interface.


Pssm-ID: 394952  Cd Length: 328  Bit Score: 645.94  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         7 DSVKSACIEVDIQQTFF-DKTWPRPIDVSKADGIIYPQGRTYSNITITYQGLFPYQGDHGDMYVYSAGHATGTTPQKLFV 85
Cdd:cd21626    1 DSGTGACIEVDVQPSYFeDKTWPMPIDVSKADGVIYPNGRTYSNITLTYTGLFPKQGDLGKQYVYSAGHAAPNTLNKLFV 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C        86 ANYSQDVKQFANGFVVRIGAAANSTGTVIISPSTSATIRKIYPAFMLGSSVGNFSDGKMGRFFNHTLVLLPDGCGTLLRA 165
Cdd:cd21626   81 SNYSLQVEPFDNGFVVRIGAAANKTGTVIISPSTSTTIKKIYPAFMLGSSVGNFSNNKTGRYFNHTLVILPDGCGTLLHA 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       166 FYCILEPRSGNHCPAGNSYTSFATYHTPATDCSDGNYNRNASLNSFKEYFNLRNCTFMYTYNITEDEILEWFGITQTAQG 245
Cdd:cd21626  161 FYCILQPRTGNRCPGGSSYTSYALWDTPATDCSSGNYNRNASLNSFKEYFNLRNCTFFYNYNITEDERAEWFGITQDTQG 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       246 VHLFSSRYVDLYGGNMFQFATLPVYDTIKYYSIIPHSIRSIQSDRKAWAAFYVYKLQPLTFLLDFSVDGYIRRAIDCGFN 325
Cdd:cd21626  241 VHLYSSRKGDLYGGNMFLFATLPVYDKIKYYTVIPRSIRSPQNDRKAWAAFYIYKLHPLTYLLDFDVDGYIRRAIDCGYD 320

                 ....*...
5X5C_C       326 DLSQLHCS 333
Cdd:cd21626  321 DLSQLQCS 328
CoV_S2 pfam01601
Coronavirus spike glycoprotein S2; The coronavirus spike glycoprotein forms the characteriztic ...
763-1277 0e+00

Coronavirus spike glycoprotein S2; The coronavirus spike glycoprotein forms the characteriztic 'corona' after which the group is named. The Spike glycoprotein is translated as a large polypeptide that is subsequently cleaved to S1 pfam01600 and S2,. The S2 subunit normally contains multiple key components, including one or more fusion peptides (FP), a second proteolytic site (S2') and two conserved heptad repeats (HRs), driving membrane penetration and virus-cell fusion. The HRs can trimerize into a coiled-coil structure built of three HR1-HR2 helical hairpins presenting as a canonical six-helix bundle and drag the virus envelope and the host cell bilayer into close proximity, preparing for fusion to occur.


Pssm-ID: 460263  Cd Length: 502  Bit Score: 642.78  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         763 LSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFQKCEQLLREYGQFCSKINQALHGANLRQDDSVRNLFASVKSSQSS 842
Cdd:pfam01601    3 ISIPTNFTISVQTEYIQTTSPKVSVDCAQYVCNGNERCLQLLVQYGSFCSTIEQALQGSARLEDVEVLSMLSISNRALTL 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         843 PIIPGFGGDFNLTLLEPVSISTgsrsaRSAIEDLLFDKVTIADPGYMQGYDDCMqqGPASARDLICAQYVAGYKVLPPLM 922
Cdd:pfam01601   83 ATISNFGSDFNFSSFLPCLNSG-----RSAIEDLLFDKVVTSGLGTVDAYKKCT--KGTSIADLVCAQYYNGIMVLPGVV 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         923 DVNMEAAYTSSLLGSIAGVGWTAglsSFAAIPFAQSIFYRLNGVGITQQVLSENQKLIANKFNQALGAMQTGFTTTNEAF 1002
Cdd:pfam01601  156 DAEKMAMYTASLTGGMAFGGLTG---AAAAIPFALAVQARLNYLGLQTDVLQENQKILANAFNNAVGNITDGFTTTASAL 232
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C        1003 QKVQDAVNNNAQALSKLASELSNTFGAISASIGDIIQRLDVLEQDAQIDRLINGRLTTLNAFVAQQLVRSESAALSAQLA 1082
Cdd:pfam01601  233 SKIQDVVNANAQALNQLTQQLSNNFGAISSSIQDIYSRLDQLEADAQVDRLINGRLAALNAFVTQQLTKASEVKASRQLA 312
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C        1083 KDKVNECVKAQSKRSGFCGQGTHIVSFVVNAPNGLYFMHVGYYPSNHIEVVSAYGLCdaANPTNCIAPVNGYFIKTNNTR 1162
Cdd:pfam01601  313 QQKVNECVKSQSSRYGFCGNGTHLFSLPQAAPNGIMFLHTVLVPTEYITVKATPGLC--VNGTTGYAPRDGQFVLNNTSN 390
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C        1163 ivdeWSYTGSSFYAPEPITSLNTKYVAP-QVTYQNISTNLPPPLLGNStgIDFQDELDEFFKNVSTSIPNFgSLTQINTT 1241
Cdd:pfam01601  391 ----WYITPRNMYQPRPITGSDFVQISScDVNFVNITNTKLPPLIPDY--VDFNKELEDIYKNLNSTLPDL-DLDIFNAT 463
                          490       500       510
                   ....*....|....*....|....*....|....*....
5X5C_C        1242 LLDLTYEMLS---LQQVVKALNESYIDLKELGNYTYYNK 1277
Cdd:pfam01601  464 ILNLTDEIKDlerLQELIDNLNQTLVDLEWLNRYETYIK 502
human_MERS-CoV_Spike_S1_RBD cd21486
receptor-binding domain of the S1 subunit of the Spike (S) protein from human Middle East ...
351-569 2.29e-152

receptor-binding domain of the S1 subunit of the Spike (S) protein from human Middle East respiratory syndrome coronavirus; This group contains the receptor-binding domain (RBD) of the S1 subunit of the spike (S) protein from the human coronavirus that causes Middle East Respiratory Syndrome (MERS-CoV). MERS-CoV causes severe pulmonary disease in humans. The CoV S protein is an envelope glycoprotein that plays the most important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains a hydrophobic fusion peptide and two heptad repeat regions. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including MERS-CoV use the C-domain to bind their receptors. MERS-CoV use human dipeptidyl peptidase 4 (DPP4), also called CD26, as its receptor. It binds DPP4 through the RBD of its S1 subunit and then fuses viral and host membranes through its S2 subunit. Due to the key role of the S protein RBD in viral attachment, it is the major target for antibody-mediated neutralization. This model corresponds to the S1 subunit C-domain that serves as the RBD for most CoVs including MERS-CoV.


Pssm-ID: 394833  Cd Length: 219  Bit Score: 457.67  E-value: 2.29e-152
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       351 AKPSGSVVEQAEGVECDFSPLLSGTPPQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSLILDYFS 430
Cdd:cd21486    1 AKPSGSVVEQAEGVECDFSPLLSGTPPQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSLILDYFS 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       431 YPLSMKSDLSVSSAGPISQFNYKQSFSNPTCLILATVPHNLTTITKPLKYSYINKCSRLLSDDRTEVPQLVNANQYSPCV 510
Cdd:cd21486   81 YPLSMKSDLSVSSAGPISQFNYKQSFSNPTCLILATVPHNLTTITKPLKYSYINKCSRLLSDDRTEVPQLVNANQYSPCV 160
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*....
5X5C_C       511 SIVPSTVWEDGDYYRKQLSPLEGGGWLVASGSTVAMTEQLQMGFGITVQYGTDTNSVCP 569
Cdd:cd21486  161 SIVPSTVWEDGDYYRKQLSPLEGGGWLVASGSTVAMTEQLQMGFGITVQYGTDTNSVCP 219
bCoV_S1_N pfam16451
Betacoronavirus-like spike glycoprotein S1, N-terminal; This entry represents the N-terminal ...
32-367 4.19e-109

Betacoronavirus-like spike glycoprotein S1, N-terminal; This entry represents the N-terminal domain of the betacoronavirus-like trimeric spike glycoprotein. The distal S1 subunit of the coronavirus spike protein is responsible for receptor binding. S1 contains two domains; an N-terminal galectin-like domain (NTD) and a receptor-binding domain (S1 RBD) also referred to as the S1 CTD or domain B. Either the S1 NTD or S1 RBD, or occasionally both, are involved in binding to host receptors. S1 NTD is located on the side of the spike trimer and mainly recognizes sugar receptors. For many betacoronaviruses (b-CoVs), for example mouse hepatitis virus (MHV), the RBD is located in the NTD. The structure of the MHV S1 NTD showed the same fold as human galectins (galactose-binding lectin), however it does not bind any sugar; instead, it binds to the carcinoembryonic antigen cell-adhesion molecule CEACAM1) through protein-protein interactions. All three CEACAM21a-binding sites in MHV spikes can be fully occupied by CEACAM1a. It has been shown that CEACAM1a binding to the MHV spike weakens the interactions between S1 and S2 and facilitates the proteolysis of the spike protein and dissociation of S1. The homologous bovine CoV (BCov) S1 NTD also possesses a galectin fold but binds to sialic acid-containing moieties on host cell membranes, as does the NTD of three other group A b-Covs, namely human CoV (HCoV) OC43, avian b-CoV, and infectious bronchitis virus (IBV). Despite the S1 NTD of human respiratory b-CoV HKU1 being highly homologous to the NTDs of MHV and bovine CoV, it does not bind to either sugar or human carcinoembryonic antigen-related cell adhesion molecules (CEACAMs) and the RBD is found instead in the S1 RBD domain.


Pssm-ID: 465119  Cd Length: 296  Bit Score: 345.55  E-value: 4.19e-109
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C          32 DVSKADGIIYPQGRTYSNITITYQGLFPYQGDHGDMYVYSAGHATGTTpqklfvaNYSQDVKQFANGFVVRIGAAANSTG 111
Cdd:pfam16451    1 DVSKADGTYYLPDRVYSNTTTLRQGLFPKQGDNVTNYVYSPAHHCGKR-------NFSTPFLPFGDGIFVHIGNASNATG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         112 TVIISpstsatirkIYPAFMLGSSVGNFSdgkmgrffnHTLVLLPDGCGTLLRAFYCileprsgnhcpagnsytSFATYH 191
Cdd:pfam16451   74 GRIIS---------EPPAFVFGSTFGNTS---------HTLIIAPDSCQTNLTILVC-----------------NFTLCA 118
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         192 TPATDCS--DGNYNRNASLNSFKeyfNLRNCTFMYTYNITEDEILEWFGITQTAQGVHLFSSRYV---DLYGGNMFQFAT 266
Cdd:pfam16451  119 NPVTACRwgAGNYNANNSLVYFK---NAINCTFNRTYNITFDTSLIYFGFKQQDGGFHIYYSYWLpdlDSGPPTLFPFAT 195
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         267 LPVYDTIKYYSIIPHSIRSIQSDRKAWAAFYVYKLQPLTFLLDFSVDGYIRRAIDCGFNDLSQLHCSYESFDVESGVYSV 346
Cdd:pfam16451  196 LPLGINITYFQVIPSSIRSTQNCRRANAAYYVAPLKPSTFLLDFDVNGYITNAVDCSFDPLSELKCSTGSFEPDTGVYST 275
                          330       340
                   ....*....|....*....|.
5X5C_C         347 SSFEAKPSGSVVEQAEGVECD 367
Cdd:pfam16451  276 SSYRAQPVGSVYVRQPNVTCD 296
bCoV_S1_RBD pfam09408
Betacoronavirus spike glycoprotein S1, receptor binding; This entry represents the receptor ...
377-569 1.07e-37

Betacoronavirus spike glycoprotein S1, receptor binding; This entry represents the receptor binding domain (S1 RBD) of the betacoronavirus spike glycoprotein. The spike glycoprotein is arranged in trimers on the surface of the viral membrane and is essential for viral entry. The spike protein is translated as a large polypeptide that is subsequently cleaved to the distal S1, responsible for receptor binding, and the membrane-anchored S2 responsible for membrane fusion. The coronavirus (SARS-CoV) S1 subunit is composed of two distinct domains: an N-terminal domain (S1 NTD) and a receptor-binding domain (S1 RBD) also referred to as the S1 CTD or domain B. Each of these domains have been implicated in binding to host receptors. However, most coronaviruses are not known to utilize both the S1 NTD and S1 RBD for viral entry. SARS-CoV makes use of its S1 RBD to bind to the human angiotensin-converting enzyme 2 (ACE2) as its host receptor.


Pssm-ID: 462789  Cd Length: 156  Bit Score: 138.79  E-value: 1.07e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         377 PQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSLILDYFSYPLSMKSDLSVSSAGPISQFNYKQSF 456
Cdd:pfam09408    1 PSPANWERKTFSNCNFDFSVLLSLLQVSSFKCYGVSPSKLADMCYGSVTIDYFAIPETHKSNLQPGSPGAISKYNYKLPD 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         457 SNPTCLILATVPhNLTTITKPLKYSYInkcsrllsddrtevpqLVNANQYSPCVSIVpSTVWEDGDYYRKQLSplegggw 536
Cdd:pfam09408   81 DFYGCVLAFNVN-ANTNYAFADNYPYR----------------YIKPGQYQPCNSFV-STVPNSPDGHYCTPS------- 135
                          170       180       190
                   ....*....|....*....|....*....|...
5X5C_C         537 lvasgstvamteQLQMGFGITVQYGTDTNSVCP 569
Cdd:pfam09408  136 ------------SFNGVVVITLKPATGSNLVCP 156
Fibritin_C pfam07921
Fibritin C-terminal region; This family features sequences bearing similarity to the ...
1288-1315 8.92e-08

Fibritin C-terminal region; This family features sequences bearing similarity to the C-terminal portion of the bacteriophage T4 protein fibritin. This protein is responsible for attachment of long tail fibres to virus particle, and forms the 'whiskers' or fibres on the neck of the virion. The region seen in this family contains an N-terminal coiled-coil portion and the C-terminal globular foldon domain (residues 457-486), which is essential for fibritin trimerization and folding. This domain consists of a beta-hairpin; three such hairpins come together in a beta-propeller-like arrangement in the trimer, which is stabilized by hydrogen bonds, salt bridges and hydrophobic interactions.


Pssm-ID: 254516 [Multi-domain]  Cd Length: 93  Bit Score: 51.30  E-value: 8.92e-08
                           10        20
                   ....*....|....*....|....*...
5X5C_C        1288 GSGYIPEAPRDGQAYVRKDGEWVLLSTF 1315
Cdd:pfam07921   66 ESGKIDDAPDDGRWYVRKDGAWVLLSSI 93
wac PHA02607
fibritin; Provisional
1289-1316 4.46e-06

fibritin; Provisional


Pssm-ID: 177432 [Multi-domain]  Cd Length: 454  Bit Score: 50.79  E-value: 4.46e-06
                          10        20
                  ....*....|....*....|....*...
5X5C_C       1289 SGYIPEAPRDGQAYVRKDGEWVLLSTFL 1316
Cdd:PHA02607  424 AGKLDDAPSDGSWYVRKNGAWVEVSTGL 451
 
Name Accession Description Interval E-value
MERS-CoV-like_Spike_SD1-2_S1-S2_S2 cd22379
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
591-1277 0e+00

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from Middle East respiratory syndrome coronavirus and related betacoronaviruses in the C lineage; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from betacoronaviruses in the merbecovirus subgenus (C lineage), including Middle East respiratory syndrome coronavirus (MERS-CoV). The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HCoV-KU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411966 [Multi-domain]  Cd Length: 682  Bit Score: 1362.17  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       591 LYGVSGRGVFQNCTAVGVRQQRFVYDAYQNLVGYYSDDGNYYCLRACVSVPVSVIYDKETKTHATLFGSVACEHISSTMS 670
Cdd:cd22379    1 LYGVTGRGVFQNCTAVGIRQQRFVYDSFDNLVGYHSDDGNYYCVRPCVSVPVSVIYDKSTNTHATLFGSVACEHISTMMS 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       671 QYSRSTRSMLkRRDSTYGPLQTPVGCVLGLVNSSLFVEDCKLPLGQSLCALPDTPSTltpRSVSSVPGEmRLASIAFNHP 750
Cdd:cd22379   81 QFSRSTQSML-RRRSTNGPLQTAVGCVIGLVNTSLTVEDCKLPLGQSLCAVPPTLTP---RSVSSVPGE-QLASINFNHP 155
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       751 IQVDQLNSSYFKLSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFQKCEQLLREYGQFCSKINQALHGANLRQDDSVR 830
Cdd:cd22379  156 LQVDQLNSSGFKVSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFEKCEQLLREYGQFCSKINQALHGANLRQDDSVR 235
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       831 NLFASVKSSQSSPIIPGFGGDFNLTLLEPVSISTGSRSARSAIEDLLFDKVTIADPGYMQGYDDCMQQGPASARDLICAQ 910
Cdd:cd22379  236 NLFASIKTSQSQPLIAGLGGDFNLTLLEPPSISTGSRSYRSAIEDLLFDKVTIADPGYMQGYDECMKQGPPSARDLICAQ 315
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       911 YVAGYKVLPPLMDVNMEAAYTSSLLGSIAGVGWTAGLSSFAAIPFAQSIFYRLNGVGITQQVLSENQKLIANKFNQALGA 990
Cdd:cd22379  316 YVAGYKVLPPLYDVNMEAAYTSSLLGSIAGAGWTAGLSSFAAIPFAQSIFYRLNGVGITQQVLSENQKLIANKFNQALGA 395
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       991 MQTGFTTTNEAFQKVQDAVNNNAQALSKLASELSNTFGAISASIGDIIQRLDVLEQDAQIDRLINGRLTTLNAFVAQQLV 1070
Cdd:cd22379  396 MQTGFTTTNLAFQKVQDAVNANAQALSKLASELSNTFGAISSSIGDILKRLDVLEQEAQIDRLINGRLTSLNAFVAQQLV 475
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1071 RSESAALSAQLAKDKVNECVKAQSKRSGFCGQGTHIVSFVVNAPNGLYFMHVGYYPSNHIEVVSAYGLCDAANPTNCIAP 1150
Cdd:cd22379  476 RSETAARSAQLAKDKVNECVKSQSKRNGFCGQGTHIVSFVINAPNGLYFFHVGYVPTNHVNVTAAYGLCDSANPTNCIAP 555
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1151 VNGYFIKTNNTRIVDEWSYTGSSFYAPEPITSLNTKYVAPQVTYQNISTNLPPPLLGNSTGIDFQDELDEFFKNVSTSIP 1230
Cdd:cd22379  556 VNGYFIKNNTTRIVDEWSYTGSSFYAPEPITSANTRYVSPDVTFQNLSNNLPPPLLSNSTDIDFKDELEEFFKNVSSQIP 635
                        650       660       670       680
                 ....*....|....*....|....*....|....*....|....*..
5X5C_C      1231 NFGSLTQINTTLLDLTYEMLSLQQVVKALNESYIDLKELGNYTYYNK 1277
Cdd:cd22379  636 NFGSISQINTTLLDLSDEMLSLQQVVKALNESYIDLKELGNYTYYQK 682
betaCoV_Spike_SD1-2_S1-S2_S2 cd22370
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
591-1269 0e+00

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from betacoronaviruses; This family contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from betacoronaviruses, including three highly pathogenic human coronaviruses (CoVs), Middle East respiratory syndrome coronavirus (MERS-CoV), Severe acute respiratory syndrome (SARS) coronavirus (SARS-CoV), and SARS coronavirus 2 (SARS-CoV-2), also known as a 2019 novel coronavirus (2019-nCoV). The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HKU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411957 [Multi-domain]  Cd Length: 667  Bit Score: 1061.71  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       591 LYGVSGRGVFQNCTAVGVRQQRFVYDAYQNLVGYYSD-DGNYYCLRACVSVPVSVIYDKETK-THATLFGSVACEHISST 668
Cdd:cd22370    1 LYGYTGTGVLTETNATFLPFQNFGYDSNGNLIAFKDPqTNTIYTILPCVSGPVSVITPGNNTnEVAVLYNGLNCSEVPSA 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       669 MSQYSRSTRSmlKRRDSTYGPLQTPVGCVLGLVNSSLFVEDCKLPLGQSLCALPDTPSTLTPRSvsSVPGEMRLASIAFN 748
Cdd:cd22370   81 ISAVSLTPWW--RVYSSTSNYFDTPVGCLLGAVNSSNNSYECDLPLGAGLCASYTTQSVLRSRS--VASRSIRLTTMSFF 156
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       749 HPIQVD-QLNSSYFKLSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFQKCEQLLREYGQFCSKINQALHGANLRQDD 827
Cdd:cd22370  157 AENSVDvEVAYSNFSIQIPTNFTIAVTEEFIPTTMPKVTVDCAQYVCGDSSECSNLLLQYGTFCDNINRALTGVALLQDK 236
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       828 SVRNLFASVKSSQSSPIIPGFGGDFNLTLLEPVSISTGSRSARSAIEDLLFDKVTIADPGYMQGYDDCMQQGPAsaRDLI 907
Cdd:cd22370  237 NQLEVFASVKQIVKTPAPLKDFGGFNFSSLLPCLGSNGGSSARSAIEDLLFNKVTLADVGFMKQYDDCTGGSAA--RDLI 314
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       908 CAQYVAGYKVLPPLMDVNMEAAYTSSLLGSIAGVGWTAGLSSFAAIPFAQSIFYRLNGVGITQQVLSENQKLIANKFNQA 987
Cdd:cd22370  315 CAQSFNGLKVLPPLLTDEMIAAYTSALLGGTATSGWTFGASSAAQIPFAMQMAYRFNGIGVTQQVLVENQKLIANKFNQA 394
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       988 LGAMQTGFTTTNEAFQKVQDAVNNNAQALSKLASELSNTFGAISASIGDIIQRLDVLEQDAQIDRLINGRLTTLNAFVAQ 1067
Cdd:cd22370  395 LGSIQTGFTATNSALAKLQDVVNQNAQALNTLVKQLSNNFGAISSSLNDILSRLDKLEADVQIDRLINGRLQVLQTYVTQ 474
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1068 QLVRSESAALSAQLAKDKVNECVKAQSKRSGFCGQGTHIVSFVVNAPNGLYFMHVGYYPSNHIEVVSAYGLCDAANPTNc 1147
Cdd:cd22370  475 QLIRASEIRASAQLAAQKMSECVKGQSKRVDFCGNGTHLMSFPQSAPNGVVFLHVTYKPTSYKNVTTAPAICHNGKAYF- 553
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1148 iaPVNGYFIKTNNtrivdEWSYTGSSFYAPEPITSLNTKYVAPQ---VTYQNISTnlPPPLLGNSTgiDFQDELDEFFKN 1224
Cdd:cd22370  554 --PKEGVFVKNNN-----SWMFTGRNFYEPEIITTDNTFYSGSCdvnFTYVNNTV--YNPLQPELD--DFKAELDKFFKN 622
                        650       660       670       680
                 ....*....|....*....|....*....|....*....|....*
5X5C_C      1225 VSTSIPNFGSLTQINTTLLDLTYEMLSLQQVVKALNESYIDLKEL 1269
Cdd:cd22370  623 HTSPDPNLGDLSGINASFVDLQKEMDTLQEVVKQLNESLIDLKEL 667
MERS-CoV-like_Spike_S1_NTD cd21626
N-terminal domain of the S1 subunit of the Spike (S) protein from Middle East respiratory ...
7-333 0e+00

N-terminal domain of the S1 subunit of the Spike (S) protein from Middle East respiratory syndrome-related coronavirus and related betacoronaviruses in the C lineage; This subfamily contains the N-terminal domain (NTD) of the S1 subunit of the Spike (S) proteins from betacoronaviruses in the merbecovirus subgenera (C lineage), including the highly pathogenic human coronavirus (CoV), Middle East respiratory syndrome (MERS)-related CoV, and related bat CoVs. The CoV S protein is an envelope glycoprotein that plays the most important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains a hydrophobic fusion peptide and two heptad repeat regions. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). Most CoVs, including MERS-CoV, use the C-domain to bind their receptors. Despite using the C-domain as its receptor, neutralizing antibodies targeting MERS-CoV S1-NTD have been reported, including human antibody CDC2-A2, murine antibodies G2 and 5F9, and macaque antibodies FIB-H1 and JC57-13. G2 has been shown to strongly disrupt the attachment of MERS-CoV S to its receptor, dipeptidyl peptidase-4 (DPP4). In addition, the S1 NTD contributes to the Spike trimer interface.


Pssm-ID: 394952  Cd Length: 328  Bit Score: 645.94  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         7 DSVKSACIEVDIQQTFF-DKTWPRPIDVSKADGIIYPQGRTYSNITITYQGLFPYQGDHGDMYVYSAGHATGTTPQKLFV 85
Cdd:cd21626    1 DSGTGACIEVDVQPSYFeDKTWPMPIDVSKADGVIYPNGRTYSNITLTYTGLFPKQGDLGKQYVYSAGHAAPNTLNKLFV 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C        86 ANYSQDVKQFANGFVVRIGAAANSTGTVIISPSTSATIRKIYPAFMLGSSVGNFSDGKMGRFFNHTLVLLPDGCGTLLRA 165
Cdd:cd21626   81 SNYSLQVEPFDNGFVVRIGAAANKTGTVIISPSTSTTIKKIYPAFMLGSSVGNFSNNKTGRYFNHTLVILPDGCGTLLHA 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       166 FYCILEPRSGNHCPAGNSYTSFATYHTPATDCSDGNYNRNASLNSFKEYFNLRNCTFMYTYNITEDEILEWFGITQTAQG 245
Cdd:cd21626  161 FYCILQPRTGNRCPGGSSYTSYALWDTPATDCSSGNYNRNASLNSFKEYFNLRNCTFFYNYNITEDERAEWFGITQDTQG 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       246 VHLFSSRYVDLYGGNMFQFATLPVYDTIKYYSIIPHSIRSIQSDRKAWAAFYVYKLQPLTFLLDFSVDGYIRRAIDCGFN 325
Cdd:cd21626  241 VHLYSSRKGDLYGGNMFLFATLPVYDKIKYYTVIPRSIRSPQNDRKAWAAFYIYKLHPLTYLLDFDVDGYIRRAIDCGYD 320

                 ....*...
5X5C_C       326 DLSQLHCS 333
Cdd:cd21626  321 DLSQLQCS 328
CoV_S2 pfam01601
Coronavirus spike glycoprotein S2; The coronavirus spike glycoprotein forms the characteriztic ...
763-1277 0e+00

Coronavirus spike glycoprotein S2; The coronavirus spike glycoprotein forms the characteriztic 'corona' after which the group is named. The Spike glycoprotein is translated as a large polypeptide that is subsequently cleaved to S1 pfam01600 and S2,. The S2 subunit normally contains multiple key components, including one or more fusion peptides (FP), a second proteolytic site (S2') and two conserved heptad repeats (HRs), driving membrane penetration and virus-cell fusion. The HRs can trimerize into a coiled-coil structure built of three HR1-HR2 helical hairpins presenting as a canonical six-helix bundle and drag the virus envelope and the host cell bilayer into close proximity, preparing for fusion to occur.


Pssm-ID: 460263  Cd Length: 502  Bit Score: 642.78  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         763 LSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFQKCEQLLREYGQFCSKINQALHGANLRQDDSVRNLFASVKSSQSS 842
Cdd:pfam01601    3 ISIPTNFTISVQTEYIQTTSPKVSVDCAQYVCNGNERCLQLLVQYGSFCSTIEQALQGSARLEDVEVLSMLSISNRALTL 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         843 PIIPGFGGDFNLTLLEPVSISTgsrsaRSAIEDLLFDKVTIADPGYMQGYDDCMqqGPASARDLICAQYVAGYKVLPPLM 922
Cdd:pfam01601   83 ATISNFGSDFNFSSFLPCLNSG-----RSAIEDLLFDKVVTSGLGTVDAYKKCT--KGTSIADLVCAQYYNGIMVLPGVV 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         923 DVNMEAAYTSSLLGSIAGVGWTAglsSFAAIPFAQSIFYRLNGVGITQQVLSENQKLIANKFNQALGAMQTGFTTTNEAF 1002
Cdd:pfam01601  156 DAEKMAMYTASLTGGMAFGGLTG---AAAAIPFALAVQARLNYLGLQTDVLQENQKILANAFNNAVGNITDGFTTTASAL 232
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C        1003 QKVQDAVNNNAQALSKLASELSNTFGAISASIGDIIQRLDVLEQDAQIDRLINGRLTTLNAFVAQQLVRSESAALSAQLA 1082
Cdd:pfam01601  233 SKIQDVVNANAQALNQLTQQLSNNFGAISSSIQDIYSRLDQLEADAQVDRLINGRLAALNAFVTQQLTKASEVKASRQLA 312
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C        1083 KDKVNECVKAQSKRSGFCGQGTHIVSFVVNAPNGLYFMHVGYYPSNHIEVVSAYGLCdaANPTNCIAPVNGYFIKTNNTR 1162
Cdd:pfam01601  313 QQKVNECVKSQSSRYGFCGNGTHLFSLPQAAPNGIMFLHTVLVPTEYITVKATPGLC--VNGTTGYAPRDGQFVLNNTSN 390
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C        1163 ivdeWSYTGSSFYAPEPITSLNTKYVAP-QVTYQNISTNLPPPLLGNStgIDFQDELDEFFKNVSTSIPNFgSLTQINTT 1241
Cdd:pfam01601  391 ----WYITPRNMYQPRPITGSDFVQISScDVNFVNITNTKLPPLIPDY--VDFNKELEDIYKNLNSTLPDL-DLDIFNAT 463
                          490       500       510
                   ....*....|....*....|....*....|....*....
5X5C_C        1242 LLDLTYEMLS---LQQVVKALNESYIDLKELGNYTYYNK 1277
Cdd:pfam01601  464 ILNLTDEIKDlerLQELIDNLNQTLVDLEWLNRYETYIK 502
bat-HKU9-CoV-like_Spike_SD1-2_S1-S2_S2 cd22381
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
591-1274 8.14e-172

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from Rousettus bat coronavirus HKU9 and related betacoronaviruses in the D lineage; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from betacoronaviruses in the nobecovirus subgenus (D lineage), including Rousettus bat coronavirus HKU9 (Ro-BatCoV HKU9). The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HCoV-KU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Ro-BatCoV HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411968 [Multi-domain]  Cd Length: 731  Bit Score: 528.56  E-value: 8.14e-172
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       591 LYGVSGRGVFQNCTAVGVRQQRFVYDAYQNLVGYySDDGNYYCLRACVSVPVSVIYDKETKThATLFGSVACEHISSTMS 670
Cdd:cd22381    1 LYGYTGTGVLSTSNLTIPDSKVFSASSTGDIIAV-SVNGTVYSISPCVSVPISVGYDPGFER-ALLFNGLSCSERARAVS 78
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       671 QYSrSTRSMLKRRDSTYGPLQTPVGCVLGLVNS-SLFVEDCKLPLGQSLCALPDTPSTLTPRSVSsvpgemRLASIAFNh 749
Cdd:cd22381   79 EPA-SDYWRASVSDGANNTFDTPSGCVYNVINRtTITVNQCSMPLGNSLCLVNNTTAVSARGSLS------LLSLVTYD- 150
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       750 PIQVDQLN--SSYFKLSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFQKCEQLLREYGQFCSKINQALHGANLRQDD 827
Cdd:cd22381  151 PLYDSSVTplTPVYWVSIPTNFTLAATTEYIQTTAPKINIDCAKYLCGDSSRCLTVLLQYGTFCDDVNKALARVSTILDA 230
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       828 SVRNLFASVKSSQSSPIIPGFGGDFNLT-LLEPVSISTGSRSARSAIEDLLFDKVTIADPGYMQGYDDCM-QQGPASARD 905
Cdd:cd22381  231 SLVSLVSELTSDVVRSENLAFDGDYNFTgLMGCLGSNCNSKSYRSALSDLLYNKVKVADPGFMQSYQKCIdSQWGGNIRD 310
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       906 LICAQYVAGYKVLPPLMDVNMEAAYTSSLLGSIAGVGWTAGLSSFAAIPFAQSIFYRLNGVGITQQVLSENQKLIANKFN 985
Cdd:cd22381  311 LICTQTFNGISVLPPIVSPGMQALYTSLLVGAVASSGYTFGITSVGVIPFATQLQFRLNGLGVTTQVLVENQKLIANSFN 390
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       986 QALGAMQTGFTTTNEAFQKVQDAVNNNAQALSKLASELSNTFGAISASIGDIIQRLDVLEQDAQIDRLINGRLTTLNAFV 1065
Cdd:cd22381  391 KALVSIQKGFDATNQALSKMQTVINQHAQQLQTLVQQLGNSFGAISSSINEIFSRLDGLEANAEVDRLINGRMVVLNTYV 470
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1066 AQQLVRSESAALSAQLAKDKVNECVKAQSKRSGFCGQGTHIVSFVVNAPNGLYFMHVGYYPSNHIEVVSAYGLCdaANPT 1145
Cdd:cd22381  471 TQLLIQASEVRAQAALAKQKISECVKAQSLRNDFCGNGTHVLSIPQLAPNGVLFIHYSYQPTAYALVQTAAGLC--FNGT 548
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1146 NcIAPVNGYFIKTNNTRIvdeWSYTGSSFYAPEPITSLNTKYVAP-QVTYQNISTNLPPPLLGNStgIDFQDELDEFFKN 1224
Cdd:cd22381  549 G-YAPRGGLFVLPNNSNL---WHFTKMNFYNPVNISYSNTQVLTScSVNYTTVNYTVLNPSEPSD--FNFQEEFDKWYKN 622
                        650       660       670       680       690
                 ....*....|....*....|....*....|....*....|....*....|
5X5C_C      1225 VSTSIPNFGSLTQINTTLLDLTYEMLSLQQVVKALNESYIDLKELGNYTY 1274
Cdd:cd22381  623 QSSQFNNTFNPSDFNFSTVDVNEQLATLTDVVKQLNESFIDLKKLNVYEQ 672
HKU1-CoV-like_Spike_SD1-2_S1-S2_S2 cd22380
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
591-1269 1.44e-159

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from human HKU1 and OC43 coronaviruses and related betacoronaviruses in the A lineage; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from betacoronaviruses in the embecovirus subgenus (A lineage), including highly pathogenic human coronaviruses (CoVs), HKU1 and OC43 CoVs, as well as murine hepatitis virus (MHV). The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of MHV is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HCoV-KU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411967 [Multi-domain]  Cd Length: 663  Bit Score: 494.29  E-value: 1.44e-159
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       591 LYGVSGRGVFQNCTAVGVRQ-QRFVYDAYQNLVGYYSDDGN-YYCLRACVSVPVSVIYDKETKTHATLFGSVACEHI-SS 667
Cdd:cd22380    1 LYGITGQGIFKEVNADYYNSwQNLLYDSNGNLYGFRDYLTNkTYMIRSCYSGRVSAAFHANASEPALLYRNLKCSYVfNN 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       668 TMSQysrsTRSMLKRRDSTygplqtpVGCVLGLVNS-SLFVEDCKLPLGQSLCAlpDTPSTLtpRSVSSVPGEMRLASIa 746
Cdd:cd22380   81 TISR----EEQPLNYFDSY-------LGCVVNADNStSSAVQTCDLRMGSGYCV--DYSTSR--RSRRSISTGYRFTTF- 144
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       747 fnHPIQVDQLNSS------YFKLSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFQKCEQLLREYGQFCSKINQALHG 820
Cdd:cd22380  145 --EPFTVNLVNDSvepvggLYEIQIPTNFTIGNHEEFIQTSSPKVTIDCAAFVCGDYAACRQQLVEYGSFCDNINAILNE 222
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       821 ANLRQDDSVRNLFASVKS--SQSSPIIPGFGGDFNLTLLEPVSISTGSR----SARSAIEDLLFDKVTIADPGYMQGYDD 894
Cdd:cd22380  223 VNELLDTTQLQVANSLMQgvTLSSRLKDGINFNVDDINFSPVLGCLGSDcnaaSSRSAIEDLLFDKVKLSDVGFVEAYNN 302
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       895 CmqQGPASARDLICAQYVAGYKVLPPLMDVNMEAAYTSSLLGSIAGVGWTAGlssfAAIPFAQSIFYRLNGVGITQQVLS 974
Cdd:cd22380  303 C--TGGAEIRDLLCVQSFNGIKVLPPVLSENQISGYTTAATAASLFPPWSAA----AGVPFSLNVQYRINGLGVTMDVLS 376
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       975 ENQKLIANKFNQALGAMQTGFTTTNEAFQKVQDAVNNNAQALSKLASELSNTFGAISASIGDIIQRLDVLEQDAQIDRLI 1054
Cdd:cd22380  377 QNQKLIANAFNNALGAIQEGFDATNSALAKIQSVVNANAEALNNLLQQLSNRFGAISASLQEILSRLDALEAQAQIDRLI 456
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1055 NGRLTTLNAFVAQQLVRSESAALSAQLAKDKVNECVKAQSKRSGFCGQGTHIVSFVVNAPNGLYFMHVGYYPSNHIEVVS 1134
Cdd:cd22380  457 NGRLTALNAYVSQQLSDSTLVKFSAAQAIEKVNECVKSQSPRINFCGNGNHILSLVQNAPYGLYFIHFSYVPTSFVTAKV 536
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1135 AYGLCDAANptNCIAPVNGYFIKTNntrivDEWSYTGSSFYAPEPITSLN-----------TKyvAPQVTYQNISTNLPp 1203
Cdd:cd22380  537 SPGLCIAGD--RGIAPKSGYFVNVN-----NEWMFTGSGYYYPEPITDKNvvvmsscavnyTK--APDVMLNTSIPNLP- 606
                        650       660       670       680       690       700
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
5X5C_C      1204 pllgnstgiDFQDELDEFFKNVSTSIPNFGSLTQINTTLLDLTYEMLSLQQVVKALNESYIDLKEL 1269
Cdd:cd22380  607 ---------DFKEELDQWFKNQTSVAPDLSLDEYINVTFLDLQDEMNRIQEAIKVLNESYINLKEI 663
CoV_Spike_S1-S2_S2 cd21698
S1/S2 cleavage region and the S2 fusion subunit of coronavirus spike (S) proteins; This model ...
763-1269 3.15e-155

S1/S2 cleavage region and the S2 fusion subunit of coronavirus spike (S) proteins; This model represents the S1/S2 cleavage region and the S2 subunit of the spike (S) glycoprotein from coronavirus (CoVs), including three highly pathogenic human CoVs, Middle East respiratory syndrome coronavirus (MERS-CoV), Severe acute respiratory syndrome (SARS) coronavirus (SARS-CoV), and SARS coronavirus 2 (SARS-CoV-2), also known as a 2019 novel coronavirus (2019-nCoV). The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect S1 and S2. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV, and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP), and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. Notably, SARS-CoV-2 has a functional polybasic (furin) cleavage site through the insertion of PRRAR*SV (* indicates the cleavage site) at the S1/S2 interface, which is absent in SARS-CoV and other SARS-related CoVs. The S1/S2 cleavage region and the S2 fusion subunit play an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411955 [Multi-domain]  Cd Length: 523  Bit Score: 477.68  E-value: 3.15e-155
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       763 LSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFQKCEQLLREYGQFCSKINQALHGANLRQDDSVRNLFASVKSSQSS 842
Cdd:cd21698   33 IAIPSNFTLSVTTEYLQVTMTKVSVDCTTYVCGGSPRCKNLLLQYGSACDTIEQALRGIAVLEDSEVSNMFSTSKQALKL 112
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       843 PIIPGFGgDFNLTLLEPvsiSTGSRSARSAIEDLLFDKVTIADPGYMQGYDDCMqqGPASARDLICAQYVAGYKVLPPLM 922
Cdd:cd21698  113 AIIKSFG-GFNFSQILP---TPSRPSGRSAIEDLLFTKVVTAGLGTVDQYKNCT--KGIAIADLACAQYYNGIMVLPPVA 186
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       923 DVNMEAAYTSSLLGSIAgvgwTAGLSSFAAIPFAQSIFYRLNGVGITQQVLSENQKLIANKFNQALGAMQTGFTTTNEAF 1002
Cdd:cd21698  187 DAEKMAMYTGSLTAGMV----FGGITAAAAIPFSLAMQARLNYVGLQQNVLLENQKLLANSFNKAIGNISDAFSSTSSAL 262
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1003 QKVQDAVNNNAQALSKLASELSNTFGAISASIGDIIQRLDVLEQDAQIDRLINGRLTTLNAFVAQQLVRSESAALSAQLA 1082
Cdd:cd21698  263 QKIQDVVNQQAQALNTLTSQLSNNFGAISSSIQDIYQRLDKLEADVQVDRLITGRLAALNAFVTQQLIKAAEVRQSRRLA 342
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1083 KDKVNECVKAQSKRSGFCGQGTHIVSFVVNAPNGLYFMHVGYYPSNHIEVVSAYGLC-DAANPTNCIAPVngYFIKTNNT 1161
Cdd:cd21698  343 QQKINECVKSQSSRYGFCGNGTHLFSIPQSAPSGIVFLHTVLVPTSYKNVTAYPGICvDGKAGSPLEGPL--VFIQNNNH 420
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1162 rivdeWSYTGSSFYAPEPITSLNTKYV---APQVTYQNISTNLPPPLlgNSTgIDFQDELDEFFKNVSTSIPNFGSLTQI 1238
Cdd:cd21698  421 -----WFVTPRNMYEPRIITTADFVQItscDANVTIVNNTVNLDPVI--PDY-VDVNEELDDYIQNLPNHTLPDLDLSGY 492
                        490       500       510
                 ....*....|....*....|....*....|.
5X5C_C      1239 NTTLLDLTYEMLSLQQVVKALNESYIDLKEL 1269
Cdd:cd21698  493 NATILNISSEIDRLNEVAKNLNQSVVELQEY 523
human_MERS-CoV_Spike_S1_RBD cd21486
receptor-binding domain of the S1 subunit of the Spike (S) protein from human Middle East ...
351-569 2.29e-152

receptor-binding domain of the S1 subunit of the Spike (S) protein from human Middle East respiratory syndrome coronavirus; This group contains the receptor-binding domain (RBD) of the S1 subunit of the spike (S) protein from the human coronavirus that causes Middle East Respiratory Syndrome (MERS-CoV). MERS-CoV causes severe pulmonary disease in humans. The CoV S protein is an envelope glycoprotein that plays the most important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains a hydrophobic fusion peptide and two heptad repeat regions. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including MERS-CoV use the C-domain to bind their receptors. MERS-CoV use human dipeptidyl peptidase 4 (DPP4), also called CD26, as its receptor. It binds DPP4 through the RBD of its S1 subunit and then fuses viral and host membranes through its S2 subunit. Due to the key role of the S protein RBD in viral attachment, it is the major target for antibody-mediated neutralization. This model corresponds to the S1 subunit C-domain that serves as the RBD for most CoVs including MERS-CoV.


Pssm-ID: 394833  Cd Length: 219  Bit Score: 457.67  E-value: 2.29e-152
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       351 AKPSGSVVEQAEGVECDFSPLLSGTPPQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSLILDYFS 430
Cdd:cd21486    1 AKPSGSVVEQAEGVECDFSPLLSGTPPQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSLILDYFS 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       431 YPLSMKSDLSVSSAGPISQFNYKQSFSNPTCLILATVPHNLTTITKPLKYSYINKCSRLLSDDRTEVPQLVNANQYSPCV 510
Cdd:cd21486   81 YPLSMKSDLSVSSAGPISQFNYKQSFSNPTCLILATVPHNLTTITKPLKYSYINKCSRLLSDDRTEVPQLVNANQYSPCV 160
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*....
5X5C_C       511 SIVPSTVWEDGDYYRKQLSPLEGGGWLVASGSTVAMTEQLQMGFGITVQYGTDTNSVCP 569
Cdd:cd21486  161 SIVPSTVWEDGDYYRKQLSPLEGGGWLVASGSTVAMTEQLQMGFGITVQYGTDTNSVCP 219
SARS-CoV-like_Spike_SD1-2_S1-S2_S2 cd22378
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
642-1272 9.14e-147

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from SARS-CoV-2 (COVID-19) and related betacoronaviruses in the B lineage; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from betacoronaviruses in the sarbecovirus subgenus (B lineage), including highly pathogenic human CoVs such as Severe acute respiratory syndrome (SARS) coronavirus (SARS-CoV), and SARS-CoV-2 (also known as a 2019 novel coronavirus or 2019-nCoV). The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. Notably, SARS-CoV-2 has a functional polybasic (furin) cleavage site through the insertion of PRRAR*SV (* indicates the cleavage site) at the S1/S2 interface, which is absent in SARS-CoV and other SARS-related coronaviruses. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411965 [Multi-domain]  Cd Length: 662  Bit Score: 460.23  E-value: 9.14e-147
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       642 VSVIY--DKETKTHATLFGSVACEHISSTM--SQYSRSTRSMlkrrdsTYGP--LQTPVGCVLGL--VNSSLfveDCKLP 713
Cdd:cd22378   53 VSVITpgTNASSEVAVLYQDVNCTDVPTAIhaDQLTPAWRVY------STGSnvFQTQAGCLIGAehVNTSY---ECDIP 123
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       714 LGQSLCALPDTPSTLTPRSVSSVpgemrlasIAFNHPIQVDQ-LNSSYFKLSIPTNFSFGVTQEYIQTTIQKVTVDCKQY 792
Cdd:cd22378  124 IGAGICASYHTVSLLRSTSQKSI--------VAYTMSLGAENsIAYSNNSIAIPTNFSISVTTEVMPVSMAKTSVDCTMY 195
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       793 VCNGFQKCEQLLREYGQFCSKINQALHGANLRQDDSVRNLFASVKSSQSSPIIPGFGGdFNLTLLEPvsiSTGSRSARSA 872
Cdd:cd22378  196 ICGDSTECSNLLLQYGSFCTQLNRALSGIAVEQDKNTQEVFAQVKQMYKTPTIKDFGG-FNFSQILP---DPSKPTKRSF 271
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       873 IEDLLFDKVTIADPGYMQGYDDCMqqGPASARDLICAQYVAGYKVLPPLMDVNMEAAYTSSLLGSIAGVGWTAGLSSFAA 952
Cdd:cd22378  272 IEDLLFNKVTLADAGFMKQYGDCL--GDINARDLICAQKFNGLTVLPPLLTDEMIAAYTAALVSGTATAGWTFGAGAALQ 349
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       953 IPFAQSIFYRLNGVGITQQVLSENQKLIANKFNQALGAMQTGFTTTNEAFQKVQDAVNNNAQALSKLASELSNTFGAISA 1032
Cdd:cd22378  350 IPFAMQMAYRFNGIGVTQNVLYENQKQIANQFNKAISQIQESLTTTSTALGKLQDVVNQNAQALNTLVKQLSSNFGAISS 429
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1033 SIGDIIQRLDVLEQDAQIDRLINGRLTTLNAFVAQQLVRSESAALSAQLAKDKVNECVKAQSKRSGFCGQGTHIVSFVVN 1112
Cdd:cd22378  430 VLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQA 509
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1113 APNGLYFMHVGYYPSNHIEVVSAYGLCDAANPTnciAPVNGYFIkTNNTrivdEWSYTGSSFYAPEPITSLNTkYVAPQ- 1191
Cdd:cd22378  510 APHGVVFLHVTYVPSQERNFTTAPAICHEGKAY---FPREGVFV-SNGT----SWFITQRNFYSPQIITTDNT-FVSGNc 580
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1192 -VTYQNISTNLPPPLLGNSTgiDFQDELDEFFKNVSTSIPNFGSLTQINTTLLDLTYEMLSLQQVVKALNESYIDLKELG 1270
Cdd:cd22378  581 dVVIGIINNTVYDPLQPELD--SFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELG 658

                 ..
5X5C_C      1271 NY 1272
Cdd:cd22378  659 KY 660
gammaCoV_Spike_SD1-2_S1-S2_S2 cd22372
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
764-1269 1.40e-136

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from avian infectious bronchitis coronavirus (IBV) and related gammacoronaviruses; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from gammacoronaviruses, including avian infectious bronchitis virus, and Beluga whale coronavirus SW1 (whale-CoV SW1). The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HCoV-KU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Ro-BatCoV HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411959 [Multi-domain]  Cd Length: 661  Bit Score: 433.26  E-value: 1.40e-136
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       764 SIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFQKCEQLLREYGQFCSKINQALHGANLRQDDSVRNLFASVKSSQSSP 843
Cdd:cd22372  163 LIPNSFNLTVTDEYIQTRMDKVQINCLQYVCGNSLECRKLFQQYGPVCDNILSIVNSVNQKEDMELLSFYSSTKPGGFNT 242
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       844 iiPGFG----GDFNLTLLEPvsiSTGSRSARSAIEDLLFDKVTIADPGYMQGYDDCMQQGPASARDLICAQYVAGYKVLP 919
Cdd:cd22372  243 --PVFNnvstGGFNISLLLP---PPSSPQGRSFIEDLLFTKVETVGLPTDDAYKKCTAGPLGFLKDLVCAQEYNGLLVLP 317
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       920 PLMDVNMEAAYTSSLLGSIAGVGWTAGlssfAAIPFAQSIFYRLNGVGITQQVLSENQKLIANKFNQALGAMQTGFTTTN 999
Cdd:cd22372  318 PIITAEMQTMYTGSLVASMAFGGITAA----GAIPFATQIQARINHLGITQSLLLKNQEKIAASFNKAIGHMQEGFRSTS 393
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1000 EAFQKVQDAVNNNAQALSKLASELSNTFGAISASIGDIIQRLDVLEQDAQIDRLINGRLTTLNAFVAQQLVRSESAALSA 1079
Cdd:cd22372  394 LALQQIQDVVNKQSAILTETMASLNKNFGAISSVIQDIYQQLDAIQADAQVDRLITGRLSSLSVLASAKQAEYYKVSQQR 473
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1080 QLAKDKVNECVKAQSKRSGFCGQGTHIVSFVVNAPNGLYFMHVGYYPSNHIEVVSAYGLCdaANPTN----CIAPVN--G 1153
Cdd:cd22372  474 ELATQKINECVKSQSNRYGFCGNGRHVLTIPQNAPNGIVFIHFTYTPESFVNVTAIVGFC--VNPANgsqyAIVPANgrG 551
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1154 YFIKTNNTRIVdewsyTGSSFYAPEPITSLN-TKYVAPQVTYQNISTNLPPPLLGNSTgIDFQDELDEFFKNVSTSIPNF 1232
Cdd:cd22372  552 IFIQVNGTYYI-----TARDMYMPRDITAGDiVTLTSCQANYVSVNKTVITTFVDNDD-FDFDDELSKWWNETKHELPDF 625
                        490       500       510
                 ....*....|....*....|....*....|....*....
5X5C_C      1233 gslTQINTT--LLDLTYEMLSLQQVVKALNESYIDLKEL 1269
Cdd:cd22372  626 ---DQFNYTipILNISNEIDRIQEVIQGLNDSLIDLETL 661
MERS-like_CoV_Spike_S1_RBD cd21479
receptor-binding domain of the S1 subunit of the Spike (S) protein from Middle East ...
351-569 3.55e-120

receptor-binding domain of the S1 subunit of the Spike (S) protein from Middle East respiratory syndrome coronavirus; This family contains the receptor-binding domain (RBD) of the S1 subunit of the spike (S) protein from the human coronavirus that causes Middle East Respiratory Syndrome (MERS-CoV) and related coronaviruses from animals. MERS-CoV causes severe pulmonary disease in humans. The CoV S protein is an envelope glycoprotein that plays the most important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains a hydrophobic fusion peptide and two heptad repeat regions. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including MERS-CoV use the C-domain to bind their receptors. MERS-CoV use human dipeptidyl peptidase 4 (DPP4), also called CD26, as its receptor. It binds DPP4 through the RBD of its S1 subunit and then fuses viral and host membranes through its S2 subunit. Due to the key role of the S protein RBD in viral attachment, it is the major target for antibody-mediated neutralization. This model corresponds to the S1 subunit C-domain that serves as the RBD for most CoVs including MERS-CoV.


Pssm-ID: 394826  Cd Length: 216  Bit Score: 372.11  E-value: 3.55e-120
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       351 AKPSGSVVEQAEGVECDFSPLLSGTPPQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSLILDYFS 430
Cdd:cd21479    1 AKPRGTFIEQAEGVECDFSPLLKGTPPQVYNFKRLVFTNCNYNLTKLLSLFQVDEFSCHQISPSALATGCYSSLTVDYFA 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       431 YPLSMKSDLSVSSAGPISQFNYKQSFSNPTCLILATVPHNLtTITKPLKYSYINKCSRLLSDDRTevPQLVNANQYSPCV 510
Cdd:cd21479   81 YPLSMKSYLQPGSAGPIVQFNYKQDFSNPTCRILATVPANL-TITKPSNYSYITKCSRLTGDGKN--PQYVNPGQYTPCL 157
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*....
5X5C_C       511 SIVPSTVWEDGDYYRKQLSPLEGGGWLVASGSTVAMTEQLQMGFGITVQYGTDTNSVCP 569
Cdd:cd21479  158 CFSPSGLAEDGFYFSYQLHGLEGGGYLTARGSYVPMTENLQMAFVITVQYGTDTNSVCP 216
alphaCoV_Spike_SD1-2_S1-S2_S2 cd22369
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
583-1269 1.43e-119

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) protein from alphacoronaviruses; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from alphacoronaviruses including human coronaviruses (HCoVs), HCoV-NL63, and HCoV-229E, and porcine coronaviruses, transmissible gastroenteritis virus (TGEV) and porcine epidemic diarrhea virus (PEDV), among others. The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP), and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HKU1 the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411956 [Multi-domain]  Cd Length: 666  Bit Score: 387.80  E-value: 1.43e-119
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       583 LGNCVEYSLYGVSGRGVFQ--NCTAV-GVrqqrFVYDAYQNLVGYY-SDDGNYYCLRAC-VSVPVSVIYDK----ETKTH 653
Cdd:cd22369    7 LNVCTDYTIYGITGRGIIRksNSTYIaGL----YYTSNSGQLLGFKnSTTGEVFSVTPCqLSSQVAVVSDNivgvMSATN 82
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       654 ATLFGSVaceHISSTMSQYSRSTrsmlkrrdsTYGPLQTPVgcvlgLVNSSLFVedcklplgqslCAlpDTPST-LTPRS 732
Cdd:cd22369   83 NVSLGFN---NTIETPSFYYHSN---------GAENCTEPV-----LTYGSIGV-----------CA--DGSITeVTPRS 132
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       733 VSSVPGEmrlasiafnhPIqvdqlnsSYFKLSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFQKCEQLLREYGQFCS 812
Cdd:cd22369  133 VSPEPVS----------PI-------ITGNISIPSNFTVSVQVEYLQMYLKPVSVDCSTYVCNGNPRCLQLLTQYASACR 195
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       813 KINQALH-GANLRQDDsVRNLFASVKSSQSSPIIPGFGGDFNLTLLEPvsistGSRSARSAIEDLLFDKVTIADPGYM-Q 890
Cdd:cd22369  196 TIEEALQlSARLESVE-VNSMITVSEEALRLANISTFFDDYNLSAVLP-----AGVGGRSAIEDLLFDKVVTSGLGTVdE 269
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       891 GYDDCMQQGPASARDLICAQYVAGYKVLPPLMDVNMEAAYTSSLLGSIAGVGWTAGlssfAAIPFAQSIFYRLNGVGITQ 970
Cdd:cd22369  270 DYKACTKGLGIAAADVACAQYYNGIMVLPGVVDAEKMALYTASLTGGMVLGGFTAA----AAIPFSLAVQSRLNYVALQT 345
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       971 QVLSENQKLIANKFNQALGAMQTGFTTTNEAFQ--------------KVQDAVNNNAQALSKLASELSNTFGAISASIGD 1036
Cdd:cd22369  346 DVLQRNQQILANSFNSAMGNITVAFSEVNDAIQqtsdaintvaqalnKVQNVVNEQGQALSQLTKQLASNFQAISSSIED 425
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1037 IIQRLDVLEQDAQIDRLINGRLTTLNAFVAQQLVRSESAALSAQLAKDKVNECVKAQSKRSGFCGQGTHIVSFVVNAPNG 1116
Cdd:cd22369  426 IYNRLDGLAADAQVDRLITGRLAALNAFVTQTLTKYTEVRASRQLAQQKINECVKSQSSRYGFCGNGTHLFSIVNAAPDG 505
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1117 LYFMHVGYYPSNHIEVVSAYGLCDAANPTNCIAPVNGYFIKTNNTRIVdewsyTGSSFYAP-EPITSLNTKYVAPQVTYQ 1195
Cdd:cd22369  506 IMFLHTVLLPTEYVTVAAWAGLCVDGKAYVLRDDVVLTLFKLNDKYYV-----TPRDMFEPrVPVSSDFVQISNCNVTYV 580
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1196 NISTNLPPPLLGNStgIDFQDELDEFFKNV-STSIPNFGsLTQINTTLLDLTYEM--------------LSLQQVVKALN 1260
Cdd:cd22369  581 NITSDELPEVIPDY--IDVNKTLEEFLANLpNYTLPDLP-LDIFNATYLNLTGEIadlenksesllnttVELQELIDNIN 657

                 ....*....
5X5C_C      1261 ESYIDLKEL 1269
Cdd:cd22369  658 NTLVDLEWL 666
bCoV_S1_N pfam16451
Betacoronavirus-like spike glycoprotein S1, N-terminal; This entry represents the N-terminal ...
32-367 4.19e-109

Betacoronavirus-like spike glycoprotein S1, N-terminal; This entry represents the N-terminal domain of the betacoronavirus-like trimeric spike glycoprotein. The distal S1 subunit of the coronavirus spike protein is responsible for receptor binding. S1 contains two domains; an N-terminal galectin-like domain (NTD) and a receptor-binding domain (S1 RBD) also referred to as the S1 CTD or domain B. Either the S1 NTD or S1 RBD, or occasionally both, are involved in binding to host receptors. S1 NTD is located on the side of the spike trimer and mainly recognizes sugar receptors. For many betacoronaviruses (b-CoVs), for example mouse hepatitis virus (MHV), the RBD is located in the NTD. The structure of the MHV S1 NTD showed the same fold as human galectins (galactose-binding lectin), however it does not bind any sugar; instead, it binds to the carcinoembryonic antigen cell-adhesion molecule CEACAM1) through protein-protein interactions. All three CEACAM21a-binding sites in MHV spikes can be fully occupied by CEACAM1a. It has been shown that CEACAM1a binding to the MHV spike weakens the interactions between S1 and S2 and facilitates the proteolysis of the spike protein and dissociation of S1. The homologous bovine CoV (BCov) S1 NTD also possesses a galectin fold but binds to sialic acid-containing moieties on host cell membranes, as does the NTD of three other group A b-Covs, namely human CoV (HCoV) OC43, avian b-CoV, and infectious bronchitis virus (IBV). Despite the S1 NTD of human respiratory b-CoV HKU1 being highly homologous to the NTDs of MHV and bovine CoV, it does not bind to either sugar or human carcinoembryonic antigen-related cell adhesion molecules (CEACAMs) and the RBD is found instead in the S1 RBD domain.


Pssm-ID: 465119  Cd Length: 296  Bit Score: 345.55  E-value: 4.19e-109
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C          32 DVSKADGIIYPQGRTYSNITITYQGLFPYQGDHGDMYVYSAGHATGTTpqklfvaNYSQDVKQFANGFVVRIGAAANSTG 111
Cdd:pfam16451    1 DVSKADGTYYLPDRVYSNTTTLRQGLFPKQGDNVTNYVYSPAHHCGKR-------NFSTPFLPFGDGIFVHIGNASNATG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         112 TVIISpstsatirkIYPAFMLGSSVGNFSdgkmgrffnHTLVLLPDGCGTLLRAFYCileprsgnhcpagnsytSFATYH 191
Cdd:pfam16451   74 GRIIS---------EPPAFVFGSTFGNTS---------HTLIIAPDSCQTNLTILVC-----------------NFTLCA 118
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         192 TPATDCS--DGNYNRNASLNSFKeyfNLRNCTFMYTYNITEDEILEWFGITQTAQGVHLFSSRYV---DLYGGNMFQFAT 266
Cdd:pfam16451  119 NPVTACRwgAGNYNANNSLVYFK---NAINCTFNRTYNITFDTSLIYFGFKQQDGGFHIYYSYWLpdlDSGPPTLFPFAT 195
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         267 LPVYDTIKYYSIIPHSIRSIQSDRKAWAAFYVYKLQPLTFLLDFSVDGYIRRAIDCGFNDLSQLHCSYESFDVESGVYSV 346
Cdd:pfam16451  196 LPLGINITYFQVIPSSIRSTQNCRRANAAYYVAPLKPSTFLLDFDVNGYITNAVDCSFDPLSELKCSTGSFEPDTGVYST 275
                          330       340
                   ....*....|....*....|.
5X5C_C         347 SSFEAKPSGSVVEQAEGVECD 367
Cdd:pfam16451  276 SSYRAQPVGSVYVRQPNVTCD 296
alphaCoV-HKU2-like_Spike_SD1-2_S1-S2_S2 cd22371
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the CoV ...
600-1277 7.14e-107

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the CoV spike (S) glycoprotein from Rhinolophus bat coronavirus HKU2 and related alphacoronaviruses; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from Wencheng shrew coronavirus (WESV), Lucheng Rn rat coronavirus (LRNV), and two bat viruses (Rhinolophus bat coronavirus HKU2 and BtRf-AlphaCoV/YN2012). Members of this group form a distinct cluster that is separated from the other alphacoronaviruses. The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HCoV-KU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Ro-BatCoV HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411958 [Multi-domain]  Cd Length: 686  Bit Score: 353.71  E-value: 7.14e-107
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       600 FQNCTAVGV-RQQRFVYDAYQNLVgYYSD-------DGNYYCLRACVSVPVSVIYDKETKTHaTLFGSVACEHISSTMSQ 671
Cdd:cd22371    1 IDGVTFQGIlYETNFTFDSFYNLL-YKGSmvkyvriLGVVYEVEPCNEFSYSVLKNNSSSYG-TLYSGADCNQIDTKTFR 78
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       672 YsrstrsmlkrRDSTYGPLQTPVGCvlgLVNSSLFVED---CKLPLGQSLCAlpdtpsTLTPRSVSSVPGEMRLASIAFN 748
Cdd:cd22371   79 F----------KARSHTGTNTSLGC---LFNASYTNDTyttCLNPLGNGFCA------DVNVTSPVVGNIGIQKHDTDYV 139
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       749 HPIqvdqLNSSYfkLSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFQKCEQLLREYGQFCSKINQALHGANLRQDDS 828
Cdd:cd22371  140 RPI----LTEQF--IELPLDHQLVVKEQFLQTSMPKFDVDCERYICDVSKACRELLFKYGGFCSKITADIKGSSILLDSQ 213
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       829 VRNLFASVKSSQSSPIiPGFGgDFNLTLLEPvsistgSRSARSAIEDLLFDKVTIADPGYMQGYDDCMQQgpaSARDLIC 908
Cdd:cd22371  214 ILGLYKTIAVDFSSPD-VDFG-DFNFSMFMS------EKNGRSFIEDLLFDKIVTTGPGFYQDYYDCKKM---NLQDLTC 282
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       909 AQYVAGYKVLPPLMDvnmeaAYTSSLLGSIAGVGWTAGLSSFAA--IPFAQSIFYRLNGVGITQQVLSENQKLIANKFNQ 986
Cdd:cd22371  283 AQYYNGIMVIPPIMD-----DETIGMYGGIVAASMTAGLFGGQAgmVTWNTAMAGRLNALGVTQDALVEDVNKLANGFNN 357
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       987 ALGAMQTGFTTTNEAFQKVQDAVNNNAQALSKLASELSNTFGAISASIGDIIQRLDVLEQDAQIDRLINGRLTTLNAFVA 1066
Cdd:cd22371  358 LTQSVSKLAKTTSQALSAIQAVVNQNAAQVEQLVQGLSENFGAISNNFEVIAERLEKLEADQQMDRLINGRMNVLQNFVT 437
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1067 QQLVRSESAALSAQLAKDKVNECVKAQSKRSGFCGQGTHIVSFVVNAPNGLYFMHVGYYPSNHIEVVSAYGLCDaANPTN 1146
Cdd:cd22371  438 NYKLKISELKSTQRLVQSLINECVYAQSLRNGFCGDGLHVMSLMQNAPDGIMFFHYTLKPNNTIIVKTTPGLCL-SNEVC 516
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1147 CIAPVNGYFIKTNntRIVDEWSYTGSSFYAPEPITSLNTKYVAPQVTYQNISTNLPPPLLGNSTGIDfqDELDEFFKNVS 1226
Cdd:cd22371  517 IKPIDAKFGVLVS--ANDSYWHFTPRNIYNPENITNSNIIAVSGGANYTTVNNTIDIIEPPQNPPID--EEFRELYKNVT 592
                        650       660       670       680       690
                 ....*....|....*....|....*....|....*....|....*....|.
5X5C_C      1227 TSIPNFGSLTqINTTLLDLTYEMLSLQQVVKALNESYIDLKELGNYTYYNK 1277
Cdd:cd22371  593 LELEQLKNIT-FDMSKLNLTYEIDRLNEIAENVSKLHVTVSEFNKYVQYVK 642
delta-PDCoV-like_Spike_SD1-2_S1-S2_S2 cd22373
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
763-1265 1.66e-103

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from porcine coronavirus HKU15, avian coronaviruses, and related deltacoronaviruses; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from porcine coronavirus PDCoV, and several avian coronaviruses such as quail deltacoronavirus (QdCoV) UAE-HKU30, white-eye coronavirus HKU16, common moorhen coronavirus HKU21, thrush CoV HKU12, and munia CoV HKU13, all from the Buldecovirus subgenus of deltacoronaviruses. The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HCoV-KU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Ro-BatCoV HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411960 [Multi-domain]  Cd Length: 648  Bit Score: 343.36  E-value: 1.66e-103
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       763 LSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFQKCEQLLREYGQFCSKINQALHGANLRQDDSVRNLFASVKSSQSS 842
Cdd:cd22373  140 VEIPSAFTLSVQTEYLQVQAEQVVVDCPQYVCNGNSRCLQLLAQYTSACSNIESALHSSAQLDSREITNMFQTSTQSLEL 219
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       843 PIIPGFGGDFNLTllepvSISTGSRSARSAIEDLLFDKVTIADPGYM-QGYDDCmQQGPASArDLICAQYVAGYKVLPPL 921
Cdd:cd22373  220 ANITNFKGDYNFT-----SILTTKIGGRSAIEDLLFNKVVTNGLGTVdQDYKSC-SKDMAIA-DLVCSQYYNGIMVLPGV 292
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       922 MDVNMEAAYTSSLLGSIAGVGWTAGlssfAAIPFAQSIFYRLNGVGITQQVLSENQKLIANKFNQALGAMQTGFTTTNEA 1001
Cdd:cd22373  293 VDAEKMAMYTGSLTGAMVFGGLTAA----AAIPFSTAVQARLNYVALQTNVLQENQKILAESFNQAVGNISLALSSVNDA 368
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1002 FQ--------------KVQDAVNNNAQALSKLASELSNTFGAISASIGDIIQRLDVLEQDAQIDRLINGRLTTLNAFVAQ 1067
Cdd:cd22373  369 IQqtsealntvanainKIQTVVNQQGEALSHLTAQLSNNFQAISTSIQDIYNRLDEVEANQQVDRLITGRLAALNAYVTQ 448
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1068 QLVRSESAALSAQLAKDKVNECVKAQSKRSGFCGQGTHIVSFVVNAPNGLYFMHVGYYPSNHIEVVSAYGLcdaanptnC 1147
Cdd:cd22373  449 LLNQMSQIRQSRLLAQQKINECVKSQSSRYGFCGNGTHLFSITQAAPNGIFFMHAVLVPTKFTRVNASAGI--------C 520
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1148 IAPVNGYFIKTNNT--RIVDEWSYTGSSFYAPEPITSLN-TKYVAPQVTYQNISTNLPPPLLGNStgIDFQDELDEFFKN 1224
Cdd:cd22373  521 VDNTKGYSLQPQLIlyQFNNSWRVTPRNMYEPRLPRQADfIPLTDCSVTFYNTTAADLPNIIPDY--VDVNQTVSDIIDN 598
                        490       500       510       520
                 ....*....|....*....|....*....|....*....|....*..
5X5C_C      1225 VSTSIPNFGSLTQINTTLLDLTYEMLSLQQVVKALN------ESYID 1265
Cdd:cd22373  599 LPTPTPPQLDVDIYNNTILNLTQEINDLQERSKNLSqiadrlQQYID 645
CoV_Spike_S1_NTD cd21527
N-terminal domain of the S1 subunit of coronavirus Spike (S) proteins; This family contains ...
25-333 3.68e-100

N-terminal domain of the S1 subunit of coronavirus Spike (S) proteins; This family contains the N-terminal domain (NTD) of the S1 subunit of coronavirus (CoV) Spike (S) proteins from all four (A-D) lineages of betacoronaviruses, including three highly pathogenic human CoVs (HCoV) such as Middle East respiratory syndrome (MERS)-related CoV, Severe acute respiratory syndrome (SARS) CoV, and SARS-CoV-2, also known as a 2019 novel coronavirus (2019-nCoV) or COVID-19 virus. The CoV S protein is an envelope glycoprotein that plays the most important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains a hydrophobic fusion peptide and two heptad repeat regions. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). Most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV, use the C-domain to bind their receptors. However, some CoVs from the A lineage, such as mouse hepatitis virus (MHV) uses the NTD to bind its receptor, mouse carcinoembryonic antigen related cell adhesion molecule 1a (mCEACAM1a). Bovine CoV and HCoV-OC43, also from the A lineage, recognize a sugar moiety, 5-N-acetyl-9-O-acetylneuraminic acid (Neu5,9Ac2), on cell-surface glycoproteins or glycolipids; this binding is also through the S1 NTD. The S1 NTD has also been the target for neutralizing antibodies, including human antibody CDC2-A2, and murine antibodies G2 and 5F9, which target MERS-CoV NTD. In addition, the S1 NTD contributes to the Spike trimer interface.


Pssm-ID: 394949  Cd Length: 268  Bit Score: 320.20  E-value: 3.68e-100
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C        25 KTWPRPIDVSKADGIIYPQGRTYSNITITYQGLFPYQGDHGDMYVYSAGHATGTtpqklfVANYSQDVKQFANGFVVRIG 104
Cdd:cd21527    1 KISTHTSDVSKGLGTYYPDDRVYSNTTLLLQGLFPYDGSNFTNYALKGSHALGT------LWFYPPFVSPFNNGIFVKVK 74
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       105 AAANSTgtviispstSATIRKIYPAFMLGSSVGnfsdgkmgrFFNHTLVLLPDGCGTLLRAFYCILEPRsgnhcpagnsy 184
Cdd:cd21527   75 NTKNST---------SATIYSEYPAIVFGSTFG---------NTSYTVVIQPDNGGTLLEASACQYEMC----------- 125
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       185 TSFATYHTPATDCSDGNYnrnaslnSFKEYFNLRNCTFMYTYNITEDEILEWFGITQTAQGVHLFSSRYVDLYGGNMFQF 264
Cdd:cd21527  126 EYNATICVPKTDGSDGNY-------SWHIDSNAFNCTFEYNFTYNVNADLLYFGFYQEDGTLYAYYSDYVDLYGGPLKFL 198
                        250       260       270       280       290       300       310
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       265 ATLPVYDTIKYYSIIPHSIRSIQS-DRKAWAAFYVYKLQPLTFLLDFSVDGYIRRAIDCGFNDLSQLHCS 333
Cdd:cd21527  199 FSLPLGDNLTNYYVIPLTCRSIQSsDRKFAAAYYVTYLTPRTFLLNFDENGVITNAVDCSSNFLSELKCS 268
bat_HKU4-like_Spike_S1_RBD cd21487
receptor-binding domain of the S1 subunit of the Spike (S) protein from Tylonycteris bat ...
351-569 5.68e-100

receptor-binding domain of the S1 subunit of the Spike (S) protein from Tylonycteris bat coronavirus HKU4 and similar proteins; This group contains the receptor-binding domain (RBD) of the S1 subunit of the spike (S) protein from Tylonycteris bat coronavirus HKU4 and other Middle East Respiratory Syndrome (MERS)-related coronaviruses. The CoV S protein is an envelope glycoprotein that plays the most important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains a hydrophobic fusion peptide and two heptad repeat regions. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including human MERS-CoV that is phylogenetically closely related to bat CoV HKU4 use the C-domain to bind their receptors. HKU4 is able to bind the MERS-CoV receptor, human dipeptidyl peptidase 4 (DPP4), also called CD26. Due to the key role of the S protein RBD in viral attachment, it is the major target for antibody-mediated neutralization. This model corresponds to the S1 subunit C-domain that serves as the RBD for most CoVs including MERS-CoV, and most likely, bat CoV HKU4.


Pssm-ID: 394834  Cd Length: 219  Bit Score: 317.69  E-value: 5.68e-100
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       351 AKPSGSVVEQAEGVECDFSPLLSGTPPQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSLILDYFS 430
Cdd:cd21487    1 ASATGTFIEQPNATECDFSPMLTGVAPQVYNFKRLVFSNCNYNLTKLLSLFAVDEFSCNGISPDAIARGCYSTLTVDYFA 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       431 YPLSMKSDLSVSSAGPISQFNYKQSFSNPTCLILATVPHNLtTITKPLKYSYINKCSRLLSDDR-TEVPQLVNANQYSPC 509
Cdd:cd21487   81 YPLSMKSYIRPGSAGNIPLYNYKQSFANPTCRVMASVPANV-TITKPEAYGYISKCSRLTGDNQhIETPLYINPGEYSIC 159
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       510 VSIVPSTVWEDGDYYRKQLSPLEGGGWLVASGSTVAMTEQLQMGFGITVQYGTDTNSVCP 569
Cdd:cd21487  160 RDFAPNGFSEDGQVFKRTLTQFEGGGLLIGVGTRVPMTANLEMGFVISVQYGTGTDSVCP 219
PDEV-like_Spike_SD1-2_S1-2_S2 cd22376
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
763-1269 7.96e-100

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from Porcine epidemic diarrhea virus and related alphacoronavirus; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from alphacoronaviruses, including porcine epidemic diarrhea virus (PEDV), Scotophilus bat coronavirus, and swine enteric coronavirus, among others. The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HKU1 the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411963 [Multi-domain]  Cd Length: 673  Bit Score: 334.03  E-value: 7.96e-100
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       763 LSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFQKCEQLLREYGQFCSKINQALH-GANLRQDDSVRNLFASVKSSQS 841
Cdd:cd22376  143 ISIPTNFTMSIRTEYLQLYNTPVSVDCAMYVCNGNSRCKQLLTQYTSACKTIESALQlSARLESVEVNSMLTISEEALQL 222
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       842 SPIIPGFGGDFNLTLLEPVSISTgsrsaRSAIEDLLFDKVTIADPGYM-QGYDDCMQQgpASARDLICAQYVAGYKVLPP 920
Cdd:cd22376  223 ATISSFNGGGYNFTNVLGASVQK-----RSFIEDLLFNKVVTNGLGTVdEDYKRCSNG--LSVADLVCAQYYSGVMVLPG 295
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       921 LMDVNMEAAYTSSLLGSIAgvgwTAGLSSFAAIPFAQSIFYRLNGVGITQQVLSENQKLIANKFNQALGAMQTGFTTTNE 1000
Cdd:cd22376  296 VVDAEKLHMYSASLIGGMV----LGGITAAAALPFSYAVQARLNYVALQTDVLQRNQQLLAESFNSAIGNITSAFESVKE 371
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1001 AFQ--------------KVQDAVNNNAQALSKLASELSNTFGAISASIGDIIQRLDVLEQDAQIDRLINGRLTTLNAFVA 1066
Cdd:cd22376  372 AISqtsqglntvahaltKVQDVVNSQGAALNQLTVQLQHNFQAISSSIDDIYSRLDQLSADAQVDRLITGRLSALNAFVA 451
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1067 QQLVRSESAALSAQLAKDKVNECVKAQSKRSGFCG-QGTHIVSFVVNAPNGLYFMHVGYYPSNHIEVVSAYGLCDaaNPT 1145
Cdd:cd22376  452 QTLTKYTEVQASRKLAQQKVNECVKSQSQRYGFCGgDGEHIFSLVQAAPQGLLFLHTVLVPGDFVNVTAIAGLCV--DDE 529
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1146 NCIAPVNGYFIKTNNTRIVDEWSYTGSSFYAPEPITSLNTKYV---APQVTYQNISTNLPPPLLGNStgIDFQDELDEFF 1222
Cdd:cd22376  530 IALTLREPGVLFTHEVLTYTATEYFVSPRKMFEPRKPTVSDFVqieSCVVTYVNLTSDQLPDVIPDY--IDVNKTLDEIL 607
                        490       500       510       520
                 ....*....|....*....|....*....|....*....|....*...
5X5C_C      1223 KNV-STSIPNFgSLTQINTTLLDLTYEMLSLQQVVKALNESYIDLKEL 1269
Cdd:cd22376  608 ASLpNRTGPSL-PLDVFNATYLNLTGEIADLEQRSESLRNTTEELRSL 654
HCoV-NL63-229E-like_Spike_SD1-2_S1-S2_S2 cd22375
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
583-1269 1.23e-98

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoproteins from HCoV-NL63, HCoV-229E, and related alphacoronavirus; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from alphacoronaviruses, including human coronaviruses (HCoVs), HCoV-NL63 and HCoV-229E. The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HKU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411962 [Multi-domain]  Cd Length: 677  Bit Score: 330.71  E-value: 1.23e-98
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       583 LGNCVEYSLYGVSGRGVFQNCTAVGVRQQRFVYDAyQNLVGYYS-DDGNYYCLRACVSVPVSVIYDKetkthaTLFGSVA 661
Cdd:cd22375    7 LNNCTKYNIYDYSGTGVIRSSNDSFIGGITYTSNS-GNLLGFKDvSTGTIYSITPCNPPDQVVVYQQ------AIVGAML 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       662 CEHISSTMSQYSRSTRSMLKRRDSTYGplqtpvgCVLGLVNSSLFvedcklplgqSLCA----LPDTPSTLTPRSVSSVp 737
Cdd:cd22375   80 SENETRYGLSNVVELPNFYYASNGTYN-------CTDAVLTYSNF----------GICAdgsiIPVRPRNVSDNGVSAI- 141
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       738 gemrlasIAFNhpiqvdqlnssyfkLSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFQKCEQLLREYGQFCSKINQA 817
Cdd:cd22375  142 -------VTAN--------------LSIPSNWTTSVQVEYLQITSTPIVVDCSTYVCNGNPRCVELLKQYTSACKTIEDA 200
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       818 LHGANLRQDDSVRNLFASVKSSQSSPIIPGFGgDFNLTLLEP-VSISTGSRSARSAIEDLLFDKVTIADPGYMQG-YDDC 895
Cdd:cd22375  201 LRLSARLESADVSSMLTFDSNAFTLANVSSFG-DYNLSSVLPqLPTSGSRIAGRSAIEDLLFSKVVTSGLGTVDAdYKSC 279
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       896 MQQgpASARDLICAQYVAGYKVLPPLMDVNMEAAYTSSLLGSIAgvgwTAGLSSFAAIPFAQSIFYRLNGVGITQQVLSE 975
Cdd:cd22375  280 TKG--LSIADLACAQYYNGIMVLPGVADAERMAMYTGSLIGGMA----LGGLTSAAAIPFSLALQARLNYVALQTDVLQE 353
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       976 NQKLIANKFNQALGAMQTGFTTTNEA--------------FQKVQDAVNNNAQALSKLASELSNTFGAISASIGDIIQRL 1041
Cdd:cd22375  354 NQKILAASFNKAMTNIVDAFTGVNDAitqtsqaiqtvataLNKIQDVVNQQGNALNHLTSQLRQNFQAISSSIQAIYDRL 433
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1042 DVLEQDAQIDRLINGRLTTLNAFVAQQLVRSESAALSAQLAKDKVNECVKAQSKRSGFCGQGTHIVSFVVNAPNGLYFMH 1121
Cdd:cd22375  434 DTIQADQQVDRLITGRLAALNAFVSQTLTKYTEVRASRQLAQQKVNECVKSQSNRYGFCGNGTHIFSIVNAAPEGLVFLH 513
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1122 VGYYPSNHIEVVSAYGLcdaanptnCIAPVNGYFIKTNNTRIVDEWSY---TGSSFYAPE-PITSLNTKYVAPQVTYQNI 1197
Cdd:cd22375  514 TVLLPTQYKDVEAWSGL--------CVDGVNGYVLRQPNLALYKDGGVfriTSRVMFEPRiPTMADFVQIENCNVTFVNI 585
                        650       660       670       680       690       700       710
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
5X5C_C      1198 STNLPPPLLGNStgIDFQDELDEFF-KNVSTSIPNFGsLTQINTTLLDLTYEMLSLQQVVKALNESYIDLKEL 1269
Cdd:cd22375  586 SRSELQTIVPEY--VDVNKTLQELIeKLPNYTVPDLD-LDQYNQTILNLTSEISTLENKSAELNYTVQKLQTL 655
delta-PiCoV-like_Spike_SD1-2_S1-S2_S2 cd22374
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
714-1277 2.09e-94

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from Pigeon coronavirus UAE-HKU29, and related avian deltacoronaviruses; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from Pigeon coronavirus UAE-HKU29, and related avian deltacoronaviruses including Falcon coronavirus UAE-HKU27, Magpie-robin coronavirus HKU18, Sparrow coronavirus HKU17, and Night heron coronavirus HKU19. The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the (C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HCoV-KU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Ro-BatCoV HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411961 [Multi-domain]  Cd Length: 739  Bit Score: 320.68  E-value: 2.09e-94
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       714 LGQSLCALPdtpsTLTPRSVSSVP-GEMRLASIAFNHPIQVDQLNSSyfKLSIPTNFSFGVTQEYIQTTIQKVTVDCKQY 792
Cdd:cd22374  121 IGNDTCETP----VITFGSIGVCPgGGLHFVDPTSNEFTNVVPISTQ--NISIPKNFTVSIQTEYIQIEQQPVTVDCRQY 194
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       793 VCNGFQKCEQLLREYGQFCSKINQALHGANLRQDDSVRNLFASVKSSQSSPIIPGFGGD---FNLTLLEPVSIStgsrsA 869
Cdd:cd22374  195 VCNGNPRCLQLLMQYTSACSTIEQALSLNARLEAASIQTMLTYSPETLKLANITNFQSDdvnYNLTNILPKKYQ-----G 269
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       870 RSAIEDLLFDKVTIADPGYM-QGYDDCMQQgpASARDLICAQYVAGYKVLPPLMDVNMEAAYTSSLLGSIAgvgwTAGLS 948
Cdd:cd22374  270 RSAIEDLLFDKVVTNGLGTVdQDYKACTNG--VSIADLVCAQYYNGIMVLPGVADPEKMAQYTASLTGGMV----FGGLT 343
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       949 SFAAIPFAQSIFYRLNGVGITQQVLSENQKLIANKFNQALGAMQTGFTTTNE--------------AFQKVQDAVNNNAQ 1014
Cdd:cd22374  344 SAAAIPFSLAVQSRLNYVALQTDVLQQNQQILADSFNNAMGNITLAFKEVSEglsqvsgaittvanALTKIQTVVNSQGQ 423
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1015 ALSKLASELSNTFGAISASIGDIIQRLDVLEQDAQIDRLINGRLTTLNAFVAQQLVRSESAALSAQLAKDKVNECVKAQS 1094
Cdd:cd22374  424 ALATLTEQLANNFQAISASIADIYNRLNQLEADAQVDRLITGRLAALNAFVTQTLSKLAEVRQARQLALDKINECVKSQS 503
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1095 KRSGFCGQGTHIVSFVVNAPNGLYFMHVGYYPSNHIEVVSAYGLCDAANPTNCIAPVNGYFIKTnntrivDEWSYTGSSF 1174
Cdd:cd22374  504 SRYGFCGNGTHLFSIVNAAPYGFVFFHTVLLPTQYATVQAYSGICQNGRALALKDPSLALFRGT------DKYLVTPRNM 577
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1175 YAPEPITSLNTKYVAP-QVTYQNISTNLPPPLLGNStgIDFQDELDEFFKNV-STSIPNFgSLTQINTTLLDLTYEM--- 1249
Cdd:cd22374  578 YQPRTAAQADFVYIEScTVTYLNLTDTTIDAVIPDY--VDVNKTVEDILNNLpNYTKPDL-DIGRYNNTILNLTTEIndl 654
                        570       580       590
                 ....*....|....*....|....*....|....*....
5X5C_C      1250 ------LS-----LQQVVKALNESYIDLKELGNYTYYNK 1277
Cdd:cd22374  655 ngraenLSqivenLEEYIKKINATLVDLEWLNRVETYIK 693
TGEV-like_Spike_SD1-2_S1-S2_S2 cd22377
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
763-1277 2.74e-91

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from transmissible gastroenteritis virus and related alphacoronaviruses; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from porcine transmissible gastroenteritis virus (TGEV), canine coronavirus (CCoV), and feline coronavirus (FCoV). They display greater than 96% sequence identity and have been grouped in the same species, alphacoronavirus 1, within the Alphacoronavirus genus. The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HKU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411964 [Multi-domain]  Cd Length: 751  Bit Score: 312.47  E-value: 2.74e-91
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       763 LSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFQKCEQLLREYGQFCSKINQALH-GANLRQ---DD----SVRNL-F 833
Cdd:cd22377  171 ITIPKNFTVAVQAEYIQIQVKPVVVDCAKYVCNGNRHCLKLLTQYTSACQTIENALNlGARLESlmlNDmitvSDRSLeL 250
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       834 ASVKSSQSS----PIIPGFGGDFNLTLLEPVSistgsrSARSAIEDLLFDKVTIADPGYM-QGYDDCmqQGPASARDLIC 908
Cdd:cd22377  251 ATVEKFNSTvlggEKLGGFYFDGLKDLLPPRI------GKRSAIEDLLFNKVVTSGLGTVdDDYKKC--SAGTDVADLVC 322
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       909 AQYVAGYKVLPPLMDVNMEAAYTSSLLGSIAgvgwTAGLSSFAAIPFAQSIFYRLNGVGITQQVLSENQKLIANKFNQAL 988
Cdd:cd22377  323 AQYYNGIMVLPGVVDDNKMAMYTASLIGGMA----LGSITSAVAVPFAMQVQARLNYVALQTDVLQENQKILANAFNNAI 398
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       989 G-----------AMQT---GFTTTNEAFQKVQDAVNNNAQALSKLASELSNTFGAISASIGDIIQRLDVLEQDAQIDRLI 1054
Cdd:cd22377  399 GnitlalgkvsnAITTtsdGFNTMASALTKIQSVVNQQGEALSQLTSQLQKNFQAISSSIAEIYNRLEKVEADAQVDRLI 478
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1055 NGRLTTLNAFVAQQLVRSESAALSAQLAKDKVNECVKAQSKRSGFCGQGTHIVSFVVNAPNGLYFMHVGYYPSNHIEVVS 1134
Cdd:cd22377  479 TGRLAALNAYVSQTLTQYAEVKASRQLAMEKVNECVKSQSDRYGFCGNGTHLFSLVNSAPDGLLFFHTVLLPTEWEEVTA 558
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1135 AYGLCdaANPTnciapvNGYFIKTNNTRIvdeWSYTGSSFYAP----EP----------ITS-----LNTKYvapqVTYQ 1195
Cdd:cd22377  559 WSGIC--VNDT------YAYVLKDFLTSI---FSYNGTYMVTPrnmfQPrkpqmsdfvqITScevtfLNTTY----TTFQ 623
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C      1196 NIstnLPPPLLGNSTgidFQDELDEFFKNVSTSIPNFgSLTQINTTLLDLTYEM--------------LSLQQVVKALNE 1261
Cdd:cd22377  624 EI---VIDYIDINKT---IADMLEQYNPNYTVPELDL-QLEIFNQTKLNLTAEIdqleqradnltniaHELQQYIDNLNK 696
                        570
                 ....*....|....*.
5X5C_C      1262 SYIDLKELGNYTYYNK 1277
Cdd:cd22377  697 TLVDLEWLNRIETYVK 712
CoV_Spike_S1_RBD cd21470
receptor-binding domain of the S1 subunit of coronavirus spike (S) proteins; This family ...
351-569 4.30e-62

receptor-binding domain of the S1 subunit of coronavirus spike (S) proteins; This family contains the receptor-binding domain (RBD) of the S1 subunit of coronavirus (CoV) spike (S) proteins from three highly pathogenic human coronaviruses (CoVs), including Middle East respiratory syndrome coronavirus (MERS-CoV), Severe acute respiratory syndrome (SARS) coronavirus (SARS-CoV), and SARS coronavirus 2 (SARS-CoV-2), also known as a 2019 novel coronavirus (2019-nCoV), as well as S proteins from related coronaviruses. The CoV S protein is an envelope glycoprotein that plays the most important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains a hydrophobic fusion peptide and two heptad repeat regions. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. MHV uses mouse carcinoembryonic antigen related cell adhesion molecule 1a (mCEACAM1a) as the receptor, and the receptors for SARS-CoV and MERS-CoV are human angiotensin-converting enzyme 2 (ACE2) and human dipeptidyl peptidase 4 (DPP4), respectively. Recent studies found that the RBD of SARS-CoV-2 S protein binds strongly to human and bat angiotensin-converting enzyme 2 (ACE2) receptors. Moreover, SARS-CoV-2 RBD exhibited significantly higher binding affinity to the ACE2 receptor than SARS-CoV RBD. Due to the key role of the S protein RBD in viral attachment, it is the major target for antibody-mediated neutralization. This model corresponds to the S1 subunit C-domain that serves as the RBD for most CoVs.


Pssm-ID: 394823  Cd Length: 171  Bit Score: 209.26  E-value: 4.30e-62
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       351 AKPSGSVVEQAEGVE-CDFSPLLSGTP-PQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSLILDY 428
Cdd:cd21470    1 VKPSGSVVRRPNNTPlCDFSEWLNATSvPSVYNWERKVFSNCNFNLSKLLSLFSVDSFTCNGISPAKIAGLCFSSITVDK 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       429 FSYPLSMKSDLSVSSAGPISQFNYKQSFSNPTCLILATVPHNLTTITKPLKYSYINKcsrllsddrtevpqlvnanqysp 508
Cdd:cd21470   81 FAIPLSRKSDLQPGSAGFIQDYNYKIDFDNTSCQLAYNLPANNATITKNHDYVYIQK----------------------- 137
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
5X5C_C       509 cvsivpstvwedgdyyrkqlsplegggwlVASGSTVAMT--EQLQMGFGITVQYGTDTNSVCP 569
Cdd:cd21470  138 -----------------------------FLGWSTDGCTsgDQCQIFFNISFQYGNASGTVCS 171
bCoV_S1_RBD pfam09408
Betacoronavirus spike glycoprotein S1, receptor binding; This entry represents the receptor ...
377-569 1.07e-37

Betacoronavirus spike glycoprotein S1, receptor binding; This entry represents the receptor binding domain (S1 RBD) of the betacoronavirus spike glycoprotein. The spike glycoprotein is arranged in trimers on the surface of the viral membrane and is essential for viral entry. The spike protein is translated as a large polypeptide that is subsequently cleaved to the distal S1, responsible for receptor binding, and the membrane-anchored S2 responsible for membrane fusion. The coronavirus (SARS-CoV) S1 subunit is composed of two distinct domains: an N-terminal domain (S1 NTD) and a receptor-binding domain (S1 RBD) also referred to as the S1 CTD or domain B. Each of these domains have been implicated in binding to host receptors. However, most coronaviruses are not known to utilize both the S1 NTD and S1 RBD for viral entry. SARS-CoV makes use of its S1 RBD to bind to the human angiotensin-converting enzyme 2 (ACE2) as its host receptor.


Pssm-ID: 462789  Cd Length: 156  Bit Score: 138.79  E-value: 1.07e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         377 PQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSLILDYFSYPLSMKSDLSVSSAGPISQFNYKQSF 456
Cdd:pfam09408    1 PSPANWERKTFSNCNFDFSVLLSLLQVSSFKCYGVSPSKLADMCYGSVTIDYFAIPETHKSNLQPGSPGAISKYNYKLPD 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C         457 SNPTCLILATVPhNLTTITKPLKYSYInkcsrllsddrtevpqLVNANQYSPCVSIVpSTVWEDGDYYRKQLSplegggw 536
Cdd:pfam09408   81 DFYGCVLAFNVN-ANTNYAFADNYPYR----------------YIKPGQYQPCNSFV-STVPNSPDGHYCTPS------- 135
                          170       180       190
                   ....*....|....*....|....*....|...
5X5C_C         537 lvasgstvamteQLQMGFGITVQYGTDTNSVCP 569
Cdd:pfam09408  136 ------------SFNGVVVITLKPATGSNLVCP 156
HKU1-like_CoV_Spike_S1_RBD cd21478
receptor-binding domain of the S1 subunit of the Spike (S) protein from human coronavirus HKU1 ...
365-476 2.94e-17

receptor-binding domain of the S1 subunit of the Spike (S) protein from human coronavirus HKU1 and related coronaviruses; This family contains the receptor-binding domain (RBD) of the S1 subunit of the spike (S) protein from human coronavirus (CoV) HKU1, human coronavirus OC43 (HCoV-OC43), mouse hepatitis virus (MHV), porcine hemagglutinating encephalomyelitis virus (HEV), and other related coronaviruses. HKU1 is a human betacoronavirus that causes mild yet prevalent respiratory disease. HCoV-OC43 is of zoonotic origin and is endemic in the human population, causing mild respiratory tract infections and possible severe complications or fatalities in young children, the elderly, and immunocompromised individuals. MHV is the most common viral pathogen in contemporary laboratory mouse colonies manifesting as a primary infection in the upper respiratory tract. Porcine HEV is associated with acute outbreaks of wasting and encephalitis in nursing piglets from pig farms. These viruses are related to the zoonotic SARS and MERS betacoronaviruses, which have high fatality rates and pandemic potential. The CoV S protein is an envelope glycoprotein that plays the most important role in viral attachment, fusion and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains a hydrophobic fusion peptide and two heptad repeat regions. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBDs of MHV and HCoV-OC43 are located at the NTD, most CoVs use the C-domain to bind their receptors. Although a protein receptor has not yet been identified for HKU1, antibodies against the C-domain, but not those against the NTD, blocked HKU1 infection of cells, suggesting that the S1 C-domain is the primary HKU1 receptor-binding site. Due to the key role of the S protein RBD in viral attachment, it is the major target for antibody-mediated neutralization. This model corresponds to the S1 subunit C-domain that serves as the RBD for most CoVs.


Pssm-ID: 394825  Cd Length: 223  Bit Score: 82.12  E-value: 2.94e-17
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       365 ECDFSPLLSG-TPPQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSLILDYFSYPLSMKSDLSVSS 443
Cdd:cd21478   20 DCDIEEWLNApTVPSPLNWERKTFSNCNFNMSSLLSLVQADSFSCNNIDASKIYGMCFGSITIDKFAIPNSRKVDLQLGS 99
                         90       100       110
                 ....*....|....*....|....*....|...
5X5C_C       444 AGPISQFNYKQSFSNPTCLILATVPHNLTTITK 476
Cdd:cd21478  100 SGYLQSFNYRIDTAATSCQLYYSLPANNVTVTN 132
HKU1_N1_CoV_Spike_S1_RBD cd21483
receptor-binding domain of the S1 subunit of the Spike (S) protein from human coronavirus HKU1, ...
347-474 2.84e-15

receptor-binding domain of the S1 subunit of the Spike (S) protein from human coronavirus HKU1, isolate N1; This group contains the receptor-binding domain (RBD) of the S1 subunit of the spike (S) protein from human coronavirus (CoV) HKU1, isolate N1. HKU1 is a human betacoronavirus that causes mild yet prevalent respiratory disease, and is related to the zoonotic SARS and MERS betacoronaviruses, which have high fatality rates and pandemic potential. The CoV S protein is an envelope glycoprotein that plays the most important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains a hydrophobic fusion peptide and two heptad repeat regions. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs use the C-domain to bind their receptors. Although a protein receptor has not yet been identified for HKU1, antibodies against the C-domain, but not those against the NTD, blocked HKU1 infection of cells, suggesting that the S1 C-domain is the primary HKU1 receptor-binding site. Due to the key role of the S protein RBD in viral attachment, it is the major target for antibody-mediated neutralization. This model corresponds to the S1 subunit C-domain that serves as the RBD for most CoVs, and most likely, for HKU1.


Pssm-ID: 394830  Cd Length: 306  Bit Score: 78.23  E-value: 2.84e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       347 SSFEAKPSGSVVEQAEGV-ECDFSPLLSG-TPPQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSL 424
Cdd:cd21483    1 SGFTVKPVATVHRRIPDLpDCDIDKWLNNfNVPSPLNWERKIFSNCNFNLSTLLRLVHTDSFSCNNFDESKIYGSCFKSI 80
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|
5X5C_C       425 ILDYFSYPLSMKSDLSVSSAGPISQFNYKQSFSNPTCLILATVPHNLTTI 474
Cdd:cd21483   81 VLDKFAIPNSRRSDLQLGSSGFLQSSNYKIDTTSSSCQLYYSLPAINVTI 130
HEV_Spike_S1_RBD cd21508
receptor-binding domain of the S1 subunit of the Spike (S) protein from porcine ...
366-509 5.54e-15

receptor-binding domain of the S1 subunit of the Spike (S) protein from porcine hemagglutinating encephalomyelitis virus; This group contains the receptor-binding domain (RBD) of the S1 subunit of the spike (S) protein from porcine hemagglutinating encephalomyelitis virus (HEV), which is associated with acute outbreaks of wasting and encephalitis in nursing piglets from pig farms. Porcine HEV is related to the zoonotic SARS and MERS betacoronaviruses, which have high fatality rates and pandemic potential. The CoV S protein is an envelope glycoprotein that plays the most important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains a hydrophobic fusion peptide and two heptad repeat regions. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs use the C-domain to bind their receptors. The protein receptor for porcine HEV has not yet been identified. Due to the key role of the S protein RBD in viral attachment, it is the major target for antibody-mediated neutralization. This model corresponds to the S1 subunit C-domain that serves as the RBD for most CoVs.


Pssm-ID: 394835  Cd Length: 298  Bit Score: 77.09  E-value: 5.54e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       366 CDFSPLL-SGTPPQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSLILDYFSYPLSMKSDLSVSSA 444
Cdd:cd21508   21 CDIEAWLnSKTVSSPLNWERKIFSNCNFNMGRLMSFIQADSFGCNNIDASRLYGMCFGSITIDKFAIPNSRKVDLQVGKS 100
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
5X5C_C       445 GPISQFNYKQSFSNPTCLILATVPHNLTTITK------PLKYSYINKC--SRLLSDDRTEVPQLVNANQYSPC 509
Cdd:cd21508  101 GYLQSFNYKIDTAVSSCQLYYSLPAANVSVTHynpsswNRRYGFNNQSfgSRGLHDAVYSQQCFNTPNTYCPC 173
HKU1_N5-like_CoV_Spike_S1_RBD cd21482
receptor-binding domain of the S1 subunit of the Spike (S) protein from human coronavirus HKU1, ...
347-474 3.51e-14

receptor-binding domain of the S1 subunit of the Spike (S) protein from human coronavirus HKU1, isolate N5 and isolate N2; This group contains the receptor-binding domain (RBD) of the S1 subunit of the spike (S) protein from human coronavirus (CoV) HKU1, isolates N5 and N2. HKU1 is a human betacoronavirus that causes mild yet prevalent respiratory disease, and is related to the zoonotic SARS and MERS betacoronaviruses, which have high fatality rates and pandemic potential. The CoV S protein is an envelope glycoprotein that plays the most important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains a hydrophobic fusion peptide and two heptad repeat regions. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs use the C-domain to bind their receptors. Although a protein receptor has not yet been identified for HKU1, antibodies against the C-domain, but not those against the NTD, blocked HKU1 infection of cells, suggesting that the S1 C-domain is the primary HKU1 receptor-binding site. Due to the key role of the S protein RBD in viral attachment, it is the major target for antibody-mediated neutralization. This model corresponds to the S1 subunit C-domain that serves as the RBD for most CoVs, and most likely, for HKU1.


Pssm-ID: 394829  Cd Length: 304  Bit Score: 75.10  E-value: 3.51e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       347 SSFEAKPSGSVVEQAEGV-ECDFSPLLSGTP-PQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSL 424
Cdd:cd21482    1 SGFTVKPVATVYRRIPNLpDCDIDNWLNNVSvPSPLNWERRIFSNCNFNLSTLLRLVHVDSFSCNNLDKSKIFGSCFNSI 80
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|
5X5C_C       425 ILDYFSYPLSMKSDLSVSSAGPISQFNYKQSFSNPTCLILATVPHNLTTI 474
Cdd:cd21482   81 TVDKFAIPNRRRDDLQLGSSGFLQSSNYKIDISSSSCQLYYSLPLLNVTI 130
HCoV-OC43-like_Spike_S1_RBD cd21485
receptor-binding domain of the S1 subunit of the Spike (S) protein from human coronavirus OC43 ...
377-468 9.85e-14

receptor-binding domain of the S1 subunit of the Spike (S) protein from human coronavirus OC43 and related proteins; This group contains the receptor-binding domain (RBD) of the S1 subunit of the spike (S) protein from several betacoronaviruses including human coronavirus OC43 (HCoV-OC43) and bovine respiratory coronavirus (BCoV), among others. HCoV-OC43 is of zoonotic origin and is endemic in the human population, causing mild respiratory tract infections and possible severe complications or fatalities in young children, the elderly, and immunocompromised individuals. These viruses are related to the zoonotic SARS and MERS betacoronaviruses, which have high fatality rates and pandemic potential. The CoV S protein is an envelope glycoprotein that plays the most important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains a hydrophobic fusion peptide and two heptad repeat regions. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs use the C-domain to bind their receptors. It has been reported that HCoV-OC43 uses 9-O-acetyl-sialic acid (9-O-Ac-Sia) as a receptor, which is terminally linked to oligosaccharides decorating glycoproteins and gangliosides at the host cell surface. HCoV-OC43 appears to bind 9-O-Ac-Sia at the NTD. Due to the key role of the S protein RBD in viral attachment, it is the major target for antibody-mediated neutralization. This model corresponds to the S1 subunit C-domain that serves as the RBD for most CoVs.


Pssm-ID: 394832  Cd Length: 312  Bit Score: 73.89  E-value: 9.85e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       377 PQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSLILDYFSYPLSMKSDLSVSSAGPISQFNYKQSF 456
Cdd:cd21485   33 PSPLNWERKTFSNCNFNMSSLMSFIQADSFTCNNIDAAKIYGMCFSSITIDKFAIPNGRKVDLQLGNLGYLQSFNYRIDT 112
                         90
                 ....*....|..
5X5C_C       457 SNPTCLILATVP 468
Cdd:cd21485  113 TATSCQLYYNLP 124
MHV-like_Spike_S1_RBD cd21484
receptor-binding domain of the S1 subunit of the Spike (S) protein from mouse hepatitis virus ...
347-475 1.39e-12

receptor-binding domain of the S1 subunit of the Spike (S) protein from mouse hepatitis virus and other rodent coronaviruses; This group contains the receptor-binding domain (RBD) of the S1 subunit of the spike (S) protein from mouse hepatitis virus (MHV) and other rodent coronaviruses. MHV is the most common viral pathogen in contemporary laboratory mouse colonies manifesting as a primary infection in the upper respiratory tract. The CoV S protein is an envelope glycoprotein that plays the most important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains a hydrophobic fusion peptide and two heptad repeat regions. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). MHV uses mouse carcinoembryonic antigen related cell adhesion molecule 1a (mCEACAM1a) as the receptor; the RBD of MHV is located at the NTD. Most CoVs, such as SARS-CoV and MERS-CoV, use the C-domain to bind their receptors. Due to the key role of the S protein RBD in viral attachment, it is the major target for antibody-mediated neutralization. This model corresponds to the S1 subunit C-domain that serves as the RBD for most CoVs.


Pssm-ID: 394831  Cd Length: 264  Bit Score: 69.53  E-value: 1.39e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       347 SSFEAKPSGSVVEQAEGV-ECDFSPLLSG-TPPQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSL 424
Cdd:cd21484    1 SGYTVQPVGVVYRRVPNLpDCKIEEWLTAkSVPSPLNWERKTFQNCNFNLSSLLRYVQAESLSCNNIDASKVYGMCFGSI 80
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|.
5X5C_C       425 ILDYFSYPLSMKSDLSVSSAGPISQFNYKQSFSNPTCLILATVPHNLTTIT 475
Cdd:cd21484   81 SIDKFAIPNSRRVDLQLGNSGFLQSFNYKIDTRATSCQLYYSLAQNNVTVN 131
CoV_S1_C pfam19209
Coronavirus spike glycoprotein S1, C-terminal; This entry represents a domain found at the ...
584-640 3.52e-08

Coronavirus spike glycoprotein S1, C-terminal; This entry represents a domain found at the C-terminus of the Coronavirus S1 protein. It is found across a range of alpha, beta and gamma coronaviruses. This small all beta stranded domain is known as subdomain 2 in the structure of the porcine epidemic diarrhea virus spike protein.


Pssm-ID: 437047 [Multi-domain]  Cd Length: 57  Bit Score: 51.08  E-value: 3.52e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
5X5C_C         584 GNCVEYSLYGVSGRGVFQNcTAVGVRQQRFVYDAYQNLVGY-YSDDGNYYCLRACVSV 640
Cdd:pfam19209    1 NVCTDYTIYGITGTGVIRE-TNSTIPSGLYYTSSSGDLLGFkNSTTGTVYSVTPCVSS 57
Fibritin_C pfam07921
Fibritin C-terminal region; This family features sequences bearing similarity to the ...
1288-1315 8.92e-08

Fibritin C-terminal region; This family features sequences bearing similarity to the C-terminal portion of the bacteriophage T4 protein fibritin. This protein is responsible for attachment of long tail fibres to virus particle, and forms the 'whiskers' or fibres on the neck of the virion. The region seen in this family contains an N-terminal coiled-coil portion and the C-terminal globular foldon domain (residues 457-486), which is essential for fibritin trimerization and folding. This domain consists of a beta-hairpin; three such hairpins come together in a beta-propeller-like arrangement in the trimer, which is stabilized by hydrogen bonds, salt bridges and hydrophobic interactions.


Pssm-ID: 254516 [Multi-domain]  Cd Length: 93  Bit Score: 51.30  E-value: 8.92e-08
                           10        20
                   ....*....|....*....|....*...
5X5C_C        1288 GSGYIPEAPRDGQAYVRKDGEWVLLSTF 1315
Cdd:pfam07921   66 ESGKIDDAPDDGRWYVRKDGAWVLLSSI 93
SARS-CoV-like_Spike_S1_NTD cd21624
N-terminal domain of the S1 subunit of the Spike (S) protein from Severe acute respiratory ...
214-333 3.49e-06

N-terminal domain of the S1 subunit of the Spike (S) protein from Severe acute respiratory syndrome coronavirus and related betacoronaviruses in the B lineage; This subfamily contains the N-terminal domain (NTD) of the S1 subunit of the Spike (S) proteins from betacoronaviruses in the sarbecovirus subgenera (B lineage), including the highly pathogenic human coronavirus (CoV), Severe acute respiratory syndrome (SARS) CoV, and SARS-CoV-2, also known as a 2019 novel coronavirus (2019-nCoV) or COVID-19 virus. The CoV S protein is an envelope glycoprotein that plays the most important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains a hydrophobic fusion peptide and two heptad repeat regions. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). Most CoVs, including SARS-CoV-2 and SARS-CoV, use the C-domain to bind their receptors. The S1 NTD contributes to the Spike trimer interface.


Pssm-ID: 394950  Cd Length: 280  Bit Score: 50.41  E-value: 3.49e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       214 YFNLRNCTFMYTYnitEDEILEWFGITQ--TAQGVHLFSSR-----------YVDLYGGNMFQFATL------PVYDTIK 274
Cdd:cd21624  142 YSNAFNCTYEYVS---QSFQLDVSEKNGnfKHLREFVFKNVdgflkvyhgyqPINVVRGLPSGFSVLkpifklPLGINIT 218
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       275 YYSIIPHSIRSIQSDRKAW-AAFYVYKLQPLTFLLDFSVDGYIRRAIDCGFNDLSQLHCS 333
Cdd:cd21624  219 NFRVVLTMFSPAQSNWTAGnAAYYVGYLKPTTFMLKFDENGTITDAVDCSQDPLAELKCT 278
wac PHA02607
fibritin; Provisional
1289-1316 4.46e-06

fibritin; Provisional


Pssm-ID: 177432 [Multi-domain]  Cd Length: 454  Bit Score: 50.79  E-value: 4.46e-06
                          10        20
                  ....*....|....*....|....*...
5X5C_C       1289 SGYIPEAPRDGQAYVRKDGEWVLLSTFL 1316
Cdd:PHA02607  424 AGKLDDAPSDGSWYVRKNGAWVEVSTGL 451
SARS-CoV-like_Spike_S1_RBD cd21477
receptor-binding domain of the S1 subunit of severe acute respiratory syndrome-related ...
366-453 3.80e-04

receptor-binding domain of the S1 subunit of severe acute respiratory syndrome-related coronavirus Spike (S) protein and similar proteins; This subfamily contains the receptor-binding domain of the S1 subunit of coronavirus (CoV) spike (S) proteins from highly pathogenic human virus, severe acute respiratory syndrome (SARS) coronavirus (SARS-CoV), SARS coronavirus 2 (SARS-CoV-2), also known as 2019 novel coronavirus (2019-nCoV), and other SARS-like coronaviruses. The CoV S protein is an envelope glycoprotein that plays the most important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains a hydrophobic fusion peptide and two heptad repeat regions. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2 and SARS-CoV use the C-domain to bind their receptors. Recent studies found that the receptor-binding domain (RBD) of SARS-CoV-2 S protein binds strongly to human and bat angiotensin-converting enzyme 2 (ACE2) receptors. Moreover, SARS-CoV-2 RBD exhibited significantly higher binding affinity to the ACE2 receptor than SARS-CoV RBD. Due to the key role of the S protein RBD in viral attachment, it is the major target for antibody-mediated neutralization. This model corresponds to the S1 subunit C-domain that serves as the RBD for SARS-CoV, SARS-CoV-2, and most CoVs.


Pssm-ID: 394824  Cd Length: 205  Bit Score: 43.27  E-value: 3.80e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
5X5C_C       366 CDFSPLLSGTP-PQVYNFKRLVFTNCNYNLTKLLSLFSVNDFTCSQISPAAIASNCYSSLILDYFSYPLSMKSDLSVSSA 444
Cdd:cd21477   18 CPFDKVFNATRfPSVYAWERTKISDCVADYTVLYNSTSFSTFKCYGVSPSKLIDLCFTSVYADTFLIRGSEVRQVAPGQT 97

                 ....*....
5X5C_C       445 GPISQFNYK 453
Cdd:cd21477   98 GVIADYNYK 106
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH