NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1904824917|gb|QNS29861|]
View 

Glycoprotein precursor [Kasokero virus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Hanta_G2 pfam01561
Hantavirus/Nairovirus glycoprotein G2; The medium (M) genome segment of hantaviruses (family ...
851-1353 1.97e-95

Hantavirus/Nairovirus glycoprotein G2; The medium (M) genome segment of hantaviruses (family Bunyaviridae) encodes the two virion glycoproteins G1 and G2, as a polyprotein precursor. This entry represents the polyprotein region which forms the G2 glycoprotein. The N-terminal region has a conserved CNP motif, suggested to be an integrin-binding motif.


:

Pssm-ID: 460253  Cd Length: 457  Bit Score: 315.57  E-value: 1.97e-95
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  851 SAKHISLSWESSKLVGNRVVLSGKSTAILKLNPRTSTSWEMSSPDANEKKVLTLSILDYTQIYSSRFEYITGDRKVSTW- 929
Cdd:pfam01561    1 SETPLTPVWNDNAHGVGSVPMHTDLELDFSLTSSSKYTYRRKLTNPLEEAQSIDLHIEIEEQTIGVDVHALGHWFDGRLn 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  930 --SEGSCTGPCPKN-CGCDDPSC-HTREWLNSRNWRCNPTWCWGIGTGCSCCSAQVVDLyeSWLVSIWQIEHIRTPVVAC 1005
Cdd:pfam01561   81 lkTSFHCYGACTKYeYPWHTAKChYERDYQYETSWGCNPSDCPGVGTGCTACGLYLDQL--KPVGSAYKIITIRYSRRVC 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917 1006 LEFDHEnRVCEVVEAGIEIQLGPVTVAFSDPFGEQKLLPKRIAVYHKRDSDHehvdllhnhgigGAEQYCKlQSCTHGTA 1085
Cdd:pfam01561  159 VQFGEE-NLCKIIDMNDCFVSRHVKVCIIGTVSKFSQGDTLLFFGPLEGGGL------------IFKHWCT-STCQFGDP 224
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917 1086 GDYQILNPDALVFDDitSINYFKKIDAA-NKVWMSWEGVNLGYY----CNPGDWTTCTAENVVVR-NSEAFQNRNNLERN 1159
Cdd:pfam01561  225 GDIMSPRDKGFLCPE--FPGSFRKKCNFaTTPICEYDGNMVSGYkkvmATIDSFQSFNTSTMHFTdERIEWKDPDGMLRD 302
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917 1160 YSVSHFFHSSRIYGAGKT-LSMDLKGRPV------QSGGNINVYITVNNLELNSKKVNLQGIKValktcsgCFGCNLGAE 1232
Cdd:pfam01561  303 HINILVTKDIDFDNLGENpCKIGLQTSSIegawgsGVGFTLTCLVSLTECPTFLTSIKACDKAI-------CYGAESVTL 375
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917 1233 CQVSLAISepeefhLHLRSRTPGVtvpDTSFlvisgeekvfriRVFSIQKTGNFCVEILESKHCPDCKGEDLVSCVglel 1312
Cdd:pfam01561  376 TRGQNTVK------VSGKGGHSGS---TFRC------------CHGEDCSQIGLHAAAPHLDKVNGISEIENSKVY---- 430
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|.
gi 1904824917 1313 edpkpvllehrsvlfskSNQTCEGGViSCWAGSAGNLFKGI 1353
Cdd:pfam01561  431 -----------------DDGAPQCGI-KCWFVKSGEWISGI 453
Nairovirus_M super family cl06817
Nairovirus M polyprotein-like; The sequences in this family are similar to the Dugbe virus M ...
424-762 2.41e-24

Nairovirus M polyprotein-like; The sequences in this family are similar to the Dugbe virus M polyprotein precursor, which includes glycoproteins G1 and G2. Both are thought to be inserted in the membrane of the Golgi complex of the infected host cell, and G1 is known to have a role in infection of vertebrate hosts.


The actual alignment was detected with superfamily member pfam07948:

Pssm-ID: 285223  Cd Length: 657  Bit Score: 110.10  E-value: 2.41e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  424 GRRRLLGLEKrkrplkrQDIRCHSGSHLVKVKKYAATNEYKSFPDAKVGFCNDSMITNLPIGHEFGCYRVGSVSTHIQCK 503
Cdd:pfam07948  286 GRRKLLAVED-------GSDDCISGTQLIEGESAEIHGDDIGGPGDKITICNGSSILDQPLDEEHGCYTINRIRSFKACE 358
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  504 PYHNAydgeKDCNVTST-EDCEQDMlCAEVKLNGQGIVTArTQLGEVQLKHCLEKCHFSFKKVNDlEVYFTCPDGKQHRL 582
Cdd:pfam07948  359 NKASG----KNCEIDKElKKCDQGK-CLRISQEGAGHIKL-SRGKEILIDACDEHCEIMIPKGKG-DILVDCSGGQQHFL 431
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  583 HSNAIDANCPFQKHLGKYALYACRATHRPIMLYTVIAWLTVGVMALSITLQVLSLLIRTYCYLVICVKAKLDRGKGKCPS 662
Cdd:pfam07948  432 EDNIIDIGCPKIKFLGGIAIYFCRMSNHPKTALAFGFWFGCGYIITCIFCFAIFHLIIFFANCGKQCKKKGELKGEICTI 511
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  663 CNDM-VNSSEEWQRHQNCKRGKCPYCGTKGSEIDLRKHANVCLQKETVLEHDANVLNIRRTPRLALRLGCLVNSLQGKPT 741
Cdd:pfam07948  512 CEQQpVNAIDAELHDLNCNFNICPYCANRLSDDGLARHVGKCPKRKEKLEEIELYLNLEECPLCLRKCLQLLESTGIALK 591
                          330       340
                   ....*....|....*....|.
gi 1904824917  742 RLTWFVVLLCLFCLLIRPVSS 762
Cdd:pfam07948  592 RSSWLIVLLVLFGLAISPVQG 612
PHA03307 super family cl33723
transcriptional regulator ICP4; Provisional
23-180 8.74e-06

transcriptional regulator ICP4; Provisional


The actual alignment was detected with superfamily member PHA03307:

Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.55  E-value: 8.74e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917   23 VAADSGGNVTEVSPTQGKANSTGSAEGSSTTPSlgpKEGANSSEPVRRNSSESTTVSDSqmggnktGVSTKQPTPPAEVE 102
Cdd:PHA03307   269 IWEASGWNGPSSRPGPASSSSSPRERSPSPSPS---SPGSGPAPSSPRASSSSSSSRES-------SSSSTSSSSESSRG 338
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1904824917  103 TATPPAALQMTNGSDSgktTTSGPTTAPTRKKGGRLGIRGGSAAVRAVSRVLARVRSGERNLVISREAGGDLQQAREA 180
Cdd:PHA03307   339 AAVSPGPSPSRSPSPS---RPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPR 413
 
Name Accession Description Interval E-value
Hanta_G2 pfam01561
Hantavirus/Nairovirus glycoprotein G2; The medium (M) genome segment of hantaviruses (family ...
851-1353 1.97e-95

Hantavirus/Nairovirus glycoprotein G2; The medium (M) genome segment of hantaviruses (family Bunyaviridae) encodes the two virion glycoproteins G1 and G2, as a polyprotein precursor. This entry represents the polyprotein region which forms the G2 glycoprotein. The N-terminal region has a conserved CNP motif, suggested to be an integrin-binding motif.


Pssm-ID: 460253  Cd Length: 457  Bit Score: 315.57  E-value: 1.97e-95
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  851 SAKHISLSWESSKLVGNRVVLSGKSTAILKLNPRTSTSWEMSSPDANEKKVLTLSILDYTQIYSSRFEYITGDRKVSTW- 929
Cdd:pfam01561    1 SETPLTPVWNDNAHGVGSVPMHTDLELDFSLTSSSKYTYRRKLTNPLEEAQSIDLHIEIEEQTIGVDVHALGHWFDGRLn 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  930 --SEGSCTGPCPKN-CGCDDPSC-HTREWLNSRNWRCNPTWCWGIGTGCSCCSAQVVDLyeSWLVSIWQIEHIRTPVVAC 1005
Cdd:pfam01561   81 lkTSFHCYGACTKYeYPWHTAKChYERDYQYETSWGCNPSDCPGVGTGCTACGLYLDQL--KPVGSAYKIITIRYSRRVC 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917 1006 LEFDHEnRVCEVVEAGIEIQLGPVTVAFSDPFGEQKLLPKRIAVYHKRDSDHehvdllhnhgigGAEQYCKlQSCTHGTA 1085
Cdd:pfam01561  159 VQFGEE-NLCKIIDMNDCFVSRHVKVCIIGTVSKFSQGDTLLFFGPLEGGGL------------IFKHWCT-STCQFGDP 224
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917 1086 GDYQILNPDALVFDDitSINYFKKIDAA-NKVWMSWEGVNLGYY----CNPGDWTTCTAENVVVR-NSEAFQNRNNLERN 1159
Cdd:pfam01561  225 GDIMSPRDKGFLCPE--FPGSFRKKCNFaTTPICEYDGNMVSGYkkvmATIDSFQSFNTSTMHFTdERIEWKDPDGMLRD 302
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917 1160 YSVSHFFHSSRIYGAGKT-LSMDLKGRPV------QSGGNINVYITVNNLELNSKKVNLQGIKValktcsgCFGCNLGAE 1232
Cdd:pfam01561  303 HINILVTKDIDFDNLGENpCKIGLQTSSIegawgsGVGFTLTCLVSLTECPTFLTSIKACDKAI-------CYGAESVTL 375
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917 1233 CQVSLAISepeefhLHLRSRTPGVtvpDTSFlvisgeekvfriRVFSIQKTGNFCVEILESKHCPDCKGEDLVSCVglel 1312
Cdd:pfam01561  376 TRGQNTVK------VSGKGGHSGS---TFRC------------CHGEDCSQIGLHAAAPHLDKVNGISEIENSKVY---- 430
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|.
gi 1904824917 1313 edpkpvllehrsvlfskSNQTCEGGViSCWAGSAGNLFKGI 1353
Cdd:pfam01561  431 -----------------DDGAPQCGI-KCWFVKSGEWISGI 453
Nairovirus_M pfam07948
Nairovirus M polyprotein-like; The sequences in this family are similar to the Dugbe virus M ...
424-762 2.41e-24

Nairovirus M polyprotein-like; The sequences in this family are similar to the Dugbe virus M polyprotein precursor, which includes glycoproteins G1 and G2. Both are thought to be inserted in the membrane of the Golgi complex of the infected host cell, and G1 is known to have a role in infection of vertebrate hosts.


Pssm-ID: 285223  Cd Length: 657  Bit Score: 110.10  E-value: 2.41e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  424 GRRRLLGLEKrkrplkrQDIRCHSGSHLVKVKKYAATNEYKSFPDAKVGFCNDSMITNLPIGHEFGCYRVGSVSTHIQCK 503
Cdd:pfam07948  286 GRRKLLAVED-------GSDDCISGTQLIEGESAEIHGDDIGGPGDKITICNGSSILDQPLDEEHGCYTINRIRSFKACE 358
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  504 PYHNAydgeKDCNVTST-EDCEQDMlCAEVKLNGQGIVTArTQLGEVQLKHCLEKCHFSFKKVNDlEVYFTCPDGKQHRL 582
Cdd:pfam07948  359 NKASG----KNCEIDKElKKCDQGK-CLRISQEGAGHIKL-SRGKEILIDACDEHCEIMIPKGKG-DILVDCSGGQQHFL 431
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  583 HSNAIDANCPFQKHLGKYALYACRATHRPIMLYTVIAWLTVGVMALSITLQVLSLLIRTYCYLVICVKAKLDRGKGKCPS 662
Cdd:pfam07948  432 EDNIIDIGCPKIKFLGGIAIYFCRMSNHPKTALAFGFWFGCGYIITCIFCFAIFHLIIFFANCGKQCKKKGELKGEICTI 511
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  663 CNDM-VNSSEEWQRHQNCKRGKCPYCGTKGSEIDLRKHANVCLQKETVLEHDANVLNIRRTPRLALRLGCLVNSLQGKPT 741
Cdd:pfam07948  512 CEQQpVNAIDAELHDLNCNFNICPYCANRLSDDGLARHVGKCPKRKEKLEEIELYLNLEECPLCLRKCLQLLESTGIALK 591
                          330       340
                   ....*....|....*....|.
gi 1904824917  742 RLTWFVVLLCLFCLLIRPVSS 762
Cdd:pfam07948  592 RSSWLIVLLVLFGLAISPVQG 612
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
23-180 8.74e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.55  E-value: 8.74e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917   23 VAADSGGNVTEVSPTQGKANSTGSAEGSSTTPSlgpKEGANSSEPVRRNSSESTTVSDSqmggnktGVSTKQPTPPAEVE 102
Cdd:PHA03307   269 IWEASGWNGPSSRPGPASSSSSPRERSPSPSPS---SPGSGPAPSSPRASSSSSSSRES-------SSSSTSSSSESSRG 338
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1904824917  103 TATPPAALQMTNGSDSgktTTSGPTTAPTRKKGGRLGIRGGSAAVRAVSRVLARVRSGERNLVISREAGGDLQQAREA 180
Cdd:PHA03307   339 AAVSPGPSPSRSPSPS---RPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPR 413
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
22-131 5.90e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 40.89  E-value: 5.90e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917   22 TVAADSGGNVTEVSPTQGKANSTGSAegSSTTPSLGPKEGANSSEPVRRNSSESTTVSDSQMGGNKTGVSTKQPTPPAEV 101
Cdd:COG3469     98 ATSTASGANTGTSTVTTTSTGAGSVT--STTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSA 175
                           90       100       110
                   ....*....|....*....|....*....|
gi 1904824917  102 eTATPPAALQMTNGSDSGKTTTSGPTTAPT 131
Cdd:COG3469    176 -STTPSATTTATATTASGATTPSATTTATT 204
 
Name Accession Description Interval E-value
Hanta_G2 pfam01561
Hantavirus/Nairovirus glycoprotein G2; The medium (M) genome segment of hantaviruses (family ...
851-1353 1.97e-95

Hantavirus/Nairovirus glycoprotein G2; The medium (M) genome segment of hantaviruses (family Bunyaviridae) encodes the two virion glycoproteins G1 and G2, as a polyprotein precursor. This entry represents the polyprotein region which forms the G2 glycoprotein. The N-terminal region has a conserved CNP motif, suggested to be an integrin-binding motif.


Pssm-ID: 460253  Cd Length: 457  Bit Score: 315.57  E-value: 1.97e-95
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  851 SAKHISLSWESSKLVGNRVVLSGKSTAILKLNPRTSTSWEMSSPDANEKKVLTLSILDYTQIYSSRFEYITGDRKVSTW- 929
Cdd:pfam01561    1 SETPLTPVWNDNAHGVGSVPMHTDLELDFSLTSSSKYTYRRKLTNPLEEAQSIDLHIEIEEQTIGVDVHALGHWFDGRLn 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  930 --SEGSCTGPCPKN-CGCDDPSC-HTREWLNSRNWRCNPTWCWGIGTGCSCCSAQVVDLyeSWLVSIWQIEHIRTPVVAC 1005
Cdd:pfam01561   81 lkTSFHCYGACTKYeYPWHTAKChYERDYQYETSWGCNPSDCPGVGTGCTACGLYLDQL--KPVGSAYKIITIRYSRRVC 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917 1006 LEFDHEnRVCEVVEAGIEIQLGPVTVAFSDPFGEQKLLPKRIAVYHKRDSDHehvdllhnhgigGAEQYCKlQSCTHGTA 1085
Cdd:pfam01561  159 VQFGEE-NLCKIIDMNDCFVSRHVKVCIIGTVSKFSQGDTLLFFGPLEGGGL------------IFKHWCT-STCQFGDP 224
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917 1086 GDYQILNPDALVFDDitSINYFKKIDAA-NKVWMSWEGVNLGYY----CNPGDWTTCTAENVVVR-NSEAFQNRNNLERN 1159
Cdd:pfam01561  225 GDIMSPRDKGFLCPE--FPGSFRKKCNFaTTPICEYDGNMVSGYkkvmATIDSFQSFNTSTMHFTdERIEWKDPDGMLRD 302
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917 1160 YSVSHFFHSSRIYGAGKT-LSMDLKGRPV------QSGGNINVYITVNNLELNSKKVNLQGIKValktcsgCFGCNLGAE 1232
Cdd:pfam01561  303 HINILVTKDIDFDNLGENpCKIGLQTSSIegawgsGVGFTLTCLVSLTECPTFLTSIKACDKAI-------CYGAESVTL 375
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917 1233 CQVSLAISepeefhLHLRSRTPGVtvpDTSFlvisgeekvfriRVFSIQKTGNFCVEILESKHCPDCKGEDLVSCVglel 1312
Cdd:pfam01561  376 TRGQNTVK------VSGKGGHSGS---TFRC------------CHGEDCSQIGLHAAAPHLDKVNGISEIENSKVY---- 430
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|.
gi 1904824917 1313 edpkpvllehrsvlfskSNQTCEGGViSCWAGSAGNLFKGI 1353
Cdd:pfam01561  431 -----------------DDGAPQCGI-KCWFVKSGEWISGI 453
Nairovirus_M pfam07948
Nairovirus M polyprotein-like; The sequences in this family are similar to the Dugbe virus M ...
424-762 2.41e-24

Nairovirus M polyprotein-like; The sequences in this family are similar to the Dugbe virus M polyprotein precursor, which includes glycoproteins G1 and G2. Both are thought to be inserted in the membrane of the Golgi complex of the infected host cell, and G1 is known to have a role in infection of vertebrate hosts.


Pssm-ID: 285223  Cd Length: 657  Bit Score: 110.10  E-value: 2.41e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  424 GRRRLLGLEKrkrplkrQDIRCHSGSHLVKVKKYAATNEYKSFPDAKVGFCNDSMITNLPIGHEFGCYRVGSVSTHIQCK 503
Cdd:pfam07948  286 GRRKLLAVED-------GSDDCISGTQLIEGESAEIHGDDIGGPGDKITICNGSSILDQPLDEEHGCYTINRIRSFKACE 358
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  504 PYHNAydgeKDCNVTST-EDCEQDMlCAEVKLNGQGIVTArTQLGEVQLKHCLEKCHFSFKKVNDlEVYFTCPDGKQHRL 582
Cdd:pfam07948  359 NKASG----KNCEIDKElKKCDQGK-CLRISQEGAGHIKL-SRGKEILIDACDEHCEIMIPKGKG-DILVDCSGGQQHFL 431
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  583 HSNAIDANCPFQKHLGKYALYACRATHRPIMLYTVIAWLTVGVMALSITLQVLSLLIRTYCYLVICVKAKLDRGKGKCPS 662
Cdd:pfam07948  432 EDNIIDIGCPKIKFLGGIAIYFCRMSNHPKTALAFGFWFGCGYIITCIFCFAIFHLIIFFANCGKQCKKKGELKGEICTI 511
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  663 CNDM-VNSSEEWQRHQNCKRGKCPYCGTKGSEIDLRKHANVCLQKETVLEHDANVLNIRRTPRLALRLGCLVNSLQGKPT 741
Cdd:pfam07948  512 CEQQpVNAIDAELHDLNCNFNICPYCANRLSDDGLARHVGKCPKRKEKLEEIELYLNLEECPLCLRKCLQLLESTGIALK 591
                          330       340
                   ....*....|....*....|.
gi 1904824917  742 RLTWFVVLLCLFCLLIRPVSS 762
Cdd:pfam07948  592 RSSWLIVLLVLFGLAISPVQG 612
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
23-180 8.74e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.55  E-value: 8.74e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917   23 VAADSGGNVTEVSPTQGKANSTGSAEGSSTTPSlgpKEGANSSEPVRRNSSESTTVSDSqmggnktGVSTKQPTPPAEVE 102
Cdd:PHA03307   269 IWEASGWNGPSSRPGPASSSSSPRERSPSPSPS---SPGSGPAPSSPRASSSSSSSRES-------SSSSTSSSSESSRG 338
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1904824917  103 TATPPAALQMTNGSDSgktTTSGPTTAPTRKKGGRLGIRGGSAAVRAVSRVLARVRSGERNLVISREAGGDLQQAREA 180
Cdd:PHA03307   339 AAVSPGPSPSRSPSPS---RPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPR 413
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
22-131 5.90e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 40.89  E-value: 5.90e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917   22 TVAADSGGNVTEVSPTQGKANSTGSAegSSTTPSLGPKEGANSSEPVRRNSSESTTVSDSQMGGNKTGVSTKQPTPPAEV 101
Cdd:COG3469     98 ATSTASGANTGTSTVTTTSTGAGSVT--STTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSA 175
                           90       100       110
                   ....*....|....*....|....*....|
gi 1904824917  102 eTATPPAALQMTNGSDSGKTTTSGPTTAPT 131
Cdd:COG3469    176 -STTPSATTTATATTASGATTPSATTTATT 204
Bunya_G1 pfam03557
Bunyavirus glycoprotein G1; Bunyavirus has three genomic segments: small (S), middle-sized (M), ...
885-1049 9.61e-03

Bunyavirus glycoprotein G1; Bunyavirus has three genomic segments: small (S), middle-sized (M), and large (L). The S segment encodes the nucleocapsid and a non-structural protein. The M segment codes for two glycoproteins, G1 and G2, and another non-structural protein (NSm). The L segment codes for an RNA polymerase. This family contains the G1 glycoprotein which is the viral attachment protein.


Pssm-ID: 281546  Cd Length: 879  Bit Score: 40.39  E-value: 9.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  885 TSTSWEMSSPDAneKKVLTLSILDYTQIYSSRFE--YITGDR-KVSTWSEGSCTGPCPKNCGcddpscHTREWLN----- 956
Cdd:pfam03557  490 TSIGFKINSKDG--FHLLDLIAFVKSASIKSTYNhiYDTGPTiGINTKHDEHCTGPCPANIE------HEAGWLTfaqer 561
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1904824917  957 SRNWRCNPTWCWGIGTGCSCCSAQVVdLYESWLVSIWQIEHIrTPVVACLEF-DHE-----NRVCEVVEAGIEIQLGPVt 1030
Cdd:pfam03557  562 TSRWGCEEFGCLAVNDGCVFGSCQDI-IKEELKVYRKATDEV-TDLELCITFpDHTfcteiNAIEPIITDEFELQFKTV- 638
                          170
                   ....*....|....*....
gi 1904824917 1031 vafsdpfgEQKLLPKRIAV 1049
Cdd:pfam03557  639 --------DTKSLPRIVAV 649
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH