|
Name |
Accession |
Description |
Interval |
E-value |
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
387-550 |
2.32e-29 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation. :
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 116.73 E-value: 2.32e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 387 WTCTKQPCPGHCSLEGGSFVTTFDARPYRFHGTCTYTLLQSpqLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVI 466
Cdd:smart00216 1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD--CSSEPTFSVLLKNVPCGGGATCLKSVKVELNGDEIEL 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 467 SEDEV-ITNNGDTKLLPYKTHNITI-FRQTSTHLQMATTFGLELVfQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDD 544
Cdd:smart00216 79 KDDNGkVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGLIQV-TFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDD 157
|
....*.
gi 1907182184 545 FTTSMG 550
Cdd:smart00216 158 FRTPDG 163
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
1058-1130 |
3.53e-28 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. :
Pssm-ID: 214843 Cd Length: 76 Bit Score: 110.12 E-value: 3.53e-28
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182184 1058 WAERKCNIINSQ--TFAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFCP 1130
Cdd:smart00832 3 YACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCIsPWRTPTFCP 76
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
857-1019 |
3.83e-28 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation. :
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 113.27 E-value: 3.83e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 857 WTCqLSTQCPSTCVLYGEGHIITFDGQRFVFDGDCEYMLATDdcgaNSSQPTFKVLTENVICGkSGVTCSRAIKISLGGL 936
Cdd:smart00216 1 WCC-TQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD----CSSEPTFSVLLKNVPCG-GGATCLKSVKVELNGD 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 937 FITMADSN--YTVSGE-------EPLVHLKVKPSPLNLVLdidIPGRLNLTLVWNKHMSVSIKIrRATQQDALCGLCGNA 1007
Cdd:smart00216 75 EIELKDDNgkVTVNGQqvslpykTSDGSIQIRSSGGYLVV---ITSLGLIQVTFDGLTLLSVQL-PSKYRGKTCGLCGNF 150
|
170
....*....|..
gi 1907182184 1008 NGNMKDDFETRS 1019
Cdd:smart00216 151 DGEPEDDFRTPD 162
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
45-194 |
2.14e-25 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods. :
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 105.15 E-value: 2.14e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDVPATFSIQLRRDMEGN----ISRIIMELGASVVTVNKETISVRDIG 120
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGAsgvcLKSVTVIVGDLEITLQKGGTVLVNGQ 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 121 VVSLPYTSNGLQITPYGQSVQLVAKQLELELVITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDGK 194
Cdd:pfam00094 81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
589-661 |
1.13e-21 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. :
Pssm-ID: 214843 Cd Length: 76 Bit Score: 91.63 E-value: 1.13e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182184 589 AETHCSMLLKKGSVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:smart00832 4 ACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTP-TFC 75
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
303-358 |
2.00e-16 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. :
Pssm-ID: 460351 Cd Length: 55 Bit Score: 75.89 E-value: 2.00e-16
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:pfam01826 1 CPANEVYSECGSACPPTCANLSPpdVCPEPCVEGCVCPPGFVRNS---GGKCVPPSDC 55
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
765-828 |
3.23e-13 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation. :
Pssm-ID: 410995 Cd Length: 55 Bit Score: 66.57 E-value: 3.23e-13
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:cd19941 1 CPPNEVYSEC--------GSACPPTCANPNAPPPC-TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
246-298 |
9.20e-11 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826. :
Pssm-ID: 462584 Cd Length: 68 Bit Score: 60.09 E-value: 9.20e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1907182184 246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALC 298
Cdd:pfam08742 18 VDPEPYFEACVYDM--CSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTFC 68
|
|
| CT |
smart00041 |
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ... |
4164-4243 |
3.91e-10 |
|
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers. :
Pssm-ID: 214482 Cd Length: 82 Bit Score: 58.95 E-value: 3.91e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 4164 EHQITYQGCVAN-VTLTRCQGFCASSVSFNkdTLQLESSCGCCQPLSTYKKQLSLPCPDpdapGQQLTLTLQVFSSCVCS 4242
Cdd:smart00041 5 RQTITYNGCTSVtVKNAFCEGKCGSASSYS--IQDVQHSCSCCQPHKTKTRQVRLRCPD----GSTVKKTVMHIEECGCE 78
|
.
gi 1907182184 4243 P 4243
Cdd:smart00041 79 P 79
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
3637-4059 |
1.70e-08 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.49 E-value: 1.70e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3637 TVTPTQPTP---------------IPATTNSPMTTVGLTGTPvvHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFR 3701
Cdd:PHA03247 2567 SVPPPRPAPrpsepavtsrarrpdAPPQSARPRAPVDDRGDP--RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPP 2644
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3702 TSEQSTTTFPTPSAPQTSLV--TSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPvstilqttievTTPPNTSTPVTH 3779
Cdd:PHA03247 2645 TVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP-----------PPPPPTPEPAPH 2713
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3780 S-TSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAIS 3858
Cdd:PHA03247 2714 AlVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3859 RTTGISGTPFRTPMKTTITFPTPSSLQTSM-ATLFPPfstsvmssteifntPTNPHSVSSASTSRPLSTSLPT------- 3930
Cdd:PHA03247 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASpAGPLPP--------------PTSAQPTAPPPPPGPPPPSLPLggsvapg 2859
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3931 -TIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTAS 4009
Cdd:PHA03247 2860 gDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 4010 PPSSAPTfvSPTAASTVISSALPTI---HMTP---------TPSSRPTSSTGLLSTSKTTSH 4059
Cdd:PHA03247 2940 QPPLAPT--TDPAGAGEPSGAVPQPwlgALVPgrvavprfrVPQPAPSREAPASSTPPLTGH 2999
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
665-722 |
1.89e-07 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation. :
Pssm-ID: 410995 Cd Length: 55 Bit Score: 50.39 E-value: 1.89e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 665 CTGNRTFSYDSQACDRTCLSLsDRETEChvSPVPVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANP-NAPPPC--TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| VWC_out |
smart00215 |
von Willebrand factor (vWF) type C domain; |
360-404 |
2.89e-06 |
|
von Willebrand factor (vWF) type C domain; :
Pssm-ID: 214565 Cd Length: 67 Bit Score: 47.56 E-value: 2.89e-06
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 1907182184 360 CMLNGMVYGPGEITKTACQTCQCTMGRWTCTKQPC-PGHCSLEGGS 404
Cdd:smart00215 1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCgPKPCLLHNLS 46
|
|
| Chi1 super family |
cl43877 |
Chitinase [Carbohydrate transport and metabolism]; |
3181-3381 |
3.24e-06 |
|
Chitinase [Carbohydrate transport and metabolism]; The actual alignment was detected with superfamily member COG3469:
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 53.22 E-value: 3.24e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3181 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3260
Cdd:COG3469 28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3261 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3340
Cdd:COG3469 108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATT 183
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 1907182184 3341 PFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 3381
Cdd:COG3469 184 TATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Chi1 super family |
cl43877 |
Chitinase [Carbohydrate transport and metabolism]; |
1657-1878 |
3.41e-06 |
|
Chitinase [Carbohydrate transport and metabolism]; The actual alignment was detected with superfamily member COG3469:
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 53.22 E-value: 3.41e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1657 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 1736
Cdd:COG3469 7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1737 HPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaITNSLMTTGGLTG 1816
Cdd:COG3469 87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 1817 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 1878
Cdd:COG3469 163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Herpes_BLLF1 super family |
cl37540 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1727-2176 |
5.11e-06 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo. The actual alignment was detected with superfamily member pfam05109:
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 53.00 E-value: 5.11e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1727 TSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSThlplfstlsvtptteglntQSTPIPAitn 1806
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAD-------------------VTSPTPA--- 479
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1807 slMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVavsntkhTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSS 1886
Cdd:pfam05109 480 --GTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTT 550
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1887 ----VTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTP-TSAPHLS 1961
Cdd:pfam05109 551 ptpnATSPTPAVTTPTPN------ATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvVTSPPKN 624
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1962 ETSAVTAHQSTPTAVSANSikptMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTH 2041
Cdd:pfam05109 625 ATSAVTTGQHNITSSSTSS----MSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPA 700
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2042 SGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFST 2121
Cdd:pfam05109 701 PRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTT 780
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*
gi 1907182184 2122 DRTSTphlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR 2176
Cdd:pfam05109 781 DYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQ 832
|
|
| Chi1 super family |
cl43877 |
Chitinase [Carbohydrate transport and metabolism]; |
2576-2766 |
1.31e-05 |
|
Chitinase [Carbohydrate transport and metabolism]; The actual alignment was detected with superfamily member COG3469:
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 51.29 E-value: 1.31e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2576 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPT 2655
Cdd:COG3469 38 TATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTST 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2656 HSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 2735
Cdd:COG3469 118 GAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGA 193
|
170 180 190
....*....|....*....|....*....|.
gi 1907182184 2736 KHTTgvsletsvqTTIASPTPSAPQTSLATH 2766
Cdd:COG3469 194 TTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Chi1 super family |
cl43877 |
Chitinase [Carbohydrate transport and metabolism]; |
2305-2516 |
1.54e-05 |
|
Chitinase [Carbohydrate transport and metabolism]; The actual alignment was detected with superfamily member COG3469:
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 50.91 E-value: 1.54e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2305 TSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHS 2384
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2385 GPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKH 2464
Cdd:COG3469 82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATG 161
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 2465 TTGVSLETSVQTTIASPTPSAPQTSLATHLP--FSSTSAVTPTSEVIITPTPQH 2516
Cdd:COG3469 162 GTTTTSTTTTTTSASTTPSATTTATATTASGatTPSATTTATTTGPPTPGLPKH 215
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
2646-3101 |
5.35e-05 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.94 E-value: 5.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2646 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTS--------GTTSS 2717
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2718 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 2791
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2792 ssASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAhqstPTAVSANSIKP 2871
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR----PAVASLSESRE 2796
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2872 TMSSTGTPVVHTTSGTTSSPQTPRTTHPSttvavsgtvhtTGLPSGTSVHTTTNFPTHSGPQSSLSTH---LPLFSTLSV 2948
Cdd:PHA03247 2797 SLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvAPGGDVRRR 2865
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2949 TPTTEGLNTPTSPHSLSVASTSMPlmTVLPTTLEGTRPPHTSVPVTYTTTAATQTkssfSTDRTSAPHLSQPSTVTPTQS 3028
Cdd:PHA03247 2866 PPSRSPAAKPAAPARPPVRRLARP--AVSRSTESFALPPDQPERPPQPQAPPPPQ----PQPQPPPPPQPQPPPPPPPRP 2939
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3029 TPIPATTNSLMTTGGLTGTPP-----------VHTTSGTTSSPQTPRTTHPFSTvavsNTKHTTGVSLETSVQTTIASPT 3097
Cdd:PHA03247 2940 QPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrVAVPRFRVPQPAPSREAPASST----PPLTGHSLSRVSSWASSLALHE 3015
|
....
gi 1907182184 3098 PSAP 3101
Cdd:PHA03247 3016 ETDP 3019
|
|
| Herpes_BLLF1 super family |
cl37540 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
3330-3788 |
1.25e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo. The actual alignment was detected with superfamily member pfam05109:
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 44.91 E-value: 1.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3330 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 3409
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3410 STSTTTGnilPTTIGQTGSPHTSVPVIYTTS----AITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIK 3485
Cdd:pfam05109 502 KAPDMTS---PTSAVTTPTPNATSPTPAVTTptpnATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTS 578
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3486 PTmSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTP 3565
Cdd:pfam05109 579 PT-SAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSP 657
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3566 TTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQPTP 3645
Cdd:pfam05109 658 STSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATS 737
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3646 --IPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSttlSTAPQFRTSEQSTTTFPTPSAPQTSLVTS 3723
Cdd:pfam05109 738 pqAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTTDYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFT 814
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182184 3724 LPPFSTSS----VSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQ 3788
Cdd:pfam05109 815 SPPVTTAQatvpVPPTSQPRFSNLSMLVLQWASLAVLTLLLLLVMADCAFRRNLSTSHTYTTPPYDDAE 883
|
|
| ROM1 super family |
cl34999 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
1407-1625 |
4.66e-03 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms]; The actual alignment was detected with superfamily member COG5422:
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 43.34 E-value: 4.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1407 STGPPLGTSVQTTINFPTLSAPQTSLVT-----PHPGLSSSSTALTSEILKTPTSSQMVSSASPQT-IFSSIHPKTTLEA 1480
Cdd:COG5422 28 SKQLLPPRRLQRKLNPISIRNGADNDIInseskESFGKYALGHQIFSSFSSSPKLFQRRNSAGPIThSPSATSSTSSLNS 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1481 TTPQHTAPLITSITSSITQAQSSFSTDKTYTSQHS------QPSTMTaHQSRSLPTVTTSTKSTMGLTG---TPPVHTTS 1551
Cdd:COG5422 108 NDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQkrknplLPSSST-HGTHPPIVFTDNNGSHAGAPNarsRKEIPSLG 186
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182184 1552 GTTSSPQTPRTTHPFSTVAVSNT---KHTTGVSLETSvqttiaSPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPT 1625
Cdd:COG5422 187 SQSMQLPSPHFRQKFSSSDTSNGfsyPSIRKNSRHSS------NSMPSFPHSSTAVLLKRHSGSSGASLISSNITPS 257
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
387-550 |
2.32e-29 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 116.73 E-value: 2.32e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 387 WTCTKQPCPGHCSLEGGSFVTTFDARPYRFHGTCTYTLLQSpqLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVI 466
Cdd:smart00216 1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD--CSSEPTFSVLLKNVPCGGGATCLKSVKVELNGDEIEL 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 467 SEDEV-ITNNGDTKLLPYKTHNITI-FRQTSTHLQMATTFGLELVfQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDD 544
Cdd:smart00216 79 KDDNGkVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGLIQV-TFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDD 157
|
....*.
gi 1907182184 545 FTTSMG 550
Cdd:smart00216 158 FRTPDG 163
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
398-550 |
4.16e-29 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 115.55 E-value: 4.16e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 398 CSLEGGSFVTTFDARPYRFHGTCTYTLLQSPQLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVISEDEVITNNGD 477
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGASGVCLKSVTVIVGDLEITLQKGGTVLVNGQ 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182184 478 TKLLPYKTHNITIFRQTSTHLQMATTFGLELVFQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDDFTTSMG 550
Cdd:pfam00094 81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDG 153
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
1058-1130 |
3.53e-28 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 110.12 E-value: 3.53e-28
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182184 1058 WAERKCNIINSQ--TFAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFCP 1130
Cdd:smart00832 3 YACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCIsPWRTPTFCP 76
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
857-1019 |
3.83e-28 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 113.27 E-value: 3.83e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 857 WTCqLSTQCPSTCVLYGEGHIITFDGQRFVFDGDCEYMLATDdcgaNSSQPTFKVLTENVICGkSGVTCSRAIKISLGGL 936
Cdd:smart00216 1 WCC-TQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD----CSSEPTFSVLLKNVPCG-GGATCLKSVKVELNGD 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 937 FITMADSN--YTVSGE-------EPLVHLKVKPSPLNLVLdidIPGRLNLTLVWNKHMSVSIKIrRATQQDALCGLCGNA 1007
Cdd:smart00216 75 EIELKDDNgkVTVNGQqvslpykTSDGSIQIRSSGGYLVV---ITSLGLIQVTFDGLTLLSVQL-PSKYRGKTCGLCGNF 150
|
170
....*....|..
gi 1907182184 1008 NGNMKDDFETRS 1019
Cdd:smart00216 151 DGEPEDDFRTPD 162
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
1062-1129 |
1.44e-25 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 102.46 E-value: 1.44e-25
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1062 KCNIINSQT-FAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFC 1129
Cdd:pfam08742 1 KCGLLSDSGpFAPCHSVVDPEPYFEACVYDMCSC--GGDDECLCAALAAYARACQAAGVCIgDWRTPTFC 68
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
45-194 |
2.14e-25 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 105.15 E-value: 2.14e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDVPATFSIQLRRDMEGN----ISRIIMELGASVVTVNKETISVRDIG 120
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGAsgvcLKSVTVIVGDLEITLQKGGTVLVNGQ 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 121 VVSLPYTSNGLQITPYGQSVQLVAKQLELELVITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDGK 194
Cdd:pfam00094 81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
45-193 |
1.17e-24 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 103.25 E-value: 1.17e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDvpATFSIQLRRDMEG----NISRIIMELGASVVTVNKETISVR-DI 119
Cdd:smart00216 12 CSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSE--PTFSVLLKNVPCGggatCLKSVKVELNGDEIELKDDNGKVTvNG 89
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182184 120 GVVSLPYTSNGLQITPYgQSVQLVAKQLELELV-ITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDG 193
Cdd:smart00216 90 QQVSLPYKTSDGSIQIR-SSGGYLVVITSLGLIqVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDDFRTPDG 163
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
589-661 |
1.13e-21 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 91.63 E-value: 1.13e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182184 589 AETHCSMLLKKGSVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:smart00832 4 ACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTP-TFC 75
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
592-661 |
3.10e-21 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 90.13 E-value: 3.10e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 592 HCSMLLKKGsVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:pfam08742 1 KCGLLSDSG-PFAPCHSVVDPEPYFEACVYDMCSCGGDDECLCAALAAYARACQAAGVCIGDWRTP-TFC 68
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
869-1019 |
1.68e-20 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 90.89 E-value: 1.68e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 869 CVLYGEGHIITFDGQRFVFDGDCEYMLAtDDCGANSSqPTFKVLTENVICGKSGVtCSRAIKISLGGLFITMADSNY-TV 947
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLA-KDCSEEPD-FSFSVTNKNCNGGASGV-CLKSVTVIVGDLEITLQKGGTvLV 77
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 948 SGEEplVHLKVKPSPLNL------VLDIDIPGRLNLTLVWNKHMSVSIKIRRaTQQDALCGLCGNANGNMKDDFETRS 1019
Cdd:pfam00094 78 NGQK--VSLPYKSDGGEVeilgsgFVVVDLSPGVGLQVDGDGRGQLFVTLSP-SYQGKTCGLCGNYNGNQEDDFMTPD 152
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
303-358 |
2.00e-16 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 75.89 E-value: 2.00e-16
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:pfam01826 1 CPANEVYSECGSACPPTCANLSPpdVCPEPCVEGCVCPPGFVRNS---GGKCVPPSDC 55
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
303-358 |
1.55e-15 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 73.51 E-value: 1.55e-15
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANPNAppPCTKQCVEGCFCPEGYVRNS---GGKCVPPSQC 55
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
765-828 |
3.23e-13 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 66.57 E-value: 3.23e-13
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:cd19941 1 CPPNEVYSEC--------GSACPPTCANPNAPPPC-TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
765-828 |
4.68e-13 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 66.26 E-value: 4.68e-13
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:pfam01826 1 CPANEVYSEC--------GSACPPTCANLSPPDVC-PEPCVEGCVCPPGFVRNSGGKCVPPSDC 55
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
246-298 |
9.20e-11 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 60.09 E-value: 9.20e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1907182184 246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALC 298
Cdd:pfam08742 18 VDPEPYFEACVYDM--CSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTFC 68
|
|
| CT |
smart00041 |
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ... |
4164-4243 |
3.91e-10 |
|
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers.
Pssm-ID: 214482 Cd Length: 82 Bit Score: 58.95 E-value: 3.91e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 4164 EHQITYQGCVAN-VTLTRCQGFCASSVSFNkdTLQLESSCGCCQPLSTYKKQLSLPCPDpdapGQQLTLTLQVFSSCVCS 4242
Cdd:smart00041 5 RQTITYNGCTSVtVKNAFCEGKCGSASSYS--IQDVQHSCSCCQPHKTKTRQVRLRCPD----GSTVKKTVMHIEECGCE 78
|
.
gi 1907182184 4243 P 4243
Cdd:smart00041 79 P 79
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
246-299 |
5.55e-09 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 55.42 E-value: 5.55e-09
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALCP 299
Cdd:smart00832 25 VDPEPFFENCVYDT--CACGGDCECLCDALAAYAAACAEAGVCISPWRTPTFCP 76
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
3637-4059 |
1.70e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.49 E-value: 1.70e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3637 TVTPTQPTP---------------IPATTNSPMTTVGLTGTPvvHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFR 3701
Cdd:PHA03247 2567 SVPPPRPAPrpsepavtsrarrpdAPPQSARPRAPVDDRGDP--RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPP 2644
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3702 TSEQSTTTFPTPSAPQTSLV--TSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPvstilqttievTTPPNTSTPVTH 3779
Cdd:PHA03247 2645 TVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP-----------PPPPPTPEPAPH 2713
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3780 S-TSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAIS 3858
Cdd:PHA03247 2714 AlVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3859 RTTGISGTPFRTPMKTTITFPTPSSLQTSM-ATLFPPfstsvmssteifntPTNPHSVSSASTSRPLSTSLPT------- 3930
Cdd:PHA03247 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASpAGPLPP--------------PTSAQPTAPPPPPGPPPPSLPLggsvapg 2859
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3931 -TIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTAS 4009
Cdd:PHA03247 2860 gDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 4010 PPSSAPTfvSPTAASTVISSALPTI---HMTP---------TPSSRPTSSTGLLSTSKTTSH 4059
Cdd:PHA03247 2940 QPPLAPT--TDPAGAGEPSGAVPQPwlgALVPgrvavprfrVPQPAPSREAPASSTPPLTGH 2999
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
665-722 |
1.89e-07 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 50.39 E-value: 1.89e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 665 CTGNRTFSYDSQACDRTCLSLsDRETEChvSPVPVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANP-NAPPPC--TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
3734-4050 |
7.75e-07 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 55.77 E-value: 7.75e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3734 PTDEIHITSTNPHTVSS---VSMSRPVSTIL--QTTIEVT-----TPPNTSTPVTHSTSATTEAQGSFSTERTSTSYLSH 3803
Cdd:TIGR00927 68 SNDEMMMVSSDPPKSSSemeGEMLAPQATVGrdEATPSIAmentpSPPRRTAKITPTTPKNNYSPTAAGTERVKEDTPAT 147
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3804 PSSTTVH-QSTAG-PVITSIKSTM--GVTGTPPVHTTSGT---TSSP----------------QTPHSTHPISTAAISRT 3860
Cdd:TIGR00927 148 PSRALNHyISTSGrQRVKSYTPKPrgEVKSSSPTQTREKVrkyTPSPlgrmvnsyapstfmtmPRSHGITPRTTVKDSEI 227
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3861 TGISGTPFRTPMKTTITFPTPSSLQtSMATLFPPFSTSVMSsTEIFNTPTN---------PHSVSSASTSRP---LSTSL 3928
Cdd:TIGR00927 228 TATYKMLETNPSKRTAGKTTPTPLK-GMTDNTPTFLTREVE-TDLLTSPRSvvekntlttPRRVESNSSTNHwglVGKNN 305
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3929 PTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTL--QYTPTPSSVSHSPLLTTP 4006
Cdd:TIGR00927 306 LTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIASATfrGLEKNPSTAPSTPATPRV 385
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|.
gi 1907182184 4007 TASPPSSA-------PTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGL 4050
Cdd:TIGR00927 386 RAVLTTQVhhcvvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQPDL 436
|
|
| VWC_out |
smart00215 |
von Willebrand factor (vWF) type C domain; |
360-404 |
2.89e-06 |
|
von Willebrand factor (vWF) type C domain;
Pssm-ID: 214565 Cd Length: 67 Bit Score: 47.56 E-value: 2.89e-06
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 1907182184 360 CMLNGMVYGPGEITKTACQTCQCTMGRWTCTKQPC-PGHCSLEGGS 404
Cdd:smart00215 1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCgPKPCLLHNLS 46
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3181-3381 |
3.24e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 53.22 E-value: 3.24e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3181 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3260
Cdd:COG3469 28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3261 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3340
Cdd:COG3469 108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATT 183
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 1907182184 3341 PFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 3381
Cdd:COG3469 184 TATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1657-1878 |
3.41e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 53.22 E-value: 3.41e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1657 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 1736
Cdd:COG3469 7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1737 HPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaITNSLMTTGGLTG 1816
Cdd:COG3469 87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 1817 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 1878
Cdd:COG3469 163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1727-2176 |
5.11e-06 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 53.00 E-value: 5.11e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1727 TSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSThlplfstlsvtptteglntQSTPIPAitn 1806
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAD-------------------VTSPTPA--- 479
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1807 slMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVavsntkhTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSS 1886
Cdd:pfam05109 480 --GTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTT 550
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1887 ----VTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTP-TSAPHLS 1961
Cdd:pfam05109 551 ptpnATSPTPAVTTPTPN------ATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvVTSPPKN 624
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1962 ETSAVTAHQSTPTAVSANSikptMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTH 2041
Cdd:pfam05109 625 ATSAVTTGQHNITSSSTSS----MSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPA 700
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2042 SGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFST 2121
Cdd:pfam05109 701 PRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTT 780
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*
gi 1907182184 2122 DRTSTphlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR 2176
Cdd:pfam05109 781 DYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQ 832
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
665-722 |
1.27e-05 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 45.07 E-value: 1.27e-05
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182184 665 CTGNRTFSYDSQACDRTCLSLSDR---ETEChvspvpVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:pfam01826 1 CPANEVYSECGSACPPTCANLSPPdvcPEPC------VEGCVCPPGFVRNSGGKCVPPSDC 55
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2576-2766 |
1.31e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 51.29 E-value: 1.31e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2576 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPT 2655
Cdd:COG3469 38 TATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTST 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2656 HSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 2735
Cdd:COG3469 118 GAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGA 193
|
170 180 190
....*....|....*....|....*....|.
gi 1907182184 2736 KHTTgvsletsvqTTIASPTPSAPQTSLATH 2766
Cdd:COG3469 194 TTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2305-2516 |
1.54e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 50.91 E-value: 1.54e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2305 TSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHS 2384
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2385 GPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKH 2464
Cdd:COG3469 82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATG 161
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 2465 TTGVSLETSVQTTIASPTPSAPQTSLATHLP--FSSTSAVTPTSEVIITPTPQH 2516
Cdd:COG3469 162 GTTTTSTTTTTTSASTTPSATTTATATTASGatTPSATTTATTTGPPTPGLPKH 215
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2646-3101 |
5.35e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.94 E-value: 5.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2646 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTS--------GTTSS 2717
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2718 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 2791
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2792 ssASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAhqstPTAVSANSIKP 2871
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR----PAVASLSESRE 2796
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2872 TMSSTGTPVVHTTSGTTSSPQTPRTTHPSttvavsgtvhtTGLPSGTSVHTTTNFPTHSGPQSSLSTH---LPLFSTLSV 2948
Cdd:PHA03247 2797 SLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvAPGGDVRRR 2865
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2949 TPTTEGLNTPTSPHSLSVASTSMPlmTVLPTTLEGTRPPHTSVPVTYTTTAATQTkssfSTDRTSAPHLSQPSTVTPTQS 3028
Cdd:PHA03247 2866 PPSRSPAAKPAAPARPPVRRLARP--AVSRSTESFALPPDQPERPPQPQAPPPPQ----PQPQPPPPPQPQPPPPPPPRP 2939
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3029 TPIPATTNSLMTTGGLTGTPP-----------VHTTSGTTSSPQTPRTTHPFSTvavsNTKHTTGVSLETSVQTTIASPT 3097
Cdd:PHA03247 2940 QPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrVAVPRFRVPQPAPSREAPASST----PPLTGHSLSRVSSWASSLALHE 3015
|
....
gi 1907182184 3098 PSAP 3101
Cdd:PHA03247 3016 ETDP 3019
|
|
| ROM1 |
COG5422 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
3802-4036 |
8.92e-05 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 49.12 E-value: 8.92e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3802 SHPSS---TTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGT--TSSPQTPHSTHpISTAAISrttgISGTPFRTPMKTTI 3876
Cdd:COG5422 59 SKESFgkyALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATssTSSLNSNDGDQ-FSPASDS----LSFNPSSTQSRKDS 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3877 TFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLStslPTTIKGTGTPQTPVSDINTTSATTQAHS 3956
Cdd:COG5422 134 GPGDGSPVQKRKNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEI---PSLGSQSMQLPSPHFRQKFSSSDTSNGF 210
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3957 SFPTTRTSTSHlslpSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPT-----ASPPSSAPTFVSPTAASTVISSAL 4031
Cdd:COG5422 211 SYPSIRKNSRH----SSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSssnseAMSTSSKRPYIYPALLSRVAVEFK 286
|
....*
gi 1907182184 4032 PTIHM 4036
Cdd:COG5422 287 MRLQL 291
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
2432-2763 |
1.32e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 48.22 E-value: 1.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2432 TPPVHTTSGTTSSPqTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSAVTPTSEVIIT 2511
Cdd:pfam03154 186 PPPPGTTQAATAGP-TPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQ 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2512 PTPQhtfssaSTSTTTGNILPTTIgQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAV 2591
Cdd:pfam03154 265 PLPQ------PSLHGQMPPMPHSL-QTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2592 SANSIKPtMSSTGTPVVHT----TSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPqsslSTH- 2666
Cdd:pfam03154 338 QPPREQP-LPPAPLSMPHIkpppTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHp 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2667 -----LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGV 2741
Cdd:pfam03154 413 pplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGI 492
|
330 340
....*....|....*....|..
gi 1907182184 2742 SLETSVQTTIASPTPSAPQTSL 2763
Cdd:pfam03154 493 QPPSSASVSSSGPVPAAVSCPL 514
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
3473-3846 |
4.59e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 46.11 E-value: 4.59e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3473 QSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLS 3552
Cdd:pfam17823 45 DAVPRADNKSSEQ*NFCAATAAPAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPS 124
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3553 THLPLFSTLSVTPTTEGLNTPTS--PHSLSAASTSMPLMTVLPTTlEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTP 3630
Cdd:pfam17823 125 SAAQSLPAAIAALPSEAFSAPRAaaCRANASAAPRAAIAAASAPH-AASPAPRTAASSTTAASSTTAASSAPTTAASSAP 203
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3631 hlsqsSTVTPTQPTPIPAT-TNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQSTTT 3709
Cdd:pfam17823 204 -----ATLTPARGISTAATaTGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHAR 278
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3710 FPTPSAPQTSLVTSLPPFSTSSVSPTDEIHITSTNPHTVSSVS--MSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEA 3787
Cdd:pfam17823 279 RLSPAKHMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGepTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEP 358
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182184 3788 QG--------SFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTP 3846
Cdd:pfam17823 359 SAspvpvlhtSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAPEQVATEATAGTASAGPTPRSSGDP 425
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1492-1875 |
5.66e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 46.30 E-value: 5.66e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1492 SITSSITQAQSSFSTDKTYTSQHSQPSTMTAHQSRSLPTVTTSTKSTMGLTGTPPVHTTSGTTSSPQ-TPRTTHPFSTVA 1570
Cdd:pfam03154 143 STSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQgSPATSQPPNQTQ 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1571 VSNTKHTTgvsletsvqttIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIgQT 1650
Cdd:pfam03154 223 STAAPHTL-----------IQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ------PSLHGQMPPMPHSL-QT 284
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1651 GSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPtMSSTGTPVVHTT-SGTTSS 1729
Cdd:pfam03154 285 GPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQP-LPPAPLSMPHIKpPPTTPI 363
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1730 PQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTH------LPLFSTLSVTPTTEGLNTQSTP 1800
Cdd:pfam03154 364 PQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHppplqlMPQSQQLPPPPAQPPVLTQSQS 439
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182184 1801 IPAITNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 1875
Cdd:pfam03154 440 LPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
1918-3538 |
6.14e-04 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 46.30 E-value: 6.14e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1918 TTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTS 1997
Cdd:COG3210 80 GIGAAAANTAGTLETGLTSNIGGGSVNGSNSTGNGTLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAG 159
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1998 GTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLS 2077
Cdd:COG3210 160 NNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAG 239
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2078 VASTSMplmTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQSTPIPATTNSLMTTGGLT 2157
Cdd:COG3210 240 VISTGG---TDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGT 316
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2158 GTPPVHTNSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSKVII 2237
Cdd:COG3210 317 AAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASS 396
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2238 TPTPQHTLSSASTSTTTGNILPTTIGKTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTA 2317
Cdd:COG3210 397 TTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSA 476
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2318 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLF 2397
Cdd:COG3210 477 GNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTT 556
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2398 STLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTT 2477
Cdd:COG3210 557 AASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTG 636
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2478 IASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQT 2557
Cdd:COG3210 637 SAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVT 716
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2558 KTSFSTDRTS-------TSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRT--THPST 2628
Cdd:COG3210 717 GQIGALANANgdtvtfgNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATLDNagAEISI 796
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2629 TVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPV 2708
Cdd:COG3210 797 DITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAAS 876
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2709 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQ 2788
Cdd:COG3210 877 ITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSA 956
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2789 HTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANS 2868
Cdd:COG3210 957 ASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGT 1036
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2869 IKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSV 2948
Cdd:COG3210 1037 AATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGV 1116
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2949 TPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSAPHLSQPSTVTPTQS 3028
Cdd:COG3210 1117 TASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGT 1196
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3029 TPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATH 3108
Cdd:COG3210 1197 DLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGN 1276
|
1210 1220 1230 1240 1250 1260 1270 1280
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3109 LPFSSTSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSTITQTKTSFFTDRTSTSTSAP 3188
Cdd:COG3210 1277 AGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGN 1356
|
1290 1300 1310 1320 1330 1340 1350 1360
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3189 HLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF 3268
Cdd:COG3210 1357 GATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGG 1436
|
1370 1380 1390 1400 1410 1420 1430 1440
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3269 PTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVS 3348
Cdd:COG3210 1437 TGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTT 1516
|
1450 1460 1470 1480 1490 1500 1510 1520
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3349 NTKHTTGVSLETSVQTTIASPTPSAPQTSLAThlpfsstsSVTPTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTGS 3428
Cdd:COG3210 1517 AEVAKASLEGGEGTYGGSSVAEAGTGGGILGA--------VSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQA 1588
|
1530 1540 1550 1560 1570 1580 1590 1600
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3429 PHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQT 3508
Cdd:COG3210 1589 PTAGNTATLTLSLAEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNGGEGVLALVAGGNTTNGTTLSGAVNGAGNGW 1668
|
1610 1620 1630
....*....|....*....|....*....|
gi 1907182184 3509 PRTTHPSTTVAVSGTVHTTGLPSGTSVHTT 3538
Cdd:COG3210 1669 AVDLTDATLAGLGGATTAAAGNVATGDTAP 1698
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1758-2222 |
6.45e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.08 E-value: 6.45e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1758 SVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVHTTS--------GTTSS 1829
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1830 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 1903
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1904 ssASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIkp 1983
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-- 2798
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1984 tmsstgtpvvhttsgttSSPQTPrTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSlsthlPLFSTLsvtpT 2063
Cdd:PHA03247 2799 -----------------PSPWDP-ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG-----PPPPSL----P 2851
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2064 TEGLNTPTSPHSLSVASTSMPLMTVLPttlegTRPPHTSVPVTYTttaatqtksSFSTDRTSTPHLSQSSTVTPTQSTPi 2143
Cdd:PHA03247 2852 LGGSVAPGGDVRRRPPSRSPAAKPAAP-----ARPPVRRLARPAV---------SRSTESFALPPDQPERPPQPQAPPP- 2916
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2144 PATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV-QTTIASPTPSAPQTSLATHLP 2222
Cdd:PHA03247 2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVpRFRVPQPAPSREAPASSTPPL 2996
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
3330-3788 |
1.25e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 44.91 E-value: 1.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3330 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 3409
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3410 STSTTTGnilPTTIGQTGSPHTSVPVIYTTS----AITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIK 3485
Cdd:pfam05109 502 KAPDMTS---PTSAVTTPTPNATSPTPAVTTptpnATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTS 578
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3486 PTmSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTP 3565
Cdd:pfam05109 579 PT-SAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSP 657
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3566 TTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQPTP 3645
Cdd:pfam05109 658 STSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATS 737
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3646 --IPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSttlSTAPQFRTSEQSTTTFPTPSAPQTSLVTS 3723
Cdd:pfam05109 738 pqAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTTDYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFT 814
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182184 3724 LPPFSTSS----VSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQ 3788
Cdd:pfam05109 815 SPPVTTAQatvpVPPTSQPRFSNLSMLVLQWASLAVLTLLLLLVMADCAFRRNLSTSHTYTTPPYDDAE 883
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3464-3684 |
1.30e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.74 E-value: 1.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3464 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 3543
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3544 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 3623
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182184 3624 TDRTSTPhlSQSSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSL 3684
Cdd:COG3469 159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKHVL 217
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2849-3063 |
1.64e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.36 E-value: 1.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2849 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2928
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2929 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPH-TSVPVTYTTTAATQTKSSF 3007
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAsATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182184 3008 STDRTSAPHLSQPSTVTPTQSTPIPATTnslmTTGGLTGTPPVHTTSGTTSSPQTP 3063
Cdd:COG3469 159 ATGGTTTTSTTTTTTSASTTPSATTTAT----ATTASGATTPSATTTATTTGPPTP 210
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1961-2179 |
1.67e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.36 E-value: 1.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1961 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2040
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2041 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 2120
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182184 2121 TDRTSTPhlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTH 2179
Cdd:COG3469 159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| VWC |
pfam00093 |
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with ... |
360-395 |
1.96e-03 |
|
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with pfam00094.
Pssm-ID: 278520 Cd Length: 57 Bit Score: 39.33 E-value: 1.96e-03
10 20 30
....*....|....*....|....*....|....*..
gi 1907182184 360 CMLNGMVYGPGEITKTA-CQTCQCTMGRWTCTKQPCP 395
Cdd:pfam00093 1 CVQNGVVYENGETWKPDlCTICTCDDGKVLCDKIICP 37
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
2705-3068 |
2.06e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 44.37 E-value: 2.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2705 TPPVHTTSGTTSSPqTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIIT 2784
Cdd:pfam03154 186 PPPPGTTQAATAGP-TPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQ 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2785 PTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAV 2864
Cdd:pfam03154 265 PLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2865 SANSIKPtMSSTGTPVVHTT-SGTTSSPQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTHL 2940
Cdd:pfam03154 338 QPPREQP-LPPAPLSMPHIKpPPTTPIPQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHP 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2941 PlfsTLSVTPTTEGLNTP-------TSPHSLSVASTSMPLMTVL-PTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTdrT 3012
Cdd:pfam03154 413 P---PLQLMPQSQQLPPPpaqppvlTQSQSLPPPAASHPPTSGLhQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTS--S 487
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182184 3013 SAPHLSQPSTVTPTQSTPIPATTNSLMttggltgtPPVHTTSGTTSSPQTPRTTHP 3068
Cdd:pfam03154 488 AMPGIQPPSSASVSSSGPVPAAVSCPL--------PPVQIKEEALDEAEEPESPPP 535
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
2575-2899 |
2.13e-03 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 44.22 E-value: 2.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2575 LSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTSSSPQTPRT------THPSTTVAVSGTVHTTGLPSGTSVQ 2648
Cdd:TIGR00927 91 LAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSRALNHYISTSGRQRVKSYT 168
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2649 TTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipATTNSLMTTGGLTGTPPVHTTSGT 2714
Cdd:TIGR00927 169 PKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEITATYKMLETNPSKRTAGK 245
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2715 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 2794
Cdd:TIGR00927 246 TTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLTTPQGTVLEHTPATSEGQV 325
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2795 STSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPTsAPHLSETSAVTAHQST---PTAV 2864
Cdd:TIGR00927 326 TISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA-TPRVRAVLTTQVHHCVvvkPAPA 404
|
330 340 350
....*....|....*....|....*....|....*
gi 1907182184 2865 SANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2899
Cdd:TIGR00927 405 VPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
3018-3378 |
2.27e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 44.37 E-value: 2.27e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3018 SQPSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSgtTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqttIASPT 3097
Cdd:pfam03154 169 TQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPS--VPPQGSPATSQPPNQTQSTAAPHTL-----------IQQTP 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3098 PSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPViyttstitqTKTSFF 3177
Cdd:pfam03154 236 TLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPV---------PPQPFP 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3178 TDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP 3257
Cdd:pfam03154 300 LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLS 379
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3258 SGTSVQTTTNFPTHSG--PQSSLSTH------------LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPV 3323
Cdd:pfam03154 380 GPSPFQMNSNLPPPPAlkPLSSLSTHhppsahppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQ 459
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*
gi 1907182184 3324 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 3378
Cdd:pfam03154 460 SPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
3772-4000 |
3.40e-03 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 43.84 E-value: 3.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3772 NTSTPVTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHP 3851
Cdd:NF033849 250 STSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSS 329
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3852 ISTAAISRTTGISGT-----PFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLST 3926
Cdd:NF033849 330 SYNVSSGTGVSSSHSdgtsqSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 3927 SLPTTiKGTGTpQTPVSDINTTSATTQAHSSFPTTRTSTSHlSLPSSMTSTLTpASRSASTLQYTPTPSSVSHS 4000
Cdd:NF033849 410 SQGGS-EGWGS-GDSVQSVSQSYGSSSSTGTSSGHSDSSSH-STSSGQADSVS-QGTSWSEGTGTSQGQSVGTS 479
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
2127-2490 |
3.84e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.60 E-value: 3.84e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2127 PHLSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVhtnSGTTSSPQTPRTTHPFSTVAVsntkhttgvsletsvQTTIA 2206
Cdd:pfam03154 171 PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV---PPQGSPATSQPPNQTQSTAAP---------------HTLIQ 232
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2207 SPTPSAPQTSLATHLPFSSTSSVTPTSKVIITPTPQhtlssaSTSTTTGNILPTTIgKTGSPHTSVPVIYTTSAITQTKT 2286
Cdd:pfam03154 233 QTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ------PSLHGQMPPMPHSL-QTGPSHMQHPVPPQPFPLTPQSS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2287 SFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPtMSSTGTPVVHTT-SGTTSSPQTPRT---THPSTTVAVSGT 2362
Cdd:pfam03154 306 QSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQP-LPPAPLSMPHIKpPPTTPIPQLPNPqshKHPPHLSGPSPF 384
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2363 VHTTGLPSGTSVHTTTNFPTHSGPqsslSTH------LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVH 2436
Cdd:pfam03154 385 QMNSNLPPPPALKPLSSLSTHHPP----SAHppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQS 460
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 2437 TTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 2490
Cdd:pfam03154 461 PFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
1669-2011 |
3.84e-03 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 43.45 E-value: 3.84e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1669 TKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTTSSPQTPRT------THPSTTV 1742
Cdd:TIGR00927 73 MMVSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSR 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1743 AVSGTVHTTGLPSGTSVHTTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipAITNSL 1808
Cdd:TIGR00927 151 ALNHYISTSGRQRVKSYTPKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEI 227
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1809 MTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVT 1888
Cdd:TIGR00927 228 TATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLT 307
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1889 PTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPTsAPHLS 1961
Cdd:TIGR00927 308 TPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA-TPRVR 386
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|...
gi 1907182184 1962 ETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2011
Cdd:TIGR00927 387 AVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2299-2768 |
4.66e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.39 E-value: 4.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2299 APHLSEtSAVTAHQSTPTAvsansiKPTMSSTGTPVVHTTSGTTSSPQT--PRTTHPSTTVAVSGTVHTTGLPSGTSVHT 2376
Cdd:PHA03247 2574 APRPSE-PAVTSRARRPDA------PPQSARPRAPVDDRGDPRGPAPPSplPPDTHAPDPPPPSPSPAANEPDPHPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2377 TTNFPTHSGPQSSlSTHLPLFSTLSVTPTTEGLNTQSTPIPATTnslmttggltgtPPVHTTSGTTSSPQTPRTTHPFST 2456
Cdd:PHA03247 2647 PPPERPRDDPAPG-RVSRPRRARRLGRAAQASSPPQRPRRRAAR------------PTVGSLTSLADPPPPPPTPEPAPH 2713
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2457 VAVSntkhttgvsletsvqttiASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSAststttgnilPTTig 2536
Cdd:PHA03247 2714 ALVS------------------ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP----------PTT-- 2763
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2537 qTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSS 2616
Cdd:PHA03247 2764 -AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP 2842
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2617 SPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTnfPTHSgPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPI----PAT 2692
Cdd:PHA03247 2843 PGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA--PARP-PVRRLARPAVSRSTESFALPPDQPERPPQPQapppPQP 2919
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182184 2693 TNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV-QTTIASPTPSAPQTSLATHLP 2768
Cdd:PHA03247 2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVpRFRVPQPAPSREAPASSTPPL 2996
|
|
| ROM1 |
COG5422 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
1407-1625 |
4.66e-03 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 43.34 E-value: 4.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1407 STGPPLGTSVQTTINFPTLSAPQTSLVT-----PHPGLSSSSTALTSEILKTPTSSQMVSSASPQT-IFSSIHPKTTLEA 1480
Cdd:COG5422 28 SKQLLPPRRLQRKLNPISIRNGADNDIInseskESFGKYALGHQIFSSFSSSPKLFQRRNSAGPIThSPSATSSTSSLNS 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1481 TTPQHTAPLITSITSSITQAQSSFSTDKTYTSQHS------QPSTMTaHQSRSLPTVTTSTKSTMGLTG---TPPVHTTS 1551
Cdd:COG5422 108 NDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQkrknplLPSSST-HGTHPPIVFTDNNGSHAGAPNarsRKEIPSLG 186
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182184 1552 GTTSSPQTPRTTHPFSTVAVSNT---KHTTGVSLETSvqttiaSPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPT 1625
Cdd:COG5422 187 SQSMQLPSPHFRQKFSSSDTSNGfsyPSIRKNSRHSS------NSMPSFPHSSTAVLLKRHSGSSGASLISSNITPS 257
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
387-550 |
2.32e-29 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 116.73 E-value: 2.32e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 387 WTCTKQPCPGHCSLEGGSFVTTFDARPYRFHGTCTYTLLQSpqLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVI 466
Cdd:smart00216 1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD--CSSEPTFSVLLKNVPCGGGATCLKSVKVELNGDEIEL 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 467 SEDEV-ITNNGDTKLLPYKTHNITI-FRQTSTHLQMATTFGLELVfQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDD 544
Cdd:smart00216 79 KDDNGkVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGLIQV-TFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDD 157
|
....*.
gi 1907182184 545 FTTSMG 550
Cdd:smart00216 158 FRTPDG 163
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
398-550 |
4.16e-29 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 115.55 E-value: 4.16e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 398 CSLEGGSFVTTFDARPYRFHGTCTYTLLQSPQLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVISEDEVITNNGD 477
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGASGVCLKSVTVIVGDLEITLQKGGTVLVNGQ 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182184 478 TKLLPYKTHNITIFRQTSTHLQMATTFGLELVFQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDDFTTSMG 550
Cdd:pfam00094 81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDG 153
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
1058-1130 |
3.53e-28 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 110.12 E-value: 3.53e-28
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182184 1058 WAERKCNIINSQ--TFAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFCP 1130
Cdd:smart00832 3 YACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCIsPWRTPTFCP 76
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
857-1019 |
3.83e-28 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 113.27 E-value: 3.83e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 857 WTCqLSTQCPSTCVLYGEGHIITFDGQRFVFDGDCEYMLATDdcgaNSSQPTFKVLTENVICGkSGVTCSRAIKISLGGL 936
Cdd:smart00216 1 WCC-TQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD----CSSEPTFSVLLKNVPCG-GGATCLKSVKVELNGD 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 937 FITMADSN--YTVSGE-------EPLVHLKVKPSPLNLVLdidIPGRLNLTLVWNKHMSVSIKIrRATQQDALCGLCGNA 1007
Cdd:smart00216 75 EIELKDDNgkVTVNGQqvslpykTSDGSIQIRSSGGYLVV---ITSLGLIQVTFDGLTLLSVQL-PSKYRGKTCGLCGNF 150
|
170
....*....|..
gi 1907182184 1008 NGNMKDDFETRS 1019
Cdd:smart00216 151 DGEPEDDFRTPD 162
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
1062-1129 |
1.44e-25 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 102.46 E-value: 1.44e-25
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1062 KCNIINSQT-FAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFC 1129
Cdd:pfam08742 1 KCGLLSDSGpFAPCHSVVDPEPYFEACVYDMCSC--GGDDECLCAALAAYARACQAAGVCIgDWRTPTFC 68
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
45-194 |
2.14e-25 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 105.15 E-value: 2.14e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDVPATFSIQLRRDMEGN----ISRIIMELGASVVTVNKETISVRDIG 120
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGAsgvcLKSVTVIVGDLEITLQKGGTVLVNGQ 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 121 VVSLPYTSNGLQITPYGQSVQLVAKQLELELVITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDGK 194
Cdd:pfam00094 81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
45-193 |
1.17e-24 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 103.25 E-value: 1.17e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDvpATFSIQLRRDMEG----NISRIIMELGASVVTVNKETISVR-DI 119
Cdd:smart00216 12 CSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSE--PTFSVLLKNVPCGggatCLKSVKVELNGDEIELKDDNGKVTvNG 89
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182184 120 GVVSLPYTSNGLQITPYgQSVQLVAKQLELELV-ITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDG 193
Cdd:smart00216 90 QQVSLPYKTSDGSIQIR-SSGGYLVVITSLGLIqVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDDFRTPDG 163
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
589-661 |
1.13e-21 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 91.63 E-value: 1.13e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182184 589 AETHCSMLLKKGSVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:smart00832 4 ACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTP-TFC 75
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
592-661 |
3.10e-21 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 90.13 E-value: 3.10e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 592 HCSMLLKKGsVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:pfam08742 1 KCGLLSDSG-PFAPCHSVVDPEPYFEACVYDMCSCGGDDECLCAALAAYARACQAAGVCIGDWRTP-TFC 68
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
869-1019 |
1.68e-20 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 90.89 E-value: 1.68e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 869 CVLYGEGHIITFDGQRFVFDGDCEYMLAtDDCGANSSqPTFKVLTENVICGKSGVtCSRAIKISLGGLFITMADSNY-TV 947
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLA-KDCSEEPD-FSFSVTNKNCNGGASGV-CLKSVTVIVGDLEITLQKGGTvLV 77
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 948 SGEEplVHLKVKPSPLNL------VLDIDIPGRLNLTLVWNKHMSVSIKIRRaTQQDALCGLCGNANGNMKDDFETRS 1019
Cdd:pfam00094 78 NGQK--VSLPYKSDGGEVeilgsgFVVVDLSPGVGLQVDGDGRGQLFVTLSP-SYQGKTCGLCGNYNGNQEDDFMTPD 152
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
303-358 |
2.00e-16 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 75.89 E-value: 2.00e-16
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:pfam01826 1 CPANEVYSECGSACPPTCANLSPpdVCPEPCVEGCVCPPGFVRNS---GGKCVPPSDC 55
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
303-358 |
1.55e-15 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 73.51 E-value: 1.55e-15
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANPNAppPCTKQCVEGCFCPEGYVRNS---GGKCVPPSQC 55
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
765-828 |
3.23e-13 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 66.57 E-value: 3.23e-13
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:cd19941 1 CPPNEVYSEC--------GSACPPTCANPNAPPPC-TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
765-828 |
4.68e-13 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 66.26 E-value: 4.68e-13
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:pfam01826 1 CPANEVYSEC--------GSACPPTCANLSPPDVC-PEPCVEGCVCPPGFVRNSGGKCVPPSDC 55
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
246-298 |
9.20e-11 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 60.09 E-value: 9.20e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1907182184 246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALC 298
Cdd:pfam08742 18 VDPEPYFEACVYDM--CSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTFC 68
|
|
| CT |
smart00041 |
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ... |
4164-4243 |
3.91e-10 |
|
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers.
Pssm-ID: 214482 Cd Length: 82 Bit Score: 58.95 E-value: 3.91e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 4164 EHQITYQGCVAN-VTLTRCQGFCASSVSFNkdTLQLESSCGCCQPLSTYKKQLSLPCPDpdapGQQLTLTLQVFSSCVCS 4242
Cdd:smart00041 5 RQTITYNGCTSVtVKNAFCEGKCGSASSYS--IQDVQHSCSCCQPHKTKTRQVRLRCPD----GSTVKKTVMHIEECGCE 78
|
.
gi 1907182184 4243 P 4243
Cdd:smart00041 79 P 79
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
246-299 |
5.55e-09 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 55.42 E-value: 5.55e-09
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALCP 299
Cdd:smart00832 25 VDPEPFFENCVYDT--CACGGDCECLCDALAAYAAACAEAGVCISPWRTPTFCP 76
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
3637-4059 |
1.70e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.49 E-value: 1.70e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3637 TVTPTQPTP---------------IPATTNSPMTTVGLTGTPvvHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFR 3701
Cdd:PHA03247 2567 SVPPPRPAPrpsepavtsrarrpdAPPQSARPRAPVDDRGDP--RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPP 2644
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3702 TSEQSTTTFPTPSAPQTSLV--TSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPvstilqttievTTPPNTSTPVTH 3779
Cdd:PHA03247 2645 TVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP-----------PPPPPTPEPAPH 2713
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3780 S-TSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAIS 3858
Cdd:PHA03247 2714 AlVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3859 RTTGISGTPFRTPMKTTITFPTPSSLQTSM-ATLFPPfstsvmssteifntPTNPHSVSSASTSRPLSTSLPT------- 3930
Cdd:PHA03247 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASpAGPLPP--------------PTSAQPTAPPPPPGPPPPSLPLggsvapg 2859
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3931 -TIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTAS 4009
Cdd:PHA03247 2860 gDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 4010 PPSSAPTfvSPTAASTVISSALPTI---HMTP---------TPSSRPTSSTGLLSTSKTTSH 4059
Cdd:PHA03247 2940 QPPLAPT--TDPAGAGEPSGAVPQPwlgALVPgrvavprfrVPQPAPSREAPASSTPPLTGH 2999
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
3556-4015 |
2.30e-08 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 60.96 E-value: 2.30e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3556 PLFSTLSVTPTTEGLNTPT-SPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPvtytttaatqtkssfsTDRTSTPHLSQ 3634
Cdd:PHA03307 54 TVVAGAAACDRFEPPTGPPpGPGTEAPANESRSTPTWSLSTLAPASPAREGSP----------------TPPGPSSPDPP 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3635 SSTVTPTQPTPIPATTNSPMTTvglTGTPVVHTPSGTSSIAHTPHTthslPTAASSSTTLSTAPQFRTSEQSTttfPTPS 3714
Cdd:PHA03307 118 PPTPPPASPPPSPAPDLSEMLR---PVGSPGPPPAASPPAAGASPA----AVASDAASSRQAALPLSSPEETA---RAPS 187
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3715 APQTSLVTSLPPFSTSSVSPTdeihitstnPHTVSSVSMSRPVSTILQTtievttpPNTSTPVTHSTSATTEAQGSFSTE 3794
Cdd:PHA03307 188 SPPAEPPPSTPPAAASPRPPR---------RSSPISASASSPAPAPGRS-------AADDAGASSSDSSSSESSGCGWGP 251
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3795 RTSTSyLSHPssttvhqstaGPVITSIKSTMGVTGTPPvhttsGTTSSPQTPHSTHPISTAAISRttGISGTPFRTPMKT 3874
Cdd:PHA03307 252 ENECP-LPRP----------APITLPTRIWEASGWNGP-----SSRPGPASSSSSPRERSPSPSP--SSPGSGPAPSSPR 313
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3875 TITFPTPSSLQTSMATLfppfSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLPTTIKGTGTPQTPvsdintTSATTQA 3954
Cdd:PHA03307 314 ASSSSSSSRESSSSSTS----SSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPS------SPAASAG 383
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 3955 HSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTASP-PSSAP 4015
Cdd:PHA03307 384 RPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSGEPwPGSPP 445
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
665-722 |
1.89e-07 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 50.39 E-value: 1.89e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 665 CTGNRTFSYDSQACDRTCLSLsDRETEChvSPVPVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANP-NAPPPC--TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
3734-4050 |
7.75e-07 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 55.77 E-value: 7.75e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3734 PTDEIHITSTNPHTVSS---VSMSRPVSTIL--QTTIEVT-----TPPNTSTPVTHSTSATTEAQGSFSTERTSTSYLSH 3803
Cdd:TIGR00927 68 SNDEMMMVSSDPPKSSSemeGEMLAPQATVGrdEATPSIAmentpSPPRRTAKITPTTPKNNYSPTAAGTERVKEDTPAT 147
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3804 PSSTTVH-QSTAG-PVITSIKSTM--GVTGTPPVHTTSGT---TSSP----------------QTPHSTHPISTAAISRT 3860
Cdd:TIGR00927 148 PSRALNHyISTSGrQRVKSYTPKPrgEVKSSSPTQTREKVrkyTPSPlgrmvnsyapstfmtmPRSHGITPRTTVKDSEI 227
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3861 TGISGTPFRTPMKTTITFPTPSSLQtSMATLFPPFSTSVMSsTEIFNTPTN---------PHSVSSASTSRP---LSTSL 3928
Cdd:TIGR00927 228 TATYKMLETNPSKRTAGKTTPTPLK-GMTDNTPTFLTREVE-TDLLTSPRSvvekntlttPRRVESNSSTNHwglVGKNN 305
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3929 PTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTL--QYTPTPSSVSHSPLLTTP 4006
Cdd:TIGR00927 306 LTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIASATfrGLEKNPSTAPSTPATPRV 385
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|.
gi 1907182184 4007 TASPPSSA-------PTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGL 4050
Cdd:TIGR00927 386 RAVLTTQVhhcvvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQPDL 436
|
|
| VWC_out |
smart00215 |
von Willebrand factor (vWF) type C domain; |
360-404 |
2.89e-06 |
|
von Willebrand factor (vWF) type C domain;
Pssm-ID: 214565 Cd Length: 67 Bit Score: 47.56 E-value: 2.89e-06
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 1907182184 360 CMLNGMVYGPGEITKTACQTCQCTMGRWTCTKQPC-PGHCSLEGGS 404
Cdd:smart00215 1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCgPKPCLLHNLS 46
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3181-3381 |
3.24e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 53.22 E-value: 3.24e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3181 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3260
Cdd:COG3469 28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3261 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3340
Cdd:COG3469 108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATT 183
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 1907182184 3341 PFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 3381
Cdd:COG3469 184 TATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1657-1878 |
3.41e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 53.22 E-value: 3.41e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1657 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 1736
Cdd:COG3469 7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1737 HPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaITNSLMTTGGLTG 1816
Cdd:COG3469 87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 1817 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 1878
Cdd:COG3469 163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1727-2176 |
5.11e-06 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 53.00 E-value: 5.11e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1727 TSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSThlplfstlsvtptteglntQSTPIPAitn 1806
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAD-------------------VTSPTPA--- 479
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1807 slMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVavsntkhTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSS 1886
Cdd:pfam05109 480 --GTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTT 550
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1887 ----VTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTP-TSAPHLS 1961
Cdd:pfam05109 551 ptpnATSPTPAVTTPTPN------ATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvVTSPPKN 624
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1962 ETSAVTAHQSTPTAVSANSikptMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTH 2041
Cdd:pfam05109 625 ATSAVTTGQHNITSSSTSS----MSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPA 700
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2042 SGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFST 2121
Cdd:pfam05109 701 PRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTT 780
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*
gi 1907182184 2122 DRTSTphlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR 2176
Cdd:pfam05109 781 DYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQ 832
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
665-722 |
1.27e-05 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 45.07 E-value: 1.27e-05
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182184 665 CTGNRTFSYDSQACDRTCLSLSDR---ETEChvspvpVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:pfam01826 1 CPANEVYSECGSACPPTCANLSPPdvcPEPC------VEGCVCPPGFVRNSGGKCVPPSDC 55
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2576-2766 |
1.31e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 51.29 E-value: 1.31e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2576 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPT 2655
Cdd:COG3469 38 TATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTST 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2656 HSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 2735
Cdd:COG3469 118 GAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGA 193
|
170 180 190
....*....|....*....|....*....|.
gi 1907182184 2736 KHTTgvsletsvqTTIASPTPSAPQTSLATH 2766
Cdd:COG3469 194 TTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2305-2516 |
1.54e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 50.91 E-value: 1.54e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2305 TSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHS 2384
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2385 GPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKH 2464
Cdd:COG3469 82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATG 161
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 2465 TTGVSLETSVQTTIASPTPSAPQTSLATHLP--FSSTSAVTPTSEVIITPTPQH 2516
Cdd:COG3469 162 GTTTTSTTTTTTSASTTPSATTTATATTASGatTPSATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1644-1833 |
3.54e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 49.75 E-value: 3.54e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1644 PTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTT 1723
Cdd:COG3469 26 AATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGA 105
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1724 SGTTSSPQTPrtthPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPA 1803
Cdd:COG3469 106 NTGTSTVTTT----STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
|
170 180 190
....*....|....*....|....*....|
gi 1907182184 1804 ITNSLmTTGGLTGTPPVHTTSGTTSSPQTP 1833
Cdd:COG3469 182 TTTAT-ATTASGATTPSATTTATTTGPPTP 210
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3153-3340 |
5.19e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 49.37 E-value: 5.19e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3153 TGSPHTSVPVIYTTSTITQTKTSFFTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSS 3232
Cdd:COG3469 28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3233 PQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLM 3312
Cdd:COG3469 108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATA 187
|
170 180
....*....|....*....|....*...
gi 1907182184 3313 TTVGLTGTPPVHTTSGTTSSPQTPRTTH 3340
Cdd:COG3469 188 TTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2646-3101 |
5.35e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.94 E-value: 5.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2646 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTS--------GTTSS 2717
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2718 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 2791
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2792 ssASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAhqstPTAVSANSIKP 2871
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR----PAVASLSESRE 2796
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2872 TMSSTGTPVVHTTSGTTSSPQTPRTTHPSttvavsgtvhtTGLPSGTSVHTTTNFPTHSGPQSSLSTH---LPLFSTLSV 2948
Cdd:PHA03247 2797 SLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvAPGGDVRRR 2865
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2949 TPTTEGLNTPTSPHSLSVASTSMPlmTVLPTTLEGTRPPHTSVPVTYTTTAATQTkssfSTDRTSAPHLSQPSTVTPTQS 3028
Cdd:PHA03247 2866 PPSRSPAAKPAAPARPPVRRLARP--AVSRSTESFALPPDQPERPPQPQAPPPPQ----PQPQPPPPPQPQPPPPPPPRP 2939
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3029 TPIPATTNSLMTTGGLTGTPP-----------VHTTSGTTSSPQTPRTTHPFSTvavsNTKHTTGVSLETSVQTTIASPT 3097
Cdd:PHA03247 2940 QPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrVAVPRFRVPQPAPSREAPASST----PPLTGHSLSRVSSWASSLALHE 3015
|
....
gi 1907182184 3098 PSAP 3101
Cdd:PHA03247 3016 ETDP 3019
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2318-2514 |
7.18e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 48.98 E-value: 7.18e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2318 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNF-PTHSGPQSSLSTHLPL 2396
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTtAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2397 FSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTT--HPFSTVAVSNTKHTTGVSLETSV 2474
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTsgASATSSAGSTTTTTTVSGTETAT 160
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 1907182184 2475 QTTIASPTPSAPqTSLATHLPFSSTSAVTPTSEVIITPTP 2514
Cdd:COG3469 161 GGTTTTSTTTTT-TSASTTPSATTTATATTASGATTPSAT 199
|
|
| ROM1 |
COG5422 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
3802-4036 |
8.92e-05 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 49.12 E-value: 8.92e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3802 SHPSS---TTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGT--TSSPQTPHSTHpISTAAISrttgISGTPFRTPMKTTI 3876
Cdd:COG5422 59 SKESFgkyALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATssTSSLNSNDGDQ-FSPASDS----LSFNPSSTQSRKDS 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3877 TFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLStslPTTIKGTGTPQTPVSDINTTSATTQAHS 3956
Cdd:COG5422 134 GPGDGSPVQKRKNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEI---PSLGSQSMQLPSPHFRQKFSSSDTSNGF 210
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3957 SFPTTRTSTSHlslpSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPT-----ASPPSSAPTFVSPTAASTVISSAL 4031
Cdd:COG5422 211 SYPSIRKNSRH----SSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSssnseAMSTSSKRPYIYPALLSRVAVEFK 286
|
....*
gi 1907182184 4032 PTIHM 4036
Cdd:COG5422 287 MRLQL 291
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1623-1821 |
1.26e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 48.21 E-value: 1.26e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1623 TPTPQHTLSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTA 1702
Cdd:COG3469 12 AGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATS 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1703 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTN-----FPTHSGPQSSLST 1777
Cdd:COG3469 92 TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVsgtetATGGTTTTSTTTT 171
|
170 180 190 200
....*....|....*....|....*....|....*....|....
gi 1907182184 1778 HLPLFSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVH 1821
Cdd:COG3469 172 TTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
2432-2763 |
1.32e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 48.22 E-value: 1.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2432 TPPVHTTSGTTSSPqTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSAVTPTSEVIIT 2511
Cdd:pfam03154 186 PPPPGTTQAATAGP-TPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQ 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2512 PTPQhtfssaSTSTTTGNILPTTIgQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAV 2591
Cdd:pfam03154 265 PLPQ------PSLHGQMPPMPHSL-QTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2592 SANSIKPtMSSTGTPVVHT----TSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPqsslSTH- 2666
Cdd:pfam03154 338 QPPREQP-LPPAPLSMPHIkpppTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHp 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2667 -----LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGV 2741
Cdd:pfam03154 413 pplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGI 492
|
330 340
....*....|....*....|..
gi 1907182184 2742 SLETSVQTTIASPTPSAPQTSL 2763
Cdd:pfam03154 493 QPPSSASVSSSGPVPAAVSCPL 514
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2531-2725 |
2.91e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 47.05 E-value: 2.91e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2531 LPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSiKPTMSSTGTPVVHT 2610
Cdd:COG3469 22 LLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA-AAATSTSATLVATS 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2611 TSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIP 2690
Cdd:COG3469 101 TASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPS 180
|
170 180 190
....*....|....*....|....*....|....*
gi 1907182184 2691 ATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 2725
Cdd:COG3469 181 ATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3823-4060 |
3.06e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 46.67 E-value: 3.06e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3823 STMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSS 3902
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3903 TEIFNTPtnphsvsSASTSRPLSTSLPTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPAS 3982
Cdd:COG3469 82 ATAAAAA-------ATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVS 154
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 3983 RSASTLQYTPTPSSVShspllTTPTASPPSSAPTFVSPTAASTvissalptihmTPTPSSRPTSSTGLLSTSKTTSHV 4060
Cdd:COG3469 155 GTETATGGTTTTSTTT-----TTTSASTTPSATTTATATTASG-----------ATTPSATTTATTTGPPTPGLPKHV 216
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
3476-4049 |
3.14e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 47.24 E-value: 3.14e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3476 PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqSSLSTHL 3555
Cdd:PHA03247 2560 PPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSP-SPAANEP 2638
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3556 PLFSTLSVTPTTEGLNTPTSPH-SLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTT------------AATQTKSSF 3622
Cdd:PHA03247 2639 DPHPPPTVPPPERPRDDPAPGRvSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSladpppppptpePAPHALVSA 2718
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3623 STDRTSTPHLSQSSTVTPTQPTPiPATTNSPMTTVGLTGTPVVHTPSGTSSI------AHTPHTTHSLPTAASSSTTLST 3696
Cdd:PHA03247 2719 TPLPPGPAAARQASPALPAAPAP-PAVPAGPATPGGPARPARPPTTAGPPAPappaapAAGPPRRLTRPAVASLSESRES 2797
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3697 APQFRTSEQSTT--TFPTPSAPQTSLVTSL--PPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPVStilqttievTTPPN 3772
Cdd:PHA03247 2798 LPSPWDPADPPAavLAPAAALPPAASPAGPlpPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVR---------RRPPS 2868
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3773 TSTPVTHSTSAtteaqgsfsteRTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHT-TSGTTSSPQTPHSTHP 3851
Cdd:PHA03247 2869 RSPAAKPAAPA-----------RPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQpQPPPPPQPQPPPPPPP 2937
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3852 ISTAAISRTTGISGTPfrtpmkttitFPTPSSLQTSMATLFPPfstsvmssteifNTPTNPHSVSSASTSRPLSTSLPTT 3931
Cdd:PHA03247 2938 RPQPPLAPTTDPAGAG----------EPSGAVPQPWLGALVPG------------RVAVPRFRVPQPAPSREAPASSTPP 2995
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3932 IKGTGTPQTpvsdinTTSATTQAHSSFPTTRtstshlslPSSMTSTLTPAS----RSASTLQYTPTPSSVSHSPLLTTPT 4007
Cdd:PHA03247 2996 LTGHSLSRV------SSWASSLALHEETDPP--------PVSLKQTLWPPDdtedSDADSLFDSDSERSDLEALDPLPPE 3061
|
570 580 590 600
....*....|....*....|....*....|....*....|....*....
gi 1907182184 4008 ASPPSSAPTFVSPTAAStviSSALPTIHMTPTP-------SSRPTSSTG 4049
Cdd:PHA03247 3062 PHDPFAHEPDPATPEAG---ARESPSSQFGPPPlsanaalSRRYVRSTG 3107
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1827-2242 |
3.31e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.83 E-value: 3.31e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1827 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 1906
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1907 STSTTTGnilPTTIGQTGSPHTSVPviyTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMS 1986
Cdd:pfam05109 502 KAPDMTS---PTSAVTTPTPNATSP---TPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLG 575
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1987 STGTpvvhTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTnfPTHSGPQSSLSTH-LPLFSTLSVTPTTE 2065
Cdd:pfam05109 576 KTSP----TSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSP--PKNATSAVTTGQHnITSSSTSSMSLRPS 649
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2066 GLNTPTSPHSLSVASTSMPLMT-VLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSST----VTPTQS 2140
Cdd:pfam05109 650 SISETLSPSTSDNSTSHMPLLTsAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTkpgeVNVTKG 729
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2141 TPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR-TTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLAT 2219
Cdd:pfam05109 730 TPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKhTTGHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRP 809
|
410 420
....*....|....*....|...
gi 1907182184 2220 HLPFSSTSSVTPTSKVIITPTPQ 2242
Cdd:pfam05109 810 RWTFTSPPVTTAQATVPVPPTSQ 832
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2258-2452 |
3.42e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 46.67 E-value: 3.42e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2258 LPTTIGKTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSiKPTMSSTGTPVVHT 2337
Cdd:COG3469 22 LLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA-AAATSTSATLVATS 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2338 TSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIP 2417
Cdd:COG3469 101 TASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPS 180
|
170 180 190
....*....|....*....|....*....|....*
gi 1907182184 2418 ATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 2452
Cdd:COG3469 181 ATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| ROM1 |
COG5422 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
3700-3978 |
3.43e-04 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 47.19 E-value: 3.43e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3700 FRTSEQSTTTFPTPSAPQTSLVTSLPPfsTSSVSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTH 3779
Cdd:COG5422 17 FGAPRKSDAFVSKQLLPPRRLQRKLNP--ISIRNGADNDIINSESKESFGKYALGHQIFSSFSSSPKLFQRRNSAGPITH 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3780 STSAT--TEAQGSFS---TERTSTSYLSHPSSTTVHQSTAGPvitsikstmgvTGTPpvhttSGTTSSPQTPHSTHPIST 3854
Cdd:COG5422 95 SPSATssTSSLNSNDgdqFSPASDSLSFNPSSTQSRKDSGPG-----------DGSP-----VQKRKNPLLPSSSTHGTH 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3855 AAISrTTGISGTPFRTPMKTTiTFPTPSSLQTSMATLFPPF--STSVMSSTEIFNTP---TNPHSVSSASTSRPLSTSLP 3929
Cdd:COG5422 159 PPIV-FTDNNGSHAGAPNARS-RKEIPSLGSQSMQLPSPHFrqKFSSSDTSNGFSYPsirKNSRHSSNSMPSFPHSSTAV 236
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 1907182184 3930 TTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTL 3978
Cdd:COG5422 237 LLKRHSGSSGASLISSNITPSSSNSEAMSTSSKRPYIYPALLSRVAVEF 285
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
3473-3846 |
4.59e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 46.11 E-value: 4.59e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3473 QSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLS 3552
Cdd:pfam17823 45 DAVPRADNKSSEQ*NFCAATAAPAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPS 124
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3553 THLPLFSTLSVTPTTEGLNTPTS--PHSLSAASTSMPLMTVLPTTlEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTP 3630
Cdd:pfam17823 125 SAAQSLPAAIAALPSEAFSAPRAaaCRANASAAPRAAIAAASAPH-AASPAPRTAASSTTAASSTTAASSAPTTAASSAP 203
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3631 hlsqsSTVTPTQPTPIPAT-TNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQSTTT 3709
Cdd:pfam17823 204 -----ATLTPARGISTAATaTGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHAR 278
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3710 FPTPSAPQTSLVTSLPPFSTSSVSPTDEIHITSTNPHTVSSVS--MSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEA 3787
Cdd:pfam17823 279 RLSPAKHMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGepTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEP 358
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182184 3788 QG--------SFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTP 3846
Cdd:pfam17823 359 SAspvpvlhtSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAPEQVATEATAGTASAGPTPRSSGDP 425
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
3823-4121 |
4.89e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.47 E-value: 4.89e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3823 STMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAISRTTGISGTPfRTPMKTTITfPTPSSLqTSMATLFPPFST---SV 3899
Cdd:PHA03247 2636 NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPP-QRPRRRAAR-PTVGSL-TSLADPPPPPPTpepAP 2712
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3900 MSSTEIFNTPTNPHSVSSASTSRPLSTSLPTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLT 3979
Cdd:PHA03247 2713 HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLS 2792
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3980 PASRSASTLQYTPTPSSVSHSPLLTTPTASPPssAPTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGLLSTSKTTSH 4059
Cdd:PHA03247 2793 ESRESLPSPWDPADPPAAVLAPAAALPPAASP--AGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 4060 VPTFSSFSSKSTTAHLTSLTTQAATSGLLSSTMGMTNLPSSGSPDINHTTRPPGSSPLPTSA 4121
Cdd:PHA03247 2871 PAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPP 2932
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3181-3404 |
5.10e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 45.90 E-value: 5.10e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3181 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3260
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3261 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTeglntqSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3340
Cdd:COG3469 82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTT------STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSG 155
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 3341 PFSTVAVSNTKHTTGVSLeTSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEviiTPTPQH 3404
Cdd:COG3469 156 TETATGGTTTTSTTTTTT-SASTTPSATTTATATTASGATTPSATTTATTTGPPT---PGLPKH 215
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1492-1875 |
5.66e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 46.30 E-value: 5.66e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1492 SITSSITQAQSSFSTDKTYTSQHSQPSTMTAHQSRSLPTVTTSTKSTMGLTGTPPVHTTSGTTSSPQ-TPRTTHPFSTVA 1570
Cdd:pfam03154 143 STSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQgSPATSQPPNQTQ 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1571 VSNTKHTTgvsletsvqttIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIgQT 1650
Cdd:pfam03154 223 STAAPHTL-----------IQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ------PSLHGQMPPMPHSL-QT 284
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1651 GSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPtMSSTGTPVVHTT-SGTTSS 1729
Cdd:pfam03154 285 GPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQP-LPPAPLSMPHIKpPPTTPI 363
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1730 PQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTH------LPLFSTLSVTPTTEGLNTQSTP 1800
Cdd:pfam03154 364 PQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHppplqlMPQSQQLPPPPAQPPVLTQSQS 439
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182184 1801 IPAITNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 1875
Cdd:pfam03154 440 LPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
3839-4118 |
6.14e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.06 E-value: 6.14e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3839 TTSSPQTPHSTHPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMatlfPPFSTSVMSSTEIFNTPTNPHSVSSA 3918
Cdd:pfam05109 392 TVSGLGTAPKTLIITRTATNATTTTHKVIFSKAPESTTTSPTLNTTGFAA----PNTTTGLPSSTHVPTNLTAPASTGPT 467
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3919 STSRPLSTSLPTTIKGTGTPQTPVSDI----------NTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTL 3988
Cdd:pfam05109 468 VSTADVTSPTPAGTTSGASPVTPSPSPrdngteskapDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSA 547
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3989 QYTPTPSSVSHSPLLTTPTASPPSSAPTFVSPTAASTvissalptihmTPTPSSRPTSSTGLLSTSKTTSHV--PTFSSF 4066
Cdd:pfam05109 548 VTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVT-----------TPTPNATSPTVGETSPQANTTNHTlgGTSSTP 616
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 4067 SSKSTTAHLTSLTTQAATSGLLSSTMGMTNLPSSGSPDINHTTRPPGSSPLP 4118
Cdd:pfam05109 617 VVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMP 668
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
1918-3538 |
6.14e-04 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 46.30 E-value: 6.14e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1918 TTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTS 1997
Cdd:COG3210 80 GIGAAAANTAGTLETGLTSNIGGGSVNGSNSTGNGTLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAG 159
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1998 GTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLS 2077
Cdd:COG3210 160 NNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAG 239
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2078 VASTSMplmTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQSTPIPATTNSLMTTGGLT 2157
Cdd:COG3210 240 VISTGG---TDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGT 316
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2158 GTPPVHTNSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSKVII 2237
Cdd:COG3210 317 AAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASS 396
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2238 TPTPQHTLSSASTSTTTGNILPTTIGKTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTA 2317
Cdd:COG3210 397 TTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSA 476
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2318 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLF 2397
Cdd:COG3210 477 GNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTT 556
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2398 STLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTT 2477
Cdd:COG3210 557 AASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTG 636
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2478 IASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQT 2557
Cdd:COG3210 637 SAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVT 716
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2558 KTSFSTDRTS-------TSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRT--THPST 2628
Cdd:COG3210 717 GQIGALANANgdtvtfgNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATLDNagAEISI 796
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2629 TVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPV 2708
Cdd:COG3210 797 DITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAAS 876
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2709 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQ 2788
Cdd:COG3210 877 ITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSA 956
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2789 HTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANS 2868
Cdd:COG3210 957 ASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGT 1036
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2869 IKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSV 2948
Cdd:COG3210 1037 AATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGV 1116
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2949 TPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSAPHLSQPSTVTPTQS 3028
Cdd:COG3210 1117 TASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGT 1196
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3029 TPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATH 3108
Cdd:COG3210 1197 DLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGN 1276
|
1210 1220 1230 1240 1250 1260 1270 1280
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3109 LPFSSTSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSTITQTKTSFFTDRTSTSTSAP 3188
Cdd:COG3210 1277 AGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGN 1356
|
1290 1300 1310 1320 1330 1340 1350 1360
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3189 HLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF 3268
Cdd:COG3210 1357 GATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGG 1436
|
1370 1380 1390 1400 1410 1420 1430 1440
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3269 PTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVS 3348
Cdd:COG3210 1437 TGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTT 1516
|
1450 1460 1470 1480 1490 1500 1510 1520
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3349 NTKHTTGVSLETSVQTTIASPTPSAPQTSLAThlpfsstsSVTPTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTGS 3428
Cdd:COG3210 1517 AEVAKASLEGGEGTYGGSSVAEAGTGGGILGA--------VSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQA 1588
|
1530 1540 1550 1560 1570 1580 1590 1600
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3429 PHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQT 3508
Cdd:COG3210 1589 PTAGNTATLTLSLAEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNGGEGVLALVAGGNTTNGTTLSGAVNGAGNGW 1668
|
1610 1620 1630
....*....|....*....|....*....|
gi 1907182184 3509 PRTTHPSTTVAVSGTVHTTGLPSGTSVHTT 3538
Cdd:COG3210 1669 AVDLTDATLAGLGGATTAAAGNVATGDTAP 1698
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1758-2222 |
6.45e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.08 E-value: 6.45e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1758 SVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVHTTS--------GTTSS 1829
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1830 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 1903
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1904 ssASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIkp 1983
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-- 2798
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1984 tmsstgtpvvhttsgttSSPQTPrTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSlsthlPLFSTLsvtpT 2063
Cdd:PHA03247 2799 -----------------PSPWDP-ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG-----PPPPSL----P 2851
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2064 TEGLNTPTSPHSLSVASTSMPLMTVLPttlegTRPPHTSVPVTYTttaatqtksSFSTDRTSTPHLSQSSTVTPTQSTPi 2143
Cdd:PHA03247 2852 LGGSVAPGGDVRRRPPSRSPAAKPAAP-----ARPPVRRLARPAV---------SRSTESFALPPDQPERPPQPQAPPP- 2916
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2144 PATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV-QTTIASPTPSAPQTSLATHLP 2222
Cdd:PHA03247 2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVpRFRVPQPAPSREAPASSTPPL 2996
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2578-2789 |
8.23e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 45.51 E-value: 8.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2578 TSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHS 2657
Cdd:COG3469 14 GASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTS 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2658 GPQSSLSTHLPLFSTLSVTPTTeglntqSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKH 2737
Cdd:COG3469 94 ATLVATSTASGANTGTSTVTTT------STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTS 167
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 2738 TTGVSLeTSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEviiTPTPQH 2789
Cdd:COG3469 168 TTTTTT-SASTTPSATTTATATTASGATTPSATTTATTTGPPT---PGLPKH 215
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
3626-4024 |
8.63e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 45.53 E-value: 8.63e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3626 RTSTPHLSQSSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQ 3705
Cdd:pfam03154 168 QTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHP 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3706 STTTFPTPSAPQTSLVTSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPV-THSTSAT 3784
Cdd:pfam03154 248 PLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGqSQQRIHT 327
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3785 TEAQGSFSTERTSTSYLSHPSSTTVhqstagPVItsikstmgvtgTPPVHTTSGTTSSPQT-PHSTHPISTAAISRTTGI 3863
Cdd:pfam03154 328 PPSQSQLQSQQPPREQPLPPAPLSM------PHI-----------KPPPTTPIPQLPNPQShKHPPHLSGPSPFQMNSNL 390
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3864 SGTPFRTPMKTTITFPTPSS-------LQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLPTT--IKG 3934
Cdd:pfam03154 391 PPPPALKPLSSLSTHHPPSAhppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHpfVPG 470
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3935 TGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSvshspllTTPTASPPSSA 4014
Cdd:pfam03154 471 GPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPES-------PPPPPRSPSPE 543
|
410
....*....|.
gi 1907182184 4015 PTFV-SPTAAS 4024
Cdd:pfam03154 544 PTVVnTPSHAS 554
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3635-3882 |
9.76e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 45.13 E-value: 9.76e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3635 SSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTThSLPTAASSSTTLSTAPQFRTSEQSTTTFPTPS 3714
Cdd:COG3469 3 SVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSV-VVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3715 APQTSLVTSLPPFSTSSVSPTDeihitstnphtvssvsmsrPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQGSFSTE 3794
Cdd:COG3469 82 ATAAAAAATSTSATLVATSTAS-------------------GANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATS 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3795 RTSTSYLSHPSSTTVHqstagpvitsikstmGVTGTPPVHTTSGTTSSPQTPhSTHPISTAAISRTTGISGTPFRTPMKT 3874
Cdd:COG3469 143 SAGSTTTTTTVSGTET---------------ATGGTTTTSTTTTTTSASTTP-SATTTATATTASGATTPSATTTATTTG 206
|
....*...
gi 1907182184 3875 TITFPTPS 3882
Cdd:COG3469 207 PPTPGLPK 214
|
|
| Hamartin |
pfam04388 |
Hamartin protein; This family includes the hamartin protein which is thought to function as a ... |
3767-4050 |
1.11e-03 |
|
Hamartin protein; This family includes the hamartin protein which is thought to function as a tumour suppressor. The hamartin protein interacts with the tuberin protein pfam03542. Tuberous sclerosis complex (TSC) is an autosomal dominant disorder and is characterized by the presence of hamartomas in many organs, such as brain, skin, heart, lung, and kidney. It is caused by mutation either TSC1 or TSC2 tumour suppressor gene. TSC1 encodes a protein, hamartin, containing two coiled-coil regions, which have been shown to mediate binding to tuberin. The TSC2 gene codes for tuberin pfam03542. These two proteins function within the same pathway(s) regulating cell cycle, cell growth, adhesion, and vesicular trafficking.
Pssm-ID: 461287 [Multi-domain] Cd Length: 730 Bit Score: 45.05 E-value: 1.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3767 VTTPPNTSTpvTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTG--TPPvhTTSGT--TSS 3842
Cdd:pfam04388 276 PTASPYTDQ--QSSYGSSTSTPSSTPRLQLSSSSGTSPPYLSPPSIRLKTDSFPLWSPSSVCGmtTPP--TSPGMvpTTP 351
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3843 PQTPHST-HPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTS 3921
Cdd:pfam04388 352 SELSPSSsHLSSRGSSPPEAAGEATPETTPAKDSPYLKQPPPLSDSHVHRALPASSQPSSPPRKDGRSQSSFPPLSKQAP 431
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3922 RPLSTSLPTTIKGTGTP--QTPVSDINTT----------------------SATTQAHSSFPTTR------TSTSHLSLP 3971
Cdd:pfam04388 432 TNPNSRGLLEPPGDKSSvtLSELPDFIKDlalssedsvegaeeeaaisqelSEITTEKNETDCSRggldmpFSRTMESLA 511
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182184 3972 SSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTASPPSSAPTFvSPTAASTVISSalPTIHMTPTPSSRPTSSTGL 4050
Cdd:pfam04388 512 GSQRSRNRIASYCSSTSQSDSHGPATTPESKPSALAEDGLRRTKSC-SFKQSFTPIEQ--PIESSDDCPTDEQDGENGL 587
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
3330-3788 |
1.25e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 44.91 E-value: 1.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3330 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 3409
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3410 STSTTTGnilPTTIGQTGSPHTSVPVIYTTS----AITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIK 3485
Cdd:pfam05109 502 KAPDMTS---PTSAVTTPTPNATSPTPAVTTptpnATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTS 578
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3486 PTmSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTP 3565
Cdd:pfam05109 579 PT-SAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSP 657
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3566 TTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQPTP 3645
Cdd:pfam05109 658 STSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATS 737
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3646 --IPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSttlSTAPQFRTSEQSTTTFPTPSAPQTSLVTS 3723
Cdd:pfam05109 738 pqAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTTDYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFT 814
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182184 3724 LPPFSTSS----VSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQ 3788
Cdd:pfam05109 815 SPPVTTAQatvpVPPTSQPRFSNLSMLVLQWASLAVLTLLLLLVMADCAFRRNLSTSHTYTTPPYDDAE 883
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3464-3684 |
1.30e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.74 E-value: 1.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3464 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 3543
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3544 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 3623
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182184 3624 TDRTSTPhlSQSSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSL 3684
Cdd:COG3469 159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKHVL 217
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3805-4032 |
1.40e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.74 E-value: 1.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3805 SSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAISRTTGISGTpfrtpmkttiTFPTPSSL 3884
Cdd:COG3469 4 VSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTA----------ASSTAATS 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3885 QTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLPTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTS 3964
Cdd:COG3469 74 STTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTV 153
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 3965 TSHLSLPSSMTSTLTPASRSASTLQYTPTPSsvshspllTTPTASPPSSAPTFVSPTAASTVISSALP 4032
Cdd:COG3469 154 SGTETATGGTTTTSTTTTTTSASTTPSATTT--------ATATTASGATTPSATTTATTTGPPTPGLP 213
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2591-2765 |
1.43e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.74 E-value: 1.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2591 VSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF-PTHSGPQSSLSTHLPL 2669
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTtAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2670 FSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTT--HPFSTVAVSNTKHTTGVSLETSV 2747
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTsgASATSSAGSTTTTTTVSGTETAT 160
|
170
....*....|....*...
gi 1907182184 2748 QTTIASPTPSAPQTSLAT 2765
Cdd:COG3469 161 GGTTTTSTTTTTTSASTT 178
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3206-3380 |
1.64e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.36 E-value: 1.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3206 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF-PTHSGPQSSLSTHLPL 3284
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTtAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3285 FSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTT--HPFSTVAVSNTKHTTGVSLETSV 3362
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTsgASATSSAGSTTTTTTVSGTETAT 160
|
170
....*....|....*...
gi 1907182184 3363 QTTIASPTPSAPQTSLAT 3380
Cdd:COG3469 161 GGTTTTSTTTTTTSASTT 178
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2849-3063 |
1.64e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.36 E-value: 1.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2849 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2928
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2929 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPH-TSVPVTYTTTAATQTKSSF 3007
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAsATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182184 3008 STDRTSAPHLSQPSTVTPTQSTPIPATTnslmTTGGLTGTPPVHTTSGTTSSPQTP 3063
Cdd:COG3469 159 ATGGTTTTSTTTTTTSASTTPSATTTAT----ATTASGATTPSATTTATTTGPPTP 210
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1961-2179 |
1.67e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.36 E-value: 1.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1961 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2040
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2041 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 2120
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182184 2121 TDRTSTPhlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTH 2179
Cdd:COG3469 159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1913-2104 |
1.86e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.36 E-value: 1.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1913 GNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPhlSETSAVTAHQSTPTAVSANSIKPTMSSTGTPV 1992
Cdd:COG3469 24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASST--AATSSTTSTTATATAAAAAATSTSATLVATST 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1993 VHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP--SGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTP 2070
Cdd:COG3469 102 ASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSgaSATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
|
170 180 190
....*....|....*....|....*....|....
gi 1907182184 2071 TSPHSLSVASTSMPLMTVLPTTleGTRPPHTSVP 2104
Cdd:COG3469 182 TTTATATTASGATTPSATTTAT--TTGPPTPGLP 213
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2801-2992 |
1.86e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.36 E-value: 1.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2801 GNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPhlSETSAVTAHQSTPTAVSANSIKPTMSSTGTPV 2880
Cdd:COG3469 24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASST--AATSSTTSTTATATAAAAAATSTSATLVATST 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2881 VHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP--SGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTP 2958
Cdd:COG3469 102 ASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSgaSATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
|
170 180 190
....*....|....*....|....*....|....
gi 1907182184 2959 TSPHSLSVASTSMPLMTVLPTTleGTRPPHTSVP 2992
Cdd:COG3469 182 TTTATATTASGATTPSATTTAT--TTGPPTPGLP 213
|
|
| VWC |
pfam00093 |
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with ... |
360-395 |
1.96e-03 |
|
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with pfam00094.
Pssm-ID: 278520 Cd Length: 57 Bit Score: 39.33 E-value: 1.96e-03
10 20 30
....*....|....*....|....*....|....*..
gi 1907182184 360 CMLNGMVYGPGEITKTA-CQTCQCTMGRWTCTKQPCP 395
Cdd:pfam00093 1 CVQNGVVYENGETWKPDlCTICTCDDGKVLCDKIICP 37
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
3573-3944 |
2.03e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 44.18 E-value: 2.03e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3573 PTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQPTPIPATTns 3652
Cdd:pfam17823 66 APAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFS-- 143
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3653 pmttvgltgTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQSTTTFPTPSAPQTSLVTSLPPFSTSSV 3732
Cdd:pfam17823 144 ---------APRAAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGIST 214
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3733 SPTDEIHITSTNphTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAqGSFSTERTSTSYLSHPSSTTVHQS 3812
Cdd:pfam17823 215 AATATGHPAAGT--ALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAA-GTINMGDPHARRLSPAKHMPSDTM 291
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3813 TAGPVITSIKSTMG----VTGTPPVHTTSG--------TTSSPQTPHSTHPISTAAISRTTGISGTPFRTPMKTTITFPT 3880
Cdd:pfam17823 292 ARNPAAPMGAQAQGpiiqVSTDQPVHNTAGeptpspsnTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSMI 371
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 3881 PSSLQTSMATLFPPFSTSVMSSTEifNTPTNPHSVSSASTsrPLSTSLPTTIKGTGTPQTPVSD 3944
Cdd:pfam17823 372 PEVEATSPTTQPSPLLPTQGAAGP--GILLAPEQVATEAT--AGTASAGPTPRSSGDPKTLAMA 431
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
2705-3068 |
2.06e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 44.37 E-value: 2.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2705 TPPVHTTSGTTSSPqTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIIT 2784
Cdd:pfam03154 186 PPPPGTTQAATAGP-TPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQ 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2785 PTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAV 2864
Cdd:pfam03154 265 PLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2865 SANSIKPtMSSTGTPVVHTT-SGTTSSPQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTHL 2940
Cdd:pfam03154 338 QPPREQP-LPPAPLSMPHIKpPPTTPIPQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHP 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2941 PlfsTLSVTPTTEGLNTP-------TSPHSLSVASTSMPLMTVL-PTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTdrT 3012
Cdd:pfam03154 413 P---PLQLMPQSQQLPPPpaqppvlTQSQSLPPPAASHPPTSGLhQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTS--S 487
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182184 3013 SAPHLSQPSTVTPTQSTPIPATTNSLMttggltgtPPVHTTSGTTSSPQTPRTTHP 3068
Cdd:pfam03154 488 AMPGIQPPSSASVSSSGPVPAAVSCPL--------PPVQIKEEALDEAEEPESPPP 535
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2442-2758 |
2.09e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 44.52 E-value: 2.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2442 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLA---THLPFSSTSAVTPT------------- 2505
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAdvtSPTPAGTTSGASPVtpspsprdngtes 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2506 --------SEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSphTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSE 2577
Cdd:pfam05109 502 kapdmtspTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSP--TSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSP 579
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2578 TSAVTAHQSTPTAVSANSIKPTMSSTGtpvvHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHS 2657
Cdd:pfam05109 580 TSAVTTPTPNATSPTVGETSPQANTTN----HTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETL 655
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2658 GPQSS--LSTHLPLFStlSVTPTTEGLNTQSTPIPATTNSLMTTgglTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 2735
Cdd:pfam05109 656 SPSTSdnSTSHMPLLT--SAHPTGGENITQVTPASTSTHHVSTS---SPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGT 730
|
330 340
....*....|....*....|...
gi 1907182184 2736 KHTTGVSLETSVQTTIASPTPSA 2758
Cdd:pfam05109 731 PPKNATSPQAPSGQKTAVPTVTS 753
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
2575-2899 |
2.13e-03 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 44.22 E-value: 2.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2575 LSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTSSSPQTPRT------THPSTTVAVSGTVHTTGLPSGTSVQ 2648
Cdd:TIGR00927 91 LAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSRALNHYISTSGRQRVKSYT 168
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2649 TTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipATTNSLMTTGGLTGTPPVHTTSGT 2714
Cdd:TIGR00927 169 PKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEITATYKMLETNPSKRTAGK 245
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2715 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 2794
Cdd:TIGR00927 246 TTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLTTPQGTVLEHTPATSEGQV 325
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2795 STSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPTsAPHLSETSAVTAHQST---PTAV 2864
Cdd:TIGR00927 326 TISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA-TPRVRAVLTTQVHHCVvvkPAPA 404
|
330 340 350
....*....|....*....|....*....|....*
gi 1907182184 2865 SANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2899
Cdd:TIGR00927 405 VPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3559-3726 |
2.25e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.97 E-value: 2.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3559 STLSVTPTTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTV 3638
Cdd:COG3469 33 TLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTV 112
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3639 TPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQF---RTSEQSTTTFPTPSA 3715
Cdd:COG3469 113 TTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSastTPSATTTATATTASG 192
|
170
....*....|.
gi 1907182184 3716 PQTSLVTSLPP 3726
Cdd:COG3469 193 ATTPSATTTAT 203
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
3018-3378 |
2.27e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 44.37 E-value: 2.27e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3018 SQPSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSgtTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqttIASPT 3097
Cdd:pfam03154 169 TQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPS--VPPQGSPATSQPPNQTQSTAAPHTL-----------IQQTP 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3098 PSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPViyttstitqTKTSFF 3177
Cdd:pfam03154 236 TLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPV---------PPQPFP 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3178 TDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP 3257
Cdd:pfam03154 300 LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLS 379
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3258 SGTSVQTTTNFPTHSG--PQSSLSTH------------LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPV 3323
Cdd:pfam03154 380 GPSPFQMNSNLPPPPAlkPLSSLSTHhppsahppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQ 459
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*
gi 1907182184 3324 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 3378
Cdd:pfam03154 460 SPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
|
|
| VWC_out |
smart00215 |
von Willebrand factor (vWF) type C domain; |
830-887 |
2.34e-03 |
|
von Willebrand factor (vWF) type C domain;
Pssm-ID: 214565 Cd Length: 67 Bit Score: 39.08 E-value: 2.34e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 830 CDYAGVSYPGGFELHTDCKTCTCSQGRWTCQlSTQCPSTCVLYGEGHIITFDGQRFVF 887
Cdd:smart00215 1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCT-KVWCGPKPCLLHNLSGECPLGQGCVP 57
|
|
| VWC |
smart00214 |
von Willebrand factor (vWF) type C domain; |
360-395 |
2.60e-03 |
|
von Willebrand factor (vWF) type C domain;
Pssm-ID: 214564 Cd Length: 59 Bit Score: 39.04 E-value: 2.60e-03
10 20 30
....*....|....*....|....*....|....*...
gi 1907182184 360 CMLNGMVYGPGEITKT-ACQTCQCTMGRW-TCTKQPCP 395
Cdd:smart00214 1 CVHNGRVYNDGETWKPdPCQICTCLDGTTvLCDPVECP 38
|
|
| COG5099 |
COG5099 |
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal ... |
3621-4000 |
2.75e-03 |
|
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal structure and biogenesis];
Pssm-ID: 227430 [Multi-domain] Cd Length: 777 Bit Score: 43.97 E-value: 2.75e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3621 SFSTDRTSTPHLSQSSTVTpTQPTPIPATTNSPMTTVGlTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTapqf 3700
Cdd:COG5099 38 STPNSFSPIPSKASSSATF-TLNLPINNSVNHKITSSS-SSRRKPSGSWSVAISSSTSGSQSLLMELPSSSFNPST---- 111
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3701 rtseqSTTTFPTPSAPQTSLVTSLPPFSTSSVSPTDEIHI-TSTNPHTVSSVSMSRPVSTILQttiEVTTPPNTSTPVTH 3779
Cdd:COG5099 112 -----SSRNKSNSALSSTQQGNANSSVTLSSSTASSMFNSnKLPLPNPNHSNSATTNQSGSSF---INTPASSSSQPLTN 183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3780 STSATTEAQGSFSTERTSTSYLSHPSSTTVhqSTAGPVITSIkstmGVTGTPPVHTTSGTTSSPQTPHsTHPISTAAISR 3859
Cdd:COG5099 184 LVVSSIKRFPYLTSLSPFFNYLIDPSSDSA--TASADTSPSF----NPPPNLSPNNLFSTSDLSPLPD-TQSVENNIILN 256
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3860 TTGISGTPFRTPMKTTI--TFPTPSSLQTSMATLFPP-FSTSVMSSTEIFNT----PTNPHSVSSASTSRPLSTSLPTTI 3932
Cdd:COG5099 257 SSSSINELTSIYGSVPSirNLRGLNSALVSFLNVSSSsLAFSALNGKEVSPTgspsTRSFARVLPKSSPNNLLTEILTTG 336
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 3933 KGTGTPQTPVSDINTTSATTQAHSsfpttrTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHS 4000
Cdd:COG5099 337 VNPPQSLPSLLNPVFLSTSTGFSL------TNLSGYLNPNKNLKKNTLSSLSNLGYSSNVPSPSSSES 398
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1703-1877 |
3.24e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.59 E-value: 3.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1703 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNF-PTHSGPQSSLSTHLPL 1781
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTtAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1782 FSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTT--HPFSTVAVSNTKHTTGVSLETSV 1859
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTsgASATSSAGSTTTTTTVSGTETAT 160
|
170
....*....|....*...
gi 1907182184 1860 QTTIASPTPSAPQTSLAT 1877
Cdd:COG3469 161 GGTTTTSTTTTTTSASTT 178
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
3772-4000 |
3.40e-03 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 43.84 E-value: 3.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3772 NTSTPVTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHP 3851
Cdd:NF033849 250 STSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSS 329
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3852 ISTAAISRTTGISGT-----PFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLST 3926
Cdd:NF033849 330 SYNVSSGTGVSSSHSdgtsqSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 3927 SLPTTiKGTGTpQTPVSDINTTSATTQAHSSFPTTRTSTSHlSLPSSMTSTLTpASRSASTLQYTPTPSSVSHS 4000
Cdd:NF033849 410 SQGGS-EGWGS-GDSVQSVSQSYGSSSSTGTSSGHSDSSSH-STSSGQADSVS-QGTSWSEGTGTSQGQSVGTS 479
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1690-1901 |
3.41e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.59 E-value: 3.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1690 TSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHS 1769
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1770 GPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKH 1849
Cdd:COG3469 82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATG 161
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 1850 TTGVSLETSVQTTIASPTPSAPQTSLATHLP--FSSTSSVTPTSEVIITPTPQH 1901
Cdd:COG3469 162 GTTTTSTTTTTTSASTTPSATTTATATTASGatTPSATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1896-2073 |
3.56e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.20 E-value: 3.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1896 TPTPQHTLSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTA 1975
Cdd:COG3469 29 AASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTG 108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1976 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLF 2055
Cdd:COG3469 109 TSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATAT 188
|
170
....*....|....*...
gi 1907182184 2056 STLSVTPTTEGLNTPTSP 2073
Cdd:COG3469 189 TASGATTPSATTTATTTG 206
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2615-3064 |
3.83e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 43.37 E-value: 3.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2615 SSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSThlplfstlsvtptteglntQSTPIPATTn 2694
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAD-------------------VTSPTPAGT- 481
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2695 slmtTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVavsntkhTTGVSLETSVQTTIASPTPSAPQTSLATHLPfsstss 2774
Cdd:pfam05109 482 ----TSGASPVTPSPSPRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTTPTPNATSPTLGKTSP------ 544
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2775 vtptSEVIITPTPQHTLssaststtmgnilPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAV 2854
Cdd:pfam05109 545 ----TSAVTTPTPNATS-------------PTPAVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNH 607
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2855 TAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQS 2934
Cdd:pfam05109 608 TLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPA 687
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2935 SLSTHLPLFSTLSVTPTTEGL-------NTPTSPHSLSVASTSMPLMTVLPTTLEGTRpphTSVP-VTYTTTAATQTKSS 3006
Cdd:pfam05109 688 STSTHHVSTSSPAPRPGTTSQasgpgnsSTSTKPGEVNVTKGTPPKNATSPQAPSGQK---TAVPtVTSTGGKANSTTGG 764
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 3007 FSTDRTSAPHLSQPST------VTP----TQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPR 3064
Cdd:pfam05109 765 KHTTGHGARTSTEPTTdyggdsTTPrtryNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQ 832
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
2127-2490 |
3.84e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.60 E-value: 3.84e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2127 PHLSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVhtnSGTTSSPQTPRTTHPFSTVAVsntkhttgvsletsvQTTIA 2206
Cdd:pfam03154 171 PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV---PPQGSPATSQPPNQTQSTAAP---------------HTLIQ 232
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2207 SPTPSAPQTSLATHLPFSSTSSVTPTSKVIITPTPQhtlssaSTSTTTGNILPTTIgKTGSPHTSVPVIYTTSAITQTKT 2286
Cdd:pfam03154 233 QTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ------PSLHGQMPPMPHSL-QTGPSHMQHPVPPQPFPLTPQSS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2287 SFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPtMSSTGTPVVHTT-SGTTSSPQTPRT---THPSTTVAVSGT 2362
Cdd:pfam03154 306 QSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQP-LPPAPLSMPHIKpPPTTPIPQLPNPqshKHPPHLSGPSPF 384
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2363 VHTTGLPSGTSVHTTTNFPTHSGPqsslSTH------LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVH 2436
Cdd:pfam03154 385 QMNSNLPPPPALKPLSSLSTHHPP----SAHppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQS 460
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 2437 TTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 2490
Cdd:pfam03154 461 PFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
1669-2011 |
3.84e-03 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 43.45 E-value: 3.84e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1669 TKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTTSSPQTPRT------THPSTTV 1742
Cdd:TIGR00927 73 MMVSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSR 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1743 AVSGTVHTTGLPSGTSVHTTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipAITNSL 1808
Cdd:TIGR00927 151 ALNHYISTSGRQRVKSYTPKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEI 227
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1809 MTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVT 1888
Cdd:TIGR00927 228 TATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLT 307
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1889 PTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPTsAPHLS 1961
Cdd:TIGR00927 308 TPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA-TPRVR 386
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|...
gi 1907182184 1962 ETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2011
Cdd:TIGR00927 387 AVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2299-2768 |
4.66e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.39 E-value: 4.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2299 APHLSEtSAVTAHQSTPTAvsansiKPTMSSTGTPVVHTTSGTTSSPQT--PRTTHPSTTVAVSGTVHTTGLPSGTSVHT 2376
Cdd:PHA03247 2574 APRPSE-PAVTSRARRPDA------PPQSARPRAPVDDRGDPRGPAPPSplPPDTHAPDPPPPSPSPAANEPDPHPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2377 TTNFPTHSGPQSSlSTHLPLFSTLSVTPTTEGLNTQSTPIPATTnslmttggltgtPPVHTTSGTTSSPQTPRTTHPFST 2456
Cdd:PHA03247 2647 PPPERPRDDPAPG-RVSRPRRARRLGRAAQASSPPQRPRRRAAR------------PTVGSLTSLADPPPPPPTPEPAPH 2713
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2457 VAVSntkhttgvsletsvqttiASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSAststttgnilPTTig 2536
Cdd:PHA03247 2714 ALVS------------------ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP----------PTT-- 2763
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2537 qTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSS 2616
Cdd:PHA03247 2764 -AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP 2842
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2617 SPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTnfPTHSgPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPI----PAT 2692
Cdd:PHA03247 2843 PGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA--PARP-PVRRLARPAVSRSTESFALPPDQPERPPQPQapppPQP 2919
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182184 2693 TNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV-QTTIASPTPSAPQTSLATHLP 2768
Cdd:PHA03247 2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVpRFRVPQPAPSREAPASSTPPL 2996
|
|
| ROM1 |
COG5422 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
1407-1625 |
4.66e-03 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 43.34 E-value: 4.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1407 STGPPLGTSVQTTINFPTLSAPQTSLVT-----PHPGLSSSSTALTSEILKTPTSSQMVSSASPQT-IFSSIHPKTTLEA 1480
Cdd:COG5422 28 SKQLLPPRRLQRKLNPISIRNGADNDIInseskESFGKYALGHQIFSSFSSSPKLFQRRNSAGPIThSPSATSSTSSLNS 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1481 TTPQHTAPLITSITSSITQAQSSFSTDKTYTSQHS------QPSTMTaHQSRSLPTVTTSTKSTMGLTG---TPPVHTTS 1551
Cdd:COG5422 108 NDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQkrknplLPSSST-HGTHPPIVFTDNNGSHAGAPNarsRKEIPSLG 186
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182184 1552 GTTSSPQTPRTTHPFSTVAVSNT---KHTTGVSLETSvqttiaSPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPT 1625
Cdd:COG5422 187 SQSMQLPSPHFRQKFSSSDTSNGfsyPSIRKNSRHSS------NSMPSFPHSSTAVLLKRHSGSSGASLISSNITPS 257
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
3016-3365 |
5.11e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.98 E-value: 5.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3016 HLSQPSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVavsntkhTTGVSLETSVQTTIAS 3095
Cdd:pfam05109 457 NLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTT 529
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3096 PTPSAPQTSLATHLPfsstssvtptSEVIITPTPQHTLssaststtmgnilPTTIGQTGSPHTSVPVIYTTSTITQTKTS 3175
Cdd:pfam05109 530 PTPNATSPTLGKTSP----------TSAVTTPTPNATS-------------PTPAVTTPTPNATIPTLGKTSPTSAVTTP 586
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3176 FFTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTG 3255
Cdd:pfam05109 587 TPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSH 666
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3256 LPSGTSVQTTTNFPTHSGPQSSLSTHlpLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLT-GTPPVHTTSGTTSSPQ 3334
Cdd:pfam05109 667 MPLLTSAHPTGGENITQVTPASTSTH--HVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTkGTPPKNATSPQAPSGQ 744
|
330 340 350
....*....|....*....|....*....|....*
gi 1907182184 3335 TPRTTHPFSTVAVSNT----KHTTGVSLETSVQTT 3365
Cdd:pfam05109 745 KTAVPTVTSTGGKANSttggKHTTGHGARTSTEPT 779
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3707-3929 |
5.18e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.82 E-value: 5.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3707 TTTFPTPSAPQTSLVTSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTE 3786
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3787 AQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSpqtphsthpiSTAAISRTTGISGT 3866
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTS----------GASATSSAGSTTTT 150
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182184 3867 PFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLP 3929
Cdd:COG3469 151 TTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLP 213
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
3841-4118 |
5.28e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.39 E-value: 5.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3841 SSPQTPHSTHPISTAAISRTTGISGTPF----RTPMKTTITFP---TPSSLQTSMATLFPPFSTSVMSSTEIF--NTPTN 3911
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQsarpRAPVDDRGDPRgpaPPSPLPPDTHAPDPPPPSPSPAANEPDphPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3912 PHSVSSASTSRPLSTSLPTTIKGTGTPQTPvsdinttSATTQA--HSSFPTTRTSTSHLSLPSSMTSTltPASRSASTLQ 3989
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQA-------SSPPQRprRRAARPTVGSLTSLADPPPPPPT--PEPAPHALVS 2717
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3990 YTPTP----SSVSHSPLLTTPTASPPSSAPTFV--------SPTAASTVISSALPTIHMTPTPSSRPTSSTGLLSTSKTT 4057
Cdd:PHA03247 2718 ATPLPpgpaAARQASPALPAAPAPPAVPAGPATpggparpaRPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182184 4058 SHVPTFSSFSSKSTTAHLTSLTTQAATSGLLSSTMGMTNLPSSGSPDINHTTRPPGSSPLP 4118
Cdd:PHA03247 2798 LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAP 2858
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
3671-4061 |
5.56e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 42.83 E-value: 5.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3671 TSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQSTTTFPTPSAP-QTSLVTSLPPFSTSSVSPTDEihiTSTNPHTVS 3749
Cdd:pfam03154 144 TSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPgTTQAATAGPTPSAPSVPPQGS---PATSQPPNQ 220
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3750 SVSMSRPVSTILQT-TIEVTTPPNTSTPVTHSTSATTEAQgsFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVT 3828
Cdd:pfam03154 221 TQSTAAPHTLIQQTpTLHPQRLPSPHPPLQPMTQPPPPSQ--VSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPF 298
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3829 GTPPVHTTSGTTSSPQT--PHSTHPISTAAISRTTGISGTPFR------TPMKTTITFPTPSSLQTSMATLFPPFSTSVM 3900
Cdd:pfam03154 299 PLTPQSSQSQVPPGPSPaaPGQSQQRIHTPPSQSQLQSQQPPReqplppAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHL 378
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3901 SSTEIFNTPTNphsVSSASTSRPLStSLPTTIKGTGTPqtPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTP 3980
Cdd:pfam03154 379 SGPSPFQMNSN---LPPPPALKPLS-SLSTHHPPSAHP--PPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSG 452
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3981 ASRSASTLQYTPTPSSVSHSPLLTtptasPPSSAPTFVSPtaastvissALPTIHmtpTPSSRPTSSTGLLSTSKTTSHV 4060
Cdd:pfam03154 453 LHQVPSQSPFPQHPFVPGGPPPIT-----PPSGPPTSTSS---------AMPGIQ---PPSSASVSSSGPVPAAVSCPLP 515
|
.
gi 1907182184 4061 P 4061
Cdd:pfam03154 516 P 516
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2458-2661 |
5.60e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.82 E-value: 5.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2458 AVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQ 2537
Cdd:COG3469 12 AGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATS 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2538 TGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHlSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSS 2617
Cdd:COG3469 92 TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSS-TAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTT 170
|
170 180 190 200
....*....|....*....|....*....|....*....|....
gi 1907182184 2618 PQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQS 2661
Cdd:COG3469 171 TTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPK 214
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3756-3967 |
5.99e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.43 E-value: 5.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3756 PVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPpvhT 3835
Cdd:COG3469 11 TAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA---A 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3836 TSGTTSSPQTPHSTHPiSTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSV 3915
Cdd:COG3469 88 AATSTSATLVATSTAS-GANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTT 166
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 3916 SSASTSRPLSTSLPTTIKGTGTPQTPVSdinTTSATTQAHSSFPTTRTSTSH 3967
Cdd:COG3469 167 STTTTTTSASTTPSATTTATATTASGAT---TPSATTTATTTGPPTPGLPKH 215
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
3756-4054 |
6.10e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.98 E-value: 6.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3756 PVSTILQTTIEVT--TPPNTSTPVTHSTSATTEAQGSFSTERT-STSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPP 3832
Cdd:pfam05109 425 PESTTTSPTLNTTgfAAPNTTTGLPSSTHVPTNLTAPASTGPTvSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAP 504
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3833 VHTTSGTTSSPQTPHSTHPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSV-----MSSTEIFN 3907
Cdd:pfam05109 505 DMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIptlgkTSPTSAVT 584
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3908 TPTNPHSVSSASTSRPLSTSLPTTIKGTG-TPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSlPSSMTSTLTPASRSAS 3986
Cdd:pfam05109 585 TPTPNATSPTVGETSPQANTTNHTLGGTSsTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLR-PSSISETLSPSTSDNS 663
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 3987 TlqytptpssvSHSPLLTtptasppssaptfvsptaastvisSALPTIHMTPTPSSRPTSSTGLLSTS 4054
Cdd:pfam05109 664 T----------SHMPLLT------------------------SAHPTGGENITQVTPASTSTHHVSTS 697
|
|
| PLN02217 |
PLN02217 |
probable pectinesterase/pectinesterase inhibitor |
3930-4060 |
6.78e-03 |
|
probable pectinesterase/pectinesterase inhibitor
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 42.38 E-value: 6.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3930 TTIKGTGTPQTP--VSDINTTSATTQAHSSFPTTRTSTShlslpSSMTSTLTPAsrsastlqyTPTPSSVSHSPLLTTPT 4007
Cdd:PLN02217 548 AWIPGKGVPYIPglFAGNPGSTNSTPTGSAASSNTTFSS-----DSPSTVVAPS---------TSPPAGHLGSPPATPSK 613
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 1907182184 4008 ASPPSSAPTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGLLSTSKTTSHV 4060
Cdd:PLN02217 614 IVSPSTSPPASHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESSVSMV 666
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2028-2489 |
8.40e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.21 E-value: 8.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2028 SGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGlNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTY 2107
Cdd:pfam05109 394 SGLGTAPKTLIITRTATNATTTTHKVIFSKAPESTTTSP-TLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAD 472
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2108 TTTAATQTKSSFSTDRTSTPHLSQSST-------VTPTQSTPIPattnslmTTGGLTGTPPVHTNSGTTSSPQTPRTThP 2180
Cdd:pfam05109 473 VTSPTPAGTTSGASPVTPSPSPRDNGTeskapdmTSPTSAVTTP-------TPNATSPTPAVTTPTPNATSPTLGKTS-P 544
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2181 FSTVavsntkhTTGVSLETSVQTTIASPTPSAPQTSLATHLPfsstssvtptSKVIITPTPQHTLSSASTSTTTGNILPT 2260
Cdd:pfam05109 545 TSAV-------TTPTPNATSPTPAVTTPTPNATIPTLGKTSP----------TSAVTTPTPNATSPTVGETSPQANTTNH 607
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2261 TIGKTGS-PHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAP-HLSET-SAVTAHQSTPTAVSANSIKPTMSSTGTPVVHT 2337
Cdd:pfam05109 608 TLGGTSStPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPsSISETlSPSTSDNSTSHMPLLTSAHPTGGENITQVTPA 687
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2338 TSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPthsgPQSSLSTHLPLFSTLSV-TPTTEGLNTQSTPI 2416
Cdd:pfam05109 688 STSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTP----PKNATSPQAPSGQKTAVpTVTSTGGKANSTTG 763
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182184 2417 PATTNSlmtTGGLTGTPPVHTTSGTTSSPQT---PRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTS 2489
Cdd:pfam05109 764 GKHTTG---HGARTSTEPTTDYGGDSTTPRTrynATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQPRFS 836
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1944-2163 |
8.71e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.05 E-value: 8.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1944 TSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHT 2023
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2024 TGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSV 2103
Cdd:COG3469 82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATG 161
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2104 PVTYTTTaatqtkSSFSTDRTSTPHLSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVH 2163
Cdd:COG3469 162 GTTTTST------TTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3776-3982 |
8.79e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.05 E-value: 8.79e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3776 PVTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTA 3855
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3856 AISRTTGISGTPFRTPMKTTitfpTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLPTTIKGT 3935
Cdd:COG3469 81 TATAAAAAATSTSATLVATS----TASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGT 156
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 1907182184 3936 GTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPAS 3982
Cdd:COG3469 157 ETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTAT 203
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1515-1965 |
8.80e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.62 E-value: 8.80e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1515 SQPSTMTAHQSRSLPTVTTSTKSTMGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSntkHTTGVSLETSVQTTIASPT 1594
Cdd:PHA03247 2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLG---RAAQASSPPQRPRRRAARP 2690
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1595 PSAPQTSLATHLPFSSTSSVTPTSEVIITPTPqhtlssASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFS 1674
Cdd:PHA03247 2691 TVGSLTSLADPPPPPPTPEPAPHALVSATPLP------PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA 2764
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1675 TDRTSTPTSAPHLSETSAVTAhqstPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGlP 1754
Cdd:PHA03247 2765 GPPAPAPPAAPAAGPPRRLTR----PAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTA-P 2839
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1755 SGTSVHTTTNFPTHSG--PQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPAITNSlmtTGGLTGTPPVHTTSGTTSSPQT 1832
Cdd:PHA03247 2840 PPPPGPPPPSLPLGGSvaPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRS---TESFALPPDQPERPPQPQAPPP 2916
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1833 PRTTHPfstvavsntkhttgvsLETSVQTTIASPTPSAPQTSLAthlpfsstssvtPTSEVIITPTPQHTLSSASTSTTT 1912
Cdd:PHA03247 2917 PQPQPQ----------------PPPPPQPQPPPPPPPRPQPPLA------------PTTDPAGAGEPSGAVPQPWLGALV 2968
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|...
gi 1907182184 1913 GNILPTTIGQTGSPHTSVPviyTTSAITQTKTSFSTDRTSTPTSAPHLSETSA 1965
Cdd:PHA03247 2969 PGRVAVPRFRVPQPAPSRE---APASSTPPLTGHSLSRVSSWASSLALHEETD 3018
|
|
|