|
Name |
Accession |
Description |
Interval |
E-value |
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
888-1047 |
1.15e-36 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation. :
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 137.92 E-value: 1.15e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 888 WECTNKPCMGACVAYGDGHFVTFDGERYIFEGSCEYTLAQDyCrgntSTNGTFRIVTENVPCGTTGTtCSKTIKIFVESY 967
Cdd:smart00216 1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD-C----SSEPTFSVLLKNVPCGGGAT-CLKSVKVELNGD 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 968 ELILHEGNFKVVEQGPSGDPPYK-------IRFMGIFLVIEIRSGIV-VSWDRKTSVFVRLQQHYKGRVCGLCGNFDDNA 1039
Cdd:smart00216 75 EIELKDDNGKVTVNGQQVSLPYKtsdgsiqIRSSGGYLVVITSLGLIqVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEP 154
|
....*...
gi 147905740 1040 INDFTTRS 1047
Cdd:smart00216 155 EDDFRTPD 162
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
429-583 |
9.09e-35 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods. :
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 132.11 E-value: 9.09e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 429 CSVQGGSHIFTFDEKLYNVHGDCSYILTKKC-ADSSFTVLVDLRKCGVTDTENCLKAVTLNLnsGDTVIRIQVNGAVFLN 507
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCsEEPDFSFSVTNKNCNGGASGVCLKSVTVIV--GDLEITLQKGGTVLVN 78
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 147905740 508 SIFTQMPLSAGNITIFKPSSFFIIVQTGMGLQLQVQLVPLMQVFVRLDRSYQGQMCGLCGNFNQNQADDFTALSGV 583
Cdd:pfam00094 79 GQKVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
2185-2273 |
7.36e-30 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues. :
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 115.51 E-value: 7.36e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2185 WTKWIDVDYPSSSiNGGDIETYENIRANGeKICEKPQDIRCMAQNHPGVNVDELGQKVKCDVSSGLVCYNKDQGGtfQMC 2264
Cdd:pfam13330 1 WTPWFDVDNPSGS-GGGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQP--DGC 76
|
....*....
gi 147905740 2265 YNYLIKVLC 2273
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
2499-2587 |
7.36e-30 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues. :
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 115.51 E-value: 7.36e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2499 WTKWIDVDYPSSSiNGGDIETYENIRANGeKICEKPQDIRCMAQNHPGVNVDELGQKVKCDVSSGLVCYNKDQGGtfQMC 2578
Cdd:pfam13330 1 WTPWFDVDNPSGS-GGGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQP--DGC 76
|
....*....
gi 147905740 2579 YNYLIKVLC 2587
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
1575-1663 |
7.36e-30 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues. :
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 115.51 E-value: 7.36e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1575 WTKWIDVDYPSSSiNGGDIETYENIRANGeKICEKPQDIRCMAQNHPGVNVDELGQKVKCDVSSGLVCYNKDQGGtfQMC 1654
Cdd:pfam13330 1 WTPWFDVDNPSGS-GGGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQP--DGC 76
|
....*....
gi 147905740 1655 YNYLIKVLC 1663
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
3067-3155 |
1.01e-29 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues. :
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 115.12 E-value: 1.01e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3067 WTRWINVDYPSSSiNGGDIETYENIRANGeKICEKPQDIRCMAQNYPGVNVDELGQKVKCDVNSGLVCYNKDQGGTfkMC 3146
Cdd:pfam13330 1 WTPWFDVDNPSGS-GGGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPD--GC 76
|
....*....
gi 147905740 3147 YNYLIKVLC 3155
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
1871-1959 |
1.47e-29 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues. :
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 114.35 E-value: 1.47e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1871 WTKWIDVDYPSSSiNGGDIETYENIRANGeKICEKPQDIRCMAQNYPGVNVDKLGQKVKCDVSSGLVCYNKDQGGtfPMC 1950
Cdd:pfam13330 1 WTPWFDVDNPSGS-GGGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQP--DGC 76
|
....*....
gi 147905740 1951 YNYLIKVLC 1959
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
3625-3713 |
1.95e-29 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues. :
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 113.97 E-value: 1.95e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3625 WTEWFDVDSPtSGSVKGDMETYENIRASGKtLCQAPEKIECRAENYPAVSIDKVGQVVSCNLETGLVCKNEDQKDDfkMC 3704
Cdd:pfam13330 1 WTPWFDVDNP-SGSGGGDFETLENLRAYGK-FCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPD--GC 76
|
....*....
gi 147905740 3705 FNYNIRVLC 3713
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
3381-3469 |
2.53e-29 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues. :
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 113.97 E-value: 2.53e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3381 WTKWIDVDYPSSSiNGGDIETYENIRTNGeKICEKPQDIRCMAQNHPGVNVDELGQKVKCDVNSGLVCYNKDQGGtfQMC 3460
Cdd:pfam13330 1 WTPWFDVDNPSGS-GGGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQP--DGC 76
|
....*....
gi 147905740 3461 YNYLIKVLC 3469
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
1346-1432 |
4.03e-29 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues. :
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 113.20 E-value: 4.03e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1346 WSIWYDESHPEAGMsGGDFETFENLRKKGhQVCQTPVGIECRAVKFPSMDLEKLGQKVNCDPSSGLSCFNSEQSPPLCHD 1425
Cdd:pfam13330 1 WTPWFDVDNPSGSG-GGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPDGCLD 78
|
....*..
gi 147905740 1426 YELRVLC 1432
Cdd:pfam13330 79 YEVRFLC 85
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
4111-4283 |
1.39e-28 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation. :
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 114.81 E-value: 1.39e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 4111 PCHFHYECECFCSGWGHSHYLTFDGTSYTFLDNCTSVLMREIHPRHgNLSILAHSYYCGATtnvTSCPRALSVYYNSMEI 4190
Cdd:smart00216 1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSEP-TFSVLLKNVPCGGG---ATCLKSVKVELNGDEI 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 4191 ILTisttesgKDESLIIWDQMWIRSGFSKNGVTVSLSGATTMSVNISTIG-TIITFDGNI-FQIWLPYRYFsSNTEGQCG 4268
Cdd:smart00216 77 ELK-------DDNGKVTVNGQQVSLPYKTSDGSIQIRSSGGYLVVITSLGlIQVTFDGLTlLSVQLPSKYR-GKTCGLCG 148
|
170
....*....|....*
gi 147905740 4269 TCTNSQIDDCRRPDG 4283
Cdd:smart00216 149 NFDGEPEDDFRTPDG 163
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
3779-3867 |
4.63e-27 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues. :
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 107.42 E-value: 4.63e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3779 WTEWFDADYPNpGPRGGDFEVYAVFREvGYIFCDQPKDIECRSEKEPDRPLETLEQVVQCDVRFGLICKNINQSGPLqyC 3858
Cdd:pfam13330 1 WTPWFDVDNPS-GSGGGDFETLENLRA-YGKFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPDG--C 76
|
....*....
gi 147905740 3859 DNYHVRLLC 3867
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
80-227 |
1.70e-26 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation. :
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 108.64 E-value: 1.70e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 80 VCSTWGDFHYKTFDGDVFRFPGLCNYVFSSHCGaTYEDFNIQLRRGLEGSRPTVTY-VLLRAQGLVIELS--NGSVLVNG 156
Cdd:smart00216 11 TCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCS-SEPTFSVLLKNVPCGGGATCLKsVKVELNGDEIELKddNGKVTVNG 89
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 147905740 157 HREKLPYSRAGLLME---KSSGYVKISIRLVLTFLWNEEDSALLELDSKYINQTCGLCGDFNGLPAvSEFYTHN 227
Cdd:smart00216 90 QQVSLPYKTSDGSIQirsSGGYLVVITSLGLIQVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPE-DDFRTPD 162
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
2688-2776 |
1.59e-25 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues. :
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 102.80 E-value: 1.59e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2688 WTDWLDSDKPQpGQFEGDIETYYHIQNKTgiKICKKPVDIECEAVLFPNISFQKLGQEVVCNVDFGLICRNSKQSDNqIC 2767
Cdd:pfam13330 1 WTPWFDVDNPS-GSGGGDFETLENLRAYG--KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPD-GC 76
|
....*....
gi 147905740 2768 FNYHIRVLC 2776
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
619-700 |
5.17e-25 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. :
Pssm-ID: 214843 Cd Length: 76 Bit Score: 101.26 E-value: 5.17e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 619 ENFAREWCSMLTESSGVFSACHATVSPVPFYSNCLFDTCNCENSEDCMCAALSSYVFACAAKGVLLSGWRDkvcykyTNN 698
Cdd:smart00832 1 KYYACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRT------PTF 74
|
..
gi 147905740 699 CP 700
Cdd:smart00832 75 CP 76
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
1084-1158 |
1.54e-24 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. :
Pssm-ID: 214843 Cd Length: 76 Bit Score: 99.72 E-value: 1.54e-24
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 147905740 1084 KSWAQKKCSIINSA--TFAACHSQVDSTKYYEACVHDVCACdsGGDCECFCTAVAAYAQACRDVGVCLS-WRTPDICP 1158
Cdd:smart00832 1 KYYACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCISpWRTPTFCP 76
|
|
| CT |
smart00041 |
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ... |
4708-4790 |
2.34e-15 |
|
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers. :
Pssm-ID: 214482 Cd Length: 82 Bit Score: 73.98 E-value: 2.34e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 4708 VHVNATVLRYKGCETE-VNITFCEGSCSGISKYSmeAQAMERQCTCCQESKVHDVAVTMQCPDGTVIQHTYTHIDECNCA 4786
Cdd:smart00041 1 KSPVRQTITYNGCTSVtVKNAFCEGKCGSASSYS--IQDVQHSCSCCQPHKTKTRQVRLRCPDGSTVKKTVMHIEECGCE 78
|
....
gi 147905740 4787 PACS 4790
Cdd:smart00041 79 PNCP 82
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
265-329 |
1.33e-14 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826. :
Pssm-ID: 462584 Cd Length: 68 Bit Score: 71.26 E-value: 1.33e-14
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 147905740 265 ICRRTLLGPAFDKCTALVDVSMYLDACVQDLCRCP---TCPCATFAEYSRQCAHAGGYPQNWRHPDLC 329
Cdd:pfam08742 1 KCGLLSDSGPFAPCHSVVDPEPYFEACVYDMCSCGgddECLCAALAAYARACQAAGVCIGDWRTPTFC 68
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
4336-4398 |
3.82e-14 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826. :
Pssm-ID: 462584 Cd Length: 68 Bit Score: 70.10 E-value: 3.82e-14
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 147905740 4336 LCELMLSQ-VFAECHRLIPPDAFFRSCVSDHCN-ANITDMLCQSLEAYAALCRAQGVC-TNWRNAT 4398
Cdd:pfam08742 1 KCGLLSDSgPFAPCHSVVDPEPYFEACVYDMCScGGDDECLCAALAAYARACQAAGVCiGDWRTPT 66
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
333-389 |
3.41e-13 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation. :
Pssm-ID: 410995 Cd Length: 55 Bit Score: 66.96 E-value: 3.41e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*..
gi 147905740 333 CPFNMEYRECSSPCVDTCSNPQRSQLCEDHCMDGCFCPPGTVLDDVRHlgCLPLEQC 389
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANPNAPPPCTKQCVEGCFCPEGYVRNSGGK--CVPPSQC 55
|
|
| Herpes_BLLF1 super family |
cl37540 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1968-2181 |
2.46e-08 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo. The actual alignment was detected with superfamily member pfam05109:
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 60.70 E-value: 2.46e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1968 STTPEGSVSTTETttatpqtLSSKPLPNTSSIETWSP--SGPTTSMEA------SEMPTFWTLRTSVSSP------PTTP 2033
Cdd:pfam05109 475 SPTPAGTTSGASP-------VTPSPSPRDNGTESKAPdmTSPTSAVTTptpnatSPTPAVTTPTPNATSPtlgktsPTSA 547
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2034 LTVTSVSTQHPKSLESSGSP--SVVTARSTSPGHSQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTT 2111
Cdd:pfam05109 548 VTTPTPNATSPTPAVTTPTPnaTIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATS 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2112 T-----HPVTTASTT------RAVTETLSSRPHTKTTSELPTVSSVSSTAG----------PGTQTGSTS----RPNTVT 2166
Cdd:pfam05109 628 AvttgqHNITSSSTSsmslrpSSISETLSPSTSDNSTSHMPLLTSAHPTGGenitqvtpasTSTHHVSTSspapRPGTTS 707
|
250
....*....|....*
gi 147905740 2167 QpASSPETSSTTCQP 2181
Cdd:pfam05109 708 Q-ASGPGNSSTSTKP 721
|
|
| Herpes_BLLF1 super family |
cl37540 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2282-2495 |
2.46e-08 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo. The actual alignment was detected with superfamily member pfam05109:
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 60.70 E-value: 2.46e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2282 STTPEGSVSTTETttatpqtLSSKPLPNTSSIETWSP--SGPTTSMEA------SEMPTFWTLRTSVSSP------PTTP 2347
Cdd:pfam05109 475 SPTPAGTTSGASP-------VTPSPSPRDNGTESKAPdmTSPTSAVTTptpnatSPTPAVTTPTPNATSPtlgktsPTSA 547
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2348 LTVTSVSTQHPKSLESSGSP--SVVTARSTSPGHSQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTT 2425
Cdd:pfam05109 548 VTTPTPNATSPTPAVTTPTPnaTIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATS 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2426 T-----HPVTTASTT------RAVTETLSSRPHTKTTSELPTVSSVSSTAG----------PGTQTGSTS----RPNTVT 2480
Cdd:pfam05109 628 AvttgqHNITSSSTSsmslrpSSISETLSPSTSDNSTSHMPLLTSAHPTGGenitqvtpasTSTHHVSTSspapRPGTTS 707
|
250
....*....|....*
gi 147905740 2481 QpASSPETSSTTCQP 2495
Cdd:pfam05109 708 Q-ASGPGNSSTSTKP 721
|
|
| DUF5585 super family |
cl39316 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
2827-3082 |
8.97e-08 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. The actual alignment was detected with superfamily member pfam17823:
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 58.43 E-value: 8.97e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2827 SRPSTPRTEKIVSIATARTTGSFPVSSLSTFPASPTSLLPTSEEPHSGKPSTSSWTASSNVPSTAKPTTSTGRQfPRTKT 2906
Cdd:pfam17823 84 TEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAA-PRAAI 162
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2907 TLWNSLPSSTPTMATRSIHPISTTMTTKNPQTLVTSDISKSNSMLPSRPHITASSVTSTSgsqisksySSTSLTTAGPSS 2986
Cdd:pfam17823 163 AAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHP--------AAGTALAAVGNS 234
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2987 PPHGGTTTHPVTTasTTRAVTETLssrphtktTSELPTVSSVSSTAGPGTQTGSTSRPSIVTQPVSSPETSSTTCQPQCQ 3066
Cdd:pfam17823 235 SPAAGTVTAAVGT--VTPAALATL--------AAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQAQ 304
|
250
....*....|....*..
gi 147905740 3067 W-TRWINVDYPSSSING 3082
Cdd:pfam17823 305 GpIIQVSTDQPVHNTAG 321
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
699-756 |
4.59e-07 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation. :
Pssm-ID: 410995 Cd Length: 55 Bit Score: 49.62 E-value: 4.59e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 147905740 699 CPQTKGYFYSVSSCQPTCRSLsEADVTCsvPFVTVDGCTCPEGTFLNDKDHCVPVEEC 756
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANP-NAPPPC--TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| PLN02217 super family |
cl33436 |
probable pectinesterase/pectinesterase inhibitor |
1687-1796 |
6.66e-07 |
|
probable pectinesterase/pectinesterase inhibitor The actual alignment was detected with superfamily member PLN02217:
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 55.87 E-value: 6.66e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1687 STPQTLSSKPLPNTSSFETWSPS---GPTTSMEASEMptfwtlrtsvSSPPTTPLTVTSVSTQHPKSleSSGSPSVVTA- 1762
Cdd:PLN02217 568 STNSTPTGSAASSNTTFSSDSPStvvAPSTSPPAGHL----------GSPPATPSKIVSPSTSPPAS--HLGSPSTTPSs 635
|
90 100 110
....*....|....*....|....*....|....*
gi 147905740 1763 -RSTSQGYSLPPSSPHTTNSLSATTSKASLSSMTT 1796
Cdd:PLN02217 636 pESSIKVASTETASPESSIKVASTESSVSMVSMST 670
|
|
| Metaviral_G super family |
cl26626 |
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ... |
1742-1864 |
4.16e-06 |
|
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated. The actual alignment was detected with superfamily member pfam09595:
Pssm-ID: 462833 [Multi-domain] Cd Length: 183 Bit Score: 50.34 E-value: 4.16e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1742 SVSTQHPKSlESSGSPSVVTARSTSQGYSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPV-- 1819
Cdd:pfam09595 54 NINKQHPEQ-EHHENPPLNEAAKEAPSESEDAPDIDPNNQHPSQDRSEAPPLEPAAKTKPSEHEPANPPDASNRLSPPda 132
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 147905740 1820 TTASTTRAVTETLSsrphtkTTSELPTVSSASSTAGPGTQTFSTT 1864
Cdd:pfam09595 133 STAAIREARTFRKP------STGKRNNPSSAQSDQSPPRANHEAI 171
|
|
| Herpes_BLLF1 super family |
cl37540 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2909-3626 |
4.29e-06 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo. The actual alignment was detected with superfamily member pfam05109:
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 53.38 E-value: 4.29e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2909 WNSLPSSTPTMATRSiHPISTTMTTKNPQTLVTSDISKSNSMLPSRPHITASsVTSTSGSQISKSYSSTSLTTAG-PSSP 2987
Cdd:pfam05109 162 WDNCNSTNITAVVRA-QGLDVTLPLSLPTSAQDSNFSVKTEMLGNEIDIECI-MEDGEISQVLPGDNKFNITCSGyESHV 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2988 PHGG--TTTHPVTTA--STTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTQTGSTSrpsIVTQPVSSPETSSTTCQP 3063
Cdd:pfam05109 240 PSGGilTSTSPVATPipGTGYAYSLRLTPRPVSRFLGNNSILYVFYSGNGPKASGGDYC---IQSNIVFSDEIPASQDMP 316
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3064 qcqwTRWINVDYPsssingGDIETYENIRANGEKiCEKPQDIRCMAQNYPgvNVDELGQKVKCDVNSGLVCYNKDQGGTF 3143
Cdd:pfam05109 317 ----TNTTDITYV------GDNATYSVPMVTSED-ANSPNVTVTAFWAWP--NNTETDFKCKWTLTSGTPSGCENISGAF 383
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3144 KMCYNYLIKVLCCSVSHCQGSTTPEGSVSTTETA----SSIPQTLSSKPLPNTSSFetwspSGPTTSmeaSEMPTFWTLR 3219
Cdd:pfam05109 384 ASNRTFDITVSGLGTAPKTLIITRTATNATTTTHkvifSKAPESTTTSPTLNTTGF-----AAPNTT---TGLPSSTHVP 455
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3220 TSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTARSTSQGHSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTSEPS 3299
Cdd:pfam05109 456 TNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNAT 535
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3300 SPPHGGTTthPVTAASTtravtetlssrPHTKTTSELP--TVPSASSTAGPGTQTGSTSRPSTVTQPASSPETSSTTcqP 3377
Cdd:pfam05109 536 SPTLGKTS--PTSAVTT-----------PTPNATSPTPavTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETS--P 600
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3378 QCQWTKWIDVDYPSSSINGGDIETYENIRTNGEkicekpQDIRCMAQNHPGVNVDELGQKVKCDVNSGLVCYNKDQGGTF 3457
Cdd:pfam05109 601 QANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQ------HNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAH 674
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3458 QMCYNYLIKVLCCSVSHCQGSTTTPGSAPMTGSATTAtqtlssmplPNTSLTVTSPHSASTTAATSITETSSPRSVSSIH 3537
Cdd:pfam05109 675 PTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASG---------PGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQK 745
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3538 TTPKIFPET---LQSTTGltTQQTSSTGIWTSmltgtagtvsstslstslesttgiTEVSTSMPMTSTLPVTVTNATT-- 3612
Cdd:pfam05109 746 TAVPTVTSTggkANSTTG--GKHTTGHGARTS------------------------TEPTTDYGGDSTTPRTRYNATTyl 799
|
730
....*....|....*
gi 147905740 3613 -PQGTTLCQPKCKWT 3626
Cdd:pfam05109 800 pPSTSSKLRPRWTFT 814
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
798-859 |
4.47e-06 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation. :
Pssm-ID: 410995 Cd Length: 55 Bit Score: 46.54 E-value: 4.47e-06
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 147905740 798 CQAPMVYLDCNNAsvghhgaeCLKSCHTLD--VECfSTQCVSGCVCPSGLVADGNGGCIAEEDC 859
Cdd:cd19941 1 CPPNEVYSECGSA--------CPPTCANPNapPPC-TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| VWC |
smart00214 |
von Willebrand factor (vWF) type C domain; |
4570-4631 |
6.68e-05 |
|
von Willebrand factor (vWF) type C domain; :
Pssm-ID: 214564 Cd Length: 59 Bit Score: 43.66 E-value: 6.68e-05
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 147905740 4570 CTYNDTIYGVGTTFPGGPCHTCTCLSdgdqEPKVECKETNC--TTSCPQGFKyTLVPEQCCGEC 4631
Cdd:smart00214 1 CVHNGRVYNDGETWKPDPCQICTCLD----GTTVLCDPVECppPPDCPNPER-VKPPGECCPRC 59
|
|
| VWC_out |
smart00215 |
von Willebrand factor (vWF) type C domain; |
391-436 |
2.27e-04 |
|
von Willebrand factor (vWF) type C domain; :
Pssm-ID: 214565 Cd Length: 67 Bit Score: 42.16 E-value: 2.27e-04
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 147905740 391 CTHGGRIYAPGESFNTSCRSCTCFGGLWKCKDLPCPGTCSVQGGSH 436
Cdd:smart00215 1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCGPKPCLLHNLS 46
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
888-1047 |
1.15e-36 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 137.92 E-value: 1.15e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 888 WECTNKPCMGACVAYGDGHFVTFDGERYIFEGSCEYTLAQDyCrgntSTNGTFRIVTENVPCGTTGTtCSKTIKIFVESY 967
Cdd:smart00216 1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD-C----SSEPTFSVLLKNVPCGGGAT-CLKSVKVELNGD 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 968 ELILHEGNFKVVEQGPSGDPPYK-------IRFMGIFLVIEIRSGIV-VSWDRKTSVFVRLQQHYKGRVCGLCGNFDDNA 1039
Cdd:smart00216 75 EIELKDDNGKVTVNGQQVSLPYKtsdgsiqIRSSGGYLVVITSLGLIqVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEP 154
|
....*...
gi 147905740 1040 INDFTTRS 1047
Cdd:smart00216 155 EDDFRTPD 162
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
429-583 |
9.09e-35 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 132.11 E-value: 9.09e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 429 CSVQGGSHIFTFDEKLYNVHGDCSYILTKKC-ADSSFTVLVDLRKCGVTDTENCLKAVTLNLnsGDTVIRIQVNGAVFLN 507
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCsEEPDFSFSVTNKNCNGGASGVCLKSVTVIV--GDLEITLQKGGTVLVN 78
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 147905740 508 SIFTQMPLSAGNITIFKPSSFFIIVQTGMGLQLQVQLVPLMQVFVRLDRSYQGQMCGLCGNFNQNQADDFTALSGV 583
Cdd:pfam00094 79 GQKVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
418-582 |
6.59e-34 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 129.83 E-value: 6.59e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 418 WKCKDLPCPGTCSVQGGSHIFTFDEKLYNVHGDCSYILTKKCADS-SFTVLVDLRKCGVTDTenCLKAVTLNLNSGDTVI 496
Cdd:smart00216 1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSEpTFSVLLKNVPCGGGAT--CLKSVKVELNGDEIEL 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 497 rIQVNGAVFLNSIFTQMPLSAGNITI-FKPSSFFIIVQTGMGLqLQVQLVPLMQVFVRLDRSYQGQMCGLCGNFNQNQAD 575
Cdd:smart00216 79 -KDDNGKVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGL-IQVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPED 156
|
....*..
gi 147905740 576 DFTALSG 582
Cdd:smart00216 157 DFRTPDG 163
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
2185-2273 |
7.36e-30 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 115.51 E-value: 7.36e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2185 WTKWIDVDYPSSSiNGGDIETYENIRANGeKICEKPQDIRCMAQNHPGVNVDELGQKVKCDVSSGLVCYNKDQGGtfQMC 2264
Cdd:pfam13330 1 WTPWFDVDNPSGS-GGGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQP--DGC 76
|
....*....
gi 147905740 2265 YNYLIKVLC 2273
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
2499-2587 |
7.36e-30 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 115.51 E-value: 7.36e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2499 WTKWIDVDYPSSSiNGGDIETYENIRANGeKICEKPQDIRCMAQNHPGVNVDELGQKVKCDVSSGLVCYNKDQGGtfQMC 2578
Cdd:pfam13330 1 WTPWFDVDNPSGS-GGGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQP--DGC 76
|
....*....
gi 147905740 2579 YNYLIKVLC 2587
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
1575-1663 |
7.36e-30 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 115.51 E-value: 7.36e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1575 WTKWIDVDYPSSSiNGGDIETYENIRANGeKICEKPQDIRCMAQNHPGVNVDELGQKVKCDVSSGLVCYNKDQGGtfQMC 1654
Cdd:pfam13330 1 WTPWFDVDNPSGS-GGGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQP--DGC 76
|
....*....
gi 147905740 1655 YNYLIKVLC 1663
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
3067-3155 |
1.01e-29 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 115.12 E-value: 1.01e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3067 WTRWINVDYPSSSiNGGDIETYENIRANGeKICEKPQDIRCMAQNYPGVNVDELGQKVKCDVNSGLVCYNKDQGGTfkMC 3146
Cdd:pfam13330 1 WTPWFDVDNPSGS-GGGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPD--GC 76
|
....*....
gi 147905740 3147 YNYLIKVLC 3155
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
1871-1959 |
1.47e-29 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 114.35 E-value: 1.47e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1871 WTKWIDVDYPSSSiNGGDIETYENIRANGeKICEKPQDIRCMAQNYPGVNVDKLGQKVKCDVSSGLVCYNKDQGGtfPMC 1950
Cdd:pfam13330 1 WTPWFDVDNPSGS-GGGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQP--DGC 76
|
....*....
gi 147905740 1951 YNYLIKVLC 1959
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
3625-3713 |
1.95e-29 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 113.97 E-value: 1.95e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3625 WTEWFDVDSPtSGSVKGDMETYENIRASGKtLCQAPEKIECRAENYPAVSIDKVGQVVSCNLETGLVCKNEDQKDDfkMC 3704
Cdd:pfam13330 1 WTPWFDVDNP-SGSGGGDFETLENLRAYGK-FCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPD--GC 76
|
....*....
gi 147905740 3705 FNYNIRVLC 3713
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
3381-3469 |
2.53e-29 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 113.97 E-value: 2.53e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3381 WTKWIDVDYPSSSiNGGDIETYENIRTNGeKICEKPQDIRCMAQNHPGVNVDELGQKVKCDVNSGLVCYNKDQGGtfQMC 3460
Cdd:pfam13330 1 WTPWFDVDNPSGS-GGGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQP--DGC 76
|
....*....
gi 147905740 3461 YNYLIKVLC 3469
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
1346-1432 |
4.03e-29 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 113.20 E-value: 4.03e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1346 WSIWYDESHPEAGMsGGDFETFENLRKKGhQVCQTPVGIECRAVKFPSMDLEKLGQKVNCDPSSGLSCFNSEQSPPLCHD 1425
Cdd:pfam13330 1 WTPWFDVDNPSGSG-GGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPDGCLD 78
|
....*..
gi 147905740 1426 YELRVLC 1432
Cdd:pfam13330 79 YEVRFLC 85
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
4111-4283 |
1.39e-28 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 114.81 E-value: 1.39e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 4111 PCHFHYECECFCSGWGHSHYLTFDGTSYTFLDNCTSVLMREIHPRHgNLSILAHSYYCGATtnvTSCPRALSVYYNSMEI 4190
Cdd:smart00216 1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSEP-TFSVLLKNVPCGGG---ATCLKSVKVELNGDEI 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 4191 ILTisttesgKDESLIIWDQMWIRSGFSKNGVTVSLSGATTMSVNISTIG-TIITFDGNI-FQIWLPYRYFsSNTEGQCG 4268
Cdd:smart00216 77 ELK-------DDNGKVTVNGQQVSLPYKTSDGSIQIRSSGGYLVVITSLGlIQVTFDGLTlLSVQLPSKYR-GKTCGLCG 148
|
170
....*....|....*
gi 147905740 4269 TCTNSQIDDCRRPDG 4283
Cdd:smart00216 149 NFDGEPEDDFRTPDG 163
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
899-1047 |
2.64e-28 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 113.62 E-value: 2.64e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 899 CVAYGDGHFVTFDGERYIFEGSCEYTLAQDyCRGNTstNGTFRIVTENVPCGTTGTtCSKTIKIFVESYELILHEGNFKV 978
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKD-CSEEP--DFSFSVTNKNCNGGASGV-CLKSVTVIVGDLEITLQKGGTVL 76
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 147905740 979 VEQGPSGDPPYK----IRFMG---IFLVIEIRSGIVVSWDRKTSVFVRLQQHYKGRVCGLCGNFDDNAINDFTTRS 1047
Cdd:pfam00094 77 VNGQKVSLPYKSdggeVEILGsgfVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPD 152
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
3779-3867 |
4.63e-27 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 107.42 E-value: 4.63e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3779 WTEWFDADYPNpGPRGGDFEVYAVFREvGYIFCDQPKDIECRSEKEPDRPLETLEQVVQCDVRFGLICKNINQSGPLqyC 3858
Cdd:pfam13330 1 WTPWFDVDNPS-GSGGGDFETLENLRA-YGKFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPDG--C 76
|
....*....
gi 147905740 3859 DNYHVRLLC 3867
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
80-227 |
1.70e-26 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 108.64 E-value: 1.70e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 80 VCSTWGDFHYKTFDGDVFRFPGLCNYVFSSHCGaTYEDFNIQLRRGLEGSRPTVTY-VLLRAQGLVIELS--NGSVLVNG 156
Cdd:smart00216 11 TCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCS-SEPTFSVLLKNVPCGGGATCLKsVKVELNGDEIELKddNGKVTVNG 89
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 147905740 157 HREKLPYSRAGLLME---KSSGYVKISIRLVLTFLWNEEDSALLELDSKYINQTCGLCGDFNGLPAvSEFYTHN 227
Cdd:smart00216 90 QQVSLPYKTSDGSIQirsSGGYLVVITSLGLIQVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPE-DDFRTPD 162
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
81-225 |
4.86e-26 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 107.07 E-value: 4.86e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 81 CSTWGDFHYKTFDGDVFRFPGLCNYVFSSHCGATYED-FNIQLRRGLEGSRPTVTY-VLLRAQGLVIEL-SNGSVLVNGH 157
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFsFSVTNKNCNGGASGVCLKsVTVIVGDLEITLqKGGTVLVNGQ 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 147905740 158 REKLPYSRAGLLMEKSSGY---VKISIRLVLTFLWNEEDSALLELDSKYINQTCGLCGDFNGLPAvSEFYT 225
Cdd:pfam00094 81 KVSLPYKSDGGEVEILGSGfvvVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQE-DDFMT 150
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
2688-2776 |
1.59e-25 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 102.80 E-value: 1.59e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2688 WTDWLDSDKPQpGQFEGDIETYYHIQNKTgiKICKKPVDIECEAVLFPNISFQKLGQEVVCNVDFGLICRNSKQSDNqIC 2767
Cdd:pfam13330 1 WTPWFDVDNPS-GSGGGDFETLENLRAYG--KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPD-GC 76
|
....*....
gi 147905740 2768 FNYHIRVLC 2776
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
619-700 |
5.17e-25 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 101.26 E-value: 5.17e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 619 ENFAREWCSMLTESSGVFSACHATVSPVPFYSNCLFDTCNCENSEDCMCAALSSYVFACAAKGVLLSGWRDkvcykyTNN 698
Cdd:smart00832 1 KYYACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRT------PTF 74
|
..
gi 147905740 699 CP 700
Cdd:smart00832 75 CP 76
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
1084-1158 |
1.54e-24 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 99.72 E-value: 1.54e-24
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 147905740 1084 KSWAQKKCSIINSA--TFAACHSQVDSTKYYEACVHDVCACdsGGDCECFCTAVAAYAQACRDVGVCLS-WRTPDICP 1158
Cdd:smart00832 1 KYYACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCISpWRTPTFCP 76
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
4122-4284 |
2.33e-23 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 99.37 E-value: 2.33e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 4122 CSGWGHSHYLTFDGTSYTFLDNCTSVLMREIHPRHgNLSILAHSYYCGATTNVTsCPRALSVYYNSMEIILTisttesgk 4201
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEP-DFSFSVTNKNCNGGASGV-CLKSVTVIVGDLEITLQ-------- 70
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 4202 DESLIIWDQMWIRSGFSKNGVTVSLSGATTMSVNISTIGTI-ITFDGNIF-QIWLPYRYFSSnTEGQCGTCTNSQIDDCR 4279
Cdd:pfam00094 71 KGGTVLVNGQKVSLPYKSDGGEVEILGSGFVVVDLSPGVGLqVDGDGRGQlFVTLSPSYQGK-TCGLCGNYNGNQEDDFM 149
|
....*
gi 147905740 4280 RPDGT 4284
Cdd:pfam00094 150 TPDGT 154
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
1090-1157 |
6.46e-21 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 89.36 E-value: 6.46e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1090 KCSIINSAT-FAACHSQVDSTKYYEACVHDVCACdsGGDCECFCTAVAAYAQACRDVGVCL-SWRTPDIC 1157
Cdd:pfam08742 1 KCGLLSDSGpFAPCHSVVDPEPYFEACVYDMCSC--GGDDECLCAALAAYARACQAAGVCIgDWRTPTFC 68
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
625-692 |
2.70e-20 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 87.44 E-value: 2.70e-20
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 147905740 625 WCSMLTeSSGVFSACHATVSPVPFYSNCLFDTCNCENSEDCMCAALSSYVFACAAKGVLLSGWRDKVC 692
Cdd:pfam08742 1 KCGLLS-DSGPFAPCHSVVDPEPYFEACVYDMCSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTF 67
|
|
| CT |
smart00041 |
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ... |
4708-4790 |
2.34e-15 |
|
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers.
Pssm-ID: 214482 Cd Length: 82 Bit Score: 73.98 E-value: 2.34e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 4708 VHVNATVLRYKGCETE-VNITFCEGSCSGISKYSmeAQAMERQCTCCQESKVHDVAVTMQCPDGTVIQHTYTHIDECNCA 4786
Cdd:smart00041 1 KSPVRQTITYNGCTSVtVKNAFCEGKCGSASSYS--IQDVQHSCSCCQPHKTKTRQVRLRCPDGSTVKKTVMHIEECGCE 78
|
....
gi 147905740 4787 PACS 4790
Cdd:smart00041 79 PNCP 82
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
265-329 |
1.33e-14 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 71.26 E-value: 1.33e-14
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 147905740 265 ICRRTLLGPAFDKCTALVDVSMYLDACVQDLCRCP---TCPCATFAEYSRQCAHAGGYPQNWRHPDLC 329
Cdd:pfam08742 1 KCGLLSDSGPFAPCHSVVDPEPYFEACVYDMCSCGgddECLCAALAAYARACQAAGVCIGDWRTPTFC 68
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
4336-4398 |
3.82e-14 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 70.10 E-value: 3.82e-14
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 147905740 4336 LCELMLSQ-VFAECHRLIPPDAFFRSCVSDHCN-ANITDMLCQSLEAYAALCRAQGVC-TNWRNAT 4398
Cdd:pfam08742 1 KCGLLSDSgPFAPCHSVVDPEPYFEACVYDMCScGGDDECLCAALAAYARACQAAGVCiGDWRTPT 66
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
333-389 |
3.41e-13 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 66.96 E-value: 3.41e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*..
gi 147905740 333 CPFNMEYRECSSPCVDTCSNPQRSQLCEDHCMDGCFCPPGTVLDDVRHlgCLPLEQC 389
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANPNAPPPCTKQCVEGCFCPEGYVRNSGGK--CVPPSQC 55
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
4337-4398 |
9.47e-13 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 66.21 E-value: 9.47e-13
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 147905740 4337 CELMLSQ--VFAECHRLIPPDAFFRSCVSDHCNAN-ITDMLCQSLEAYAALCRAQGVC-TNWRNAT 4398
Cdd:smart00832 8 CGILLSPrgPFAACHSVVDPEPFFENCVYDTCACGgDCECLCDALAAYAAACAEAGVCiSPWRTPT 73
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
333-389 |
1.56e-12 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 65.10 E-value: 1.56e-12
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*..
gi 147905740 333 CPFNMEYRECSSPCVDTCSNPQRSQLCEDHCMDGCFCPPGTVLDDvrHLGCLPLEQC 389
Cdd:pfam01826 1 CPANEVYSECGSACPPTCANLSPPDVCPEPCVEGCVCPPGFVRNS--GGKCVPPSDC 55
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
275-329 |
1.46e-09 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 57.35 E-value: 1.46e-09
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 147905740 275 FDKCTALVDVSMYLDACVQDLCRCP---TCPCATFAEYSRQCAHAGGYPQNWRHPDLC 329
Cdd:smart00832 18 FAACHSVVDPEPFFENCVYDTCACGgdcECLCDALAAYAAACAEAGVCISPWRTPTFC 75
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1968-2181 |
2.46e-08 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 60.70 E-value: 2.46e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1968 STTPEGSVSTTETttatpqtLSSKPLPNTSSIETWSP--SGPTTSMEA------SEMPTFWTLRTSVSSP------PTTP 2033
Cdd:pfam05109 475 SPTPAGTTSGASP-------VTPSPSPRDNGTESKAPdmTSPTSAVTTptpnatSPTPAVTTPTPNATSPtlgktsPTSA 547
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2034 LTVTSVSTQHPKSLESSGSP--SVVTARSTSPGHSQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTT 2111
Cdd:pfam05109 548 VTTPTPNATSPTPAVTTPTPnaTIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATS 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2112 T-----HPVTTASTT------RAVTETLSSRPHTKTTSELPTVSSVSSTAG----------PGTQTGSTS----RPNTVT 2166
Cdd:pfam05109 628 AvttgqHNITSSSTSsmslrpSSISETLSPSTSDNSTSHMPLLTSAHPTGGenitqvtpasTSTHHVSTSspapRPGTTS 707
|
250
....*....|....*
gi 147905740 2167 QpASSPETSSTTCQP 2181
Cdd:pfam05109 708 Q-ASGPGNSSTSTKP 721
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2282-2495 |
2.46e-08 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 60.70 E-value: 2.46e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2282 STTPEGSVSTTETttatpqtLSSKPLPNTSSIETWSP--SGPTTSMEA------SEMPTFWTLRTSVSSP------PTTP 2347
Cdd:pfam05109 475 SPTPAGTTSGASP-------VTPSPSPRDNGTESKAPdmTSPTSAVTTptpnatSPTPAVTTPTPNATSPtlgktsPTSA 547
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2348 LTVTSVSTQHPKSLESSGSP--SVVTARSTSPGHSQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTT 2425
Cdd:pfam05109 548 VTTPTPNATSPTPAVTTPTPnaTIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATS 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2426 T-----HPVTTASTT------RAVTETLSSRPHTKTTSELPTVSSVSSTAG----------PGTQTGSTS----RPNTVT 2480
Cdd:pfam05109 628 AvttgqHNITSSSTSsmslrpSSISETLSPSTSDNSTSHMPLLTSAHPTGGenitqvtpasTSTHHVSTSspapRPGTTS 707
|
250
....*....|....*
gi 147905740 2481 QpASSPETSSTTCQP 2495
Cdd:pfam05109 708 Q-ASGPGNSSTSTKP 721
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
2827-3082 |
8.97e-08 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 58.43 E-value: 8.97e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2827 SRPSTPRTEKIVSIATARTTGSFPVSSLSTFPASPTSLLPTSEEPHSGKPSTSSWTASSNVPSTAKPTTSTGRQfPRTKT 2906
Cdd:pfam17823 84 TEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAA-PRAAI 162
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2907 TLWNSLPSSTPTMATRSIHPISTTMTTKNPQTLVTSDISKSNSMLPSRPHITASSVTSTSgsqisksySSTSLTTAGPSS 2986
Cdd:pfam17823 163 AAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHP--------AAGTALAAVGNS 234
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2987 PPHGGTTTHPVTTasTTRAVTETLssrphtktTSELPTVSSVSSTAGPGTQTGSTSRPSIVTQPVSSPETSSTTCQPQCQ 3066
Cdd:pfam17823 235 SPAAGTVTAAVGT--VTPAALATL--------AAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQAQ 304
|
250
....*....|....*..
gi 147905740 3067 W-TRWINVDYPSSSING 3082
Cdd:pfam17823 305 GpIIQVSTDQPVHNTAG 321
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2003-2181 |
4.53e-07 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 56.30 E-value: 4.53e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2003 SPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTARSTSPGHSQPPSSPYATNSLSATT 2082
Cdd:COG3469 30 ASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGT 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2083 SKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTQTGSTSRP 2162
Cdd:COG3469 110 STVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATT 189
|
170
....*....|....*....
gi 147905740 2163 NTVTQPASSPETSSTTCQP 2181
Cdd:COG3469 190 ASGATTPSATTTATTTGPP 208
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2317-2495 |
4.53e-07 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 56.30 E-value: 4.53e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2317 SPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTARSTSPGHSQPPSSPYATNSLSATT 2396
Cdd:COG3469 30 ASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGT 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2397 SKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTQTGSTSRP 2476
Cdd:COG3469 110 STVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATT 189
|
170
....*....|....*....
gi 147905740 2477 NTVTQPASSPETSSTTCQP 2495
Cdd:COG3469 190 ASGATTPSATTTATTTGPP 208
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
699-756 |
4.59e-07 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 49.62 E-value: 4.59e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 147905740 699 CPQTKGYFYSVSSCQPTCRSLsEADVTCsvPFVTVDGCTCPEGTFLNDKDHCVPVEEC 756
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANP-NAPPPC--TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| PLN02217 |
PLN02217 |
probable pectinesterase/pectinesterase inhibitor |
1687-1796 |
6.66e-07 |
|
probable pectinesterase/pectinesterase inhibitor
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 55.87 E-value: 6.66e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1687 STPQTLSSKPLPNTSSFETWSPS---GPTTSMEASEMptfwtlrtsvSSPPTTPLTVTSVSTQHPKSleSSGSPSVVTA- 1762
Cdd:PLN02217 568 STNSTPTGSAASSNTTFSSDSPStvvAPSTSPPAGHL----------GSPPATPSKIVSPSTSPPAS--HLGSPSTTPSs 635
|
90 100 110
....*....|....*....|....*....|....*
gi 147905740 1763 -RSTSQGYSLPPSSPHTTNSLSATTSKASLSSMTT 1796
Cdd:PLN02217 636 pESSIKVASTETASPESSIKVASTESSVSMVSMST 670
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1671-1862 |
1.10e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 54.76 E-value: 1.10e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1671 GSTTPEGSVSTTETATSTPQTLSSKPLPNTSSFETWSPSGPTTSMEASEMPTFWTLRTSVS--SPPTTPLTVTSVSTQHP 1748
Cdd:COG3469 21 TLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATatAAAAAATSTSATLVATS 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1749 KSLESSGSPSVVTARSTSQGYSLPPSSPHTTNSLSATTSKASLSS----MTTISTSLTTPGPSSPPHGGTTThpvTTAST 1824
Cdd:COG3469 101 TASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGstttTTTVSGTETATGGTTTTSTTTTT---TSAST 177
|
170 180 190
....*....|....*....|....*....|....*...
gi 147905740 1825 TRAVTETLSSRPHTKTTSelPTVSSASSTAGPGTQTFS 1862
Cdd:COG3469 178 TPSATTTATATTASGATT--PSATTTATTTGPPTPGLP 213
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1672-1855 |
1.53e-06 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 54.92 E-value: 1.53e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1672 STTPEGSvstteTATSTPQTLSSKPLPNTSSFETWSPSGPTTSMEA------SEMPTFWTLRTSVSSP------PTTPLT 1739
Cdd:pfam05109 475 SPTPAGT-----TSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTptpnatSPTPAVTTPTPNATSPtlgktsPTSAVT 549
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1740 VTSVSTQHPKSLESSGSP--SVVTARSTSQGYSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTT- 1816
Cdd:pfam05109 550 TPTPNATSPTPAVTTPTPnaTIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAv 629
|
170 180 190 200
....*....|....*....|....*....|....*....|....*....
gi 147905740 1817 ----HPVTTASTT------RAVTETLSSRPHTKTTSELPTVSSASSTAG 1855
Cdd:pfam05109 630 ttgqHNITSSSTSsmslrpSSISETLSPSTSDNSTSHMPLLTSAHPTGG 678
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
2316-2485 |
1.65e-06 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 52.60 E-value: 1.65e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2316 WSPSGPTT-SMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTArsTSPGHSQPPSSpyatnslsa 2394
Cdd:PHA03255 24 WTSSGSSTaSAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTV--TSTGTTVTPVP--------- 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2395 TTSKASLSSMTTISTSLTTPGPSSpphGGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTqTGSTS 2474
Cdd:PHA03255 93 TTSNASTINVTTKVTAQNITATEA---GTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKT-TAELP 168
|
170
....*....|.
gi 147905740 2475 RPNTVTQPASS 2485
Cdd:PHA03255 169 TVPDERQPSLS 179
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
2002-2171 |
1.65e-06 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 52.60 E-value: 1.65e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2002 WSPSGPTT-SMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTArsTSPGHSQPPSSpyatnslsa 2080
Cdd:PHA03255 24 WTSSGSSTaSAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTV--TSTGTTVTPVP--------- 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2081 TTSKASLSSMTTISTSLTTPGPSSpphGGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTqTGSTS 2160
Cdd:PHA03255 93 TTSNASTINVTTKVTAQNITATEA---GTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKT-TAELP 168
|
170
....*....|.
gi 147905740 2161 RPNTVTQPASS 2171
Cdd:PHA03255 169 TVPDERQPSLS 179
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
2881-3053 |
3.35e-06 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 51.83 E-value: 3.35e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2881 WTASSNVPSTAKPTTSTgrqfprtkttlwNSLPSSTPTMATRSIHPISTTMTTKNPQTlvTSDISKSNSMLPSRPHITAS 2960
Cdd:PHA03255 24 WTSSGSSTASAGNVTGT------------TAVTTPSPSASGPSTNQSTTLTTTSAPIT--TTAILSTNTTTVTSTGTTVT 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2961 SVTSTS-GSQISKSYSSTSLTTAGPSSpphGGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTqTG 3039
Cdd:PHA03255 90 PVPTTSnASTINVTTKVTAQNITATEA---GTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKT-TA 165
|
170
....*....|....
gi 147905740 3040 STSRPSIVTQPVSS 3053
Cdd:PHA03255 166 ELPTVPDERQPSLS 179
|
|
| Metaviral_G |
pfam09595 |
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ... |
1742-1864 |
4.16e-06 |
|
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.
Pssm-ID: 462833 [Multi-domain] Cd Length: 183 Bit Score: 50.34 E-value: 4.16e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1742 SVSTQHPKSlESSGSPSVVTARSTSQGYSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPV-- 1819
Cdd:pfam09595 54 NINKQHPEQ-EHHENPPLNEAAKEAPSESEDAPDIDPNNQHPSQDRSEAPPLEPAAKTKPSEHEPANPPDASNRLSPPda 132
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 147905740 1820 TTASTTRAVTETLSsrphtkTTSELPTVSSASSTAGPGTQTFSTT 1864
Cdd:pfam09595 133 STAAIREARTFRKP------STGKRNNPSSAQSDQSPPRANHEAI 171
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2909-3626 |
4.29e-06 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 53.38 E-value: 4.29e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2909 WNSLPSSTPTMATRSiHPISTTMTTKNPQTLVTSDISKSNSMLPSRPHITASsVTSTSGSQISKSYSSTSLTTAG-PSSP 2987
Cdd:pfam05109 162 WDNCNSTNITAVVRA-QGLDVTLPLSLPTSAQDSNFSVKTEMLGNEIDIECI-MEDGEISQVLPGDNKFNITCSGyESHV 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2988 PHGG--TTTHPVTTA--STTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTQTGSTSrpsIVTQPVSSPETSSTTCQP 3063
Cdd:pfam05109 240 PSGGilTSTSPVATPipGTGYAYSLRLTPRPVSRFLGNNSILYVFYSGNGPKASGGDYC---IQSNIVFSDEIPASQDMP 316
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3064 qcqwTRWINVDYPsssingGDIETYENIRANGEKiCEKPQDIRCMAQNYPgvNVDELGQKVKCDVNSGLVCYNKDQGGTF 3143
Cdd:pfam05109 317 ----TNTTDITYV------GDNATYSVPMVTSED-ANSPNVTVTAFWAWP--NNTETDFKCKWTLTSGTPSGCENISGAF 383
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3144 KMCYNYLIKVLCCSVSHCQGSTTPEGSVSTTETA----SSIPQTLSSKPLPNTSSFetwspSGPTTSmeaSEMPTFWTLR 3219
Cdd:pfam05109 384 ASNRTFDITVSGLGTAPKTLIITRTATNATTTTHkvifSKAPESTTTSPTLNTTGF-----AAPNTT---TGLPSSTHVP 455
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3220 TSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTARSTSQGHSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTSEPS 3299
Cdd:pfam05109 456 TNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNAT 535
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3300 SPPHGGTTthPVTAASTtravtetlssrPHTKTTSELP--TVPSASSTAGPGTQTGSTSRPSTVTQPASSPETSSTTcqP 3377
Cdd:pfam05109 536 SPTLGKTS--PTSAVTT-----------PTPNATSPTPavTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETS--P 600
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3378 QCQWTKWIDVDYPSSSINGGDIETYENIRTNGEkicekpQDIRCMAQNHPGVNVDELGQKVKCDVNSGLVCYNKDQGGTF 3457
Cdd:pfam05109 601 QANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQ------HNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAH 674
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3458 QMCYNYLIKVLCCSVSHCQGSTTTPGSAPMTGSATTAtqtlssmplPNTSLTVTSPHSASTTAATSITETSSPRSVSSIH 3537
Cdd:pfam05109 675 PTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASG---------PGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQK 745
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3538 TTPKIFPET---LQSTTGltTQQTSSTGIWTSmltgtagtvsstslstslesttgiTEVSTSMPMTSTLPVTVTNATT-- 3612
Cdd:pfam05109 746 TAVPTVTSTggkANSTTG--GKHTTGHGARTS------------------------TEPTTDYGGDSTTPRTRYNATTyl 799
|
730
....*....|....*
gi 147905740 3613 -PQGTTLCQPKCKWT 3626
Cdd:pfam05109 800 pPSTSSKLRPRWTFT 814
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
798-859 |
4.47e-06 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 46.54 E-value: 4.47e-06
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 147905740 798 CQAPMVYLDCNNAsvghhgaeCLKSCHTLD--VECfSTQCVSGCVCPSGLVADGNGGCIAEEDC 859
Cdd:cd19941 1 CPPNEVYSECGSA--------CPPTCANPNapPPC-TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3164-3377 |
4.48e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 52.83 E-value: 4.48e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3164 STTPEGSVSTTETASSIPQTLSSKPLPNTSSFETWSPSGPTTSmeasempTFWTLRTSVSSPPTTPLTVTSVSTQHPKSL 3243
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVS-------TTGSVVVAASGSAGSGTGTTAASSTAATSS 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3244 ESSGSPSVVTARSTSQGHSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTSEPSSPPHGGTTTHPVTAASTTRAVTET 3323
Cdd:COG3469 75 TTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVS 154
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 147905740 3324 LSSRPHTKTTSELPTVPSASSTAGPGTQTGSTSRPSTVTQPASSPETSSTTCQP 3377
Cdd:COG3469 155 GTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPP 208
|
|
| ROM1 |
COG5422 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
2814-3056 |
5.11e-06 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 53.36 E-value: 5.11e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2814 SEEPIPSTSFTTQSRPSTPRTEKIVSIATARTTGSFPVSSLS--TFPASPTSLLPTSEEPHSGkPSTSSWTASSnvpSTA 2891
Cdd:COG5422 28 SKQLLPPRRLQRKLNPISIRNGADNDIINSESKESFGKYALGhqIFSSFSSSPKLFQRRNSAG-PITHSPSATS---STS 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2892 KPTTSTGRQF-PRTKTTLWNSL----------PSSTPTMATRS-IHPISTTMTTKNPqtLVTSDISKSNSMLP---SRPH 2956
Cdd:COG5422 104 SLNSNDGDQFsPASDSLSFNPSstqsrkdsgpGDGSPVQKRKNpLLPSSSTHGTHPP--IVFTDNNGSHAGAPnarSRKE 181
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2957 ITASSVTSTSGSQISKSYSSTSLTTAGPSSPPHggTTTHpvttastTRAVTETLSSRPHTKTTSELPTVSSVSSTagpgT 3036
Cdd:COG5422 182 IPSLGSQSMQLPSPHFRQKFSSSDTSNGFSYPS--IRKN-------SRHSSNSMPSFPHSSTAVLLKRHSGSSGA----S 248
|
250 260
....*....|....*....|
gi 147905740 3037 QTGSTSRPSIVTQPVSSPET 3056
Cdd:COG5422 249 LISSNITPSSSNSEAMSTSS 268
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1665-1864 |
5.37e-06 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 53.09 E-value: 5.37e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1665 SVSHCQGSTTPEGSVSTTETATSTPQTLSSKPLPNTSSFETWSPSGPTTSMEASEMPTFWTLRTSVSSPpttpltvTSVS 1744
Cdd:NF033849 318 STTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSS-------SGVS 390
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1745 TQHPKSLESSG--SPSVVTARSTSQGYSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTA 1822
Cdd:NF033849 391 GGFSGGIAGGGvtSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGT 470
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 147905740 1823 STTRAVTEtlsSRPHTKTTSELPTVSSASSTAGPGTQTFSTT 1864
Cdd:NF033849 471 SQGQSVGT---SESWSTSQSETDSVGDSTGTSESVSQGDGRS 509
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
709-756 |
3.06e-05 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 44.30 E-value: 3.06e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 147905740 709 VSSCQPTCRSLSEADVtCSVPfvTVDGCTCPEGTFLNDKDHCVPVEEC 756
Cdd:pfam01826 11 GSACPPTCANLSPPDV-CPEP--CVEGCVCPPGFVRNSGGKCVPPSDC 55
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
798-859 |
4.85e-05 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 43.92 E-value: 4.85e-05
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 147905740 798 CQAPMVYLDCNNAsvghhgaeCLKSCHTLDVECF-STQCVSGCVCPSGLVADGNGGCIAEEDC 859
Cdd:pfam01826 1 CPANEVYSECGSA--------CPPTCANLSPPDVcPEPCVEGCVCPPGFVRNSGGKCVPPSDC 55
|
|
| VWC |
smart00214 |
von Willebrand factor (vWF) type C domain; |
4570-4631 |
6.68e-05 |
|
von Willebrand factor (vWF) type C domain;
Pssm-ID: 214564 Cd Length: 59 Bit Score: 43.66 E-value: 6.68e-05
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 147905740 4570 CTYNDTIYGVGTTFPGGPCHTCTCLSdgdqEPKVECKETNC--TTSCPQGFKyTLVPEQCCGEC 4631
Cdd:smart00214 1 CVHNGRVYNDGETWKPDPCQICTCLD----GTTVLCDPVECppPPDCPNPER-VKPPGECCPRC 59
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
3163-3372 |
1.37e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 48.63 E-value: 1.37e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3163 GSTTPEGSvSTTETASSIPQTLSSKPLPNTSSFETWSPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKS 3242
Cdd:PHA03307 104 GSPTPPGP-SSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEET 182
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3243 LESSGSPSvVTARSTSQGHSLPPSSPHTTNSLSATTS-------KASLSSMTTISTSLTTSEPSSPPHGGTTTHPVTAAS 3315
Cdd:PHA03307 183 ARAPSSPP-AEPPPSTPPAAASPRPPRRSSPISASASspapapgRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPA 261
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 147905740 3316 TTRAVTETLSSRPHTKTTSELPTVPSASSTAGPG--TQTGSTSRPSTVTQPASSPETSS 3372
Cdd:PHA03307 262 PITLPTRIWEASGWNGPSSRPGPASSSSSPRERSpsPSPSSPGSGPAPSSPRASSSSSS 320
|
|
| VWC_out |
smart00215 |
von Willebrand factor (vWF) type C domain; |
391-436 |
2.27e-04 |
|
von Willebrand factor (vWF) type C domain;
Pssm-ID: 214565 Cd Length: 67 Bit Score: 42.16 E-value: 2.27e-04
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 147905740 391 CTHGGRIYAPGESFNTSCRSCTCFGGLWKCKDLPCPGTCSVQGGSH 436
Cdd:smart00215 1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCGPKPCLLHNLS 46
|
|
| Cys_knot |
pfam00007 |
Cystine-knot domain; The family comprises glycoprotein hormones and the C-terminal domain of ... |
4710-4787 |
4.72e-04 |
|
Cystine-knot domain; The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerization.
Pssm-ID: 394966 Cd Length: 105 Bit Score: 42.39 E-value: 4.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 4710 VNATV-LRYKGCET--EVNITFCEGSCSGISKYSMEAQAMERQCTCCQESKVHDVAVTMQCPDGTVIQHTYTHIDECNCA 4786
Cdd:pfam00007 7 TNYTIsVEKEGCTSckTINTTICAGYCYTRDPVYKDGRRAVSQRVCTYRDVTYETVVLPGCPPGVDPTVTYPVALSCHCG 86
|
.
gi 147905740 4787 P 4787
Cdd:pfam00007 87 N 87
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
1736-1868 |
5.32e-04 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 44.89 E-value: 5.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1736 TPLTVTSVSTQHPKSLESSGSPSVVTARSTSQGYSLPPSSPHTTNSLSATTSkASLSSMTTISTSLTTPGPSSPPHGGTT 1815
Cdd:PHA03255 20 TSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTT-AILSTNTTTVTSTGTTVTPVPTTSNAS 98
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 147905740 1816 THPVTTASTTRAVTETLSSRPHTKTTSELPTVSSASSTAGPGTQTFSTTCQPQ 1868
Cdd:PHA03255 99 TINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPT 151
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
2822-3060 |
2.48e-03 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 44.22 E-value: 2.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2822 SFTTQSRPSTPRteKIVSIATARTTGSFPVSSLST------FPASPtSLLPTSEEPHSGKPSTSSWT-ASSNVPSTAKPT 2894
Cdd:TIGR00927 105 SIAMENTPSPPR--RTAKITPTTPKNNYSPTAAGTervkedTPATP-SRALNHYISTSGRQRVKSYTpKPRGEVKSSSPT 181
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2895 TSTGRQFPRTKTTL---WNSLPSSTPTMATRSiHPISTTMTTKNPQTLVTSDISKSNS---------------MLPSRP- 2955
Cdd:TIGR00927 182 QTREKVRKYTPSPLgrmVNSYAPSTFMTMPRS-HGITPRTTVKDSEITATYKMLETNPskrtagkttptplkgMTDNTPt 260
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2956 ----HITASSVTSTSGSQISKSYSSTSLTTAGPSSPPHGGTTTHPVTTASttRAVTEtlssrpHTKTTSELPTVSSVSST 3031
Cdd:TIGR00927 261 fltrEVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLTTPQ--GTVLE------HTPATSEGQVTISIMTG 332
|
250 260
....*....|....*....|....*....
gi 147905740 3032 AGPGTQTGSTSRPSIVTqpvSSPETSSTT 3060
Cdd:TIGR00927 333 SSPAETKASTAAWKIRN---PLSRTSAPA 358
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
888-1047 |
1.15e-36 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 137.92 E-value: 1.15e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 888 WECTNKPCMGACVAYGDGHFVTFDGERYIFEGSCEYTLAQDyCrgntSTNGTFRIVTENVPCGTTGTtCSKTIKIFVESY 967
Cdd:smart00216 1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD-C----SSEPTFSVLLKNVPCGGGAT-CLKSVKVELNGD 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 968 ELILHEGNFKVVEQGPSGDPPYK-------IRFMGIFLVIEIRSGIV-VSWDRKTSVFVRLQQHYKGRVCGLCGNFDDNA 1039
Cdd:smart00216 75 EIELKDDNGKVTVNGQQVSLPYKtsdgsiqIRSSGGYLVVITSLGLIqVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEP 154
|
....*...
gi 147905740 1040 INDFTTRS 1047
Cdd:smart00216 155 EDDFRTPD 162
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
429-583 |
9.09e-35 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 132.11 E-value: 9.09e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 429 CSVQGGSHIFTFDEKLYNVHGDCSYILTKKC-ADSSFTVLVDLRKCGVTDTENCLKAVTLNLnsGDTVIRIQVNGAVFLN 507
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCsEEPDFSFSVTNKNCNGGASGVCLKSVTVIV--GDLEITLQKGGTVLVN 78
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 147905740 508 SIFTQMPLSAGNITIFKPSSFFIIVQTGMGLQLQVQLVPLMQVFVRLDRSYQGQMCGLCGNFNQNQADDFTALSGV 583
Cdd:pfam00094 79 GQKVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
418-582 |
6.59e-34 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 129.83 E-value: 6.59e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 418 WKCKDLPCPGTCSVQGGSHIFTFDEKLYNVHGDCSYILTKKCADS-SFTVLVDLRKCGVTDTenCLKAVTLNLNSGDTVI 496
Cdd:smart00216 1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSEpTFSVLLKNVPCGGGAT--CLKSVKVELNGDEIEL 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 497 rIQVNGAVFLNSIFTQMPLSAGNITI-FKPSSFFIIVQTGMGLqLQVQLVPLMQVFVRLDRSYQGQMCGLCGNFNQNQAD 575
Cdd:smart00216 79 -KDDNGKVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGL-IQVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPED 156
|
....*..
gi 147905740 576 DFTALSG 582
Cdd:smart00216 157 DFRTPDG 163
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
2185-2273 |
7.36e-30 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 115.51 E-value: 7.36e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2185 WTKWIDVDYPSSSiNGGDIETYENIRANGeKICEKPQDIRCMAQNHPGVNVDELGQKVKCDVSSGLVCYNKDQGGtfQMC 2264
Cdd:pfam13330 1 WTPWFDVDNPSGS-GGGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQP--DGC 76
|
....*....
gi 147905740 2265 YNYLIKVLC 2273
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
2499-2587 |
7.36e-30 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 115.51 E-value: 7.36e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2499 WTKWIDVDYPSSSiNGGDIETYENIRANGeKICEKPQDIRCMAQNHPGVNVDELGQKVKCDVSSGLVCYNKDQGGtfQMC 2578
Cdd:pfam13330 1 WTPWFDVDNPSGS-GGGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQP--DGC 76
|
....*....
gi 147905740 2579 YNYLIKVLC 2587
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
1575-1663 |
7.36e-30 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 115.51 E-value: 7.36e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1575 WTKWIDVDYPSSSiNGGDIETYENIRANGeKICEKPQDIRCMAQNHPGVNVDELGQKVKCDVSSGLVCYNKDQGGtfQMC 1654
Cdd:pfam13330 1 WTPWFDVDNPSGS-GGGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQP--DGC 76
|
....*....
gi 147905740 1655 YNYLIKVLC 1663
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
3067-3155 |
1.01e-29 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 115.12 E-value: 1.01e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3067 WTRWINVDYPSSSiNGGDIETYENIRANGeKICEKPQDIRCMAQNYPGVNVDELGQKVKCDVNSGLVCYNKDQGGTfkMC 3146
Cdd:pfam13330 1 WTPWFDVDNPSGS-GGGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPD--GC 76
|
....*....
gi 147905740 3147 YNYLIKVLC 3155
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
1871-1959 |
1.47e-29 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 114.35 E-value: 1.47e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1871 WTKWIDVDYPSSSiNGGDIETYENIRANGeKICEKPQDIRCMAQNYPGVNVDKLGQKVKCDVSSGLVCYNKDQGGtfPMC 1950
Cdd:pfam13330 1 WTPWFDVDNPSGS-GGGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQP--DGC 76
|
....*....
gi 147905740 1951 YNYLIKVLC 1959
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
3625-3713 |
1.95e-29 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 113.97 E-value: 1.95e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3625 WTEWFDVDSPtSGSVKGDMETYENIRASGKtLCQAPEKIECRAENYPAVSIDKVGQVVSCNLETGLVCKNEDQKDDfkMC 3704
Cdd:pfam13330 1 WTPWFDVDNP-SGSGGGDFETLENLRAYGK-FCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPD--GC 76
|
....*....
gi 147905740 3705 FNYNIRVLC 3713
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
3381-3469 |
2.53e-29 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 113.97 E-value: 2.53e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3381 WTKWIDVDYPSSSiNGGDIETYENIRTNGeKICEKPQDIRCMAQNHPGVNVDELGQKVKCDVNSGLVCYNKDQGGtfQMC 3460
Cdd:pfam13330 1 WTPWFDVDNPSGS-GGGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQP--DGC 76
|
....*....
gi 147905740 3461 YNYLIKVLC 3469
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
1346-1432 |
4.03e-29 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 113.20 E-value: 4.03e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1346 WSIWYDESHPEAGMsGGDFETFENLRKKGhQVCQTPVGIECRAVKFPSMDLEKLGQKVNCDPSSGLSCFNSEQSPPLCHD 1425
Cdd:pfam13330 1 WTPWFDVDNPSGSG-GGDFETLENLRAYG-KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPDGCLD 78
|
....*..
gi 147905740 1426 YELRVLC 1432
Cdd:pfam13330 79 YEVRFLC 85
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
4111-4283 |
1.39e-28 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 114.81 E-value: 1.39e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 4111 PCHFHYECECFCSGWGHSHYLTFDGTSYTFLDNCTSVLMREIHPRHgNLSILAHSYYCGATtnvTSCPRALSVYYNSMEI 4190
Cdd:smart00216 1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSEP-TFSVLLKNVPCGGG---ATCLKSVKVELNGDEI 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 4191 ILTisttesgKDESLIIWDQMWIRSGFSKNGVTVSLSGATTMSVNISTIG-TIITFDGNI-FQIWLPYRYFsSNTEGQCG 4268
Cdd:smart00216 77 ELK-------DDNGKVTVNGQQVSLPYKTSDGSIQIRSSGGYLVVITSLGlIQVTFDGLTlLSVQLPSKYR-GKTCGLCG 148
|
170
....*....|....*
gi 147905740 4269 TCTNSQIDDCRRPDG 4283
Cdd:smart00216 149 NFDGEPEDDFRTPDG 163
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
899-1047 |
2.64e-28 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 113.62 E-value: 2.64e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 899 CVAYGDGHFVTFDGERYIFEGSCEYTLAQDyCRGNTstNGTFRIVTENVPCGTTGTtCSKTIKIFVESYELILHEGNFKV 978
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKD-CSEEP--DFSFSVTNKNCNGGASGV-CLKSVTVIVGDLEITLQKGGTVL 76
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 147905740 979 VEQGPSGDPPYK----IRFMG---IFLVIEIRSGIVVSWDRKTSVFVRLQQHYKGRVCGLCGNFDDNAINDFTTRS 1047
Cdd:pfam00094 77 VNGQKVSLPYKSdggeVEILGsgfVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPD 152
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
3779-3867 |
4.63e-27 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 107.42 E-value: 4.63e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3779 WTEWFDADYPNpGPRGGDFEVYAVFREvGYIFCDQPKDIECRSEKEPDRPLETLEQVVQCDVRFGLICKNINQSGPLqyC 3858
Cdd:pfam13330 1 WTPWFDVDNPS-GSGGGDFETLENLRA-YGKFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPDG--C 76
|
....*....
gi 147905740 3859 DNYHVRLLC 3867
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
80-227 |
1.70e-26 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 108.64 E-value: 1.70e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 80 VCSTWGDFHYKTFDGDVFRFPGLCNYVFSSHCGaTYEDFNIQLRRGLEGSRPTVTY-VLLRAQGLVIELS--NGSVLVNG 156
Cdd:smart00216 11 TCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCS-SEPTFSVLLKNVPCGGGATCLKsVKVELNGDEIELKddNGKVTVNG 89
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 147905740 157 HREKLPYSRAGLLME---KSSGYVKISIRLVLTFLWNEEDSALLELDSKYINQTCGLCGDFNGLPAvSEFYTHN 227
Cdd:smart00216 90 QQVSLPYKTSDGSIQirsSGGYLVVITSLGLIQVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPE-DDFRTPD 162
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
81-225 |
4.86e-26 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 107.07 E-value: 4.86e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 81 CSTWGDFHYKTFDGDVFRFPGLCNYVFSSHCGATYED-FNIQLRRGLEGSRPTVTY-VLLRAQGLVIEL-SNGSVLVNGH 157
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFsFSVTNKNCNGGASGVCLKsVTVIVGDLEITLqKGGTVLVNGQ 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 147905740 158 REKLPYSRAGLLMEKSSGY---VKISIRLVLTFLWNEEDSALLELDSKYINQTCGLCGDFNGLPAvSEFYT 225
Cdd:pfam00094 81 KVSLPYKSDGGEVEILGSGfvvVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQE-DDFMT 150
|
|
| Mucin2_WxxW |
pfam13330 |
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ... |
2688-2776 |
1.59e-25 |
|
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.
Pssm-ID: 463846 [Multi-domain] Cd Length: 85 Bit Score: 102.80 E-value: 1.59e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2688 WTDWLDSDKPQpGQFEGDIETYYHIQNKTgiKICKKPVDIECEAVLFPNISFQKLGQEVVCNVDFGLICRNSKQSDNqIC 2767
Cdd:pfam13330 1 WTPWFDVDNPS-GSGGGDFETLENLRAYG--KFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPD-GC 76
|
....*....
gi 147905740 2768 FNYHIRVLC 2776
Cdd:pfam13330 77 LDYEVRFLC 85
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
619-700 |
5.17e-25 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 101.26 E-value: 5.17e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 619 ENFAREWCSMLTESSGVFSACHATVSPVPFYSNCLFDTCNCENSEDCMCAALSSYVFACAAKGVLLSGWRDkvcykyTNN 698
Cdd:smart00832 1 KYYACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRT------PTF 74
|
..
gi 147905740 699 CP 700
Cdd:smart00832 75 CP 76
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
1084-1158 |
1.54e-24 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 99.72 E-value: 1.54e-24
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 147905740 1084 KSWAQKKCSIINSA--TFAACHSQVDSTKYYEACVHDVCACdsGGDCECFCTAVAAYAQACRDVGVCLS-WRTPDICP 1158
Cdd:smart00832 1 KYYACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCISpWRTPTFCP 76
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
4122-4284 |
2.33e-23 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 99.37 E-value: 2.33e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 4122 CSGWGHSHYLTFDGTSYTFLDNCTSVLMREIHPRHgNLSILAHSYYCGATTNVTsCPRALSVYYNSMEIILTisttesgk 4201
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEP-DFSFSVTNKNCNGGASGV-CLKSVTVIVGDLEITLQ-------- 70
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 4202 DESLIIWDQMWIRSGFSKNGVTVSLSGATTMSVNISTIGTI-ITFDGNIF-QIWLPYRYFSSnTEGQCGTCTNSQIDDCR 4279
Cdd:pfam00094 71 KGGTVLVNGQKVSLPYKSDGGEVEILGSGFVVVDLSPGVGLqVDGDGRGQlFVTLSPSYQGK-TCGLCGNYNGNQEDDFM 149
|
....*
gi 147905740 4280 RPDGT 4284
Cdd:pfam00094 150 TPDGT 154
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
1090-1157 |
6.46e-21 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 89.36 E-value: 6.46e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1090 KCSIINSAT-FAACHSQVDSTKYYEACVHDVCACdsGGDCECFCTAVAAYAQACRDVGVCL-SWRTPDIC 1157
Cdd:pfam08742 1 KCGLLSDSGpFAPCHSVVDPEPYFEACVYDMCSC--GGDDECLCAALAAYARACQAAGVCIgDWRTPTFC 68
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
625-692 |
2.70e-20 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 87.44 E-value: 2.70e-20
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 147905740 625 WCSMLTeSSGVFSACHATVSPVPFYSNCLFDTCNCENSEDCMCAALSSYVFACAAKGVLLSGWRDKVC 692
Cdd:pfam08742 1 KCGLLS-DSGPFAPCHSVVDPEPYFEACVYDMCSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTF 67
|
|
| CT |
smart00041 |
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ... |
4708-4790 |
2.34e-15 |
|
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers.
Pssm-ID: 214482 Cd Length: 82 Bit Score: 73.98 E-value: 2.34e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 4708 VHVNATVLRYKGCETE-VNITFCEGSCSGISKYSmeAQAMERQCTCCQESKVHDVAVTMQCPDGTVIQHTYTHIDECNCA 4786
Cdd:smart00041 1 KSPVRQTITYNGCTSVtVKNAFCEGKCGSASSYS--IQDVQHSCSCCQPHKTKTRQVRLRCPDGSTVKKTVMHIEECGCE 78
|
....
gi 147905740 4787 PACS 4790
Cdd:smart00041 79 PNCP 82
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
265-329 |
1.33e-14 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 71.26 E-value: 1.33e-14
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 147905740 265 ICRRTLLGPAFDKCTALVDVSMYLDACVQDLCRCP---TCPCATFAEYSRQCAHAGGYPQNWRHPDLC 329
Cdd:pfam08742 1 KCGLLSDSGPFAPCHSVVDPEPYFEACVYDMCSCGgddECLCAALAAYARACQAAGVCIGDWRTPTFC 68
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
4336-4398 |
3.82e-14 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 70.10 E-value: 3.82e-14
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 147905740 4336 LCELMLSQ-VFAECHRLIPPDAFFRSCVSDHCN-ANITDMLCQSLEAYAALCRAQGVC-TNWRNAT 4398
Cdd:pfam08742 1 KCGLLSDSgPFAPCHSVVDPEPYFEACVYDMCScGGDDECLCAALAAYARACQAAGVCiGDWRTPT 66
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
333-389 |
3.41e-13 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 66.96 E-value: 3.41e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*..
gi 147905740 333 CPFNMEYRECSSPCVDTCSNPQRSQLCEDHCMDGCFCPPGTVLDDVRHlgCLPLEQC 389
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANPNAPPPCTKQCVEGCFCPEGYVRNSGGK--CVPPSQC 55
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
4337-4398 |
9.47e-13 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 66.21 E-value: 9.47e-13
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 147905740 4337 CELMLSQ--VFAECHRLIPPDAFFRSCVSDHCNAN-ITDMLCQSLEAYAALCRAQGVC-TNWRNAT 4398
Cdd:smart00832 8 CGILLSPrgPFAACHSVVDPEPFFENCVYDTCACGgDCECLCDALAAYAAACAEAGVCiSPWRTPT 73
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
333-389 |
1.56e-12 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 65.10 E-value: 1.56e-12
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*..
gi 147905740 333 CPFNMEYRECSSPCVDTCSNPQRSQLCEDHCMDGCFCPPGTVLDDvrHLGCLPLEQC 389
Cdd:pfam01826 1 CPANEVYSECGSACPPTCANLSPPDVCPEPCVEGCVCPPGFVRNS--GGKCVPPSDC 55
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
275-329 |
1.46e-09 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 57.35 E-value: 1.46e-09
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 147905740 275 FDKCTALVDVSMYLDACVQDLCRCP---TCPCATFAEYSRQCAHAGGYPQNWRHPDLC 329
Cdd:smart00832 18 FAACHSVVDPEPFFENCVYDTCACGgdcECLCDALAAYAAACAEAGVCISPWRTPTFC 75
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1968-2181 |
2.46e-08 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 60.70 E-value: 2.46e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1968 STTPEGSVSTTETttatpqtLSSKPLPNTSSIETWSP--SGPTTSMEA------SEMPTFWTLRTSVSSP------PTTP 2033
Cdd:pfam05109 475 SPTPAGTTSGASP-------VTPSPSPRDNGTESKAPdmTSPTSAVTTptpnatSPTPAVTTPTPNATSPtlgktsPTSA 547
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2034 LTVTSVSTQHPKSLESSGSP--SVVTARSTSPGHSQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTT 2111
Cdd:pfam05109 548 VTTPTPNATSPTPAVTTPTPnaTIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATS 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2112 T-----HPVTTASTT------RAVTETLSSRPHTKTTSELPTVSSVSSTAG----------PGTQTGSTS----RPNTVT 2166
Cdd:pfam05109 628 AvttgqHNITSSSTSsmslrpSSISETLSPSTSDNSTSHMPLLTSAHPTGGenitqvtpasTSTHHVSTSspapRPGTTS 707
|
250
....*....|....*
gi 147905740 2167 QpASSPETSSTTCQP 2181
Cdd:pfam05109 708 Q-ASGPGNSSTSTKP 721
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2282-2495 |
2.46e-08 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 60.70 E-value: 2.46e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2282 STTPEGSVSTTETttatpqtLSSKPLPNTSSIETWSP--SGPTTSMEA------SEMPTFWTLRTSVSSP------PTTP 2347
Cdd:pfam05109 475 SPTPAGTTSGASP-------VTPSPSPRDNGTESKAPdmTSPTSAVTTptpnatSPTPAVTTPTPNATSPtlgktsPTSA 547
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2348 LTVTSVSTQHPKSLESSGSP--SVVTARSTSPGHSQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTT 2425
Cdd:pfam05109 548 VTTPTPNATSPTPAVTTPTPnaTIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATS 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2426 T-----HPVTTASTT------RAVTETLSSRPHTKTTSELPTVSSVSSTAG----------PGTQTGSTS----RPNTVT 2480
Cdd:pfam05109 628 AvttgqHNITSSSTSsmslrpSSISETLSPSTSDNSTSHMPLLTSAHPTGGenitqvtpasTSTHHVSTSspapRPGTTS 707
|
250
....*....|....*
gi 147905740 2481 QpASSPETSSTTCQP 2495
Cdd:pfam05109 708 Q-ASGPGNSSTSTKP 721
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
2827-3082 |
8.97e-08 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 58.43 E-value: 8.97e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2827 SRPSTPRTEKIVSIATARTTGSFPVSSLSTFPASPTSLLPTSEEPHSGKPSTSSWTASSNVPSTAKPTTSTGRQfPRTKT 2906
Cdd:pfam17823 84 TEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAA-PRAAI 162
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2907 TLWNSLPSSTPTMATRSIHPISTTMTTKNPQTLVTSDISKSNSMLPSRPHITASSVTSTSgsqisksySSTSLTTAGPSS 2986
Cdd:pfam17823 163 AAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHP--------AAGTALAAVGNS 234
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2987 PPHGGTTTHPVTTasTTRAVTETLssrphtktTSELPTVSSVSSTAGPGTQTGSTSRPSIVTQPVSSPETSSTTCQPQCQ 3066
Cdd:pfam17823 235 SPAAGTVTAAVGT--VTPAALATL--------AAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQAQ 304
|
250
....*....|....*..
gi 147905740 3067 W-TRWINVDYPSSSING 3082
Cdd:pfam17823 305 GpIIQVSTDQPVHNTAG 321
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
2825-3063 |
1.01e-07 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 58.05 E-value: 1.01e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2825 TQSRPSTPRTEKIVSIA-TARTTGSFPVSSLSTFPASPTSLLPTSEEPHSGKPSTSSWTASSNVPSTAKPTTSTGrqfpr 2903
Cdd:pfam17823 111 ASRALAAAASSSPSSAAqSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAA----- 185
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2904 TKTTLWNSLPSSTPTMATRSIHPISTTMTtknpqtlvtsdiSKSNSMLPSRPHITASSVTSTSGSQISKSYSSTSLTTAG 2983
Cdd:pfam17823 186 SSTTAASSAPTTAASSAPATLTPARGIST------------AATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAAL 253
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2984 PSSPPHGGTTTHPVTTASTTRAVTETLSSRPH--TKTTSELPTVSSVSSTAGPGTQTgSTSRPSIVTQPVSSPETSSTTC 3061
Cdd:pfam17823 254 ATLAAAAGTVASAAGTINMGDPHARRLSPAKHmpSDTMARNPAAPMGAQAQGPIIQV-STDQPVHNTAGEPTPSPSNTTL 332
|
..
gi 147905740 3062 QP 3063
Cdd:pfam17823 333 EP 334
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2920-3410 |
2.60e-07 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 57.23 E-value: 2.60e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2920 ATRSIhPISTTMTTKNPQTLVTSDIS-KSNSMLPSRPHITASSVTSTSGSQISKSYSST---SLTTAGPSSPPHGGTTTH 2995
Cdd:pfam05109 329 ATYSV-PMVTSEDANSPNVTVTAFWAwPNNTETDFKCKWTLTSGTPSGCENISGAFASNrtfDITVSGLGTAPKTLIITR 407
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2996 PVTTASTTRAVTeTLSSRPHTKTTSelPTVSSvSSTAGPGTQTG---STSRPSIVTQPVSSPETSSTTCqpqcqwtrwIN 3072
Cdd:pfam05109 408 TATNATTTTHKV-IFSKAPESTTTS--PTLNT-TGFAAPNTTTGlpsSTHVPTNLTAPASTGPTVSTAD---------VT 474
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3073 VDYPSSSINGGD-IETYENIRANGEKICEKPQDIRCMAQNYPGVNVDELGQKVKCDVNSglvcynkdqggtfkmcynyli 3151
Cdd:pfam05109 475 SPTPAGTTSGASpVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPN--------------------- 533
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3152 kvlccSVSHCQGSTTPEGSVSTTETASSIPQTLSSKPLPNtSSFETWSPSGPTTSMeasemptfwTLRTSVSSPPTTPLT 3231
Cdd:pfam05109 534 -----ATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPN-ATIPTLGKTSPTSAV---------TTPTPNATSPTVGET 598
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3232 VTSVSTQHPKSLESSGSPSVV------TARSTSQGHSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTSEPSSPPHGG 3305
Cdd:pfam05109 599 SPQANTTNHTLGGTSSTPVVTsppknaTSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGG 678
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3306 TTTHPVTAASTTRAVTETLSSRPHTKTTSElptvpsassTAGPGTQTGSTsRPSTVTQPASSPETSSTTCQ-PQCQWTKW 3384
Cdd:pfam05109 679 ENITQVTPASTSTHHVSTSSPAPRPGTTSQ---------ASGPGNSSTST-KPGEVNVTKGTPPKNATSPQaPSGQKTAV 748
|
490 500
....*....|....*....|....*....
gi 147905740 3385 IDVDY---PSSSINGGDIETYENIRTNGE 3410
Cdd:pfam05109 749 PTVTStggKANSTTGGKHTTGHGARTSTE 777
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2003-2181 |
4.53e-07 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 56.30 E-value: 4.53e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2003 SPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTARSTSPGHSQPPSSPYATNSLSATT 2082
Cdd:COG3469 30 ASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGT 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2083 SKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTQTGSTSRP 2162
Cdd:COG3469 110 STVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATT 189
|
170
....*....|....*....
gi 147905740 2163 NTVTQPASSPETSSTTCQP 2181
Cdd:COG3469 190 ASGATTPSATTTATTTGPP 208
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2317-2495 |
4.53e-07 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 56.30 E-value: 4.53e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2317 SPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTARSTSPGHSQPPSSPYATNSLSATT 2396
Cdd:COG3469 30 ASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGT 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2397 SKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTQTGSTSRP 2476
Cdd:COG3469 110 STVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATT 189
|
170
....*....|....*....
gi 147905740 2477 NTVTQPASSPETSSTTCQP 2495
Cdd:COG3469 190 ASGATTPSATTTATTTGPP 208
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
699-756 |
4.59e-07 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 49.62 E-value: 4.59e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 147905740 699 CPQTKGYFYSVSSCQPTCRSLsEADVTCsvPFVTVDGCTCPEGTFLNDKDHCVPVEEC 756
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANP-NAPPPC--TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| PLN02217 |
PLN02217 |
probable pectinesterase/pectinesterase inhibitor |
1687-1796 |
6.66e-07 |
|
probable pectinesterase/pectinesterase inhibitor
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 55.87 E-value: 6.66e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1687 STPQTLSSKPLPNTSSFETWSPS---GPTTSMEASEMptfwtlrtsvSSPPTTPLTVTSVSTQHPKSleSSGSPSVVTA- 1762
Cdd:PLN02217 568 STNSTPTGSAASSNTTFSSDSPStvvAPSTSPPAGHL----------GSPPATPSKIVSPSTSPPAS--HLGSPSTTPSs 635
|
90 100 110
....*....|....*....|....*....|....*
gi 147905740 1763 -RSTSQGYSLPPSSPHTTNSLSATTSKASLSSMTT 1796
Cdd:PLN02217 636 pESSIKVASTETASPESSIKVASTESSVSMVSMST 670
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1680-2131 |
7.10e-07 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 56.08 E-value: 7.10e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1680 STTETATSTPQTLSSKPLPNTSSFETWSPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSV 1759
Cdd:pfam05109 408 TATNATTTTHKVIFSKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASP 487
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1760 VTARSTSQGYSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTASTTRAVTETLSSRPHTK 1839
Cdd:pfam05109 488 VTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTP 567
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1840 TTSeLPTVSSASSTAGPGTQTFSTT------CQPQCQWTKWIDVDYPSSSINGGDIETYENIRANGEkicekpQDIRCMA 1913
Cdd:pfam05109 568 NAT-IPTLGKTSPTSAVTTPTPNATsptvgeTSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQ------HNITSSS 640
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1914 QNYPGVNVDKLGQKVKCDVSSGLVCYNKDQGGTFPMCYNYLIKVLCCSVSHCQGSTTpegsvsttetttatpqtlSSKPL 1993
Cdd:pfam05109 641 TSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTS------------------SPAPR 702
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1994 PNTSSiETWSPSGPTTSMEASEmptfwtLRTSVSSPP---TTPLTVTSVSTQHPKSLESSGSPSVVTARSTSPGH----S 2066
Cdd:pfam05109 703 PGTTS-QASGPGNSSTSTKPGE------VNVTKGTPPknaTSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHgartS 775
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 147905740 2067 QPPSSPYATNSlsaTTSKASLSSMTTISTSLTTpgpSSPPHGGTTTHPVTTASTTRAVTETLSSR 2131
Cdd:pfam05109 776 TEPTTDYGGDS---TTPRTRYNATTYLPPSTSS---KLRPRWTFTSPPVTTAQATVPVPPTSQPR 834
|
|
| Metaviral_G |
pfam09595 |
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ... |
2352-2485 |
1.04e-06 |
|
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.
Pssm-ID: 462833 [Multi-domain] Cd Length: 183 Bit Score: 52.26 E-value: 1.04e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2352 SVSTQHPKSlESSGSPSVVTARSTSPGHSQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPV-- 2429
Cdd:pfam09595 54 NINKQHPEQ-EHHENPPLNEAAKEAPSESEDAPDIDPNNQHPSQDRSEAPPLEPAAKTKPSEHEPANPPDASNRLSPPda 132
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 147905740 2430 TTASTTRAVTETLSsrphtkTTSELPTVSSVSSTAGPGTQTGSTSRPNTVTQPASS 2485
Cdd:pfam09595 133 STAAIREARTFRKP------STGKRNNPSSAQSDQSPPRANHEAIGRANPFAMSST 182
|
|
| Metaviral_G |
pfam09595 |
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ... |
2038-2171 |
1.04e-06 |
|
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.
Pssm-ID: 462833 [Multi-domain] Cd Length: 183 Bit Score: 52.26 E-value: 1.04e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2038 SVSTQHPKSlESSGSPSVVTARSTSPGHSQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPV-- 2115
Cdd:pfam09595 54 NINKQHPEQ-EHHENPPLNEAAKEAPSESEDAPDIDPNNQHPSQDRSEAPPLEPAAKTKPSEHEPANPPDASNRLSPPda 132
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 147905740 2116 TTASTTRAVTETLSsrphtkTTSELPTVSSVSSTAGPGTQTGSTSRPNTVTQPASS 2171
Cdd:pfam09595 133 STAAIREARTFRKP------STGKRNNPSSAQSDQSPPRANHEAIGRANPFAMSST 182
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1671-1862 |
1.10e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 54.76 E-value: 1.10e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1671 GSTTPEGSVSTTETATSTPQTLSSKPLPNTSSFETWSPSGPTTSMEASEMPTFWTLRTSVS--SPPTTPLTVTSVSTQHP 1748
Cdd:COG3469 21 TLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATatAAAAAATSTSATLVATS 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1749 KSLESSGSPSVVTARSTSQGYSLPPSSPHTTNSLSATTSKASLSS----MTTISTSLTTPGPSSPPHGGTTThpvTTAST 1824
Cdd:COG3469 101 TASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGstttTTTVSGTETATGGTTTTSTTTTT---TSAST 177
|
170 180 190
....*....|....*....|....*....|....*...
gi 147905740 1825 TRAVTETLSSRPHTKTTSelPTVSSASSTAGPGTQTFS 1862
Cdd:COG3469 178 TPSATTTATATTASGATT--PSATTTATTTGPPTPGLP 213
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
2306-2492 |
1.38e-06 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 54.58 E-value: 1.38e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2306 PLPNTSSIETWSPSGPTTSMEASEMPTfwtlRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVvtARSTSPGHSQPPSS 2385
Cdd:pfam17823 67 PAPVTLTKGTSAAHLNSTEVTAEHTPH----GTDLSEPATREGAADGAASRALAAAASSSPSSA--AQSLPAAIAALPSE 140
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2386 PYATNSLSATTSKASLSSMTTISTSlttpgpSSPPHGGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPTVSSVSSTAG 2465
Cdd:pfam17823 141 AFSAPRAAACRANASAAPRAAIAAA------SAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGIST 214
|
170 180
....*....|....*....|....*...
gi 147905740 2466 PGTQTGSTSRPNTVTQ-PASSPETSSTT 2492
Cdd:pfam17823 215 AATATGHPAAGTALAAvGNSSPAAGTVT 242
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
1992-2178 |
1.38e-06 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 54.58 E-value: 1.38e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1992 PLPNTSSIETWSPSGPTTSMEASEMPTfwtlRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVvtARSTSPGHSQPPSS 2071
Cdd:pfam17823 67 PAPVTLTKGTSAAHLNSTEVTAEHTPH----GTDLSEPATREGAADGAASRALAAAASSSPSSA--AQSLPAAIAALPSE 140
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2072 PYATNSLSATTSKASLSSMTTISTSlttpgpSSPPHGGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPTVSSVSSTAG 2151
Cdd:pfam17823 141 AFSAPRAAACRANASAAPRAAIAAA------SAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGIST 214
|
170 180
....*....|....*....|....*...
gi 147905740 2152 PGTQTGSTSRPNTVTQ-PASSPETSSTT 2178
Cdd:pfam17823 215 AATATGHPAAGTALAAvGNSSPAAGTVT 242
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1672-1855 |
1.53e-06 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 54.92 E-value: 1.53e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1672 STTPEGSvstteTATSTPQTLSSKPLPNTSSFETWSPSGPTTSMEA------SEMPTFWTLRTSVSSP------PTTPLT 1739
Cdd:pfam05109 475 SPTPAGT-----TSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTptpnatSPTPAVTTPTPNATSPtlgktsPTSAVT 549
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1740 VTSVSTQHPKSLESSGSP--SVVTARSTSQGYSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTT- 1816
Cdd:pfam05109 550 TPTPNATSPTPAVTTPTPnaTIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAv 629
|
170 180 190 200
....*....|....*....|....*....|....*....|....*....
gi 147905740 1817 ----HPVTTASTT------RAVTETLSSRPHTKTTSELPTVSSASSTAG 1855
Cdd:pfam05109 630 ttgqHNITSSSTSsmslrpSSISETLSPSTSDNSTSHMPLLTSAHPTGG 678
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
2316-2485 |
1.65e-06 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 52.60 E-value: 1.65e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2316 WSPSGPTT-SMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTArsTSPGHSQPPSSpyatnslsa 2394
Cdd:PHA03255 24 WTSSGSSTaSAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTV--TSTGTTVTPVP--------- 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2395 TTSKASLSSMTTISTSLTTPGPSSpphGGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTqTGSTS 2474
Cdd:PHA03255 93 TTSNASTINVTTKVTAQNITATEA---GTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKT-TAELP 168
|
170
....*....|.
gi 147905740 2475 RPNTVTQPASS 2485
Cdd:PHA03255 169 TVPDERQPSLS 179
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
2002-2171 |
1.65e-06 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 52.60 E-value: 1.65e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2002 WSPSGPTT-SMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTArsTSPGHSQPPSSpyatnslsa 2080
Cdd:PHA03255 24 WTSSGSSTaSAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTV--TSTGTTVTPVP--------- 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2081 TTSKASLSSMTTISTSLTTPGPSSpphGGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTqTGSTS 2160
Cdd:PHA03255 93 TTSNASTINVTTKVTAQNITATEA---GTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKT-TAELP 168
|
170
....*....|.
gi 147905740 2161 RPNTVTQPASS 2171
Cdd:PHA03255 169 TVPDERQPSLS 179
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2814-3068 |
1.69e-06 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 54.54 E-value: 1.69e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2814 SEEPIPSTSFTTQSRPS-TPRTEKIVSIATARTTGSFPVSSLSTFPASPTSLL----PTSEEPHSGKPSTSSwTASSNVP 2888
Cdd:pfam05109 475 SPTPAGTTSGASPVTPSpSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVttptPNATSPTLGKTSPTS-AVTTPTP 553
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2889 STAKPTTSTGRQFPRT------KTTLWNSLPSSTPTMATRSIHPISTTMTTKN-------PQTLVTSDISKSNSMLPSRP 2955
Cdd:pfam05109 554 NATSPTPAVTTPTPNAtiptlgKTSPTSAVTTPTPNATSPTVGETSPQANTTNhtlggtsSTPVVTSPPKNATSAVTTGQ 633
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2956 H-ITASSVTSTS--GSQISKSYS------STSLTTAGPSSPPHGGTTTHPVTTASTTRAVTETLSSRPHTKTTSElptvs 3026
Cdd:pfam05109 634 HnITSSSTSSMSlrPSSISETLSpstsdnSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQ----- 708
|
250 260 270 280
....*....|....*....|....*....|....*....|..
gi 147905740 3027 svssTAGPGTQTGSTSRPSIVTQPVSSPETSSTTCQPQCQWT 3068
Cdd:pfam05109 709 ----ASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKT 746
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1671-1829 |
2.20e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 53.99 E-value: 2.20e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1671 GSTTPEGSVSTTETATSTPQTLSSKPLPNTSSFETWSPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKS 1750
Cdd:COG3469 61 TGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASA 140
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 147905740 1751 LESSGSPSVVTARSTSQGYSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTPGPSSPPhGGTTTHPVTTASTTRAVT 1829
Cdd:COG3469 141 TSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATT-TATTTGPPTPGLPKHVLV 218
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2812-3077 |
2.50e-06 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 54.15 E-value: 2.50e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2812 LPSEEPIPSTSFTTQSRPSTPRTEKIVSIATARTT-GSFPVsslsTFPASPTSLLPTSEEPHSGKPSTSSWTASsnvPST 2890
Cdd:pfam05109 448 LPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTsGASPV----TPSPSPRDNGTESKAPDMTSPTSAVTTPT---PNA 520
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2891 AKPTTSTGRQFPR-TKTTLWNSLPSSTPTMATRSIHPISTTMTTKNPqtlvtsdisksNSMLPSRPHITASSVTSTSGSQ 2969
Cdd:pfam05109 521 TSPTPAVTTPTPNaTSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTP-----------NATIPTLGKTSPTSAVTTPTPN 589
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2970 ISKSYSSTSLTTAGPSSPPHGGTTTHPVTTA---STTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTQTGSTSRPSI 3046
Cdd:pfam05109 590 ATSPTVGETSPQANTTNHTLGGTSSTPVVTSppkNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPL 669
|
250 260 270
....*....|....*....|....*....|.
gi 147905740 3047 VTQPVSSPETSSTTCQPQCQWTRWINVDYPS 3077
Cdd:pfam05109 670 LTSAHPTGGENITQVTPASTSTHHVSTSSPA 700
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
1667-1866 |
2.74e-06 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 53.43 E-value: 2.74e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1667 SHCQGSTTPEGSVSTTETATST-PQTLSSKPLPNTSSFETWSPSGPTTsmEASEMPTFWTLRTSVSSPPTTPLTVTS--- 1742
Cdd:pfam17823 92 PHGTDLSEPATREGAADGAASRaLAAAASSSPSSAAQSLPAAIAALPS--EAFSAPRAAACRANASAAPRAAIAAASaph 169
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1743 VSTQHPKSLESSGSPSVVTARSTSQGYSLPPSSPHTTNSLSATTSKASLSSMTTISTSL-----TTPGPSSPPHGGTTTH 1817
Cdd:pfam17823 170 AASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALaavgnSSPAAGTVTAAVGTVT 249
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1818 PVTTASTTRAVTETLS-------SRPHTKTTS---ELPTVSSASSTAGP-GTQTFSTTCQ 1866
Cdd:pfam17823 250 PAALATLAAAAGTVASaagtinmGDPHARRLSpakHMPSDTMARNPAAPmGAQAQGPIIQ 309
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
1967-2184 |
2.79e-06 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 53.43 E-value: 2.79e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1967 GSTTPEGSVSTTETTTATPQTLSSKPLPNTSSIETWSPsgptTSMEASEMPTFWTLRTSVSSPPTTPlTVTSVSTQHP-- 2044
Cdd:pfam17823 136 ALPSEAFSAPRAAACRANASAAPRAAIAAASAPHAASP----APRTAASSTTAASSTTAASSAPTTA-ASSAPATLTPar 210
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2045 ---KSLESSGSPSVVTARSTSPGHSQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTASTT 2121
Cdd:pfam17823 211 gisTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDT 290
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 147905740 2122 RAVTETLSSRPHTK-TTSELPTVSSVSSTAGPGTQTGS--TSRPNTVTQPASSPETSSTTCQPQCQ 2184
Cdd:pfam17823 291 MARNPAAPMGAQAQgPIIQVSTDQPVHNTAGEPTPSPSntTLEPNTPKSVASTNLAVVTTTKAQAK 356
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
2281-2498 |
2.79e-06 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 53.43 E-value: 2.79e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2281 GSTTPEGSVSTTETTTATPQTLSSKPLPNTSSIETWSPsgptTSMEASEMPTFWTLRTSVSSPPTTPlTVTSVSTQHP-- 2358
Cdd:pfam17823 136 ALPSEAFSAPRAAACRANASAAPRAAIAAASAPHAASP----APRTAASSTTAASSTTAASSAPTTA-ASSAPATLTPar 210
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2359 ---KSLESSGSPSVVTARSTSPGHSQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTASTT 2435
Cdd:pfam17823 211 gisTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDT 290
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 147905740 2436 RAVTETLSSRPHTK-TTSELPTVSSVSSTAGPGTQTGS--TSRPNTVTQPASSPETSSTTCQPQCQ 2498
Cdd:pfam17823 291 MARNPAAPMGAQAQgPIIQVSTDQPVHNTAGEPTPSPSntTLEPNTPKSVASTNLAVVTTTKAQAK 356
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
2881-3053 |
3.35e-06 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 51.83 E-value: 3.35e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2881 WTASSNVPSTAKPTTSTgrqfprtkttlwNSLPSSTPTMATRSIHPISTTMTTKNPQTlvTSDISKSNSMLPSRPHITAS 2960
Cdd:PHA03255 24 WTSSGSSTASAGNVTGT------------TAVTTPSPSASGPSTNQSTTLTTTSAPIT--TTAILSTNTTTVTSTGTTVT 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2961 SVTSTS-GSQISKSYSSTSLTTAGPSSpphGGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTqTG 3039
Cdd:PHA03255 90 PVPTTSnASTINVTTKVTAQNITATEA---GTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKT-TA 165
|
170
....*....|....
gi 147905740 3040 STSRPSIVTQPVSS 3053
Cdd:PHA03255 166 ELPTVPDERQPSLS 179
|
|
| Metaviral_G |
pfam09595 |
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ... |
1742-1864 |
4.16e-06 |
|
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.
Pssm-ID: 462833 [Multi-domain] Cd Length: 183 Bit Score: 50.34 E-value: 4.16e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1742 SVSTQHPKSlESSGSPSVVTARSTSQGYSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPV-- 1819
Cdd:pfam09595 54 NINKQHPEQ-EHHENPPLNEAAKEAPSESEDAPDIDPNNQHPSQDRSEAPPLEPAAKTKPSEHEPANPPDASNRLSPPda 132
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 147905740 1820 TTASTTRAVTETLSsrphtkTTSELPTVSSASSTAGPGTQTFSTT 1864
Cdd:pfam09595 133 STAAIREARTFRKP------STGKRNNPSSAQSDQSPPRANHEAI 171
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
1706-1847 |
4.17e-06 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 51.44 E-value: 4.17e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1706 WSPSGPTT-SMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTArsTSQGYSLPPSSPHTTNSLSA 1784
Cdd:PHA03255 24 WTSSGSSTaSAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTV--TSTGTTVTPVPTTSNASTIN 101
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 147905740 1785 TTSKASLSSMTTISTSLTTPGPSSPphGGTTTHPVTTASTTRAVTETLS--------SRPHTKTTSELPTV 1847
Cdd:PHA03255 102 VTTKVTAQNITATEAGTGTSTGVTS--NVTTRSSSTTSATTRITNATTLaptlsskgTSNATKTTAELPTV 170
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2097-2494 |
4.18e-06 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 53.38 E-value: 4.18e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2097 LTTPGPSSPPHGGTTTHPVTTASTTRAVTeTLSSRPHTKTTSelPTVSSvSSTAGPGTQTG---STSRPNTVTQPASSPE 2173
Cdd:pfam05109 391 ITVSGLGTAPKTLIITRTATNATTTTHKV-IFSKAPESTTTS--PTLNT-TGFAAPNTTTGlpsSTHVPTNLTAPASTGP 466
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2174 TSSTTcqpqcqwtkwiDVDYPS---SSINGGDIETYENIRANGEKICEKPQDIRCMAQNHPGVNVDELGQKVKCDVSSgl 2250
Cdd:pfam05109 467 TVSTA-----------DVTSPTpagTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPN-- 533
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2251 vcynkdqggtfqmcynylikvlccSVSHCQGSTTPEGSVSTTETTTATPQTLSSKPLPNtSSIETWSPSGPTTSMeASEM 2330
Cdd:pfam05109 534 ------------------------ATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPN-ATIPTLGKTSPTSAV-TTPT 587
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2331 PTFWTLRTSVSSPP--TTPLTV-----TSVSTQHPKSLESSGSPSV--VTARSTSPGHSQPPSspyATNSLSATTSKASL 2401
Cdd:pfam05109 588 PNATSPTVGETSPQanTTNHTLggtssTPVVTSPPKNATSAVTTGQhnITSSSTSSMSLRPSS---ISETLSPSTSDNST 664
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2402 SSMTTIStslttpgpSSPPHGGTTTHPVTTASTTRAVTETLSSRPHTKTTSElptvssvssTAGPGTQTGSTsRPNTVTQ 2481
Cdd:pfam05109 665 SHMPLLT--------SAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQ---------ASGPGNSSTST-KPGEVNV 726
|
410
....*....|...
gi 147905740 2482 PASSPETSSTTCQ 2494
Cdd:pfam05109 727 TKGTPPKNATSPQ 739
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2909-3626 |
4.29e-06 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 53.38 E-value: 4.29e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2909 WNSLPSSTPTMATRSiHPISTTMTTKNPQTLVTSDISKSNSMLPSRPHITASsVTSTSGSQISKSYSSTSLTTAG-PSSP 2987
Cdd:pfam05109 162 WDNCNSTNITAVVRA-QGLDVTLPLSLPTSAQDSNFSVKTEMLGNEIDIECI-MEDGEISQVLPGDNKFNITCSGyESHV 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2988 PHGG--TTTHPVTTA--STTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTQTGSTSrpsIVTQPVSSPETSSTTCQP 3063
Cdd:pfam05109 240 PSGGilTSTSPVATPipGTGYAYSLRLTPRPVSRFLGNNSILYVFYSGNGPKASGGDYC---IQSNIVFSDEIPASQDMP 316
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3064 qcqwTRWINVDYPsssingGDIETYENIRANGEKiCEKPQDIRCMAQNYPgvNVDELGQKVKCDVNSGLVCYNKDQGGTF 3143
Cdd:pfam05109 317 ----TNTTDITYV------GDNATYSVPMVTSED-ANSPNVTVTAFWAWP--NNTETDFKCKWTLTSGTPSGCENISGAF 383
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3144 KMCYNYLIKVLCCSVSHCQGSTTPEGSVSTTETA----SSIPQTLSSKPLPNTSSFetwspSGPTTSmeaSEMPTFWTLR 3219
Cdd:pfam05109 384 ASNRTFDITVSGLGTAPKTLIITRTATNATTTTHkvifSKAPESTTTSPTLNTTGF-----AAPNTT---TGLPSSTHVP 455
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3220 TSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTARSTSQGHSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTSEPS 3299
Cdd:pfam05109 456 TNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNAT 535
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3300 SPPHGGTTthPVTAASTtravtetlssrPHTKTTSELP--TVPSASSTAGPGTQTGSTSRPSTVTQPASSPETSSTTcqP 3377
Cdd:pfam05109 536 SPTLGKTS--PTSAVTT-----------PTPNATSPTPavTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETS--P 600
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3378 QCQWTKWIDVDYPSSSINGGDIETYENIRTNGEkicekpQDIRCMAQNHPGVNVDELGQKVKCDVNSGLVCYNKDQGGTF 3457
Cdd:pfam05109 601 QANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQ------HNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAH 674
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3458 QMCYNYLIKVLCCSVSHCQGSTTTPGSAPMTGSATTAtqtlssmplPNTSLTVTSPHSASTTAATSITETSSPRSVSSIH 3537
Cdd:pfam05109 675 PTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASG---------PGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQK 745
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3538 TTPKIFPET---LQSTTGltTQQTSSTGIWTSmltgtagtvsstslstslesttgiTEVSTSMPMTSTLPVTVTNATT-- 3612
Cdd:pfam05109 746 TAVPTVTSTggkANSTTG--GKHTTGHGARTS------------------------TEPTTDYGGDSTTPRTRYNATTyl 799
|
730
....*....|....*
gi 147905740 3613 -PQGTTLCQPKCKWT 3626
Cdd:pfam05109 800 pPSTSSKLRPRWTFT 814
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
798-859 |
4.47e-06 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 46.54 E-value: 4.47e-06
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 147905740 798 CQAPMVYLDCNNAsvghhgaeCLKSCHTLD--VECfSTQCVSGCVCPSGLVADGNGGCIAEEDC 859
Cdd:cd19941 1 CPPNEVYSECGSA--------CPPTCANPNapPPC-TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3164-3377 |
4.48e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 52.83 E-value: 4.48e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3164 STTPEGSVSTTETASSIPQTLSSKPLPNTSSFETWSPSGPTTSmeasempTFWTLRTSVSSPPTTPLTVTSVSTQHPKSL 3243
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVS-------TTGSVVVAASGSAGSGTGTTAASSTAATSS 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3244 ESSGSPSVVTARSTSQGHSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTSEPSSPPHGGTTTHPVTAASTTRAVTET 3323
Cdd:COG3469 75 TTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVS 154
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 147905740 3324 LSSRPHTKTTSELPTVPSASSTAGPGTQTGSTSRPSTVTQPASSPETSSTTCQP 3377
Cdd:COG3469 155 GTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPP 208
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
1666-1810 |
4.58e-06 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 51.44 E-value: 4.58e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1666 VSHCQGSTTPEGSVSTTETATSTPQTLSSKPLPNTSSFETWSPSGPTTSMEASEMPTfwtlrtsvSSPPTTPLTVTSVST 1745
Cdd:PHA03255 37 VTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVPT--------TSNASTINVTTKVTA 108
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 147905740 1746 QHPKSLES-SGSPSVVTARSTSQGYSLPPSSPHTTNslsATTSKASLSSMTTISTSLTTPGPSSPP 1810
Cdd:PHA03255 109 QNITATEAgTGTSTGVTSNVTTRSSSTTSATTRITN---ATTLAPTLSSKGTSNATKTTAELPTVP 171
|
|
| ROM1 |
COG5422 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
2814-3056 |
5.11e-06 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 53.36 E-value: 5.11e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2814 SEEPIPSTSFTTQSRPSTPRTEKIVSIATARTTGSFPVSSLS--TFPASPTSLLPTSEEPHSGkPSTSSWTASSnvpSTA 2891
Cdd:COG5422 28 SKQLLPPRRLQRKLNPISIRNGADNDIINSESKESFGKYALGhqIFSSFSSSPKLFQRRNSAG-PITHSPSATS---STS 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2892 KPTTSTGRQF-PRTKTTLWNSL----------PSSTPTMATRS-IHPISTTMTTKNPqtLVTSDISKSNSMLP---SRPH 2956
Cdd:COG5422 104 SLNSNDGDQFsPASDSLSFNPSstqsrkdsgpGDGSPVQKRKNpLLPSSSTHGTHPP--IVFTDNNGSHAGAPnarSRKE 181
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2957 ITASSVTSTSGSQISKSYSSTSLTTAGPSSPPHggTTTHpvttastTRAVTETLSSRPHTKTTSELPTVSSVSSTagpgT 3036
Cdd:COG5422 182 IPSLGSQSMQLPSPHFRQKFSSSDTSNGFSYPS--IRKN-------SRHSSNSMPSFPHSSTAVLLKRHSGSSGA----S 248
|
250 260
....*....|....*....|
gi 147905740 3037 QTGSTSRPSIVTQPVSSPET 3056
Cdd:COG5422 249 LISSNITPSSSNSEAMSTSS 268
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1665-1864 |
5.37e-06 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 53.09 E-value: 5.37e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1665 SVSHCQGSTTPEGSVSTTETATSTPQTLSSKPLPNTSSFETWSPSGPTTSMEASEMPTFWTLRTSVSSPpttpltvTSVS 1744
Cdd:NF033849 318 STTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSS-------SGVS 390
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1745 TQHPKSLESSG--SPSVVTARSTSQGYSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTA 1822
Cdd:NF033849 391 GGFSGGIAGGGvtSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGT 470
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 147905740 1823 STTRAVTEtlsSRPHTKTTSELPTVSSASSTAGPGTQTFSTT 1864
Cdd:NF033849 471 SQGQSVGT---SESWSTSQSETDSVGDSTGTSESVSQGDGRS 509
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1671-1868 |
6.38e-06 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 52.61 E-value: 6.38e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1671 GSTTPEGSVSTTETATSTPQTLSSKPLPNtSSFETWSPSGPTTSMeASEMPTFWTLRTSVSSPP--TTPLTV-----TSV 1743
Cdd:pfam05109 540 GKTSPTSAVTTPTPNATSPTPAVTTPTPN-ATIPTLGKTSPTSAV-TTPTPNATSPTVGETSPQanTTNHTLggtssTPV 617
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1744 STQHPKSLESSGSPSV--VTARSTSQgYSLPPSSphTTNSLSATTSKASLSSMTTIStslttpgpSSPPHGGTTTHPVTT 1821
Cdd:pfam05109 618 VTSPPKNATSAVTTGQhnITSSSTSS-MSLRPSS--ISETLSPSTSDNSTSHMPLLT--------SAHPTGGENITQVTP 686
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 147905740 1822 ASTTRAVTETLSSRPHTKTTSELPTVSSASSTAGPGTQTFSTTCQPQ 1868
Cdd:pfam05109 687 ASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPK 733
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1990-2177 |
1.02e-05 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 52.23 E-value: 1.02e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1990 SKPLPNTSSIETWSPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTARSTSPGHSQPP 2069
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2070 SSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTASTTRAVTETLSSRPHTKTTSeLPTVSSVSST 2149
Cdd:pfam05109 502 KAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNAT-IPTLGKTSPT 580
|
170 180
....*....|....*....|....*...
gi 147905740 2150 AGPGTQTGSTSRPntvTQPASSPETSST 2177
Cdd:pfam05109 581 SAVTTPTPNATSP---TVGETSPQANTT 605
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
1671-1860 |
1.11e-05 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 51.50 E-value: 1.11e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1671 GSTTPEGSVSTTETATSTPQTLSSKPLPNTSSfeTWSPSGPTTSMEASEMPTfwtlRTSVSSPPTTPLTVTSVSTQHPKS 1750
Cdd:pfam17823 44 GDAVPRADNKSSEQ*NFCAATAAPAPVTLTKG--TSAAHLNSTEVTAEHTPH----GTDLSEPATREGAADGAASRALAA 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1751 LESSGSPSVvtARSTSQGYSLPPSSPHTTNSLSATTSKASLSSMTTIST--SLTTPGPSSPPHGGTTTHPVTTASTTRAV 1828
Cdd:pfam17823 118 AASSSPSSA--AQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAasAPHAASPAPRTAASSTTAASSTTAASSAP 195
|
170 180 190
....*....|....*....|....*....|..
gi 147905740 1829 TETLSSRPHTKTTSElPTVSSASSTAGPGTQT 1860
Cdd:pfam17823 196 TTAASSAPATLTPAR-GISTAATATGHPAAGT 226
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
1680-1864 |
1.24e-05 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 51.50 E-value: 1.24e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1680 STTETATSTPQ-TLSSKPLPNTSSFETWSPSGPTTSMEASEMPTFWTLRTSVSSPPTtpLTVTSVSTQHPKSLESSGSPS 1758
Cdd:pfam17823 83 STEVTAEHTPHgTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPS--EAFSAPRAAACRANASAAPRA 160
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1759 VVTARSTSQGYSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTASTTRAVTETLSSRPHT 1838
Cdd:pfam17823 161 AIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGT 240
|
170 180
....*....|....*....|....*.
gi 147905740 1839 KTTSELPTVSSASSTAGPGTQTFSTT 1864
Cdd:pfam17823 241 VTAAVGTVTPAALATLAAAAGTVASA 266
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2002-2176 |
1.53e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.86 E-value: 1.53e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2002 WSPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTARSTSPGHSQPPSSPYATNSLSAT 2081
Cdd:PHA03247 2715 LVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSES 2794
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2082 TSKASLSSMTTISTSLTTPGPSSPPhggTTTHPVTTASTTRAVTETLSSRPHTKTTSELPTVSSVSstagPGtqtGSTSR 2161
Cdd:PHA03247 2795 RESLPSPWDPADPPAAVLAPAAALP---PAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVA----PG---GDVRR 2864
|
170
....*....|....*
gi 147905740 2162 PNTVTQPASSPETSS 2176
Cdd:PHA03247 2865 RPPSRSPAAKPAAPA 2879
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2316-2490 |
1.53e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.86 E-value: 1.53e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2316 WSPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTARSTSPGHSQPPSSPYATNSLSAT 2395
Cdd:PHA03247 2715 LVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSES 2794
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2396 TSKASLSSMTTISTSLTTPGPSSPPhggTTTHPVTTASTTRAVTETLSSRPHTKTTSELPTVSSVSstagPGtqtGSTSR 2475
Cdd:PHA03247 2795 RESLPSPWDPADPPAAVLAPAAALP---PAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVA----PG---GDVRR 2864
|
170
....*....|....*
gi 147905740 2476 PNTVTQPASSPETSS 2490
Cdd:PHA03247 2865 RPPSRSPAAKPAAPA 2879
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
1672-1864 |
1.64e-05 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 51.11 E-value: 1.64e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1672 STTPEGSVSTTETATSTPQT-LSSKPLPNTSSFETWSPSGPTTS----MEASEMPTFWTLRTSVSSPPTTPlTVTSVSTQ 1746
Cdd:pfam17823 128 QSLPAAIAALPSEAFSAPRAaACRANASAAPRAAIAAASAPHAAspapRTAASSTTAASSTTAASSAPTTA-ASSAPATL 206
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1747 HPKSLESSGSPSVVTARSTSQGYSLPPSS--PHTTNSLSATTSKASLSSMTTISTSLTTPGPS---SPPHGGTTTHPVTT 1821
Cdd:pfam17823 207 TPARGISTAATATGHPAAGTALAAVGNSSpaAGTVTAAVGTVTPAALATLAAAAGTVASAAGTinmGDPHARRLSPAKHM 286
|
170 180 190 200
....*....|....*....|....*....|....*....|....
gi 147905740 1822 ASTTRAVTETLSSRPHTK-TTSELPTVSSASSTAGPGTQTFSTT 1864
Cdd:pfam17823 287 PSDTMARNPAAPMGAQAQgPIIQVSTDQPVHNTAGEPTPSPSNT 330
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
3164-3377 |
2.01e-05 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 50.73 E-value: 2.01e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3164 STTPEGSVSTTETASSIPQTLS---------SKPLPNTSSFETWSPSGPTTS----MEASEMPTFWTLRTSVSSPPTTPl 3230
Cdd:pfam17823 120 SSSPSSAAQSLPAAIAALPSEAfsapraaacRANASAAPRAAIAAASAPHAAspapRTAASSTTAASSTTAASSAPTTA- 198
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3231 TVTSVSTQHPKSLESSGSPSVVTARSTSQGHSLPPSS--PHTTNSLSATTSKASLSSMTTistslttsepssppHGGTTT 3308
Cdd:pfam17823 199 ASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSpaAGTVTAAVGTVTPAALATLAA--------------AAGTVA 264
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 147905740 3309 HPVTAASTTRAVTETLSSRPH--TKTTSELPTVPSASSTAGPGTQTgSTSRPSTVTQPASSPETSSTTCQP 3377
Cdd:pfam17823 265 SAAGTINMGDPHARRLSPAKHmpSDTMARNPAAPMGAQAQGPIIQV-STDQPVHNTAGEPTPSPSNTTLEP 334
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
3157-3397 |
2.47e-05 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 50.34 E-value: 2.47e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3157 SVSHCQGSTTPEGSVSTTETASSIPQTLSSKPLPNTSSFETWSPSGPT-TSMEASEMPTFWTLRTSVSSPPTTPLTVTSV 3235
Cdd:pfam17823 165 ASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGIsTAATATGHPAAGTALAAVGNSSPAAGTVTAA 244
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3236 S-----------TQHPKSLESSG-------------SPSVVTARSTSQGHSLPPSSPHTTNSLSATTSKaslssmttIST 3291
Cdd:pfam17823 245 VgtvtpaalatlAAAAGTVASAAgtinmgdpharrlSPAKHMPSDTMARNPAAPMGAQAQGPIIQVSTD--------QPV 316
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3292 SLTTSEPSSPPHGGT--TTHPVTAASTTRAVTETLSSRPHTKTTSELPTVPSASSTAGPGTQTGSTSRPSTVTQPASSPE 3369
Cdd:pfam17823 317 HNTAGEPTPSPSNTTlePNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSMIPEVEATSPTTQPSPLLPTQGAAGPG 396
|
250 260
....*....|....*....|....*...
gi 147905740 3370 TSSTTCQPQCQWTKWIDVDYPSSSINGG 3397
Cdd:pfam17823 397 ILLAPEQVATEATAGTASAGPTPRSSGD 424
|
|
| PLN02217 |
PLN02217 |
probable pectinesterase/pectinesterase inhibitor |
1996-2092 |
3.03e-05 |
|
probable pectinesterase/pectinesterase inhibitor
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 50.47 E-value: 3.03e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1996 TSSIETWSPSGPTTSMEASEMPTfwtlRTSVSSPPTTPLTVTSVSTQHPKSleSSGSPSVVTARSTSP--GHSQPPSSPY 2073
Cdd:PLN02217 578 ASSNTTFSSDSPSTVVAPSTSPP----AGHLGSPPATPSKIVSPSTSPPAS--HLGSPSTTPSSPESSikVASTETASPE 651
|
90
....*....|....*....
gi 147905740 2074 ATNSLSATTSKASLSSMTT 2092
Cdd:PLN02217 652 SSIKVASTESSVSMVSMST 670
|
|
| PLN02217 |
PLN02217 |
probable pectinesterase/pectinesterase inhibitor |
2310-2406 |
3.03e-05 |
|
probable pectinesterase/pectinesterase inhibitor
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 50.47 E-value: 3.03e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2310 TSSIETWSPSGPTTSMEASEMPTfwtlRTSVSSPPTTPLTVTSVSTQHPKSleSSGSPSVVTARSTSP--GHSQPPSSPY 2387
Cdd:PLN02217 578 ASSNTTFSSDSPSTVVAPSTSPP----AGHLGSPPATPSKIVSPSTSPPAS--HLGSPSTTPSSPESSikVASTETASPE 651
|
90
....*....|....*....
gi 147905740 2388 ATNSLSATTSKASLSSMTT 2406
Cdd:PLN02217 652 SSIKVASTESSVSMVSMST 670
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
709-756 |
3.06e-05 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 44.30 E-value: 3.06e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 147905740 709 VSSCQPTCRSLSEADVtCSVPfvTVDGCTCPEGTFLNDKDHCVPVEEC 756
Cdd:pfam01826 11 GSACPPTCANLSPPDV-CPEP--CVEGCVCPPGFVRNSGGKCVPPSDC 55
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2800-3063 |
3.25e-05 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 50.30 E-value: 3.25e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2800 FPSTVSTSTTSK---------------LPSEEPIPSTSFTTQSRPSTPRTEKIVSIATARTT-GSFPVsslsTFPASPTS 2863
Cdd:pfam05109 421 FSKAPESTTTSPtlnttgfaapntttgLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTsGASPV----TPSPSPRD 496
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2864 LLPTSEEPHSGKPSTSSWTASsnvPSTAKPTTSTGRQFPR-TKTTLWNSLPSSTPTMATRSIHPISTTMTTKNPQTLVTS 2942
Cdd:pfam05109 497 NGTESKAPDMTSPTSAVTTPT---PNATSPTPAVTTPTPNaTSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPT 573
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2943 --DISKSNSMLPSRPHITASSVTSTSGSQISKSY-----SSTSLTTAGPSSPPHGGTT-THPVTTASTT------RAVTE 3008
Cdd:pfam05109 574 lgKTSPTSAVTTPTPNATSPTVGETSPQANTTNHtlggtSSTPVVTSPPKNATSAVTTgQHNITSSSTSsmslrpSSISE 653
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 147905740 3009 TLSSRPHTKTTSELPTVSSVSSTAG----------PGTQTGSTSRPSI---VTQPVSSPETSSTTCQP 3063
Cdd:pfam05109 654 TLSPSTSDNSTSHMPLLTSAHPTGGenitqvtpasTSTHHVSTSSPAPrpgTTSQASGPGNSSTSTKP 721
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
3170-3645 |
3.63e-05 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 49.96 E-value: 3.63e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3170 SVSTTETASSIPQTL-SSKPLPNTSSFETWSPSGPTTSMEASEMPTFwtlrtSVSSPPTTPLTVTS-VSTQHPKSLEssg 3247
Cdd:pfam17823 14 PLSESHAAPADPRHFvLNKMWNGAGKQNASGDAVPRADNKSSEQ*NF-----CAATAAPAPVTLTKgTSAAHLNSTE--- 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3248 spsvVTARSTsqghslppssPHTTNSLSATTSKASLSSMTtistslttsepSSPPHGGTTTHPVTAASTTRAVTETLSSR 3327
Cdd:pfam17823 86 ----VTAEHT----------PHGTDLSEPATREGAADGAA-----------SRALAAAASSSPSSAAQSLPAAIAALPSE 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3328 PHTKTTSElptVPSASSTAGPGTQTGSTSRPSTVTqPASSPETSSTTCqpqcqwtkwidvdyPSSSINGGDIETYENIRT 3407
Cdd:pfam17823 141 AFSAPRAA---ACRANASAAPRAAIAAASAPHAAS-PAPRTAASSTTA--------------ASSTTAASSAPTTAASSA 202
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3408 NGEKICEKPQDIRCMAQNHPGVNVdelgqkvkcdVNSGLVCYNKDQGGTFQMCYNYLIKVLCCSVSHCQGSTTTPGSAPM 3487
Cdd:pfam17823 203 PATLTPARGISTAATATGHPAAGT----------ALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINM 272
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3488 TGSATTATQTLSSMPLPNTSLTVTSPHSASTTAATSITETSSPRSVSSIHTTPKIFPETLQSTTGLTTQQTSSTGIWTSm 3567
Cdd:pfam17823 273 GDPHARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVVTTT- 351
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3568 ltgTAGTVSSTSLSTSLESTTGITEVSTSMPMTSTLPVTVTNATTPQGTTLC--QPKCKWTEWFDVDSPTSGSvKGDMET 3645
Cdd:pfam17823 352 ---KAQAKEPSASPVPVLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLApeQVATEATAGTASAGPTPRS-SGDPKT 427
|
|
| Metaviral_G |
pfam09595 |
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ... |
1995-2130 |
4.49e-05 |
|
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.
Pssm-ID: 462833 [Multi-domain] Cd Length: 183 Bit Score: 47.26 E-value: 4.49e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1995 NTSSIETWSPSGPTTSMEASEMPTFwTLRTSVSSPPTTPLTVTSVSTQHPkSLESSGSPSVVTARSTSPGHSQPPSSPYA 2074
Cdd:pfam09595 46 IITDIIDININKQHPEQEHHENPPL-NEAAKEAPSESEDAPDIDPNNQHP-SQDRSEAPPLEPAAKTKPSEHEPANPPDA 123
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*....
gi 147905740 2075 TNSLS---ATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHpVTTASTTRAVTETLSS 2130
Cdd:pfam09595 124 SNRLSppdASTAAIREARTFRKPSTGKRNNPSSAQSDQSPPR-ANHEAIGRANPFAMSS 181
|
|
| Metaviral_G |
pfam09595 |
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ... |
2309-2444 |
4.49e-05 |
|
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.
Pssm-ID: 462833 [Multi-domain] Cd Length: 183 Bit Score: 47.26 E-value: 4.49e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2309 NTSSIETWSPSGPTTSMEASEMPTFwTLRTSVSSPPTTPLTVTSVSTQHPkSLESSGSPSVVTARSTSPGHSQPPSSPYA 2388
Cdd:pfam09595 46 IITDIIDININKQHPEQEHHENPPL-NEAAKEAPSESEDAPDIDPNNQHP-SQDRSEAPPLEPAAKTKPSEHEPANPPDA 123
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*....
gi 147905740 2389 TNSLS---ATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHpVTTASTTRAVTETLSS 2444
Cdd:pfam09595 124 SNRLSppdASTAAIREARTFRKPSTGKRNNPSSAQSDQSPPR-ANHEAIGRANPFAMSS 181
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2827-3043 |
4.64e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 49.75 E-value: 4.64e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2827 SRPSTPRTEKIVSIATARTTGSFPVSSLSTFPASPTSLLPTSEEPHSGKPSTSSWTASSNVPSTAKPTTSTgrqfprTKT 2906
Cdd:COG3469 5 STAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSST------TST 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2907 TLWNSLPSSTPTMATRSIHPISTTMTTKNPQTLVTSDISKSNSMLPSRPHITASSVTSTSGSQISKSYSSTSLTTAGPSS 2986
Cdd:COG3469 79 TATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2987 PPHGGTTTHPVT---TASTTRAVTetlSSRPHTKTTSELPTVSSVSSTAGPGTQTGSTSR 3043
Cdd:COG3469 159 ATGGTTTTSTTTtttSASTTPSAT---TTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
2301-2495 |
4.65e-05 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 49.57 E-value: 4.65e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2301 TLSSKPLPNTSSIETWSPSGPTTSMEASEMPTFWTLRTSVSSPPT--TPLTVTSVSTQHPKSLESSGSPSVVTARSTSPG 2378
Cdd:pfam17823 125 SAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASAPHaaSPAPRTAASSTTAASSTTAASSAPTTAASSAPA 204
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2379 HSQPPS----SPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPP--------HGGTTTHPVTTASTTRAVTETLSSRP 2446
Cdd:pfam17823 205 TLTPARgistAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPaalatlaaAAGTVASAAGTINMGDPHARRLSPAK 284
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 147905740 2447 H--TKTTSELPTVSSVSSTAGPGTQTgSTSRPNTVTQPASSPETSSTTCQP 2495
Cdd:pfam17823 285 HmpSDTMARNPAAPMGAQAQGPIIQV-STDQPVHNTAGEPTPSPSNTTLEP 334
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
1987-2181 |
4.65e-05 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 49.57 E-value: 4.65e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1987 TLSSKPLPNTSSIETWSPSGPTTSMEASEMPTFWTLRTSVSSPPT--TPLTVTSVSTQHPKSLESSGSPSVVTARSTSPG 2064
Cdd:pfam17823 125 SAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASAPHaaSPAPRTAASSTTAASSTTAASSAPTTAASSAPA 204
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2065 HSQPPS----SPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPP--------HGGTTTHPVTTASTTRAVTETLSSRP 2132
Cdd:pfam17823 205 TLTPARgistAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPaalatlaaAAGTVASAAGTINMGDPHARRLSPAK 284
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 147905740 2133 H--TKTTSELPTVSSVSSTAGPGTQTgSTSRPNTVTQPASSPETSSTTCQP 2181
Cdd:pfam17823 285 HmpSDTMARNPAAPMGAQAQGPIIQV-STDQPVHNTAGEPTPSPSNTTLEP 334
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
2346-2511 |
4.81e-05 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 47.98 E-value: 4.81e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2346 TPLTVTSvSTQHPKSLESSGSPSVVTARSTSPGHSQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPphggtT 2425
Cdd:PHA03255 20 TSLIWTS-SGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVP-----T 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2426 THPVTTASTTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTQTGSTSRPNTVTQPASSPETSSTTCQPQCQWTKWIDV 2505
Cdd:PHA03255 94 TSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDE 173
|
....*.
gi 147905740 2506 DYPSSS 2511
Cdd:PHA03255 174 RQPSLS 179
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
2032-2197 |
4.81e-05 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 47.98 E-value: 4.81e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2032 TPLTVTSvSTQHPKSLESSGSPSVVTARSTSPGHSQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPphggtT 2111
Cdd:PHA03255 20 TSLIWTS-SGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVP-----T 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2112 THPVTTASTTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTQTGSTSRPNTVTQPASSPETSSTTCQPQCQWTKWIDV 2191
Cdd:PHA03255 94 TSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDE 173
|
....*.
gi 147905740 2192 DYPSSS 2197
Cdd:PHA03255 174 RQPSLS 179
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
798-859 |
4.85e-05 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 43.92 E-value: 4.85e-05
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 147905740 798 CQAPMVYLDCNNAsvghhgaeCLKSCHTLDVECF-STQCVSGCVCPSGLVADGNGGCIAEEDC 859
Cdd:pfam01826 1 CPANEVYSECGSA--------CPPTCANLSPPDVcPEPCVEGCVCPPGFVRNSGGKCVPPSDC 55
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2281-2483 |
5.28e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 49.37 E-value: 5.28e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2281 GSTTPEGSVSTTETTTATPQTLSSKPLPNTSSIETWSPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKS 2360
Cdd:COG3469 21 TLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATS 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2361 LESSGSPSVVTARSTSPGhSQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTASTTravte 2440
Cdd:COG3469 101 TASGANTGTSTVTTTSTG-AGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTS----- 174
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 147905740 2441 TLSSRPHTKTTSelpTVSSVSSTAGPGTQTGSTSRPNTVTQPA 2483
Cdd:COG3469 175 ASTTPSATTTAT---ATTASGATTPSATTTATTTGPPTPGLPK 214
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1967-2169 |
5.28e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 49.37 E-value: 5.28e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1967 GSTTPEGSVSTTETTTATPQTLSSKPLPNTSSIETWSPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKS 2046
Cdd:COG3469 21 TLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATS 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2047 LESSGSPSVVTARSTSPGhSQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTASTTravte 2126
Cdd:COG3469 101 TASGANTGTSTVTTTSTG-AGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTS----- 174
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 147905740 2127 TLSSRPHTKTTSelpTVSSVSSTAGPGTQTGSTSRPNTVTQPA 2169
Cdd:COG3469 175 ASTTPSATTTAT---ATTASGATTPSATTTATTTGPPTPGLPK 214
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1989-2181 |
5.30e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 49.77 E-value: 5.30e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1989 SSKPLPNTSSIETWSPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTARSTSPGHSQP 2068
Cdd:pfam03154 144 TSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQS 223
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2069 PSSPYATNSLSATTSKASLSSMTTISTSLTTPGPssPPHGGTTTHPVTTASTT-----RAVTETLSSRPHTKTTSELPTV 2143
Cdd:pfam03154 224 TAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPP--PSQVSPQPLPQPSLHGQmppmpHSLQTGPSHMQHPVPPQPFPLT 301
|
170 180 190
....*....|....*....|....*....|....*...
gi 147905740 2144 SSVSSTAGPGTQTGSTSRPNTVTQPASSPETSSTTCQP 2181
Cdd:pfam03154 302 PQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQP 339
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
2303-2495 |
5.30e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 49.77 E-value: 5.30e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2303 SSKPLPNTSSIETWSPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTARSTSPGHSQP 2382
Cdd:pfam03154 144 TSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQS 223
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2383 PSSPYATNSLSATTSKASLSSMTTISTSLTTPGPssPPHGGTTTHPVTTASTT-----RAVTETLSSRPHTKTTSELPTV 2457
Cdd:pfam03154 224 TAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPP--PSQVSPQPLPQPSLHGQmppmpHSLQTGPSHMQHPVPPQPFPLT 301
|
170 180 190
....*....|....*....|....*....|....*...
gi 147905740 2458 SSVSSTAGPGTQTGSTSRPNTVTQPASSPETSSTTCQP 2495
Cdd:pfam03154 302 PQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQP 339
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
2304-2495 |
5.47e-05 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 49.57 E-value: 5.47e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2304 SKPLPNTSSIETWSPSGPTTSMEASEMPTFWTLRTSVSSPPTtpLTVTSVSTQHPKSLESSGSPSVVTARSTSPGHSQPP 2383
Cdd:pfam17823 98 SEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPS--EAFSAPRAAACRANASAAPRAAIAAASAPHAASPAP 175
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2384 SSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSP----------PHGGTTTHPVTTASTtraVTETLSSRPHTKTTSE 2453
Cdd:pfam17823 176 RTAASSTTAASSTTAASSAPTTAASSAPATLTPARGistaatatghPAAGTALAAVGNSSP---AAGTVTAAVGTVTPAA 252
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 147905740 2454 LPTVSSVSSTAGPGTQTGSTSRPNTVT-QPASSPETSSTTCQP 2495
Cdd:pfam17823 253 LATLAAAAGTVASAAGTINMGDPHARRlSPAKHMPSDTMARNP 295
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
1990-2181 |
5.47e-05 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 49.57 E-value: 5.47e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1990 SKPLPNTSSIETWSPSGPTTSMEASEMPTFWTLRTSVSSPPTtpLTVTSVSTQHPKSLESSGSPSVVTARSTSPGHSQPP 2069
Cdd:pfam17823 98 SEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPS--EAFSAPRAAACRANASAAPRAAIAAASAPHAASPAP 175
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2070 SSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSP----------PHGGTTTHPVTTASTtraVTETLSSRPHTKTTSE 2139
Cdd:pfam17823 176 RTAASSTTAASSTTAASSAPTTAASSAPATLTPARGistaatatghPAAGTALAAVGNSSP---AAGTVTAAVGTVTPAA 252
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 147905740 2140 LPTVSSVSSTAGPGTQTGSTSRPNTVT-QPASSPETSSTTCQP 2181
Cdd:pfam17823 253 LATLAAAAGTVASAAGTINMGDPHARRlSPAKHMPSDTMARNP 295
|
|
| Pneumo_att_G |
pfam05539 |
Pneumovirinae attachment membrane glycoprotein G; |
1661-1864 |
6.68e-05 |
|
Pneumovirinae attachment membrane glycoprotein G;
Pssm-ID: 114270 [Multi-domain] Cd Length: 408 Bit Score: 48.89 E-value: 6.68e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1661 VLCCSVSHCQGSTTPEGSVSTTETATSTPQTLSSKPlpntssfetwsPSGPTTSMEASE-----MPTFWTLRTSVSSPPT 1735
Cdd:pfam05539 133 ALCESGPICQRDYNPRDRPKCRCTLRGKDVSCCKEP-----------KTAVTTSKTTSWptevsHPTYPSQVTPQSQPAT 201
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1736 TPLTvTSVSTQHPKSLESSGSPSVVTARSTSQGYSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTT 1815
Cdd:pfam05539 202 QGHQ-TATANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNR 280
|
170 180 190 200
....*....|....*....|....*....|....*....|....*....
gi 147905740 1816 THPVTTASTTRAVTETLSSRPHTKTTSELPTVSSASSTAGPGTQTFSTT 1864
Cdd:pfam05539 281 RSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTT 329
|
|
| VWC |
smart00214 |
von Willebrand factor (vWF) type C domain; |
4570-4631 |
6.68e-05 |
|
von Willebrand factor (vWF) type C domain;
Pssm-ID: 214564 Cd Length: 59 Bit Score: 43.66 E-value: 6.68e-05
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 147905740 4570 CTYNDTIYGVGTTFPGGPCHTCTCLSdgdqEPKVECKETNC--TTSCPQGFKyTLVPEQCCGEC 4631
Cdd:smart00214 1 CVHNGRVYNDGETWKPDPCQICTCLD----GTTVLCDPVECppPPDCPNPER-VKPPGECCPRC 59
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
2817-3063 |
7.66e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 49.40 E-value: 7.66e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2817 PIPSTSFTTQSrPSTPRTEKIVSIATARTTGSFPV--SSLSTFPASPTSLLPTSEEphSGKPSTSSWTASSNVP-STAKP 2893
Cdd:PHA03307 175 PLSSPEETARA-PSSPPAEPPPSTPPAAASPRPPRrsSPISASASSPAPAPGRSAA--DDAGASSSDSSSSESSgCGWGP 251
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2894 TTSTGRQFPRtkttlwnslPSSTPTMATRSIHPISTTmTTKNPQTLVTSDISKSNSMLPSRPHITASSVTSTSGSQISKS 2973
Cdd:PHA03307 252 ENECPLPRPA---------PITLPTRIWEASGWNGPS-SRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSS 321
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2974 ySSTSLTTAGPSSPPHGGTTTHPvtTASTTRAVTETL---SSRPHTKTTSELPTVSSVSSTAGPGTQTGSTSRPSIVTQP 3050
Cdd:PHA03307 322 -RESSSSSTSSSSESSRGAAVSP--GPSPSRSPSPSRpppPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRA 398
|
250
....*....|...
gi 147905740 3051 VSSPETSSTTCQP 3063
Cdd:PHA03307 399 RRRDATGRFPAGR 411
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1967-2180 |
8.03e-05 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 49.14 E-value: 8.03e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1967 GSTTPEGSVSTTETTTATPQTLSSKPLPNtSSIETWSPSGPTTSMeASEMPTFWTLRTSVSSPP--TTPLTV-----TSV 2039
Cdd:pfam05109 540 GKTSPTSAVTTPTPNATSPTPAVTTPTPN-ATIPTLGKTSPTSAV-TTPTPNATSPTVGETSPQanTTNHTLggtssTPV 617
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2040 STQHPKSLESSGSPSV--VTARSTSPGHSQPPSspyATNSLSATTSKASLSSMTTIStslttpgpSSPPHGGTTTHPVTT 2117
Cdd:pfam05109 618 VTSPPKNATSAVTTGQhnITSSSTSSMSLRPSS---ISETLSPSTSDNSTSHMPLLT--------SAHPTGGENITQVTP 686
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 147905740 2118 ASTTRAVTETLSSRPHTKTTSElptvssvssTAGPGTQTGSTsRPNTVTQPASSPETSSTTCQ 2180
Cdd:pfam05109 687 ASTSTHHVSTSSPAPRPGTTSQ---------ASGPGNSSTST-KPGEVNVTKGTPPKNATSPQ 739
|
|
| Metaviral_G |
pfam09595 |
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ... |
1680-1834 |
8.04e-05 |
|
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.
Pssm-ID: 462833 [Multi-domain] Cd Length: 183 Bit Score: 46.87 E-value: 8.04e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1680 STTETATSTPQTLSSKPLP-NTSSFETWSPSGPTTSMEASEMPTFwTLRTSVSSPPTTPLTVTSVSTQHPkSLESSGSPS 1758
Cdd:pfam09595 26 KCFEHASLILIGESNKEAAlIITDIIDININKQHPEQEHHENPPL-NEAAKEAPSESEDAPDIDPNNQHP-SQDRSEAPP 103
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 147905740 1759 VVTARSTSQGYSLPPSSPHTTNSLS---ATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHpVTTASTTRAVTETLSS 1834
Cdd:pfam09595 104 LEPAAKTKPSEHEPANPPDASNRLSppdASTAAIREARTFRKPSTGKRNNPSSAQSDQSPPR-ANHEAIGRANPFAMSS 181
|
|
| Metaviral_G |
pfam09595 |
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ... |
3234-3367 |
9.70e-05 |
|
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.
Pssm-ID: 462833 [Multi-domain] Cd Length: 183 Bit Score: 46.49 E-value: 9.70e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3234 SVSTQHPKSlESSGSPSVVTARSTSQGHSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTSEPSSPPHGGTTTHPVTA 3313
Cdd:pfam09595 54 NINKQHPEQ-EHHENPPLNEAAKEAPSESEDAPDIDPNNQHPSQDRSEAPPLEPAAKTKPSEHEPANPPDASNRLSPPDA 132
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 147905740 3314 asTTRAVTETLSSRphTKTTSELPTVPSASSTAGPGTQTGSTSRPSTVTQPASS 3367
Cdd:pfam09595 133 --STAAIREARTFR--KPSTGKRNNPSSAQSDQSPPRANHEAIGRANPFAMSST 182
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
1989-2182 |
1.09e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 48.91 E-value: 1.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1989 SSKPLPNTSSietwSPSGPTTS--MEASEMPTFWTLrtSVSSPPTTPLTVTSVSTQHPkSLESSGSPSVVTARSTSPGHS 2066
Cdd:PHA03378 595 TPWPVPHPSQ----TPEPPTTQshIPETSAPRQWPM--PLRPIPMRPLRMQPITFNVL-VFPTPHQPPQVEITPYKPTWT 667
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2067 QPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTASTTR----AVTETLSSRPHTKTTSELPt 2142
Cdd:PHA03378 668 QIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRarppAAAPGRARPPAAAPGRARP- 746
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 147905740 2143 vssvsSTAGPGTQTGSTSRPNTVTQPASSPETSSTTCQPQ 2182
Cdd:PHA03378 747 -----PAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQ 781
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
2303-2496 |
1.09e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 48.91 E-value: 1.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2303 SSKPLPNTSSietwSPSGPTTS--MEASEMPTFWTLrtSVSSPPTTPLTVTSVSTQHPkSLESSGSPSVVTARSTSPGHS 2380
Cdd:PHA03378 595 TPWPVPHPSQ----TPEPPTTQshIPETSAPRQWPM--PLRPIPMRPLRMQPITFNVL-VFPTPHQPPQVEITPYKPTWT 667
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2381 QPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTASTTR----AVTETLSSRPHTKTTSELPt 2456
Cdd:PHA03378 668 QIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRarppAAAPGRARPPAAAPGRARP- 746
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 147905740 2457 vssvsSTAGPGTQTGSTSRPNTVTQPASSPETSSTTCQPQ 2496
Cdd:PHA03378 747 -----PAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQ 781
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2881-3345 |
1.11e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 48.76 E-value: 1.11e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2881 WTASSNVPSTAKPTT---STGRQFPRTKTTLWNS----LPSSTPTMATRSIHPI----STTMTTKNPqTLVTSDISKSNS 2949
Cdd:pfam05109 366 WTLTSGTPSGCENISgafASNRTFDITVSGLGTApktlIITRTATNATTTTHKVifskAPESTTTSP-TLNTTGFAAPNT 444
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2950 M--LPSRPHI----TASSVTSTSGSQISKSYSSTSLTTAGPS------SPPHGGTTTHPVTTASTTRAVTEtlssrPHTK 3017
Cdd:pfam05109 445 TtgLPSSTHVptnlTAPASTGPTVSTADVTSPTPAGTTSGASpvtpspSPRDNGTESKAPDMTSPTSAVTT-----PTPN 519
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3018 TTSELPTVSSVSSTAGPGTqTGSTSRPSIVTQPV---SSPETSSTTCQPQCQWTRwINVDYPSSSInggdieTYENIRAN 3094
Cdd:pfam05109 520 ATSPTPAVTTPTPNATSPT-LGKTSPTSAVTTPTpnaTSPTPAVTTPTPNATIPT-LGKTSPTSAV------TTPTPNAT 591
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3095 GEKICEKPQDIRCMAQNYPGVNvdelgqkvkcdvnSGLVCYNKDQGGTfkmcynylikvlcCSVSHCQGSTTPEGSVSTT 3174
Cdd:pfam05109 592 SPTVGETSPQANTTNHTLGGTS-------------STPVVTSPPKNAT-------------SAVTTGQHNITSSSTSSMS 645
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3175 ETASSIPQTLSSKPLPNTSS----FETWSPSGPTTSMEASEMPTFwTLRTSVSSPPTTPLTVTSVSTqhPKSLESSGSPS 3250
Cdd:pfam05109 646 LRPSSISETLSPSTSDNSTShmplLTSAHPTGGENITQVTPASTS-THHVSTSSPAPRPGTTSQASG--PGNSSTSTKPG 722
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3251 VVT-ARSTSQGHSLPPSSPH-------TTNSLSATTSKASLSSMTTISTSLTTSEPSSpPHGGTTTHPVTAASTTRAVTE 3322
Cdd:pfam05109 723 EVNvTKGTPPKNATSPQAPSgqktavpTVTSTGGKANSTTGGKHTTGHGARTSTEPTT-DYGGDSTTPRTRYNATTYLPP 801
|
490 500
....*....|....*....|...
gi 147905740 3323 TLSSRPHTKTTSELPTVPSASST 3345
Cdd:pfam05109 802 STSSKLRPRWTFTSPPVTTAQAT 824
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
3163-3374 |
1.18e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 48.42 E-value: 1.18e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3163 GSTTPEGSVSTTETASSIPQTLSSKPLPNTSSfeTWSPSGPTTSMEASEMPTfwtlRTSVSSPPTTPLTVTSVSTQHPKS 3242
Cdd:pfam17823 44 GDAVPRADNKSSEQ*NFCAATAAPAPVTLTKG--TSAAHLNSTEVTAEHTPH----GTDLSEPATREGAADGAASRALAA 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3243 LESSGSPSVvtARSTSQGHSLPPSSPHTTNSLSATTSKASLSSMTTISTslttsepSSPPHGGTTTHPVTAASTTRAVTE 3322
Cdd:pfam17823 118 AASSSPSSA--AQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAA-------ASAPHAASPAPRTAASSTTAASST 188
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 147905740 3323 TL-SSRPHTKTTSELPTVPSASSTAGPGTQTGSTSRPSTVTQ-PASSPETSSTT 3374
Cdd:pfam17823 189 TAaSSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAvGNSSPAAGTVT 242
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
3163-3372 |
1.37e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 48.63 E-value: 1.37e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3163 GSTTPEGSvSTTETASSIPQTLSSKPLPNTSSFETWSPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKS 3242
Cdd:PHA03307 104 GSPTPPGP-SSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEET 182
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3243 LESSGSPSvVTARSTSQGHSLPPSSPHTTNSLSATTS-------KASLSSMTTISTSLTTSEPSSPPHGGTTTHPVTAAS 3315
Cdd:PHA03307 183 ARAPSSPP-AEPPPSTPPAAASPRPPRRSSPISASASspapapgRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPA 261
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 147905740 3316 TTRAVTETLSSRPHTKTTSELPTVPSASSTAGPG--TQTGSTSRPSTVTQPASSPETSS 3372
Cdd:PHA03307 262 PITLPTRIWEASGWNGPSSRPGPASSSSSPRERSpsPSPSSPGSGPAPSSPRASSSSSS 320
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1989-2182 |
1.58e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 48.24 E-value: 1.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1989 SSKPLPNTSSIETWSPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVS------TQHPKSLESSGSPSVVTARSTS 2062
Cdd:PHA03307 74 GPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASpppspaPDLSEMLRPVGSPGPPPAASPP 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2063 PGHSQPPSSPYATNS--LSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTASTTRAVTETLSSRPHTKTTSEL 2140
Cdd:PHA03307 154 AAGASPAAVASDAASsrQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAG 233
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 147905740 2141 PTVSSVSSTAGPGTQTGSTS-----RPNTVTQP-----ASSPETSSTTCQPQ 2182
Cdd:PHA03307 234 ASSSDSSSSESSGCGWGPENecplpRPAPITLPtriweASGWNGPSSRPGPA 285
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
2303-2496 |
1.58e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 48.24 E-value: 1.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2303 SSKPLPNTSSIETWSPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVS------TQHPKSLESSGSPSVVTARSTS 2376
Cdd:PHA03307 74 GPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASpppspaPDLSEMLRPVGSPGPPPAASPP 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2377 PGHSQPPSSPYATNS--LSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTASTTRAVTETLSSRPHTKTTSEL 2454
Cdd:PHA03307 154 AAGASPAAVASDAASsrQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAG 233
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 147905740 2455 PTVSSVSSTAGPGTQTGSTS-----RPNTVTQP-----ASSPETSSTTCQPQ 2496
Cdd:PHA03307 234 ASSSDSSSSESSGCGWGPENecplpRPAPITLPtriweASGWNGPSSRPGPA 285
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
1455-1870 |
1.72e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 47.65 E-value: 1.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1455 TSRRPSSPTLTTHTASSESQSWTREEKKTSVDKSYSSRQTVPFTSNSRLTSTSMSSASSLPLSNPTWTTTLKATDENTML 1534
Cdd:pfam17823 63 ATAAPAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAF 142
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1535 SSRTTAVftSQTRSTSRPSTVTQPISSPETSSTTCQPQCQWTKwidvdYPSSSINGGDIETYENIRANGEKICEKPQDIR 1614
Cdd:pfam17823 143 SAPRAAA--CRANASAAPRAAIAAASAPHAASPAPRTAASSTT-----AASSTTAASSAPTTAASSAPATLTPARGISTA 215
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1615 CMAQNHPGVNVdelgqkvkcdVSSGLVCYNKDQGGTFQMCYNYLIKVLCCSVSHCQGSTTPEGSVSTTETATSTPQTLSS 1694
Cdd:pfam17823 216 ATATGHPAAGT----------ALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKH 285
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1695 KPlPNTSSFETWSPSGPTTSMEASEMPTFWTLrTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTARSTSQGYSLPP- 1773
Cdd:pfam17823 286 MP-SDTMARNPAAPMGAQAQGPIIQVSTDQPV-HNTAGEPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPv 363
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1774 SSPHTTNSLSATtskaslssmTTISTSLTTPGPSSPPHGGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPTVSSASST 1853
Cdd:pfam17823 364 PVLHTSMIPEVE---------ATSPTTQPSPLLPTQGAAGPGILLAPEQVATEATAGTASAGPTPRSSGDPKTLAMASCQ 434
|
410
....*....|....*..
gi 147905740 1854 AGPGTQTFSTTCQPQCQ 1870
Cdd:pfam17823 435 LSTQGQYLVVTTDPLTP 451
|
|
| SOG2 |
pfam10428 |
RAM signalling pathway protein; SOG2 proteins in Saccharomyces cerevisiae are involved in cell ... |
2862-3045 |
1.75e-04 |
|
RAM signalling pathway protein; SOG2 proteins in Saccharomyces cerevisiae are involved in cell separation and cytokinesis.
Pssm-ID: 431280 Cd Length: 476 Bit Score: 47.79 E-value: 1.75e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2862 TSLLPTSEEPHSGKPSTSSWTASSNVPSTAKPTTSTGRQFPRTKTTLWnslpSSTPTMATRSIHPISTTMT-----TKNP 2936
Cdd:pfam10428 150 ASLGPLLEAVRPPSPKKRAGRTKQPSPSITSGGSPSSPAESSTRPSSS----SVTPTRRRRHAGSFSSKLPplrsdTTIP 225
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2937 QTLVTSDISKSNSMLPSRPHITASSVTSTSGSQISKSYSSTSLTTAGPSSPPHGGTTTHPVTTASTTRAVTETLSSRPHT 3016
Cdd:pfam10428 226 HPGGNLSSPAPNGAQTPTPPRSATSPGVPSSAPTLGTGSTGAISRSNHSTSGSQSSLTSSSRSRSSSRSNTLLSTSGPSS 305
|
170 180
....*....|....*....|....*....
gi 147905740 3017 KTTselpTVSSVSSTAGPGTQTGSTSRPS 3045
Cdd:pfam10428 306 LAT----TPRPSSGESFAPTSTGSRINPL 330
|
|
| PRK08581 |
PRK08581 |
amidase domain-containing protein; |
2335-2512 |
2.04e-04 |
|
amidase domain-containing protein;
Pssm-ID: 236304 [Multi-domain] Cd Length: 619 Bit Score: 47.48 E-value: 2.04e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2335 TLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTARST-----SPGHSQPPSSPYATNSLSATTSKASLSSMTTIST 2409
Cdd:PRK08581 55 TSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDFIyknlpQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDIS 134
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2410 SLTTPGPSSpphGGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPtvSSVSSTAGPGTQTGSTSRPNTVTQPASSPETS 2489
Cdd:PRK08581 135 DYEQPRNSE---KSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAP--SSNNTKPSTSNKQPNSPKPTQPNQSNSQPASD 209
|
170 180
....*....|....*....|...
gi 147905740 2490 STTCQPQCQWTKWIDVDYPSSSI 2512
Cdd:PRK08581 210 DTANQKSSSKDNQSMSDSALDSI 232
|
|
| PRK08581 |
PRK08581 |
amidase domain-containing protein; |
2021-2198 |
2.04e-04 |
|
amidase domain-containing protein;
Pssm-ID: 236304 [Multi-domain] Cd Length: 619 Bit Score: 47.48 E-value: 2.04e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2021 TLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTARST-----SPGHSQPPSSPYATNSLSATTSKASLSSMTTIST 2095
Cdd:PRK08581 55 TSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDFIyknlpQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDIS 134
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2096 SLTTPGPSSpphGGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPtvSSVSSTAGPGTQTGSTSRPNTVTQPASSPETS 2175
Cdd:PRK08581 135 DYEQPRNSE---KSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAP--SSNNTKPSTSNKQPNSPKPTQPNQSNSQPASD 209
|
170 180
....*....|....*....|...
gi 147905740 2176 STTCQPQCQWTKWIDVDYPSSSI 2198
Cdd:PRK08581 210 DTANQKSSSKDNQSMSDSALDSI 232
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
2819-3083 |
2.07e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 47.65 E-value: 2.07e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2819 PSTSFTTQSRPSTPRTEKIVSIA------TARTTGSFPVSSLSTFPASPTSLLPTSEEPHSGKPSTSSWTASSNV----P 2888
Cdd:pfam17823 145 PRAAACRANASAAPRAAIAAASAphaaspAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATghpaA 224
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2889 STAKPTTSTGRQFPRTKTTLWNSLPSSTPTMATRSIHPISTTMTTKNPQTLVTSDISKSNSMlPSrpHITASSVTSTSGS 2968
Cdd:pfam17823 225 GTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHM-PS--DTMARNPAAPMGA 301
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2969 QISKSYSSTSL-----TTAGPSSPPHGGTTTHPVT---TASTTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTQTGS 3040
Cdd:pfam17823 302 QAQGPIIQVSTdqpvhNTAGEPTPSPSNTTLEPNTpksVASTNLAVVTTTKAQAKEPSASPVPVLHTSMIPEVEATSPTT 381
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 147905740 3041 TSRPSIVTQPVSSPETSSTTCQPQCQWTRWINVDYPSSSINGG 3083
Cdd:pfam17823 382 QPSPLLPTQGAAGPGILLAPEQVATEATAGTASAGPTPRSSGD 424
|
|
| VWC_out |
smart00215 |
von Willebrand factor (vWF) type C domain; |
391-436 |
2.27e-04 |
|
von Willebrand factor (vWF) type C domain;
Pssm-ID: 214565 Cd Length: 67 Bit Score: 42.16 E-value: 2.27e-04
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 147905740 391 CTHGGRIYAPGESFNTSCRSCTCFGGLWKCKDLPCPGTCSVQGGSH 436
Cdd:smart00215 1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCGPKPCLLHNLS 46
|
|
| Pneumo_att_G |
pfam05539 |
Pneumovirinae attachment membrane glycoprotein G; |
2271-2493 |
2.29e-04 |
|
Pneumovirinae attachment membrane glycoprotein G;
Pssm-ID: 114270 [Multi-domain] Cd Length: 408 Bit Score: 46.96 E-value: 2.29e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2271 VLCCSVSHCQGSTTPE-GSVSTTETTTATPQTLSSKPLPNTSSIETWSPSGPTTsmeasemPTFWTLRTSVSSPPTTPLT 2349
Cdd:pfam05539 133 ALCESGPICQRDYNPRdRPKCRCTLRGKDVSCCKEPKTAVTTSKTTSWPTEVSH-------PTYPSQVTPQSQPATQGHQ 205
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2350 vTSVSTQHPKSLESSGSPSVVTARSTSPGHSQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPV 2429
Cdd:pfam05539 206 -TATANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPH 284
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 147905740 2430 TTASTTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTQTGSTSRpNTVTQPASSPETSSTTC 2493
Cdd:pfam05539 285 STATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTTQ-NLVDCKELDPPKPNSIC 347
|
|
| Pneumo_att_G |
pfam05539 |
Pneumovirinae attachment membrane glycoprotein G; |
1957-2179 |
2.29e-04 |
|
Pneumovirinae attachment membrane glycoprotein G;
Pssm-ID: 114270 [Multi-domain] Cd Length: 408 Bit Score: 46.96 E-value: 2.29e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1957 VLCCSVSHCQGSTTPE-GSVSTTETTTATPQTLSSKPLPNTSSIETWSPSGPTTsmeasemPTFWTLRTSVSSPPTTPLT 2035
Cdd:pfam05539 133 ALCESGPICQRDYNPRdRPKCRCTLRGKDVSCCKEPKTAVTTSKTTSWPTEVSH-------PTYPSQVTPQSQPATQGHQ 205
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2036 vTSVSTQHPKSLESSGSPSVVTARSTSPGHSQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPV 2115
Cdd:pfam05539 206 -TATANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPH 284
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 147905740 2116 TTASTTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGTQTGSTSRpNTVTQPASSPETSSTTC 2179
Cdd:pfam05539 285 STATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTTQ-NLVDCKELDPPKPNSIC 347
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3163-3357 |
2.81e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 47.05 E-value: 2.81e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3163 GSTTPEGSVSTTETASSIPQTLSSKPLPNTSSFETWSPSGPTTSMEASEMPTFWTLRTSVS--SPPTTPLTVTSVSTQHP 3240
Cdd:COG3469 21 TLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATatAAAAAATSTSATLVATS 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3241 KSLESSGSPSVVTARSTSQGHSLPPSSPHTTNSLSATTSKASLSSMTTISTSLTTSEPSSPPHGGT-TTHPVTAASTTRA 3319
Cdd:COG3469 101 TASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTsTTTTTTSASTTPS 180
|
170 180 190
....*....|....*....|....*....|....*...
gi 147905740 3320 VTetlSSRPHTKTTSELPTVPSASSTAGPGTQTGSTSR 3357
Cdd:COG3469 181 AT---TTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| PLN02217 |
PLN02217 |
probable pectinesterase/pectinesterase inhibitor |
3169-3279 |
2.85e-04 |
|
probable pectinesterase/pectinesterase inhibitor
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 47.39 E-value: 2.85e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3169 GSVSTTETASSIPqtlsskplpntsSFETWSPSGPTTSMEASEMPTfwtlRTSVSSPPTTPLTVTSVSTQHPKSleSSGS 3248
Cdd:PLN02217 567 GSTNSTPTGSAAS------------SNTTFSSDSPSTVVAPSTSPP----AGHLGSPPATPSKIVSPSTSPPAS--HLGS 628
|
90 100 110
....*....|....*....|....*....|...
gi 147905740 3249 PSVVTA--RSTSQGHSLPPSSPHTTNSLSATTS 3279
Cdd:PLN02217 629 PSTTPSspESSIKVASTETASPESSIKVASTES 661
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2797-3031 |
3.65e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 47.24 E-value: 3.65e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2797 APGFPSTVSTSTTSKLPSEePIPSTSFTTQSRPSTPRTEKIVSIATARTTGSFPVSSLSTFPASPTSLLPTSEEPHSGKP 2876
Cdd:PHA03247 2768 APAPPAAPAAGPPRRLTRP-AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPP 2846
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2877 STSSWTASSNVP---------STAKPTTSTGRQFPRTKTTLWNSLPSSTPTMAT--RSIHPISTTMTTKNPQTLVTSDIS 2945
Cdd:PHA03247 2847 PPSLPLGGSVAPggdvrrrppSRSPAAKPAAPARPPVRRLARPAVSRSTESFALppDQPERPPQPQAPPPPQPQPQPPPP 2926
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2946 KSNSMLPSRPHITASSVTSTSGSQisksysSTSLTTAGPSSPPHG----GTTTHPVTTASTTRAVTETLSSRPHTKTTSE 3021
Cdd:PHA03247 2927 PQPQPPPPPPPRPQPPLAPTTDPA------GAGEPSGAVPQPWLGalvpGRVAVPRFRVPQPAPSREAPASSTPPLTGHS 3000
|
250
....*....|
gi 147905740 3022 LPTVSSVSST 3031
Cdd:PHA03247 3001 LSRVSSWASS 3010
|
|
| Not5 |
COG5665 |
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription]; |
1997-2187 |
3.81e-04 |
|
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
Pssm-ID: 444384 [Multi-domain] Cd Length: 874 Bit Score: 46.96 E-value: 3.81e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1997 SSIETWSPSGPTTSMEASEMPTFWTlrTSVSSPPTTPLTVTSVSTQHPKSLESS-------GSPSVVTARSTSPGHSQPP 2069
Cdd:COG5665 240 PSLLATPPATPATEEKSSQQPKSQP--TSPSGGTTPPSTNQLTTSNTPTSTAKAqpqpptkKQPAKEPPSDTASGNPSAP 317
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2070 SSPYATNSLSATTSKASLSSMTTISTSLTTPGP-SSPPHGGTTTHpvttastTRAVTETLSSRPHTKTT-SELPTVSSVS 2147
Cdd:COG5665 318 SVLINSDSPTSEDPATASVPTTEETTAFTTPSSvPSTPAEKDTPA-------TDLATPVSPTPPETSVDkKVSPDSATSS 390
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 147905740 2148 STAGPGTQTGSTSRPNTVTQPASSPETSSTTCQPQCQWTK 2187
Cdd:COG5665 391 TKSEKEGGTASSPMPPNIAIGAKDDVDATDPSQEAKEYTK 430
|
|
| Not5 |
COG5665 |
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription]; |
2311-2501 |
3.81e-04 |
|
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
Pssm-ID: 444384 [Multi-domain] Cd Length: 874 Bit Score: 46.96 E-value: 3.81e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2311 SSIETWSPSGPTTSMEASEMPTFWTlrTSVSSPPTTPLTVTSVSTQHPKSLESS-------GSPSVVTARSTSPGHSQPP 2383
Cdd:COG5665 240 PSLLATPPATPATEEKSSQQPKSQP--TSPSGGTTPPSTNQLTTSNTPTSTAKAqpqpptkKQPAKEPPSDTASGNPSAP 317
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2384 SSPYATNSLSATTSKASLSSMTTISTSLTTPGP-SSPPHGGTTTHpvttastTRAVTETLSSRPHTKTT-SELPTVSSVS 2461
Cdd:COG5665 318 SVLINSDSPTSEDPATASVPTTEETTAFTTPSSvPSTPAEKDTPA-------TDLATPVSPTPPETSVDkKVSPDSATSS 390
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 147905740 2462 STAGPGTQTGSTSRPNTVTQPASSPETSSTTCQPQCQWTK 2501
Cdd:COG5665 391 TKSEKEGGTASSPMPPNIAIGAKDDVDATDPSQEAKEYTK 430
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2008-2498 |
3.86e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.83 E-value: 3.86e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2008 TTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTARSTS--PGHSQPPSSPYATNSLSATTSKA 2085
Cdd:pfam05109 392 TVSGLGTAPKTLIITRTATNATTTTHKVIFSKAPESTTTSPTLNTTGFAAPNTTTglPSSTHVPTNLTAPASTGPTVSTA 471
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2086 SLSSMT----TISTSLTTPGPSSPPHGGTTTHPVTTASTTRAVTETLSSRPHTKTTSElPTVSSVSSTAGpgtQTGSTSR 2161
Cdd:pfam05109 472 DVTSPTpagtTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTT-PTPNATSPTLG---KTSPTSA 547
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2162 PNTVTQPASSPETSSTTCQPQCQwTKWIDVDYPSSSInggdieTYENIRANGEKICEkpqdircmaqNHPGVNVDelgqk 2241
Cdd:pfam05109 548 VTTPTPNATSPTPAVTTPTPNAT-IPTLGKTSPTSAV------TTPTPNATSPTVGE----------TSPQANTT----- 605
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2242 vkcdvssglvcyNKDQGGTfqmcynylikvlccsvSHCQGSTTPEGSVSTTETTTA---TPQTLSSKPLPNTSSIETWSP 2318
Cdd:pfam05109 606 ------------NHTLGGT----------------SSTPVVTSPPKNATSAVTTGQhniTSSSTSSMSLRPSSISETLSP 657
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2319 SgpTTSMEASEMPTFwTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVvTARSTSPGHSQPPSSPYATNSLSATTSK 2398
Cdd:pfam05109 658 S--TSDNSTSHMPLL-TSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGT-TSQASGPGNSSTSTKPGEVNVTKGTPPK 733
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2399 ASLSSMTtistslttpgpsspPHGGTTTHPVTTASTTRAVTET--LSSRPHTKTTSELPTVSSVSSTAGPGTQ------- 2469
Cdd:pfam05109 734 NATSPQA--------------PSGQKTAVPTVTSTGGKANSTTggKHTTGHGARTSTEPTTDYGGDSTTPRTRynattyl 799
|
490 500 510
....*....|....*....|....*....|...
gi 147905740 2470 ---TGSTSRPN-TVTQPASSPETSSTTCQPQCQ 2498
Cdd:pfam05109 800 ppsTSSKLRPRwTFTSPPVTTAQATVPVPPTSQ 832
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2835-3062 |
3.96e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.83 E-value: 3.96e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2835 EKIVSIATARTTGSFPVSSLSTFP-------ASPTSLLPTSEEPHSGKPSTSSWTASSNVPSTAKPTTSTGRQFPRTKTT 2907
Cdd:pfam05109 377 ENISGAFASNRTFDITVSGLGTAPktliitrTATNATTTTHKVIFSKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPT 456
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2908 LWNSLPSSTPTMATRSIHPISTTMTTKNPQTLVTSDISKSNSMLPSRPHITA--SSVTSTSGSQISKSYSSTSLT----- 2980
Cdd:pfam05109 457 NLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSptSAVTTPTPNATSPTPAVTTPTpnats 536
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2981 -TAGPSSPPHGGTTTHPVTT----ASTTRAVTETLSSRPHTKTTSEL---------PTVSSVSSTAGPGTQT-GSTSRPS 3045
Cdd:pfam05109 537 pTLGKTSPTSAVTTPTPNATsptpAVTTPTPNATIPTLGKTSPTSAVttptpnatsPTVGETSPQANTTNHTlGGTSSTP 616
|
250
....*....|....*..
gi 147905740 3046 IVTQPVSSPETSSTTCQ 3062
Cdd:pfam05109 617 VVTSPPKNATSAVTTGQ 633
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
2811-3066 |
4.61e-04 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 46.46 E-value: 4.61e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2811 KLPSEEPIPSTSFTTQSRPSTPrTEKIVSIATARTTGSFPVS--SLSTFPASPTSLL-----PTSEEPhsgKPSTSSWTA 2883
Cdd:PLN03209 322 KIPSQRVPPKESDAADGPKPVP-TKPVTPEAPSPPIEEEPPQpkAVVPRPLSPYTAYedlkpPTSPIP---TPPSSSPAS 397
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2884 SSNVPSTAKPTTSTGRQFPRTKTTLWNSLPSSTPTMATRSIHPISTTMTTKNPqtlvtsdisksnsmlpsrphitaSSVT 2963
Cdd:PLN03209 398 SKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPLSPYARYEDLKPP-----------------------TSPS 454
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2964 STSGSQISKSYSSTSLTTAGPSSPPhggttthpvTTASTTRAVTETLSSRPHTKTTSELPTVSSVSST-AGPGTQTGSTS 3042
Cdd:PLN03209 455 PTAPTGVSPSVSSTSSVPAVPDTAP---------ATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSpAAPVGKVAPSS 525
|
250 260
....*....|....*....|....
gi 147905740 3043 RPSIVTQPVSSPETSSTTCQPQCQ 3066
Cdd:PLN03209 526 TNEVVKVGNSAPPTALADEQHHAQ 549
|
|
| Cys_knot |
pfam00007 |
Cystine-knot domain; The family comprises glycoprotein hormones and the C-terminal domain of ... |
4710-4787 |
4.72e-04 |
|
Cystine-knot domain; The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerization.
Pssm-ID: 394966 Cd Length: 105 Bit Score: 42.39 E-value: 4.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 4710 VNATV-LRYKGCET--EVNITFCEGSCSGISKYSMEAQAMERQCTCCQESKVHDVAVTMQCPDGTVIQHTYTHIDECNCA 4786
Cdd:pfam00007 7 TNYTIsVEKEGCTSckTINTTICAGYCYTRDPVYKDGRRAVSQRVCTYRDVTYETVVLPGCPPGVDPTVTYPVALSCHCG 86
|
.
gi 147905740 4787 P 4787
Cdd:pfam00007 87 N 87
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
1736-1868 |
5.32e-04 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 44.89 E-value: 5.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1736 TPLTVTSVSTQHPKSLESSGSPSVVTARSTSQGYSLPPSSPHTTNSLSATTSkASLSSMTTISTSLTTPGPSSPPHGGTT 1815
Cdd:PHA03255 20 TSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTT-AILSTNTTTVTSTGTTVTPVPTTSNAS 98
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 147905740 1816 THPVTTASTTRAVTETLSSRPHTKTTSELPTVSSASSTAGPGTQTFSTTCQPQ 1868
Cdd:PHA03255 99 TINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPT 151
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
3162-3372 |
6.24e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.47 E-value: 6.24e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3162 QGSTTPEGSVSTTETASSIPQTLSSKPLPNTSSFE----TWSPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVST 3237
Cdd:PHA03247 2675 QASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEpaphALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGP 2754
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3238 QHPKSLESSGSPSVVTarstsqghslPPSSPHTTNSlSATTSKASLSSMTTISTSLTTSEPSSPPhggttthpvtAASTT 3317
Cdd:PHA03247 2755 ARPARPPTTAGPPAPA----------PPAAPAAGPP-RRLTRPAVASLSESRESLPSPWDPADPP----------AAVLA 2813
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 147905740 3318 RAVTETLSSRPHT---KTTSELPTVPSASSTAGPGTQT--------GSTSRPSTVTQPASSPETSS 3372
Cdd:PHA03247 2814 PAAALPPAASPAGplpPPTSAQPTAPPPPPGPPPPSLPlggsvapgGDVRRRPPSRSPAAKPAAPA 2879
|
|
| VWC_out |
smart00215 |
von Willebrand factor (vWF) type C domain; |
861-918 |
7.58e-04 |
|
von Willebrand factor (vWF) type C domain;
Pssm-ID: 214565 Cd Length: 67 Bit Score: 40.62 E-value: 7.58e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 147905740 861 CIHNEVTYRPGEIIRVDCNNCTCKNRRWECTNKPCMGACVAYGDGHFVTFDGERYIFE 918
Cdd:smart00215 1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCGPKPCLLHNLSGECPLGQGCVPS 58
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2302-2512 |
7.89e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.08 E-value: 7.89e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2302 LSSKPLPNTSSIETWSPSGPTTSMEASEMPTfwtlrtSVSSPPTTPLTVTSVSTQHP----KSLESSGSPSVVTARSTSP 2377
Cdd:PHA03247 2795 RESLPSPWDPADPPAAVLAPAAALPPAASPA------GPLPPPTSAQPTAPPPPPGPpppsLPLGGSVAPGGDVRRRPPS 2868
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2378 GhsQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPtv 2457
Cdd:PHA03247 2869 R--SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLA-- 2944
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2458 sSVSSTAGPGTQTGSTSRP-------------------NTVTQPASSPETSSTTCQPQCQWTKW-------IDVDYPSSS 2511
Cdd:PHA03247 2945 -PTTDPAGAGEPSGAVPQPwlgalvpgrvavprfrvpqPAPSREAPASSTPPLTGHSLSRVSSWasslalhEETDPPPVS 3023
|
.
gi 147905740 2512 I 2512
Cdd:PHA03247 3024 L 3024
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1988-2198 |
7.89e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.08 E-value: 7.89e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1988 LSSKPLPNTSSIETWSPSGPTTSMEASEMPTfwtlrtSVSSPPTTPLTVTSVSTQHP----KSLESSGSPSVVTARSTSP 2063
Cdd:PHA03247 2795 RESLPSPWDPADPPAAVLAPAAALPPAASPA------GPLPPPTSAQPTAPPPPPGPpppsLPLGGSVAPGGDVRRRPPS 2868
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2064 GhsQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPtv 2143
Cdd:PHA03247 2869 R--SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLA-- 2944
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2144 sSVSSTAGPGTQTGSTSRP-------------------NTVTQPASSPETSSTTCQPQCQWTKW-------IDVDYPSSS 2197
Cdd:PHA03247 2945 -PTTDPAGAGEPSGAVPQPwlgalvpgrvavprfrvpqPAPSREAPASSTPPLTGHSLSRVSSWasslalhEETDPPPVS 3023
|
.
gi 147905740 2198 I 2198
Cdd:PHA03247 3024 L 3024
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1671-1857 |
9.62e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 45.93 E-value: 9.62e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1671 GSTTPEGSvSTTETATSTPQTLSSKPLPNTSSFETWSPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKS 1750
Cdd:PHA03307 104 GSPTPPGP-SSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEET 182
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1751 LESSGSPS----VVTARSTSQGYSLPPSSP--HTTNSLSATTSKASLSSMTTISTSLTTPGPSSPPHGGTTTHPVTTAST 1824
Cdd:PHA03307 183 ARAPSSPPaeppPSTPPAAASPRPPRRSSPisASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAP 262
|
170 180 190
....*....|....*....|....*....|...
gi 147905740 1825 TRAVTETLSSRPHTKTTSELPTVSSASSTAGPG 1857
Cdd:PHA03307 263 ITLPTRIWEASGWNGPSSRPGPASSSSSPRERS 295
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
2958-3060 |
1.00e-03 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 44.12 E-value: 1.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2958 TASSVTSTSGSQISKSYSSTSLTTAGP-SSPPHGGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPTVSSVSSTAGPGT 3036
Cdd:PHA03255 41 TAVTTPSPSASGPSTNQSTTLTTTSAPiTTTAILSTNTTTVTSTGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGT 120
|
90 100
....*....|....*....|....
gi 147905740 3037 QTGSTSrpSIVTQpvSSPETSSTT 3060
Cdd:PHA03255 121 STGVTS--NVTTR--SSSTTSATT 140
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
3198-3340 |
1.12e-03 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 44.12 E-value: 1.12e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3198 WSPSGPTT-SMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTArsTSQGHSLPPSSPHTTNSLSA 3276
Cdd:PHA03255 24 WTSSGSSTaSAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTV--TSTGTTVTPVPTTSNASTIN 101
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 147905740 3277 TTSKASLSSMTTISTSLTTSEPSspphggTTTHPVTAASTTRAVTE-----TLSSRPH-------TKTTSELPTVP 3340
Cdd:PHA03255 102 VTTKVTAQNITATEAGTGTSTGV------TSNVTTRSSSTTSATTRitnatTLAPTLSskgtsnaTKTTAELPTVP 171
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
2862-3025 |
1.40e-03 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 43.74 E-value: 1.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2862 TSLLPTSEEPH-------SGKPSTSSWTASSNVPSTAKPTTSTGRQFPRTkTTLWNSLPSSTPTMATRSIHPISTTMTTK 2934
Cdd:PHA03255 20 TSLIWTSSGSStasagnvTGTTAVTTPSPSASGPSTNQSTTLTTTSAPIT-TTAILSTNTTTVTSTGTTVTPVPTTSNAS 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2935 NPQTlvTSDISKSNSMLPSRPHITASSVTSTSGSQISKSYSSTSLTTagpsspphGGTTTHPVTTASTTRAVTEtlssrp 3014
Cdd:PHA03255 99 TINV--TTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRIT--------NATTLAPTLSSKGTSNATK------ 162
|
170
....*....|.
gi 147905740 3015 htkTTSELPTV 3025
Cdd:PHA03255 163 ---TTAELPTV 170
|
|
| PLN02217 |
PLN02217 |
probable pectinesterase/pectinesterase inhibitor |
2028-2125 |
1.43e-03 |
|
probable pectinesterase/pectinesterase inhibitor
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 45.08 E-value: 1.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2028 SPPTTPLTVTSvSTQHPKSLESSGSPSVVTARSTSP--GH-SQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSS 2104
Cdd:PLN02217 565 NPGSTNSTPTG-SAASSNTTFSSDSPSTVVAPSTSPpaGHlGSPPATPSKIVSPSTSPPASHLGSPSTTPSSPESSIKVA 643
|
90 100
....*....|....*....|.
gi 147905740 2105 PPHGGTTTHPVTTASTTRAVT 2125
Cdd:PLN02217 644 STETASPESSIKVASTESSVS 664
|
|
| PLN02217 |
PLN02217 |
probable pectinesterase/pectinesterase inhibitor |
2342-2439 |
1.43e-03 |
|
probable pectinesterase/pectinesterase inhibitor
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 45.08 E-value: 1.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2342 SPPTTPLTVTSvSTQHPKSLESSGSPSVVTARSTSP--GH-SQPPSSPYATNSLSATTSKASLSSMTTISTSLTTPGPSS 2418
Cdd:PLN02217 565 NPGSTNSTPTG-SAASSNTTFSSDSPSTVVAPSTSPpaGHlGSPPATPSKIVSPSTSPPASHLGSPSTTPSSPESSIKVA 643
|
90 100
....*....|....*....|.
gi 147905740 2419 PPHGGTTTHPVTTASTTRAVT 2439
Cdd:PLN02217 644 STETASPESSIKVASTESSVS 664
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1967-2131 |
1.60e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.74 E-value: 1.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1967 GSTTPEGSVSTTETTTATPQTLSSKPLPNTSSIETWSPSGPTTSmeaSEMPTFWTLRTSVSSPPTTPLTVTSVSTQhpkS 2046
Cdd:COG3469 61 TGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASG---ANTGTSTVTTTSTGAGSVTSTTSSTAGST---T 134
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2047 LESSGSPSVVTARSTSPGHSQPPSSPYATNSLSATTSKASLSSmttiSTSLTTPGPSSPPHGGTTTHPVTTASTTRAVTE 2126
Cdd:COG3469 135 TSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSAST----TPSATTTATATTASGATTPSATTTATTTGPPTP 210
|
....*
gi 147905740 2127 TLSSR 2131
Cdd:COG3469 211 GLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2281-2445 |
1.60e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.74 E-value: 1.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2281 GSTTPEGSVSTTETTTATPQTLSSKPLPNTSSIETWSPSGPTTSmeaSEMPTFWTLRTSVSSPPTTPLTVTSVSTQhpkS 2360
Cdd:COG3469 61 TGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASG---ANTGTSTVTTTSTGAGSVTSTTSSTAGST---T 134
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2361 LESSGSPSVVTARSTSPGHSQPPSSPYATNSLSATTSKASLSSmttiSTSLTTPGPSSPPHGGTTTHPVTTASTTRAVTE 2440
Cdd:COG3469 135 TSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSAST----TPSATTTATATTASGATTPSATTTATTTGPPTP 210
|
....*
gi 147905740 2441 TLSSR 2445
Cdd:COG3469 211 GLPKH 215
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
1672-1882 |
1.69e-03 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 44.53 E-value: 1.69e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1672 STTPEGSVSTTETATSTPQTLSSKPLPNTSSFETWSPSG-----PTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQ 1746
Cdd:PLN03209 346 PVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAYEDLKPPTspiptPPSSSPASSKSVDAVAKPAEPDVVPSPGSASNVPEV 425
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1747 HPKSLESSgspsvvTARSTSQgYSL-----PPSSPHTTNSLSATTSKASLSSMT--------TISTSLTTPGPSSP-PHG 1812
Cdd:PLN03209 426 EPAQVEAK------KTRPLSP-YARyedlkPPTSPSPTAPTGVSPSVSSTSSVPavpdtapaTAATDAAAPPPANMrPLS 498
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 147905740 1813 GTTTHPVTTASTTRAVTETLSSRPHTKTTSELPTVSSASSTAGPGTQTFSTTCQ-PQCQWTKWIDVDYPSS 1882
Cdd:PLN03209 499 PYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKPrPLSPYTMYEDLKPPTS 569
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1989-2176 |
1.73e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 44.78 E-value: 1.73e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1989 SSKPLPNTSSIETWSPsGPTTSMEASEMPTfwTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTarSTSPGHSQP 2068
Cdd:PHA03307 111 PSSPDPPPPTPPPASP-PPSPAPDLSEMLR--PVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPL--SSPEETARA 185
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2069 PSSPYATNSLSATTSKASLSSM---TTISTSLTTPGPSSPPH----------------------GGTTTHPVTTASTTRA 2123
Cdd:PHA03307 186 PSSPPAEPPPSTPPAAASPRPPrrsSPISASASSPAPAPGRSaaddagasssdssssessgcgwGPENECPLPRPAPITL 265
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 147905740 2124 VTETLSSRPHTKTTSELPTVSSVSSTAGPG--TQTGSTSRPNTVTQPASSPETSS 2176
Cdd:PHA03307 266 PTRIWEASGWNGPSSRPGPASSSSSPRERSpsPSPSSPGSGPAPSSPRASSSSSS 320
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
2303-2490 |
1.73e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 44.78 E-value: 1.73e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2303 SSKPLPNTSSIETWSPsGPTTSMEASEMPTfwTLRTSVSSPPTTPLTVTSVSTQHPKSLESSGSPSVVTarSTSPGHSQP 2382
Cdd:PHA03307 111 PSSPDPPPPTPPPASP-PPSPAPDLSEMLR--PVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPL--SSPEETARA 185
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2383 PSSPYATNSLSATTSKASLSSM---TTISTSLTTPGPSSPPH----------------------GGTTTHPVTTASTTRA 2437
Cdd:PHA03307 186 PSSPPAEPPPSTPPAAASPRPPrrsSPISASASSPAPAPGRSaaddagasssdssssessgcgwGPENECPLPRPAPITL 265
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 147905740 2438 VTETLSSRPHTKTTSELPTVSSVSSTAGPG--TQTGSTSRPNTVTQPASSPETSS 2490
Cdd:PHA03307 266 PTRIWEASGWNGPSSRPGPASSSSSPRERSpsPSPSSPGSGPAPSSPRASSSSSS 320
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
2822-3060 |
2.48e-03 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 44.22 E-value: 2.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2822 SFTTQSRPSTPRteKIVSIATARTTGSFPVSSLST------FPASPtSLLPTSEEPHSGKPSTSSWT-ASSNVPSTAKPT 2894
Cdd:TIGR00927 105 SIAMENTPSPPR--RTAKITPTTPKNNYSPTAAGTervkedTPATP-SRALNHYISTSGRQRVKSYTpKPRGEVKSSSPT 181
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2895 TSTGRQFPRTKTTL---WNSLPSSTPTMATRSiHPISTTMTTKNPQTLVTSDISKSNS---------------MLPSRP- 2955
Cdd:TIGR00927 182 QTREKVRKYTPSPLgrmVNSYAPSTFMTMPRS-HGITPRTTVKDSEITATYKMLETNPskrtagkttptplkgMTDNTPt 260
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 2956 ----HITASSVTSTSGSQISKSYSSTSLTTAGPSSPPHGGTTTHPVTTASttRAVTEtlssrpHTKTTSELPTVSSVSST 3031
Cdd:TIGR00927 261 fltrEVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLTTPQ--GTVLE------HTPATSEGQVTISIMTG 332
|
250 260
....*....|....*....|....*....
gi 147905740 3032 AGPGTQTGSTSRPSIVTqpvSSPETSSTT 3060
Cdd:TIGR00927 333 SSPAETKASTAAWKIRN---PLSRTSAPA 358
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1674-1867 |
5.06e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 43.37 E-value: 5.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1674 TPEGSVSTTETATSTPQTLSSKPlpNTSSFETWSPSGPTTSMEASEMPTFWTLRTSVSSPPTTPLTVTSVSTQHPKSLES 1753
Cdd:pfam05109 444 TTTGLPSSTHVPTNLTAPASTGP--TVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNAT 521
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 1754 SGSPSVVT----ARSTSQGYSLPPS-----SPHTTNSLSATTSKASLSSMTTI-----STSLTTPGP--SSP------PH 1811
Cdd:pfam05109 522 SPTPAVTTptpnATSPTLGKTSPTSavttpTPNATSPTPAVTTPTPNATIPTLgktspTSAVTTPTPnaTSPtvgetsPQ 601
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 147905740 1812 GGTTTHPVTTASTTRAVTETLSSRPHTKTTSELPTVSSASSTAGPGTQTFSTTCQP 1867
Cdd:pfam05109 602 ANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSP 657
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
3228-3378 |
6.82e-03 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 41.43 E-value: 6.82e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 147905740 3228 TPLTVTSVSTQHPKSLESSGSPSVVTARSTSQGHSLPPSSPHTTNSLSATTSKAslssmttistslttsepsspphGGTT 3307
Cdd:PHA03255 20 TSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAI----------------------LSTN 77
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 147905740 3308 THPVTAASTTRAVTETLSSRPHTKTTSELPTVPSASSTAGPGTQTGSTSR---PSTVTQPASSPETSSTTCQPQ 3378
Cdd:PHA03255 78 TTTVTSTGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVTSNvttRSSSTTSATTRITNATTLAPT 151
|
|
|