NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907182184|ref|XP_036009081|]
View 

mucin-6 isoform X4 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
387-550 2.32e-29

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


:

Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 116.73  E-value: 2.32e-29
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184   387 WTCTKQPCPGHCSLEGGSFVTTFDARPYRFHGTCTYTLLQSpqLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVI 466
Cdd:smart00216    1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD--CSSEPTFSVLLKNVPCGGGATCLKSVKVELNGDEIEL 78
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184   467 SEDEV-ITNNGDTKLLPYKTHNITI-FRQTSTHLQMATTFGLELVfQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDD 544
Cdd:smart00216   79 KDDNGkVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGLIQV-TFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDD 157

                    ....*.
gi 1907182184   545 FTTSMG 550
Cdd:smart00216  158 FRTPDG 163
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
1058-1130 3.53e-28

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


:

Pssm-ID: 214843  Cd Length: 76  Bit Score: 110.12  E-value: 3.53e-28
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182184  1058 WAERKCNIINSQ--TFAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFCP 1130
Cdd:smart00832    3 YACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCIsPWRTPTFCP 76
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
857-1019 3.83e-28

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


:

Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 113.27  E-value: 3.83e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184   857 WTCqLSTQCPSTCVLYGEGHIITFDGQRFVFDGDCEYMLATDdcgaNSSQPTFKVLTENVICGkSGVTCSRAIKISLGGL 936
Cdd:smart00216    1 WCC-TQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD----CSSEPTFSVLLKNVPCG-GGATCLKSVKVELNGD 74
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184   937 FITMADSN--YTVSGE-------EPLVHLKVKPSPLNLVLdidIPGRLNLTLVWNKHMSVSIKIrRATQQDALCGLCGNA 1007
Cdd:smart00216   75 EIELKDDNgkVTVNGQqvslpykTSDGSIQIRSSGGYLVV---ITSLGLIQVTFDGLTLLSVQL-PSKYRGKTCGLCGNF 150
                           170
                    ....*....|..
gi 1907182184  1008 NGNMKDDFETRS 1019
Cdd:smart00216  151 DGEPEDDFRTPD 162
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
45-194 2.14e-25

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


:

Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 105.15  E-value: 2.14e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184   45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDVPATFSIQLRRDMEGN----ISRIIMELGASVVTVNKETISVRDIG 120
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGAsgvcLKSVTVIVGDLEITLQKGGTVLVNGQ 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184  121 VVSLPYTSNGLQITPYGQSVQLVAKQLELELVITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDGK 194
Cdd:pfam00094   81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
589-661 1.13e-21

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


:

Pssm-ID: 214843  Cd Length: 76  Bit Score: 91.63  E-value: 1.13e-21
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182184   589 AETHCSMLLKKGSVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:smart00832    4 ACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTP-TFC 75
TIL pfam01826
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ...
303-358 2.00e-16

Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.


:

Pssm-ID: 460351  Cd Length: 55  Bit Score: 75.89  E-value: 2.00e-16
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184  303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:pfam01826    1 CPANEVYSECGSACPPTCANLSPpdVCPEPCVEGCVCPPGFVRNS---GGKCVPPSDC 55
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
765-828 3.23e-13

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


:

Pssm-ID: 410995  Cd Length: 55  Bit Score: 66.57  E-value: 3.23e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184  765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:cd19941      1 CPPNEVYSEC--------GSACPPTCANPNAPPPC-TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
246-298 9.20e-11

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


:

Pssm-ID: 462584  Cd Length: 68  Bit Score: 60.09  E-value: 9.20e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907182184  246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALC 298
Cdd:pfam08742   18 VDPEPYFEACVYDM--CSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTFC 68
CT smart00041
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ...
4164-4243 3.91e-10

C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers.


:

Pssm-ID: 214482  Cd Length: 82  Bit Score: 58.95  E-value: 3.91e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184  4164 EHQITYQGCVAN-VTLTRCQGFCASSVSFNkdTLQLESSCGCCQPLSTYKKQLSLPCPDpdapGQQLTLTLQVFSSCVCS 4242
Cdd:smart00041    5 RQTITYNGCTSVtVKNAFCEGKCGSASSYS--IQDVQHSCSCCQPHKTKTRQVRLRCPD----GSTVKKTVMHIEECGCE 78

                    .
gi 1907182184  4243 P 4243
Cdd:smart00041   79 P 79
PHA03247 super family cl33720
large tegument protein UL36; Provisional
3637-4059 1.70e-08

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.49  E-value: 1.70e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3637 TVTPTQPTP---------------IPATTNSPMTTVGLTGTPvvHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFR 3701
Cdd:PHA03247  2567 SVPPPRPAPrpsepavtsrarrpdAPPQSARPRAPVDDRGDP--RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPP 2644
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3702 TSEQSTTTFPTPSAPQTSLV--TSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPvstilqttievTTPPNTSTPVTH 3779
Cdd:PHA03247  2645 TVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP-----------PPPPPTPEPAPH 2713
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3780 S-TSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAIS 3858
Cdd:PHA03247  2714 AlVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3859 RTTGISGTPFRTPMKTTITFPTPSSLQTSM-ATLFPPfstsvmssteifntPTNPHSVSSASTSRPLSTSLPT------- 3930
Cdd:PHA03247  2794 SRESLPSPWDPADPPAAVLAPAAALPPAASpAGPLPP--------------PTSAQPTAPPPPPGPPPPSLPLggsvapg 2859
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3931 -TIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTAS 4009
Cdd:PHA03247  2860 gDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 4010 PPSSAPTfvSPTAASTVISSALPTI---HMTP---------TPSSRPTSSTGLLSTSKTTSH 4059
Cdd:PHA03247  2940 QPPLAPT--TDPAGAGEPSGAVPQPwlgALVPgrvavprfrVPQPAPSREAPASSTPPLTGH 2999
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
665-722 1.89e-07

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


:

Pssm-ID: 410995  Cd Length: 55  Bit Score: 50.39  E-value: 1.89e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184  665 CTGNRTFSYDSQACDRTCLSLsDRETEChvSPVPVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANP-NAPPPC--TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
VWC_out smart00215
von Willebrand factor (vWF) type C domain;
360-404 2.89e-06

von Willebrand factor (vWF) type C domain;


:

Pssm-ID: 214565  Cd Length: 67  Bit Score: 47.56  E-value: 2.89e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*.
gi 1907182184   360 CMLNGMVYGPGEITKTACQTCQCTMGRWTCTKQPC-PGHCSLEGGS 404
Cdd:smart00215    1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCgPKPCLLHNLS 46
Chi1 super family cl43877
Chitinase [Carbohydrate transport and metabolism];
3181-3381 3.24e-06

Chitinase [Carbohydrate transport and metabolism];


The actual alignment was detected with superfamily member COG3469:

Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 53.22  E-value: 3.24e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3181 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3260
Cdd:COG3469     28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3261 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3340
Cdd:COG3469    108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATT 183
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 1907182184 3341 PFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 3381
Cdd:COG3469    184 TATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
Chi1 super family cl43877
Chitinase [Carbohydrate transport and metabolism];
1657-1878 3.41e-06

Chitinase [Carbohydrate transport and metabolism];


The actual alignment was detected with superfamily member COG3469:

Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 53.22  E-value: 3.41e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1657 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 1736
Cdd:COG3469      7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1737 HPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaITNSLMTTGGLTG 1816
Cdd:COG3469     87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 1817 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 1878
Cdd:COG3469    163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1727-2176 5.11e-06

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 53.00  E-value: 5.11e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1727 TSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSThlplfstlsvtptteglntQSTPIPAitn 1806
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAD-------------------VTSPTPA--- 479
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1807 slMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVavsntkhTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSS 1886
Cdd:pfam05109  480 --GTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTT 550
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1887 ----VTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTP-TSAPHLS 1961
Cdd:pfam05109  551 ptpnATSPTPAVTTPTPN------ATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvVTSPPKN 624
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1962 ETSAVTAHQSTPTAVSANSikptMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTH 2041
Cdd:pfam05109  625 ATSAVTTGQHNITSSSTSS----MSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPA 700
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2042 SGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFST 2121
Cdd:pfam05109  701 PRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTT 780
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907182184 2122 DRTSTphlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR 2176
Cdd:pfam05109  781 DYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQ 832
Chi1 super family cl43877
Chitinase [Carbohydrate transport and metabolism];
2576-2766 1.31e-05

Chitinase [Carbohydrate transport and metabolism];


The actual alignment was detected with superfamily member COG3469:

Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 51.29  E-value: 1.31e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2576 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPT 2655
Cdd:COG3469     38 TATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTST 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2656 HSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 2735
Cdd:COG3469    118 GAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGA 193
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1907182184 2736 KHTTgvsletsvqTTIASPTPSAPQTSLATH 2766
Cdd:COG3469    194 TTPS---------ATTTATTTGPPTPGLPKH 215
Chi1 super family cl43877
Chitinase [Carbohydrate transport and metabolism];
2305-2516 1.54e-05

Chitinase [Carbohydrate transport and metabolism];


The actual alignment was detected with superfamily member COG3469:

Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.91  E-value: 1.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2305 TSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHS 2384
Cdd:COG3469      2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2385 GPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKH 2464
Cdd:COG3469     82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATG 161
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 2465 TTGVSLETSVQTTIASPTPSAPQTSLATHLP--FSSTSAVTPTSEVIITPTPQH 2516
Cdd:COG3469    162 GTTTTSTTTTTTSASTTPSATTTATATTASGatTPSATTTATTTGPPTPGLPKH 215
PHA03247 super family cl33720
large tegument protein UL36; Provisional
2646-3101 5.35e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.94  E-value: 5.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2646 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTS--------GTTSS 2717
Cdd:PHA03247  2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2718 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 2791
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2792 ssASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAhqstPTAVSANSIKP 2871
Cdd:PHA03247  2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR----PAVASLSESRE 2796
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2872 TMSSTGTPVVHTTSGTTSSPQTPRTTHPSttvavsgtvhtTGLPSGTSVHTTTNFPTHSGPQSSLSTH---LPLFSTLSV 2948
Cdd:PHA03247  2797 SLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvAPGGDVRRR 2865
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2949 TPTTEGLNTPTSPHSLSVASTSMPlmTVLPTTLEGTRPPHTSVPVTYTTTAATQTkssfSTDRTSAPHLSQPSTVTPTQS 3028
Cdd:PHA03247  2866 PPSRSPAAKPAAPARPPVRRLARP--AVSRSTESFALPPDQPERPPQPQAPPPPQ----PQPQPPPPPQPQPPPPPPPRP 2939
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3029 TPIPATTNSLMTTGGLTGTPP-----------VHTTSGTTSSPQTPRTTHPFSTvavsNTKHTTGVSLETSVQTTIASPT 3097
Cdd:PHA03247  2940 QPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrVAVPRFRVPQPAPSREAPASST----PPLTGHSLSRVSSWASSLALHE 3015

                   ....
gi 1907182184 3098 PSAP 3101
Cdd:PHA03247  3016 ETDP 3019
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
3330-3788 1.25e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.91  E-value: 1.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3330 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 3409
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3410 STSTTTGnilPTTIGQTGSPHTSVPVIYTTS----AITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIK 3485
Cdd:pfam05109  502 KAPDMTS---PTSAVTTPTPNATSPTPAVTTptpnATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTS 578
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3486 PTmSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTP 3565
Cdd:pfam05109  579 PT-SAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSP 657
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3566 TTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQPTP 3645
Cdd:pfam05109  658 STSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATS 737
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3646 --IPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSttlSTAPQFRTSEQSTTTFPTPSAPQTSLVTS 3723
Cdd:pfam05109  738 pqAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTTDYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFT 814
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182184 3724 LPPFSTSS----VSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQ 3788
Cdd:pfam05109  815 SPPVTTAQatvpVPPTSQPRFSNLSMLVLQWASLAVLTLLLLLVMADCAFRRNLSTSHTYTTPPYDDAE 883
ROM1 super family cl34999
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ...
1407-1625 4.66e-03

RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];


The actual alignment was detected with superfamily member COG5422:

Pssm-ID: 227709 [Multi-domain]  Cd Length: 1175  Bit Score: 43.34  E-value: 4.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1407 STGPPLGTSVQTTINFPTLSAPQTSLVT-----PHPGLSSSSTALTSEILKTPTSSQMVSSASPQT-IFSSIHPKTTLEA 1480
Cdd:COG5422     28 SKQLLPPRRLQRKLNPISIRNGADNDIInseskESFGKYALGHQIFSSFSSSPKLFQRRNSAGPIThSPSATSSTSSLNS 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1481 TTPQHTAPLITSITSSITQAQSSFSTDKTYTSQHS------QPSTMTaHQSRSLPTVTTSTKSTMGLTG---TPPVHTTS 1551
Cdd:COG5422    108 NDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQkrknplLPSSST-HGTHPPIVFTDNNGSHAGAPNarsRKEIPSLG 186
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182184 1552 GTTSSPQTPRTTHPFSTVAVSNT---KHTTGVSLETSvqttiaSPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPT 1625
Cdd:COG5422    187 SQSMQLPSPHFRQKFSSSDTSNGfsyPSIRKNSRHSS------NSMPSFPHSSTAVLLKRHSGSSGASLISSNITPS 257
 
Name Accession Description Interval E-value
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
387-550 2.32e-29

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 116.73  E-value: 2.32e-29
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184   387 WTCTKQPCPGHCSLEGGSFVTTFDARPYRFHGTCTYTLLQSpqLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVI 466
Cdd:smart00216    1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD--CSSEPTFSVLLKNVPCGGGATCLKSVKVELNGDEIEL 78
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184   467 SEDEV-ITNNGDTKLLPYKTHNITI-FRQTSTHLQMATTFGLELVfQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDD 544
Cdd:smart00216   79 KDDNGkVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGLIQV-TFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDD 157

                    ....*.
gi 1907182184   545 FTTSMG 550
Cdd:smart00216  158 FRTPDG 163
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
398-550 4.16e-29

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 115.55  E-value: 4.16e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184  398 CSLEGGSFVTTFDARPYRFHGTCTYTLLQSPQLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVISEDEVITNNGD 477
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGASGVCLKSVTVIVGDLEITLQKGGTVLVNGQ 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182184  478 TKLLPYKTHNITIFRQTSTHLQMATTFGLELVFQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDDFTTSMG 550
Cdd:pfam00094   81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDG 153
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
1058-1130 3.53e-28

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 110.12  E-value: 3.53e-28
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182184  1058 WAERKCNIINSQ--TFAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFCP 1130
Cdd:smart00832    3 YACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCIsPWRTPTFCP 76
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
857-1019 3.83e-28

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 113.27  E-value: 3.83e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184   857 WTCqLSTQCPSTCVLYGEGHIITFDGQRFVFDGDCEYMLATDdcgaNSSQPTFKVLTENVICGkSGVTCSRAIKISLGGL 936
Cdd:smart00216    1 WCC-TQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD----CSSEPTFSVLLKNVPCG-GGATCLKSVKVELNGD 74
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184   937 FITMADSN--YTVSGE-------EPLVHLKVKPSPLNLVLdidIPGRLNLTLVWNKHMSVSIKIrRATQQDALCGLCGNA 1007
Cdd:smart00216   75 EIELKDDNgkVTVNGQqvslpykTSDGSIQIRSSGGYLVV---ITSLGLIQVTFDGLTLLSVQL-PSKYRGKTCGLCGNF 150
                           170
                    ....*....|..
gi 1907182184  1008 NGNMKDDFETRS 1019
Cdd:smart00216  151 DGEPEDDFRTPD 162
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
1062-1129 1.44e-25

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 102.46  E-value: 1.44e-25
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1062 KCNIINSQT-FAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFC 1129
Cdd:pfam08742    1 KCGLLSDSGpFAPCHSVVDPEPYFEACVYDMCSC--GGDDECLCAALAAYARACQAAGVCIgDWRTPTFC 68
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
45-194 2.14e-25

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 105.15  E-value: 2.14e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184   45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDVPATFSIQLRRDMEGN----ISRIIMELGASVVTVNKETISVRDIG 120
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGAsgvcLKSVTVIVGDLEITLQKGGTVLVNGQ 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184  121 VVSLPYTSNGLQITPYGQSVQLVAKQLELELVITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDGK 194
Cdd:pfam00094   81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
45-193 1.17e-24

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 103.25  E-value: 1.17e-24
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184    45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDvpATFSIQLRRDMEG----NISRIIMELGASVVTVNKETISVR-DI 119
Cdd:smart00216   12 CSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSE--PTFSVLLKNVPCGggatCLKSVKVELNGDEIELKDDNGKVTvNG 89
                            90       100       110       120       130       140       150
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182184   120 GVVSLPYTSNGLQITPYgQSVQLVAKQLELELV-ITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDG 193
Cdd:smart00216   90 QQVSLPYKTSDGSIQIR-SSGGYLVVITSLGLIqVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDDFRTPDG 163
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
589-661 1.13e-21

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 91.63  E-value: 1.13e-21
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182184   589 AETHCSMLLKKGSVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:smart00832    4 ACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTP-TFC 75
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
592-661 3.10e-21

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 90.13  E-value: 3.10e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184  592 HCSMLLKKGsVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:pfam08742    1 KCGLLSDSG-PFAPCHSVVDPEPYFEACVYDMCSCGGDDECLCAALAAYARACQAAGVCIGDWRTP-TFC 68
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
869-1019 1.68e-20

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 90.89  E-value: 1.68e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184  869 CVLYGEGHIITFDGQRFVFDGDCEYMLAtDDCGANSSqPTFKVLTENVICGKSGVtCSRAIKISLGGLFITMADSNY-TV 947
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLA-KDCSEEPD-FSFSVTNKNCNGGASGV-CLKSVTVIVGDLEITLQKGGTvLV 77
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184  948 SGEEplVHLKVKPSPLNL------VLDIDIPGRLNLTLVWNKHMSVSIKIRRaTQQDALCGLCGNANGNMKDDFETRS 1019
Cdd:pfam00094   78 NGQK--VSLPYKSDGGEVeilgsgFVVVDLSPGVGLQVDGDGRGQLFVTLSP-SYQGKTCGLCGNYNGNQEDDFMTPD 152
TIL pfam01826
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ...
303-358 2.00e-16

Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.


Pssm-ID: 460351  Cd Length: 55  Bit Score: 75.89  E-value: 2.00e-16
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184  303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:pfam01826    1 CPANEVYSECGSACPPTCANLSPpdVCPEPCVEGCVCPPGFVRNS---GGKCVPPSDC 55
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
303-358 1.55e-15

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


Pssm-ID: 410995  Cd Length: 55  Bit Score: 73.51  E-value: 1.55e-15
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184  303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANPNAppPCTKQCVEGCFCPEGYVRNS---GGKCVPPSQC 55
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
765-828 3.23e-13

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


Pssm-ID: 410995  Cd Length: 55  Bit Score: 66.57  E-value: 3.23e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184  765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:cd19941      1 CPPNEVYSEC--------GSACPPTCANPNAPPPC-TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
TIL pfam01826
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ...
765-828 4.68e-13

Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.


Pssm-ID: 460351  Cd Length: 55  Bit Score: 66.26  E-value: 4.68e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184  765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:pfam01826    1 CPANEVYSEC--------GSACPPTCANLSPPDVC-PEPCVEGCVCPPGFVRNSGGKCVPPSDC 55
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
246-298 9.20e-11

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 60.09  E-value: 9.20e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907182184  246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALC 298
Cdd:pfam08742   18 VDPEPYFEACVYDM--CSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTFC 68
CT smart00041
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ...
4164-4243 3.91e-10

C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers.


Pssm-ID: 214482  Cd Length: 82  Bit Score: 58.95  E-value: 3.91e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184  4164 EHQITYQGCVAN-VTLTRCQGFCASSVSFNkdTLQLESSCGCCQPLSTYKKQLSLPCPDpdapGQQLTLTLQVFSSCVCS 4242
Cdd:smart00041    5 RQTITYNGCTSVtVKNAFCEGKCGSASSYS--IQDVQHSCSCCQPHKTKTRQVRLRCPD----GSTVKKTVMHIEECGCE 78

                    .
gi 1907182184  4243 P 4243
Cdd:smart00041   79 P 79
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
246-299 5.55e-09

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 55.42  E-value: 5.55e-09
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....
gi 1907182184   246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALCP 299
Cdd:smart00832   25 VDPEPFFENCVYDT--CACGGDCECLCDALAAYAAACAEAGVCISPWRTPTFCP 76
PHA03247 PHA03247
large tegument protein UL36; Provisional
3637-4059 1.70e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.49  E-value: 1.70e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3637 TVTPTQPTP---------------IPATTNSPMTTVGLTGTPvvHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFR 3701
Cdd:PHA03247  2567 SVPPPRPAPrpsepavtsrarrpdAPPQSARPRAPVDDRGDP--RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPP 2644
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3702 TSEQSTTTFPTPSAPQTSLV--TSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPvstilqttievTTPPNTSTPVTH 3779
Cdd:PHA03247  2645 TVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP-----------PPPPPTPEPAPH 2713
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3780 S-TSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAIS 3858
Cdd:PHA03247  2714 AlVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3859 RTTGISGTPFRTPMKTTITFPTPSSLQTSM-ATLFPPfstsvmssteifntPTNPHSVSSASTSRPLSTSLPT------- 3930
Cdd:PHA03247  2794 SRESLPSPWDPADPPAAVLAPAAALPPAASpAGPLPP--------------PTSAQPTAPPPPPGPPPPSLPLggsvapg 2859
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3931 -TIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTAS 4009
Cdd:PHA03247  2860 gDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 4010 PPSSAPTfvSPTAASTVISSALPTI---HMTP---------TPSSRPTSSTGLLSTSKTTSH 4059
Cdd:PHA03247  2940 QPPLAPT--TDPAGAGEPSGAVPQPwlgALVPgrvavprfrVPQPAPSREAPASSTPPLTGH 2999
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
665-722 1.89e-07

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


Pssm-ID: 410995  Cd Length: 55  Bit Score: 50.39  E-value: 1.89e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184  665 CTGNRTFSYDSQACDRTCLSLsDRETEChvSPVPVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANP-NAPPPC--TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
3734-4050 7.75e-07

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 55.77  E-value: 7.75e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3734 PTDEIHITSTNPHTVSS---VSMSRPVSTIL--QTTIEVT-----TPPNTSTPVTHSTSATTEAQGSFSTERTSTSYLSH 3803
Cdd:TIGR00927   68 SNDEMMMVSSDPPKSSSemeGEMLAPQATVGrdEATPSIAmentpSPPRRTAKITPTTPKNNYSPTAAGTERVKEDTPAT 147
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3804 PSSTTVH-QSTAG-PVITSIKSTM--GVTGTPPVHTTSGT---TSSP----------------QTPHSTHPISTAAISRT 3860
Cdd:TIGR00927  148 PSRALNHyISTSGrQRVKSYTPKPrgEVKSSSPTQTREKVrkyTPSPlgrmvnsyapstfmtmPRSHGITPRTTVKDSEI 227
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3861 TGISGTPFRTPMKTTITFPTPSSLQtSMATLFPPFSTSVMSsTEIFNTPTN---------PHSVSSASTSRP---LSTSL 3928
Cdd:TIGR00927  228 TATYKMLETNPSKRTAGKTTPTPLK-GMTDNTPTFLTREVE-TDLLTSPRSvvekntlttPRRVESNSSTNHwglVGKNN 305
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3929 PTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTL--QYTPTPSSVSHSPLLTTP 4006
Cdd:TIGR00927  306 LTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIASATfrGLEKNPSTAPSTPATPRV 385
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907182184 4007 TASPPSSA-------PTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGL 4050
Cdd:TIGR00927  386 RAVLTTQVhhcvvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQPDL 436
VWC_out smart00215
von Willebrand factor (vWF) type C domain;
360-404 2.89e-06

von Willebrand factor (vWF) type C domain;


Pssm-ID: 214565  Cd Length: 67  Bit Score: 47.56  E-value: 2.89e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*.
gi 1907182184   360 CMLNGMVYGPGEITKTACQTCQCTMGRWTCTKQPC-PGHCSLEGGS 404
Cdd:smart00215    1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCgPKPCLLHNLS 46
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3181-3381 3.24e-06

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 53.22  E-value: 3.24e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3181 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3260
Cdd:COG3469     28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3261 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3340
Cdd:COG3469    108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATT 183
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 1907182184 3341 PFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 3381
Cdd:COG3469    184 TATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1657-1878 3.41e-06

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 53.22  E-value: 3.41e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1657 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 1736
Cdd:COG3469      7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1737 HPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaITNSLMTTGGLTG 1816
Cdd:COG3469     87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 1817 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 1878
Cdd:COG3469    163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1727-2176 5.11e-06

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 53.00  E-value: 5.11e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1727 TSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSThlplfstlsvtptteglntQSTPIPAitn 1806
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAD-------------------VTSPTPA--- 479
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1807 slMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVavsntkhTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSS 1886
Cdd:pfam05109  480 --GTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTT 550
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1887 ----VTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTP-TSAPHLS 1961
Cdd:pfam05109  551 ptpnATSPTPAVTTPTPN------ATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvVTSPPKN 624
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1962 ETSAVTAHQSTPTAVSANSikptMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTH 2041
Cdd:pfam05109  625 ATSAVTTGQHNITSSSTSS----MSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPA 700
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2042 SGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFST 2121
Cdd:pfam05109  701 PRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTT 780
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907182184 2122 DRTSTphlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR 2176
Cdd:pfam05109  781 DYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQ 832
TIL pfam01826
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ...
665-722 1.27e-05

Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.


Pssm-ID: 460351  Cd Length: 55  Bit Score: 45.07  E-value: 1.27e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182184  665 CTGNRTFSYDSQACDRTCLSLSDR---ETEChvspvpVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:pfam01826    1 CPANEVYSECGSACPPTCANLSPPdvcPEPC------VEGCVCPPGFVRNSGGKCVPPSDC 55
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2576-2766 1.31e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 51.29  E-value: 1.31e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2576 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPT 2655
Cdd:COG3469     38 TATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTST 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2656 HSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 2735
Cdd:COG3469    118 GAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGA 193
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1907182184 2736 KHTTgvsletsvqTTIASPTPSAPQTSLATH 2766
Cdd:COG3469    194 TTPS---------ATTTATTTGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2305-2516 1.54e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.91  E-value: 1.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2305 TSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHS 2384
Cdd:COG3469      2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2385 GPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKH 2464
Cdd:COG3469     82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATG 161
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 2465 TTGVSLETSVQTTIASPTPSAPQTSLATHLP--FSSTSAVTPTSEVIITPTPQH 2516
Cdd:COG3469    162 GTTTTSTTTTTTSASTTPSATTTATATTASGatTPSATTTATTTGPPTPGLPKH 215
PHA03247 PHA03247
large tegument protein UL36; Provisional
2646-3101 5.35e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.94  E-value: 5.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2646 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTS--------GTTSS 2717
Cdd:PHA03247  2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2718 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 2791
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2792 ssASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAhqstPTAVSANSIKP 2871
Cdd:PHA03247  2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR----PAVASLSESRE 2796
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2872 TMSSTGTPVVHTTSGTTSSPQTPRTTHPSttvavsgtvhtTGLPSGTSVHTTTNFPTHSGPQSSLSTH---LPLFSTLSV 2948
Cdd:PHA03247  2797 SLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvAPGGDVRRR 2865
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2949 TPTTEGLNTPTSPHSLSVASTSMPlmTVLPTTLEGTRPPHTSVPVTYTTTAATQTkssfSTDRTSAPHLSQPSTVTPTQS 3028
Cdd:PHA03247  2866 PPSRSPAAKPAAPARPPVRRLARP--AVSRSTESFALPPDQPERPPQPQAPPPPQ----PQPQPPPPPQPQPPPPPPPRP 2939
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3029 TPIPATTNSLMTTGGLTGTPP-----------VHTTSGTTSSPQTPRTTHPFSTvavsNTKHTTGVSLETSVQTTIASPT 3097
Cdd:PHA03247  2940 QPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrVAVPRFRVPQPAPSREAPASST----PPLTGHSLSRVSSWASSLALHE 3015

                   ....
gi 1907182184 3098 PSAP 3101
Cdd:PHA03247  3016 ETDP 3019
ROM1 COG5422
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ...
3802-4036 8.92e-05

RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];


Pssm-ID: 227709 [Multi-domain]  Cd Length: 1175  Bit Score: 49.12  E-value: 8.92e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3802 SHPSS---TTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGT--TSSPQTPHSTHpISTAAISrttgISGTPFRTPMKTTI 3876
Cdd:COG5422     59 SKESFgkyALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATssTSSLNSNDGDQ-FSPASDS----LSFNPSSTQSRKDS 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3877 TFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLStslPTTIKGTGTPQTPVSDINTTSATTQAHS 3956
Cdd:COG5422    134 GPGDGSPVQKRKNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEI---PSLGSQSMQLPSPHFRQKFSSSDTSNGF 210
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3957 SFPTTRTSTSHlslpSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPT-----ASPPSSAPTFVSPTAASTVISSAL 4031
Cdd:COG5422    211 SYPSIRKNSRH----SSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSssnseAMSTSSKRPYIYPALLSRVAVEFK 286

                   ....*
gi 1907182184 4032 PTIHM 4036
Cdd:COG5422    287 MRLQL 291
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
2432-2763 1.32e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 48.22  E-value: 1.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2432 TPPVHTTSGTTSSPqTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSAVTPTSEVIIT 2511
Cdd:pfam03154  186 PPPPGTTQAATAGP-TPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2512 PTPQhtfssaSTSTTTGNILPTTIgQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAV 2591
Cdd:pfam03154  265 PLPQ------PSLHGQMPPMPHSL-QTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2592 SANSIKPtMSSTGTPVVHT----TSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPqsslSTH- 2666
Cdd:pfam03154  338 QPPREQP-LPPAPLSMPHIkpppTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHp 412
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2667 -----LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGV 2741
Cdd:pfam03154  413 pplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGI 492
                          330       340
                   ....*....|....*....|..
gi 1907182184 2742 SLETSVQTTIASPTPSAPQTSL 2763
Cdd:pfam03154  493 QPPSSASVSSSGPVPAAVSCPL 514
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
3473-3846 4.59e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 46.11  E-value: 4.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3473 QSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLS 3552
Cdd:pfam17823   45 DAVPRADNKSSEQ*NFCAATAAPAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPS 124
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3553 THLPLFSTLSVTPTTEGLNTPTS--PHSLSAASTSMPLMTVLPTTlEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTP 3630
Cdd:pfam17823  125 SAAQSLPAAIAALPSEAFSAPRAaaCRANASAAPRAAIAAASAPH-AASPAPRTAASSTTAASSTTAASSAPTTAASSAP 203
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3631 hlsqsSTVTPTQPTPIPAT-TNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQSTTT 3709
Cdd:pfam17823  204 -----ATLTPARGISTAATaTGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHAR 278
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3710 FPTPSAPQTSLVTSLPPFSTSSVSPTDEIHITSTNPHTVSSVS--MSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEA 3787
Cdd:pfam17823  279 RLSPAKHMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGepTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEP 358
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182184 3788 QG--------SFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTP 3846
Cdd:pfam17823  359 SAspvpvlhtSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAPEQVATEATAGTASAGPTPRSSGDP 425
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1492-1875 5.66e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 46.30  E-value: 5.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1492 SITSSITQAQSSFSTDKTYTSQHSQPSTMTAHQSRSLPTVTTSTKSTMGLTGTPPVHTTSGTTSSPQ-TPRTTHPFSTVA 1570
Cdd:pfam03154  143 STSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQgSPATSQPPNQTQ 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1571 VSNTKHTTgvsletsvqttIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIgQT 1650
Cdd:pfam03154  223 STAAPHTL-----------IQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ------PSLHGQMPPMPHSL-QT 284
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1651 GSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPtMSSTGTPVVHTT-SGTTSS 1729
Cdd:pfam03154  285 GPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQP-LPPAPLSMPHIKpPPTTPI 363
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1730 PQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTH------LPLFSTLSVTPTTEGLNTQSTP 1800
Cdd:pfam03154  364 PQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHppplqlMPQSQQLPPPPAQPPVLTQSQS 439
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182184 1801 IPAITNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 1875
Cdd:pfam03154  440 LPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
1918-3538 6.14e-04

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 46.30  E-value: 6.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1918 TTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTS 1997
Cdd:COG3210     80 GIGAAAANTAGTLETGLTSNIGGGSVNGSNSTGNGTLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAG 159
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1998 GTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLS 2077
Cdd:COG3210    160 NNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAG 239
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2078 VASTSMplmTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQSTPIPATTNSLMTTGGLT 2157
Cdd:COG3210    240 VISTGG---TDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGT 316
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2158 GTPPVHTNSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSKVII 2237
Cdd:COG3210    317 AAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASS 396
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2238 TPTPQHTLSSASTSTTTGNILPTTIGKTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTA 2317
Cdd:COG3210    397 TTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSA 476
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2318 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLF 2397
Cdd:COG3210    477 GNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTT 556
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2398 STLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTT 2477
Cdd:COG3210    557 AASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTG 636
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2478 IASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQT 2557
Cdd:COG3210    637 SAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVT 716
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2558 KTSFSTDRTS-------TSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRT--THPST 2628
Cdd:COG3210    717 GQIGALANANgdtvtfgNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATLDNagAEISI 796
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2629 TVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPV 2708
Cdd:COG3210    797 DITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAAS 876
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2709 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQ 2788
Cdd:COG3210    877 ITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSA 956
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2789 HTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANS 2868
Cdd:COG3210    957 ASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGT 1036
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2869 IKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSV 2948
Cdd:COG3210   1037 AATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGV 1116
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2949 TPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSAPHLSQPSTVTPTQS 3028
Cdd:COG3210   1117 TASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGT 1196
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3029 TPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATH 3108
Cdd:COG3210   1197 DLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGN 1276
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3109 LPFSSTSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSTITQTKTSFFTDRTSTSTSAP 3188
Cdd:COG3210   1277 AGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGN 1356
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3189 HLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF 3268
Cdd:COG3210   1357 GATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGG 1436
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3269 PTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVS 3348
Cdd:COG3210   1437 TGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTT 1516
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3349 NTKHTTGVSLETSVQTTIASPTPSAPQTSLAThlpfsstsSVTPTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTGS 3428
Cdd:COG3210   1517 AEVAKASLEGGEGTYGGSSVAEAGTGGGILGA--------VSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQA 1588
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3429 PHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQT 3508
Cdd:COG3210   1589 PTAGNTATLTLSLAEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNGGEGVLALVAGGNTTNGTTLSGAVNGAGNGW 1668
                         1610      1620      1630
                   ....*....|....*....|....*....|
gi 1907182184 3509 PRTTHPSTTVAVSGTVHTTGLPSGTSVHTT 3538
Cdd:COG3210   1669 AVDLTDATLAGLGGATTAAAGNVATGDTAP 1698
PHA03247 PHA03247
large tegument protein UL36; Provisional
1758-2222 6.45e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 6.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1758 SVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVHTTS--------GTTSS 1829
Cdd:PHA03247  2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1830 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 1903
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1904 ssASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIkp 1983
Cdd:PHA03247  2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-- 2798
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1984 tmsstgtpvvhttsgttSSPQTPrTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSlsthlPLFSTLsvtpT 2063
Cdd:PHA03247  2799 -----------------PSPWDP-ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG-----PPPPSL----P 2851
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2064 TEGLNTPTSPHSLSVASTSMPLMTVLPttlegTRPPHTSVPVTYTttaatqtksSFSTDRTSTPHLSQSSTVTPTQSTPi 2143
Cdd:PHA03247  2852 LGGSVAPGGDVRRRPPSRSPAAKPAAP-----ARPPVRRLARPAV---------SRSTESFALPPDQPERPPQPQAPPP- 2916
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2144 PATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV-QTTIASPTPSAPQTSLATHLP 2222
Cdd:PHA03247  2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVpRFRVPQPAPSREAPASSTPPL 2996
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
3330-3788 1.25e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.91  E-value: 1.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3330 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 3409
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3410 STSTTTGnilPTTIGQTGSPHTSVPVIYTTS----AITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIK 3485
Cdd:pfam05109  502 KAPDMTS---PTSAVTTPTPNATSPTPAVTTptpnATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTS 578
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3486 PTmSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTP 3565
Cdd:pfam05109  579 PT-SAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSP 657
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3566 TTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQPTP 3645
Cdd:pfam05109  658 STSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATS 737
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3646 --IPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSttlSTAPQFRTSEQSTTTFPTPSAPQTSLVTS 3723
Cdd:pfam05109  738 pqAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTTDYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFT 814
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182184 3724 LPPFSTSS----VSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQ 3788
Cdd:pfam05109  815 SPPVTTAQatvpVPPTSQPRFSNLSMLVLQWASLAVLTLLLLLVMADCAFRRNLSTSHTYTTPPYDDAE 883
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3464-3684 1.30e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.74  E-value: 1.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3464 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 3543
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3544 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 3623
Cdd:COG3469     81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182184 3624 TDRTSTPhlSQSSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSL 3684
Cdd:COG3469    159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKHVL 217
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2849-3063 1.64e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.36  E-value: 1.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2849 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2928
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2929 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPH-TSVPVTYTTTAATQTKSSF 3007
Cdd:COG3469     81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAsATSSAGSTTTTTTVSGTET 158
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182184 3008 STDRTSAPHLSQPSTVTPTQSTPIPATTnslmTTGGLTGTPPVHTTSGTTSSPQTP 3063
Cdd:COG3469    159 ATGGTTTTSTTTTTTSASTTPSATTTAT----ATTASGATTPSATTTATTTGPPTP 210
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1961-2179 1.67e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.36  E-value: 1.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1961 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2040
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2041 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 2120
Cdd:COG3469     81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182184 2121 TDRTSTPhlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTH 2179
Cdd:COG3469    159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
VWC pfam00093
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with ...
360-395 1.96e-03

von Willebrand factor type C domain; The high cutoff was used to prevent overlap with pfam00094.


Pssm-ID: 278520  Cd Length: 57  Bit Score: 39.33  E-value: 1.96e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1907182184  360 CMLNGMVYGPGEITKTA-CQTCQCTMGRWTCTKQPCP 395
Cdd:pfam00093    1 CVQNGVVYENGETWKPDlCTICTCDDGKVLCDKIICP 37
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
2705-3068 2.06e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 2.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2705 TPPVHTTSGTTSSPqTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIIT 2784
Cdd:pfam03154  186 PPPPGTTQAATAGP-TPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2785 PTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAV 2864
Cdd:pfam03154  265 PLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2865 SANSIKPtMSSTGTPVVHTT-SGTTSSPQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTHL 2940
Cdd:pfam03154  338 QPPREQP-LPPAPLSMPHIKpPPTTPIPQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHP 412
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2941 PlfsTLSVTPTTEGLNTP-------TSPHSLSVASTSMPLMTVL-PTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTdrT 3012
Cdd:pfam03154  413 P---PLQLMPQSQQLPPPpaqppvlTQSQSLPPPAASHPPTSGLhQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTS--S 487
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182184 3013 SAPHLSQPSTVTPTQSTPIPATTNSLMttggltgtPPVHTTSGTTSSPQTPRTTHP 3068
Cdd:pfam03154  488 AMPGIQPPSSASVSSSGPVPAAVSCPL--------PPVQIKEEALDEAEEPESPPP 535
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
2575-2899 2.13e-03

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 44.22  E-value: 2.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2575 LSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTSSSPQTPRT------THPSTTVAVSGTVHTTGLPSGTSVQ 2648
Cdd:TIGR00927   91 LAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSRALNHYISTSGRQRVKSYT 168
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2649 TTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipATTNSLMTTGGLTGTPPVHTTSGT 2714
Cdd:TIGR00927  169 PKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEITATYKMLETNPSKRTAGK 245
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2715 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 2794
Cdd:TIGR00927  246 TTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLTTPQGTVLEHTPATSEGQV 325
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2795 STSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPTsAPHLSETSAVTAHQST---PTAV 2864
Cdd:TIGR00927  326 TISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA-TPRVRAVLTTQVHHCVvvkPAPA 404
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 1907182184 2865 SANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2899
Cdd:TIGR00927  405 VPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
3018-3378 2.27e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 2.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3018 SQPSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSgtTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqttIASPT 3097
Cdd:pfam03154  169 TQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPS--VPPQGSPATSQPPNQTQSTAAPHTL-----------IQQTP 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3098 PSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPViyttstitqTKTSFF 3177
Cdd:pfam03154  236 TLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPV---------PPQPFP 299
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3178 TDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP 3257
Cdd:pfam03154  300 LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLS 379
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3258 SGTSVQTTTNFPTHSG--PQSSLSTH------------LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPV 3323
Cdd:pfam03154  380 GPSPFQMNSNLPPPPAlkPLSSLSTHhppsahppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQ 459
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907182184 3324 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 3378
Cdd:pfam03154  460 SPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
3772-4000 3.40e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 43.84  E-value: 3.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3772 NTSTPVTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHP 3851
Cdd:NF033849   250 STSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSS 329
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3852 ISTAAISRTTGISGT-----PFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLST 3926
Cdd:NF033849   330 SYNVSSGTGVSSSHSdgtsqSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 3927 SLPTTiKGTGTpQTPVSDINTTSATTQAHSSFPTTRTSTSHlSLPSSMTSTLTpASRSASTLQYTPTPSSVSHS 4000
Cdd:NF033849   410 SQGGS-EGWGS-GDSVQSVSQSYGSSSSTGTSSGHSDSSSH-STSSGQADSVS-QGTSWSEGTGTSQGQSVGTS 479
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
2127-2490 3.84e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.60  E-value: 3.84e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2127 PHLSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVhtnSGTTSSPQTPRTTHPFSTVAVsntkhttgvsletsvQTTIA 2206
Cdd:pfam03154  171 PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV---PPQGSPATSQPPNQTQSTAAP---------------HTLIQ 232
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2207 SPTPSAPQTSLATHLPFSSTSSVTPTSKVIITPTPQhtlssaSTSTTTGNILPTTIgKTGSPHTSVPVIYTTSAITQTKT 2286
Cdd:pfam03154  233 QTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ------PSLHGQMPPMPHSL-QTGPSHMQHPVPPQPFPLTPQSS 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2287 SFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPtMSSTGTPVVHTT-SGTTSSPQTPRT---THPSTTVAVSGT 2362
Cdd:pfam03154  306 QSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQP-LPPAPLSMPHIKpPPTTPIPQLPNPqshKHPPHLSGPSPF 384
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2363 VHTTGLPSGTSVHTTTNFPTHSGPqsslSTH------LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVH 2436
Cdd:pfam03154  385 QMNSNLPPPPALKPLSSLSTHHPP----SAHppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQS 460
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 2437 TTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 2490
Cdd:pfam03154  461 PFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
1669-2011 3.84e-03

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 43.45  E-value: 3.84e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1669 TKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTTSSPQTPRT------THPSTTV 1742
Cdd:TIGR00927   73 MMVSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSR 150
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1743 AVSGTVHTTGLPSGTSVHTTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipAITNSL 1808
Cdd:TIGR00927  151 ALNHYISTSGRQRVKSYTPKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEI 227
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1809 MTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVT 1888
Cdd:TIGR00927  228 TATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLT 307
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1889 PTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPTsAPHLS 1961
Cdd:TIGR00927  308 TPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA-TPRVR 386
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907182184 1962 ETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2011
Cdd:TIGR00927  387 AVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
PHA03247 PHA03247
large tegument protein UL36; Provisional
2299-2768 4.66e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 4.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2299 APHLSEtSAVTAHQSTPTAvsansiKPTMSSTGTPVVHTTSGTTSSPQT--PRTTHPSTTVAVSGTVHTTGLPSGTSVHT 2376
Cdd:PHA03247  2574 APRPSE-PAVTSRARRPDA------PPQSARPRAPVDDRGDPRGPAPPSplPPDTHAPDPPPPSPSPAANEPDPHPPPTV 2646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2377 TTNFPTHSGPQSSlSTHLPLFSTLSVTPTTEGLNTQSTPIPATTnslmttggltgtPPVHTTSGTTSSPQTPRTTHPFST 2456
Cdd:PHA03247  2647 PPPERPRDDPAPG-RVSRPRRARRLGRAAQASSPPQRPRRRAAR------------PTVGSLTSLADPPPPPPTPEPAPH 2713
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2457 VAVSntkhttgvsletsvqttiASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSAststttgnilPTTig 2536
Cdd:PHA03247  2714 ALVS------------------ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP----------PTT-- 2763
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2537 qTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSS 2616
Cdd:PHA03247  2764 -AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP 2842
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2617 SPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTnfPTHSgPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPI----PAT 2692
Cdd:PHA03247  2843 PGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA--PARP-PVRRLARPAVSRSTESFALPPDQPERPPQPQapppPQP 2919
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182184 2693 TNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV-QTTIASPTPSAPQTSLATHLP 2768
Cdd:PHA03247  2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVpRFRVPQPAPSREAPASSTPPL 2996
ROM1 COG5422
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ...
1407-1625 4.66e-03

RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];


Pssm-ID: 227709 [Multi-domain]  Cd Length: 1175  Bit Score: 43.34  E-value: 4.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1407 STGPPLGTSVQTTINFPTLSAPQTSLVT-----PHPGLSSSSTALTSEILKTPTSSQMVSSASPQT-IFSSIHPKTTLEA 1480
Cdd:COG5422     28 SKQLLPPRRLQRKLNPISIRNGADNDIInseskESFGKYALGHQIFSSFSSSPKLFQRRNSAGPIThSPSATSSTSSLNS 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1481 TTPQHTAPLITSITSSITQAQSSFSTDKTYTSQHS------QPSTMTaHQSRSLPTVTTSTKSTMGLTG---TPPVHTTS 1551
Cdd:COG5422    108 NDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQkrknplLPSSST-HGTHPPIVFTDNNGSHAGAPNarsRKEIPSLG 186
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182184 1552 GTTSSPQTPRTTHPFSTVAVSNT---KHTTGVSLETSvqttiaSPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPT 1625
Cdd:COG5422    187 SQSMQLPSPHFRQKFSSSDTSNGfsyPSIRKNSRHSS------NSMPSFPHSSTAVLLKRHSGSSGASLISSNITPS 257
 
Name Accession Description Interval E-value
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
387-550 2.32e-29

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 116.73  E-value: 2.32e-29
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184   387 WTCTKQPCPGHCSLEGGSFVTTFDARPYRFHGTCTYTLLQSpqLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVI 466
Cdd:smart00216    1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD--CSSEPTFSVLLKNVPCGGGATCLKSVKVELNGDEIEL 78
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184   467 SEDEV-ITNNGDTKLLPYKTHNITI-FRQTSTHLQMATTFGLELVfQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDD 544
Cdd:smart00216   79 KDDNGkVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGLIQV-TFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDD 157

                    ....*.
gi 1907182184   545 FTTSMG 550
Cdd:smart00216  158 FRTPDG 163
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
398-550 4.16e-29

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 115.55  E-value: 4.16e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184  398 CSLEGGSFVTTFDARPYRFHGTCTYTLLQSPQLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVISEDEVITNNGD 477
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGASGVCLKSVTVIVGDLEITLQKGGTVLVNGQ 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182184  478 TKLLPYKTHNITIFRQTSTHLQMATTFGLELVFQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDDFTTSMG 550
Cdd:pfam00094   81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDG 153
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
1058-1130 3.53e-28

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 110.12  E-value: 3.53e-28
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182184  1058 WAERKCNIINSQ--TFAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFCP 1130
Cdd:smart00832    3 YACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCIsPWRTPTFCP 76
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
857-1019 3.83e-28

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 113.27  E-value: 3.83e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184   857 WTCqLSTQCPSTCVLYGEGHIITFDGQRFVFDGDCEYMLATDdcgaNSSQPTFKVLTENVICGkSGVTCSRAIKISLGGL 936
Cdd:smart00216    1 WCC-TQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD----CSSEPTFSVLLKNVPCG-GGATCLKSVKVELNGD 74
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184   937 FITMADSN--YTVSGE-------EPLVHLKVKPSPLNLVLdidIPGRLNLTLVWNKHMSVSIKIrRATQQDALCGLCGNA 1007
Cdd:smart00216   75 EIELKDDNgkVTVNGQqvslpykTSDGSIQIRSSGGYLVV---ITSLGLIQVTFDGLTLLSVQL-PSKYRGKTCGLCGNF 150
                           170
                    ....*....|..
gi 1907182184  1008 NGNMKDDFETRS 1019
Cdd:smart00216  151 DGEPEDDFRTPD 162
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
1062-1129 1.44e-25

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 102.46  E-value: 1.44e-25
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1062 KCNIINSQT-FAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFC 1129
Cdd:pfam08742    1 KCGLLSDSGpFAPCHSVVDPEPYFEACVYDMCSC--GGDDECLCAALAAYARACQAAGVCIgDWRTPTFC 68
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
45-194 2.14e-25

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 105.15  E-value: 2.14e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184   45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDVPATFSIQLRRDMEGN----ISRIIMELGASVVTVNKETISVRDIG 120
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGAsgvcLKSVTVIVGDLEITLQKGGTVLVNGQ 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184  121 VVSLPYTSNGLQITPYGQSVQLVAKQLELELVITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDGK 194
Cdd:pfam00094   81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
45-193 1.17e-24

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 103.25  E-value: 1.17e-24
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184    45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDvpATFSIQLRRDMEG----NISRIIMELGASVVTVNKETISVR-DI 119
Cdd:smart00216   12 CSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSE--PTFSVLLKNVPCGggatCLKSVKVELNGDEIELKDDNGKVTvNG 89
                            90       100       110       120       130       140       150
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182184   120 GVVSLPYTSNGLQITPYgQSVQLVAKQLELELV-ITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDG 193
Cdd:smart00216   90 QQVSLPYKTSDGSIQIR-SSGGYLVVITSLGLIqVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDDFRTPDG 163
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
589-661 1.13e-21

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 91.63  E-value: 1.13e-21
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182184   589 AETHCSMLLKKGSVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:smart00832    4 ACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTP-TFC 75
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
592-661 3.10e-21

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 90.13  E-value: 3.10e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184  592 HCSMLLKKGsVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:pfam08742    1 KCGLLSDSG-PFAPCHSVVDPEPYFEACVYDMCSCGGDDECLCAALAAYARACQAAGVCIGDWRTP-TFC 68
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
869-1019 1.68e-20

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 90.89  E-value: 1.68e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184  869 CVLYGEGHIITFDGQRFVFDGDCEYMLAtDDCGANSSqPTFKVLTENVICGKSGVtCSRAIKISLGGLFITMADSNY-TV 947
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLA-KDCSEEPD-FSFSVTNKNCNGGASGV-CLKSVTVIVGDLEITLQKGGTvLV 77
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184  948 SGEEplVHLKVKPSPLNL------VLDIDIPGRLNLTLVWNKHMSVSIKIRRaTQQDALCGLCGNANGNMKDDFETRS 1019
Cdd:pfam00094   78 NGQK--VSLPYKSDGGEVeilgsgFVVVDLSPGVGLQVDGDGRGQLFVTLSP-SYQGKTCGLCGNYNGNQEDDFMTPD 152
TIL pfam01826
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ...
303-358 2.00e-16

Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.


Pssm-ID: 460351  Cd Length: 55  Bit Score: 75.89  E-value: 2.00e-16
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184  303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:pfam01826    1 CPANEVYSECGSACPPTCANLSPpdVCPEPCVEGCVCPPGFVRNS---GGKCVPPSDC 55
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
303-358 1.55e-15

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


Pssm-ID: 410995  Cd Length: 55  Bit Score: 73.51  E-value: 1.55e-15
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184  303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANPNAppPCTKQCVEGCFCPEGYVRNS---GGKCVPPSQC 55
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
765-828 3.23e-13

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


Pssm-ID: 410995  Cd Length: 55  Bit Score: 66.57  E-value: 3.23e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184  765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:cd19941      1 CPPNEVYSEC--------GSACPPTCANPNAPPPC-TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
TIL pfam01826
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ...
765-828 4.68e-13

Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.


Pssm-ID: 460351  Cd Length: 55  Bit Score: 66.26  E-value: 4.68e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184  765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:pfam01826    1 CPANEVYSEC--------GSACPPTCANLSPPDVC-PEPCVEGCVCPPGFVRNSGGKCVPPSDC 55
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
246-298 9.20e-11

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 60.09  E-value: 9.20e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907182184  246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALC 298
Cdd:pfam08742   18 VDPEPYFEACVYDM--CSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTFC 68
CT smart00041
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ...
4164-4243 3.91e-10

C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers.


Pssm-ID: 214482  Cd Length: 82  Bit Score: 58.95  E-value: 3.91e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184  4164 EHQITYQGCVAN-VTLTRCQGFCASSVSFNkdTLQLESSCGCCQPLSTYKKQLSLPCPDpdapGQQLTLTLQVFSSCVCS 4242
Cdd:smart00041    5 RQTITYNGCTSVtVKNAFCEGKCGSASSYS--IQDVQHSCSCCQPHKTKTRQVRLRCPD----GSTVKKTVMHIEECGCE 78

                    .
gi 1907182184  4243 P 4243
Cdd:smart00041   79 P 79
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
246-299 5.55e-09

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 55.42  E-value: 5.55e-09
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....
gi 1907182184   246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALCP 299
Cdd:smart00832   25 VDPEPFFENCVYDT--CACGGDCECLCDALAAYAAACAEAGVCISPWRTPTFCP 76
PHA03247 PHA03247
large tegument protein UL36; Provisional
3637-4059 1.70e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.49  E-value: 1.70e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3637 TVTPTQPTP---------------IPATTNSPMTTVGLTGTPvvHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFR 3701
Cdd:PHA03247  2567 SVPPPRPAPrpsepavtsrarrpdAPPQSARPRAPVDDRGDP--RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPP 2644
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3702 TSEQSTTTFPTPSAPQTSLV--TSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPvstilqttievTTPPNTSTPVTH 3779
Cdd:PHA03247  2645 TVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP-----------PPPPPTPEPAPH 2713
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3780 S-TSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAIS 3858
Cdd:PHA03247  2714 AlVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3859 RTTGISGTPFRTPMKTTITFPTPSSLQTSM-ATLFPPfstsvmssteifntPTNPHSVSSASTSRPLSTSLPT------- 3930
Cdd:PHA03247  2794 SRESLPSPWDPADPPAAVLAPAAALPPAASpAGPLPP--------------PTSAQPTAPPPPPGPPPPSLPLggsvapg 2859
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3931 -TIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTAS 4009
Cdd:PHA03247  2860 gDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 4010 PPSSAPTfvSPTAASTVISSALPTI---HMTP---------TPSSRPTSSTGLLSTSKTTSH 4059
Cdd:PHA03247  2940 QPPLAPT--TDPAGAGEPSGAVPQPwlgALVPgrvavprfrVPQPAPSREAPASSTPPLTGH 2999
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3556-4015 2.30e-08

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 60.96  E-value: 2.30e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3556 PLFSTLSVTPTTEGLNTPT-SPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPvtytttaatqtkssfsTDRTSTPHLSQ 3634
Cdd:PHA03307    54 TVVAGAAACDRFEPPTGPPpGPGTEAPANESRSTPTWSLSTLAPASPAREGSP----------------TPPGPSSPDPP 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3635 SSTVTPTQPTPIPATTNSPMTTvglTGTPVVHTPSGTSSIAHTPHTthslPTAASSSTTLSTAPQFRTSEQSTttfPTPS 3714
Cdd:PHA03307   118 PPTPPPASPPPSPAPDLSEMLR---PVGSPGPPPAASPPAAGASPA----AVASDAASSRQAALPLSSPEETA---RAPS 187
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3715 APQTSLVTSLPPFSTSSVSPTdeihitstnPHTVSSVSMSRPVSTILQTtievttpPNTSTPVTHSTSATTEAQGSFSTE 3794
Cdd:PHA03307   188 SPPAEPPPSTPPAAASPRPPR---------RSSPISASASSPAPAPGRS-------AADDAGASSSDSSSSESSGCGWGP 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3795 RTSTSyLSHPssttvhqstaGPVITSIKSTMGVTGTPPvhttsGTTSSPQTPHSTHPISTAAISRttGISGTPFRTPMKT 3874
Cdd:PHA03307   252 ENECP-LPRP----------APITLPTRIWEASGWNGP-----SSRPGPASSSSSPRERSPSPSP--SSPGSGPAPSSPR 313
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3875 TITFPTPSSLQTSMATLfppfSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLPTTIKGTGTPQTPvsdintTSATTQA 3954
Cdd:PHA03307   314 ASSSSSSSRESSSSSTS----SSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPS------SPAASAG 383
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 3955 HSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTASP-PSSAP 4015
Cdd:PHA03307   384 RPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSGEPwPGSPP 445
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
665-722 1.89e-07

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


Pssm-ID: 410995  Cd Length: 55  Bit Score: 50.39  E-value: 1.89e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184  665 CTGNRTFSYDSQACDRTCLSLsDRETEChvSPVPVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANP-NAPPPC--TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
3734-4050 7.75e-07

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 55.77  E-value: 7.75e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3734 PTDEIHITSTNPHTVSS---VSMSRPVSTIL--QTTIEVT-----TPPNTSTPVTHSTSATTEAQGSFSTERTSTSYLSH 3803
Cdd:TIGR00927   68 SNDEMMMVSSDPPKSSSemeGEMLAPQATVGrdEATPSIAmentpSPPRRTAKITPTTPKNNYSPTAAGTERVKEDTPAT 147
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3804 PSSTTVH-QSTAG-PVITSIKSTM--GVTGTPPVHTTSGT---TSSP----------------QTPHSTHPISTAAISRT 3860
Cdd:TIGR00927  148 PSRALNHyISTSGrQRVKSYTPKPrgEVKSSSPTQTREKVrkyTPSPlgrmvnsyapstfmtmPRSHGITPRTTVKDSEI 227
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3861 TGISGTPFRTPMKTTITFPTPSSLQtSMATLFPPFSTSVMSsTEIFNTPTN---------PHSVSSASTSRP---LSTSL 3928
Cdd:TIGR00927  228 TATYKMLETNPSKRTAGKTTPTPLK-GMTDNTPTFLTREVE-TDLLTSPRSvvekntlttPRRVESNSSTNHwglVGKNN 305
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3929 PTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTL--QYTPTPSSVSHSPLLTTP 4006
Cdd:TIGR00927  306 LTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIASATfrGLEKNPSTAPSTPATPRV 385
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907182184 4007 TASPPSSA-------PTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGL 4050
Cdd:TIGR00927  386 RAVLTTQVhhcvvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQPDL 436
VWC_out smart00215
von Willebrand factor (vWF) type C domain;
360-404 2.89e-06

von Willebrand factor (vWF) type C domain;


Pssm-ID: 214565  Cd Length: 67  Bit Score: 47.56  E-value: 2.89e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*.
gi 1907182184   360 CMLNGMVYGPGEITKTACQTCQCTMGRWTCTKQPC-PGHCSLEGGS 404
Cdd:smart00215    1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCgPKPCLLHNLS 46
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3181-3381 3.24e-06

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 53.22  E-value: 3.24e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3181 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3260
Cdd:COG3469     28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3261 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3340
Cdd:COG3469    108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATT 183
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 1907182184 3341 PFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 3381
Cdd:COG3469    184 TATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1657-1878 3.41e-06

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 53.22  E-value: 3.41e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1657 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 1736
Cdd:COG3469      7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1737 HPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaITNSLMTTGGLTG 1816
Cdd:COG3469     87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 1817 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 1878
Cdd:COG3469    163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1727-2176 5.11e-06

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 53.00  E-value: 5.11e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1727 TSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSThlplfstlsvtptteglntQSTPIPAitn 1806
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAD-------------------VTSPTPA--- 479
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1807 slMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVavsntkhTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSS 1886
Cdd:pfam05109  480 --GTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTT 550
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1887 ----VTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTP-TSAPHLS 1961
Cdd:pfam05109  551 ptpnATSPTPAVTTPTPN------ATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvVTSPPKN 624
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1962 ETSAVTAHQSTPTAVSANSikptMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTH 2041
Cdd:pfam05109  625 ATSAVTTGQHNITSSSTSS----MSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPA 700
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2042 SGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFST 2121
Cdd:pfam05109  701 PRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTT 780
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907182184 2122 DRTSTphlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR 2176
Cdd:pfam05109  781 DYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQ 832
TIL pfam01826
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ...
665-722 1.27e-05

Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.


Pssm-ID: 460351  Cd Length: 55  Bit Score: 45.07  E-value: 1.27e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182184  665 CTGNRTFSYDSQACDRTCLSLSDR---ETEChvspvpVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:pfam01826    1 CPANEVYSECGSACPPTCANLSPPdvcPEPC------VEGCVCPPGFVRNSGGKCVPPSDC 55
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2576-2766 1.31e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 51.29  E-value: 1.31e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2576 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPT 2655
Cdd:COG3469     38 TATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTST 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2656 HSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 2735
Cdd:COG3469    118 GAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGA 193
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1907182184 2736 KHTTgvsletsvqTTIASPTPSAPQTSLATH 2766
Cdd:COG3469    194 TTPS---------ATTTATTTGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2305-2516 1.54e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.91  E-value: 1.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2305 TSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHS 2384
Cdd:COG3469      2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2385 GPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKH 2464
Cdd:COG3469     82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATG 161
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 2465 TTGVSLETSVQTTIASPTPSAPQTSLATHLP--FSSTSAVTPTSEVIITPTPQH 2516
Cdd:COG3469    162 GTTTTSTTTTTTSASTTPSATTTATATTASGatTPSATTTATTTGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1644-1833 3.54e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 49.75  E-value: 3.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1644 PTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTT 1723
Cdd:COG3469     26 AATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGA 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1724 SGTTSSPQTPrtthPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPA 1803
Cdd:COG3469    106 NTGTSTVTTT----STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
                          170       180       190
                   ....*....|....*....|....*....|
gi 1907182184 1804 ITNSLmTTGGLTGTPPVHTTSGTTSSPQTP 1833
Cdd:COG3469    182 TTTAT-ATTASGATTPSATTTATTTGPPTP 210
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3153-3340 5.19e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 49.37  E-value: 5.19e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3153 TGSPHTSVPVIYTTSTITQTKTSFFTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSS 3232
Cdd:COG3469     28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3233 PQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLM 3312
Cdd:COG3469    108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATA 187
                          170       180
                   ....*....|....*....|....*...
gi 1907182184 3313 TTVGLTGTPPVHTTSGTTSSPQTPRTTH 3340
Cdd:COG3469    188 TTASGATTPSATTTATTTGPPTPGLPKH 215
PHA03247 PHA03247
large tegument protein UL36; Provisional
2646-3101 5.35e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.94  E-value: 5.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2646 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTS--------GTTSS 2717
Cdd:PHA03247  2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2718 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 2791
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2792 ssASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAhqstPTAVSANSIKP 2871
Cdd:PHA03247  2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR----PAVASLSESRE 2796
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2872 TMSSTGTPVVHTTSGTTSSPQTPRTTHPSttvavsgtvhtTGLPSGTSVHTTTNFPTHSGPQSSLSTH---LPLFSTLSV 2948
Cdd:PHA03247  2797 SLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvAPGGDVRRR 2865
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2949 TPTTEGLNTPTSPHSLSVASTSMPlmTVLPTTLEGTRPPHTSVPVTYTTTAATQTkssfSTDRTSAPHLSQPSTVTPTQS 3028
Cdd:PHA03247  2866 PPSRSPAAKPAAPARPPVRRLARP--AVSRSTESFALPPDQPERPPQPQAPPPPQ----PQPQPPPPPQPQPPPPPPPRP 2939
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3029 TPIPATTNSLMTTGGLTGTPP-----------VHTTSGTTSSPQTPRTTHPFSTvavsNTKHTTGVSLETSVQTTIASPT 3097
Cdd:PHA03247  2940 QPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrVAVPRFRVPQPAPSREAPASST----PPLTGHSLSRVSSWASSLALHE 3015

                   ....
gi 1907182184 3098 PSAP 3101
Cdd:PHA03247  3016 ETDP 3019
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2318-2514 7.18e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 48.98  E-value: 7.18e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2318 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNF-PTHSGPQSSLSTHLPL 2396
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTtAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2397 FSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTT--HPFSTVAVSNTKHTTGVSLETSV 2474
Cdd:COG3469     81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTsgASATSSAGSTTTTTTVSGTETAT 160
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 1907182184 2475 QTTIASPTPSAPqTSLATHLPFSSTSAVTPTSEVIITPTP 2514
Cdd:COG3469    161 GGTTTTSTTTTT-TSASTTPSATTTATATTASGATTPSAT 199
ROM1 COG5422
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ...
3802-4036 8.92e-05

RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];


Pssm-ID: 227709 [Multi-domain]  Cd Length: 1175  Bit Score: 49.12  E-value: 8.92e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3802 SHPSS---TTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGT--TSSPQTPHSTHpISTAAISrttgISGTPFRTPMKTTI 3876
Cdd:COG5422     59 SKESFgkyALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATssTSSLNSNDGDQ-FSPASDS----LSFNPSSTQSRKDS 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3877 TFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLStslPTTIKGTGTPQTPVSDINTTSATTQAHS 3956
Cdd:COG5422    134 GPGDGSPVQKRKNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEI---PSLGSQSMQLPSPHFRQKFSSSDTSNGF 210
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3957 SFPTTRTSTSHlslpSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPT-----ASPPSSAPTFVSPTAASTVISSAL 4031
Cdd:COG5422    211 SYPSIRKNSRH----SSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSssnseAMSTSSKRPYIYPALLSRVAVEFK 286

                   ....*
gi 1907182184 4032 PTIHM 4036
Cdd:COG5422    287 MRLQL 291
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1623-1821 1.26e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 48.21  E-value: 1.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1623 TPTPQHTLSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTA 1702
Cdd:COG3469     12 AGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATS 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1703 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTN-----FPTHSGPQSSLST 1777
Cdd:COG3469     92 TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVsgtetATGGTTTTSTTTT 171
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....
gi 1907182184 1778 HLPLFSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVH 1821
Cdd:COG3469    172 TTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
2432-2763 1.32e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 48.22  E-value: 1.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2432 TPPVHTTSGTTSSPqTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSAVTPTSEVIIT 2511
Cdd:pfam03154  186 PPPPGTTQAATAGP-TPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2512 PTPQhtfssaSTSTTTGNILPTTIgQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAV 2591
Cdd:pfam03154  265 PLPQ------PSLHGQMPPMPHSL-QTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2592 SANSIKPtMSSTGTPVVHT----TSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPqsslSTH- 2666
Cdd:pfam03154  338 QPPREQP-LPPAPLSMPHIkpppTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHp 412
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2667 -----LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGV 2741
Cdd:pfam03154  413 pplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGI 492
                          330       340
                   ....*....|....*....|..
gi 1907182184 2742 SLETSVQTTIASPTPSAPQTSL 2763
Cdd:pfam03154  493 QPPSSASVSSSGPVPAAVSCPL 514
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2531-2725 2.91e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 47.05  E-value: 2.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2531 LPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSiKPTMSSTGTPVVHT 2610
Cdd:COG3469     22 LLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA-AAATSTSATLVATS 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2611 TSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIP 2690
Cdd:COG3469    101 TASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPS 180
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 1907182184 2691 ATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 2725
Cdd:COG3469    181 ATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3823-4060 3.06e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 46.67  E-value: 3.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3823 STMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSS 3902
Cdd:COG3469      2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3903 TEIFNTPtnphsvsSASTSRPLSTSLPTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPAS 3982
Cdd:COG3469     82 ATAAAAA-------ATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVS 154
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 3983 RSASTLQYTPTPSSVShspllTTPTASPPSSAPTFVSPTAASTvissalptihmTPTPSSRPTSSTGLLSTSKTTSHV 4060
Cdd:COG3469    155 GTETATGGTTTTSTTT-----TTTSASTTPSATTTATATTASG-----------ATTPSATTTATTTGPPTPGLPKHV 216
PHA03247 PHA03247
large tegument protein UL36; Provisional
3476-4049 3.14e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 3.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3476 PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqSSLSTHL 3555
Cdd:PHA03247  2560 PPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSP-SPAANEP 2638
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3556 PLFSTLSVTPTTEGLNTPTSPH-SLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTT------------AATQTKSSF 3622
Cdd:PHA03247  2639 DPHPPPTVPPPERPRDDPAPGRvSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSladpppppptpePAPHALVSA 2718
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3623 STDRTSTPHLSQSSTVTPTQPTPiPATTNSPMTTVGLTGTPVVHTPSGTSSI------AHTPHTTHSLPTAASSSTTLST 3696
Cdd:PHA03247  2719 TPLPPGPAAARQASPALPAAPAP-PAVPAGPATPGGPARPARPPTTAGPPAPappaapAAGPPRRLTRPAVASLSESRES 2797
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3697 APQFRTSEQSTT--TFPTPSAPQTSLVTSL--PPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPVStilqttievTTPPN 3772
Cdd:PHA03247  2798 LPSPWDPADPPAavLAPAAALPPAASPAGPlpPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVR---------RRPPS 2868
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3773 TSTPVTHSTSAtteaqgsfsteRTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHT-TSGTTSSPQTPHSTHP 3851
Cdd:PHA03247  2869 RSPAAKPAAPA-----------RPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQpQPPPPPQPQPPPPPPP 2937
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3852 ISTAAISRTTGISGTPfrtpmkttitFPTPSSLQTSMATLFPPfstsvmssteifNTPTNPHSVSSASTSRPLSTSLPTT 3931
Cdd:PHA03247  2938 RPQPPLAPTTDPAGAG----------EPSGAVPQPWLGALVPG------------RVAVPRFRVPQPAPSREAPASSTPP 2995
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3932 IKGTGTPQTpvsdinTTSATTQAHSSFPTTRtstshlslPSSMTSTLTPAS----RSASTLQYTPTPSSVSHSPLLTTPT 4007
Cdd:PHA03247  2996 LTGHSLSRV------SSWASSLALHEETDPP--------PVSLKQTLWPPDdtedSDADSLFDSDSERSDLEALDPLPPE 3061
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|....*....
gi 1907182184 4008 ASPPSSAPTFVSPTAAStviSSALPTIHMTPTP-------SSRPTSSTG 4049
Cdd:PHA03247  3062 PHDPFAHEPDPATPEAG---ARESPSSQFGPPPlsanaalSRRYVRSTG 3107
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1827-2242 3.31e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.83  E-value: 3.31e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1827 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 1906
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1907 STSTTTGnilPTTIGQTGSPHTSVPviyTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMS 1986
Cdd:pfam05109  502 KAPDMTS---PTSAVTTPTPNATSP---TPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLG 575
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1987 STGTpvvhTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTnfPTHSGPQSSLSTH-LPLFSTLSVTPTTE 2065
Cdd:pfam05109  576 KTSP----TSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSP--PKNATSAVTTGQHnITSSSTSSMSLRPS 649
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2066 GLNTPTSPHSLSVASTSMPLMT-VLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSST----VTPTQS 2140
Cdd:pfam05109  650 SISETLSPSTSDNSTSHMPLLTsAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTkpgeVNVTKG 729
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2141 TPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR-TTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLAT 2219
Cdd:pfam05109  730 TPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKhTTGHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRP 809
                          410       420
                   ....*....|....*....|...
gi 1907182184 2220 HLPFSSTSSVTPTSKVIITPTPQ 2242
Cdd:pfam05109  810 RWTFTSPPVTTAQATVPVPPTSQ 832
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2258-2452 3.42e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 46.67  E-value: 3.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2258 LPTTIGKTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSiKPTMSSTGTPVVHT 2337
Cdd:COG3469     22 LLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA-AAATSTSATLVATS 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2338 TSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIP 2417
Cdd:COG3469    101 TASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPS 180
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 1907182184 2418 ATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 2452
Cdd:COG3469    181 ATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
ROM1 COG5422
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ...
3700-3978 3.43e-04

RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];


Pssm-ID: 227709 [Multi-domain]  Cd Length: 1175  Bit Score: 47.19  E-value: 3.43e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3700 FRTSEQSTTTFPTPSAPQTSLVTSLPPfsTSSVSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTH 3779
Cdd:COG5422     17 FGAPRKSDAFVSKQLLPPRRLQRKLNP--ISIRNGADNDIINSESKESFGKYALGHQIFSSFSSSPKLFQRRNSAGPITH 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3780 STSAT--TEAQGSFS---TERTSTSYLSHPSSTTVHQSTAGPvitsikstmgvTGTPpvhttSGTTSSPQTPHSTHPIST 3854
Cdd:COG5422     95 SPSATssTSSLNSNDgdqFSPASDSLSFNPSSTQSRKDSGPG-----------DGSP-----VQKRKNPLLPSSSTHGTH 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3855 AAISrTTGISGTPFRTPMKTTiTFPTPSSLQTSMATLFPPF--STSVMSSTEIFNTP---TNPHSVSSASTSRPLSTSLP 3929
Cdd:COG5422    159 PPIV-FTDNNGSHAGAPNARS-RKEIPSLGSQSMQLPSPHFrqKFSSSDTSNGFSYPsirKNSRHSSNSMPSFPHSSTAV 236
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*....
gi 1907182184 3930 TTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTL 3978
Cdd:COG5422    237 LLKRHSGSSGASLISSNITPSSSNSEAMSTSSKRPYIYPALLSRVAVEF 285
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
3473-3846 4.59e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 46.11  E-value: 4.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3473 QSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLS 3552
Cdd:pfam17823   45 DAVPRADNKSSEQ*NFCAATAAPAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPS 124
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3553 THLPLFSTLSVTPTTEGLNTPTS--PHSLSAASTSMPLMTVLPTTlEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTP 3630
Cdd:pfam17823  125 SAAQSLPAAIAALPSEAFSAPRAaaCRANASAAPRAAIAAASAPH-AASPAPRTAASSTTAASSTTAASSAPTTAASSAP 203
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3631 hlsqsSTVTPTQPTPIPAT-TNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQSTTT 3709
Cdd:pfam17823  204 -----ATLTPARGISTAATaTGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHAR 278
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3710 FPTPSAPQTSLVTSLPPFSTSSVSPTDEIHITSTNPHTVSSVS--MSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEA 3787
Cdd:pfam17823  279 RLSPAKHMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGepTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEP 358
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182184 3788 QG--------SFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTP 3846
Cdd:pfam17823  359 SAspvpvlhtSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAPEQVATEATAGTASAGPTPRSSGDP 425
PHA03247 PHA03247
large tegument protein UL36; Provisional
3823-4121 4.89e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 4.89e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3823 STMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAISRTTGISGTPfRTPMKTTITfPTPSSLqTSMATLFPPFST---SV 3899
Cdd:PHA03247  2636 NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPP-QRPRRRAAR-PTVGSL-TSLADPPPPPPTpepAP 2712
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3900 MSSTEIFNTPTNPHSVSSASTSRPLSTSLPTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLT 3979
Cdd:PHA03247  2713 HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLS 2792
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3980 PASRSASTLQYTPTPSSVSHSPLLTTPTASPPssAPTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGLLSTSKTTSH 4059
Cdd:PHA03247  2793 ESRESLPSPWDPADPPAAVLAPAAALPPAASP--AGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 4060 VPTFSSFSSKSTTAHLTSLTTQAATSGLLSSTMGMTNLPSSGSPDINHTTRPPGSSPLPTSA 4121
Cdd:PHA03247  2871 PAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPP 2932
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3181-3404 5.10e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 45.90  E-value: 5.10e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3181 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3260
Cdd:COG3469      2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3261 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTeglntqSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3340
Cdd:COG3469     82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTT------STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSG 155
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 3341 PFSTVAVSNTKHTTGVSLeTSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEviiTPTPQH 3404
Cdd:COG3469    156 TETATGGTTTTSTTTTTT-SASTTPSATTTATATTASGATTPSATTTATTTGPPT---PGLPKH 215
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1492-1875 5.66e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 46.30  E-value: 5.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1492 SITSSITQAQSSFSTDKTYTSQHSQPSTMTAHQSRSLPTVTTSTKSTMGLTGTPPVHTTSGTTSSPQ-TPRTTHPFSTVA 1570
Cdd:pfam03154  143 STSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQgSPATSQPPNQTQ 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1571 VSNTKHTTgvsletsvqttIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIgQT 1650
Cdd:pfam03154  223 STAAPHTL-----------IQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ------PSLHGQMPPMPHSL-QT 284
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1651 GSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPtMSSTGTPVVHTT-SGTTSS 1729
Cdd:pfam03154  285 GPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQP-LPPAPLSMPHIKpPPTTPI 363
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1730 PQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTH------LPLFSTLSVTPTTEGLNTQSTP 1800
Cdd:pfam03154  364 PQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHppplqlMPQSQQLPPPPAQPPVLTQSQS 439
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182184 1801 IPAITNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 1875
Cdd:pfam03154  440 LPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
3839-4118 6.14e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 6.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3839 TTSSPQTPHSTHPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMatlfPPFSTSVMSSTEIFNTPTNPHSVSSA 3918
Cdd:pfam05109  392 TVSGLGTAPKTLIITRTATNATTTTHKVIFSKAPESTTTSPTLNTTGFAA----PNTTTGLPSSTHVPTNLTAPASTGPT 467
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3919 STSRPLSTSLPTTIKGTGTPQTPVSDI----------NTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTL 3988
Cdd:pfam05109  468 VSTADVTSPTPAGTTSGASPVTPSPSPrdngteskapDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSA 547
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3989 QYTPTPSSVSHSPLLTTPTASPPSSAPTFVSPTAASTvissalptihmTPTPSSRPTSSTGLLSTSKTTSHV--PTFSSF 4066
Cdd:pfam05109  548 VTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVT-----------TPTPNATSPTVGETSPQANTTNHTlgGTSSTP 616
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 4067 SSKSTTAHLTSLTTQAATSGLLSSTMGMTNLPSSGSPDINHTTRPPGSSPLP 4118
Cdd:pfam05109  617 VVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMP 668
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
1918-3538 6.14e-04

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 46.30  E-value: 6.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1918 TTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTS 1997
Cdd:COG3210     80 GIGAAAANTAGTLETGLTSNIGGGSVNGSNSTGNGTLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAG 159
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1998 GTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLS 2077
Cdd:COG3210    160 NNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAG 239
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2078 VASTSMplmTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQSTPIPATTNSLMTTGGLT 2157
Cdd:COG3210    240 VISTGG---TDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGT 316
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2158 GTPPVHTNSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSKVII 2237
Cdd:COG3210    317 AAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASS 396
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2238 TPTPQHTLSSASTSTTTGNILPTTIGKTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTA 2317
Cdd:COG3210    397 TTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSA 476
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2318 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLF 2397
Cdd:COG3210    477 GNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTT 556
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2398 STLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTT 2477
Cdd:COG3210    557 AASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTG 636
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2478 IASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQT 2557
Cdd:COG3210    637 SAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVT 716
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2558 KTSFSTDRTS-------TSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRT--THPST 2628
Cdd:COG3210    717 GQIGALANANgdtvtfgNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATLDNagAEISI 796
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2629 TVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPV 2708
Cdd:COG3210    797 DITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAAS 876
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2709 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQ 2788
Cdd:COG3210    877 ITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSA 956
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2789 HTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANS 2868
Cdd:COG3210    957 ASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGT 1036
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2869 IKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSV 2948
Cdd:COG3210   1037 AATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGV 1116
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2949 TPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSAPHLSQPSTVTPTQS 3028
Cdd:COG3210   1117 TASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGT 1196
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3029 TPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATH 3108
Cdd:COG3210   1197 DLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGN 1276
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3109 LPFSSTSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSTITQTKTSFFTDRTSTSTSAP 3188
Cdd:COG3210   1277 AGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGN 1356
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3189 HLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF 3268
Cdd:COG3210   1357 GATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGG 1436
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3269 PTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVS 3348
Cdd:COG3210   1437 TGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTT 1516
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3349 NTKHTTGVSLETSVQTTIASPTPSAPQTSLAThlpfsstsSVTPTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTGS 3428
Cdd:COG3210   1517 AEVAKASLEGGEGTYGGSSVAEAGTGGGILGA--------VSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQA 1588
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3429 PHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQT 3508
Cdd:COG3210   1589 PTAGNTATLTLSLAEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNGGEGVLALVAGGNTTNGTTLSGAVNGAGNGW 1668
                         1610      1620      1630
                   ....*....|....*....|....*....|
gi 1907182184 3509 PRTTHPSTTVAVSGTVHTTGLPSGTSVHTT 3538
Cdd:COG3210   1669 AVDLTDATLAGLGGATTAAAGNVATGDTAP 1698
PHA03247 PHA03247
large tegument protein UL36; Provisional
1758-2222 6.45e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 6.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1758 SVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVHTTS--------GTTSS 1829
Cdd:PHA03247  2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1830 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 1903
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1904 ssASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIkp 1983
Cdd:PHA03247  2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-- 2798
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1984 tmsstgtpvvhttsgttSSPQTPrTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSlsthlPLFSTLsvtpT 2063
Cdd:PHA03247  2799 -----------------PSPWDP-ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG-----PPPPSL----P 2851
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2064 TEGLNTPTSPHSLSVASTSMPLMTVLPttlegTRPPHTSVPVTYTttaatqtksSFSTDRTSTPHLSQSSTVTPTQSTPi 2143
Cdd:PHA03247  2852 LGGSVAPGGDVRRRPPSRSPAAKPAAP-----ARPPVRRLARPAV---------SRSTESFALPPDQPERPPQPQAPPP- 2916
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2144 PATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV-QTTIASPTPSAPQTSLATHLP 2222
Cdd:PHA03247  2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVpRFRVPQPAPSREAPASSTPPL 2996
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2578-2789 8.23e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 45.51  E-value: 8.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2578 TSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHS 2657
Cdd:COG3469     14 GASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTS 93
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2658 GPQSSLSTHLPLFSTLSVTPTTeglntqSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKH 2737
Cdd:COG3469     94 ATLVATSTASGANTGTSTVTTT------STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTS 167
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 2738 TTGVSLeTSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEviiTPTPQH 2789
Cdd:COG3469    168 TTTTTT-SASTTPSATTTATATTASGATTPSATTTATTTGPPT---PGLPKH 215
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
3626-4024 8.63e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 45.53  E-value: 8.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3626 RTSTPHLSQSSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQ 3705
Cdd:pfam03154  168 QTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHP 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3706 STTTFPTPSAPQTSLVTSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPV-THSTSAT 3784
Cdd:pfam03154  248 PLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGqSQQRIHT 327
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3785 TEAQGSFSTERTSTSYLSHPSSTTVhqstagPVItsikstmgvtgTPPVHTTSGTTSSPQT-PHSTHPISTAAISRTTGI 3863
Cdd:pfam03154  328 PPSQSQLQSQQPPREQPLPPAPLSM------PHI-----------KPPPTTPIPQLPNPQShKHPPHLSGPSPFQMNSNL 390
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3864 SGTPFRTPMKTTITFPTPSS-------LQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLPTT--IKG 3934
Cdd:pfam03154  391 PPPPALKPLSSLSTHHPPSAhppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHpfVPG 470
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3935 TGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSvshspllTTPTASPPSSA 4014
Cdd:pfam03154  471 GPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPES-------PPPPPRSPSPE 543
                          410
                   ....*....|.
gi 1907182184 4015 PTFV-SPTAAS 4024
Cdd:pfam03154  544 PTVVnTPSHAS 554
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3635-3882 9.76e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 45.13  E-value: 9.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3635 SSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTThSLPTAASSSTTLSTAPQFRTSEQSTTTFPTPS 3714
Cdd:COG3469      3 SVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSV-VVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3715 APQTSLVTSLPPFSTSSVSPTDeihitstnphtvssvsmsrPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQGSFSTE 3794
Cdd:COG3469     82 ATAAAAAATSTSATLVATSTAS-------------------GANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATS 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3795 RTSTSYLSHPSSTTVHqstagpvitsikstmGVTGTPPVHTTSGTTSSPQTPhSTHPISTAAISRTTGISGTPFRTPMKT 3874
Cdd:COG3469    143 SAGSTTTTTTVSGTET---------------ATGGTTTTSTTTTTTSASTTP-SATTTATATTASGATTPSATTTATTTG 206

                   ....*...
gi 1907182184 3875 TITFPTPS 3882
Cdd:COG3469    207 PPTPGLPK 214
Hamartin pfam04388
Hamartin protein; This family includes the hamartin protein which is thought to function as a ...
3767-4050 1.11e-03

Hamartin protein; This family includes the hamartin protein which is thought to function as a tumour suppressor. The hamartin protein interacts with the tuberin protein pfam03542. Tuberous sclerosis complex (TSC) is an autosomal dominant disorder and is characterized by the presence of hamartomas in many organs, such as brain, skin, heart, lung, and kidney. It is caused by mutation either TSC1 or TSC2 tumour suppressor gene. TSC1 encodes a protein, hamartin, containing two coiled-coil regions, which have been shown to mediate binding to tuberin. The TSC2 gene codes for tuberin pfam03542. These two proteins function within the same pathway(s) regulating cell cycle, cell growth, adhesion, and vesicular trafficking.


Pssm-ID: 461287 [Multi-domain]  Cd Length: 730  Bit Score: 45.05  E-value: 1.11e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3767 VTTPPNTSTpvTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTG--TPPvhTTSGT--TSS 3842
Cdd:pfam04388  276 PTASPYTDQ--QSSYGSSTSTPSSTPRLQLSSSSGTSPPYLSPPSIRLKTDSFPLWSPSSVCGmtTPP--TSPGMvpTTP 351
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3843 PQTPHST-HPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTS 3921
Cdd:pfam04388  352 SELSPSSsHLSSRGSSPPEAAGEATPETTPAKDSPYLKQPPPLSDSHVHRALPASSQPSSPPRKDGRSQSSFPPLSKQAP 431
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3922 RPLSTSLPTTIKGTGTP--QTPVSDINTT----------------------SATTQAHSSFPTTR------TSTSHLSLP 3971
Cdd:pfam04388  432 TNPNSRGLLEPPGDKSSvtLSELPDFIKDlalssedsvegaeeeaaisqelSEITTEKNETDCSRggldmpFSRTMESLA 511
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182184 3972 SSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTASPPSSAPTFvSPTAASTVISSalPTIHMTPTPSSRPTSSTGL 4050
Cdd:pfam04388  512 GSQRSRNRIASYCSSTSQSDSHGPATTPESKPSALAEDGLRRTKSC-SFKQSFTPIEQ--PIESSDDCPTDEQDGENGL 587
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
3330-3788 1.25e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.91  E-value: 1.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3330 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 3409
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3410 STSTTTGnilPTTIGQTGSPHTSVPVIYTTS----AITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIK 3485
Cdd:pfam05109  502 KAPDMTS---PTSAVTTPTPNATSPTPAVTTptpnATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTS 578
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3486 PTmSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTP 3565
Cdd:pfam05109  579 PT-SAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSP 657
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3566 TTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQPTP 3645
Cdd:pfam05109  658 STSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATS 737
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3646 --IPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSttlSTAPQFRTSEQSTTTFPTPSAPQTSLVTS 3723
Cdd:pfam05109  738 pqAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTTDYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFT 814
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182184 3724 LPPFSTSS----VSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQ 3788
Cdd:pfam05109  815 SPPVTTAQatvpVPPTSQPRFSNLSMLVLQWASLAVLTLLLLLVMADCAFRRNLSTSHTYTTPPYDDAE 883
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3464-3684 1.30e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.74  E-value: 1.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3464 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 3543
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3544 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 3623
Cdd:COG3469     81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182184 3624 TDRTSTPhlSQSSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSL 3684
Cdd:COG3469    159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKHVL 217
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3805-4032 1.40e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.74  E-value: 1.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3805 SSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAISRTTGISGTpfrtpmkttiTFPTPSSL 3884
Cdd:COG3469      4 VSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTA----------ASSTAATS 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3885 QTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLPTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTS 3964
Cdd:COG3469     74 STTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTV 153
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 3965 TSHLSLPSSMTSTLTPASRSASTLQYTPTPSsvshspllTTPTASPPSSAPTFVSPTAASTVISSALP 4032
Cdd:COG3469    154 SGTETATGGTTTTSTTTTTTSASTTPSATTT--------ATATTASGATTPSATTTATTTGPPTPGLP 213
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2591-2765 1.43e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.74  E-value: 1.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2591 VSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF-PTHSGPQSSLSTHLPL 2669
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTtAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2670 FSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTT--HPFSTVAVSNTKHTTGVSLETSV 2747
Cdd:COG3469     81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTsgASATSSAGSTTTTTTVSGTETAT 160
                          170
                   ....*....|....*...
gi 1907182184 2748 QTTIASPTPSAPQTSLAT 2765
Cdd:COG3469    161 GGTTTTSTTTTTTSASTT 178
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3206-3380 1.64e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.36  E-value: 1.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3206 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF-PTHSGPQSSLSTHLPL 3284
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTtAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3285 FSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTT--HPFSTVAVSNTKHTTGVSLETSV 3362
Cdd:COG3469     81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTsgASATSSAGSTTTTTTVSGTETAT 160
                          170
                   ....*....|....*...
gi 1907182184 3363 QTTIASPTPSAPQTSLAT 3380
Cdd:COG3469    161 GGTTTTSTTTTTTSASTT 178
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2849-3063 1.64e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.36  E-value: 1.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2849 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2928
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2929 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPH-TSVPVTYTTTAATQTKSSF 3007
Cdd:COG3469     81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAsATSSAGSTTTTTTVSGTET 158
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182184 3008 STDRTSAPHLSQPSTVTPTQSTPIPATTnslmTTGGLTGTPPVHTTSGTTSSPQTP 3063
Cdd:COG3469    159 ATGGTTTTSTTTTTTSASTTPSATTTAT----ATTASGATTPSATTTATTTGPPTP 210
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1961-2179 1.67e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.36  E-value: 1.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1961 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2040
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2041 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 2120
Cdd:COG3469     81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182184 2121 TDRTSTPhlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTH 2179
Cdd:COG3469    159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1913-2104 1.86e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.36  E-value: 1.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1913 GNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPhlSETSAVTAHQSTPTAVSANSIKPTMSSTGTPV 1992
Cdd:COG3469     24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASST--AATSSTTSTTATATAAAAAATSTSATLVATST 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1993 VHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP--SGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTP 2070
Cdd:COG3469    102 ASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSgaSATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
                          170       180       190
                   ....*....|....*....|....*....|....
gi 1907182184 2071 TSPHSLSVASTSMPLMTVLPTTleGTRPPHTSVP 2104
Cdd:COG3469    182 TTTATATTASGATTPSATTTAT--TTGPPTPGLP 213
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2801-2992 1.86e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 44.36  E-value: 1.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2801 GNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPhlSETSAVTAHQSTPTAVSANSIKPTMSSTGTPV 2880
Cdd:COG3469     24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASST--AATSSTTSTTATATAAAAAATSTSATLVATST 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2881 VHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP--SGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTP 2958
Cdd:COG3469    102 ASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSgaSATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
                          170       180       190
                   ....*....|....*....|....*....|....
gi 1907182184 2959 TSPHSLSVASTSMPLMTVLPTTleGTRPPHTSVP 2992
Cdd:COG3469    182 TTTATATTASGATTPSATTTAT--TTGPPTPGLP 213
VWC pfam00093
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with ...
360-395 1.96e-03

von Willebrand factor type C domain; The high cutoff was used to prevent overlap with pfam00094.


Pssm-ID: 278520  Cd Length: 57  Bit Score: 39.33  E-value: 1.96e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1907182184  360 CMLNGMVYGPGEITKTA-CQTCQCTMGRWTCTKQPCP 395
Cdd:pfam00093    1 CVQNGVVYENGETWKPDlCTICTCDDGKVLCDKIICP 37
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
3573-3944 2.03e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 44.18  E-value: 2.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3573 PTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQPTPIPATTns 3652
Cdd:pfam17823   66 APAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFS-- 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3653 pmttvgltgTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQSTTTFPTPSAPQTSLVTSLPPFSTSSV 3732
Cdd:pfam17823  144 ---------APRAAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGIST 214
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3733 SPTDEIHITSTNphTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAqGSFSTERTSTSYLSHPSSTTVHQS 3812
Cdd:pfam17823  215 AATATGHPAAGT--ALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAA-GTINMGDPHARRLSPAKHMPSDTM 291
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3813 TAGPVITSIKSTMG----VTGTPPVHTTSG--------TTSSPQTPHSTHPISTAAISRTTGISGTPFRTPMKTTITFPT 3880
Cdd:pfam17823  292 ARNPAAPMGAQAQGpiiqVSTDQPVHNTAGeptpspsnTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSMI 371
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 3881 PSSLQTSMATLFPPFSTSVMSSTEifNTPTNPHSVSSASTsrPLSTSLPTTIKGTGTPQTPVSD 3944
Cdd:pfam17823  372 PEVEATSPTTQPSPLLPTQGAAGP--GILLAPEQVATEAT--AGTASAGPTPRSSGDPKTLAMA 431
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
2705-3068 2.06e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 2.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2705 TPPVHTTSGTTSSPqTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIIT 2784
Cdd:pfam03154  186 PPPPGTTQAATAGP-TPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2785 PTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAV 2864
Cdd:pfam03154  265 PLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2865 SANSIKPtMSSTGTPVVHTT-SGTTSSPQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTHL 2940
Cdd:pfam03154  338 QPPREQP-LPPAPLSMPHIKpPPTTPIPQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHP 412
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2941 PlfsTLSVTPTTEGLNTP-------TSPHSLSVASTSMPLMTVL-PTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTdrT 3012
Cdd:pfam03154  413 P---PLQLMPQSQQLPPPpaqppvlTQSQSLPPPAASHPPTSGLhQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTS--S 487
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182184 3013 SAPHLSQPSTVTPTQSTPIPATTNSLMttggltgtPPVHTTSGTTSSPQTPRTTHP 3068
Cdd:pfam03154  488 AMPGIQPPSSASVSSSGPVPAAVSCPL--------PPVQIKEEALDEAEEPESPPP 535
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
2442-2758 2.09e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.52  E-value: 2.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2442 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLA---THLPFSSTSAVTPT------------- 2505
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAdvtSPTPAGTTSGASPVtpspsprdngtes 501
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2506 --------SEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSphTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSE 2577
Cdd:pfam05109  502 kapdmtspTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSP--TSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSP 579
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2578 TSAVTAHQSTPTAVSANSIKPTMSSTGtpvvHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHS 2657
Cdd:pfam05109  580 TSAVTTPTPNATSPTVGETSPQANTTN----HTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETL 655
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2658 GPQSS--LSTHLPLFStlSVTPTTEGLNTQSTPIPATTNSLMTTgglTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 2735
Cdd:pfam05109  656 SPSTSdnSTSHMPLLT--SAHPTGGENITQVTPASTSTHHVSTS---SPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGT 730
                          330       340
                   ....*....|....*....|...
gi 1907182184 2736 KHTTGVSLETSVQTTIASPTPSA 2758
Cdd:pfam05109  731 PPKNATSPQAPSGQKTAVPTVTS 753
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
2575-2899 2.13e-03

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 44.22  E-value: 2.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2575 LSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTSSSPQTPRT------THPSTTVAVSGTVHTTGLPSGTSVQ 2648
Cdd:TIGR00927   91 LAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSRALNHYISTSGRQRVKSYT 168
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2649 TTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipATTNSLMTTGGLTGTPPVHTTSGT 2714
Cdd:TIGR00927  169 PKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEITATYKMLETNPSKRTAGK 245
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2715 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 2794
Cdd:TIGR00927  246 TTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLTTPQGTVLEHTPATSEGQV 325
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2795 STSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPTsAPHLSETSAVTAHQST---PTAV 2864
Cdd:TIGR00927  326 TISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA-TPRVRAVLTTQVHHCVvvkPAPA 404
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 1907182184 2865 SANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2899
Cdd:TIGR00927  405 VPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3559-3726 2.25e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.97  E-value: 2.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3559 STLSVTPTTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTV 3638
Cdd:COG3469     33 TLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTV 112
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3639 TPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQF---RTSEQSTTTFPTPSA 3715
Cdd:COG3469    113 TTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSastTPSATTTATATTASG 192
                          170
                   ....*....|.
gi 1907182184 3716 PQTSLVTSLPP 3726
Cdd:COG3469    193 ATTPSATTTAT 203
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
3018-3378 2.27e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 2.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3018 SQPSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSgtTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqttIASPT 3097
Cdd:pfam03154  169 TQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPS--VPPQGSPATSQPPNQTQSTAAPHTL-----------IQQTP 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3098 PSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPViyttstitqTKTSFF 3177
Cdd:pfam03154  236 TLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPV---------PPQPFP 299
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3178 TDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP 3257
Cdd:pfam03154  300 LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLS 379
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3258 SGTSVQTTTNFPTHSG--PQSSLSTH------------LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPV 3323
Cdd:pfam03154  380 GPSPFQMNSNLPPPPAlkPLSSLSTHhppsahppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQ 459
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907182184 3324 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 3378
Cdd:pfam03154  460 SPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
VWC_out smart00215
von Willebrand factor (vWF) type C domain;
830-887 2.34e-03

von Willebrand factor (vWF) type C domain;


Pssm-ID: 214565  Cd Length: 67  Bit Score: 39.08  E-value: 2.34e-03
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184   830 CDYAGVSYPGGFELHTDCKTCTCSQGRWTCQlSTQCPSTCVLYGEGHIITFDGQRFVF 887
Cdd:smart00215    1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCT-KVWCGPKPCLLHNLSGECPLGQGCVP 57
VWC smart00214
von Willebrand factor (vWF) type C domain;
360-395 2.60e-03

von Willebrand factor (vWF) type C domain;


Pssm-ID: 214564  Cd Length: 59  Bit Score: 39.04  E-value: 2.60e-03
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1907182184   360 CMLNGMVYGPGEITKT-ACQTCQCTMGRW-TCTKQPCP 395
Cdd:smart00214    1 CVHNGRVYNDGETWKPdPCQICTCLDGTTvLCDPVECP 38
COG5099 COG5099
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal ...
3621-4000 2.75e-03

RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal structure and biogenesis];


Pssm-ID: 227430 [Multi-domain]  Cd Length: 777  Bit Score: 43.97  E-value: 2.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3621 SFSTDRTSTPHLSQSSTVTpTQPTPIPATTNSPMTTVGlTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTapqf 3700
Cdd:COG5099     38 STPNSFSPIPSKASSSATF-TLNLPINNSVNHKITSSS-SSRRKPSGSWSVAISSSTSGSQSLLMELPSSSFNPST---- 111
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3701 rtseqSTTTFPTPSAPQTSLVTSLPPFSTSSVSPTDEIHI-TSTNPHTVSSVSMSRPVSTILQttiEVTTPPNTSTPVTH 3779
Cdd:COG5099    112 -----SSRNKSNSALSSTQQGNANSSVTLSSSTASSMFNSnKLPLPNPNHSNSATTNQSGSSF---INTPASSSSQPLTN 183
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3780 STSATTEAQGSFSTERTSTSYLSHPSSTTVhqSTAGPVITSIkstmGVTGTPPVHTTSGTTSSPQTPHsTHPISTAAISR 3859
Cdd:COG5099    184 LVVSSIKRFPYLTSLSPFFNYLIDPSSDSA--TASADTSPSF----NPPPNLSPNNLFSTSDLSPLPD-TQSVENNIILN 256
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3860 TTGISGTPFRTPMKTTI--TFPTPSSLQTSMATLFPP-FSTSVMSSTEIFNT----PTNPHSVSSASTSRPLSTSLPTTI 3932
Cdd:COG5099    257 SSSSINELTSIYGSVPSirNLRGLNSALVSFLNVSSSsLAFSALNGKEVSPTgspsTRSFARVLPKSSPNNLLTEILTTG 336
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 3933 KGTGTPQTPVSDINTTSATTQAHSsfpttrTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHS 4000
Cdd:COG5099    337 VNPPQSLPSLLNPVFLSTSTGFSL------TNLSGYLNPNKNLKKNTLSSLSNLGYSSNVPSPSSSES 398
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1703-1877 3.24e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.59  E-value: 3.24e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1703 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNF-PTHSGPQSSLSTHLPL 1781
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTtAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1782 FSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTT--HPFSTVAVSNTKHTTGVSLETSV 1859
Cdd:COG3469     81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTsgASATSSAGSTTTTTTVSGTETAT 160
                          170
                   ....*....|....*...
gi 1907182184 1860 QTTIASPTPSAPQTSLAT 1877
Cdd:COG3469    161 GGTTTTSTTTTTTSASTT 178
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
3772-4000 3.40e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 43.84  E-value: 3.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3772 NTSTPVTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHP 3851
Cdd:NF033849   250 STSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSS 329
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3852 ISTAAISRTTGISGT-----PFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLST 3926
Cdd:NF033849   330 SYNVSSGTGVSSSHSdgtsqSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 3927 SLPTTiKGTGTpQTPVSDINTTSATTQAHSSFPTTRTSTSHlSLPSSMTSTLTpASRSASTLQYTPTPSSVSHS 4000
Cdd:NF033849   410 SQGGS-EGWGS-GDSVQSVSQSYGSSSSTGTSSGHSDSSSH-STSSGQADSVS-QGTSWSEGTGTSQGQSVGTS 479
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1690-1901 3.41e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.59  E-value: 3.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1690 TSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHS 1769
Cdd:COG3469      2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1770 GPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPAITNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKH 1849
Cdd:COG3469     82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATG 161
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 1850 TTGVSLETSVQTTIASPTPSAPQTSLATHLP--FSSTSSVTPTSEVIITPTPQH 1901
Cdd:COG3469    162 GTTTTSTTTTTTSASTTPSATTTATATTASGatTPSATTTATTTGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1896-2073 3.56e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.20  E-value: 3.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1896 TPTPQHTLSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTA 1975
Cdd:COG3469     29 AASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTG 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1976 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLF 2055
Cdd:COG3469    109 TSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATAT 188
                          170
                   ....*....|....*...
gi 1907182184 2056 STLSVTPTTEGLNTPTSP 2073
Cdd:COG3469    189 TASGATTPSATTTATTTG 206
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
2615-3064 3.83e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 43.37  E-value: 3.83e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2615 SSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSThlplfstlsvtptteglntQSTPIPATTn 2694
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAD-------------------VTSPTPAGT- 481
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2695 slmtTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVavsntkhTTGVSLETSVQTTIASPTPSAPQTSLATHLPfsstss 2774
Cdd:pfam05109  482 ----TSGASPVTPSPSPRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTTPTPNATSPTLGKTSP------ 544
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2775 vtptSEVIITPTPQHTLssaststtmgnilPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAV 2854
Cdd:pfam05109  545 ----TSAVTTPTPNATS-------------PTPAVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNH 607
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2855 TAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQS 2934
Cdd:pfam05109  608 TLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPA 687
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2935 SLSTHLPLFSTLSVTPTTEGL-------NTPTSPHSLSVASTSMPLMTVLPTTLEGTRpphTSVP-VTYTTTAATQTKSS 3006
Cdd:pfam05109  688 STSTHHVSTSSPAPRPGTTSQasgpgnsSTSTKPGEVNVTKGTPPKNATSPQAPSGQK---TAVPtVTSTGGKANSTTGG 764
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 3007 FSTDRTSAPHLSQPST------VTP----TQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPR 3064
Cdd:pfam05109  765 KHTTGHGARTSTEPTTdyggdsTTPrtryNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQ 832
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
2127-2490 3.84e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.60  E-value: 3.84e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2127 PHLSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVhtnSGTTSSPQTPRTTHPFSTVAVsntkhttgvsletsvQTTIA 2206
Cdd:pfam03154  171 PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV---PPQGSPATSQPPNQTQSTAAP---------------HTLIQ 232
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2207 SPTPSAPQTSLATHLPFSSTSSVTPTSKVIITPTPQhtlssaSTSTTTGNILPTTIgKTGSPHTSVPVIYTTSAITQTKT 2286
Cdd:pfam03154  233 QTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ------PSLHGQMPPMPHSL-QTGPSHMQHPVPPQPFPLTPQSS 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2287 SFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPtMSSTGTPVVHTT-SGTTSSPQTPRT---THPSTTVAVSGT 2362
Cdd:pfam03154  306 QSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQP-LPPAPLSMPHIKpPPTTPIPQLPNPqshKHPPHLSGPSPF 384
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2363 VHTTGLPSGTSVHTTTNFPTHSGPqsslSTH------LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVH 2436
Cdd:pfam03154  385 QMNSNLPPPPALKPLSSLSTHHPP----SAHppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQS 460
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907182184 2437 TTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 2490
Cdd:pfam03154  461 PFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
1669-2011 3.84e-03

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 43.45  E-value: 3.84e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1669 TKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTTSSPQTPRT------THPSTTV 1742
Cdd:TIGR00927   73 MMVSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSR 150
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1743 AVSGTVHTTGLPSGTSVHTTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipAITNSL 1808
Cdd:TIGR00927  151 ALNHYISTSGRQRVKSYTPKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEI 227
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1809 MTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVT 1888
Cdd:TIGR00927  228 TATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLT 307
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1889 PTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPTsAPHLS 1961
Cdd:TIGR00927  308 TPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA-TPRVR 386
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907182184 1962 ETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2011
Cdd:TIGR00927  387 AVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
PHA03247 PHA03247
large tegument protein UL36; Provisional
2299-2768 4.66e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 4.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2299 APHLSEtSAVTAHQSTPTAvsansiKPTMSSTGTPVVHTTSGTTSSPQT--PRTTHPSTTVAVSGTVHTTGLPSGTSVHT 2376
Cdd:PHA03247  2574 APRPSE-PAVTSRARRPDA------PPQSARPRAPVDDRGDPRGPAPPSplPPDTHAPDPPPPSPSPAANEPDPHPPPTV 2646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2377 TTNFPTHSGPQSSlSTHLPLFSTLSVTPTTEGLNTQSTPIPATTnslmttggltgtPPVHTTSGTTSSPQTPRTTHPFST 2456
Cdd:PHA03247  2647 PPPERPRDDPAPG-RVSRPRRARRLGRAAQASSPPQRPRRRAAR------------PTVGSLTSLADPPPPPPTPEPAPH 2713
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2457 VAVSntkhttgvsletsvqttiASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSAststttgnilPTTig 2536
Cdd:PHA03247  2714 ALVS------------------ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP----------PTT-- 2763
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2537 qTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSS 2616
Cdd:PHA03247  2764 -AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP 2842
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2617 SPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTnfPTHSgPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPI----PAT 2692
Cdd:PHA03247  2843 PGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA--PARP-PVRRLARPAVSRSTESFALPPDQPERPPQPQapppPQP 2919
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182184 2693 TNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV-QTTIASPTPSAPQTSLATHLP 2768
Cdd:PHA03247  2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVpRFRVPQPAPSREAPASSTPPL 2996
ROM1 COG5422
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ...
1407-1625 4.66e-03

RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];


Pssm-ID: 227709 [Multi-domain]  Cd Length: 1175  Bit Score: 43.34  E-value: 4.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1407 STGPPLGTSVQTTINFPTLSAPQTSLVT-----PHPGLSSSSTALTSEILKTPTSSQMVSSASPQT-IFSSIHPKTTLEA 1480
Cdd:COG5422     28 SKQLLPPRRLQRKLNPISIRNGADNDIInseskESFGKYALGHQIFSSFSSSPKLFQRRNSAGPIThSPSATSSTSSLNS 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1481 TTPQHTAPLITSITSSITQAQSSFSTDKTYTSQHS------QPSTMTaHQSRSLPTVTTSTKSTMGLTG---TPPVHTTS 1551
Cdd:COG5422    108 NDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQkrknplLPSSST-HGTHPPIVFTDNNGSHAGAPNarsRKEIPSLG 186
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182184 1552 GTTSSPQTPRTTHPFSTVAVSNT---KHTTGVSLETSvqttiaSPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPT 1625
Cdd:COG5422    187 SQSMQLPSPHFRQKFSSSDTSNGfsyPSIRKNSRHSS------NSMPSFPHSSTAVLLKRHSGSSGASLISSNITPS 257
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
3016-3365 5.11e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.98  E-value: 5.11e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3016 HLSQPSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVavsntkhTTGVSLETSVQTTIAS 3095
Cdd:pfam05109  457 NLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTT 529
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3096 PTPSAPQTSLATHLPfsstssvtptSEVIITPTPQHTLssaststtmgnilPTTIGQTGSPHTSVPVIYTTSTITQTKTS 3175
Cdd:pfam05109  530 PTPNATSPTLGKTSP----------TSAVTTPTPNATS-------------PTPAVTTPTPNATIPTLGKTSPTSAVTTP 586
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3176 FFTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTG 3255
Cdd:pfam05109  587 TPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSH 666
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3256 LPSGTSVQTTTNFPTHSGPQSSLSTHlpLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLT-GTPPVHTTSGTTSSPQ 3334
Cdd:pfam05109  667 MPLLTSAHPTGGENITQVTPASTSTH--HVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTkGTPPKNATSPQAPSGQ 744
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 1907182184 3335 TPRTTHPFSTVAVSNT----KHTTGVSLETSVQTT 3365
Cdd:pfam05109  745 KTAVPTVTSTGGKANSttggKHTTGHGARTSTEPT 779
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3707-3929 5.18e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.82  E-value: 5.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3707 TTTFPTPSAPQTSLVTSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTE 3786
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3787 AQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSpqtphsthpiSTAAISRTTGISGT 3866
Cdd:COG3469     81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTS----------GASATSSAGSTTTT 150
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182184 3867 PFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLP 3929
Cdd:COG3469    151 TTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLP 213
PHA03247 PHA03247
large tegument protein UL36; Provisional
3841-4118 5.28e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 5.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3841 SSPQTPHSTHPISTAAISRTTGISGTPF----RTPMKTTITFP---TPSSLQTSMATLFPPFSTSVMSSTEIF--NTPTN 3911
Cdd:PHA03247  2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQsarpRAPVDDRGDPRgpaPPSPLPPDTHAPDPPPPSPSPAANEPDphPPPTV 2646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3912 PHSVSSASTSRPLSTSLPTTIKGTGTPQTPvsdinttSATTQA--HSSFPTTRTSTSHLSLPSSMTSTltPASRSASTLQ 3989
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQA-------SSPPQRprRRAARPTVGSLTSLADPPPPPPT--PEPAPHALVS 2717
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3990 YTPTP----SSVSHSPLLTTPTASPPSSAPTFV--------SPTAASTVISSALPTIHMTPTPSSRPTSSTGLLSTSKTT 4057
Cdd:PHA03247  2718 ATPLPpgpaAARQASPALPAAPAPPAVPAGPATpggparpaRPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182184 4058 SHVPTFSSFSSKSTTAHLTSLTTQAATSGLLSSTMGMTNLPSSGSPDINHTTRPPGSSPLP 4118
Cdd:PHA03247  2798 LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAP 2858
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
3671-4061 5.56e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.83  E-value: 5.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3671 TSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQSTTTFPTPSAP-QTSLVTSLPPFSTSSVSPTDEihiTSTNPHTVS 3749
Cdd:pfam03154  144 TSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPgTTQAATAGPTPSAPSVPPQGS---PATSQPPNQ 220
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3750 SVSMSRPVSTILQT-TIEVTTPPNTSTPVTHSTSATTEAQgsFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVT 3828
Cdd:pfam03154  221 TQSTAAPHTLIQQTpTLHPQRLPSPHPPLQPMTQPPPPSQ--VSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPF 298
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3829 GTPPVHTTSGTTSSPQT--PHSTHPISTAAISRTTGISGTPFR------TPMKTTITFPTPSSLQTSMATLFPPFSTSVM 3900
Cdd:pfam03154  299 PLTPQSSQSQVPPGPSPaaPGQSQQRIHTPPSQSQLQSQQPPReqplppAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHL 378
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3901 SSTEIFNTPTNphsVSSASTSRPLStSLPTTIKGTGTPqtPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTP 3980
Cdd:pfam03154  379 SGPSPFQMNSN---LPPPPALKPLS-SLSTHHPPSAHP--PPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSG 452
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3981 ASRSASTLQYTPTPSSVSHSPLLTtptasPPSSAPTFVSPtaastvissALPTIHmtpTPSSRPTSSTGLLSTSKTTSHV 4060
Cdd:pfam03154  453 LHQVPSQSPFPQHPFVPGGPPPIT-----PPSGPPTSTSS---------AMPGIQ---PPSSASVSSSGPVPAAVSCPLP 515

                   .
gi 1907182184 4061 P 4061
Cdd:pfam03154  516 P 516
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
2458-2661 5.60e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.82  E-value: 5.60e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2458 AVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQ 2537
Cdd:COG3469     12 AGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATS 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2538 TGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHlSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSS 2617
Cdd:COG3469     92 TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSS-TAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTT 170
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....
gi 1907182184 2618 PQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQS 2661
Cdd:COG3469    171 TTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPK 214
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3756-3967 5.99e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.43  E-value: 5.99e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3756 PVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPpvhT 3835
Cdd:COG3469     11 TAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA---A 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3836 TSGTTSSPQTPHSTHPiSTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSV 3915
Cdd:COG3469     88 AATSTSATLVATSTAS-GANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTT 166
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907182184 3916 SSASTSRPLSTSLPTTIKGTGTPQTPVSdinTTSATTQAHSSFPTTRTSTSH 3967
Cdd:COG3469    167 STTTTTTSASTTPSATTTATATTASGAT---TPSATTTATTTGPPTPGLPKH 215
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
3756-4054 6.10e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.98  E-value: 6.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3756 PVSTILQTTIEVT--TPPNTSTPVTHSTSATTEAQGSFSTERT-STSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPP 3832
Cdd:pfam05109  425 PESTTTSPTLNTTgfAAPNTTTGLPSSTHVPTNLTAPASTGPTvSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAP 504
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3833 VHTTSGTTSSPQTPHSTHPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSV-----MSSTEIFN 3907
Cdd:pfam05109  505 DMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIptlgkTSPTSAVT 584
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3908 TPTNPHSVSSASTSRPLSTSLPTTIKGTG-TPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSlPSSMTSTLTPASRSAS 3986
Cdd:pfam05109  585 TPTPNATSPTVGETSPQANTTNHTLGGTSsTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLR-PSSISETLSPSTSDNS 663
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182184 3987 TlqytptpssvSHSPLLTtptasppssaptfvsptaastvisSALPTIHMTPTPSSRPTSSTGLLSTS 4054
Cdd:pfam05109  664 T----------SHMPLLT------------------------SAHPTGGENITQVTPASTSTHHVSTS 697
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
3930-4060 6.78e-03

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 42.38  E-value: 6.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3930 TTIKGTGTPQTP--VSDINTTSATTQAHSSFPTTRTSTShlslpSSMTSTLTPAsrsastlqyTPTPSSVSHSPLLTTPT 4007
Cdd:PLN02217   548 AWIPGKGVPYIPglFAGNPGSTNSTPTGSAASSNTTFSS-----DSPSTVVAPS---------TSPPAGHLGSPPATPSK 613
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907182184 4008 ASPPSSAPTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGLLSTSKTTSHV 4060
Cdd:PLN02217   614 IVSPSTSPPASHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESSVSMV 666
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
2028-2489 8.40e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.21  E-value: 8.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2028 SGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGlNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTY 2107
Cdd:pfam05109  394 SGLGTAPKTLIITRTATNATTTTHKVIFSKAPESTTTSP-TLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAD 472
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2108 TTTAATQTKSSFSTDRTSTPHLSQSST-------VTPTQSTPIPattnslmTTGGLTGTPPVHTNSGTTSSPQTPRTThP 2180
Cdd:pfam05109  473 VTSPTPAGTTSGASPVTPSPSPRDNGTeskapdmTSPTSAVTTP-------TPNATSPTPAVTTPTPNATSPTLGKTS-P 544
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2181 FSTVavsntkhTTGVSLETSVQTTIASPTPSAPQTSLATHLPfsstssvtptSKVIITPTPQHTLSSASTSTTTGNILPT 2260
Cdd:pfam05109  545 TSAV-------TTPTPNATSPTPAVTTPTPNATIPTLGKTSP----------TSAVTTPTPNATSPTVGETSPQANTTNH 607
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2261 TIGKTGS-PHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAP-HLSET-SAVTAHQSTPTAVSANSIKPTMSSTGTPVVHT 2337
Cdd:pfam05109  608 TLGGTSStPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPsSISETlSPSTSDNSTSHMPLLTSAHPTGGENITQVTPA 687
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2338 TSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPthsgPQSSLSTHLPLFSTLSV-TPTTEGLNTQSTPI 2416
Cdd:pfam05109  688 STSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTP----PKNATSPQAPSGQKTAVpTVTSTGGKANSTTG 763
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182184 2417 PATTNSlmtTGGLTGTPPVHTTSGTTSSPQT---PRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTS 2489
Cdd:pfam05109  764 GKHTTG---HGARTSTEPTTDYGGDSTTPRTrynATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQPRFS 836
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1944-2163 8.71e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.05  E-value: 8.71e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1944 TSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHT 2023
Cdd:COG3469      2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2024 TGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSV 2103
Cdd:COG3469     82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATG 161
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 2104 PVTYTTTaatqtkSSFSTDRTSTPHLSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVH 2163
Cdd:COG3469    162 GTTTTST------TTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
3776-3982 8.79e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.05  E-value: 8.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3776 PVTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTA 3855
Cdd:COG3469      1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 3856 AISRTTGISGTPFRTPMKTTitfpTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLPTTIKGT 3935
Cdd:COG3469     81 TATAAAAAATSTSATLVATS----TASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGT 156
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1907182184 3936 GTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPAS 3982
Cdd:COG3469    157 ETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTAT 203
PHA03247 PHA03247
large tegument protein UL36; Provisional
1515-1965 8.80e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 8.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1515 SQPSTMTAHQSRSLPTVTTSTKSTMGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSntkHTTGVSLETSVQTTIASPT 1594
Cdd:PHA03247  2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLG---RAAQASSPPQRPRRRAARP 2690
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1595 PSAPQTSLATHLPFSSTSSVTPTSEVIITPTPqhtlssASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFS 1674
Cdd:PHA03247  2691 TVGSLTSLADPPPPPPTPEPAPHALVSATPLP------PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA 2764
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1675 TDRTSTPTSAPHLSETSAVTAhqstPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGlP 1754
Cdd:PHA03247  2765 GPPAPAPPAAPAAGPPRRLTR----PAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTA-P 2839
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1755 SGTSVHTTTNFPTHSG--PQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPAITNSlmtTGGLTGTPPVHTTSGTTSSPQT 1832
Cdd:PHA03247  2840 PPPPGPPPPSLPLGGSvaPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRS---TESFALPPDQPERPPQPQAPPP 2916
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182184 1833 PRTTHPfstvavsntkhttgvsLETSVQTTIASPTPSAPQTSLAthlpfsstssvtPTSEVIITPTPQHTLSSASTSTTT 1912
Cdd:PHA03247  2917 PQPQPQ----------------PPPPPQPQPPPPPPPRPQPPLA------------PTTDPAGAGEPSGAVPQPWLGALV 2968
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907182184 1913 GNILPTTIGQTGSPHTSVPviyTTSAITQTKTSFSTDRTSTPTSAPHLSETSA 1965
Cdd:PHA03247  2969 PGRVAVPRFRVPQPAPSRE---APASSTPPLTGHSLSRVSSWASSLALHEETD 3018
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH