NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|167004347|ref|NP_001107794|]
View 

cadherin-related family member 5 isoform 1 precursor [Mus musculus]

Protein Classification

cadherin repeat domain-containing protein( domain architecture ID 10182011)

cadherin repeat domain-containing protein similar to Homo sapiens desmoglein-2, which is involved in the interaction of plaque proteins and intermediate filaments mediating cell-cell adhesion; cadherins are are calcium-dependent cell adhesion proteins that preferentially interact with themselves in connecting cells

CATH:  2.60.40.60
Gene Ontology:  GO:0007156|GO:0005509
SCOP:  4007535

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
133-229 2.84e-11

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


:

Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 60.79  E-value: 2.84e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347 133 TFNVSEDTKVNTTVIpetQLKATDADI--NDILVYTLqeVTPNASKFFSLEGVNYpALKLDQTLDYFKNQNMTFMLLARD 210
Cdd:cd11304    3 EVSVPENAPPGTVVL---TVSATDPDSgeNGEVTYSI--VSGNEDGLFSIDPSTG-EITTAKPLDREEQSSYTLTVTATD 76
                         90
                 ....*....|....*....
gi 167004347 211 tweeNVEPSHTATATLVLN 229
Cdd:cd11304   77 ----GGGPPLSSTATVTIT 91
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
255-346 1.73e-08

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


:

Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 52.70  E-value: 1.73e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347 255 QYSAVVPTGHKLPSPLIMspgpIYAVDGDQAINQSIIYSIIAGNTDGTFIINAHDGNLTMTKSIP--SPMKFTLLIRA-D 331
Cdd:cd11304    1 SYEVSVPENAPPGTVVLT----VSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPLDreEQSSYTLTVTAtD 76
                         90
                 ....*....|....*
gi 167004347 332 QEDMAQYSVTQAIVE 346
Cdd:cd11304   77 GGGPPLSSTATVTIT 91
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
437-619 1.94e-06

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 51.84  E-value: 1.94e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  437 ASNTVTKDTATAVVEIQVSERELPStefPTPpeAGGTTGPSSNTTMEAPLTSGT-SQRPATTSSGGSVGPFPPGGTTlrp 515
Cdd:pfam05109 451 STHVPTNLTAPASTGPTVSTADVTS---PTP--AGTTSGASPVTPSPSPRDNGTeSKAPDMTSPTSAVTTPTPNATS--- 522
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  516 PTPASSIP---GGSPTLGTSTSPQTTTPGGDSAQTPKPGTshpTAPTSRTSTSLMTTSSRSDSTQTPKPGTSQPMV--PI 590
Cdd:pfam05109 523 PTPAVTTPtpnATSPTLGKTSPTSAVTTPTPNATSPTPAV---TTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVgeTS 599
                         170       180
                  ....*....|....*....|....*....
gi 167004347  591 PGASTSSQPATPSGSSPQTPKPGTSQSTA 619
Cdd:pfam05109 600 PQANTTNHTLGGTSSTPVVTSPPKNATSA 628
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
40-123 8.27e-03

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


:

Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 36.52  E-value: 8.27e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  40 FRVEENTTVSEPLVNIFV--PDG-----LHVTLGPLSTPYAFRIEGK--DLFLNVTPDYEENSLLQADVECKrgDAVVVR 110
Cdd:cd11304    4 VSVPENAPPGTVVLTVSAtdPDSgengeVTYSIVSGNEDGLFSIDPStgEITTAKPLDREEQSSYTLTVTAT--DGGGPP 81
                         90
                 ....*....|....*..
gi 167004347 111 LE----VFVAVLDINDN 123
Cdd:cd11304   82 LSstatVTITVLDVNDN 98
 
Name Accession Description Interval E-value
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
133-229 2.84e-11

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 60.79  E-value: 2.84e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347 133 TFNVSEDTKVNTTVIpetQLKATDADI--NDILVYTLqeVTPNASKFFSLEGVNYpALKLDQTLDYFKNQNMTFMLLARD 210
Cdd:cd11304    3 EVSVPENAPPGTVVL---TVSATDPDSgeNGEVTYSI--VSGNEDGLFSIDPSTG-EITTAKPLDREEQSSYTLTVTATD 76
                         90
                 ....*....|....*....
gi 167004347 211 tweeNVEPSHTATATLVLN 229
Cdd:cd11304   77 ----GGGPPLSSTATVTIT 91
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
255-346 1.73e-08

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 52.70  E-value: 1.73e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347 255 QYSAVVPTGHKLPSPLIMspgpIYAVDGDQAINQSIIYSIIAGNTDGTFIINAHDGNLTMTKSIP--SPMKFTLLIRA-D 331
Cdd:cd11304    1 SYEVSVPENAPPGTVVLT----VSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPLDreEQSSYTLTVTAtD 76
                         90
                 ....*....|....*
gi 167004347 332 QEDMAQYSVTQAIVE 346
Cdd:cd11304   77 GGGPPLSSTATVTIT 91
Cadherin pfam00028
Cadherin domain;
256-346 8.82e-07

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 47.68  E-value: 8.82e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  256 YSAVVPTGhklpSPLIMSPGPIYAVDGDQAINQSIIYSIIAGNTDGTFIINAHDGNLTMTKSI--PSPMKFTLLIRA-DQ 332
Cdd:pfam00028   1 YSASVPEN----APVGTEVLTVTATDPDLGPNGRIFYSILGGGPGGNFRIDPDTGDISTTKPLdrESIGEYELTVEAtDS 76
                          90
                  ....*....|....
gi 167004347  333 EDMAQYSVTQAIVE 346
Cdd:pfam00028  77 GGPPLSSTATVTIT 90
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
279-355 1.62e-06

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 46.57  E-value: 1.62e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 167004347   279 AVDGDQAINQSIIYSIIAGNTDGTFIINAHDGNLTMTKSIpspmkftlliraDQEDMAQYSVTqaiVEARSVTGNPL 355
Cdd:smart00112   2 ATDADSGENGKVTYSILSGNDDGLFSIDPETGEITTTKPL------------DREEQPEYTLT---VEATDGGGPPL 63
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
437-619 1.94e-06

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 51.84  E-value: 1.94e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  437 ASNTVTKDTATAVVEIQVSERELPStefPTPpeAGGTTGPSSNTTMEAPLTSGT-SQRPATTSSGGSVGPFPPGGTTlrp 515
Cdd:pfam05109 451 STHVPTNLTAPASTGPTVSTADVTS---PTP--AGTTSGASPVTPSPSPRDNGTeSKAPDMTSPTSAVTTPTPNATS--- 522
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  516 PTPASSIP---GGSPTLGTSTSPQTTTPGGDSAQTPKPGTshpTAPTSRTSTSLMTTSSRSDSTQTPKPGTSQPMV--PI 590
Cdd:pfam05109 523 PTPAVTTPtpnATSPTLGKTSPTSAVTTPTPNATSPTPAV---TTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVgeTS 599
                         170       180
                  ....*....|....*....|....*....
gi 167004347  591 PGASTSSQPATPSGSSPQTPKPGTSQSTA 619
Cdd:pfam05109 600 PQANTTNHTLGGTSSTPVVTSPPKNATSA 628
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
460-711 4.26e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.47  E-value: 4.26e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  460 PSTEFPTPPEAGGTTGPSSNTTMEAPLTSGTSQRPATTSSGGSVGPFPPGGTtlrPPTPASSIPGGSPTLGtstspqtTT 539
Cdd:PHA03307   75 PGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASP---PPSPAPDLSEMLRPVG-------SP 144
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  540 PGGDSAQTPKPGTSHPTAPTSRTSTSLMTTSSRSDSTQTPKPGTSQPMVPIpgastsSQPATPSGSSPQTPKPGTSQSTA 619
Cdd:PHA03307  145 GPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPP------STPPAAASPRPPRRSSPISASAS 218
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  620 TGPISGAGEQGDGQRFSTVDMAVLGGVLGALLLLALICLVIlvhkHYRHRLACCSGKASEPQPSGydnltFLPDHKAKWS 699
Cdd:PHA03307  219 SPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPR----PAPITLPTRIWEASGWNGPS-----SRPGPASSSS 289
                         250
                  ....*....|..
gi 167004347  700 PTPNRKPEPSPK 711
Cdd:PHA03307  290 SPRERSPSPSPS 301
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
466-622 2.51e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 44.64  E-value: 2.51e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347 466 TPPEAGGTTGPSSNTTMEAPLTSGTSQRPAttsSGGSVGPFPPGGTTlrPPTPASSIPGGSPtlGTSTSPQTTTPGGDSA 545
Cdd:COG5164  100 TPAGDGGATGPPDDGGATGPPDDGGSTTPP---SGGSTTPPGDGGST--PPGPGSTGPGGST--TPPGDGGSTTPPGPGG 172
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 167004347 546 QTPKPGTSHPTAPtsrtsTSLMTTSSRSDSTQTPKPGTSQPMVPI-PGASTSSQPATPSGSSPQTPKPGTSQSTATGP 622
Cdd:COG5164  173 STTPPDDGGSTTP-----PNKGETGTDIPTGGTPRQGPDGPVKKDdKNGKGNPPDDRGGKTGPKDQRPKTNPIERRGP 245
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
40-123 8.27e-03

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 36.52  E-value: 8.27e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  40 FRVEENTTVSEPLVNIFV--PDG-----LHVTLGPLSTPYAFRIEGK--DLFLNVTPDYEENSLLQADVECKrgDAVVVR 110
Cdd:cd11304    4 VSVPENAPPGTVVLTVSAtdPDSgengeVTYSIVSGNEDGLFSIDPStgEITTAKPLDREEQSSYTLTVTAT--DGGGPP 81
                         90
                 ....*....|....*..
gi 167004347 111 LE----VFVAVLDINDN 123
Cdd:cd11304   82 LSstatVTITVLDVNDN 98
 
Name Accession Description Interval E-value
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
133-229 2.84e-11

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 60.79  E-value: 2.84e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347 133 TFNVSEDTKVNTTVIpetQLKATDADI--NDILVYTLqeVTPNASKFFSLEGVNYpALKLDQTLDYFKNQNMTFMLLARD 210
Cdd:cd11304    3 EVSVPENAPPGTVVL---TVSATDPDSgeNGEVTYSI--VSGNEDGLFSIDPSTG-EITTAKPLDREEQSSYTLTVTATD 76
                         90
                 ....*....|....*....
gi 167004347 211 tweeNVEPSHTATATLVLN 229
Cdd:cd11304   77 ----GGGPPLSSTATVTIT 91
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
255-346 1.73e-08

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 52.70  E-value: 1.73e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347 255 QYSAVVPTGHKLPSPLIMspgpIYAVDGDQAINQSIIYSIIAGNTDGTFIINAHDGNLTMTKSIP--SPMKFTLLIRA-D 331
Cdd:cd11304    1 SYEVSVPENAPPGTVVLT----VSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPLDreEQSSYTLTVTAtD 76
                         90
                 ....*....|....*
gi 167004347 332 QEDMAQYSVTQAIVE 346
Cdd:cd11304   77 GGGPPLSSTATVTIT 91
Cadherin pfam00028
Cadherin domain;
256-346 8.82e-07

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 47.68  E-value: 8.82e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  256 YSAVVPTGhklpSPLIMSPGPIYAVDGDQAINQSIIYSIIAGNTDGTFIINAHDGNLTMTKSI--PSPMKFTLLIRA-DQ 332
Cdd:pfam00028   1 YSASVPEN----APVGTEVLTVTATDPDLGPNGRIFYSILGGGPGGNFRIDPDTGDISTTKPLdrESIGEYELTVEAtDS 76
                          90
                  ....*....|....
gi 167004347  333 EDMAQYSVTQAIVE 346
Cdd:pfam00028  77 GGPPLSSTATVTIT 90
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
279-355 1.62e-06

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 46.57  E-value: 1.62e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 167004347   279 AVDGDQAINQSIIYSIIAGNTDGTFIINAHDGNLTMTKSIpspmkftlliraDQEDMAQYSVTqaiVEARSVTGNPL 355
Cdd:smart00112   2 ATDADSGENGKVTYSILSGNDDGLFSIDPETGEITTTKPL------------DREEQPEYTLT---VEATDGGGPPL 63
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
437-619 1.94e-06

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 51.84  E-value: 1.94e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  437 ASNTVTKDTATAVVEIQVSERELPStefPTPpeAGGTTGPSSNTTMEAPLTSGT-SQRPATTSSGGSVGPFPPGGTTlrp 515
Cdd:pfam05109 451 STHVPTNLTAPASTGPTVSTADVTS---PTP--AGTTSGASPVTPSPSPRDNGTeSKAPDMTSPTSAVTTPTPNATS--- 522
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  516 PTPASSIP---GGSPTLGTSTSPQTTTPGGDSAQTPKPGTshpTAPTSRTSTSLMTTSSRSDSTQTPKPGTSQPMV--PI 590
Cdd:pfam05109 523 PTPAVTTPtpnATSPTLGKTSPTSAVTTPTPNATSPTPAV---TTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVgeTS 599
                         170       180
                  ....*....|....*....|....*....
gi 167004347  591 PGASTSSQPATPSGSSPQTPKPGTSQSTA 619
Cdd:pfam05109 600 PQANTTNHTLGGTSSTPVVTSPPKNATSA 628
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
460-711 4.26e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.47  E-value: 4.26e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  460 PSTEFPTPPEAGGTTGPSSNTTMEAPLTSGTSQRPATTSSGGSVGPFPPGGTtlrPPTPASSIPGGSPTLGtstspqtTT 539
Cdd:PHA03307   75 PGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASP---PPSPAPDLSEMLRPVG-------SP 144
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  540 PGGDSAQTPKPGTSHPTAPTSRTSTSLMTTSSRSDSTQTPKPGTSQPMVPIpgastsSQPATPSGSSPQTPKPGTSQSTA 619
Cdd:PHA03307  145 GPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPP------STPPAAASPRPPRRSSPISASAS 218
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  620 TGPISGAGEQGDGQRFSTVDMAVLGGVLGALLLLALICLVIlvhkHYRHRLACCSGKASEPQPSGydnltFLPDHKAKWS 699
Cdd:PHA03307  219 SPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPR----PAPITLPTRIWEASGWNGPS-----SRPGPASSSS 289
                         250
                  ....*....|..
gi 167004347  700 PTPNRKPEPSPK 711
Cdd:PHA03307  290 SPRERSPSPSPS 301
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
460-775 1.03e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 46.32  E-value: 1.03e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  460 PSTEFPTPPEAGGTTGPSSNTTMEAPLTSGTSQRPATTSSGGSVGPFPPGGTTLRPPTPASSIPGGSPTLGTSTSPQTTT 539
Cdd:PHA03307   63 DRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVG 142
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  540 PGGDSAQTPKPGTSHPTAPTSRTSTSLMTTSSRSDSTQTPKPGTSQPMVPIPGastsSQPATPSGSSPQTPKPGTSQSTA 619
Cdd:PHA03307  143 SPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPP----STPPAAASPRPPRRSSPISASAS 218
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  620 TGPISGAGEQGDGQRFSTVDMAVLGGVLGALLLLALICLVIlvhkHYRHRLACCSGKASEPQPSGydnltFLPDHKAKWS 699
Cdd:PHA03307  219 SPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPR----PAPITLPTRIWEASGWNGPS-----SRPGPASSSS 289
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  700 PTPNRKPEPSPK--------------LAQPPLRPPSPMSSSPTPPSSTPPSPQPKASGSPKTVQAGDSPSAVRSILTKER 765
Cdd:PHA03307  290 SPRERSPSPSPSspgsgpapssprasSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRP 369
                         330
                  ....*....|
gi 167004347  766 RPEGEGGYKA 775
Cdd:PHA03307  370 RPSRAPSSPA 379
PHA03247 PHA03247
large tegument protein UL36; Provisional
462-711 1.86e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 1.86e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  462 TEFPTPPEAGGTTGPSSNTTMEAPLTSGTSQRPATTSSGGSVGPFPPGGTTLRPPTPASSIPGGSPTlGTSTSPQTTTPG 541
Cdd:PHA03247 2680 PQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPA-GPATPGGPARPA 2758
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  542 GDSAQTPKPGTSHPTAPTSRTSTSLMTTSSRSDSTQTPKPGTSQPMVPIPGASTSSQPATPSGSSPQTPKPGTSQSTATG 621
Cdd:PHA03247 2759 RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTA 2838
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  622 PISGAGEQGDGQRFSTVDMAVLGGVLGALLLLALICLVILVHKHYRHRLACCSGKASEPQPsgydnltfLPDHKAKWSPT 701
Cdd:PHA03247 2839 PPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA--------LPPDQPERPPQ 2910
                         250
                  ....*....|
gi 167004347  702 PNRKPEPSPK 711
Cdd:PHA03247 2911 PQAPPPPQPQ 2920
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
466-622 2.51e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 44.64  E-value: 2.51e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347 466 TPPEAGGTTGPSSNTTMEAPLTSGTSQRPAttsSGGSVGPFPPGGTTlrPPTPASSIPGGSPtlGTSTSPQTTTPGGDSA 545
Cdd:COG5164  100 TPAGDGGATGPPDDGGATGPPDDGGSTTPP---SGGSTTPPGDGGST--PPGPGSTGPGGST--TPPGDGGSTTPPGPGG 172
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 167004347 546 QTPKPGTSHPTAPtsrtsTSLMTTSSRSDSTQTPKPGTSQPMVPI-PGASTSSQPATPSGSSPQTPKPGTSQSTATGP 622
Cdd:COG5164  173 STTPPDDGGSTTP-----PNKGETGTDIPTGGTPRQGPDGPVKKDdKNGKGNPPDDRGGKTGPKDQRPKTNPIERRGP 245
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
460-612 3.58e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 3.58e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  460 PSTEFPTPPEAGGTTGPSSNTTMEAPLTSGTSQRPATTSSGGSVGPFPPGGTTLRP---PTPASSIPGGSP--------- 527
Cdd:pfam03154 184 PSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPqrlPSPHPPLQPMTQppppsqvsp 263
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  528 -TLGTSTSPQTTTPGGDSAQTPKPGTSHPTAPTSRTSTSLMTTSSRSDSTQTPKPGTSQPMVPIPGASTSSQPATPSGSS 606
Cdd:pfam03154 264 qPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQ 343

                  ....*.
gi 167004347  607 PQTPKP 612
Cdd:pfam03154 344 PLPPAP 349
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
415-622 3.87e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.21  E-value: 3.87e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347 415 NKDIMLTAVPMEEARTIRVEVEASNTVTKDTATAVVEIQVSERELPSTEFPTPPEAGGTTGPSSNTTMEAPLTSGTSQRP 494
Cdd:PRK07764 566 NAEVLVTALAEELGGDWQVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAP 645
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347 495 ATTSSGGSVGPFPPGGTTLRPPTPASSIPGGSPTLGTSTSPQTTTPGGDSAQTPKPGTSHPTAPTSRTSTSLMTTSSRSD 574
Cdd:PRK07764 646 GVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAA 725
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*....
gi 167004347 575 STQTPKPGTSQPMVPIPGA-STSSQPATPSGSSPQTPKPGTSQSTATGP 622
Cdd:PRK07764 726 QGASAPSPAADDPVPLPPEpDDPPDPAGAPAQPPPPPAPAPAAAPAAAP 774
PHA03247 PHA03247
large tegument protein UL36; Provisional
460-622 6.19e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 6.19e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  460 PSTEFPTPPEAGGTTGPSSNttmeaPLTSGTSQRPATTSSGGSVGPFPPGGTTlrPPTPASSIPGGSPTlgtsTSPQTTT 539
Cdd:PHA03247 2694 SLTSLADPPPPPPTPEPAPH-----ALVSATPLPPGPAAARQASPALPAAPAP--PAVPAGPATPGGPA----RPARPPT 2762
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  540 PGGDSAQTPkpgtshPTAPTSRTSTSLMTTSSRSDSTQTPKPGTSQPMVPIPGASTSSQPATPSGSSPQTPKPGTSQSTA 619
Cdd:PHA03247 2763 TAGPPAPAP------PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQP 2836

                  ...
gi 167004347  620 TGP 622
Cdd:PHA03247 2837 TAP 2839
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
500-621 7.14e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.44  E-value: 7.14e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347 500 GGSVGPFPPGGTTLRPPTPASSIPGGSPTlGTSTSPQTTTPGGDSAQTPKPGTSHPTAPTSRTSTSLMTTSSRSDSTQTP 579
Cdd:PRK07764 388 AGGAGAPAAAAPSAAAAAPAAAPAPAAAA-PAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQP 466
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|..
gi 167004347 580 KPGTSQPMVPIPGASTSSQPATPSGSSPQTPKPGTSQSTATG 621
Cdd:PRK07764 467 APAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADD 508
PHA03247 PHA03247
large tegument protein UL36; Provisional
460-622 8.18e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 8.18e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  460 PSTEFPTPPEAGGTTGPSsnttmeAPLTSGTSQRPATTSSGGSVGPFPPGGTTLRPPTPASsiPGGSPTLGTSTSPQTTT 539
Cdd:PHA03247 2717 SATPLPPGPAAARQASPA------LPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAP--PAAPAAGPPRRLTRPAV 2788
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  540 PGGDSAQTPKPGTSHPT-APTSRTSTSLMTTSSRSDSTQTPKPGTSQPMVPIPGASTSSQPATPSGS-SPQTP--KPGTS 615
Cdd:PHA03247 2789 ASLSESRESLPSPWDPAdPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSvAPGGDvrRRPPS 2868

                  ....*..
gi 167004347  616 QSTATGP 622
Cdd:PHA03247 2869 RSPAAKP 2875
PHA03247 PHA03247
large tegument protein UL36; Provisional
460-622 9.13e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.00  E-value: 9.13e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  460 PSTEFPTPPEAGGTTGPSSNTTME--APLTSGTSQRPATTSSGGSVGPF-------PPGGTTLRPPTPASSipgGSPTLG 530
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLgrAAQASSPPQRPRRRAARPTVGSLtsladppPPPPTPEPAPHALVS---ATPLPP 2723
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  531 TSTSPQTTTPGGDSAQTPKPGTSHPTAPTSRTSTSLMTTSSRSDSTQTPK--PGTSQPMVPIPGASTSSqPATPSGSSPQ 608
Cdd:PHA03247 2724 GPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAapAAGPPRRLTRPAVASLS-ESRESLPSPW 2802
                         170
                  ....*....|....
gi 167004347  609 TPKPGTSQSTATGP 622
Cdd:PHA03247 2803 DPADPPAAVLAPAA 2816
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
479-620 1.24e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 42.36  E-value: 1.24e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347 479 NTTMEAPLTSGTSQRPattsSGGSVGPfpPGGTTLrpPTPASSIPGGSPTLGTSTSPQTTTPGGDSAQTPKPGTSHPTAP 558
Cdd:PRK14959 358 NLAMLPRLMPVESLRP----SGGGASA--PSGSAA--EGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAA 429
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 167004347 559 TSRTSTSLMTTSSRSDSTQTPKP----GTSQPMVPIPGASTSSQPATPSGSSPQTPKPGTSQSTAT 620
Cdd:PRK14959 430 PSPRVPWDDAPPAPPRSGIPPRPaprmPEASPVPGAPDSVASASDAPPTLGDPSDTAEHTPSGPRT 495
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
460-634 2.52e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.70  E-value: 2.52e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  460 PSTEFPTPPEAGGTTGPSSNTTMEapltSGTSQRPATTSSGGSVGPFPPGGTTLRPPTPASSIPGGSPTLGTSTSPQTtt 539
Cdd:PHA03307  215 ASASSPAPAPGRSAADDAGASSSD----SSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSS-- 288
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  540 pGGDSAQTPKPGTSHPTAPTSRTSTSLMTTSSRSDSTQTPK-----PGTSQPMVPiPGASTSSQPATPSGSSPQTPK-PG 613
Cdd:PHA03307  289 -SSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSStssssESSRGAAVS-PGPSPSRSPSPSRPPPPADPSsPR 366
                         170       180
                  ....*....|....*....|.
gi 167004347  614 TSQSTATGPISGAGEQGDGQR 634
Cdd:PHA03307  367 KRPRPSRAPSSPAASAGRPTR 387
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
437-623 2.65e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.10  E-value: 2.65e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  437 ASNTVTKDTATAVVEIQVSERELPSTEFPTPPEAGGTTGPSSNTTMEAPLTSGTSQRPATTSSGGSVGPFPPGGTTLRPP 516
Cdd:pfam17823 127 AQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATL 206
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  517 TPASSI-----PGGSPTLGTSTSPQTTTPGGDSAQTPKPGTSHPTA-PTSRTSTSLMTTSSRSDSTQTPKPGTSQPMVPI 590
Cdd:pfam17823 207 TPARGIstaatATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAAlATLAAAAGTVASAAGTINMGDPHARRLSPAKHM 286
                         170       180       190
                  ....*....|....*....|....*....|...
gi 167004347  591 PGASTSSQPATPSGSSPQTPkpgTSQSTATGPI 623
Cdd:pfam17823 287 PSDTMARNPAAPMGAQAQGP---IIQVSTDQPV 316
PHA03247 PHA03247
large tegument protein UL36; Provisional
460-637 4.36e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.08  E-value: 4.36e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  460 PSTEFPTPPEAGGTTGPSSNTTMEAPLTSGTSQRPAT--TSSGGSVG-----PFPPGGTTLRPPTP-----ASSIPGGSP 527
Cdd:PHA03247 2564 PDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRApvDDRGDPRGpappsPLPPDTHAPDPPPPspspaANEPDPHPP 2643
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  528 TLGTSTSPQTTTPGGDSAQTPKPGTSHPTAPTSRTST----------SLMTTSSRSDSTQTPKPGTSQPMVPIPGASTSS 597
Cdd:PHA03247 2644 PTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPqrprrraarpTVGSLTSLADPPPPPPTPEPAPHALVSATPLPP 2723
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|
gi 167004347  598 QPATPSGSSPQTPKPGTSQSTATGPISGAGEQGDGQRFST 637
Cdd:PHA03247 2724 GPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTT 2763
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
461-611 7.12e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 39.74  E-value: 7.12e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347 461 STEFPTPPEAGGTTGPSSNTTMEAPLTSGTSQRPAT-TSSGGSVGPFPPGGTTLRPPTPASSIPGGSPTLGTSTSPQTTT 539
Cdd:COG3469   64 TAASSTAATSSTTSTTATATAAAAAATSTSATLVATsTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSS 143
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 167004347 540 PGGDSAQTPKPGTSHPTAPTSRTSTSLMTTSSRSDSTQTPKPGTSQPmVPIPGASTSSQPATPSGSSPQTPK 611
Cdd:COG3469  144 AGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTA-SGATTPSATTTATTTGPPTPGLPK 214
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
40-123 8.27e-03

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 36.52  E-value: 8.27e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 167004347  40 FRVEENTTVSEPLVNIFV--PDG-----LHVTLGPLSTPYAFRIEGK--DLFLNVTPDYEENSLLQADVECKrgDAVVVR 110
Cdd:cd11304    4 VSVPENAPPGTVVLTVSAtdPDSgengeVTYSIVSGNEDGLFSIDPStgEITTAKPLDREEQSSYTLTVTAT--DGGGPP 81
                         90
                 ....*....|....*..
gi 167004347 111 LE----VFVAVLDINDN 123
Cdd:cd11304   82 LSstatVTITVLDVNDN 98
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH