NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1908832871|ref|NP_001374145|]
View 

protein ENTREP2 isoform 4 precursor [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
175-382 1.26e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 1.26e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871  175 APSPFGTLYDVAINSPGLLYPAELPPPYEAVVGQPPASQVtsiGQQVAESSSGDPNTSAGFSTPV-PADSTSLLVSEGTA 253
Cdd:PHA03247  2688 ARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAA---ARQASPALPAAPAPPAVPAGPAtPGGPARPARPPTTA 2764
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871  254 TPGSSPSPDGPVGAPAPSEPALPPGHVSPEDPGMGSQVQPGPGRVSRSTSDPTLCTSSMAGDASSHRPSCSQDLEAGLSE 333
Cdd:PHA03247  2765 GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG 2844
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*....
gi 1908832871  334 AVPGSASMSRSataacraqLSPAGDpdtwkTDQRPTPEPFPATSKERPR 382
Cdd:PHA03247  2845 PPPPSLPLGGS--------VAPGGD-----VRRRPPSRSPAAKPAAPAR 2880
CD20 super family cl04401
CD20-like family; This family includes the CD20 protein and the beta subunit of the high ...
1-103 2.89e-03

CD20-like family; This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm. This family also includes LR8 like proteins from humans, mice and rats. The function of the human LR8 protein is unknown although it is known to be strongly expressed in the lung fibroblasts. This family also includes sarcospan is a transmembrane component of dystrophin-associated glycoprotein. Loss of the sarcoglycan complex and sarcospan alone is sufficient to cause muscular dystrophy. The role of the sarcoglycan complex and sarcospan is thought to be to strengthen the dystrophin axis connecting the basement membrane with the cytoskeleton.


The actual alignment was detected with superfamily member pfam04103:

Pssm-ID: 461174  Cd Length: 155  Bit Score: 38.40  E-value: 2.89e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871   1 MLLSAVCVMLNLAGSILSCQN-AQLVNSLEGCQLIK--FDSVEVCVCCELQHQSSGCsnlgetlklnplqENCNAVRLTL 77
Cdd:pfam04103  63 LLLNLLSLFTAVAGIILLSLSlALLTSAHECCMSESdlTPSTSTCSCKSSSEDPECR-------------AYCSSLRGLF 129
                          90       100
                  ....*....|....*....|....*.
gi 1908832871  78 KDLLFSVCALNVLSTIVCALATAMCC 103
Cdd:pfam04103 130 TGILSMLLILTVLELLVSLLSAILGC 155
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
175-382 1.26e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 1.26e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871  175 APSPFGTLYDVAINSPGLLYPAELPPPYEAVVGQPPASQVtsiGQQVAESSSGDPNTSAGFSTPV-PADSTSLLVSEGTA 253
Cdd:PHA03247  2688 ARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAA---ARQASPALPAAPAPPAVPAGPAtPGGPARPARPPTTA 2764
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871  254 TPGSSPSPDGPVGAPAPSEPALPPGHVSPEDPGMGSQVQPGPGRVSRSTSDPTLCTSSMAGDASSHRPSCSQDLEAGLSE 333
Cdd:PHA03247  2765 GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG 2844
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*....
gi 1908832871  334 AVPGSASMSRSataacraqLSPAGDpdtwkTDQRPTPEPFPATSKERPR 382
Cdd:PHA03247  2845 PPPPSLPLGGS--------VAPGGD-----VRRRPPSRSPAAKPAAPAR 2880
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
191-372 7.41e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.06  E-value: 7.41e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871 191 GLLYPAELPPPYEAVVGQP-PASQVTSIGQQVAESSSGDPNTSAGFSTPVpadstSLLVSEGTATPGSSPSPDGPVgAPA 269
Cdd:pfam03154 179 GAASPPSPPPPGTTQAATAgPTPSAPSVPPQGSPATSQPPNQTQSTAAPH-----TLIQQTPTLHPQRLPSPHPPL-QPM 252
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871 270 PSEPalPPGHVSPE---DPGMGSQVQPGPGRVSrstSDPTLCTSSMAGDASSHRPSCSQ-DLEAGLSEAVPGSASmSRSA 345
Cdd:pfam03154 253 TQPP--PPSQVSPQplpQPSLHGQMPPMPHSLQ---TGPSHMQHPVPPQPFPLTPQSSQsQVPPGPSPAAPGQSQ-QRIH 326
                         170       180
                  ....*....|....*....|....*..
gi 1908832871 346 TAACRAQLSPAGDPDTWKTDQRPTPEP 372
Cdd:pfam03154 327 TPPSQSQLQSQQPPREQPLPPAPLSMP 353
CD20 pfam04103
CD20-like family; This family includes the CD20 protein and the beta subunit of the high ...
1-103 2.89e-03

CD20-like family; This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm. This family also includes LR8 like proteins from humans, mice and rats. The function of the human LR8 protein is unknown although it is known to be strongly expressed in the lung fibroblasts. This family also includes sarcospan is a transmembrane component of dystrophin-associated glycoprotein. Loss of the sarcoglycan complex and sarcospan alone is sufficient to cause muscular dystrophy. The role of the sarcoglycan complex and sarcospan is thought to be to strengthen the dystrophin axis connecting the basement membrane with the cytoskeleton.


Pssm-ID: 461174  Cd Length: 155  Bit Score: 38.40  E-value: 2.89e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871   1 MLLSAVCVMLNLAGSILSCQN-AQLVNSLEGCQLIK--FDSVEVCVCCELQHQSSGCsnlgetlklnplqENCNAVRLTL 77
Cdd:pfam04103  63 LLLNLLSLFTAVAGIILLSLSlALLTSAHECCMSESdlTPSTSTCSCKSSSEDPECR-------------AYCSSLRGLF 129
                          90       100
                  ....*....|....*....|....*.
gi 1908832871  78 KDLLFSVCALNVLSTIVCALATAMCC 103
Cdd:pfam04103 130 TGILSMLLILTVLELLVSLLSAILGC 155
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
175-382 1.26e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 1.26e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871  175 APSPFGTLYDVAINSPGLLYPAELPPPYEAVVGQPPASQVtsiGQQVAESSSGDPNTSAGFSTPV-PADSTSLLVSEGTA 253
Cdd:PHA03247  2688 ARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAA---ARQASPALPAAPAPPAVPAGPAtPGGPARPARPPTTA 2764
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871  254 TPGSSPSPDGPVGAPAPSEPALPPGHVSPEDPGMGSQVQPGPGRVSRSTSDPTLCTSSMAGDASSHRPSCSQDLEAGLSE 333
Cdd:PHA03247  2765 GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG 2844
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*....
gi 1908832871  334 AVPGSASMSRSataacraqLSPAGDpdtwkTDQRPTPEPFPATSKERPR 382
Cdd:PHA03247  2845 PPPPSLPLGGS--------VAPGGD-----VRRRPPSRSPAAKPAAPAR 2880
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
227-394 1.61e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.47  E-value: 1.61e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871  227 GDPNTSAGFSTPVPADSTSLLVSEGTATPGSS--------PSPDGPVGAPAPSEPALPPGHVSPEDPGMGSQVQPGPGRV 298
Cdd:PHA03307    69 TGPPPGPGTEAPANESRSTPTWSLSTLAPASParegsptpPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPP 148
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871  299 SRSTSDPTLCTSSMAGDASShrPSCSQDLEAGLSEAVPGSASMSRSATAACRAQLSPAGDPDTWKTDQRPTPEPFPATSK 378
Cdd:PHA03307   149 AASPPAAGASPAAVASDAAS--SRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGR 226
                          170       180
                   ....*....|....*....|
gi 1908832871  379 E----RPRSLVDSKAYADAR 394
Cdd:PHA03307   227 SaaddAGASSSDSSSSESSG 246
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
195-383 6.54e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.55  E-value: 6.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871  195 PAELPPPYEAVVGQPPASQVTSIGQQVAESSSGDPNTSAGFSTPVPADSTSLLVSEGTA------TPGSSPSPDGPVGAP 268
Cdd:PHA03307   190 PAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGwgpeneCPLPRPAPITLPTRI 269
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871  269 APSEPALP----PGHVSPEDPGMGSQVQPGPGR-VSRSTSDPTLCTSSMAGDASSHRPSCSQDLEAGLSEAVPGSASMSR 343
Cdd:PHA03307   270 WEASGWNGpssrPGPASSSSSPRERSPSPSPSSpGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSR 349
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 1908832871  344 SATAAcraqlSPAGDPDTWKTDQRPTPE---PFPATSKERPRS 383
Cdd:PHA03307   350 SPSPS-----RPPPPADPSSPRKRPRPSrapSSPAASAGRPTR 387
PHA03247 PHA03247
large tegument protein UL36; Provisional
184-374 1.86e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871  184 DVAINSPGLLYPAELPPPYEAVVGQP-PASQVTSIGQQVAESSSGDPNTSAGFSTPVPADSTSLLVSEGTATPGSSPSPD 262
Cdd:PHA03247  2546 DDAGDPPPPLPPAAPPAAPDRSVPPPrPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871  263 GPVGAPAPSEPALPPGHVSPEDPGMGSQVQPGPGRVSRstsdptlctssmagdassHRPSCSQDLEAGLSEAVPGSasmS 342
Cdd:PHA03247  2626 PPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSR------------------PRRARRLGRAAQASSPPQRP---R 2684
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1908832871  343 RSATAACRAQLSPAGDPdtwkTDQRPTPEPFP 374
Cdd:PHA03247  2685 RRAARPTVGSLTSLADP----PPPPPTPEPAP 2712
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
191-372 7.41e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.06  E-value: 7.41e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871 191 GLLYPAELPPPYEAVVGQP-PASQVTSIGQQVAESSSGDPNTSAGFSTPVpadstSLLVSEGTATPGSSPSPDGPVgAPA 269
Cdd:pfam03154 179 GAASPPSPPPPGTTQAATAgPTPSAPSVPPQGSPATSQPPNQTQSTAAPH-----TLIQQTPTLHPQRLPSPHPPL-QPM 252
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871 270 PSEPalPPGHVSPE---DPGMGSQVQPGPGRVSrstSDPTLCTSSMAGDASSHRPSCSQ-DLEAGLSEAVPGSASmSRSA 345
Cdd:pfam03154 253 TQPP--PPSQVSPQplpQPSLHGQMPPMPHSLQ---TGPSHMQHPVPPQPFPLTPQSSQsQVPPGPSPAAPGQSQ-QRIH 326
                         170       180
                  ....*....|....*....|....*..
gi 1908832871 346 TAACRAQLSPAGDPDTWKTDQRPTPEP 372
Cdd:pfam03154 327 TPPSQSQLQSQQPPREQPLPPAPLSMP 353
PHA03378 PHA03378
EBNA-3B; Provisional
157-395 9.34e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 41.59  E-value: 9.34e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871 157 EYTCTPSTEAQRGlHLDFAPSPFGTLYDVAIN-SPGLLY-----PAELPPPYEA-VVGQPPASQVTSIGQQVAESSSGDP 229
Cdd:PHA03378  658 EITPYKPTWTQIG-HIPYQPSPTGANTMLPIQwAPGTMQpppraPTPMRPPAAPpGRAQRPAAATGRARPPAAAPGRARP 736
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871 230 NTSAGFSTPVPAdSTSLLVSEGTATPGSSPSPDGPVGAPAPSEPALPPghvspedPGMGSQVQPGPGRVSRSTSDPT--- 306
Cdd:PHA03378  737 PAAAPGRARPPA-AAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAP-------PAPQQRPRGAPTPQPPPQAGPTsmq 808
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871 307 LCTSSMAGDASSHRPSCSQDLEAGLSEAVPGSASMSRSATAACRAQLSPAGDPDTWKTDQRPTPEPFPATSKERPRSLVD 386
Cdd:PHA03378  809 LMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQAAAGPTPSPGSGTSDKIVQAPVFYPPVLQPIQVMRQLGS 888

                  ....*....
gi 1908832871 387 SKAYADARV 395
Cdd:PHA03378  889 VRAAAASTV 897
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
173-382 9.80e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.40  E-value: 9.80e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871 173 DFAPSPFGTLYDVAINSPGLLYPAELPPPYEAVVGQPPASQVTSIGQQVAESSSGDPntsagfstPVPADSTSLLVSEGT 252
Cdd:PRK12323  371 GAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRS--------PAPEALAAARQASAR 442
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871 253 ATPGSSPSPDGPVGAPAPSEPALPPGHVSPEDPGMGSQVQPGPGRVSRSTSDPTLCTSSMAGDASSHRPScsqDLEAGLS 332
Cdd:PRK12323  443 GPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPA---QPDAAPA 519
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|
gi 1908832871 333 EAVpgSASMSRSATAACRAQLSPAGDPDTWKTDQRPTPEPFPATSKERPR 382
Cdd:PRK12323  520 GWV--AESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPR 567
PRK13729 PRK13729
conjugal transfer pilus assembly protein TraB; Provisional
222-302 1.41e-03

conjugal transfer pilus assembly protein TraB; Provisional


Pssm-ID: 184281 [Multi-domain]  Cd Length: 475  Bit Score: 40.96  E-value: 1.41e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871 222 AESSSGDPNTSAGfsTPVPADSTSLLVSEGTATPGSSPSPDGPVGAPAPSEP-ALPPGHVSPEDPGMGSQVQPGPGRVSR 300
Cdd:PRK13729  120 VKALGANPVTATG--EPVPQMPASPPGPEGEPQPGNTPVSFPPQGSVAVPPPtAFYPGNGVTPPPQVTYQSVPVPNRIQR 197

                  ..
gi 1908832871 301 ST 302
Cdd:PRK13729  198 KT 199
PRK12495 PRK12495
hypothetical protein; Provisional
219-359 1.77e-03

hypothetical protein; Provisional


Pssm-ID: 183558 [Multi-domain]  Cd Length: 226  Bit Score: 39.85  E-value: 1.77e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871 219 QQVAESSSGDPNTSAGFSTPVPADSTSLLVSEGTATPGSSPSPDGPVGAPAPSEPALPPGHVSPEDPGMGSQVQPGPGRV 298
Cdd:PRK12495   66 QPVTEDGAAGDDAGDGAEATAPSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSATDEAATDPPATAAARDGPTPDPT 145
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1908832871 299 SRSTSDPTLCTSSMAGDASSHRPSCSQDLEAGLSEAVPGSASMSRSATAACRaQLSPAGDP 359
Cdd:PRK12495  146 AQPATPDERRSPRQRPPVSGEPPTPSTPDAHVAGTLQAARESLVETLARFAR-RAAATDDP 205
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
189-383 1.89e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 40.92  E-value: 1.89e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871  189 SPGLLYPAELPPPYEAVVGQPPASQVTSIGQQVAESSSGDPNTSAG---FSTPVPADSTSLLVSEGTATPGSSPSPDGPV 265
Cdd:PHA03307   125 SPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRqaaLPLSSPEETARAPSSPPAEPPPSTPPAAASP 204
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871  266 GAPAPSEPALPPGHVSPEDPGMGSQVQPGPGRVSRSTSDPTLCTSSMAGDASSHRPScSQDLEAGLSEAVPGSASMSRSA 345
Cdd:PHA03307   205 RPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPA-PITLPTRIWEASGWNGPSSRPG 283
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1908832871  346 TAACRAQLSPAGDPDTWKTDQRPTPEPFPATSKERPRS 383
Cdd:PHA03307   284 PASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSS 321
CD20 pfam04103
CD20-like family; This family includes the CD20 protein and the beta subunit of the high ...
1-103 2.89e-03

CD20-like family; This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm. This family also includes LR8 like proteins from humans, mice and rats. The function of the human LR8 protein is unknown although it is known to be strongly expressed in the lung fibroblasts. This family also includes sarcospan is a transmembrane component of dystrophin-associated glycoprotein. Loss of the sarcoglycan complex and sarcospan alone is sufficient to cause muscular dystrophy. The role of the sarcoglycan complex and sarcospan is thought to be to strengthen the dystrophin axis connecting the basement membrane with the cytoskeleton.


Pssm-ID: 461174  Cd Length: 155  Bit Score: 38.40  E-value: 2.89e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871   1 MLLSAVCVMLNLAGSILSCQN-AQLVNSLEGCQLIK--FDSVEVCVCCELQHQSSGCsnlgetlklnplqENCNAVRLTL 77
Cdd:pfam04103  63 LLLNLLSLFTAVAGIILLSLSlALLTSAHECCMSESdlTPSTSTCSCKSSSEDPECR-------------AYCSSLRGLF 129
                          90       100
                  ....*....|....*....|....*.
gi 1908832871  78 KDLLFSVCALNVLSTIVCALATAMCC 103
Cdd:pfam04103 130 TGILSMLLILTVLELLVSLLSAILGC 155
motB PRK12799
flagellar motor protein MotB; Reviewed
211-322 3.19e-03

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 39.70  E-value: 3.19e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871 211 ASQVTSIGQQVAESSSGDPNTSAGFSTPVPADSTSLLVSEGTAT--PGSSPSPDGPVGAPAPSEPALPPGHVSPEDPGMG 288
Cdd:PRK12799  303 AVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAValSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTT 382
                          90       100       110
                  ....*....|....*....|....*....|....
gi 1908832871 289 SQVQPGPGRVSRSTSDPTlcTSSMAGDASSHRPS 322
Cdd:PRK12799  383 ETQQSSTGNITSTANGPT--TSLPAAPASNIPVS 414
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
199-403 3.60e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 39.97  E-value: 3.60e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871 199 PPPYEAVVGQPPASQVTSIGQQVAESSSGDPNTSAGFSTPVPADSTSLLVSEGTATPGSSPSPDGPvGAPAPSEPALPPG 278
Cdd:PRK07764  610 EEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGG-AAPAAPPPAPAPA 688
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871 279 HVSPEDPGMGSQVQPGP------GRVSRSTSDPTLCTSSMAGDASSHRPSCSQDLEAGLSEAVPGSASMSRSATAACRAq 352
Cdd:PRK07764  689 APAAPAGAAPAQPAPAPaatppaGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPA- 767
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1908832871 353 lSPAGDPDTWKTDQRPTPEPFPATSKERPRSLVDSKAYADA---RVLVAKFLEH 403
Cdd:PRK07764  768 -AAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAEEVAMElleEELGAKKIEE 820
DUF4641 pfam15483
Domain of unknown function (DUF4641); This family of proteins is found in eukaryotes. Proteins ...
222-278 4.32e-03

Domain of unknown function (DUF4641); This family of proteins is found in eukaryotes. Proteins in this family are typically between 201 and 519 amino acids in length.


Pssm-ID: 464741  Cd Length: 443  Bit Score: 39.34  E-value: 4.32e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1908832871 222 AESSSGDPNTSAGfstPVPADSTSLLVSEGTATP-GSSPSPD--GPVGAPAPSE-PALPPG 278
Cdd:pfam15483 360 GEFSSGDPNIRAP---QVPGNSQPSALSQGGVRPrGPAPSGDqePPVRPPRPERqQQPPPG 417
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
207-359 6.02e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 39.00  E-value: 6.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908832871  207 GQPPASQVTSIGQQVAESSSGDPNTSAGFSTPVPADSTsllvsegtatPGSSPSPDGPVGAPAPSEPALPPGHVSPEDPG 286
Cdd:PHA03307   305 SGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVS----------PGPSPSRSPSPSRPPPPADPSSPRKRPRPSRA 374
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1908832871  287 MGSQVQPgPGRVSRSTSDPTLCTSSMAGDASSHRPscsqdleAGLSEAVPGSASMSRSATAACRAQLSPAGDP 359
Cdd:PHA03307   375 PSSPAAS-AGRPTRRRARAAVAGRARRRDATGRFP-------AGRPRPSPLDAGAASGAFYARYPLLTPSGEP 439
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH