NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2462515030|ref|XP_054195411|]
View 

period circadian protein homolog 3 isoform X9 [Homo sapiens]

Protein Classification

PAS and Period_C domain-containing protein( domain architecture ID 12888871)

protein containing domains PAS, Herpes_BLLF1, and Period_C

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Period_C super family cl13540
Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is ...
1075-1177 1.17e-25

Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is typically between 164 to 200 amino acids in length. This domain is found associated with pfam08447.


The actual alignment was detected with superfamily member pfam12114:

Pssm-ID: 463464  Cd Length: 171  Bit Score: 104.79  E-value: 1.17e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030 1075 SVYSSKISQNGQQSQDVQKKETF-PNVAEEPIWRMIRQTPERILMTYQVPERVKEVVLKEDLEKLESMRQQQPQFSHGQK 1153
Cdd:pfam12114   68 SIDSSENNHKAKKTAEVGEEEHFiKCVLQDPIWLLMANTDDSVMMTYQIPSRDLETVLKEDREKLKAMQKMQPRFTEDQK 147
                           90       100
                   ....*....|....*....|....
gi 2462515030 1154 EELAKVYNWIQSQTVTQEIDIQAC 1177
Cdd:pfam12114  148 GELAEVHPWIQKGGLPAALDLSEC 171
PAS cd00130
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
284-376 1.79e-12

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.


:

Pssm-ID: 238075 [Multi-domain]  Cd Length: 103  Bit Score: 64.58  E-value: 1.79e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  284 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEhspIRFCTQNGDYIILDSSWSSFVNPW 363
Cdd:cd00130     14 ILYANPAAEQLLGYSPEELIGKSLLDLIHPEDREELRERLENLLSGGEPVTLE---VRLRRKDGSVIWVLVSLTPIRDEG 90
                           90
                   ....*....|...
gi 2462515030  364 SRKISFIIGRHKV 376
Cdd:cd00130     91 GEVIGLLGVVRDI 103
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
744-1055 3.05e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 58.24  E-value: 3.05e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  744 LPEPPDSSSSNTGSgprrgAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPylvPAFPLPAATSPGREYAAPGTAPEG 823
Cdd:pfam03154  148 IPSPQDNESDSDSS-----AQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAA---TAGPTPSAPSVPPQGSPATSQPPN 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  824 LHGLPLSEG--LQPYPAFPFPYLDTfmtvflPDPPVCPLLSPSflpcpflgatassaispsmssamsptldPPPSVTSQR 901
Cdd:pfam03154  220 QTQSTAAPHtlIQQTPTLHPQRLPS------PHPPLQPMTQPP----------------------------PPSQVSPQP 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  902 REEEKWEAQSE--GHPFITSRSSSPLQLNllqeemPRPSESPDQMRRNTCPQTEYQCVTGNNGSES-SPATTGALSTGSP 978
Cdd:pfam03154  266 LPQPSLHGQMPpmPHSLQTGPSHMQHPVP------PQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhTPPSQSQLQSQQP 339
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462515030  979 PRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmkNPSHPTASTLSMGLPPSrtPSHPTATVLSTGSPPSESP 1055
Cdd:pfam03154  340 PREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPP--HLSGPSPFQMNSNLPPP--PALKPLSSLSTHHPPSAHP 412
 
Name Accession Description Interval E-value
Period_C pfam12114
Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is ...
1075-1177 1.17e-25

Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is typically between 164 to 200 amino acids in length. This domain is found associated with pfam08447.


Pssm-ID: 463464  Cd Length: 171  Bit Score: 104.79  E-value: 1.17e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030 1075 SVYSSKISQNGQQSQDVQKKETF-PNVAEEPIWRMIRQTPERILMTYQVPERVKEVVLKEDLEKLESMRQQQPQFSHGQK 1153
Cdd:pfam12114   68 SIDSSENNHKAKKTAEVGEEEHFiKCVLQDPIWLLMANTDDSVMMTYQIPSRDLETVLKEDREKLKAMQKMQPRFTEDQK 147
                           90       100
                   ....*....|....*....|....
gi 2462515030 1154 EELAKVYNWIQSQTVTQEIDIQAC 1177
Cdd:pfam12114  148 GELAEVHPWIQKGGLPAALDLSEC 171
PAS cd00130
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
284-376 1.79e-12

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.


Pssm-ID: 238075 [Multi-domain]  Cd Length: 103  Bit Score: 64.58  E-value: 1.79e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  284 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEhspIRFCTQNGDYIILDSSWSSFVNPW 363
Cdd:cd00130     14 ILYANPAAEQLLGYSPEELIGKSLLDLIHPEDREELRERLENLLSGGEPVTLE---VRLRRKDGSVIWVLVSLTPIRDEG 90
                           90
                   ....*....|...
gi 2462515030  364 SRKISFIIGRHKV 376
Cdd:cd00130     91 GEVIGLLGVVRDI 103
PAS_3 pfam08447
PAS fold; The PAS fold corresponds to the structural domain that has previously been defined ...
284-372 6.89e-12

PAS fold; The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.


Pssm-ID: 430001 [Multi-domain]  Cd Length: 89  Bit Score: 62.74  E-value: 6.89e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  284 FLEVDEKAVPLLGYLPQDLIGT--SILSYLHPEDRSLMVAIHQKVLKYAGhpPFEHsPIRFCTQNGDYIILDSSWSSFVN 361
Cdd:pfam08447    1 IIYWSPRFEEILGYTPEELLGKgeSWLDLVHPDDRERVREALWEALKGGE--PYSG-EYRIRRKDGEYRWVEARARPIRD 77
                           90
                   ....*....|.
gi 2462515030  362 pWSRKISFIIG 372
Cdd:pfam08447   78 -ENGKPVRVIG 87
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
744-1055 3.05e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 58.24  E-value: 3.05e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  744 LPEPPDSSSSNTGSgprrgAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPylvPAFPLPAATSPGREYAAPGTAPEG 823
Cdd:pfam03154  148 IPSPQDNESDSDSS-----AQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAA---TAGPTPSAPSVPPQGSPATSQPPN 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  824 LHGLPLSEG--LQPYPAFPFPYLDTfmtvflPDPPVCPLLSPSflpcpflgatassaispsmssamsptldPPPSVTSQR 901
Cdd:pfam03154  220 QTQSTAAPHtlIQQTPTLHPQRLPS------PHPPLQPMTQPP----------------------------PPSQVSPQP 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  902 REEEKWEAQSE--GHPFITSRSSSPLQLNllqeemPRPSESPDQMRRNTCPQTEYQCVTGNNGSES-SPATTGALSTGSP 978
Cdd:pfam03154  266 LPQPSLHGQMPpmPHSLQTGPSHMQHPVP------PQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhTPPSQSQLQSQQP 339
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462515030  979 PRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmkNPSHPTASTLSMGLPPSrtPSHPTATVLSTGSPPSESP 1055
Cdd:pfam03154  340 PREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPP--HLSGPSPFQMNSNLPPP--PALKPLSSLSTHHPPSAHP 412
PAS smart00091
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
284-328 1.24e-07

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels.


Pssm-ID: 214512  Cd Length: 67  Bit Score: 49.71  E-value: 1.24e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 2462515030   284 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLK 328
Cdd:smart00091   23 ILYANPAAEELLGYSPEELIGKSLLELIHPEDRERVQEALQRLLS 67
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
741-1056 6.64e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 54.02  E-value: 6.64e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  741 RKKLPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSP-------------TFPPAAMVPSQAPYLVPAFPL-PA 806
Cdd:PHA03307    64 RFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPtppgpsspdppppTPPPASPPPSPAPDLSEMLRPvGS 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  807 ATSPGREYAAPGTAPEGLHGLPLSEGLQPYPAFPfpyldtfMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSA 886
Cdd:PHA03307   144 PGPPPAASPPAAGASPAAVASDAASSRQAALPLS-------SPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS 216
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  887 MsptldPPPSVTSQRREEEKWEAQSEGhpfiTSRSSSPLQLNLLQEEMPRPSESPdqmrrNTCPQTEYQCVTGNN-GSES 965
Cdd:PHA03307   217 A-----SSPAPAPGRSAADDAGASSSD----SSSSESSGCGWGPENECPLPRPAP-----ITLPTRIWEASGWNGpSSRP 282
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  966 SPATTGALSTGSPPRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmKNPSHPTASTLSMGLPPSRTPSHPTATvl 1045
Cdd:PHA03307   283 GPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSS-SESSRGAAVSPGPSPSRSPSPSRPPPP-- 359
                          330
                   ....*....|.
gi 2462515030 1046 STGSPPSESPS 1056
Cdd:PHA03307   360 ADPSSPRKRPR 370
KinA COG5805
Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle ...
273-373 3.23e-03

Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle control, cell division, chromosome partitioning, Signal transduction mechanisms];


Pssm-ID: 444507 [Multi-domain]  Cd Length: 496  Bit Score: 41.64  E-value: 3.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  273 IFTTTHTPGcVFLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEHSPIrfcTQNGDYIIL 352
Cdd:COG5805    169 LICVIDTDG-RILFINESIERLFGAPREELIGKNLLELLHPCDKEEFKERIESITEVWQEFIIEREII---TKDGRIRYF 244
                           90       100
                   ....*....|....*....|..
gi 2462515030  353 DSSWSSFVNP-WSRKISFIIGR 373
Cdd:COG5805    245 EAVIVPLIDTdGSVKGILVILR 266
 
Name Accession Description Interval E-value
Period_C pfam12114
Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is ...
1075-1177 1.17e-25

Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is typically between 164 to 200 amino acids in length. This domain is found associated with pfam08447.


Pssm-ID: 463464  Cd Length: 171  Bit Score: 104.79  E-value: 1.17e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030 1075 SVYSSKISQNGQQSQDVQKKETF-PNVAEEPIWRMIRQTPERILMTYQVPERVKEVVLKEDLEKLESMRQQQPQFSHGQK 1153
Cdd:pfam12114   68 SIDSSENNHKAKKTAEVGEEEHFiKCVLQDPIWLLMANTDDSVMMTYQIPSRDLETVLKEDREKLKAMQKMQPRFTEDQK 147
                           90       100
                   ....*....|....*....|....
gi 2462515030 1154 EELAKVYNWIQSQTVTQEIDIQAC 1177
Cdd:pfam12114  148 GELAEVHPWIQKGGLPAALDLSEC 171
PAS cd00130
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
284-376 1.79e-12

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.


Pssm-ID: 238075 [Multi-domain]  Cd Length: 103  Bit Score: 64.58  E-value: 1.79e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  284 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEhspIRFCTQNGDYIILDSSWSSFVNPW 363
Cdd:cd00130     14 ILYANPAAEQLLGYSPEELIGKSLLDLIHPEDREELRERLENLLSGGEPVTLE---VRLRRKDGSVIWVLVSLTPIRDEG 90
                           90
                   ....*....|...
gi 2462515030  364 SRKISFIIGRHKV 376
Cdd:cd00130     91 GEVIGLLGVVRDI 103
PAS_3 pfam08447
PAS fold; The PAS fold corresponds to the structural domain that has previously been defined ...
284-372 6.89e-12

PAS fold; The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.


Pssm-ID: 430001 [Multi-domain]  Cd Length: 89  Bit Score: 62.74  E-value: 6.89e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  284 FLEVDEKAVPLLGYLPQDLIGT--SILSYLHPEDRSLMVAIHQKVLKYAGhpPFEHsPIRFCTQNGDYIILDSSWSSFVN 361
Cdd:pfam08447    1 IIYWSPRFEEILGYTPEELLGKgeSWLDLVHPDDRERVREALWEALKGGE--PYSG-EYRIRRKDGEYRWVEARARPIRD 77
                           90
                   ....*....|.
gi 2462515030  362 pWSRKISFIIG 372
Cdd:pfam08447   78 -ENGKPVRVIG 87
PAS_11 pfam14598
PAS domain; This family includes the PAS-B domain of NCOA1 (Nuclear receptor coactivator 1), ...
274-376 2.99e-09

PAS domain; This family includes the PAS-B domain of NCOA1 (Nuclear receptor coactivator 1), which binds to an LXXLL motif in the C-terminal region of STAT6 (Signal transducer and activator of transcription 6).


Pssm-ID: 464214 [Multi-domain]  Cd Length: 110  Bit Score: 55.76  E-value: 2.99e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  274 FTTTHTPGCVFLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHppfEHSPI-RFCTQNGDYIIL 352
Cdd:pfam14598    4 FTTRHDIDGKIISCDTRAPFSLGYEKDELVGRSIYDLVHPQDLRTAKSHLREIIQTRGR---ATSPSyRLRLRDGDFLSV 80
                           90       100
                   ....*....|....*....|....
gi 2462515030  353 DSSWSSFVNPWSRKISFIIGRHKV 376
Cdd:pfam14598   81 HTKSKLFLNQNSNQQPFIMCTHTI 104
PAS pfam00989
PAS fold; The PAS fold corresponds to the structural domain that has previously been defined ...
271-370 3.85e-09

PAS fold; The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya. This domain can bind gases (O2, CO and NO), FAD, 4-hydroxycinnamic acid and NAD+ (Matilla et.al., FEMS Microbiology Reviews, fuab043, 45, 2021, 1. https://doi.org/10.1093/femsre/fuab043).


Pssm-ID: 395786 [Multi-domain]  Cd Length: 113  Bit Score: 55.50  E-value: 3.85e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  271 KRIFTTTHTPGCV------FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKyAGHPPFEHSpIRFCT 344
Cdd:pfam00989    4 RAILESLPDGIFVvdedgrILYVNAAAEELLGLSREEVIGKSLLDLIPEEDDAEVAELLRQALL-QGEESRGFE-VSFRV 81
                           90       100
                   ....*....|....*....|....*.
gi 2462515030  345 QNGDYIILDSSWSSFVNPWSRKISFI 370
Cdd:pfam00989   82 PDGRPRHVEVRASPVRDAGGEILGFL 107
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
744-1055 3.05e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 58.24  E-value: 3.05e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  744 LPEPPDSSSSNTGSgprrgAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPylvPAFPLPAATSPGREYAAPGTAPEG 823
Cdd:pfam03154  148 IPSPQDNESDSDSS-----AQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAA---TAGPTPSAPSVPPQGSPATSQPPN 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  824 LHGLPLSEG--LQPYPAFPFPYLDTfmtvflPDPPVCPLLSPSflpcpflgatassaispsmssamsptldPPPSVTSQR 901
Cdd:pfam03154  220 QTQSTAAPHtlIQQTPTLHPQRLPS------PHPPLQPMTQPP----------------------------PPSQVSPQP 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  902 REEEKWEAQSE--GHPFITSRSSSPLQLNllqeemPRPSESPDQMRRNTCPQTEYQCVTGNNGSES-SPATTGALSTGSP 978
Cdd:pfam03154  266 LPQPSLHGQMPpmPHSLQTGPSHMQHPVP------PQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhTPPSQSQLQSQQP 339
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462515030  979 PRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmkNPSHPTASTLSMGLPPSrtPSHPTATVLSTGSPPSESP 1055
Cdd:pfam03154  340 PREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPP--HLSGPSPFQMNSNLPPP--PALKPLSSLSTHHPPSAHP 412
PAS smart00091
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
284-328 1.24e-07

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels.


Pssm-ID: 214512  Cd Length: 67  Bit Score: 49.71  E-value: 1.24e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 2462515030   284 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLK 328
Cdd:smart00091   23 ILYANPAAEELLGYSPEELIGKSLLELIHPEDRERVQEALQRLLS 67
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
748-1058 2.12e-07

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 55.31  E-value: 2.12e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  748 PDSSSSNTGSGPRRGAHQNAQPCCPSAA--------SSPHTSSPTFPPAAMVP-SQAPYLVPAFPLPAATSPGREYAAPG 818
Cdd:pfam05109  466 PTVSTADVTSPTPAGTTSGASPVTPSPSprdngtesKAPDMTSPTSAVTTPTPnATSPTPAVTTPTPNATSPTLGKTSPT 545
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  819 TAPEglhgLPLSEGLQPYPAFPFPYLDTFMTVFLPDPPVCPLLSPS-FLPCPFLGATASSAISPSMS----SAMSPTLDP 893
Cdd:pfam05109  546 SAVT----TPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTpNATSPTVGETSPQANTTNHTlggtSSTPVVTSP 621
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  894 PPSVTSqrreeekweAQSEGHPFITSRSSSPLQLNLLQ-EEMPRPSESpDQMRRNTCPQTEYQCVTGNNGSESSPATTGA 972
Cdd:pfam05109  622 PKNATS---------AVTTGQHNITSSSTSSMSLRPSSiSETLSPSTS-DNSTSHMPLLTSAHPTGGENITQVTPASTST 691
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  973 --LSTGSP-PRenpshPTASALSTGSPPMKNPSHPTASALSTGSPPmKNPSHPTASTLSMGLPPSRTPSHPTATVLSTGS 1049
Cdd:pfam05109  692 hhVSTSSPaPR-----PGTTSQASGPGNSSTSTKPGEVNVTKGTPP-KNATSPQAPSGQKTAVPTVTSTGGKANSTTGGK 765

                   ....*....
gi 2462515030 1050 PPSESPSRT 1058
Cdd:pfam05109  766 HTTGHGART 774
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
741-1056 6.64e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 54.02  E-value: 6.64e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  741 RKKLPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSP-------------TFPPAAMVPSQAPYLVPAFPL-PA 806
Cdd:PHA03307    64 RFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPtppgpsspdppppTPPPASPPPSPAPDLSEMLRPvGS 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  807 ATSPGREYAAPGTAPEGLHGLPLSEGLQPYPAFPfpyldtfMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSA 886
Cdd:PHA03307   144 PGPPPAASPPAAGASPAAVASDAASSRQAALPLS-------SPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS 216
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  887 MsptldPPPSVTSQRREEEKWEAQSEGhpfiTSRSSSPLQLNLLQEEMPRPSESPdqmrrNTCPQTEYQCVTGNN-GSES 965
Cdd:PHA03307   217 A-----SSPAPAPGRSAADDAGASSSD----SSSSESSGCGWGPENECPLPRPAP-----ITLPTRIWEASGWNGpSSRP 282
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  966 SPATTGALSTGSPPRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmKNPSHPTASTLSMGLPPSRTPSHPTATvl 1045
Cdd:PHA03307   283 GPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSS-SESSRGAAVSPGPSPSRSPSPSRPPPP-- 359
                          330
                   ....*....|.
gi 2462515030 1046 STGSPPSESPS 1056
Cdd:PHA03307   360 ADPSSPRKRPR 370
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
745-1057 3.17e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 51.71  E-value: 3.17e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  745 PEPPDSSSSNTGSGPRRGAHQNAQPCC-PSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPL-PAATSPGREYAAPGTAPE 822
Cdd:PHA03307    80 PANESRSTPTWSLSTLAPASPAREGSPtPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPvGSPGPPPAASPPAAGASP 159
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  823 GLHGLPLSEGLQPYPAFPfpyldtfMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSAMsptldPPPSVTSQRR 902
Cdd:PHA03307   160 AAVASDAASSRQAALPLS-------SPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASA-----SSPAPAPGRS 227
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  903 EEEKWEAQSEGhpfiTSRSSSPLQLNLLQEEMPRPSESPdqmrrNTCPQTEYQCVTGNN-GSESSPATTGALSTGSPPRE 981
Cdd:PHA03307   228 AADDAGASSSD----SSSSESSGCGWGPENECPLPRPAP-----ITLPTRIWEASGWNGpSSRPGPASSSSSPRERSPSP 298
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462515030  982 NPSHPTASALSTGSPPMKNPSHPTASALSTGSPpmknpSHPTAStlSMGLPPSRTPSHPTAtvLSTGSPPSESPSR 1057
Cdd:PHA03307   299 SPSSPGSGPAPSSPRASSSSSSSRESSSSSTSS-----SSESSR--GAAVSPGPSPSRSPS--PSRPPPPADPSSP 365
PHA03247 PHA03247
large tegument protein UL36; Provisional
731-1056 3.42e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.86  E-value: 3.42e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  731 SAGCRKGKHKRKKLPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLPAATSP 810
Cdd:PHA03247  2656 PAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPAL 2735
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  811 GREYAAPGTaPEGlHGLPLSEGLQPYPAFPfpyldtfMTVFLPDPPVCPLLSPS-FLPCPflgATASSAISPSMSSAMSP 889
Cdd:PHA03247  2736 PAAPAPPAV-PAG-PATPGGPARPARPPTT-------AGPPAPAPPAAPAAGPPrRLTRP---AVASLSESRESLPSPWD 2803
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  890 TLDPPPSVTSQRREEEKWEAQSEGHPFITSRSSSPLQLNllqeemPRPSESPDQMRRNTCPQTEYQcvtgNNGSESSPAT 969
Cdd:PHA03247  2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP------PGPPPPSLPLGGSVAPGGDVR----RRPPSRSPAA 2873
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  970 TGALSTGSP----PRENPSHPTASaLSTGSPPMKNPSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSrtPSHPTATVL 1045
Cdd:PHA03247  2874 KPAAPARPPvrrlARPAVSRSTES-FALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP--PLAPTTDPA 2950
                          330
                   ....*....|.
gi 2462515030 1046 STGSPPSESPS 1056
Cdd:PHA03247  2951 GAGEPSGAVPQ 2961
PHA03247 PHA03247
large tegument protein UL36; Provisional
747-1055 4.05e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.48  E-value: 4.05e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  747 PPDSSSSNTGSGPRRGAHQNAQPccpsAASSPHTSSPTFPPAAmvPSQAPYLVPAFPLPAATSPGREYAAPGTAPEGLHG 826
Cdd:PHA03247  2592 PPQSARPRAPVDDRGDPRGPAPP----SPLPPDTHAPDPPPPS--PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPR 2665
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  827 LPLSEGLQPYPAFPfpyldtfmtvflPDPPVCPLLSPSFLPCPFLGatassaispsmssamsptlDPPPSvtsQRREEEK 906
Cdd:PHA03247  2666 RARRLGRAAQASSP------------PQRPRRRAARPTVGSLTSLA-------------------DPPPP---PPTPEPA 2711
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  907 WEAQSEGHPfitsrssSPLQLNLLQEEMPRPSESPdqmrrNTCPQTEYQCVTGNNGSESSPATTGALSTGSPPRENPSHP 986
Cdd:PHA03247  2712 PHALVSATP-------LPPGPAAARQASPALPAAP-----APPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462515030  987 ----TASALSTGSPPMKNPSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSRTPShPTATVLSTGSPPSESP 1055
Cdd:PHA03247  2780 prrlTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ-PTAPPPPPGPPPPSLP 2851
PHA03247 PHA03247
large tegument protein UL36; Provisional
747-1053 7.61e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 7.61e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  747 PPDSSSSNTGSGPRRGAhQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLPAATSPgreyaAPGTAPEGLHG 826
Cdd:PHA03247  2742 PAVPAGPATPGGPARPA-RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDP-----ADPPAAVLAPA 2815
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  827 LPLSEGLQPYPAFPFPyldtfmTVFLPDPPVCPllsPSFLPCPFlgATASSAISPSMSSAMSPTLDPPPSVTSQRREEEK 906
Cdd:PHA03247  2816 AALPPAASPAGPLPPP------TSAQPTAPPPP---PGPPPPSL--PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVR 2884
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  907 WEAQSEghpfiTSRSSSPLQLNLLQEEMPRPSESPDQMRRNTCPQTEYQCVTGNNG---SESSPATTGALSTGSPPRENP 983
Cdd:PHA03247  2885 RLARPA-----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPpprPQPPLAPTTDPAGAGEPSGAV 2959
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462515030  984 SHPTASALSTGSPPMKN----PSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSRTPSHPTATVLSTGSPPSE 1053
Cdd:PHA03247  2960 PQPWLGALVPGRVAVPRfrvpQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQTLWPPDD 3033
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
745-1058 8.04e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.55  E-value: 8.04e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  745 PEPPDSSSSNTGSGPRRG-AHQNAQPCCPSAASSPHTSSPtFPPAAMVPSQAPYLVPAFPLPAATSPGREYAAPGTAPEG 823
Cdd:PHA03307   114 PDPPPPTPPPASPPPSPApDLSEMLRPVGSPGPPPAASPP-AAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAE 192
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  824 LHGLPLSEGLQPYPafpfPYLDTFMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSAMSPTLDPPPS---VTSQ 900
Cdd:PHA03307   193 PPPSTPPAAASPRP----PRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPApitLPTR 268
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  901 RREEEKWEAQSEGHPFITSRSSSPlqlnllqEEMPRPSESPDQMRRNTCPQTeyqcVTGNNGSESSPATTGALSTGSPPR 980
Cdd:PHA03307   269 IWEASGWNGPSSRPGPASSSSSPR-------ERSPSPSPSSPGSGPAPSSPR----ASSSSSSSRESSSSSTSSSSESSR 337
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462515030  981 ENPSHPtasalstGSPPMKNPSHPTASALSTGSPPMKN-PSHPTASTLSMGlPPSRTPSHPTATVLSTGSPPSESPSRT 1058
Cdd:PHA03307   338 GAAVSP-------GPSPSRSPSPSRPPPPADPSSPRKRpRPSRAPSSPAAS-AGRPTRRRARAAVAGRARRRDATGRFP 408
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
744-1056 3.27e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 48.22  E-value: 3.27e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  744 LPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSP-HTSSPTFP-PAAMVPSQAPYLVPAFPLPAATSPGREYA-APGTA 820
Cdd:pfam03154  252 MTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPqPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhTPPSQ 331
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  821 PEGLHGLPLSEglQPYPAFPFPyldtfMTVFLPDP--PVCPLLSPSFLPCPflgatasSAISPSMSSAMSPTLDPPPSVt 898
Cdd:pfam03154  332 SQLQSQQPPRE--QPLPPAPLS-----MPHIKPPPttPIPQLPNPQSHKHP-------PHLSGPSPFQMNSNLPPPPAL- 396
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  899 sqrreeEKWEAQSEGHPfiTSRSSSPLQLNLLQEEMPRPSESPDQMrrntcpqTEYQCVTGnngSESSPATTGALSTGSP 978
Cdd:pfam03154  397 ------KPLSSLSTHHP--PSAHPPPLQLMPQSQQLPPPPAQPPVL-------TQSQSLPP---PAASHPPTSGLHQVPS 458
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  979 preNPSHPTASALSTGSPPMKNPSHPTASALSTGS---PPMKNP---SHPTASTLSMGLPPSRTPSHPTATVLSTGS--P 1050
Cdd:pfam03154  459 ---QSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPgiqPPSSASvssSGPVPAAVSCPLPPVQIKEEALDEAEEPESppP 535

                   ....*.
gi 2462515030 1051 PSESPS 1056
Cdd:pfam03154  536 PPRSPS 541
PHA03247 PHA03247
large tegument protein UL36; Provisional
744-1056 5.94e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 5.94e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  744 LPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSP-------HTSSPTFPPAAMVPSQAPYLVPAFPLPAATSPGREYAA 816
Cdd:PHA03247  2624 PDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPgrvsrprRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPP 2703
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  817 PGTAPE-----GLHGLPLSEGLQPYPAFPFPYLDTFMTVFLPDPPVCPlLSPSFLPCPFLGATASSAISPSMSSAMSPTL 891
Cdd:PHA03247  2704 PPPTPEpaphaLVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP-GGPARPARPPTTAGPPAPAPPAAPAAGPPRR 2782
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  892 DPPPSVTSQrrEEEKWEAQSEGHPFITSRSSSPLQLNLLQEEMPRPSESPDQMRRNTCPQTEYQCVTGNNGSESSPATTG 971
Cdd:PHA03247  2783 LTRPAVASL--SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGG 2860
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  972 ALSTGSPPRENPSHPTAsalstgsppmknPSHPTASALStgSPPMKNPSHPTASTlSMGLPPSRTPSHPTATVLSTGSPP 1051
Cdd:PHA03247  2861 DVRRRPPSRSPAAKPAA------------PARPPVRRLA--RPAVSRSTESFALP-PDQPERPPQPQAPPPPQPQPQPPP 2925

                   ....*
gi 2462515030 1052 SESPS 1056
Cdd:PHA03247  2926 PPQPQ 2930
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
745-1056 4.29e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.76  E-value: 4.29e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  745 PEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLP-------AATSPGREYAAP 817
Cdd:pfam03154  185 SPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPhpplqpmTQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  818 GTAPEGLHGL--PLSEGLQ--------PYPAFPFPYLDTFMTVFLPDPPVCPLLSPSflpcpflgatassaispsmssAM 887
Cdd:pfam03154  265 PLPQPSLHGQmpPMPHSLQtgpshmqhPVPPQPFPLTPQSSQSQVPPGPSPAAPGQS---------------------QQ 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  888 SPTLDPPPSVTSQRR--EEEKWEAQSEGHPFITSRSSSPLQLNLLQEEMPRPSE----SPDQMRRNTCPQTEYQ---CVT 958
Cdd:pfam03154  324 RIHTPPSQSQLQSQQppREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHlsgpSPFQMNSNLPPPPALKplsSLS 403
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  959 GNNGSESSPATTGALSTGSPPRENPSHPTASALSTGSPPmKNPSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSRTPS 1038
Cdd:pfam03154  404 THHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPP-PAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPP 482
                          330
                   ....*....|....*...
gi 2462515030 1039 HPTATVLSTGSPPSESPS 1056
Cdd:pfam03154  483 TSTSSAMPGIQPPSSASV 500
KinA COG5805
Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle ...
273-373 3.23e-03

Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle control, cell division, chromosome partitioning, Signal transduction mechanisms];


Pssm-ID: 444507 [Multi-domain]  Cd Length: 496  Bit Score: 41.64  E-value: 3.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  273 IFTTTHTPGcVFLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEHSPIrfcTQNGDYIIL 352
Cdd:COG5805    169 LICVIDTDG-RILFINESIERLFGAPREELIGKNLLELLHPCDKEEFKERIESITEVWQEFIIEREII---TKDGRIRYF 244
                           90       100
                   ....*....|....*....|..
gi 2462515030  353 DSSWSSFVNP-WSRKISFIIGR 373
Cdd:COG5805    245 EAVIVPLIDTdGSVKGILVILR 266
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
745-1055 3.33e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.70  E-value: 3.33e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  745 PEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLPAATSPGREYAAPGTAPEGL 824
Cdd:PHA03307   129 SPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPR 208
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  825 HGLPLSEG-LQPYPAFP----FPYLDTFMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSAMSPTLDPPP--SV 897
Cdd:PHA03307   209 RSSPISASaSSPAPAPGrsaaDDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPasSS 288
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  898 TSQRREEEKWEAQSEGHPFITSRSSSPLQLNLLQEEMPrPSESPDQMRRNTCPQTeyqcvTGNNGSES-SPATTGALSTG 976
Cdd:PHA03307   289 SSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSS-SSTSSSSESSRGAAVS-----PGPSPSRSpSPSRPPPPADP 362
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462515030  977 SPPRENPshPTASALSTGSPPMKNPSHPTASALSTGSPPMKNPSHPtastlsmgLPPSRTPSHPTATVLSTGSPPSESP 1055
Cdd:PHA03307   363 SSPRKRP--RPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGR--------FPAGRPRPSPLDAGAASGAFYARYP 431
PHA03379 PHA03379
EBNA-3A; Provisional
745-1055 9.32e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 40.43  E-value: 9.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  745 PEPPDSSSSNTGSGPRRGAHQN-AQPCCPSAASSPHTSSPTfPPAAMVPSQAPYLVPAFPLPAATSPGREYAAPGTAPEG 823
Cdd:PHA03379   425 PEVPQSLETATSHGSAQVPEPPpVHDLEPGPLHDQHSMAPC-PVAQLPPGPLQDLEPGDQLPGVVQDGRPACAPVPAPAG 503
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  824 LHGLPLSEGLQPYPAFPF-PYLDTFMTV-FLPDP------PVCPLLSPSFLPCPflGATASSAISPSMSSAMSPTLDPPP 895
Cdd:PHA03379   504 PIVRPWEASLSQVPGVAFaPVMPQPMPVePVPVPtvalerPVCPAPPLIAMQGP--GETSGIVRVRERWRPAPWTPNPPR 581
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  896 SVTSQRREEEKWEAQSEGHPFITSRSSSPLQLNLL--QEEMPRPSEsPDQMRRNTCPQTEYQCVTGNNG----------- 962
Cdd:PHA03379   582 SPSQMSVRDRLARLRAEAQPYQASVEVQPPQLTQVspQQPMEYPLE-PEQQMFPGSPFSQVADVMRAGGvpamqpqyfdl 660
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462515030  963 SESSPATTGALST-------GSPPR--ENPSH---PTASALSTGSP--------PMKNPSHPtASALSTGSPPMKNPSHP 1022
Cdd:PHA03379   661 PLQQPISQGAPLAplrasmgPVPPVpaTQPQYfdiPLTEPINQGASaahflpqqPMEGPLVP-ERWMFQGATLSQSVRPG 739
                          330       340       350
                   ....*....|....*....|....*....|...
gi 2462515030 1023 TASTLSMGLPPSRTPSHPTATVLSTGSPPSESP 1055
Cdd:PHA03379   740 VAQSQYFDLPLTQPINHGAPAAHFLHQPPMEGP 772
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH