|
Name |
Accession |
Description |
Interval |
E-value |
| GPS2_interact |
pfam15784 |
G-protein pathway suppressor 2-interacting domain; GPS2_interact is the more N-terminal domain ... |
141-229 |
4.22e-41 |
|
G-protein pathway suppressor 2-interacting domain; GPS2_interact is the more N-terminal domain of two co-repressor protein-families found in vertebrates. The domain is found in NCoR and SMRT proteins; N-CoR (nuclear receptor co-repressor) and SMRT (silencing mediator for retinoid and thyroid receptors) are related corepressors that mediate transcriptional repression by unliganded nuclear receptors and other classes of transcriptional repressors. GPS2 is a stoichiometric subunit of the N-CoR-HDAC3 complex. GPS2 links the complex to membrane receptor-related intracellular JNK (c-Jun amino-terminal kinase) signalling pathways. :
Pssm-ID: 464868 [Multi-domain] Cd Length: 89 Bit Score: 146.93 E-value: 4.22e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 141 LTGKLEPVSPPSPPHTDPELELVPPRLSKEELIQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKPPEPEKPVSPPPIE 220
Cdd:pfam15784 1 YYPQVEAISPTLPSPEGQDQELSPFRSSKDELLQNIDKVDREIAKVEQQISKLKKKQQQLEEEAAKPPEPEEPVSPPPSE 80
|
....*....
gi 331284176 221 SKHRSLVQI 229
Cdd:pfam15784 81 SKHRSLAQI 89
|
|
| Myb_DNA-binding |
pfam00249 |
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ... |
613-656 |
7.61e-13 |
|
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family. :
Pssm-ID: 459731 [Multi-domain] Cd Length: 46 Bit Score: 64.83 E-value: 7.61e-13
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 331284176 613 RWTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQCKNFYFNYK 656
Cdd:pfam00249 3 PWTPEEDELLLEAVEKLGNRWKKIAKLLPGRTDNQCKNRWQNYL 46
|
|
| SANT super family |
cl21498 |
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ... |
431-474 |
1.07e-06 |
|
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA. The actual alignment was detected with superfamily member cd11661:
Pssm-ID: 473887 [Multi-domain] Cd Length: 46 Bit Score: 47.22 E-value: 1.07e-06
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 331284176 431 WSEQEKETFREKFMQHPKNFGLI-ASFLERKTVAECVLYYYLTKK 474
Cdd:cd11661 2 WSESEAKLFEEGLRKYGKDFHDIrQDFLPWKSVGELVEFYYMWKK 46
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
740-1209 |
1.62e-06 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 1.62e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 740 AAKDTGqnGPKPPatLGADGPPPGPPTPPPEDIPAPTEPTPASEATGAPTPPPAPPSPSAPPPVVPKEEKEEETAAAPPV 819
Cdd:PHA03247 2544 ASDDAG--DPPPP--LPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPP 2619
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 820 EEGEEQKPPAAEELAVDTGKAEEPVKSECTEEAEEGPAKGKdaeaaeataegaLKAEKKEGGSGRATTAKSS------GA 893
Cdd:PHA03247 2620 DTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGR------------VSRPRRARRLGRAAQASSPpqrprrRA 2687
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 894 PQDSDSSATCSADevdeaeggdknrllsPRPSLLTPTGDPRAnASPQKPLDLKQLKQRAAAiPPIQVTKVHEPPREDAAP 973
Cdd:PHA03247 2688 ARPTVGSLTSLAD---------------PPPPPPTPEPAPHA-LVSATPLPPGPAAARQAS-PALPAAPAPPAVPAGPAT 2750
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 974 TKPAPPAPPPPQNLQPESDAPqqpgssPRGKSRSPAPPADKEAEKPVFFPAFAAEAQKLPGDPPCWTSGlPFPVPPREVI 1053
Cdd:PHA03247 2751 PGGPARPARPPTTAGPPAPAP------PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLA-PAAALPPAAS 2823
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 1054 KASPHAPDPSAFSYAPPGHPLPL-------GLHDTARPVLPRPPTISnPPPLISSAKHPSVLERQIGAISQGMSvqlhvP 1126
Cdd:PHA03247 2824 PAGPLPPPTSAQPTAPPPPPGPPppslplgGSVAPGGDVRRRPPSRS-PAAKPAAPARPPVRRLARPAVSRSTE-----S 2897
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 1127 YSEHAKAPVGPVTMGLPLPMDPKKLAPFSGVKQEQLSPRGQAGPPESlgvPTAQEASVlrGTALGSVPG---GSITKG-- 1201
Cdd:PHA03247 2898 FALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLA---PTTDPAGA--GEPSGAVPQpwlGALVPGrv 2972
|
....*....
gi 331284176 1202 -IPSTRVPS 1209
Cdd:PHA03247 2973 aVPRFRVPQ 2981
|
|
| SMC_N super family |
cl47134 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
100-457 |
3.24e-06 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination. The actual alignment was detected with superfamily member TIGR02169:
Pssm-ID: 481474 [Multi-domain] Cd Length: 1164 Bit Score: 52.76 E-value: 3.24e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 100 MEFIESKRP-----RLELLPDPLLRPSPLLATGQPAGSEDLTKDRSLTGKLEPVSppsppHTDPELELVPPRLSKE--EL 172
Cdd:TIGR02169 626 VEDIEAARRlmgkyRMVTLEGELFEKSGAMTGGSRAPRGGILFSRSEPAELQRLR-----ERLEGLKRELSSLQSElrRI 700
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 173 IQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKPPEPEKPvspppIESKHRSLVQIIYDENRKKAEAAHRI--LEGLGP 250
Cdd:TIGR02169 701 ENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEE-----LEEDLSSLEQEIENVKSELKELEARIeeLEEDLH 775
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 251 QVELPLyNQPSDtRQYHENIKINQAMRKKLILYFKR-------------RNHARKQWEQKFCQRYDQLMEAWEKKV---- 313
Cdd:TIGR02169 776 KLEEAL-NDLEA-RLSHSRIPEIQAELSKLEEEVSRiearlreieqklnRLTLEKEYLEKEIQELQEQRIDLKEQIksie 853
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 314 ERIEN-NPRRRAKESKVREY------YEKQFPEIRKQR-ELQERMQRVGQRGSGLSMSAARSEHEVSEIIDGLSEQEN-- 383
Cdd:TIGR02169 854 KEIENlNGKKEELEEELEELeaalrdLESRLGDLKKERdELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEel 933
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 384 --LEKQMRQLAVIPPMLYDADQ------------QRIKFINM-------------NGLMADPMKVYKDR----QVMNMWS 432
Cdd:TIGR02169 934 seIEDPKGEDEEIPEEELSLEDvqaelqrveeeiRALEPVNMlaiqeyeevlkrlDELKEKRAKLEEERkailERIEEYE 1013
|
410 420
....*....|....*....|....*
gi 331284176 433 EQEKETFREKFMQHPKNFGLIASFL 457
Cdd:TIGR02169 1014 KKKREVFMEAFEAINENFNEIFAEL 1038
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
1875-2252 |
1.87e-05 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.32 E-value: 1.87e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 1875 TAVEPSTPTVLRSTSTSSPVRPAATFPPA-THCPLGGTLDGVYPTLMEPVLLPKEAPRVARPERPRAdtghaflAKPPAR 1953
Cdd:PHA03247 2600 APVDDRGDPRGPAPPSPLPPDTHAPDPPPpSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR-------ARRLGR 2672
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 1954 SGlePASSPSKGSEPRPLVPPVSGHATIARTPAKNLAPHHASPDPPAPPASASDPHREKTQSKPFSIQEL---------- 2023
Cdd:PHA03247 2673 AA--QASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAppavpagpat 2750
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 2024 ---ELRSLGYHGSSYSPEGVEPVSPVSSPSLTHDKGLPKHLEELDKShLEGELRPKQPGPVKLGGEAAHLPHLRPLPESQ 2100
Cdd:PHA03247 2751 pggPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES-LPSPWDPADPPAAVLAPAAALPPAASPAGPLP 2829
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 2101 PSSSPLlQTAPGVKGHQRVVTLAQHISEVITQDYTRHHPQQLSAPLPAPLYSFPGASCPVLDLRRPPSDLYLPPPDHGAP 2180
Cdd:PHA03247 2830 PPTSAQ-PTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP 2908
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 2181 ARGSPHSEGGKRSPEPNKTSVLGGGEDGIEPVSPPEGMTEP---------------GHSRSAVYPLLYRDGEQTEPSRMG 2245
Cdd:PHA03247 2909 PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPagagepsgavpqpwlGALVPGRVAVPRFRVPQPAPSREA 2988
|
....*..
gi 331284176 2246 SKSPGNT 2252
Cdd:PHA03247 2989 PASSTPP 2995
|
|
| RSC8 super family |
cl34960 |
RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / ... |
540-648 |
6.60e-04 |
|
RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / Transcription]; The actual alignment was detected with superfamily member COG5259:
Pssm-ID: 227584 [Multi-domain] Cd Length: 531 Bit Score: 44.88 E-value: 6.60e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 540 DKEDLLKEKTDDTSGEDNDEKEAVASKGRKTANSQGRrKGRITRSMANEANS--------EEAITPQ--QSAELASMELN 609
Cdd:COG5259 196 ENYSPSLKSPKKESQGKVDELKDHSEKHPSSCSCCGN-KSFNTRYHNLRAEKynscsecyDQGRFPSefTSSDFKPVTIS 274
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 331284176 610 ESSR---WTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQC 648
Cdd:COG5259 275 LLIRdknWSRQELLLLLEGIEMYGDDWDKVARHVGTKTKEQC 316
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| GPS2_interact |
pfam15784 |
G-protein pathway suppressor 2-interacting domain; GPS2_interact is the more N-terminal domain ... |
141-229 |
4.22e-41 |
|
G-protein pathway suppressor 2-interacting domain; GPS2_interact is the more N-terminal domain of two co-repressor protein-families found in vertebrates. The domain is found in NCoR and SMRT proteins; N-CoR (nuclear receptor co-repressor) and SMRT (silencing mediator for retinoid and thyroid receptors) are related corepressors that mediate transcriptional repression by unliganded nuclear receptors and other classes of transcriptional repressors. GPS2 is a stoichiometric subunit of the N-CoR-HDAC3 complex. GPS2 links the complex to membrane receptor-related intracellular JNK (c-Jun amino-terminal kinase) signalling pathways.
Pssm-ID: 464868 [Multi-domain] Cd Length: 89 Bit Score: 146.93 E-value: 4.22e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 141 LTGKLEPVSPPSPPHTDPELELVPPRLSKEELIQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKPPEPEKPVSPPPIE 220
Cdd:pfam15784 1 YYPQVEAISPTLPSPEGQDQELSPFRSSKDELLQNIDKVDREIAKVEQQISKLKKKQQQLEEEAAKPPEPEEPVSPPPSE 80
|
....*....
gi 331284176 221 SKHRSLVQI 229
Cdd:pfam15784 81 SKHRSLAQI 89
|
|
| Myb_DNA-binding |
pfam00249 |
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ... |
613-656 |
7.61e-13 |
|
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
Pssm-ID: 459731 [Multi-domain] Cd Length: 46 Bit Score: 64.83 E-value: 7.61e-13
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 331284176 613 RWTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQCKNFYFNYK 656
Cdd:pfam00249 3 PWTPEEDELLLEAVEKLGNRWKKIAKLLPGRTDNQCKNRWQNYL 46
|
|
| SANT |
smart00717 |
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains; |
613-658 |
1.32e-10 |
|
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
Pssm-ID: 197842 [Multi-domain] Cd Length: 49 Bit Score: 58.39 E-value: 1.32e-10
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 331284176 613 RWTEEEMETAKKGLLEHG-RNWSAIARMVGSKTVSQCKNFYFNYKKR 658
Cdd:smart00717 3 EWTEEEDELLIELVKKYGkNNWEKIAKELPGRTAEQCRERWRNLLKP 49
|
|
| SANT |
cd00167 |
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ... |
613-656 |
1.87e-10 |
|
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
Pssm-ID: 238096 [Multi-domain] Cd Length: 45 Bit Score: 57.97 E-value: 1.87e-10
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 331284176 613 RWTEEEMETAKKGLLEHG-RNWSAIARMVGSKTVSQCKNFYFNYK 656
Cdd:cd00167 1 PWTEEEDELLLEAVKKYGkNNWEKIAKELPGRTPKQCRERWRNLL 45
|
|
| SANT_MTA3_like |
cd11661 |
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ... |
431-474 |
1.07e-06 |
|
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.
Pssm-ID: 212559 [Multi-domain] Cd Length: 46 Bit Score: 47.22 E-value: 1.07e-06
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 331284176 431 WSEQEKETFREKFMQHPKNFGLI-ASFLERKTVAECVLYYYLTKK 474
Cdd:cd11661 2 WSESEAKLFEEGLRKYGKDFHDIrQDFLPWKSVGELVEFYYMWKK 46
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
740-1209 |
1.62e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 1.62e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 740 AAKDTGqnGPKPPatLGADGPPPGPPTPPPEDIPAPTEPTPASEATGAPTPPPAPPSPSAPPPVVPKEEKEEETAAAPPV 819
Cdd:PHA03247 2544 ASDDAG--DPPPP--LPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPP 2619
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 820 EEGEEQKPPAAEELAVDTGKAEEPVKSECTEEAEEGPAKGKdaeaaeataegaLKAEKKEGGSGRATTAKSS------GA 893
Cdd:PHA03247 2620 DTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGR------------VSRPRRARRLGRAAQASSPpqrprrRA 2687
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 894 PQDSDSSATCSADevdeaeggdknrllsPRPSLLTPTGDPRAnASPQKPLDLKQLKQRAAAiPPIQVTKVHEPPREDAAP 973
Cdd:PHA03247 2688 ARPTVGSLTSLAD---------------PPPPPPTPEPAPHA-LVSATPLPPGPAAARQAS-PALPAAPAPPAVPAGPAT 2750
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 974 TKPAPPAPPPPQNLQPESDAPqqpgssPRGKSRSPAPPADKEAEKPVFFPAFAAEAQKLPGDPPCWTSGlPFPVPPREVI 1053
Cdd:PHA03247 2751 PGGPARPARPPTTAGPPAPAP------PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLA-PAAALPPAAS 2823
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 1054 KASPHAPDPSAFSYAPPGHPLPL-------GLHDTARPVLPRPPTISnPPPLISSAKHPSVLERQIGAISQGMSvqlhvP 1126
Cdd:PHA03247 2824 PAGPLPPPTSAQPTAPPPPPGPPppslplgGSVAPGGDVRRRPPSRS-PAAKPAAPARPPVRRLARPAVSRSTE-----S 2897
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 1127 YSEHAKAPVGPVTMGLPLPMDPKKLAPFSGVKQEQLSPRGQAGPPESlgvPTAQEASVlrGTALGSVPG---GSITKG-- 1201
Cdd:PHA03247 2898 FALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLA---PTTDPAGA--GEPSGAVPQpwlGALVPGrv 2972
|
....*....
gi 331284176 1202 -IPSTRVPS 1209
Cdd:PHA03247 2973 aVPRFRVPQ 2981
|
|
| Myb_DNA-binding |
pfam00249 |
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ... |
431-470 |
2.77e-06 |
|
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
Pssm-ID: 459731 [Multi-domain] Cd Length: 46 Bit Score: 45.96 E-value: 2.77e-06
10 20 30 40
....*....|....*....|....*....|....*....|
gi 331284176 431 WSEQEKETFREKFMQHPKNFGLIASFLERKTVAECVLYYY 470
Cdd:pfam00249 4 WTPEEDELLLEAVEKLGNRWKKIAKLLPGRTDNQCKNRWQ 43
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
100-457 |
3.24e-06 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 52.76 E-value: 3.24e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 100 MEFIESKRP-----RLELLPDPLLRPSPLLATGQPAGSEDLTKDRSLTGKLEPVSppsppHTDPELELVPPRLSKE--EL 172
Cdd:TIGR02169 626 VEDIEAARRlmgkyRMVTLEGELFEKSGAMTGGSRAPRGGILFSRSEPAELQRLR-----ERLEGLKRELSSLQSElrRI 700
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 173 IQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKPPEPEKPvspppIESKHRSLVQIIYDENRKKAEAAHRI--LEGLGP 250
Cdd:TIGR02169 701 ENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEE-----LEEDLSSLEQEIENVKSELKELEARIeeLEEDLH 775
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 251 QVELPLyNQPSDtRQYHENIKINQAMRKKLILYFKR-------------RNHARKQWEQKFCQRYDQLMEAWEKKV---- 313
Cdd:TIGR02169 776 KLEEAL-NDLEA-RLSHSRIPEIQAELSKLEEEVSRiearlreieqklnRLTLEKEYLEKEIQELQEQRIDLKEQIksie 853
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 314 ERIEN-NPRRRAKESKVREY------YEKQFPEIRKQR-ELQERMQRVGQRGSGLSMSAARSEHEVSEIIDGLSEQEN-- 383
Cdd:TIGR02169 854 KEIENlNGKKEELEEELEELeaalrdLESRLGDLKKERdELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEel 933
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 384 --LEKQMRQLAVIPPMLYDADQ------------QRIKFINM-------------NGLMADPMKVYKDR----QVMNMWS 432
Cdd:TIGR02169 934 seIEDPKGEDEEIPEEELSLEDvqaelqrveeeiRALEPVNMlaiqeyeevlkrlDELKEKRAKLEEERkailERIEEYE 1013
|
410 420
....*....|....*....|....*
gi 331284176 433 EQEKETFREKFMQHPKNFGLIASFL 457
Cdd:TIGR02169 1014 KKKREVFMEAFEAINENFNEIFAEL 1038
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1875-2252 |
1.87e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.32 E-value: 1.87e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 1875 TAVEPSTPTVLRSTSTSSPVRPAATFPPA-THCPLGGTLDGVYPTLMEPVLLPKEAPRVARPERPRAdtghaflAKPPAR 1953
Cdd:PHA03247 2600 APVDDRGDPRGPAPPSPLPPDTHAPDPPPpSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR-------ARRLGR 2672
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 1954 SGlePASSPSKGSEPRPLVPPVSGHATIARTPAKNLAPHHASPDPPAPPASASDPHREKTQSKPFSIQEL---------- 2023
Cdd:PHA03247 2673 AA--QASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAppavpagpat 2750
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 2024 ---ELRSLGYHGSSYSPEGVEPVSPVSSPSLTHDKGLPKHLEELDKShLEGELRPKQPGPVKLGGEAAHLPHLRPLPESQ 2100
Cdd:PHA03247 2751 pggPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES-LPSPWDPADPPAAVLAPAAALPPAASPAGPLP 2829
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 2101 PSSSPLlQTAPGVKGHQRVVTLAQHISEVITQDYTRHHPQQLSAPLPAPLYSFPGASCPVLDLRRPPSDLYLPPPDHGAP 2180
Cdd:PHA03247 2830 PPTSAQ-PTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP 2908
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 2181 ARGSPHSEGGKRSPEPNKTSVLGGGEDGIEPVSPPEGMTEP---------------GHSRSAVYPLLYRDGEQTEPSRMG 2245
Cdd:PHA03247 2909 PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPagagepsgavpqpwlGALVPGRVAVPRFRVPQPAPSREA 2988
|
....*..
gi 331284176 2246 SKSPGNT 2252
Cdd:PHA03247 2989 PASSTPP 2995
|
|
| SANT |
smart00717 |
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains; |
431-474 |
7.98e-05 |
|
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
Pssm-ID: 197842 [Multi-domain] Cd Length: 49 Bit Score: 42.21 E-value: 7.98e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 331284176 431 WSEQEKETFREKFMQHP-KNFGLIASFLERKTVAECVLYYYLTKK 474
Cdd:smart00717 4 WTEEEDELLIELVKKYGkNNWEKIAKELPGRTAEQCRERWRNLLK 48
|
|
| DUF4455 |
pfam14643 |
Domain of unknown function (DUF4455); This domain family is found in bacteria and eukaryotes, ... |
132-391 |
2.79e-04 |
|
Domain of unknown function (DUF4455); This domain family is found in bacteria and eukaryotes, and is approximately 480 amino acids in length. There are two completely conserved residues (W and P) that may be functionally important.
Pssm-ID: 464231 [Multi-domain] Cd Length: 469 Bit Score: 46.12 E-value: 2.79e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 132 SEDLTKDRSLTGKLEpvsppspphTDPELElvppRLSKEELIQNMDRVDREITMVEQQISKLKKKQQQLEEEAAkppEPE 211
Cdd:pfam14643 42 AESDEEINALFKKLE---------DDDALE----DYTIQQLEELWDIVAQHSLLRKSWIKELDETLEKLEKERA---DKL 105
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 212 KPVspppIESKHRSLVQIIYdenrkkaeaahrILEglgPQVELPLYNqpsdtrqyhENIKINQAM---RKKLILYFKRRN 288
Cdd:pfam14643 106 KSV----LKKYVEILEDIAH------------LLP---PDVYRLIDK---------EAMEINQALlenRRAYAKLFANLM 157
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 289 HARKQWEQKFCQRYDQLMEAWEK--------------KVERIENNPRRRakeskvREYYEKqfpeIRKQRELQERMQRVG 354
Cdd:pfam14643 158 EAELKQELSFRLRWQDRVDRWKAlktehliqefkefiASEEIQNPPERK------KELEEM----LKEQKKLQQKRLELL 227
|
250 260 270
....*....|....*....|....*....|....*..
gi 331284176 355 QRGSGLSMSAARSEhEVSEIIDGLseqENLEKQMRQL 391
Cdd:pfam14643 228 QKISDLLPPAYSKS-KVEEWWASL---EALNEQLDQY 260
|
|
| RSC8 |
COG5259 |
RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / ... |
540-648 |
6.60e-04 |
|
RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / Transcription];
Pssm-ID: 227584 [Multi-domain] Cd Length: 531 Bit Score: 44.88 E-value: 6.60e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 540 DKEDLLKEKTDDTSGEDNDEKEAVASKGRKTANSQGRrKGRITRSMANEANS--------EEAITPQ--QSAELASMELN 609
Cdd:COG5259 196 ENYSPSLKSPKKESQGKVDELKDHSEKHPSSCSCCGN-KSFNTRYHNLRAEKynscsecyDQGRFPSefTSSDFKPVTIS 274
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 331284176 610 ESSR---WTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQC 648
Cdd:COG5259 275 LLIRdknWSRQELLLLLEGIEMYGDDWDKVARHVGTKTKEQC 316
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| GPS2_interact |
pfam15784 |
G-protein pathway suppressor 2-interacting domain; GPS2_interact is the more N-terminal domain ... |
141-229 |
4.22e-41 |
|
G-protein pathway suppressor 2-interacting domain; GPS2_interact is the more N-terminal domain of two co-repressor protein-families found in vertebrates. The domain is found in NCoR and SMRT proteins; N-CoR (nuclear receptor co-repressor) and SMRT (silencing mediator for retinoid and thyroid receptors) are related corepressors that mediate transcriptional repression by unliganded nuclear receptors and other classes of transcriptional repressors. GPS2 is a stoichiometric subunit of the N-CoR-HDAC3 complex. GPS2 links the complex to membrane receptor-related intracellular JNK (c-Jun amino-terminal kinase) signalling pathways.
Pssm-ID: 464868 [Multi-domain] Cd Length: 89 Bit Score: 146.93 E-value: 4.22e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 141 LTGKLEPVSPPSPPHTDPELELVPPRLSKEELIQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKPPEPEKPVSPPPIE 220
Cdd:pfam15784 1 YYPQVEAISPTLPSPEGQDQELSPFRSSKDELLQNIDKVDREIAKVEQQISKLKKKQQQLEEEAAKPPEPEEPVSPPPSE 80
|
....*....
gi 331284176 221 SKHRSLVQI 229
Cdd:pfam15784 81 SKHRSLAQI 89
|
|
| Myb_DNA-binding |
pfam00249 |
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ... |
613-656 |
7.61e-13 |
|
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
Pssm-ID: 459731 [Multi-domain] Cd Length: 46 Bit Score: 64.83 E-value: 7.61e-13
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 331284176 613 RWTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQCKNFYFNYK 656
Cdd:pfam00249 3 PWTPEEDELLLEAVEKLGNRWKKIAKLLPGRTDNQCKNRWQNYL 46
|
|
| SANT |
smart00717 |
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains; |
613-658 |
1.32e-10 |
|
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
Pssm-ID: 197842 [Multi-domain] Cd Length: 49 Bit Score: 58.39 E-value: 1.32e-10
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 331284176 613 RWTEEEMETAKKGLLEHG-RNWSAIARMVGSKTVSQCKNFYFNYKKR 658
Cdd:smart00717 3 EWTEEEDELLIELVKKYGkNNWEKIAKELPGRTAEQCRERWRNLLKP 49
|
|
| SANT |
cd00167 |
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ... |
613-656 |
1.87e-10 |
|
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
Pssm-ID: 238096 [Multi-domain] Cd Length: 45 Bit Score: 57.97 E-value: 1.87e-10
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 331284176 613 RWTEEEMETAKKGLLEHG-RNWSAIARMVGSKTVSQCKNFYFNYK 656
Cdd:cd00167 1 PWTEEEDELLLEAVKKYGkNNWEKIAKELPGRTPKQCRERWRNLL 45
|
|
| SANT_MTA3_like |
cd11661 |
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ... |
431-474 |
1.07e-06 |
|
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.
Pssm-ID: 212559 [Multi-domain] Cd Length: 46 Bit Score: 47.22 E-value: 1.07e-06
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 331284176 431 WSEQEKETFREKFMQHPKNFGLI-ASFLERKTVAECVLYYYLTKK 474
Cdd:cd11661 2 WSESEAKLFEEGLRKYGKDFHDIrQDFLPWKSVGELVEFYYMWKK 46
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
740-1209 |
1.62e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 1.62e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 740 AAKDTGqnGPKPPatLGADGPPPGPPTPPPEDIPAPTEPTPASEATGAPTPPPAPPSPSAPPPVVPKEEKEEETAAAPPV 819
Cdd:PHA03247 2544 ASDDAG--DPPPP--LPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPP 2619
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 820 EEGEEQKPPAAEELAVDTGKAEEPVKSECTEEAEEGPAKGKdaeaaeataegaLKAEKKEGGSGRATTAKSS------GA 893
Cdd:PHA03247 2620 DTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGR------------VSRPRRARRLGRAAQASSPpqrprrRA 2687
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 894 PQDSDSSATCSADevdeaeggdknrllsPRPSLLTPTGDPRAnASPQKPLDLKQLKQRAAAiPPIQVTKVHEPPREDAAP 973
Cdd:PHA03247 2688 ARPTVGSLTSLAD---------------PPPPPPTPEPAPHA-LVSATPLPPGPAAARQAS-PALPAAPAPPAVPAGPAT 2750
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 974 TKPAPPAPPPPQNLQPESDAPqqpgssPRGKSRSPAPPADKEAEKPVFFPAFAAEAQKLPGDPPCWTSGlPFPVPPREVI 1053
Cdd:PHA03247 2751 PGGPARPARPPTTAGPPAPAP------PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLA-PAAALPPAAS 2823
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 1054 KASPHAPDPSAFSYAPPGHPLPL-------GLHDTARPVLPRPPTISnPPPLISSAKHPSVLERQIGAISQGMSvqlhvP 1126
Cdd:PHA03247 2824 PAGPLPPPTSAQPTAPPPPPGPPppslplgGSVAPGGDVRRRPPSRS-PAAKPAAPARPPVRRLARPAVSRSTE-----S 2897
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 1127 YSEHAKAPVGPVTMGLPLPMDPKKLAPFSGVKQEQLSPRGQAGPPESlgvPTAQEASVlrGTALGSVPG---GSITKG-- 1201
Cdd:PHA03247 2898 FALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLA---PTTDPAGA--GEPSGAVPQpwlGALVPGrv 2972
|
....*....
gi 331284176 1202 -IPSTRVPS 1209
Cdd:PHA03247 2973 aVPRFRVPQ 2981
|
|
| Myb_DNA-binding |
pfam00249 |
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ... |
431-470 |
2.77e-06 |
|
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
Pssm-ID: 459731 [Multi-domain] Cd Length: 46 Bit Score: 45.96 E-value: 2.77e-06
10 20 30 40
....*....|....*....|....*....|....*....|
gi 331284176 431 WSEQEKETFREKFMQHPKNFGLIASFLERKTVAECVLYYY 470
Cdd:pfam00249 4 WTPEEDELLLEAVEKLGNRWKKIAKLLPGRTDNQCKNRWQ 43
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
100-457 |
3.24e-06 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 52.76 E-value: 3.24e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 100 MEFIESKRP-----RLELLPDPLLRPSPLLATGQPAGSEDLTKDRSLTGKLEPVSppsppHTDPELELVPPRLSKE--EL 172
Cdd:TIGR02169 626 VEDIEAARRlmgkyRMVTLEGELFEKSGAMTGGSRAPRGGILFSRSEPAELQRLR-----ERLEGLKRELSSLQSElrRI 700
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 173 IQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKPPEPEKPvspppIESKHRSLVQIIYDENRKKAEAAHRI--LEGLGP 250
Cdd:TIGR02169 701 ENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEE-----LEEDLSSLEQEIENVKSELKELEARIeeLEEDLH 775
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 251 QVELPLyNQPSDtRQYHENIKINQAMRKKLILYFKR-------------RNHARKQWEQKFCQRYDQLMEAWEKKV---- 313
Cdd:TIGR02169 776 KLEEAL-NDLEA-RLSHSRIPEIQAELSKLEEEVSRiearlreieqklnRLTLEKEYLEKEIQELQEQRIDLKEQIksie 853
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 314 ERIEN-NPRRRAKESKVREY------YEKQFPEIRKQR-ELQERMQRVGQRGSGLSMSAARSEHEVSEIIDGLSEQEN-- 383
Cdd:TIGR02169 854 KEIENlNGKKEELEEELEELeaalrdLESRLGDLKKERdELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEel 933
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 384 --LEKQMRQLAVIPPMLYDADQ------------QRIKFINM-------------NGLMADPMKVYKDR----QVMNMWS 432
Cdd:TIGR02169 934 seIEDPKGEDEEIPEEELSLEDvqaelqrveeeiRALEPVNMlaiqeyeevlkrlDELKEKRAKLEEERkailERIEEYE 1013
|
410 420
....*....|....*....|....*
gi 331284176 433 EQEKETFREKFMQHPKNFGLIASFL 457
Cdd:TIGR02169 1014 KKKREVFMEAFEAINENFNEIFAEL 1038
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1875-2252 |
1.87e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.32 E-value: 1.87e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 1875 TAVEPSTPTVLRSTSTSSPVRPAATFPPA-THCPLGGTLDGVYPTLMEPVLLPKEAPRVARPERPRAdtghaflAKPPAR 1953
Cdd:PHA03247 2600 APVDDRGDPRGPAPPSPLPPDTHAPDPPPpSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR-------ARRLGR 2672
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 1954 SGlePASSPSKGSEPRPLVPPVSGHATIARTPAKNLAPHHASPDPPAPPASASDPHREKTQSKPFSIQEL---------- 2023
Cdd:PHA03247 2673 AA--QASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAppavpagpat 2750
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 2024 ---ELRSLGYHGSSYSPEGVEPVSPVSSPSLTHDKGLPKHLEELDKShLEGELRPKQPGPVKLGGEAAHLPHLRPLPESQ 2100
Cdd:PHA03247 2751 pggPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES-LPSPWDPADPPAAVLAPAAALPPAASPAGPLP 2829
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 2101 PSSSPLlQTAPGVKGHQRVVTLAQHISEVITQDYTRHHPQQLSAPLPAPLYSFPGASCPVLDLRRPPSDLYLPPPDHGAP 2180
Cdd:PHA03247 2830 PPTSAQ-PTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP 2908
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 2181 ARGSPHSEGGKRSPEPNKTSVLGGGEDGIEPVSPPEGMTEP---------------GHSRSAVYPLLYRDGEQTEPSRMG 2245
Cdd:PHA03247 2909 PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPagagepsgavpqpwlGALVPGRVAVPRFRVPQPAPSREA 2988
|
....*..
gi 331284176 2246 SKSPGNT 2252
Cdd:PHA03247 2989 PASSTPP 2995
|
|
| SANT |
cd00167 |
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ... |
431-473 |
2.41e-05 |
|
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
Pssm-ID: 238096 [Multi-domain] Cd Length: 45 Bit Score: 43.33 E-value: 2.41e-05
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 331284176 431 WSEQEKETFREKFMQHP-KNFGLIASFLERKTVAECVLYYYLTK 473
Cdd:cd00167 2 WTEEEDELLLEAVKKYGkNNWEKIAKELPGRTPKQCRERWRNLL 45
|
|
| SANT |
smart00717 |
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains; |
431-474 |
7.98e-05 |
|
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
Pssm-ID: 197842 [Multi-domain] Cd Length: 49 Bit Score: 42.21 E-value: 7.98e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 331284176 431 WSEQEKETFREKFMQHP-KNFGLIASFLERKTVAECVLYYYLTKK 474
Cdd:smart00717 4 WTEEEDELLIELVKKYGkNNWEKIAKELPGRTAEQCRERWRNLLK 48
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
819-1117 |
8.94e-05 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 48.15 E-value: 8.94e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 819 VEEGEEQKPPAAEElavDTGKAEEPVKSEcteEAEEGPAKGKDaeaaeataegalkaeKKEGGSGRATTAKSSGAPQDSD 898
Cdd:PTZ00449 489 IKKSKKKLAPIEEE---DSDKHDEPPEGP---EASGLPPKAPG---------------DKEGEEGEHEDSKESDEPKEGG 547
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 899 SSATCSADEVDEAEGGDKNRLLSPRPSLLTPTGDPRANASPQKPLDLKQLKQRAAAIPPIQVTKVHEPPREDAAPTKPAP 978
Cdd:PTZ00449 548 KPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRP 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 979 PAPPPPQNLQPesdaPQQPGSSPRGKS-RSPAPPADKEAEKPVFFPAFAAE--------AQKLPGDPPCWTSGLPFPVPP 1049
Cdd:PTZ00449 628 ESPKSPKRPPP----PQRPSSPERPEGpKIIKSPKPPKSPKPPFDPKFKEKfyddyldaAAKSKETKTTVVLDESFESIL 703
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 331284176 1050 REVIKASPHAPDPSAfsyappgHPLPlglhdtarPVLPRPPTISNPPPLISSAKHPSVLERQIGAISQ 1117
Cdd:PTZ00449 704 KETLPETPGTPFTTP-------RPLP--------PKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEE 756
|
|
| DUF4455 |
pfam14643 |
Domain of unknown function (DUF4455); This domain family is found in bacteria and eukaryotes, ... |
132-391 |
2.79e-04 |
|
Domain of unknown function (DUF4455); This domain family is found in bacteria and eukaryotes, and is approximately 480 amino acids in length. There are two completely conserved residues (W and P) that may be functionally important.
Pssm-ID: 464231 [Multi-domain] Cd Length: 469 Bit Score: 46.12 E-value: 2.79e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 132 SEDLTKDRSLTGKLEpvsppspphTDPELElvppRLSKEELIQNMDRVDREITMVEQQISKLKKKQQQLEEEAAkppEPE 211
Cdd:pfam14643 42 AESDEEINALFKKLE---------DDDALE----DYTIQQLEELWDIVAQHSLLRKSWIKELDETLEKLEKERA---DKL 105
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 212 KPVspppIESKHRSLVQIIYdenrkkaeaahrILEglgPQVELPLYNqpsdtrqyhENIKINQAM---RKKLILYFKRRN 288
Cdd:pfam14643 106 KSV----LKKYVEILEDIAH------------LLP---PDVYRLIDK---------EAMEINQALlenRRAYAKLFANLM 157
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 289 HARKQWEQKFCQRYDQLMEAWEK--------------KVERIENNPRRRakeskvREYYEKqfpeIRKQRELQERMQRVG 354
Cdd:pfam14643 158 EAELKQELSFRLRWQDRVDRWKAlktehliqefkefiASEEIQNPPERK------KELEEM----LKEQKKLQQKRLELL 227
|
250 260 270
....*....|....*....|....*....|....*..
gi 331284176 355 QRGSGLSMSAARSEhEVSEIIDGLseqENLEKQMRQL 391
Cdd:pfam14643 228 QKISDLLPPAYSKS-KVEEWWASL---EALNEQLDQY 260
|
|
| Myb_DNA-bind_6 |
pfam13921 |
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ... |
614-655 |
5.52e-04 |
|
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
Pssm-ID: 372817 [Multi-domain] Cd Length: 60 Bit Score: 39.99 E-value: 5.52e-04
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 331284176 614 WTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQCKNFYFNY 655
Cdd:pfam13921 1 WTEEEDEKLLKLVEKYGNDWKQIAKELGRRTPKQCFDRWRRK 42
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
166-410 |
5.75e-04 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 45.43 E-value: 5.75e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 166 RLSKEELIQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKppepekpvspppIESKHRSLVQIIYDENRKKAEAAHRIl 245
Cdd:TIGR02168 245 QEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEE------------LQKELYALANEISRLEQQKQILRERL- 311
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 246 eglgpqvelplynqpsdtRQYHENIKINQAMRKKLilyFKRRNHARK---QWEQKFCQ---RYDQLMEAWEKKVERIENN 319
Cdd:TIGR02168 312 ------------------ANLERQLEELEAQLEEL---ESKLDELAEelaELEEKLEElkeELESLEAELEELEAELEEL 370
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 320 PRRRAKESKVREYYEKQFPEIRKQRE-LQERMQRVGQRGSGLSMSAARSEHEVSEIIDGLSEQEnLEKQMRQLAVIPPML 398
Cdd:TIGR02168 371 ESRLEELEEQLETLRSKVAQLELQIAsLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAE-LKELQAELEELEEEL 449
|
250
....*....|..
gi 331284176 399 YDADQQRIKFIN 410
Cdd:TIGR02168 450 EELQEELERLEE 461
|
|
| RSC8 |
COG5259 |
RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / ... |
540-648 |
6.60e-04 |
|
RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / Transcription];
Pssm-ID: 227584 [Multi-domain] Cd Length: 531 Bit Score: 44.88 E-value: 6.60e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 540 DKEDLLKEKTDDTSGEDNDEKEAVASKGRKTANSQGRrKGRITRSMANEANS--------EEAITPQ--QSAELASMELN 609
Cdd:COG5259 196 ENYSPSLKSPKKESQGKVDELKDHSEKHPSSCSCCGN-KSFNTRYHNLRAEKynscsecyDQGRFPSefTSSDFKPVTIS 274
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 331284176 610 ESSR---WTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQC 648
Cdd:COG5259 275 LLIRdknWSRQELLLLLEGIEMYGDDWDKVARHVGTKTKEQC 316
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
734-1125 |
2.42e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.39 E-value: 2.42e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 734 PSPHTEAAKDTGQNGPKPPAT-LGADGPPPGPPTPPPEDIPAPTEPTPASEATGAPTPPPAPPSPSAPPPVVPKEEKEEE 812
Cdd:PHA03247 2616 PLPPDTHAPDPPPPSPSPAANePDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL 2695
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 813 TAAA---PPVEEGEEQKPPAAEELAVDTGKAEEPVKSECTEEAEEGPAKGKDAEAAEATAEGALKAEKKEGGSGRATTAK 889
Cdd:PHA03247 2696 TSLAdppPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 890 SSGAPQDSDSSATCSADEVDEAEGGDKNRLLSPRPsLLTPTGDPRANASPQKPLDLKQLKQRAAAIPPiqvtkvhEPPRE 969
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA-VLAPAAALPPAASPAGPLPPPTSAQPTAPPPP-------PGPPP 2847
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 970 DAAPTKPAPPAPPPPQNLQPESDAPQQPGSSPRGKSRSPAPPADKEAEKPVFFPAFAAEAQKLPGDPPCWTSGLPFPVPP 1049
Cdd:PHA03247 2848 PSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPP 2927
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 1050 REVIKASPHA---------PDPSAFSYAPPGHPLP-LGLHDTARPVLPRPPTISNPPPLISSAKHPSVLERQI--GAISQ 1117
Cdd:PHA03247 2928 QPQPPPPPPPrpqpplaptTDPAGAGEPSGAVPQPwLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSlsRVSSW 3007
|
....*...
gi 331284176 1118 GMSVQLHV 1125
Cdd:PHA03247 3008 ASSLALHE 3015
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
748-1171 |
2.51e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.39 E-value: 2.51e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 748 GPKPPA---------TLGADGPPPGPPTPPPEDIPAPTEPTPASEATGAPTPPPAPPSPSAPPPVVPKEEKEEETAAAPP 818
Cdd:PHA03247 2682 RPRRRAarptvgsltSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP 2761
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 819 VEEGEEQKPPAAEELAVDTGKAEEPVKSECTEEAEEGPA---KGKDAEAAEATAEGALKAEKKEGGSGRATTAKSSGAPQ 895
Cdd:PHA03247 2762 TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSpwdPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP 2841
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 896 DSDSSATCSADEVDEAEGGDKNRLLSPRPSLLTPTGDPRANAS----PQKPLDLKQLKQRAAAIPPIQVTKVHEPPREda 971
Cdd:PHA03247 2842 PPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRrlarPAVSRSTESFALPPDQPERPPQPQAPPPPQP-- 2919
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 972 aptkpappappppqnlQPESDAPQQPGSSPRGKSRSPAPPAdkeaekPVFFPAFAAEAQklPGDPPCWTSGL---PFPVP 1048
Cdd:PHA03247 2920 ----------------QPQPPPPPQPQPPPPPPPRPQPPLA------PTTDPAGAGEPS--GAVPQPWLGALvpgRVAVP 2975
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 1049 PREVIKASPHAPDPSAFSYAPPGHPLP--------LGLHDTARPvlprpptisNPPPLISSAKHPSVLERqigaiSQGMS 1120
Cdd:PHA03247 2976 RFRVPQPAPSREAPASSTPPLTGHSLSrvsswassLALHEETDP---------PPVSLKQTLWPPDDTED-----SDADS 3041
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|...
gi 331284176 1121 VQLHVPYSEHAKAPvGPVTmglPLPMDPKKLAPFSGVKQ--EQLSPRGQAGPP 1171
Cdd:PHA03247 3042 LFDSDSERSDLEAL-DPLP---PEPHDPFAHEPDPATPEagARESPSSQFGPP 3090
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
811-1216 |
4.04e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.62 E-value: 4.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 811 EETAAAPPVEEGEEQKPP-------AAEELAVDTGKAEEPvksecteeaeegPAKGKDAEAAEATAEGALKAEKKEGGSG 883
Cdd:PHA03247 2514 RLAPAILPDEPVGEPVHPrmltwirGLEELASDDAGDPPP------------PLPPAAPPAAPDRSVPPPRPAPRPSEPA 2581
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 884 RATTAKSSGAPQDSDSSATCSADEvdeaegGDKNRLLSPRPSlltPTGDPRANASPQKPldlkqlkqRAAAIPPIQVTKV 963
Cdd:PHA03247 2582 VTSRARRPDAPPQSARPRAPVDDR------GDPRGPAPPSPL---PPDTHAPDPPPPSP--------SPAANEPDPHPPP 2644
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 964 HEPPREDAAPTKPAPPAPPPPQNLQPESdaPQQPGSSPRGksrsPAPPADKEAEKPVFFPAFAAEAQKLPGD-PPCWTSG 1042
Cdd:PHA03247 2645 TVPPPERPRDDPAPGRVSRPRRARRLGR--AAQASSPPQR----PRRRAARPTVGSLTSLADPPPPPPTPEPaPHALVSA 2718
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 1043 LPFPVPPREVIKASPHAPDPSAFSYAPPGHPLPLGLHDTARPVLPRPPTISNPPPLISSAKHPSVLERQIGAISQGMSVQ 1122
Cdd:PHA03247 2719 TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL 2798
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 1123 LHVPYSEHAKAPVGPVTMGLPLPMDPKKLAPFSGVKQeQLSPRGQAGPPESlgvPTAQEASVlrgtalgsVPGGSITKGI 1202
Cdd:PHA03247 2799 PSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ-PTAPPPPPGPPPP---SLPLGGSV--------APGGDVRRRP 2866
|
410
....*....|....
gi 331284176 1203 PSTRVPSDSAITYR 1216
Cdd:PHA03247 2867 PSRSPAAKPAAPAR 2880
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
921-1095 |
4.19e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 42.56 E-value: 4.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 921 SPRPSLLTPTGDPRANASPQKPLDLKQLKQRAAAIPPIQVTKVHEPPREDAAPTKPAPPAPPPPQNLQ---------PES 991
Cdd:PRK12323 373 GPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASargpggapaPAP 452
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 992 DAPQQPGSSPRGKSRSPAPPADKEAEKPVfFPAFAAEAQKLPGDPPCWTSGLP-FPVPPREVIKASP------------H 1058
Cdd:PRK12323 453 APAAAPAAAARPAAAGPRPVAAAAAAAPA-RAAPAAAPAPADDDPPPWEELPPeFASPAPAQPDAAPagwvaesipdpaT 531
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 331284176 1059 APDPSAFSY---APPGHPLPLGLHDTARPVLPRPPTISNP 1095
Cdd:PRK12323 532 ADPDDAFETlapAPAAAPAPRAAAATEPVVAPRPPRASAS 571
|
|
| PHA03264 |
PHA03264 |
envelope glycoprotein D; Provisional |
1010-1162 |
5.47e-03 |
|
envelope glycoprotein D; Provisional
Pssm-ID: 223029 [Multi-domain] Cd Length: 416 Bit Score: 41.91 E-value: 5.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 1010 PPADKEAEKPVFFPAFAAEAQKLPGDPPCWTSGLPFPV---PPREVIKASPHAPDPSAFSYAPPGHPLP---LGLHDTAR 1083
Cdd:PHA03264 255 PPYFEESKGYEPPPAPSGGSPAPPGDDRPEAKPEPGPVedgAPGRETGGEGEGPEPAGRDGAAGGEPKPgppRPAPDADR 334
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 1084 PV-LPRPPTISNPPPLISSAKHPSVLERQIGaISQGMSVQLHVPYSEHAKAPVGPVTMGL-PLPMDPKKLAPFSGVKQEQ 1161
Cdd:PHA03264 335 PEgWPSLEAITFPPPTPATPAVPRARPVIVG-TGIAAAAIACVAAAGAVAYFVYTRRRGAgPLPTKEKKLLAFGNVNYSA 413
|
.
gi 331284176 1162 L 1162
Cdd:PHA03264 414 L 414
|
|
| TPH |
pfam13868 |
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ... |
169-390 |
5.81e-03 |
|
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.
Pssm-ID: 464007 [Multi-domain] Cd Length: 341 Bit Score: 41.44 E-value: 5.81e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 169 KEELIQNMDRVdREITMVEQQISKLKKKQQQLEEEAAKppepekpvspppiESKH-RSLVQIIYDENRKKAEAAHRILEG 247
Cdd:pfam13868 62 EKEEERKEERK-RYRQELEEQIEEREQKRQEEYEEKLQ-------------EREQmDEIVERIQEEDQAEAEEKLEKQRQ 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 248 LgpQVELPLYNQpsDTRQY---------HENIKINQAMRKKLILYF-----KRRNHARKQWEQkfcQRYDQLMEAWEKKV 313
Cdd:pfam13868 128 L--REEIDEFNE--EQAEWkelekeeerEEDERILEYLKEKAEREEereaeREEIEEEKEREI---ARLRAQQEKAQDEK 200
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 331284176 314 ERIENNPRRRAKESKVREYYEKQFPEIRKQRELQERMQR-----VGQRGSGLSMSAARSEHEVSEIIDGLSEQENLEKQM 388
Cdd:pfam13868 201 AERDELRAKLYQEEQERKERQKEREEAEKKARQRQELQQareeqIELKERRLAEEAEREEEEFERMLRKQAEDEEIEQEE 280
|
..
gi 331284176 389 RQ 390
Cdd:pfam13868 281 AE 282
|
|
|