NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2462541457|ref|XP_054232628|]
View 

telomerase protein component 1 isoform X1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
TROVE pfam05731
TROVE domain; This presumed domain is found in TEP1 and Ro60 proteins, that are RNA-binding ...
226-676 9.27e-152

TROVE domain; This presumed domain is found in TEP1 and Ro60 proteins, that are RNA-binding components of Telomerase, Ro and Vault RNPs. This domain has been named TROVE, (after Telomerase, Ro and Vault). This domain is probably RNA-binding.


:

Pssm-ID: 461724  Cd Length: 361  Bit Score: 475.34  E-value: 9.27e-152
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  226 TSGDSESHPEPTDHVLQEKKMALLSLLCST---LVSEVNMNNTS------DPTLAAIFEICREL-----ALLEPEFILKA 291
Cdd:pfam05731    1 VSNDSGGYPEPTDDVLQEKRFLLLGLLCGTyytLASEVTMDNAQaikiieDGTGASILETLRELsaagrAPKEPEFILKL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  292 SLYARQQLNVRNVANNILAIAAFLPACRphlrryfcaivqLPSDWIQVAELYQSLAEGDKNKLVPLPACLRTAMTD---- 367
Cdd:pfam05731   81 ALYARQQLNIRDVANHVLAIAAVLPVCR------------LPTDLFEVAEYCEELAEGDEKKLTGWGRCLRRAMTDwyts 148
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  368 KFAQFDEYQLAKYNPRKHRAKRHPRRPPRspgmePPFSHR---CFPRYIGFLREEQRKFEKAGDTVSEKKNPPRFTLKKL 444
Cdd:pfam05731  149 KFAEFLAYQLTKYNTRKHWSHKDPFRLPH-----PPKFSEtslELKGLFRYATKEQRKFEKAYGAVPEKKESKRLTLKKL 223
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  445 VQRLHIHKPAQHVQALLGYRYpsnlqlfsrsRLpgpwdssragkrmklsrpeTWERELSLRGNKASVWEELIENgKLPFM 524
Cdd:pfam05731  224 VQRLHISEPAEHVQALIGKRY----------RL-------------------TWEREPSLRGNSAEVWEELIDS-KLPMM 273
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  525 AMLRNLCNLLRVGISSRHHE-LILQRLQHAKSVIHSRQFPFRFLNAHdaidaleaqlrnqalpfpsnitlmrriltrnek 603
Cdd:pfam05731  274 AMLRNLCNLLRVGVSARHHEdLVLQRLQNPKSVIHSRQHPFRFLNAH--------------------------------- 320
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462541457  604 nrprrrflchlsrqqlrmamripVLYEQLKREKLRvhkaRQWKYDGEMlnryRQALETAVNLSVKhSLPLLPG 676
Cdd:pfam05731  321 -----------------------VVYEQGKGEKGK----LQWKPDPEI----SQALEAAFYLAVK-NLPPTPG 361
DUF5920 pfam19334
Domain of unknown function (DUF5920); This domain is found in the Telomerase protein component ...
687-889 6.87e-138

Domain of unknown function (DUF5920); This domain is found in the Telomerase protein component 1 (TEP1) and it contains an homology region to the telomerase associated protein from Tetrahymena p80. TEP1 is a component of the telomerase ribonucleoprotein complex and is thought to be responsible for catalysing the addition of new telomeres to chromosomes. TEP1 is also a component of the vault particle, a cytoplasmic ribonucleoprotein complex, in which it is required for vault RNA stability and its association with the vault particle. This domain is localized between the TROVE (pfam05731) and DUF4062 (pfam13271) domains.


:

Pssm-ID: 466045  Cd Length: 203  Bit Score: 428.81  E-value: 6.87e-138
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  687 NADRLCPKSNPQGPPLNYALLLIGMMITRAEQVDVVLCGGDTLKTAVLKAEEGILKTAIKLQAQVQEFDENDGWSLNTFG 766
Cdd:pfam19334    1 NADRLCPKSNPQGPPLNYVLLLIGMMIARAEQVDLLLCGRGTLKTAVLKAEEGILKTAIKLQAQVQELEENDEWPLTTFG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  767 KYLLSLAGQRVPVDRVILLGQSMDDGMINVAKQLYWQRVNSKCLFVGILLRRVQYLSTDLNPNDVTLSGCTDAILKFIAE 846
Cdd:pfam19334   81 KYLLSLAVQRVPVDRVILFGQTMNERLINVAKQLFWQHVNSKCLFVGVLLRKTQYISPDLNPNDVTLSGCTDGILKFIAE 160
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 2462541457  847 HGASHLLEHVGQMDKIFKIPPPPGKTGVQSLRPLEEDTPSPLA 889
Cdd:pfam19334  161 RGASRLLEHVGQMDKIFKIPPPPGKTGVLSLRPLEEDTPSPLA 203
WD40 COG2319
WD40 repeat [General function prediction only];
1849-2268 8.20e-70

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 241.35  E-value: 8.20e-70
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1849 AFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRPRGHLGSLSlS 1928
Cdd:COG2319      1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHT-A 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1929 PALSVALSPDGDRVAVGYRADGIRIYKISSGSQGAQGQALDVAVSALAWlSP--KVLVSGAEDGSLQGWALKecSLQSLW 2006
Cdd:COG2319     80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAF-SPdgKTLASGSADGTVRLWDLA--TGKLLR 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2007 LLSRFQKPVLGLATSQ--ELLASASEDFTVQLWprqlltrphKAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRS 2084
Cdd:COG2319    157 TLTGHSGAVTSVAFSPdgKLLASGSDDGTVRLW---------DLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGT 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2085 LLCWDVRTPKtpvLIHSFPAcHRDWVTGCAWTKDN-LLISCSSDGSVGLWDPESGQRLGQFLGHQSAVSAVA--AVEEHV 2161
Cdd:COG2319    228 VRLWDLATGK---LLRTLTG-HSGSVRSVAFSPDGrLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAfsPDGKLL 303
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2162 VSVSRDGTLKVWD-HQGVELTSIPAHSGPISHCAAAmeprAAGQpgselLVVTVGLDGATRLWHPLLVCQTHTLLGHSGP 2240
Cdd:COG2319    304 ASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFS----PDGK-----TLASGSDDGTVRLWDLATGELLRTLTGHTGA 374
                          410       420
                   ....*....|....*....|....*...
gi 2462541457 2241 VRAAAVSETSGLMLTASEDGSVRLWQVP 2268
Cdd:COG2319    375 VTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 COG2319
WD40 repeat [General function prediction only];
1680-1958 1.84e-48

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 179.34  E-value: 1.84e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1680 TAVAFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDDTLFLTA-FDGLLELWDLQHGCRVLQTKAH 1758
Cdd:COG2319    124 RSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGsDDGTVRLWDLATGKLLRTLTGH 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1759 QYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQ-LAFQHTYPKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKD 1837
Cdd:COG2319    204 TGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKlLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRT 283
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1838 LGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGR 1917
Cdd:COG2319    284 LTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGE 363
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 2462541457 1918 P----RGHLGslslsPALSVALSPDGDRVAVGYRADGIRIYKISS 1958
Cdd:COG2319    364 LlrtlTGHTG-----AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
NACHT pfam05729
NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in ...
1162-1337 2.19e-32

NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.


:

Pssm-ID: 428606 [Multi-domain]  Cd Length: 166  Bit Score: 124.72  E-value: 2.19e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1162 RLSLVTGQSGQGKTAFLASLVSALQAPDGAKVASLVFFHFSGARPDQGLAltllRRLCTYLRGQLKEPGALPSTYRSLVW 1241
Cdd:pfam05729    1 RTVILQGEAGSGKTTLLQKLALLWAQGKLPQGFDFVFFLPCRELSRSGNA----RSLADLLFSQWPEPAAPVSEVWAVIL 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1242 ELQQRLLpksaeslhpgqtqvLIIDGADRLVDQNGQ---------LISDWIPKKLPRCVHLVLSVSSDAG--LGETLEQS 1310
Cdd:pfam05729   77 ELPERLL--------------LILDGLDELVSDLGQldgpcpvltLLSSLLRKKLLPGASLLLTVRPDALrdLRRGLEEP 142
                          170       180
                   ....*....|....*....|....*..
gi 2462541457 1311 QgahVLALGPLEASARARLVREELALY 1337
Cdd:pfam05729  143 R---YLEVRGFSESDRKQYVRKYFSDE 166
DUF4062 pfam13271
Domain of unknown function (DUF4062); This presumed domain is functionally uncharacterized. ...
900-1008 1.04e-15

Domain of unknown function (DUF4062); This presumed domain is functionally uncharacterized. This domain family is found in bacteria, archaea and eukaryotes, and is approximately 80 amino acids in length. There is a conserved SST sequence motif.


:

Pssm-ID: 463823  Cd Length: 78  Bit Score: 74.16  E-value: 1.04e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  900 RLFISSTFRDMHGERDLLLRSVlpalqaRAAPHrislhgidLRWGVTEEETRRNRQLEVCLGEVENAQLFVGILGSRYGY 979
Cdd:pfam13271    1 KVFISSTFYDLKEEREALIEAL------LELGH--------IPVGMEEFPASDESPLDVCLREVDECDIYILILGGRYGS 66
                           90       100
                   ....*....|....*....|....*....
gi 2462541457  980 IPpsynlpdhphfhwaqqyPSGRSVTEME 1008
Cdd:pfam13271   67 ID-----------------PDGISYTELE 78
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
1-29 8.83e-15

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


:

Pssm-ID: 428450  Cd Length: 29  Bit Score: 69.74  E-value: 8.83e-15
                           10        20
                   ....*....|....*....|....*....
gi 2462541457    1 MEKLHGHVSAHPDILSLENRCLAMLPDLQ 29
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
91-119 3.67e-14

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


:

Pssm-ID: 428450  Cd Length: 29  Bit Score: 68.20  E-value: 3.67e-14
                           10        20
                   ....*....|....*....|....*....
gi 2462541457   91 MEKPHGHVSAHPDILSLENRCLATLSSLK 119
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
61-89 1.51e-13

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


:

Pssm-ID: 428450  Cd Length: 29  Bit Score: 66.27  E-value: 1.51e-13
                           10        20
                   ....*....|....*....|....*....
gi 2462541457   61 MEKPHGYVSAHPDILSLENQCLATLSDLK 89
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
31-59 6.92e-12

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


:

Pssm-ID: 428450  Cd Length: 29  Bit Score: 61.65  E-value: 6.92e-12
                           10        20
                   ....*....|....*....|....*....
gi 2462541457   31 LEKLHQHVSTHSDILSLKNQCLATLPDLK 59
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
2231-2391 2.55e-11

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 66.97  E-value: 2.55e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2231 THTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKE-------------------ADDTCIPRSS----------- 2280
Cdd:cd00200      2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellrtlkghtgpvrdvaasADGTYLASGSsdktirlwdle 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2281 ------------AAVTAVAWAPDGSMAVSGNQAGELILWQEAKAVATAQAPGHIG---ALIWSSAHTfFVLSA--DEKIS 2343
Cdd:cd00200     82 tgecvrtltghtSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDwvnSVAFSPDGT-FVASSsqDGTIK 160
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 2462541457 2344 EWQvkLRKGSAPGNLSLHLNRIlqedlgvlTSLDWAPDGHFLILAKAD 2391
Cdd:cd00200    161 LWD--LRTGKCVATLTGHTGEV--------NSVAFSPDGEKLLSSSSD 198
 
Name Accession Description Interval E-value
TROVE pfam05731
TROVE domain; This presumed domain is found in TEP1 and Ro60 proteins, that are RNA-binding ...
226-676 9.27e-152

TROVE domain; This presumed domain is found in TEP1 and Ro60 proteins, that are RNA-binding components of Telomerase, Ro and Vault RNPs. This domain has been named TROVE, (after Telomerase, Ro and Vault). This domain is probably RNA-binding.


Pssm-ID: 461724  Cd Length: 361  Bit Score: 475.34  E-value: 9.27e-152
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  226 TSGDSESHPEPTDHVLQEKKMALLSLLCST---LVSEVNMNNTS------DPTLAAIFEICREL-----ALLEPEFILKA 291
Cdd:pfam05731    1 VSNDSGGYPEPTDDVLQEKRFLLLGLLCGTyytLASEVTMDNAQaikiieDGTGASILETLRELsaagrAPKEPEFILKL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  292 SLYARQQLNVRNVANNILAIAAFLPACRphlrryfcaivqLPSDWIQVAELYQSLAEGDKNKLVPLPACLRTAMTD---- 367
Cdd:pfam05731   81 ALYARQQLNIRDVANHVLAIAAVLPVCR------------LPTDLFEVAEYCEELAEGDEKKLTGWGRCLRRAMTDwyts 148
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  368 KFAQFDEYQLAKYNPRKHRAKRHPRRPPRspgmePPFSHR---CFPRYIGFLREEQRKFEKAGDTVSEKKNPPRFTLKKL 444
Cdd:pfam05731  149 KFAEFLAYQLTKYNTRKHWSHKDPFRLPH-----PPKFSEtslELKGLFRYATKEQRKFEKAYGAVPEKKESKRLTLKKL 223
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  445 VQRLHIHKPAQHVQALLGYRYpsnlqlfsrsRLpgpwdssragkrmklsrpeTWERELSLRGNKASVWEELIENgKLPFM 524
Cdd:pfam05731  224 VQRLHISEPAEHVQALIGKRY----------RL-------------------TWEREPSLRGNSAEVWEELIDS-KLPMM 273
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  525 AMLRNLCNLLRVGISSRHHE-LILQRLQHAKSVIHSRQFPFRFLNAHdaidaleaqlrnqalpfpsnitlmrriltrnek 603
Cdd:pfam05731  274 AMLRNLCNLLRVGVSARHHEdLVLQRLQNPKSVIHSRQHPFRFLNAH--------------------------------- 320
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462541457  604 nrprrrflchlsrqqlrmamripVLYEQLKREKLRvhkaRQWKYDGEMlnryRQALETAVNLSVKhSLPLLPG 676
Cdd:pfam05731  321 -----------------------VVYEQGKGEKGK----LQWKPDPEI----SQALEAAFYLAVK-NLPPTPG 361
DUF5920 pfam19334
Domain of unknown function (DUF5920); This domain is found in the Telomerase protein component ...
687-889 6.87e-138

Domain of unknown function (DUF5920); This domain is found in the Telomerase protein component 1 (TEP1) and it contains an homology region to the telomerase associated protein from Tetrahymena p80. TEP1 is a component of the telomerase ribonucleoprotein complex and is thought to be responsible for catalysing the addition of new telomeres to chromosomes. TEP1 is also a component of the vault particle, a cytoplasmic ribonucleoprotein complex, in which it is required for vault RNA stability and its association with the vault particle. This domain is localized between the TROVE (pfam05731) and DUF4062 (pfam13271) domains.


Pssm-ID: 466045  Cd Length: 203  Bit Score: 428.81  E-value: 6.87e-138
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  687 NADRLCPKSNPQGPPLNYALLLIGMMITRAEQVDVVLCGGDTLKTAVLKAEEGILKTAIKLQAQVQEFDENDGWSLNTFG 766
Cdd:pfam19334    1 NADRLCPKSNPQGPPLNYVLLLIGMMIARAEQVDLLLCGRGTLKTAVLKAEEGILKTAIKLQAQVQELEENDEWPLTTFG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  767 KYLLSLAGQRVPVDRVILLGQSMDDGMINVAKQLYWQRVNSKCLFVGILLRRVQYLSTDLNPNDVTLSGCTDAILKFIAE 846
Cdd:pfam19334   81 KYLLSLAVQRVPVDRVILFGQTMNERLINVAKQLFWQHVNSKCLFVGVLLRKTQYISPDLNPNDVTLSGCTDGILKFIAE 160
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 2462541457  847 HGASHLLEHVGQMDKIFKIPPPPGKTGVQSLRPLEEDTPSPLA 889
Cdd:pfam19334  161 RGASRLLEHVGQMDKIFKIPPPPGKTGVLSLRPLEEDTPSPLA 203
WD40 COG2319
WD40 repeat [General function prediction only];
1849-2268 8.20e-70

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 241.35  E-value: 8.20e-70
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1849 AFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRPRGHLGSLSlS 1928
Cdd:COG2319      1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHT-A 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1929 PALSVALSPDGDRVAVGYRADGIRIYKISSGSQGAQGQALDVAVSALAWlSP--KVLVSGAEDGSLQGWALKecSLQSLW 2006
Cdd:COG2319     80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAF-SPdgKTLASGSADGTVRLWDLA--TGKLLR 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2007 LLSRFQKPVLGLATSQ--ELLASASEDFTVQLWprqlltrphKAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRS 2084
Cdd:COG2319    157 TLTGHSGAVTSVAFSPdgKLLASGSDDGTVRLW---------DLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGT 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2085 LLCWDVRTPKtpvLIHSFPAcHRDWVTGCAWTKDN-LLISCSSDGSVGLWDPESGQRLGQFLGHQSAVSAVA--AVEEHV 2161
Cdd:COG2319    228 VRLWDLATGK---LLRTLTG-HSGSVRSVAFSPDGrLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAfsPDGKLL 303
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2162 VSVSRDGTLKVWD-HQGVELTSIPAHSGPISHCAAAmeprAAGQpgselLVVTVGLDGATRLWHPLLVCQTHTLLGHSGP 2240
Cdd:COG2319    304 ASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFS----PDGK-----TLASGSDDGTVRLWDLATGELLRTLTGHTGA 374
                          410       420
                   ....*....|....*....|....*...
gi 2462541457 2241 VRAAAVSETSGLMLTASEDGSVRLWQVP 2268
Cdd:COG2319    375 VTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1972-2308 1.70e-55

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 196.02  E-value: 1.70e-55
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1972 VSALAWL-SPKVLVSGAEDGSLQGWalkecSLQSLWLLSRFQ---KPVLGLATS--QELLASASEDFTVQLWPrqlltrp 2045
Cdd:cd00200     12 VTCVAFSpDGKLLATGSGDGTIKVW-----DLETGELLRTLKghtGPVRDVAASadGTYLASGSSDKTIRLWD------- 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2046 hkAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWDVRTPKtpvLIHSFPaCHRDWVTGCAWTKDNLLISCS 2125
Cdd:cd00200     80 --LETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGK---CLTTLR-GHTDWVNSVAFSPDGTFVASS 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2126 S-DGSVGLWDPESGQRLGQFLGHQSAVSAVAAV--EEHVVSVSRDGTLKVWDHQgveltsipahsgpishcaaamepraA 2202
Cdd:cd00200    154 SqDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSpdGEKLLSSSSDGTIKLWDLS-------------------------T 208
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2203 GQpgsellvvtvgldgatrlwhpllvcQTHTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKEADDTCIPRSSAA 2282
Cdd:cd00200    209 GK-------------------------CLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS 263
                          330       340
                   ....*....|....*....|....*.
gi 2462541457 2283 VTAVAWAPDGSMAVSGNQAGELILWQ 2308
Cdd:cd00200    264 VTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
1680-1958 1.84e-48

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 179.34  E-value: 1.84e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1680 TAVAFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDDTLFLTA-FDGLLELWDLQHGCRVLQTKAH 1758
Cdd:COG2319    124 RSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGsDDGTVRLWDLATGKLLRTLTGH 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1759 QYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQ-LAFQHTYPKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKD 1837
Cdd:COG2319    204 TGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKlLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRT 283
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1838 LGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGR 1917
Cdd:COG2319    284 LTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGE 363
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 2462541457 1918 P----RGHLGslslsPALSVALSPDGDRVAVGYRADGIRIYKISS 1958
Cdd:COG2319    364 LlrtlTGHTG-----AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1680-1955 1.96e-36

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 140.93  E-value: 1.96e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1680 TAVAFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDDTLFLTA-FDGLLELWDLQHGCRVLQTKAH 1758
Cdd:cd00200     13 TCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGsSDKTIRLWDLETGECVRTLTGH 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1759 QYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQLAF---QHTypKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVT 1835
Cdd:cd00200     93 TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTtlrGHT--DWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCV 170
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1836 KDLGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSL 1915
Cdd:cd00200    171 ATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRT 250
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 2462541457 1916 GRPRGHLGSLSlSPALSVALSPDGDRVAVGYrADG-IRIYK 1955
Cdd:cd00200    251 GECVQTLSGHT-NSVTSLAWSPDGKRLASGS-ADGtIRIWD 289
NACHT pfam05729
NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in ...
1162-1337 2.19e-32

NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.


Pssm-ID: 428606 [Multi-domain]  Cd Length: 166  Bit Score: 124.72  E-value: 2.19e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1162 RLSLVTGQSGQGKTAFLASLVSALQAPDGAKVASLVFFHFSGARPDQGLAltllRRLCTYLRGQLKEPGALPSTYRSLVW 1241
Cdd:pfam05729    1 RTVILQGEAGSGKTTLLQKLALLWAQGKLPQGFDFVFFLPCRELSRSGNA----RSLADLLFSQWPEPAAPVSEVWAVIL 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1242 ELQQRLLpksaeslhpgqtqvLIIDGADRLVDQNGQ---------LISDWIPKKLPRCVHLVLSVSSDAG--LGETLEQS 1310
Cdd:pfam05729   77 ELPERLL--------------LILDGLDELVSDLGQldgpcpvltLLSSLLRKKLLPGASLLLTVRPDALrdLRRGLEEP 142
                          170       180
                   ....*....|....*....|....*..
gi 2462541457 1311 QgahVLALGPLEASARARLVREELALY 1337
Cdd:pfam05729  143 R---YLEVRGFSESDRKQYVRKYFSDE 166
DUF4062 pfam13271
Domain of unknown function (DUF4062); This presumed domain is functionally uncharacterized. ...
900-1008 1.04e-15

Domain of unknown function (DUF4062); This presumed domain is functionally uncharacterized. This domain family is found in bacteria, archaea and eukaryotes, and is approximately 80 amino acids in length. There is a conserved SST sequence motif.


Pssm-ID: 463823  Cd Length: 78  Bit Score: 74.16  E-value: 1.04e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  900 RLFISSTFRDMHGERDLLLRSVlpalqaRAAPHrislhgidLRWGVTEEETRRNRQLEVCLGEVENAQLFVGILGSRYGY 979
Cdd:pfam13271    1 KVFISSTFYDLKEEREALIEAL------LELGH--------IPVGMEEFPASDESPLDVCLREVDECDIYILILGGRYGS 66
                           90       100
                   ....*....|....*....|....*....
gi 2462541457  980 IPpsynlpdhphfhwaqqyPSGRSVTEME 1008
Cdd:pfam13271   67 ID-----------------PDGISYTELE 78
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
1-29 8.83e-15

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


Pssm-ID: 428450  Cd Length: 29  Bit Score: 69.74  E-value: 8.83e-15
                           10        20
                   ....*....|....*....|....*....
gi 2462541457    1 MEKLHGHVSAHPDILSLENRCLAMLPDLQ 29
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
91-119 3.67e-14

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


Pssm-ID: 428450  Cd Length: 29  Bit Score: 68.20  E-value: 3.67e-14
                           10        20
                   ....*....|....*....|....*....
gi 2462541457   91 MEKPHGHVSAHPDILSLENRCLATLSSLK 119
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
61-89 1.51e-13

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


Pssm-ID: 428450  Cd Length: 29  Bit Score: 66.27  E-value: 1.51e-13
                           10        20
                   ....*....|....*....|....*....
gi 2462541457   61 MEKPHGYVSAHPDILSLENQCLATLSDLK 89
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
31-59 6.92e-12

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


Pssm-ID: 428450  Cd Length: 29  Bit Score: 61.65  E-value: 6.92e-12
                           10        20
                   ....*....|....*....|....*....
gi 2462541457   31 LEKLHQHVSTHSDILSLKNQCLATLPDLK 59
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
2231-2391 2.55e-11

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 66.97  E-value: 2.55e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2231 THTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKE-------------------ADDTCIPRSS----------- 2280
Cdd:cd00200      2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellrtlkghtgpvrdvaasADGTYLASGSsdktirlwdle 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2281 ------------AAVTAVAWAPDGSMAVSGNQAGELILWQEAKAVATAQAPGHIG---ALIWSSAHTfFVLSA--DEKIS 2343
Cdd:cd00200     82 tgecvrtltghtSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDwvnSVAFSPDGT-FVASSsqDGTIK 160
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 2462541457 2344 EWQvkLRKGSAPGNLSLHLNRIlqedlgvlTSLDWAPDGHFLILAKAD 2391
Cdd:cd00200    161 LWD--LRTGKCVATLTGHTGEV--------NSVAFSPDGEKLLSSSSD 198
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
2050-2089 1.79e-07

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 49.23  E-value: 1.79e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 2462541457  2050 DFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWD 2089
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
PTZ00421 PTZ00421
coronin; Provisional
1962-2199 4.25e-07

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 55.28  E-value: 4.25e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1962 GAQGQALDVAVSALawlSPKVLVSGAEDGSLQGWALKECSLQSlwllsrfqkpvlglATSQELLasasedftvqlwprql 2041
Cdd:PTZ00421    73 GQEGPIIDVAFNPF---DPQKLFTASEDGTIMGWGIPEEGLTQ--------------NISDPIV---------------- 119
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2042 ltrphkaedfpcgtELRGHEGPVSCCSFSTDG-GSLATGGRDRSLLCWDVRTPKTPVLIhsfpACHRDWVTGCAWTKD-N 2119
Cdd:PTZ00421   120 --------------HLQGHTKKVGIVSFHPSAmNVLASAGADMVVNVWDVERGKAVEVI----KCHSDQITSLEWNLDgS 181
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2120 LLISCSSDGSVGLWDPESGQRLGQFLGHQSAVS--AVAAVEEHVV-----SVSRDGTLKVWDHQGVEltsIPAHSGPISH 2192
Cdd:PTZ00421   182 LLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSqrCLWAKRKDLIitlgcSKSQQRQIMLWDTRKMA---SPYSTVDLDQ 258

                   ....*..
gi 2462541457 2193 CAAAMEP 2199
Cdd:PTZ00421   259 SSALFIP 265
WD40 pfam00400
WD domain, G-beta repeat;
2053-2089 6.13e-07

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 47.73  E-value: 6.13e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 2462541457 2053 CGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWD 2089
Cdd:pfam00400    3 LLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1756-1787 2.06e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 40.76  E-value: 2.06e-04
                            10        20        30
                    ....*....|....*....|....*....|..
gi 2462541457  1756 KAHQYQITGCCLSPDCRLLATVCLGGCLKLWD 1787
Cdd:smart00320    9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
YcjX COG3106
Ras-like GTP-binding stress-induced protein YcjX, DUF463 family [Signal transduction ...
1149-1199 1.01e-03

Ras-like GTP-binding stress-induced protein YcjX, DUF463 family [Signal transduction mechanisms];


Pssm-ID: 442340  Cd Length: 467  Bit Score: 44.41  E-value: 1.01e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462541457 1149 LQDTVQRLMLPHGRLSlVTGQSGQGKTAFLASLVSALQApdGAKVASLVFF 1199
Cdd:COG3106     11 LADLANRLLDRHLRLA-VTGLSRSGKTAFITSLVNQLLH--GGSGARLPLF 58
WD40 pfam00400
WD domain, G-beta repeat;
1756-1787 6.22e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 36.55  E-value: 6.22e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 2462541457 1756 KAHQYQITGCCLSPDCRLLATVCLGGCLKLWD 1787
Cdd:pfam00400    8 EGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
 
Name Accession Description Interval E-value
TROVE pfam05731
TROVE domain; This presumed domain is found in TEP1 and Ro60 proteins, that are RNA-binding ...
226-676 9.27e-152

TROVE domain; This presumed domain is found in TEP1 and Ro60 proteins, that are RNA-binding components of Telomerase, Ro and Vault RNPs. This domain has been named TROVE, (after Telomerase, Ro and Vault). This domain is probably RNA-binding.


Pssm-ID: 461724  Cd Length: 361  Bit Score: 475.34  E-value: 9.27e-152
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  226 TSGDSESHPEPTDHVLQEKKMALLSLLCST---LVSEVNMNNTS------DPTLAAIFEICREL-----ALLEPEFILKA 291
Cdd:pfam05731    1 VSNDSGGYPEPTDDVLQEKRFLLLGLLCGTyytLASEVTMDNAQaikiieDGTGASILETLRELsaagrAPKEPEFILKL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  292 SLYARQQLNVRNVANNILAIAAFLPACRphlrryfcaivqLPSDWIQVAELYQSLAEGDKNKLVPLPACLRTAMTD---- 367
Cdd:pfam05731   81 ALYARQQLNIRDVANHVLAIAAVLPVCR------------LPTDLFEVAEYCEELAEGDEKKLTGWGRCLRRAMTDwyts 148
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  368 KFAQFDEYQLAKYNPRKHRAKRHPRRPPRspgmePPFSHR---CFPRYIGFLREEQRKFEKAGDTVSEKKNPPRFTLKKL 444
Cdd:pfam05731  149 KFAEFLAYQLTKYNTRKHWSHKDPFRLPH-----PPKFSEtslELKGLFRYATKEQRKFEKAYGAVPEKKESKRLTLKKL 223
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  445 VQRLHIHKPAQHVQALLGYRYpsnlqlfsrsRLpgpwdssragkrmklsrpeTWERELSLRGNKASVWEELIENgKLPFM 524
Cdd:pfam05731  224 VQRLHISEPAEHVQALIGKRY----------RL-------------------TWEREPSLRGNSAEVWEELIDS-KLPMM 273
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  525 AMLRNLCNLLRVGISSRHHE-LILQRLQHAKSVIHSRQFPFRFLNAHdaidaleaqlrnqalpfpsnitlmrriltrnek 603
Cdd:pfam05731  274 AMLRNLCNLLRVGVSARHHEdLVLQRLQNPKSVIHSRQHPFRFLNAH--------------------------------- 320
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462541457  604 nrprrrflchlsrqqlrmamripVLYEQLKREKLRvhkaRQWKYDGEMlnryRQALETAVNLSVKhSLPLLPG 676
Cdd:pfam05731  321 -----------------------VVYEQGKGEKGK----LQWKPDPEI----SQALEAAFYLAVK-NLPPTPG 361
DUF5920 pfam19334
Domain of unknown function (DUF5920); This domain is found in the Telomerase protein component ...
687-889 6.87e-138

Domain of unknown function (DUF5920); This domain is found in the Telomerase protein component 1 (TEP1) and it contains an homology region to the telomerase associated protein from Tetrahymena p80. TEP1 is a component of the telomerase ribonucleoprotein complex and is thought to be responsible for catalysing the addition of new telomeres to chromosomes. TEP1 is also a component of the vault particle, a cytoplasmic ribonucleoprotein complex, in which it is required for vault RNA stability and its association with the vault particle. This domain is localized between the TROVE (pfam05731) and DUF4062 (pfam13271) domains.


Pssm-ID: 466045  Cd Length: 203  Bit Score: 428.81  E-value: 6.87e-138
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  687 NADRLCPKSNPQGPPLNYALLLIGMMITRAEQVDVVLCGGDTLKTAVLKAEEGILKTAIKLQAQVQEFDENDGWSLNTFG 766
Cdd:pfam19334    1 NADRLCPKSNPQGPPLNYVLLLIGMMIARAEQVDLLLCGRGTLKTAVLKAEEGILKTAIKLQAQVQELEENDEWPLTTFG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  767 KYLLSLAGQRVPVDRVILLGQSMDDGMINVAKQLYWQRVNSKCLFVGILLRRVQYLSTDLNPNDVTLSGCTDAILKFIAE 846
Cdd:pfam19334   81 KYLLSLAVQRVPVDRVILFGQTMNERLINVAKQLFWQHVNSKCLFVGVLLRKTQYISPDLNPNDVTLSGCTDGILKFIAE 160
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 2462541457  847 HGASHLLEHVGQMDKIFKIPPPPGKTGVQSLRPLEEDTPSPLA 889
Cdd:pfam19334  161 RGASRLLEHVGQMDKIFKIPPPPGKTGVLSLRPLEEDTPSPLA 203
WD40 COG2319
WD40 repeat [General function prediction only];
1849-2268 8.20e-70

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 241.35  E-value: 8.20e-70
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1849 AFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRPRGHLGSLSlS 1928
Cdd:COG2319      1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHT-A 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1929 PALSVALSPDGDRVAVGYRADGIRIYKISSGSQGAQGQALDVAVSALAWlSP--KVLVSGAEDGSLQGWALKecSLQSLW 2006
Cdd:COG2319     80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAF-SPdgKTLASGSADGTVRLWDLA--TGKLLR 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2007 LLSRFQKPVLGLATSQ--ELLASASEDFTVQLWprqlltrphKAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRS 2084
Cdd:COG2319    157 TLTGHSGAVTSVAFSPdgKLLASGSDDGTVRLW---------DLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGT 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2085 LLCWDVRTPKtpvLIHSFPAcHRDWVTGCAWTKDN-LLISCSSDGSVGLWDPESGQRLGQFLGHQSAVSAVA--AVEEHV 2161
Cdd:COG2319    228 VRLWDLATGK---LLRTLTG-HSGSVRSVAFSPDGrLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAfsPDGKLL 303
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2162 VSVSRDGTLKVWD-HQGVELTSIPAHSGPISHCAAAmeprAAGQpgselLVVTVGLDGATRLWHPLLVCQTHTLLGHSGP 2240
Cdd:COG2319    304 ASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFS----PDGK-----TLASGSDDGTVRLWDLATGELLRTLTGHTGA 374
                          410       420
                   ....*....|....*....|....*...
gi 2462541457 2241 VRAAAVSETSGLMLTASEDGSVRLWQVP 2268
Cdd:COG2319    375 VTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 COG2319
WD40 repeat [General function prediction only];
1683-2092 1.03e-60

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 215.16  E-value: 1.03e-60
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1683 AFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDD-TLFLTAFDGLLELWDLQHGCRVLQTKAHQYQ 1761
Cdd:COG2319      1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGaRLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1762 ITGCCLSPDCRLLATVCLGGCLKLWDTVRGQLAFQHT-YPKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKDLGA 1840
Cdd:COG2319     81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTgHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1841 PGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRPRG 1920
Cdd:COG2319    161 HSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLR 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1921 HLGSLSlSPALSVALSPDGDRVAVGYRADGIRIYKISSGSQGAQGQALDVAVSALAWlSP--KVLVSGAEDGSLQGWALK 1998
Cdd:COG2319    241 TLTGHS-GSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAF-SPdgKLLASGSDDGTVRLWDLA 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1999 ecSLQSLWLLSRFQKPVLGLATSQ--ELLASASEDFTVQLWprqlltrphKAEDFPCGTELRGHEGPVSCCSFSTDGGSL 2076
Cdd:COG2319    319 --TGKLLRTLTGHTGAVRSVAFSPdgKTLASGSDDGTVRLW---------DLATGELLRTLTGHTGAVTSVAFSPDGRTL 387
                          410
                   ....*....|....*.
gi 2462541457 2077 ATGGRDRSLLCWDVRT 2092
Cdd:COG2319    388 ASGSADGTVRLWDLAT 403
WD40 COG2319
WD40 repeat [General function prediction only];
1725-2137 2.37e-60

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 214.00  E-value: 2.37e-60
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1725 LFLSDDTLFLTAFDGLLELWDLQHGCRVLQTKAHQYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQLAFQHTYPKS-L 1803
Cdd:COG2319      2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAaV 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1804 NCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKDLGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAH 1883
Cdd:COG2319     82 LSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGH 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1884 HGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRP----RGHLGSLSlspalSVALSPDGDRVAVGYRADGIRIYKISSG 1959
Cdd:COG2319    162 SGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLlrtlTGHTGAVR-----SVAFSPDGKLLASGSADGTVRLWDLATG 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1960 SQGAQGQALDVAVSALAWlSP--KVLVSGAEDGSLQGWALKecSLQSLWLLSRFQKPVLGLATSQ--ELLASASEDFTVQ 2035
Cdd:COG2319    237 KLLRTLTGHSGSVRSVAF-SPdgRLLASGSADGTVRLWDLA--TGELLRTLTGHSGGVNSVAFSPdgKLLASGSDDGTVR 313
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2036 LWprqlltrphKAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWDVRTPKtpvLIHSFPAcHRDWVTGCAW 2115
Cdd:COG2319    314 LW---------DLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGE---LLRTLTG-HTGAVTSVAF 380
                          410       420
                   ....*....|....*....|...
gi 2462541457 2116 TKD-NLLISCSSDGSVGLWDPES 2137
Cdd:COG2319    381 SPDgRTLASGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1972-2308 1.70e-55

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 196.02  E-value: 1.70e-55
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1972 VSALAWL-SPKVLVSGAEDGSLQGWalkecSLQSLWLLSRFQ---KPVLGLATS--QELLASASEDFTVQLWPrqlltrp 2045
Cdd:cd00200     12 VTCVAFSpDGKLLATGSGDGTIKVW-----DLETGELLRTLKghtGPVRDVAASadGTYLASGSSDKTIRLWD------- 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2046 hkAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWDVRTPKtpvLIHSFPaCHRDWVTGCAWTKDNLLISCS 2125
Cdd:cd00200     80 --LETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGK---CLTTLR-GHTDWVNSVAFSPDGTFVASS 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2126 S-DGSVGLWDPESGQRLGQFLGHQSAVSAVAAV--EEHVVSVSRDGTLKVWDHQgveltsipahsgpishcaaamepraA 2202
Cdd:cd00200    154 SqDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSpdGEKLLSSSSDGTIKLWDLS-------------------------T 208
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2203 GQpgsellvvtvgldgatrlwhpllvcQTHTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKEADDTCIPRSSAA 2282
Cdd:cd00200    209 GK-------------------------CLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS 263
                          330       340
                   ....*....|....*....|....*.
gi 2462541457 2283 VTAVAWAPDGSMAVSGNQAGELILWQ 2308
Cdd:cd00200    264 VTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
1975-2391 2.94e-53

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 193.59  E-value: 2.94e-53
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1975 LAWLSPKVLVSGAEDGSLQGWALKECSLQSLWLLSRFQKPVLGLATSQELLASASEDFTVQLWPRQLLTRPHkaedfpcg 2054
Cdd:COG2319      1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLA-------- 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2055 tELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWDVRTPKTPVLIHSfpacHRDWVTGCAWTKD-NLLISCSSDGSVGLW 2133
Cdd:COG2319     73 -TLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG----HTGAVRSVAFSPDgKTLASGSADGTVRLW 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2134 DPESGQRLGQFLGHQSAVSAVA--AVEEHVVSVSRDGTLKVWD-HQGVELTSIPAHSGPISHCAAAmepraagqPGSELL 2210
Cdd:COG2319    148 DLATGKLLRTLTGHSGAVTSVAfsPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFS--------PDGKLL 219
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2211 VvTVGLDGATRLWHPLLVCQTHTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKEADDTCIPRSSAAVTAVAWAP 2290
Cdd:COG2319    220 A-SGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSP 298
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2291 DGSMAVSGNQAGELILWQEAKAVATAQAPGHIG---ALIWSSA-HTFFVLSADEKISEWQvkLRKGSAPGNLSLHLNRIl 2366
Cdd:COG2319    299 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGavrSVAFSPDgKTLASGSDDGTVRLWD--LATGELLRTLTGHTGAV- 375
                          410       420
                   ....*....|....*....|....*
gi 2462541457 2367 qedlgvlTSLDWAPDGHFLILAKAD 2391
Cdd:COG2319    376 -------TSVAFSPDGRTLASGSAD 393
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
2056-2346 5.72e-53

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 188.70  E-value: 5.72e-53
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2056 ELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWDVrtpKTPVLIHSFPAcHRDWVTGCAWTKD-NLLISCSSDGSVGLWD 2134
Cdd:cd00200      4 TLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDL---ETGELLRTLKG-HTGPVRDVAASADgTYLASGSSDKTIRLWD 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2135 PESGQRLGQFLGHQSAVSAVAAVEEH--VVSVSRDGTLKVWD-HQGVELTSIPAHSGPISHCAAamepraagqPGSELLV 2211
Cdd:cd00200     80 LETGECVRTLTGHTSYVSSVAFSPDGriLSSSSRDKTIKVWDvETGKCLTTLRGHTDWVNSVAF---------SPDGTFV 150
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2212 VTVGLDGATRLWHP-LLVCQtHTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKEADDTCIPRSSAAVTAVAWAP 2290
Cdd:cd00200    151 ASSSQDGTIKLWDLrTGKCV-ATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP 229
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2291 DGSMAVSGNQAGELILWQEAKAVATAQAPGH---IGALIWS-SAHTFFVLSADEKISEWQ 2346
Cdd:cd00200    230 DGYLLASGSEDGTIRVWDLRTGECVQTLSGHtnsVTSLAWSpDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
1680-1958 1.84e-48

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 179.34  E-value: 1.84e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1680 TAVAFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDDTLFLTA-FDGLLELWDLQHGCRVLQTKAH 1758
Cdd:COG2319    124 RSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGsDDGTVRLWDLATGKLLRTLTGH 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1759 QYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQ-LAFQHTYPKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKD 1837
Cdd:COG2319    204 TGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKlLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRT 283
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1838 LGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGR 1917
Cdd:COG2319    284 LTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGE 363
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 2462541457 1918 P----RGHLGslslsPALSVALSPDGDRVAVGYRADGIRIYKISS 1958
Cdd:COG2319    364 LlrtlTGHTG-----AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1799-2174 5.08e-44

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 162.89  E-value: 5.08e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1799 YPKSLNCVAFHPEGQVIATGSWagsisffqvdglkvtkdlgapgasirtlafnvpggvvavgrlDSMVELWAWREGARLA 1878
Cdd:cd00200      8 HTGGVTCVAFSPDGKLLATGSG------------------------------------------DGTIKVWDLETGELLR 45
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1879 AFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRP----RGHLGSLSlspalSVALSPDGdRVAVGYRADG-IRI 1953
Cdd:cd00200     46 TLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECvrtlTGHTSYVS-----SVAFSPDG-RILSSSSRDKtIKV 119
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1954 YKISSGS-----QGAQGQALDVAVSalawlspkvlvsgaedgslqgwalkecslqslwllsrfqkpvlglaTSQELLASA 2028
Cdd:cd00200    120 WDVETGKclttlRGHTDWVNSVAFS----------------------------------------------PDGTFVASS 153
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2029 SEDFTVQLWprqlltrphKAEDFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWDVRTPKtpvLIHSFPAcHRD 2108
Cdd:cd00200    154 SQDGTIKLW---------DLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGK---CLGTLRG-HEN 220
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462541457 2109 WVTGCAWTKDN-LLISCSSDGSVGLWDPESGQRLGQFLGHQSAVSAVAAVEE--HVVSVSRDGTLKVWD 2174
Cdd:cd00200    221 GVNSVAFSPDGyLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDgkRLASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1680-1955 1.96e-36

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 140.93  E-value: 1.96e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1680 TAVAFSTNGQRAAVGTANGTVYLLDLRTWQEEKSVVSGCDGISACLFLSDDTLFLTA-FDGLLELWDLQHGCRVLQTKAH 1758
Cdd:cd00200     13 TCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGsSDKTIRLWDLETGECVRTLTGH 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1759 QYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQLAF---QHTypKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVT 1835
Cdd:cd00200     93 TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTtlrGHT--DWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCV 170
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1836 KDLGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREGARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSL 1915
Cdd:cd00200    171 ATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRT 250
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 2462541457 1916 GRPRGHLGSLSlSPALSVALSPDGDRVAVGYrADG-IRIYK 1955
Cdd:cd00200    251 GECVQTLSGHT-NSVTSLAWSPDGKRLASGS-ADGtIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1719-1996 1.98e-36

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 140.93  E-value: 1.98e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1719 DGISACLFLSDDTLFLTAF-DGLLELWDLQHGCRVLQTKAHQYQITGCCLSPDCRLLATVCLGGCLKLWDTVRGQLA--- 1794
Cdd:cd00200     10 GGVTCVAFSPDGKLLATGSgDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVrtl 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1795 FQHTypKSLNCVAFHPEGQVIATGSWAGSISFFQVDGLKVTKDLGAPGASIRTLAFNVPGGVVAVGRLDSMVELWAWREG 1874
Cdd:cd00200     90 TGHT--SYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTG 167
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1875 ARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWSGSLGRP----RGHLGSLslspaLSVALSPDGDRVAVGYRADG 1950
Cdd:cd00200    168 KCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKClgtlRGHENGV-----NSVAFSPDGYLLASGSEDGT 242
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 2462541457 1951 IRIYKISSGSQGAQGQALDVAVSALAWlSP--KVLVSGAEDGSLQGWA 1996
Cdd:cd00200    243 IRVWDLRTGECVQTLSGHTNSVTSLAW-SPdgKRLASGSADGTIRIWD 289
NACHT pfam05729
NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in ...
1162-1337 2.19e-32

NACHT domain; This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.


Pssm-ID: 428606 [Multi-domain]  Cd Length: 166  Bit Score: 124.72  E-value: 2.19e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1162 RLSLVTGQSGQGKTAFLASLVSALQAPDGAKVASLVFFHFSGARPDQGLAltllRRLCTYLRGQLKEPGALPSTYRSLVW 1241
Cdd:pfam05729    1 RTVILQGEAGSGKTTLLQKLALLWAQGKLPQGFDFVFFLPCRELSRSGNA----RSLADLLFSQWPEPAAPVSEVWAVIL 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1242 ELQQRLLpksaeslhpgqtqvLIIDGADRLVDQNGQ---------LISDWIPKKLPRCVHLVLSVSSDAG--LGETLEQS 1310
Cdd:pfam05729   77 ELPERLL--------------LILDGLDELVSDLGQldgpcpvltLLSSLLRKKLLPGASLLLTVRPDALrdLRRGLEEP 142
                          170       180
                   ....*....|....*....|....*..
gi 2462541457 1311 QgahVLALGPLEASARARLVREELALY 1337
Cdd:pfam05729  143 R---YLEVRGFSESDRKQYVRKYFSDE 166
DUF4062 pfam13271
Domain of unknown function (DUF4062); This presumed domain is functionally uncharacterized. ...
900-1008 1.04e-15

Domain of unknown function (DUF4062); This presumed domain is functionally uncharacterized. This domain family is found in bacteria, archaea and eukaryotes, and is approximately 80 amino acids in length. There is a conserved SST sequence motif.


Pssm-ID: 463823  Cd Length: 78  Bit Score: 74.16  E-value: 1.04e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457  900 RLFISSTFRDMHGERDLLLRSVlpalqaRAAPHrislhgidLRWGVTEEETRRNRQLEVCLGEVENAQLFVGILGSRYGY 979
Cdd:pfam13271    1 KVFISSTFYDLKEEREALIEAL------LELGH--------IPVGMEEFPASDESPLDVCLREVDECDIYILILGGRYGS 66
                           90       100
                   ....*....|....*....|....*....
gi 2462541457  980 IPpsynlpdhphfhwaqqyPSGRSVTEME 1008
Cdd:pfam13271   67 ID-----------------PDGISYTELE 78
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
1-29 8.83e-15

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


Pssm-ID: 428450  Cd Length: 29  Bit Score: 69.74  E-value: 8.83e-15
                           10        20
                   ....*....|....*....|....*....
gi 2462541457    1 MEKLHGHVSAHPDILSLENRCLAMLPDLQ 29
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
91-119 3.67e-14

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


Pssm-ID: 428450  Cd Length: 29  Bit Score: 68.20  E-value: 3.67e-14
                           10        20
                   ....*....|....*....|....*....
gi 2462541457   91 MEKPHGHVSAHPDILSLENRCLATLSSLK 119
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
61-89 1.51e-13

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


Pssm-ID: 428450  Cd Length: 29  Bit Score: 66.27  E-value: 1.51e-13
                           10        20
                   ....*....|....*....|....*....
gi 2462541457   61 MEKPHGYVSAHPDILSLENQCLATLSDLK 89
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
TEP1_N pfam05386
TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus ...
31-59 6.92e-12

TEP1 N-terminal domain; This short sequence region is found in four copies at the N-terminus of the TEP1 telomerase component. The functional significance of the region is uncertain. However the conservation of two histidines and a cysteine suggests it is a potential zinc binding domain.


Pssm-ID: 428450  Cd Length: 29  Bit Score: 61.65  E-value: 6.92e-12
                           10        20
                   ....*....|....*....|....*....
gi 2462541457   31 LEKLHQHVSTHSDILSLKNQCLATLPDLK 59
Cdd:pfam05386    1 MEKPHGHVSAHPDILSLENRCLATLPDLK 29
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
2231-2391 2.55e-11

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 66.97  E-value: 2.55e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2231 THTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLWQVPKE-------------------ADDTCIPRSS----------- 2280
Cdd:cd00200      2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellrtlkghtgpvrdvaasADGTYLASGSsdktirlwdle 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2281 ------------AAVTAVAWAPDGSMAVSGNQAGELILWQEAKAVATAQAPGHIG---ALIWSSAHTfFVLSA--DEKIS 2343
Cdd:cd00200     82 tgecvrtltghtSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDwvnSVAFSPDGT-FVASSsqDGTIK 160
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 2462541457 2344 EWQvkLRKGSAPGNLSLHLNRIlqedlgvlTSLDWAPDGHFLILAKAD 2391
Cdd:cd00200    161 LWD--LRTGKCVATLTGHTGEV--------NSVAFSPDGEKLLSSSSD 198
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
2050-2089 1.79e-07

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 49.23  E-value: 1.79e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 2462541457  2050 DFPCGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWD 2089
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
PTZ00421 PTZ00421
coronin; Provisional
1962-2199 4.25e-07

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 55.28  E-value: 4.25e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1962 GAQGQALDVAVSALawlSPKVLVSGAEDGSLQGWALKECSLQSlwllsrfqkpvlglATSQELLasasedftvqlwprql 2041
Cdd:PTZ00421    73 GQEGPIIDVAFNPF---DPQKLFTASEDGTIMGWGIPEEGLTQ--------------NISDPIV---------------- 119
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2042 ltrphkaedfpcgtELRGHEGPVSCCSFSTDG-GSLATGGRDRSLLCWDVRTPKTPVLIhsfpACHRDWVTGCAWTKD-N 2119
Cdd:PTZ00421   120 --------------HLQGHTKKVGIVSFHPSAmNVLASAGADMVVNVWDVERGKAVEVI----KCHSDQITSLEWNLDgS 181
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2120 LLISCSSDGSVGLWDPESGQRLGQFLGHQSAVS--AVAAVEEHVV-----SVSRDGTLKVWDHQGVEltsIPAHSGPISH 2192
Cdd:PTZ00421   182 LLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSqrCLWAKRKDLIitlgcSKSQQRQIMLWDTRKMA---SPYSTVDLDQ 258

                   ....*..
gi 2462541457 2193 CAAAMEP 2199
Cdd:PTZ00421   259 SSALFIP 265
WD40 pfam00400
WD domain, G-beta repeat;
2053-2089 6.13e-07

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 47.73  E-value: 6.13e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 2462541457 2053 CGTELRGHEGPVSCCSFSTDGGSLATGGRDRSLLCWD 2089
Cdd:pfam00400    3 LLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
2232-2265 4.25e-06

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 45.38  E-value: 4.25e-06
                            10        20        30
                    ....*....|....*....|....*....|....
gi 2462541457  2232 HTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLW 2265
Cdd:smart00320    6 KTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
2095-2134 6.55e-06

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 45.00  E-value: 6.55e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 2462541457  2095 TPVLIHSFPAcHRDWVTGCAWTKD-NLLISCSSDGSVGLWD 2134
Cdd:smart00320    1 SGELLKTLKG-HTGPVTSVAFSPDgKYLASGSDDGTIKLWD 40
PLN00181 PLN00181
protein SPA1-RELATED; Provisional
1972-2147 8.48e-06

protein SPA1-RELATED; Provisional


Pssm-ID: 177776 [Multi-domain]  Cd Length: 793  Bit Score: 51.24  E-value: 8.48e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1972 VSALAWLS--PKVLVSGAEDGSLQGWALKECSLQSLwlLSRFQKPVLGLATSQE---LLASASEDFTVQLWPRQlltrph 2046
Cdd:PLN00181   535 LSGICWNSyiKSQVASSNFEGVVQVWDVARSQLVTE--MKEHEKRVWSIDYSSAdptLLASGSDDGSVKLWSIN------ 606
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 2047 kaEDFPCGTelRGHEGPVSCCSFSTDGG-SLATGGRDRSLLCWDVRTPKTPVlihsfpaC----HRDWVTGCAWTKDNLL 2121
Cdd:PLN00181   607 --QGVSIGT--IKTKANICCVQFPSESGrSLAFGSADHKVYYYDLRNPKLPL-------CtmigHSKTVSYVRFVDSSTL 675
                          170       180       190
                   ....*....|....*....|....*....|..
gi 2462541457 2122 ISCSSDGSVGLWD---PESG---QRLGQFLGH 2147
Cdd:PLN00181   676 VSSSTDNTLKLWDlsmSISGineTPLHSFMGH 707
WD40 pfam00400
WD domain, G-beta repeat;
2230-2265 1.17e-05

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 44.26  E-value: 1.17e-05
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 2462541457 2230 QTHTLLGHSGPVRAAAVSETSGLMLTASEDGSVRLW 2265
Cdd:pfam00400    3 LLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
WD40 pfam00400
WD domain, G-beta repeat;
2098-2134 1.45e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 41.18  E-value: 1.45e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 2462541457 2098 LIHSFPAcHRDWVTGCAWTKD-NLLISCSSDGSVGLWD 2134
Cdd:pfam00400    3 LLKTLEG-HTGSVTSLAFSPDgKLLASGSDDGTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1756-1787 2.06e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 40.76  E-value: 2.06e-04
                            10        20        30
                    ....*....|....*....|....*....|..
gi 2462541457  1756 KAHQYQITGCCLSPDCRLLATVCLGGCLKLWD 1787
Cdd:smart00320    9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
2137-2174 4.47e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 39.99  E-value: 4.47e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 2462541457  2137 SGQRLGQFLGHQSAVSAVA--AVEEHVVSVSRDGTLKVWD 2174
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAfsPDGKYLASGSDDGTIKLWD 40
AAA_16 pfam13191
AAA ATPase domain; This family of domains contain a P-loop motif that is characteriztic of the ...
1147-1280 6.90e-04

AAA ATPase domain; This family of domains contain a P-loop motif that is characteriztic of the AAA superfamily.


Pssm-ID: 433025 [Multi-domain]  Cd Length: 167  Bit Score: 42.88  E-value: 6.90e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1147 RLLQDTVQRLMLPHGRLSLVTGQSGQGKTAFLASLVSALqAPDGAKVASLVFFHFSGARP--DQGLALTLLRRLCT---- 1220
Cdd:pfam13191   10 EQLLDALDRVRSGRPPSVLLTGEAGTGKTTLLRELLRAL-ERDGGYFLRGKCDENLPYSPllEALTREGLLRQLLDeles 88
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462541457 1221 --------YLRGQLKEPGALPSTYRSLVWELQQRLLPKSAESLHPgqtQVLIIDGADRLVDQNGQLIS 1280
Cdd:pfam13191   89 slleawraALLEALAPVPELPGDLAERLLDLLLRLLDLLARGERP---LVLVLDDLQWADEASLQLLA 153
WD40 pfam00400
WD domain, G-beta repeat;
2138-2174 8.18e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 39.25  E-value: 8.18e-04
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2462541457 2138 GQRLGQFLGHQSAVSAVA--AVEEHVVSVSRDGTLKVWD 2174
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAfsPDGKLLASGSDDGTVKVWD 39
YcjX COG3106
Ras-like GTP-binding stress-induced protein YcjX, DUF463 family [Signal transduction ...
1149-1199 1.01e-03

Ras-like GTP-binding stress-induced protein YcjX, DUF463 family [Signal transduction mechanisms];


Pssm-ID: 442340  Cd Length: 467  Bit Score: 44.41  E-value: 1.01e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462541457 1149 LQDTVQRLMLPHGRLSlVTGQSGQGKTAFLASLVSALQApdGAKVASLVFF 1199
Cdd:COG3106     11 LADLANRLLDRHLRLA-VTGLSRSGKTAFITSLVNQLLH--GGSGARLPLF 58
AAA_22 pfam13401
AAA domain;
1160-1271 1.81e-03

AAA domain;


Pssm-ID: 379165 [Multi-domain]  Cd Length: 129  Bit Score: 40.79  E-value: 1.81e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1160 HGRLSLVTGQSGQGKTAFLASLVSALQAPDgakvASLVFFHFSGarpdqglaLTLLRRLCTYLRGQLKEPGALPSTYRSL 1239
Cdd:pfam13401    4 GAGILVLTGESGTGKTTLLRRLLEQLPEVR----DSVVFVDLPS--------GTSPKDLLRALLRALGLPLSGRLSKEEL 71
                           90       100       110
                   ....*....|....*....|....*....|..
gi 2462541457 1240 VWELQQRLlpksaesLHPGQTQVLIIDGADRL 1271
Cdd:pfam13401   72 LAALQQLL-------LALAVAVVLIIDEAQHL 96
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1874-1912 6.08e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 36.52  E-value: 6.08e-03
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 2462541457  1874 GARLAAFPAHHGFVAAALFLHAGCQLLTAGEDGKVQVWS 1912
Cdd:smart00320    2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
1756-1787 6.22e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 36.55  E-value: 6.22e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 2462541457 1756 KAHQYQITGCCLSPDCRLLATVCLGGCLKLWD 1787
Cdd:pfam00400    8 EGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
ExeA COG3267
Type II secretory pathway ATPase component GspA/ExeA/MshM [Intracellular trafficking, ...
1160-1271 9.35e-03

Type II secretory pathway ATPase component GspA/ExeA/MshM [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 442498 [Multi-domain]  Cd Length: 261  Bit Score: 40.54  E-value: 9.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462541457 1160 HGRLSLVTGQSGQGKTAFLASLVSALqaPDGAKVASLVFFHFSgarpdqglALTLLRRLCTYLRGQLKepgalPSTYRSL 1239
Cdd:COG3267     42 GGGFVVLTGEVGTGKTTLLRRLLERL--PDDVKVAYIPNPQLS--------PAELLRAIADELGLEPK-----GASKADL 106
                           90       100       110
                   ....*....|....*....|....*....|..
gi 2462541457 1240 VWELQQRLLPKSAESLHPgqtqVLIIDGADRL 1271
Cdd:COG3267    107 LRQLQEFLLELAAAGRRV----VLIIDEAQNL 134
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH