NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2462615106|ref|XP_054214531|]
View 

cytoplasmic dynein 2 intermediate chain 1 isoform X2 [Homo sapiens]

Protein Classification

WD40 repeat domain-containing protein( domain architecture ID 1000017)

WD40 repeat domain-containing protein folds into a beta-propeller structure and functions as a scaffold, providing a platform for the interaction and assembly of several proteins into a signalosome; similar to a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly

CATH:  2.130.10.10
Gene Ontology:  GO:0005515
SCOP:  4002744

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PTZ00121 super family cl31754
MAEBL; Provisional
5-289 3.24e-07

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 54.76  E-value: 3.24e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106    5 EHKEPRCRDPDQDARSRDRVAEVHTAKESPRGerDRDRQRERRRDAKDREKEKLKEKHREAEKSHSRGKDREKEKDRRAR 84
Cdd:PTZ00121  1493 EEAKKKADEAKKAAEAKKKADEAKKAEEAKKA--DEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAK 1570
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106   85 KEELRQTVAhhnllgqeTRDRQLLERAERK-GRSVSKVRSEEKDEDSERGDEDRERRYRERKLQYGDSKDNPLKYWLYKE 163
Cdd:PTZ00121  1571 KAEEDKNMA--------LRKAEEAKKAEEArIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKE 1642
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106  164 EGERRH----RKPREPDRDKKHREKSSTRE---------KREKYSKEKSNSFSDKGEERHK----------EKRHKEGFH 220
Cdd:PTZ00121  1643 AEEKKKaeelKKAEEENKIKAAEEAKKAEEdkkkaeeakKAEEDEKKAAEALKKEAEEAKKaeelkkkeaeEKKKAEELK 1722
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462615106  221 FDDERHQSNVDRKEKSAKDEPRKRES--QNGEHRNRGASSKRDGTSS-----QHAENLVRNHGKDKDSRRKHGHEE 289
Cdd:PTZ00121  1723 KAEEENKIKAEEAKKEAEEDKKKAEEakKDEEEKKKIAHLKKEEEKKaeeirKEKEAVIEEELDEEDEKRRMEVDK 1798
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
866-938 1.16e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 41.94  E-value: 1.16e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462615106  866 VNVIDFSPFGEPiFLAGCSDGSIRLHQLSSAFPLLQWDSSTDshAVTGLQWSPTRpAVFLVQDDTSNIYIWDL 938
Cdd:cd00200    180 VNSVAFSPDGEK-LLSSSSDGTIKLWDLSTGKCLGTLRGHEN--GVNSVAFSPDG-YLLASGSEDGTIRVWDL 248
 
Name Accession Description Interval E-value
PTZ00121 PTZ00121
MAEBL; Provisional
5-289 3.24e-07

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 54.76  E-value: 3.24e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106    5 EHKEPRCRDPDQDARSRDRVAEVHTAKESPRGerDRDRQRERRRDAKDREKEKLKEKHREAEKSHSRGKDREKEKDRRAR 84
Cdd:PTZ00121  1493 EEAKKKADEAKKAAEAKKKADEAKKAEEAKKA--DEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAK 1570
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106   85 KEELRQTVAhhnllgqeTRDRQLLERAERK-GRSVSKVRSEEKDEDSERGDEDRERRYRERKLQYGDSKDNPLKYWLYKE 163
Cdd:PTZ00121  1571 KAEEDKNMA--------LRKAEEAKKAEEArIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKE 1642
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106  164 EGERRH----RKPREPDRDKKHREKSSTRE---------KREKYSKEKSNSFSDKGEERHK----------EKRHKEGFH 220
Cdd:PTZ00121  1643 AEEKKKaeelKKAEEENKIKAAEEAKKAEEdkkkaeeakKAEEDEKKAAEALKKEAEEAKKaeelkkkeaeEKKKAEELK 1722
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462615106  221 FDDERHQSNVDRKEKSAKDEPRKRES--QNGEHRNRGASSKRDGTSS-----QHAENLVRNHGKDKDSRRKHGHEE 289
Cdd:PTZ00121  1723 KAEEENKIKAEEAKKEAEEDKKKAEEakKDEEEKKKIAHLKKEEEKKaeeirKEKEAVIEEELDEEDEKRRMEVDK 1798
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
51-193 4.13e-05

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 47.20  E-value: 4.13e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106   51 KDREKEKLKEKHREaekshsRGKDREKEKDRRARKEELRQTVAHHnllgqETRDRQLLERAERKGRSVSKVRSEEKDEDS 130
Cdd:TIGR01642    1 RDEEPDREREKSRG------RDRDRSSERPRRRSRDRSRFRDRHR-----RSRERSYREDSRPRDRRRYDSRSPRSLRYS 69
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462615106  131 ERGDedrerryrerklqygdSKDNPLK--YWLYKEEGERRHRKPREPDRDKKHREKSSTREKREK 193
Cdd:TIGR01642   70 SVRR----------------SRDRPRRrsRSVRSIEQHRRRLRDRSPSNQWRKDDKKRSLWDIKP 118
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
54-254 1.04e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 46.27  E-value: 1.04e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106   54 EKEKLK-EKHREAEKSHSRGKDREKEkdrRARKEELRQTVAHHNLLGQETRDRQLL-ERAERKGRSVSKVRSEEkdEDSE 131
Cdd:pfam17380  338 EQERMAmERERELERIRQEERKRELE---RIRQEEIAMEISRMRELERLQMERQQKnERVRQELEAARKVKILE--EERQ 412
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106  132 RGDEDRERRYRERKLQYGDSKDNPLKYW----------LYKEEGERRHRKPR----EPDRDKKHREKSSTREKREKYSKE 197
Cdd:pfam17380  413 RKIQQQKVEMEQIRAEQEEARQREVRRLeeeraremerVRLEEQERQQQVERlrqqEEERKRKKLELEKEKRDRKRAEEQ 492
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462615106  198 KSNSFSDKGEERHK----EKRHKEGFHFDDERHQSNVDRKEKSAKDEPRKRESQNGEHRNR 254
Cdd:pfam17380  493 RRKILEKELEERKQamieEERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRR 553
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
866-938 1.16e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 41.94  E-value: 1.16e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462615106  866 VNVIDFSPFGEPiFLAGCSDGSIRLHQLSSAFPLLQWDSSTDshAVTGLQWSPTRpAVFLVQDDTSNIYIWDL 938
Cdd:cd00200    180 VNSVAFSPDGEK-LLSSSSDGTIKLWDLSTGKCLGTLRGHEN--GVNSVAFSPDG-YLLASGSEDGTIRVWDL 248
 
Name Accession Description Interval E-value
PTZ00121 PTZ00121
MAEBL; Provisional
5-289 3.24e-07

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 54.76  E-value: 3.24e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106    5 EHKEPRCRDPDQDARSRDRVAEVHTAKESPRGerDRDRQRERRRDAKDREKEKLKEKHREAEKSHSRGKDREKEKDRRAR 84
Cdd:PTZ00121  1493 EEAKKKADEAKKAAEAKKKADEAKKAEEAKKA--DEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAK 1570
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106   85 KEELRQTVAhhnllgqeTRDRQLLERAERK-GRSVSKVRSEEKDEDSERGDEDRERRYRERKLQYGDSKDNPLKYWLYKE 163
Cdd:PTZ00121  1571 KAEEDKNMA--------LRKAEEAKKAEEArIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKE 1642
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106  164 EGERRH----RKPREPDRDKKHREKSSTRE---------KREKYSKEKSNSFSDKGEERHK----------EKRHKEGFH 220
Cdd:PTZ00121  1643 AEEKKKaeelKKAEEENKIKAAEEAKKAEEdkkkaeeakKAEEDEKKAAEALKKEAEEAKKaeelkkkeaeEKKKAEELK 1722
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462615106  221 FDDERHQSNVDRKEKSAKDEPRKRES--QNGEHRNRGASSKRDGTSS-----QHAENLVRNHGKDKDSRRKHGHEE 289
Cdd:PTZ00121  1723 KAEEENKIKAEEAKKEAEEDKKKAEEakKDEEEKKKIAHLKKEEEKKaeeirKEKEAVIEEELDEEDEKRRMEVDK 1798
PTZ00121 PTZ00121
MAEBL; Provisional
5-403 1.42e-06

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 52.84  E-value: 1.42e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106    5 EHKEPRCRDPDQDARsrdRVAEVHTAKESPRgerdrDRQRERRRDAKDREKEKLKEKHREAEKSHSRGKDREKEKDRRA- 83
Cdd:PTZ00121  1114 ARKAEEAKKKAEDAR---KAEEARKAEDARK-----AEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAe 1185
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106   84 ---RKEELRQtvAHHNLLGQETRDRQLLERAE--------RKGRSVSKVRSEEKDEDSERGDEDRERRYRERKLQYGDSK 152
Cdd:PTZ00121  1186 evrKAEELRK--AEDARKAEAARKAEEERKAEearkaedaKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMA 1263
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106  153 DNPLKYWLYKEEgerrhrKPREPDRDKKHREKSSTREKREKYSKEKSNSFSDKGEERHKEKRHKEgfhfDDERHQSNVDR 232
Cdd:PTZ00121  1264 HFARRQAAIKAE------EARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKK----KAEEAKKKADA 1333
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106  233 KEKSAKDEPRKRESQNGEHRNRGASSKRDGTSSQHAENLVRNHGKDKDSRRKHGHEEGSSVwwKLDQRPGGEETVEIEKE 312
Cdd:PTZ00121  1334 AKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKAD--EAKKKAEEDKKKADELK 1411
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106  313 ETDLENARADAYTASCEDDFEDyeddfevcdgddDESSNEPESREKLEELplaqKKEIQEIQRAINA-----ENERIGEL 387
Cdd:PTZ00121  1412 KAAAAKKKADEAKKKAEEKKKA------------DEAKKKAEEAKKADEA----KKKAEEAKKAEEAkkkaeEAKKADEA 1475
                          410
                   ....*....|....*.
gi 2462615106  388 SLKLFQKRGRTEFEKE 403
Cdd:PTZ00121  1476 KKKAEEAKKADEAKKK 1491
PTZ00121 PTZ00121
MAEBL; Provisional
2-407 1.18e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 49.75  E-value: 1.18e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106    2 DLPEHKEPRCRDPDQDARSRDRVAEVHTAKESPRGERDRDRQRERRRDAKDREKEKLKEKHREAEKShsrgKDREKEKDR 81
Cdd:PTZ00121  1282 ELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAA----KAEAEAAAD 1357
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106   82 RARKEELRQTVAHHnllgQETRDRQLLERAERKGRSVSKVRSEEK--DEDSERGDEDRERRYRERKLQYGDSKDNPLKyw 159
Cdd:PTZ00121  1358 EAEAAEEKAEAAEK----KKEEAKKKADAAKKKAEEKKKADEAKKkaEEDKKKADELKKAAAAKKKADEAKKKAEEKK-- 1431
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106  160 lYKEEGERRHRKPREPDRDKKHREKSSTREKREKYSKE--KSNSFSDKGEERHK-----------EKRHKEGFHFDDERH 226
Cdd:PTZ00121  1432 -KADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEakKADEAKKKAEEAKKadeakkkaeeaKKKADEAKKAAEAKK 1510
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106  227 QSNVDRK--EKSAKDEPRKRESQNGEHRNRGASSKRDGTSSQHAENLVRNHGKDKDSRRKHGHEEGSSVWWKLDQRPGGE 304
Cdd:PTZ00121  1511 KADEAKKaeEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAE 1590
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106  305 ETVEIEKEETDLENARADAYTASCEDDFEDYEddfevcdgddDESSNEPESREKLEELPLAQKKEIQEIQRAINAENE-- 382
Cdd:PTZ00121  1591 EARIEEVMKLYEEEKKMKAEEAKKAEEAKIKA----------EELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEEnk 1660
                          410       420
                   ....*....|....*....|....*.
gi 2462615106  383 -RIGELSLKLFQKRGRTEFEKEPRTD 407
Cdd:PTZ00121  1661 iKAAEEAKKAEEDKKKAEEAKKAEED 1686
PTZ00121 PTZ00121
MAEBL; Provisional
21-279 1.84e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 48.98  E-value: 1.84e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106   21 RDRVAEVHTAKESPRGERDRDRQRERRRDAKDREK-EKLKEKHREAEKS--HSRGKDREKEKDRRARK-------EELRQ 90
Cdd:PTZ00121  1450 KKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKaDEAKKKAEEAKKKadEAKKAAEAKKKADEAKKaeeakkaDEAKK 1529
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106   91 T----VAHHNLLGQETRDRQLLERAERKGRSVSKVRSEEKDEDSERGDEDRERRYRERKLQygdSKDNPLKYWLYKEEGE 166
Cdd:PTZ00121  1530 AeeakKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAE---EARIEEVMKLYEEEKK 1606
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106  167 RRHRKPREPDRDKKHREKSstreKREKYSKEKSNSFSDKGEErhkEKRHKEGFHFDDERHQSNVDRKEKSAKDEPRKRES 246
Cdd:PTZ00121  1607 MKAEEAKKAEEAKIKAEEL----KKAEEEKKKVEQLKKKEAE---EKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEE 1679
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 2462615106  247 ---QNGEHRNRGASSKRDGTSSQHAENLVRNHGKDK 279
Cdd:PTZ00121  1680 akkAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEK 1715
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
51-193 4.13e-05

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 47.20  E-value: 4.13e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106   51 KDREKEKLKEKHREaekshsRGKDREKEKDRRARKEELRQTVAHHnllgqETRDRQLLERAERKGRSVSKVRSEEKDEDS 130
Cdd:TIGR01642    1 RDEEPDREREKSRG------RDRDRSSERPRRRSRDRSRFRDRHR-----RSRERSYREDSRPRDRRRYDSRSPRSLRYS 69
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462615106  131 ERGDedrerryrerklqygdSKDNPLK--YWLYKEEGERRHRKPREPDRDKKHREKSSTREKREK 193
Cdd:TIGR01642   70 SVRR----------------SRDRPRRrsRSVRSIEQHRRRLRDRSPSNQWRKDDKKRSLWDIKP 118
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
54-254 1.04e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 46.27  E-value: 1.04e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106   54 EKEKLK-EKHREAEKSHSRGKDREKEkdrRARKEELRQTVAHHNLLGQETRDRQLL-ERAERKGRSVSKVRSEEkdEDSE 131
Cdd:pfam17380  338 EQERMAmERERELERIRQEERKRELE---RIRQEEIAMEISRMRELERLQMERQQKnERVRQELEAARKVKILE--EERQ 412
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106  132 RGDEDRERRYRERKLQYGDSKDNPLKYW----------LYKEEGERRHRKPR----EPDRDKKHREKSSTREKREKYSKE 197
Cdd:pfam17380  413 RKIQQQKVEMEQIRAEQEEARQREVRRLeeeraremerVRLEEQERQQQVERlrqqEEERKRKKLELEKEKRDRKRAEEQ 492
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462615106  198 KSNSFSDKGEERHK----EKRHKEGFHFDDERHQSNVDRKEKSAKDEPRKRESQNGEHRNR 254
Cdd:pfam17380  493 RRKILEKELEERKQamieEERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRR 553
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
171-260 3.56e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 44.50  E-value: 3.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106  171 KPREPDRDK-----KHREKSSTREKREKYSKEKSNSFSDKGEER--HKEKRHKEGFHFDDERHQSNVDRKEKSAKDEPRK 243
Cdd:TIGR01642    1 RDEEPDREReksrgRDRDRSSERPRRRSRDRSRFRDRHRRSRERsyREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRR 80
                           90       100
                   ....*....|....*....|
gi 2462615106  244 RE---SQNGEHRNRGASSKR 260
Cdd:TIGR01642   81 RSrsvRSIEQHRRRLRDRSP 100
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
59-216 8.19e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 43.34  E-value: 8.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106   59 KEKHREAEKSHSRGKDREKEKDRRARKEELRQtvahhnllgqETRDRQLLERAERKG-RSVSKVRSEEKDEDSERgdedr 137
Cdd:TIGR01642    3 EEPDREREKSRGRDRDRSSERPRRRSRDRSRF----------RDRHRRSRERSYREDsRPRDRRRYDSRSPRSLR----- 67
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462615106  138 erryrerklqygdskdnplkywlykeegERRHRKPREPDRdkkHREKSSTREKREkysKEKSNSFSDKGEERHKEKRHK 216
Cdd:TIGR01642   68 ----------------------------YSSVRRSRDRPR---RRSRSVRSIEQH---RRRLRDRSPSNQWRKDDKKRS 112
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
161-280 9.49e-04

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 42.98  E-value: 9.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106  161 YKEEGERRHRKPREPDRDKKHREKSSTREKREKYSKEKSnsfSDKGEERHKEKrhkegfhfDDERhqsnvDRKEKSAKDE 240
Cdd:TIGR01622    2 YRDRERERLRDSSSAGDRDRRRDKGRERSRDRSRDRERS---RSRRRDRHRDR--------DYYR-----GRERRSRSRR 65
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 2462615106  241 PRKRESQNGEHRNRGASSKRDGTSSQHAENLVRNHGKDKD 280
Cdd:TIGR01622   66 PNRRYRPREKRRRRGDSYRRRRDDRRSRREKPRARDGTPE 105
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
866-938 1.16e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 41.94  E-value: 1.16e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462615106  866 VNVIDFSPFGEPiFLAGCSDGSIRLHQLSSAFPLLQWDSSTDshAVTGLQWSPTRpAVFLVQDDTSNIYIWDL 938
Cdd:cd00200    180 VNSVAFSPDGEK-LLSSSSDGTIKLWDLSTGKCLGTLRGHEN--GVNSVAFSPDG-YLLASGSEDGTIRVWDL 248
Caldesmon pfam02029
Caldesmon;
13-271 1.30e-03

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 42.55  E-value: 1.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106   13 DPDQDARSRDRVAevhtAKESPRGERDRDRQRERRRDAKDREKEKLKEKHREAEKSHSRGKDREKEKDRRARKEELRQTV 92
Cdd:pfam02029    3 DEEEAARERRRRA----REERRRQKEEEEPSGQVTESVEPNEHNSYEEDSELKPSGQGGLDEEEAFLDRTAKREERRQKR 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106   93 AHHNLLGQETRDRQLLER----AERKGR----SVSKVRSEEKDEDSERGDEDRERRYRERKLQYGDSKDNPLKYwlyKEE 164
Cdd:pfam02029   79 LQEALERQKEFDPTIADEkesvAERKENneeeENSSWEKEEKRDSRLGRYKEEETEIREKEYQENKWSTEVRQA---EEE 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106  165 GERRHRKPRE----PDRDKKHREKSSTREKREKYSKEKSNSFSDKGEERHKEK------RHKEGFHFDDERHQSNVDRKE 234
Cdd:pfam02029  156 GEEEEDKSEEaeevPTENFAKEEVKDEKIKKEKKVKYESKVFLDQKRGHPEVKsqngeeEVTKLKVTTKRRQGGLSQSQE 235
                          250       260       270
                   ....*....|....*....|....*....|....*..
gi 2462615106  235 KSAKDEPRKRESQNGEHRNRgassKRDGTSSQHAENL 271
Cdd:pfam02029  236 REEEAEVFLEAEQKLEELRR----RRQEKESEEFEKL 268
PTZ00121 PTZ00121
MAEBL; Provisional
53-403 2.94e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 41.67  E-value: 2.94e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106   53 REKEKLKEKHREAEKSHSRGKDREKEKDRRARKEELRQTVAHHNLLGQETRDRQLLERAE--RKGRSVSKVRSEEKDEDS 130
Cdd:PTZ00121  1084 KEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEeaRKAEDAKRVEIARKAEDA 1163
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106  131 ERGDEDRERRYRERKLQYGDSKDNPLKYWLYKEEGERRHRKPREPDRDKKHREKSSTREKREKYSKEKSNSFSDKGEE-- 208
Cdd:PTZ00121  1164 RKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEak 1243
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106  209 RHKEKRHKEGFHFDDE-------RHQSNVDRKEKSAKDEPRK-------RESQNGEHRNRGASSKRDGTSSQHAENLVRN 274
Cdd:PTZ00121  1244 KAEEERNNEEIRKFEEarmahfaRRQAAIKAEEARKADELKKaeekkkaDEAKKAEEKKKADEAKKKAEEAKKADEAKKK 1323
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462615106  275 --HGKDKDSRRKHGHEEGSsvwwKLDQRPGGEETVEIEKEETDLENARADAYTASCEDDFEDYEDDFEvcdgddDESSNE 352
Cdd:PTZ00121  1324 aeEAKKKADAAKKKAEEAK----KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKA------EEKKKA 1393
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462615106  353 PESREKLEElplaQKKEIQEIQRAiNAENERIGELSLKLFQKRGRTEFEKE 403
Cdd:PTZ00121  1394 DEAKKKAEE----DKKKADELKKA-AAAKKKADEAKKKAEEKKKADEAKKK 1439
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH