NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|578814386|ref|XP_006716104|]
View 

cytoplasmic dynein 2 intermediate chain 1 isoform X3 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PTZ00121 super family cl31754
MAEBL; Provisional
5-456 3.92e-09

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 61.31  E-value: 3.92e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386    5 KRRTKDDTWKADDLRKHLWAIQSGGSKEERKHREKKLRKESEM----DLPEHKEPRCRDPDQDARSRDRVAE--VHTAKE 78
Cdd:PTZ00121 1247 EERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKkkadEAKKAEEKKKADEAKKKAEEAKKADeaKKKAEE 1326
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386   79 SPRGERDRDRQRERRRDAKDREKEKLKEKHREAEKSHSRGKDREKEKDRRARKEELRQTVAHHNLLGQETRDRQllERAE 158
Cdd:PTZ00121 1327 AKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKA--EEDK 1404
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  159 RKGRSVSKVRSEEKDEDSERGDEDRERRYRERKLQYGDSKDnplkywlyKEEGERRHRKPREPDRDNKHREKSSTREKRE 238
Cdd:PTZ00121 1405 KKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKK--------ADEAKKKAEEAKKAEEAKKKAEEAKKADEAK 1476
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  239 KYSKEKSNSFSDKGEERHKEKRHKEGFHFDDERHQSNVDRK--EKSAKDEPRKRESQNGEHRNRGASSKRDGTSSQHAEN 316
Cdd:PTZ00121 1477 KKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKaeEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEE 1556
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  317 LVRNHGKDKDSRRKHGHEEGSSVWWKLDQRPGGEEtvvRREIEKEETDLENARADAYTASCEDDFEDYEddfevcdgddD 396
Cdd:PTZ00121 1557 LKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEE---ARIEEVMKLYEEEKKMKAEEAKKAEEAKIKA----------E 1623
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578814386  397 ESSNEPESREKLEELPLAQKKEIQEIQRAINAENE---RIGELSLKLFQKRGRTEFEKEPRTD 456
Cdd:PTZ00121 1624 ELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEEnkiKAAEEAKKAEEDKKKAEEAKKAEED 1686
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
915-987 1.13e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 42.32  E-value: 1.13e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578814386  915 VNVIDFSPFGEPiFLAGCSDGSIRLHQLSSAFPLLQWDSSTDshAVTGLQWSPTRpAVFLVQDDTSNIYIWDL 987
Cdd:cd00200   180 VNSVAFSPDGEK-LLSSSSDGTIKLWDLSTGKCLGTLRGHEN--GVNSVAFSPDG-YLLASGSEDGTIRVWDL 248
 
Name Accession Description Interval E-value
PTZ00121 PTZ00121
MAEBL; Provisional
5-456 3.92e-09

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 61.31  E-value: 3.92e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386    5 KRRTKDDTWKADDLRKHLWAIQSGGSKEERKHREKKLRKESEM----DLPEHKEPRCRDPDQDARSRDRVAE--VHTAKE 78
Cdd:PTZ00121 1247 EERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKkkadEAKKAEEKKKADEAKKKAEEAKKADeaKKKAEE 1326
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386   79 SPRGERDRDRQRERRRDAKDREKEKLKEKHREAEKSHSRGKDREKEKDRRARKEELRQTVAHHNLLGQETRDRQllERAE 158
Cdd:PTZ00121 1327 AKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKA--EEDK 1404
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  159 RKGRSVSKVRSEEKDEDSERGDEDRERRYRERKLQYGDSKDnplkywlyKEEGERRHRKPREPDRDNKHREKSSTREKRE 238
Cdd:PTZ00121 1405 KKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKK--------ADEAKKKAEEAKKAEEAKKKAEEAKKADEAK 1476
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  239 KYSKEKSNSFSDKGEERHKEKRHKEGFHFDDERHQSNVDRK--EKSAKDEPRKRESQNGEHRNRGASSKRDGTSSQHAEN 316
Cdd:PTZ00121 1477 KKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKaeEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEE 1556
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  317 LVRNHGKDKDSRRKHGHEEGSSVWWKLDQRPGGEEtvvRREIEKEETDLENARADAYTASCEDDFEDYEddfevcdgddD 396
Cdd:PTZ00121 1557 LKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEE---ARIEEVMKLYEEEKKMKAEEAKKAEEAKIKA----------E 1623
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578814386  397 ESSNEPESREKLEELPLAQKKEIQEIQRAINAENE---RIGELSLKLFQKRGRTEFEKEPRTD 456
Cdd:PTZ00121 1624 ELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEEnkiKAAEEAKKAEEDKKKAEEAKKAEED 1686
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
18-300 9.20e-06

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 49.74  E-value: 9.20e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386    18 LRKHLWAIQSGGSKEERKHREKKLRKESEMdLPEHKEPRCRDPDQdarsRDRVAEVHTAKESprgerdrdrqRERRRDAK 97
Cdd:pfam17380  271 LNQLLHIVQHQKAVSERQQQEKFEKMEQER-LRQEKEEKAREVER----RRKLEEAEKARQA----------EMDRQAAI 335
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386    98 DREKEKLK-EKHREAEKSHSRGKDREKEkdrRARKEELRQTVAHHNLLGQETRDRQLL-ERAERKGRSVSKVRSEEkdED 175
Cdd:pfam17380  336 YAEQERMAmERERELERIRQEERKRELE---RIRQEEIAMEISRMRELERLQMERQQKnERVRQELEAARKVKILE--EE 410
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386   176 SERGDEDRERRYRERKLQYGDSKDNPLKYW----------LYKEEGERRHRKPR----EPDRDNKHREKSSTREKREKYS 241
Cdd:pfam17380  411 RQRKIQQQKVEMEQIRAEQEEARQREVRRLeeeraremerVRLEEQERQQQVERlrqqEEERKRKKLELEKEKRDRKRAE 490
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578814386   242 KEKSNSFSDKGEERHK----EKRHKEGFHFDDERHQSNVDRKEKSAKDEPRKRESQNGEHRNR 300
Cdd:pfam17380  491 EQRRKILEKELEERKQamieEERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRR 553
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
97-248 1.03e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 46.04  E-value: 1.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386    97 KDREKEKLKEKHREaekshsRGKDREKEKDRRARKEELRQTVAHHnllgqETRDRQLLERAERKGRSVSKVRSEEKDEDS 176
Cdd:TIGR01642    1 RDEEPDREREKSRG------RDRDRSSERPRRRSRDRSRFRDRHR-----RSRERSYREDSRPRDRRRYDSRSPRSLRYS 69
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 578814386   177 ERGDedrerryrerklqygdSKDNPlkywlykeegeRRHRKPREPDRDNKHREKSSTREKREKYSKEKSNSF 248
Cdd:TIGR01642   70 SVRR----------------SRDRP-----------RRRSRSVRSIEQHRRRLRDRSPSNQWRKDDKKRSLW 114
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
915-987 1.13e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 42.32  E-value: 1.13e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578814386  915 VNVIDFSPFGEPiFLAGCSDGSIRLHQLSSAFPLLQWDSSTDshAVTGLQWSPTRpAVFLVQDDTSNIYIWDL 987
Cdd:cd00200   180 VNSVAFSPDGEK-LLSSSSDGTIKLWDLSTGKCLGTLRGHEN--GVNSVAFSPDG-YLLASGSEDGTIRVWDL 248
 
Name Accession Description Interval E-value
PTZ00121 PTZ00121
MAEBL; Provisional
5-456 3.92e-09

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 61.31  E-value: 3.92e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386    5 KRRTKDDTWKADDLRKHLWAIQSGGSKEERKHREKKLRKESEM----DLPEHKEPRCRDPDQDARSRDRVAE--VHTAKE 78
Cdd:PTZ00121 1247 EERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKkkadEAKKAEEKKKADEAKKKAEEAKKADeaKKKAEE 1326
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386   79 SPRGERDRDRQRERRRDAKDREKEKLKEKHREAEKSHSRGKDREKEKDRRARKEELRQTVAHHNLLGQETRDRQllERAE 158
Cdd:PTZ00121 1327 AKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKA--EEDK 1404
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  159 RKGRSVSKVRSEEKDEDSERGDEDRERRYRERKLQYGDSKDnplkywlyKEEGERRHRKPREPDRDNKHREKSSTREKRE 238
Cdd:PTZ00121 1405 KKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKK--------ADEAKKKAEEAKKAEEAKKKAEEAKKADEAK 1476
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  239 KYSKEKSNSFSDKGEERHKEKRHKEGFHFDDERHQSNVDRK--EKSAKDEPRKRESQNGEHRNRGASSKRDGTSSQHAEN 316
Cdd:PTZ00121 1477 KKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKaeEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEE 1556
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  317 LVRNHGKDKDSRRKHGHEEGSSVWWKLDQRPGGEEtvvRREIEKEETDLENARADAYTASCEDDFEDYEddfevcdgddD 396
Cdd:PTZ00121 1557 LKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEE---ARIEEVMKLYEEEKKMKAEEAKKAEEAKIKA----------E 1623
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578814386  397 ESSNEPESREKLEELPLAQKKEIQEIQRAINAENE---RIGELSLKLFQKRGRTEFEKEPRTD 456
Cdd:PTZ00121 1624 ELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEEnkiKAAEEAKKAEEDKKKAEEAKKAEED 1686
PTZ00121 PTZ00121
MAEBL; Provisional
8-452 6.45e-09

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 60.54  E-value: 6.45e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386    8 TKDDTWKADDLRKhlwaIQSGGSKEERKHREKKLRKESEMDLPEHKEPrcrdpdQDARsrdRVAEVHTAKESPRGERDRD 87
Cdd:PTZ00121 1093 TEEAFGKAEEAKK----TETGKAEEARKAEEAKKKAEDARKAEEARKA------EDAR---KAEEARKAEDAKRVEIARK 1159
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386   88 RQRERR-RDAKDREKEKLKEKHREAEKSHSRGKDREKEKDRRArkEELRQTvahhnllgQETRDRQLLERAE--RKGRSV 164
Cdd:PTZ00121 1160 AEDARKaEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKA--EAARKA--------EEERKAEEARKAEdaKKAEAV 1229
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  165 SKVRSEEKDEDSERGDEDRERRYRERKLQYGDSKDNPLKYWLYKEEgerrhrKPREPDRDNKHREKSSTREKREKYSKEK 244
Cdd:PTZ00121 1230 KKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAE------EARKADELKKAEEKKKADEAKKAEEKKK 1303
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  245 SNSFSDKGEERHK-EKRHKEGFHFDDERHQSNVDRKEKSAKDEPRKRESQNGEHRNRGASSKRDGTSSQHAENLVRNHGK 323
Cdd:PTZ00121 1304 ADEAKKKAEEAKKaDEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAA 1383
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  324 DKDSRRKHGHEEGSSvwwKLDQRPGGEETVVRREIEKEETDLENARADAYTASceddfedyeddfevcdgddDESSNEPE 403
Cdd:PTZ00121 1384 KKKAEEKKKADEAKK---KAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKA-------------------DEAKKKAE 1441
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|....
gi 578814386  404 SREKLEELplaqKKEIQEIQRAINA-----ENERIGELSLKLFQKRGRTEFEKE 452
Cdd:PTZ00121 1442 EAKKADEA----KKKAEEAKKAEEAkkkaeEAKKADEAKKKAEEAKKADEAKKK 1491
PTZ00121 PTZ00121
MAEBL; Provisional
2-422 7.02e-09

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 60.15  E-value: 7.02e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386    2 EPGKRRTKDDTWKADDLRKhlwAIQSGGSKEERKHREKKLRKESEMDLPEHKEPRCRDPDQDARSRDRVAEVHTAKESPR 81
Cdd:PTZ00121 1394 DEAKKKAEEDKKKADELKK---AAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAK 1470
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386   82 GERDRDRQRERRRDAKDREK--EKLKEKHREAEK-SHSRGKDREKEKDRRARK-EELRQT----VAHHNLLGQETRDRQL 153
Cdd:PTZ00121 1471 KADEAKKKAEEAKKADEAKKkaEEAKKKADEAKKaAEAKKKADEAKKAEEAKKaDEAKKAeeakKADEAKKAEEKKKADE 1550
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  154 LERAERKGRSVSKVRSEEKDEDSERGDEDRERRYRERKLQygdSKDNPLKYWLYKEEgerRHRKPREPDRDNKHREKSST 233
Cdd:PTZ00121 1551 LKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAE---EARIEEVMKLYEEE---KKMKAEEAKKAEEAKIKAEE 1624
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  234 REKREKySKEKSNSFSDKGEErhkEKRHKEGFHFDDERHQSNVDRKEKSAKDEPRKRES---QNGEHRNRGASSKRDGTS 310
Cdd:PTZ00121 1625 LKKAEE-EKKKVEQLKKKEAE---EKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEakkAEEDEKKAAEALKKEAEE 1700
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  311 SQHAENLVRNHGKDKDSRRKHGHEEgssvwwklDQRPGGEETvVRREIEKEETDLENARADAYTASCEDDFEDYEDDFEV 390
Cdd:PTZ00121 1701 AKKAEELKKKEAEEKKKAEELKKAE--------EENKIKAEE-AKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAE 1771
                         410       420       430
                  ....*....|....*....|....*....|..
gi 578814386  391 CDGDDDESSNEPESREKLEELPLAQKKEIQEI 422
Cdd:PTZ00121 1772 EIRKEKEAVIEEELDEEDEKRRMEVDKKIKDI 1803
PTZ00121 PTZ00121
MAEBL; Provisional
6-452 2.79e-07

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 55.15  E-value: 2.79e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386    6 RRTKDDTWKADDLRKHLWAIQSGGSKEERKHREKKLRKESEmdlpEHKEPRCRDPDQDARSRDRVAEVHTAKESPRGERD 85
Cdd:PTZ00121 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAA----KAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAA 1383
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386   86 RDRQRERRRDakDREKEKLKEKHREAEKSHSRGKDREKEKDRRARKEELRQTVAHHNLLGQETRDRQLLERAERKGRSVS 165
Cdd:PTZ00121 1384 KKKAEEKKKA--DEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEE 1461
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  166 -KVRSEEK---DEDSERGDEDRERRYRERKLQYGDSKDNPLKYWL---YKEEGERRHRKPREPDRDNKHREKSSTREKRE 238
Cdd:PTZ00121 1462 aKKKAEEAkkaDEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAeakKKADEAKKAEEAKKADEAKKAEEAKKADEAKK 1541
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  239 KYSKEKSNSFSdKGEERHK--EKRHKEGFHFDDERHQSNVDRKEKSAKDEPRKRES----QNGEHRNRGASSKRDGTSSQ 312
Cdd:PTZ00121 1542 AEEKKKADELK-KAEELKKaeEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEvmklYEEEKKMKAEEAKKAEEAKI 1620
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  313 HAENlVRNHGKDKDSRRKHGHEEGSSVWWKLDQRPGGEETVVRREIEKEETDLENARADAYTASCEDDFEDyeddfevcd 392
Cdd:PTZ00121 1621 KAEE-LKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKA--------- 1690
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  393 gdDDESSNEPESREKLEELPLAQKKEIQEIQRAINAENERigelSLKLFQKRGRTEFEKE 452
Cdd:PTZ00121 1691 --AEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEEN----KIKAEEAKKEAEEDKK 1744
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
18-300 9.20e-06

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 49.74  E-value: 9.20e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386    18 LRKHLWAIQSGGSKEERKHREKKLRKESEMdLPEHKEPRCRDPDQdarsRDRVAEVHTAKESprgerdrdrqRERRRDAK 97
Cdd:pfam17380  271 LNQLLHIVQHQKAVSERQQQEKFEKMEQER-LRQEKEEKAREVER----RRKLEEAEKARQA----------EMDRQAAI 335
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386    98 DREKEKLK-EKHREAEKSHSRGKDREKEkdrRARKEELRQTVAHHNLLGQETRDRQLL-ERAERKGRSVSKVRSEEkdED 175
Cdd:pfam17380  336 YAEQERMAmERERELERIRQEERKRELE---RIRQEEIAMEISRMRELERLQMERQQKnERVRQELEAARKVKILE--EE 410
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386   176 SERGDEDRERRYRERKLQYGDSKDNPLKYW----------LYKEEGERRHRKPR----EPDRDNKHREKSSTREKREKYS 241
Cdd:pfam17380  411 RQRKIQQQKVEMEQIRAEQEEARQREVRRLeeeraremerVRLEEQERQQQVERlrqqEEERKRKKLELEKEKRDRKRAE 490
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578814386   242 KEKSNSFSDKGEERHK----EKRHKEGFHFDDERHQSNVDRKEKSAKDEPRKRESQNGEHRNR 300
Cdd:pfam17380  491 EQRRKILEKELEERKQamieEERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRR 553
PTZ00121 PTZ00121
MAEBL; Provisional
1-294 7.72e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 47.06  E-value: 7.72e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386    1 MEPGKRRTKDDTWKADDLRKHLWAIQSGGSKEERKHREKKLRKESEMDLPEHKEPRCRDPDQDARSRDRVAEVHTAKESP 80
Cdd:PTZ00121 1635 VEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEE 1714
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386   81 RgerdrdrqrerrrdakdREKEKLKEKHREAEKSHSRGKDREKEKDRRArkEELRQTVAHHNLLGQETRDRQllERAERK 160
Cdd:PTZ00121 1715 K-----------------KKAEELKKAEEENKIKAEEAKKEAEEDKKKA--EEAKKDEEEKKKIAHLKKEEE--KKAEEI 1773
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  161 GRSVSKVRSEEKDEDSERGDEDRERRYRERK-----LQYGDSKDNPlkYWLYKEEGERRHRKpREPDRDNKHREKSSTRE 235
Cdd:PTZ00121 1774 RKEKEAVIEEELDEEDEKRRMEVDKKIKDIFdnfanIIEGGKEGNL--VINDSKEMEDSAIK-EVADSKNMQLEEADAFE 1850
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 578814386  236 KREKYSKEKSNSFSDKGEERHKEKRHKEgfhfDDERHQSNVDRKEKSAKDEPRKRESQN 294
Cdd:PTZ00121 1851 KHKFNKNNENGEDGNKEADFNKEKDLKE----DDEEEIEEADEIEKIDKDDIEREIPNN 1905
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
97-248 1.03e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 46.04  E-value: 1.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386    97 KDREKEKLKEKHREaekshsRGKDREKEKDRRARKEELRQTVAHHnllgqETRDRQLLERAERKGRSVSKVRSEEKDEDS 176
Cdd:TIGR01642    1 RDEEPDREREKSRG------RDRDRSSERPRRRSRDRSRFRDRHR-----RSRERSYREDSRPRDRRRYDSRSPRSLRYS 69
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 578814386   177 ERGDedrerryrerklqygdSKDNPlkywlykeegeRRHRKPREPDRDNKHREKSSTREKREKYSKEKSNSF 248
Cdd:TIGR01642   70 SVRR----------------SRDRP-----------RRRSRSVRSIEQHRRRLRDRSPSNQWRKDDKKRSLW 114
PTZ00121 PTZ00121
MAEBL; Provisional
99-452 2.52e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 45.52  E-value: 2.52e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386   99 REKEKLKEKHREAEKSHSRGKDREKEKDRRARKEELRQTVAHHNLLGQETRDRQLLERAE--RKGRSVSKVRSEEKDEDS 176
Cdd:PTZ00121 1084 KEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEeaRKAEDAKRVEIARKAEDA 1163
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  177 ERGDEDRERRYRERKLQYGDSKDNPLKYWLYKEEGERRHRKPREPDRDNKHREKSSTREKREKYSKEKSNSFSDKGEE-- 254
Cdd:PTZ00121 1164 RKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEak 1243
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  255 RHKEKRHKEGFHFDDE-------RHQSNVDRKEKSAKDEPRK-------RESQNGEHRNRGASSKRDGTSSQHAENLVRN 320
Cdd:PTZ00121 1244 KAEEERNNEEIRKFEEarmahfaRRQAAIKAEEARKADELKKaeekkkaDEAKKAEEKKKADEAKKKAEEAKKADEAKKK 1323
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  321 --HGKDKDSRRKHGHEEGSsvwwKLDQRPGGEETVVRREIEKEEtdlENARADAYTASCEDDFEDYEDDFEvcdgddDES 398
Cdd:PTZ00121 1324 aeEAKKKADAAKKKAEEAK----KAAEAAKAEAEAAADEAEAAE---EKAEAAEKKKEEAKKKADAAKKKA------EEK 1390
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....
gi 578814386  399 SNEPESREKLEElplaQKKEIQEIQRAiNAENERIGELSLKLFQKRGRTEFEKE 452
Cdd:PTZ00121 1391 KKADEAKKKAEE----DKKKADELKKA-AAAKKKADEAKKKAEEKKKADEAKKK 1439
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
217-306 4.63e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 44.11  E-value: 4.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386   217 KPREPDRDN-----KHREKSSTREKREKYSKEKSNSFSDKGEER--HKEKRHKEGFHFDDERHQSNVDRKEKSAKDEPRK 289
Cdd:TIGR01642    1 RDEEPDREReksrgRDRDRSSERPRRRSRDRSRFRDRHRRSRERsyREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRR 80
                           90       100
                   ....*....|....*....|
gi 578814386   290 RE---SQNGEHRNRGASSKR 306
Cdd:TIGR01642   81 RSrsvRSIEQHRRRLRDRSP 100
PTZ00121 PTZ00121
MAEBL; Provisional
2-307 4.66e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 44.36  E-value: 4.66e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386    2 EPGKRRTKDDTWKADDLRKhlwaiqsggsKEERKHREKKLRKESEMdlpehKEPRCRDPDQDARSRDRVAEVHTAKESPR 81
Cdd:PTZ00121 1596 EVMKLYEEEKKMKAEEAKK----------AEEAKIKAEELKKAEEE-----KKKVEQLKKKEAEEKKKAEELKKAEEENK 1660
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386   82 gerdrdrqRERRRDAKDREKEKLK-EKHREAEKSHSRGKDREKEKDRRARK-EELRQTVAhhnllgQETRDRQLLERAER 159
Cdd:PTZ00121 1661 --------IKAAEEAKKAEEDKKKaEEAKKAEEDEKKAAEALKKEAEEAKKaEELKKKEA------EEKKKAEELKKAEE 1726
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386  160 KGRSVSKVRSEEKDEDSERGDEDRERRYRERKLQYgdskdnpLKYWLYKEEGERRHRKPR--EPDRDNKHREKSSTREKR 237
Cdd:PTZ00121 1727 ENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAH-------LKKEEEKKAEEIRKEKEAviEEELDEEDEKRRMEVDKK 1799
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 578814386  238 EKYSKEKSNSFSDKGEERH---KEKRHKEGFHFDDERHQSNVDRKE-KSAKDEPRKRESQNGEHRNRGASSKRD 307
Cdd:PTZ00121 1800 IKDIFDNFANIIEGGKEGNlviNDSKEMEDSAIKEVADSKNMQLEEaDAFEKHKFNKNNENGEDGNKEADFNKE 1873
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
915-987 1.13e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 42.32  E-value: 1.13e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578814386  915 VNVIDFSPFGEPiFLAGCSDGSIRLHQLSSAFPLLQWDSSTDshAVTGLQWSPTRpAVFLVQDDTSNIYIWDL 987
Cdd:cd00200   180 VNSVAFSPDGEK-LLSSSSDGTIKLWDLSTGKCLGTLRGHEN--GVNSVAFSPDG-YLLASGSEDGTIRVWDL 248
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
105-262 1.20e-03

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 42.57  E-value: 1.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386   105 KEKHREAEKSHSRGKDREKEKDRRARKEELRQtvahhnllgqETRDRQLLERAERKG-RSVSKVRSEEKDEDSERgdedr 183
Cdd:TIGR01642    3 EEPDREREKSRGRDRDRSSERPRRRSRDRSRF----------RDRHRRSRERSYREDsRPRDRRRYDSRSPRSLR----- 67
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 578814386   184 erryrerklqygdskdnplkywlykeegERRHRKPREPDRdnkHREKSSTREKREkysKEKSNSFSDKGEERHKEKRHK 262
Cdd:TIGR01642   68 ----------------------------YSSVRRSRDRPR---RRSRSVRSIEQH---RRRLRDRSPSNQWRKDDKKRS 112
PRK12678 PRK12678
transcription termination factor Rho; Provisional
2-238 8.21e-03

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 40.27  E-value: 8.21e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386    2 EPGKRRTKDDTWKADDLRKHLWAIQSGGSKEERKHREKKLRKESEMDLPEHKEPRCRDPDQDARSRDRVAEVHTAKESPR 81
Cdd:PRK12678   74 AAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPA 153
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578814386   82 GERDRDRQRERRRDAKDREKEklkEKHREAEKSHSRGKDREKEKDRRARKEELRQTVAHHNllGQETRDRQLLERAERKG 161
Cdd:PRK12678  154 TEARADAAERTEEEERDERRR---RGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQ--GDRREERGRRDGGDRRG 228
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 578814386  162 RSVSKVRSEEKDEDSERGDedrerryrerklQYGDSKDNplkywlyKEEGERRHRKPRepDRDNKHREKSSTREKRE 238
Cdd:PRK12678  229 RRRRRDRRDARGDDNREDR------------GDRDGDDG-------EGRGGRRGRRFR--DRDRRGRRGGDGGNERE 284
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH