NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|767902634|ref|XP_011539099|]
View 

cilia- and flagella-associated protein 57 isoform X3 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
382-646 2.39e-34

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 136.96  E-value: 2.39e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  382 LIATCSLDRSIRLWNYETNT-LELFKEYQEEAYSISLHPSGHFIVVGFADK-LRLMNLLIDD-IRSFK--EYSVRGcgeC 456
Cdd:COG2319   134 TLASGSADGTVRLWDLATGKlLRTLTGHSGAVTSVAFSPDGKLLASGSDDGtVRLWDLATGKlLRTLTghTGAVRS---V 210
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  457 SFS-NGGHLFAAVNGNVIHVYTTTSLENISSLKGHTGKIRSIVWNADDSKLISGGTDGAVYEWNLSTGKRETECVLKSCS 535
Cdd:COG2319   211 AFSpDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGG 290
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  536 YNCVTVSPDAKIIFAVGSDHTLK--EIADSLILREISAFDVTYTAIVISHSGRMMFVGTSVGTIRAmkYPLPLQKEFNEY 613
Cdd:COG2319   291 VNSVAFSPDGKLLASGSDDGTVRlwDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRL--WDLATGELLRTL 368
                         250       260       270
                  ....*....|....*....|....*....|...
gi 767902634  614 QAHAGPITKMLLTFDDQFLLTAAEDGCLFTWKV 646
Cdd:COG2319   369 TGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
Mplasa_alph_rch super family cl37461
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
682-1200 2.23e-21

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


The actual alignment was detected with superfamily member TIGR04523:

Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 100.87  E-value: 2.23e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   682 ELKTRVEELKMENEYQLRLKD-MNYSEKIKELTDKFI-----------QEMESLKTKNQVLRTEKEKQDVYHHEHIEDLL 749
Cdd:TIGR04523  104 DLSKINSEIKNDKEQKNKLEVeLNKLEKQKKENKKNIdkflteikkkeKELEKLNNKYNDLKKQKEELENELNLLEKEKL 183
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   750 DKQSrelqDMECCNNQKLLLEY---------EKYQELQLKSQRMQEEyEKQLRDNDETKSQALEELTEFYEAKLQEKTTL 820
Cdd:TIGR04523  184 NIQK----NIDKIKNKLLKLELllsnlkkkiQKNKSLESQISELKKQ-NNQLKDNIEKKQQEINEKTTEISNTQTQLNQL 258
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   821 LEEAQEDVRQ---QLREFEETKKQIEEDEDReIQDIKTKYEkKLRDEKESNL--RLKGETGIMRKKFSSLQKE------- 888
Cdd:TIGR04523  259 KDEQNKIKKQlseKQKELEQNNKKIKELEKQ-LNQLKSEIS-DLNNQKEQDWnkELKSELKNQEKKLEEIQNQisqnnki 336
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   889 IEERTNDIETLKGEQM-------KLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKE 961
Cdd:TIGR04523  337 ISQLNEQISQLKKELTnsesensEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKK 416
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   962 LKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITEL-------WQKLRATDQEMRRERQkerDLEALVKRFKT 1034
Cdd:TIGR04523  417 LQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLdntreslETQLKVLSRSINKIKQ---NLEQKQKELKS 493
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1035 DLHNCVAYIQEPRLLKEKVRGLFEKyvqradmveIAGLNTDlQQEYTRQREHLERNLATLKKKVVK-EGELHRTDYVRIM 1113
Cdd:TIGR04523  494 KEKELKKLNEEKKELEEKVKDLTKK---------ISSLKEK-IEKLESEKKEKESKISDLEDELNKdDFELKKENLEKEI 563
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1114 QENVSLIKEINELRRELKFTRSQVYDLEAALKLTKKVRPQEVSETEpsrdMLSTAPTARLNEQEETGRIIEMQRLEIQRL 1193
Cdd:TIGR04523  564 DEKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKE----KKISSLEKELEKAKKENEKLSSIIKNIKSK 639

                   ....*..
gi 767902634  1194 RDQIQEQ 1200
Cdd:TIGR04523  640 KNKLKQE 646
WD40 super family cl43672
WD40 repeat [General function prediction only];
41-399 1.05e-05

WD40 repeat [General function prediction only];


The actual alignment was detected with superfamily member COG2319:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 49.52  E-value: 1.05e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   41 ALSISPNRRYLAiseTVQEKPAITIYELSSIPCRKrkVLNNFDFQVQkfiSMAFSPDSKYLLAqTSppESNLVYwLWEKQ 120
Cdd:COG2319   125 SVAFSPDGKTLA---SGSADGTVRLWDLATGKLLR--TLTGHSGAVT---SVAFSPDGKLLAS-GS--DDGTVR-LWDLA 192
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  121 KVMAIVRIDTQNNPVYQVSFSPqDNTQVcVTG--NGMFKLLRFAEGTLKQTSfqRGEPQNYLAHTWVADDK-IVVGTDTG 197
Cdd:COG2319   193 TGKLLRTLTGHTGAVRSVAFSP-DGKLL-ASGsaDGTVRLWDLATGKLLRTL--TGHSGSVRSVAFSPDGRlLASGSADG 268
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  198 KLFLfesgdqrWETsimvkeptNGSKSLDVIQESESliefpPVSSplpsyeqmVAASSHSQMsmpqvfaiaayskgFACS 277
Cdd:COG2319   269 TVRL-------WDL--------ATGELLRTLTGHSG-----GVNS--------VAFSPDGKL--------------LASG 306
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  278 AGPGRVLLFEkmeekdfYRESREIRIPVDPQSndpsqsdkqDVLCLCFSPSEETLVASTSKNqlysiTMSLTEISKGEPA 357
Cdd:COG2319   307 SDDGTVRLWD-------LATGKLLRTLTGHTG---------AVRSVAFSPDGKTLASGSDDG-----TVRLWDLATGELL 365
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|..
gi 767902634  358 HFeylmYPLHSAPITGLATCIRKPLIATCSLDRSIRLWNYET 399
Cdd:COG2319   366 RT----LTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
382-646 2.39e-34

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 136.96  E-value: 2.39e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  382 LIATCSLDRSIRLWNYETNT-LELFKEYQEEAYSISLHPSGHFIVVGFADK-LRLMNLLIDD-IRSFK--EYSVRGcgeC 456
Cdd:COG2319   134 TLASGSADGTVRLWDLATGKlLRTLTGHSGAVTSVAFSPDGKLLASGSDDGtVRLWDLATGKlLRTLTghTGAVRS---V 210
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  457 SFS-NGGHLFAAVNGNVIHVYTTTSLENISSLKGHTGKIRSIVWNADDSKLISGGTDGAVYEWNLSTGKRETECVLKSCS 535
Cdd:COG2319   211 AFSpDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGG 290
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  536 YNCVTVSPDAKIIFAVGSDHTLK--EIADSLILREISAFDVTYTAIVISHSGRMMFVGTSVGTIRAmkYPLPLQKEFNEY 613
Cdd:COG2319   291 VNSVAFSPDGKLLASGSDDGTVRlwDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRL--WDLATGELLRTL 368
                         250       260       270
                  ....*....|....*....|....*....|...
gi 767902634  614 QAHAGPITKMLLTFDDQFLLTAAEDGCLFTWKV 646
Cdd:COG2319   369 TGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
367-645 2.41e-31

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 125.14  E-value: 2.41e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  367 HSAPITGLATCIRKPLIATCSLDRSIRLWNYETNTLEL-FKEYQEEAYSISLHPSGHFIVVGFADK-LRLMNL----LID 440
Cdd:cd00200     8 HTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRtLKGHTGPVRDVAASADGTYLASGSSDKtIRLWDLetgeCVR 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  441 DIRSFKEYsVRGcgeCSFSNGGHLFAAVNG-NVIHVYTTTSLENISSLKGHTGKIRSIVWNADDSKLISGGTDGAVYEWN 519
Cdd:cd00200    88 TLTGHTSY-VSS---VAFSPDGRILSSSSRdKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWD 163
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  520 LSTGKRETECVLKSCSYNCVTVSPDAKIIFAVGSDHTLK--EIADSLILREISAFDVTYTAIVISHSGRMMFVGTSVGTI 597
Cdd:cd00200   164 LRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKlwDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTI 243
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*...
gi 767902634  598 RAmkYPLPLQKEFNEYQAHAGPITKMLLTFDDQFLLTAAEDGCLFTWK 645
Cdd:cd00200   244 RV--WDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
682-1200 2.23e-21

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 100.87  E-value: 2.23e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   682 ELKTRVEELKMENEYQLRLKD-MNYSEKIKELTDKFI-----------QEMESLKTKNQVLRTEKEKQDVYHHEHIEDLL 749
Cdd:TIGR04523  104 DLSKINSEIKNDKEQKNKLEVeLNKLEKQKKENKKNIdkflteikkkeKELEKLNNKYNDLKKQKEELENELNLLEKEKL 183
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   750 DKQSrelqDMECCNNQKLLLEY---------EKYQELQLKSQRMQEEyEKQLRDNDETKSQALEELTEFYEAKLQEKTTL 820
Cdd:TIGR04523  184 NIQK----NIDKIKNKLLKLELllsnlkkkiQKNKSLESQISELKKQ-NNQLKDNIEKKQQEINEKTTEISNTQTQLNQL 258
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   821 LEEAQEDVRQ---QLREFEETKKQIEEDEDReIQDIKTKYEkKLRDEKESNL--RLKGETGIMRKKFSSLQKE------- 888
Cdd:TIGR04523  259 KDEQNKIKKQlseKQKELEQNNKKIKELEKQ-LNQLKSEIS-DLNNQKEQDWnkELKSELKNQEKKLEEIQNQisqnnki 336
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   889 IEERTNDIETLKGEQM-------KLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKE 961
Cdd:TIGR04523  337 ISQLNEQISQLKKELTnsesensEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKK 416
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   962 LKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITEL-------WQKLRATDQEMRRERQkerDLEALVKRFKT 1034
Cdd:TIGR04523  417 LQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLdntreslETQLKVLSRSINKIKQ---NLEQKQKELKS 493
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1035 DLHNCVAYIQEPRLLKEKVRGLFEKyvqradmveIAGLNTDlQQEYTRQREHLERNLATLKKKVVK-EGELHRTDYVRIM 1113
Cdd:TIGR04523  494 KEKELKKLNEEKKELEEKVKDLTKK---------ISSLKEK-IEKLESEKKEKESKISDLEDELNKdDFELKKENLEKEI 563
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1114 QENVSLIKEINELRRELKFTRSQVYDLEAALKLTKKVRPQEVSETEpsrdMLSTAPTARLNEQEETGRIIEMQRLEIQRL 1193
Cdd:TIGR04523  564 DEKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKE----KKISSLEKELEKAKKENEKLSSIIKNIKSK 639

                   ....*..
gi 767902634  1194 RDQIQEQ 1200
Cdd:TIGR04523  640 KNKLKQE 646
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
691-1201 1.73e-14

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 78.57  E-value: 1.73e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  691 KMENEYQLRLKDMNYSEKIKELTDKFIQEMESLKtkNQVLRTEKEKQDVYHH-EHIEDLLDKQSRELQDMEccnnqKLLL 769
Cdd:PRK03918  159 DYENAYKNLGEVIKEIKRRIERLEKFIKRTENIE--ELIKEKEKELEEVLREiNEISSELPELREELEKLE-----KEVK 231
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  770 EYEKYQELQLKSQRMQEEYEKQLRdndetksqaleeltefyeaKLQEKTTLLEEAQEDVRQQLREFEETKKQIEE----- 844
Cdd:PRK03918  232 ELEELKEEIEELEKELESLEGSKR-------------------KLEEKIRELEERIEELKKEIEELEEKVKELKElkeka 292
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  845 DEDREIQDIKTKYEKKLRDEKESNLRLKGE-TGIMR--KKFSSLQKEIEERTNDIETLKGEQMKLQG------VIKSLEK 915
Cdd:PRK03918  293 EEYIKLSEFYEEYLDELREIEKRLSRLEEEiNGIEEriKELEEKEERLEELKKKLKELEKRLEELEErhelyeEAKAKKE 372
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  916 DIQGLK-----REIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQI-------------------EPREN 971
Cdd:PRK03918  373 ELERLKkrltgLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIeelkkakgkcpvcgrelteEHRKE 452
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  972 EIRVMKEQIQEMEAELENFHKQNTQLELNITELwQKLRATDQEMRRERQKERDLEALVKRFKT-DLHNCVAYIQEPRLLK 1050
Cdd:PRK03918  453 LLEEYTAELKRIEKELKEIEEKERKLRKELREL-EKVLKKESELIKLKELAEQLKELEEKLKKyNLEELEKKAEEYEKLK 531
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1051 EKVRGLFEkyvqradmvEIAGLNTDLQ--QEYTRQREHLERNLATLKKKVvkeGELHRtdyvRIMQENVSLIKEINELRR 1128
Cdd:PRK03918  532 EKLIKLKG---------EIKSLKKELEklEELKKKLAELEKKLDELEEEL---AELLK----ELEELGFESVEELEERLK 595
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767902634 1129 ELKFTRSQVYDLEAAlkltkkvrPQEVSETEPSRDMLSTAPTARLNEQEETGRIIEMQRLEIQRLRDQIQEQE 1201
Cdd:PRK03918  596 ELEPFYNEYLELKDA--------EKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEE 660
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
671-1201 1.59e-13

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 75.54  E-value: 1.59e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   671 TDMEEK-AQVMLELK--TRVEELKMEN-EYQLRLKDMNYSEKIKElTDKFIQEMESLKTKNQVLRTekekqDVYHHEHIE 746
Cdd:pfam15921  320 SDLESTvSQLRSELReaKRMYEDKIEElEKQLVLANSELTEARTE-RDQFSQESGNLDDQLQKLLA-----DLHKREKEL 393
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   747 DLLDKQSRELQDMECCNN-------QKLLLEYEKYQELQLKSQRMQEEYEKQLrdndETKSQALEELTEFYEaKLQEKTT 819
Cdd:pfam15921  394 SLEKEQNKRLWDRDTGNSitidhlrRELDDRNMEVQRLEALLKAMKSECQGQM----ERQMAAIQGKNESLE-KVSSLTA 468
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   820 LLEEAQEDVRQQLREFEETKKQIEEDEdREIQDIKTKYEKKLRDEKESNlrlkgetgimrKKFSSLQKEIEERTNDIETL 899
Cdd:pfam15921  469 QLESTKEMLRKVVEELTAKKMTLESSE-RTVSDLTASLQEKERAIEATN-----------AEITKLRSRVDLKLQELQHL 536
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   900 KGEQMKLQGVikslEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQ 979
Cdd:pfam15921  537 KNEGDHLRNV----QTECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKAQLEKEINDRRLELQEFKIL 612
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   980 IQEMEAELENFHKQNTQLELNITELwqkLRATDQEMRRERQKERDLEALVKRFKTDLHNCVAYIQEPRLLKEKVRGLFEk 1059
Cdd:pfam15921  613 KDKKDAKIRELEARVSDLELEKVKL---VNAGSERLRAVKDIKQERDQLLNEVKTSRNELNSLSEDYEVLKRNFRNKSE- 688
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1060 yvqradmvEIAGLNTDLQQEYTRQREHLERNLATLKKKVVKEGElhrtdyvrIMQENVSLIKEINELRRELKFTRSQVYD 1139
Cdd:pfam15921  689 --------EMETTTNKLKMQLKSAQSELEQTRNTLKSMEGSDGH--------AMKVAMGMQKQITAKRGQIDALQSKIQF 752
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767902634  1140 LEAALKLTKKVRPQEVSETEPSRDMLSTAPTarlnEQEETGRIIEMQRLEIQRLRDQIQEQE 1201
Cdd:pfam15921  753 LEEAMTNANKEKHFLKEEKNKLSQELSTVAT----EKNKMAGELEVLRSQERRLKEKVANME 810
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
767-1027 2.67e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 74.97  E-value: 2.67e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  767 LLLEYEKYQElQLKSQRMQEEYEKQLRDNDETKSQALEELTEFYEAKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDE 846
Cdd:COG1196   230 LLLKLRELEA-ELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLE 308
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  847 DReiqdiktkyekkLRDEKESNLRLKGETgimrkkfSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQE 926
Cdd:COG1196   309 ER------------RRELEERLEELEEEL-------AELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLE 369
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  927 RDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQ 1006
Cdd:COG1196   370 AEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAE 449
                         250       260
                  ....*....|....*....|.
gi 767902634 1007 KLRATDQEMRRERQKERDLEA 1027
Cdd:COG1196   450 EEAELEEEEEALLELLAELLE 470
ClyA_Cry6Aa-like cd22656
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ...
795-992 1.56e-07

Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.


Pssm-ID: 439154 [Multi-domain]  Cd Length: 309  Bit Score: 54.68  E-value: 1.56e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  795 NDETKSQALEELTEFYEaKLQEKTTLLEEAQEDVRQQLREFEetkKQIEEDEDrEIQDIKTKYEKKLRDEKESnlrlkge 874
Cdd:cd22656   108 DDEELEEAKKTIKALLD-DLLKEAKKYQDKAAKVVDKLTDFE---NQTEKDQT-ALETLEKALKDLLTDEGGA------- 175
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  875 tgIMRKKFSSLQKEIEERTNDIetlkgeqmklqgvIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKknqelgkfkfv 954
Cdd:cd22656   176 --IARKEIKDLQKELEKLNEEY-------------AAKLKAKIDELKALIADDEAKLAAALRLIADLTA----------- 229
                         170       180       190
                  ....*....|....*....|....*....|....*...
gi 767902634  955 LDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHK 992
Cdd:cd22656   230 ADTDLDNLLALIGPAIPALEKLQGAWQAIATDLDSLKD 267
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
480-519 1.56e-06

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 45.77  E-value: 1.56e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 767902634    480 SLENISSLKGHTGKIRSIVWNADDSKLISGGTDGAVYEWN 519
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 COG2319
WD40 repeat [General function prediction only];
41-399 1.05e-05

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 49.52  E-value: 1.05e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   41 ALSISPNRRYLAiseTVQEKPAITIYELSSIPCRKrkVLNNFDFQVQkfiSMAFSPDSKYLLAqTSppESNLVYwLWEKQ 120
Cdd:COG2319   125 SVAFSPDGKTLA---SGSADGTVRLWDLATGKLLR--TLTGHSGAVT---SVAFSPDGKLLAS-GS--DDGTVR-LWDLA 192
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  121 KVMAIVRIDTQNNPVYQVSFSPqDNTQVcVTG--NGMFKLLRFAEGTLKQTSfqRGEPQNYLAHTWVADDK-IVVGTDTG 197
Cdd:COG2319   193 TGKLLRTLTGHTGAVRSVAFSP-DGKLL-ASGsaDGTVRLWDLATGKLLRTL--TGHSGSVRSVAFSPDGRlLASGSADG 268
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  198 KLFLfesgdqrWETsimvkeptNGSKSLDVIQESESliefpPVSSplpsyeqmVAASSHSQMsmpqvfaiaayskgFACS 277
Cdd:COG2319   269 TVRL-------WDL--------ATGELLRTLTGHSG-----GVNS--------VAFSPDGKL--------------LASG 306
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  278 AGPGRVLLFEkmeekdfYRESREIRIPVDPQSndpsqsdkqDVLCLCFSPSEETLVASTSKNqlysiTMSLTEISKGEPA 357
Cdd:COG2319   307 SDDGTVRLWD-------LATGKLLRTLTGHTG---------AVRSVAFSPDGKTLASGSDDG-----TVRLWDLATGELL 365
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|..
gi 767902634  358 HFeylmYPLHSAPITGLATCIRKPLIATCSLDRSIRLWNYET 399
Cdd:COG2319   366 RT----LTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 pfam00400
WD domain, G-beta repeat;
482-519 2.30e-05

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 42.33  E-value: 2.30e-05
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 767902634   482 ENISSLKGHTGKIRSIVWNADDSKLISGGTDGAVYEWN 519
Cdd:pfam00400    2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 pfam00400
WD domain, G-beta repeat;
367-396 4.14e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 38.87  E-value: 4.14e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 767902634   367 HSAPITGLATCIRKPLIATCSLDRSIRLWN 396
Cdd:pfam00400   10 HTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
364-396 5.23e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 38.83  E-value: 5.23e-04
                            10        20        30
                    ....*....|....*....|....*....|...
gi 767902634    364 YPLHSAPITGLATCIRKPLIATCSLDRSIRLWN 396
Cdd:smart00320    8 LKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
796-1202 9.63e-04

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 43.46  E-value: 9.63e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  796 DETKSQALEELTEFYEAKLQEKTTLLEEaqedvRQQLREFEETKKqieededreIQDIKTKY-------EKKLRDEKESN 868
Cdd:NF033838   53 NESQKEHAKEVESHLEKILSEIQKSLDK-----RKHTQNVALNKK---------LSDIKTEYlyelnvlKEKSEAELTSK 118
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  869 LR---------LKGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVI--KSLEKDIQGLKREIQERD-ETIQDKEK 936
Cdd:NF033838  119 TKkeldaafeqFKKDTLEPGKKVAEATKKVEEAEKKAKDQKEEDRRNYPTNtyKTLELEIAESDVEVKKAElELVKEEAK 198
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  937 RIYDLKKKNQELGKFkfvldykikELKKQIEPRENEIRVMKEQIQE-----MEAELENFHKQNTQLELNITELWQKLRAT 1011
Cdd:NF033838  199 EPRDEEKIKQAKAKV---------ESKKAEATRLEKIKTDREKAEEeakrrADAKLKEAVEKNVATSEQDKPKRRAKRGV 269
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1012 DQEMRRERQKERDLEAlvkrfkTDLHNCVAYIQEPRLLKEKVRGLFEKYVQRAdmveiaglntdlQQEYTRQREHLERNL 1091
Cdd:NF033838  270 LGEPATPDKKENDAKS------SDSSVGEETLPSPSLKPEKKVAEAEKKVEEA------------KKKAKDQKEEDRRNY 331
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1092 ATLKKKVVkEGELHRTDyVRIMQENVSLIKEI-NELRRELKFTRSQvydleaalkltKKVrpqEVSETEPSRdmLSTAPT 1170
Cdd:NF033838  332 PTNTYKTL-ELEIAESD-VKVKEAELELVKEEaKEPRNEEKIKQAK-----------AKV---ESKKAEATR--LEKIKT 393
                         410       420       430
                  ....*....|....*....|....*....|..
gi 767902634 1171 ARLNEQEETGRIIEmqrlEIQRLRDQIQEQEQ 1202
Cdd:NF033838  394 DRKKAEEEAKRKAA----EEDKVKEKPAEQPQ 421
Spc7 smart00787
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ...
779-953 2.56e-03

Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.


Pssm-ID: 197874 [Multi-domain]  Cd Length: 312  Bit Score: 41.54  E-value: 2.56e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634    779 LKSQRMQEEYEKQLRDNDEtksqalEELTEFYEAKLQEKTTLLEEAQE--DVRQQLRE-FEETKKQIEEdedreiqdikt 855
Cdd:smart00787  128 LEAKKMWYEWRMKLLEGLK------EGLDENLEGLKEDYKLLMKELELlnSIKPKLRDrKDALEEELRQ----------- 190
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634    856 kyEKKLRDEKESNLrlKGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIqdKE 935
Cdd:smart00787  191 --LKQLEDELEDCD--PTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKL--EQ 264
                           170
                    ....*....|....*...
gi 767902634    936 KRIYDLKKKNQELGKFKF 953
Cdd:smart00787  265 CRGFTFKEIEKLKEQLKL 282
WDR74 cd22857
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and ...
322-399 4.86e-03

WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and plants is an essential factor for ribosome assembly. In cooperation with the assembly factor NVL2, WDR74 participates in an early cleavage of the pre-rRNA processing pathway. NVL2 is a type II double ring, AAA-ATPase, that may mediate the release of WDR74 from nucleolar pre-60S particles. WDR74 has been implicated in tumorigenesis. In lung cancer, it regulates cell proliferation, cell cycle progression, chemoresistance and cell aggressiveness, by inducing nuclear beta-catenin accumulation and driving downstream Wnt-responsive genes expression. In melanoma, it promotes apoptosis resistance and aggressive behavior by regulating the RPL5-MDM2-p53 pathway. WDR74 contains an N-terminal seven-bladed beta-propeller WD40 domain that associates with the D1-AAA domain of the AAA-ATPase NVL2, and a flexible lysine-rich C-terminus that extends outward from the WD40 domain, and is required for nucleolar localization.


Pssm-ID: 439303 [Multi-domain]  Cd Length: 325  Bit Score: 40.67  E-value: 4.86e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  322 CLCFSPSEETLVASTSKNQLYSI---TMSLTEISKGepahfeylmypLHSAPITGLATCIRKPLIATCSLDRSIRLWNYE 398
Cdd:cd22857   228 AVAEDPDGHTVYVGDTSGDLASIdlrTGKLLGCFKG-----------KCGGSIRSIARHPELPLIASCGLDRYLRIWDTE 296

                  .
gi 767902634  399 T 399
Cdd:cd22857   297 T 297
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
382-646 2.39e-34

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 136.96  E-value: 2.39e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  382 LIATCSLDRSIRLWNYETNT-LELFKEYQEEAYSISLHPSGHFIVVGFADK-LRLMNLLIDD-IRSFK--EYSVRGcgeC 456
Cdd:COG2319   134 TLASGSADGTVRLWDLATGKlLRTLTGHSGAVTSVAFSPDGKLLASGSDDGtVRLWDLATGKlLRTLTghTGAVRS---V 210
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  457 SFS-NGGHLFAAVNGNVIHVYTTTSLENISSLKGHTGKIRSIVWNADDSKLISGGTDGAVYEWNLSTGKRETECVLKSCS 535
Cdd:COG2319   211 AFSpDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGG 290
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  536 YNCVTVSPDAKIIFAVGSDHTLK--EIADSLILREISAFDVTYTAIVISHSGRMMFVGTSVGTIRAmkYPLPLQKEFNEY 613
Cdd:COG2319   291 VNSVAFSPDGKLLASGSDDGTVRlwDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRL--WDLATGELLRTL 368
                         250       260       270
                  ....*....|....*....|....*....|...
gi 767902634  614 QAHAGPITKMLLTFDDQFLLTAAEDGCLFTWKV 646
Cdd:COG2319   369 TGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
WD40 COG2319
WD40 repeat [General function prediction only];
365-657 1.80e-31

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 128.49  E-value: 1.80e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  365 PLHSAPITGLATCIRKPLIATCSLDRSIRLWNYETN-TLELFKEYQEEAYSISLHPSGHFIVVGFADK-LRLMNL----L 438
Cdd:COG2319    75 LGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGlLLRTLTGHTGAVRSVAFSPDGKTLASGSADGtVRLWDLatgkL 154
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  439 IDDIRSFKE--YSVrgcgecSFSNGGHLFAAVNG-NVIHVYTTTSLENISSLKGHTGKIRSIVWNADDSKLISGGTDGAV 515
Cdd:COG2319   155 LRTLTGHSGavTSV------AFSPDGKLLASGSDdGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTV 228
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  516 YEWNLSTGKRETECVLKSCSYNCVTVSPDAKIIFAVGSDHTLK--EIADSLILREISAFDVTYTAIVISHSGRMMFVGTS 593
Cdd:COG2319   229 RLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRlwDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSD 308
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 767902634  594 VGTIRAmkYPLPLQKEFNEYQAHAGPITKMLLTFDDQFLLTAAEDGclfTWKVFDKDGRGIKRE 657
Cdd:COG2319   309 DGTVRL--WDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDG---TVRLWDLATGELLRT 367
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
367-645 2.41e-31

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 125.14  E-value: 2.41e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  367 HSAPITGLATCIRKPLIATCSLDRSIRLWNYETNTLEL-FKEYQEEAYSISLHPSGHFIVVGFADK-LRLMNL----LID 440
Cdd:cd00200     8 HTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRtLKGHTGPVRDVAASADGTYLASGSSDKtIRLWDLetgeCVR 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  441 DIRSFKEYsVRGcgeCSFSNGGHLFAAVNG-NVIHVYTTTSLENISSLKGHTGKIRSIVWNADDSKLISGGTDGAVYEWN 519
Cdd:cd00200    88 TLTGHTSY-VSS---VAFSPDGRILSSSSRdKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWD 163
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  520 LSTGKRETECVLKSCSYNCVTVSPDAKIIFAVGSDHTLK--EIADSLILREISAFDVTYTAIVISHSGRMMFVGTSVGTI 597
Cdd:cd00200   164 LRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKlwDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTI 243
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*...
gi 767902634  598 RAmkYPLPLQKEFNEYQAHAGPITKMLLTFDDQFLLTAAEDGCLFTWK 645
Cdd:cd00200   244 RV--WDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
320-558 6.08e-26

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 111.93  E-value: 6.08e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  320 VLCLCFSPSEETLvASTSKNQlysiTMSLTEISKGEPAHFeylmYPLHSAPITGLATCIRKPLIATCSLDRSIRLWNYET 399
Cdd:COG2319   165 VTSVAFSPDGKLL-ASGSDDG----TVRLWDLATGKLLRT----LTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLAT 235
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  400 NT-LELFKEYQEEAYSISLHPSGHFIVVGFADK-LRLMNLLIDDIRSFKEYSVRGCGECSFS-NGGHLFAAVNGNVIHVY 476
Cdd:COG2319   236 GKlLRTLTGHSGSVRSVAFSPDGRLLASGSADGtVRLWDLATGELLRTLTGHSGGVNSVAFSpDGKLLASGSDDGTVRLW 315
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  477 TTTSLENISSLKGHTGKIRSIVWNADDSKLISGGTDGAVYEWNLSTGKRETECVLKSCSYNCVTVSPDAKIIFAVGSDHT 556
Cdd:COG2319   316 DLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGT 395

                  ..
gi 767902634  557 LK 558
Cdd:COG2319   396 VR 397
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
317-519 1.29e-21

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 96.64  E-value: 1.29e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  317 KQDVLCLCFSPSEeTLVASTSKNQlysiTMSLTEISKGEPAHFeylmYPLHSAPITGLATCIRKPLIATCSLDRSIRLWN 396
Cdd:cd00200    93 TSYVSSVAFSPDG-RILSSSSRDK----TIKVWDVETGKCLTT----LRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWD 163
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  397 YETNTL-ELFKEYQEEAYSISLHPSGHFIVVGFADKlrlmNLLIDDIRSFKE-YSVRGCGE----CSFSNGGHLFAAVNG 470
Cdd:cd00200   164 LRTGKCvATLTGHTGEVNSVAFSPDGEKLLSSSSDG----TIKLWDLSTGKClGTLRGHENgvnsVAFSPDGYLLASGSE 239
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|
gi 767902634  471 -NVIHVYTTTSLENISSLKGHTGKIRSIVWNADDSKLISGGTDGAVYEWN 519
Cdd:cd00200   240 dGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
682-1200 2.23e-21

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 100.87  E-value: 2.23e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   682 ELKTRVEELKMENEYQLRLKD-MNYSEKIKELTDKFI-----------QEMESLKTKNQVLRTEKEKQDVYHHEHIEDLL 749
Cdd:TIGR04523  104 DLSKINSEIKNDKEQKNKLEVeLNKLEKQKKENKKNIdkflteikkkeKELEKLNNKYNDLKKQKEELENELNLLEKEKL 183
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   750 DKQSrelqDMECCNNQKLLLEY---------EKYQELQLKSQRMQEEyEKQLRDNDETKSQALEELTEFYEAKLQEKTTL 820
Cdd:TIGR04523  184 NIQK----NIDKIKNKLLKLELllsnlkkkiQKNKSLESQISELKKQ-NNQLKDNIEKKQQEINEKTTEISNTQTQLNQL 258
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   821 LEEAQEDVRQ---QLREFEETKKQIEEDEDReIQDIKTKYEkKLRDEKESNL--RLKGETGIMRKKFSSLQKE------- 888
Cdd:TIGR04523  259 KDEQNKIKKQlseKQKELEQNNKKIKELEKQ-LNQLKSEIS-DLNNQKEQDWnkELKSELKNQEKKLEEIQNQisqnnki 336
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   889 IEERTNDIETLKGEQM-------KLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKE 961
Cdd:TIGR04523  337 ISQLNEQISQLKKELTnsesensEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKK 416
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   962 LKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITEL-------WQKLRATDQEMRRERQkerDLEALVKRFKT 1034
Cdd:TIGR04523  417 LQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLdntreslETQLKVLSRSINKIKQ---NLEQKQKELKS 493
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1035 DLHNCVAYIQEPRLLKEKVRGLFEKyvqradmveIAGLNTDlQQEYTRQREHLERNLATLKKKVVK-EGELHRTDYVRIM 1113
Cdd:TIGR04523  494 KEKELKKLNEEKKELEEKVKDLTKK---------ISSLKEK-IEKLESEKKEKESKISDLEDELNKdDFELKKENLEKEI 563
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1114 QENVSLIKEINELRRELKFTRSQVYDLEAALKLTKKVRPQEVSETEpsrdMLSTAPTARLNEQEETGRIIEMQRLEIQRL 1193
Cdd:TIGR04523  564 DEKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKE----KKISSLEKELEKAKKENEKLSSIIKNIKSK 639

                   ....*..
gi 767902634  1194 RDQIQEQ 1200
Cdd:TIGR04523  640 KNKLKQE 646
WD40 COG2319
WD40 repeat [General function prediction only];
320-522 2.38e-20

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 94.98  E-value: 2.38e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  320 VLCLCFSPSEETLvASTSKNQlysiTMSLTEISKGEPAHFeylmYPLHSAPITGLATCirkP---LIATCSLDRSIRLWN 396
Cdd:COG2319   207 VRSVAFSPDGKLL-ASGSADG----TVRLWDLATGKLLRT----LTGHSGSVRSVAFS---PdgrLLASGSADGTVRLWD 274
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  397 YETNT-LELFKEYQEEAYSISLHPSGHFIVVGFADK-LRLMNLLIDD-IRSFKEYSVRGCGeCSFS-NGGHLFAAVNGNV 472
Cdd:COG2319   275 LATGElLRTLTGHSGGVNSVAFSPDGKLLASGSDDGtVRLWDLATGKlLRTLTGHTGAVRS-VAFSpDGKTLASGSDDGT 353
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|
gi 767902634  473 IHVYTTTSLENISSLKGHTGKIRSIVWNADDSKLISGGTDGAVYEWNLST 522
Cdd:COG2319   354 VRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
799-1145 7.92e-17

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 86.26  E-value: 7.92e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   799 KSQALEELTEFYEaKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEDREIQDIKTKYEKKLRDEKESNlRLKGETGIM 878
Cdd:TIGR02168  675 RRREIEELEEKIE-ELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVE-QLEERIAQL 752
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   879 RKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYK 958
Cdd:TIGR02168  753 SKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERR 832
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   959 IKELKKQIEPRENEIRVMKEQIQ--------------EMEAELENFHKQNTQLELNITELWQKLRATDQEMRRERQKERD 1024
Cdd:TIGR02168  833 IAATERRLEDLEEQIEELSEDIEslaaeieeleelieELESELEALLNERASLEEALALLRSELEELSEELRELESKRSE 912
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1025 LEALVKRFKTDLHNCVAYIQEPRL----LKEKVRglfEKYVQRADMVEIAGLNTDLQQEYTRQRehlernLATLKKKVVK 1100
Cdd:TIGR02168  913 LRRELEELREKLAQLELRLEGLEVridnLQERLS---EEYSLTLEEAEALENKIEDDEEEARRR------LKRLENKIKE 983
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 767902634  1101 EGELhrtdyvrimqeNVSLIKEINELRRELKFTRSQVYDLEAALK 1145
Cdd:TIGR02168  984 LGPV-----------NLAAIEEYEELKERYDFLTAQKEDLTEAKE 1017
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
483-644 3.36e-16

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 80.46  E-value: 3.36e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  483 NISSLKGHTGKIRSIVWNADDSKLISGGTDGAVYEWNLSTGKRETECVLKSCSYNCVTVSPDAKIIFAVGSDHTLK--EI 560
Cdd:cd00200     1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRlwDL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  561 ADSLILREISAFDVTYTAIVISHSGRMMFVGTSVGTIRAmkYPLPLQKEFNEYQAHAGPITKMLLTFDDQFLLTAAEDGC 640
Cdd:cd00200    81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKV--WDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGT 158

                  ....
gi 767902634  641 LFTW 644
Cdd:cd00200   159 IKLW 162
WD40 COG2319
WD40 repeat [General function prediction only];
457-648 4.94e-16

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 81.88  E-value: 4.94e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  457 SFSNGGHLFAAVNGNVIHVYTTTSLENISSLKGHTGKIRSIVWNADDSKLISGGTDGAVYEWNLSTGKRETECVLKSCSY 536
Cdd:COG2319    44 ASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAV 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  537 NCVTVSPDAKIIFAVGSDHTLK--EIADSLILREISAFDVTYTAIVISHSGRMMFVGTSVGTIRAmkYPLPLQKEFNEYQ 614
Cdd:COG2319   124 RSVAFSPDGKTLASGSADGTVRlwDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRL--WDLATGKLLRTLT 201
                         170       180       190
                  ....*....|....*....|....*....|....
gi 767902634  615 AHAGPITKMLLTFDDQFLLTAAEDGclfTWKVFD 648
Cdd:COG2319   202 GHTGAVRSVAFSPDGKLLASGSADG---TVRLWD 232
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
822-1181 7.03e-16

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 83.19  E-value: 7.03e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   822 EEAQEDVRQQLREFEETKKQIEEDEdrEIQDIKTKYEKKLRDEKESNLRLKGETGIMRK-KFSSLQKEIEERTNDIETLK 900
Cdd:TIGR02169  166 AEFDRKKEKALEELEEVEENIERLD--LIIDEKRQQLERLRREREKAERYQALLKEKREyEGYELLKEKEALERQKEAIE 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   901 GEqmklqgvIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGkfkfvlDYKIKELKKQIEPRENEIRVMKEQI 980
Cdd:TIGR02169  244 RQ-------LASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLG------EEEQLRVKEKIGELEAEIASLERSI 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   981 QEMEAELENFHKQNTQLELNITELWQKLRATDQEMRRERQKERDLEALVKRFKTDLHNCVAYIQEprlLKEKVRGLFEKY 1060
Cdd:TIGR02169  311 AEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEE---VDKEFAETRDEL 387
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1061 VQRADMVEiaglntdlqqEYTRQREHLERNLATL---KKKVVKEGELHRTDYVRIMQ-------ENVSLIKEINELRREL 1130
Cdd:TIGR02169  388 KDYREKLE----------KLKREINELKRELDRLqeeLQRLSEELADLNAAIAGIEAkineleeEKEDKALEIKKQEWKL 457
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 767902634  1131 KFTRSQVYDLEAAL-KLTKKVRPQEVSETEPSRDMLSTAPTARLNEQEETGR 1181
Cdd:TIGR02169  458 EQLAADLSKYEQELyDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGG 509
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
772-1026 1.26e-15

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 82.42  E-value: 1.26e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   772 EKYQELQLKSQRMqEEYE--KQLRDNDETKSQALEELTEFyEAKLQEKTTLLEEAQEDVRQQLREFEETKKQIE---EDE 846
Cdd:TIGR02169  211 ERYQALLKEKREY-EGYEllKEKEALERQKEAIERQLASL-EEELEKLTEEISELEKRLEEIEQLLEELNKKIKdlgEEE 288
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   847 DREIQ----DIKTKYEKKLRDEKESNLRLKGETGIMRKKFS---SLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQG 919
Cdd:TIGR02169  289 QLRVKekigELEAEIASLERSIAEKERELEDAEERLAKLEAeidKLLAEIEELEREIEEERKRRDKLTEEYAELKEELED 368
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   920 LKREIQERDETIQ-------DKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHK 992
Cdd:TIGR02169  369 LRAELEEVDKEFAetrdelkDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKAL 448
                          250       260       270
                   ....*....|....*....|....*....|....
gi 767902634   993 QNTQLELNITELWQKLRATDQEMRRERQKERDLE 1026
Cdd:TIGR02169  449 EIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVE 482
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
675-998 3.55e-15

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 80.88  E-value: 3.55e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   675 EKAQVMLELKTRVEELkmenEYQLRLKDMNYSEKIKELTDKFIQEMESLKTKNQVLRTEKEKQdvyhHEHIEDLLDKQSR 754
Cdd:TIGR02169  208 EKAERYQALLKEKREY----EGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKR----LEEIEQLLEELNK 279
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   755 ELQDM---ECCNNQKLLLEYEKYQElqlKSQRMQEEYEKQLRDNDETKSQALEELTefyeaKLQEKTTLLEEAQEDVRQQ 831
Cdd:TIGR02169  280 KIKDLgeeEQLRVKEKIGELEAEIA---SLERSIAEKERELEDAEERLAKLEAEID-----KLLAEIEELEREIEEERKR 351
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   832 LREFEEtkkqieededrEIQDIKTKYEK---KLRDEKESNLRLKGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQG 908
Cdd:TIGR02169  352 RDKLTE-----------EYAELKEELEDlraELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSE 420
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   909 VIKSLEKDIQGLKREIQERDETIQDKEKRIydlKKKNQELgkfkfvldykiKELKKQIEPRENEIRVMKEQIQEMEAELE 988
Cdd:TIGR02169  421 ELADLNAAIAGIEAKINELEEEKEDKALEI---KKQEWKL-----------EQLAADLSKYEQELYDLKEEYDRVEKELS 486
                          330
                   ....*....|
gi 767902634   989 NFHKQNTQLE 998
Cdd:TIGR02169  487 KLQRELAEAE 496
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
691-1201 1.73e-14

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 78.57  E-value: 1.73e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  691 KMENEYQLRLKDMNYSEKIKELTDKFIQEMESLKtkNQVLRTEKEKQDVYHH-EHIEDLLDKQSRELQDMEccnnqKLLL 769
Cdd:PRK03918  159 DYENAYKNLGEVIKEIKRRIERLEKFIKRTENIE--ELIKEKEKELEEVLREiNEISSELPELREELEKLE-----KEVK 231
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  770 EYEKYQELQLKSQRMQEEYEKQLRdndetksqaleeltefyeaKLQEKTTLLEEAQEDVRQQLREFEETKKQIEE----- 844
Cdd:PRK03918  232 ELEELKEEIEELEKELESLEGSKR-------------------KLEEKIRELEERIEELKKEIEELEEKVKELKElkeka 292
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  845 DEDREIQDIKTKYEKKLRDEKESNLRLKGE-TGIMR--KKFSSLQKEIEERTNDIETLKGEQMKLQG------VIKSLEK 915
Cdd:PRK03918  293 EEYIKLSEFYEEYLDELREIEKRLSRLEEEiNGIEEriKELEEKEERLEELKKKLKELEKRLEELEErhelyeEAKAKKE 372
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  916 DIQGLK-----REIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQI-------------------EPREN 971
Cdd:PRK03918  373 ELERLKkrltgLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIeelkkakgkcpvcgrelteEHRKE 452
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  972 EIRVMKEQIQEMEAELENFHKQNTQLELNITELwQKLRATDQEMRRERQKERDLEALVKRFKT-DLHNCVAYIQEPRLLK 1050
Cdd:PRK03918  453 LLEEYTAELKRIEKELKEIEEKERKLRKELREL-EKVLKKESELIKLKELAEQLKELEEKLKKyNLEELEKKAEEYEKLK 531
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1051 EKVRGLFEkyvqradmvEIAGLNTDLQ--QEYTRQREHLERNLATLKKKVvkeGELHRtdyvRIMQENVSLIKEINELRR 1128
Cdd:PRK03918  532 EKLIKLKG---------EIKSLKKELEklEELKKKLAELEKKLDELEEEL---AELLK----ELEELGFESVEELEERLK 595
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767902634 1129 ELKFTRSQVYDLEAAlkltkkvrPQEVSETEPSRDMLSTAPTARLNEQEETGRIIEMQRLEIQRLRDQIQEQE 1201
Cdd:PRK03918  596 ELEPFYNEYLELKDA--------EKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEE 660
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
808-1149 4.07e-14

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 77.40  E-value: 4.07e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   808 EFYEAKLQEKTTLLEEA------QEDVRQQLREFEETKKQIEEDED--REI---------QDIKTKYEKKLRDEKEsNLR 870
Cdd:TIGR02168  148 EIIEAKPEERRAIFEEAagiskyKERRKETERKLERTRENLDRLEDilNELerqlkslerQAEKAERYKELKAELR-ELE 226
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   871 LkgetGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGK 950
Cdd:TIGR02168  227 L----ALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQ 302
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   951 FKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQEMRRERQKERDLEALVK 1030
Cdd:TIGR02168  303 QKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLE 382
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1031 RFKTDLHncvayiqeprLLKEKVrglfekYVQRADMVEIAGLNTDLQQEYTRQREHLERNLATLKKKVVKEgelHRTDYV 1110
Cdd:TIGR02168  383 TLRSKVA----------QLELQI------ASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKE---LQAELE 443
                          330       340       350
                   ....*....|....*....|....*....|....*....
gi 767902634  1111 RIMQENVSLIKEINELRRELKFTRSQVYDLEAALKLTKK 1149
Cdd:TIGR02168  444 ELEEELEELQEELERLEEALEELREELEEAEQALDAAER 482
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
705-1028 1.48e-13

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 75.87  E-value: 1.48e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   705 YSEKIKELTDkfiqEMESLKTKNQVLRTEKEkqdvyhheHIEDLLDKQSRELQDMEccnnqKLLLEYEKYQELQLKSQRM 784
Cdd:TIGR02169  672 EPAELQRLRE----RLEGLKRELSSLQSELR--------RIENRLDELSQELSDAS-----RKIGEIEKEIEQLEQEEEK 734
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   785 QEEYEKQLRDNDETKSQALEEltefYEAKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEDREIQDIKTKYEKKLRDE 864
Cdd:TIGR02169  735 LKERLEELEEDLSSLEQEIEN----VKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRI 810
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   865 KESNLRLKGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKK 944
Cdd:TIGR02169  811 EARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKE 890
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   945 NQElgkfkfvLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITEL----WQKLRATDQEMRRERq 1020
Cdd:TIGR02169  891 RDE-------LEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDeeipEEELSLEDVQAELQR- 962

                   ....*...
gi 767902634  1021 KERDLEAL 1028
Cdd:TIGR02169  963 VEEEIRAL 970
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
671-1201 1.59e-13

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 75.54  E-value: 1.59e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   671 TDMEEK-AQVMLELK--TRVEELKMEN-EYQLRLKDMNYSEKIKElTDKFIQEMESLKTKNQVLRTekekqDVYHHEHIE 746
Cdd:pfam15921  320 SDLESTvSQLRSELReaKRMYEDKIEElEKQLVLANSELTEARTE-RDQFSQESGNLDDQLQKLLA-----DLHKREKEL 393
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   747 DLLDKQSRELQDMECCNN-------QKLLLEYEKYQELQLKSQRMQEEYEKQLrdndETKSQALEELTEFYEaKLQEKTT 819
Cdd:pfam15921  394 SLEKEQNKRLWDRDTGNSitidhlrRELDDRNMEVQRLEALLKAMKSECQGQM----ERQMAAIQGKNESLE-KVSSLTA 468
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   820 LLEEAQEDVRQQLREFEETKKQIEEDEdREIQDIKTKYEKKLRDEKESNlrlkgetgimrKKFSSLQKEIEERTNDIETL 899
Cdd:pfam15921  469 QLESTKEMLRKVVEELTAKKMTLESSE-RTVSDLTASLQEKERAIEATN-----------AEITKLRSRVDLKLQELQHL 536
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   900 KGEQMKLQGVikslEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQ 979
Cdd:pfam15921  537 KNEGDHLRNV----QTECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKAQLEKEINDRRLELQEFKIL 612
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   980 IQEMEAELENFHKQNTQLELNITELwqkLRATDQEMRRERQKERDLEALVKRFKTDLHNCVAYIQEPRLLKEKVRGLFEk 1059
Cdd:pfam15921  613 KDKKDAKIRELEARVSDLELEKVKL---VNAGSERLRAVKDIKQERDQLLNEVKTSRNELNSLSEDYEVLKRNFRNKSE- 688
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1060 yvqradmvEIAGLNTDLQQEYTRQREHLERNLATLKKKVVKEGElhrtdyvrIMQENVSLIKEINELRRELKFTRSQVYD 1139
Cdd:pfam15921  689 --------EMETTTNKLKMQLKSAQSELEQTRNTLKSMEGSDGH--------AMKVAMGMQKQITAKRGQIDALQSKIQF 752
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767902634  1140 LEAALKLTKKVRPQEVSETEPSRDMLSTAPTarlnEQEETGRIIEMQRLEIQRLRDQIQEQE 1201
Cdd:pfam15921  753 LEEAMTNANKEKHFLKEEKNKLSQELSTVAT----EKNKMAGELEVLRSQERRLKEKVANME 810
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
664-1034 2.01e-13

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 75.10  E-value: 2.01e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  664 EEVLVTKTDMEEKAQVMLELKTRVEELKmENEYQLRLKDMNYSEKIKELTDKfIQEMESLKTKNQVLRTEKE--KQDVYH 741
Cdd:PRK03918  279 EEKVKELKELKEKAEEYIKLSEFYEEYL-DELREIEKRLSRLEEEINGIEER-IKELEEKEERLEELKKKLKelEKRLEE 356
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  742 HEHIEDLLDKQSRELQDMECCNNQKLLLEYEKYQELQLKSQRMQEEYEKQLRDNDETKSQ----------ALEEL----- 806
Cdd:PRK03918  357 LEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGElkkeikelkkAIEELkkakg 436
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  807 ---------TEFYEAKLQEKTTLleeaqedvrqQLREFEETKKQIEEDEdREIQDIKTKYEKKLRDEKE--SNLRLKGET 875
Cdd:PRK03918  437 kcpvcgrelTEEHRKELLEEYTA----------ELKRIEKELKEIEEKE-RKLRKELRELEKVLKKESEliKLKELAEQL 505
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  876 GIMRKKFSSLQKE-IEERTNDIETLKGEQMKLQGVIKSLEKDI---QGLKREIQERDETIQDKEKRIYDLKKKNQELGkF 951
Cdd:PRK03918  506 KELEEKLKKYNLEeLEKKAEEYEKLKEKLIKLKGEIKSLKKELeklEELKKKLAELEKKLDELEEELAELLKELEELG-F 584
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  952 KFV--LDYKIKEL-------------KKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQEMR 1016
Cdd:PRK03918  585 ESVeeLEERLKELepfyneylelkdaEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEEL 664
                         410       420
                  ....*....|....*....|
gi 767902634 1017 RER--QKERDLEALVKRFKT 1034
Cdd:PRK03918  665 REEylELSRELAGLRAELEE 684
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
669-1038 2.62e-13

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 74.67  E-value: 2.62e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   669 TKTDMEEKAQVMLELKTRVEELKME-NEYQLRLKDMNySEKIKELTDKFIQEMESLKTKNQVLRTE---KEKQDVYHHEH 744
Cdd:TIGR04523  265 IKKQLSEKQKELEQNNKKIKELEKQlNQLKSEISDLN-NQKEQDWNKELKSELKNQEKKLEEIQNQisqNNKIISQLNEQ 343
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   745 IEDLldkqSRELQDMECCNNQKLLLEYEKYQELQlKSQRMQEEYEKQLRdNDETKSQALEELTEFYEAKLQEKTTLLEEA 824
Cdd:TIGR04523  344 ISQL----KKELTNSESENSEKQRELEEKQNEIE-KLKKENQSYKQEIK-NLESQINDLESKIQNQEKLNQQKDEQIKKL 417
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   825 QEDVRQQLREFEETKKQIEEDED------REIQDIKTKYE--KKLRDEKESNLR-LKGETGIMRKKFSSLQKEIEERTND 895
Cdd:TIGR04523  418 QQEKELLEKEIERLKETIIKNNSeikdltNQDSVKELIIKnlDNTRESLETQLKvLSRSINKIKQNLEQKQKELKSKEKE 497
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   896 IETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKK---------KNQELGKFKFVLDYKIKELKKQI 966
Cdd:TIGR04523  498 LKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDelnkddfelKKENLEKEIDEKNKEIEELKQTQ 577
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767902634   967 EPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQEMRRERQKERDLEALVKRFKTDLHN 1038
Cdd:TIGR04523  578 KSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQ 649
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
767-1027 2.67e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 74.97  E-value: 2.67e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  767 LLLEYEKYQElQLKSQRMQEEYEKQLRDNDETKSQALEELTEFYEAKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDE 846
Cdd:COG1196   230 LLLKLRELEA-ELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLE 308
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  847 DReiqdiktkyekkLRDEKESNLRLKGETgimrkkfSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQE 926
Cdd:COG1196   309 ER------------RRELEERLEELEEEL-------AELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLE 369
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  927 RDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQ 1006
Cdd:COG1196   370 AEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAE 449
                         250       260
                  ....*....|....*....|.
gi 767902634 1007 KLRATDQEMRRERQKERDLEA 1027
Cdd:COG1196   450 EEAELEEEEEALLELLAELLE 470
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
649-1149 2.99e-13

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 74.37  E-value: 2.99e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   649 KDGRGIKREReVGFAEEVLVTKTDMEEKAQVMLELKTRVEELKMENEyQLRLKdmnYSEKIKELTDkFIQEMESLKTKNQ 728
Cdd:pfam05483   85 KEAEKIKKWK-VSIEAELKQKENKLQENRKIIEAQRKAIQELQFENE-KVSLK---LEEEIQENKD-LIKENNATRHLCN 158
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   729 VLR------TEKEKQDVYHHEHIEDL-------LDKQSRELQDMECCNNQKLLLEYEKYQELQLKSQRMQEEYEKQLRDN 795
Cdd:pfam05483  159 LLKetcarsAEKTKKYEYEREETRQVymdlnnnIEKMILAFEELRVQAENARLEMHFKLKEDHEKIQHLEEEYKKEINDK 238
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   796 DETKSQALEELTEfYEAKLQEKTTLLEEAQEDVRQ---QLREFEETKKQIEEDED---REIQDIK--------------- 854
Cdd:pfam05483  239 EKQVSLLLIQITE-KENKMKDLTFLLEESRDKANQleeKTKLQDENLKELIEKKDhltKELEDIKmslqrsmstqkalee 317
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   855 -----TKYEKKLRDEKESNLRlkgETGIMRKKFSSLQKEIEERTNDIETL-KGEQMKLQG---VIKSLEKDIQGLKREIQ 925
Cdd:pfam05483  318 dlqiaTKTICQLTEEKEAQME---ELNKAKAAHSFVVTEFEATTCSLEELlRTEQQRLEKnedQLKIITMELQKKSSELE 394
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   926 ERDETIQDKEKRIYDLKK----------KNQELGKFKFVLDYKIKELKKQIEPRENEIRVMK--------------EQIQ 981
Cdd:pfam05483  395 EMTKFKNNKEVELEELKKilaedeklldEKKQFEKIAEELKGKEQELIFLLQAREKEIHDLEiqltaiktseehylKEVE 474
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   982 EMEAELENFHKQNTQLELNITELwqklratDQEMRRERQKERDLEALVKRFKTDLHNCVAyiQEPRLLKEkVRGLFEKYV 1061
Cdd:pfam05483  475 DLKTELEKEKLKNIELTAHCDKL-------LLENKELTQEASDMTLELKKHQEDIINCKK--QEERMLKQ-IENLEEKEM 544
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1062 QRADMVEiaglntDLQQEYTRQREHLERNLatlkKKVVKEGELHRTDYVRIMQENVSLIKEINELRRELKFTRSQVYDLE 1141
Cdd:pfam05483  545 NLRDELE------SVREEFIQKGDEVKCKL----DKSEENARSIEYEVLKKEKQMKILENKCNNLKKQIENKNKNIEELH 614

                   ....*...
gi 767902634  1142 AALKLTKK 1149
Cdd:pfam05483  615 QENKALKK 622
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
704-1032 1.61e-12

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 72.40  E-value: 1.61e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   704 NYSEKIKELTDKfIQEMESLKTKNQVLRTEKEKQdvyhHEHIEDLLDKQSRELQDMEccnnQKLLLEYEKYQELQLKSQR 783
Cdd:TIGR02168  674 ERRREIEELEEK-IEELEEKIAELEKALAELRKE----LEELEEELEQLRKELEELS----RQISALRKDLARLEAEVEQ 744
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   784 mqeeYEKQLRDNDETKSQALEELTEfYEAKLQEKTTLLEEAqEDVRQQLREfeetkkQIEEDEDR--EIQDIKTKYEKKL 861
Cdd:TIGR02168  745 ----LEERIAQLSKELTELEAEIEE-LEERLEEAEEELAEA-EAEIEELEA------QIEQLKEElkALREALDELRAEL 812
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   862 RDEKESNLRLKGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDL 941
Cdd:TIGR02168  813 TLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALL 892
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   942 KKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQ-NTQLELNITELWQKLRATDQEMRRERQ 1020
Cdd:TIGR02168  893 RSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERlSEEYSLTLEEAEALENKIEDDEEEARR 972
                          330
                   ....*....|..
gi 767902634  1021 KERDLEALVKRF 1032
Cdd:TIGR02168  973 RLKRLENKIKEL 984
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
770-1033 2.31e-12

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 71.64  E-value: 2.31e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   770 EYEKYQELQLKSQRMQEEYEKQLRDNDETKSQALEeltefYEAKLQEKTTLLEEAQEDvRQQLREFEETKKQIEEDEDRE 849
Cdd:TIGR02169  672 EPAELQRLRERLEGLKRELSSLQSELRRIENRLDE-----LSQELSDASRKIGEIEKE-IEQLEQEEEKLKERLEELEED 745
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   850 IQDIktkyEKKLRDEKESNLRLKGETGIMRKKFSSLQKEIEERtndietlkgEQMKLQGVIKSLEKDIQGLKREIQERDE 929
Cdd:TIGR02169  746 LSSL----EQEIENVKSELKELEARIEELEEDLHKLEEALNDL---------EARLSHSRIPEIQAELSKLEEEVSRIEA 812
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   930 TIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLR 1009
Cdd:TIGR02169  813 RLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERD 892
                          250       260
                   ....*....|....*....|....
gi 767902634  1010 ATDQEMRRERQKERDLEALVKRFK 1033
Cdd:TIGR02169  893 ELEAQLRELERKIEELEAQIEKKR 916
PTZ00121 PTZ00121
MAEBL; Provisional
655-1045 2.72e-12

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 71.71  E-value: 2.72e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  655 KREREVGFAEEvLVTKTDMEEKAQvmlELKTRVEELKMENEYQLRLKDMNYSEKIKELTDKFIQEMESLKTKNQVLRTEK 734
Cdd:PTZ00121 1438 KKAEEAKKADE-AKKKAEEAKKAE---EAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKAD 1513
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  735 EKQDVYHHEHIEDLLD-KQSRELQDMECCNNQKLLLEYEKYQELQLKSQRMQEEYEKQLRDNDETKSQALEELTEFYEAK 813
Cdd:PTZ00121 1514 EAKKAEEAKKADEAKKaEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEAR 1593
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  814 LQEKTTLLEEAQEDVRQQLREFEETKKQIEE-DEDREIQDIKTKYEKKLRDEKESNLRLKGETGIMRKKFSSLQKEIEER 892
Cdd:PTZ00121 1594 IEEVMKLYEEEKKMKAEEAKKAEEAKIKAEElKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEED 1673
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  893 TNDIETLKGEQmklqgviKSLEKDIQGLKREIQERDETIQDKeKRIYDLKKKNQELGKFKFVLDYKIKELKKQieprENE 972
Cdd:PTZ00121 1674 KKKAEEAKKAE-------EDEKKAAEALKKEAEEAKKAEELK-KKEAEEKKKAEELKKAEEENKIKAEEAKKE----AEE 1741
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767902634  973 IRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQEMRRERQKERDLEalVKRFKTDLHNCVAYIQE 1045
Cdd:PTZ00121 1742 DKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRME--VDKKIKDIFDNFANIIE 1812
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
671-989 4.62e-12

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 70.43  E-value: 4.62e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   671 TDMEEKAQVMLELKTRVEELKMENE-YQLRLKDMNYSEKIKELTDKFIQEMESLKTKN-QVLRTEKEKQDvyhhEHIEDL 748
Cdd:TIGR04523  356 SENSEKQRELEEKQNEIEKLKKENQsYKQEIKNLESQINDLESKIQNQEKLNQQKDEQiKKLQQEKELLE----KEIERL 431
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   749 LDKQSRELQDMECCNNQKLLLE--------YEKYQELQLKSqrMQEEYEKQLRDNDETKSQALEELTEFyeAKLQEKTTL 820
Cdd:TIGR04523  432 KETIIKNNSEIKDLTNQDSVKEliiknldnTRESLETQLKV--LSRSINKIKQNLEQKQKELKSKEKEL--KKLNEEKKE 507
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   821 LEEAQEDVRQQLREFEETKKQIEED---EDREIQDIKTKYEKKLRDEKESNLRlkgetgimrKKFSSLQKEIEERTNDIE 897
Cdd:TIGR04523  508 LEEKVKDLTKKISSLKEKIEKLESEkkeKESKISDLEDELNKDDFELKKENLE---------KEIDEKNKEIEELKQTQK 578
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   898 TLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMK 977
Cdd:TIGR04523  579 SLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIKETIKEIR 658
                          330
                   ....*....|..
gi 767902634   978 EQIQEMEAELEN 989
Cdd:TIGR04523  659 NKWPEIIKKIKE 670
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
786-1118 3.90e-11

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 67.77  E-value: 3.90e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   786 EEYEKQLRDNDETKSQ---ALEELTEFYEaKLQEKTTLLEEAQEdVRQQLREFEET----KKQIEEDEDREIQDIKTKYE 858
Cdd:TIGR02168  175 KETERKLERTRENLDRledILNELERQLK-SLERQAEKAERYKE-LKAELRELELAllvlRLEELREELEELQEELKEAE 252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   859 KKlRDEKESNLRLKGEtgimrkKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRI 938
Cdd:TIGR02168  253 EE-LEELTAELQELEE------KLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQL 325
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   939 YDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQEMRRE 1018
Cdd:TIGR02168  326 EELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERL 405
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1019 RQKERDLEALVKRFKTDLHNCVAYIQEPRlLKEKVRGLFEKyvqRADMVEIAGLNTDLQQEYTRQREHLERNLATLKKKV 1098
Cdd:TIGR02168  406 EARLERLEDRRERLQQEIEELLKKLEEAE-LKELQAELEEL---EEELEELQEELERLEEALEELREELEEAEQALDAAE 481
                          330       340
                   ....*....|....*....|.
gi 767902634  1099 VKEGELH-RTDYVRIMQENVS 1118
Cdd:TIGR02168  482 RELAQLQaRLDSLERLQENLE 502
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
670-1204 4.57e-11

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 67.69  E-value: 4.57e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   670 KTDMEEKAQVMLELKTRVEELK--MENEYQLRLKDMNYSEKIKELTDKFIQEMESLKTKNQVLRTEKEKQDV------YH 741
Cdd:pfam02463  155 RLEIEEEAAGSRLKRKKKEALKklIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLlyldylKL 234
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   742 HEHIEDLLDKQSRELQDMECCNNQKLLLEYEKyQELQLKSQRMQEEYEKQLRDNDETKSQALEELTEFYEAKLQEKTTLL 821
Cdd:pfam02463  235 NEERIDLLQELLRDEQEEIESSKQEIEKEEEK-LAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDE 313
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   822 EEAQEDVRQQLREFEETKKQIEEDEDREIQDIKTKYEKKLRDEKESNLRLKgetgimrkkfssLQKEIEERTNDIETLKG 901
Cdd:pfam02463  314 EKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKL------------QEKLEQLEEELLAKKKL 381
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   902 EQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVM--KEQ 979
Cdd:pfam02463  382 ESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQelKLL 461
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   980 IQEMEAELENFHKQNTQLELNITELWQKLRATDQEMRR--ERQKERDLEALVKRFKTDLHNCVAYIQEPRLLKEKVRGLF 1057
Cdd:pfam02463  462 KDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSqkESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENY 541
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1058 EKYVQRADMVEIAGLNTDLQQEYTRQREHLERNLATLKKKVVKEGELHRTDYVRImQENVSLIKEINELRRELKFTRSQV 1137
Cdd:pfam02463  542 KVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAV-LEIDPILNLAQLDKATLEADEDDK 620
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 767902634  1138 YDLEAA--LKLTKKVRPQEVSETEPSRDMLSTAPTARLNEQEETGRIIEMQRLEIQRLRDQIQEQEQVT 1204
Cdd:pfam02463  621 RAKVVEgiLKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESEL 689
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
772-1101 4.74e-11

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 67.27  E-value: 4.74e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  772 EKYQELQLKsqrmQEEYEKQLRdndetkSQALEELtefyEAKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEdedreiq 851
Cdd:COG1196   213 ERYRELKEE----LKELEAELL------LLKLREL----EAELEELEAELEELEAELEELEAELAELEAELEE------- 271
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  852 diktkyekklrdekesnlrlkgetgiMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETI 931
Cdd:COG1196   272 --------------------------LRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEEL 325
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  932 QDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITEL-WQKLRA 1010
Cdd:COG1196   326 AELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELaAQLEEL 405
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1011 TDQEMRRERQKERDLEALVKRFKTDLHNCVAYIQEPRLLKEKVRGLFEKYVQRAdmvEIAGLNTDLQQEYTRQREHLERN 1090
Cdd:COG1196   406 EEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEE---ALLELLAELLEEAALLEAALAEL 482
                         330
                  ....*....|.
gi 767902634 1091 LATLKKKVVKE 1101
Cdd:COG1196   483 LEELAEAAARL 493
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
773-1212 9.83e-11

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 66.30  E-value: 9.83e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   773 KYQELQLKSQRMQEEYE-KQLRDNDETKSQALEELTEFYEAKLQEKTTlleeaqedvRQQLREFEETKKQIEEDEDREIQ 851
Cdd:pfam05557    8 KARLSQLQNEKKQMELEhKRARIELEKKASALKRQLDRESDRNQELQK---------RIRLLEKREAEAEEALREQAELN 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   852 DIKTKYEK---KLRDEKESNLRLKGETgimrkkFSSLQKEIEErtndietLKGEQMKLQGVIKSLEKDIQGLKREIQERD 928
Cdd:pfam05557   79 RLKKKYLEalnKKLNEKESQLADAREV------ISCLKNELSE-------LRRQIQRAELELQSTNSELEELQERLDLLK 145
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   929 ETIQDKEKRIYDLKKKNQELGkfkfVLDYKIKELKKQIEPRENEIRVMK------EQIQEMEAELENFHKQNTQL-ELNI 1001
Cdd:pfam05557  146 AKASEAEQLRQNLEKQQSSLA----EAEQRIKELEFEIQSQEQDSEIVKnskselARIPELEKELERLREHNKHLnENIE 221
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1002 TELWQKLRATDQEMRRERQKER-----DLEALVKRFKTDLHNCVAYIQEPRL---LKEKVRGLFEKYVQR--ADMVEIAG 1071
Cdd:pfam05557  222 NKLLLKEEVEDLKRKLEREEKYreeaaTLELEKEKLEQELQSWVKLAQDTGLnlrSPEDLSRRIEQLQQReiVLKEENSS 301
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1072 LNTDLQQEYTRQREhLERNLATLKKKVVKEG-ELHRTD-YVRIMQENVSLI-KEINELRRELKFTRSQVYDLEAALKLTK 1148
Cdd:pfam05557  302 LTSSARQLEKARRE-LEQELAQYLKKIEDLNkKLKRHKaLVRRLQRRVLLLtKERDGYRAILESYDKELTMSNYSPQLLE 380
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1149 KVRP-----QEV---------------SETEPSRDMLSTA---------------PTARLNEQEETGRIIEMQRLEIQRL 1193
Cdd:pfam05557  381 RIEEaedmtQKMqahneemeaqlsvaeEELGGYKQQAQTLerelqalrqqesladPSYSKEEVDSLRRKLETLELERQRL 460
                          490
                   ....*....|....*....
gi 767902634  1194 RDQIQEQEQVTGFHTLAGV 1212
Cdd:pfam05557  461 REQKNELEMELERRCLQGD 479
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
773-1096 5.44e-10

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 64.04  E-value: 5.44e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   773 KYQELQlKSQRMQEEYEKQLRDNDETKSQALEELTEFYEaKLQEKTTLLEEAQEdvrqqLREFEETKKQieededrEIQD 852
Cdd:pfam01576   10 KEEELQ-KVKERQQKAESELKELEKKHQQLCEEKNALQE-QLQAETELCAEAEE-----MRARLAARKQ-------ELEE 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   853 IKTKYEKKLRDEKESNLRLKGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGL----------KR 922
Cdd:pfam01576   76 ILHELESRLEEEEERSQQLQNEKKKMQQHIQDLEEQLDEEEAARQKLQLEKVTTEAKIKKLEEDILLLedqnsklskeRK 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   923 EIQER-------------------------DETIQDKEKRIYDLKKKNQELGKFKFVLD--------------YKIKELK 963
Cdd:pfam01576  156 LLEERiseftsnlaeeeekakslsklknkhEAMISDLEERLKKEEKGRQELEKAKRKLEgestdlqeqiaelqAQIAELR 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   964 KQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQEMRRERQKERDLEALVKRFKTDLHNCV--- 1040
Cdd:pfam01576  236 AQLAKKEEELQAALARLEEETAQKNNALKKIRELEAQISELQEDLESERAARNKAEKQRRDLGEELEALKTELEDTLdtt 315
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767902634  1041 AYIQEPRLLKEKVRGLF------EKYVQRADMVEIAGLNTDLQQEYTRQREHLERNLATLKK 1096
Cdd:pfam01576  316 AAQQELRSKREQEVTELkkaleeETRSHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEK 377
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
707-1045 1.33e-09

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 62.28  E-value: 1.33e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  707 EKIKELTDKFIQEMESLKTKN-QVLRTEKEKQDVYHHEHIEDLLDKQSRELQDMECCNNQKLLLEYEKYQELQLKSQRMQ 785
Cdd:COG5185   159 GIIKDIFGKLTQELNQNLKKLeIFGLTLGLLKGISELKKAEPSGTVNSIKESETGNLGSESTLLEKAKEIINIEEALKGF 238
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  786 EEYEKQLRDNDETK---SQALEELTEFYEAKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEDREIQDIKTKYEKKLR 862
Cdd:COG5185   239 QDPESELEDLAQTSdklEKLVEQNTDLRLEKLGENAESSKRLNENANNLIKQFENTKEKIAEYTKSIDIKKATESLEEQL 318
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  863 DEKESN-----LRLKGETGIMR------KKFSSLQKEIEERTNDIETLKGEQM--KLQGVIKSLEKDIQGLKREIQERDE 929
Cdd:COG5185   319 AAAEAEqeleeSKRETETGIQNltaeieQGQESLTENLEAIKEEIENIVGEVElsKSSEELDSFKDTIESTKESLDEIPQ 398
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  930 TIQDKEKRIydlkkkNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELE-NFHKQNTQLELNITELWQKL 1008
Cdd:COG5185   399 NQRGYAQEI------LATLEDTLKAADRQIEELQRQIEQATSSNEEVSKLLNELISELNkVMREADEESQSRLEEAYDEI 472
                         330       340       350
                  ....*....|....*....|....*....|....*...
gi 767902634 1009 RATDQEMRRERQKER-DLEALVKRFKTDLHNCVAYIQE 1045
Cdd:COG5185   473 NRSVRSKKEDLNEELtQIESRVSTLKATLEKLRAKLER 510
PTZ00121 PTZ00121
MAEBL; Provisional
654-1130 1.38e-09

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 62.85  E-value: 1.38e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  654 IKREREVGFAEEVLVTKTDMEEKAQVMLELKtRVEELKMENEYQLRLKDMNYSEKIKELTDKFIQEMESLKTKNQVLRTE 733
Cdd:PTZ00121 1551 LKKAEELKKAEEKKKAEEAKKAEEDKNMALR-KAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAE 1629
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  734 KEKQDVYHHEHIEDLLDKQSRELQDMECCNNQKLLLEYEKYQELQLKSQRMQEEYEKQLRDNDETKSQAleeltefYEAK 813
Cdd:PTZ00121 1630 EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEA-------EEAK 1702
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  814 LQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEDREIQDIKTKYEKKLRDEKEsnlrlkgetgimRKKFSSLQKEIEERT 893
Cdd:PTZ00121 1703 KAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE------------KKKIAHLKKEEEKKA 1770
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  894 NDIETLKgeqmklQGVIKslekdiQGLKREIQERDETIQDKEKRIYDLKKKNQELGKfkfvldykikelkkqieprENEI 973
Cdd:PTZ00121 1771 EEIRKEK------EAVIE------EELDEEDEKRRMEVDKKIKDIFDNFANIIEGGK-------------------EGNL 1819
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  974 RVMKEQIQEMEAELENFHKQNTQLElNITELWQKLRATDQEMRRERQKERDLEALVKRFKTDLHNcvayIQEPRLLKEKV 1053
Cdd:PTZ00121 1820 VINDSKEMEDSAIKEVADSKNMQLE-EADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEE----IEEADEIEKID 1894
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1054 RGLFEKYVQRADMveiAGLNTDLQQEYTRQREHLERNLATLKKKVVKEGELH------RTDYVRIMQENVSLIKEINELR 1127
Cdd:PTZ00121 1895 KDDIEREIPNNNM---AGKNNDIIDDKLDKDEYIKRDAEETREEIIKISKKDmcindfSSKFCDYMKDNISSGNCSDEER 1971

                  ...
gi 767902634 1128 REL 1130
Cdd:PTZ00121 1972 KEL 1974
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
678-1205 1.50e-09

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 62.44  E-value: 1.50e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   678 QVMLELKTRVEELKMENE--YQLRLKDMNYSEKIKELTDKFIQEMESLK-----------TKNQVLRT------------ 732
Cdd:pfam15921  110 QSVIDLQTKLQEMQMERDamADIRRRESQSQEDLRNQLQNTVHELEAAKclkedmledsnTQIEQLRKmmlshegvlqei 189
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   733 --------EKEKQDVYHHEHIEDL--------LDKQSRELQ--------------------DMECCNNQKLLLEYEKYQE 776
Cdd:pfam15921  190 rsilvdfeEASGKKIYEHDSMSTMhfrslgsaISKILRELDteisylkgrifpvedqlealKSESQNKIELLLQQHQDRI 269
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   777 LQLKSQR---------------------------MQEE-------YEKQLRDNDETKSQALEELTE---FYEAKLQ--EK 817
Cdd:pfam15921  270 EQLISEHeveitgltekassarsqansiqsqleiIQEQarnqnsmYMRQLSDLESTVSQLRSELREakrMYEDKIEelEK 349
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   818 TTLLEEAQ-EDVRQQLREFEETKKQIEEDEDREIQDIKtKYEKKLRDEKESNLRLKGETGIMRKKFSSLQKEIEERTNDI 896
Cdd:pfam15921  350 QLVLANSElTEARTERDQFSQESGNLDDQLQKLLADLH-KREKELSLEKEQNKRLWDRDTGNSITIDHLRRELDDRNMEV 428
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   897 EtlkgeqmKLQGVIKSLEKDIQGlkrEIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKEL---KKQIEPRENEI 973
Cdd:pfam15921  429 Q-------RLEALLKAMKSECQG---QMERQMAAIQGKNESLEKVSSLTAQLESTKEMLRKVVEELtakKMTLESSERTV 498
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   974 RVMKEQIQEMEAELENFHKQNTQLELNITELWQKLratdQEMRRERQKERDLEALVKRFKTDLHNCVAYIQeprLLKEKV 1053
Cdd:pfam15921  499 SDLTASLQEKERAIEATNAEITKLRSRVDLKLQEL----QHLKNEGDHLRNVQTECEALKLQMAEKDKVIE---ILRQQI 571
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1054 RGLFEKYVQR-----ADMVEIAGLNTDLQ------QEYTRQREHLERNLATLKKKvVKEGELHRTDYVRIMQENVSLIKE 1122
Cdd:pfam15921  572 ENMTQLVGQHgrtagAMQVEKAQLEKEINdrrlelQEFKILKDKKDAKIRELEAR-VSDLELEKVKLVNAGSERLRAVKD 650
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1123 I----NELRRELKFTRSQVYDLEAALKLTKKVRPQEVSETEPSRDMLSTAPTARLNEQEETGRIIE-MQRLEIQRLRDQI 1197
Cdd:pfam15921  651 IkqerDQLLNEVKTSRNELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQTRNTLKsMEGSDGHAMKVAM 730

                   ....*...
gi 767902634  1198 QEQEQVTG 1205
Cdd:pfam15921  731 GMQKQITA 738
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
664-988 1.99e-09

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 62.06  E-value: 1.99e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   664 EEVLVTKTDMEEKAQVMLELKTRVEElkmeneyQLRLKDMNYSE--KIKELTDKFIQEMESLKTKNQVLR---TEKE--K 736
Cdd:pfam15921  482 EELTAKKMTLESSERTVSDLTASLQE-------KERAIEATNAEitKLRSRVDLKLQELQHLKNEGDHLRnvqTECEalK 554
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   737 QDVYHHEHIEDLLDKQSRELQDMECCNNQKL-LLEYEKYQ-ELQLKSQRMQEEYEKQLRDNDETKSQALEELTEFYEAkl 814
Cdd:pfam15921  555 LQMAEKDKVIEILRQQIENMTQLVGQHGRTAgAMQVEKAQlEKEINDRRLELQEFKILKDKKDAKIRELEARVSDLEL-- 632
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   815 qEKTTLLEEAQEDVRQqLREFEETKKQI--EEDEDR-EIQDIKTKYEKKLRDEKESNLRLKGETGIMRKKFSSLQKEIEE 891
Cdd:pfam15921  633 -EKVKLVNAGSERLRA-VKDIKQERDQLlnEVKTSRnELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQ 710
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   892 RTNDIETLKGEQ-------MKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKK 964
Cdd:pfam15921  711 TRNTLKSMEGSDghamkvaMGMQKQITAKRGQIDALQSKIQFLEEAMTNANKEKHFLKEEKNKLSQELSTVATEKNKMAG 790
                          330       340
                   ....*....|....*....|....
gi 767902634   965 QIEPRENEIRVMKEQIQEMEAELE 988
Cdd:pfam15921  791 ELEVLRSQERRLKEKVANMEVALD 814
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
715-1202 2.26e-09

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 61.71  E-value: 2.26e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  715 KFIQEM--ESLKTKNQVLRTEKEKQDVYHHEHIEDLlDKQSRELQDmeccnnqklllEYEKYQELQLKSQRMQEEYE--K 790
Cdd:COG4717    41 AFIRAMllERLEKEADELFKPQGRKPELNLKELKEL-EEELKEAEE-----------KEEEYAELQEELEELEEELEelE 108
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  791 QLRDNDETKSQALEELTEFYEA-----KLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEDrEIQDIKTKYEKKLRDek 865
Cdd:COG4717   109 AELEELREELEKLEKLLQLLPLyqeleALEAELAELPERLEELEERLEELRELEEELEELEA-ELAELQEELEELLEQ-- 185
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  866 eSNLRLKGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKD--IQGLKREIQERDET---------IQDK 934
Cdd:COG4717   186 -LSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENEleAAALEERLKEARLLlliaaallaLLGL 264
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  935 EKRIYDLKKKNQELGK--------FKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQ 1006
Cdd:COG4717   265 GGSLLSLILTIAGVLFlvlgllalLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLD 344
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1007 KLRATDQEMR--RERQKERDLEALVKRFKTDLHNCVAyiqeprllkekvrGLFEKYVQRADMVEIaglntdlQQEYTRQR 1084
Cdd:COG4717   345 RIEELQELLReaEELEEELQLEELEQEIAALLAEAGV-------------EDEEELRAALEQAEE-------YQELKEEL 404
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1085 EHLERNLATLKKKVvkEGELHRTDYVRIMQENVSLIKEINELRRELKFTRSQVYDLEAALkltkkvrpqevsetepsrdm 1164
Cdd:COG4717   405 EELEEQLEELLGEL--EELLEALDEEELEEELEELEEELEELEEELEELREELAELEAEL-------------------- 462
                         490       500       510
                  ....*....|....*....|....*....|....*...
gi 767902634 1165 lstaptarlnEQEETGRIIEMQRLEIQRLRDQIQEQEQ 1202
Cdd:COG4717   463 ----------EQLEEDGELAELLQELEELKAELRELAE 490
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
842-1210 2.43e-09

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 61.62  E-value: 2.43e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  842 IEEDEDRE--------IQDIKTKYEKKLRDEKESNLRLKG-ETGIMRKkfSSLQKEIEERTNDIETLKGEQMKLQGVIKS 912
Cdd:PRK03918  141 LESDESREkvvrqilgLDDYENAYKNLGEVIKEIKRRIERlEKFIKRT--ENIEELIKEKEKELEEVLREINEISSELPE 218
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  913 LEKDIQGLKREIQERDETiqdKEKrIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEaELENFHK 992
Cdd:PRK03918  219 LREELEKLEKEVKELEEL---KEE-IEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELK-ELKEKAE 293
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  993 QNTQLELNITELWQKLRATDQEMRRERQKERDLEALVKrfktDLHNCVAYIQEprlLKEKVRGLFEKYvqrADMVEIAGL 1072
Cdd:PRK03918  294 EYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIK----ELEEKEERLEE---LKKKLKELEKRL---EELEERHEL 363
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1073 NTDLQQEYTRQREHLERNLATLKKKVVKEGELHRTDYVRIMQENVSLIKEINELRRELKFTRSQVYDLEAAlKLTKKVRP 1152
Cdd:PRK03918  364 YEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKA-KGKCPVCG 442
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1153 QEVSETEPSRDMLS-TAPTARL-NEQEETGRIIEMQRLEIQRLRDQIQEQEQVTGFHTLA 1210
Cdd:PRK03918  443 RELTEEHRKELLEEyTAELKRIeKELKEIEEKERKLRKELRELEKVLKKESELIKLKELA 502
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
715-1202 2.48e-09

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 61.71  E-value: 2.48e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  715 KFIQEMESLKTKNQVLRTEKEKQDVYHHehIEDLLDKQSRELQDMEccnnqkllleyEKYQELQLKSQRMqeEYEKQLRD 794
Cdd:COG4717    65 KPELNLKELKELEEELKEAEEKEEEYAE--LQEELEELEEELEELE-----------AELEELREELEKL--EKLLQLLP 129
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  795 NDETKSQALEELTEFYE--AKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEdedrEIQDIKTKYEKKLRDEKESNLRLK 872
Cdd:COG4717   130 LYQELEALEAELAELPErlEELEERLEELRELEEELEELEAELAELQEELEE----LLEQLSLATEEELQDLAEELEELQ 205
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  873 GETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQgVIKSLEK------------DIQGLKREIQERDETIQDKEKRIYD 940
Cdd:COG4717   206 QRLAELEEELEEAQEELEELEEELEQLENELEAAA-LEERLKEarlllliaaallALLGLGGSLLSLILTIAGVLFLVLG 284
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  941 LkkknqeLGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQEMR--RE 1018
Cdd:COG4717   285 L------LALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLReaEE 358
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1019 RQKERDLEALVKRFKTDLHNCVAyiqeprllkekvrGLFEKYVQRADMVEIaglntdlQQEYTRQREHLERNLATLKKKV 1098
Cdd:COG4717   359 LEEELQLEELEQEIAALLAEAGV-------------EDEEELRAALEQAEE-------YQELKEELEELEEQLEELLGEL 418
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1099 vkEGELHRTDYVRIMQENVSLIKEINELRRELKFTRSQVYDLEAALKLTKKVRpqEVSETEPSRDMLStaptARLNEQEE 1178
Cdd:COG4717   419 --EELLEALDEEELEEELEELEEELEELEEELEELREELAELEAELEQLEEDG--ELAELLQELEELK----AELRELAE 490
                         490       500
                  ....*....|....*....|....
gi 767902634 1179 TGRIIEMQRLEIQRLRDQIQEQEQ 1202
Cdd:COG4717   491 EWAALKLALELLEEAREEYREERL 514
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
792-1034 4.50e-09

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 60.80  E-value: 4.50e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  792 LRDNDETKSQA-LEELTEFY-EAKLQEKTTLLEEAQEDVRQQLrefEETKKQIEEDEdREIQDIKTKYekklrdekeSNL 869
Cdd:COG3206   142 YTSPDPELAAAvANALAEAYlEQNLELRREEARKALEFLEEQL---PELRKELEEAE-AALEEFRQKN---------GLV 208
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  870 RLKGETGIMRKKFSSLQKEIEErtndietLKGEQMKLQGVIKSLEKDIQGLKREIQE--RDETIQDKEKRIYDLKKKNQE 947
Cdd:COG3206   209 DLSEEAKLLLQQLSELESQLAE-------ARAELAEAEARLAALRAQLGSGPDALPEllQSPVIQQLRAQLAELEAELAE 281
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  948 LGKfKFVLDY-KIKELKKQIEPRENEIRVMKEQI-QEMEAELENFHKQNTQLELNITELWQKLRATDQEMRRERQKERDL 1025
Cdd:COG3206   282 LSA-RYTPNHpDVIALRAQIAALRAQLQQEAQRIlASLEAELEALQAREASLQAQLAQLEARLAELPELEAELRRLEREV 360

                  ....*....
gi 767902634 1026 EALVKRFKT 1034
Cdd:COG3206   361 EVARELYES 369
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
776-1027 6.22e-09

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 58.77  E-value: 6.22e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  776 ELQLKSQRMQEEYEkQLRDNDETKSQALEELTEFYEAKLQEKTTLLEEAQEdVRQQLREFEETKKQIEEDEDREIQDIKT 855
Cdd:COG1340    12 ELEEKIEELREEIE-ELKEKRDELNEELKELAEKRDELNAQVKELREEAQE-LREKRDELNEKVKELKEERDELNEKLNE 89
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  856 KYEK--KLRDEKESNLRLKGETGIMRKKFSSLQKEIEERTNDIEtlkgEQMKLQGVIKSLEKDIQGLKREIQERDEtIQD 933
Cdd:COG1340    90 LREEldELRKELAELNKAGGSIDKLRKEIERLEWRQQTEVLSPE----EEKELVEKIKELEKELEKAKKALEKNEK-LKE 164
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  934 KEKRIYDLKKKNQELGKfkfvldyKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQ 1013
Cdd:COG1340   165 LRAELKELRKEAEEIHK-------KIKELAEEAQELHEEMIELYKEADELRKEADELHKEIVEAQEKADELHEEIIELQK 237
                         250
                  ....*....|....
gi 767902634 1014 EMRRERQKERDLEA 1027
Cdd:COG1340   238 ELRELRKELKKLRK 251
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
786-1199 7.83e-09

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 60.05  E-value: 7.83e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  786 EEYEKQLRDNDETKSQALEELTEfYEAKLQEKTTLlEEAQEDVRQQLREFEETKkqieEDEDREIQDIKTKYEKkLRDEK 865
Cdd:PRK02224  223 ERYEEQREQARETRDEADEVLEE-HEERREELETL-EAEIEDLRETIAETERER----EELAEEVRDLRERLEE-LEEER 295
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  866 ESnlrLKGETGimrkkFSSLqkeieertnDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKN 945
Cdd:PRK02224  296 DD---LLAEAG-----LDDA---------DAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERA 358
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  946 QELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENfhkQNTQLElNITELWQKLRATDQEMrreRQKERDL 1025
Cdd:PRK02224  359 EELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGD---APVDLG-NAEDFLEELREERDEL---REREAEL 431
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1026 EAlvkrfktDLHNCVAYIQEPR-LLKEKVRGLFEKYVQRADMVEIAGLNTDLQQEYTRQREHLERNLATLKKKVVKEGEL 1104
Cdd:PRK02224  432 EA-------TLRTARERVEEAEaLLEAGKCPECGQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDL 504
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1105 HRT-DYVRIMQENVSLIKE--------INELRRELKFTRSQVYDLEAALKLTKKVRPQEVSETEPSRDMLSTAPTAR--L 1173
Cdd:PRK02224  505 VEAeDRIERLEERREDLEEliaerretIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLaeL 584
                         410       420
                  ....*....|....*....|....*.
gi 767902634 1174 NEQEETGRIIEMQRLEIQRLRDQIQE 1199
Cdd:PRK02224  585 KERIESLERIRTLLAAIADAEDEIER 610
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
775-972 1.12e-08

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 58.62  E-value: 1.12e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  775 QELQLKSQRMQEEyEKQLRDNDETKSQALEELTEFyEAKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEdREIQDIK 854
Cdd:COG4942    27 AELEQLQQEIAEL-EKELAALKKEEKALLKQLAAL-ERRIAALARRIRALEQELAALEAELAELEKEIAELR-AELEAQK 103
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  855 TKYEKKLR--------------------DEKESNLR-LKGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSL 913
Cdd:COG4942   104 EELAELLRalyrlgrqpplalllspedfLDAVRRLQyLKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAEL 183
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 767902634  914 EKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENE 972
Cdd:COG4942   184 EEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAER 242
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
879-1036 1.15e-08

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 58.62  E-value: 1.15e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  879 RKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKfkfVLDYK 958
Cdd:COG4942    26 EAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRA---ELEAQ 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  959 IKELKKQI-------------------EPRENEIRVM---------KEQIQEMEAELENFHKQNTQLELNITELWQKLRA 1010
Cdd:COG4942   103 KEELAELLralyrlgrqpplalllspeDFLDAVRRLQylkylaparREQAEELRADLAELAALRAELEAERAELEALLAE 182
                         170       180
                  ....*....|....*....|....*.
gi 767902634 1011 TDQEMRRERQKERDLEALVKRFKTDL 1036
Cdd:COG4942   183 LEEERAALEALKAERQKLLARLEKEL 208
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
654-1164 1.45e-08

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 59.29  E-value: 1.45e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   654 IKREREVGFAEEVLVTKTDMEEKAQVMLELKTRVEELKMENEYQLRLKDMNYSEKIKELTDKFIQ------EMESLKTKN 727
Cdd:TIGR00606  398 LVIERQEDEAKTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKElqqlegSSDRILELD 477
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   728 QVLRTE---------------KEKQDVYHHEHIEDLLDKQSRELQDMECCNNQKLLLEyekyQELQLKSQRMQEEyeKQL 792
Cdd:TIGR00606  478 QELRKAerelskaeknsltetLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRT----QMEMLTKDKMDKD--EQI 551
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   793 RDNDETKSQALEELTEFYEAKLQEKTTL---------LEEAQEDVRQQLREFEETKKQIEEDEDREIQDIkTKYEKKLRD 863
Cdd:TIGR00606  552 RKIKSRHSDELTSLLGYFPNKKQLEDWLhskskeinqTRDRLAKLNKELASLEQNKNHINNELESKEEQL-SSYEDKLFD 630
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   864 -----EKESNL-RLKGETGIMRKKFSSLQKEIEERTNDIETLKGE--------------QMKLQGVIKSLEKDIQGLKRE 923
Cdd:TIGR00606  631 vcgsqDEESDLeRLKEEIEKSSKQRAMLAGATAVYSQFITQLTDEnqsccpvcqrvfqtEAELQEFISDLQSKLRLAPDK 710
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   924 IQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITE 1003
Cdd:TIGR00606  711 LKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTD 790
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1004 LwqklRATDQEMRRERQKERDLEALVKrfKTDLHNCVAYIQEPRLLKEKVRGLFEKYVQRADmveiagLNTDLQQEYTRQ 1083
Cdd:TIGR00606  791 V----TIMERFQMELKDVERKIAQQAA--KLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIE------LNRKLIQDQQEQ 858
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1084 REHLERNLATLKKKVVKEGE-LHRTDyvRIMQENVSLIKEINELRRELKFTRSQVYDLEAALKLTKKVRPQEVSETEPSR 1162
Cdd:TIGR00606  859 IQHLKSKTNELKSEKLQIGTnLQRRQ--QFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSN 936

                   ..
gi 767902634  1163 DM 1164
Cdd:TIGR00606  937 KK 938
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
663-1142 1.51e-08

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 59.28  E-value: 1.51e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  663 AEEVLvtkTDMEEKAQVMLELKTRVEELKMENEYQLRLKDmNYSEKIKELTDkfiqEMESLKTKNQVLRTEKEKQDVyHH 742
Cdd:PRK02224  239 ADEVL---EEHEERREELETLEAEIEDLRETIAETERERE-ELAEEVRDLRE----RLEELEEERDDLLAEAGLDDA-DA 309
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  743 EHIEDLLDKQSRELQDMEccnnqkllleyEKYQELQLKSQRMQEEYEKQLRDNDETKSQAlEELTEfyEAKLQEKTtlLE 822
Cdd:PRK02224  310 EAVEARREELEDRDEELR-----------DRLEECRVAAQAHNEEAESLREDADDLEERA-EELRE--EAAELESE--LE 373
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  823 EAQEDVRQQLREFEETKKQIEEDEDReIQDIKTKYEK--KLRDEKESNL-RLKGETGIMRKKFSSLQK------------ 887
Cdd:PRK02224  374 EAREAVEDRREEIEELEEEIEELRER-FGDAPVDLGNaeDFLEELREERdELREREAELEATLRTARErveeaealleag 452
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  888 ----------------EIEERTNDIETLKGEQMKLQGVIKSLEKDIqglkreiqERDETIQDKEKRIYDLKKKNQELGKf 951
Cdd:PRK02224  453 kcpecgqpvegsphveTIEEDRERVEELEAELEDLEEEVEEVEERL--------ERAEDLVEAEDRIERLEERREDLEE- 523
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  952 kfvldyKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQemRRERQKERdLEALvKR 1031
Cdd:PRK02224  524 ------LIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNS--KLAELKER-IESL-ER 593
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1032 FKTDLHNCVAYIQEPRLLKEKVRGLFEKYVQRADMVE-----IAGLN--------TDLQQEYTRQREHLERNLATLKKKV 1098
Cdd:PRK02224  594 IRTLLAAIADAEDEIERLREKREALAELNDERRERLAekrerKRELEaefdeariEEAREDKERAEEYLEQVEEKLDELR 673
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|....*
gi 767902634 1099 VKEGELHRtdyvRI-MQENVslIKEINELRRELKFTRSQVYDLEA 1142
Cdd:PRK02224  674 EERDDLQA----EIgAVENE--LEELEELRERREALENRVEALEA 712
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
879-1036 1.79e-08

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 56.47  E-value: 1.79e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  879 RKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIyDLKKKNQELGkfkfVLDYK 958
Cdd:COG1579    23 EHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQL-GNVRNNKEYE----ALQKE 97
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 767902634  959 IKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQEMRRERQK-ERDLEALVKRFKTDL 1036
Cdd:COG1579    98 IESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEElEAEREELAAKIPPEL 176
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
656-988 3.18e-08

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 58.19  E-value: 3.18e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   656 REREVGFAE-EVLVTKTDMEEKAQVMLELKTRVEELKMENeyqlrLKDMNYSEKIKELTDKFIQEmeslkTKNQVLRTEK 734
Cdd:pfam05483  448 REKEIHDLEiQLTAIKTSEEHYLKEVEDLKTELEKEKLKN-----IELTAHCDKLLLENKELTQE-----ASDMTLELKK 517
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   735 EKQDVYHHEHIEDLLDKQSRELQdmeccnnqkllleyEKYQELQLKSQRMQEEYeKQLRDNDETKSQALEELTEFYEAKL 814
Cdd:pfam05483  518 HQEDIINCKKQEERMLKQIENLE--------------EKEMNLRDELESVREEF-IQKGDEVKCKLDKSEENARSIEYEV 582
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   815 QEKTTLLEEAQEDVRQQLREFEETKKQIEE--DEDREIQDIKTKYEKKLRDEKESNLRLKGETGIMRKKFSSL----QKE 888
Cdd:pfam05483  583 LKKEKQMKILENKCNNLKKQIENKNKNIEElhQENKALKKKGSAENKQLNAYEIKVNKLELELASAKQKFEEIidnyQKE 662
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   889 IEERTNDIETLKGEQMKLQGVIKS---LEKDIQG------------LKREIQERDETIQDKEKRIYDLKKKNQELGKFKF 953
Cdd:pfam05483  663 IEDKKISEEKLLEEVEKAKAIADEavkLQKEIDKrcqhkiaemvalMEKHKHQYDKIIEERDSELGLYKNKEQEQSSAKA 742
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 767902634   954 VLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELE 988
Cdd:pfam05483  743 ALEIELSNIKAELLSLKKQLEIEKEEKEKLKMEAK 777
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
915-1145 3.34e-08

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 57.08  E-value: 3.34e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  915 KDIQGLKREIQERDETIQDKEKRIYDLKKKnqelgkfkfvldykIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQN 994
Cdd:COG4942    20 DAAAEAEAELEQLQQEIAELEKELAALKKE--------------EKALLKQLAALERRIAALARRIRALEQELAALEAEL 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  995 TQLELNITELWQKLratdqemrrERQKERDLEALVKRFKTDLHNCVAYIQEPRLLKEKVRGL-FEKYVQRADMVEIAGLN 1073
Cdd:COG4942    86 AELEKEIAELRAEL---------EAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLqYLKYLAPARREQAEELR 156
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 767902634 1074 TDlQQEYTRQREHLERNLATLkKKVVKEGELHRTDYVRIMQENVSLI----KEINELRRELKFTRSQVYDLEAALK 1145
Cdd:COG4942   157 AD-LAELAALRAELEAERAEL-EALLAELEEERAALEALKAERQKLLarleKELAELAAELAELQQEAEELEALIA 230
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
674-1000 7.38e-08

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 57.00  E-value: 7.38e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  674 EEKAQVMLELKTRVEELkMENEYQLRLKDMNYSEKIKELtDKFIQEMESLKTKNQVLRTEKEKqdvyhhEHIEDLLDKQS 753
Cdd:PRK03918  387 EKLEKELEELEKAKEEI-EEEISKITARIGELKKEIKEL-KKAIEELKKAKGKCPVCGRELTE------EHRKELLEEYT 458
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  754 RELQDMEccNNQKLLLEYEKYQELQLKSQRMQEEYEKQLRDNDETKSQ---ALEELTEFYEAKLQEKTTLLEEAQEDVRQ 830
Cdd:PRK03918  459 AELKRIE--KELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQlkeLEEKLKKYNLEELEKKAEEYEKLKEKLIK 536
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  831 QLREFEETKKQIEEDEdrEIQDIKTKYEKKLRDEKESNLRLKGEtgIMRKKFSSLqKEIEERTNDIETLKGEQMKLQGV- 909
Cdd:PRK03918  537 LKGEIKSLKKELEKLE--ELKKKLAELEKKLDELEEELAELLKE--LEELGFESV-EELEERLKELEPFYNEYLELKDAe 611
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  910 --IKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDY------------------------------ 957
Cdd:PRK03918  612 keLEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYeelreeylelsrelaglraeleelekrree 691
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*....
gi 767902634  958 ------KIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELN 1000
Cdd:PRK03918  692 ikktleKLKEELEEREKAKKELEKLEKALERVEELREKVKKYKALLKER 740
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
811-1201 1.01e-07

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 56.46  E-value: 1.01e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  811 EAKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEDReiqdiKTKYEkKLRDEKESNLRLKgetgimrkkfsSLQKEIE 890
Cdd:COG4913   609 RAKLAALEAELAELEEELAEAEERLEALEAELDALQER-----REALQ-RLAEYSWDEIDVA-----------SAEREIA 671
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  891 ERTNDIETLKGEQmklqgviksleKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELgkfkfvldykikelkkqieprE 970
Cdd:COG4913   672 ELEAELERLDASS-----------DDLAALEEQLEELEAELEELEEELDELKGEIGRL---------------------E 719
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  971 NEIRVMKEQIQEMEAELENF-HKQNTQLELNITELWQKLRATdqemRRERQKERDLEALVKRFKTDLHNcvayiqeprlL 1049
Cdd:COG4913   720 KELEQAEEELDELQDRLEAAeDLARLELRALLEERFAAALGD----AVERELRENLEERIDALRARLNR----------A 785
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1050 KEKVRGLFEKYVQRADMvEIAGLNTDL------QQEYTR-QREHLERNLATLKKKVVKEGELHRTDYVRIMQENVSLIKE 1122
Cdd:COG4913   786 EEELERAMRAFNREWPA-ETADLDADLeslpeyLALLDRlEEDGLPEYEERFKELLNENSIEFVADLLSKLRRAIREIKE 864
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1123 ----INELRRELKFTRSQVYDLEAalkltKKVRPQEVSETepsRDMLStapTARLNEQEETGRIIEMQRLEIQRLRDQIQ 1198
Cdd:COG4913   865 ridpLNDSLKRIPFGPGRYLRLEA-----RPRPDPEVREF---RQELR---AVTSGASLFDEELSEARFAALKRLIERLR 933

                  ...
gi 767902634 1199 EQE 1201
Cdd:COG4913   934 SEE 936
WD40 COG2319
WD40 repeat [General function prediction only];
461-648 1.22e-07

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 55.69  E-value: 1.22e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  461 GGHLFAAVNGNVIHVYTTTSLENISSLKGHTGKIRSIVWNADDSKLISGGTDGAVYEWNLSTGKRETECVLKSCSYNCVT 540
Cdd:COG2319     6 GAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVA 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  541 VSPDAKIIFAVGSDHTLK--EIADSLILREISAFDVTYTAIVISHSGRMMFVGTSVGTIRAmkYPLPLQKEFNEYQAHAG 618
Cdd:COG2319    86 FSPDGRLLASASADGTVRlwDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRL--WDLATGKLLRTLTGHSG 163
                         170       180       190
                  ....*....|....*....|....*....|..
gi 767902634  619 PITkmLLTF--DDQFLLTAAEDGclfTWKVFD 648
Cdd:COG2319   164 AVT--SVAFspDGKLLASGSDDG---TVRLWD 190
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
686-1202 1.47e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 56.22  E-value: 1.47e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   686 RVEELKMENEYQLRLKDMNysEKIKELTDKFIQEMESLKTKNQVLRTEKEKQDVYHhEHIEDLLDKQSRELQDMECCNNQ 765
Cdd:TIGR02168  325 LEELESKLDELAEELAELE--EKLEELKEELESLEAELEELEAELEELESRLEELE-EQLETLRSKVAQLELQIASLNNE 401
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   766 KLLLEYEKyQELQLKSQRMQEEYEKQLRDNDETKSQALE---ELTEFYEAKLQEKTTLLEEAQEDVRQQLREFEETKKQI 842
Cdd:TIGR02168  402 IERLEARL-ERLEDRRERLQQEIEELLKKLEEAELKELQaelEELEEELEELQEELERLEEALEELREELEEAEQALDAA 480
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   843 EEDEDR------EIQDIKTKYEKKLRDEKE---SNLRLKGETGIMRKKFS---SLQKEIE----ERTNDIETlKGEQMKL 906
Cdd:TIGR02168  481 ERELAQlqarldSLERLQENLEGFSEGVKAllkNQSGLSGILGVLSELISvdeGYEAAIEaalgGRLQAVVV-ENLNAAK 559
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   907 QGvIKSLEK-----------------DIQGLKREIQERDETIQDKEKRIY---------------------------DLK 942
Cdd:TIGR02168  560 KA-IAFLKQnelgrvtflpldsikgtEIQGNDREILKNIEGFLGVAKDLVkfdpklrkalsyllggvlvvddldnalELA 638
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   943 KKNQELGKFkFVLD------------------YKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNIT-- 1002
Cdd:TIGR02168  639 KKLRPGYRI-VTLDgdlvrpggvitggsaktnSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEql 717
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1003 -----ELWQKLRATDQEMRRERQKERDLEALVKRFKTDLHNCVAYIQEPRLLKEKVRGLFEkyvqrADMVEIAGLNTDLQ 1077
Cdd:TIGR02168  718 rkeleELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELA-----EAEAEIEELEAQIE 792
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1078 QeYTRQREHLERNLATLKKKVvkegELHRTDYVRIMQENVSLIKEINELRRELKFTRSQVYDLEAALKLTKKvrpqEVSE 1157
Cdd:TIGR02168  793 Q-LKEELKALREALDELRAEL----TLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAA----EIEE 863
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|....*
gi 767902634  1158 TEPSRDMLSTAPTARLNEQEETGRIIEMQRLEIQRLRDQIQEQEQ 1202
Cdd:TIGR02168  864 LEELIEELESELEALLNERASLEEALALLRSELEELSEELRELES 908
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
782-1033 1.49e-07

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 55.79  E-value: 1.49e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  782 QRMQEEYekqLRDNDETKSQALEELTEFYEAKLQEKTTLLEEAQedvrQQLREFEETKKQIEEDEDREIqdiktkYEKKL 861
Cdd:COG3206   155 NALAEAY---LEQNLELRREEARKALEFLEEQLPELRKELEEAE----AALEEFRQKNGLVDLSEEAKL------LLQQL 221
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  862 RDEKESNLRLKGETGIMRKKFSSLQKEIEERTNDIETLKGEQmklqgVIKSLEKDIQGLKREIQERDETIQDKEKRIYDL 941
Cdd:COG3206   222 SELESQLAEARAELAEAEARLAALRAQLGSGPDALPELLQSP-----VIQQLRAQLAELEAELAELSARYTPNHPDVIAL 296
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  942 KKKNQELGKFKFvldykiKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLelnitelwQKLRATDQEMRRERQK 1021
Cdd:COG3206   297 RAQIAALRAQLQ------QEAQRILASLEAELEALQAREASLQAQLAQLEARLAEL--------PELEAELRRLEREVEV 362
                         250
                  ....*....|...
gi 767902634 1022 ERDL-EALVKRFK 1033
Cdd:COG3206   363 ARELyESLLQRLE 375
ClyA_Cry6Aa-like cd22656
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ...
795-992 1.56e-07

Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.


Pssm-ID: 439154 [Multi-domain]  Cd Length: 309  Bit Score: 54.68  E-value: 1.56e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  795 NDETKSQALEELTEFYEaKLQEKTTLLEEAQEDVRQQLREFEetkKQIEEDEDrEIQDIKTKYEKKLRDEKESnlrlkge 874
Cdd:cd22656   108 DDEELEEAKKTIKALLD-DLLKEAKKYQDKAAKVVDKLTDFE---NQTEKDQT-ALETLEKALKDLLTDEGGA------- 175
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  875 tgIMRKKFSSLQKEIEERTNDIetlkgeqmklqgvIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKknqelgkfkfv 954
Cdd:cd22656   176 --IARKEIKDLQKELEKLNEEY-------------AAKLKAKIDELKALIADDEAKLAAALRLIADLTA----------- 229
                         170       180       190
                  ....*....|....*....|....*....|....*...
gi 767902634  955 LDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHK 992
Cdd:cd22656   230 ADTDLDNLLALIGPAIPALEKLQGAWQAIATDLDSLKD 267
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
682-1157 2.23e-07

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 55.16  E-value: 2.23e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  682 ELKTRVEELKmeneyQLRLKDMNYSEKIKELtDKFIQEMESLKTKNQVLRTEKEK--QDVYHHEHIEDLLDKQsRELQDM 759
Cdd:COG4717    72 ELKELEEELK-----EAEEKEEEYAELQEEL-EELEEELEELEAELEELREELEKleKLLQLLPLYQELEALE-AELAEL 144
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  760 EccnnqkllLEYEKYQELQLKSQRMQEEYEKQLRDNDETKSQALEELTEFYEAKLQEKTTLLEEAQEdVRQQLREFEETK 839
Cdd:COG4717   145 P--------ERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEE-LQQRLAELEEEL 215
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  840 KQIEEDEDREIQDIKTKYEKKLRDEKESNLRLKGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVI-KSLEKDIQ 918
Cdd:COG4717   216 EEAQEELEELEEELEQLENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGVLFLVLGLLaLLFLLLAR 295
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  919 GLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEpRENEIRVMKEQIQEMEAELenfhkQNTQLE 998
Cdd:COG4717   296 EKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLD-RIEELQELLREAEELEEEL-----QLEELE 369
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  999 LNITELWQKLRATDQEMRRERQKE-RDLEALVKRFktdlhncvayiqepRLLKEKVRGLFEKYVQRADMVEIAGLNTDLQ 1077
Cdd:COG4717   370 QEIAALLAEAGVEDEEELRAALEQaEEYQELKEEL--------------EELEEQLEELLGELEELLEALDEEELEEELE 435
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1078 Q------EYTRQREHLERNLATLKKKVvkEGELHRTDYVRIMQENVSLIKEINELRRELKFTRSQVYDLEAALKLTKKVR 1151
Cdd:COG4717   436 EleeeleELEEELEELREELAELEAEL--EQLEEDGELAELLQELEELKAELRELAEEWAALKLALELLEEAREEYREER 513

                  ....*.
gi 767902634 1152 PQEVSE 1157
Cdd:COG4717   514 LPPVLE 519
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
812-938 2.71e-07

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 53.00  E-value: 2.71e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  812 AKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEDREIQDIKtKYEKKL---RDEKESNlRLKGETGIMRKKFSSLQKE 888
Cdd:COG1579    34 AELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIK-KYEEQLgnvRNNKEYE-ALQKEIESLKRRISDLEDE 111
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 767902634  889 IEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRI 938
Cdd:COG1579   112 ILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEEL 161
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
800-1031 2.92e-07

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 54.38  E-value: 2.92e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  800 SQALEELTEFYEAKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEdREIQDIktkyEKKLRDekesnlrLKGETGIMR 879
Cdd:COG4942    15 AAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALE-RRIAAL----ARRIRA-------LEQELAALE 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  880 KKFSSLQKEIEERTNDIETLKGEqmkLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKI 959
Cdd:COG4942    83 AELAELEKEIAELRAELEAQKEE---LAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADL 159
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767902634  960 KELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQEMRRERQKERDLEALVKR 1031
Cdd:COG4942   160 AELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIAR 231
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
843-1199 3.26e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 55.07  E-value: 3.26e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   843 EEDEDREIQDIKTKYEKKLRDEKESNLRLKGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKR 922
Cdd:TIGR02169  672 EPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQ 751
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   923 EIQERDETIQDKEKRIYDLKKKnqelgkfkfvldykIKELKKQIEPRENEIRvmKEQIQEMEAELENFHKQNTQLELNIT 1002
Cdd:TIGR02169  752 EIENVKSELKELEARIEELEED--------------LHKLEEALNDLEARLS--HSRIPEIQAELSKLEEEVSRIEARLR 815
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1003 ELWQKLRATDQEMRRERQKERDLEALVkrfktdlhncvayiqeprllkekvrglfekyvqradmveiagLNTDLQQEYTR 1082
Cdd:TIGR02169  816 EIEQKLNRLTLEKEYLEKEIQELQEQR------------------------------------------IDLKEQIKSIE 853
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1083 QREHLernlatlkkkvvkegelhrtdyvrimqenvsLIKEINELRRELKFTRSQVYDLEAALKLTKKVRPQ---EVSETE 1159
Cdd:TIGR02169  854 KEIEN-------------------------------LNGKKEELEEELEELEAALRDLESRLGDLKKERDEleaQLRELE 902
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 767902634  1160 PSRDMLSTA---PTARLNEQEETGRIIEMQRLEIQRLRDQIQE 1199
Cdd:TIGR02169  903 RKIEELEAQiekKRKRLSELKAKLEALEEELSEIEDPKGEDEE 945
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
665-1097 1.19e-06

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 53.26  E-value: 1.19e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   665 EVLVTKTDMEEKAQVMLELKTRVEelKMENEYQLRLKdmnysekiKEltDKFIQEMESLKTKNQVLRTEKEKQDVYHHEH 744
Cdd:pfam01576  163 EFTSNLAEEEEKAKSLSKLKNKHE--AMISDLEERLK--------KE--EKGRQELEKAKRKLEGESTDLQEQIAELQAQ 230
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   745 IEDL---LDKQSRELQ------DMECCNNQKLLleyEKYQELQLKSQRMQE--EYEKQLRDNDETKSQALEELTEFYEAK 813
Cdd:pfam01576  231 IAELraqLAKKEEELQaalarlEEETAQKNNAL---KKIRELEAQISELQEdlESERAARNKAEKQRRDLGEELEALKTE 307
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   814 LqEKTTLLEEAQEDVR-QQLREFEETKKQIEED---EDREIQDIKTKYEKKLRDEKESNLRLKGETGIMRKKFSSLQKEI 889
Cdd:pfam01576  308 L-EDTLDTTAAQQELRsKREQEVTELKKALEEEtrsHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQALESEN 386
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   890 EERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPR 969
Cdd:pfam01576  387 AELQAELRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSL 466
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   970 ENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLratDQEMRRERQKERDLEALV-------KRFKTDLHNCVAY 1042
Cdd:pfam01576  467 ESQLQDTQELLQEETRQKLNLSTRLRQLEDERNSLQEQL---EEEEEAKRNVERQLSTLQaqlsdmkKKLEEDAGTLEAL 543
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 767902634  1043 IQEPRLLKEKVRGLFEKYVQRADMVE-IAGLNTDLQQEY---TRQREHLERNLATLKKK 1097
Cdd:pfam01576  544 EEGKKRLQRELEALTQQLEEKAAAYDkLEKTKNRLQQELddlLVDLDHQRQLVSNLEKK 602
HMMR_N pfam15905
Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of ...
768-1007 1.23e-06

Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of eukaryotic hyaluronan-mediated motility receptor proteins. The protein is functionally associated with BRCA1 and thus predicted to be a common, low-penetrance breast cancer candidate.


Pssm-ID: 464932 [Multi-domain]  Cd Length: 329  Bit Score: 52.12  E-value: 1.23e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   768 LLEYEKYQELQLKSQRMQEEYEKQLRD------NDETKSQALEELTEFYEAKLQ----EKTTLLEEAQEdVRQQLREF-- 835
Cdd:pfam15905   58 SLELKKKSQKNLKESKDQKELEKEIRAlvqergEQDKRLQALEEELEKVEAKLNaavrEKTSLSASVAS-LEKQLLELtr 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   836 -----------EETKKQIE-------------EDEDREIQDIKTKYEKKLRDEKESNLRLKGETGIMRKKFSSLQKEIEE 891
Cdd:pfam15905  137 vnellkakfseDGTQKKMSslsmelmklrnklEAKMKEVMAKQEGMEGKLQVTQKNLEHSKGKVAQLEEKLVSTEKEKIE 216
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   892 RTNDIETLKGEQMKLQGVIKSLEKdiqgLKREIQERDETiqdkekriydLKKKNQELGKFKFVLDYKIKELKKQIEPR-- 969
Cdd:pfam15905  217 EKSETEKLLEYITELSCVSEQVEK----YKLDIAQLEEL----------LKEKNDEIESLKQSLEEKEQELSKQIKDLne 282
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 767902634   970 ---------ENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQK 1007
Cdd:pfam15905  283 kckllesekEELLREYEEKEQTLNAELEELKEKLTLEEQEHQKLQQK 329
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
797-1033 1.24e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 53.00  E-value: 1.24e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  797 ETKSQALEELTEFYeAKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDED-----REIQDIktkyEKKLRDEKESNLRL 871
Cdd:COG4913   613 AALEAELAELEEEL-AEAEERLEALEAELDALQERREALQRLAEYSWDEIDvasaeREIAEL----EAELERLDASSDDL 687
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  872 KGetgimrkkfssLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRI-----YDLKKKNQ 946
Cdd:COG4913   688 AA-----------LEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLArlelrALLEERFA 756
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  947 ELGKFKFVldykiKELKKQIeprENEIRVMKEQIQEMEAELEN-FHKQNTQLELNITELWQKLRATDQ-EMRRERQKERD 1024
Cdd:COG4913   757 AALGDAVE-----RELRENL---EERIDALRARLNRAEEELERaMRAFNREWPAETADLDADLESLPEyLALLDRLEEDG 828

                  ....*....
gi 767902634 1025 LEALVKRFK 1033
Cdd:COG4913   829 LPEYEERFK 837
PRK01156 PRK01156
chromosome segregation protein; Provisional
745-1203 1.24e-06

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 52.98  E-value: 1.24e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  745 IEDLLDKQSRELQDMEccNNQKLLLEYEKYQELQLKS-QRMQEEYEKQLRDNDETKSQ-----ALEELTEFYEAKLQEKT 818
Cdd:PRK01156  185 IDYLEEKLKSSNLELE--NIKKQIADDEKSHSITLKEiERLSIEYNNAMDDYNNLKSAlnelsSLEDMKNRYESEIKTAE 262
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  819 TLLEEAQEDVrQQLREFEETKKQIEEDE---DREIQDIKTKYEKKLRDEKESNLRLKGETG----IMRK---------KF 882
Cdd:PRK01156  263 SDLSMELEKN-NYYKELEERHMKIINDPvykNRNYINDYFKYKNDIENKKQILSNIDAEINkyhaIIKKlsvlqkdynDY 341
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  883 SSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKdiqgLKREIQERDETIQDKEKRIYDLKKKN----QELGKFKFVLDYK 958
Cdd:PRK01156  342 IKKKSRYDDLNNQILELEGYEMDYNSYLKSIES----LKKKIEEYSKNIERMSAFISEILKIQeidpDAIKKELNEINVK 417
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  959 IKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNT----QLELNITELWQKLRATDQEMRRERQKERDLEALVKRFKT 1034
Cdd:PRK01156  418 LQDISSKVSSLNQRIRALRENLDELSRNMEMLNGQSVcpvcGTTLGEEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDE 497
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1035 DLHNCVAyiQEPRLLKEKVRGLFEKYVQRADMVeiAGLNTDLQQEYTRQREHLERNLATLKKKVVKEGELH--RTDYVRI 1112
Cdd:PRK01156  498 KIVDLKK--RKEYLESEEINKSINEYNKIESAR--ADLEDIKIKINELKDKHDKYEEIKNRYKSLKLEDLDskRTSWLNA 573
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1113 MQEnVSLI------KEINELRRELKFTRSQVYDLEAALKLTKKVRPQEVSETEPSRDMLSTaptaRLNEQEETGRIIEMQ 1186
Cdd:PRK01156  574 LAV-ISLIdietnrSRSNEIKKQLNDLESRLQEIEIGFPDDKSYIDKSIREIENEANNLNN----KYNEIQENKILIEKL 648
                         490
                  ....*....|....*..
gi 767902634 1187 RLEIQRLRDQIQEQEQV 1203
Cdd:PRK01156  649 RGKIDNYKKQIAEIDSI 665
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
870-1144 1.30e-06

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 52.21  E-value: 1.30e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  870 RLKGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELG 949
Cdd:COG4372    49 QLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLE 128
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  950 KFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFhkQNTQLELNITELWQKLRATDQEMRRERQKERDLEALV 1029
Cdd:COG4372   129 QQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAAL--EQELQALSEAEAEQALDELLKEANRNAEKEEELAEAE 206
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1030 KRFKTDLHNCVAYIQEPRLLKEKVRGLFEKYVQRADMVEIAGLNTDLQQEYTRQREHLERNLATLKKKVVKEGELHRTDY 1109
Cdd:COG4372   207 KLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELE 286
                         250       260       270
                  ....*....|....*....|....*....|....*
gi 767902634 1110 VRIMQENVSLIKEINELRRELKFTRSQVYDLEAAL 1144
Cdd:COG4372   287 ALEEAALELKLLALLLNLAALSLIGALEDALLAAL 321
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
480-519 1.56e-06

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 45.77  E-value: 1.56e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 767902634    480 SLENISSLKGHTGKIRSIVWNADDSKLISGGTDGAVYEWN 519
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
851-1072 2.10e-06

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 51.30  E-value: 2.10e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  851 QDIKTKYEKKLRDEKESNLRLKGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDET 930
Cdd:COG4942    19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAE 98
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  931 IQDKEKRIYDLKKKNQELGK---FKFVLD-------YKIKELKKQIEP-RENEIRVMKEQIQEMEAELENFHKQNTQLEL 999
Cdd:COG4942    99 LEAQKEELAELLRALYRLGRqppLALLLSpedfldaVRRLQYLKYLAPaRREQAEELRADLAELAALRAELEAERAELEA 178
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767902634 1000 NITELWQKLRATDQEMRRERQKERDLEALVKRFKTDLhncVAYIQEPRLLKEKVRGLFEKYVQRADMVEIAGL 1072
Cdd:COG4942   179 LLAELEEERAALEALKAERQKLLARLEKELAELAAEL---AELQQEAEELEALIARLEAEAAAAAERTPAAGF 248
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
785-1131 2.80e-06

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 51.97  E-value: 2.80e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   785 QEEYEKQLRDNDETKSQALEeLTEFYEAKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEdreiqdiktKYEKKLRDE 864
Cdd:TIGR00606  711 LKSTESELKKKEKRRDEMLG-LAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQE---------TLLGTIMPE 780
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   865 KESNLRLKGETGIMRKkfssLQKEIEERTNDIETLKGeqmKLQGVikSLEKDIQGLKREIQERDET-------------- 930
Cdd:TIGR00606  781 EESAKVCLTDVTIMER----FQMELKDVERKIAQQAA---KLQGS--DLDRTVQQVNQEKQEKQHEldtvvskielnrkl 851
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   931 IQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRA 1010
Cdd:TIGR00606  852 IQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISS 931
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1011 TDQEMRRERQKERDLEALVKRFKTDLHNCVAYIQEPRllkekvrglfEKYVQRADmVEIAGLNTDLqQEYTRQREHLERN 1090
Cdd:TIGR00606  932 KETSNKKAQDKVNDIKEKVKNIHGYMKDIENKIQDGK----------DDYLKQKE-TELNTVNAQL-EECEKHQEKINED 999
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 767902634  1091 LATLKKKVVKEGELHrtdyvRIMQENVSLIK---EINELRRELK 1131
Cdd:TIGR00606 1000 MRLMRQDIDTQKIQE-----RWLQDNLTLRKrenELKEVEEELK 1038
PRK12704 PRK12704
phosphodiesterase; Provisional
829-989 3.03e-06

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 51.32  E-value: 3.03e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  829 RQQLREFEETKKQIEEDEDREIQDIKtkyEKKLRDEKESNLRLKGEtgimrkkfssLQKEIEERtndietlkgeqmklqg 908
Cdd:PRK12704   30 EAKIKEAEEEAKRILEEAKKEAEAIK---KEALLEAKEEIHKLRNE----------FEKELRER---------------- 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  909 vikslEKDIQGLKREIQERDETIqdkEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEpreneirVMKEQIQEMEAELE 988
Cdd:PRK12704   81 -----RNELQKLEKRLLQKEENL---DRKLELLEKREEELEKKEKELEQKQQELEKKEE-------ELEELIEEQLQELE 145

                  .
gi 767902634  989 N 989
Cdd:PRK12704  146 R 146
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
821-985 3.14e-06

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 51.40  E-value: 3.14e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  821 LEEAQEDVRQQLREFEETKKQIEEDEDREIQdikTKYEKKLRdekesnlrlkgetgimrkkfsslqkeieertndietlk 900
Cdd:COG2433   378 IEEALEELIEKELPEEEPEAEREKEHEEREL---TEEEEEIR-------------------------------------- 416
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  901 geqmKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKK-NQELGKFKFV--LDYKIKELKKQIEPRENEIRVMK 977
Cdd:COG2433   417 ----RLEEQVERLEAEVEELEAELEEKDERIERLERELSEARSEeRREIRKDREIsrLDREIERLERELEEERERIEELK 492

                  ....*...
gi 767902634  978 EQIQEMEA 985
Cdd:COG2433   493 RKLERLKE 500
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
776-1054 3.32e-06

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 51.05  E-value: 3.32e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   776 ELQLKSQRMQEEYeKQLRDNDETKSQALEELTEFYEAKLQEKTTLLEEaQEDVRQQLREFEEtkkqieededreiqDIKT 855
Cdd:pfam07888   77 ELESRVAELKEEL-RQSREKHEELEEKYKELSASSEELSEEKDALLAQ-RAAHEARIRELEE--------------DIKT 140
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   856 KYEKKLrdEKESNL-RLKGETgimrKKFSSLQKEIEErtnDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDK 934
Cdd:pfam07888  141 LTQRVL--ERETELeRMKERA----KKAGAQRKEEEA---ERKQLQAKLQQTEEELRSLSKEFQELRNSLAQRDTQVLQL 211
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   935 EKRIYDLKKKNQELGKFKFVLDYKIKELK----------KQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITEL 1004
Cdd:pfam07888  212 QDTITTLTQKLTTAHRKEAENEALLEELRslqerlnaseRKVEGLGEELSSMAAQRDRTQAELHQARLQAAQLTLQLADA 291
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1005 ----------WQKLRATdqeMRRERQKERDleaLVKRFKTDLHNCVAYIQEPRLLKEKVR 1054
Cdd:pfam07888  292 slalregrarWAQERET---LQQSAEADKD---RIEKLSAELQRLEERLQEERMEREKLE 345
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
879-1027 3.74e-06

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 50.60  E-value: 3.74e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  879 RKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIqdkEKRIYDLKKKNQELGKFKFVL--- 955
Cdd:COG3883    36 QAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREEL---GERARALYRSGGSVSYLDVLLgse 112
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  956 ---DY-------------------KIKELKKQIEPRENEIRVMKEQIQEMEAELEnfhKQNTQLELNITELWQKLRATDQ 1013
Cdd:COG3883   113 sfsDFldrlsalskiadadadlleELKADKAELEAKKAELEAKLAELEALKAELE---AAKAELEAQQAEQEALLAQLSA 189
                         170
                  ....*....|....
gi 767902634 1014 EMRRERQKERDLEA 1027
Cdd:COG3883   190 EEAAAEAQLAELEA 203
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
757-1020 4.05e-06

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 51.28  E-value: 4.05e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   757 QDMECCNNQKLLLEYEKYQELQ-LKSQRMQEEYEKQLRDNDetKSQALEELTEFYEAKLQEKTTLLEEAQEDVRQQLREF 835
Cdd:pfam17380  273 QLLHIVQHQKAVSERQQQEKFEkMEQERLRQEKEEKAREVE--RRRKLEEAEKARQAEMDRQAAIYAEQERMAMEREREL 350
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   836 EETKkqiEEDEDREIQDIKTK-----------YEKKLRDEKESNLRLKGETGIMRKKF---SSLQKEIEERTNDIETLKG 901
Cdd:pfam17380  351 ERIR---QEERKRELERIRQEeiameisrmreLERLQMERQQKNERVRQELEAARKVKileEERQRKIQQQKVEMEQIRA 427
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   902 EQ-----MKLQGVIKSLEKDIQGLKREIQERDETI----QDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENE 972
Cdd:pfam17380  428 EQeearqREVRRLEEERAREMERVRLEEQERQQQVerlrQQEEERKRKKLELEKEKRDRKRAEEQRRKILEKELEERKQA 507
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 767902634   973 IRVMKEQIQEMEAELENfhKQNTQLELNITELWQKLRATDQEMRRERQ 1020
Cdd:pfam17380  508 MIEEERKRKLLEKEMEE--RQKAIYEEERRREAEEERRKQQEMEERRR 553
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
811-1054 4.90e-06

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 49.91  E-value: 4.90e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  811 EAKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEDREIQDIKTKYEK--KLRDEKESNLRLKGEtgiMRKKFSSLQKE 888
Cdd:COG1340    10 LEELEEKIEELREEIEELKEKRDELNEELKELAEKRDELNAQVKELREEaqELREKRDELNEKVKE---LKEERDELNEK 86
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  889 IEERTNDIETLKGEQMKLQGV---IKSLEKDIQGLKREIQERDETIqDKEKRIYD-LKKKNQELGKFKFVLDY--KIKEL 962
Cdd:COG1340    87 LNELREELDELRKELAELNKAggsIDKLRKEIERLEWRQQTEVLSP-EEEKELVEkIKELEKELEKAKKALEKneKLKEL 165
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  963 KKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQEMRRERQKERDLEALVKRFKTDLHNcvaY 1042
Cdd:COG1340   166 RAELKELRKEAEEIHKKIKELAEEAQELHEEMIELYKEADELRKEADELHKEIVEAQEKADELHEEIIELQKELRE---L 242
                         250
                  ....*....|..
gi 767902634 1043 IQEPRLLKEKVR 1054
Cdd:COG1340   243 RKELKKLRKKQR 254
PTZ00121 PTZ00121
MAEBL; Provisional
657-1202 9.43e-06

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 50.14  E-value: 9.43e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  657 EREVGFAEEVLVTKTDMEEKAQVMLELKTRVEELKMENEYQlRLKDMNYSEKIKELTDKFIQEM----------ESLKTK 726
Cdd:PTZ00121 1094 EEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEAR-KAEDARKAEEARKAEDAKRVEIarkaedarkaEEARKA 1172
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  727 NQVLRTEKEKQDVYHHEHIEDLLDKQSRELQDMECCNNQKLLLEYEKYQELQL-----KSQRMQEEYEKQLRDNDETKSQ 801
Cdd:PTZ00121 1173 EDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKaeavkKAEEAKKDAEEAKKAEEERNNE 1252
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  802 ALEELTEFYEAKLQEKTTLLEEAQEDVRQQLREFEETKKQIE---EDEDREIQDIKTKYEKKlrdEKESNLRLKGETGim 878
Cdd:PTZ00121 1253 EIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEakkAEEKKKADEAKKKAEEA---KKADEAKKKAEEA-- 1327
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  879 RKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQErdetiqdKEKRIYDLKKKNQELgkfkfvldYK 958
Cdd:PTZ00121 1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE-------AKKKADAAKKKAEEK--------KK 1392
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  959 IKELKKQIEprenEIRVMKEQIQEMEAELENFHKQNTQLElnitelwQKLRAtdQEMRRERQKERDLEALVKRF--KTDL 1036
Cdd:PTZ00121 1393 ADEAKKKAE----EDKKKADELKKAAAAKKKADEAKKKAE-------EKKKA--DEAKKKAEEAKKADEAKKKAeeAKKA 1459
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1037 HNCVAYIQEPRLLKEKVRGLFEKyvQRADMVEIAGLNTDLQQEYTRQREHLERNLATLKKKvvkeGELHRTDYVRIMQEN 1116
Cdd:PTZ00121 1460 EEAKKKAEEAKKADEAKKKAEEA--KKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKA----EEAKKADEAKKAEEA 1533
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1117 vsliKEINELRR-ELKFTRSQVYDLEAALKLTKKVRPQEVSETEPSRDM-LSTAPTARLNEQEETGRIIEMQRLEIQRLR 1194
Cdd:PTZ00121 1534 ----KKADEAKKaEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMaLRKAEEAKKAEEARIEEVMKLYEEEKKMKA 1609

                  ....*...
gi 767902634 1195 DQIQEQEQ 1202
Cdd:PTZ00121 1610 EEAKKAEE 1617
WD40 COG2319
WD40 repeat [General function prediction only];
41-399 1.05e-05

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 49.52  E-value: 1.05e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   41 ALSISPNRRYLAiseTVQEKPAITIYELSSIPCRKrkVLNNFDFQVQkfiSMAFSPDSKYLLAqTSppESNLVYwLWEKQ 120
Cdd:COG2319   125 SVAFSPDGKTLA---SGSADGTVRLWDLATGKLLR--TLTGHSGAVT---SVAFSPDGKLLAS-GS--DDGTVR-LWDLA 192
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  121 KVMAIVRIDTQNNPVYQVSFSPqDNTQVcVTG--NGMFKLLRFAEGTLKQTSfqRGEPQNYLAHTWVADDK-IVVGTDTG 197
Cdd:COG2319   193 TGKLLRTLTGHTGAVRSVAFSP-DGKLL-ASGsaDGTVRLWDLATGKLLRTL--TGHSGSVRSVAFSPDGRlLASGSADG 268
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  198 KLFLfesgdqrWETsimvkeptNGSKSLDVIQESESliefpPVSSplpsyeqmVAASSHSQMsmpqvfaiaayskgFACS 277
Cdd:COG2319   269 TVRL-------WDL--------ATGELLRTLTGHSG-----GVNS--------VAFSPDGKL--------------LASG 306
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  278 AGPGRVLLFEkmeekdfYRESREIRIPVDPQSndpsqsdkqDVLCLCFSPSEETLVASTSKNqlysiTMSLTEISKGEPA 357
Cdd:COG2319   307 SDDGTVRLWD-------LATGKLLRTLTGHTG---------AVRSVAFSPDGKTLASGSDDG-----TVRLWDLATGELL 365
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|..
gi 767902634  358 HFeylmYPLHSAPITGLATCIRKPLIATCSLDRSIRLWNYET 399
Cdd:COG2319   366 RT----LTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
MPS2 pfam17060
Monopolar spindle protein 2; Is a fungal transmembrane protein which is part of the component ...
861-1001 1.14e-05

Monopolar spindle protein 2; Is a fungal transmembrane protein which is part of the component of the spindle pole body (SPB) required for the insertion of the nascent SPB into the nuclear envelope and for the proper execution of spindle pole body (SPB) duplication. It seems that Mps2-Spc24 interaction may contribute to the localization of Spc24 and other kinetochore components to the inner plaque of the SPB.


Pssm-ID: 407228 [Multi-domain]  Cd Length: 340  Bit Score: 48.82  E-value: 1.14e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   861 LRDEKESNLRLKGEtgIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGV------IKSLEKDIQGLKREIQERDETIQDK 934
Cdd:pfam17060   96 IPASFISALELKED--VKSSPRSEADSLGTPIKVDLLRNLKPQESPETPrrinrkYKSLELRVESMKDELEFKDETIMEK 173
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   935 E-------KRIYDLKKKNQELGK-FKFVLDYK-----------------IKELKKQIEPRENEIRVMKEQIQEMEAELEN 989
Cdd:pfam17060  174 DrelteltSTISKLKDKYDFLSReFEFYKQHHehggnnsiktatkhefiISELKRKLQEQNRLIRILQEQIQFDPGALHD 253
                          170
                   ....*....|..
gi 767902634   990 FHKQNTQLELNI 1001
Cdd:pfam17060  254 NGPKNLVLNGAI 265
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
743-1031 1.22e-05

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 49.97  E-value: 1.22e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   743 EHIEDLLDKQSRElQDMECCNNQKLLLeyekyqelqlKSQRMQEEYEKQLRDNDETKSQALEELtefyeAKLQEKTTLLE 822
Cdd:TIGR00618  594 VRLQDLTEKLSEA-EDMLACEQHALLR----------KLQPEQDLQDVRLHLQQCSQELALKLT-----ALHALQLTLTQ 657
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   823 EAQEDVRQQLREFEETKKQIEEDEDREIQDIKTKY----------EKKLRDEKESNLRLKGETGIMRKKFSSLQKEIEER 892
Cdd:TIGR00618  658 ERVREHALSIRVLPKELLASRQLALQKMQSEKEQLtywkemlaqcQTLLRELETHIEEYDREFNEIENASSSLGSDLAAR 737
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   893 TNDIETLKGEQMKLQG-VIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPREN 971
Cdd:TIGR00618  738 EDALNQSLKELMHQARtVLKARTEAHFNNNEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDED 817
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   972 EIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQEMRRERQKERDLEALVKR 1031
Cdd:TIGR00618  818 ILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQLTQEQAKIIQLSDK 877
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
670-1091 1.94e-05

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 49.05  E-value: 1.94e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   670 KTDMEEKAQVMLELKTRVEELKMENEyqlrlkdmNYSEKIKELTdkfiqemESLKTKNQ---VLRTEKEKQDVYHHEHiE 746
Cdd:pfam10174  295 KQELSKKESELLALQTKLETLTNQNS--------DCKQHIEVLK-------ESLTAKEQraaILQTEVDALRLRLEEK-E 358
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   747 DLLDKQSRELQDMeccNNQKLLLEYE-------------KYQELQLKSQRMQEeyekQLRDNDetksQALEELTEFYEAk 813
Cdd:pfam10174  359 SFLNKKTKQLQDL---TEEKSTLAGEirdlkdmldvkerKINVLQKKIENLQE----QLRDKD----KQLAGLKERVKS- 426
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   814 LQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEDREiqdiktkyEKKLRDEKESnlrLKGETGIMRKKFSSLQKEIEERT 893
Cdd:pfam10174  427 LQTDSSNTDTALTTLEEALSEKERIIERLKEQRERE--------DRERLEELES---LKKENKDLKEKVSALQPELTEKE 495
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   894 NDIETLKGEQ-------MKLQGVIKSLEKDIQgLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQI 966
Cdd:pfam10174  496 SSLIDLKEHAsslassgLKKDSKLKSLEIAVE-QKKEECSKLENQLKKAHNAEEAVRTNPEINDRIRLLEQEVARYKEES 574
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   967 EPRENEIRVMKEQIQEMEAELENFHKQNTQLELNI-------TELWQKLRATDQEMRRERQKERDleaLVKRFKTDLHNC 1039
Cdd:pfam10174  575 GKAQAEVERLLGILREVENEKNDKDKKIAELESLTlrqmkeqNKKVANIKHGQQEMKKKGAQLLE---EARRREDNLADN 651
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 767902634  1040 VAYIQEPRLLK--EKVRGLFEKYVQRADMVEIA-----GLNTDLQQEytrQREHLERNL 1091
Cdd:pfam10174  652 SQQLQLEELMGalEKTRQELDATKARLSSTQQSlaekdGHLTNLRAE---RRKQLEEIL 707
PRK04778 PRK04778
septation ring formation regulator EzrA; Provisional
707-1078 2.13e-05

septation ring formation regulator EzrA; Provisional


Pssm-ID: 179877 [Multi-domain]  Cd Length: 569  Bit Score: 48.68  E-value: 2.13e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  707 EKIKELTDKFIQEMESLKTKNQVLRTEkekqdvyhhehIEDLLDKQsRELQdmeccnnQKLLLEYEKYQELqlksqrmQE 786
Cdd:PRK04778  115 DLIEEDIEQILEELQELLESEEKNREE-----------VEQLKDLY-RELR-------KSLLANRFSFGPA-------LD 168
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  787 EYEKQLrDNDETKSQALEELTE---FYEA-----KLQEKTTLLEEAQEDVRQQLREFEETKKQieededrEIQDIKTKYe 858
Cdd:PRK04778  169 ELEKQL-ENLEEEFSQFVELTEsgdYVEAreildQLEEELAALEQIMEEIPELLKELQTELPD-------QLQELKAGY- 239
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  859 kklRDEKESNLRLKgETGIMrKKFSSLQKEIEERTNDIETLKGEQMKLQgvIKSLEKDIQGL----------KREIQERD 928
Cdd:PRK04778  240 ---RELVEEGYHLD-HLDIE-KEIQDLKEQIDENLALLEELDLDEAEEK--NEEIQERIDQLydilerevkaRKYVEKNS 312
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  929 ETIQDKEKRiydLKKKNQELG------KFKFVLDYKIKELKKQIEPRENEIRVMKEQIQ-----------EMEAELENFH 991
Cdd:PRK04778  313 DTLPDFLEH---AKEQNKELKeeidrvKQSYTLNESELESVRQLEKQLESLEKQYDEITeriaeqeiaysELQEELEEIL 389
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  992 KQNTQLELNITELWQKLRATdqemrreRQKERDLEALVKRFKTDLHNCVAYIQeprllKEKVRGLFEKYVQRADMV--EI 1069
Cdd:PRK04778  390 KQLEEIEKEQEKLSEMLQGL-------RKDELEAREKLERYRNKLHEIKRYLE-----KSNLPGLPEDYLEMFFEVsdEI 457

                  ....*....
gi 767902634 1070 AGLNTDLQQ 1078
Cdd:PRK04778  458 EALAEELEE 466
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
742-1199 2.22e-05

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 48.81  E-value: 2.22e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   742 HEHIEDLLDKQSRELQDMECCNNQKLLLEYEKYQELQ------LKSQRMQEEYEKQLRDNDETKSQA--LEELTEFYEAK 813
Cdd:TIGR00618  189 KKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEkelkhlREALQQTQQSHAYLTQKREAQEEQlkKQQLLKQLRAR 268
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   814 LQEKTTL---LEEAQEDV---RQQLREFEETKK--QIEEDEDREIQDIKTKYEK--KLRDEKESNLRLKGETGIMRKKFS 883
Cdd:TIGR00618  269 IEELRAQeavLEETQERInraRKAAPLAAHIKAvtQIEQQAQRIHTELQSKMRSraKLLMKRAAHVKQQSSIEEQRRLLQ 348
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   884 SLQKEIE--ERTNDIETLKGEQMKLQgviKSLEKDIqglkREIQERDETIQDKEKRIYDLKKKNQelgkfkfvldykikE 961
Cdd:TIGR00618  349 TLHSQEIhiRDAHEVATSIREISCQQ---HTLTQHI----HTLQQQKTTLTQKLQSLCKELDILQ--------------R 407
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   962 LKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQEMRRERQKERDLEALVKRFKT------- 1034
Cdd:TIGR00618  408 EQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQihlqetr 487
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1035 ----DLHNCVAYIQEPRLLKEKVRglfeKYVQRADMVEIAGLNTDLQQEYTRQREHLERNLATLKKKVVKEGElHRTDY- 1109
Cdd:TIGR00618  488 kkavVLARLLELQEEPCPLCGSCI----HPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERK-QRASLk 562
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1110 ---VRIMQENVSLIKEINELRRELKFTRSQVYDL----EAALKLTKKVRPQ---EVSETEPSRDMLSTAPTARLNEQEET 1179
Cdd:TIGR00618  563 eqmQEIQQSFSILTQCDNRSKEDIPNLQNITVRLqdltEKLSEAEDMLACEqhaLLRKLQPEQDLQDVRLHLQQCSQELA 642
                          490       500
                   ....*....|....*....|
gi 767902634  1180 GRIIEMQRLEIQRLRDQIQE 1199
Cdd:TIGR00618  643 LKLTALHALQLTLTQERVRE 662
WD40 pfam00400
WD domain, G-beta repeat;
482-519 2.30e-05

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 42.33  E-value: 2.30e-05
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 767902634   482 ENISSLKGHTGKIRSIVWNADDSKLISGGTDGAVYEWN 519
Cdd:pfam00400    2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
46 PHA02562
endonuclease subunit; Provisional
688-928 2.33e-05

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 48.47  E-value: 2.33e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  688 EELKMENEYQLRLKDMNySEKIKELTDKF---IQEMESLKT-----KNQVLRTEKEKQDVyhHEHIEDLLDKQSRELQDM 759
Cdd:PHA02562  195 QQIKTYNKNIEEQRKKN-GENIARKQNKYdelVEEAKTIKAeieelTDELLNLVMDIEDP--SAALNKLNTAAAKIKSKI 271
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  760 ECCnnQKLLLEYEKYQELQLKSQrmqeeyekQLRDNDETKSQALEELTEfyeakLQEKTTLLEEAQEDVRQQLREFEETK 839
Cdd:PHA02562  272 EQF--QKVIKMYEKGGVCPTCTQ--------QISEGPDRITKIKDKLKE-----LQHSLEKLDTAIDELEEIMDEFNEQS 336
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  840 KqieededrEIQDIKTKYEKKLRDekesnlrLKGEtgimRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQG 919
Cdd:PHA02562  337 K--------KLLELKNKISTNKQS-------LITL----VDKAKKVKAAIEELQAEFVDNAEELAKLQDELDKIVKTKSE 397

                  ....*....
gi 767902634  920 LKREIQERD 928
Cdd:PHA02562  398 LVKEKYHRG 406
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
965-1202 2.54e-05

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 48.47  E-value: 2.54e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  965 QIEPRENEIRVmkEQIQEMEAELENFhkqNTQLEL-----NITELWQKLRATDQEMRRERQKERDLEALVKRFK------ 1033
Cdd:COG3206    60 LVEPQSSDVLL--SGLSSLSASDSPL---ETQIEIlksrpVLERVVDKLNLDEDPLGEEASREAAIERLRKNLTvepvkg 134
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1034 TDLHNcVAYI-QEPRLLKEKVRGLFEKYVQRadmveiaglNTDLQQEYTRQ-REHLERNLATLKKKVVK-EGELH----R 1106
Cdd:COG3206   135 SNVIE-ISYTsPDPELAAAVANALAEAYLEQ---------NLELRREEARKaLEFLEEQLPELRKELEEaEAALEefrqK 204
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1107 TDYVRIMQENVSLIKEINELRRELKFTRSQVYDLEAALK-LTKKVRPQEVSETEPSRDMLSTAPTARLNEQEetGRIIEM 1185
Cdd:COG3206   205 NGLVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAaLRAQLGSGPDALPELLQSPVIQQLRAQLAELE--AELAEL 282
                         250       260
                  ....*....|....*....|...
gi 767902634 1186 QRL------EIQRLRDQIQEQEQ 1202
Cdd:COG3206   283 SARytpnhpDVIALRAQIAALRA 305
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
785-1011 3.02e-05

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 47.90  E-value: 3.02e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  785 QEEYEKQLRDNDETKSQALEELtefyeAKLQEKttlLEEAQEDVRQQLREFEETKKQIEEDEdREIQDIKTKYEKKlRDE 864
Cdd:COG3883    18 IQAKQKELSELQAELEAAQAEL-----DALQAE---LEELNEEYNELQAELEALQAEIDKLQ-AEIAEAEAEIEER-REE 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  865 KESNLRLKGETGIMRKKFSSL--QKEIEERTNDIETLKgeqmKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLK 942
Cdd:COG3883    88 LGERARALYRSGGSVSYLDVLlgSESFSDFLDRLSALS----KIADADADLLEELKADKAELEAKKAELEAKLAELEALK 163
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 767902634  943 KKNQElgkfkfvldyKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRAT 1011
Cdd:COG3883   164 AELEA----------AKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAA 222
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
652-952 3.07e-05

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 48.20  E-value: 3.07e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   652 RGIKREREVGFAEEVlvTKTDMEEKAQVMLELktrvEELKMENEYQLrlkdmnysEKIKEltdkfiqemESLKTKNQVLR 731
Cdd:pfam17380  310 REVERRRKLEEAEKA--RQAEMDRQAAIYAEQ----ERMAMEREREL--------ERIRQ---------EERKRELERIR 366
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   732 TEKEKQDVYHHEHIEDLL----DKQSRELQDMECCNNQKLLLEyekyqELQLKSQRMQEEYEKQLRDNDETKSQALEELT 807
Cdd:pfam17380  367 QEEIAMEISRMRELERLQmerqQKNERVRQELEAARKVKILEE-----ERQRKIQQQKVEMEQIRAEQEEARQREVRRLE 441
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   808 EFYEAKLQEKTTLLEEAQEDVRQQLREFEETKKQ--IEEDEDREIQDIKTKYEKKLRDEKESNLRLKGETGIMRKkfsSL 885
Cdd:pfam17380  442 EERAREMERVRLEEQERQQQVERLRQQEEERKRKklELEKEKRDRKRAEEQRRKILEKELEERKQAMIEEERKRK---LL 518
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 767902634   886 QKEIEERTNDIetLKGEQMKLQGVIKSLEKDIQGlKREIQERDETIQDKEKRIYDLKKKNQELGKFK 952
Cdd:pfam17380  519 EKEMEERQKAI--YEEERRREAEEERRKQQEMEE-RRRIQEQMRKATEERSRLEAMEREREMMRQIV 582
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
879-1034 3.15e-05

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 48.32  E-value: 3.15e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  879 RKKFSSLQKEIEERTnDIETLKGEQMK---LQGVIKSLEKDIQGLK-----REIQERDETIQDKEKRIYDLKKKNQELGK 950
Cdd:COG2433   349 KNKFERVEKKVPPDV-DRDEVKARVIRglsIEEALEELIEKELPEEepeaeREKEHEERELTEEEEEIRRLEEQVERLEA 427
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  951 FKFVLDYKIKELKKQIEPRENEIRVMKEQIQE---MEAELENFHKQNTQLElnitelwqklratdQEMRRERQKERDLEA 1027
Cdd:COG2433   428 EVEELEAELEEKDERIERLERELSEARSEERReirKDREISRLDREIERLE--------------RELEEERERIEELKR 493

                  ....*..
gi 767902634 1028 LVKRFKT 1034
Cdd:COG2433   494 KLERLKE 500
EzrA pfam06160
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ...
812-1038 4.13e-05

Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.


Pssm-ID: 428797 [Multi-domain]  Cd Length: 542  Bit Score: 47.54  E-value: 4.13e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   812 AKLQEKTTLLEEAQEDVRQQLREFEETKKQiEEDEDREIQDIKTKYEKkLRDEKESNlrlkgetgimRKKFSSLQKEIEE 891
Cdd:pfam06160   86 KALDEIEELLDDIEEDIKQILEELDELLES-EEKNREEVEELKDKYRE-LRKTLLAN----------RFSYGPAIDELEK 153
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   892 RTNDIE---------TLKGEQMKLQGVIKSLEKDIQGLKR---EIQERDETIQDK-EKRIYDLKKKNQELGKFKFVLDY- 957
Cdd:pfam06160  154 QLAEIEeefsqfeelTESGDYLEAREVLEKLEEETDALEElmeDIPPLYEELKTElPDQLEELKEGYREMEEEGYALEHl 233
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   958 ----KIKELKKQIEPR------------ENEIRVMKEQIQE----MEAELENFH-----------------KQNTQLELN 1000
Cdd:pfam06160  234 nvdkEIQQLEEQLEENlallenleldeaEEALEEIEERIDQlydlLEKEVDAKKyveknlpeiedylehaeEQNKELKEE 313
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 767902634  1001 ITELWQKLRATDQEMRRERQKERDLEALVKRFKTDLHN 1038
Cdd:pfam06160  314 LERVQQSYTLNENELERVRGLEKQLEELEKRYDEIVER 351
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
682-1205 4.18e-05

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 48.12  E-value: 4.18e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   682 ELKTRVEELKmeneyQLRLKDMNYSEKIKELtDKFIQEMESLKTKNQVLR------TEKEKQDVYHHEHIE--DLLDKQS 753
Cdd:TIGR00606  249 PLKNRLKEIE-----HNLSKIMKLDNEIKAL-KSRKKQMEKDNSELELKMekvfqgTDEQLNDLYHNHQRTvrEKERELV 322
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   754 RELQDMECCNNQKLLLEYEKYQ------ELQLKSQRMQEEYEKqlRDNDETKSQALEELTEFyeaklqekttlleEAQED 827
Cdd:TIGR00606  323 DCQRELEKLNKERRLLNQEKTEllveqgRLQLQADRHQEHIRA--RDSLIQSLATRLELDGF-------------ERGPF 387
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   828 VRQQLREFEETKKQIEEDEDREIQDIKTKYEKKLRDEKES----NLRLKGETGIMRKKFSSLQKEIEERTNDIETLKGEQ 903
Cdd:TIGR00606  388 SERQIKNFHTLVIERQEDEAKTAAQLCADLQSKERLKQEQadeiRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLE 467
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   904 MKLQGVIKsLEKDIQGLKREIQERDE--TIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQ 981
Cdd:TIGR00606  468 GSSDRILE-LDQELRKAERELSKAEKnsLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMD 546
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   982 EMEAELENFHKQNTQLE------LNITELWQKLRATDQEMRRERQKERDLEALVKRFKTDLHNcvaYIQEPRLLKEKVRG 1055
Cdd:TIGR00606  547 KDEQIRKIKSRHSDELTsllgyfPNKKQLEDWLHSKSKEINQTRDRLAKLNKELASLEQNKNH---INNELESKEEQLSS 623
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1056 LFEKyvqradMVEIAGlNTDLQQEYTRQREHLER---NLATLKKKV------VKEGELHRTDYVRIMQENVSLIKEINEL 1126
Cdd:TIGR00606  624 YEDK------LFDVCG-SQDEESDLERLKEEIEKsskQRAMLAGATavysqfITQLTDENQSCCPVCQRVFQTEAELQEF 696
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1127 RRELK-FTRSQVYDLEAALKLTKKV--RPQEVSETEPSRDMLSTAPTARLNEQEETGRIIEMqrlEIQRLRDQIQEQEQV 1203
Cdd:TIGR00606  697 ISDLQsKLRLAPDKLKSTESELKKKekRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNR---DIQRLKNDIEEQETL 773

                   ..
gi 767902634  1204 TG 1205
Cdd:TIGR00606  774 LG 775
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
837-1010 4.95e-05

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 47.71  E-value: 4.95e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   837 ETKKQIEEDEDREIQ-DIKTKyEKKLRD-EKESNLRlkgetgimRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLE 914
Cdd:TIGR04523   32 DTEEKQLEKKLKTIKnELKNK-EKELKNlDKNLNKD--------EEKINNSNNKIKILEQQIKDLNDKLKKNKDKINKLN 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   915 KDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQN 994
Cdd:TIGR04523  103 SDLSKINSEIKNDKEQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEK 182
                          170
                   ....*....|....*.
gi 767902634   995 TQLELNITELWQKLRA 1010
Cdd:TIGR04523  183 LNIQKNIDKIKNKLLK 198
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
828-1204 5.48e-05

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 47.71  E-value: 5.48e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   828 VRQQLREFEETKKQIEEDEDrEIQDIKTKYEKKLRDEKESNLRLKgetgIMRKKFSSLQKEIEERTNDIETLKGEQMKLQ 907
Cdd:TIGR04523   35 EKQLEKKLKTIKNELKNKEK-ELKNLDKNLNKDEEKINNSNNKIK----ILEQQIKDLNDKLKKNKDKINKLNSDLSKIN 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   908 GVIKS-------LEKDIQGLKREIQERDETIQD-------KEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEI 973
Cdd:TIGR04523  110 SEIKNdkeqknkLEVELNKLEKQKKENKKNIDKflteikkKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQKNI 189
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   974 RVMKEQI-----------------QEMEAELENFHKQNTQLELNITELWQKLRATDQEMRRERQKERDLEALVKRFKTDL 1036
Cdd:TIGR04523  190 DKIKNKLlklelllsnlkkkiqknKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQL 269
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1037 HNCVAYI-QEPRLLKEKvrglfEKYVQRADMvEIAGLNTDLQQEYTRQrehLERNLATLKKKVvkegELHRTDYVRIMQE 1115
Cdd:TIGR04523  270 SEKQKELeQNNKKIKEL-----EKQLNQLKS-EISDLNNQKEQDWNKE---LKSELKNQEKKL----EEIQNQISQNNKI 336
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1116 NVSLIKEINELRRELKFTRSQvyDLEAALKLTKKVRPQEVSEtepsrdmlstaptarlNEQEETGRIIEMQRLEIQRLRD 1195
Cdd:TIGR04523  337 ISQLNEQISQLKKELTNSESE--NSEKQRELEEKQNEIEKLK----------------KENQSYKQEIKNLESQINDLES 398

                   ....*....
gi 767902634  1196 QIQEQEQVT 1204
Cdd:TIGR04523  399 KIQNQEKLN 407
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
816-1101 5.50e-05

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 47.90  E-value: 5.50e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  816 EKTTLLEEAQEDVRQQLREFEETKKQIEE---DEDREIQDIKTKYEKKLRDEKESNLRLKGETGIM-------------- 878
Cdd:PTZ00440  404 KYTNIISLSEHTLKAAEDVLKENSQKIADyalYSNLEIIEIKKKYDEKINELKKSINQLKTLISIMksfydliisekdsm 483
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  879 ------RKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLE---KDIQGLKREIQERDETIQDKEKRIYDLKKKNQELG 949
Cdd:PTZ00440  484 dskekkESSDSNYQEKVDELLQIINSIKEKNNIVNNNFKNIEdyyITIEGLKNEIEGLIELIKYYLQSIETLIKDEKLKR 563
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  950 KFKFVLDYKIKELKKQIEpRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQEMRRERQKeRDLEALV 1029
Cdd:PTZ00440  564 SMKNDIKNKIKYIEENVD-HIKDIISLNDEIDNIIQQIEELINEALFNKEKFINEKNDLQEKVKYILNKFYK-GDLQELL 641
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767902634 1030 KRFKTDLHNCVAYIQEPRlLKEKVRGLFEKYVQRADmvEIAGLNTDLQQEYTRQREHLERNLATLKKKVVKE 1101
Cdd:PTZ00440  642 DELSHFLDDHKYLYHEAK-SKEDLQTLLNTSKNEYE--KLEFMKSDNIDNIIKNLKKELQNLLSLKENIIKK 710
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
856-1140 5.77e-05

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 47.53  E-value: 5.77e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   856 KYEKKLRDEKESNLRLKGETGIMRKKFSSLQKEIeerTNDIETLKGEQMKLQgvikSLEKDIQGLKREIQE-RDETIQDK 934
Cdd:pfam12128  608 KAEEALQSAREKQAAAEEQLVQANGELEKASREE---TFARTALKNARLDLR----RLFDEKQSEKDKKNKaLAERKDSA 680
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   935 EKRIYDLKKKnqelgkfKFVLDYKIKELKKQIEPRENEIRV-MKEQIQEMEAELENFHKQ-NTQLELNITELWQKLRATD 1012
Cdd:pfam12128  681 NERLNSLEAQ-------LKQLDKKHQAWLEEQKEQKREARTeKQAYWQVVEGALDAQLALlKAAIAARRSGAKAELKALE 753
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1013 QEMRRERQKERDLEALVKRFKTDLHNCVAYIQEPRLLKEKVRGLF----EKYVQRADMVEIAGLNT-----DLQQEYTRQ 1083
Cdd:pfam12128  754 TWYKRDLASLGVDPDVIAKLKREIRTLERKIERIAVRRQEVLRYFdwyqETWLQRRPRLATQLSNIeraisELQQQLARL 833
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 767902634  1084 -------REHLERNLATLKKKVVKEGELHRTdyVRIMQENVSLIKE---INELRRELKFTRSQVYDL 1140
Cdd:pfam12128  834 iadtklrRAKLEMERKASEKQQVRLSENLRG--LRCEMSKLATLKEdanSEQAQGSIGERLAQLEDL 898
46 PHA02562
endonuclease subunit; Provisional
743-939 6.45e-05

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 47.32  E-value: 6.45e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  743 EHIEDLLDKQS------------REL-QDMECCNNQKLLLE-----YEKYQELQLKS-----QRMQEEYEKQL---RDND 796
Cdd:PHA02562  154 KLVEDLLDISVlsemdklnkdkiRELnQQIQTLDMKIDHIQqqiktYNKNIEEQRKKngeniARKQNKYDELVeeaKTIK 233
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  797 ETKSQALEELTEF------YEAKLQEKTTLLEEAQEDVRQQLRE---FEE-----TKKQIEEDEDREIQDIKTK---YEK 859
Cdd:PHA02562  234 AEIEELTDELLNLvmdiedPSAALNKLNTAAAKIKSKIEQFQKVikmYEKggvcpTCTQQISEGPDRITKIKDKlkeLQH 313
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  860 KLRDEKESNLRLKG---ETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKD-------IQGLKREIQERDE 929
Cdd:PHA02562  314 SLEKLDTAIDELEEimdEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEfvdnaeeLAKLQDELDKIVK 393
                         250
                  ....*....|
gi 767902634  930 TIQDKEKRIY 939
Cdd:PHA02562  394 TKSELVKEKY 403
ClyA_Cry6Aa-like cd22656
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ...
704-931 6.78e-05

Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.


Pssm-ID: 439154 [Multi-domain]  Cd Length: 309  Bit Score: 46.21  E-value: 6.78e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  704 NYSEKIKELTDKFIQEMESLKTKNQVLRTEKEKQDvyHHEHIEDLLDKQSRELQdmeccnnqkllleyekyqELQLKSQR 783
Cdd:cd22656    80 NYAQNAGGTIDSYYAEILELIDDLADATDDEELEE--AKKTIKALLDDLLKEAK------------------KYQDKAAK 139
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  784 MQEEYeKQLRDNDETKSQALEELTEFYEAKLQEKttLLEEAQEDVRQQLREFEETKKQIEEDEDREIQDIKTKYEKkLRD 863
Cdd:cd22656   140 VVDKL-TDFENQTEKDQTALETLEKALKDLLTDE--GGAIARKEIKDLQKELEKLNEEYAAKLKAKIDELKALIAD-DEA 215
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 767902634  864 EKESNLRLKGETGIMRKKFSSLQKEIEERTNDIEtlkgeqmKLQGVIKSLEKDIQGLKREIQERDETI 931
Cdd:cd22656   216 KLAAALRLIADLTAADTDLDNLLALIGPAIPALE-------KLQGAWQAIATDLDSLKDLLEDDISKI 276
CCDC73 pfam15818
Coiled-coil domain-containing protein 73 family; CCDC73 is a family of eukaryotic coiled-coil ...
680-1106 7.39e-05

Coiled-coil domain-containing protein 73 family; CCDC73 is a family of eukaryotic coiled-coil containing proteins. The function is not known. The alternative name is sarcoma antigen NY-SAR-79.


Pssm-ID: 464893 [Multi-domain]  Cd Length: 1048  Bit Score: 47.25  E-value: 7.39e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   680 MLELKTR----VEELKMENEyqlrlKDMNYSEKIkeltDKFIQEMESLKTKNQVLRTEKEKqdvyhhehiedlLDKQSRE 755
Cdd:pfam15818    2 LLDFKTSlleaLEELRMRRE-----AETQYEEQI----GKIIVETQELKWQKETLQNQKET------------LAKQHKE 60
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   756 LqdMECCNNQkllleyekyqeLQLKSQRMQEEYEK-QLRDndETKSQALEELTEFYEAKLQEKTTL---LEEAQEDVRQQ 831
Cdd:pfam15818   61 A--MAVFKKQ-----------LQMKMCALEEEKGKyQLAT--EIKEKEIEGLKETLKALQVSKYSLqkkVSEMEQKLQLH 125
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   832 LREFEETKKQIEEDEdreiqdiktKYEKKLrdekesnlrlKGETGIMRKKFSSLQKEIEERtndIETLKgeqmKLQGVIK 911
Cdd:pfam15818  126 LLAKEDHHKQLNEIE---------KYYATI----------TGQFGLVKENHGKLEQNVQEA---IQLNK----RLSALNK 179
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   912 SLEKDIQGLKREiqerdetiqdkekriydLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQeMEAEL---- 987
Cdd:pfam15818  180 KQESEICSLKKE-----------------LKKVTSDLIKSKVTCQYKMGEENINLTIKEQKFQELQERLN-MELELnkki 241
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   988 --ENFHKQNTQLELNITelwqkLRATDQEMRRERQKERDLEALVKRFKtdlHNCVAYIQEPRLLKEKVRGLFEKYVQRAD 1065
Cdd:pfam15818  242 neEITHIQEEKQDIIIS-----FQHMQQLLQQQTQANTEMEAELKALK---ENNQTLERDNELQREKVKENEEKFLNLQN 313
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|.
gi 767902634  1066 MVEIAglntdlQQEYTRQREHLERNLATLKKKVVKEGELHR 1106
Cdd:pfam15818  314 EHEKA------LGTWKKHVEELNGEINEIKNELSSLKETHI 348
PRK04778 PRK04778
septation ring formation regulator EzrA; Provisional
770-1036 8.07e-05

septation ring formation regulator EzrA; Provisional


Pssm-ID: 179877 [Multi-domain]  Cd Length: 569  Bit Score: 46.75  E-value: 8.07e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  770 EYEKYQELQLKSQRmQEEYEKQLRDNDETKSQALEELTE-------------FYEAKLQEKTT--LLEEAQEDVRQQLRE 834
Cdd:PRK04778   49 ELEKVKKLNLTGQS-EEKFEEWRQKWDEIVTNSLPDIEEqlfeaeelndkfrFRKAKHEINEIesLLDLIEEDIEQILEE 127
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  835 FEETKKQiEEDEDREIQDIKTKYEKkLRDEKESNlrlkgetgimRKKFSSLQKEIEERTNDIETL---------KGEQMK 905
Cdd:PRK04778  128 LQELLES-EEKNREEVEQLKDLYRE-LRKSLLAN----------RFSFGPALDELEKQLENLEEEfsqfvelteSGDYVE 195
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  906 LQGVIKSLEKDIQGLKR---EIQERDETIQDK---------------------------EKRIYDLKKKNQE----LGKF 951
Cdd:PRK04778  196 AREILDQLEEELAALEQimeEIPELLKELQTElpdqlqelkagyrelveegyhldhldiEKEIQDLKEQIDEnlalLEEL 275
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  952 KF-VLDYKIKELKKQI----EPRENEI---RVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQEMRRERQKER 1023
Cdd:PRK04778  276 DLdEAEEKNEEIQERIdqlyDILEREVkarKYVEKNSDTLPDFLEHAKEQNKELKEEIDRVKQSYTLNESELESVRQLEK 355
                         330
                  ....*....|...
gi 767902634 1024 DLEALVKRFKTDL 1036
Cdd:PRK04778  356 QLESLEKQYDEIT 368
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
707-1167 8.22e-05

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 46.96  E-value: 8.22e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   707 EKIKELTDKFIQEMESLKTKNQVLRTEKEKQdvyhhEHIEDLLdkQSRELQdMECcnnqkllleyekyqelqlkSQRMQE 786
Cdd:TIGR00606  189 ETLRQVRQTQGQKVQEHQMELKYLKQYKEKA-----CEIRDQI--TSKEAQ-LES-------------------SREIVK 241
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   787 EYEKQLRDNDETKSQALEELTEFYeaKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEDREIQDIKTKYEKKLRDEKE 866
Cdd:TIGR00606  242 SYENELDPLKNRLKEIEHNLSKIM--KLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLNDLYHNHQRTVREKER 319
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   867 SNLRLKGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQ----ERDETIQDKEKRIYDLK 942
Cdd:TIGR00606  320 ELVDCQRELEKLNKERRLLNQEKTELLVEQGRLQLQADRHQEHIRARDSLIQSLATRLEldgfERGPFSERQIKNFHTLV 399
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   943 KKNQElGKFKFV----LDYKIKELKKQIEPRENEIRvMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQEMRRE 1018
Cdd:TIGR00606  400 IERQE-DEAKTAaqlcADLQSKERLKQEQADEIRDE-KKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRILELD 477
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1019 ---RQKERDLEALVKRFKTD-LHNCVAYIQEPRL-LKEKVRGLFEKYVQRadmveiaglntDLQQEYTRQREHLERNLAT 1093
Cdd:TIGR00606  478 qelRKAERELSKAEKNSLTEtLKKEVKSLQNEKAdLDRKLRKLDQEMEQL-----------NHHTTTRTQMEMLTKDKMD 546
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 767902634  1094 LKKKVVKEGELHRTDYVRIMQE---NVSLIKEINELRRELKFTRSQVYDLEAALKLTKKVRPQEVSETEPSRDMLST 1167
Cdd:TIGR00606  547 KDEQIRKIKSRHSDELTSLLGYfpnKKQLEDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSS 623
COG5022 COG5022
Myosin heavy chain [General function prediction only];
674-1131 8.73e-05

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 46.99  E-value: 8.73e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  674 EEKAQVMLELK-----TRVEELKMENEYQLRLKDMNYSEKIKELTDKfiqEMESLKTKNQVLRTEKEKQDVyhHEHIEDL 748
Cdd:COG5022   902 LELESEIIELKkslssDLIENLEFKTELIARLKKLLNNIDLEEGPSI---EYVKLPELNKLHEVESKLKET--SEEYEDL 976
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  749 LDKQS---RELQDMeccnNQKLLLEYEKYQELQLKSQRMQEEyEKQLRDNDeTKSQALEELTEFY---------EAKLQE 816
Cdd:COG5022   977 LKKSTilvREGNKA----NSELKNFKKELAELSKQYGALQES-TKQLKELP-VEVAELQSASKIIssestelsiLKPLQK 1050
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  817 KTTLLEEAQEDVRQQLREFEETKKQIEEDEDREIQDIKTKYEKK---LRDEKESNLRLKGETGIMR------KKFSSLQK 887
Cdd:COG5022  1051 LKGLLLLENNQLQARYKALKLRRENSLLDDKQLYQLESTENLLKtinVKDLEVTNRNLVKPANVLQfivaqmIKLNLLQE 1130
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  888 EIEERTNDIETLKGEQMKLqgviKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFvldykiKELKKQIE 967
Cdd:COG5022  1131 ISKFLSQLVNTLEPVFQKL----SVLQLELDGLFWEANLEALPSPPPFAALSEKRLYQSALYDEKS------KLSSSEVN 1200
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  968 PRENEIRVMKEQIQEmEAELENFHKQNTQLELNITELWQKLRAT-DQEMRRERQKERDLEALVKRFKTDLHNCVAYIQEP 1046
Cdd:COG5022  1201 DLKNELIALFSKIFS-GWPRGDKLKKLISEGWVPTEYSTSLKGFnNLNKKFDTPASMSNEKLLSLLNSIDNLLSSYKLEE 1279
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1047 RLLKEKVRGLFEKYVQRADMVEIAGLNT-------------------DLQQEYTRQREHLERNLATLKKKVVKEGELHRT 1107
Cdd:COG5022  1280 EVLPATINSLLQYINVGLFNALRTKASSlrwksatevnynseelddwCREFEISDVDEELEELIQAVKVLQLLKDDLNKL 1359
                         490       500
                  ....*....|....*....|....
gi 767902634 1108 DYVRIMQENVSLIkEINELRRELK 1131
Cdd:COG5022  1360 DELLDACYSLNPA-EIQNLKSRYD 1382
Taxilin pfam09728
Myosin-like coiled-coil protein; Taxilin contains an extraordinarily long coiled-coil domain ...
718-979 1.09e-04

Myosin-like coiled-coil protein; Taxilin contains an extraordinarily long coiled-coil domain in its C-terminal half and is ubiquitously expressed. It is a novel binding partner of several syntaxin family members and is possibly involved in Ca2+-dependent exocytosis in neuroendocrine cells. Gamma-taxilin, described as leucine zipper protein Factor Inhibiting ATF4-mediated Transcription (FIAT), localizes to the nucleus in osteoblasts and dimerizes with ATF4 to form inactive dimers, thus inhibiting ATF4-mediated transcription.


Pssm-ID: 462861 [Multi-domain]  Cd Length: 302  Bit Score: 45.71  E-value: 1.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   718 QEMESLKTKNQVLRTEKEKQDVYHHEHIE--DLLDKQSRELQ----DMECCNNQKLLLEYEKYQELQLKSQRMQEEYEKQ 791
Cdd:pfam09728   39 KDLKKLKKKQDQLQKEKDQLQSELSKAILakSKLEKLCRELQkqnkKLKEESKKLAKEEEEKRKELSEKFQSTLKDIQDK 118
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   792 LRDNDETKSQALEELTEFYE--AKLQEKTTLLEE------AQEDVRQQLRE------FEETKKQIEEDEDREIQDIKTKY 857
Cdd:pfam09728  119 MEEKSEKNNKLREENEELREklKSLIEQYELRELhfekllKTKELEVQLAEaklqqaTEEEEKKAQEKEVAKARELKAQV 198
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   858 EKKLRDEKEsnlrLKGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDetiqdkeKR 937
Cdd:pfam09728  199 QTLSETEKE----LREQLNLYVEKFEEFQDTLNKSNEVFTTFKKEMEKMSKKIKKLEKENLTWKRKWEKSN-------KA 267
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|..
gi 767902634   938 IYDLKKKNQELGKfkfvldyKIKELKKQIEPRENEIRVMKEQ 979
Cdd:pfam09728  268 LLEMAEERQKLKE-------ELEKLQKKLEKLENLCRALQAE 302
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
905-1202 1.49e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 46.20  E-value: 1.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   905 KLQGVIKSLEKDIQGLKREiqerdetiqdKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEME 984
Cdd:TIGR02168  190 RLEDILNELERQLKSLERQ----------AEKAERYKELKAELRELELALLVLRLEELREELEELQEELKEAEEELEELT 259
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   985 AELENFHKQNTQLELNITELWQKLRATDQEMRRERQKERDLEalvkrfktdlhncvayiQEPRLLKEKVRGLFEKYVQra 1064
Cdd:TIGR02168  260 AELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLE-----------------QQKQILRERLANLERQLEE-- 320
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1065 dmveiagLNTDLQQEyTRQREHLERNLATLKKKVVkegelhrtdyvrimqenvSLIKEINELRRELKFTRSQVYDLEAAL 1144
Cdd:TIGR02168  321 -------LEAQLEEL-ESKLDELAEELAELEEKLE------------------ELKEELESLEAELEELEAELEELESRL 374
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 767902634  1145 KLTKKVRPQEVSETepsrdmlstaptARLNEQEETgriiemQRLEIQRLRDQIQEQEQ 1202
Cdd:TIGR02168  375 EELEEQLETLRSKV------------AQLELQIAS------LNNEIERLEARLERLED 414
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
728-1187 1.77e-04

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 46.20  E-value: 1.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   728 QVLRTEKEKQDV-----YHHEHIEDLLDKQSRELQdmECCNNQKLLLEYekyqelqLKSQRMQEEYEKQLRdndETKSQA 802
Cdd:TIGR01612  328 KILESEGEQGHIinkliFLEKEFEDTIHKSDIYKD--ECLSNHLFMEDY-------LKDDKISPYYYEFLE---EIKKIA 395
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   803 LEELTEFYEAKLQEKTTLLEEAQEDVrqqLREFEETKKQ--IEEDEDREIQDIKTKYEKKLRDEKESNLRL--KGETGIM 878
Cdd:TIGR01612  396 KQRAIFFYNAKKLKHLEILYKHQEDI---LNNFHKTIERliFEKPDPNNNNIFKDDFDEFNKPIPKSKLKAleKRFFEIF 472
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   879 RKKFSS--LQKEIEERTNDIETLKG--EQMK-LQGVIKSLEkdiQGLKREIQERDETIQDKEKRIYDLKKKNQELGkfkf 953
Cdd:TIGR01612  473 EEEWGSydIKKDIDENSKQDNTVKLilMRMKdFKDIIDFME---LYKPDEVPSKNIIGFDIDQNIKAKLYKEIEAG---- 545
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   954 vldykikeLKKQIEPRENeirvMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQEmrrerqkerdlealvkrfk 1033
Cdd:TIGR01612  546 --------LKESYELAKN----WKKLIHEIKKELEEENEDSIHLEKEIKDLFDKYLEIDDE------------------- 594
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1034 tdlhncVAYIQEPRL-LKEKVRGLFEK--YVQRA-DMVEIAGLNTDLQQEYTRQR-----EHLERN---LATLKKKVVKe 1101
Cdd:TIGR01612  595 ------IIYINKLKLeLKEKIKNISDKneYIKKAiDLKKIIENNNAYIDELAKISpyqvpEHLKNKdkiYSTIKSELSK- 667
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1102 geLHRTDYVRIMQENVSLIKEINELRRELKftrSQVYDLEAalKLTKKVRPQEVSETEPSRDMLSTAPTarlNEQEETGR 1181
Cdd:TIGR01612  668 --IYEDDIDALYNELSSIVKENAIDNTEDK---AKLDDLKS--KIDKEYDKIQNMETATVELHLSNIEN---KKNELLDI 737

                   ....*.
gi 767902634  1182 IIEMQR 1187
Cdd:TIGR01612  738 IVEIKK 743
COG5022 COG5022
Myosin heavy chain [General function prediction only];
770-1189 1.87e-04

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 45.84  E-value: 1.87e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  770 EYEKYQELQLKSQRMQEEYEK-QLRDNDETKSQALEELTEFYEAKLQEKTTLLEEAQEDVRQQLREFEETKKQIEE--DE 846
Cdd:COG5022   811 EYRSYLACIIKLQKTIKREKKlRETEEVEFSLKAEVLIQKFGRSLKAKKRFSLLKKETIYLQSAQRVELAERQLQElkID 890
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  847 DREIQDIKTKYEK------KLRDEKESNLRLKGEtgIMRKKFSSLQKEIEERtnDIETLKGEQMKLQGVIKSLEKDIQGL 920
Cdd:COG5022   891 VKSISSLKLVNLEleseiiELKKSLSSDLIENLE--FKTELIARLKKLLNNI--DLEEGPSIEYVKLPELNKLHEVESKL 966
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  921 KREIQERDETIQDKEKRIYDLKKKNQELGKFKfvldYKIKELKKQIEPRENEIRVMKEQIQEMEaelenFHKQNTQLELN 1000
Cdd:COG5022   967 KETSEEYEDLLKKSTILVREGNKANSELKNFK----KELAELSKQYGALQESTKQLKELPVEVA-----ELQSASKIISS 1037
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1001 ITELWQKLRATDQEMRRERQKERDLEALVKRFKTDLHNCVAYiqeprllkekvrgLFEKYVQRADMVEIAGLNTDLQQEY 1080
Cdd:COG5022  1038 ESTELSILKPLQKLKGLLLLENNQLQARYKALKLRRENSLLD-------------DKQLYQLESTENLLKTINVKDLEVT 1104
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1081 TRQREHLERNLATLKKKVVKEGELHRTDyvRIMQENVSLIKEINELRRELKFTRSQVYDLEAALKLTKKVRPQEVSETEP 1160
Cdd:COG5022  1105 NRNLVKPANVLQFIVAQMIKLNLLQEIS--KFLSQLVNTLEPVFQKLSVLQLELDGLFWEANLEALPSPPPFAALSEKRL 1182
                         410       420       430
                  ....*....|....*....|....*....|.
gi 767902634 1161 SRD-MLSTAPTARLNEQEE-TGRIIEMQRLE 1189
Cdd:COG5022  1183 YQSaLYDEKSKLSSSEVNDlKNELIALFSKI 1213
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
662-937 1.89e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 46.06  E-value: 1.89e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  662 FAEEVLvTKTDMEEKAQVMLELktrVEELkmeNEYQLRLKDMnySEKIKELTD--KFIQEMESLKTKNQVLRTEKEKQDV 739
Cdd:COG4913   213 VREYML-EEPDTFEAADALVEH---FDDL---ERAHEALEDA--REQIELLEPirELAERYAAARERLAELEYLRAALRL 283
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  740 YHHEHIEDLLDKQSRELQdmeccnnqkllleyEKYQELqlksqrmqeeyEKQLRDNDETKSQALEELTEFYEAKLQEKTT 819
Cdd:COG4913   284 WFAQRRLELLEAELEELR--------------AELARL-----------EAELERLEARLDALREELDELEAQIRGNGGD 338
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  820 LLEEAQEDVRQQlrefEETKKQIEEDEDReiqdiktkYEKKLRDekesnLRLKGETGimRKKFSSLQKEIEERtndIETL 899
Cdd:COG4913   339 RLEQLEREIERL----ERELEERERRRAR--------LEALLAA-----LGLPLPAS--AEEFAALRAEAAAL---LEAL 396
                         250       260       270
                  ....*....|....*....|....*....|....*...
gi 767902634  900 KGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKR 937
Cdd:COG4913   397 EEELEALEEALAEAEAALRDLRRELRELEAEIASLERR 434
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
955-1139 1.99e-04

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 45.78  E-value: 1.99e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  955 LDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQN--TQLELNITELWQKLRATDQEMRRERQKERDLEALVKRF 1032
Cdd:COG3206   166 LELRREEARKALEFLEEQLPELRKELEEAEAALEEFRQKNglVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAAL 245
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1033 KTDL---HNCVAYIQEPRLLKEKVRGLFEKYVQRADMV---------------EIAGLNTDLQQEYTRQREHLERNLATL 1094
Cdd:COG3206   246 RAQLgsgPDALPELLQSPVIQQLRAQLAELEAELAELSarytpnhpdvialraQIAALRAQLQQEAQRILASLEAELEAL 325
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 767902634 1095 KKkvvKEGELHRT--DYVRIMQENVSLIKEINELRRELKFTRsQVYD 1139
Cdd:COG3206   326 QA---REASLQAQlaQLEARLAELPELEAELRRLEREVEVAR-ELYE 368
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
707-1035 2.22e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 45.68  E-value: 2.22e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  707 EKIKELTDKFIQEMESLKTKNQVLRTEKEKQDVYHH--EHIEDLLDKQS--RELQDMEccnnqkllleyEKYQELQLKSQ 782
Cdd:COG4913   617 AELAELEEELAEAEERLEALEAELDALQERREALQRlaEYSWDEIDVASaeREIAELE-----------AELERLDASSD 685
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  783 RMQEeyekqLRDNDETKSQALEELTEFYEAkLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEDRE-IQDIKTKYEKKL 861
Cdd:COG4913   686 DLAA-----LEEQLEELEAELEELEEELDE-LKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLElRALLEERFAAAL 759
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  862 RDEKESNLR--LKGETGIMRKKFSSLQKEIEERTND-IETLKGEQMKLQGVIKSLEkDIQGLKREIQERDetIQDKEKRI 938
Cdd:COG4913   760 GDAVERELRenLEERIDALRARLNRAEEELERAMRAfNREWPAETADLDADLESLP-EYLALLDRLEEDG--LPEYEERF 836
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  939 YDLKKKNQElgkfkfvldYKIKELKKQIeprENEIRVMKEQIQEMEAELEN----------FHKQNTQLElNITELWQKL 1008
Cdd:COG4913   837 KELLNENSI---------EFVADLLSKL---RRAIREIKERIDPLNDSLKRipfgpgrylrLEARPRPDP-EVREFRQEL 903
                         330       340       350
                  ....*....|....*....|....*....|...
gi 767902634 1009 RA------TDQEMRRERQKERdLEALVKRFKTD 1035
Cdd:COG4913   904 RAvtsgasLFDEELSEARFAA-LKRLIERLRSE 935
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
928-1180 2.31e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 45.21  E-value: 2.31e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  928 DETIQDKEKRIYDLKKKNQELGKfkfvldyKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQK 1007
Cdd:COG3883    15 DPQIQAKQKELSELQAELEAAQA-------ELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREE 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1008 LRATDQEMRRERQKERDLEALVKrfKTDlhncvayiqeprllkekvrglFEKYVQRADMVE-IAGLNTDLQQEYTRQREH 1086
Cdd:COG3883    88 LGERARALYRSGGSVSYLDVLLG--SES---------------------FSDFLDRLSALSkIADADADLLEELKADKAE 144
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1087 LERNLATLKKKvVKEGELHRTDYVRIMQENVSLIKE----INELRRELKFTRSQVYDLEAALKLTKKVRPQEVSETEPSR 1162
Cdd:COG3883   145 LEAKKAELEAK-LAELEALKAELEAAKAELEAQQAEqealLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAA 223
                         250
                  ....*....|....*...
gi 767902634 1163 DMLSTAPTARLNEQEETG 1180
Cdd:COG3883   224 AAAAAAAAAAAAAAAAAA 241
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
947-1149 2.79e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 45.14  E-value: 2.79e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  947 ELGK---FKFVLDYKIKELKKQIE----PRENEIRVMKEQIQEMEAELENFHKQNTQLElnitELWQKLRATDQEMRRER 1019
Cdd:COG4717    33 EAGKstlLAFIRAMLLERLEKEADelfkPQGRKPELNLKELKELEEELKEAEEKEEEYA----ELQEELEELEEELEELE 108
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1020 QKERDLEALVKRFKtDLHNCVAYIQEPRLLKEKVRGLFEKYVQ-RADMVEIAglntDLQQEYTRQREHLERNLATLKKKV 1098
Cdd:COG4717   109 AELEELREELEKLE-KLLQLLPLYQELEALEAELAELPERLEElEERLEELR----ELEEELEELEAELAELQEELEELL 183
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767902634 1099 VKEGELHRTDYVRIMQENVSLIKEINELRRELKFTRSQVYDLEAALKLTKK 1149
Cdd:COG4717   184 EQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLEN 234
OmpH pfam03938
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH ...
763-863 3.35e-04

Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.


Pssm-ID: 461098 [Multi-domain]  Cd Length: 140  Bit Score: 42.18  E-value: 3.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   763 NNQKLLLEYEKYQELQLKSQRMQEEYEKQLrdndETKSQALEELTEFYEAKLQEKTTLLEEAQEDVRQQLREFEETKKQI 842
Cdd:pfam03938    6 DMQKILEESPEGKAAQAQLEKKFKKRQAEL----EAKQKELQKLYEELQKDGALLEEEREEKEQELQKKEQELQQLQQKA 81
                           90       100
                   ....*....|....*....|.
gi 767902634   843 EEDEDREIQDIKTKYEKKLRD 863
Cdd:pfam03938   82 QQELQKKQQELLQPIQDKINK 102
CCDC22 pfam05667
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 ...
822-993 3.80e-04

Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 (CCDC22) is involved in regulation of NF-kappa-B signalling; the function may involve association with COMMD8 and a CUL1-dependent E3 ubiquitin ligase complex. It is part of the OMMD/CCDC22/CCDC93 (CCC) complex, which interacts with the multisubunit WASH complex required for endosomal deposition of F-actin and cargo trafficking in conjunction with the retromer. This entry also includes CCDC22 homologs from animals and plants.


Pssm-ID: 461708 [Multi-domain]  Cd Length: 600  Bit Score: 44.63  E-value: 3.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   822 EEAQEDVRQQLREFEETKKQIEEDEDREIQDIKtKYEKKLRDEKESNLRLKGETGIMRKKFsslqKEIEERTNDIEtlkg 901
Cdd:pfam05667  334 EEELEELQEQLEDLESSIQELEKEIKKLESSIK-QVEEELEELKEQNEELEKQYKVKKKTL----DLLPDAEENIA---- 404
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   902 eqmKLQGVIKSLEKDIQGLKREIQERDETIQdKEKRIYDLKKKNQELgKFKFVLDyKIKELKKQIEPRENEIRVMKEQIQ 981
Cdd:pfam05667  405 ---KLQALVDASAQRLVELAGQWEKHRVPLI-EEYRALKEAKSNKED-ESQRKLE-EIKELREKIKEVAEEAKQKEELYK 478
                          170
                   ....*....|..
gi 767902634   982 EMEAELENFHKQ 993
Cdd:pfam05667  479 QLVAEYERLPKD 490
PRK04778 PRK04778
septation ring formation regulator EzrA; Provisional
664-997 4.06e-04

septation ring formation regulator EzrA; Provisional


Pssm-ID: 179877 [Multi-domain]  Cd Length: 569  Bit Score: 44.44  E-value: 4.06e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  664 EEVLVTKTDMEE------KAQVML-----ELKTRVEELKMENEYqlrLKDMNYSEKIKELTDKfIQEMESLKTKNQVLRT 732
Cdd:PRK04778  205 EELAALEQIMEEipellkELQTELpdqlqELKAGYRELVEEGYH---LDHLDIEKEIQDLKEQ-IDENLALLEELDLDEA 280
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  733 EKEKQDVyhHEHIEDLLDKqsrelqdMEccnnqkllLEYEKYQELQLKSQRMQEEYEKQLRDNDETKsqalEELTEfyea 812
Cdd:PRK04778  281 EEKNEEI--QERIDQLYDI-------LE--------REVKARKYVEKNSDTLPDFLEHAKEQNKELK----EEIDR---- 335
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  813 kLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEDREIQDIKTkyekklrdekesnlrlkgetgimrkkFSSLQKEIEER 892
Cdd:PRK04778  336 -VKQSYTLNESELESVRQLEKQLESLEKQYDEITERIAEQEIA--------------------------YSELQEELEEI 388
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  893 TNDIETLKGEQMKLQGVIKSLEKDiqglkrEIQERdETIQDKEKRIYDLK----KKN-----QELGKFKFVLDYKIKELK 963
Cdd:PRK04778  389 LKQLEEIEKEQEKLSEMLQGLRKD------ELEAR-EKLERYRNKLHEIKryleKSNlpglpEDYLEMFFEVSDEIEALA 461
                         330       340       350
                  ....*....|....*....|....*....|....
gi 767902634  964 KQIEPRENEIRVMKEQIQEMEAELENFHKQNTQL 997
Cdd:PRK04778  462 EELEEKPINMEAVNRLLEEATEDVETLEEETEEL 495
WD40 pfam00400
WD domain, G-beta repeat;
367-396 4.14e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 38.87  E-value: 4.14e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 767902634   367 HSAPITGLATCIRKPLIATCSLDRSIRLWN 396
Cdd:pfam00400   10 HTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
364-396 5.23e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 38.83  E-value: 5.23e-04
                            10        20        30
                    ....*....|....*....|....*....|...
gi 767902634    364 YPLHSAPITGLATCIRKPLIATCSLDRSIRLWN 396
Cdd:smart00320    8 LKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Caldesmon pfam02029
Caldesmon;
768-1053 5.42e-04

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 44.09  E-value: 5.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   768 LLEYEKYQELQlKSQRMQEEYEKQlRDNDETKSQALEELTEfyeaklQEKTTLLEEAQEDVRQQLREFEETKKQIEEDED 847
Cdd:pfam02029   64 FLDRTAKREER-RQKRLQEALERQ-KEFDPTIADEKESVAE------RKENNEEEENSSWEKEEKRDSRLGRYKEEETEI 135
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   848 REIQDIKTKYEKKLRDEKEsnlrlKGEtgiMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQER 927
Cdd:pfam02029  136 REKEYQENKWSTEVRQAEE-----EGE---EEEDKSEEAEEVPTENFAKEEVKDEKIKKEKKVKYESKVFLDQKRGHPEV 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   928 deTIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPrENEIRVMKEQIQEME-AELENFHKQNTQLELNITELwQ 1006
Cdd:pfam02029  208 --KSQNGEEEVTKLKVTTKRRQGGLSQSQEREEEAEVFLEA-EQKLEELRRRRQEKEsEEFEKLRQKQQEAELELEEL-K 283
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 767902634  1007 KLRATDQEMRRERQKERDLEALVKRFKTDlhncvayiQEPRLLKEKV 1053
Cdd:pfam02029  284 KKREERRKLLEEEEQRRKQEEAERKLREE--------EEKRRMKEEI 322
WD40 COG2319
WD40 repeat [General function prediction only];
36-158 5.42e-04

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 43.75  E-value: 5.42e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   36 SQGMLALSISPNRRYLAiseTVQEKPAITIYELSSipcrkRKVLNNFDFQVQKFISMAFSPDSKYLLAQTSppeSNLVYw 115
Cdd:COG2319   288 SGGVNSVAFSPDGKLLA---SGSDDGTVRLWDLAT-----GKLLRTLTGHTGAVRSVAFSPDGKTLASGSD---DGTVR- 355
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 767902634  116 LWEKQKVMAIVRIDTQNNPVYQVSFSPQDNTQVCVTGNGMFKL 158
Cdd:COG2319   356 LWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRL 398
Macoilin pfam09726
Macoilin family; The Macoilin proteins has an N-terminal portion that is composed of 5 ...
670-948 5.55e-04

Macoilin family; The Macoilin proteins has an N-terminal portion that is composed of 5 trasnmembrane helices, followed by a C-terminal coiled-coil region. Macoilin is a highly conserved protein present in eukaryotes. Macoilin appears to be found in the ER and be involved in the function of neurons.


Pssm-ID: 462859 [Multi-domain]  Cd Length: 670  Bit Score: 44.07  E-value: 5.55e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   670 KTDMEEKAQVMLELKTRVEELkmeneyqlrlkdMNYSEKIKELTDKFIQEMESLKTKNQVLRTEKEKqdvyhhehiedll 749
Cdd:pfam09726  408 KAELQASRQTEQELRSQISSL------------TSLERSLKSELGQLRQENDLLQTKLHNAVSAKQK------------- 462
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   750 DKQSreLQDMEccnnQKLLLEyekyqelqlksQRMQEEYEKQLrdNDETKSQALEELTEFYEAKLQEKTTllEEAQEDVR 829
Cdd:pfam09726  463 DKQT--VQQLE----KRLKAE-----------QEARASAEKQL--AEEKKRKKEEEATAARAVALAAASR--GECTESLK 521
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   830 QQLREFEETKKQIEED---EDREIQDIKTKyEKKLRDEKESnlrlKGETGIMRKKFSSLQKEIEERTNDI--ETlkgeQM 904
Cdd:pfam09726  522 QRKRELESEIKKLTHDiklKEEQIRELEIK-VQELRKYKES----EKDTEVLMSALSAMQDKNQHLENSLsaET----RI 592
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....
gi 767902634   905 KLqgvikSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQEL 948
Cdd:pfam09726  593 KL-----DLFSALGDAKRQLEIAQGQIYQKDQEIKDLKQKIAEV 631
PRK12704 PRK12704
phosphodiesterase; Provisional
660-868 7.32e-04

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 43.61  E-value: 7.32e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  660 VGFAEEVLVTKTDMEEKAQVMLEL-KTRVEELKMENEYQLRLKDMNYSEKIKELTDKFiqEMESLKTKNQVLRTEKEKQD 738
Cdd:PRK12704   16 VGAVIGYFVRKKIAEAKIKEAEEEaKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEF--EKELRERRNELQKLEKRLLQ 93
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  739 VyhhehiEDLLDKQSRELQDMEccnnqkllleyekyQELQLKsqrmQEEYEKQLRDndetksqaLEELTEFYEAKLQEKT 818
Cdd:PRK12704   94 K------EENLDRKLELLEKRE--------------EELEKK----EKELEQKQQE--------LEKKEEELEELIEEQL 141
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....
gi 767902634  819 TLLEEA----QEDVRQQLreFEETKKQIEEDEDREIQDIKTKYekKLRDEKESN 868
Cdd:PRK12704  142 QELERIsgltAEEAKEIL--LEKVEEEARHEAAVLIKEIEEEA--KEEADKKAK 191
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
796-1202 9.63e-04

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 43.46  E-value: 9.63e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  796 DETKSQALEELTEFYEAKLQEKTTLLEEaqedvRQQLREFEETKKqieededreIQDIKTKY-------EKKLRDEKESN 868
Cdd:NF033838   53 NESQKEHAKEVESHLEKILSEIQKSLDK-----RKHTQNVALNKK---------LSDIKTEYlyelnvlKEKSEAELTSK 118
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  869 LR---------LKGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVI--KSLEKDIQGLKREIQERD-ETIQDKEK 936
Cdd:NF033838  119 TKkeldaafeqFKKDTLEPGKKVAEATKKVEEAEKKAKDQKEEDRRNYPTNtyKTLELEIAESDVEVKKAElELVKEEAK 198
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  937 RIYDLKKKNQELGKFkfvldykikELKKQIEPRENEIRVMKEQIQE-----MEAELENFHKQNTQLELNITELWQKLRAT 1011
Cdd:NF033838  199 EPRDEEKIKQAKAKV---------ESKKAEATRLEKIKTDREKAEEeakrrADAKLKEAVEKNVATSEQDKPKRRAKRGV 269
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1012 DQEMRRERQKERDLEAlvkrfkTDLHNCVAYIQEPRLLKEKVRGLFEKYVQRAdmveiaglntdlQQEYTRQREHLERNL 1091
Cdd:NF033838  270 LGEPATPDKKENDAKS------SDSSVGEETLPSPSLKPEKKVAEAEKKVEEA------------KKKAKDQKEEDRRNY 331
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1092 ATLKKKVVkEGELHRTDyVRIMQENVSLIKEI-NELRRELKFTRSQvydleaalkltKKVrpqEVSETEPSRdmLSTAPT 1170
Cdd:NF033838  332 PTNTYKTL-ELEIAESD-VKVKEAELELVKEEaKEPRNEEKIKQAK-----------AKV---ESKKAEATR--LEKIKT 393
                         410       420       430
                  ....*....|....*....|....*....|..
gi 767902634 1171 ARLNEQEETGRIIEmqrlEIQRLRDQIQEQEQ 1202
Cdd:NF033838  394 DRKKAEEEAKRKAA----EEDKVKEKPAEQPQ 421
PRK05771 PRK05771
V-type ATP synthase subunit I; Validated
815-1004 1.20e-03

V-type ATP synthase subunit I; Validated


Pssm-ID: 235600 [Multi-domain]  Cd Length: 646  Bit Score: 42.99  E-value: 1.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  815 QEKTTLLEEAQEdvrQQLREFEETKKQIEEDEDREIQDIKTKYEK---KLRDEKESNLRLKGETGIMRKK---------- 881
Cdd:PRK05771   16 SYKDEVLEALHE---LGVVHIEDLKEELSNERLRKLRSLLTKLSEaldKLRSYLPKLNPLREEKKKVSVKsleelikdve 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  882 --FSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEK------DIQ-------------GLKREIQERDETI--------- 931
Cdd:PRK05771   93 eeLEKIEKEIKELEEEISELENEIKELEQEIERLEPwgnfdlDLSlllgfkyvsvfvgTVPEDKLEELKLEsdvenveyi 172
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  932 -QDKEKRIY-------DLKKKNQELGKF---KFVLDYK--IKELKKQIEPRENEIrvmKEQIQEMEAELENFHKQNTQLE 998
Cdd:PRK05771  173 sTDKGYVYVvvvvlkeLSDEVEEELKKLgfeRLELEEEgtPSELIREIKEELEEI---EKERESLLEELKELAKKYLEEL 249

                  ....*.
gi 767902634  999 LNITEL 1004
Cdd:PRK05771  250 LALYEY 255
ATP-synt_Fo_b cd06503
F-type ATP synthase, membrane subunit b; Membrane subunit b is a component of the Fo complex ...
775-855 1.21e-03

F-type ATP synthase, membrane subunit b; Membrane subunit b is a component of the Fo complex of FoF1-ATP synthase. The F-type ATP synthases (FoF1-ATPase) consist of two structural domains: the F1 (assembly factor one) complex containing the soluble catalytic core, and the Fo (oligomycin sensitive factor) complex containing the membrane proton channel, linked together by a central stalk and a peripheral stalk. F1 is composed of alpha (or A), beta (B), gamma (C), delta (D) and epsilon (E) subunits with a stoichiometry of 3:3:1:1:1, while Fo consists of the three subunits a, b, and c (1:2:10-14). An oligomeric ring of 10-14 c subunits (c-ring) make up the Fo rotor. The flux of protons through the ATPase channel (Fo) drives the rotation of the c-ring, which in turn is coupled to the rotation of the F1 complex gamma subunit rotor due to the permanent binding between the gamma and epsilon subunits of F1 and the c-ring of Fo. The F-ATP synthases are primarily found in the inner membranes of eukaryotic mitochondria, in the thylakoid membranes of chloroplasts or in the plasma membranes of bacteria. The F-ATP synthases are the primary producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). Alternatively, under conditions of low driving force, ATP synthases function as ATPases, thus generating a transmembrane proton or Na(+) gradient at the expense of energy derived from ATP hydrolysis. This group also includes F-ATP synthase that has also been found in the archaea Candidatus Methanoperedens.


Pssm-ID: 349951 [Multi-domain]  Cd Length: 132  Bit Score: 40.11  E-value: 1.21e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  775 QELQLKSQRMQEEYEKQLRdndETKSQALEELTEFYEAKLQEKTTLLEEAQEDVRQQLrefEETKKQIEEDEDREIQDIK 854
Cdd:cd06503    43 EKAKEEAEELLAEYEEKLA---EARAEAQEIIEEARKEAEKIKEEILAEAKEEAERIL---EQAKAEIEQEKEKALAELR 116

                  .
gi 767902634  855 T 855
Cdd:cd06503   117 K 117
ATG16 pfam08614
Autophagy protein 16 (ATG16); Autophagy is a ubiquitous intracellular degradation system for ...
804-942 1.22e-03

Autophagy protein 16 (ATG16); Autophagy is a ubiquitous intracellular degradation system for eukaryotic cells. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. ATG16 (also known as Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate in the yeast autophagy pathway.


Pssm-ID: 462536 [Multi-domain]  Cd Length: 176  Bit Score: 41.07  E-value: 1.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   804 EELTEFYeAKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEDREIQDIKTKYE---KKLRDEKESNLRLKGEtgimrk 880
Cdd:pfam08614    3 LELIDAY-NRLLDRTALLEAENAKLQSEPESVLPSTSSSKLSKASPQSASIQSLEqllAQLREELAELYRSRGE------ 75
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767902634   881 kfssLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIYDLK 942
Cdd:pfam08614   76 ----LAQRLVDLNEELQELEKKLREDERRLAALEAERAQLEEKLKDREEELREKRKLNQDLQ 133
46 PHA02562
endonuclease subunit; Provisional
828-1024 1.41e-03

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 42.69  E-value: 1.41e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  828 VRQQLREFEETKKQIEEDEDREIQDIKTKYEKKLRDEKEsnlrLKGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQ 907
Cdd:PHA02562  193 IQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKT----IKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIK 268
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  908 GVIKSLEKDI-------------QGLKREiQERDETIQDK----EKRIYDLKKKNQELGKFKFVLD---YKIKELKKQIE 967
Cdd:PHA02562  269 SKIEQFQKVIkmyekggvcptctQQISEG-PDRITKIKDKlkelQHSLEKLDTAIDELEEIMDEFNeqsKKLLELKNKIS 347
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 767902634  968 PRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQEmRRERQKERD 1024
Cdd:PHA02562  348 TNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDELDKIVKT-KSELVKEKY 403
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
655-933 1.50e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 43.12  E-value: 1.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   655 KREREVGFAEEVLVTKTDMEEKAQVMLE----LKTRVEELKME-----NEYQLRLKDMNYSEKIKELTDKFIQEMESLKT 725
Cdd:TIGR02168  769 RLEEAEEELAEAEAEIEELEAQIEQLKEelkaLREALDELRAEltllnEEAANLRERLESLERRIAATERRLEDLEEQIE 848
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   726 KNQVLRTEKEKQdvyhHEHIEDLLDKQSRELQdmeccnnqkllLEYEKYQELQLKSQRMQEEYEKQLRDNDETKSQALEE 805
Cdd:TIGR02168  849 ELSEDIESLAAE----IEELEELIEELESELE-----------ALLNERASLEEALALLRSELEELSEELRELESKRSEL 913
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   806 ltefyEAKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEDREIQDIKTKYEKKLRDEKESNLRLKGetgiMRKKFSSL 885
Cdd:TIGR02168  914 -----RRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKR----LENKIKEL 984
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 767902634   886 Q----------KEIEERtndIETLKGEQMKLQGVIKSLEKDIQGLKREIQER-DETIQD 933
Cdd:TIGR02168  985 GpvnlaaieeyEELKER---YDFLTAQKEDLTEAKETLEEAIEEIDREARERfKDTFDQ 1040
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
682-948 2.00e-03

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 41.82  E-value: 2.00e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  682 ELKTRVEELKMENEYQLRLKDMNYsEKIKELTDKFIQEMESLKTKNQVLRTEKEKQDVYHHEHIEdlLDKQSRELQDME- 760
Cdd:COG1340    47 ELNAQVKELREEAQELREKRDELN-EKVKELKEERDELNEKLNELREELDELRKELAELNKAGGS--IDKLRKEIERLEw 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  761 CCNNQKLLLEYEKyqELQLKSQRMQEEYEKqLRDNDETKSQALEELTEFYEAKLQekttlleeaQEDVRQQLREFEETKK 840
Cdd:COG1340   124 RQQTEVLSPEEEK--ELVEKIKELEKELEK-AKKALEKNEKLKELRAELKELRKE---------AEEIHKKIKELAEEAQ 191
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  841 QIEEdedrEIQDIKTKYEKkLRDEKESnlrlkgetgiMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGL 920
Cdd:COG1340   192 ELHE----EMIELYKEADE-LRKEADE----------LHKEIVEAQEKADELHEEIIELQKELRELRKELKKLRKKQRAL 256
                         250       260
                  ....*....|....*....|....*...
gi 767902634  921 KREiqERDETIQDKEKRIYDLKKKNQEL 948
Cdd:COG1340   257 KRE--KEKEELEEKAEEIFEKLKKGEKL 282
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
806-1037 2.02e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 42.59  E-value: 2.02e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  806 LTEFYEAKLQEKTTLLEEAqEDVRQQLREFEETKKQIEEDEDR-----EIQDIKTKYEKKLRDEKESN-LRLKGETGIMR 879
Cdd:COG4913   209 LDDFVREYMLEEPDTFEAA-DALVEHFDDLERAHEALEDAREQiellePIRELAERYAAARERLAELEyLRAALRLWFAQ 287
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  880 KKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDetiqdkekriydlkkknqelGKfkfvldyki 959
Cdd:COG4913   288 RRLELLEAELEELRAELARLEAELERLEARLDALREELDELEAQIRGNG--------------------GD--------- 338
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  960 kelkkQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLEL-----------NITELWQKLRATDQEMRRERQKERDLEAL 1028
Cdd:COG4913   339 -----RLEQLEREIERLERELEERERRRARLEALLAALGLplpasaeefaaLRAEAAALLEALEEELEALEEALAEAEAA 413

                  ....*....
gi 767902634 1029 VKRFKTDLH 1037
Cdd:COG4913   414 LRDLRRELR 422
PTZ00332 PTZ00332
paraflagellar rod protein; Provisional
754-1151 2.15e-03

paraflagellar rod protein; Provisional


Pssm-ID: 240364 [Multi-domain]  Cd Length: 589  Bit Score: 42.26  E-value: 2.15e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  754 RELQDMECCNNQKLLLEYEKYQELQLKSQRMQEEYEKQLRD-------------NDETKSQALEELTEFyeakLQEKTTL 820
Cdd:PTZ00332  162 ATLKNIEDIMNVTQIQNALASTDDQIKTQLAQLEKTNEIQNvamhdgemqvaeeQMWTKVQLQERLIEL----VADKFRL 237
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  821 LEEAqEDVRQQLREFEETKKQIEEdEDREIQDIKtkyeKKLRDEKESNLRlkgetgimrkkfsSLQKEIEERtnDIEtlK 900
Cdd:PTZ00332  238 IGKC-EEENKSFSKIHEVQKQANQ-ETSQMKDAK----RRLKQRCETDLK-------------HIHDAIQKA--DLE--D 294
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  901 GEQMKLQGVIKslEKDiqglKREIQERDETIQDKEKRIYDLKKKNQELGKFKFvldykiKELKKQIEP--RENEIRVMKE 978
Cdd:PTZ00332  295 AEAMKRYATNK--EKS----ERFIRENEDRQEEAWNKIQDLERQLQRLGTERF------EEVKRRIEEndREEKRRVEYQ 362
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  979 QIQEMEAE---LENFHKQNTQLELNITELWQKLRA------------TDQEM---RRERQKERdLEALVKRFKTdlHNCV 1040
Cdd:PTZ00332  363 QFLEVAGQhkkLLELTVYNCDLALRCTGLVEELVSegcaavkarhdkTNQDLaalRLQVHKEH-LEYFRMLYLT--LGSL 439
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1041 AYIQEPRLlKEKVRGLFEKYVQRADMVEI----AGLNTDLQQEYTRQREHLERNLATLKKKVVKEGELHRTDYVRIMQEN 1116
Cdd:PTZ00332  440 IYKKEKRL-EEIDRNIRTTHIQLEFCVETfdpnAKKHADMKKELYKLRQGVEEELAMLKEKQAQALEMFKESEEALDAAG 518
                         410       420       430
                  ....*....|....*....|....*....|....*
gi 767902634 1117 VSLIKEINELRRELKFTRSQVYDLEAALKLTKKVR 1151
Cdd:PTZ00332  519 IEFVHPVDENNEEVLTRRSKMVEYRSHLAKQEEVK 553
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
727-1024 2.23e-03

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 41.83  E-value: 2.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   727 NQVLRTEKEKQDVYHHEHIEDLLDKQSRELQDMECCNNQKLLLEYEKYQ----ELQLKSQRMQEEYEKQLRDNDETKSQA 802
Cdd:pfam13868   31 KKRIKAEEKEEERRLDEMMEEERERALEEEEEKEEERKEERKRYRQELEeqieEREQKRQEEYEEKLQEREQMDEIVERI 110
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   803 LEELTEFYEAKLQEKTTLLEEAQEDVRQQlREFEETKKQIEEDEDREIQDIKTKYEKKLRDEKESNLRLKGETGIMRKKF 882
Cdd:pfam13868  111 QEEDQAEAEEKLEKQRQLREEIDEFNEEQ-AEWKELEKEEEREEDERILEYLKEKAEREEEREAEREEIEEEKEREIARL 189
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   883 SSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIydlkkknqelgkfkfvldyKIKEL 962
Cdd:pfam13868  190 RAQQEKAQDEKAERDELRAKLYQEEQERKERQKEREEAEKKARQRQELQQAREEQI-------------------ELKER 250
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 767902634   963 KKQIEPRENE---IRVMKEQIQEMEAELENFHKQNTQLELNITELWQKLRATDQEMRRERQKERD 1024
Cdd:pfam13868  251 RLAEEAEREEeefERMLRKQAEDEEIEQEEAEKRRMKRLEHRRELEKQIEEREEQRAAEREEELE 315
Golgin_A5 pfam09787
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining ...
931-1089 2.32e-03

Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1.


Pssm-ID: 462900 [Multi-domain]  Cd Length: 305  Bit Score: 41.67  E-value: 2.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   931 IQDKEKRIYDLKK-KNQELGKFKFVLDYKIKELKK-------QIEPRENEIRVMKEQIQEMEA----ELENFHKQNTQLE 998
Cdd:pfam09787   20 LQSKEKLIASLKEgSGVEGLDSSTALTLELEELRQerdllreEIQKLRGQIQQLRTELQELEAqqqeEAESSREQLQELE 99
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   999 LNITELWQKLRATDQEMRRERQKERDLEALVKRFKTDLHNCV----AYIQEPRL--------------LKEKVRGLFEKY 1060
Cdd:pfam09787  100 EQLATERSARREAEAELERLQEELRYLEEELRRSKATLQSRIkdreAEIEKLRNqltsksqssssqseLENRLHQLTETL 179
                          170       180
                   ....*....|....*....|....*....
gi 767902634  1061 VQRADMVEiaglntDLQQEYTRQREHLER 1089
Cdd:pfam09787  180 IQKQTMLE------ALSTEKNSLVLQLER 202
AAA_13 pfam13166
AAA domain; This family of domains contain a P-loop motif that is characteriztic of the AAA ...
772-1000 2.33e-03

AAA domain; This family of domains contain a P-loop motif that is characteriztic of the AAA superfamily. Many of the proteins in this family are conjugative transfer proteins. This family includes the PrrC protein that is thought to be the active component of the anticodon nuclease.


Pssm-ID: 463796 [Multi-domain]  Cd Length: 712  Bit Score: 42.36  E-value: 2.33e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   772 EKYQELQLKSQRMQEEYEKQlrdndetKSQALEELTEfyeakLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEDReiq 851
Cdd:pfam13166  279 DEFTEFQNRLQKLIEKVESA-------ISSLLAQLPA-----VSDLASLLSAFELDVEDIESEAEVLNSQLDGLRRA--- 343
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   852 diktKYEKKLRDEKESNL-RLKGETGIMRKKFSSLQKEIE---ERTNDIETLKGEQMKL--QGVIKSLEKDIQGLKREIQ 925
Cdd:pfam13166  344 ----LEAKRKDPFKSIELdSVDAKIESINDLVASINELIAkhnEITDNFEEEKNKAKKKlrLHLVEEFKSEIDEYKDKYA 419
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 767902634   926 ERDETIQDKEKRIYDLKKKNQELGKfkfvldyKIKELkkqieprENEIRVMKEQIQEMEAELENFHKQNTQLELN 1000
Cdd:pfam13166  420 GLEKAINSLEKEIKNLEAEIKKLRE-------EIKEL-------EAQLRDHKPGADEINKLLKAFGFGELELSFN 480
46 PHA02562
endonuclease subunit; Provisional
881-1035 2.38e-03

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 41.92  E-value: 2.38e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  881 KFSSLQKEIEERTNDIETL----KGEQMKLQGVIKSLEKDIQGLKREIQERDETI----QDKEKRIYDLKKKNQELGKFK 952
Cdd:PHA02562  189 KIDHIQQQIKTYNKNIEEQrkknGENIARKQNKYDELVEEAKTIKAEIEELTDELlnlvMDIEDPSAALNKLNTAAAKIK 268
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  953 FVLDYKIKELK------------KQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLEL----------NITELWQKLRA 1010
Cdd:PHA02562  269 SKIEQFQKVIKmyekggvcptctQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEimdefneqskKLLELKNKIST 348
                         170       180
                  ....*....|....*....|....*
gi 767902634 1011 TDQEMRRERQKERDLEALVKRFKTD 1035
Cdd:PHA02562  349 NKQSLITLVDKAKKVKAAIEELQAE 373
Spc7 smart00787
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ...
779-953 2.56e-03

Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.


Pssm-ID: 197874 [Multi-domain]  Cd Length: 312  Bit Score: 41.54  E-value: 2.56e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634    779 LKSQRMQEEYEKQLRDNDEtksqalEELTEFYEAKLQEKTTLLEEAQE--DVRQQLRE-FEETKKQIEEdedreiqdikt 855
Cdd:smart00787  128 LEAKKMWYEWRMKLLEGLK------EGLDENLEGLKEDYKLLMKELELlnSIKPKLRDrKDALEEELRQ----------- 190
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634    856 kyEKKLRDEKESNLrlKGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIqdKE 935
Cdd:smart00787  191 --LKQLEDELEDCD--PTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKL--EQ 264
                           170
                    ....*....|....*...
gi 767902634    936 KRIYDLKKKNQELGKFKF 953
Cdd:smart00787  265 CRGFTFKEIEKLKEQLKL 282
PRK12704 PRK12704
phosphodiesterase; Provisional
674-853 2.88e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 41.69  E-value: 2.88e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  674 EEKAQVMLELKTRVEELKMENEYQLRLKDmnysEKIKELTDKFIQEMESLKTKnqvlrtekekqdvyhhehiEDLLDKQS 753
Cdd:PRK12704   53 AIKKEALLEAKEEIHKLRNEFEKELRERR----NELQKLEKRLLQKEENLDRK-------------------LELLEKRE 109
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  754 RELQDMECCNNQKLlleyekyQELQLKSQRMQEEYEKQLrdndetksQALEE---LTEFyEAKLQEKTTLLEEAQEDVRQ 830
Cdd:PRK12704  110 EELEKKEKELEQKQ-------QELEKKEEELEELIEEQL--------QELERisgLTAE-EAKEILLEKVEEEARHEAAV 173
                         170       180
                  ....*....|....*....|...
gi 767902634  831 QLREFEEtkkQIEEDEDREIQDI 853
Cdd:PRK12704  174 LIKEIEE---EAKEEADKKAKEI 193
PLN02939 PLN02939
transferase, transferring glycosyl groups
847-1131 3.20e-03

transferase, transferring glycosyl groups


Pssm-ID: 215507 [Multi-domain]  Cd Length: 977  Bit Score: 41.81  E-value: 3.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  847 DREIQDIKTKYEKKLRD-EKESNLRLKGETGIMR---KKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLEKDIQ---- 918
Cdd:PLN02939  105 DEAIAAIDNEQQTNSKDgEQLSDFQLEDLVGMIQnaeKNILLLNQARLQALEDLEKILTEKEALQGKINILEMRLSetda 184
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  919 --GLKREIQERDETIQDK-EKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQ---IQEMEAELENFHK 992
Cdd:PLN02939  185 riKLAAQEKIHVEILEEQlEKLRNELLIRGATEGLCVHSLSKELDVLKEENMLLKDDIQFLKAElieVAETEERVFKLEK 264
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  993 QNTQLELNITELWQKLRATDQEMRRER--------QKERDLEALVKRFKTDLHNCVAYIQEPRLLKEKVRGLfEKYVQRA 1064
Cdd:PLN02939  265 ERSLLDASLRELESKFIVAQEDVSKLSplqydcwwEKVENLQDLLDRATNQVEKAALVLDQNQDLRDKVDKL-EASLKEA 343
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 767902634 1065 DMVEIAGLNTDL-QQEYTRQREHLERNlatlkkkvvkEGELHrtDYVRIMQEnvsLIKEINELRRELK 1131
Cdd:PLN02939  344 NVSKFSSYKVELlQQKLKLLEERLQAS----------DHEIH--SYIQLYQE---SIKEFQDTLSKLK 396
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
768-1027 3.32e-03

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 41.70  E-value: 3.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   768 LLEYEKYQELQLKSQRMQEEYEK-----QLRDNDETKSQALEELTEF------YEAKLQEKTTLLEEAQEDVRQQLREFE 836
Cdd:pfam01576  476 LLQEETRQKLNLSTRLRQLEDERnslqeQLEEEEEAKRNVERQLSTLqaqlsdMKKKLEEDAGTLEALEEGKKRLQRELE 555
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   837 ETKKQIEEDEDR--EIQDIKTKYEKKLRDEKESNLRLKGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIKSLE 914
Cdd:pfam01576  556 ALTQQLEEKAAAydKLEKTKNRLQQELDDLLVDLDHQRQLVSNLEKKQKKFDQMLAEEKAISARYAEERDRAEAEAREKE 635
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   915 KDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQN 994
Cdd:pfam01576  636 TRALSLARALEEALEAKEELERTNKQLRAEMEDLVSSKDDVGKNVHELERSKRALEQQVEEMKTQLEELEDELQATEDAK 715
                          250       260       270
                   ....*....|....*....|....*....|...
gi 767902634   995 TQLELNItelwQKLRAtdqemrrerQKERDLEA 1027
Cdd:pfam01576  716 LRLEVNM----QALKA---------QFERDLQA 735
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
681-866 3.44e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 41.68  E-value: 3.44e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  681 LELKTRVEELKMENEYQLRLKDMNYSEKIKELTDKFIQEMESLKTKNQVLRT-EKEKQDVYHHEHIEDLLDK-QSRELQD 758
Cdd:COG4717   307 LQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEElEEELQLEELEQEIAALLAEaGVEDEEE 386
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  759 MEccnnqKLLLEYEKYQELQLKsqrmQEEYEKQLRDNDETKSQALEELT-EFYEAKLQEKTTLLEEAQEDVRQQLREFEE 837
Cdd:COG4717   387 LR-----AALEQAEEYQELKEE----LEELEEQLEELLGELEELLEALDeEELEEELEELEEELEELEEELEELREELAE 457
                         170       180       190
                  ....*....|....*....|....*....|
gi 767902634  838 TKKQIEE-DEDREIQDIKTKYEKKLRDEKE 866
Cdd:COG4717   458 LEAELEQlEEDGELAELLQELEELKAELRE 487
Spc7 smart00787
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ...
703-876 3.45e-03

Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.


Pssm-ID: 197874 [Multi-domain]  Cd Length: 312  Bit Score: 41.16  E-value: 3.45e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634    703 MNYSEKIKELTDKfiqEMESLKTKNQVLRTEKEKQDVYHHEhIEDLLDKQSRELQDMeccnnQKLLLEYEKYQELQLKSQ 782
Cdd:smart00787  139 MKLLEGLKEGLDE---NLEGLKEDYKLLMKELELLNSIKPK-LRDRKDALEEELRQL-----KQLEDELEDCDPTELDRA 209
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634    783 RmqEEYEKQLRDNDEtKSQALEELTEfyeaKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEDREIQDIKTKYEKKLR 862
Cdd:smart00787  210 K--EKLKKLLQEIMI-KVKKLEELEE----ELQELESKIEDLTNKKSELNTEIAEAEKKLEQCRGFTFKEIEKLKEQLKL 282
                           170
                    ....*....|....
gi 767902634    863 DEKESNLRLKGETG 876
Cdd:smart00787  283 LQSLTGWKITKLSG 296
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
789-1203 3.73e-03

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 41.73  E-value: 3.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   789 EKQLRDNDETKSQALEELTEFYEAKLQEKTTLLEEAQEDVRQQlREFEETKKQIEEDEDREIQDIKTKYEKklrdEKESN 868
Cdd:pfam10174   44 ERALRKEEAARISVLKEQYRVTQEENQHLQLTIQALQDELRAQ-RDLNQLLQQDFTTSPVDGEDKFSTPEL----TEENF 118
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   869 LRLKGETGIMRKKFSSLQKEIEERTNDIETLKG-------------EQMKLQGVIKSLEKDIQGLKREIQERDETIQDKE 935
Cdd:pfam10174  119 RRLQSEHERQAKELFLLRKTLEEMELRIETQKQtlgardesikkllEMLQSKGLPKKSGEEDWERTRRIAEAEMQLGHLE 198
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   936 KRiydLKKKNQELGKFKFVLDY---------KIKELKKQIEPRENEIRVMKEQIQEMEAELEN------FHKQNTQLELN 1000
Cdd:pfam10174  199 VL---LDQKEKENIHLREELHRrnqlqpdpaKTKALQTVIEMKDTKISSLERNIRDLEDEVQMlktnglLHTEDREEEIK 275
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1001 ITELWQ--------KLRATDQEMRRerqKERDLEALVKRFKTdLHN----CVAYIQeprLLKEKvrgLFEKYvQRADMve 1068
Cdd:pfam10174  276 QMEVYKshskfmknKIDQLKQELSK---KESELLALQTKLET-LTNqnsdCKQHIE---VLKES---LTAKE-QRAAI-- 342
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1069 iaglntdLQQEYTRQREHLERNLATLKKKvvkegelhrTDYVRIMQENVSL-------------IKE--INELRRELKFT 1133
Cdd:pfam10174  343 -------LQTEVDALRLRLEEKESFLNKK---------TKQLQDLTEEKSTlageirdlkdmldVKErkINVLQKKIENL 406
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1134 RSQVYDLEAALKlTKKVRPQEVSETEPSRDMLSTAPTARLNEQEetgRIIEmqRLEIQRLRDQIQEQEQV 1203
Cdd:pfam10174  407 QEQLRDKDKQLA-GLKERVKSLQTDSSNTDTALTTLEEALSEKE---RIIE--RLKEQREREDRERLEEL 470
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
888-1153 3.89e-03

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 41.75  E-value: 3.89e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   888 EIEERTNDIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQDKEKRIY----DLKKKNQELGKFKFVLDYKIKELK 963
Cdd:pfam12128  242 EFTKLQQEFNTLESAELRLSHLHFGYKSDETLIASRQEERQETSAELNQLLRtlddQWKEKRDELNGELSAADAAVAKDR 321
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   964 KQIEPRENEIRV-MKEQIQEMEAELENFHKQNTQLEL---NITELWQKLRATDQEMRRERQKeRDLEalVKRFKTDLHNC 1039
Cdd:pfam12128  322 SELEALEDQHGAfLDADIETAAADQEQLPSWQSELENleeRLKALTGKHQDVTAKYNRRRSK-IKEQ--NNRDIAGIKDK 398
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1040 VAYIQEprllkEKVRGLfekyvqradmveiAGLNTDLQQEYTRQREHLERNLATLK--KKVVKE--GELH-RTDYVRIMQ 1114
Cdd:pfam12128  399 LAKIRE-----ARDRQL-------------AVAEDDLQALESELREQLEAGKLEFNeeEYRLKSrlGELKlRLNQATATP 460
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|...
gi 767902634  1115 ENVSLIK----EINELRRELKFTRSQVYDLEAALKLTKKVRPQ 1153
Cdd:pfam12128  461 ELLLQLEnfdeRIERAREEQEAANAEVERLQSELRQARKRRDQ 503
WD40 COG2319
WD40 repeat [General function prediction only];
40-169 4.09e-03

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 41.05  E-value: 4.09e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   40 LALSISPNRRYLAiseTVQEKPAITIYELSSipcrkRKVLNNFDFQVQKFISMAFSPDSKYLLAQTsppESNLVyWLWEK 119
Cdd:COG2319   250 RSVAFSPDGRLLA---SGSADGTVRLWDLAT-----GELLRTLTGHSGGVNSVAFSPDGKLLASGS---DDGTV-RLWDL 317
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 767902634  120 QKVMAIVRIDTQNNPVYQVSFSPQDNTQVCVTGNGMFKLLRFAEGTLKQT 169
Cdd:COG2319   318 ATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRT 367
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
821-1168 4.33e-03

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 41.57  E-value: 4.33e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   821 LEEAQEDVRQQLREFEETKKQIEEDEDR--EIQDIKTKYEKKLRDEKESNLRLKGETGIMRKKfsslQKEIEERTNDIET 898
Cdd:TIGR00606  191 LRQVRQTQGQKVQEHQMELKYLKQYKEKacEIRDQITSKEAQLESSREIVKSYENELDPLKNR----LKEIEHNLSKIMK 266
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   899 LKGEQMKLQGVIKSLEKDIQGLK----------------------REIQERDETIQDKEKRIYDLKKKNQELGKFKFVLD 956
Cdd:TIGR00606  267 LDNEIKALKSRKKQMEKDNSELElkmekvfqgtdeqlndlyhnhqRTVREKERELVDCQRELEKLNKERRLLNQEKTELL 346
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   957 YKIKELKKQIEPRENEIRVMKEQIQEMEaelenfhkqnTQLELNITElwqklRATDQEMRRERQKERDLEALVKRFKTDL 1036
Cdd:TIGR00606  347 VEQGRLQLQADRHQEHIRARDSLIQSLA----------TRLELDGFE-----RGPFSERQIKNFHTLVIERQEDEAKTAA 411
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  1037 HNCVAYIQEPRLLKEKVRGLfekyvqradMVEIAGLNTDLQQEytrqREHLERNLATLKKKvVKEGELHRTDYVRIMQEN 1116
Cdd:TIGR00606  412 QLCADLQSKERLKQEQADEI---------RDEKKGLGRTIELK----KEILEKKQEELKFV-IKELQQLEGSSDRILELD 477
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 767902634  1117 VSLIKEINEL------------RRELKFTRSQVYDLEAALKLTKKVRPQEVSETEPSRDMLSTA 1168
Cdd:TIGR00606  478 QELRKAERELskaeknsltetlKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLT 541
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
687-1014 4.58e-03

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 41.58  E-value: 4.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   687 VEELKMENEYQLRLKDMNYS-EKIKELTDKFIQEMESLKT--KNQvlrTEKEKQDVYHHEHIEDLLDKQSRELQDMECCN 763
Cdd:TIGR01612 1401 LEECKSKIESTLDDKDIDECiKKIKELKNHILSEESNIDTyfKNA---DENNENVLLLFKNIEMADNKSQHILKIKKDNA 1477
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   764 NQKLLLEYEKYQELQLKSQRMQEEYEK---QLRDNDETKSQALEELTE----FYEAKLQEKttlLEEAQEDVRQQLREFE 836
Cdd:TIGR01612 1478 TNDHDFNINELKEHIDKSKGCKDEADKnakAIEKNKELFEQYKKDVTEllnkYSALAIKNK---FAKTKKDSEIIIKEIK 1554
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   837 ETKKQIEEDEDREIQDIKTKYEKKLRDEKE---SNLRLKGETGI------MRKKFSSLQkEIEERTNDIETlkgEQMKLQ 907
Cdd:TIGR01612 1555 DAHKKFILEAEKSEQKIKEIKKEKFRIEDDaakNDKSNKAAIDIqlslenFENKFLKIS-DIKKKINDCLK---ETESIE 1630
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634   908 GVIKSLEKDIQGLK--------REIQERDETIQDKEKRIYDLKKKNQElgkfkfvLDYKIKELKKQIEPREN--EIRVMk 977
Cdd:TIGR01612 1631 KKISSFSIDSQDTElkengdnlNSLQEFLESLKDQKKNIEDKKKELDE-------LDSEIEKIEIDVDQHKKnyEIGII- 1702
                          330       340       350
                   ....*....|....*....|....*....|....*...
gi 767902634   978 EQIQEM-EAELENFHKQNTQLELNITELWQKLRATDQE 1014
Cdd:TIGR01612 1703 EKIKEIaIANKEEIESIKELIEPTIENLISSFNTNDLE 1740
PRK01156 PRK01156
chromosome segregation protein; Provisional
780-1053 4.72e-03

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 41.43  E-value: 4.72e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  780 KSQRMQEEYEKQLRDNDETKSQALEELTEFYEAKLQEKTTLLEEAQEDVRQQL---REFEETKKQIEEDEDR--EIQDIK 854
Cdd:PRK01156  466 KSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEINKSIneyNKIESARADLEDIKIKinELKDKH 545
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  855 TKYEKKlrDEKESNLRLkgetGIMRKKFSSLQKEIEERTN-DIETLKGEQMKLQGVIKSLEKDIQGLKREIQERDETIQD 933
Cdd:PRK01156  546 DKYEEI--KNRYKSLKL----EDLDSKRTSWLNALAVISLiDIETNRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYIDK 619
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  934 KEKRIYDlkKKNQELGKFKfvldyKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLeLNITELWQKLRATDQ 1013
Cdd:PRK01156  620 SIREIEN--EANNLNNKYN-----EIQENKILIEKLRGKIDNYKKQIAEIDSIIPDLKEITSRI-NDIEDNLKKSRKALD 691
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|
gi 767902634 1014 EMRRERQKERDLEALVKRFKTDLHNCVAYIQEPRLLKEKV 1053
Cdd:PRK01156  692 DAKANRARLESTIEILRTRINELSDRINDINETLESMKKI 731
WDR74 cd22857
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and ...
322-399 4.86e-03

WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and plants is an essential factor for ribosome assembly. In cooperation with the assembly factor NVL2, WDR74 participates in an early cleavage of the pre-rRNA processing pathway. NVL2 is a type II double ring, AAA-ATPase, that may mediate the release of WDR74 from nucleolar pre-60S particles. WDR74 has been implicated in tumorigenesis. In lung cancer, it regulates cell proliferation, cell cycle progression, chemoresistance and cell aggressiveness, by inducing nuclear beta-catenin accumulation and driving downstream Wnt-responsive genes expression. In melanoma, it promotes apoptosis resistance and aggressive behavior by regulating the RPL5-MDM2-p53 pathway. WDR74 contains an N-terminal seven-bladed beta-propeller WD40 domain that associates with the D1-AAA domain of the AAA-ATPase NVL2, and a flexible lysine-rich C-terminus that extends outward from the WD40 domain, and is required for nucleolar localization.


Pssm-ID: 439303 [Multi-domain]  Cd Length: 325  Bit Score: 40.67  E-value: 4.86e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  322 CLCFSPSEETLVASTSKNQLYSI---TMSLTEISKGepahfeylmypLHSAPITGLATCIRKPLIATCSLDRSIRLWNYE 398
Cdd:cd22857   228 AVAEDPDGHTVYVGDTSGDLASIdlrTGKLLGCFKG-----------KCGGSIRSIARHPELPLIASCGLDRYLRIWDTE 296

                  .
gi 767902634  399 T 399
Cdd:cd22857   297 T 297
mukB PRK04863
chromosome partition protein MukB;
707-924 5.16e-03

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 41.10  E-value: 5.16e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  707 EKIKELTD--KFIQE----MESLKTKNQVLRTEKE-----KQDVYHHEHIEDLLDKQSRELQDMeccNNQKLLLEYEKYQ 775
Cdd:PRK04863  901 EQLDEAEEakRFVQQhgnaLAQLEPIVSVLQSDPEqfeqlKQDYQQAQQTQRDAKQQAFALTEV---VQRRAHFSYEDAA 977
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  776 ELQLKSQRMQEEYEKQLRDNDETKSQALEELTEfYEAKLQEKTTLLEEAQE--DV-RQQLREFEetkkqieededREIQD 852
Cdd:PRK04863  978 EMLAKNSDLNEKLRQRLEQAEQERTRAREQLRQ-AQAQLAQYNQVLASLKSsyDAkRQMLQELK-----------QELQD 1045
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 767902634  853 IKTKY----EKKLRDEKEsnlRLKGETGIMRKKFSSLQKEIEERTNDIETLkgeqmklQGVIKSLEKDIQGLKREI 924
Cdd:PRK04863 1046 LGVPAdsgaEERARARRD---ELHARLSANRSRRNQLEKQLTFCEAEMDNL-------TKKLRKLERDYHEMREQV 1111
PRK01156 PRK01156
chromosome segregation protein; Provisional
737-977 6.57e-03

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 40.66  E-value: 6.57e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  737 QDVYHHEHIEDLLDKQSRELQDMECCNNQKLLLEYEKYQElQLKSQRMQ----EEYEKQLRDNDETKSQALEELTEFYEA 812
Cdd:PRK01156  483 EKIREIEIEVKDIDEKIVDLKKRKEYLESEEINKSINEYN-KIESARADlediKIKINELKDKHDKYEEIKNRYKSLKLE 561
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  813 KLQEKTTLLEEAQE-----DVRQQLREFEETKKQIEEDEDR------EIQDIKTKYEKKLR--DEKESNLRLKgetgimR 879
Cdd:PRK01156  562 DLDSKRTSWLNALAvisliDIETNRSRSNEIKKQLNDLESRlqeieiGFPDDKSYIDKSIReiENEANNLNNK------Y 635
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  880 KKFSSLQKEIEERTNDIETLKGEQMKlqgvIKSLEKDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKFKFVLDYKI 959
Cdd:PRK01156  636 NEIQENKILIEKLRGKIDNYKKQIAE----IDSIIPDLKEITSRINDIEDNLKKSRKALDDAKANRARLESTIEILRTRI 711
                         250
                  ....*....|....*...
gi 767902634  960 KELKKQIEPRENEIRVMK 977
Cdd:PRK01156  712 NELSDRINDINETLESMK 729
PRK00409 PRK00409
recombination and DNA strand exchange inhibitor protein; Reviewed
801-911 6.92e-03

recombination and DNA strand exchange inhibitor protein; Reviewed


Pssm-ID: 234750 [Multi-domain]  Cd Length: 782  Bit Score: 40.58  E-value: 6.92e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  801 QALEELtefyEAKLQEKTTLLEEAQEDVRQQLREFEETKKQIEEDEDREIQDIKTKYEKKLRDEKE---------SNLRL 871
Cdd:PRK00409  523 ASLEEL----ERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKeadeiikelRQLQK 598
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 767902634  872 KGETGIMRKKFSSLQKEIEERTNDIETLKGEQMKLQGVIK 911
Cdd:PRK00409  599 GGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638
PRK01156 PRK01156
chromosome segregation protein; Provisional
886-1198 7.59e-03

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 40.66  E-value: 7.59e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  886 QKEIEERTNDIETLKGEQMKLQGVIKSLE---KDIQGLKREIQERDETIQDKEKRIYDLKKKNQELGKfkfvldyKIKEL 962
Cdd:PRK01156  151 RKKILDEILEINSLERNYDKLKDVIDMLRaeiSNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLK-------EIERL 223
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  963 KKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQlelnitelwqkLRATDQEMRRERQKERDLEALVKRFKtDLHNCVAY 1042
Cdd:PRK01156  224 SIEYNNAMDDYNNLKSALNELSSLEDMKNRYESE-----------IKTAESDLSMELEKNNYYKELEERHM-KIINDPVY 291
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634 1043 IQeprllKEKVRGLFEKYVQRADMVEI-AGLNTDLQqeytrQREHLERNLATLKKkvvkegelHRTDYVRIMQENVSLIK 1121
Cdd:PRK01156  292 KN-----RNYINDYFKYKNDIENKKQIlSNIDAEIN-----KYHAIIKKLSVLQK--------DYNDYIKKKSRYDDLNN 353
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 767902634 1122 EINELRRELKFTRSQVYDLEaalKLTKKVRpqevsETEPSRDMLSTAPTARLNEQEETGRIIEMQRLEIQRLRDQIQ 1198
Cdd:PRK01156  354 QILELEGYEMDYNSYLKSIE---SLKKKIE-----EYSKNIERMSAFISEILKIQEIDPDAIKKELNEINVKLQDIS 422
PHA01351 PHA01351
putative minor structural protein
700-1006 9.49e-03

putative minor structural protein


Pssm-ID: 107029  Cd Length: 1070  Bit Score: 40.31  E-value: 9.49e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  700 LKDMNYSEKIKELTDKFIqemeslkTKNQVLRTEKekqdvyhhehiedLLDKQSRELQDMECCNNQKLLLEyekyqelqL 779
Cdd:PHA01351  455 FKDLGYPEEVRTVFDTMI-------TQSQLIQTNQ-------------LLLRQLQQIVSLGIFDQKKIKEE--------L 506
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  780 KSQRMQEEYEKQLRDNDETKSQALEELTEfYEAKLqeKTTLLeeAQEDVRQQLREFEETKKQIEededreiQDIKTKYEK 859
Cdd:PHA01351  507 KANKFNEQVALQILESELQFAQLQNQLKE-YQFKL--NNFLI--SPQDLEKDLKHLGFDSAIIS-------ALIYENQVE 574
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  860 KLRDEKESNLRlkgetGIMRKKFSSLqKEIEERTNDIETLKGEQMKLQGVIkSLEKDIQGLKREIQERDETIQ-DKEKRI 938
Cdd:PHA01351  575 QLIKFQLNNIE-----SLAKKGYLSL-DEIKKQFKAIGIIKEYEDAFINFY-NQELQISAFLTILKSQLRQFQiDPKEAE 647
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  939 YDLKKK--NQELGKFKFVLDYKIKELKKQIEPRENEIRVMKEQIQEMEAELENFHKQNTQLELNITELWQ 1006
Cdd:PHA01351  648 TELKKLniNEYLANQIIQEEYNINIAKLQLSVLETIAKTLYYDQQQLSGELKKIHKDKTALELYITKFYY 717
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
664-874 9.95e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 40.05  E-value: 9.95e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  664 EEVLVTKTDMEEKaqvMLELKTRVEELKMENEYQLrlkdmnySEKIKELtDKFIQEMESLKTKNQVLRTEKEKQDVYHHE 743
Cdd:PRK03918  559 AELEKKLDELEEE---LAELLKELEELGFESVEEL-------EERLKEL-EPFYNEYLELKDAEKELEREEKELKKLEEE 627
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767902634  744 hIEDLLDKQSRELQDMECCNNQ--KLLLEY--EKYQELQLKSQRMQEEYE------KQLRDNDETKSQALEELTEFYEA- 812
Cdd:PRK03918  628 -LDKAFEELAETEKRLEELRKEleELEKKYseEEYEELREEYLELSRELAglraelEELEKRREEIKKTLEKLKEELEEr 706
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 767902634  813 -KLQEKTTLLEEAQEDVrQQLRE-FEETKKQIEEDEDREIQDIKTKYEKKLRDEKESNLRLKGE 874
Cdd:PRK03918  707 eKAKKELEKLEKALERV-EELREkVKKYKALLKERALSKVGEIASEIFEELTEGKYSGVRVKAE 769
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH