|
Name |
Accession |
Description |
Interval |
E-value |
| BRE1 |
pfam08647 |
BRE1 E3 ubiquitin ligase; BRE1 is an E3 ubiquitin ligase that has been shown to act as a ... |
780-878 |
1.72e-21 |
|
BRE1 E3 ubiquitin ligase; BRE1 is an E3 ubiquitin ligase that has been shown to act as a transcriptional activator through direct activator interactions.
Pssm-ID: 462547 [Multi-domain] Cd Length: 95 Bit Score: 89.95 E-value: 1.72e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 780 LQQEMVKVTQEQEEQLASLDASKKELHIMEQKKLRVESKIEQEKKEQKEIEHHMKDLDNDLKKLNMLMNKNRCSSEELEQ 859
Cdd:pfam08647 1 LQTELVKLEQAFEELSEQLDKKVKDLTILEEKKLRLEAEKAKADQKYFAAMRSKDALENENKKLNTLLSKSSELIEQLKE 80
|
90
....*....|....*....
gi 148664197 860 nnrvTENEFVRSLKASERE 878
Cdd:pfam08647 81 ----TEKEFVRKLKNLEKE 95
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
299-1128 |
2.15e-14 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 78.17 E-value: 2.15e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 299 LNRQIEKLKL------DLQELVVATKQsrAQRQELGVNLYEVQQHLVHLQKLLEKSHDRHAMASSERRQKEEELQAARAL 372
Cdd:TIGR02168 198 LERQLKSLERqaekaeRYKELKAELRE--LELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLE 275
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 373 YTKTCAAANEERKKLAALQTEMENLALHL-FYMQNIDQDMRDDIRVMTQVV----KKAETERIRAEIEKKKQDLYVDQLT 447
Cdd:TIGR02168 276 VSELEEEIEELQKELYALANEISRLEQQKqILRERLANLERQLEELEAQLEelesKLDELAEELAELEEKLEELKEELES 355
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 448 TRAQQ--LEEDIALFEAQYLAQAEDTRILRKAVSEACTEIDAISVEKRRIMQQwasslvgmKHRDEAHRAVLEAlRGCQH 525
Cdd:TIGR02168 356 LEAELeeLEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEAR--------LERLEDRRERLQQ-EIEEL 426
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 526 QAKSTDGEIEAYKKSIMKEEEKNEKLASILNRTETEATLLQKLTTQCLTKQVALQSQFNTYRLTLQDTEDALSQDQ---- 601
Cdd:TIGR02168 427 LKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEgfse 506
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 602 ------LEQMILTEELQAIRQAIQGELELRRKTDAAIREKLQEHMTSNKTTkyFNQLILRLQKEKTNMMTHLskINGDIA 675
Cdd:TIGR02168 507 gvkallKNQSGLSGILGVLSELISVDEGYEAAIEAALGGRLQAVVVENLNA--AKKAIAFLKQNELGRVTFL--PLDSIK 582
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 676 QTTLDITHTSSRldAHQKTLVELDQDVKKVNELITNSQSEISRRTILIERKQGLINFLNK--QLERMVSeLGGEEVGP-- 751
Cdd:TIGR02168 583 GTEIQGNDREIL--KNIEGFLGVAKDLVKFDPKLRKALSYLLGGVLVVDDLDNALELAKKlrPGYRIVT-LDGDLVRPgg 659
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 752 ---------------LELEIKRLSKLIDEHDGKAVQAQVTWLRLQQEMVKVTQEQEEQLASLDASKKELHIMEQKKLRVE 816
Cdd:TIGR02168 660 vitggsaktnssileRRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLE 739
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 817 SKIEQEKKEQKEIEHHMKDLDNDLKKLNMLMNKNRCSSEELEQnNRVTENEFV----RSLKASERETIKMQDKLNQLSEE 892
Cdd:TIGR02168 740 AEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEA-EIEELEAQIeqlkEELKALREALDELRAELTLLNEE 818
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 893 KATLLNQLVEAEHQIMLWEKKIQLAKEM--RSSVDSEIGQTEIRAMKGEIHRMKVRLGQLLKQQEKMIRAMELAVARRET 970
Cdd:TIGR02168 819 AANLRERLESLERRIAATERRLEDLEEQieELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEE 898
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 971 VTTQaegQRKMDRKaltRTDFHHKQLELRRKIRDVRKATDECTKTVLELEETQRNVSSSLLEKQEKLSV-IQADFDTLEA 1049
Cdd:TIGR02168 899 LSEE---LRELESK---RSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENkIEDDEEEARR 972
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 1050 DLTRLGAlKRQNLSEI--VALQtrlkHLQAVKEgRYVFLFRSKQSLVLERQRLDKRLALI-ATILDRVRDEYPQFQEALH 1126
Cdd:TIGR02168 973 RLKRLEN-KIKELGPVnlAAIE----EYEELKE-RYDFLTAQKEDLTEAKETLEEAIEEIdREARERFKDTFDQVNENFQ 1046
|
..
gi 148664197 1127 KV 1128
Cdd:TIGR02168 1047 RV 1048
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
321-1054 |
1.47e-11 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 68.93 E-value: 1.47e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 321 RAQRQELGVNLYEVQQHLVHLQKLLE------KSHDRHAMASSE-RRQKEEELQAARALYTKTCAAANEERKKL------ 387
Cdd:TIGR02168 171 KERRKETERKLERTRENLDRLEDILNelerqlKSLERQAEKAERyKELKAELRELELALLVLRLEELREELEELqeelke 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 388 -----AALQTEMENLALHLFYMQNIDQDMRDDIRVMTQVVKKAETERIRAEIEKKKQDLYVDQLTTRAQQLEEDIALFEA 462
Cdd:TIGR02168 251 aeeelEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELES 330
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 463 QYLAQAEDTRILRKAVSEACTEIDAISVEKRRIMQQWASSLVGMKHRDEAHRAVLEALRGCQHQAKSTDGEIEAYKKSIM 542
Cdd:TIGR02168 331 KLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLE 410
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 543 KEEEKNEKLASILNRTETEATLLQKLttqcltkqvALQSQFNTYRLTLQDTEDALSQDQLEQMILTEELQAIRQAIQgel 622
Cdd:TIGR02168 411 RLEDRRERLQQEIEELLKKLEEAELK---------ELQAELEELEEELEELQEELERLEEALEELREELEEAEQALD--- 478
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 623 elrrktdaAIREKLQEHMTSNKTTKYFNQLILRLQKEKTNMMTHLSKINGdIAQTTLDITHTSSRLDAH---------QK 693
Cdd:TIGR02168 479 --------AAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSG-ILGVLSELISVDEGYEAAieaalggrlQA 549
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 694 TLVELDQDVKKVNELITnsQSEISRRTILIERKQGLINFLNKQLERMVSELGGEEVG-PLELEIKRLSKLID-------- 764
Cdd:TIGR02168 550 VVVENLNAAKKAIAFLK--QNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAkDLVKFDPKLRKALSyllggvlv 627
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 765 -------EHDGKAVQAQVTWLRLQQEMVK----VTQEQEEQLASLDASKKELHIMEQKKLRVESKIEQEKKEQKEIEHHM 833
Cdd:TIGR02168 628 vddldnaLELAKKLRPGYRIVTLDGDLVRpggvITGGSAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKEL 707
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 834 KDLDNDLKKLNMLMNKNRCSSEELEQNNRVTENEFVRSLKASER---ETIKMQDKLNQLSEEKATLLNQLVEAEHQIMLW 910
Cdd:TIGR02168 708 EELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQlskELTELEAEIEELEERLEEAEEELAEAEAEIEEL 787
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 911 EKKIQLAKEMRSSVDSEIG--QTEIRAMKGEIHRMKVRLGQLLKQQEKMIRAMELAVARRETVTTQAEGQRKMDRKALTR 988
Cdd:TIGR02168 788 EAQIEQLKEELKALREALDelRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEEL 867
|
730 740 750 760 770 780 790
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 148664197 989 TDFHHKQLELRRKIRDV--------RKATDECTKTVLELEETQRNVSSSLLEKQEKLSVIQADFDTLEADLTRL 1054
Cdd:TIGR02168 868 IEELESELEALLNERASleealallRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNL 941
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
538-928 |
1.48e-09 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 62.35 E-value: 1.48e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 538 KKSIMKEEEKNEKLASILNRTETEATLLQKLTTQCLTKQVALQSQFNTYRLTLQDTEDALSQDQLEQMILTEELQA-IRQ 616
Cdd:TIGR04523 231 KDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLSEKQKELEQNNKKIKELEKQLNQLKSEISDLNNQKEQdWNK 310
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 617 AIQGELELRRKTDAAIREKLQEhmtSNKTTKYFNQLILRLQKEKTNMMTHLSKINGDIAQTTLDITHTSSRLDAHQKTLV 696
Cdd:TIGR04523 311 ELKSELKNQEKKLEEIQNQISQ---NNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIK 387
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 697 ELDQDVKKVNELITNSQSEISRRTILIERKQGLINFLNKQLERMVSELG---------GEEVGPLELEIKRLSKLIDEHD 767
Cdd:TIGR04523 388 NLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIknnseikdlTNQDSVKELIIKNLDNTRESLE 467
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 768 GKAVQAQVTWLRLQQEMVKVTQE---QEEQLASLDASKKELH----IMEQKKLRVESKIEQEKKEQKEIEHHMKDLDNDL 840
Cdd:TIGR04523 468 TQLKVLSRSINKIKQNLEQKQKElksKEKELKKLNEEKKELEekvkDLTKKISSLKEKIEKLESEKKEKESKISDLEDEL 547
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 841 KKLNMLMNKNRCSSEELEQNNRVTE-NEFVRSLKASEREtikMQDKLNQLSEEKATLLNQLVEAEHQIMLWEKKIQLAKE 919
Cdd:TIGR04523 548 NKDDFELKKENLEKEIDEKNKEIEElKQTQKSLKKKQEE---KQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKK 624
|
....*....
gi 148664197 920 MRSSVDSEI 928
Cdd:TIGR04523 625 ENEKLSSII 633
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
299-1014 |
4.53e-09 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 60.85 E-value: 4.53e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 299 LNRQIEKLKLDLQELVVATKQSRAQRQELGVNLYEVQQHLVHLQKLLEKSHDRHAMASSERRQKEEELQAARALYTKTCA 378
Cdd:TIGR02169 306 LERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRD 385
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 379 AANEERKKLAALQTEMENLALHLFYMQNIDQDMRDDIRVMTQVVKKAETERIRAEIEKKKQDLYVDQLTTRAQQLEEDIA 458
Cdd:TIGR02169 386 ELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLS 465
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 459 LFEAQYLAQAEDTRILRKAVSEACTEIDAisVEKRRimQQWASSLVGMKHR---------------------DEAHRAVL 517
Cdd:TIGR02169 466 KYEQELYDLKEEYDRVEKELSKLQRELAE--AEAQA--RASEERVRGGRAVeevlkasiqgvhgtvaqlgsvGERYATAI 541
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 518 EALRGCQHQAKSTDGEIEAyKKSIMKEEEKNEKLASI--LNRTETEATLLQKLTTQ----CLTKQVALQSQF-NTYRLTL 590
Cdd:TIGR02169 542 EVAAGNRLNNVVVEDDAVA-KEAIELLKRRKAGRATFlpLNKMRDERRDLSILSEDgvigFAVDLVEFDPKYePAFKYVF 620
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 591 QDT------EDALSQ-DQLEQMILTEELQAIRQAIQ-GELELRRKTDAAIREKLQEHMTSNKttkyfnqlILRLQKEKTN 662
Cdd:TIGR02169 621 GDTlvvediEAARRLmGKYRMVTLEGELFEKSGAMTgGSRAPRGGILFSRSEPAELQRLRER--------LEGLKRELSS 692
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 663 MMTHLSKINGDIAQTTLDITHTSSRLDAHQKTLVELDQDVKKVNELITNSQSEISRRTILIERKQGLINFLNKQLERMvs 742
Cdd:TIGR02169 693 LQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEEL-- 770
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 743 elgGEEVGPLELEIKRLSKLIDEHDGKAVQAQVtwlrlqQEMVKVTQEQEEQLASLDASKKELHIMEQkklrvesKIEQE 822
Cdd:TIGR02169 771 ---EEDLHKLEEALNDLEARLSHSRIPEIQAEL------SKLEEEVSRIEARLREIEQKLNRLTLEKE-------YLEKE 834
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 823 KKEQKEIEHHMKDLDNDLKKLNMLMNKNRCSSEELEQNNRVTENEFVRSLKASERETIKMQDKLNQLSEEKATLLNQLVE 902
Cdd:TIGR02169 835 IQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEK 914
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 903 AEHQIMLWEKKIQLAKEMRSSVDSEIGQTEiramkgEIHRMKVRLGQLlkqqEKMIRAMELAVARRETVTTQAEGQrkMD 982
Cdd:TIGR02169 915 KRKRLSELKAKLEALEEELSEIEDPKGEDE------EIPEEELSLEDV----QAELQRVEEEIRALEPVNMLAIQE--YE 982
|
730 740 750
....*....|....*....|....*....|..
gi 148664197 983 RKALTRTDFHHKQLELRRKIRDVRKATDECTK 1014
Cdd:TIGR02169 983 EVLKRLDELKEKRAKLEEERKAILERIEEYEK 1014
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
332-969 |
1.11e-08 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 59.75 E-value: 1.11e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 332 YEVQQHLVHLQ-KLLEKSHDRHAMASSERRQKE-------------EELQAARALYTKTCAAANEERKKLAALQTEMENL 397
Cdd:pfam15921 106 FYLRQSVIDLQtKLQEMQMERDAMADIRRRESQsqedlrnqlqntvHELEAAKCLKEDMLEDSNTQIEQLRKMMLSHEGV 185
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 398 aLHLFYMQNID-----------QDMRDDI--RVMTQVVKKAETErIRAEIEKKKQDLYV--DQLTTRAQQLEEDIALFEA 462
Cdd:pfam15921 186 -LQEIRSILVDfeeasgkkiyeHDSMSTMhfRSLGSAISKILRE-LDTEISYLKGRIFPveDQLEALKSESQNKIELLLQ 263
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 463 QYLAQAE--------DTRILRKAVSEACTEIDAISvEKRRIMQQWASSLVGMKHRDEAH-RAVLEALRGCQHQAKST--D 531
Cdd:pfam15921 264 QHQDRIEqlisehevEITGLTEKASSARSQANSIQ-SQLEIIQEQARNQNSMYMRQLSDlESTVSQLRSELREAKRMyeD 342
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 532 GEIEAYKKSIMKEEEKNEKLASILNRTETEATL---LQKLTTQC--LTKQVALQSQFNTyRLTLQDTEDALSQDQLEQMI 606
Cdd:pfam15921 343 KIEELEKQLVLANSELTEARTERDQFSQESGNLddqLQKLLADLhkREKELSLEKEQNK-RLWDRDTGNSITIDHLRREL 421
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 607 ---------LTEELQAIRQAIQGELELRRKTDAAIREKLQEHMTSNKTTKYFNQLILRLQKEKT--------------NM 663
Cdd:pfam15921 422 ddrnmevqrLEALLKAMKSECQGQMERQMAAIQGKNESLEKVSSLTAQLESTKEMLRKVVEELTakkmtlessertvsDL 501
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 664 MTHLSKINGDIAQTTLDITHTSSRLDAHQKTLveldQDVKKVNELITNSQSEISRRTILIERKQGLINFLNKQLERMVSE 743
Cdd:pfam15921 502 TASLQEKERAIEATNAEITKLRSRVDLKLQEL----QHLKNEGDHLRNVQTECEALKLQMAEKDKVIEILRQQIENMTQL 577
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 744 LG--GEEVGPLELEIKRLSKLIDEHDGKAVQAQVtwlrLQQEMVKVTQEQEEQLASLDASKKELHIMEQKKLRVESKIEQ 821
Cdd:pfam15921 578 VGqhGRTAGAMQVEKAQLEKEINDRRLELQEFKI----LKDKKDAKIRELEARVSDLELEKVKLVNAGSERLRAVKDIKQ 653
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 822 EKKE-QKEIEHHMKDLDNDLKKLNMLMNKNRCSSEELEqnnrVTENEFVRSLKASERETIKMQDKLNQLSEEKATLL--- 897
Cdd:pfam15921 654 ERDQlLNEVKTSRNELNSLSEDYEVLKRNFRNKSEEME----TTTNKLKMQLKSAQSELEQTRNTLKSMEGSDGHAMkva 729
|
650 660 670 680 690 700 710
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 148664197 898 ----NQLVEAEHQIMLWEKKIQLAKEMRSSVDSeigqtEIRAMKGEIHRMKVRLGQLLKQQEKMirAMELAVARRE 969
Cdd:pfam15921 730 mgmqKQITAKRGQIDALQSKIQFLEEAMTNANK-----EKHFLKEEKNKLSQELSTVATEKNKM--AGELEVLRSQ 798
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
299-950 |
4.00e-08 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 57.64 E-value: 4.00e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 299 LNRQIEKLKLDlqelvvATKQSRAQRQELGVNLYEVQQHLVHLQKLleksHDRHAMASSERRQKEEELQAARALYTKTCA 378
Cdd:COG1196 198 LERQLEPLERQ------AEKAERYRELKEELKELEAELLLLKLREL----EAELEELEAELEELEAELEELEAELAELEA 267
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 379 AANEERKKLAALQTEMENLalhlfymQNIDQDMRDDIRVMTQVVKKAETERIRAEIEKKKQDLYVDQLTTRAQQLEEDIA 458
Cdd:COG1196 268 ELEELRLELEELELELEEA-------QAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELE 340
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 459 LFEAQYLAQAEDTRILRKAVSEACTEIDAISVEKRRIMQQWasslvgmkhrDEAHRAVLEALRgcqhqakstdgEIEAYK 538
Cdd:COG1196 341 ELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEEL----------EELAEELLEALR-----------AAAELA 399
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 539 KSIMKEEEKNEKLASILNRTETEATLLQKLTTQCLTKQVALQSQFNTYRLTLQDTEDAlsQDQLEQMILTEELQAIRQAI 618
Cdd:COG1196 400 AQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEE--EEALLELLAELLEEAALLEA 477
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 619 QGELELRRKTDAAIREKLQEHMTSNKTTKYFNQLILRLQKEKTNMMTHLSKINGDIAQTTLDITHTSSRLDAHQktLVEL 698
Cdd:COG1196 478 ALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNI--VVED 555
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 699 DQDVKKVNELItnSQSEISRRTILIERKQGLINFLNKQLERMVSELGGEEVGPLELEIKRLSKLIDEHDGKAVQAQVTWL 778
Cdd:COG1196 556 DEVAAAAIEYL--KAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLE 633
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 779 RLQQEMVKVTQEQEEQLASLDASKKELHIMEQKKLRVESKIEQEKKEQKEIEhhmkdldndlkklnmlmnknrcsSEELE 858
Cdd:COG1196 634 AALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELA-----------------------ERLAE 690
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 859 QNNRVTENEFVRSLKASERETIKMQDKLNQLSEEKATLLNQLVEAEHQIMLWEKKIQLAKEMRSSVDSEIGQTEIRAmkg 938
Cdd:COG1196 691 EELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELER--- 767
|
650
....*....|..
gi 148664197 939 EIHRMKVRLGQL 950
Cdd:COG1196 768 ELERLEREIEAL 779
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
870-1073 |
1.50e-07 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 55.16 E-value: 1.50e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 870 RSLKASERETIKMQDKLNQLSEEKATLLNQLVEAEHQIMLWEKKIQLAKEMRSSVDSEIG--QTEIRAMKGEIHRMKVRL 947
Cdd:COG4942 27 AELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAelEKEIAELRAELEAQKEEL 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 948 GQLLKQQEKMIRAMELAV----------ARRETVTTQAEGQRKMDRKALTRTdfhhkQLELRRKIRDVRKATDECTKTVL 1017
Cdd:COG4942 107 AELLRALYRLGRQPPLALllspedfldaVRRLQYLKYLAPARREQAEELRAD-----LAELAALRAELEAERAELEALLA 181
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 148664197 1018 ELEETQRNVSSSLLEKQEKLSVIQADFDTLEADLTRLGALKRQNLSEIVALQTRLK 1073
Cdd:COG4942 182 ELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAA 237
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
239-988 |
4.30e-07 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 54.29 E-value: 4.30e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 239 GDLPVFQDQIQQPSTEEGAMAERVESEGSDEEAEDEgsQLVVL-DPDHPLMVRFQAALKNY--LNRQIEKLKLDLQELVV 315
Cdd:TIGR02168 232 LRLEELREELEELQEELKEAEEELEELTAELQELEE--KLEELrLEVSELEEEIEELQKELyaLANEISRLEQQKQILRE 309
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 316 ATKQSRAQRQELGVNLYEVQQHLVHLQKLLEKSHDR-------HAMASSERRQKEEELQAARALYTKTCAAANEERKKLA 388
Cdd:TIGR02168 310 RLANLERQLEELEAQLEELESKLDELAEELAELEEKleelkeeLESLEAELEELEAELEELESRLEELEEQLETLRSKVA 389
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 389 ALQTEMENLALHLFYM---QNIDQDMRD----DIRVMTQVVKKAETERIRAEIEKKKQDLyvDQLTTRAQQLEEDIALFE 461
Cdd:TIGR02168 390 QLELQIASLNNEIERLearLERLEDRRErlqqEIEELLKKLEEAELKELQAELEELEEEL--EELQEELERLEEALEELR 467
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 462 AQYLAQAEDTRILRKAVSEACTEIDAISVEKRRIMQ---------QWASSLVGMKHR-------DEAHRAVLEALRGCQH 525
Cdd:TIGR02168 468 EELEEAEQALDAAERELAQLQARLDSLERLQENLEGfsegvkallKNQSGLSGILGVlselisvDEGYEAAIEAALGGRL 547
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 526 QAKSTDGE------IEAYKK-----------SIMKEEEKNEKLASILNRTETEATLLQKLTTQCLTKQVALQSQFNTYRL 588
Cdd:TIGR02168 548 QAVVVENLnaakkaIAFLKQnelgrvtflplDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGVLV 627
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 589 --TLQDTEDALSQDQLEQMILTEELQAIR----------QAIQGELELRRKTD----------AAIREKLQEHMTSNKTT 646
Cdd:TIGR02168 628 vdDLDNALELAKKLRPGYRIVTLDGDLVRpggvitggsaKTNSSILERRREIEeleekieeleEKIAELEKALAELRKEL 707
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 647 KYFNQLILRLQKEKTNMMTHLSKINGDIAQTTLDITHTSSRLDAHQKTLVELDQDVKKVNELITNSQSEISRRTILIERK 726
Cdd:TIGR02168 708 EELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEEL 787
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 727 QGLINFLN---KQLERMVSELGGE------EVGPLELEIKRLSKLIDEHDGKAVQAQVTWLRLQQEMVKVTQEQEEQLAS 797
Cdd:TIGR02168 788 EAQIEQLKeelKALREALDELRAEltllneEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEEL 867
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 798 LDASKKELHI-------MEQKKLRVESKIEQEKKEQKEIEHHMKDLDNDLKKLNMLMNKNRCSSEELEQN-NRVTEnefv 869
Cdd:TIGR02168 868 IEELESELEAllnerasLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRiDNLQE---- 943
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 870 rslKASERETIKMQDklnqLSEEKATLLNQLVEAEHQIMLWEKKIQlakemrssvdsEIGQTEIRAMKgEIHRMKVRLGQ 949
Cdd:TIGR02168 944 ---RLSEEYSLTLEE----AEALENKIEDDEEEARRRLKRLENKIK-----------ELGPVNLAAIE-EYEELKERYDF 1004
|
810 820 830
....*....|....*....|....*....|....*....
gi 148664197 950 LLKQQEKMIRAmelavarretVTTQAEGQRKMDRKALTR 988
Cdd:TIGR02168 1005 LTAQKEDLTEA----------KETLEEAIEEIDREARER 1033
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
423-1128 |
5.56e-07 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 54.17 E-value: 5.56e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 423 KKAETER--------------IRAEIEKKKQDLYVD-QLTTRAQQLEEDIALFEAQYLAQAEDTRILRKAVSEActEIDA 487
Cdd:COG1196 173 RKEEAERkleateenlerledILGELERQLEPLERQaEKAERYRELKEELKELEAELLLLKLRELEAELEELEA--ELEE 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 488 ISVEKRRIMQQWASSLVGMKHRDEAHRAVLEALRGCQHQAKSTDGEIEAYKKSIMKEEEKNEKLASILNRTETEATLLQK 567
Cdd:COG1196 251 LEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEE 330
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 568 LTTQCLTKQVALQSQFNTYRLTLQDTEDALSQDQLEQMILTEELQAIRQAIQGELELRRKTDAAIREKLQEHMTSNKTTK 647
Cdd:COG1196 331 ELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEE 410
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 648 YFNQLILRLQKEKTNMMTHLSKINGDIAQTTLDITHTSSRLDAHQKTLVELDQDVKKVNELITNSQSEISRRTILIERKQ 727
Cdd:COG1196 411 ALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAA 490
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 728 GLINFLNKQLERMVSELGGEEVGPLELEIKRLSKLIDE--HDGKAVQAQVTwLRLQQEMVKVTQEQEEQLASLDASKKEl 805
Cdd:COG1196 491 ARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVliGVEAAYEAALE-AALAAALQNIVVEDDEVAAAAIEYLKA- 568
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 806 himeQKKLRVE----SKIEQEKKEQKEIEHHMKDLDNDLkklnmlmnknrcsseelEQNNRVTENEFVRSLKASERETIK 881
Cdd:COG1196 569 ----AKAGRATflplDKIRARAALAAALARGAIGAAVDL-----------------VASDLREADARYYVLGDTLLGRTL 627
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 882 MQDKLNQLSEEKATLLNQLVEAEhqimlwekkiqLAKEMRSSVDSEIGQTEIRAMKGEIHRMKVRLGQLLKQQEKMIRAM 961
Cdd:COG1196 628 VAARLEAALRRAVTLAGRLREVT-----------LEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELE 696
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 962 ELAVARRETVTTQAEGQRKMDRKALTRTDFHHKQLELRRKIRDVRkatdectktvLELEETQRNVSSSLLEKQEKLSVIQ 1041
Cdd:COG1196 697 EALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEEL----------LEEEELLEEEALEELPEPPDLEELE 766
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 1042 ADFDTLEADLTRLGALkrqNLseiVALQTrlkhLQAVKEgRYVFLFRSKQSLVLERQRLdkrLALIATILDRVRDeypQF 1121
Cdd:COG1196 767 RELERLEREIEALGPV---NL---LAIEE----YEELEE-RYDFLSEQREDLEEARETL---EEAIEEIDRETRE---RF 829
|
....*..
gi 148664197 1122 QEALHKV 1128
Cdd:COG1196 830 LETFDAV 836
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
603-1053 |
6.97e-06 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 50.40 E-value: 6.97e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 603 EQMILTE-ELQAIRQAIQGELELRRKTDAAIREKLQEHMTSNKTTKYFNQLILRLQKEKTNMMTHLSKINGDIAQTTLDI 681
Cdd:TIGR04523 117 EQKNKLEvELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKL 196
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 682 T---HTSSRLDAH-------QKTLVELDQDVKKVNELITNSQSEISRRTILIERKQGLINFLNKQLERMVSELGGEEvgp 751
Cdd:TIGR04523 197 LkleLLLSNLKKKiqknkslESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLSEKQ--- 273
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 752 leLEIKRLSKLID--EHDGKAVQAQVTWLRLQQE---MVKVTQEQEEQLASLDASKKELHIMEQKKLRVESKIEQEKKEQ 826
Cdd:TIGR04523 274 --KELEQNNKKIKelEKQLNQLKSEISDLNNQKEqdwNKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKEL 351
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 827 KEIEHHMKDLDNDLKKlnmlmNKNRCSSEELEQNNRVTENEFVRSLKASERETIKMQDKLNQLSEEKAtllnQLVEAEHQ 906
Cdd:TIGR04523 352 TNSESENSEKQRELEE-----KQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQI----KKLQQEKE 422
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 907 IMlwEKKIQLAKEMRSSVDSEIgqteiRAMKGEIHRMKV---RLGQLLKQQEKMIRAMELAVarRETVTTQAEGQRKMDR 983
Cdd:TIGR04523 423 LL--EKEIERLKETIIKNNSEI-----KDLTNQDSVKELiikNLDNTRESLETQLKVLSRSI--NKIKQNLEQKQKELKS 493
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 984 KAltrtdfhHKQLELRRKIRDVRKATDECTKTVLELEETQRNVSSSLLEKQEKLSVIQADFDTLEADLTR 1053
Cdd:TIGR04523 494 KE-------KELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDFELKK 556
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
298-1064 |
4.87e-05 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 47.76 E-value: 4.87e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 298 YLNRQIEKLKLDLQELVVATKQSRAQRQELGVNLYEVQQHLVHLQKLLEKSHDRHAMASSERRQKEEELQAaralytKTC 377
Cdd:TIGR02169 213 YQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKD------LGE 286
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 378 AAANEERKKLAALQTEMENLALHLFYMQNIDQDMRDDIRVMtqvvkKAETERIRAEIEKKKQDL-----YVDQLTTRAQQ 452
Cdd:TIGR02169 287 EEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKL-----EAEIDKLLAEIEELEREIeeerkRRDKLTEEYAE 361
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 453 LEEDIALFEAQYLAQAEDTRILRKAVSEACTEIDAISVEKRRIMQQWASSLVGMKHRDEAHRAVLEALRGCQHQAKSTDG 532
Cdd:TIGR02169 362 LKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEE 441
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 533 EIEAYKKSIMKEEEKNEKLASILNRTETEATLLQKLTTQCLTKQVALQSQFNTYRLTLQDTEDALSQDQLEQMILTEELQ 612
Cdd:TIGR02169 442 EKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAVEEVLKASIQ 521
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 613 AI----RQAIQGELELRRKTDAAIREKLQEHMTSNKTTKyfNQLILRLQKEKTNMMTHLSKINGDIAQTTLDITHTSSRL 688
Cdd:TIGR02169 522 GVhgtvAQLGSVGERYATAIEVAAGNRLNNVVVEDDAVA--KEAIELLKRRKAGRATFLPLNKMRDERRDLSILSEDGVI 599
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 689 DaHQKTLVELDQDVKKVnelitnsQSEISRRTILIER----KQGLINFlnkqleRMVSelggeevgpLELEIKRLSKLID 764
Cdd:TIGR02169 600 G-FAVDLVEFDPKYEPA-------FKYVFGDTLVVEDieaaRRLMGKY------RMVT---------LEGELFEKSGAMT 656
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 765 EHDGKAVQAQVTWLRLQQEMVKVTQEQEEQLASLDASKKELHIMEQKKLRVESKIEQEKKEQKEIEHHMKDLDNDLKKLN 844
Cdd:TIGR02169 657 GGSRAPRGGILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLK 736
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 845 MLMNKNRCSSEELEQnnRVTENEfvRSLKASERETIKMQDKLNQLSEEKATLLNQLveAEHQIMLWEKKIQLAKEMRSSV 924
Cdd:TIGR02169 737 ERLEELEEDLSSLEQ--EIENVK--SELKELEARIEELEEDLHKLEEALNDLEARL--SHSRIPEIQAELSKLEEEVSRI 810
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 925 DSEIGQTEiramkGEIHRMKVRLGQLLKQQEKMIRAMELAVARRETVTTQAEGQRKMDRKALTRTDFHHKQL-ELRRKIR 1003
Cdd:TIGR02169 811 EARLREIE-----QKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALrDLESRLG 885
|
730 740 750 760 770 780
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 148664197 1004 DVRKATDECTKTVLELEETQRNVSSSLLEKQEKLSVIQADFDTLEADLTRLGALKRQNLSE 1064
Cdd:TIGR02169 886 DLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEI 946
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
723-1048 |
7.31e-05 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 46.98 E-value: 7.31e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 723 IERKQGLINFLNKQLERMVSELGG--------EEVGPLELEIKRLSKLIDEHDGKAVQAQVTWLRLQQEMVKV-TQEQEE 793
Cdd:TIGR02169 186 IERLDLIIDEKRQQLERLRREREKaeryqallKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEeISELEK 265
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 794 QLASLDASKKELH--IME---------QKKLR-VESKIEQEKKEQKEIEHHMKDLDNDLKKLNMLMNKNRCSSEELEqnn 861
Cdd:TIGR02169 266 RLEEIEQLLEELNkkIKDlgeeeqlrvKEKIGeLEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELE--- 342
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 862 rvtenefvRSLKASERETIKMQDKLNQLSEEKATLLNQLVEAEHQIMLWEKKIQLAKEMRSSVDSEIgqteiramkGEIH 941
Cdd:TIGR02169 343 --------REIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREI---------NELK 405
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 942 RMKVRLGQLLKQQEKMIRAMELAVARRETVTTQAEGQRKMDRKALTrtdfhhkqlELRRKIRDVRKATDECTKTVLELEE 1021
Cdd:TIGR02169 406 RELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIK---------KQEWKLEQLAADLSKYEQELYDLKE 476
|
330 340
....*....|....*....|....*..
gi 148664197 1022 TQRNVSSSLLEKQEKLSVIQADFDTLE 1048
Cdd:TIGR02169 477 EYDRVEKELSKLQRELAEAEAQARASE 503
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
727-1108 |
7.46e-05 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 47.04 E-value: 7.46e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 727 QGLINFLNKQLERMVSELGGEEVGPLELEIKRLSKLIDEHdgkavqaqvtwlrlQQEMVKVTQEQEEQLASLDASKKELH 806
Cdd:pfam15921 237 KGRIFPVEDQLEALKSESQNKIELLLQQHQDRIEQLISEH--------------EVEITGLTEKASSARSQANSIQSQLE 302
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 807 IMEqkklrveskiEQEKKEQKEIEHHMKDLDNDLKKLNmlmnknrcssEELEQNNRVTEN---EFVRSLKASERETIKMQ 883
Cdd:pfam15921 303 IIQ----------EQARNQNSMYMRQLSDLESTVSQLR----------SELREAKRMYEDkieELEKQLVLANSELTEAR 362
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 884 DKLNQLSEEKATLLNQLVEAEHQIMLWEKKIQLAKEMRSSV-DSEIGQT---------------EIRAMKGEIHRMKVRL 947
Cdd:pfam15921 363 TERDQFSQESGNLDDQLQKLLADLHKREKELSLEKEQNKRLwDRDTGNSitidhlrrelddrnmEVQRLEALLKAMKSEC 442
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 948 GQLLKQQEKMIRAMELAVARRETVTTQAEGQRKMdrkaltrtdfhhkqlelrrkirdVRKATDECTKTVLELEETQRNVS 1027
Cdd:pfam15921 443 QGQMERQMAAIQGKNESLEKVSSLTAQLESTKEM-----------------------LRKVVEELTAKKMTLESSERTVS 499
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 1028 SSLLEKQEKLSVIQADfdtlEADLTRLGALKRQNLSEIVALQTRLKHLQAVK-EGRYVFLFRSKQSLVLE--RQRLDKRL 1104
Cdd:pfam15921 500 DLTASLQEKERAIEAT----NAEITKLRSRVDLKLQELQHLKNEGDHLRNVQtECEALKLQMAEKDKVIEilRQQIENMT 575
|
....
gi 148664197 1105 ALIA 1108
Cdd:pfam15921 576 QLVG 579
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
528-1048 |
8.10e-05 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 46.98 E-value: 8.10e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 528 KSTDGEIEAYKKSIMKEEEKNEKLASI---LNRTETEATLLQKLTTQCLTKQVALQSQFntyrLTLQDTEDALSQDQLEQ 604
Cdd:PRK03918 172 KEIKRRIERLEKFIKRTENIEELIKEKekeLEEVLREINEISSELPELREELEKLEKEV----KELEELKEEIEELEKEL 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 605 MILTEELQAIRQAIQGELELRRKTDAAIREkLQEHMTSNKTTKYFNQLILRLQKEKTNMMTHLSKINgdiaqttLDITHT 684
Cdd:PRK03918 248 ESLEGSKRKLEEKIRELEERIEELKKEIEE-LEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIE-------KRLSRL 319
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 685 SSRLDAHQKTLVELDQDVKKVNELiTNSQSEISRRTILIERKQGL---INFLNKQLERMVSELGGEEVGPLELEIKRLSK 761
Cdd:PRK03918 320 EEEINGIEERIKELEEKEERLEEL-KKKLKELEKRLEELEERHELyeeAKAKKEELERLKKRLTGLTPEKLEKELEELEK 398
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 762 LIDEhdgkavqaqvtwlrLQQEMVKVTQEqeeqLASLDASKKELH--IMEQKKLRVE---------------------SK 818
Cdd:PRK03918 399 AKEE--------------IEEEISKITAR----IGELKKEIKELKkaIEELKKAKGKcpvcgrelteehrkelleeytAE 460
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 819 IEQEKKEQKEIEHHMKDLDNDLKKLNMLMN------KNRCSSEELEQNNRVTENEFVRSLKASERETIKMQDKLNQLSEE 892
Cdd:PRK03918 461 LKRIEKELKEIEEKERKLRKELRELEKVLKkeseliKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGE 540
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 893 KATL---LNQLVEAEHQIMLWEKKIQLAKEMRSSVDSEIGQTEIramkGEIHRMKVRLGQLLKQQEKMIramELAVARRE 969
Cdd:PRK03918 541 IKSLkkeLEKLEELKKKLAELEKKLDELEEELAELLKELEELGF----ESVEELEERLKELEPFYNEYL---ELKDAEKE 613
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 970 TvttqaegQRKMDRKALTRTDFHhkqlELRRKIRDVRKATDECTKTVLEL-----EETQRNVSSSLLEKQEKLSVIQADF 1044
Cdd:PRK03918 614 L-------EREEKELKKLEEELD----KAFEELAETEKRLEELRKELEELekkysEEEYEELREEYLELSRELAGLRAEL 682
|
....
gi 148664197 1045 DTLE 1048
Cdd:PRK03918 683 EELE 686
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
411-1134 |
1.89e-04 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 45.81 E-value: 1.89e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 411 MRDDIRVMTQVVKKaeteRIRAEIE-KKKQDLYVDQLTTRAQQLEE--DIALFEaqylaqaEDTRILRKAVSEACTEIDa 487
Cdd:TIGR01612 1116 IKDDIKNLDQKIDH----HIKALEEiKKKSENYIDEIKAQINDLEDvaDKAISN-------DDPEEIEKKIENIVTKID- 1183
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 488 isveKRRIMQQWASSLVGMKHRDEAHRAVLEALRGCQ----------------HQAKSTDGEIEAYKKSIMKEEEKNEKL 551
Cdd:TIGR01612 1184 ----KKKNIYDEIKKLLNEIAEIEKDKTSLEEVKGINlsygknlgklflekidEEKKKSEHMIKAMEAYIEDLDEIKEKS 1259
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 552 ASILNRTETEATLLQKLTTQCLTkqvalQSQFNTYRLTLQDTEDALSQDQLEQMILTE------ELQAIRQAIQGELELR 625
Cdd:TIGR01612 1260 PEIENEMGIEMDIKAEMETFNIS-----HDDDKDHHIISKKHDENISDIREKSLKIIEdfseesDINDIKKELQKNLLDA 1334
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 626 RKTDAAIREKLqehmtsNKTTKYFNqlILRLQ---------KEKTNMMTHLSK-INGDIAQTTLDITHTSSRLDAHQ-KT 694
Cdd:TIGR01612 1335 QKHNSDINLYL------NEIANIYN--ILKLNkikkiidevKEYTKEIEENNKnIKDELDKSEKLIKKIKDDINLEEcKS 1406
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 695 LVELDQDVKKVNELITNSQseISRRTILIER--------------KQGLINFLNKQL-----ERMVSELGGEEVGPLELE 755
Cdd:TIGR01612 1407 KIESTLDDKDIDECIKKIK--ELKNHILSEEsnidtyfknadennENVLLLFKNIEMadnksQHILKIKKDNATNDHDFN 1484
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 756 IKRLSKLIDEHDGKAVQAQvtwlrlqqemvKVTQEQEEQLASLDASKKELHIMEQK--KLRVESKIEQEKKEQKEIEHHM 833
Cdd:TIGR01612 1485 INELKEHIDKSKGCKDEAD-----------KNAKAIEKNKELFEQYKKDVTELLNKysALAIKNKFAKTKKDSEIIIKEI 1553
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 834 KDLDNDL--------KKLNMLMNKNRCSSEELEQNNRvtENEFVRSLKASERetiKMQDKLNQLSEEKATLLNQLVEAEH 905
Cdd:TIGR01612 1554 KDAHKKFileaekseQKIKEIKKEKFRIEDDAAKNDK--SNKAAIDIQLSLE---NFENKFLKISDIKKKINDCLKETES 1628
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 906 qimlWEKKIQlakemRSSVDSEigQTEIRAMKGEIHRMKVRLgQLLKQQEKMIramelavarrETVTTQAEGQRKMDRKA 985
Cdd:TIGR01612 1629 ----IEKKIS-----SFSIDSQ--DTELKENGDNLNSLQEFL-ESLKDQKKNI----------EDKKKELDELDSEIEKI 1686
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 986 LTRTDFHHKQLELR--RKIRDVRKATDECTKTVLEL-EETQRNVSSSL-------LEKQEKLSVIQADFDTLEADLTRL- 1054
Cdd:TIGR01612 1687 EIDVDQHKKNYEIGiiEKIKEIAIANKEEIESIKELiEPTIENLISSFntndlegIDPNEKLEEYNTEIGDIYEEFIELy 1766
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 1055 ----GALKRQNLSEIVALQTRLKHLQAVKEGRYVFLFRSKQSLVL---ERQRLDKRLALIATILDRVRD----EYPQFQE 1123
Cdd:TIGR01612 1767 niiaGCLETVSKEPITYDEIKNTRINAQNEFLKIIEIEKKSKSYLddiEAKEFDRIINHFKKKLDHVNDkftkEYSKINE 1846
|
810
....*....|.
gi 148664197 1124 ALHKVSQMIAN 1134
Cdd:TIGR01612 1847 GFDDISKSIEN 1857
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
294-1065 |
2.84e-04 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 45.34 E-value: 2.84e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 294 ALKNYLNRQIEKLKLDLQELVVATKQSRAQRQELGVNLYEVQQHLVHLQKLLEKSHD---RHAMASSERRQKEEELQAAR 370
Cdd:TIGR00618 187 AKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAyltQKREAQEEQLKKQQLLKQLR 266
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 371 ALYTK--TCAAANEERKKLAALQTEMENLALHLFYMQNIDQDMRddiRVMTQVvkkAETERIRAEIEKKKQDLYVDQLTT 448
Cdd:TIGR00618 267 ARIEElrAQEAVLEETQERINRARKAAPLAAHIKAVTQIEQQAQ---RIHTEL---QSKMRSRAKLLMKRAAHVKQQSSI 340
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 449 RAQQLEEDIALFEAQYLAQAEDTRILRKAVSEACTEIdaisvekRRIMQQWASSLVGMKHRDEAHRAVLEALRGCQHQAK 528
Cdd:TIGR00618 341 EEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTL-------TQHIHTLQQQKTTLTQKLQSLCKELDILQREQATID 413
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 529 STDGEIEAYKKSIMKeEEKNEKLAsiLNRTETEATLLQKLTTQCLTKQVALQSQFNTYRLTLQDTEDALSQDQLEQMILT 608
Cdd:TIGR00618 414 TRTSAFRDLQGQLAH-AKKQQELQ--QRYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHLQETRKKA 490
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 609 EELQaIRQAIQGELELRRKTDAAIREKLQEHMTSNKTTKYFNQLILRLQKEKTnmmthlskingDIAQTTLDITHTSSRL 688
Cdd:TIGR00618 491 VVLA-RLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQRGEQTYAQLET-----------SEEDVYHQLTSERKQR 558
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 689 DAHQKTLVELDQDVKKVNELITNSQSEISRRTILIERKQGLINfLNKQLERMVSELGGEEVgpLELEIKRLSKLIDEHDG 768
Cdd:TIGR00618 559 ASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTE-KLSEAEDMLACEQHALL--RKLQPEQDLQDVRLHLQ 635
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 769 KAVQaqvtwlRLQQEMVKVTQEQEEqLASLDASKKELHIMEQKKLRVESKIEQEKKEQKEIEHhmkdLDNDLKKLNMLMN 848
Cdd:TIGR00618 636 QCSQ------ELALKLTALHALQLT-LTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQ----LTYWKEMLAQCQT 704
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 849 KNRCSSEELEQNNR-VTENEFVRSLKASE--RETIKMQDKLNQLSEEKATLLNQLVEAEHQIMLWE--KKIQLAKEMRSS 923
Cdd:TIGR00618 705 LLRELETHIEEYDReFNEIENASSSLGSDlaAREDALNQSLKELMHQARTVLKARTEAHFNNNEEVtaALQTGAELSHLA 784
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 924 VDSEIGQTEIRAMKGEIHRMKVRLGQLLKQQEKMIRA-MELAVARRETVTTQAEgqrkmdrkaltrtDFHHKQLELRRKI 1002
Cdd:TIGR00618 785 AEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLqCETLVQEEEQFLSRLE-------------EKSATLGEITHQL 851
|
730 740 750 760 770 780
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 148664197 1003 rdvrKATDECTKTVLELEETQRNVsSSLLEKQEKLSVIQADFD-----TLEADLTRLGALKRQNLSEI 1065
Cdd:TIGR00618 852 ----LKYEECSKQLAQLTQEQAKI-IQLSDKLNGINQIKIQFDgdaliKFLHEITLYANVRLANQSEG 914
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
533-1108 |
5.88e-04 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 44.19 E-value: 5.88e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 533 EIEAYKKSIMKEEEKNEKLASILNRTETEATLLQKLTTQCLTKQVALQSQFNTYRLTLQDTEDALSQDQLEQMILTEELQ 612
Cdd:pfam02463 252 EIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELK 331
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 613 AIRQAIqgELELRRKTDAAIREKLQEHMTSNKTTKyfnQLILRLQKEKtnmmthlskingDIAQTTLDITHTSSRLDAHQ 692
Cdd:pfam02463 332 KEKEEI--EELEKELKELEIKREAEEEEEEELEKL---QEKLEQLEEE------------LLAKKKLESERLSSAAKLKE 394
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 693 KTLVELDQDVKKVNELITNSQSEISrrtILIERKQGLINFLNKQLERMVSELGGEEVGPLELEIKRLSKLIDEHDGKAVQ 772
Cdd:pfam02463 395 EELELKSEEEKEAQLLLELARQLED---LLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSE 471
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 773 aqvtwlRLQQEMVKVTQEQEEQLASLDASKKELHIMEQKKLRVESKIEQEKKEQKEIEHHMKDLDNDLKKLNMLMNKNRC 852
Cdd:pfam02463 472 ------DLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAI 545
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 853 SSEELEQNNRVTENEFVR-SLKASERETIKMQDKLNQLSEEKATLLNQLVEAEHQIMLWEKKIQLAKEMRSS----VDSE 927
Cdd:pfam02463 546 STAVIVEVSATADEVEERqKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEddkrAKVV 625
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 928 IGQTEIRAMKGEIHRMKVRLGQLLKQQEKMI-------RAMELAVARRETVTTQAEGQRK---MDRKALTRTDFHHKQLE 997
Cdd:pfam02463 626 EGILKDTELTKLKESAKAKESGLRKGVSLEEglaekseVKASLSELTKELLEIQELQEKAeseLAKEEILRRQLEIKKKE 705
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 998 LRRKIRDVRKATDECTKTVLELEETQRNVSSSLLEKQEKLSVIQADFDTLEADLTRLGALKRQNLSEIVALQTRLKHLQA 1077
Cdd:pfam02463 706 QREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEK 785
|
570 580 590
....*....|....*....|....*....|.
gi 148664197 1078 VKEGRYVFLFRSKQSLVLERQRLDKRLALIA 1108
Cdd:pfam02463 786 LKVEEEKEEKLKAQEEELRALEEELKEEAEL 816
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
301-962 |
5.94e-04 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 44.27 E-value: 5.94e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 301 RQIEKLKLDLQElvvatKQSRAQRQelgvnLYEVQQHLVHLQKLLEKSHDRHAMASSERRQKEEELQAARALYTKTCAAA 380
Cdd:TIGR00606 408 KTAAQLCADLQS-----KERLKQEQ-----ADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRILELD 477
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 381 NEERKKLAALQTEMEN-----LALHLFYMQN----IDQDMRDDIRVMTQVVKKAETERIRAEIEKKKQDLY--------- 442
Cdd:TIGR00606 478 QELRKAERELSKAEKNsltetLKKEVKSLQNekadLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKDeqirkiksr 557
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 443 -VDQLTTRA------QQLEEDIALFEAQYLAQAEDTRILRKAVSEACTEIDAISVEKRRIMQQWAS------SLVGMKHR 509
Cdd:TIGR00606 558 hSDELTSLLgyfpnkKQLEDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSyedklfDVCGSQDE 637
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 510 DEAHRAVLEALRGCQHQAKSTDGEIEAYKKSIMKEEEKNEKLASILNRTETEATLLQKLTTQcltkqvaLQSQFNTYRLT 589
Cdd:TIGR00606 638 ESDLERLKEEIEKSSKQRAMLAGATAVYSQFITQLTDENQSCCPVCQRVFQTEAELQEFISD-------LQSKLRLAPDK 710
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 590 LQDTEDALSQDQLEQ---MILTEELQAIRQAIQGEL-ELRRKTDAAIREKLQEHMTSNKTTKYFNQLILRLQKEKTNM-- 663
Cdd:TIGR00606 711 LKSTESELKKKEKRRdemLGLAPGRQSIIDLKEKEIpELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLtd 790
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 664 MTHLSKINGDIAQTTLDITHTSSRLDAhqktlVELDQDVKKVNELITNSQSEISRRTILIERKQGLINFLNKQLERMVSE 743
Cdd:TIGR00606 791 VTIMERFQMELKDVERKIAQQAAKLQG-----SDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSK 865
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 744 LGgeevgplELEIKRLSKLIDEHDGKAVQAQ-VTWLRLQQEMVKVTQEQEEQLASLDASKKELhiMEQKKLRVESKIEQE 822
Cdd:TIGR00606 866 TN-------ELKSEKLQIGTNLQRRQQFEEQlVELSTEVQSLIREIKDAKEQDSPLETFLEKD--QQEKEELISSKETSN 936
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 823 KKEQKEIEHHMKDLDNDLKKLNMLMNKNRCSSEELEQNNRVTENEFVRSLKASERETIKMQDKLNQLSEEKATLLNQ--L 900
Cdd:TIGR00606 937 KKAQDKVNDIKEKVKNIHGYMKDIENKIQDGKDDYLKQKETELNTVNAQLEECEKHQEKINEDMRLMRQDIDTQKIQerW 1016
|
650 660 670 680 690 700
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 148664197 901 VEAEHQIMLWEKKIQLAKEMRSSVDSEIGQTEIRAMKGEIHRMKVRLGQLLKQQEKMIRAME 962
Cdd:TIGR00606 1017 LQDNLTLRKRENELKEVEEELKQHLKEMGQMQVLQMKQEHQKLEENIDLIKRNHVLALGRQK 1078
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
866-1071 |
1.69e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 42.59 E-value: 1.69e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 866 NEFVRSLKASERETikmQDKLNQLSEEKATLL---NQLVEAEHQIM-------LWEK------KIQLAKEMRSSVDSEIG 929
Cdd:COG4913 210 DDFVREYMLEEPDT---FEAADALVEHFDDLErahEALEDAREQIEllepireLAERyaaareRLAELEYLRAALRLWFA 286
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 930 QTEIRAMKGEIHRMKVRLGQLLKQQEKMIRAMELAVARRETVTTQAEGQrKMDRKALTRTDFHHKQLELRRKIRDVRKAT 1009
Cdd:COG4913 287 QRRLELLEAELEELRAELARLEAELERLEARLDALREELDELEAQIRGN-GGDRLEQLEREIERLERELEERERRRARLE 365
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 148664197 1010 DECTK-------TVLELEETQRNVSSSLLEKQEKLSVIQADFDTLEADLTRLGALKRQNLSEIVALQTR 1071
Cdd:COG4913 366 ALLAAlglplpaSAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLERR 434
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
299-835 |
1.75e-03 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 42.40 E-value: 1.75e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 299 LNRQIEKLKLDLQELVVATKQSRAQRQELGVNLYEVQQHLVH-LQKLLEKSHDRHAMASSERRQKEEELQAARALYTKTC 377
Cdd:pfam05483 188 LNNNIEKMILAFEELRVQAENARLEMHFKLKEDHEKIQHLEEeYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESR 267
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 378 AAAN--EERKKLAA------------LQTEMENLALHLFYMQNIDQDMRDDIRVMT----QVVKKAETERIRAEIEKKKQ 439
Cdd:pfam05483 268 DKANqlEEKTKLQDenlkeliekkdhLTKELEDIKMSLQRSMSTQKALEEDLQIATkticQLTEEKEAQMEELNKAKAAH 347
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 440 DLYVDQLTTRAQQLEEdIALFEAQYLAQAED-----TRILRKAVSE-------------ACTEIDAISVEKRRIM---QQ 498
Cdd:pfam05483 348 SFVVTEFEATTCSLEE-LLRTEQQRLEKNEDqlkiiTMELQKKSSEleemtkfknnkevELEELKKILAEDEKLLdekKQ 426
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 499 WASSLVGMKHRDEAHRAVLEALRGCQH----QAKSTDGEIEAYKKSI--MKEEEKNEKLASILNRTETEATLLQ--KLTT 570
Cdd:pfam05483 427 FEKIAEELKGKEQELIFLLQAREKEIHdleiQLTAIKTSEEHYLKEVedLKTELEKEKLKNIELTAHCDKLLLEnkELTQ 506
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 571 QCLTKQVALQSQFNTYRLTLQDTEDALSQD---QLEQMILTEELQAIRQA-IQGELELRRKTDAA---IREKLQEHMTSN 643
Cdd:pfam05483 507 EASDMTLELKKHQEDIINCKKQEERMLKQIenlEEKEMNLRDELESVREEfIQKGDEVKCKLDKSeenARSIEYEVLKKE 586
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 644 KTTKYFNQLILRLQKEKTNMMTHLSKINGDIAQTTLDITHTSSRLDAHQ----KTLVELDQDVKKVNELITNSQSEISRR 719
Cdd:pfam05483 587 KQMKILENKCNNLKKQIENKNKNIEELHQENKALKKKGSAENKQLNAYEikvnKLELELASAKQKFEEIIDNYQKEIEDK 666
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 720 TILIERKQGLIN----------FLNKQLERMVSELGGEEVGPLELEIKRLSKLIDEHDgkavqaqvtwlrlqQEMVKVTQ 789
Cdd:pfam05483 667 KISEEKLLEEVEkakaiadeavKLQKEIDKRCQHKIAEMVALMEKHKHQYDKIIEERD--------------SELGLYKN 732
|
570 580 590 600
....*....|....*....|....*....|....*....|....*.
gi 148664197 790 EQEEQLASLDASKKELHIMEQKKLRVESKIEQEKKEQKEIEHHMKD 835
Cdd:pfam05483 733 KEQEQSSAKAALEIELSNIKAELLSLKKQLEIEKEEKEKLKMEAKE 778
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
542-1037 |
1.89e-03 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 42.73 E-value: 1.89e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 542 MKEEEKNEKLASILNRTETEATLLQKlTTQCLTKQ---VALQSQFNTYRLTLQDTEDALSQDQLEQMILTEEL----QAI 614
Cdd:TIGR01612 876 IKAEISDDKLNDYEKKFNDSKSLINE-INKSIEEEyqnINTLKKVDEYIKICENTKESIEKFHNKQNILKEILnkniDTI 954
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 615 RQAIQGELELRRKTDAAIREKLQE-------------HMTSNKTTKYFNQLILRLQKEKTNMMTHL----SKINGDIAQT 677
Cdd:TIGR01612 955 KESNLIEKSYKDKFDNTLIDKINEldkafkdaslndyEAKNNELIKYFNDLKANLGKNKENMLYHQfdekEKATNDIEQK 1034
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 678 TLDITHTSSRLDAHQKTLV-----ELDQDVKKVNELITnsqseisrRTILIERKQGLINFlNKQLERM----VSELGGE- 747
Cdd:TIGR01612 1035 IEDANKNIPNIEIAIHTSIyniidEIEKEIGKNIELLN--------KEILEEAEINITNF-NEIKEKLkhynFDDFGKEe 1105
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 748 ------EVGPLELEIKRLSKLIDEHdgkavqaqvtwlrlQQEMVKVTQEQEEQLASLDASKKELHIMEQKKLRVESKIEQ 821
Cdd:TIGR01612 1106 nikyadEINKIKDDIKNLDQKIDHH--------------IKALEEIKKKSENYIDEIKAQINDLEDVADKAISNDDPEEI 1171
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 822 EKKeqkeIEHHMKDLDndlKKLNMLMNKNRCSSEELEQNNRVTENEFVRSLKASERETIKmQDKLNQLSEEKAtllnqlv 901
Cdd:TIGR01612 1172 EKK----IENIVTKID---KKKNIYDEIKKLLNEIAEIEKDKTSLEEVKGINLSYGKNLG-KLFLEKIDEEKK------- 1236
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 902 EAEHQIMLWEKKIQLAKEMR-SSVDSEIGQTEIRAMKGEIHRMKVR---------LGQLLKQQEKMIRAMELAVARRETV 971
Cdd:TIGR01612 1237 KSEHMIKAMEAYIEDLDEIKeKSPEIENEMGIEMDIKAEMETFNIShdddkdhhiISKKHDENISDIREKSLKIIEDFSE 1316
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 148664197 972 TTQAEGQRKMDRKALTRTDFHHKQLELR----------RKIRDVRKATDECTKTVLELEETQRNVSSSlLEKQEKL 1037
Cdd:TIGR01612 1317 ESDINDIKKELQKNLLDAQKHNSDINLYlneianiyniLKLNKIKKIIDEVKEYTKEIEENNKNIKDE-LDKSEKL 1391
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
650-849 |
1.96e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 42.06 E-value: 1.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 650 NQLILRLQKEKTNMMTHLSKINGDIAQTTLDITHTSSRLDAHQKTLVELDQDVKKVNELITNSQSEISRRTILIERkqgl 729
Cdd:COG4942 40 EKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELAELLRALYR---- 115
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 730 inflNKQLERMVSELGGEEVGPLELEIKRLSKLIDEHDGKAVQAQVTWLRLQQEMVKVTQEQEEQLASLDASKKELHIME 809
Cdd:COG4942 116 ----LGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAALE 191
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 148664197 810 QKKLRVESKIEQEKKEQKEIEHHMKDLDNDLKKLNMLMNK 849
Cdd:COG4942 192 ALKAERQKLLARLEKELAELAAELAELQQEAEELEALIAR 231
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
299-640 |
1.98e-03 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 42.36 E-value: 1.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 299 LNRQIEKLKLDLQELVVATKQSRAQRQELGVNLYEVQQHLVHLQKLLEKSHDRHAMASSERRQKEEELQAARALYTKTCA 378
Cdd:TIGR02169 686 LKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEA 765
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 379 AANEERKKLAALQTEMENLALHLfyMQNIDQDMRDDIRVMTQVVKKAETERIRAEIEKKKQDLYVDQLTTRAQQLEEDIA 458
Cdd:TIGR02169 766 RIEELEEDLHKLEEALNDLEARL--SHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRI 843
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 459 LFEAQY--LAQAEDTRILRKAvseactEIDAISVEKRRIMQQWASSLVGMKHRDEAHRAVLEALRGCQHQAKStdgEIEA 536
Cdd:TIGR02169 844 DLKEQIksIEKEIENLNGKKE------ELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEA---QIEK 914
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 537 yKKSIMKE-----EEKNEKLASILNRT-----ETEATL-LQKLTTQCLTKQVALQSQFNTYRLTLQDTEDALS-QDQLE- 603
Cdd:TIGR02169 915 -KRKRLSElkaklEALEEELSEIEDPKgedeeIPEEELsLEDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKrLDELKe 993
|
330 340 350
....*....|....*....|....*....|....*...
gi 148664197 604 -QMILTEELQAIRQAIQGELELRRKTDAAIREKLQEHM 640
Cdd:TIGR02169 994 kRAKLEEERKAILERIEEYEKKKREVFMEAFEAINENF 1031
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
788-1009 |
2.27e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 41.67 E-value: 2.27e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 788 TQEQEEQLASLDASKKELHIMEQKKLRVESKIEQEKKEQKEIEHHMKDLDNDLKKLNMLMNKNRCSSEELEQNNRVTENE 867
Cdd:COG4942 19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAE 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 868 FVRsLKASERETIKMQDKLNQLSEEKAtLLNQ--LVEAEHQIMLWEkkiQLAKEMRSSVDS-EIGQTEIRAMKGEIHRMK 944
Cdd:COG4942 99 LEA-QKEELAELLRALYRLGRQPPLAL-LLSPedFLDAVRRLQYLK---YLAPARREQAEElRADLAELAALRAELEAER 173
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 945 VRLGQLLKQQEKMIRAMELAVARRETVTTQAEGQRKMDRKALTRTDFHHKQLE-----LRRKIRDVRKAT 1009
Cdd:COG4942 174 AELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEaliarLEAEAAAAAERT 243
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
778-1041 |
2.68e-03 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 42.05 E-value: 2.68e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 778 LRLQQEMVKVTQEQEEQLASLDASKKELHIMEQKKLRVESKIEQEKKEQKEIEH-HMKDLDNDLKKLNMLmnKNRCSSEE 856
Cdd:PTZ00121 1488 AKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKaEEKKKADELKKAEEL--KKAEEKKK 1565
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 857 LEQNNRVTENEFVRSLKASERETI---KMQDKLNQLSEEKATLLNQLVEAEHQIMLWEKkIQLAKEMRSSVDSEIGQTEI 933
Cdd:PTZ00121 1566 AEEAKKAEEDKNMALRKAEEAKKAeeaRIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEE-LKKAEEEKKKVEQLKKKEAE 1644
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 934 RAMKGEIHR-----MKVRLGQLLKQQEKMIRAMELAVARRETVTTQAEGQRKMDRKALTRTDFHHKQLELRRKIRDVRKA 1008
Cdd:PTZ00121 1645 EKKKAEELKkaeeeNKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKA 1724
|
250 260 270
....*....|....*....|....*....|....*..
gi 148664197 1009 TDECTKTVLEL----EETQRNVSSSLLEKQEKLSVIQ 1041
Cdd:PTZ00121 1725 EEENKIKAEEAkkeaEEDKKKAEEAKKDEEEKKKIAH 1761
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
667-859 |
3.24e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 41.29 E-value: 3.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 667 LSKINGDIAQTTLDITHTSSRLDAHQKTLVELDQDVKKVNELITNSQSEISRRTILIERKQGLINFLNKQLERMVSELGG 746
Cdd:COG4942 29 LEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELAE 108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 747 --------EEVGPLELEIKRLSKLIDEHDGKAVQAQVTWLRLQ-QEMVKVTQEQEEQLASLDASKKELHIMEQKKLRVES 817
Cdd:COG4942 109 llralyrlGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQaEELRADLAELAALRAELEAERAELEALLAELEEERA 188
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 148664197 818 KIEQEKKEQKEIehhMKDLDNDLKKLNMLMNKNRCSSEELEQ 859
Cdd:COG4942 189 ALEALKAERQKL---LARLEKELAELAAELAELQQEAEELEA 227
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
753-989 |
4.11e-03 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 40.97 E-value: 4.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 753 ELEIKRLSKLIDEHDGKAVQAQVTWLRLQQEMVKVTQEQEEQLASLDASKKELHIMEQKKLRVESKIEQEKKEQKEIEHH 832
Cdd:COG3883 15 DPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARA 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 833 MKDLDNDLKKLNMLMNknrcsSEELEqnnrvtenEFVRSLKASERETIKMQDKLNQLSEEKATLLNQLVEAEHQIMLWEK 912
Cdd:COG3883 95 LYRSGGSVSYLDVLLG-----SESFS--------DFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEA 161
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 148664197 913 KIQLAKEMRSSVDSEIGQTE--IRAMKGEIHRMKVRLGQLLKQQEKMIRAMELAVARRETVTTQAEGQRKMDRKALTRT 989
Cdd:COG3883 162 LKAELEAAKAELEAQQAEQEalLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 240
|
|
| Nup88 |
pfam10168 |
Nuclear pore component; Nup88 can be divided into two structural domains; the N-terminal ... |
755-900 |
4.78e-03 |
|
Nuclear pore component; Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein has no obvious structural motifs but is the region for binding to Nup98, one of the components of the nuclear pore. the C-terminal end is a predicted coiled-coil domain. Nup88 is overexpressed in tumour cells.
Pssm-ID: 462975 [Multi-domain] Cd Length: 713 Bit Score: 41.18 E-value: 4.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148664197 755 EIKRLSKLIdehdGKAVQAQVTWLRLQQEMVKVTQEQEEQLASL--DASKKELHIME---------QKKLRVESKIEQE- 822
Cdd:pfam10168 558 EIQKRVKLL----KLQKEQQLQELQSLEEERKSLSERAEKLAEKyeEIKDKQEKLMRrckkvlqrlNSQLPVLSDAEREm 633
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 148664197 823 KKEQKEIEHHMKDLDNDLKKLNMLMNKNRCSSEELEQNNRVTEnefvRSLKASERETIKmqDKLNQLSEEKATLLNQL 900
Cdd:pfam10168 634 KKELETINEQLKHLANAIKQAKKKMNYQRYQIAKSQSIRKKSS----LSLSEKQRKTIK--EILKQLGSEIDELIKQV 705
|
|
|