NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1833563309|ref|NP_001368929|]
View 

coiled-coil domain-containing protein 144A isoform b [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
CCDC144C pfam14915
CCDC144C protein coiled-coil region; This family includes the human protein CCDC144C and the ...
821-1123 3.62e-138

CCDC144C protein coiled-coil region; This family includes the human protein CCDC144C and the ankyrin repeat domain-containing protein 26-like 1 found in eukaryotes. Its function remains unknown, however, it is known to contain a coiled-coil domain which corresponds to this region. The ankyrin repeat which features in this protein is a common amino acid motif.


:

Pssm-ID: 464371 [Multi-domain]  Cd Length: 304  Bit Score: 425.56  E-value: 3.62e-138
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  821 DCMLQEEIALLRLEIDTIKNQNKQKEKKYFEDIEAVKEKNDNLQKIIKLNEETLTETILQYSGQLNNLTAENKILNSELE 900
Cdd:pfam14915    1 NCMLQDEIAMLRLEIDTIKNQNQEKEKKYLEDIEILKEKNDDLQKTLKLNEETLTKTVFQYNGQLNVLKAENTMLNSKLE 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  901 NGKQNQERLEIEMESYRCRLAAAVRDCDQSQTA-RDLKLDFQRTRQEWVRLHDKMKVDMSGLQAKNEILSEKLSNAESKI 979
Cdd:pfam14915   81 NEKQNKERLETEVESYRSRLAAAIQDHEQSQTSkRDLELAFQRERDEWLRLQDKMNFDVSNLRDENEILSQQLSKAESKA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  980 NSLQIQLHNTRDALGRESLILERVQRDLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQLQSENMLLRQQLDDA 1059
Cdd:pfam14915  161 NSLENELHRTRDALREKTLLLESVQRDLSQAQCQKKELEHMYQNEQDKVNKYIGKQESLEERLAQLQSENMLLRQQLEDA 240
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1833563309 1060 HKKANSQEKTSSTIQDQFHSAAKNLQAESEKQILSLQEKNKELMDEYNHLKERMDQCEKEKAGR 1123
Cdd:pfam14915  241 QNKADAKEKTVIDIQDQFQDIVKKLQAESEKQVLLLEERNKELINECNHLKERLYQYEKEKAER 304
DUF3496 pfam12001
Domain of unknown function (DUF3496); This presumed domain is functionally uncharacterized. ...
1244-1354 2.61e-47

Domain of unknown function (DUF3496); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is about 110 amino acids in length.


:

Pssm-ID: 463425 [Multi-domain]  Cd Length: 109  Bit Score: 164.45  E-value: 2.61e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1244 KTQMELTIKDLESEISRIKTSQADFNKTELERYKELYLEEVKVRESLSNELSRTNEMIAEVSTQLTVEKEQtrSRSLFTA 1323
Cdd:pfam12001    1 RSQMELRIKDLESELSKMKTSQEDSNKIELEKYKQLYLEELKVRKSLSNKLNKTNERLAEVSTKLLVEKQQ--NRSLLST 78
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1833563309 1324 YATRPVLESPCVGNLNDSEGLNRKHIPRKKR 1354
Cdd:pfam12001   79 LTTRPVLESPCVGNLNNSLVLNRNFIPRENL 109
SMC_prok_B super family cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
466-1279 2.69e-12

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 72.01  E-value: 2.69e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  466 YKSLKPKLENLsslppdsDRTSEVYLHEELQQDMQKFKNEVNTLEEEFLALKKEDVQLHKDVEEEMEKHRSNSTELSGTL 545
Cdd:TIGR02168  215 YKELKAELREL-------ELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQ 287
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  546 TDGTTVGNDDDGLNQQI----PRKENGEHDRPADKTSNEKNEVKNQIYpEADFADSMEPSEIASEDCElshsvyeNFMLL 621
Cdd:TIGR02168  288 KELYALANEISRLEQQKqilrERLANLERQLEELEAQLEELESKLDEL-AEELAELEEKLEELKEELE-------SLEAE 359
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  622 IEQLRMEYKDSASLPRIQDtfclcEHLLKLKNnhcdqltvKLKQMENMVSVLQNELSETKKTKLQLELQkiewekelydl 701
Cdd:TIGR02168  360 LEELEAELEELESRLEELE-----EQLETLRS--------KVAQLELQIASLNNEIERLEARLERLEDR----------- 415
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  702 rlalkQENEEKRNADMLYNKDSEQLRIKEEECGKVVETKQQLKWNLRRLVKELRTVRNNLDLVVQERNDAQKQLSEEQDA 781
Cdd:TIGR02168  416 -----RERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQAR 490
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  782 RILQDQILTSKQKELEMARKKMN--SEISHRHQKEKDLFHEDCMLQEEI-ALLRLEIDTIknqNKQKEKKYFEDIEAVKE 858
Cdd:TIGR02168  491 LDSLERLQENLEGFSEGVKALLKnqSGLSGILGVLSELISVDEGYEAAIeAALGGRLQAV---VVENLNAAKKAIAFLKQ 567
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  859 KNDNLQKIIKLNeeTLTETILQYSGqLNNLTAENKILNSELENGKQNqERLEIEMESyrcrLAAAVRDCDQSQTARDLK- 937
Cdd:TIGR02168  568 NELGRVTFLPLD--SIKGTEIQGND-REILKNIEGFLGVAKDLVKFD-PKLRKALSY----LLGGVLVVDDLDNALELAk 639
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  938 ----------LDFQRTRQEWVRLHDKMKVDMSGLQAKNEI--LSEKLSNAESKINSLQIQLHNTRDALGRESLILERVQR 1005
Cdd:TIGR02168  640 klrpgyrivtLDGDLVRPGGVITGGSAKTNSSILERRREIeeLEEKIEELEEKIAELEKALAELRKELEELEEELEQLRK 719
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1006 DLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQLQSENMLLRQQLDDAHKKANSQEKTSSTIQDQFHSAAKNLQ 1085
Cdd:TIGR02168  720 ELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELK 799
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1086 AeSEKQILSLQEKNKELMDEYNHLKERMDQCEKEKAGRKIDLTEAQETvpSRCLHLDAENEVLQLQQTLFSMKAIQKQCE 1165
Cdd:TIGR02168  800 A-LREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQ--IEELSEDIESLAAEIEELEELIEELESELE 876
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1166 TLQKNKKQLKQEVVNLKSYMERNMLERGKAEwHKLLIEERARKEIEEKLNEAILTLQKQAAVSHEQLVQLREDNTTSIKT 1245
Cdd:TIGR02168  877 ALLNERASLEEALALLRSELEELSEELRELE-SKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEE 955
                          810       820       830       840
                   ....*....|....*....|....*....|....*....|...
gi 1833563309 1246 QMELT------IKDLESEISRIKTSQADF---NKTELERYKEL 1279
Cdd:TIGR02168  956 AEALEnkieddEEEARRRLKRLENKIKELgpvNLAAIEEYEEL 998
 
Name Accession Description Interval E-value
CCDC144C pfam14915
CCDC144C protein coiled-coil region; This family includes the human protein CCDC144C and the ...
821-1123 3.62e-138

CCDC144C protein coiled-coil region; This family includes the human protein CCDC144C and the ankyrin repeat domain-containing protein 26-like 1 found in eukaryotes. Its function remains unknown, however, it is known to contain a coiled-coil domain which corresponds to this region. The ankyrin repeat which features in this protein is a common amino acid motif.


Pssm-ID: 464371 [Multi-domain]  Cd Length: 304  Bit Score: 425.56  E-value: 3.62e-138
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  821 DCMLQEEIALLRLEIDTIKNQNKQKEKKYFEDIEAVKEKNDNLQKIIKLNEETLTETILQYSGQLNNLTAENKILNSELE 900
Cdd:pfam14915    1 NCMLQDEIAMLRLEIDTIKNQNQEKEKKYLEDIEILKEKNDDLQKTLKLNEETLTKTVFQYNGQLNVLKAENTMLNSKLE 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  901 NGKQNQERLEIEMESYRCRLAAAVRDCDQSQTA-RDLKLDFQRTRQEWVRLHDKMKVDMSGLQAKNEILSEKLSNAESKI 979
Cdd:pfam14915   81 NEKQNKERLETEVESYRSRLAAAIQDHEQSQTSkRDLELAFQRERDEWLRLQDKMNFDVSNLRDENEILSQQLSKAESKA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  980 NSLQIQLHNTRDALGRESLILERVQRDLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQLQSENMLLRQQLDDA 1059
Cdd:pfam14915  161 NSLENELHRTRDALREKTLLLESVQRDLSQAQCQKKELEHMYQNEQDKVNKYIGKQESLEERLAQLQSENMLLRQQLEDA 240
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1833563309 1060 HKKANSQEKTSSTIQDQFHSAAKNLQAESEKQILSLQEKNKELMDEYNHLKERMDQCEKEKAGR 1123
Cdd:pfam14915  241 QNKADAKEKTVIDIQDQFQDIVKKLQAESEKQVLLLEERNKELINECNHLKERLYQYEKEKAER 304
DUF3496 pfam12001
Domain of unknown function (DUF3496); This presumed domain is functionally uncharacterized. ...
1244-1354 2.61e-47

Domain of unknown function (DUF3496); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is about 110 amino acids in length.


Pssm-ID: 463425 [Multi-domain]  Cd Length: 109  Bit Score: 164.45  E-value: 2.61e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1244 KTQMELTIKDLESEISRIKTSQADFNKTELERYKELYLEEVKVRESLSNELSRTNEMIAEVSTQLTVEKEQtrSRSLFTA 1323
Cdd:pfam12001    1 RSQMELRIKDLESELSKMKTSQEDSNKIELEKYKQLYLEELKVRKSLSNKLNKTNERLAEVSTKLLVEKQQ--NRSLLST 78
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1833563309 1324 YATRPVLESPCVGNLNDSEGLNRKHIPRKKR 1354
Cdd:pfam12001   79 LTTRPVLESPCVGNLNNSLVLNRNFIPRENL 109
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
658-1303 5.67e-13

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 73.94  E-value: 5.67e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  658 QLTVKLKQMENMVSVLQNELSETKKTKLQLELQKIEWEKELYDLRLALKQENEEKrnADMLYNKDSEQLRIKEEEcgkvv 737
Cdd:TIGR02168  292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKL--EELKEELESLEAELEELE----- 364
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  738 ETKQQLKWNLRRLVKELRTVRNNLDLVVQERNDAQKQLSE--------EQDARILQDQILTSKQKELEMARKKMNSEISH 809
Cdd:TIGR02168  365 AELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERlearlerlEDRRERLQQEIEELLKKLEEAELKELQAELEE 444
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  810 RHQKEKDLFHEDCMLQEEIALLRLEIDTIKNQNKQKEKKY------FEDIEAVKEKNDNLQKIIK--LNEETLTETILQY 881
Cdd:TIGR02168  445 LEEELEELQEELERLEEALEELREELEEAEQALDAAERELaqlqarLDSLERLQENLEGFSEGVKalLKNQSGLSGILGV 524
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  882 SGQLNNLTAE-------------NKILNSELENGKQNQERLEiEMESYRCRLAAAVRDCDQSQTARDLKL-----DFQRT 943
Cdd:TIGR02168  525 LSELISVDEGyeaaieaalggrlQAVVVENLNAAKKAIAFLK-QNELGRVTFLPLDSIKGTEIQGNDREIlknieGFLGV 603
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  944 RQEWVRLHDKMKVDMSG----------LQAKNEILseKLSNAESKINSLQIQLHNTRDAL-----GRESLILERvQRDLS 1008
Cdd:TIGR02168  604 AKDLVKFDPKLRKALSYllggvlvvddLDNALELA--KKLRPGYRIVTLDGDLVRPGGVItggsaKTNSSILER-RREIE 680
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1009 QTQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQLQSENMLLRQQLDDAHKKANSQEKTSSTIQDQFHSAAKNLqAES 1088
Cdd:TIGR02168  681 ELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKEL-TEL 759
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1089 EKQILSLQEKNKELMDEYNHLKERMDQCEKEKAGRKIDLTEAQETVPS-----RCLHLDAENEVLQLQQTLFSMKAIQKQ 1163
Cdd:TIGR02168  760 EAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDElraelTLLNEEAANLRERLESLERRIAATERR 839
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1164 CETLQKNKKQLKQEVVNLKSYMERnmLERGKAEWHK----LLIEERARKEIEEKLNEAILTLQKQAAVSHEQLVQLRE-- 1237
Cdd:TIGR02168  840 LEDLEEQIEELSEDIESLAAEIEE--LEELIEELESeleaLLNERASLEEALALLRSELEELSEELRELESKRSELRRel 917
                          650       660       670       680       690       700
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1833563309 1238 DNTTSIKTQMELTIKDLESEISRIKTSQADFNKTELERYKELYLEEVKVRESLSNELSRTNEMIAE 1303
Cdd:TIGR02168  918 EELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIKE 983
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
466-1279 2.69e-12

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 72.01  E-value: 2.69e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  466 YKSLKPKLENLsslppdsDRTSEVYLHEELQQDMQKFKNEVNTLEEEFLALKKEDVQLHKDVEEEMEKHRSNSTELSGTL 545
Cdd:TIGR02168  215 YKELKAELREL-------ELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQ 287
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  546 TDGTTVGNDDDGLNQQI----PRKENGEHDRPADKTSNEKNEVKNQIYpEADFADSMEPSEIASEDCElshsvyeNFMLL 621
Cdd:TIGR02168  288 KELYALANEISRLEQQKqilrERLANLERQLEELEAQLEELESKLDEL-AEELAELEEKLEELKEELE-------SLEAE 359
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  622 IEQLRMEYKDSASLPRIQDtfclcEHLLKLKNnhcdqltvKLKQMENMVSVLQNELSETKKTKLQLELQkiewekelydl 701
Cdd:TIGR02168  360 LEELEAELEELESRLEELE-----EQLETLRS--------KVAQLELQIASLNNEIERLEARLERLEDR----------- 415
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  702 rlalkQENEEKRNADMLYNKDSEQLRIKEEECGKVVETKQQLKWNLRRLVKELRTVRNNLDLVVQERNDAQKQLSEEQDA 781
Cdd:TIGR02168  416 -----RERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQAR 490
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  782 RILQDQILTSKQKELEMARKKMN--SEISHRHQKEKDLFHEDCMLQEEI-ALLRLEIDTIknqNKQKEKKYFEDIEAVKE 858
Cdd:TIGR02168  491 LDSLERLQENLEGFSEGVKALLKnqSGLSGILGVLSELISVDEGYEAAIeAALGGRLQAV---VVENLNAAKKAIAFLKQ 567
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  859 KNDNLQKIIKLNeeTLTETILQYSGqLNNLTAENKILNSELENGKQNqERLEIEMESyrcrLAAAVRDCDQSQTARDLK- 937
Cdd:TIGR02168  568 NELGRVTFLPLD--SIKGTEIQGND-REILKNIEGFLGVAKDLVKFD-PKLRKALSY----LLGGVLVVDDLDNALELAk 639
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  938 ----------LDFQRTRQEWVRLHDKMKVDMSGLQAKNEI--LSEKLSNAESKINSLQIQLHNTRDALGRESLILERVQR 1005
Cdd:TIGR02168  640 klrpgyrivtLDGDLVRPGGVITGGSAKTNSSILERRREIeeLEEKIEELEEKIAELEKALAELRKELEELEEELEQLRK 719
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1006 DLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQLQSENMLLRQQLDDAHKKANSQEKTSSTIQDQFHSAAKNLQ 1085
Cdd:TIGR02168  720 ELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELK 799
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1086 AeSEKQILSLQEKNKELMDEYNHLKERMDQCEKEKAGRKIDLTEAQETvpSRCLHLDAENEVLQLQQTLFSMKAIQKQCE 1165
Cdd:TIGR02168  800 A-LREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQ--IEELSEDIESLAAEIEELEELIEELESELE 876
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1166 TLQKNKKQLKQEVVNLKSYMERNMLERGKAEwHKLLIEERARKEIEEKLNEAILTLQKQAAVSHEQLVQLREDNTTSIKT 1245
Cdd:TIGR02168  877 ALLNERASLEEALALLRSELEELSEELRELE-SKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEE 955
                          810       820       830       840
                   ....*....|....*....|....*....|....*....|...
gi 1833563309 1246 QMELT------IKDLESEISRIKTSQADF---NKTELERYKEL 1279
Cdd:TIGR02168  956 AEALEnkieddEEEARRRLKRLENKIKELgpvNLAAIEEYEEL 998
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
890-1234 2.22e-07

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 55.71  E-value: 2.22e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  890 AENKilnseLENGKQNQERLEI---EMESYRCRLAaavRDCDQSQTARDLKLDFQRTRQEWVRLHDK--------MKVDM 958
Cdd:COG1196    177 AERK-----LEATEENLERLEDilgELERQLEPLE---RQAEKAERYRELKEELKELEAELLLLKLReleaeleeLEAEL 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  959 SGLQAKNEILSEKLSNAESKINSLQIQLHNTRDAL----GRESLI---LERVQRDLSQTQCQKKETEQMYQIEQSKLKKY 1031
Cdd:COG1196    249 EELEAELEELEAELAELEAELEELRLELEELELELeeaqAEEYELlaeLARLEQDIARLEERRRELEERLEELEEELAEL 328
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1032 IAKQESVEERLSQLQSENMLLRQQLDDAHKKANSQEKTSSTIQDQFHSAAKNLQAEsEKQILSLQEKNKELMDEYNHLKE 1111
Cdd:COG1196    329 EEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEEL-AEELLEALRAAAELAAQLEELEE 407
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1112 RMDQCEKEKAGRKIDLTEAQETVpsrclhLDAENEVLQLQQTLfsmKAIQKQCETLQKNKKQLKQEVVNLKsymernmle 1191
Cdd:COG1196    408 AEEALLERLERLEEELEELEEAL------AELEEEEEEEEEAL---EEAAEEEAELEEEEEALLELLAELL--------- 469
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1833563309 1192 rgkaEWHKLLIEERARKEIEEKLNEAILTLQKQAAVSHEQLVQ 1234
Cdd:COG1196    470 ----EEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLE 508
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
649-1314 5.69e-06

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 51.13  E-value: 5.69e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  649 LKLKNNHCDQLTVKLKQMENMVSVLQNELSETKKTKLQLELQKIEWEKELYDLRLALKQENEEKR---NADMLYNKDSEQ 725
Cdd:pfam02463  215 LKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKekkLQEEELKLLAKE 294
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  726 LRIKEEECGKVVETKQQLKWNLRRLVKELRTVRNNLDLVVQERNDAQKQLSEEQDARILQDQILTSKQKELEMARKKMNS 805
Cdd:pfam02463  295 EEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEE 374
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  806 EISHRHQKEKDLFHEDCMLQEEIALLRLEIDTIKNQNKQKekkyfediEAVKEKNDNLQKIIKLNEETLTETILQYSGQL 885
Cdd:pfam02463  375 LLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELA--------RQLEDLLKEEKKEELEILEEEEESIELKQGKL 446
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  886 NNLTAENKILNSELENGKQNQERLEIEMESYRCRLAAAVRDCDQSqtarDLKLDFQRTRQEWVRLHDKMKVDMSGLQAKN 965
Cdd:pfam02463  447 TEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLS----RQKLEERSQKESKARSGLKVLLALIKDGVGG 522
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  966 EILSEKLSNAE-SKINSLQIQLHNTRDALGRESLILERVQRDLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQ 1044
Cdd:pfam02463  523 RIISAHGRLGDlGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPI 602
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1045 LQSENMLLRQQLDDAHKKANS--QEKTSSTIQDQFHSAAKNLQAESEKQILSLQEKNKELMDEYNHLKERMDQCEKEKAG 1122
Cdd:pfam02463  603 LNLAQLDKATLEADEDDKRAKvvEGILKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQ 682
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1123 RKIDLTEAQETVPSRCLHLDAENEVLQLQQTLFSMKAIQKQCETLQKNKKQLKQEVVNLKSYMERNMLERGKAEWHKLLI 1202
Cdd:pfam02463  683 EKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEK 762
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1203 EERARKEIE-EKLNEAILTLQKQAAVSHEQLVQLREDNTTSIKTQMELTIKDLESEISRIKTSQADFNKTELERYKELYL 1281
Cdd:pfam02463  763 EEEKSELSLkEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALEL 842
                          650       660       670
                   ....*....|....*....|....*....|...
gi 1833563309 1282 EEVKVRESLSNELSRTNEMIAEVSTQLTVEKEQ 1314
Cdd:pfam02463  843 KEEQKLEKLAEEELERLEEEITKEELLQELLLK 875
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
746-1319 4.06e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 48.14  E-value: 4.06e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  746 NLRRLVKELRTVRNNLDLVVQERNDAQKQLSEEQDARILQDQILTSKQKELEMARKKMnSEISHRHQKEKDLFHEDCMLQ 825
Cdd:PRK03918   166 NLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREEL-EKLEKEVKELEELKEEIEELE 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  826 EEIALLRLEIDTIKNQNKQKEkkyfEDIEAVKEKNDNLQKIIKLNEEtltetilqysgqLNNLTAENKILNSELENGKQN 905
Cdd:PRK03918   245 KELESLEGSKRKLEEKIRELE----ERIEELKKEIEELEEKVKELKE------------LKEKAEEYIKLSEFYEEYLDE 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  906 QERLEIEMESYRCRLAAAVRDCDqsqtardlKLDFQRTRQEWvrlhdkMKVDMSGLQAKNEILSEKLSNAEsKINSLQIQ 985
Cdd:PRK03918   309 LREIEKRLSRLEEEINGIEERIK--------ELEEKEERLEE------LKKKLKELEKRLEELEERHELYE-EAKAKKEE 373
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  986 LHNTRDALGRESLilERVQRDLSQTQCQKKE-TEQMYQIEQSK--LKKYIAKQESVEERLSQLQSENMLLRQQLDDAHKK 1062
Cdd:PRK03918   374 LERLKKRLTGLTP--EKLEKELEELEKAKEEiEEEISKITARIgeLKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRK 451
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1063 A---------NSQEKTSSTIQDQFHSAAKNLQaESEKQILSLQE--KNKELMDEYNHLKERMDQCEKEKAGRKidlTEAQ 1131
Cdd:PRK03918   452 ElleeytaelKRIEKELKEIEEKERKLRKELR-ELEKVLKKESEliKLKELAEQLKELEEKLKKYNLEELEKK---AEEY 527
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1132 ETVPSRCLHLdaENEVLQLQQTLFSMKAIQKQCETLQKNKKQLKQEVVNLksymERNMLERGKAEWHKLlieERARKEIE 1211
Cdd:PRK03918   528 EKLKEKLIKL--KGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAEL----LKELEELGFESVEEL---EERLKELE 598
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1212 EKLNEAI-LTLQKQAAVSHEQLVQLREDNTTSIKTQMELTIKDLESEISRIKTSQADFNKTELERYKELYLEEVKVRESL 1290
Cdd:PRK03918   599 PFYNEYLeLKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGL 678
                          570       580
                   ....*....|....*....|....*....
gi 1833563309 1291 SNELSRTNEMIAEVSTQLTVEKEQTRSRS 1319
Cdd:PRK03918   679 RAELEELEKRREEIKKTLEKLKEELEERE 707
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
968-1313 5.05e-04

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 44.12  E-value: 5.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  968 LSEKLSNAESKINSLQIQLHNTRDALGRESLILERVQRDLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQLQS 1047
Cdd:COG4372     29 LSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQE 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1048 ENMLLRQQLDDAHKKANSQEKTSSTIQDQfhsaaknlQAESEKQILSLQEKNKELMDEYNHLKERMDQCEKEKAGRKIDL 1127
Cdd:COG4372    109 EAEELQEELEELQKERQDLEQQRKQLEAQ--------IAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAE 180
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1128 TEAQ--ETVPSRCLHLDAENEVLQLQQTLFSMKAIQKQCETLQKNKKQLKQEVVNLKSYMERNMLERG-KAEWHKLLIEE 1204
Cdd:COG4372    181 AEQAldELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKeELLEEVILKEI 260
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1205 RARKEIEEKLNEAILTLQKQAAVSHEQLVQLREDNTTSIKTQMELTIKDLESEISRIKTSQADFNKTELERYKELYLEEV 1284
Cdd:COG4372    261 EELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILLAELA 340
                          330       340
                   ....*....|....*....|....*....
gi 1833563309 1285 KVRESLSNELSRTNEMIAEVSTQLTVEKE 1313
Cdd:COG4372    341 DLLQLLLVGLLDNDVLELLSKGAEAGVAD 369
PRK11281 PRK11281
mechanosensitive channel MscK;
1140-1309 7.10e-03

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 41.05  E-value: 7.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1140 HLDAENEVLQ--LQQTLFSMKAIQKQCETLQKNKKQLKQEVVNLKSYMERnmLERGKAEWHKLLIEERAR---KEIEEKL 1214
Cdd:PRK11281    53 LLEAEDKLVQqdLEQTLALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAE--LEALKDDNDEETRETLSTlslRQLESRL 130
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1215 NE---AILTLQKQAAVSHEQLVQLRednTTSIKTQMELT-----IKDLESEISRIKTSQADFNKTELERYK-ELYLEEvk 1285
Cdd:PRK11281   131 AQtldQLQNAQNDLAEYNSQLVSLQ---TQPERAQAALYansqrLQQIRNLLKGGKVGGKALRPSQRVLLQaEQALLN-- 205
                          170       180
                   ....*....|....*....|....
gi 1833563309 1286 vresLSNELSRTNemiAEVSTQLT 1309
Cdd:PRK11281   206 ----AQNDLQRKS---LEGNTQLQ 222
 
Name Accession Description Interval E-value
CCDC144C pfam14915
CCDC144C protein coiled-coil region; This family includes the human protein CCDC144C and the ...
821-1123 3.62e-138

CCDC144C protein coiled-coil region; This family includes the human protein CCDC144C and the ankyrin repeat domain-containing protein 26-like 1 found in eukaryotes. Its function remains unknown, however, it is known to contain a coiled-coil domain which corresponds to this region. The ankyrin repeat which features in this protein is a common amino acid motif.


Pssm-ID: 464371 [Multi-domain]  Cd Length: 304  Bit Score: 425.56  E-value: 3.62e-138
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  821 DCMLQEEIALLRLEIDTIKNQNKQKEKKYFEDIEAVKEKNDNLQKIIKLNEETLTETILQYSGQLNNLTAENKILNSELE 900
Cdd:pfam14915    1 NCMLQDEIAMLRLEIDTIKNQNQEKEKKYLEDIEILKEKNDDLQKTLKLNEETLTKTVFQYNGQLNVLKAENTMLNSKLE 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  901 NGKQNQERLEIEMESYRCRLAAAVRDCDQSQTA-RDLKLDFQRTRQEWVRLHDKMKVDMSGLQAKNEILSEKLSNAESKI 979
Cdd:pfam14915   81 NEKQNKERLETEVESYRSRLAAAIQDHEQSQTSkRDLELAFQRERDEWLRLQDKMNFDVSNLRDENEILSQQLSKAESKA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  980 NSLQIQLHNTRDALGRESLILERVQRDLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQLQSENMLLRQQLDDA 1059
Cdd:pfam14915  161 NSLENELHRTRDALREKTLLLESVQRDLSQAQCQKKELEHMYQNEQDKVNKYIGKQESLEERLAQLQSENMLLRQQLEDA 240
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1833563309 1060 HKKANSQEKTSSTIQDQFHSAAKNLQAESEKQILSLQEKNKELMDEYNHLKERMDQCEKEKAGR 1123
Cdd:pfam14915  241 QNKADAKEKTVIDIQDQFQDIVKKLQAESEKQVLLLEERNKELINECNHLKERLYQYEKEKAER 304
DUF3496 pfam12001
Domain of unknown function (DUF3496); This presumed domain is functionally uncharacterized. ...
1244-1354 2.61e-47

Domain of unknown function (DUF3496); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is about 110 amino acids in length.


Pssm-ID: 463425 [Multi-domain]  Cd Length: 109  Bit Score: 164.45  E-value: 2.61e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1244 KTQMELTIKDLESEISRIKTSQADFNKTELERYKELYLEEVKVRESLSNELSRTNEMIAEVSTQLTVEKEQtrSRSLFTA 1323
Cdd:pfam12001    1 RSQMELRIKDLESELSKMKTSQEDSNKIELEKYKQLYLEELKVRKSLSNKLNKTNERLAEVSTKLLVEKQQ--NRSLLST 78
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1833563309 1324 YATRPVLESPCVGNLNDSEGLNRKHIPRKKR 1354
Cdd:pfam12001   79 LTTRPVLESPCVGNLNNSLVLNRNFIPRENL 109
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
658-1303 5.67e-13

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 73.94  E-value: 5.67e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  658 QLTVKLKQMENMVSVLQNELSETKKTKLQLELQKIEWEKELYDLRLALKQENEEKrnADMLYNKDSEQLRIKEEEcgkvv 737
Cdd:TIGR02168  292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKL--EELKEELESLEAELEELE----- 364
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  738 ETKQQLKWNLRRLVKELRTVRNNLDLVVQERNDAQKQLSE--------EQDARILQDQILTSKQKELEMARKKMNSEISH 809
Cdd:TIGR02168  365 AELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERlearlerlEDRRERLQQEIEELLKKLEEAELKELQAELEE 444
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  810 RHQKEKDLFHEDCMLQEEIALLRLEIDTIKNQNKQKEKKY------FEDIEAVKEKNDNLQKIIK--LNEETLTETILQY 881
Cdd:TIGR02168  445 LEEELEELQEELERLEEALEELREELEEAEQALDAAERELaqlqarLDSLERLQENLEGFSEGVKalLKNQSGLSGILGV 524
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  882 SGQLNNLTAE-------------NKILNSELENGKQNQERLEiEMESYRCRLAAAVRDCDQSQTARDLKL-----DFQRT 943
Cdd:TIGR02168  525 LSELISVDEGyeaaieaalggrlQAVVVENLNAAKKAIAFLK-QNELGRVTFLPLDSIKGTEIQGNDREIlknieGFLGV 603
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  944 RQEWVRLHDKMKVDMSG----------LQAKNEILseKLSNAESKINSLQIQLHNTRDAL-----GRESLILERvQRDLS 1008
Cdd:TIGR02168  604 AKDLVKFDPKLRKALSYllggvlvvddLDNALELA--KKLRPGYRIVTLDGDLVRPGGVItggsaKTNSSILER-RREIE 680
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1009 QTQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQLQSENMLLRQQLDDAHKKANSQEKTSSTIQDQFHSAAKNLqAES 1088
Cdd:TIGR02168  681 ELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKEL-TEL 759
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1089 EKQILSLQEKNKELMDEYNHLKERMDQCEKEKAGRKIDLTEAQETVPS-----RCLHLDAENEVLQLQQTLFSMKAIQKQ 1163
Cdd:TIGR02168  760 EAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDElraelTLLNEEAANLRERLESLERRIAATERR 839
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1164 CETLQKNKKQLKQEVVNLKSYMERnmLERGKAEWHK----LLIEERARKEIEEKLNEAILTLQKQAAVSHEQLVQLRE-- 1237
Cdd:TIGR02168  840 LEDLEEQIEELSEDIESLAAEIEE--LEELIEELESeleaLLNERASLEEALALLRSELEELSEELRELESKRSELRRel 917
                          650       660       670       680       690       700
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1833563309 1238 DNTTSIKTQMELTIKDLESEISRIKTSQADFNKTELERYKELYLEEVKVRESLSNELSRTNEMIAE 1303
Cdd:TIGR02168  918 EELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIKE 983
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
649-1318 2.50e-12

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 72.01  E-value: 2.50e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  649 LKLKNNHCDQLTVKLKQMENMV-------SVLQNELSETKKTKLQLELQKIEWEKELYDLRLALKQENEEKRNADMLYNK 721
Cdd:TIGR02168  234 LEELREELEELQEELKEAEEELeeltaelQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLAN 313
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  722 DSEQLRIKEEECGKVVETKQQLKWNLRRLVKELRTVRNNLDLvVQERNDAQKQLSEEQDARI--LQDQILT--SKQKELE 797
Cdd:TIGR02168  314 LERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELES-LEAELEELEAELEELESRLeeLEEQLETlrSKVAQLE 392
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  798 MARKKMNSEISHRHQKEKDLFHEDCMLQEEI-----ALLRLEIDTIKNQNKQKEKKYFEDIEAVKEKNDNLQKIIKLNEE 872
Cdd:TIGR02168  393 LQIASLNNEIERLEARLERLEDRRERLQQEIeellkKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEE 472
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  873 tLTETILQYSGQLNNLTAENKILNSELENG----------KQNQERLE---------IEMES-YRCRLAAAVRD------ 926
Cdd:TIGR02168  473 -AEQALDAAERELAQLQARLDSLERLQENLegfsegvkalLKNQSGLSgilgvlselISVDEgYEAAIEAALGGrlqavv 551
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  927 CDQSQTARD---------------LKLD--------------------FQRTRQEWVRLHDKMKVDMSG----------L 961
Cdd:TIGR02168  552 VENLNAAKKaiaflkqnelgrvtfLPLDsikgteiqgndreilkniegFLGVAKDLVKFDPKLRKALSYllggvlvvddL 631
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  962 QAKNEILseKLSNAESKINSLQIQLHNTRDAL-----GRESLILERvQRDLSQTQCQKKETEQMYQIEQSKLKKYIAKQE 1036
Cdd:TIGR02168  632 DNALELA--KKLRPGYRIVTLDGDLVRPGGVItggsaKTNSSILER-RREIEELEEKIEELEEKIAELEKALAELRKELE 708
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1037 SVEERLSQLQSENMLLRQQLDDAHKKANSQEKTSSTIQDQFHSAAKNLqAESEKQILSLQEKNKELMDEYNHLKERMDQC 1116
Cdd:TIGR02168  709 ELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKEL-TELEAEIEELEERLEEAEEELAEAEAEIEEL 787
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1117 EKEKAGRKIDLTEAQETVPS-----RCLHLDAENEVLQLQQTLFSMKAIQKQCETLQKNKKQLKQEVVNLKSYMERnmle 1191
Cdd:TIGR02168  788 EAQIEQLKEELKALREALDElraelTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEE---- 863
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1192 rgkaewhklliEERARKEIEEKLNEailtLQKQAAVSHEQLVQLREDNTTSIKTQ--MELTIKDLESEISRIKTSQADFN 1269
Cdd:TIGR02168  864 -----------LEELIEELESELEA----LLNERASLEEALALLRSELEELSEELreLESKRSELRRELEELREKLAQLE 928
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*....
gi 1833563309 1270 ktelERYKELYLEEVKVRESLSNELSRTNEMIAEVSTQLTVEKEQTRSR 1318
Cdd:TIGR02168  929 ----LRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRR 973
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
466-1279 2.69e-12

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 72.01  E-value: 2.69e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  466 YKSLKPKLENLsslppdsDRTSEVYLHEELQQDMQKFKNEVNTLEEEFLALKKEDVQLHKDVEEEMEKHRSNSTELSGTL 545
Cdd:TIGR02168  215 YKELKAELREL-------ELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQ 287
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  546 TDGTTVGNDDDGLNQQI----PRKENGEHDRPADKTSNEKNEVKNQIYpEADFADSMEPSEIASEDCElshsvyeNFMLL 621
Cdd:TIGR02168  288 KELYALANEISRLEQQKqilrERLANLERQLEELEAQLEELESKLDEL-AEELAELEEKLEELKEELE-------SLEAE 359
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  622 IEQLRMEYKDSASLPRIQDtfclcEHLLKLKNnhcdqltvKLKQMENMVSVLQNELSETKKTKLQLELQkiewekelydl 701
Cdd:TIGR02168  360 LEELEAELEELESRLEELE-----EQLETLRS--------KVAQLELQIASLNNEIERLEARLERLEDR----------- 415
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  702 rlalkQENEEKRNADMLYNKDSEQLRIKEEECGKVVETKQQLKWNLRRLVKELRTVRNNLDLVVQERNDAQKQLSEEQDA 781
Cdd:TIGR02168  416 -----RERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQAR 490
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  782 RILQDQILTSKQKELEMARKKMN--SEISHRHQKEKDLFHEDCMLQEEI-ALLRLEIDTIknqNKQKEKKYFEDIEAVKE 858
Cdd:TIGR02168  491 LDSLERLQENLEGFSEGVKALLKnqSGLSGILGVLSELISVDEGYEAAIeAALGGRLQAV---VVENLNAAKKAIAFLKQ 567
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  859 KNDNLQKIIKLNeeTLTETILQYSGqLNNLTAENKILNSELENGKQNqERLEIEMESyrcrLAAAVRDCDQSQTARDLK- 937
Cdd:TIGR02168  568 NELGRVTFLPLD--SIKGTEIQGND-REILKNIEGFLGVAKDLVKFD-PKLRKALSY----LLGGVLVVDDLDNALELAk 639
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  938 ----------LDFQRTRQEWVRLHDKMKVDMSGLQAKNEI--LSEKLSNAESKINSLQIQLHNTRDALGRESLILERVQR 1005
Cdd:TIGR02168  640 klrpgyrivtLDGDLVRPGGVITGGSAKTNSSILERRREIeeLEEKIEELEEKIAELEKALAELRKELEELEEELEQLRK 719
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1006 DLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQLQSENMLLRQQLDDAHKKANSQEKTSSTIQDQFHSAAKNLQ 1085
Cdd:TIGR02168  720 ELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELK 799
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1086 AeSEKQILSLQEKNKELMDEYNHLKERMDQCEKEKAGRKIDLTEAQETvpSRCLHLDAENEVLQLQQTLFSMKAIQKQCE 1165
Cdd:TIGR02168  800 A-LREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQ--IEELSEDIESLAAEIEELEELIEELESELE 876
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1166 TLQKNKKQLKQEVVNLKSYMERNMLERGKAEwHKLLIEERARKEIEEKLNEAILTLQKQAAVSHEQLVQLREDNTTSIKT 1245
Cdd:TIGR02168  877 ALLNERASLEEALALLRSELEELSEELRELE-SKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEE 955
                          810       820       830       840
                   ....*....|....*....|....*....|....*....|...
gi 1833563309 1246 QMELT------IKDLESEISRIKTSQADF---NKTELERYKEL 1279
Cdd:TIGR02168  956 AEALEnkieddEEEARRRLKRLENKIKELgpvNLAAIEEYEEL 998
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
730-1120 3.73e-12

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 71.25  E-value: 3.73e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  730 EEECGKVVETKQQLKWNLRRLVKELRTVRNNLDLVVQERNDAQKQLSE-EQDARILQDQILTSKQ--KELEMARKKMNSE 806
Cdd:TIGR02169  673 PAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEiEKEIEQLEQEEEKLKErlEELEEDLSSLEQE 752
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  807 ISHRHQKEKDLFHEDCMLQEEIALLRLEIDTIKNQNKQkekkyfEDIEAVKEKNDNLQKIIKLNEETLTETilqySGQLN 886
Cdd:TIGR02169  753 IENVKSELKELEARIEELEEDLHKLEEALNDLEARLSH------SRIPEIQAELSKLEEEVSRIEARLREI----EQKLN 822
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  887 NLTAENKILNSELENGKQNQERLEIEMESYRCRLAAAVRDcdqsqtARDLKLDFQRTRQEWVRLHDKMKvdmsGLQAKNE 966
Cdd:TIGR02169  823 RLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGK------KEELEEELEELEAALRDLESRLG----DLKKERD 892
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  967 ILSEKLSNAESKINSLQIQLHNTRDALGRESLILERVQRDLSQTQCQKKEtEQMYQIEQSKLKKYIAKQESVEERLSQLQ 1046
Cdd:TIGR02169  893 ELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGE-DEEIPEEELSLEDVQAELQRVEEEIRALE 971
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1833563309 1047 SENMLLRQQLDdahkkansqektsstiqdqfhsaaknlqaESEKQILSLQEKNKELMDEYNHLKERMDQCEKEK 1120
Cdd:TIGR02169  972 PVNMLAIQEYE-----------------------------EVLKRLDELKEKRAKLEEERKAILERIEEYEKKK 1016
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
762-1134 2.23e-11

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 68.93  E-value: 2.23e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  762 DLVVQERNDAQKQLSEEQDARILQDQILTSKQKELEMARKKMNSEISHRHQKEKDLFHEDCMLQEEIALLRLEIDTIKNQ 841
Cdd:TIGR02168  669 NSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEER 748
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  842 NKQKEkkyFEDIEAVKEKNDNLQKIIKLNE---------ETLTETILQYSGQLNNLTAENKILNSELENGKQNQERLEIE 912
Cdd:TIGR02168  749 IAQLS---KELTELEAEIEELEERLEEAEEelaeaeaeiEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRER 825
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  913 MESYRCRLAAAvrdcdqsqtardlkldfQRTRQEWVRLHDKMKVDMSGLQAKNEILSEKLSNAESKINSLQIQLHNTRDA 992
Cdd:TIGR02168  826 LESLERRIAAT-----------------ERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEA 888
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  993 LGRESLILERVQRDLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQLQSenmllrqqlddahkKANSQEKTSST 1072
Cdd:TIGR02168  889 LALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQE--------------RLSEEYSLTLE 954
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1833563309 1073 IQDQFHSAAKNLQAESEKQILSLQEKNKEL-------MDEYNHLKERMDQCEKEKAgrkiDLTEAQETV 1134
Cdd:TIGR02168  955 EAEALENKIEDDEEEARRRLKRLENKIKELgpvnlaaIEEYEELKERYDFLTAQKE----DLTEAKETL 1019
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
746-1065 3.19e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 58.53  E-value: 3.19e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  746 NLRRL---VKELRTVRNNLDLVVQ--ERNDAQKQLSEEQDARILQDQI--LTSKQKELEMARKKMNSEIsHRHQKEKDlf 818
Cdd:TIGR02168  187 NLDRLediLNELERQLKSLERQAEkaERYKELKAELRELELALLVLRLeeLREELEELQEELKEAEEEL-EELTAELQ-- 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  819 hedcMLQEEIALLRLEIDTIKNQNKQKEKKYFEDIEAVKEKNDNLQkIIKLNEETLTETILQYSGQLNNLTAENKILNSE 898
Cdd:TIGR02168  264 ----ELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQ-ILRERLANLERQLEELEAQLEELESKLDELAEE 338
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  899 LENGKQNQERLEIEMESYRCRLAAavrdcdqsqtARDLKLDFQRTRQEWVRLHDKMKVDMSGLQAKNEILSEKLSNAESK 978
Cdd:TIGR02168  339 LAELEEKLEELKEELESLEAELEE----------LEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEAR 408
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  979 INSLQIQLHNTRDALGRESLILERVQRDLSQTQCQKKETEqmyqieqskLKKYIAKQESVEERLSQLQSENMLLRQQLDD 1058
Cdd:TIGR02168  409 LERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEE---------LEELQEELERLEEALEELREELEEAEQALDA 479

                   ....*..
gi 1833563309 1059 AHKKANS 1065
Cdd:TIGR02168  480 AERELAQ 486
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
649-1285 4.61e-08

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 57.72  E-value: 4.61e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  649 LKLKNNHCDQLTVKLKQMENMVSVLQNELSETKKTKLQLELQKIEWEKELYDLRLALKQENEEKRNADML------YNKD 722
Cdd:TIGR04523  133 KKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKLLKLELLlsnlkkKIQK 212
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  723 SEQLRIK----EEECGKVVETKQQLKWNLRRLVKELRTVRNNLDLVVQERNDAQKQLSEEQDARILQDQILTSKQKELem 798
Cdd:TIGR04523  213 NKSLESQiselKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLSEKQKELEQNNKKIKELEKQL-- 290
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  799 arKKMNSEISH-RHQKEKDLFHEdcmlqeeiallrleidtiknqnkqkekkYFEDIEAVKEKNDNLQKIIKLNEET---L 874
Cdd:TIGR04523  291 --NQLKSEISDlNNQKEQDWNKE----------------------------LKSELKNQEKKLEEIQNQISQNNKIisqL 340
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  875 TETILQYSGQLNNLTAENKILNSELENGKQNQERLEIEMESYRCRLAaavrdcDQSQTARDLKLDFQRTRQEWVRLHDKM 954
Cdd:TIGR04523  341 NEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIK------NLESQINDLESKIQNQEKLNQQKDEQI 414
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  955 KVdmsgLQAKNEILSEKLSNAESKINSLQIQLHNTRDALGRESLILERVQRDLSQTQCQKKETEQMYQIEQSKLKKYIAK 1034
Cdd:TIGR04523  415 KK----LQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKE 490
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1035 QESVEERLSQLQSENMLLRQQLDDAHKKAnsqektsstiqdqfhSAAKNLQAESEKQILSLQEKNKELMDEYNHLKERMD 1114
Cdd:TIGR04523  491 LKSKEKELKKLNEEKKELEEKVKDLTKKI---------------SSLKEKIEKLESEKKEKESKISDLEDELNKDDFELK 555
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1115 QCEKEKAGRKIDlteaqetvpsrclhldaeNEVLQLQQTLFSMKAIQKQcetLQKNKKQLKQEVVNLKSYMERNMLERGK 1194
Cdd:TIGR04523  556 KENLEKEIDEKN------------------KEIEELKQTQKSLKKKQEE---KQELIDQKEKEKKDLIKEIEEKEKKISS 614
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1195 AEWHKLLIEERARK--EIEEKLNEAILTLQKQAAVSHEQLvqlreDNTTSIKTQMELTIKDLESEISRIKTSQADFNKTE 1272
Cdd:TIGR04523  615 LEKELEKAKKENEKlsSIIKNIKSKKNKLKQEVKQIKETI-----KEIRNKWPEIIKKIKESKTKIDDIIELMKDWLKEL 689
                          650
                   ....*....|...
gi 1833563309 1273 LERYKELYLEEVK 1285
Cdd:TIGR04523  690 SLHYKKYITRMIR 702
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
976-1313 9.75e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 56.99  E-value: 9.75e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  976 ESKINSLQIQ---------LHNTRDALGRESLI--LERVQRDLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQ 1044
Cdd:TIGR02168  199 ERQLKSLERQaekaerykeLKAELRELELALLVlrLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSE 278
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1045 LQSENMLLRQQLDDAHKKANSQEKTSSTIQDQFHSAAKNlQAESEKQILSLQEKNKELMDEYNHLKERMDQCEKEKAGRK 1124
Cdd:TIGR02168  279 LEEEIEELQKELYALANEISRLEQQKQILRERLANLERQ-LEELEAQLEELESKLDELAEELAELEEKLEELKEELESLE 357
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1125 IDLTEAQETVPSRCLHLDAENEVLQ-----LQQTLFSMKAIQKQCETLQKNKKQLKQEVVNLKSYMERNMLERGKAEWHK 1199
Cdd:TIGR02168  358 AELEELEAELEELESRLEELEEQLEtlrskVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKE 437
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1200 LLIEERARKEIEEKLNEAILTLQKQAAVSHEQLVQLREDnttsiktqmeltIKDLESEISRIKtSQADFNKTELERYKEL 1279
Cdd:TIGR02168  438 LQAELEELEEELEELQEELERLEEALEELREELEEAEQA------------LDAAERELAQLQ-ARLDSLERLQENLEGF 504
                          330       340       350
                   ....*....|....*....|....*....|....
gi 1833563309 1280 YlEEVKvreSLSNELSRTNEMIAEVSTQLTVEKE 1313
Cdd:TIGR02168  505 S-EGVK---ALLKNQSGLSGILGVLSELISVDEG 534
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
765-1308 1.05e-07

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 56.67  E-value: 1.05e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  765 VQERNDAQKQLSEEQDARILQDQI-LTSKQKELEMARKKMNSEISHRHQKEKDLFHEdcmLQEEIallrleidtiknqnk 843
Cdd:pfam15921   90 LQRRLNESNELHEKQKFYLRQSVIdLQTKLQEMQMERDAMADIRRRESQSQEDLRNQ---LQNTV--------------- 151
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  844 qkekkyfEDIEAVKEKNDNLQKIIKLNEETLTETILQYSGQLNNLtaENKILNSELENGKQNQERLEIEMESYRcRLAAA 923
Cdd:pfam15921  152 -------HELEAAKCLKEDMLEDSNTQIEQLRKMMLSHEGVLQEI--RSILVDFEEASGKKIYEHDSMSTMHFR-SLGSA 221
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  924 VrdcdqSQTARDLKLDFQRTRQEWVRLHDKMKVDMSGLQAKNEIL----------------------SEKLSNAESKINS 981
Cdd:pfam15921  222 I-----SKILRELDTEISYLKGRIFPVEDQLEALKSESQNKIELLlqqhqdrieqliseheveitglTEKASSARSQANS 296
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  982 LQIQLHntrdalgresLILERVQRDLSQTQCQKKETEQMYQIEQSKLKKyiaKQESVEERLSQLQSENMLlrqqlddahk 1061
Cdd:pfam15921  297 IQSQLE----------IIQEQARNQNSMYMRQLSDLESTVSQLRSELRE---AKRMYEDKIEELEKQLVL---------- 353
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1062 kANSQEKTSSTIQDQFHSAAKNLQAESEKQILSLQEKNKELMDEynhlKERMDQCEKEKAGRKIDLTEAQETVPSRCLhl 1141
Cdd:pfam15921  354 -ANSELTEARTERDQFSQESGNLDDQLQKLLADLHKREKELSLE----KEQNKRLWDRDTGNSITIDHLRRELDDRNM-- 426
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1142 daenEVLQLQQTLFSMKA-----IQKQCETLQKNKKQLkQEVVNLKSYMERNmlergkaewhklliEERARKEIEEklne 1216
Cdd:pfam15921  427 ----EVQRLEALLKAMKSecqgqMERQMAAIQGKNESL-EKVSSLTAQLEST--------------KEMLRKVVEE---- 483
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1217 aiLTLQKQAAVSHEQLVQlreDNTTSIKtQMELTIKDLESEISRIKtSQADFNKTELERYK--ELYLEEVKVR-ESLSNE 1293
Cdd:pfam15921  484 --LTAKKMTLESSERTVS---DLTASLQ-EKERAIEATNAEITKLR-SRVDLKLQELQHLKneGDHLRNVQTEcEALKLQ 556
                          570
                   ....*....|....*
gi 1833563309 1294 LSRTNEMIAEVSTQL 1308
Cdd:pfam15921  557 MAEKDKVIEILRQQI 571
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
890-1234 2.22e-07

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 55.71  E-value: 2.22e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  890 AENKilnseLENGKQNQERLEI---EMESYRCRLAaavRDCDQSQTARDLKLDFQRTRQEWVRLHDK--------MKVDM 958
Cdd:COG1196    177 AERK-----LEATEENLERLEDilgELERQLEPLE---RQAEKAERYRELKEELKELEAELLLLKLReleaeleeLEAEL 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  959 SGLQAKNEILSEKLSNAESKINSLQIQLHNTRDAL----GRESLI---LERVQRDLSQTQCQKKETEQMYQIEQSKLKKY 1031
Cdd:COG1196    249 EELEAELEELEAELAELEAELEELRLELEELELELeeaqAEEYELlaeLARLEQDIARLEERRRELEERLEELEEELAEL 328
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1032 IAKQESVEERLSQLQSENMLLRQQLDDAHKKANSQEKTSSTIQDQFHSAAKNLQAEsEKQILSLQEKNKELMDEYNHLKE 1111
Cdd:COG1196    329 EEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEEL-AEELLEALRAAAELAAQLEELEE 407
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1112 RMDQCEKEKAGRKIDLTEAQETVpsrclhLDAENEVLQLQQTLfsmKAIQKQCETLQKNKKQLKQEVVNLKsymernmle 1191
Cdd:COG1196    408 AEEALLERLERLEEELEELEEAL------AELEEEEEEEEEAL---EEAAEEEAELEEEEEALLELLAELL--------- 469
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1833563309 1192 rgkaEWHKLLIEERARKEIEEKLNEAILTLQKQAAVSHEQLVQ 1234
Cdd:COG1196    470 ----EEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLE 508
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
658-1318 2.95e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 55.45  E-value: 2.95e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  658 QLTVKLKQMENMVSVLQNELSETKKTKLQLELQKIEWEKELYDLRLALKQEN-EEKRNADmlyNKDSEQLRIKEEECGKV 736
Cdd:TIGR02168  217 ELKAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKlEELRLEV---SELEEEIEELQKELYAL 293
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  737 VETKQQLKWNLRRLVKELRTVRNNLDLVVQERNDAQKQLSEEQDARILQDQILTSKQKELEMAR---KKMNSEISHRHQK 813
Cdd:TIGR02168  294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEaelEELEAELEELESR 373
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  814 EKDLFHEDCMLQEEIALLRLEIDTIKNQNKQKEkkyfEDIEAVKEKNDNLQKIIKLNEETLTETILQYSG--------QL 885
Cdd:TIGR02168  374 LEELEEQLETLRSKVAQLELQIASLNNEIERLE----ARLERLEDRRERLQQEIEELLKKLEEAELKELQaeleeleeEL 449
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  886 NNLTAENKILNSELENGKQNQERLEIEMESYRCRLAAAVRDCDQSQTARDLKLDFQRTRQEWV-----------RLHDKM 954
Cdd:TIGR02168  450 EELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLknqsglsgilgVLSELI 529
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  955 KVDMSGLQAKNEILSEKLSNAESKINSLQIQL--HNTRDALGRESLILERVQRD--LSQTQCQKKETEQMYQIEQSKLKK 1030
Cdd:TIGR02168  530 SVDEGYEAAIEAALGGRLQAVVVENLNAAKKAiaFLKQNELGRVTFLPLDSIKGteIQGNDREILKNIEGFLGVAKDLVK 609
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1031 YIAK-QESVEERLSQLqsenmLLRQQLDDAHKKANSQEKTSS--TIQDQFHSAAKNLQAESEKQILSLQEKNKELmdeyN 1107
Cdd:TIGR02168  610 FDPKlRKALSYLLGGV-----LVVDDLDNALELAKKLRPGYRivTLDGDLVRPGGVITGGSAKTNSSILERRREI----E 680
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1108 HLKERMDQCEKEKAGRKIDLTEAQETvpsrclHLDAENEVLQLQQTLFSM-----------KAIQKQCETLQKNKKQLKQ 1176
Cdd:TIGR02168  681 ELEEKIEELEEKIAELEKALAELRKE------LEELEEELEQLRKELEELsrqisalrkdlARLEAEVEQLEERIAQLSK 754
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1177 EVVNLKSYMERNMLERGKAEWHKLLIEE-------------RARKEIEEKLNE--AILTLQKQAAVSHEQLVQLREDNTT 1241
Cdd:TIGR02168  755 ELTELEAEIEELEERLEEAEEELAEAEAeieeleaqieqlkEELKALREALDElrAELTLLNEEAANLRERLESLERRIA 834
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1242 SIKTQME---LTIKDLESEISRIKTSQADF--NKTELERYKELYLEEVKVRESLSNELSRtnEMIAEVSTQLTVEKEQTR 1316
Cdd:TIGR02168  835 ATERRLEdleEQIEELSEDIESLAAEIEELeeLIEELESELEALLNERASLEEALALLRS--ELEELSEELRELESKRSE 912

                   ..
gi 1833563309 1317 SR 1318
Cdd:TIGR02168  913 LR 914
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
747-1238 7.63e-07

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 54.15  E-value: 7.63e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  747 LRRLVKELRTVRNNLDLVVQERNDAQKQLSE-EQDARILQDQILTSKQKELEMARKkmnsEISHRHQKEKDLFHEDCMLQ 825
Cdd:COG4913    290 LELLEAELEELRAELARLEAELERLEARLDAlREELDELEAQIRGNGGDRLEQLER----EIERLERELEERERRRARLE 365
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  826 EEIALLRLEIDTIKNQnkqkekkyFEDI-EAVKEKNDNLQKIIKLNEETLTETIlqysGQLNNLTAENKILNSELENGKQ 904
Cdd:COG4913    366 ALLAALGLPLPASAEE--------FAALrAEAAALLEALEEELEALEEALAEAE----AALRDLRRELRELEAEIASLER 433
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  905 NQERLEIEMESYRCRLAAA----------------VRDCDQS-QTA-----RDLKLD-------FQRTRqEWV-RLHDKM 954
Cdd:COG4913    434 RKSNIPARLLALRDALAEAlgldeaelpfvgelieVRPEEERwRGAiervlGGFALTllvppehYAAAL-RWVnRLHLRG 512
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  955 KVDMSGLQAKNEI----------LSEKLSNAESKI-NSLQIQLHNTRDALGRESL-ILERVQRDLSQTqCQKKETEQMYQ 1022
Cdd:COG4913    513 RLVYERVRTGLPDperprldpdsLAGKLDFKPHPFrAWLEAELGRRFDYVCVDSPeELRRHPRAITRA-GQVKGNGTRHE 591
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1023 IeqsKLKKYIAKQ----ESVEERLSQLQSENMLLRQQLDDAHKKANSQEKTSSTIQDQFHSAAKNLQ-AESEKQILSLQE 1097
Cdd:COG4913    592 K---DDRRRIRSRyvlgFDNRAKLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEySWDEIDVASAER 668
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1098 KNKELMDEYNHLKERMDqcekekagrkiDLTEAQETVpsrclhLDAENEVLQLQQTLfsmKAIQKQCETLQKNKKQLKQE 1177
Cdd:COG4913    669 EIAELEAELERLDASSD-----------DLAALEEQL------EELEAELEELEEEL---DELKGEIGRLEKELEQAEEE 728
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1833563309 1178 VVNLKSYMERnmLERGKAEWHKLLIEER--------ARKEIEEKLNEAILTLQKQAAVSHEQLVQLRED 1238
Cdd:COG4913    729 LDELQDRLEA--AEDLARLELRALLEERfaaalgdaVERELRENLEERIDALRARLNRAEEELERAMRA 795
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
667-1205 1.07e-06

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 53.69  E-value: 1.07e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  667 ENMVSVLQNELSEtkktklQLELQKIEWEKELYDLRLALkqENEEKRNADMLYNKDS-EQLRIKEEECGKVVETKQQLKW 745
Cdd:pfam12128  414 EDDLQALESELRE------QLEAGKLEFNEEEYRLKSRL--GELKLRLNQATATPELlLQLENFDERIERAREEQEAANA 485
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  746 NLRRLVKELRTVRNNLDLVVQERNDAQKQLSEEQDA-RILQDQILTSKQKELEMARKK-----------MNSEISHRHQK 813
Cdd:pfam12128  486 EVERLQSELRQARKRRDQASEALRQASRRLEERQSAlDELELQLFPQAGTLLHFLRKEapdweqsigkvISPELLHRTDL 565
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  814 EKDLFHEDCMLQEEIALLRLEIDTIKNQNkqkekkYFEDIEAVKEKNDNLQKIIKLNEET---LTETILQYSGQLNNLTA 890
Cdd:pfam12128  566 DPEVWDGSVGGELNLYGVKLDLKRIDVPE------WAASEEELRERLDKAEEALQSAREKqaaAEEQLVQANGELEKASR 639
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  891 ENKILNSELENGKQNQERLEIEMESYRCRLaaavrdcdQSQTARDLKLDFQRTRQewvrLHDKMKVDMSGLQA-KNEILS 969
Cdd:pfam12128  640 EETFARTALKNARLDLRRLFDEKQSEKDKK--------NKALAERKDSANERLNS----LEAQLKQLDKKHQAwLEEQKE 707
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  970 EKLSNAESKINSLQIQLHNTRDALGR--ESLILERVQRDLSQTQCQKketeqmyqieqsKLKKYIAKQESVEERLSQLQS 1047
Cdd:pfam12128  708 QKREARTEKQAYWQVVEGALDAQLALlkAAIAARRSGAKAELKALET------------WYKRDLASLGVDPDVIAKLKR 775
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1048 ENMLLRQQLDDAHKKANSQEKTSSTIQDQFHSAAKNLQAESEKQILSLQEKNKELMDEYNHLKERMDQCEKE-KAGRK-- 1124
Cdd:pfam12128  776 EIRTLERKIERIAVRRQEVLRYFDWYQETWLQRRPRLATQLSNIERAISELQQQLARLIADTKLRRAKLEMErKASEKqq 855
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1125 IDLTEAQETVPSRC-----LHLDAENEVLQLQQTlFSMKAIQKQCETLQKNKKQLKQEVVNLKSYMERNMLERGKAEWHK 1199
Cdd:pfam12128  856 VRLSENLRGLRCEMsklatLKEDANSEQAQGSIG-ERLAQLEDLKLKRDYLSESVKKYVEHFKNVIADHSGSGLAETWES 934

                   ....*.
gi 1833563309 1200 LLIEER 1205
Cdd:pfam12128  935 LREEDH 940
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
721-1314 1.08e-06

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 53.19  E-value: 1.08e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  721 KDSEQLRIKEEECGKVVETKQQLKWNLRRLVKELRTVRNNLDLVVQERNDAQKQLSEEQDARILQDQILTSKQ----KEL 796
Cdd:pfam05483  219 EDHEKIQHLEEEYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKTKLQDENLKELIEKKdhltKEL 298
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  797 EMARKKMNSEISHRHQKEKDLfhedCMLQEEIALLRLEIDTIKNQNKQKEKKYFEDIEAVKEKNDNLQKIIKLNEETLTE 876
Cdd:pfam05483  299 EDIKMSLQRSMSTQKALEEDL----QIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEK 374
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  877 TilqySGQLNNLTAENKILNSELENGKQNQERLEIEMESYRCRLAaavrdcdQSQTARDLKLDFQRTRQEwvrlhdkmkv 956
Cdd:pfam05483  375 N----EDQLKIITMELQKKSSELEEMTKFKNNKEVELEELKKILA-------EDEKLLDEKKQFEKIAEE---------- 433
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  957 dmsgLQAKNEILSEKLSNAESKINSLQIQLHNTRdalgreslilervqrdlSQTQCQKKETEQMyqieQSKLKKYIAKQE 1036
Cdd:pfam05483  434 ----LKGKEQELIFLLQAREKEIHDLEIQLTAIK-----------------TSEEHYLKEVEDL----KTELEKEKLKNI 488
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1037 SVEERLSQLQSENMLLRQQLDDAHKKANSQEKTSSTIQDQfhsaaknlQAESEKQILSLQEKNKELMDEYNHLKERMDQc 1116
Cdd:pfam05483  489 ELTAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQ--------EERMLKQIENLEEKEMNLRDELESVREEFIQ- 559
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1117 EKEKAGRKIDLTEAQetvpSRCLHLDAENEVLQLQQTLFSMKAIQKQCETLQKNKKQLKQE--VVNLKSYMERNMLERGK 1194
Cdd:pfam05483  560 KGDEVKCKLDKSEEN----ARSIEYEVLKKEKQMKILENKCNNLKKQIENKNKNIEELHQEnkALKKKGSAENKQLNAYE 635
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1195 AEWHKLLIEERARKEieeKLNEAILTLQKQ---AAVSHEQLVQLREDNTT----SIKTQMELTIK---DLESEISRIKTS 1264
Cdd:pfam05483  636 IKVNKLELELASAKQ---KFEEIIDNYQKEiedKKISEEKLLEEVEKAKAiadeAVKLQKEIDKRcqhKIAEMVALMEKH 712
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1833563309 1265 QADFNKTELERYKELYL------EEVKVRESLSNELSRTNEMIAEVSTQLTVEKEQ 1314
Cdd:pfam05483  713 KHQYDKIIEERDSELGLyknkeqEQSSAKAALEIELSNIKAELLSLKKQLEIEKEE 768
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
676-1262 2.00e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 52.63  E-value: 2.00e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  676 ELSETKKtKLQLELQKIEWEkelyDLRLALKQENEEKRNADMLYNKDSEQLRIKEEEcgkvvetkqqlkwnlrrlVKELR 755
Cdd:COG1196    217 ELKEELK-ELEAELLLLKLR----ELEAELEELEAELEELEAELEELEAELAELEAE------------------LEELR 273
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  756 TVRNNLDLVVQERNDAQKQLSEEQDARILQDQILTSKQKELEMARKKMNSEISHRHQKEKDlfhedcmLQEEIALLRLEI 835
Cdd:COG1196    274 LELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEE-------LEEELEELEEEL 346
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  836 DTIKNQNKQKEKKYFEDIEAVKEKNDNLQKIIKLnEETLTETILQYSGQLNNLTAENKILNSELENGKQNQERLEIEMES 915
Cdd:COG1196    347 EEAEEELEEAEAELAEAEEALLEAEAELAEAEEE-LEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEE 425
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  916 yrcRLAAAVRDCDQSQTARDLKLDFQRTRQEWVRLHDKMKVDMSGLQAKNEILSEKLSNAESKINSLQIQLHNTRDALGR 995
Cdd:COG1196    426 ---LEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEAD 502
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  996 ESLILERVQRdLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQLQSENMLLRQQLDDAHKKANSQEKT---SST 1072
Cdd:COG1196    503 YEGFLEGVKA-ALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATflpLDK 581
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1073 IQDQFHSAAKNLQAESEKQILSLQEKNKELMDEY----NHLKERMDQCEKEKAGRKIDLTEAQETVPSRclhLDAENEVL 1148
Cdd:COG1196    582 IRARAALAAALARGAIGAAVDLVASDLREADARYyvlgDTLLGRTLVAARLEAALRRAVTLAGRLREVT---LEGEGGSA 658
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1149 QLQQTLFSMKAIQKQCETLQKNKKQLKQEVVNLKSYMERNMLERGKAEwhkLLIEERARKEIEEKLNEAILTLQKQAAVS 1228
Cdd:COG1196    659 GGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEE---RELAEAEEERLEEELEEEALEEQLEAERE 735
                          570       580       590
                   ....*....|....*....|....*....|....*...
gi 1833563309 1229 HEQLVQLREDNTTSI----KTQMELTIKDLESEISRIK 1262
Cdd:COG1196    736 ELLEELLEEEELLEEealeELPEPPDLEELERELERLE 773
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
657-1181 5.54e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 51.09  E-value: 5.54e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  657 DQLTVKLKQMENMVSVLQNELSETKKTKLQLELQKIEWEKELYDLRLALKQENEEKRNADMLYNKDSEQLRIKEEECGKV 736
Cdd:COG1196    249 EELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAEL 328
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  737 VETKQQLKWNLRRLVKELRTVRNNLDLVVQERNDAQKQLSEEQDARI-LQDQILTSKQKELEMARKKMN--SEISHRHQK 813
Cdd:COG1196    329 EEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAeAEEELEELAEELLEALRAAAElaAQLEELEEA 408
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  814 EKDLFHEDCMLQEEIALLRLEIDTIKNQNKQKEKKYFEDIEAVKEKNDNLQKIIKLNEETLTETILQYSgQLNNLTAENK 893
Cdd:COG1196    409 EEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEA-ALAELLEELA 487
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  894 ILNSELENGKQNQERLEIEMESYRCRLAAAVRDCDQSQTARDLKLDF-----QRTRQEWVRLHDKMKVDMSGLQAKNEIL 968
Cdd:COG1196    488 EAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAayeaaLEAALAAALQNIVVEDDEVAAAAIEYLK 567
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  969 SEKLS-------NAESKINSLQIQLHNTRDALGRESLILERVQRDLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVEER 1041
Cdd:COG1196    568 AAKAGratflplDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLR 647
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1042 LSQLQSENMLLRQQLDDAHKKANSQEKTSSTIQDQfhsaakNLQAESEKQILSLQEKNKELMDEYNHLKERMDQCEKEKA 1121
Cdd:COG1196    648 EVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELE------ELAERLAEEELELEEALLAEEEEERELAEAEEERLEEEL 721
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1833563309 1122 gRKIDLTEAQETVPSRCLHLDAENEVLQLQQTLFSMKAIQKQcETLQKNKKQLKQEV-----VNL 1181
Cdd:COG1196    722 -EEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDL-EELERELERLEREIealgpVNL 784
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
649-1314 5.69e-06

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 51.13  E-value: 5.69e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  649 LKLKNNHCDQLTVKLKQMENMVSVLQNELSETKKTKLQLELQKIEWEKELYDLRLALKQENEEKR---NADMLYNKDSEQ 725
Cdd:pfam02463  215 LKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKekkLQEEELKLLAKE 294
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  726 LRIKEEECGKVVETKQQLKWNLRRLVKELRTVRNNLDLVVQERNDAQKQLSEEQDARILQDQILTSKQKELEMARKKMNS 805
Cdd:pfam02463  295 EEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEE 374
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  806 EISHRHQKEKDLFHEDCMLQEEIALLRLEIDTIKNQNKQKekkyfediEAVKEKNDNLQKIIKLNEETLTETILQYSGQL 885
Cdd:pfam02463  375 LLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELA--------RQLEDLLKEEKKEELEILEEEEESIELKQGKL 446
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  886 NNLTAENKILNSELENGKQNQERLEIEMESYRCRLAAAVRDCDQSqtarDLKLDFQRTRQEWVRLHDKMKVDMSGLQAKN 965
Cdd:pfam02463  447 TEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLS----RQKLEERSQKESKARSGLKVLLALIKDGVGG 522
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  966 EILSEKLSNAE-SKINSLQIQLHNTRDALGRESLILERVQRDLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQ 1044
Cdd:pfam02463  523 RIISAHGRLGDlGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPI 602
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1045 LQSENMLLRQQLDDAHKKANS--QEKTSSTIQDQFHSAAKNLQAESEKQILSLQEKNKELMDEYNHLKERMDQCEKEKAG 1122
Cdd:pfam02463  603 LNLAQLDKATLEADEDDKRAKvvEGILKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQ 682
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1123 RKIDLTEAQETVPSRCLHLDAENEVLQLQQTLFSMKAIQKQCETLQKNKKQLKQEVVNLKSYMERNMLERGKAEWHKLLI 1202
Cdd:pfam02463  683 EKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEK 762
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1203 EERARKEIE-EKLNEAILTLQKQAAVSHEQLVQLREDNTTSIKTQMELTIKDLESEISRIKTSQADFNKTELERYKELYL 1281
Cdd:pfam02463  763 EEEKSELSLkEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALEL 842
                          650       660       670
                   ....*....|....*....|....*....|...
gi 1833563309 1282 EEVKVRESLSNELSRTNEMIAEVSTQLTVEKEQ 1314
Cdd:pfam02463  843 KEEQKLEKLAEEELERLEEEITKEELLQELLLK 875
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
662-1314 1.31e-05

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 49.63  E-value: 1.31e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  662 KLKQMENMVSVLQNELsetkKTKlqlelqkiewEKELYDLRLALKQENEEKRNAdmlyNKDSEQLRIKEEEC-GKVVETK 740
Cdd:TIGR04523   34 EEKQLEKKLKTIKNEL----KNK----------EKELKNLDKNLNKDEEKINNS----NNKIKILEQQIKDLnDKLKKNK 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  741 QQLK---WNLRRLVKELRTVRNNLDLVVQERNDAQKQLSEEQDARILQDQILTSKQKELEMARKKMNSEISHRHQKEKDL 817
Cdd:TIGR04523   96 DKINklnSDLSKINSEIKNDKEQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENEL 175
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  818 FhedcMLQEEIALLRLEIDTIKNQNKQKEKKYFeDIEAVKEKNDNLQ-KIIKLNEE--TLTETILQYSGQLNNLTAENKI 894
Cdd:TIGR04523  176 N----LLEKEKLNIQKNIDKIKNKLLKLELLLS-NLKKKIQKNKSLEsQISELKKQnnQLKDNIEKKQQEINEKTTEISN 250
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  895 LNSELENGKQNQERLEIEMESYRCRLAAAvrdcdqSQTARDLKLDFQRTRQEWVRLHDKMKVDMsglqakNEILSEKLSN 974
Cdd:TIGR04523  251 TQTQLNQLKDEQNKIKKQLSEKQKELEQN------NKKIKELEKQLNQLKSEISDLNNQKEQDW------NKELKSELKN 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  975 AESKINSLQIQLHNTRDALGRESLILERVQRDLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQLQSENMLLRQ 1054
Cdd:TIGR04523  319 QEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLES 398
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1055 QLDDahkkansQEKTSSTIQDQFHSAAKNLQaESEKQILSLQEKNKELMDEYNHLKErmDQCEKEKAGRKIDLTEAQETV 1134
Cdd:TIGR04523  399 KIQN-------QEKLNQQKDEQIKKLQQEKE-LLEKEIERLKETIIKNNSEIKDLTN--QDSVKELIIKNLDNTRESLET 468
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1135 PSRCLHLDAENEVLQLQQTLFSMKAIQKQCETLQKNKKQLKQEVVNLKSYMERNMLERGKAEWHKLLIEERArKEIEEKL 1214
Cdd:TIGR04523  469 QLKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKI-SDLEDEL 547
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1215 NE-----AILTLQKQAAVSHEQLVQLREDNTTSIKTQMELT--IKDLESEISRIKTSQADFNKTELERYKELYLEEvKVR 1287
Cdd:TIGR04523  548 NKddfelKKENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQelIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAK-KEN 626
                          650       660
                   ....*....|....*....|....*..
gi 1833563309 1288 ESLSNELSRTNEMIAEVSTQLTVEKEQ 1314
Cdd:TIGR04523  627 EKLSSIIKNIKSKKNKLKQEVKQIKET 653
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
658-1297 2.63e-05

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 49.02  E-value: 2.63e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  658 QLTVKLKQMENMVSVLQNEL-------SETKKTKLQLELQKIEWEKELYDLRLALkqENEEKRNADMLYNKDSEQLRIKE 730
Cdd:pfam01576   30 ELEKKHQQLCEEKNALQEQLqaetelcAEAEEMRARLAARKQELEEILHELESRL--EEEEERSQQLQNEKKKMQQHIQD 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  731 EECGKVVETKQQLKWNLRRLVKELRTVRNNLDLVVQErnDAQKQLSEEqdARILQDQI--LTS-------KQKELEMARK 801
Cdd:pfam01576  108 LEEQLDEEEAARQKLQLEKVTTEAKIKKLEEDILLLE--DQNSKLSKE--RKLLEERIseFTSnlaeeeeKAKSLSKLKN 183
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  802 K---MNSEISHRHQKE-----------KDLFHEDCMLQEEIALLRLEIDTIKNqnkqkekkyfedieAVKEKNDNLQKII 867
Cdd:pfam01576  184 KheaMISDLEERLKKEekgrqelekakRKLEGESTDLQEQIAELQAQIAELRA--------------QLAKKEEELQAAL 249
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  868 -KLNEETLTETILQYSgqLNNLTAENKILNSELENGKQNQER-------LEIEMESYRCRLAAAVRDCDQSQTARDLK-- 937
Cdd:pfam01576  250 aRLEEETAQKNNALKK--IRELEAQISELQEDLESERAARNKaekqrrdLGEELEALKTELEDTLDTTAAQQELRSKReq 327
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  938 --LDFQRTRQEWVRLHDKMKVDMSGLQAKN-EILSEKLSNAE-SKINslqiqLHNTRDALGRESLILERVQRDLSQ---- 1009
Cdd:pfam01576  328 evTELKKALEEETRSHEAQLQEMRQKHTQAlEELTEQLEQAKrNKAN-----LEKAKQALESENAELQAELRTLQQakqd 402
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1010 TQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQLQSENMLLRQQLDDAHKKANSQEKTSSTIQDQFHSAAKNLQAESe 1089
Cdd:pfam01576  403 SEHKRKKLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLESQLQDTQELLQEET- 481
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1090 KQILSLQEKNKELMDEYNHLKERMDQCE--KEKAGRKIDLTEAQETVPSRCLHLDAEN-EVLQ---------LQQTLFSM 1157
Cdd:pfam01576  482 RQKLNLSTRLRQLEDERNSLQEQLEEEEeaKRNVERQLSTLQAQLSDMKKKLEEDAGTlEALEegkkrlqreLEALTQQL 561
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1158 KAIQKQCETLQKNKKQLKQEVVNLKSYMERNM-----LERGKAEWHKLLIEERA----------RKEIEEKLNEAiltlq 1222
Cdd:pfam01576  562 EEKAAAYDKLEKTKNRLQQELDDLLVDLDHQRqlvsnLEKKQKKFDQMLAEEKAisaryaeerdRAEAEAREKET----- 636
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1223 KQAAVSHEQlvqlreDNTTSIKTQMELTIKDLESEISRIKTSQADFNKT--ELERYK---ELYLEEVKVR-ESLSNELSR 1296
Cdd:pfam01576  637 RALSLARAL------EEALEAKEELERTNKQLRAEMEDLVSSKDDVGKNvhELERSKralEQQVEEMKTQlEELEDELQA 710

                   .
gi 1833563309 1297 T 1297
Cdd:pfam01576  711 T 711
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
746-1319 4.06e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 48.14  E-value: 4.06e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  746 NLRRLVKELRTVRNNLDLVVQERNDAQKQLSEEQDARILQDQILTSKQKELEMARKKMnSEISHRHQKEKDLFHEDCMLQ 825
Cdd:PRK03918   166 NLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREEL-EKLEKEVKELEELKEEIEELE 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  826 EEIALLRLEIDTIKNQNKQKEkkyfEDIEAVKEKNDNLQKIIKLNEEtltetilqysgqLNNLTAENKILNSELENGKQN 905
Cdd:PRK03918   245 KELESLEGSKRKLEEKIRELE----ERIEELKKEIEELEEKVKELKE------------LKEKAEEYIKLSEFYEEYLDE 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  906 QERLEIEMESYRCRLAAAVRDCDqsqtardlKLDFQRTRQEWvrlhdkMKVDMSGLQAKNEILSEKLSNAEsKINSLQIQ 985
Cdd:PRK03918   309 LREIEKRLSRLEEEINGIEERIK--------ELEEKEERLEE------LKKKLKELEKRLEELEERHELYE-EAKAKKEE 373
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  986 LHNTRDALGRESLilERVQRDLSQTQCQKKE-TEQMYQIEQSK--LKKYIAKQESVEERLSQLQSENMLLRQQLDDAHKK 1062
Cdd:PRK03918   374 LERLKKRLTGLTP--EKLEKELEELEKAKEEiEEEISKITARIgeLKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRK 451
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1063 A---------NSQEKTSSTIQDQFHSAAKNLQaESEKQILSLQE--KNKELMDEYNHLKERMDQCEKEKAGRKidlTEAQ 1131
Cdd:PRK03918   452 ElleeytaelKRIEKELKEIEEKERKLRKELR-ELEKVLKKESEliKLKELAEQLKELEEKLKKYNLEELEKK---AEEY 527
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1132 ETVPSRCLHLdaENEVLQLQQTLFSMKAIQKQCETLQKNKKQLKQEVVNLksymERNMLERGKAEWHKLlieERARKEIE 1211
Cdd:PRK03918   528 EKLKEKLIKL--KGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAEL----LKELEELGFESVEEL---EERLKELE 598
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1212 EKLNEAI-LTLQKQAAVSHEQLVQLREDNTTSIKTQMELTIKDLESEISRIKTSQADFNKTELERYKELYLEEVKVRESL 1290
Cdd:PRK03918   599 PFYNEYLeLKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGL 678
                          570       580
                   ....*....|....*....|....*....
gi 1833563309 1291 SNELSRTNEMIAEVSTQLTVEKEQTRSRS 1319
Cdd:PRK03918   679 RAELEELEKRREEIKKTLEKLKEELEERE 707
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1026-1328 6.87e-05

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 47.76  E-value: 6.87e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1026 SKLKKYIAKQESVEERLSQLQSENMLLRQQLDDAHKKANSQEK----TSSTIQDQFHSAAKNLQAeSEKQILSLQEKNKE 1101
Cdd:TIGR02169  170 RKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERyqalLKEKREYEGYELLKEKEA-LERQKEAIERQLAS 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1102 LMDEYNHLKERMDQCEKEKAGRKIDLTEAQETVPSRclhldAENEVLQLQQTLFSMKAIQKQC-----------ETLQKN 1170
Cdd:TIGR02169  249 LEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDL-----GEEEQLRVKEKIGELEAEIASLersiaekerelEDAEER 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1171 KKQLKQEVVNLKSYME--RNMLERGKAEWHKLLIEERARKEIEEKLNEAILTLQKQAAVSHEQLVQLRE----------- 1237
Cdd:TIGR02169  324 LAKLEAEIDKLLAEIEelEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREkleklkreine 403
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1238 -----DNTTSIKTQMELTIKDLESEISRIKTSQADFNKTELERYKELYLEE------VKVRESLSNELSRTNEMIAEVST 1306
Cdd:TIGR02169  404 lkrelDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEwkleqlAADLSKYEQELYDLKEEYDRVEK 483
                          330       340
                   ....*....|....*....|..
gi 1833563309 1307 QLTvEKEQTRSRSLFTAYATRP 1328
Cdd:TIGR02169  484 ELS-KLQRELAEAEAQARASEE 504
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
658-1261 9.36e-05

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 47.24  E-value: 9.36e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  658 QLTVKLKQMENMVSVLQNELSETKKTKLQLELQKIEWEKELYDLRLALKQ-ENEEKRNADMLYNKDSEQLRIKEEECGKV 736
Cdd:COG1196    217 ELKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEaELEELRLELEELELELEEAQAEEYELLAE 296
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  737 VETKQQlkwNLRRLVKELRTVRNNLDLVVQERNDAQKQLSEEQDARILQDQILTSKQKELEMARKKMNSEISHRHQKEKD 816
Cdd:COG1196    297 LARLEQ---DIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAE 373
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  817 LfhedcmLQEEIALLRLEIdtiknqnkqkekkyfEDIEAVKEKNDNLQKIiklneETLTETILQYSGQLNNLTAENKILN 896
Cdd:COG1196    374 L------AEAEEELEELAE---------------ELLEALRAAAELAAQL-----EELEEAEEALLERLERLEEELEELE 427
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  897 SELENGKQNQERLEIEMESyrcRLAAAVRDCDQSQTARDLKLDFQRTRQEWVRLHDKMKVDMSGLQAKNEILSEKLSNAE 976
Cdd:COG1196    428 EALAELEEEEEEEEEALEE---AAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYE 504
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  977 SKINSLQIQLHNTRDALGRESLILERvqrdlsqtQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQLQSENMLLRQQL 1056
Cdd:COG1196    505 GFLEGVKAALLLAGLRGLAGAVAVLI--------GVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATF 576
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1057 DDAHkKANSQEKTSSTIQDQFHSAAKNLQAESEKQILSLQEKNKELMDEYNHLKERMDQCEKEKAGRKIDLTEAQETVPS 1136
Cdd:COG1196    577 LPLD-KIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEG 655
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1137 RCLHLDAENEVLQLQQTLFSMKAIQKQCETLQKNKKQLKQEVVNLKSYMERNMLERGKAEWHKLLIEERARKEIEEKLNE 1216
Cdd:COG1196    656 GSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAERE 735
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|....*
gi 1833563309 1217 AILTLQKQAAVSHEQLVQLREDNTTSIKTQmELTIKDLESEISRI 1261
Cdd:COG1196    736 ELLEELLEEEELLEEEALEELPEPPDLEEL-ERELERLEREIEAL 779
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
664-1303 1.16e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 46.89  E-value: 1.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  664 KQMENMVSVLQNELSETKKTKLQLELQKIEWEKELYDLRLALKQENEEKRNADMLYNKDSEQLRIKEEECGKVVETKQQL 743
Cdd:TIGR00618  141 KTFTRVVLLPQGEFAQFLKAKSKEKKELLMNLFPLDQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHER 220
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  744 KWNLRRLVKELRTV----RNNLDLVVQERNDAQKQLSEEQDARILQDQI--LTSKQKELEMARKKMN---------SEIS 808
Cdd:TIGR00618  221 KQVLEKELKHLREAlqqtQQSHAYLTQKREAQEEQLKKQQLLKQLRARIeeLRAQEAVLEETQERINrarkaaplaAHIK 300
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  809 HRHQKEKDLFHEDCMLQEEIALL-----------RLEIDTIKNQNKQKEKKYFEDIEAVKEKNDNLQKIIKLNEETLTET 877
Cdd:TIGR00618  301 AVTQIEQQAQRIHTELQSKMRSRakllmkraahvKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTLTQH 380
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  878 ILQYSGQLNNLTAENKILNSELENGKQNQERLEIEMESYRC---RLAAAVRDCDQSQTARDLKLDF-QRTRQ-------E 946
Cdd:TIGR00618  381 IHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAFRDlqgQLAHAKKQQELQQRYAELCAAAiTCTAQceklekiH 460
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  947 WVRLHDKMKVDMSGLQAKNEILSEKLSNAESKINSLQIQLHNTRDALGRES-LILERVQRDLSQ-TQCQKKETEQMYQIE 1024
Cdd:TIGR00618  461 LQESAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIhPNPARQDIDNPGpLTRRMQRGEQTYAQL 540
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1025 QSKLKKYIAKQESVEERLSQLQSENMLLRQQLDDAHKKANSQEKTSSTIQDQFHSAAKNLQAESEKQIlSLQEKNKELMD 1104
Cdd:TIGR00618  541 ETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAED-MLACEQHALLR 619
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1105 EYNHLKERMD------QCEKEKAGRKIDLTEAQETVPSR----CLHLDAENEVLQLQQTLFSMKAIQKQCETLQKNKKQL 1174
Cdd:TIGR00618  620 KLQPEQDLQDvrlhlqQCSQELALKLTALHALQLTLTQErvreHALSIRVLPKELLASRQLALQKMQSEKEQLTYWKEML 699
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1175 KQEVVNLKSYMErnmlergkaewhKLLIEERARKEIEEKLNEAILTLQKQAAVSHEQLVQLREDNTTSIKTQMELTIKDL 1254
Cdd:TIGR00618  700 AQCQTLLRELET------------HIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELMHQARTVLKARTEAHFNNN 767
                          650       660       670       680
                   ....*....|....*....|....*....|....*....|....*....
gi 1833563309 1255 ESEISRIKTSQadfnktELERYKELYLEEVKVRESLSNELSRTNEMIAE 1303
Cdd:TIGR00618  768 EEVTAALQTGA------ELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQ 810
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
646-963 1.16e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 47.04  E-value: 1.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  646 EHLLKLKNnHCDQLTVKLKQMENMVSVLQNELS-------ETKKTKLQLELQKIEWEKELYDLRLALKQ----------- 707
Cdd:pfam15921  541 DHLRNVQT-ECEALKLQMAEKDKVIEILRQQIEnmtqlvgQHGRTAGAMQVEKAQLEKEINDRRLELQEfkilkdkkdak 619
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  708 -ENEEKRNADM------LYNKDSEQLRikeeecgKVVETKQQLKwnlrRLVKELRTVRNNL-----DLVVQERNDAQKQL 775
Cdd:pfam15921  620 iRELEARVSDLelekvkLVNAGSERLR-------AVKDIKQERD----QLLNEVKTSRNELnslseDYEVLKRNFRNKSE 688
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  776 SEEQDARILQDQiLTSKQKELEMARKKMNS-EISHRHQKEKDLfhedcMLQEEIALLRLEIDTIKNQnkqkekkyfedIE 854
Cdd:pfam15921  689 EMETTTNKLKMQ-LKSAQSELEQTRNTLKSmEGSDGHAMKVAM-----GMQKQITAKRGQIDALQSK-----------IQ 751
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  855 AVKEKNDNLQKiiklNEETLTETILQYSGQLNNLTAENKILNSELENGKQNQERLEIEMESYRCRLAAAVRDCDQSQTAr 934
Cdd:pfam15921  752 FLEEAMTNANK----EKHFLKEEKNKLSQELSTVATEKNKMAGELEVLRSQERRLKEKVANMEVALDKASLQFAECQDI- 826
                          330       340
                   ....*....|....*....|....*....
gi 1833563309  935 dlkldFQRTRQEWVRLHDKMKVDMSGLQA 963
Cdd:pfam15921  827 -----IQRQEQESVRLKLQHTLDVKELQG 850
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
962-1181 1.19e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 46.30  E-value: 1.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  962 QAKNEILSEKLSNAESKINSLQIQLHNTRDALGRESLILERVQRDLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVEER 1041
Cdd:COG4942     19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAE 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1042 LSQLQSEnmlLRQQLDDAHKKANSQEKTSSTIQDQFHSAAKNLQ------AESEKQILSLQEKNKELMDEYNHLKERMDQ 1115
Cdd:COG4942     99 LEAQKEE---LAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQylkylaPARREQAEELRADLAELAALRAELEAERAE 175
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1833563309 1116 CEKEKAGRKIDLTEAQETVPSRclhldaENEVLQLQQTLfsmKAIQKQCETLQKNKKQLKQEVVNL 1181
Cdd:COG4942    176 LEALLAELEEERAALEALKAER------QKLLARLEKEL---AELAAELAELQQEAEELEALIARL 232
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
730-1094 1.22e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 46.60  E-value: 1.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  730 EEECGKVVETKQQLKWNLRRLVKELRTVRNNLDLVVQERNDAQKQLSEEQDARILQDQILTSKQKELEMARKKMNSEISH 809
Cdd:TIGR02169  169 DRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLAS 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  810 RHQKEKDLFHEDCMLQEEIALLRLEIDT----IKNQNKQKEKKYFEDIEAVKEKNDNLQKIIKLNE---ETLTETILQYS 882
Cdd:TIGR02169  249 LEEELEKLTEEISELEKRLEEIEQLLEElnkkIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKErelEDAEERLAKLE 328
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  883 GQLNNLTAENKILNSELENGKQNQERLEIEMESYRCRLAAAVRDcdqsqtARDLKLDFQRTRQEWVRLHdkmkvdmsglq 962
Cdd:TIGR02169  329 AEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAE------LEEVDKEFAETRDELKDYR----------- 391
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  963 akneilsEKLSNAESKINSLQiqlhNTRDALGRESLILERVQRDLSQTQCQKKEteqmyqieqsKLKKYIAKQESVEERL 1042
Cdd:TIGR02169  392 -------EKLEKLKREINELK----RELDRLQEELQRLSEELADLNAAIAGIEA----------KINELEEEKEDKALEI 450
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1833563309 1043 SQLQSENMLLRQQLDDAHKKANSQEKTSSTIQDQFHSAAKNL-QAESEKQILS 1094
Cdd:TIGR02169  451 KKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELaEAEAQARASE 503
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
850-1320 2.17e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 46.12  E-value: 2.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  850 FEDIEAVKEKNDNLQKIIKLNEETLTETILQYSGQLNNLTAENKILNSELENGKQNQERLEIEMESYRCR------LAAA 923
Cdd:TIGR00618  186 FAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQlkkqqlLKQL 265
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  924 VRDCDQSQTA------RDLKLDFQR--------------TRQEWVRLHDKMKVDM----SGLQAKNEILSEKLSNAESK- 978
Cdd:TIGR00618  266 RARIEELRAQeavleeTQERINRARkaaplaahikavtqIEQQAQRIHTELQSKMrsraKLLMKRAAHVKQQSSIEEQRr 345
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  979 -INSLQIQLHNTRDALGRESLILERVQRDLSQTQCQKKETEQ---MYQIEQSKLKKYIAKQESVEERLSQLQSENMLlRQ 1054
Cdd:TIGR00618  346 lLQTLHSQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQQkttLTQKLQSLCKELDILQREQATIDTRTSAFRDL-QG 424
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1055 QLDDAHKKANSQEKTSStIQDQFhsAAKNLQAESEKQILsLQEKNKELMDEYNHLKERMDQCEKEKAGRKIDLTEAQETV 1134
Cdd:TIGR00618  425 QLAHAKKQQELQQRYAE-LCAAA--ITCTAQCEKLEKIH-LQESAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQ 500
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1135 PSRCLHLDAENEVLQLQQTLFSMKAIQKQCETLQKNKKQLKQEVVNLKSYM--ERNMLERGKAE-------WHKLLIEER 1205
Cdd:TIGR00618  501 EEPCPLCGSCIHPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLtsERKQRASLKEQmqeiqqsFSILTQCDN 580
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1206 ARKEIEEKLNEAILTLQKQAAVSHEQLVQLREDNTTS-IKTQMELTIKDLESEISRIKTSQAdFNKTELERYKELYLEEv 1284
Cdd:TIGR00618  581 RSKEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALlRKLQPEQDLQDVRLHLQQCSQELA-LKLTALHALQLTLTQE- 658
                          490       500       510
                   ....*....|....*....|....*....|....*.
gi 1833563309 1285 KVRESLSneLSRTNEMIAEVSTQLTVEKEQTRSRSL 1320
Cdd:TIGR00618  659 RVREHAL--SIRVLPKELLASRQLALQKMQSEKEQL 692
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
662-1212 2.45e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 45.83  E-value: 2.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  662 KLKQMENMVSVLQNELSETKKTKLQLELQKIEWEKELYDLRLALKqENEEKRnadmlynKDSEQLRIKEEECGKVVETKQ 741
Cdd:PRK03918   232 ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIE-ELEEKV-------KELKELKEKAEEYIKLSEFYE 303
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  742 QLKWNLRRLVKELRTVR---NNLDLVVQERNDAQKQLSE--EQDARILQD-QILTSKQKELEMARKKMNSEISHRhQKEK 815
Cdd:PRK03918   304 EYLDELREIEKRLSRLEeeiNGIEERIKELEEKEERLEElkKKLKELEKRlEELEERHELYEEAKAKKEELERLK-KRLT 382
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  816 DLFHEDcmLQEEIALLRLEIDTIKNQNKQKEKKYFEDIEAVKEKNDNLQKIIK-----------LNEETLTETILQYSGQ 884
Cdd:PRK03918   383 GLTPEK--LEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKakgkcpvcgreLTEEHRKELLEEYTAE 460
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  885 LNNLTAENKILNSELENGKQNQERLEIEMESYRcRLAAAVRDCDQSQTARDL--KLDFQRTRQEWvRLHDKMKVDMSGLQ 962
Cdd:PRK03918   461 LKRIEKELKEIEEKERKLRKELRELEKVLKKES-ELIKLKELAEQLKELEEKlkKYNLEELEKKA-EEYEKLKEKLIKLK 538
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  963 AKNEILSEKLSNAESKINSLQIQLHNTRDALGRESLILERVQR----DLSQTQCQKKETEQMYQiEQSKLKKYIAKQESV 1038
Cdd:PRK03918   539 GEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEElgfeSVEELEERLKELEPFYN-EYLELKDAEKELERE 617
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1039 EERLSQLQSENMLLRQQLDDAHKKANsqektsstiqdqfhsaaknlqaESEKQILSLQEKNKElmDEYNHLKERMDQCEK 1118
Cdd:PRK03918   618 EKELKKLEEELDKAFEELAETEKRLE----------------------ELRKELEELEKKYSE--EEYEELREEYLELSR 673
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1119 EKAGRKIDLTEAQEtvpsrclhldaenevlQLQQTLFSMKAIQKQCETLQKNKKQLKqevvNLKSYMERNMLERGKAEWH 1198
Cdd:PRK03918   674 ELAGLRAELEELEK----------------RREEIKKTLEKLKEELEEREKAKKELE----KLEKALERVEELREKVKKY 733
                          570
                   ....*....|....
gi 1833563309 1199 KLLIEERARKEIEE 1212
Cdd:PRK03918   734 KALLKERALSKVGE 747
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
850-1131 2.48e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 45.83  E-value: 2.48e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  850 FEDIEAVK-----------EKNDNLQKIIKLNEETLTETILQ----------YSGQLNNLTAENKILNSELENGKQNQER 908
Cdd:TIGR02169  176 LEELEEVEenierldliidEKRQQLERLRREREKAERYQALLkekreyegyeLLKEKEALERQKEAIERQLASLEEELEK 255
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  909 LEIEMESYRCRLAAAVRDCDQ-SQTARDLKLDFQRTRQEWVRlhdKMKVDMSGLQAKNEILSEKLSNAESKINSLQIQLH 987
Cdd:TIGR02169  256 LTEEISELEKRLEEIEQLLEElNKKIKDLGEEEQLRVKEKIG---ELEAEIASLERSIAEKERELEDAEERLAKLEAEID 332
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  988 NTRDAlgresliLERVQRDLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQLQSENMLLRQQLDDAHKKANSQE 1067
Cdd:TIGR02169  333 KLLAE-------IEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELK 405
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1068 KTSSTIQD------------------------QFHSAAKNLQAE---SEKQILSLQEKNKELMDEYNHLKERMDQCEKE- 1119
Cdd:TIGR02169  406 RELDRLQEelqrlseeladlnaaiagieakinELEEEKEDKALEikkQEWKLEQLAADLSKYEQELYDLKEEYDRVEKEl 485
                          330
                   ....*....|...
gi 1833563309 1120 -KAGRKIDLTEAQ 1131
Cdd:TIGR02169  486 sKLQRELAEAEAQ 498
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1026-1320 3.53e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 45.11  E-value: 3.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1026 SKLKKYIAKQESVEERLSQLQSENMLLRQQLDDAHKKANSQEKtsstiQDQFHSAAKNLQAESEKQILSLQEKNKELMDE 1105
Cdd:pfam17380  272 NQLLHIVQHQKAVSERQQQEKFEKMEQERLRQEKEEKAREVER-----RRKLEEAEKARQAEMDRQAAIYAEQERMAMER 346
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1106 YNHLkERMDQCEKEKAGRKIDLTE-AQETVPSR---CLHLDAENEVLQLQQTLFSMKAIQKQCETLQKNKKQLKQEVVNL 1181
Cdd:pfam17380  347 EREL-ERIRQEERKRELERIRQEEiAMEISRMReleRLQMERQQKNERVRQELEAARKVKILEEERQRKIQQQKVEMEQI 425
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1182 ksymeRNMLERGKAEWHKLLIEERARKEIEEKLNEailtLQKQaavshEQLVQLREDNTTSIKTQMELTIKdlESEISRI 1261
Cdd:pfam17380  426 -----RAEQEEARQREVRRLEEERAREMERVRLEE----QERQ-----QQVERLRQQEEERKRKKLELEKE--KRDRKRA 489
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1833563309 1262 KTSQADFNKTELERYKELYLEEVKVRESLSNELSRTNEMIAEVSTQLTVEKEQTRSRSL 1320
Cdd:pfam17380  490 EEQRRKILEKELEERKQAMIEEERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEM 548
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
662-1255 4.57e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 44.72  E-value: 4.57e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  662 KLKQMENMVSVLQNELSETKKT---KL-QLELQKIEWEKELYDLRLALKQENEEKRNADMLYNKDSEQLRIKEEECGKVV 737
Cdd:pfam15921  318 QLSDLESTVSQLRSELREAKRMyedKIeELEKQLVLANSELTEARTERDQFSQESGNLDDQLQKLLADLHKREKELSLEK 397
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  738 ETKQQLkWNlrrlvkelRTVRNNLDLvvqerndaqKQLSEEQDARILQDQiltskqkELEMARKKMNSEishrhqkekdl 817
Cdd:pfam15921  398 EQNKRL-WD--------RDTGNSITI---------DHLRRELDDRNMEVQ-------RLEALLKAMKSE----------- 441
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  818 fhedCMLQEEiallrleidtiknqnkqkekkyfEDIEAVKEKNDNLQKIiklneETLTETILQYSGQLNNLTAENKILNS 897
Cdd:pfam15921  442 ----CQGQME-----------------------RQMAAIQGKNESLEKV-----SSLTAQLESTKEMLRKVVEELTAKKM 489
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  898 ELENGKQNQERLEIEMESYRCRLAAAVRDCDQSQTARDLKL-DFQRTRQEWVRLHDkmkvdmsgLQAKNEILSEKLSNAE 976
Cdd:pfam15921  490 TLESSERTVSDLTASLQEKERAIEATNAEITKLRSRVDLKLqELQHLKNEGDHLRN--------VQTECEALKLQMAEKD 561
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  977 SKINSLQIQLHNTRDALGRE-----SLILERVQ--RDLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVE---------- 1039
Cdd:pfam15921  562 KVIEILRQQIENMTQLVGQHgrtagAMQVEKAQleKEINDRRLELQEFKILKDKKDAKIRELEARVSDLElekvklvnag 641
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1040 -ERL---SQLQSENMLLRQQLDDAHKKANSQEKTSSTIQDQFHSAAKNLQAESEKQILSLQEKNKELMDEYNHLKErMDQ 1115
Cdd:pfam15921  642 sERLravKDIKQERDQLLNEVKTSRNELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQTRNTLKS-MEG 720
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1116 CEKEKAGRKIDLteaQETVPSRCLHLDAENEVLQLQQTlfSMKAIQKQCETLQKNKKQLKQEVVNLKSymERNMLergKA 1195
Cdd:pfam15921  721 SDGHAMKVAMGM---QKQITAKRGQIDALQSKIQFLEE--AMTNANKEKHFLKEEKNKLSQELSTVAT--EKNKM---AG 790
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1196 EWHKLLIEERARKEIEEKLNEAILTLQKQAAVSHEqLVQLREDNTTSIKTQMELTIKDLE 1255
Cdd:pfam15921  791 ELEVLRSQERRLKEKVANMEVALDKASLQFAECQD-IIQRQEQESVRLKLQHTLDVKELQ 849
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
968-1313 5.05e-04

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 44.12  E-value: 5.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  968 LSEKLSNAESKINSLQIQLHNTRDALGRESLILERVQRDLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQLQS 1047
Cdd:COG4372     29 LSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQE 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1048 ENMLLRQQLDDAHKKANSQEKTSSTIQDQfhsaaknlQAESEKQILSLQEKNKELMDEYNHLKERMDQCEKEKAGRKIDL 1127
Cdd:COG4372    109 EAEELQEELEELQKERQDLEQQRKQLEAQ--------IAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAE 180
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1128 TEAQ--ETVPSRCLHLDAENEVLQLQQTLFSMKAIQKQCETLQKNKKQLKQEVVNLKSYMERNMLERG-KAEWHKLLIEE 1204
Cdd:COG4372    181 AEQAldELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKeELLEEVILKEI 260
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1205 RARKEIEEKLNEAILTLQKQAAVSHEQLVQLREDNTTSIKTQMELTIKDLESEISRIKTSQADFNKTELERYKELYLEEV 1284
Cdd:COG4372    261 EELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILLAELA 340
                          330       340
                   ....*....|....*....|....*....
gi 1833563309 1285 KVRESLSNELSRTNEMIAEVSTQLTVEKE 1313
Cdd:COG4372    341 DLLQLLLVGLLDNDVLELLSKGAEAGVAD 369
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
890-1279 5.86e-04

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 44.35  E-value: 5.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  890 AENKILNSELENGKQNQERLEIEMESYRCRLAAAvrdcdqsqtARDLKLDFQRTRQEWVRLHDKMKVDMSGLQAKNEILS 969
Cdd:pfam05557    2 AELIESKARLSQLQNEKKQMELEHKRARIELEKK---------ASALKRQLDRESDRNQELQKRIRLLEKREAEAEEALR 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  970 EKLSNAESKINSLQIqlhntrdalgresliLERVQRDLSQTQCQKKETEQMYQIEQSKLKKYIAKQESveeRLSQLQSEN 1049
Cdd:pfam05557   73 EQAELNRLKKKYLEA---------------LNKKLNEKESQLADAREVISCLKNELSELRRQIQRAEL---ELQSTNSEL 134
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1050 MLLRQQLDDAHKKANSQEKTSSTIQDQFHS-AAKNLQAESEKQILSLQEKNKELMDEYNHLKERMDQCEKEKAgrkidlt 1128
Cdd:pfam05557  135 EELQERLDLLKAKASEAEQLRQNLEKQQSSlAEAEQRIKELEFEIQSQEQDSEIVKNSKSELARIPELEKELE------- 207
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1129 eaqetvpsrclHLDAENEVLQlqqtlfsmkaiqkqceTLQKNKKQLKQEVVNLKSYMERnmLERGKAEWHKLLIEeraRK 1208
Cdd:pfam05557  208 -----------RLREHNKHLN----------------ENIENKLLLKEEVEDLKRKLER--EEKYREEAATLELE---KE 255
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1209 EIEEKLNE-------AILTLQKQAAVShEQLVQLREDN---------TTSIKTQMELTIKDLESEISRIKTSQADFNKtE 1272
Cdd:pfam05557  256 KLEQELQSwvklaqdTGLNLRSPEDLS-RRIEQLQQREivlkeenssLTSSARQLEKARRELEQELAQYLKKIEDLNK-K 333

                   ....*..
gi 1833563309 1273 LERYKEL 1279
Cdd:pfam05557  334 LKRHKAL 340
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
819-1312 6.07e-04

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 44.24  E-value: 6.07e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  819 HEDCMLQEEIALLRLEIDTIKNQNKQKEKKYFEDIEAVKEKNDNLqKIIKLNEETLTETILQYSGQLNNLTAENKILNSE 898
Cdd:TIGR04523   33 TEEKQLEKKLKTIKNELKNKEKELKNLDKNLNKDEEKINNSNNKI-KILEQQIKDLNDKLKKNKDKINKLNSDLSKINSE 111
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  899 LENGKQNQERLEIEMESyrcrlaaavrdcdqsqtardlkldFQRTRQEWVRLHDKMKVDMSGLQAKNEILSEKLSNAESK 978
Cdd:TIGR04523  112 IKNDKEQKNKLEVELNK------------------------LEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQ 167
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  979 INSLQIQLHNTRDAlgresliLERVQRDLSQTQCQKKETEQMYqieqSKLKKYIAKQESVEERLSQLQSENMLLRQQLDD 1058
Cdd:TIGR04523  168 KEELENELNLLEKE-------KLNIQKNIDKIKNKLLKLELLL----SNLKKKIQKNKSLESQISELKKQNNQLKDNIEK 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1059 AHKKANSQEKTSSTIQDQFhsaaKNLQAESEKQILSLQEKNKELMDEYNHLKERMDQCEKEKAGRKiDLTEAQETVPSRC 1138
Cdd:TIGR04523  237 KQQEINEKTTEISNTQTQL----NQLKDEQNKIKKQLSEKQKELEQNNKKIKELEKQLNQLKSEIS-DLNNQKEQDWNKE 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1139 LHLDAENEVLQLQQtlfsmkaIQKQCETLQKNKKQLKQEVVNLKSymERNMLERGKAEwhkllieerARKEIEEKLNEaI 1218
Cdd:TIGR04523  312 LKSELKNQEKKLEE-------IQNQISQNNKIISQLNEQISQLKK--ELTNSESENSE---------KQRELEEKQNE-I 372
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1219 LTLQKQAAVSHEQLVQLrednttsiktqmELTIKDLESEISRIK--TSQADFNKTELERYKELYLEEVkvrESLSNELSR 1296
Cdd:TIGR04523  373 EKLKKENQSYKQEIKNL------------ESQINDLESKIQNQEklNQQKDEQIKKLQQEKELLEKEI---ERLKETIIK 437
                          490
                   ....*....|....*.
gi 1833563309 1297 TNEMIAEVSTQLTVEK 1312
Cdd:TIGR04523  438 NNSEIKDLTNQDSVKE 453
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
851-1072 1.93e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 42.44  E-value: 1.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  851 EDIEAVKEKNDNLQKIIKLNE----------ETLTETILQYSGQLNNLTAENKILNSELENGKQNQERLEIEMESYRCRL 920
Cdd:COG4942     27 AELEQLQQEIAELEKELAALKkeekallkqlAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEEL 106
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  921 AAAVRDCDQSQTARDLKLDF-QRTRQEWVRLHDKMKVDMSGLQAKNEILSEKLSNAESKINSLQIQLhntrdalgresli 999
Cdd:COG4942    107 AELLRALYRLGRQPPLALLLsPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAER------------- 173
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1833563309 1000 lERVQRDLSQTQCQKKETEQMYQIEQSKLKKYIAKQESVEERLSQLQSENMLLRQQLDDAHKKANSQEKTSST 1072
Cdd:COG4942    174 -AELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPA 245
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
662-910 2.16e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 42.75  E-value: 2.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  662 KLKQMENMVSVLQNELSETKKTKLQLELQKIEWEKELYDLRLALKQENEEKRNADmlynkDSEQLRIK------------ 729
Cdd:TIGR02169  231 EKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLG-----EEEQLRVKekigeleaeias 305
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  730 ----------------------EEECGKVVETKQQLKWNLRRLVKELRTVRNNLDLVVQERNDAQKQLSE-EQDARILQD 786
Cdd:TIGR02169  306 lersiaekereledaeerlaklEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEvDKEFAETRD 385
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  787 QiLTSKQKELEMARKKMNSeishrHQKEKD-LFHEDCMLQEEIALLRLEIDTIKNQNKQKEKKYFEDIEAVKEKNDNLQK 865
Cdd:TIGR02169  386 E-LKDYREKLEKLKREINE-----LKRELDrLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQ 459
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1833563309  866 IIKLNEEtltetilqYSGQLNNLTAENKILNSELENGKQNQERLE 910
Cdd:TIGR02169  460 LAADLSK--------YEQELYDLKEEYDRVEKELSKLQRELAEAE 496
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
646-1321 2.16e-03

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 42.73  E-value: 2.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  646 EHLLKLKNNHCDQLTVKLKQMENMV-SVLQNELSETKKTKLQLELQKIEWEKELYDLRLALKQEN----------EEKRN 714
Cdd:TIGR01612 1135 EEIKKKSENYIDEIKAQINDLEDVAdKAISNDDPEEIEKKIENIVTKIDKKKNIYDEIKKLLNEIaeiekdktslEEVKG 1214
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  715 ADMLYNKDSEQLRIKeeecgKVVETKQQLKwnlrRLVKELRTVRNNLDLVVQERNDAQKQLSEEQDARILQD--QILTSK 792
Cdd:TIGR01612 1215 INLSYGKNLGKLFLE-----KIDEEKKKSE----HMIKAMEAYIEDLDEIKEKSPEIENEMGIEMDIKAEMEtfNISHDD 1285
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  793 QKELEMARKKMNSEISHRHQKEKDLFhEDCMLQEEIALLRLEIDTiknqNKQKEKKYFEDIEAVKEKNDNLQKIIKLNE- 871
Cdd:TIGR01612 1286 DKDHHIISKKHDENISDIREKSLKII-EDFSEESDINDIKKELQK----NLLDAQKHNSDINLYLNEIANIYNILKLNKi 1360
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  872 ETLTETILQYSgqlNNLTAENKILNSELENGKQNQERLE--IEMESYRCRLAAAVRDCDQSQTARDLKLDFQRTRQEWVR 949
Cdd:TIGR01612 1361 KKIIDEVKEYT---KEIEENNKNIKDELDKSEKLIKKIKddINLEECKSKIESTLDDKDIDECIKKIKELKNHILSEESN 1437
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  950 LHDKMKvdmsGLQAKNEILSEKLSNAESKINSLQIQLHNTRD-ALGRESLILERVQRDLSQTQCQKKETEQ-MYQIEQSK 1027
Cdd:TIGR01612 1438 IDTYFK----NADENNENVLLLFKNIEMADNKSQHILKIKKDnATNDHDFNINELKEHIDKSKGCKDEADKnAKAIEKNK 1513
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1028 LKKYIAKQE-----------SVEERLSQLQSENMLLRQQLDDAHKK-----ANSQEKTSSTIQDQFH---SAAKNlqAES 1088
Cdd:TIGR01612 1514 ELFEQYKKDvtellnkysalAIKNKFAKTKKDSEIIIKEIKDAHKKfileaEKSEQKIKEIKKEKFRiedDAAKN--DKS 1591
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1089 EKQILSLQEKNKELMDEY---NHLKERMDQCEKEKagrkidlteaqETVPSRCLHLDAENEVLQLQQTLFSMKAIQKQCE 1165
Cdd:TIGR01612 1592 NKAAIDIQLSLENFENKFlkiSDIKKKINDCLKET-----------ESIEKKISSFSIDSQDTELKENGDNLNSLQEFLE 1660
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1166 TLQKNKKQL---KQEVVNLKSYME--RNMLERGKAEWHKLLIEE-----RARKEIEEKLNEAILTLQKQAAVS------- 1228
Cdd:TIGR01612 1661 SLKDQKKNIedkKKELDELDSEIEkiEIDVDQHKKNYEIGIIEKikeiaIANKEEIESIKELIEPTIENLISSfntndle 1740
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1229 ------------------HEQLVQLREDNTTSIKTQMELTIKDLESEISRIkTSQADFNKT-ELERYKELYLEEVKVRE- 1288
Cdd:TIGR01612 1741 gidpnekleeynteigdiYEEFIELYNIIAGCLETVSKEPITYDEIKNTRI-NAQNEFLKIiEIEKKSKSYLDDIEAKEf 1819
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*....
gi 1833563309 1289 ----------------SLSNELSRTNEMIAEVSTQLTVEKEQTRSRSLF 1321
Cdd:TIGR01612 1820 driinhfkkkldhvndKFTKEYSKINEGFDDISKSIENVKNSTDENLLF 1868
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
649-1068 2.70e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 42.06  E-value: 2.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  649 LKLKNNHCDQLTVKLKQMENMVSVLQNELSEtkktkLQLELQKIEWEKELYDLRLALKQENEEKRNADMLYnkdsEQLRI 728
Cdd:COG4717     83 AEEKEEEYAELQEELEELEEELEELEAELEE-----LREELEKLEKLLQLLPLYQELEALEAELAELPERL----EELEE 153
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  729 KEEECGKVVETKQQLKWNLRRLVKELRTVRNNLDLVVQER-NDAQKQLSEEQDARILQDQILTSKQKELEMARKKMNS-E 806
Cdd:COG4717    154 RLEELRELEEELEELEAELAELQEELEELLEQLSLATEEElQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQlE 233
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  807 ISHRHQKEKDLFHEDCMLQEEIALLrLEIDTIKNQNKQKEKKYFEDIEAVKEkndnLQKIIKLNEETLTETILQYSGQLN 886
Cdd:COG4717    234 NELEAAALEERLKEARLLLLIAAAL-LALLGLGGSLLSLILTIAGVLFLVLG----LLALLFLLLAREKASLGKEAEELQ 308
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  887 NLTAENKILNSELENGKQNQ---ERLEIEMESYRCRLAAAVRDCDQSQTARDLKLDFQRTRQEWVRLHDKMKVDMSGLQA 963
Cdd:COG4717    309 ALPALEELEEEELEELLAALglpPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLAEAGVEDEEELR 388
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  964 KNEILSEKLSNAESKINSLQIQLHNTRDALGRESLIL--ERVQRDLSQTQCQKKETEQmyqieqsKLKKYIAKQESVEER 1041
Cdd:COG4717    389 AALEQAEEYQELKEELEELEEQLEELLGELEELLEALdeEELEEELEELEEELEELEE-------ELEELREELAELEAE 461
                          410       420
                   ....*....|....*....|....*....
gi 1833563309 1042 LSQLQSENML--LRQQLDDAHKKANSQEK 1068
Cdd:COG4717    462 LEQLEEDGELaeLLQELEELKAELRELAE 490
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
867-1211 3.74e-03

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 41.65  E-value: 3.74e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  867 IKLNEETLTETilQYSGQLNNLTAENKILNSELENGK---QNQERLEIEMESyRCRLAAAVRDCDQSQTARDLKLDFQRT 943
Cdd:pfam17380  258 VRYNGQTMTEN--EFLNQLLHIVQHQKAVSERQQQEKfekMEQERLRQEKEE-KAREVERRRKLEEAEKARQAEMDRQAA 334
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  944 ---RQEWVRLHDKMKVDMSGLQAKN----EILSEKLSNAESKINSLQ-IQLHNTR-DALGRESLILERVQRDLSQTQcQK 1014
Cdd:pfam17380  335 iyaEQERMAMERERELERIRQEERKreleRIRQEEIAMEISRMRELErLQMERQQkNERVRQELEAARKVKILEEER-QR 413
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1015 KETEQMYQIEQSKLKKYIAKQESVEERLSQLQSENMLLRQQlddahkKANSQEKTSSTIQDQFHSAAKNLQAESEKQILS 1094
Cdd:pfam17380  414 KIQQQKVEMEQIRAEQEEARQREVRRLEEERAREMERVRLE------EQERQQQVERLRQQEEERKRKKLELEKEKRDRK 487
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1095 LQEKNKELMDEyNHLKERMDQCEKEKAGRKI---DLTEAQETVPSRCLHLDAENEvLQLQQTLFSMKAIQKQCETLQKNK 1171
Cdd:pfam17380  488 RAEEQRRKILE-KELEERKQAMIEEERKRKLlekEMEERQKAIYEEERRREAEEE-RRKQQEMEERRRIQEQMRKATEER 565
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|
gi 1833563309 1172 KQLKQevvnlksymernmLERGKAEWHKLLIEERARKEIE 1211
Cdd:pfam17380  566 SRLEA-------------MEREREMMRQIVESEKARAEYE 592
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
754-1308 3.84e-03

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 41.95  E-value: 3.84e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  754 LRTVRNNLDLVvqernDAQKQLSEEQDarilqdqiLTSKQKELEMARKKMNSEISH----RHQKEKDLFHEDCML----- 824
Cdd:PRK02224   182 LSDQRGSLDQL-----KAQIEEKEEKD--------LHERLNGLESELAELDEEIERyeeqREQARETRDEADEVLeehee 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  825 -QEEIALLRLEIDTIKNQNKQKEKKYFEDIEAVKEKNDNLQKIIKLNEETLTETILQ------YSGQLNNLTAENKILNS 897
Cdd:PRK02224   249 rREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDdadaeaVEARREELEDRDEELRD 328
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  898 ELENGKQNQERLEIEMESYRCRLA-----------AAVRDCDQSQTARDlKLDFQRTRQEWVR--LHDKMK------VDM 958
Cdd:PRK02224   329 RLEECRVAAQAHNEEAESLREDADdleeraeelreEAAELESELEEARE-AVEDRREEIEELEeeIEELRErfgdapVDL 407
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  959 SGLQAKNEILSEKLSNAESKINSLQIQLHNTRDAlgresliLERVQRDLSQTQCqkKETEQmyQIEQS----KLKKYIAK 1034
Cdd:PRK02224   408 GNAEDFLEELREERDELREREAELEATLRTARER-------VEEAEALLEAGKC--PECGQ--PVEGSphveTIEEDRER 476
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1035 QESVEERLSQLQSENMLLRQQLDDAhKKANSQEKTSSTIQDQfHSAAKNLQAESEKQILSLQEKNKELMDEYNHLKERMD 1114
Cdd:PRK02224   477 VEELEAELEDLEEEVEEVEERLERA-EDLVEAEDRIERLEER-REDLEELIAERRETIEEKRERAEELRERAAELEAEAE 554
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1115 QCEKEKAGRKIDLTEAQETVPsrclhlDAENEVLQLQQTLFSMKAIQKQCETLQKNKKQLKQEVVNLKSYMERNMLERGK 1194
Cdd:PRK02224   555 EKREAAAEAEEEAEEAREEVA------ELNSKLAELKERIESLERIRTLLAAIADAEDEIERLREKREALAELNDERRER 628
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1195 aewhklLIEERARK-EIEEKLNEA-ILTLQKQAAVSHEQLVQLRE--DNTTSIKTQMELTIKDLESEISRIKTSQadfnk 1270
Cdd:PRK02224   629 ------LAEKRERKrELEAEFDEArIEEAREDKERAEEYLEQVEEklDELREERDDLQAEIGAVENELEELEELR----- 697
                          570       580       590
                   ....*....|....*....|....*....|....*...
gi 1833563309 1271 telERYKELYlEEVKVRESLSNELSRTNEMIAEVSTQL 1308
Cdd:PRK02224   698 ---ERREALE-NRVEALEALYDEAEELESMYGDLRAEL 731
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
659-996 4.33e-03

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 41.64  E-value: 4.33e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  659 LTVKLKQMENMVSVLQNELSETKkTKLQLELQKIEwekelydlrlALKQENEEKRNAdmlyNKDSEQLRIKEEECGKVVE 738
Cdd:pfam15921  501 LTASLQEKERAIEATNAEITKLR-SRVDLKLQELQ----------HLKNEGDHLRNV----QTECEALKLQMAEKDKVIE 565
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  739 TKQQLKWNLRRLVKEL-RTVRNNLDLVVQERNDAQKQLSEEQDARILQDQI------LTSKQKELEMARKKMNSEISHRH 811
Cdd:pfam15921  566 ILRQQIENMTQLVGQHgRTAGAMQVEKAQLEKEINDRRLELQEFKILKDKKdakireLEARVSDLELEKVKLVNAGSERL 645
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  812 QKEKDLFHEDCMLQEEIALLRLEIDTIKNQNKQKEKKYFEDIEAVKEKNDNLQKIIKLNEETLTET-------------- 877
Cdd:pfam15921  646 RAVKDIKQERDQLLNEVKTSRNELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQTrntlksmegsdgha 725
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  878 ----------ILQYSGQLNNLTAENKILNSELENGKQNQERLEIEMESYRCRLAAAVRDcdqsqtardlkldfqrtrqew 947
Cdd:pfam15921  726 mkvamgmqkqITAKRGQIDALQSKIQFLEEAMTNANKEKHFLKEEKNKLSQELSTVATE--------------------- 784
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*....
gi 1833563309  948 vrlHDKMKVDMSGLQAKNEILSEKLSNAESKINSLQIQLHNTRDALGRE 996
Cdd:pfam15921  785 ---KNKMAGELEVLRSQERRLKEKVANMEVALDKASLQFAECQDIIQRQ 830
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
657-835 4.35e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 41.29  E-value: 4.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  657 DQLTVKLKQMENMVSVLQNELSETKKTKLQLELQKIEWEKELYDLRLALKQENeekRNADMLYNKDSEQLRIKEEECGKV 736
Cdd:COG4942     58 AALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELAELL---RALYRLGRQPPLALLLSPEDFLDA 134
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  737 VETKQQLKW---NLRRLVKELRTVRNNLDLVVQERNDAQKQLSEEQDARILQDQILTSKQKELEMARKKMNSEISHRHQK 813
Cdd:COG4942    135 VRRLQYLKYlapARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAE 214
                          170       180
                   ....*....|....*....|..
gi 1833563309  814 EKDLFHEDCMLQEEIALLRLEI 835
Cdd:COG4942    215 LAELQQEAEELEALIARLEAEA 236
CCDC14 pfam15254
Coiled-coil domain-containing protein 14; This protein family, Coiled-coil domain-containing ...
1042-1152 4.68e-03

Coiled-coil domain-containing protein 14; This protein family, Coiled-coil domain-containing protein 14 (CCDC14) is a domain of unknown function. This family of proteins is found in eukaryotes. Proteins in this family are typically between 301 and 912 amino acids in length.


Pssm-ID: 464594  Cd Length: 857  Bit Score: 41.32  E-value: 4.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1042 LSQLQSENMLLRQQLDDAHKKANSQEKTSSTIQDQ--------FHSAAKNLQA---ESEKQILSLQEKNKELM------- 1103
Cdd:pfam15254  389 LQPLRSENAQLRRQLRILNQQLREQEKTEKTSGSGdcnlelfsLQSLNMSLQNqlqESLKSQELLQSKNEELLkvienqk 468
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1833563309 1104 DEYNH-----------LKERMDQCEKEKAGRKIDLTEAQETVPSRCLHLDA---ENEVL--QLQQ 1152
Cdd:pfam15254  469 EENKKltkifkekeqtLLENKQQFDIETTRVKIELEEALVNMKSFQFKLEAaekENQILgiTLRQ 533
PRK12704 PRK12704
phosphodiesterase; Provisional
961-1095 5.69e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 40.92  E-value: 5.69e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  961 LQAKNEILSEK------LSNAESKINSLQIQLHNTRDALGRESLILERVQRDLSQTQCQKKETEQMYQIEQSKLKKYIAK 1034
Cdd:PRK12704    60 LEAKEEIHKLRnefekeLRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEE 139
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1833563309 1035 QESVEERLSQLQSE---NMLLRQQlddahkKANSQEKTSSTIQdQFHSAAKnLQAESE-KQILSL 1095
Cdd:PRK12704   140 QLQELERISGLTAEeakEILLEKV------EEEARHEAAVLIK-EIEEEAK-EEADKKaKEILAQ 196
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
665-1191 6.07e-03

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 41.26  E-value: 6.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  665 QMENMVSVLQNelsetkKTKLQLELQKIEWEKELYDLRLALKQENEEKRNADMLYNKDSEQLRIKEEEC----GKVVETK 740
Cdd:pfam15921  246 QLEALKSESQN------KIELLLQQHQDRIEQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQArnqnSMYMRQL 319
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  741 QQLKWNLRRLVKELRTVRNNLDLVVQErndAQKQL----SEEQDARILQDQI------LTSKQKELEMARKKMNSEISHR 810
Cdd:pfam15921  320 SDLESTVSQLRSELREAKRMYEDKIEE---LEKQLvlanSELTEARTERDQFsqesgnLDDQLQKLLADLHKREKELSLE 396
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  811 HQKEKDLFHEDCMLQEEIALLRLEIDtiknqnkqkekkyfedieavkEKNDNLQKIIKLNEETLTETILQYSGQLNNLTA 890
Cdd:pfam15921  397 KEQNKRLWDRDTGNSITIDHLRRELD---------------------DRNMEVQRLEALLKAMKSECQGQMERQMAAIQG 455
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  891 ENKIL------NSELENGKQNQERLEIEMESYRCRLAAAVRD-CDQSQTARDLKLDFQRTRQEWVRLHDKMKVDMSGLQ- 962
Cdd:pfam15921  456 KNESLekvsslTAQLESTKEMLRKVVEELTAKKMTLESSERTvSDLTASLQEKERAIEATNAEITKLRSRVDLKLQELQh 535
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  963 AKNEilSEKLSNAESKINSLQIQLHNTRDALGreslILERVQRDLSQTQCQKKETEQMYQIEQSKLKKYI---------- 1032
Cdd:pfam15921  536 LKNE--GDHLRNVQTECEALKLQMAEKDKVIE----ILRQQIENMTQLVGQHGRTAGAMQVEKAQLEKEIndrrlelqef 609
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1033 --------AKQESVEERLSQLQSENMLL-----------------RQQLDDAHKKA----NSQEKTSSTIQDQFHSAAKN 1083
Cdd:pfam15921  610 kilkdkkdAKIRELEARVSDLELEKVKLvnagserlravkdikqeRDQLLNEVKTSrnelNSLSEDYEVLKRNFRNKSEE 689
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1084 LQAESEKQILSLQEKNKELMDEYNHLKER-------------MDQCEKEKAGR------KIDLTEAQETVPSRCLHLDAE 1144
Cdd:pfam15921  690 METTTNKLKMQLKSAQSELEQTRNTLKSMegsdghamkvamgMQKQITAKRGQidalqsKIQFLEEAMTNANKEKHFLKE 769
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1833563309 1145 nEVLQLQQTLfSMKAIQK-----QCETLQKNKKQLKQEVVNLKSYMERNMLE 1191
Cdd:pfam15921  770 -EKNKLSQEL-STVATEKnkmagELEVLRSQERRLKEKVANMEVALDKASLQ 819
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
854-1101 6.10e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 40.58  E-value: 6.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  854 EAVKEKNDNLQKIIKLNEEtltetILQYSGQLNNLTAENKILNSELENGKQNQERLEIEMESYRCRLAAAVRDcdqsqta 933
Cdd:COG3883     27 ELQAELEAAQAELDALQAE-----LEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARA------- 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309  934 rdlkldfqrtrqewvrlhdkMKVDMSGLQAKNEIL-SEKLSNAESKINSLQIQLHNTRDalgreslILERVQRDLSQTQC 1012
Cdd:COG3883     95 --------------------LYRSGGSVSYLDVLLgSESFSDFLDRLSALSKIADADAD-------LLEELKADKAELEA 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1013 QKKETEQMYQIEQSKLKKYIAKQESVEERLSQLQSENMLLRQQLDDAHKKANSQEKTSSTIQDQFHSAAKNLQAESEKQI 1092
Cdd:COG3883    148 KKAELEAKLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAA 227

                   ....*....
gi 1833563309 1093 LSLQEKNKE 1101
Cdd:COG3883    228 AAAAAAAAA 236
PRK11281 PRK11281
mechanosensitive channel MscK;
1140-1309 7.10e-03

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 41.05  E-value: 7.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1140 HLDAENEVLQ--LQQTLFSMKAIQKQCETLQKNKKQLKQEVVNLKSYMERnmLERGKAEWHKLLIEERAR---KEIEEKL 1214
Cdd:PRK11281    53 LLEAEDKLVQqdLEQTLALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAE--LEALKDDNDEETRETLSTlslRQLESRL 130
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1215 NE---AILTLQKQAAVSHEQLVQLRednTTSIKTQMELT-----IKDLESEISRIKTSQADFNKTELERYK-ELYLEEvk 1285
Cdd:PRK11281   131 AQtldQLQNAQNDLAEYNSQLVSLQ---TQPERAQAALYansqrLQQIRNLLKGGKVGGKALRPSQRVLLQaEQALLN-- 205
                          170       180
                   ....*....|....*....|....
gi 1833563309 1286 vresLSNELSRTNemiAEVSTQLT 1309
Cdd:PRK11281   206 ----AQNDLQRKS---LEGNTQLQ 222
PRK12704 PRK12704
phosphodiesterase; Provisional
1043-1220 7.74e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 40.53  E-value: 7.74e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1043 SQLQSENMLLRQQLDDAHKKANSQEKTS-STIQDQFHSAAKNLQAESEKQILSLQEKNKELMDEYNHLKERMDQCEKEKa 1121
Cdd:PRK12704    31 AKIKEAEEEAKRILEEAKKEAEAIKKEAlLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKRE- 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1122 gRKIDlteaqetvpsrclhlDAENEVLQLQQTLfsmKAIQKQCETLQKNKKQLKQEVVNL-----KSYMERNMLERGKAE 1196
Cdd:PRK12704   110 -EELE---------------KKEKELEQKQQEL---EKKEEELEELIEEQLQELERISGLtaeeaKEILLEKVEEEARHE 170
                          170       180
                   ....*....|....*....|....*.
gi 1833563309 1197 WHKLL--IEERARKEIEEKLNEAILT 1220
Cdd:PRK12704   171 AAVLIkeIEEEAKEEADKKAKEILAQ 196
PRK12704 PRK12704
phosphodiesterase; Provisional
1088-1257 8.22e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 40.53  E-value: 8.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1088 SEKQILSLQEKNKELMDEYnhlKERMDQCEKEKagrkidLTEAQETVPSRCLHLDAE-----NEVLQLQQTLfsmkaIQK 1162
Cdd:PRK12704    29 AEAKIKEAEEEAKRILEEA---KKEAEAIKKEA------LLEAKEEIHKLRNEFEKElrerrNELQKLEKRL-----LQK 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1833563309 1163 QcETLQKNKKQLKQEvvnlksymERNMLERGKAEWHKLLIEERARKEIEEKLNEAILTLQKQAAVSHEQ-----LVQLRE 1237
Cdd:PRK12704    95 E-ENLDRKLELLEKR--------EEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEakeilLEKVEE 165
                          170       180
                   ....*....|....*....|
gi 1833563309 1238 DnttsIKTQMELTIKDLESE 1257
Cdd:PRK12704   166 E----ARHEAAVLIKEIEEE 181
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH