NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907082233|ref|XP_036012699|]
View 

unconventional myosin-XVIIIa isoform X14 [Mus musculus]

Protein Classification

kinesin/myosin motor domain-containing protein; myosin heavy chain( domain architecture ID 13414207)

kinesin/myosin motor domain-containing protein may have ATPase activity and function as a molecular motor, such as kinesins and myosins| myosin heavy chain of class II myosin (or conventional myosin), which contains two heavy chains made up of the motor/head (N-terminal) and coiled-coil tail (C-terminal) domains; the head ATPase activity and functions as a molecular motor, utilizing ATP hydrolysis to generate directed movement toward the plus end along actin filaments

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MYSc_Myo18 cd01386
class XVIII myosin, motor domain; Many members of this class contain a N-terminal PDZ domain ...
394-1151 0e+00

class XVIII myosin, motor domain; Many members of this class contain a N-terminal PDZ domain which is commonly found in proteins establishing molecular complexes. The motor domain itself does not exhibit ATPase activity, suggesting that it functions as an actin tether protein. It also has two IQ domains that probably bind light chains or related calmodulins and a C-terminal tail with two sections of coiled-coil domains, which are thought to mediate homodimerization. The function of these myosins are largely unknown. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


:

Pssm-ID: 276837 [Multi-domain]  Cd Length: 689  Bit Score: 1131.26  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd01386      1 SSVLHTLRQRYGANLIHTYAGPSLIVINPRHPLAVYSEKVAKMFKGCRREDMPPHIYASAQSAYRAMLMSRRDQSIVLLG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIAGTSGtKVFSVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTM 553
Cdd:cd01386     81 RSGSGKTTNCRHILEYLVTAAGSVG-GVLSVEKLNAALTVLEAFGNVRTALNGNATRFSQLFSLDFDQAGQLASASIQTL 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  554 LLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFSKLQAAMKVLAISPEE 633
Cdd:cd01386    160 LLERSRVARRPEGESNFNVFYYLLAGADAALRTELHLNQLAESNSFGIVPLQKPEDKQKAAAAFSKLQAAMKTLGISEEE 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  634 QKTCWLILASIYHLGAAGATKEPleeqdeaaEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGTLQRSTSFRQ 713
Cdd:cd01386    240 QRAIWSILAAIYHLGAAGATKAA--------SAGRKQFARPEWAQRAAYLLGCTLEELSSAIFKHHLSGGPQQSTTSSGQ 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  714 GPEESGLGEGTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGGSARGASFEELCHN 793
Cdd:cd01386    312 ESPARSSSGGPKLTGVEALEGFAAGLYSELFAAVVSLINRSLSSSHHSTSSITIVDTPGFQNPAHSGSQRGATFEDLCHN 391
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  794 YAQDRLQRLFHERTFLQELERYKEDNIELAFDDLEPVADDSVAAVDQASH--LVRSLAHADEARGLLWLLEEEALVPGAT 871
Cdd:cd01386    392 YAQERLQLLFHERTFVAPLERYKQENVEVDFDLPELSPGALVALIDQAPQqaLVRSDLRDEDRRGLLWLLDEEALYPGSS 471
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  872 EDALLDRLFSYYGPQEGDkKGQSPLLRSSKPRHFLLGHSHGTNWVEYNVAGWLNYTKQNPATQNAPRLLQDSQKKiisnl 951
Cdd:cd01386    472 DDTFLERLFSHYGDKEGG-KGHSLLRRSEGPLQFVLGHLLGTNPVEYDVSGWLKAAKENPSAQNATQLLQESQKE----- 545
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  952 flgragsatvlsgsiagleggsqlalrratsmrktfttgMAAVKKKSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVAEG 1031
Cdd:cd01386    546 ---------------------------------------TAAVKRKSPCLQIKFQVDALIDTLRRTGLHFVHCLLPQHNA 586
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1032 WPGEPRsassrrvsssseldlppGDPCEAGLLQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHLT 1111
Cdd:cd01386    587 GKDERS-----------------TSSPAAGDELLDVPLLRSQLRGSQLLDALRLYRQGFPDHMPLGEFRRRFQVLAPPLT 649
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|
gi 1907082233 1112 KKHGRNYIVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd01386    650 KKLGLNSEVADERKAVEELLEELDLEKSSYRIGLSQVFFR 689
COG5022 super family cl34868
Myosin heavy chain [General function prediction only];
356-1877 3.16e-109

Myosin heavy chain [General function prediction only];


The actual alignment was detected with superfamily member COG5022:

Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 384.81  E-value: 3.16e-109
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  356 KLDHDGAILDVDEDDIEKANAPSCDRLEDLASLVYLNESSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMH 435
Cdd:COG5022     42 KEDGESVSVKKKVLGNDRIKLPKFDGVDDLTELSYLNEPAVLHNLEKRYNNGQIYTYSGLVLIAVNPYRDLGIYTDDIIQ 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  436 MFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKwQALST--L 513
Cdd:COG5022    122 SYSGKNRLELEPHVFAIAEEAYRNLLSEKENQTIIISGESGAGKTENAKRIMQYLASVTSSSTVEISSIEK-QILATnpI 200
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  514 LEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL--- 590
Cdd:COG5022    201 LEAFGNAKTVRNDNSSRFGKYIKIEFDENGEICGAKIETYLLEKSRVVHQNKNERNYHIFYQLLAGDPEELKKLLLLqnp 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  591 ---NHLAENNVFGIVPLSKPEEkqkaaqqFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAgatkEPLEEQDEAAeag 667
Cdd:COG5022    281 kdyIYLSQGGCDKIDGIDDAKE-------FKITLDALKTIGIDEEEQDQIFKILAAILHIGNI----EFKEDRNGAA--- 346
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  668 rkQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGG--TLQRSTSFRQgpeesglgegtklsALECLEGMASGLYSELFT 745
Cdd:COG5022    347 --IFSDNSVLDKACYLLGIDPSLFVKWLVKRQIKTGgeWIVVPLNLEQ--------------ALAIRDSLAKALYSNLFD 410
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  746 LLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---L 822
Cdd:COG5022    411 WIVDRINKSLDHSAAASNFIGVLDIYGFEIFEKN------SFEQLCINYTNEKLQQFFNQHMFKLEQEEYVKEGIEwsfI 484
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  823 AFDDLEPVADDSvaavdQASHLVRSLAHADEArgllwlleeeALVPGATEDALLDRLFSYYgPQEGDKKGQSPLLRSSKp 902
Cdd:COG5022    485 DYFDNQPCIDLI-----EKKNPLGILSLLDEE----------CVMPHATDESFTSKLAQRL-NKNSNPKFKKSRFRDNK- 547
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  903 rhFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKKIISNLFLGRAgsatvlsgsiaglEGGSQLALRRATS 982
Cdd:COG5022    548 --FVVKHYAGD--VEYDVEGFLDKNK-DPLNDDLLELLKASTNEFVSTLFDDEE-------------NIESKGRFPTLGS 609
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  983 MrktfttgmaavkkkslciqIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagl 1062
Cdd:COG5022    610 R-------------------FKESLNSLMSTLNSTQPHYIRCIKPNEEKSPW---------------------------- 642
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1063 lQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHlTKKHGRNYIVVDEKRAVEELLESLDLEKSSCC 1142
Cdd:COG5022    643 -TFDNQMVLSQLRCCGVLETIRISRAGFPSRWTFDEFVQRYRILSPS-KSWTGEYTWKEDTKNAVKSILEELVIDSSKYQ 720
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1143 LGLSRVFFRAGTLARLEEQRDEQTSRHLTLFQAACRGYLARQHF-KKRKIQDlAIRCVQKNIKKNKGVKDWPWWKLFTTV 1221
Cdd:COG5022    721 IGNTKVFFKAGVLAALEDMRDAKLDNIATRIQRAIRGRYLRRRYlQALKRIK-KIQVIQHGFRLRRLVDYELKWRLFIKL 799
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1222 RPLIQVQLSEEQIRNKDEEIQQLRSKLEKV----EKERNELRLSSDRLETRISEltSELTDERNTGESASQLLDAETAER 1297
Cdd:COG5022    800 QPLLSLLGSRKEYRSYLACIIKLQKTIKREkklrETEEVEFSLKAEVLIQKFGR--SLKAKKRFSLLKKETIYLQSAQRV 877
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1298 LRTEKEMKELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDF----TKKRLQQELED 1373
Cdd:COG5022    878 ELAERQLQELKIDVKSISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLeegpSIEYVKLPELN 957
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1374 KMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRrfdsELSQAHEETQ 1453
Cdd:COG5022    958 KLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPV----EVAELQSASK 1033
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1454 REKLQREKLQREKDMLLAEAFSLKQQME-EKDLdiagftqKVVSLEAELQDIssqESKDEASLAKVKKQLRDLEAK---V 1529
Cdd:COG5022   1034 IISSESTELSILKPLQKLKGLLLLENNQlQARY-------KALKLRRENSLL---DDKQLYQLESTENLLKTINVKdleV 1103
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1530 KDQEEELDEQAGSIQMLEQAKLRLEMEMERM--RQTHSKEMESRDEEVEEARQSCQKKLKQMEvqlEEEYEDKQKALREK 1607
Cdd:COG5022   1104 TNRNLVKPANVLQFIVAQMIKLNLLQEISKFlsQLVNTLEPVFQKLSVLQLELDGLFWEANLE---ALPSPPPFAALSEK 1180
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1608 RELESKL----STLSDQVNQ---RDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLknqleESEFTCAAA 1680
Cdd:COG5022   1181 RLYQSALydekSKLSSSEVNdlkNELIALFSKIFSGWPRGDKLKKLISEGWVPTEYSTSLKGFNNL-----NKKFDTPAS 1255
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1681 VKarkameveMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLeeDQEDMNEL-MKKHKAAVAQASRDmaqmnDLQA 1759
Cdd:COG5022   1256 MS--------NEKLLSLLNSIDNLLSSYKLEEEVLPATINSLLQYI--NVGLFNALrTKASSLRWKSATEV-----NYNS 1320
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1760 QIEESNKEKQELQEklqaLQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKT-QVKRLENlasrlketmeklteeRD 1838
Cdd:COG5022   1321 EELDDWCREFEISD----VDEELEELIQAVKVLQLLKDDLNKLDELLDACYSLNPaEIQNLKS---------------RY 1381
                         1530      1540      1550      1560
                   ....*....|....*....|....*....|....*....|
gi 1907082233 1839 QRAAAENR-EKEQNKRLQRQLRDTKEEMSELARKEAEASR 1877
Cdd:COG5022   1382 DPADKENNlPKEILKKIEALLIKQELQLSLEGKDETEVHL 1421
PHA03307 super family cl33723
transcriptional regulator ICP4; Provisional
22-305 2.53e-05

transcriptional regulator ICP4; Provisional


The actual alignment was detected with superfamily member PHA03307:

Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 49.78  E-value: 2.53e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   22 PKPGPPGKEEGAPEASSKDGSAPPRSP--QPATSPVPSETSQTAKSPEPTltmnglGAASAEGPNEEAQGLSRKRVANAV 99
Cdd:PHA03307   106 PTPPGPSSPDPPPPTPPPASPPPSPAPdlSEMLRPVGSPGPPPAASPPAA------GASPAAVASDAASSRQAALPLSSP 179
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  100 RKVV-SKVLPGEEPGTAKEPPGRGARSPEH--PARGKKGEKAATSPKPPPPPPPPPAPPKPETK-KEAAKDELSVGLRSL 175
Cdd:PHA03307   180 EETArAPSSPPAEPPPSTPPAAASPRPPRRssPISASASSPAPAPGRSAADDAGASSSDSSSSEsSGCGWGPENECPLPR 259
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  176 MSRGRGKDHKSRGKQPPGKGEKAPSQEPASTGKSGSPDMvdSPKKAGSPAKPETPNKRCSPAPAQE------LADPNLAG 249
Cdd:PHA03307   260 PAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSP--SPSSPGSGPAPSSPRASSSSSSSREssssstSSSSESSR 337
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082233  250 PKLNPSGKQQAKSPASDvqqeTGADPEVRLSPAEEAHQRLERIFTASLDPEAASPA 305
Cdd:PHA03307   338 GAAVSPGPSPSRSPSPS----RPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRR 389
PRK03918 super family cl35229
DNA double-strand break repair ATPase Rad50;
1802-1967 1.63e-04

DNA double-strand break repair ATPase Rad50;


The actual alignment was detected with superfamily member PRK03918:

Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 46.98  E-value: 1.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1802 IRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLqRQLRDTKEEMSELARKEAEASRKKHE 1881
Cdd:PRK03918   171 IKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREEL-EKLEKEVKELEELKEEIEELEKELES 249
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1882 LEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENEDLINSLQDMVTKYQKKKNKLEGDSdvdSELEDRVDGV 1961
Cdd:PRK03918   250 LEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRL---SRLEEEINGI 326

                   ....*.
gi 1907082233 1962 KSWLSK 1967
Cdd:PRK03918   327 EERIKE 332
 
Name Accession Description Interval E-value
MYSc_Myo18 cd01386
class XVIII myosin, motor domain; Many members of this class contain a N-terminal PDZ domain ...
394-1151 0e+00

class XVIII myosin, motor domain; Many members of this class contain a N-terminal PDZ domain which is commonly found in proteins establishing molecular complexes. The motor domain itself does not exhibit ATPase activity, suggesting that it functions as an actin tether protein. It also has two IQ domains that probably bind light chains or related calmodulins and a C-terminal tail with two sections of coiled-coil domains, which are thought to mediate homodimerization. The function of these myosins are largely unknown. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276837 [Multi-domain]  Cd Length: 689  Bit Score: 1131.26  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd01386      1 SSVLHTLRQRYGANLIHTYAGPSLIVINPRHPLAVYSEKVAKMFKGCRREDMPPHIYASAQSAYRAMLMSRRDQSIVLLG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIAGTSGtKVFSVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTM 553
Cdd:cd01386     81 RSGSGKTTNCRHILEYLVTAAGSVG-GVLSVEKLNAALTVLEAFGNVRTALNGNATRFSQLFSLDFDQAGQLASASIQTL 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  554 LLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFSKLQAAMKVLAISPEE 633
Cdd:cd01386    160 LLERSRVARRPEGESNFNVFYYLLAGADAALRTELHLNQLAESNSFGIVPLQKPEDKQKAAAAFSKLQAAMKTLGISEEE 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  634 QKTCWLILASIYHLGAAGATKEPleeqdeaaEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGTLQRSTSFRQ 713
Cdd:cd01386    240 QRAIWSILAAIYHLGAAGATKAA--------SAGRKQFARPEWAQRAAYLLGCTLEELSSAIFKHHLSGGPQQSTTSSGQ 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  714 GPEESGLGEGTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGGSARGASFEELCHN 793
Cdd:cd01386    312 ESPARSSSGGPKLTGVEALEGFAAGLYSELFAAVVSLINRSLSSSHHSTSSITIVDTPGFQNPAHSGSQRGATFEDLCHN 391
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  794 YAQDRLQRLFHERTFLQELERYKEDNIELAFDDLEPVADDSVAAVDQASH--LVRSLAHADEARGLLWLLEEEALVPGAT 871
Cdd:cd01386    392 YAQERLQLLFHERTFVAPLERYKQENVEVDFDLPELSPGALVALIDQAPQqaLVRSDLRDEDRRGLLWLLDEEALYPGSS 471
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  872 EDALLDRLFSYYGPQEGDkKGQSPLLRSSKPRHFLLGHSHGTNWVEYNVAGWLNYTKQNPATQNAPRLLQDSQKKiisnl 951
Cdd:cd01386    472 DDTFLERLFSHYGDKEGG-KGHSLLRRSEGPLQFVLGHLLGTNPVEYDVSGWLKAAKENPSAQNATQLLQESQKE----- 545
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  952 flgragsatvlsgsiagleggsqlalrratsmrktfttgMAAVKKKSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVAEG 1031
Cdd:cd01386    546 ---------------------------------------TAAVKRKSPCLQIKFQVDALIDTLRRTGLHFVHCLLPQHNA 586
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1032 WPGEPRsassrrvsssseldlppGDPCEAGLLQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHLT 1111
Cdd:cd01386    587 GKDERS-----------------TSSPAAGDELLDVPLLRSQLRGSQLLDALRLYRQGFPDHMPLGEFRRRFQVLAPPLT 649
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|
gi 1907082233 1112 KKHGRNYIVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd01386    650 KKLGLNSEVADERKAVEELLEELDLEKSSYRIGLSQVFFR 689
MYSc smart00242
Myosin. Large ATPases; ATPase; molecular motor. Muscle contraction consists of a cyclical ...
375-1163 2.32e-113

Myosin. Large ATPases; ATPase; molecular motor. Muscle contraction consists of a cyclical interaction between myosin and actin. The core of the myosin structure is similar in fold to that of kinesin.


Pssm-ID: 214580 [Multi-domain]  Cd Length: 677  Bit Score: 377.27  E-value: 2.32e-113
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   375 NAPSCDRLEDLASLVYLNESSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQ 454
Cdd:smart00242    1 NPPKFEGVEDLVLLTYLNEPAVLHNLKKRYLKDLIYTYIGLVLVAVNPYKQLPIYTDEVIKKYRGKSRGELPPHVFAIAD 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   455 TAYRAMLMSRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGtSGTKVFSVEKwQALST--LLEAFGNSPTIMNGSATRFS 532
Cdd:smart00242   81 NAYRNMLNDKENQSIIISGESGAGKTENTKKIMQYLASVSG-SNTEVGSVED-QILESnpILEAFGNAKTLRNNNSSRFG 158
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   533 QILSLDFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHlAENNVFgivpLSKPEEKQK 612
Cdd:smart00242  159 KFIEIHFDAKGKIIGAKIETYLLEKSRVVSQAKGERNYHIFYQLLAGASEELKKELGLKS-PEDYRY----LNQGGCLTV 233
                           250       260       270       280       290       300       310       320
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   613 A----AQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAagatkepLEEQDEAAEAGRKQFARHEWAQKAAYLLGCSL 688
Cdd:smart00242  234 DgiddAEEFKETLNAMRVLGFSEEEQESIFKILAAILHLGN-------IEFEEGRNDNAASTVKDKEELSNAAELLGVDP 306
                           330       340       350       360       370       380       390       400
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   689 EELSSAIfkhqlkggtLQRSTSFRQGPEESGLgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIV 768
Cdd:smart00242  307 EELEKAL---------TKRKIKTGGEVITKPL---NVEQALDARDALAKALYSRLFDWLVKRINQSLSFKDGSTYFIGVL 374
                           410       420       430       440       450       460       470       480
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   769 DTPGFQ----NpewggsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVADdsvaavdqa 841
Cdd:smart00242  375 DIYGFEifevN----------SFEQLCINYANEKLQQFFNQHVFKLEQEEYEREGIDwtfIDFFDNQDCID--------- 435
                           490       500       510       520       530       540       550       560
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   842 shLVRS-----LAHADEArgllwlleeeALVPGATEDALLDRLFSYYgpqegdkkGQSPLLRSSKPR---HFLLGHSHGT 913
Cdd:smart00242  436 --LIEKkppgiLSLLDEE----------CRFPKGTDQTFLEKLNQHH--------KKHPHFSKPKKKgrtEFIIKHYAGD 495
                           570       580       590       600       610       620       630       640
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   914 nwVEYNVAGWLnytKQN--PATQNAPRLLQDSQKKIISNLFlgragsatvlsGSIAGleggsqlalrRATSMRKTFTTGM 991
Cdd:smart00242  496 --VTYDVTGFL---EKNkdTLSDDLIELLQSSKNPLIASLF-----------PSGVS----------NAGSKKRFQTVGS 549
                           650       660       670       680       690       700       710       720
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   992 aavkkkslciQIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLR 1071
Cdd:smart00242  550 ----------QFKEQLNELMDTLNSTNPHFIRCIKPNEEKKPG-----------------------------DFDSSLVL 590
                           730       740       750       760       770       780       790       800
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  1072 AQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHlTKKHGRNyivvDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:smart00242  591 HQLRYLGVLENIRIRRAGFPYRLPFDEFLQRYRVLLPD-TWPPWGG----DAKKACEALLQSLGLDEDEYQLGKTKVFLR 665
                           810
                    ....*....|..
gi 1907082233  1152 AGTLARLEEQRD 1163
Cdd:smart00242  666 PGQLAELEELRE 677
COG5022 COG5022
Myosin heavy chain [General function prediction only];
356-1877 3.16e-109

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 384.81  E-value: 3.16e-109
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  356 KLDHDGAILDVDEDDIEKANAPSCDRLEDLASLVYLNESSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMH 435
Cdd:COG5022     42 KEDGESVSVKKKVLGNDRIKLPKFDGVDDLTELSYLNEPAVLHNLEKRYNNGQIYTYSGLVLIAVNPYRDLGIYTDDIIQ 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  436 MFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKwQALST--L 513
Cdd:COG5022    122 SYSGKNRLELEPHVFAIAEEAYRNLLSEKENQTIIISGESGAGKTENAKRIMQYLASVTSSSTVEISSIEK-QILATnpI 200
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  514 LEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL--- 590
Cdd:COG5022    201 LEAFGNAKTVRNDNSSRFGKYIKIEFDENGEICGAKIETYLLEKSRVVHQNKNERNYHIFYQLLAGDPEELKKLLLLqnp 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  591 ---NHLAENNVFGIVPLSKPEEkqkaaqqFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAgatkEPLEEQDEAAeag 667
Cdd:COG5022    281 kdyIYLSQGGCDKIDGIDDAKE-------FKITLDALKTIGIDEEEQDQIFKILAAILHIGNI----EFKEDRNGAA--- 346
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  668 rkQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGG--TLQRSTSFRQgpeesglgegtklsALECLEGMASGLYSELFT 745
Cdd:COG5022    347 --IFSDNSVLDKACYLLGIDPSLFVKWLVKRQIKTGgeWIVVPLNLEQ--------------ALAIRDSLAKALYSNLFD 410
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  746 LLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---L 822
Cdd:COG5022    411 WIVDRINKSLDHSAAASNFIGVLDIYGFEIFEKN------SFEQLCINYTNEKLQQFFNQHMFKLEQEEYVKEGIEwsfI 484
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  823 AFDDLEPVADDSvaavdQASHLVRSLAHADEArgllwlleeeALVPGATEDALLDRLFSYYgPQEGDKKGQSPLLRSSKp 902
Cdd:COG5022    485 DYFDNQPCIDLI-----EKKNPLGILSLLDEE----------CVMPHATDESFTSKLAQRL-NKNSNPKFKKSRFRDNK- 547
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  903 rhFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKKIISNLFLGRAgsatvlsgsiaglEGGSQLALRRATS 982
Cdd:COG5022    548 --FVVKHYAGD--VEYDVEGFLDKNK-DPLNDDLLELLKASTNEFVSTLFDDEE-------------NIESKGRFPTLGS 609
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  983 MrktfttgmaavkkkslciqIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagl 1062
Cdd:COG5022    610 R-------------------FKESLNSLMSTLNSTQPHYIRCIKPNEEKSPW---------------------------- 642
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1063 lQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHlTKKHGRNYIVVDEKRAVEELLESLDLEKSSCC 1142
Cdd:COG5022    643 -TFDNQMVLSQLRCCGVLETIRISRAGFPSRWTFDEFVQRYRILSPS-KSWTGEYTWKEDTKNAVKSILEELVIDSSKYQ 720
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1143 LGLSRVFFRAGTLARLEEQRDEQTSRHLTLFQAACRGYLARQHF-KKRKIQDlAIRCVQKNIKKNKGVKDWPWWKLFTTV 1221
Cdd:COG5022    721 IGNTKVFFKAGVLAALEDMRDAKLDNIATRIQRAIRGRYLRRRYlQALKRIK-KIQVIQHGFRLRRLVDYELKWRLFIKL 799
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1222 RPLIQVQLSEEQIRNKDEEIQQLRSKLEKV----EKERNELRLSSDRLETRISEltSELTDERNTGESASQLLDAETAER 1297
Cdd:COG5022    800 QPLLSLLGSRKEYRSYLACIIKLQKTIKREkklrETEEVEFSLKAEVLIQKFGR--SLKAKKRFSLLKKETIYLQSAQRV 877
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1298 LRTEKEMKELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDF----TKKRLQQELED 1373
Cdd:COG5022    878 ELAERQLQELKIDVKSISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLeegpSIEYVKLPELN 957
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1374 KMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRrfdsELSQAHEETQ 1453
Cdd:COG5022    958 KLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPV----EVAELQSASK 1033
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1454 REKLQREKLQREKDMLLAEAFSLKQQME-EKDLdiagftqKVVSLEAELQDIssqESKDEASLAKVKKQLRDLEAK---V 1529
Cdd:COG5022   1034 IISSESTELSILKPLQKLKGLLLLENNQlQARY-------KALKLRRENSLL---DDKQLYQLESTENLLKTINVKdleV 1103
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1530 KDQEEELDEQAGSIQMLEQAKLRLEMEMERM--RQTHSKEMESRDEEVEEARQSCQKKLKQMEvqlEEEYEDKQKALREK 1607
Cdd:COG5022   1104 TNRNLVKPANVLQFIVAQMIKLNLLQEISKFlsQLVNTLEPVFQKLSVLQLELDGLFWEANLE---ALPSPPPFAALSEK 1180
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1608 RELESKL----STLSDQVNQ---RDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLknqleESEFTCAAA 1680
Cdd:COG5022   1181 RLYQSALydekSKLSSSEVNdlkNELIALFSKIFSGWPRGDKLKKLISEGWVPTEYSTSLKGFNNL-----NKKFDTPAS 1255
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1681 VKarkameveMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLeeDQEDMNEL-MKKHKAAVAQASRDmaqmnDLQA 1759
Cdd:COG5022   1256 MS--------NEKLLSLLNSIDNLLSSYKLEEEVLPATINSLLQYI--NVGLFNALrTKASSLRWKSATEV-----NYNS 1320
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1760 QIEESNKEKQELQEklqaLQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKT-QVKRLENlasrlketmeklteeRD 1838
Cdd:COG5022   1321 EELDDWCREFEISD----VDEELEELIQAVKVLQLLKDDLNKLDELLDACYSLNPaEIQNLKS---------------RY 1381
                         1530      1540      1550      1560
                   ....*....|....*....|....*....|....*....|
gi 1907082233 1839 QRAAAENR-EKEQNKRLQRQLRDTKEEMSELARKEAEASR 1877
Cdd:COG5022   1382 DPADKENNlPKEILKKIEALLIKQELQLSLEGKDETEVHL 1421
Myosin_head pfam00063
Myosin head (motor domain);
383-1151 2.89e-108

Myosin head (motor domain);


Pssm-ID: 395017 [Multi-domain]  Cd Length: 674  Bit Score: 362.37  E-value: 2.89e-108
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  383 EDLASLVYLNESSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLM 462
Cdd:pfam00063    2 EDMVELSYLNEPSVLHNLKKRYKSDLIYTYSGLVLVAVNPYKQLPIYSEDMIKAYRGKRRGELPPHIFAIADEAYRSMLQ 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  463 SRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGT-SGTKVFSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDF 539
Cdd:pfam00063   82 DKENQSILISGESGAGKTENTKKIMQYLASVSGSgSAGNVGRLEE-QILQSnpILEAFGNAKTVRNNNSSRFGKYIEIQF 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  540 DQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL---------NHLAENNVFGIvplskpeek 610
Cdd:pfam00063  161 DAKGDIVGGKIETYLLEKSRVVYQAEGERNYHIFYQLLAGASAQLKKELRLtnpkdyhylSQSGCYTIDGI--------- 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  611 qKAAQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEaaeagrkqFARHEWAQKAAYLLGCSLEE 690
Cdd:pfam00063  232 -DDSEEFKITDKAMDILGFSDEEQMGIFRIVAAILHLGNIEFKKERNDEQAV--------PDDTENLQKAASLLGIDSTE 302
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  691 LSSAIFKHQLKGGTLQRSTSFrqgpeesglgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVD 769
Cdd:pfam00063  303 LEKALCKRRIKTGRETVSKPQ------------NVEQANYARDALAKAIYSRLFDWLVDRINKSLDVKTIEKASFIgVLD 370
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  770 TPGFQNPEWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVADdsvaavdqashLVR 846
Cdd:pfam00063  371 IYGFEIFEKN------SFEQLCINYVNEKLQQFFNHHMFKLEQEEYVREGIEwtfIDFGDNQPCID-----------LIE 433
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  847 S-----LAHADEArgllwlleeeALVPGATEDALLDRLFSYYgpqegdkkGQSPLLRSSKPR---HFLLGHSHGTnwVEY 918
Cdd:pfam00063  434 KkplgiLSLLDEE----------CLFPKATDQTFLDKLYSTF--------SKHPHFQKPRLQgetHFIIKHYAGD--VEY 493
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  919 NVAGWLNYTKqNPATQNAPRLLQDSQKKIISNLFLGRAgsatvLSGSIAGLEGGSQLALRRATSMRKtfTTGMaavkkks 998
Cdd:pfam00063  494 NVEGFLEKNK-DPLNDDLVSLLKSSSDPLLAELFPDYE-----TAESAAANESGKSTPKRTKKKRFI--TVGS------- 558
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  999 lciQIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSR 1078
Cdd:pfam00063  559 ---QFKESLGELMKTLNSTNPHYIRCIKPNEKKRAG-----------------------------VFDNSLVLHQLRCNG 606
                          730       740       750       760       770       780       790
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082233 1079 LLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHLTKKhgrnyIVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:pfam00063  607 VLEGIRIRRAGFPNRITFQEFVQRYRILAPKTWPK-----WKGDAKKGCEAILQSLNLDKEEYQFGKTKIFFR 674
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1202-1896 1.58e-42

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 171.51  E-value: 1.58e-42
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1202 NIKKNKGVKDWPWWKLFTTVRPLIQVQLSEEQIRNKDE-EIQQLRSKLEKVEKERNELrlssdrlETRISELTSELtder 1280
Cdd:pfam01576  374 NLEKAKQALESENAELQAELRTLQQAKQDSEHKRKKLEgQLQELQARLSESERQRAEL-------AEKLSKLQSEL---- 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1281 ntgESASQLLDAETAERLRTEKEMKELQTQ-YDALKKQMEVMEMEVMEARLIRA--AEINGEVDDDDAGGEWRLKYERAV 1357
Cdd:pfam01576  443 ---ESVSSLLNEAEGKNIKLSKDVSSLESQlQDTQELLQEETRQKLNLSTRLRQleDERNSLQEQLEEEEEAKRNVERQL 519
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1358 REVDF----TKKRLQQELEdKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHE 1433
Cdd:pfam01576  520 STLQAqlsdMKKKLEEDAG-TLEALEEGKKRLQRELEALTQQLEEKAAAYDKLEKTKNRLQQELDDLLVDLDHQRQLVSN 598
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1434 LEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSqeSKDEA 1513
Cdd:pfam01576  599 LEKKQKKFDQMLAEEKAISARYAEERDRAEAEAREKETRALSLARALEEALEAKEELERTNKQLRAEMEDLVS--SKDDV 676
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1514 --SLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEV 1591
Cdd:pfam01576  677 gkNVHELERSKRALEQQVEEMKTQLEELEDELQATEDAKLRLEVNMQALKAQFERDLQARDEQGEEKRRQLVKQVRELEA 756
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1592 QLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLrKDLKRTKALLADAQIMLDHLKNN-----APSKREIAQL 1666
Cdd:pfam01576  757 ELEDERKQRAQAVAAKKKLELDLKELEAQIDAANKGREEAV-KQLKKLQAQMKDLQRELEEARASrdeilAQSKESEKKL 835
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1667 KN------QLEESeftCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKH 1740
Cdd:pfam01576  836 KNleaellQLQED---LAASERARRQAQQERDELADEIASGASGKSALQDEKRRLEARIAQLEEELEEEQSNTELLNDRL 912
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1741 KAAVAQ---------ASRDMAQMND-LQAQIEESNKekqELQEKLQALQSQVEFLEqsmvdKSLVSRQEAKIRELETRLE 1810
Cdd:pfam01576  913 RKSTLQveqlttelaAERSTSQKSEsARQQLERQNK---ELKAKLQEMEGTVKSKF-----KSSIAALEAKIAQLEEQLE 984
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1811 FEKTQ----VKRLENLASRLKETMEKLTEER---DQRAAAENREKEQNKRLQRQLRDTKEEMSelarkEAEASRKKheLE 1883
Cdd:pfam01576  985 QESRErqaaNKLVRRTEKKLKEVLLQVEDERrhaDQYKDQAEKGNSRMKQLKRQLEEAEEEAS-----RANAARRK--LQ 1057
                          730
                   ....*....|...
gi 1907082233 1884 MDLESLEAANQSL 1896
Cdd:pfam01576 1058 RELDDATESNESM 1070
PTZ00014 PTZ00014
myosin-A; Provisional
329-1207 1.87e-36

myosin-A; Provisional


Pssm-ID: 240229 [Multi-domain]  Cd Length: 821  Bit Score: 150.57  E-value: 1.87e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  329 LVHRDgfSLASQLKSEELSLPEGKarvkldhdgaILDVDEDDIEKANAP-SCDRLEDLASLVYLNESSVLHTLRQRYGAS 407
Cdd:PTZ00014    56 LVLPG--STGEKLTLKQIDPPTNS----------TFEVKPEHAFNANSQiDPMTYGDIGLLPHTNIPCVLDFLKHRYLKN 123
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  408 LLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRR-EDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGSSGSGKTTSFQHL 486
Cdd:PTZ00014   124 QIYTTADPLLVAINPFKDLGNTTNDWIRRYRDAKDsDKLPPHVFTTARRALENLHGVKKSQTIIVSGESGAGKTEATKQI 203
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  487 VQYLAtiAGTSGTKVFSVEK--WQAlSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTMLLEKLRVARRP 564
Cdd:PTZ00014   204 MRYFA--SSKSGNMDLKIQNaiMAA-NPVLEAFGNAKTIRNNNSSRFGRFMQLQLGEEGGIRYGSIVAFLLEKSRVVTQE 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  565 ASEATFNVFYYLLACGDATLRTELHLNHLAENNVfgIVPLSKPEEKQKAAQQFSKLQAAMKVLAISPEEQKTCWLILASI 644
Cdd:PTZ00014   281 DDERSYHIFYQLLKGANDEMKEKYKLKSLEEYKY--INPKCLDVPGIDDVKDFEEVMESFDSMGLSESQIEDIFSILSGV 358
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  645 YHLGAAGATKEPLEEQDEAAEAGRKQFarhEWAQKAAYLLGCSLEELS-SAIFKHQLKGGTLQRStsfRQGPEESglgEG 723
Cdd:PTZ00014   359 LLLGNVEIEGKEEGGLTDAAAISDESL---EVFNEACELLFLDYESLKkELTVKVTYAGNQKIEG---PWSKDES---EM 429
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  724 TKLSaleclegMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQ----NpewggsargaSFEELCHNYAQDRL 799
Cdd:PTZ00014   430 LKDS-------LSKAVYEKLFLWIIRNLNATIEPPGGFKVFIGMLDIFGFEvfknN----------SLEQLFINITNEML 492
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  800 QRLFHERTFLQELERYKEDNI---ELAFDDLEPVAD------DSVAAV--DQashlvrSLAhadeargllwlleeealvP 868
Cdd:PTZ00014   493 QKNFVDIVFERESKLYKDEGIsteELEYTSNESVIDllcgkgKSVLSIleDQ------CLA------------------P 548
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  869 GATEDALLDRLFSYYGPQEGDKKGqspllRSSKPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKKII 948
Cdd:PTZ00014   549 GGTDEKFVSSCNTNLKNNPKYKPA-----KVDSNKNFVIKHTIGD--IQYCASGFLFKNK-DVLRPELVEVVKASPNPLV 620
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  949 SNLFLGragsATVLSGSIA-GLEGGSQLalrratsMRktfttgmaavkkkslciqiklQVDALIDTIKRSKMHFVHCFLP 1027
Cdd:PTZ00014   621 RDLFEG----VEVEKGKLAkGQLIGSQF-------LN---------------------QLDSLMSLINSTEPHFIRCIKP 668
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1028 vaegwpgeprsassrrvssssELDLPPGDPCEAGLLqldvsllrAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLA 1107
Cdd:PTZ00014   669 ---------------------NENKKPLDWNSSKVL--------IQLHSLSILEALQLRQLGFSYRRTFAEFLSQFKYLD 719
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1108 PHLTKKHGrnyivVDEKRAVEELLESLDLEKSSCCLGLSRVFFR---AGTLARLEEQRDEQTSRHLTLFQAACRGYLARQ 1184
Cdd:PTZ00014   720 LAVSNDSS-----LDPKEKAEKLLERSGLPKDSYAIGKTMVFLKkdaAKELTQIQREKLAAWEPLVSVLEALILKIKKKR 794
                          890       900
                   ....*....|....*....|...
gi 1907082233 1185 HFKKRkIQDLAIrcVQKNIKKNK 1207
Cdd:PTZ00014   795 KVRKN-IKSLVR--IQAHLRRHL 814
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1241-1959 1.13e-31

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 136.35  E-value: 1.13e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1241 IQQLRSKLEKVEKERNELRLSSDRLETRISELTSE---LTDERNTGESASQLL----DAETAERLR----TEKEMKELQT 1309
Cdd:TIGR02169  165 VAEFDRKKEKALEELEEVEENIERLDLIIDEKRQQlerLRREREKAERYQALLkekrEYEGYELLKekeaLERQKEAIER 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1310 QYDALKKQMEVMEMEVM------EARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDFTK-KRLQQELEDKMEVEQQSR 1382
Cdd:TIGR02169  245 QLASLEEELEKLTEEISelekrlEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASlERSIAEKERELEDAEERL 324
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1383 RQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKL 1462
Cdd:TIGR02169  325 AKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINEL 404
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1463 QREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDE-------ASLAKVKKQLRDLEAKVKDQEEE 1535
Cdd:TIGR02169  405 KRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEwkleqlaADLSKYEQELYDLKEEYDRVEKE 484
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1536 LDEQAGSIQMLEqAKLRLEMEMERMRQTHSKEMESRDEEV---------------------------------EEARQSC 1582
Cdd:TIGR02169  485 LSKLQRELAEAE-AQARASEERVRGGRAVEEVLKASIQGVhgtvaqlgsvgeryataievaagnrlnnvvvedDAVAKEA 563
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1583 QKKLKQME----------------------------------VQLEEEYEDKQK----------ALREKREL--ESKLST 1616
Cdd:TIGR02169  564 IELLKRRKagratflplnkmrderrdlsilsedgvigfavdlVEFDPKYEPAFKyvfgdtlvveDIEAARRLmgKYRMVT 643
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1617 L-------------------SDQVNQR-DFESEKRLRKDLKRTKALLADAQIMLDHLKNNA--------PSKREIAQLKN 1668
Cdd:TIGR02169  644 LegelfeksgamtggsraprGGILFSRsEPAELQRLRERLEGLKRELSSLQSELRRIENRLdelsqelsDASRKIGEIEK 723
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1669 QLEESEFTcAAAVKAR--------KAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEI-----QNRLEEDQEDMNE 1735
Cdd:TIGR02169  724 EIEQLEQE-EEKLKERleeleedlSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLearlsHSRIPEIQAELSK 802
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1736 LMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSmvdkslVSRQEAKIRELETRLE----F 1811
Cdd:TIGR02169  803 LEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKE------IENLNGKKEELEEELEeleaA 876
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1812 EKTQVKRLENLAS---RLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHElEMDLES 1888
Cdd:TIGR02169  877 LRDLESRLGDLKKerdELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEE-ELSLED 955
                          810       820       830       840       850       860       870
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082233 1889 LEAANQSLQADLklafKRIGDLQAAIEDEMEsdENEDLINSLQDmvtkyqkKKNKLEGDSDvdsELEDRVD 1959
Cdd:TIGR02169  956 VQAELQRVEEEI----RALEPVNMLAIQEYE--EVLKRLDELKE-------KRAKLEEERK---AILERIE 1010
PTZ00121 PTZ00121
MAEBL; Provisional
1229-1924 8.78e-25

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 114.08  E-value: 8.78e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1229 LSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRL--ETRISELTSELTDERNTgESASQLLDAETAERLRTEKEMKE 1306
Cdd:PTZ00121  1099 KAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKaeDARKAEEARKAEDAKRV-EIARKAEDARKAEEARKAEDAKK 1177
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1307 LQTQYDALKKQMEVMEMEVMEARLIRAAEingevddddaggewRLKYERAVREVdftkkrlqQELEDKMEVEQqSRRQLE 1386
Cdd:PTZ00121  1178 AEAARKAEEVRKAEELRKAEDARKAEAAR--------------KAEEERKAEEA--------RKAEDAKKAEA-VKKAEE 1234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1387 RRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKK--QRRFDSELSQAHEETQREKLQREKLQR 1464
Cdd:PTZ00121  1235 AKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKaeEKKKADEAKKAEEKKKADEAKKKAEEA 1314
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1465 EKdmllAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLE-AKVKDQEEELDEQAGSI 1543
Cdd:PTZ00121  1315 KK----ADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKkEEAKKKADAAKKKAEEK 1390
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1544 QMLEQAKLRLEMEMERMRQTHSKEMESRdeEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQ 1623
Cdd:PTZ00121  1391 KKADEAKKKAEEDKKKADELKKAAAAKK--KADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEE 1468
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1624 RDFESEKRLRKDLKR----TKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCA-AAVKARKAMEVEMEDLHLQI 1698
Cdd:PTZ00121  1469 AKKADEAKKKAEEAKkadeAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAkKAEEAKKADEAKKAEEKKKA 1548
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1699 DDIAKA---KTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKL 1775
Cdd:PTZ00121  1549 DELKKAeelKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKA 1628
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1776 QALQSQVEFLEQSMVDKslvSRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQ 1855
Cdd:PTZ00121  1629 EEEKKKVEQLKKKEAEE---KKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAE 1705
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1856 R-------------QLRDTKEE----MSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEM 1918
Cdd:PTZ00121  1706 ElkkkeaeekkkaeELKKAEEEnkikAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEEL 1785

                   ....*.
gi 1907082233 1919 ESDENE 1924
Cdd:PTZ00121  1786 DEEDEK 1791
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
22-305 2.53e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 49.78  E-value: 2.53e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   22 PKPGPPGKEEGAPEASSKDGSAPPRSP--QPATSPVPSETSQTAKSPEPTltmnglGAASAEGPNEEAQGLSRKRVANAV 99
Cdd:PHA03307   106 PTPPGPSSPDPPPPTPPPASPPPSPAPdlSEMLRPVGSPGPPPAASPPAA------GASPAAVASDAASSRQAALPLSSP 179
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  100 RKVV-SKVLPGEEPGTAKEPPGRGARSPEH--PARGKKGEKAATSPKPPPPPPPPPAPPKPETK-KEAAKDELSVGLRSL 175
Cdd:PHA03307   180 EETArAPSSPPAEPPPSTPPAAASPRPPRRssPISASASSPAPAPGRSAADDAGASSSDSSSSEsSGCGWGPENECPLPR 259
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  176 MSRGRGKDHKSRGKQPPGKGEKAPSQEPASTGKSGSPDMvdSPKKAGSPAKPETPNKRCSPAPAQE------LADPNLAG 249
Cdd:PHA03307   260 PAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSP--SPSSPGSGPAPSSPRASSSSSSSREssssstSSSSESSR 337
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082233  250 PKLNPSGKQQAKSPASDvqqeTGADPEVRLSPAEEAHQRLERIFTASLDPEAASPA 305
Cdd:PHA03307   338 GAAVSPGPSPSRSPSPS----RPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRR 389
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1802-1967 1.63e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 46.98  E-value: 1.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1802 IRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLqRQLRDTKEEMSELARKEAEASRKKHE 1881
Cdd:PRK03918   171 IKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREEL-EKLEKEVKELEELKEEIEELEKELES 249
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1882 LEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENEDLINSLQDMVTKYQKKKNKLEGDSdvdSELEDRVDGV 1961
Cdd:PRK03918   250 LEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRL---SRLEEEINGI 326

                   ....*.
gi 1907082233 1962 KSWLSK 1967
Cdd:PRK03918   327 EERIKE 332
Spc7 smart00787
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ...
1607-1781 3.24e-04

Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.


Pssm-ID: 197874 [Multi-domain]  Cd Length: 312  Bit Score: 45.39  E-value: 3.24e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  1607 KRELESKLSTLSdqvnqrdfESEKRLRKDLKRTKALLADAQIMLDHLKnnapskREIAQLKNQleeseftcaaavkarka 1686
Cdd:smart00787  146 KEGLDENLEGLK--------EDYKLLMKELELLNSIKPKLRDRKDALE------EELRQLKQL----------------- 194
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  1687 mEVEMEDLHLQIDDIAKAKtaleeqLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEE--- 1763
Cdd:smart00787  195 -EDELEDCDPTELDRAKEK------LKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQcrg 267
                           170
                    ....*....|....*....
gi 1907082233  1764 -SNKEKQELQEKLQALQSQ 1781
Cdd:smart00787  268 fTFKEIEKLKEQLKLLQSL 286
ClyA_Cry6Aa-like cd22656
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ...
1733-1949 6.49e-04

Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.


Pssm-ID: 439154 [Multi-domain]  Cd Length: 309  Bit Score: 44.28  E-value: 6.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1733 MNELMKKHKAAVAQASRDMAqMNDLQAQIEESNKEKQELQEKLQALQSQV-EFLEQSMVDKSlvsrqeaKIRELETRLEf 1811
Cdd:cd22656     96 ILELIDDLADATDDEELEEA-KKTIKALLDDLLKEAKKYQDKAAKVVDKLtDFENQTEKDQT-------ALETLEKALK- 166
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1812 ektqvkrlenlasrlketmEKLTEErdqraaAENREKEQNKRLQRQLrdtKEEMSELARKEAEASRKKHELEMDLESLEA 1891
Cdd:cd22656    167 -------------------DLLTDE------GGAIARKEIKDLQKEL---EKLNEEYAAKLKAKIDELKALIADDEAKLA 218
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082233 1892 ANQSLQADLKLAFKRIGDLQAAIEDEMESDENedLINSLQDMVTKYQKKKNKLEGDSD 1949
Cdd:cd22656    219 AALRLIADLTAADTDLDNLLALIGPAIPALEK--LQGAWQAIATDLDSLKDLLEDDIS 274
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1372-1776 7.55e-04

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 45.00  E-value: 7.55e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1372 EDKMEVEQQSRRQLERRLGDLQADSDESQ--------RALQQLKKKCQRLTAELQDTKlhlegqqvrNHELEKKQRrfdS 1443
Cdd:NF033838    54 ESQKEHAKEVESHLEKILSEIQKSLDKRKhtqnvalnKKLSDIKTEYLYELNVLKEKS---------EAELTSKTK---K 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1444 ELSQAHEETQREKLQREKLQREKDMLLAEA-FSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESkdEASLAKVKKQL 1522
Cdd:NF033838   122 ELDAAFEQFKKDTLEPGKKVAEATKKVEEAeKKAKDQKEEDRRNYPTNTYKTLELEIAESDVEVKKA--ELELVKEEAKE 199
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1523 RDLEAKVKDQEEELDEQAGSIQMLEQAKlrlememermrqthskemESRDEEVEEARQSCQKKLKQ-MEVQLEEEYEDKQ 1601
Cdd:NF033838   200 PRDEEKIKQAKAKVESKKAEATRLEKIK------------------TDREKAEEEAKRRADAKLKEaVEKNVATSEQDKP 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1602 KAlREKRELESKLSTlsdqvnqrdfesekrlrKDLKRTKALLADAQIMLDHLKNnaPSKREiaqlKNQLEESEFTCAAAV 1681
Cdd:NF033838   262 KR-RAKRGVLGEPAT-----------------PDKKENDAKSSDSSVGEETLPS--PSLKP----EKKVAEAEKKVEEAK 317
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1682 KARKAMEVE---------MEDLHLQiddIAKAKTALEE-QLSRLQREKNEIQNrleedQEDMNELMKKHKAAVAQASRdM 1751
Cdd:NF033838   318 KKAKDQKEEdrrnyptntYKTLELE---IAESDVKVKEaELELVKEEAKEPRN-----EEKIKQAKAKVESKKAEATR-L 388
                          410       420
                   ....*....|....*....|....*
gi 1907082233 1752 AQMNDLQAQIEESNKEKQELQEKLQ 1776
Cdd:NF033838   389 EKIKTDRKKAEEEAKRKAAEEDKVK 413
OmpH pfam03938
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH ...
1852-1946 1.50e-03

Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.


Pssm-ID: 461098 [Multi-domain]  Cd Length: 140  Bit Score: 41.02  E-value: 1.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1852 KRLQRQLRDTKEEM-SELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENEdLINSL 1930
Cdd:pfam03938   18 KAAQAQLEKKFKKRqAELEAKQKELQKLYEELQKDGALLEEEREEKEQELQKKEQELQQLQQKAQQELQKKQQE-LLQPI 96
                           90
                   ....*....|....*.
gi 1907082233 1931 QDMVTKYQKKKNKLEG 1946
Cdd:pfam03938   97 QDKINKAIKEVAKEKG 112
growth_prot_Scy NF041483
polarized growth protein Scy;
1427-1901 1.53e-03

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 44.05  E-value: 1.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1427 QQVRNHELEKKQRRFDSE--LSQAHEETQR----EKLQREKLQREkdmLLAEAFSLKQQMEEK------------DLDIA 1488
Cdd:NF041483    76 QLLRNAQIQADQLRADAEreLRDARAQTQRilqeHAEHQARLQAE---LHTEAVQRRQQLDQElaerrqtveshvNENVA 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1489 GFTQKVVSLEAE---LQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEM-------- 1557
Cdd:NF041483   153 WAEQLRARTESQarrLLDESRAEAEQALAAARAEAERLAEEARQRLGSEAESARAEAEAILRRARKDAERLLnaastqaq 232
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1558 ------ERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNqrdfesEKR 1631
Cdd:NF041483   233 eatdhaEQLRSSTAAESDQARRQAAELSRAAEQRMQEAEEALREARAEAEKVVAEAKEAAAKQLASAESAN------EQR 306
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1632 LRKdlkrtkalladaqimldhlknnapSKREIAQLKNQ-LEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEE 1710
Cdd:NF041483   307 TRT------------------------AKEEIARLVGEaTKEAEALKAEAEQALADARAEAEKLVAEAAEKARTVAAEDT 362
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1711 --QLSRLQREKNEIQNRLEEDQEDM-----NELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQ--------ELQEKL 1775
Cdd:NF041483   363 aaQLAKAARTAEEVLTKASEDAKATtraaaEEAERIRREAEAEADRLRGEAADQAEQLKGAAKDDTkeyraktvELQEEA 442
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1776 QALQSQVEFLEQSMVDKSLVSRQEAKireletrlefeKTQVKRLENLASRLKETMEKLTEERDQ--RAAAENREKEQNKR 1853
Cdd:NF041483   443 RRLRGEAEQLRAEAVAEGERIRGEAR-----------REAVQQIEEAARTAEELLTKAKADADElrSTATAESERVRTEA 511
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1854 LQR--QLRDTKEEMSELARKEAEasRKKHELEMDLESLEAANQSLQADLK 1901
Cdd:NF041483   512 IERatTLRRQAEETLERTRAEAE--RLRAEAEEQAEEVRAAAERAARELR 559
KLF9_13_N-like cd21975
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ...
20-130 2.63e-03

Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.


Pssm-ID: 409240 [Multi-domain]  Cd Length: 163  Bit Score: 40.83  E-value: 2.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   20 GVPKPGPPGKEEGAPEASSKDGSAPPRSPQPATSPVPSETSQTAKSPEPTLTMNGLGAAS----AEGPNEEAQGLSrKRV 95
Cdd:cd21975     15 GAVVHGVRPDPEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGADSPGLVTAAPHLLAANvlapLRGPSVEGSSLE-SGD 93
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 1907082233   96 ANAVRKVVSKVLPGEEPGTAKEPPGRGARSPEHPA 130
Cdd:cd21975     94 ADMGSDSDVAPASGAAASTSPESSSDAASSPSPLS 128
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
15-305 7.91e-03

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 41.59  E-value: 7.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   15 KEEDKGVPKPGPPGKEEGAPEASSKDGSAPPRSPQPATSPVPSETSQTAKSPEPTLTMnglgAASAEGPNEEAQGLSRKR 94
Cdd:COG5180    225 RPEAASSPKVDPPSTSEARSRPATVDAQPEMRPPADAKERRRAAIGDTPAAEPPGLPV----LEAGSEPQSDAPEAETAR 300
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   95 VANavRKVVSKVLPGEEPGtakEPPGrGARSPEHPARGKKGEKAATSPKPPPPPPPPPAPPK--------PETKKEAAKD 166
Cdd:COG5180    301 PID--VKGVASAPPATRPV---RPPG-GARDPGTPRPGQPTERPAGVPEAASDAGQPPSAYPpaeeavpgKPLEQGAPRP 374
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  167 ELSVGLRSLMSRGRGKD-HKSRGKQPPGKGEKAPSQEPASTGKSGSPDMVDSPKKAG---SPAKPETPNKRCSP-APAQE 241
Cdd:COG5180    375 GSSGGDGAPFQPPNGAPqPGLGRRGAPGPPMGAGDLVQAALDGGGRETASLGGAAGGagqGPKADFVPGDAESVsGPAGL 454
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907082233  242 LADP-NLAGPKLNPSGKQQAKSPASDVQQETGADPEVRLSPAEEAHQRLERIFTASLDPEAASPA 305
Cdd:COG5180    455 ADQAgAAASTAMADFVAPVTDATPVDVADVLGVRPDAILGGNVAPASGLDAETRIIEAEGAPATE 519
 
Name Accession Description Interval E-value
MYSc_Myo18 cd01386
class XVIII myosin, motor domain; Many members of this class contain a N-terminal PDZ domain ...
394-1151 0e+00

class XVIII myosin, motor domain; Many members of this class contain a N-terminal PDZ domain which is commonly found in proteins establishing molecular complexes. The motor domain itself does not exhibit ATPase activity, suggesting that it functions as an actin tether protein. It also has two IQ domains that probably bind light chains or related calmodulins and a C-terminal tail with two sections of coiled-coil domains, which are thought to mediate homodimerization. The function of these myosins are largely unknown. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276837 [Multi-domain]  Cd Length: 689  Bit Score: 1131.26  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd01386      1 SSVLHTLRQRYGANLIHTYAGPSLIVINPRHPLAVYSEKVAKMFKGCRREDMPPHIYASAQSAYRAMLMSRRDQSIVLLG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIAGTSGtKVFSVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTM 553
Cdd:cd01386     81 RSGSGKTTNCRHILEYLVTAAGSVG-GVLSVEKLNAALTVLEAFGNVRTALNGNATRFSQLFSLDFDQAGQLASASIQTL 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  554 LLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFSKLQAAMKVLAISPEE 633
Cdd:cd01386    160 LLERSRVARRPEGESNFNVFYYLLAGADAALRTELHLNQLAESNSFGIVPLQKPEDKQKAAAAFSKLQAAMKTLGISEEE 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  634 QKTCWLILASIYHLGAAGATKEPleeqdeaaEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGTLQRSTSFRQ 713
Cdd:cd01386    240 QRAIWSILAAIYHLGAAGATKAA--------SAGRKQFARPEWAQRAAYLLGCTLEELSSAIFKHHLSGGPQQSTTSSGQ 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  714 GPEESGLGEGTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGGSARGASFEELCHN 793
Cdd:cd01386    312 ESPARSSSGGPKLTGVEALEGFAAGLYSELFAAVVSLINRSLSSSHHSTSSITIVDTPGFQNPAHSGSQRGATFEDLCHN 391
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  794 YAQDRLQRLFHERTFLQELERYKEDNIELAFDDLEPVADDSVAAVDQASH--LVRSLAHADEARGLLWLLEEEALVPGAT 871
Cdd:cd01386    392 YAQERLQLLFHERTFVAPLERYKQENVEVDFDLPELSPGALVALIDQAPQqaLVRSDLRDEDRRGLLWLLDEEALYPGSS 471
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  872 EDALLDRLFSYYGPQEGDkKGQSPLLRSSKPRHFLLGHSHGTNWVEYNVAGWLNYTKQNPATQNAPRLLQDSQKKiisnl 951
Cdd:cd01386    472 DDTFLERLFSHYGDKEGG-KGHSLLRRSEGPLQFVLGHLLGTNPVEYDVSGWLKAAKENPSAQNATQLLQESQKE----- 545
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  952 flgragsatvlsgsiagleggsqlalrratsmrktfttgMAAVKKKSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVAEG 1031
Cdd:cd01386    546 ---------------------------------------TAAVKRKSPCLQIKFQVDALIDTLRRTGLHFVHCLLPQHNA 586
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1032 WPGEPRsassrrvsssseldlppGDPCEAGLLQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHLT 1111
Cdd:cd01386    587 GKDERS-----------------TSSPAAGDELLDVPLLRSQLRGSQLLDALRLYRQGFPDHMPLGEFRRRFQVLAPPLT 649
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|
gi 1907082233 1112 KKHGRNYIVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd01386    650 KKLGLNSEVADERKAVEELLEELDLEKSSYRIGLSQVFFR 689
MYSc cd00124
Myosin motor domain superfamily; Myosin motor domain. The catalytic (head) domain has ATPase ...
395-1151 1.45e-144

Myosin motor domain superfamily; Myosin motor domain. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276950 [Multi-domain]  Cd Length: 633  Bit Score: 465.91  E-value: 1.45e-144
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRR-EDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd00124      2 AILHNLRERYARDLIYTYVGDILVAVNPFKWLPLYSEEVMEKYRGKGRsADLPPHVFAVADAAYRAMLRDGQNQSILISG 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIAGTSGTKVFS-----VEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASA 548
Cdd:cd00124     82 ESGAGKTETTKLVLKYLAALSGSGSSKSSSsassiEQQILQSNPILEAFGNAKTVRNDNSSRFGKFIELQFDPTGRLVGA 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  549 SIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAEN---NVFGIVPLSKPEEKQKAAQQFSKLQAAMK 625
Cdd:cd00124    162 SIETYLLEKSRVVSQAPGERNFHIFYQLLAGLSDGAREELKLELLLSYyylNDYLNSSGCDRIDGVDDAEEFQELLDALD 241
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  626 VLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAGrkqfarHEWAQKAAYLLGCSLEELSSAIFKHQLKGGTl 705
Cdd:cd00124    242 VLGFSDEEQDSIFRILAAILHLGNIEFEEDEEDEDSSAEVAD------DESLKAAAKLLGVDAEDLEEALTTRTIKVGG- 314
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  706 qrstsfrqgpeESGLGEGTKLSALECLEGMASGLYSELFTLLISLVNRALKSS--QHSLCSMMIVDTPGFQNPEWggsar 783
Cdd:cd00124    315 -----------ETITKPLTVEQAEDARDALAKALYSRLFDWLVNRINAALSPTdaAESTSFIGILDIFGFENFEV----- 378
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  784 gASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELA---FDDLEPVAD------DSV-AAVDQASHLvrslahade 853
Cdd:cd00124    379 -NSFEQLCINYANEKLQQFFNQHVFKLEQEEYEEEGIDWSfidFPDNQDCLDliegkpLGIlSLLDEECLF--------- 448
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  854 argllwlleeealvPGATEDALLDRLFSYYGPQEGDKKGqspllRSSKPRHFllGHSHGTNWVEYNVAGWLNYTKQNpat 933
Cdd:cd00124    449 --------------PKGTDATFLEKLYSAHGSHPRFFSK-----KRKAKLEF--GIKHYAGDVTYDADGFLEKNKDT--- 504
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  934 qnaprllqdsqkkiisnlflgragsatvLSGSIAGLeggsqlaLRRATSMRKtfttgmaavkkkslciqiklQVDALIDT 1013
Cdd:cd00124    505 ----------------------------LPPDLVDL-------LRSGSQFRS--------------------QLDALMDT 529
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1014 IKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDH 1093
Cdd:cd00124    530 LNSTQPHFVRCIKPNDEKKPG-----------------------------LFDPELVLEQLRCAGVLEAVRIRRAGYPVR 580
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082233 1094 MVFSEFRRRFDVLAPHLTKKHGRNYIVVDEKRAveellESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd00124    581 LPFDEFLKRYRILAPGATEKASDSKKAAVLALL-----LLLKLDSSGYQLGKTKVFLR 633
MYSc smart00242
Myosin. Large ATPases; ATPase; molecular motor. Muscle contraction consists of a cyclical ...
375-1163 2.32e-113

Myosin. Large ATPases; ATPase; molecular motor. Muscle contraction consists of a cyclical interaction between myosin and actin. The core of the myosin structure is similar in fold to that of kinesin.


Pssm-ID: 214580 [Multi-domain]  Cd Length: 677  Bit Score: 377.27  E-value: 2.32e-113
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   375 NAPSCDRLEDLASLVYLNESSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQ 454
Cdd:smart00242    1 NPPKFEGVEDLVLLTYLNEPAVLHNLKKRYLKDLIYTYIGLVLVAVNPYKQLPIYTDEVIKKYRGKSRGELPPHVFAIAD 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   455 TAYRAMLMSRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGtSGTKVFSVEKwQALST--LLEAFGNSPTIMNGSATRFS 532
Cdd:smart00242   81 NAYRNMLNDKENQSIIISGESGAGKTENTKKIMQYLASVSG-SNTEVGSVED-QILESnpILEAFGNAKTLRNNNSSRFG 158
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   533 QILSLDFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHlAENNVFgivpLSKPEEKQK 612
Cdd:smart00242  159 KFIEIHFDAKGKIIGAKIETYLLEKSRVVSQAKGERNYHIFYQLLAGASEELKKELGLKS-PEDYRY----LNQGGCLTV 233
                           250       260       270       280       290       300       310       320
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   613 A----AQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAagatkepLEEQDEAAEAGRKQFARHEWAQKAAYLLGCSL 688
Cdd:smart00242  234 DgiddAEEFKETLNAMRVLGFSEEEQESIFKILAAILHLGN-------IEFEEGRNDNAASTVKDKEELSNAAELLGVDP 306
                           330       340       350       360       370       380       390       400
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   689 EELSSAIfkhqlkggtLQRSTSFRQGPEESGLgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIV 768
Cdd:smart00242  307 EELEKAL---------TKRKIKTGGEVITKPL---NVEQALDARDALAKALYSRLFDWLVKRINQSLSFKDGSTYFIGVL 374
                           410       420       430       440       450       460       470       480
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   769 DTPGFQ----NpewggsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVADdsvaavdqa 841
Cdd:smart00242  375 DIYGFEifevN----------SFEQLCINYANEKLQQFFNQHVFKLEQEEYEREGIDwtfIDFFDNQDCID--------- 435
                           490       500       510       520       530       540       550       560
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   842 shLVRS-----LAHADEArgllwlleeeALVPGATEDALLDRLFSYYgpqegdkkGQSPLLRSSKPR---HFLLGHSHGT 913
Cdd:smart00242  436 --LIEKkppgiLSLLDEE----------CRFPKGTDQTFLEKLNQHH--------KKHPHFSKPKKKgrtEFIIKHYAGD 495
                           570       580       590       600       610       620       630       640
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   914 nwVEYNVAGWLnytKQN--PATQNAPRLLQDSQKKIISNLFlgragsatvlsGSIAGleggsqlalrRATSMRKTFTTGM 991
Cdd:smart00242  496 --VTYDVTGFL---EKNkdTLSDDLIELLQSSKNPLIASLF-----------PSGVS----------NAGSKKRFQTVGS 549
                           650       660       670       680       690       700       710       720
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   992 aavkkkslciQIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLR 1071
Cdd:smart00242  550 ----------QFKEQLNELMDTLNSTNPHFIRCIKPNEEKKPG-----------------------------DFDSSLVL 590
                           730       740       750       760       770       780       790       800
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  1072 AQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHlTKKHGRNyivvDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:smart00242  591 HQLRYLGVLENIRIRRAGFPYRLPFDEFLQRYRVLLPD-TWPPWGG----DAKKACEALLQSLGLDEDEYQLGKTKVFLR 665
                           810
                    ....*....|..
gi 1907082233  1152 AGTLARLEEQRD 1163
Cdd:smart00242  666 PGQLAELEELRE 677
COG5022 COG5022
Myosin heavy chain [General function prediction only];
356-1877 3.16e-109

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 384.81  E-value: 3.16e-109
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  356 KLDHDGAILDVDEDDIEKANAPSCDRLEDLASLVYLNESSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMH 435
Cdd:COG5022     42 KEDGESVSVKKKVLGNDRIKLPKFDGVDDLTELSYLNEPAVLHNLEKRYNNGQIYTYSGLVLIAVNPYRDLGIYTDDIIQ 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  436 MFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKwQALST--L 513
Cdd:COG5022    122 SYSGKNRLELEPHVFAIAEEAYRNLLSEKENQTIIISGESGAGKTENAKRIMQYLASVTSSSTVEISSIEK-QILATnpI 200
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  514 LEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL--- 590
Cdd:COG5022    201 LEAFGNAKTVRNDNSSRFGKYIKIEFDENGEICGAKIETYLLEKSRVVHQNKNERNYHIFYQLLAGDPEELKKLLLLqnp 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  591 ---NHLAENNVFGIVPLSKPEEkqkaaqqFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAgatkEPLEEQDEAAeag 667
Cdd:COG5022    281 kdyIYLSQGGCDKIDGIDDAKE-------FKITLDALKTIGIDEEEQDQIFKILAAILHIGNI----EFKEDRNGAA--- 346
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  668 rkQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGG--TLQRSTSFRQgpeesglgegtklsALECLEGMASGLYSELFT 745
Cdd:COG5022    347 --IFSDNSVLDKACYLLGIDPSLFVKWLVKRQIKTGgeWIVVPLNLEQ--------------ALAIRDSLAKALYSNLFD 410
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  746 LLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---L 822
Cdd:COG5022    411 WIVDRINKSLDHSAAASNFIGVLDIYGFEIFEKN------SFEQLCINYTNEKLQQFFNQHMFKLEQEEYVKEGIEwsfI 484
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  823 AFDDLEPVADDSvaavdQASHLVRSLAHADEArgllwlleeeALVPGATEDALLDRLFSYYgPQEGDKKGQSPLLRSSKp 902
Cdd:COG5022    485 DYFDNQPCIDLI-----EKKNPLGILSLLDEE----------CVMPHATDESFTSKLAQRL-NKNSNPKFKKSRFRDNK- 547
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  903 rhFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKKIISNLFLGRAgsatvlsgsiaglEGGSQLALRRATS 982
Cdd:COG5022    548 --FVVKHYAGD--VEYDVEGFLDKNK-DPLNDDLLELLKASTNEFVSTLFDDEE-------------NIESKGRFPTLGS 609
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  983 MrktfttgmaavkkkslciqIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagl 1062
Cdd:COG5022    610 R-------------------FKESLNSLMSTLNSTQPHYIRCIKPNEEKSPW---------------------------- 642
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1063 lQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHlTKKHGRNYIVVDEKRAVEELLESLDLEKSSCC 1142
Cdd:COG5022    643 -TFDNQMVLSQLRCCGVLETIRISRAGFPSRWTFDEFVQRYRILSPS-KSWTGEYTWKEDTKNAVKSILEELVIDSSKYQ 720
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1143 LGLSRVFFRAGTLARLEEQRDEQTSRHLTLFQAACRGYLARQHF-KKRKIQDlAIRCVQKNIKKNKGVKDWPWWKLFTTV 1221
Cdd:COG5022    721 IGNTKVFFKAGVLAALEDMRDAKLDNIATRIQRAIRGRYLRRRYlQALKRIK-KIQVIQHGFRLRRLVDYELKWRLFIKL 799
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1222 RPLIQVQLSEEQIRNKDEEIQQLRSKLEKV----EKERNELRLSSDRLETRISEltSELTDERNTGESASQLLDAETAER 1297
Cdd:COG5022    800 QPLLSLLGSRKEYRSYLACIIKLQKTIKREkklrETEEVEFSLKAEVLIQKFGR--SLKAKKRFSLLKKETIYLQSAQRV 877
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1298 LRTEKEMKELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDF----TKKRLQQELED 1373
Cdd:COG5022    878 ELAERQLQELKIDVKSISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLeegpSIEYVKLPELN 957
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1374 KMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRrfdsELSQAHEETQ 1453
Cdd:COG5022    958 KLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPV----EVAELQSASK 1033
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1454 REKLQREKLQREKDMLLAEAFSLKQQME-EKDLdiagftqKVVSLEAELQDIssqESKDEASLAKVKKQLRDLEAK---V 1529
Cdd:COG5022   1034 IISSESTELSILKPLQKLKGLLLLENNQlQARY-------KALKLRRENSLL---DDKQLYQLESTENLLKTINVKdleV 1103
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1530 KDQEEELDEQAGSIQMLEQAKLRLEMEMERM--RQTHSKEMESRDEEVEEARQSCQKKLKQMEvqlEEEYEDKQKALREK 1607
Cdd:COG5022   1104 TNRNLVKPANVLQFIVAQMIKLNLLQEISKFlsQLVNTLEPVFQKLSVLQLELDGLFWEANLE---ALPSPPPFAALSEK 1180
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1608 RELESKL----STLSDQVNQ---RDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLknqleESEFTCAAA 1680
Cdd:COG5022   1181 RLYQSALydekSKLSSSEVNdlkNELIALFSKIFSGWPRGDKLKKLISEGWVPTEYSTSLKGFNNL-----NKKFDTPAS 1255
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1681 VKarkameveMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLeeDQEDMNEL-MKKHKAAVAQASRDmaqmnDLQA 1759
Cdd:COG5022   1256 MS--------NEKLLSLLNSIDNLLSSYKLEEEVLPATINSLLQYI--NVGLFNALrTKASSLRWKSATEV-----NYNS 1320
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1760 QIEESNKEKQELQEklqaLQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKT-QVKRLENlasrlketmeklteeRD 1838
Cdd:COG5022   1321 EELDDWCREFEISD----VDEELEELIQAVKVLQLLKDDLNKLDELLDACYSLNPaEIQNLKS---------------RY 1381
                         1530      1540      1550      1560
                   ....*....|....*....|....*....|....*....|
gi 1907082233 1839 QRAAAENR-EKEQNKRLQRQLRDTKEEMSELARKEAEASR 1877
Cdd:COG5022   1382 DPADKENNlPKEILKKIEALLIKQELQLSLEGKDETEVHL 1421
Myosin_head pfam00063
Myosin head (motor domain);
383-1151 2.89e-108

Myosin head (motor domain);


Pssm-ID: 395017 [Multi-domain]  Cd Length: 674  Bit Score: 362.37  E-value: 2.89e-108
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  383 EDLASLVYLNESSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLM 462
Cdd:pfam00063    2 EDMVELSYLNEPSVLHNLKKRYKSDLIYTYSGLVLVAVNPYKQLPIYSEDMIKAYRGKRRGELPPHIFAIADEAYRSMLQ 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  463 SRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGT-SGTKVFSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDF 539
Cdd:pfam00063   82 DKENQSILISGESGAGKTENTKKIMQYLASVSGSgSAGNVGRLEE-QILQSnpILEAFGNAKTVRNNNSSRFGKYIEIQF 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  540 DQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL---------NHLAENNVFGIvplskpeek 610
Cdd:pfam00063  161 DAKGDIVGGKIETYLLEKSRVVYQAEGERNYHIFYQLLAGASAQLKKELRLtnpkdyhylSQSGCYTIDGI--------- 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  611 qKAAQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEaaeagrkqFARHEWAQKAAYLLGCSLEE 690
Cdd:pfam00063  232 -DDSEEFKITDKAMDILGFSDEEQMGIFRIVAAILHLGNIEFKKERNDEQAV--------PDDTENLQKAASLLGIDSTE 302
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  691 LSSAIFKHQLKGGTLQRSTSFrqgpeesglgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVD 769
Cdd:pfam00063  303 LEKALCKRRIKTGRETVSKPQ------------NVEQANYARDALAKAIYSRLFDWLVDRINKSLDVKTIEKASFIgVLD 370
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  770 TPGFQNPEWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVADdsvaavdqashLVR 846
Cdd:pfam00063  371 IYGFEIFEKN------SFEQLCINYVNEKLQQFFNHHMFKLEQEEYVREGIEwtfIDFGDNQPCID-----------LIE 433
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  847 S-----LAHADEArgllwlleeeALVPGATEDALLDRLFSYYgpqegdkkGQSPLLRSSKPR---HFLLGHSHGTnwVEY 918
Cdd:pfam00063  434 KkplgiLSLLDEE----------CLFPKATDQTFLDKLYSTF--------SKHPHFQKPRLQgetHFIIKHYAGD--VEY 493
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  919 NVAGWLNYTKqNPATQNAPRLLQDSQKKIISNLFLGRAgsatvLSGSIAGLEGGSQLALRRATSMRKtfTTGMaavkkks 998
Cdd:pfam00063  494 NVEGFLEKNK-DPLNDDLVSLLKSSSDPLLAELFPDYE-----TAESAAANESGKSTPKRTKKKRFI--TVGS------- 558
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  999 lciQIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSR 1078
Cdd:pfam00063  559 ---QFKESLGELMKTLNSTNPHYIRCIKPNEKKRAG-----------------------------VFDNSLVLHQLRCNG 606
                          730       740       750       760       770       780       790
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082233 1079 LLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHLTKKhgrnyIVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:pfam00063  607 VLEGIRIRRAGFPNRITFQEFVQRYRILAPKTWPK-----WKGDAKKGCEAILQSLNLDKEEYQFGKTKIFFR 674
MYSc_class_II cd01377
class II myosins, motor domain; Myosin motor domain in class II myosins. Class II myosins, ...
395-1151 2.46e-100

class II myosins, motor domain; Myosin motor domain in class II myosins. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. Thus, myosin II has two heads. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276951 [Multi-domain]  Cd Length: 662  Bit Score: 339.05  E-value: 2.46e-100
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPaVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd01377      2 SVLHNLRERYYSDLIYTYSGLFCVAVNPyKRLP-IYTEEVIDKYKGKRREEMPPHIFAIADNAYRNMLQDRENQSILITG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKW-----QALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVA 546
Cdd:cd01377     81 ESGAGKTENTKKVIQYLASVAASSKKKKESGKKKgtledQILQAnpILEAFGNAKTVRNNNSSRFGKFIRIHFGSTGKIA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  547 SASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFgIVPLSKPEEKQKAAQQFSKLQAAMKV 626
Cdd:cd01377    161 GADIETYLLEKSRVVRQAKGERNYHIFYQLLSGADPELKEKLLLTGDPSYYFF-LSQGELTIDGVDDAEEFKLTDEAFDI 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  627 LAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEagrkqfarHEWAQKAAYLLGCSLEELSSAIFKHQLKGGT-- 704
Cdd:cd01377    240 LGFSEEEKMSIFKIVAAILHLGNIKFKQRRREEQAELDG--------TEEADKAAHLLGVNSSDLLKALLKPRIKVGRew 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  705 LQRSTSFRQgpeesglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQ----Npewgg 780
Cdd:cd01377    312 VTKGQNKEQ--------------VVFSVGALAKALYERLFLWLVKRINKTLDTKSKRQYFIGVLDIAGFEifefN----- 372
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  781 sargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAF----DDLEPVADdsvaavdqashLVRS-----LAHA 851
Cdd:cd01377    373 -----SFEQLCINYTNEKLQQFFNHHMFVLEQEEYKKEGIEWTFidfgLDLQPTID-----------LIEKpnmgiLSIL 436
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  852 DEargllwlleeEALVPGATEDALLDRLFSYygpQEGDKKGQSPLLRSSKPRHFLLGHSHGTnwVEYNVAGWLnyTK-QN 930
Cdd:cd01377    437 DE----------ECVFPKATDKTFVEKLYSN---HLGKSKNFKKPKPKKSEAHFILKHYAGD--VEYNIDGWL--EKnKD 499
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  931 PATQNAPRLLQDSQKKIISNLFlgragsatvlsgSIAGLEGGSQLALRRATSMRKTfttgMAAVKKKSLciqiklqvDAL 1010
Cdd:cd01377    500 PLNENVVALLKKSSDPLVASLF------------KDYEESGGGGGKKKKKGGSFRT----VSQLHKEQL--------NKL 555
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1011 IDTIKRSKMHFVHCFLPVAEGWPGEprsassrrvsssseldlppgdpceagllqLDVSLLRAQLRGSRLLDAMRMYRQGY 1090
Cdd:cd01377    556 MTTLRSTHPHFVRCIIPNEEKKPGK-----------------------------IDAPLVLHQLRCNGVLEGIRICRKGF 606
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082233 1091 PDHMVFSEFRRRFDVLAPHLTKKhgrnyIVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd01377    607 PNRIIFAEFKQRYSILAPNAIPK-----GFDDGKAACEKILKALQLDPELYRIGNTKVFFK 662
MYSc_Myh10 cd14920
class II myosin heavy chain 10, motor domain; Myosin motor domain of non-muscle myosin heavy ...
394-1151 4.07e-90

class II myosin heavy chain 10, motor domain; Myosin motor domain of non-muscle myosin heavy chain 10 (also called NMMHCB). Mutations in this gene have been associated with May-Hegglin anomaly and developmental defects in brain and heart. Multiple transcript variants encoding different isoforms have been found for this gene. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276952 [Multi-domain]  Cd Length: 673  Bit Score: 309.63  E-value: 4.07e-90
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd14920      1 ASVLHNLKDRYYSGLIYTYSGLFCVVINPYKNLPIYSENIIEMYRGKKRHEMPPHIYAISESAYRCMLQDREDQSILCTG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIA-----GTSGTKVFSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVA 546
Cdd:cd14920     81 ESGAGKTENTKKVIQYLAHVAsshkgRKDHNIPGELER-QLLQAnpILESFGNAKTVKNDNSSRFGKFIRINFDVTGYIV 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  547 SASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVF--GIVPLSkpeeKQKAAQQFSKLQAAM 624
Cdd:cd14920    160 GANIETYLLEKSRAVRQAKDERTFHIFYQLLSGAGEHLKSDLLLEGFNNYRFLsnGYIPIP----GQQDKDNFQETMEAM 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  625 KVLAISPEEQKTCWLILASIYHLGAAGATKEplEEQDEAAeagrkqFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGT 704
Cdd:cd14920    236 HIMGFSHEEILSMLKVVSSVLQFGNISFKKE--RNTDQAS------MPENTVAQKLCHLLGMNVMEFTRAILTPRIKVGR 307
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  705 --LQRSTsfrqgpeesglgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGFQNPEWGgs 781
Cdd:cd14920    308 dyVQKAQ--------------TKEQADFAVEALAKATYERLFRWLVHRINKALDRTKRQGASFIgILDIAGFEIFELN-- 371
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  782 argaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFD----DLEPVADdsvaAVDQASHLVRSLAHADEargl 857
Cdd:cd14920    372 ----SFEQLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWNFIdfglDLQPCID----LIERPANPPGVLALLDE---- 439
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  858 lwlleeEALVPGATEDALLDRLfsyygpqEGDKKGQSPLLRSSKPR---HFLLGHSHGTnwVEYNVAGWLnYTKQNPATQ 934
Cdd:cd14920    440 ------ECWFPKATDKTFVEKL-------VQEQGSHSKFQKPRQLKdkaDFCIIHYAGK--VDYKADEWL-MKNMDPLND 503
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  935 NAPRLLQDSQKKIISNLFlgRAGSATVlsgSIAGLEGGSQLALRRATSMRKTFTTGMAAVKKKSLciqiklqvDALIDTI 1014
Cdd:cd14920    504 NVATLLHQSSDRFVAELW--KDVDRIV---GLDQVTGMTETAFGSAYKTKKGMFRTVGQLYKESL--------TKLMATL 570
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1015 KRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHM 1094
Cdd:cd14920    571 RNTNPNFVRCIIPNHEKRAG-----------------------------KLDPHLVLDQLRCNGVLEGIRICRQGFPNRI 621
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082233 1095 VFSEFRRRFDVLAPHLTKKhgrnyIVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14920    622 VFQEFRQRYEILTPNAIPK-----GFMDGKQACERMIRALELDPNLYRIGQSKIFFR 673
MYSc_Myh11 cd14921
class II myosin heavy chain 11, motor domain; Myosin motor domain of smooth muscle myosin ...
394-1151 7.14e-84

class II myosin heavy chain 11, motor domain; Myosin motor domain of smooth muscle myosin heavy chain 11 (also called SMMHC, SMHC). The gene product is a subunit of a hexameric protein that consists of two heavy chain subunits and two pairs of non-identical light chain subunits. It functions as a major contractile protein, converting chemical energy into mechanical energy through the hydrolysis of ATP. The gene encoding a human ortholog of rat NUDE1 is transcribed from the reverse strand of this gene, and its 3' end overlaps with that of the latter. Inversion of the MYH11 locus is one of the most frequent chromosomal aberrations found in acute myeloid leukemia. Alternative splicing generates isoforms that are differentially expressed, with ratios changing during muscle cell maturation. Mutations in MYH11 have been described in individuals with thoracic aortic aneurysms leading to acute aortic dissections with patent ductus arteriosus. MYH11 mutations are also thought to contribute to human colorectal cancer and are also associated with Peutz-Jeghers syndrome. The mutations found in human intestinal neoplasia result in unregulated proteins with constitutive motor activity, similar to the mutant myh11 zebrafish. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276885 [Multi-domain]  Cd Length: 673  Bit Score: 291.15  E-value: 7.14e-84
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd14921      1 ASVLHNLRERYFSGLIYTYSGLFCVVVNPYKHLPIYSEKIVDMYKGKKRHEMPPHIYAIADTAYRSMLQDREDQSILCTG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIA----GTSGTKVFSVEKWQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVAS 547
Cdd:cd14921     81 ESGAGKTENTKKVIQYLAVVAsshkGKKDTSITGELEKQLLQAnpILEAFGNAKTVKNDNSSRFGKFIRINFDVTGYIVG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  548 ASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNV--FGIVPLSkpeeKQKAAQQFSKLQAAMK 625
Cdd:cd14921    161 ANIETYLLEKSRAIRQARDERTFHIFYYLIAGAKEKMRSDLLLEGFNNYTFlsNGFVPIP----AAQDDEMFQETLEAMS 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  626 VLAISPEEQKTCWLILASIYHLGAAGATKEplEEQDEAAeagrkqFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGtl 705
Cdd:cd14921    237 IMGFSEEEQLSILKVVSSVLQLGNIVFKKE--RNTDQAS------MPDNTAAQKVCHLMGINVTDFTRSILTPRIKVG-- 306
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  706 qrsTSFRQGPEesglgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGFQNPEWGgsarg 784
Cdd:cd14921    307 ---RDVVQKAQ-------TKEQADFAIEALAKATYERLFRWILTRVNKALDKTHRQGASFLgILDIAGFEIFEVN----- 371
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  785 aSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLEPVADDSVAAVDQASHLVRSLAHADEargllwlleeE 864
Cdd:cd14921    372 -SFEQLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWNFIDFGLDLQPCIELIERPNNPPGVLALLDE----------E 440
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  865 ALVPGATEDALLDRLFSyygPQEGDKKGQSPLLRSSKPRHFLLghsHGTNWVEYNVAGWLNyTKQNPATQNAPRLLQDSQ 944
Cdd:cd14921    441 CWFPKATDKSFVEKLCT---EQGNHPKFQKPKQLKDKTEFSII---HYAGKVDYNASAWLT-KNMDPLNDNVTSLLNASS 513
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  945 KKIISNLFLGragsatvlSGSIAGLEGGSQLALRRATSMRKTfTTGMAavkkKSLCIQIKLQVDALIDTIKRSKMHFVHC 1024
Cdd:cd14921    514 DKFVADLWKD--------VDRIVGLDQMAKMTESSLPSASKT-KKGMF----RTVGQLYKEQLGKLMTTLRNTTPNFVRC 580
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1025 FLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFD 1104
Cdd:cd14921    581 IIPNHEKRSG-----------------------------KLDAFLVLEQLRCNGVLEGIRICRQGFPNRIVFQEFRQRYE 631
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*..
gi 1907082233 1105 VLAPHLTKKHgrnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14921    632 ILAANAIPKG-----FMDGKQACILMIKALELDPNLYRIGQSKIFFR 673
MYSc_Myh2_insects_mollusks cd14911
class II myosin heavy chain 2, motor domain; Myosin motor domain of type IIa skeletal muscle ...
394-1151 3.27e-83

class II myosin heavy chain 2, motor domain; Myosin motor domain of type IIa skeletal muscle myosin heavy chain 2 (also called MYH2A, MYHSA2, MyHC-IIa, MYHas8, MyHC-2A) in insects and mollusks. This gene encodes a member of the class II or conventional myosin heavy chains, and functions in skeletal muscle contraction. Mutations in this gene results in inclusion body myopathy-3 and familial congenital myopathy. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276876 [Multi-domain]  Cd Length: 674  Bit Score: 289.19  E-value: 3.27e-83
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd14911      1 ASVLHNIKDRYYSGLIYTYSGLFCVVVNPYKKLPIYTEKIMERYKGIKRHEVPPHVFAITDSAYRNMLGDREDQSILCTG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIA-----GTSGTKVFSV-------EKWQAL---STLLEAFGNSPTIMNGSATRFSQILSLD 538
Cdd:cd14911     81 ESGAGKTENTKKVIQFLAYVAaskpkGSGAVPHPAVnpavligELEQQLlqaNPILEAFGNAKTVKNDNSSRFGKFIRIN 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  539 FDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHlAENNVF---GIVPLSKPEEkqkaAQ 615
Cdd:cd14911    161 FDASGFISGANIETYLLEKSRAIRQAKDERTFHIFYQLLAGATPEQREKFILDD-VKSYAFlsnGSLPVPGVDD----YA 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  616 QFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEplEEQDEAAeagrkqFARHEWAQKAAYLLGCSLEELSSAI 695
Cdd:cd14911    236 EFQATVKSMNIMGMTSEDFNSIFRIVSAVLLFGSMKFRQE--RNNDQAT------LPDNTVAQKIAHLLGLSVTDMTRAF 307
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  696 FKHQLKGGtlqrstsfRQGPEESGLGEGTKLSalecLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGFQ 774
Cdd:cd14911    308 LTPRIKVG--------RDFVTKAQTKEQVEFA----VEAIAKACYERMFKWLVNRINRSLDRTKRQGASFIgILDMAGFE 375
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  775 NPEWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFD----DLEPVADdsvaavdqashLVrslah 850
Cdd:cd14911    376 IFELN------SFEQLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWKFIdfglDLQPTID-----------LI----- 433
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  851 aDEARGLLWLLEEEALVPGATEDALLDRLFSYYgpqegdkkGQSPLLRSSKPR---HFLLGHSHGTnwVEYNVAGWLnYT 927
Cdd:cd14911    434 -DKPGGIMALLDEECWFPKATDKTFVDKLVSAH--------SMHPKFMKTDFRgvaDFAIVHYAGR--VDYSAAKWL-MK 501
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  928 KQNPATQNAPRLLQDSQKKIISNLFlgraGSATVLSGSIAGLeGGSQLALRRATSMRKTFTTgmaavkkkslciQIKLQV 1007
Cdd:cd14911    502 NMDPLNENIVSLLQGSQDPFVVNIW----KDAEIVGMAQQAL-TDTQFGARTRKGMFRTVSH------------LYKEQL 564
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1008 DALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYR 1087
Cdd:cd14911    565 AKLMDTLRNTNPNFVRCIIPNHEKRAG-----------------------------KIDAPLVLDQLRCNGVLEGIRICR 615
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082233 1088 QGYPDHMVFSEFRRRFDVLAPHLTKKHgrnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14911    616 QGFPNRIPFQEFRQRYELLTPNVIPKG-----FMDGKKACEKMIQALELDSNLYRVGQSKIFFR 674
MYSc_Myh18 cd14932
class II myosin heavy chain 18, motor domain; Myosin motor domain of muscle myosin heavy chain ...
394-1151 1.50e-81

class II myosin heavy chain 18, motor domain; Myosin motor domain of muscle myosin heavy chain 18. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276895 [Multi-domain]  Cd Length: 676  Bit Score: 284.61  E-value: 1.50e-81
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd14932      1 ASVLHNLKERYYSGLIYTYSGLFCVVINPYKYLPIYSEEIVNMYKGKKRHEMPPHIYAITDTAYRSMMQDREDQSILCTG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIAGTSGTK------VFS---VEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQA 542
Cdd:cd14932     81 ESGAGKTENTKKVIQYLAYVASSFKTKkdqssiALShgeLEK-QLLQAnpILEAFGNAKTVKNDNSSRFGKFIRINFDVN 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  543 GQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSKPEEKQKaaQQFSKLQA 622
Cdd:cd14932    160 GYIVGANIETYLLEKSRAIRQAKDERAFHIFYYLLTGAGDKLRSELCLEDYSKYRFLSNGNVTIPGQQDK--ELFAETME 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  623 AMKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQ----DEAAeagrkqfarhewAQKAAYLLGCSLEELSSAIFKH 698
Cdd:cd14932    238 AFRIMSIPEEEQTGLLKVVSAVLQLGNMSFKKERNSDQasmpDDTA------------AQKVCHLLGMNVTDFTRAILSP 305
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  699 QLKGGtlqrsTSFRQGPEesglgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGFQNPE 777
Cdd:cd14932    306 RIKVG-----RDYVQKAQ-------TQEQAEFAVEALAKASYERMFRWLVMRINKALDKTKRQGASFIgILDIAGFEIFE 373
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  778 WGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLEPVADDSVAAVDQASHLVRSLAHADEargl 857
Cdd:cd14932    374 LN------SFEQLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWSFIDFGLDLQPCIELIEKPNGPPGILALLDE---- 443
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  858 lwlleeEALVPGATEDALLDRLFSYYGpqeGDKKGQSPlLRSSKPRHFLLGHSHGTnwVEYNVAGWLnYTKQNPATQNAP 937
Cdd:cd14932    444 ------ECWFPKATDKSFVEKVVQEQG---NNPKFQKP-KKLKDDADFCIIHYAGK--VDYKANEWL-MKNMDPLNENVA 510
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  938 RLLQDSQKKIISNLFlgRAGSATVLSGSIAGLEGGSQLALRRATSMRKTFTTgmaavkkkslciQIKLQVDALIDTIKRS 1017
Cdd:cd14932    511 TLLNQSTDKFVSELW--KDVDRIVGLDKVAGMGESLHGAFKTRKGMFRTVGQ------------LYKEQLMNLMTTLRNT 576
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1018 KMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFS 1097
Cdd:cd14932    577 NPNFVRCIIPNHEKKAG-----------------------------KLAHHLVLDQLRCNGVLEGIRICRQGFPNRIVFQ 627
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907082233 1098 EFRRRFDVLAPHLTKKHgrnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14932    628 EFRQRYEILTPNAIPKG-----FMDGKQACVLMVKALELDPNLYRIGQSKVFFR 676
MYSc_Myh15_mammals cd14929
class II myosin heavy chain 15, motor domain; Myosin motor domain of sarcomeric myosin heavy ...
394-1151 1.01e-80

class II myosin heavy chain 15, motor domain; Myosin motor domain of sarcomeric myosin heavy chain 15 in mammals (also called KIAA1000) . MYH15 is a slow-twitch myosin. Myh15 is a ventricular myosin heavy chain. Myh15 is absent in embryonic and fetal muscles and is found in orbital layer of extraocular muscles at birth. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276892 [Multi-domain]  Cd Length: 662  Bit Score: 281.86  E-value: 1.01e-80
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd14929      1 ASVLHTLRRRYDHWMIYTYSGLFCVTINPYKWLPVYQKEVMAAYKGKRRSEAPPHIFAVANNAFQDMLHNRENQSILFTG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIAGTSGTKvfsvEKWQAL-------STLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVA 546
Cdd:cd14929     81 ESGAGKTVNTKHIIQYFATIAAMIESK----KKLGALedqimqaNPVLEAFGNAKTLRNDNSSRFGKFIRMHFGARGMLS 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  547 SASIQTMLLEKLRVARRPASEATFNVFYYLLAcGDATLRTELHLN------HLAENNVFGIvplskpeEKQKAAQQFSKL 620
Cdd:cd14929    157 SADIDIYLLEKSRVIFQQPGERNYHIFYQILS-GKKELRDLLLVSanpsdfHFCSCGAVAV-------ESLDDAEELLAT 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  621 QAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAgrkqfarhEWAQKAAYLLGCSLEELSSAIFKHQL 700
Cdd:cd14929    229 EQAMDILGFLPDEKYGCYKLTGAIMHFGNMKFKQKPREEQLEADGT--------ENADKAAFLMGINSSELVKGLIHPRI 300
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  701 KGGT--LQRSTSFRQGPEESGlgegtklsalecleGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEW 778
Cdd:cd14929    301 KVGNeyVTRSQNIEQVTYAVG--------------ALSKSIYERMFKWLVARINRVLDAKLSRQFFIGILDITGFEILDY 366
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  779 GgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELafddlepvaddsvAAVDQASHLVRSLAHADEARGLL 858
Cdd:cd14929    367 N------SLEQLCINFTNEKLQQFFNQHMFVLEQEEYRKEGIDW-------------VSIDFGLDLQACIDLIEKPMGIF 427
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  859 WLLEEEALVPGATEDALLDRLFSYYgpqegdkKGQSPLLRSSKPR------HFLLGHSHGTnwVEYNVAGWLNYTKqNPA 932
Cdd:cd14929    428 SILEEECMFPKATDLTFKTKLFDNH-------FGKSVHFQKPKPDkkkfeaHFELVHYAGV--VPYNISGWLEKNK-DLL 497
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  933 TQNAPRLLQDSQKKIISNLFlgragSATVLSGSiaGLEGGSQlALRRATSMRKtfttgMAAVKKKSLciqiklqvDALID 1012
Cdd:cd14929    498 NETVVAVFQKSSNRLLASLF-----ENYISTDS--AIQFGEK-KRKKGASFQT-----VASLHKENL--------NKLMT 556
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1013 TIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPD 1092
Cdd:cd14929    557 NLKSTAPHFVRCINPNVNKIPG-----------------------------VLDPYLVLQQLRCNGVLEGIRICREGFPN 607
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082233 1093 HMVFSEFRRRFDVLAPHLTKKHGrnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14929    608 RLLYADFKQRYCILNPRTFPKSK----FVSSRKAAEELLGSLEIDHTQYRFGITKVFFK 662
MYSc_Myh9 cd14919
class II myosin heavy chain 9, motor domain; Myosin motor domain of non-muscle myosin heavy ...
394-1151 1.91e-79

class II myosin heavy chain 9, motor domain; Myosin motor domain of non-muscle myosin heavy chain 9 (also called NMMHCA, NMHC-II-A, MHA, FTNS, EPSTS, and DFNA17). Myosin is a hexameric protein composed of a pair of myosin heavy chains (MYH) and two pairs of nonidentical light chains. The encoded protein is a myosin IIA heavy chain that contains an IQ domain and a myosin head-like domain which is involved in several important functions, including cytokinesis, cell motility and maintenance of cell shape. Defects in this gene have been associated with non-syndromic sensorineural deafness autosomal dominant type 17, Epstein syndrome, Alport syndrome with macrothrombocytopenia, Sebastian syndrome, Fechtner syndrome and macrothrombocytopenia with progressive sensorineural deafness. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276883 [Multi-domain]  Cd Length: 670  Bit Score: 278.13  E-value: 1.91e-79
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd14919      1 ASVLHNLKERYYSGLIYTYSGLFCVVINPYKNLPIYSEEIVEMYKGKKRHEMPPHIYAITDTAYRSMMQDREDQSILCTG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKWQAL---STLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASI 550
Cdd:cd14919     81 ESGAGKTENTKKVIQYLAHVASSHKSKKDQGELERQLlqaNPILEAFGNAKTVKNDNSSRFGKFIRINFDVNGYIVGANI 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  551 QTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSKPEEKQKaaQQFSKLQAAMKVLAIS 630
Cdd:cd14919    161 ETYLLEKSRAIRQAKEERTFHIFYYLLSGAGEHLKTDLLLEPYNKYRFLSNGHVTIPGQQDK--DMFQETMEAMRIMGIP 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  631 PEEQKTCWLILASIYHLGAAGATKEplEEQDEAAeagrkqFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGtlqrsTS 710
Cdd:cd14919    239 EEEQMGLLRVISGVLQLGNIVFKKE--RNTDQAS------MPDNTAAQKVSHLLGINVTDFTRGILTPRIKVG-----RD 305
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  711 FRQGPEesglgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGFQNPEWGgsargaSFEE 789
Cdd:cd14919    306 YVQKAQ-------TKEQADFAIEALAKATYERMFRWLVLRINKALDKTKRQGASFIgILDIAGFEIFDLN------SFEQ 372
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  790 LCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLEPVADDSVAAVDQASHLVRSLAHADEargllwlleeEALVPG 869
Cdd:cd14919    373 LCINYTNEKLQQLFNHTMFILEQEEYQREGIEWNFIDFGLDLQPCIDLIEKPAGPPGILALLDE----------ECWFPK 442
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  870 ATEDALLDRLFSYYGPQegdKKGQSPLLRSSKPrHFLLGHSHGTnwVEYNVAGWLnYTKQNPATQNAPRLLQDSQKKIIS 949
Cdd:cd14919    443 ATDKSFVEKVVQEQGTH---PKFQKPKQLKDKA-DFCIIHYAGK--VDYKADEWL-MKNMDPLNDNIATLLHQSSDKFVS 515
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  950 NLFlgRAGSATVLSGSIAGLeggSQLALRRATSMRKTFTTGMAAVKKKslciqiklQVDALIDTIKRSKMHFVHCFLPVA 1029
Cdd:cd14919    516 ELW--KDVDRIIGLDQVAGM---SETALPGAFKTRKGMFRTVGQLYKE--------QLAKLMATLRNTNPNFVRCIIPNH 582
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1030 EGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPH 1109
Cdd:cd14919    583 EKKAG-----------------------------KLDPHLVLDQLRCNGVLEGIRICRQGFPNRVVFQEFRQRYEILTPN 633
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|..
gi 1907082233 1110 LTKKHgrnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14919    634 SIPKG-----FMDGKQACVLMIKALELDSNLYRIGQSKVFFR 670
MYSc_Myh19 cd15896
class II myosin heavy chain19, motor domain; Myosin motor domain of muscle myosin heavy chain ...
394-1151 8.75e-76

class II myosin heavy chain19, motor domain; Myosin motor domain of muscle myosin heavy chain 19. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276899 [Multi-domain]  Cd Length: 675  Bit Score: 267.70  E-value: 8.75e-76
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd15896      1 ASVLHNLKERYYSGLIYTYSGLFCVVINPYKNLPIYSEEIVEMYKGKKRHEMPPHIYAITDTAYRSMMQDREDQSILCTG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKWQALSTL----------LEAFGNSPTIMNGSATRFSQILSLDFDQAG 543
Cdd:cd15896     81 ESGAGKTENTKKVIQYLAHVASSHKTKKDQNSLALSHGELekqllqanpiLEAFGNAKTVKNDNSSRFGKFIRINFDVNG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  544 QVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSKPEEKQKaaQQFSKLQAA 623
Cdd:cd15896    161 YIVGANIETYLLEKSRAIRQAKEERTFHIFYYLLTGAGDKLRSELLLENYNNYRFLSNGNVTIPGQQDK--DLFTETMEA 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  624 MKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEagrkqfarHEWAQKAAYLLGCSLEELSSAIFKHQLKGG 703
Cdd:cd15896    239 FRIMGIPEDEQIGMLKVVASVLQLGNMSFKKERHTDQASMPD--------NTAAQKVCHLMGMNVTDFTRAILSPRIKVG 310
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  704 T--LQRSTSFRQgpeesglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGFQNPEWGg 780
Cdd:cd15896    311 RdyVQKAQTQEQ--------------AEFAVEALAKATYERMFRWLVMRINKALDKTKRQGASFIgILDIAGFEIFELN- 375
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  781 sargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLEPVADDSVAAVDQASHLVRSLAHADEargllwl 860
Cdd:cd15896    376 -----SFEQLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWSFIDFGLDLQPCIDLIEKPASPPGILALLDE------- 443
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  861 leeEALVPGATEDALLDRLFSYYGPQEGDKKGQspllRSSKPRHFLLGHSHGTnwVEYNVAGWLnYTKQNPATQNAPRLL 940
Cdd:cd15896    444 ---ECWFPKATDKSFVEKVLQEQGTHPKFFKPK----KLKDEADFCIIHYAGK--VDYKADEWL-MKNMDPLNDNVATLL 513
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  941 QDSQKKIISNLFLGragsatvlSGSIAGLEGGSQLA-LRRATSMRKTFTTGMAAVKKKslciqiklQVDALIDTIKRSKM 1019
Cdd:cd15896    514 NQSTDKFVSELWKD--------VDRIVGLDKVSGMSeMPGAFKTRKGMFRTVGQLYKE--------QLSKLMATLRNTNP 577
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1020 HFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEF 1099
Cdd:cd15896    578 NFVRCIIPNHEKKAG-----------------------------KLDPHLVLDQLRCNGVLEGIRICRQGFPNRIVFQEF 628
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907082233 1100 RRRFDVLAPHLTKKHgrnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd15896    629 RQRYEILTPNAIPKG-----FMDGKQACVLMIKSLELDPNLYRIGQSKVFFR 675
MYSc_Myh7b cd14927
class II myosin heavy chain 7b, motor domain; Myosin motor domain of cardiac muscle, beta ...
394-1151 8.17e-74

class II myosin heavy chain 7b, motor domain; Myosin motor domain of cardiac muscle, beta myosin heavy chain 7b (also called KIAA1512, dJ756N5.1, MYH14, MHC14). MYH7B is a slow-twitch myosin. Mutations in this gene result in one form of autosomal dominant hearing impairment. Multiple transcript variants encoding different isoforms have been found for this gene. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276953 [Multi-domain]  Cd Length: 676  Bit Score: 261.81  E-value: 8.17e-74
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd14927      1 ASVLHNLRRRYSRWMIYTYSGLFCVTVNPYKWLPVYTAPVVAAYKGKRRSEAPPHIYAIADNAYNDMLRNRENQSMLITG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIA--GTSGTKVFSVEKWQALSTL----------LEAFGNSPTIMNGSATRFSQILSLDFDQ 541
Cdd:cd14927     81 ESGAGKTVNTKRVIQYFAIVAalGDGPGKKAQFLATKTGGTLedqiieanpaMEAFGNAKTLRNDNSSRFGKFIRIHFGP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  542 AGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLN------HLAENNVFGIVPLSKPEEkqkaaq 615
Cdd:cd14927    161 TGKLASADIDIYLLEKSRVIFQQPGERSYHIYYQILSGKKPELQDMLLVSmnpydyHFCSQGVTTVDNMDDGEE------ 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  616 qFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAgrkqfarhEWAQKAAYLLGCSLEELSSAI 695
Cdd:cd14927    235 -LMATDHAMDILGFSPDEKYGCYKIVGAIMHFGNMKFKQKQREEQAEADGT--------ESADKAAYLMGVSSADLLKGL 305
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  696 FKHQLKGGtlQRSTSFRQGPEESGLGEGtklsalecleGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQN 775
Cdd:cd14927    306 LHPRVKVG--NEYVTKGQSVEQVVYAVG----------ALAKATYDRMFKWLVSRINQTLDTKLPRQFFIGVLDIAGFEI 373
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  776 PEWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLepvADDSVAAVDQASHLVRSLAHADEar 855
Cdd:cd14927    374 FEFN------SFEQLCINFTNEKLQQFFNHHMFILEQEEYKREGIEWVFIDF---GLDLQACIDLIEKPLGILSILEE-- 442
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  856 gllwlleeEALVPGATEDALLDRLFSYYgpqegdkKGQSPLLRSSKP-------RHFLLGHSHGTnwVEYNVAGWLNYTK 928
Cdd:cd14927    443 --------ECMFPKASDASFKAKLYDNH-------LGKSPNFQKPRPdkkrkyeAHFEVVHYAGV--VPYNIVGWLDKNK 505
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  929 qNPATQNAPRLLQDSQKKIISNLFLGRAGSAtvlsgSIAGLEGGSQLALRRATSmrktFTTgMAAVKKKSLciqiklqvD 1008
Cdd:cd14927    506 -DPLNETVVAIFQKSQNKLLATLYENYVGSD-----STEDPKSGVKEKRKKAAS----FQT-VSQLHKENL--------N 566
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1009 ALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQ 1088
Cdd:cd14927    567 KLMTNLRATQPHFVRCIIPNETKTPG-----------------------------VMDPFLVLHQLRCNGVLEGIRICRK 617
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082233 1089 GYPDHMVFSEFRRRFDVLAPHLTKKHGrnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14927    618 GFPNRILYADFKQRYRILNPSAIPDDK----FVDSRKATEKLLGSLDIDHTQYQFGHTKVFFK 676
MYSc_Myh1_insects_crustaceans cd14909
class II myosin heavy chain 1, motor domain; Myosin motor domain of type IIx skeletal muscle ...
394-1151 2.24e-71

class II myosin heavy chain 1, motor domain; Myosin motor domain of type IIx skeletal muscle myosin heavy chain 1 (also called MYHSA1, MYHa, MyHC-2X/D, MGC133384) in insects and crustaceans. Myh1 is a type I skeletal muscle myosin that in Humans is encoded by the MYH1 gene. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276874  Cd Length: 666  Bit Score: 254.38  E-value: 2.24e-71
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd14909      1 ASVLHNLRQRYYAKLIYTYSGLFCVAINPYKRYPVYTNRCAKMYRGKRRNEVPPHIFAISDGAYVDMLTNHVNQSMLITG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKW----QALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVAS 547
Cdd:cd14909     81 ESGAGKTENTKKVIAYFATVGASKKTDEAAKSKGsledQVVQTnpVLEAFGNAKTVRNDNSSRFGKFIRIHFGPTGKLAG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  548 ASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNhlaeNNVFGIVPLSKPE---EKQKAAQQFSKLQAAM 624
Cdd:cd14909    161 ADIETYLLEKARVISQQSLERSYHIFYQIMSGSVPGVKEMCLLS----DNIYDYYIVSQGKvtvPNVDDGEEFSLTDQAF 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  625 KVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAgrkqfarhEWAQKAAYLLGCSLEELSSAIFKHQLKGGT 704
Cdd:cd14909    237 DILGFTKQEKEDVYRITAAVMHMGGMKFKQRGREEQAEQDGE--------EEGGRVSKLFGCDTAELYKNLLKPRIKVGN 308
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  705 lQRSTSFRQgpeesglgegtKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGGsarg 784
Cdd:cd14909    309 -EFVTQGRN-----------VQQVTNSIGALCKGVFDRLFKWLVKKCNETLDTQQKRQHFIGVLDIAGFEIFEYNG---- 372
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  785 asFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLepvADDSVAAVDQASHLVRSLAHADEargllwlleeE 864
Cdd:cd14909    373 --FEQLCINFTNEKLQQFFNHHMFVLEQEEYKREGIDWAFIDF---GMDLLACIDLIEKPMGILSILEE----------E 437
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  865 ALVPGATEDALLDRLFSyygpqegDKKGQSPLLRSSKP-------RHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAP 937
Cdd:cd14909    438 SMFPKATDQTFSEKLTN-------THLGKSAPFQKPKPpkpgqqaAHFAIAHYAGC--VSYNITGWLEKNK-DPLNDTVV 507
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  938 RLLQDSQKKIISNLFLGRAGSatvlSGSIAGLEGGsqlalrratsmRKTFTTGMAAVKKkslciQIKLQVDALIDTIKRS 1017
Cdd:cd14909    508 DQFKKSQNKLLIEIFADHAGQ----SGGGEQAKGG-----------RGKKGGGFATVSS-----AYKEQLNSLMTTLRST 567
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1018 KMHFVHCFLPvaegwpgeprsassrrvsssSELDLPpgdpceaGLlqLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFS 1097
Cdd:cd14909    568 QPHFVRCIIP--------------------NEMKQP-------GV--VDAHLVMHQLTCNGVLEGIRICRKGFPNRMMYP 618
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907082233 1098 EFRRRFDVLAPHLTKKHgrnyivVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14909    619 DFKMRYKILNPAGIQGE------EDPKKAAEIILESIALDPDQYRLGHTKVFFR 666
MYSc_Myh3 cd14913
class II myosin heavy chain 3, motor domain; Myosin motor domain of fetal skeletal muscle ...
395-1151 4.48e-71

class II myosin heavy chain 3, motor domain; Myosin motor domain of fetal skeletal muscle myosin heavy chain 3 (MYHC-EMB, MYHSE1, HEMHC, SMHCE) in tetrapods including mammals, lizards, and frogs. This gene is a member of the MYH family and encodes a protein with an IQ domain and a myosin head-like domain. Mutations in this gene have been associated with two congenital contracture (arthrogryposis) syndromes, Freeman-Sheldon syndrome and Sheldon-Hall syndrome. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276878 [Multi-domain]  Cd Length: 668  Bit Score: 253.43  E-value: 4.48e-71
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 474
Cdd:cd14913      2 AVLYNLKDRYTSWMIYTYSGLFCVTVNPYKWLPVYNPEVVEGYRGKKRQEAPPHIFSISDNAYQFMLTDRENQSILITGE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  475 SGSGKTTSFQHLVQYLATIAGT------SGTKVFSVEKWQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVA 546
Cdd:cd14913     82 SGAGKTVNTKRVIQYFATIAATgdlakkKDSKMKGTLEDQIISAnpLLEAFGNAKTVRNDNSSRFGKFIRIHFGTTGKLA 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  547 SASIQTMLLEKLRVARRPASEATFNVFYYLLAcgdaTLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFSKLQA---A 623
Cdd:cd14913    162 SADIETYLLEKSRVTFQLKAERSYHIFYQILS----NKKPELIELLLITTNPYDYPFISQGEILVASIDDAEELLAtdsA 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  624 MKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAgrkqfarhEWAQKAAYLLGCSLEELSSAIFKHQLKGG 703
Cdd:cd14913    238 IDILGFTPEEKSGLYKLTGAVMHYGNMKFKQKQREEQAEPDGT--------EVADKTAYLMGLNSSDLLKALCFPRVKVG 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  704 TlqrstsfrqgpEESGLGEgTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsar 783
Cdd:cd14913    310 N-----------EYVTKGQ-TVDQVHHAVNALSKSVYEKLFLWMVTRINQQLDTKLPRQHFIGVLDIAGFEIFEYN---- 373
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  784 gaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFddlepvaddsvaaVDQASHLVRSLAHADEARGLLWLLEE 863
Cdd:cd14913    374 --SLEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWTF-------------IDFGMDLAACIELIEKPMGIFSILEE 438
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  864 EALVPGATEDALLDRLFSYYGPQEGDKkgQSPLLRSSKPR-HFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQD 942
Cdd:cd14913    439 ECMFPKATDTSFKNKLYDQHLGKSNNF--QKPKVVKGRAEaHFSLIHYAGT--VDYSVSGWLEKNK-DPLNETVVGLYQK 513
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  943 SQKKIISNLFLGRAGS-ATVLSGSIAGLEGGSqlalrratsmrktFTTgMAAVKKKSLciqiklqvDALIDTIKRSKMHF 1021
Cdd:cd14913    514 SSNRLLAHLYATFATAdADSGKKKVAKKKGSS-------------FQT-VSALFRENL--------NKLMSNLRTTHPHF 571
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1022 VHCFLPVAEGWPGEprsassrrvsssseldlppgdpceagllqLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRR 1101
Cdd:cd14913    572 VRCIIPNETKTPGA-----------------------------MEHSLVLHQLRCNGVLEGIRICRKGFPNRILYGDFKQ 622
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1102 RFDVLAPHLTKKhGRnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14913    623 RYRVLNASAIPE-GQ---FIDSKKACEKLLASIDIDHTQYKFGHTKVFFK 668
MYSc_Myh14_mammals cd14930
class II myosin heavy chain 14 motor domain; Myosin motor domain of non-muscle myosin heavy ...
394-1151 1.10e-69

class II myosin heavy chain 14 motor domain; Myosin motor domain of non-muscle myosin heavy chain 14 (also called FLJ13881, KIAA2034, MHC16, MYH17). Its members include mammals, chickens, and turtles. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. Some of the data used for this classification were produced by the CyMoBase team at the Max-Planck-Institute for Biophysical Chemistry. The sequence names are composed of the species abbreviation followed by the protein abbreviation and optional protein classifier and variant designations.


Pssm-ID: 276893 [Multi-domain]  Cd Length: 670  Bit Score: 249.24  E-value: 1.10e-69
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd14930      1 ASVLHNLRERYYSGLIYTYSGLFCVVINPYKQLPIYTEAIVEMYRGKKRHEVPPHVYAVTEGAYRSMLQDREDQSILCTG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIAGT-SGTKVFSV-----EKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVAS 547
Cdd:cd14930     81 ESGAGKTENTKKVIQYLAHVASSpKGRKEPGVpgeleRQLLQANPILEAFGNAKTVKNDNSSRFGKFIRINFDVAGYIVG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  548 ASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSKPEEKQkaaQQFSKLQAAMKVL 627
Cdd:cd14930    161 ANIETYLLEKSRAIRQAKDECSFHIFYQLLGGAGEQLKADLLLEPCSHYRFLTNGPSSSPGQER---ELFQETLESLRVL 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  628 AISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEagrkqfarHEWAQKAAYLLGCSLEELSSAIFKHQLKGGtlqr 707
Cdd:cd14930    238 GFSHEEITSMLRMVSAVLQFGNIVLKRERNTDQATMPD--------NTAAQKLCRLLGLGVTDFSRALLTPRIKVG---- 305
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  708 sTSFRQGPEesglgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGFQNPEWGgsargaS 786
Cdd:cd14930    306 -RDYVQKAQ-------TKEQADFALEALAKATYERLFRWLVLRLNRALDRSPRQGASFLgILDIAGFEIFQLN------S 371
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  787 FEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLEPVADDSVAAVDQASHLVRSLAHADEargllwlleeEAL 866
Cdd:cd14930    372 FEQLCINYTNEKLQQLFNHTMFVLEQEEYQREGIPWTFLDFGLDLQPCIDLIERPANPPGLLALLDE----------ECW 441
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  867 VPGATEDALLDRLfsyygpqeGDKKGQSPllRSSKPRH------FLLGHSHGTnwVEYNVAGWLnYTKQNPATQNAPRLL 940
Cdd:cd14930    442 FPKATDKSFVEKV--------AQEQGGHP--KFQRPRHlrdqadFSVLHYAGK--VDYKANEWL-MKNMDPLNDNVAALL 508
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  941 QDSQKKIISNLFLGRAGsatvlsgsIAGLEGGSQL--ALRRATSMRKTFTTgMAAVKKKSLciqiklqvDALIDTIKRSK 1018
Cdd:cd14930    509 HQSTDRLTAEIWKDVEG--------IVGLEQVSSLgdGPPGGRPRRGMFRT-VGQLYKESL--------SRLMATLSNTN 571
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1019 MHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSE 1098
Cdd:cd14930    572 PSFVRCIVPNHEKRAG-----------------------------KLEPRLVLDQLRCNGVLEGIRICRQGFPNRILFQE 622
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907082233 1099 FRRRFDVLAPHLTKKHgrnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14930    623 FRQRYEILTPNAIPKG-----FMDGKQACEKMIQALELDPNLYRVGQSKIFFR 670
MYSc_Myo8 cd01383
class VIII myosin, motor domain; These plant-specific type VIII myosins has been associated ...
395-1108 1.58e-68

class VIII myosin, motor domain; These plant-specific type VIII myosins has been associated with endocytosis, cytokinesis, cell-to-cell coupling and gating at plasmodesmata. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. It also contains IQ domains Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276834  Cd Length: 647  Bit Score: 245.30  E-value: 1.58e-68
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKgcRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 474
Cdd:cd01383      2 SVLHNLEYRYSQDIIYTKAGPVLIAVNPFKDVPLYGNEFITAYR--QKLLDSPHVYAVADTAYREMMRDEINQSIIISGE 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  475 SGSGKTTSFQHLVQYLATIAGTSGtkvfSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQT 552
Cdd:cd01383     80 SGAGKTETAKIAMQYLAALGGGSS----GIEN-EILQTnpILEAFGNAKTLRNDNSSRFGKLIDIHFDAAGKICGAKIQT 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  553 MLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL------NHLAENNVFGIVPLSKpeekqkaAQQFSKLQAAMKV 626
Cdd:cd01383    155 YLLEKSRVVQLANGERSYHIFYQLCAGASPALREKLNLksaseyKYLNQSNCLTIDGVDD-------AKKFHELKEALDT 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  627 LAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAgrkqfarhEWAQKAAYLLGCSLEELSSAIFKHQLKGGTLQ 706
Cdd:cd01383    228 VGISKEDQEHIFQMLAAVLWLGNISFQVIDNENHVEVVAD--------EAVSTAASLLGCNANDLMLALSTRKIQAGGDK 299
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  707 RSTSFrqgpeesglgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQH-SLCSMMIVDTPGFQnpewggSARGA 785
Cdd:cd01383    300 IVKKL------------TLQQAIDARDALAKAIYASLFDWLVEQINKSLEVGKRrTGRSISILDIYGFE------SFQKN 361
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  786 SFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLEpvadDSVAAVDqashLVRS-----LAHADEargllwl 860
Cdd:cd01383    362 SFEQLCINYANERLQQHFNRHLFKLEQEEYELDGIDWTKVDFE----DNQECLD----LIEKkplglISLLDE------- 426
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  861 leeEALVPGATEDALLDRLFSYYGPQEGDKKGQSpllrsskpRHFLLGHSHGTnwVEYNVAGWLNytkqnpatQNAPRLL 940
Cdd:cd01383    427 ---ESNFPKATDLTFANKLKQHLKSNSCFKGERG--------GAFTIRHYAGE--VTYDTSGFLE--------KNRDLLH 485
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  941 QDsqkkiISNLFLGRAGSATVLSGSIAGLEGGSQLALRRATSMRKTfttgmaavkKKSLCIQIKLQVDALIDTIKRSKMH 1020
Cdd:cd01383    486 SD-----LIQLLSSCSCQLPQLFASKMLDASRKALPLTKASGSDSQ---------KQSVATKFKGQLFKLMQRLENTTPH 551
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1021 FVHCFLPVAEGWPGEprsassrrvsssseldlppgdpceagllqLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFR 1100
Cdd:cd01383    552 FIRCIKPNNKQLPGV-----------------------------FDQDLVLQQLRCCGVLEVVRISRSGYPTRMTHQEFA 602

                   ....*...
gi 1907082233 1101 RRFDVLAP 1108
Cdd:cd01383    603 RRYGFLLP 610
MYSc_Myh16 cd14934
class II myosin heavy chain 16, motor domain; Myosin motor domain of myosin heavy chain 16 ...
394-1151 1.37e-67

class II myosin heavy chain 16, motor domain; Myosin motor domain of myosin heavy chain 16 pseudogene (also called MHC20, MYH16, and myh5), encoding a sarcomeric myosin heavy chain expressed in nonhuman primate masticatory muscles, is inactivated in humans. This cd contains Myh16 in mammals. MYH16 has intermediate fibres between that of slow type 1 and fast 2B fibres, but exert more force than any other fibre type examined. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. Some of the data used for this classification were produced by the CyMoBase team at the Max-Planck-Institute for Biophysical Chemistry. The sequence names are composed of the species abbreviation followed by the protein abbreviation and optional protein classifier and variant designations.


Pssm-ID: 276896 [Multi-domain]  Cd Length: 659  Bit Score: 243.01  E-value: 1.37e-67
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd14934      1 ASVLDNLRQRYTNMRIYTYSGLFCVTVNPYKWLPIYGARVANMYKGKKRTEMPPHLFSISDNAYHDMLMDRENQSMLITG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIAGTS-----GTKVFSVEKWQAlSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASA 548
Cdd:cd14934     81 ESGAGKTENTKKVIQYFANIGGTGkqssdGKGSLEDQIIQA-NPVLEAFGNAKTTRNNNSSRFGKFIRIHFGTTGKLAGA 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  549 SIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL------NHLAENNVFGIVPLSKPEEKQKAaqqfsklQA 622
Cdd:cd14934    160 DIESYLLEKSRVISQQAAERGYHIFYQILSNKKPELIESLLLvpnpkeYHWVSQGVTVVDNMDDGEELQIT-------DV 232
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  623 AMKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAgrkqfarhEWAQKAAYLLGCSLEELSSAIFKHQLKG 702
Cdd:cd14934    233 AFDVLGFSAEEKIGVYKLTGGIMHFGNMKFKQKPREEQAEVDTT--------EVADKVAHLMGLNSGELQKGITRPRVKV 304
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  703 GT--LQRSTSFRQGPEESGlgegtklsalecleGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGg 780
Cdd:cd14934    305 GNefVQKGQNMEQCNNSIG--------------ALGKAVYDKMFKWLVVRINKTLDTKMQRQFFIGVLDIAGFEIFEFN- 369
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  781 sargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLepvADDSVAAVDQashlvrslahADEARGLLWL 860
Cdd:cd14934    370 -----SFEQLCINFTNEKLQQFFNHHMFVLEQEEYKREGIEWVFIDF---GLDLQACIDL----------LEKPMGIFSI 431
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  861 LEEEALVPGATE--------DALLDRLFSYYGPQEGDKKGQSPllrsskprHFLLGHSHGTnwVEYNVAGWLNYTKqNPA 932
Cdd:cd14934    432 LEEQCVFPKATDatfkaalyDNHLGKSSNFLKPKGGKGKGPEA--------HFELVHYAGT--VGYNITGWLEKNK-DPL 500
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  933 TQNAPRLLQDSQkKIISNLFLGRAGSAtvlsgsiagleGGSQLALRRATSMrktfttgmaavkkkSLCIQIKLQVDALID 1012
Cdd:cd14934    501 NETVVGLFQKSS-LGLLALLFKEEEAP-----------AGSKKQKRGSSFM--------------TVSNFYREQLNKLMT 554
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1013 TIKRSKMHFVHCFLpvaegwpgeprsassrrvsssseldlpPGDPCEAGLlqLDVSLLRAQLRGSRLLDAMRMYRQGYPD 1092
Cdd:cd14934    555 TLHSTAPHFVRCIV---------------------------PNEFKQSGV--VDAHLIMHQLACNGVLEGIRICRKGFPN 605
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082233 1093 HMVFSEFRRRFDVLAPHLTKKHgrnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14934    606 RLQYPEFKQRYQVLNPNVIPQG-----FVDNKKASELLLGSIDLDVNEYKIGHTKVFFR 659
MYSc_Myh7 cd14917
class II myosin heavy chain 7, motor domain; Myosin motor domain of beta (or slow) type I ...
395-1151 4.40e-67

class II myosin heavy chain 7, motor domain; Myosin motor domain of beta (or slow) type I cardiac muscle myosin heavy chain 7 (also called CMH1, MPD1, and CMD1S). Muscle myosin is a hexameric protein containing 2 heavy chain subunits, 2 alkali light chain subunits, and 2 regulatory light chain subunits. It is expressed predominantly in normal human ventrical and in skeletal muscle tissues rich in slow-twitch type I muscle fibers. Changes in the relative abundance of this protein and the alpha (or fast) heavy subunit of cardiac myosin correlate with the contractile velocity of cardiac muscle. Its expression is also altered during thyroid hormone depletion and hemodynamic overloading. Mutations in this gene are associated with familial hypertrophic cardiomyopathy, myosin storage myopathy, dilated cardiomyopathy, and Laing early-onset distal myopathy. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276881 [Multi-domain]  Cd Length: 668  Bit Score: 241.55  E-value: 4.40e-67
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 474
Cdd:cd14917      2 AVLYNLKERYASWMIYTYSGLFCVTVNPYKWLPVYNAEVVAAYRGKKRSEAPPHIFSISDNAYQYMLTDRENQSILITGE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  475 SGSGKTTSFQHLVQYLATIAG---------TSGTKVFSVEKWQAlSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQV 545
Cdd:cd14917     82 SGAGKTVNTKRVIQYFAVIAAigdrskkdqTPGKGTLEDQIIQA-NPALEAFGNAKTVRNDNSSRFGKFIRIHFGATGKL 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  546 ASASIQTMLLEKLRVARRPASEATFNVFYYLLAcgdaTLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFSKLQA--- 622
Cdd:cd14917    161 ASADIETYLLEKSRVIFQLKAERDYHIFYQILS----NKKPELLDMLLITNNPYDYAFISQGETTVASIDDAEELMAtdn 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  623 AMKVLAISPEEQKTCWLILASIYHLGAagaTKEPLEEQDEAAEAGRKqfarhEWAQKAAYLLGCSLEELSSAIFKHQLKG 702
Cdd:cd14917    237 AFDVLGFTSEEKNSMYKLTGAIMHFGN---MKFKQKQREEQAEPDGT-----EEADKSAYLMGLNSADLLKGLCHPRVKV 308
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  703 GT--LQRSTSFRQGPEESGlgegtklsalecleGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGg 780
Cdd:cd14917    309 GNeyVTKGQNVQQVIYATG--------------ALAKAVYEKMFNWMVTRINATLETKQPRQYFIGVLDIAGFEIFDFN- 373
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  781 sargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLepvADDSVAAVDQAshlvrslahaDEARGLLWL 860
Cdd:cd14917    374 -----SFEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWTFIDF---GMDLQACIDLI----------EKPMGIMSI 435
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  861 LEEEALVPGATEDALLDRLFSYYGPQEGDKkgQSPLLRSSKPR-HFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRL 939
Cdd:cd14917    436 LEEECMFPKATDMTFKAKLFDNHLGKSNNF--QKPRNIKGKPEaHFSLIHYAGT--VDYNIIGWLQKNK-DPLNETVVGL 510
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  940 LQDSQKKIISNLFLGRAGSATVLSGSIAGLEGGSqlalrratsmrkTFTTgMAAVKKKSLciqiklqvDALIDTIKRSKM 1019
Cdd:cd14917    511 YQKSSLKLLSNLFANYAGADAPIEKGKGKAKKGS------------SFQT-VSALHRENL--------NKLMTNLRSTHP 569
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1020 HFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEF 1099
Cdd:cd14917    570 HFVRCIIPNETKSPG-----------------------------VMDNPLVMHQLRCNGVLEGIRICRKGFPNRILYGDF 620
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907082233 1100 RRRFDVLAPHLTKKhGRnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14917    621 RQRYRILNPAAIPE-GQ---FIDSRKGAEKLLSSLDIDHNQYKFGHTKVFFK 668
MYSc_Myo22 cd14883
class XXII myosin, motor domain; These myosins possess an extended neck with multiple IQ ...
394-1151 2.91e-66

class XXII myosin, motor domain; These myosins possess an extended neck with multiple IQ motifs such as found in class V, VIII, XI, and XIII myosins. These myosins are defined by two tandem MyTH4 and FERM domains. The apicomplexan, but not diatom myosins contain 4-6 WD40 repeats near the end of the C-terminal tail which suggests a possible function of these myosins in signal transduction and transcriptional regulation. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276849 [Multi-domain]  Cd Length: 661  Bit Score: 239.15  E-value: 2.91e-66
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGpSLLVlstrgapAV--------YSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQ 465
Cdd:cd14883      1 EGINTNLKVRYKKDLIYTYTG-SILV-------AVnpykelpiYTQDIVKQYFGKRMGALPPHIFALAEAAYTNMQEDGK 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  466 DQSIVLLGSSGSGKTTSFQHLVQYLAtiAGTSGTkvFSVEKwQAL--STLLEAFGNSPTIMNGSATRFSQILSLDFDQAG 543
Cdd:cd14883     73 NQSVIISGESGAGKTETTKLILQYLC--AVTNNH--SWVEQ-QILeaNTILEAFGNAKTVRNDNSSRFGKFIEVCFDASG 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  544 QVASASIQTMLLEKLRVARRPASEATFNVFYYLLACG--DATLRTELHL---NHLAENNVFGIVPLSKPEEKQKaaqqFS 618
Cdd:cd14883    148 HIKGAIIQDYLLEQSRITFQAPGERNYHVFYQLLAGAkhSKELKEKLKLgepEDYHYLNQSGCIRIDNINDKKD----FD 223
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  619 KLQAAMKVLAISPEEQKTCWLILASIYHLGAagatkepLEEQDEAAEAGRKQFARHEWAQKAAYLLGCSLEELSSA-IFK 697
Cdd:cd14883    224 HLRLAMNVLGIPEEMQEGIFSVLSAILHLGN-------LTFEDIDGETGALTVEDKEILKIVAKLLGVDPDKLKKAlTIR 296
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  698 HQLKGGTLqrsTSFRQGPEEsglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPE 777
Cdd:cd14883    297 QINVRGNV---TEIPLKVQE----------ARDNRDAMAKALYSRTFAWLVNHINSCTNPGQKNSRFIGVLDIFGFENFK 363
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  778 WGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELA---FDDLEPVADdsvaAVDQASHLVRSLAhADEA 854
Cdd:cd14883    364 VN------SFEQLCINYTNEKLHKFFNHYVFKLEQEEYEKEGINWShivFTDNQECLD----LIEKPPLGILKLL-DEEC 432
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  855 RgllwlleeealVPGATEDALLDRLFSYYGPQEGDKKgqsPLLRSSKPRhFLLGHSHGTnwVEYNVAGWLNytkQNPATQ 934
Cdd:cd14883    433 R-----------FPKGTDLTYLEKLHAAHEKHPYYEK---PDRRRWKTE-FGVKHYAGE--VTYTVQGFLD---KNKDTQ 492
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  935 --NAPRLLQDSQKKIISNLFLGRAgsatvlsgsiagleggsQLALRRATSMRKTFTTGMAAVKKK-SLCIQIKLQVDALI 1011
Cdd:cd14883    493 qdDLFDLMSRSKNKFVKELFTYPD-----------------LLALTGLSISLGGDTTSRGTSKGKpTVGDTFKHQLQSLV 555
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1012 DTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYP 1091
Cdd:cd14883    556 DVLSATQPWYVRCIKPNSLKEPN-----------------------------VFDDELVLAQLRYAGMLEIIRIRKEGFP 606
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1092 DHMVFSEFRRRFDVLAPHLTKKHGRnyivvDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14883    607 IHLTFKEFVDRYLCLDPRARSADHK-----ETCGAVRALMGLGGLPEDEWQVGKTKVFLR 661
MYSc_Myo11 cd01384
class XI myosin, motor domain; These plant-specific type XI myosin are involved in organelle ...
396-1113 8.96e-65

class XI myosin, motor domain; These plant-specific type XI myosin are involved in organelle transport. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle.


Pssm-ID: 276835  Cd Length: 647  Bit Score: 234.11  E-value: 8.96e-65
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  396 VLHTLRQRYGASLLHTYAGPSLL-VLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 474
Cdd:cd01384      3 VLHNLKVRYELDEIYTYTGNILIaVNPFKRLPHLYDAHMMEQYKGAPLGELSPHVFAVADAAYRAMINEGKSQSILVSGE 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  475 SGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQT 552
Cdd:cd01384     83 SGAGKTETTKMLMQYLAYMGGRAVTEGRSVEQ-QVLESnpLLEAFGNAKTVRNNNSSRFGKFVEIQFDDAGRISGAAIRT 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  553 MLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL------NHLAENNVFGIVPLSKPEEkqkaaqqFSKLQAAMKV 626
Cdd:cd01384    162 YLLERSRVVQVSDPERNYHCFYQLCAGAPPEDREKYKLkdpkqfHYLNQSKCFELDGVDDAEE-------YRATRRAMDV 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  627 LAISPEEQKTCWLILASIYHLG----AAGATKEPLEEQDEAAEagrkqfarhEWAQKAAYLLGCSLEELSSAIFKHQlkg 702
Cdd:cd01384    235 VGISEEEQDAIFRVVAAILHLGniefSKGEEDDSSVPKDEKSE---------FHLKAAAELLMCDEKALEDALCKRV--- 302
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  703 gtlqrstsfRQGPEESGLGEGTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQnpewggSA 782
Cdd:cd01384    303 ---------IVTPDGIITKPLDPDAATLSRDALAKTIYSRLFDWLVDKINRSIGQDPNSKRLIGVLDIYGFE------SF 367
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  783 RGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAF----------DDLEPVADDSVAAVDQASHLVRSlahad 852
Cdd:cd01384    368 KTNSFEQFCINLANEKLQQHFNQHVFKMEQEEYTKEEIDWSYiefvdnqdvlDLIEKKPGGIIALLDEACMFPRS----- 442
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  853 eargllwlleeealvpgaTEDALLDRLFSYYgpqeGDKKgqspllRSSKPR----HFLLGHSHGTnwVEYNVAGWLNYTK 928
Cdd:cd01384    443 ------------------THETFAQKLYQTL----KDHK------RFSKPKlsrtDFTIDHYAGD--VTYQTDLFLDKNK 492
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  929 QN--PATQNaprLLQDSQKKIISNLFlgragsatvlsgsiagleggSQLALRRATSMRKtFTtgmaavkkkSLCIQIKLQ 1006
Cdd:cd01384    493 DYvvAEHQA---LLNASKCPFVAGLF--------------------PPLPREGTSSSSK-FS---------SIGSRFKQQ 539
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1007 VDALIDTIKRSKMHFVHCFLPVAEgwpgeprsassrrvsssseldLPPGDPCEAGLLQldvsllraQLRGSRLLDAMRMY 1086
Cdd:cd01384    540 LQELMETLNTTEPHYIRCIKPNNL---------------------LKPGIFENANVLQ--------QLRCGGVLEAVRIS 590
                          730       740
                   ....*....|....*....|....*..
gi 1907082233 1087 RQGYPDHMVFSEFRRRFDVLAPHLTKK 1113
Cdd:cd01384    591 CAGYPTRKPFEEFLDRFGLLAPEVLKG 617
MYSc_Myh13 cd14923
class II myosin heavy chain 13, motor domain; Myosin motor domain of skeletal muscle myosin ...
395-1151 1.03e-64

class II myosin heavy chain 13, motor domain; Myosin motor domain of skeletal muscle myosin heavy chain 13 (also called MyHC-eo) in mammals, chicken, and green anole. Myh13 is a myosin whose expression is restricted primarily to the extrinsic eye muscles which are specialized for function in eye movement. Class II myosins, also called conventional myosins, are the myosin type responsible for producing muscle contraction in muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276887 [Multi-domain]  Cd Length: 671  Bit Score: 234.58  E-value: 1.03e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 474
Cdd:cd14923      2 AVLYNLKERYAAWMIYTYSGLFCVTVNPYKWLPVYNPEVVAAYRGKKRQEAPPHIFSISDNAYQFMLTDRDNQSILITGE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  475 SGSGKTTSFQHLVQYLATIAgTSGTKVFSVEKWQALSTL----------LEAFGNSPTIMNGSATRFSQILSLDFDQAGQ 544
Cdd:cd14923     82 SGAGKTVNTKRVIQYFATIA-VTGDKKKEQQPGKMQGTLedqiiqanplLEAFGNAKTVRNDNSSRFGKFIRIHFGATGK 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  545 VASASIQTMLLEKLRVARRPASEATFNVFYYLLAcgdaTLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFSKLQA-- 622
Cdd:cd14923    161 LASADIETYLLEKSRVTFQLSSERSYHIFYQIMS----NKKPELIDLLLISTNPFDFPFVSQGEVTVASIDDSEELLAtd 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  623 -AMKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAgrkqfarhEWAQKAAYLLGCSLEELssaifkhqLK 701
Cdd:cd14923    237 nAIDILGFSSEEKVGIYKLTGAVMHYGNMKFKQKQREEQAEPDGT--------EVADKAGYLMGLNSAEM--------LK 300
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  702 GGTLQRstsFRQGPEESGLGEGTKlSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgs 781
Cdd:cd14923    301 GLCCPR---VKVGNEYVTKGQNVQ-QVTNSVGALAKAVYEKMFLWMVTRINQQLDTKQPRQYFIGVLDIAGFEIFDFN-- 374
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  782 argaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFddlepvaddsvaaVDQASHLVRSLAHADEARGLLWLL 861
Cdd:cd14923    375 ----SLEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWEF-------------IDFGMDLAACIELIEKPMGIFSIL 437
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  862 EEEALVPGATEDALLDRLFSYYGPQEGDKKGQSPLlRSSKPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQ 941
Cdd:cd14923    438 EEECMFPKATDTSFKNKLYDQHLGKSNNFQKPKPA-KGKAEAHFSLVHYAGT--VDYNIAGWLDKNK-DPLNETVVGLYQ 513
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  942 DSQKKIISNLFLGRAGSAtvlsgsiAGLEGGSQLALRRATSMRKTfttgMAAVKKKSLciqiklqvDALIDTIKRSKMHF 1021
Cdd:cd14923    514 KSSLKLLSFLFSNYAGAE-------AGDSGGSKKGGKKKGSSFQT----VSAVFRENL--------NKLMTNLRSTHPHF 574
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1022 VHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRR 1101
Cdd:cd14923    575 VRCLIPNETKTPG-----------------------------VMDHYLVMHQLRCNGVLEGIRICRKGFPSRILYADFKQ 625
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1102 RFDVLAPHLTKKHGrnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14923    626 RYRILNASAIPEGQ----FIDSKNASEKLLNSIDVDREQYRFGHTKVFFK 671
MYSc_Myh1_mammals cd14910
class II myosin heavy chain 1, motor domain; Myosin motor domain of type IIx skeletal muscle ...
395-1151 2.09e-64

class II myosin heavy chain 1, motor domain; Myosin motor domain of type IIx skeletal muscle myosin heavy chain 1 (also called MYHSA1, MYHa, MyHC-2X/D, MGC133384) in mammals. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276875 [Multi-domain]  Cd Length: 671  Bit Score: 233.85  E-value: 2.09e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 474
Cdd:cd14910      2 AVLYNLKERYAAWMIYTYSGLFCVTVNPYKWLPVYNAEVVTAYRGKKRQEAPPHIFSISDNAYQFMLTDRENQSILITGE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  475 SGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKWQALSTL----------LEAFGNSPTIMNGSATRFSQILSLDFDQAGQ 544
Cdd:cd14910     82 SGAGKTVNTKRVIQYFATIAVTGEKKKEEATSGKMQGTLedqiisanplLEAFGNAKTVRNDNSSRFGKFIRIHFGTTGK 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  545 VASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELhlnhLAENNVFGIVPLSKPEEKQKAAQQFSKLQA-- 622
Cdd:cd14910    162 LASADIETYLLEKSRVTFQLKAERSYHIFYQIMSNKKPDLIEML----LITTNPYDYAFVSQGEITVPSIDDQEELMAtd 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  623 -AMKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAgrkqfarhEWAQKAAYLLGCSLEELSSAIFKHQLK 701
Cdd:cd14910    238 sAIEILGFTSDERVSIYKLTGAVMHYGNMKFKQKQREEQAEPDGT--------EVADKAAYLQNLNSADLLKALCYPRVK 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  702 GGT--LQRSTSFRQgpeesglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWG 779
Cdd:cd14910    310 VGNeyVTKGQTVQQ--------------VYNAVGALAKAVYDKMFLWMVTRINQQLDTKQPRQYFIGVLDIAGFEIFDFN 375
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  780 gsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFddlepvaddsvaaVDQASHLVRSLAHADEARGLLW 859
Cdd:cd14910    376 ------SLEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWEF-------------IDFGMDLAACIELIEKPMGIFS 436
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  860 LLEEEALVPGATEDALLDRLFSYYGPQEGDKKGQSPLlRSSKPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRL 939
Cdd:cd14910    437 ILEEECMFPKATDTSFKNKLYEQHLGKSNNFQKPKPA-KGKVEAHFSLIHYAGT--VDYNIAGWLDKNK-DPLNETVVGL 512
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  940 LQDSQKKIISNLFLGrAGSATVLSGsiagleGGSQLALRRATSMRKtfttgMAAVKKKSLciqiklqvDALIDTIKRSKM 1019
Cdd:cd14910    513 YQKSSMKTLALLFSG-AAAAEAEEG------GGKKGGKKKGSSFQT-----VSALFRENL--------NKLMTNLRSTHP 572
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1020 HFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEF 1099
Cdd:cd14910    573 HFVRCIIPNETKTPG-----------------------------AMEHELVLHQLRCNGVLEGIRICRKGFPSRILYADF 623
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907082233 1100 RRRFDVLAPHLTKKhGRnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14910    624 KQRYKVLNASAIPE-GQ---FIDSKKASEKLLGSIDIDHTQYKFGHTKVFFK 671
MYSc_Myh4 cd14915
class II myosin heavy chain 4, motor domain; Myosin motor domain of skeletal muscle myosin ...
395-1151 1.09e-63

class II myosin heavy chain 4, motor domain; Myosin motor domain of skeletal muscle myosin heavy chain 4 (also called MYH2B, MyHC-2B, MyHC-IIb). Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276879 [Multi-domain]  Cd Length: 671  Bit Score: 231.54  E-value: 1.09e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 474
Cdd:cd14915      2 AVLYNLKERYAAWMIYTYSGLFCVTVNPYKWLPVYNPEVVTAYRGKKRQEAPPHIFSISDNAYQFMLTDRENQSILITGE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  475 SGSGKTTSFQHLVQYLATIA---------GTSGTKVFSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAG 543
Cdd:cd14915     82 SGAGKTVNTKRVIQYFATIAvtgekkkeeAASGKMQGTLED-QIISAnpLLEAFGNAKTVRNDNSSRFGKFIRIHFGATG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  544 QVASASIQTMLLEKLRVARRPASEATFNVFYYLLAcgdaTLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFSKLQA- 622
Cdd:cd14915    161 KLASADIETYLLEKSRVTFQLKAERSYHIFYQIMS----NKKPELIEMLLITTNPYDFAFVSQGEITVPSIDDQEELMAt 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  623 --AMKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAgrkqfarhEWAQKAAYLLGCSLEELSSAIFKHQL 700
Cdd:cd14915    237 dsAVDILGFSADEKVAIYKLTGAVMHYGNMKFKQKQREEQAEPDGT--------EVADKAAYLTSLNSADLLKALCYPRV 308
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  701 KGGT--LQRSTSFRQGPEESGlgegtklsalecleGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEW 778
Cdd:cd14915    309 KVGNeyVTKGQTVQQVYNSVG--------------ALAKAIYEKMFLWMVTRINQQLDTKQPRQYFIGVLDIAGFEIFDF 374
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  779 GgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFddlepvaddsvaaVDQASHLVRSLAHADEARGLL 858
Cdd:cd14915    375 N------SLEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWEF-------------IDFGMDLAACIELIEKPMGIF 435
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  859 WLLEEEALVPGATEDALLDRLFSYYGPQEGDKKGQSPLlRSSKPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPR 938
Cdd:cd14915    436 SILEEECMFPKATDTSFKNKLYEQHLGKSNNFQKPKPA-KGKAEAHFSLVHYAGT--VDYNIAGWLDKNK-DPLNETVVG 511
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  939 LLQDSQKKIISNLFLGraGSATVLSGSiagleGGSQLALRRATSMRKtfttgMAAVKKKSLciqiklqvDALIDTIKRSK 1018
Cdd:cd14915    512 LYQKSGMKTLAFLFSG--GQTAEAEGG-----GGKKGGKKKGSSFQT-----VSALFRENL--------NKLMTNLRSTH 571
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1019 MHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSE 1098
Cdd:cd14915    572 PHFVRCLIPNETKTPG-----------------------------AMEHELVLHQLRCNGVLEGIRICRKGFPSRILYAD 622
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907082233 1099 FRRRFDVLAPHLTKKhGRnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14915    623 FKQRYKVLNASAIPE-GQ---FIDSKKASEKLLGSIDIDHTQYKFGHTKVFFK 671
MYSc_Myh8 cd14918
class II myosin heavy chain 8, motor domain; Myosin motor domain of perinatal skeletal muscle ...
396-1151 1.43e-63

class II myosin heavy chain 8, motor domain; Myosin motor domain of perinatal skeletal muscle myosin heavy chain 8 (also called MyHC-peri, MyHC-pn). Myosin is a hexameric protein composed of a pair of myosin heavy chains (MYH) and two pairs of nonidentical light chains. A mutation in this gene results in trismus-pseudocamptodactyly syndrome. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276882 [Multi-domain]  Cd Length: 668  Bit Score: 231.16  E-value: 1.43e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  396 VLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGSS 475
Cdd:cd14918      3 VLYNLKERYAAWMIYTYSGLFCVTVNPYKWLPVYNPEVVAAYRGKKRQEAPPHIFSISDNAYQFMLTDRENQSILITGES 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  476 GSGKTTSFQHLVQYLATIAGTSGTKVFSVEKWQ--------ALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVAS 547
Cdd:cd14918     83 GAGKTVNTKRVIQYFATIAVTGEKKKEESGKMQgtledqiiSANPLLEAFGNAKTVRNDNSSRFGKFIRIHFGTTGKLAS 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  548 ASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELhlnhLAENNVFGIVPLSKPEEKQKAAQQFSKLQA---AM 624
Cdd:cd14918    163 ADIETYLLEKSRVTFQLKAERSYHIFYQITSNKKPDLIEML----LITTNPYDYAFVSQGEITVPSIDDQEELMAtdsAI 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  625 KVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAgrkqfarhEWAQKAAYLLGCSLEELSSAIFKHQLKGGT 704
Cdd:cd14918    239 DILGFTPEEKVSIYKLTGAVMHYGNMKFKQKQREEQAEPDGT--------EVADKAAYLQSLNSADLLKALCYPRVKVGN 310
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  705 lqrstsfrqgpEESGLGEgTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsarg 784
Cdd:cd14918    311 -----------EYVTKGQ-TVQQVYNAVGALAKAVYEKMFLWMVTRINQQLDTKQPRQYFIGVLDIAGFEIFDFN----- 373
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  785 aSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFddlepvaddsvaaVDQASHLVRSLAHADEARGLLWLLEEE 864
Cdd:cd14918    374 -SLEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWTF-------------IDFGMDLAACIELIEKPLGIFSILEEE 439
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  865 ALVPGATEDALLDRLFSYYGPQEGDKKgQSPLLRSSKPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQ 944
Cdd:cd14918    440 CMFPKATDTSFKNKLYDQHLGKSANFQ-KPKVVKGKAEAHFSLIHYAGT--VDYNITGWLDKNK-DPLNDTVVGLYQKSA 515
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  945 KKIISNLFLGRAGSATVLSGSIAGLEGGSqlalrratsmrkTFTTGMAAVKKkslciqiklQVDALIDTIKRSKMHFVHC 1024
Cdd:cd14918    516 MKTLASLFSTYASAEADSGAKKGAKKKGS------------SFQTVSALFRE---------NLNKLMTNLRSTHPHFVRC 574
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1025 FLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFD 1104
Cdd:cd14918    575 IIPNETKTPG-----------------------------AMEHELVLHQLRCNGVLEGIRICRKGFPSRILYGDFKQRYK 625
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*..
gi 1907082233 1105 VLAPHLTKKhGRnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14918    626 VLNASAIPE-GQ---FIDSKKASEKLLASIDIDHTQYKFGHTKVFFK 668
MYSc_Myh2_mammals cd14912
class II myosin heavy chain 2, motor domain; Myosin motor domain of type IIa skeletal muscle ...
395-1151 1.18e-62

class II myosin heavy chain 2, motor domain; Myosin motor domain of type IIa skeletal muscle myosin heavy chain 2 (also called MYH2A, MYHSA2, MyHC-IIa, MYHas8, MyHC-2A) in mammals. Mutations in this gene results in inclusion body myopathy-3 and familial congenital myopathy. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276877 [Multi-domain]  Cd Length: 673  Bit Score: 228.46  E-value: 1.18e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 474
Cdd:cd14912      2 AVLYNLKERYAAWMIYTYSGLFCVTVNPYKWLPVYNPEVVTAYRGKKRQEAPPHIFSISDNAYQFMLTDRENQSILITGE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  475 SGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKWQALSTL----------LEAFGNSPTIMNGSATRFSQILSLDFDQAGQ 544
Cdd:cd14912     82 SGAGKTVNTKRVIQYFATIAVTGEKKKEEITSGKMQGTLedqiisanplLEAFGNAKTVRNDNSSRFGKFIRIHFGTTGK 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  545 VASASIQTMLLEKLRVARRPASEATFNVFYYLlacgDATLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFSKLQA-- 622
Cdd:cd14912    162 LASADIETYLLEKSRVTFQLKAERSYHIFYQI----TSNKKPELIEMLLITTNPYDYPFVSQGEISVASIDDQEELMAtd 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  623 -AMKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAgrkqfarhEWAQKAAYLLGCSLEELSSAIFKHQLK 701
Cdd:cd14912    238 sAIDILGFTNEEKVSIYKLTGAVMHYGNLKFKQKQREEQAEPDGT--------EVADKAAYLQSLNSADLLKALCYPRVK 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  702 GGT--LQRSTSFRQGPEESGlgegtklsalecleGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWG 779
Cdd:cd14912    310 VGNeyVTKGQTVEQVTNAVG--------------ALAKAVYEKMFLWMVARINQQLDTKQPRQYFIGVLDIAGFEIFDFN 375
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  780 gsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFddlepvaddsvaaVDQASHLVRSLAHADEARGLLW 859
Cdd:cd14912    376 ------SLEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWTF-------------IDFGMDLAACIELIEKPMGIFS 436
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  860 LLEEEALVPGATEDALLDRLFSYYGPQEGDKKgQSPLLRSSKPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRL 939
Cdd:cd14912    437 ILEEECMFPKATDTSFKNKLYEQHLGKSANFQ-KPKVVKGKAEAHFSLIHYAGV--VDYNITGWLDKNK-DPLNETVVGL 512
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  940 LQDSQKKIISNLFLGragsATVLSGSIAGleGGSQLALRRATSMRKTfttgMAAVKKKSLciqiklqvDALIDTIKRSKM 1019
Cdd:cd14912    513 YQKSAMKTLAYLFSG----AQTAEGASAG--GGAKKGGKKKGSSFQT----VSALFRENL--------NKLMTNLRSTHP 574
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1020 HFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEF 1099
Cdd:cd14912    575 HFVRCIIPNETKTPG-----------------------------AMEHELVLHQLRCNGVLEGIRICRKGFPSRILYADF 625
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907082233 1100 RRRFDVLAPHLTKKhGRnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14912    626 KQRYKVLNASAIPE-GQ---FIDSKKASEKLLASIDIDHTQYKFGHTKVFFK 673
MYSc_Myo29 cd14890
class XXIX myosin, motor domain; Class XXIX myosins are comprised of Stramenopiles and have ...
394-1108 3.18e-62

class XXIX myosin, motor domain; Class XXIX myosins are comprised of Stramenopiles and have very long tail domains consisting of three IQ motifs, short coiled-coil regions, up to 18 CBS domains, a PB1 domain, and a carboxy-terminal transmembrane domain. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276855 [Multi-domain]  Cd Length: 662  Bit Score: 226.97  E-value: 3.18e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMS----RQDQS 468
Cdd:cd14890      1 ASLLHTLRLRYERDEIYTYVGPILISINPyKSIPDLYSEERMLLYHGTTAGELPPHVFAIADHAYTQLIQSgvldPSNQS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  469 IVLLGSSGSGKTTSFQHLVQYLATIagTSGTKVFSVEKWQALST------------------LLEAFGNSPTIMNGSATR 530
Cdd:cd14890     81 IIISGESGAGKTEATKIIMQYLARI--TSGFAQGASGEGEAASEaieqtlgsledrvlssnpLLESFGNAKTLRNDNSSR 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  531 FSQILSLDFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLnhlaENNVFGIVPLSK--PE 608
Cdd:cd14890    159 FGKFIEIQFDHHGKIVGAEISNFLLEKTRIVTQNDGERNYHIFYQLLAGADEALRERLKL----QTPVEYFYLRGEcsSI 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  609 EKQKAAQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAG--ATKEPLEEQDEAAeagrkqfarHEWAQKAAYLLGC 686
Cdd:cd14890    235 PSCDDAKAFAETIRCLSTIGISEENQDAVFGLLAAVLHLGNVDfeSENDTTVLEDATT---------LQSLKLAAELLGV 305
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  687 SLEELSSAIFKHQL--KGGTLQRstsfrqgPEESGLGEgTKLSALeclegmASGLYSELFTLLISLVNRALKSSQHSLCS 764
Cdd:cd14890    306 NEDALEKALLTRQLfvGGKTIVQ-------PQNVEQAR-DKRDAL------AKALYSSLFLWLVSELNRTISSPDDKWGF 371
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  765 MMIVDTPGFQNPEWGGsargasFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVADDSVAAVDQA 841
Cdd:cd14890    372 IGVLDIYGFEKFEWNT------FEQLCINYANEKLQRHFNQHMFEVEQVEYSNEGIDwqyITFNDNQACLELIEGKVNGK 445
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  842 SHLVRSLahaDEARGLLWLLEEEALVpgatedALLDRLF--SYYGPQEGDKKGQSPLLRSSK---PRHFLLGHSHGTnwV 916
Cdd:cd14890    446 PGIFITL---DDCWRFKGEEANKKFV------SQLHASFgrKSGSGGTRRGSSQHPHFVHPKfdaDKQFGIKHYAGD--V 514
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  917 EYNVAGwlnYTKQNPATqnaprlLQDSQKKIIsnlflgragsatvlsgsiagleggsqlalrrATSMRktfttgmaAVKK 996
Cdd:cd14890    515 IYDASG---FNEKNNET------LNAEMKELI-------------------------------KQSRR--------SIRE 546
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  997 KSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGEprsassrrvsssseldlppgdpceagLLQLDVSLlraQLRG 1076
Cdd:cd14890    547 VSVGAQFRTQLQELMAKISLTNPRYVRCIKPNETKAPGK--------------------------FDGLDCLR---QLKY 597
                          730       740       750
                   ....*....|....*....|....*....|....*
gi 1907082233 1077 SRLLDAMRMYRQGYP---DHMVFseFrRRFDVLAP 1108
Cdd:cd14890    598 SGMMEAIQIRQQGFAlreEHDSF--F-YDFQVLLP 629
MYSc_Myh6 cd14916
class II myosin heavy chain 6, motor domain; Myosin motor domain of alpha (or fast) cardiac ...
395-1108 5.07e-62

class II myosin heavy chain 6, motor domain; Myosin motor domain of alpha (or fast) cardiac muscle myosin heavy chain 6. Cardiac muscle myosin is a hexamer consisting of two heavy chain subunits, two light chain subunits, and two regulatory subunits. This gene encodes the alpha heavy chain subunit of cardiac myosin. Mutations in this gene cause familial hypertrophic cardiomyopathy and atrial septal defect. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276880 [Multi-domain]  Cd Length: 670  Bit Score: 226.86  E-value: 5.07e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 474
Cdd:cd14916      2 AVLYNLKERYAAWMIYTYSGLFCVTVNPYKWLPVYNAEVVAAYRGKKRSEAPPHIFSISDNAYQYMLTDRENQSILITGE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  475 SGSGKTTSFQHLVQYLATIA--GTSGTKVFSVEKWQAL-------STLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQV 545
Cdd:cd14916     82 SGAGKTVNTKRVIQYFASIAaiGDRSKKENPNANKGTLedqiiqaNPALEAFGNAKTVRNDNSSRFGKFIRIHFGATGKL 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  546 ASASIQTMLLEKLRVARRPASEATFNVFYYLLAcgdaTLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFSKLQA--- 622
Cdd:cd14916    162 ASADIETYLLEKSRVIFQLKAERNYHIFYQILS----NKKPELLDMLLVTNNPYDYAFVSQGEVSVASIDDSEELLAtds 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  623 AMKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAgrkqfarhEWAQKAAYLLGCSLEELSSAIFKHQLKG 702
Cdd:cd14916    238 AFDVLGFTAEEKAGVYKLTGAIMHYGNMKFKQKQREEQAEPDGT--------EDADKSAYLMGLNSADLLKGLCHPRVKV 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  703 GT--LQRSTSFRQgpeesglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGg 780
Cdd:cd14916    310 GNeyVTKGQSVQQ--------------VYYSIGALAKSVYEKMFNWMVTRINATLETKQPRQYFIGVLDIAGFEIFDFN- 374
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  781 sargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLepvADDSVAAVDQAshlvrslahaDEARGLLWL 860
Cdd:cd14916    375 -----SFEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWEFIDF---GMDLQACIDLI----------EKPMGIMSI 436
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  861 LEEEALVPGATEDALLDRLF-SYYGPQEGDKKGQSplLRSSKPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRL 939
Cdd:cd14916    437 LEEECMFPKASDMTFKAKLYdNHLGKSNNFQKPRN--VKGKQEAHFSLVHYAGT--VDYNILGWLEKNK-DPLNETVVGL 511
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  940 LQDSQKKIISNLFLGRAGSATvlsgsiaGLEGGSQLALRRATSMRKtfttgMAAVKKKSLciqiklqvDALIDTIKRSKM 1019
Cdd:cd14916    512 YQKSSLKLMATLFSTYASADT-------GDSGKGKGGKKKGSSFQT-----VSALHRENL--------NKLMTNLKTTHP 571
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1020 HFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEF 1099
Cdd:cd14916    572 HFVRCIIPNERKAPG-----------------------------VMDNPLVMHQLRCNGVLEGIRICRKGFPNRILYGDF 622

                   ....*....
gi 1907082233 1100 RRRFDVLAP 1108
Cdd:cd14916    623 RQRYRILNP 631
MYSc_Myo42 cd14903
class XLII myosin, motor domain; The class XLII myosins are comprised of Stramenopiles. Not ...
394-1108 1.38e-61

class XLII myosin, motor domain; The class XLII myosins are comprised of Stramenopiles. Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276868 [Multi-domain]  Cd Length: 658  Bit Score: 225.04  E-value: 1.38e-61
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLL 472
Cdd:cd14903      1 AAILYNVKKRFLRKLPYTYTGDICIAVNPyQWLPELYTEEQHSKYLNKPKEELPPHVYATSVAAYNHMKRSGRNQSILVS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  473 GSSGSGKTTSFQHLVQYLATIAGtsGTKVFSVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQT 552
Cdd:cd14903     81 GESGAGKTETTKILMNHLATIAG--GLNDSTIKKIIEVNPLLESFGNAKTVRNDNSSRFGKFTQLQFDKNGTLVGAKCRT 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  553 MLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAennVFGIVPLSKPEEKQKAAQQFSKLQAAMKVLAISPE 632
Cdd:cd14903    159 YLLEKTRVISHERPERNYHIFYQLLASPDVEERLFLDSANEC---AYTGANKTIKIEGMSDRKHFARTKEALSLIGVSEE 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  633 EQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAGRkqfarhEWAQKAAYLLGCSLEELSSAIFKHQLKGGTLQRSTSFR 712
Cdd:cd14903    236 KQEVLFEVLAGILHLGQLQIQSKPNDDEKSAIAPGD------QGAVYATKLLGLSPEALEKALCSRTMRAAGDVYTVPLK 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  713 qgpeesglgegtKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsargaSFEELCH 792
Cdd:cd14903    310 ------------KDQAEDCRDALAKAIYSNVFDWLVATINASLGNDAKMANHIGVLDIFGFEHFKHN------SFEQFCI 371
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  793 NYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLepvADDS-VAAVDQASHLVRSLAHADEARgllwlleeealvPGAT 871
Cdd:cd14903    372 NYANEKLQQKFTQDVFKTVQIEYEEEGIRWAHIDF---ADNQdVLAVIEDRLGIISLLNDEVMR------------PKGN 436
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  872 EDALLDRLFSYYgpqeGDKKGQSPLLRSSKPRhFLLGHSHGTnwVEYNVAGWLNYTKQNpatqnaprLLQD-------SQ 944
Cdd:cd14903    437 EESFVSKLSSIH----KDEQDVIEFPRTSRTQ-FTIKHYAGP--VTYESLGFLEKHKDA--------LLPDlsdlmrgSS 501
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  945 KKIISNLFLGRAGSATVLSGSIAGLEGgsqlalRRATSMRKTFTTGMaavkkkslciQIKLQVDALIDTIKRSKMHFVHC 1024
Cdd:cd14903    502 KPFLRMLFKEKVESPAAASTSLARGAR------RRRGGALTTTTVGT----------QFKDSLNELMTTIRSTNVHYVRC 565
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1025 FLPVAEGWPGEprsassrrvsssseldlppgdpceagllqLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFD 1104
Cdd:cd14903    566 IKPNSIKSPTE-----------------------------LDHLMVVSQLRCAGVIEAIRISRAAYPNRLLHEEFLDKFW 616

                   ....
gi 1907082233 1105 VLAP 1108
Cdd:cd14903    617 LFLP 620
MYSc_Myo1 cd01378
class I myosin, motor domain; Myosin I generates movement at the leading edge in cell motility, ...
394-1027 2.91e-60

class I myosin, motor domain; Myosin I generates movement at the leading edge in cell motility, and class I myosins have been implicated in phagocytosis and vesicle transport. Myosin I, an unconventional myosin, does not form dimers. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. There are 5 myosin subclasses with subclasses c/h, d/g, and a/b have an IQ domain and a TH1 domain. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276829  Cd Length: 652  Bit Score: 220.88  E-value: 2.91e-60
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPslLVLST---RGAPaVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIV 470
Cdd:cd01378      1 EAINENLKKRFENDEIYTYIGH--VLISVnpfKDLG-IYTDEVLESYRGKNRYEVPPHVFALADSAYRNMKSEKENQCVI 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  471 LLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVeKWQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASA 548
Cdd:cd01378     78 ISGESGAGKTEASKRIMQYIAAVSGGSESEVERV-KDMLLASnpLLEAFGNAKTLRNDNSSRFGKYMEIQFDFKGEPVGG 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  549 SIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL------NHLAENNVFGIVPLSKpeekqkaAQQFSKLQA 622
Cdd:cd01378    157 HITNYLLEKSRVVGQIKGERNFHIFYQLLKGASQEYLQELGLqrpeqyYYYSKSGCFDVDGIDD-------AADFKEVLN 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  623 AMKVLAISPEEQKTCWLILASIYHLGAAGATKeplEEQDEAAEAGRKQfarhewAQKAAYLLGCSLEELSSAIFKHQLKG 702
Cdd:cd01378    230 AMKVIGFTEEEQDSIFRILAAILHLGNIQFAE---DEEGNAAISDTSV------LDFVAYLLGVDPDQLEKALTHRTIET 300
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  703 GTLQRST-SFRQGPEEsglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGF----QNp 776
Cdd:cd01378    301 GGGGRSVyEVPLNVEQ----------AAYARDALAKAIYSRLFDWIVERINKSLAAKSGGKKKVIgVLDIYGFeifeKN- 369
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  777 ewggsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELA----FDDlEPVADdsvaavdqashLVRS----- 847
Cdd:cd01378    370 ---------SFEQFCINYVNEKLQQIFIELTLKAEQEEYVREGIEWTpikyFNN-KIICD-----------LIEEkppgi 428
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  848 LAHADEArgllwlleeEALVPGATEDALLDRLFSYYGPQEGDKKGQSPllRSSKPRHFLLGHSHGTnwVEYNVAGwlnYT 927
Cdd:cd01378    429 FAILDDA---------CLTAGDATDQTFLQKLNQLFSNHPHFECPSGH--FELRRGEFRIKHYAGD--VTYNVEG---FL 492
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  928 KQNPAT--QNAPRLLQDSQKKIISNLFlgragsatvlsgsiagLEGGSQLALRRATsmrktfTTGMaavkkkslciQIKL 1005
Cdd:cd01378    493 DKNKDLlfKDLKELMQSSSNPFLRSLF----------------PEGVDLDSKKRPP------TAGT----------KFKN 540
                          650       660
                   ....*....|....*....|..
gi 1907082233 1006 QVDALIDTIKRSKMHFVHCFLP 1027
Cdd:cd01378    541 SANALVETLMKKQPSYIRCIKP 562
MYSc_Myo7 cd01381
class VII myosin, motor domain; These monomeric myosins have been associated with functions in ...
394-1108 8.64e-59

class VII myosin, motor domain; These monomeric myosins have been associated with functions in sensory systems such as vision and hearing. Mammalian myosin VII has a tail with 2 MyTH4 domains, 2 FERM domains, and a SH3 domain. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276832  Cd Length: 648  Bit Score: 216.35  E-value: 8.64e-59
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGpSLLVlstrgapAV--------YSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQ 465
Cdd:cd01381      1 AGILRNLLIRYREKLIYTYTG-SILV-------AVnpyqilpiYTAEQIRLYRNKKIGELPPHIFAIADNAYTNMKRNKR 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  466 DQSIVLLGSSGSGKTTSFQHLVQYLATIAGtsgtKVFSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAG 543
Cdd:cd01381     73 DQCVVISGESGAGKTESTKLILQYLAAISG----QHSWIEQ-QILEAnpILEAFGNAKTIRNDNSSRFGKYIDIHFNKNG 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  544 QVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNH------LAENNVFGIvplskpeEKQKAAQQF 617
Cdd:cd01381    148 VIEGAKIEQYLLEKSRIVSQAPDERNYHIFYCMLAGLSAEEKKKLELGDasdyyyLTQGNCLTC-------EGRDDAAEF 220
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  618 SKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGaTKEPLEEQDEAAEagrkqFARHEWAQKAAYLLGCSLEELSSAIFK 697
Cdd:cd01381    221 ADIRSAMKVLMFTDEEIWDIFKLLAAILHLGNIK-FEATVVDNLDASE-----VRDPPNLERAAKLLEVPKQDLVDALTT 294
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  698 HQL--KGGTLQRSTSFRQgpeesglgegtklsALECLEGMASGLYSELFTLLISLVNRAL---KSSQHSLCSMMIVDTPG 772
Cdd:cd01381    295 RTIftRGETVVSPLSAEQ--------------ALDVRDAFVKGIYGRLFIWIVNKINSAIykpRGTDSSRTSIGVLDIFG 360
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  773 FQNPEWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVADdsVAAVDQAShlvrSLA 849
Cdd:cd01381    361 FENFEVN------SFEQLCINFANENLQQFFVRHIFKLEQEEYDKEGINwqhIEFVDNQDVLD--LIALKPMN----IMS 428
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  850 HADEargllwlleeEALVPGATEDALLDRLFSYYGPQEGDKKGQSPLLRSSKPRHFLlghshGTnwVEYNVAGWLNytKQ 929
Cdd:cd01381    429 LIDE----------ESKFPKGTDQTMLEKLHSTHGNNKNYLKPKSDLNTSFGINHFA-----GV--VFYDTRGFLE--KN 489
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  930 NPA-TQNAPRLLQDSQKKIISNLFlgragsatvlsgsiagleggsQLALRRATSMRKtfttgmaavKKKSLCIQIKLQVD 1008
Cdd:cd01381    490 RDTfSADLLQLVQSSKNKFLKQLF---------------------NEDISMGSETRK---------KSPTLSSQFRKSLD 539
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1009 ALIDTIKRSKMHFVHCFLPVAEGWPGEprsassrrvsssseldlppgdpceagllqLDVSLLRAQLRGSRLLDAMRMYRQ 1088
Cdd:cd01381    540 QLMKTLSACQPFFVRCIKPNEYKKPML-----------------------------FDRELCVRQLRYSGMMETIRIRKA 590
                          730       740
                   ....*....|....*....|
gi 1907082233 1089 GYPDHMVFSEFRRRFDVLAP 1108
Cdd:cd01381    591 GYPIRHTFEEFVERYRVLVP 610
MYSc_Myo36 cd14897
class XXXVI myosin, motor domain; This class of molluscan myosins contains a motor domain ...
395-953 5.85e-58

class XXXVI myosin, motor domain; This class of molluscan myosins contains a motor domain followed by a GlcAT-I (Beta1,3-glucuronyltransferase I) domain. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276862 [Multi-domain]  Cd Length: 635  Bit Score: 213.78  E-value: 5.85e-58
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCR-REDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd14897      2 TIVQTLKSRYNKDKFYTYIGDILVAVNPCKPLPIFDKKHHEEYSNLSvRSQRPPHLFWIADQAYRRLLETGRNQCILVSG 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIAGTSGTkvFSVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTM 553
Cdd:cd14897     82 ESGAGKTESTKYMIKHLMKLSPSDDS--DLLDKIVQINPLLEAFGNASTVMNDNSSRFGKFIELHFTENGQLLGAKIDDY 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  554 LLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNH----------LAENNVFGIVplskpEEKQKAAQQFSKLQAA 623
Cdd:cd14897    160 LLEKSRVVHRGNGEKNFHIFYALFAGMSRDRLLYYFLEDpdchrilrddNRNRPVFNDS-----EELEYYRQMFHDLTNI 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  624 MKVLAISPEEQKTCWLILASIYHLgaAGATKEPLEEQDEAaeagrkQFARHEWAQKAAYLLGCSLEELSSAIF--KHQLK 701
Cdd:cd14897    235 MKLIGFSEEDISVIFTILAAILHL--TNIVFIPDEDTDGV------TVADEYPLHAVAKLLGIDEVELTEALIsnVNTIR 306
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  702 GGTLQRSTSFRQgpeesglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQ-----HSLCSMMIVDTPGFQNp 776
Cdd:cd14897    307 GERIQSWKSLRQ--------------ANDSRDALAKDLYSRLFGWIVGQINRNLWPDKdfqimTRGPSIGILDMSGFEN- 371
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  777 ewggsARGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAfdDLEPVADDSV------------AAVDQASHL 844
Cdd:cd14897    372 -----FKINSFDQLCINLSNERLQQYFNDYVFPRERSEYEIEGIEWR--DIEYHDNDDVlelffkkplgilPLLDEESTF 444
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  845 vrslahadeargllwlleeealvPGATEDALLDRLFSYYGPqegdkkgqSPLLRSSKPRHFLLGHSHGTNWVEYNVAGWL 924
Cdd:cd14897    445 -----------------------PQSTDSSLVQKLNKYCGE--------SPRYVASPGNRVAFGIRHYAEQVTYDADGFL 493
                          570       580
                   ....*....|....*....|....*....
gi 1907082233  925 NYTKQNpATQNAPRLLQDSQKKIISNLFL 953
Cdd:cd14897    494 EKNRDN-LSSDIVGCLLNSNNEFISDLFT 521
MYSc_Myo5 cd01380
class V myosin, motor domain; Myo5, also called heavy chain 12, myoxin, are dimeric myosins ...
395-826 2.15e-57

class V myosin, motor domain; Myo5, also called heavy chain 12, myoxin, are dimeric myosins that transport a variety of intracellular cargo processively along actin filaments, such as melanosomes, synaptic vesicles, vacuoles, and mRNA. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. It also contains a IQ domain and a globular DIL domain. Myosin V is a class of actin-based motor proteins involved in cytoplasmic vesicle transport and anchorage, spindle-pole alignment and mRNA translocation. The protein encoded by this gene is abundant in melanocytes and nerve cells. Mutations in this gene cause Griscelli syndrome type-1 (GS1), Griscelli syndrome type-3 (GS3) and neuroectodermal melanolysosomal disease, or Elejalde disease. Multiple alternatively spliced transcript variants encoding different isoforms have been reported, but the full-length nature of some variants has not been determined. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. Note that the Dictyostelium myoVs are not contained in this child group. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276831 [Multi-domain]  Cd Length: 629  Bit Score: 212.01  E-value: 2.15e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYG-ASLLHTYAGpslLVLStrgapAV--------YSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQ 465
Cdd:cd01380      2 AVLHNLKVRFCqRNAIYTYCG---IVLV-----AInpyedlpiYGEDIIQAYSGQNMGELDPHIFAIAEEAYRQMARDEK 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  466 DQSIVLLGSSGSGKTTSFQHLVQYLATIAGTSGTKVfSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAG 543
Cdd:cd01380     74 NQSIIVSGESGAGKTVSAKYAMRYFATVGGSSSGET-QVEE-KVLASnpIMEAFGNAKTTRNDNSSRFGKYIEILFDKNY 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  544 QVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHlAENNVFGIVPLSKPEEKQKAAQQFSKLQAA 623
Cdd:cd01380    152 RIIGANMRTYLLEKSRVVFQAEEERNYHIFYQLCAAASLPELKELHLGS-AEDFFYTNQGGSPVIDGVDDAAEFEETRKA 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  624 MKVLAISPEEQKTCWLILASIYHLG-----AAGATKEPLEEQDEAaeagrkqfarhewAQKAAYLLGCSLEELSSAIFKH 698
Cdd:cd01380    231 LTLLGISEEEQMEIFRILAAILHLGnveikATRNDSASISPDDEH-------------LQIACELLGIDESQLAKWLCKR 297
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  699 QLK--GGTLQRSTSFRQgpeesglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQ-HSLCSMM-IVDTPGFQ 774
Cdd:cd01380    298 KIVtrSEVIVKPLTLQQ--------------AIVARDALAKHIYAQLFDWIVDRINKALASPVkEKQHSFIgVLDIYGFE 363
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907082233  775 NPEWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDD 826
Cdd:cd01380    364 TFEVN------SFEQFCINYANEKLQQQFNQHVFKLEQEEYVKEEIEWSFID 409
MYSc_Myo10 cd14873
class X myosin, motor domain; Myosin X is an unconventional myosin motor that functions as a ...
395-1111 6.86e-57

class X myosin, motor domain; Myosin X is an unconventional myosin motor that functions as a monomer. In mammalian cells, the motor is found to localize to filopodia. Myosin X walks towards the barbed ends of filaments and is thought to walk on bundles of actin, rather than single filaments, a unique behavior. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. C-terminal to the head domain are a variable number of IQ domains, 2 PH domains, a MyTH4 domain, and a FERM domain. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276840 [Multi-domain]  Cd Length: 651  Bit Score: 210.81  E-value: 6.86e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGpSLL--VLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLL 472
Cdd:cd14873      2 SIMYNLFQRYKRNQIYTYIG-SILasVNPYQPIAGLYEPATMEQYSRRHLGELPPHIFAIANECYRCLWKRHDNQCILIS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  473 GSSGSGKTTSFQHLVQYLATIAGTS-----GTKVFSVEKWQALST-LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVA 546
Cdd:cd14873     81 GESGAGKTESTKLILKFLSVISQQSlelslKEKTSCVEQAILESSpIMEAFGNAKTVYNNNSSRFGKFVQLNICQKGNIQ 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  547 SASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNhLAEN----NVFGIVPLSKPEEKqkaaQQFSKLQA 622
Cdd:cd14873    161 GGRIVDYLLEKNRVVRQNPGERNYHIFYALLAGLEHEEREEFYLS-TPENyhylNQSGCVEDKTISDQ----ESFREVIT 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  623 AMKVLAISPEEQKTCWLILASIYHLGAAgatkepleeqdEAAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIfkhqlkg 702
Cdd:cd14873    236 AMEVMQFSKEEVREVSRLLAGILHLGNI-----------EFITAGGAQVSFKTALGRSAELLGLDPTQLTDAL------- 297
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  703 gtLQRSTSFRqGPEESglgegTKLS---ALECLEGMASGLYSELFTLLISLVNRALKSSQHsLCSMMIVDTPGFQNPEWG 779
Cdd:cd14873    298 --TQRSMFLR-GEEIL-----TPLNvqqAVDSRDSLAMALYARCFEWVIKKINSRIKGKED-FKSIGILDIFGFENFEVN 368
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  780 gsargaSFEELCHNYAQDRLQRLFHERTF-LQELERYKEDnieLAFDDLEPVadDSVAAVDQASHLVRSLAHADEargll 858
Cdd:cd14873    369 ------HFEQFNINYANEKLQEYFNKHIFsLEQLEYSREG---LVWEDIDWI--DNGECLDLIEKKLGLLALINE----- 432
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  859 wlleeEALVPGATEDALLDRLFSyygpqeGDKKGQSPLlrssKPR--HFLLGHSHGTNWVEYNVAGWLnytKQNPAT--Q 934
Cdd:cd14873    433 -----ESHFPQATDSTLLEKLHS------QHANNHFYV----KPRvaVNNFGVKHYAGEVQYDVRGIL---EKNRDTfrD 494
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  935 NAPRLLQDSQKKIISNLFlgragsatvlsgsiagleggsqlalRRATSMRKTFTTGMAAVKKK-SLCIQIKLQVDALIDT 1013
Cdd:cd14873    495 DLLNLLRESRFDFIYDLF-------------------------EHVSSRNNQDTLKCGSKHRRpTVSSQFKDSLHSLMAT 549
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1014 IKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDH 1093
Cdd:cd14873    550 LSSSNPFFVRCIKPNMQKMPD-----------------------------QFDQAVVLNQLRYSGMLETVRIRKAGYAVR 600
                          730
                   ....*....|....*...
gi 1907082233 1094 MVFSEFRRRFDVLAPHLT 1111
Cdd:cd14873    601 RPFQDFYKRYKVLMRNLA 618
MYSc_Myo3 cd01379
class III myosin, motor domain; Myosin III has been shown to play a role in the vision process ...
395-832 2.73e-56

class III myosin, motor domain; Myosin III has been shown to play a role in the vision process in insects and in hearing in mammals. Myosin III, an unconventional myosin, does not form dimers. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. They are characterized by an N-terminal protein kinase domain and several IQ domains. Some members also contain WW, SH2, PH, and Y-phosphatase domains. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276830 [Multi-domain]  Cd Length: 633  Bit Score: 208.67  E-value: 2.73e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 474
Cdd:cd01379      2 TIVSQLQKRYSRDQIYTYIGDILIAVNPFQNLGIYTEEHSRLYRGAKRSDNPPHIFAVADAAYQAMIHQKKNQCIVISGE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  475 SGSGKTTSFQHLVQYLaTIAGTSGTKVFSvEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTML 554
Cdd:cd01379     82 SGAGKTESANLLVQQL-TVLGKANNRTLE-EKILQVNPLMEAFGNARTVINDNSSRFGKYLEMKFTSTGAVTGARISEYL 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  555 LEKLRVARRPASEATFNVFYYLLA-------CGDATLRTELHLNHLAEnnvFGIVPLSKpEEKQKAAQQFSKLQAAMKVL 627
Cdd:cd01379    160 LEKSRVVHQAIGERNFHIFYYIYAglaedkkLAKYKLPENKPPRYLQN---DGLTVQDI-VNNSGNREKFEEIEQCFKVI 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  628 AISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAaeagRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQL--KGGTL 705
Cdd:cd01379    236 GFTKEEVDSVYSILAAILHIGDIEFTEVESNHQTDK----SSRISNPEALNNVAKLLGIEADELQEALTSHSVvtRGETI 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  706 QRSTSFRQgpeesglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQHS---LCSMMIVDTPGFQNpewggsA 782
Cdd:cd01379    312 IRNNTVEE--------------ATDARDAMAKALYGRLFSWIVNRINSLLKPDRSAsdePLSIGILDIFGFEN------F 371
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907082233  783 RGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIEL---AFDDLEPVAD 832
Cdd:cd01379    372 QKNSFEQLCINIANEQIQYYFNQHIFAWEQQEYLNEGIDVdliEYEDNRPLLD 424
MYSc_Myo15 cd01387
class XV mammal-like myosin, motor domain; The class XV myosins are monomeric. In vertebrates, ...
394-1151 3.13e-56

class XV mammal-like myosin, motor domain; The class XV myosins are monomeric. In vertebrates, myosin XV appears to be expressed in sensory tissue and play a role in hearing. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. C-terminal to the head domain are 2 MyTH4 domain, a FERM domain, and a SH3 domain. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276838 [Multi-domain]  Cd Length: 657  Bit Score: 209.22  E-value: 3.13e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd01387      1 TTVLWNLKTRYERNLIYTYIGSILVSVNPYKMFDIYGLEQVQQYSGRALGELPPHLFAIANLAFAKMLDAKQNQCVVISG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIAGTSGTKVfsVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQaGQVASASIQTM 553
Cdd:cd01387     81 ESGSGKTEATKLIMQYLAAVNQRRNNLV--TEQILEATPLLEAFGNAKTVRNDNSSRFGKYLEVFFEG-GVIVGAITSQY 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  554 LLEKLRVARRPASEATFNVFYYLLACGDATLRTE---------LHLNHLAENNVFGivplskpeekQKAAQQFSKLQAAM 624
Cdd:cd01387    158 LLEKSRIVTQAKNERNYHVFYELLAGLPAQLRQKyglqeaekyFYLNQGGNCEIAG----------KSDADDFRRLLAAM 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  625 KVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAGRKqfARHEWaqkAAYLLGCSLEELSSAIfkhqlkggt 704
Cdd:cd01387    228 QVLGFSSEEQDSIFRILASVLHLGNVYFHKRQLRHGQEGVSVGSD--AEIQW---VAHLLQISPEGLQKAL--------- 293
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  705 LQRSTSFRQGPEESGLgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsarg 784
Cdd:cd01387    294 TFKVTETRRERIFTPL---TIDQALDARDAIAKALYALLFSWLVTRVNAIVYSGTQDTLSIAILDIFGFEDLSEN----- 365
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  785 aSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVAddsvaavdqasHLVrslahADEARGLLWLL 861
Cdd:cd01387    366 -SFEQLCINYANENLQYYFNKHVFKLEQEEYIREQIDwteIAFADNQPVI-----------NLI-----SKKPVGILHIL 428
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  862 EEEALVPGATEDALLDRLFSYYGPQEgdkkgqspllRSSKPR----HFLLGHSHGTNWveYNVAGWLNYTKqNPATQNAP 937
Cdd:cd01387    429 DDECNFPQATDHSFLEKCHYHHALNE----------LYSKPRmplpEFTIKHYAGQVW--YQVHGFLDKNR-DQLRQDVL 495
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  938 RLLQDSQKKIISNLFlgragsatvlsgsiagleggSQLALRRATSMRKTFTTGMAAVKKKSLCIQIKLQ--VDALIDTIK 1015
Cdd:cd01387    496 ELLVSSRTRVVAHLF--------------------SSHRAQTDKAPPRLGKGRFVTMKPRTPTVAARFQdsLLQLLEKME 555
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1016 RSKMHFVHCflpvaegwpgeprsassrrvsssseldLPPGDPCEAGLLQLDVSLlrAQLRGSRLLDAMRMYRQGYPDHMV 1095
Cdd:cd01387    556 RCNPWFVRC---------------------------LKPNHKKEPMLFDMDVVM--AQLRYSGMLETIRIRKEGYPVRLP 606
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082233 1096 FSEFRRRFDvlapHLTKKHGRNYIVVDEKRAVeELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd01387    607 FQVFIDRYR----CLVALKLPRPAPGDMCVSL-LSRLCTVTPKDMYRLGATKVFLR 657
MYSc_Myo35 cd14896
class XXXV myosin, motor domain; This class of metazoan myosins contains 2 IQ motifs, 2 MyTH4 ...
394-1107 2.24e-54

class XXXV myosin, motor domain; This class of metazoan myosins contains 2 IQ motifs, 2 MyTH4 domains, a single FERM domain, and an SH3 domain. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276861 [Multi-domain]  Cd Length: 644  Bit Score: 203.09  E-value: 2.24e-54
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd14896      1 SSVLLCLKKRFHLGRIYTFGGPILLSLNPHRSLPLFSEEVLASYHPRKALNTTPHIFAIAASAYRLSQSTGQDQCILLSG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIagtsGTKVFSVEKWQALSTL--LEAFGNSPTIMNGSATRFSQILSLDFdQAGQVASASIQ 551
Cdd:cd14896     81 HSGSGKTEAAKKIVQFLSSL----YQDQTEDRLRQPEDVLpiLESFGHAKTILNANASRFGQVLRLHL-QHGVIVGASVS 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  552 TMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHlAEN----NVFGIVPLSKPEEkqkaAQQFSKLQAAMKVL 627
Cdd:cd14896    156 HYLLETSRVVFQAQAERSFHVFYELLAGLDPEEREQLSLQG-PETyyylNQGGACRLQGKED----AQDFEGLLKALQGL 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  628 AISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAgrkqfarhEWA--QKAAYLLGCSLEELSSAIFKHQLKGGTL 705
Cdd:cd14896    231 GLCAEELTAIWAVLAAILQLGNICFSSSERESQEVAAVS--------SWAeiHTAARLLQVPPERLEGAVTHRVTETPYG 302
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  706 QRSTSFrqgPEEsglgegtklSALECLEGMASGLYSELFTLLISLVNRAL--KSSQHSLCSMMIVDTPGFQnpewggSAR 783
Cdd:cd14896    303 RVSRPL---PVE---------GAIDARDALAKTLYSRLFTWLLKRINAWLapPGEAESDATIGVVDAYGFE------ALR 364
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  784 GASFEELCHNYAQDRLQRLFHERTFLQELERYKEDniELAFDDLEPVADDSVA--AVDQASHLVRSLahadeargllwll 861
Cdd:cd14896    365 VNGLEQLCINLASERLQLFSSQTLLAQEEEECQRE--LLPWVPIPQPPRESCLdlLVDQPHSLLSIL------------- 429
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  862 EEEALVPGATEDALLDRLFSYYGPQEGDKKGQSPLlrsskPRhFLLGHSHGTnwVEYNVAGWLNYTKQ--NPATQNaprL 939
Cdd:cd14896    430 DDQTWLSQATDHTFLQKCHYHHGDHPSYAKPQLPL-----PV-FTVRHYAGT--VTYQVHKFLNRNRDqlDPAVVE---M 498
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  940 LQDSQKKIISNLFlgragsatvlsgsiagLEGGSQLALRRatsmrktfttgmaavKKKSLCIQIKLQVDALIDTIKRSKM 1019
Cdd:cd14896    499 LAQSQLQLVGSLF----------------QEAEPQYGLGQ---------------GKPTLASRFQQSLGDLTARLGRSHV 547
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1020 HFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEF 1099
Cdd:cd14896    548 YFIHCLNPNPGKLPG-----------------------------LFDVGHVTEQLRQAGILEAIGTRSEGFPVRVPFQAF 598

                   ....*...
gi 1907082233 1100 RRRFDVLA 1107
Cdd:cd14896    599 LARFGALG 606
MYSc_Myo17 cd14879
class XVII myosin, motor domain; This fungal myosin which is also known as chitin synthase ...
391-830 2.61e-54

class XVII myosin, motor domain; This fungal myosin which is also known as chitin synthase uses its motor domain to tether its vesicular cargo to peripheral actin. It works in opposition to dynein, contributing to the retention of Mcs1 vesicles at the site of cell growth and increasing vesicle fusion necessary for polarized growth. Class 17 myosins consist of a N-terminal myosin motor domain with Cyt-b5, chitin synthase 2, and a DEK_C domains at it C-terminus. The chitin synthase region contains several transmembrane domains by which myosin 17 is thought to bind secretory vesicles. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276845 [Multi-domain]  Cd Length: 647  Bit Score: 203.17  E-value: 2.61e-54
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  391 LNESSVLHTLRQRYGASLLHTYAGPS-LLVLSTRGAPAVYSEKVMHMFK-------GCRREDMAPHIYAVAQTAYRAMLM 462
Cdd:cd14879      1 PSDDAITSHLASRFRSDLPYTRLGSSaLVAVNPYKYLSSNSDASLGEYGseyydttSGSKEPLPPHAYDLAARAYLRMRR 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  463 SRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGTS--GTKVfsVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFD 540
Cdd:cd14879     81 RSEDQAVVFLGETGSGKSESRRLLLRQLLRLSSHSkkGTKL--SSQISAAEFVLDSFGNAKTLTNPNASRFGRYTELQFN 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  541 QAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVF---GIVPL-SKPEEKQkaAQQ 616
Cdd:cd14879    159 ERGRLIGAKVLDYRLERSRVASVPTGERNFHVFYYLLAGASPEERQHLGLDDPSDYALLasyGCHPLpLGPGSDD--AEG 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  617 FSKLQAAMKVLAISPEEQKTCWLILASIYHLGAagatkepLEEQDEAAeagrkqfARHEWA--------QKAAYLLGCSL 688
Cdd:cd14879    237 FQELKTALKTLGFKRKHVAQICQLLAAILHLGN-------LEFTYDHE-------GGEESAvvkntdvlDIVAAFLGVSP 302
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  689 EELSSAI-FKHQLKGGtlQRSTSFrqgpeesgLG-EGTKLSALEclegMASGLYSELFTLLISLVNRALKSSQHSLCSMM 766
Cdd:cd14879    303 EDLETSLtYKTKLVRK--ELCTVF--------LDpEGAAAQRDE----LARTLYSLLFAWVVETINQKLCAPEDDFATFI 368
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082233  767 -IVDTPGFQNPewgGSARGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELA----FDDLEPV 830
Cdd:cd14879    369 sLLDFPGFQNR---SSTGGNSLDQFCVNFANERLHNYVLRSFFERKAEELEAEGVSVPatsyFDNSDCV 434
MYSc_Myo28 cd14889
class XXVIII myosin, motor domain; These myosins are found in fish, chicken, and mollusks. The ...
396-1106 3.15e-53

class XXVIII myosin, motor domain; These myosins are found in fish, chicken, and mollusks. The tail regions of these class-XXVIII myosins consist of an IQ motif, a short coiled-coil region, and an SH2 domain. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276854  Cd Length: 659  Bit Score: 200.13  E-value: 3.15e-53
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  396 VLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAML----MSRQDQSIVL 471
Cdd:cd14889      3 LLEVLKVRFMQSNIYTYVGDILVAINPFKYLHIYEKEVSQKYKCEKKSSLPPHIFAVADRAYQSMLgrlaRGPKNQCIVI 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  472 LGSSGSGKTTSFQHLVQYLATIAgtSGTKVFSVEKWQaLSTLLEAFGNSPTIMNGSATRFSQILSLDFdQAGQVASASIQ 551
Cdd:cd14889     83 SGESGAGKTESTKLLLRQIMELC--RGNSQLEQQILQ-VNPLLEAFGNAQTVMNDNSSRFGKYIQLRF-RNGHVKGAKIN 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  552 TMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL----NHLAENNVFGivplsKPEEKQKAAQQFSKLQAAMKVL 627
Cdd:cd14889    159 EYLLEKSRVVHQDGGEENFHIFYYMFAGISAEDRENYGLldpgKYRYLNNGAG-----CKREVQYWKKKYDEVCNAMDMV 233
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  628 AISPEEQKTCWLILASIYHLGAAgatkePLEEQDEaaEAGRKQFARHEWAQKAAYLLGCSLEELSsaifkhqlkgGTLQR 707
Cdd:cd14889    234 GFTEQEEVDMFTILAGILSLGNI-----TFEMDDD--EALKVENDSNGWLKAAAGQFGVSEEDLL----------KTLTC 296
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  708 STSFRQGpeESGLGEGTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHS---LCSMMIVDTPGFQNPEWGgsarg 784
Cdd:cd14889    297 TVTFTRG--EQIQRHHTKQQAEDARDSIAKVAYGRVFGWIVSKINQLLAPKDDSsveLREIGILDIFGFENFAVN----- 369
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  785 aSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNI---ELAFDDLEPVAD-------DSVAAVDQASHLvrslahadea 854
Cdd:cd14889    370 -RFEQACINLANEQLQYFFNHHIFLMEQKEYKKEGIdwkEITYKDNKPILDlflnkpiGILSLLDEQSHF---------- 438
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  855 rgllwlleeealvPGATEDALLDRLFSYYGPQEGDKKGQspllrsSKPRHFLLGHSHGTnwVEYNVAGWLNYTKQN-PAT 933
Cdd:cd14889    439 -------------PQATDESFVDKLNIHFKGNSYYGKSR------SKSPKFTVNHYAGK--VTYNASGFLEKNRDTiPAS 497
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  934 QNAprLLQDSQKKIISNLFLGRAGSATVLSGSIAGLEGGSQlalrratsmrktfttGMAAVKKKSLCIQIKLQVDALIDT 1013
Cdd:cd14889    498 IRT--LFINSATPLLSVLFTATRSRTGTLMPRAKLPQAGSD---------------NFNSTRKQSVGAQFKHSLGVLMEK 560
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1014 IKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDH 1093
Cdd:cd14889    561 MFAASPHFVRCIKPNHVKVPG-----------------------------QLDSKYIQDQLRYNGLLETIRIRREGFSWR 611
                          730
                   ....*....|...
gi 1907082233 1094 MVFSEFRRRFDVL 1106
Cdd:cd14889    612 PSFAEFAERYKIL 624
MYSc_Myo9 cd01385
class IX myosin, motor domain; Myosin IX is a processive single-headed motor, which might play ...
395-1108 3.93e-53

class IX myosin, motor domain; Myosin IX is a processive single-headed motor, which might play a role in signalling. It has a N-terminal RA domain, an IQ domain, a C1_1 domain, and a RhoGAP domain. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276836 [Multi-domain]  Cd Length: 690  Bit Score: 200.29  E-value: 3.93e-53
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 474
Cdd:cd01385      2 TLLENLRARFKHGKIYTYVGSILIAVNPFKFLPIYNPKYVKMYQNRRLGKLPPHIFAIADVAYHAMLRKKKNQCIVISGE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  475 SGSGKTTSFQHLVQYLaTIAGTSGTKVfSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQT 552
Cdd:cd01385     82 SGSGKTESTNFLLHHL-TALSQKGYGS-GVEQ-TILGAgpVLEAFGNAKTAHNNNSSRFGKFIQVNYRENGMVRGAVVEK 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  553 MLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHlAENNVFGIVPLSKPEEKQKAAQQFSKLQAAMKVLAISPE 632
Cdd:cd01385    159 YLLEKSRIVSQEKNERNYHVFYYLLAGASEEERKELHLKQ-PEDYHYLNQSDCYTLEGEDEKYEFERLKQAMEMVGFLPE 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  633 EQKTCWLILASIYHLGAAgATKEPLEEQDEAAEAGRKqfarhEWAQKAAYLLGCSLEELSSAIF--KHQLKGGTLQRSTS 710
Cdd:cd01385    238 TQRQIFSVLSAVLHLGNI-EYKKKAYHRDESVTVGNP-----EVLDIISELLRVKEETLLEALTtkKTVTVGETLILPYK 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  711 FrqgPEesglgegtklsALECLEGMASGLYSELFTLLISLVNRAL---KSSQHSLC-SMMIVDTPGFQNPEWGgsargaS 786
Cdd:cd01385    312 L---PE-----------AIATRDAMAKCLYSALFDWIVLRINHALlnkKDLEEAKGlSIGVLDIFGFEDFGNN------S 371
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  787 FEELCHNYAQDRLQRLFHERTFLQELERYKEDNIElaFDDLEPVadDSVAAVdqasHLVRSLAHadearGLLWLLEEEAL 866
Cdd:cd01385    372 FEQFCINYANEHLQYYFNQHIFKLEQEEYKKEGIS--WHNIEYT--DNTGCL----QLISKKPT-----GLLCLLDEESN 438
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  867 VPGATEDALLDRLFS-------YYGPQEgdkkgqspllrssKPRHFLLGHSHGTnwVEYNVAGW----LNYTKQNPATqn 935
Cdd:cd01385    439 FPGATNQTLLAKFKQqhkdnkyYEKPQV-------------MEPAFIIAHYAGK--VKYQIKDFreknLDLMRPDIVA-- 501
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  936 aprLLQDSQKKIISNL-----------------FLGRAGSATVLSGSIAGLEGGSQLALRRATSMRktfttGMAAVKKKS 998
Cdd:cd01385    502 ---VLRSSSSAFVRELigidpvavfrwavlrafFRAMAAFREAGRRRAQRTAGHSLTLHDRTTKSL-----LHLHKKKKP 573
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  999 LCIQIKLQV--DALIDTIKRSKMHFVHCFLPVAEGWPgeprsassrrvsssseldlppgdpceaglLQLDVSLLRAQLRG 1076
Cdd:cd01385    574 PSVSAQFQTslSKLMETLGQAEPFFIRCIKSNAEKKP-----------------------------LRFDDELVLRQLRY 624
                          730       740       750
                   ....*....|....*....|....*....|..
gi 1907082233 1077 SRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAP 1108
Cdd:cd01385    625 TGMLETVRIRRSGYSVRYTFQEFITQFQVLLP 656
MYSc_Myo41 cd14902
class XLI myosin, motor domain; The class XLI myosins are comprised of Stramenopiles. Not much ...
395-1113 4.14e-52

class XLI myosin, motor domain; The class XLI myosins are comprised of Stramenopiles. Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276867 [Multi-domain]  Cd Length: 716  Bit Score: 197.81  E-value: 4.14e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFK--------GCRREDMAPHIYAVAQTAYRAMLMS-R 464
Cdd:cd14902      2 ALLQALSERFEHDQIYTSIGDILVALNPlKPLPDLYSESQLNAYKasmtstspVSQLSELPPHVFAIGGKAFGGLLKPeR 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  465 QDQSIVLLGSSGSGKTTSFQHLVQYLATI-----AGTSGTKVFSVEKWQALST--LLEAFGNSPTIMNGSATRFSQILSL 537
Cdd:cd14902     82 RNQSILVSGESGSGKTESTKFLMQFLTSVgrdqsSTEQEGSDAVEIGKRILQTnpILESFGNAQTIRNDNSSRFGKFIKI 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  538 DFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAE---NNVFGIVPLSKPEEKQKAA 614
Cdd:cd14902    162 QFGANNEIVGAQIVSYLLEKVRLLHQSPEERSFHIFYELLEGADKTLLDLLGLQKGGKyelLNSYGPSFARKRAVADKYA 241
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  615 QQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEA-AEAGRKQFArhewaqKAAYLLGCSLEELSS 693
Cdd:cd14902    242 QLYVETVRAFEDTGVGELERLDIFKILAALLHLGNVNFTAENGQEDATAvTAASRFHLA------KCAELMGVDVDKLET 315
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  694 AIFKHQLKGGtlQRSTSFRQGPEEsglgegtklsALECLEGMASGLYSELFTLLISLVN---------RALKSSQHSLCS 764
Cdd:cd14902    316 LLSSREIKAG--VEVMVLKLTPEQ----------AKEICGSLAKAIYGRLFTWLVRRLSdeinyfdsaVSISDEDEELAT 383
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  765 MMIVDTPGFQNPEWGGsargasFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVaddsVAAVDqa 841
Cdd:cd14902    384 IGILDIFGFESLNRNG------FEQLCINYANERLQAQFNEFVFVKEQQIYIAEGIDwknISYPSNAAC----LALFD-- 451
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  842 shlvrslahaDEARGLLWLLEEEALVPGATEDALLDRLFSYYGPQEgdkkgqspllrsskprHFLLGHSHGTnwVEYNVA 921
Cdd:cd14902    452 ----------DKSNGLFSLLDQECLMPKGSNQALSTKFYRYHGGLG----------------QFVVHHFAGR--VCYNVE 503
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  922 GWLNyTKQNPATQNAPRLLQDSQKKIISNLFL-GRAGSATVLSGSiagleggsqlALRRATSMrktfttgmaaVKKKSLC 1000
Cdd:cd14902    504 QFVE-KNTDALPADASDILSSSSNEVVVAIGAdENRDSPGADNGA----------AGRRRYSM----------LRAPSVS 562
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1001 IQIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLL 1080
Cdd:cd14902    563 AQFKSQLDRLIVQIGRTEAHYVRCLKPNEVKKPG-----------------------------IFDRERMVEQMRSVGVL 613
                          730       740       750
                   ....*....|....*....|....*....|...
gi 1907082233 1081 DAMRMYRQGYPDHMVFSEFRRRFDVLAPHLTKK 1113
Cdd:cd14902    614 EAVRIARHGYSVRLAHASFIELFSGFKCFLSTR 646
MYSc_Myo43 cd14904
class XLIII myosin, motor domain; The class XLIII myosins are comprised of Stramenopiles. Not ...
395-1108 5.06e-52

class XLIII myosin, motor domain; The class XLIII myosins are comprised of Stramenopiles. Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276869  Cd Length: 653  Bit Score: 196.32  E-value: 5.06e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd14904      2 SILFNLKKRFAASKPYTYTNDIVIALNPyKWIDNLYGDHLHEQYLKKPRDKLQPHVYATSTAAYKHMLTNEMNQSILVSG 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIAGtsGTKVFSVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTM 553
Cdd:cd14904     82 ESGAGKTETTKIVMNHLASVAG--GRKDKTIAKVIDVNPLLESFGNAKTTRNDNSSRFGKFTQLQFDGRGKLIGAKCETY 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  554 LLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFSKLQAAMKVLAISPEE 633
Cdd:cd14904    160 LLEKSRVVSIAEGERNYHIFYQLLAGLSSEERKEFGLDPNCQYQYLGDSLAQMQIPGLDDAKLFASTQKSLSLIGLDNDA 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  634 QKTCWLILASIYHLGAAgatkePLEEQDEAAEAgrkqFARHEWAQKAAYLLGCSLEELSSAIfkhqlkggtLQRSTSFRQ 713
Cdd:cd14904    240 QRTLFKILSGVLHLGEV-----MFDKSDENGSR----ISNGSQLSQVAKMLGLPTTRIEEAL---------CNRSVVTRN 301
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  714 gpeESGLGEGTKLSALECLEGMASGLYSELFTLLISLVNRALkSSQHSLCSMMI--VDTPGFQNPEWGGsargasFEELC 791
Cdd:cd14904    302 ---ESVTVPLAPVEAEENRDALAKAIYSKLFDWMVVKINAAI-STDDDRIKGQIgvLDIFGFEDFAHNG------FEQFC 371
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  792 HNYAQDRLQRLFHERTFLQELERYKEDNieLAFDDLEPVADDSVAAVDQASHLVRSLAHaDEARGllwlleeealvPGAT 871
Cdd:cd14904    372 INYANEKLQQKFTTDVFKTVEEEYIREG--LQWDHIEYQDNQGIVEVIDGKMGIIALMN-DHLRQ-----------PRGT 437
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  872 EDALLDRLFSYYGPQEGDKKGQSPllrSSKPRHFLLGHSHGTnwVEYNVAGWLNytKQNPATQN-APRLLQDSQKKIISN 950
Cdd:cd14904    438 EEALVNKIRTNHQTKKDNESIDFP---KVKRTQFIINHYAGP--VTYETVGFME--KHRDTLQNdLLDLVLLSSLDLLTE 510
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  951 LFlgraGSATVLSGSIAGLEGGSQLAlrratsmrktfttgmaavkKKSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVAE 1030
Cdd:cd14904    511 LF----GSSEAPSETKEGKSGKGTKA-------------------PKSLGSQFKTSLSQLMDNIKTTNTHYVRCIKPNAN 567
                          650       660       670       680       690       700       710
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082233 1031 GWPGEprsassrrvsssseldlppgdpceagllqLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAP 1108
Cdd:cd14904    568 KSPTE-----------------------------FDKRMVVEQLRSAGVIEAIRITRSGYPSRLTPKELATRYAIMFP 616
MYSc_Myo27 cd14888
class XXVII myosin, motor domain; Not much is known about this myosin class. The catalytic ...
395-1108 5.25e-52

class XXVII myosin, motor domain; Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276853 [Multi-domain]  Cd Length: 667  Bit Score: 196.45  E-value: 5.25e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLL-VLSTRGAPAVYSEKVM--HMFKGcrrEDMAPHIYAVAQTAYRAMLMSRQDQSIVL 471
Cdd:cd14888      2 SILHSLNLRFDIDEIYTFTGPILIaVNPFKTIPGLYSDEMLlkFIQPS---ISKSPHVFSTASSAYQGMCNNKKSQTILI 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  472 LGSSGSGKTTSFQHLVQYLATiAGTSGTKVFSVEKWQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQ-------- 541
Cdd:cd14888     79 SGESGAGKTESTKYVMKFLAC-AGSEDIKKRSLVEAQVLESnpLLEAFGNARTLRNDNSSRFGKFIELQFSKlkskrmsg 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  542 -AGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIV--PLSKPEEKQKAAQQFS 618
Cdd:cd14888    158 dRGRLCGAKIQTYLLEKVRVCDQQEGERNYHIFYQLCAAAREAKNTGLSYEENDEKLAKGADakPISIDMSSFEPHLKFR 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  619 KLQ--------------------AAMKVLAISPEEQKTCWLILASIYHLGAAgatkepLEEQDEAAEAGRKQFA-RHEWA 677
Cdd:cd14888    238 YLTksschelpdvddleefestlYAMQTVGISPEEQNQIFSIVAAILYLGNI------LFENNEACSEGAVVSAsCTDDL 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  678 QKAAYLLGCSLEELSSAifkhqLKGGTLQRSTSFRQGPEESGLGEGTKlsaleclEGMASGLYSELFTLLISLVNRAL-K 756
Cdd:cd14888    312 EKVASLLGVDAEDLLNA-----LCYRTIKTAHEFYTKPLRVDEAEDVR-------DALARALYSCLFDKVVERTNESIgY 379
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  757 SSQHSLCSMMIVDTPGFQnpewggSARGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIElaFDDLE-PVADDSV 835
Cdd:cd14888    380 SKDNSLLFCGVLDIFGFE------CFQLNSFEQLCINFTNERLQQFFNNFVFKCEEKLYIEEGIS--WNPLDfPDNQDCV 451
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  836 AAVDQASHLVrsLAHADEargllwlleeEALVPGATEDALLDRLFSYYGpqeGDKKGQSPllrSSKPRHFLLGHSHGTnw 915
Cdd:cd14888    452 DLLQEKPLGI--FCMLDE----------ECFVPGGKDQGLCNKLCQKHK---GHKRFDVV---KTDPNSFVIVHFAGP-- 511
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  916 VEYNVAGWLNYTKqNPATQNAPRLLQDSQKKIISNLFlgragsatvlsgsiagleggsqlalrrATSMRKTFTTGMAAVK 995
Cdd:cd14888    512 VKYCSDGFLEKNK-DQLSVDAQEVIKNSKNPFISNLF---------------------------SAYLRRGTDGNTKKKK 563
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  996 KKSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPgeprsassrrvsssSELDlppgdpceagllQLDVSllrAQLR 1075
Cdd:cd14888    564 FVTVSSEFRNQLDVLMETIDKTEPHFIRCIKPNSQNVP--------------DLFD------------RISVN---EQLK 614
                          730       740       750
                   ....*....|....*....|....*....|...
gi 1907082233 1076 GSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAP 1108
Cdd:cd14888    615 YGGVLQAVQVSRAGYPVRLSHAEFYNDYRILLN 647
MYSc_Myo31 cd14892
class XXXI myosin, motor domain; Class XXXI myosins have a very long neck region consisting of ...
397-1107 8.89e-52

class XXXI myosin, motor domain; Class XXXI myosins have a very long neck region consisting of 17 IQ motifs and 2 tandem ANK repeats that are separated by a PH domain. The myosin classes XXX to XXXIV contain members from Phytophthora species and Hyaloperonospora parasitica. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276857 [Multi-domain]  Cd Length: 656  Bit Score: 195.75  E-value: 8.89e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  397 LHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSekVMHMFKGCRREDMA----PHIYAVAQTAYRAMLMSR----QDQ 467
Cdd:cd14892      4 LDVLRRRYERDAIYTFTADILISINPyKSIPLLYD--VPGFDSQRKEEATAssppPHVFSIAERAYRAMKGVGkgqgTPQ 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  468 SIVLLGSSGSGKTTSFQHLVQYLATI----AGTSGTKVF-----SVEKWQALS-TLLEAFGNSPTIMNGSATRFSQILSL 537
Cdd:cd14892     82 SIVVSGESGAGKTEASKYIMKYLATAsklaKGASTSKGAanaheSIEECVLLSnLILEAFGNAKTIRNDNSSRFGKYIQI 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  538 DFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAEnnvFGIVPLSKPEEKQKA--AQ 615
Cdd:cd14892    162 HYNSDGRIAGASTDHFLLEKSRLVGPDANERNYHIFYQLLAGLDANENAALELTPAES---FLFLNQGNCVEVDGVddAT 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  616 QFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAagATKEPLEEQDEAAeagrKQFARHEWAQKAAYLLGCSLEELSSAI 695
Cdd:cd14892    239 EFKQLRDAMEQLGFDAEFQRPIFEVLAAVLHLGN--VRFEENADDEDVF----AQSADGVNVAKAAGLLGVDAAELMFKL 312
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  696 FKHQLKGGtlqRSTSFRQ--GPEEsglgegtklsALECLEGMASGLYSELFTLLISLVNRAlkSSQHSLCSMMIVDTP-- 771
Cdd:cd14892    313 VTQTTSTA---RGSVLEIklTARE----------AKNALDALCKYLYGELFDWLISRINAC--HKQQTSGVTGGAASPtf 377
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  772 ----------GFQnpewggSARGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVADdsvaaV 838
Cdd:cd14892    378 spfigildifGFE------IMPTNSFEQLCINFTNEMLQQQFNKHVFVLEQEVYASEGIDvsaIEFQDNQDCLD-----L 446
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  839 DQASHLvrslahadeargllwlleeeALVPGATEDALLDR-------LFSYYgpQEGDKKGQSpllrSSKPR----HFLL 907
Cdd:cd14892    447 IQKKPL--------------------GLLPLLEEQMLLKRkttdkqlLTIYH--QTHLDKHPH----YAKPRfecdEFVL 500
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  908 GHSHGTnwVEYNVAGWLnyTKQNPATQNAPRLLQDSQKKiisnlflgragsatvlsgsiagleggsqlalrratsmrktF 987
Cdd:cd14892    501 RHYAGD--VTYDVHGFL--AKNNDNLHDDLRDLLRSSSK----------------------------------------F 536
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  988 TTgmaavkkkslciqiklQVDALIDTIKRSKMHFVHCFLPvaegwpgeprsassrrvsssSELDLPPGDPCEagllqldv 1067
Cdd:cd14892    537 RT----------------QLAELMEVLWSTTPSYIKCIKP--------------------NNLKFPGGFSCE-------- 572
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|
gi 1907082233 1068 sLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLA 1107
Cdd:cd14892    573 -LVRDQLIYSGVLEVVRIRREGFPIRRQFEEFYEKFWPLA 611
MYSc_Myo4 cd14872
class IV myosin, motor domain; These myosins all possess a WW domain either N-terminal or ...
395-1151 5.18e-51

class IV myosin, motor domain; These myosins all possess a WW domain either N-terminal or C-terminal to their motor domain and a tail with a MyTH4 domain followed by a SH3 domain in some instances. The monomeric Acanthamoebas were the first identified members of this group and have been joined by Stramenopiles. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276839  Cd Length: 644  Bit Score: 193.07  E-value: 5.18e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVM--HMFKGCRRedMAPHIYAVAQTAYRAMLMSRQDQSIVLL 472
Cdd:cd14872      2 MIVHNLRKRFKNDQIYTNVGTILISVNPFKRLPLYTPTVMdqYMHKGPKE--MPPHTYNIADDAYRAMIVDAMNQSILIS 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  473 GSSGSGKTTSFQHLVQYLATIAGTSGtkvfSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASI 550
Cdd:cd14872     80 GESGAGKTEATKQCLSFFAEVAGSTN----GVEQ-RVLLAnpILEAFGNAKTLRNNNSSRFGKWVEIHFDNRGRICGAST 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  551 QTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNhlaenNVFGIVPLSKPEEKQKA--AQQFSKLQAAMKVLA 628
Cdd:cd14872    155 ENYLLEKSRVVYQIKGERNFHIFYQLLASPDPASRGGWGSS-----AAYGYLSLSGCIEVEGVddVADFEEVVLAMEQLG 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  629 ISPEEQKTCWLILASIYHLGAAGatkepLEEQDEAAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKH--QLKGgtlQ 706
Cdd:cd14872    230 FDDADINNVMSLIAAILKLGNIE-----FASGGGKSLVSGSTVANRDVLKEVATLLGVDAATLEEALTSRlmEIKG---C 301
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  707 RSTSFRQGPEEsglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQHSL-CSMMIVDTPGFQNPEWGgsarga 785
Cdd:cd14872    302 DPTRIPLTPAQ----------ATDACDALAKAAYSRLFDWLVKKINESMRPQKGAKtTFIGVLDIFGFEIFEKN------ 365
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  786 SFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVADdsvaavdqashLVrslahadEAR--GLLWL 860
Cdd:cd14872    366 SFEQLCINFTNEKLQQHFNQYTFKLEEALYQSEGVKfehIDFIDNQPVLD-----------LI-------EKKqpGLMLA 427
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  861 LEEEALVPGATEDALLDRLfsyyGPQEGDKKGQSPLLRSSKPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLL 940
Cdd:cd14872    428 LDDQVKIPKGSDATFMIAA----NQTHAAKSTFVYAEVRTSRTEFIVKHYAGD--VTYDITGFLEKNK-DTLQKDLYVLL 500
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  941 QDSQKKIISNLFlgragsatvlsgsiagleggSQLALRRATSmrktfttgmaavkKKSLCIQIKLQVDALIDTIKRSKMH 1020
Cdd:cd14872    501 SSSKNKLIAVLF--------------------PPSEGDQKTS-------------KVTLGGQFRKQLSALMTALNATEPH 547
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1021 FVHCflpvaegwpgeprsassrrvsssseldLPPGDPCEAGLLQLDVSLlrAQLRGSRLLDAMRMYRQGYPDHMVFSEFR 1100
Cdd:cd14872    548 YIRC---------------------------VKPNQEKRARLFDGFMSL--EQLRYAGVFEAVKIRKTGYPFRYSHERFL 598
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907082233 1101 RRFDVLAPHLTKKHGRnyivvDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1151
Cdd:cd14872    599 KRYRFLVKTIAKRVGP-----DDRQRCDLLLKSLKQDFSKVQVGKTRVLYR 644
MYSc_Myo40 cd14901
class XL myosin, motor domain; The class XL myosins are comprised of Stramenopiles. Not much ...
395-1149 4.14e-49

class XL myosin, motor domain; The class XL myosins are comprised of Stramenopiles. Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276866 [Multi-domain]  Cd Length: 655  Bit Score: 187.69  E-value: 4.14e-49
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSE--KVMHMFKGCRRED----MAPHIYAVAQTAYRAMLMSR---- 464
Cdd:cd14901      2 SILHVLRRRFAHGLIYTSTGAILVAINPFRRLPLYDDetKEAYYEHGERRAAgerkLPPHVYAVADKAFRAMLFASrgqk 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  465 QDQSIVLLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKW----QALST--LLEAFGNSPTIMNGSATRFSQILSLD 538
Cdd:cd14901     82 CDQSILVSGESGAGKTETTKIIMNYLASVSSATTHGQNATEREnvrdRVLESnpILEAFGNARTNRNNNSSRFGKFIRLG 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  539 FDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFS 618
Cdd:cd14901    162 FASSGSLLGASISTYLLERVRLVSQAKGERNYHIFYELLRGASSDELHALGLTHVEEYKYLNSSQCYDRRDGVDDSVQYA 241
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  619 KLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKepleeqdEAAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKH 698
Cdd:cd14901    242 KTRHAMTTIGMSPDEQISVLQLVAAVLHLGNLCFVK-------KDGEGGTFSMSSLANVRAACDLLGLDMDVLEKTLCTR 314
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  699 QLKGGTLQRSTSFRqgPEESGLGEgtklsaleclEGMASGLYSELFTLLISLVNRALK--SSQHSLCSMMIVDTPGF--- 773
Cdd:cd14901    315 EIRAGGEYITMPLS--VEQALLTR----------DVVAKTLYAQLFDWLVDRINESIAysESTGASRFIGIVDIFGFeif 382
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  774 -QNpewggsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLePVADDSVAAVDQASHLVRSLahAD 852
Cdd:cd14901    383 aTN----------SLEQLCINFANEKLQQLFGKFVFEMEQDEYVAEAIPWTFVEY-PNNDACVAMFEARPTGLFSL--LD 449
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  853 EargllwlleeEALVPGATEDALLDrlfSYYgpqegDKKGQSPLLRSSK----PRHFLLGHSHGTnwVEYNVAGWLNYTK 928
Cdd:cd14901    450 E----------QCLLPRGNDEKLAN---KYY-----DLLAKHASFSVSKlqqgKRQFVIHHYAGA--VCYATDGFCDKNK 509
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  929 QNPATqNAPRLLQDSqkkiiSNLFLgragSATVLSgsiagleggsqlalrratsmrktfttgmaavkkkslciQIKLQVD 1008
Cdd:cd14901    510 DHVHS-EALALLRTS-----SNAFL----SSTVVA--------------------------------------KFKVQLS 541
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1009 ALIDTIKRSKMHFVHCflpvaegwpgeprsassrrvsssseldLPPGDpcEAGLLQLDVSLLRAQLRGSRLLDAMRMYRQ 1088
Cdd:cd14901    542 SLLEVLNATEPHFIRC---------------------------IKPND--VLSPSEFDAKRVLEQLRCSGVLEAVKISRS 592
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082233 1089 GYPDHMVFSEFRRRFDVLAPHLTKKHGRNYIVVDEKRAVEELLESLDLEKSSCCLGLSRVF 1149
Cdd:cd14901    593 GYPVRFPHDAFVHTYSCLAPDGASDTWKVNELAERLMSQLQHSELNIEHLPPFQVGKTKVF 653
MYSc_Myo46 cd14907
class XLVI myosin, motor domain; The class XLVI myosins are comprised of Alveolata. Not much ...
394-1106 9.91e-49

class XLVI myosin, motor domain; The class XLVI myosins are comprised of Alveolata. Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276872 [Multi-domain]  Cd Length: 669  Bit Score: 186.78  E-value: 9.91e-49
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFKGCRRE--------DMAPHIYAVAQTAYRAMLMSR 464
Cdd:cd14907      1 AELLINLKKRYQQDKIFTYVGPTLIVMNPyKQIDNLFSEEVMQMYKEQIIQngeyfdikKEPPHIYAIAALAFKQLFENN 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  465 QDQSIVLLGSSGSGKTTSFQHLVQYLATIAG----------------TSGTKVFSVEKwQALST--LLEAFGNSPTIMNG 526
Cdd:cd14907     81 KKQAIVISGESGAGKTENAKYAMKFLTQLSQqeqnseevltltssirATSKSTKSIEQ-KILSCnpILEAFGNAKTVRND 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  527 SATRFSQILSLDFD-QAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL-NHLAENNVFgivPL 604
Cdd:cd14907    160 NSSRFGKYVSILVDkKKRKILGARIQNYLLEKSRVTQQGQGERNYHIFYHLLYGADQQLLQQLGLkNQLSGDRYD---YL 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  605 SKPE----EKQKAAQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEqDEAAEAGRKQFarhewAQKA 680
Cdd:cd14907    237 KKSNcyevDTINDEKLFKEVQQSFQTLGFTEEEQDSIWRILAAILLLGNLQFDDSTLDD-NSPCCVKNKET-----LQII 310
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  681 AYLLGCSLEELSSAIFKHQLKGGTlQRSTSfrqgpeesglgegtKLSALEC---LEGMASGLYSELFTLLISLVNRAL-- 755
Cdd:cd14907    311 AKLLGIDEEELKEALTTKIRKVGN-QVITS--------------PLSKKECinnRDSLSKELYDRLFNWLVERLNDTImp 375
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  756 ------KSSQHSLCSMMIVDTPGFQNPEwggsarGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE-----LAF 824
Cdd:cd14907    376 kdekdqQLFQNKYLSIGLLDIFGFEVFQ------NNSFEQLCINYTNEKLQQLYISYVFKAEEQEFKEEGLEdylnqLSY 449
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  825 DDLEPVAD--DS-----VAAVDQASHLvrslahadeargllwlleeealvPGATEDALLDRLFSYYGpqeGDKKGQSPll 897
Cdd:cd14907    450 TDNQDVIDllDKppigiFNLLDDSCKL-----------------------ATGTDEKLLNKIKKQHK---NNSKLIFP-- 501
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  898 RSSKPRHFLLGHSHGTnwVEYNVAGWL--NYTKQNPATQNaprLLQDSQKKIISNLFLGRAGSATVLSGSIAGleggsql 975
Cdd:cd14907    502 NKINKDTFTIRHTAKE--VEYNIEGFRekNKDEISQSIIN---CIQNSKNRIISSIFSGEDGSQQQNQSKQKK------- 569
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  976 alrratsmrktfttgmAAVKKKSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVAEGwpgeprsassrrvsssseldlppg 1055
Cdd:cd14907    570 ----------------SQKKDKFLGSKFRNQMKQLMNELMQCDVHFIRCIKPNEEK------------------------ 609
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907082233 1056 dpcEAGLLQLDVSLLraQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVL 1106
Cdd:cd14907    610 ---KADLFIQGYVLN--QIRYLGVLESIRVRKQGYPYRKSYEDFYKQYSLL 655
MYSc_Myo34 cd14895
class XXXIV myosin, motor domain; Class XXXIV myosins are composed of an IQ motif, a short ...
395-1107 6.45e-46

class XXXIV myosin, motor domain; Class XXXIV myosins are composed of an IQ motif, a short coiled-coil region, 5 tandem ANK repeats, and a carboxy-terminal FYVE domain. The myosin classes XXX to XXXIV contain members from Phytophthora species and Hyaloperonospora parasitica. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276860 [Multi-domain]  Cd Length: 704  Bit Score: 178.99  E-value: 6.45e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSekvMHMFkgcrREDM------APHIYAVAQTAYRAMLM----- 462
Cdd:cd14895      2 AFVDYLAQRYGVDQVYCRSGAVLIAVNPfKHIPGLYD---LHKY----REEMpgwtalPPHVFSIAEGAYRSLRRrlhep 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  463 --SRQDQSIVLLGSSGSGKTTSFQHLVQYLA-----TIAGTSGTKVFSVEKWQALST--LLEAFGNSPTIMNGSATRFSQ 533
Cdd:cd14895     75 gaSKKNQTILVSGESGAGKTETTKFIMNYLAesskhTTATSSSKRRRAISGSELLSAnpILESFGNARTLRNDNSSRFGK 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  534 ILSLDF-----DQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNvFGIVPLSKPE 608
Cdd:cd14895    155 FVRMFFeghelDTSLRMIGTSVETYLLEKVRVVHQNDGERNFHVFYELLAGAADDMKLELQLELLSAQE-FQYISGGQCY 233
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  609 EKQKAAQ---QFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAA--GATKEPLEEQDEAAEAGRKQFARHEWAQKAAY- 682
Cdd:cd14895    234 QRNDGVRddkQFQLVLQSMKVLGFTDVEQAAIWKILSALLHLGNVlfVASSEDEGEEDNGAASAPCRLASASPSSLTVQq 313
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  683 -------LLGCSLEELSSAIFKHQLKGGtlqrstsfrqgpEESGLGEGTKLSALECLEGMASGLYSELFTLLISLVNRAL 755
Cdd:cd14895    314 hldivskLFAVDQDELVSALTTRKISVG------------GETFHANLSLAQCGDARDAMARSLYAFLFQFLVSKVNSAS 381
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  756 KSSQHSLCS-----------MMIVDTPGFQNPEWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAF 824
Cdd:cd14895    382 PQRQFALNPnkaankdttpcIAVLDIFGFEEFEVN------QFEQFCINYANEKLQYQFIQDILLTEQQAHIEEGIKWNA 455
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  825 DDLEpvaDDSVA--AVDQASHLVRSLAhaDEargllwlleeEALVPGATEDALLDRLFSYYGpqegDKKGQSPLLRSSKP 902
Cdd:cd14895    456 VDYE---DNSVCleMLEQRPSGIFSLL--DE----------ECVVPKGSDAGFARKLYQRLQ----EHSNFSASRTDQAD 516
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  903 RHFLLGHSHGTnwVEYNVAGWLNYTKQNPaTQNAPRLLQDSQKKIISNLFlgragsaTVLSGSIAGLEGGSQLALRRATS 982
Cdd:cd14895    517 VAFQIHHYAGA--VRYQAEGFCEKNKDQP-NAELFSVLGKTSDAHLRELF-------EFFKASESAELSLGQPKLRRRSS 586
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  983 MRKTFTTGMaavkkkslciQIKLQVDALIDTIKRSKMHFVHCflpvaegwpgeprsassrrvsssseldLPPGDPCEAGl 1062
Cdd:cd14895    587 VLSSVGIGS----------QFKQQLASLLDVVQQTQTHYIRC---------------------------IKPNDESASD- 628
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*
gi 1907082233 1063 lQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLA 1107
Cdd:cd14895    629 -QFDMAKVSSQLRYGGVLKAVEIMRQSYPVRMKHADFVKQYRLLV 672
MYSc_Myo6 cd01382
class VI myosin, motor domain; Myosin VI is a monomeric myosin, which moves towards the ...
395-816 1.48e-44

class VI myosin, motor domain; Myosin VI is a monomeric myosin, which moves towards the minus-end of actin filaments, in contrast to most other myosins which moves towards the plus-end of actin filaments. It is thought that myosin VI, unlike plus-end directed myosins, does not use a pure lever arm mechanism, but instead steps with a mechanism analogous to the kinesin neck-linker uncoupling model. It has been implicated in a myriad of functions including: the transport of cytoplasmic organelles, maintenance of normal Golgi morphology, endocytosis, secretion, cell migration, border cell migration during development, and in cancer metastasis playing roles in deafness and retinal development among others. While how this is accomplished is largely unknown there are several interacting proteins that have been identified such as disabled homolog 2 (DAB2), GIPC1, synapse-associated protein 97 (SAP97; also known as DLG1) and optineurin, which have been found to target myosin VI to different cellular compartments. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the minus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276833  Cd Length: 649  Bit Score: 173.59  E-value: 1.48e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd01382      2 TLLNNIRVRYSKDKIYTYVANILIAVNPyFDIPKLYSSETIKSYQGKSLGTLPPHVFAIADKAYRDMKVLKQSQSIIVSG 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIAGTSGTKVFS--VEKwqalSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQ 551
Cdd:cd01382     82 ESGAGKTESTKYILRYLTESWGSGAGPIEQriLEA----NPLLEAFGNAKTVRNNNSSRFGKFVEIHFNEKSSVVGGFVS 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  552 TMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENnvfgivplskpeekqkaAQQFSKLQAAMKVLAISP 631
Cdd:cd01382    158 HYLLEKSRICVQSKEERNYHIFYRLCAGAPEDLREKLLKDPLLDD-----------------VGDFIRMDKAMKKIGLSD 220
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  632 EEQKTCWLILASIYHLGAAgatkePLEEQDEAAEAGRKQFARHEWA-QKAAYLLGCSLEELSSAI---FKHQLKGGTlqR 707
Cdd:cd01382    221 EEKLDIFRVVAAVLHLGNI-----EFEENGSDSGGGCNVKPKSEQSlEYAAELLGLDQDELRVSLttrVMQTTRGGA--K 293
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  708 STSFRQG--PEESGlgegtklSALECLegmASGLYSELFTLLISLVNRAL--KSSQHSLCsmmIVDTPGFQ----Npewg 779
Cdd:cd01382    294 GTVIKVPlkVEEAN-------NARDAL---AKAIYSKLFDHIVNRINQCIpfETSSYFIG---VLDIAGFEyfevN---- 356
                          410       420       430
                   ....*....|....*....|....*....|....*..
gi 1907082233  780 gsargaSFEELCHNYAQDRLQRLFHERTFLQELERYK 816
Cdd:cd01382    357 ------SFEQFCINYCNEKLQQFFNERILKEEQELYE 387
MYSc_Myo30 cd14891
class XXX myosin, motor domain; Myosins of class XXX are composed of an amino-terminal ...
434-821 3.13e-44

class XXX myosin, motor domain; Myosins of class XXX are composed of an amino-terminal SH3-like domain, two IQ motifs, a coiled-coil region and a PX domain. The myosin classes XXX to XXXIV contain members from Phytophthora species and Hyaloperonospora parasitica. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276856  Cd Length: 645  Bit Score: 172.54  E-value: 3.13e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  434 MHMFKGCRREDMAPHIYAVAQTAYRAMLMSR---QDQSIVLLGSSGSGKTTSFQHLVQYLAT-----IAGTSGTKVFSVE 505
Cdd:cd14891     40 KSDYINTPLDPCPPHPYAIAEMAYQQMCLGSgrmQNQSIVISGESGAGKTETSKIILRFLTTravggKKASGQDIEQSSK 119
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  506 KWQALST-----------LLEAFGNSPTIMNGSATRFSQILSLDF-DQAGQVASASIQTMLLEKLRVARRPASEATFNVF 573
Cdd:cd14891    120 KRKLSVTslderlmdtnpILESFGNAKTLRNHNSSRFGKFMKLQFtKDKFKLAGAFIETYLLEKSRLVAQPPGERNFHIF 199
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  574 YYLLACGDATLRTELHlnhlaennvfgivpLSKPEEKQKAAQ-------------QFSKLQAAMKVLAISPEEQKTCWLI 640
Cdd:cd14891    200 YQLLAGASAELLKELL--------------LLSPEDFIYLNQsgcvsddniddaaNFDNVVSALDTVGIDEDLQLQIWRI 265
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  641 LASIYHLGAAGATKEPLEEQD--EAAEAGRKQFArhewaqKAAYLLGCSLEELSSAIfkhqlkggtLQRSTSFRqgpees 718
Cdd:cd14891    266 LAGLLHLGNIEFDEEDTSEGEaeIASESDKEALA------TAAELLGVDEEALEKVI---------TQREIVTR------ 324
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  719 GLGEGTKLSALECL---EGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEwggsaRGASFEELCHNYA 795
Cdd:cd14891    325 GETFTIKRNAREAVysrDAIAKSIYERLFLWIVQQINTSLGHDPDPLPYIGVLDIFGFESFE-----TKNDFEQLLINYA 399
                          410       420
                   ....*....|....*....|....*.
gi 1907082233  796 QDRLQRLFHERTFLQELERYKEDNIE 821
Cdd:cd14891    400 NEALQATFNQQVFIAEQELYKSEGID 425
MYSc_Myo39 cd14900
class XXXIX myosin, motor domain; The class XXXIX myosins are found in Stramenopiles. Not much ...
395-1122 1.63e-43

class XXXIX myosin, motor domain; The class XXXIX myosins are found in Stramenopiles. Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276865  Cd Length: 627  Bit Score: 170.10  E-value: 1.63e-43
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMF-------------KGcrREDMAPHIYAVAQTAYRAM 460
Cdd:cd14900      2 TILSALETRFYAQKIYTNTGAILLAVNPfQKLPGLYSSDTMAKYllsfearssstrnKG--SDPMPPHIYQVAGEAYKAM 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  461 LMSR----QDQSIVLLGSSGSGKTTSFQHLVQYLA---------TIAGTSGTKVFSVEKWQAlSTLLEAFGNSPTIMNGS 527
Cdd:cd14900     80 MLGLngvmSDQSILVSGESGSGKTESTKFLMEYLAqagdnnlaaSVSMGKSTSGIAAKVLQT-NILLESFGNARTLRNDN 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  528 ATRFSQILSLDFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLAcGDAtlrtelhlnhlaennvfgivplskp 607
Cdd:cd14900    159 SSRFGKFIKLHFTSGGRLTGASIQTYLLEKVRLVSQSKGERNYHIFYEMAI-GAS------------------------- 212
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  608 eEKQKAAQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGA---AGATKEPLEEQDEAAEAGRKQFARhewaQKAAYLL 684
Cdd:cd14900    213 -EAARKRDMYRRVMDAMDIIGFTPHERAGIFDLLAALLHIGNltfEHDENSDRLGQLKSDLAPSSIWSR----DAAATLL 287
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  685 GCSLEELSSAIFKHQLKGGTLQRSTsfrqgpeesglgegtKLSALEC---LEGMASGLYSELFTLLISLVNRALK----- 756
Cdd:cd14900    288 SVDATKLEKALSVRRIRAGTDFVSM---------------KLSAAQAnnaRDALAKALYGRLFDWLVGKMNAFLKmddss 352
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  757 SSQHSLCSMMIVDTPGF----QNpewggsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEp 829
Cdd:cd14900    353 KSHGGLHFIGILDIFGFevfpKN----------SFEQLCINFANETLQQQFNDYVFKAEQREYESQGVDwkyVEFCDNQ- 421
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  830 vadDSVAAVDQASHLVRSLahADEargllwlleeEALVPGATEDALLDRLFSyygpqegdKKGQSPLLRSSKPRH----F 905
Cdd:cd14900    422 ---DCVNLISQRPTGILSL--IDE----------ECVMPKGSDTTLASKLYR--------ACGSHPRFSASRIQRarglF 478
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  906 LLGHSHGTnwVEYNVAGWLNytkqnpatQNAPRLLQDsqkkiISNLFLGragsatvlsgsiagleggsqlalrratsmrk 985
Cdd:cd14900    479 TIVHYAGH--VEYSTDGFLE--------KNKDVLHQE-----AVDLFVY------------------------------- 512
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  986 tfttgmaavkkkslCIQIKLQVDALIDTIKRSKMHFVHCflpvaegwpgeprsassrrvsssseldLPPGDPCEAGLLQL 1065
Cdd:cd14900    513 --------------GLQFKEQLTTLLETLQQTNPHYVRC---------------------------LKPNDLCKAGIYER 551
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082233 1066 DVSLlrAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVL----APHLTKKHGrnYIVVD 1122
Cdd:cd14900    552 ERVL--NQLRCNGVMEAVRVARAGFPIRLLHDEFVARYFSLarakNRLLAKKQG--TSLPD 608
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1202-1896 1.58e-42

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 171.51  E-value: 1.58e-42
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1202 NIKKNKGVKDWPWWKLFTTVRPLIQVQLSEEQIRNKDE-EIQQLRSKLEKVEKERNELrlssdrlETRISELTSELtder 1280
Cdd:pfam01576  374 NLEKAKQALESENAELQAELRTLQQAKQDSEHKRKKLEgQLQELQARLSESERQRAEL-------AEKLSKLQSEL---- 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1281 ntgESASQLLDAETAERLRTEKEMKELQTQ-YDALKKQMEVMEMEVMEARLIRA--AEINGEVDDDDAGGEWRLKYERAV 1357
Cdd:pfam01576  443 ---ESVSSLLNEAEGKNIKLSKDVSSLESQlQDTQELLQEETRQKLNLSTRLRQleDERNSLQEQLEEEEEAKRNVERQL 519
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1358 REVDF----TKKRLQQELEdKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHE 1433
Cdd:pfam01576  520 STLQAqlsdMKKKLEEDAG-TLEALEEGKKRLQRELEALTQQLEEKAAAYDKLEKTKNRLQQELDDLLVDLDHQRQLVSN 598
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1434 LEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSqeSKDEA 1513
Cdd:pfam01576  599 LEKKQKKFDQMLAEEKAISARYAEERDRAEAEAREKETRALSLARALEEALEAKEELERTNKQLRAEMEDLVS--SKDDV 676
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1514 --SLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEV 1591
Cdd:pfam01576  677 gkNVHELERSKRALEQQVEEMKTQLEELEDELQATEDAKLRLEVNMQALKAQFERDLQARDEQGEEKRRQLVKQVRELEA 756
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1592 QLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLrKDLKRTKALLADAQIMLDHLKNN-----APSKREIAQL 1666
Cdd:pfam01576  757 ELEDERKQRAQAVAAKKKLELDLKELEAQIDAANKGREEAV-KQLKKLQAQMKDLQRELEEARASrdeilAQSKESEKKL 835
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1667 KN------QLEESeftCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKH 1740
Cdd:pfam01576  836 KNleaellQLQED---LAASERARRQAQQERDELADEIASGASGKSALQDEKRRLEARIAQLEEELEEEQSNTELLNDRL 912
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1741 KAAVAQ---------ASRDMAQMND-LQAQIEESNKekqELQEKLQALQSQVEFLEqsmvdKSLVSRQEAKIRELETRLE 1810
Cdd:pfam01576  913 RKSTLQveqlttelaAERSTSQKSEsARQQLERQNK---ELKAKLQEMEGTVKSKF-----KSSIAALEAKIAQLEEQLE 984
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1811 FEKTQ----VKRLENLASRLKETMEKLTEER---DQRAAAENREKEQNKRLQRQLRDTKEEMSelarkEAEASRKKheLE 1883
Cdd:pfam01576  985 QESRErqaaNKLVRRTEKKLKEVLLQVEDERrhaDQYKDQAEKGNSRMKQLKRQLEEAEEEAS-----RANAARRK--LQ 1057
                          730
                   ....*....|...
gi 1907082233 1884 MDLESLEAANQSL 1896
Cdd:pfam01576 1058 RELDDATESNESM 1070
MYSc_Myo45 cd14906
class XLV myosin, motor domain; The class XLVI myosins are comprised of slime molds ...
395-1119 1.03e-41

class XLV myosin, motor domain; The class XLVI myosins are comprised of slime molds Dictyostelium and Polysphondylium. Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276871 [Multi-domain]  Cd Length: 715  Bit Score: 165.92  E-value: 1.03e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFKGCRR-EDMAPHIYAVAQTAYRAMLMSRQDQSIVLL 472
Cdd:cd14906      2 IILNNLGKRYKSDSIYTYIGNVLISINPyKDISSIYSNLILNEYKDINQnKSPIPHIYAVALRAYQSMVSEKKNQSIIIS 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  473 GSSGSGKTTSFQHLVQYLATiagTSGTKVF----------SVEKWQALST-LLEAFGNSPTIMNGSATRFSQILSLDFDQ 541
Cdd:cd14906     82 GESGSGKTEASKTILQYLIN---TSSSNQQqnnnnnnnnnSIEKDILTSNpILEAFGNSRTTKNHNSSRFGKFLKIEFRS 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  542 A-GQVASASIQTMLLEKLRVARRP-ASEATFNVFYYLLACGDATLRTELHLNH--------LAENNVFGIV------PLS 605
Cdd:cd14906    159 SdGKIDGASIETYLLEKSRISHRPdNINLSYHIFYYLVYGASKDERSKWGLNNdpskyrylDARDDVISSFksqssnKNS 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  606 KPEEKQKAAQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGaagaTKEPLEEQDEAAEAGRKQFARHEWAQkAAYLLG 685
Cdd:cd14906    239 NHNNKTESIESFQLLKQSMESMSINKEQCDAIFLSLAAILHLG----NIEFEEDSDFSKYAYQKDKVTASLES-VSKLLG 313
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  686 CSLEELSSAIFKHQLKGGTlqRSTSFRQgPEESGLGEGTKlsaleclEGMASGLYSELFTLLISLVNR-----------A 754
Cdd:cd14906    314 YIESVFKQALLNRNLKAGG--RGSVYCR-PMEVAQSEQTR-------DALSKSLYVRLFKYIVEKINRkfnqntqsndlA 383
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  755 LKSSQHSLCSMMIVDTPGFQNpewggsARGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELA---FDDlepvA 831
Cdd:cd14906    384 GGSNKKNNLFIGVLDIFGFEN------LSSNSLEQLLINFTNEKLQQQFNLNVFENEQKEYLSEGIPWSnsnFID----N 453
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  832 DDSVAAVDQASHLVRSLAHaDEArgllwlleeeaLVPGATEDALLDRLFSYYgpqegdKKGQSPLLRSSKPRHFLLGHSH 911
Cdd:cd14906    454 KECIELIEKKSDGILSLLD-DEC-----------IMPKGSEQSLLEKYNKQY------HNTNQYYQRTLAKGTLGIKHFA 515
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  912 GTnwVEYNVAGWLNYTKQNPATqNAPRLLQDSQKKIISNLFLGRAGSATvlsgsiagleggsqlalrrATSMRKTFTTGM 991
Cdd:cd14906    516 GD--VTYQTDGWLEKNRDSLYS-DVEDLLLASSNFLKKSLFQQQITSTT-------------------NTTKKQTQSNTV 573
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  992 AAvkkkslciQIKLQVDALIDTIKRSKMHFVHCFLPvaegwpgeprsassrrvssssELDLPPGDpceagllqLDVSLLR 1071
Cdd:cd14906    574 SG--------QFLEQLNQLIQTINSTSVHYIRCIKP---------------------NQTMDCNN--------FNNVHVL 616
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*...
gi 1907082233 1072 AQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHLTKKHGRNYI 1119
Cdd:cd14906    617 SQLRNVGVLNTIKVRKMGYSYRRDFNQFFSRYKCIVDMYNRKNNNNPK 664
MYSc_Myo16 cd14878
class XVI myosin, motor domain; These XVI type myosins are also known as Neuronal ...
394-1107 2.72e-39

class XVI myosin, motor domain; These XVI type myosins are also known as Neuronal tyrosine-phosphorylated phosphoinositide-3-kinase adapter 3/NYAP3. Myo16 is thought to play a regulatory role in cell cycle progression and has been recently implicated in Schizophrenia. Class XVI myosins are characterized by an N-terminal ankyrin repeat domain and some with chitin synthase domains that arose independently from the ones in the class XVII fungal myosins. They bind protein phosphatase 1 catalytic subunits 1alpha/PPP1CA and 1gamma/PPP1CC. Human Myo16 interacts with ACOT9, ARHGAP26 and PIK3R2 and with components of the WAVE1 complex, CYFIP1 and NCKAP1. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276844 [Multi-domain]  Cd Length: 656  Bit Score: 157.67  E-value: 2.72e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMF---KGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIV 470
Cdd:cd14878      1 SSLLYEIQKRFGNNQIYTFIGDILLLVNPYKELPIYSTMVSQLYlssSGQLCSSLPPHLFSCAERAFHQLFQERRPQCFI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  471 LLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSveKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDF-DQAGQVASAS 549
Cdd:cd14878     81 LSGERGSGKTEASKQIMKHLTCRASSSRTTFDS--RFKHVNCILEAFGHAKTTLNDLSSCFIKYFELQFcERKKHLTGAR 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  550 IQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVfgiVPLSKPEEKQKAA-----QQFSKLQAAM 624
Cdd:cd14878    159 IYTYMLEKSRLVSQPPGQSNFLIFYLLMDGLSAEEKYGLHLNNLCAHRY---LNQTMREDVSTAErslnrEKLAVLKQAL 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  625 KVLAISPEEQKTCWLILASIYHLGAAGATKepLEEQDEAAEAGRKQFarhewaQKAAYLLGCSLEELSSAIFK--HQLKG 702
Cdd:cd14878    236 NVVGFSSLEVENLFVILSAILHLGDIRFTA--LTEADSAFVSDLQLL------EQVAGMLQVSTDELASALTTdiQYFKG 307
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  703 GTLQRSTSFRQGPEESGLgegtklsaleclegMASGLYSELFTLLISLVNRALKsSQHSLCSMM-----IVDTPGFQNpe 777
Cdd:cd14878    308 DMIIRRHTIQIAEFYRDL--------------LAKSLYSRLFSFLVNTVNCCLQ-SQDEQKSMQtldigILDIFGFEE-- 370
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  778 wggsARGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE-------------LAFDDLEPVADDSVaaVDQASHL 844
Cdd:cd14878    371 ----FQKNEFEQLCVNMTNEKMHHYINEVLFLQEQTECVQEGVTmetayspgnqtgvLDFFFQKPSGFLSL--LDEESQM 444
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  845 VRSLAHADEARgllwlleeealVPGATEDALLDRLFSYYGPQEGD--KKGQSPLlrsskprhFLLGHSHGTnwVEYNVAG 922
Cdd:cd14878    445 IWSVEPNLPKK-----------LQSLLESSNTNAVYSPMKDGNGNvaLKDQGTA--------FTVMHYAGR--VMYEIVG 503
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  923 WLNYTKqNPATQNAPRLLQDSQKKIISNLFLGRAgsATVlsgsiaglegGSQLalrratsmrktfttgmaavkKKSLC-I 1001
Cdd:cd14878    504 AIEKNK-DSLSQNLLFVMKTSENVVINHLFQSKL--VTI----------ASQL--------------------RKSLAdI 550
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1002 QIKLQvdalidtikRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLD 1081
Cdd:cd14878    551 IGKLQ---------KCTPHFIHCIKPNNSKLPD-----------------------------TFDNFYVSAQLQYIGVLE 592
                          730       740
                   ....*....|....*....|....*.
gi 1907082233 1082 AMRMYRQGYPDHMVFSEFRRRFDVLA 1107
Cdd:cd14878    593 MVKIFRYGYPVRLSFSDFLSRYKPLA 618
MYSc_Myo26 cd14887
class XXVI myosin, motor domain; These MyTH-FERM myosins are thought to be related to the ...
397-1151 2.16e-38

class XXVI myosin, motor domain; These MyTH-FERM myosins are thought to be related to the other myosins that have a MyTH4 domain such as class III, VII, IX, X , XV, XVI, XVII, XX, XXII, XXV, and XXXIV. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276852  Cd Length: 725  Bit Score: 155.96  E-value: 2.16e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  397 LHTLRQRYGA--------SLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQS 468
Cdd:cd14887      4 LENLYQRYNKayinkenrNCIYTYTGTLLIAVNPYRFFNLYDRQWISRFDTEANSRLVPHPFGLAEFAYCRLVRDRRSQS 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  469 IVLLGSSGSGKTTSFQHLVQYLATI------AGTSGTKvfsvEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQA 542
Cdd:cd14887     84 ILISGESGAGKTETSKHVLTYLAAVsdrrhgADSQGLE----ARLLQSGPVLEAFGNAHTVLNANSSRFGKMLLLHFTGR 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  543 GQVASASIQTMLLEKLRVARRPASEATFNVFYYLlaCGDATLRTELHlnhlaennvfgivplSKPEEKQKAAQQFSKLQA 622
Cdd:cd14887    160 GKLTRASVATYLLANERVVRIPSDEFSFHIFYAL--CNAAVAAATQK---------------SSAGEGDPESTDLRRITA 222
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  623 AMKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAGRKQFARHEWAQKAAYLLgcSLEELSSAIFKHQLKG 702
Cdd:cd14887    223 AMKTVGIGGGEQADIFKLLAAILHLGNVEFTTDQEPETSKKRKLTSVSVGCEETAADRSHSS--EVKCLSSGLKVTEASR 300
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  703 GTLQRSTSFRQGPEESGLGEGTKLS-----------------ALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSM 765
Cdd:cd14887    301 KHLKTVARLLGLPPGVEGEEMLRLAlvsrsvretrsffdldgAAAARDAACKNLYSRAFDAVVARINAGLQRSAKPSESD 380
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  766 MIVDTP--------------GFQNPEWGGSARgasFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIEL-----AFDD 826
Cdd:cd14887    381 SDEDTPsttgtqtigildlfGFEDLRNHSKNR---LEQLCINYANERLHCFLLEQLILNEHMLYTQEGVFQnqdcsAFPF 457
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  827 LEPVADDSVAAVDQASHLvrSLAHADEARGLLWLLEEEALVPGATEDALLdrlfsyygpqegdkkgQSPLLRSSKPRHFL 906
Cdd:cd14887    458 SFPLASTLTSSPSSTSPF--SPTPSFRSSSAFATSPSLPSSLSSLSSSLS----------------SSPPVWEGRDNSDL 519
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  907 LGHShgtnwVEYNVAGWLNYTKQNPAtqnaprlLQDSQKKIISNLFLGRA---------GSATVLSGSIAGLEGGSQLAL 977
Cdd:cd14887    520 FYEK-----LNKNIINSAKYKNITPA-------LSRENLEFTVSHFACDVtydardfcrANREATSDELERLFLACSTYT 587
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  978 RRATSMRKTFTTGMAAvKKKSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVAEGwpgeprsassrrvsssseldlppgdp 1057
Cdd:cd14887    588 RLVGSKKNSGVRAISS-RRSTLSAQFASQLQQVLKALQETSCHFIRCVKPNRVQ-------------------------- 640
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1058 cEAGLlqLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHLTKKHgrnyivVDEKRAVEELLESLDLE 1137
Cdd:cd14887    641 -EAGI--FEDAYVHRQLRCSGMSDLLRVMADGFPCRLPYVELWRRYETKLPMALREA------LTPKMFCKIVLMFLEIN 711
                          810
                   ....*....|....
gi 1907082233 1138 KSSCCLGLSRVFFR 1151
Cdd:cd14887    712 SNSYTFGKTKIFFR 725
MYSc_Myo47 cd14908
class XLVII myosin, motor domain; The class XLVII myosins are comprised of Stramenopiles. Not ...
395-824 2.56e-37

class XLVII myosin, motor domain; The class XLVII myosins are comprised of Stramenopiles. Not much is known about this myosin class. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276873 [Multi-domain]  Cd Length: 682  Bit Score: 151.98  E-value: 2.56e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFK--GCRRE-------DMAPHIYAVAQTAYRAMLM-SR 464
Cdd:cd14908      2 AILHSLSRRFFRGIIYTWTGPVLIAVNPFQRLPLYGKEILESYRqeGLLRSqgiespqALGPHVFAIADRSYRQMMSeIR 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  465 QDQSIVLLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKWQALSTL---------LEAFGNSPTIMNGSATRFSQIL 535
Cdd:cd14908     82 ASQSILISGESGAGKTESTKIVMLYLTTLGNGEEGAPNEGEELGKLSIMdrvlqsnpiLEAFGNARTLRNDNSSRFGKFI 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  536 SLDFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGivplskPEEKQKAAQ 615
Cdd:cd14908    162 ELGFNRAGNLLGAKVQTYLLEKVRLPFHASGERNYHIFYQLLRGGDEEEHEKYEFHDGITGGLQL------PNEFHYTGQ 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  616 -------------QFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAagATKEPLEEqDEAAEAGrkQFARHEWAQKAAY 682
Cdd:cd14908    236 ggapdlreftdedGLVYTLKAMRTMGWEESSIDTILDIIAGLLHLGQ--LEFESKEE-DGAAEIA--EEGNEKCLARVAK 310
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  683 LLGCSLEELSSAIFKHQLKGGTLQRSTSFrqgpeesglgegTKLSALECLEGMASGLYSELFTLLISLVNRALK--SSQH 760
Cdd:cd14908    311 LLGVDVDKLLRALTSKIIVVRGKEITTKL------------TPHKAYDARDALAKTIYGALFLWVVATVNSSINweNDKD 378
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082233  761 SLCSMMIVDTPGFQNPEWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAF 824
Cdd:cd14908    379 IRSSVGVLDIFGFECFAHN------SFEQLCINFTNEALQQQFNQFIFKLEQKEYEKESIEWAF 436
MYSc_Myo19 cd14880
class XIX myosin, motor domain; Monomeric myosin-XIX (Myo19) functions as an actin-based motor ...
394-824 3.45e-37

class XIX myosin, motor domain; Monomeric myosin-XIX (Myo19) functions as an actin-based motor for mitochondrial movement in vertebrate cells. It contains a variable number of IQ domains. Human myo19 contains a motor domain, three IQ motifs, and a short tail. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276846 [Multi-domain]  Cd Length: 658  Bit Score: 151.54  E-value: 3.45e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFKGCRR-EDMAPHIYAVAQTAYRAMLMSRQ--DQSI 469
Cdd:cd14880      1 ETVLRCLQARYTADTFYTNAGCTLVALNPfKPVPQLYSPELMREYHAAPQpQKLKPHIFTVGEQTYRNVKSLIEpvNQSI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  470 VLLGSSGSGKTTSFQHLVQYLATIAG--TSGTKVFSVEKWQAL----STLLEAFGNSPTIMNGSATRFSQILSLDFDQAG 543
Cdd:cd14880     81 VVSGESGAGKTWTSRCLMKFYAVVAAspTSWESHKIAERIEQRilnsNPVMEAFGNACTLRNNNSSRFGKFIQLQLNRAQ 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  544 QVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELhlnHLAENNVFGIVPLSkpeEKQKAAQQFSKLQAA 623
Cdd:cd14880    161 QMTGAAVQTYLLEKTRVACQAPSERNFHIFYQICKGASADERLQW---HLPEGAAFSWLPNP---ERNLEEDCFEVTREA 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  624 MKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAGRKQFARhewaqKAAYLLGCSLEELSSAIFKHQLKGG 703
Cdd:cd14880    235 MLHLGIDTPTQNNIFKVLAGLLHLGNIQFADSEDEAQPCQPMDDTKESVR-----TSALLLKLPEDHLLETLQIRTIRAG 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  704 TLQRstSFRQgpeesglgegtKLSALEC---LEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGFQN-PEw 778
Cdd:cd14880    310 KQQQ--VFKK-----------PCSRAECdtrRDCLAKLIYARLFDWLVSVINSSICADTDSWTTFIgLLDVYGFESfPE- 375
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*.
gi 1907082233  779 ggsargASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAF 824
Cdd:cd14880    376 ------NSLEQLCINYANEKLQQHFVAHYLRAQQEEYAVEGLEWSF 415
MYSc_Myo38 cd14899
class XXXVIII myosin; The class XXXVIII myosins are comprised of Stramenopiles. Not much is ...
394-1103 4.06e-37

class XXXVIII myosin; The class XXXVIII myosins are comprised of Stramenopiles. Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276864 [Multi-domain]  Cd Length: 717  Bit Score: 151.79  E-value: 4.06e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMF----------KGCRREDMAPHIYAVAQTAYRAMLM 462
Cdd:cd14899      1 ASILNALRLRYERHAIYTHIGDILISINPfQDLPQLYGDEILRGYaydhnsqfgdRVTSTDPREPHLFAVARAAYIDIVQ 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  463 SRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKW-------------QALST--LLEAFGNSPTIMNGS 527
Cdd:cd14899     81 NGRSQSILISGESGAGKTEATKIIMTYFAVHCGTGNNNLTNSESIsppaspsrttieeQVLQSnpILEAFGNARTVRNDN 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  528 ATRFSQILSLDF-DQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLA----CGDATLRTELHLNHLAEN-NVFGI 601
Cdd:cd14899    161 SSRFGKFIELRFrDERRRLAGARIRTYLLEKIRVIKQAPHERNFHIFYELLSadnnCVSKEQKQVLALSGGPQSfRLLNQ 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  602 VPLSKPEEKQKAAQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEA----AEAGRKQFARHEWA 677
Cdd:cd14899    241 SLCSKRRDGVKDGVQFRATKRAMQQLGMSEGEIGGVLEIVAAVLHMGNVDFEQIPHKGDDTVfadeARVMSSTTGAFDHF 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  678 QKAAYLLGCSLEELSSAIFKHQLKGGtlqrSTSFRQGPEESGLGEGTKLSALEClegmasglYSELFTLLISLVNRALK- 756
Cdd:cd14899    321 TKAAELLGVSTEALDHALTKRWLHAS----NETLVVGVDVAHARNTRNALTMEC--------YRLLFEWLVARVNNKLQr 388
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  757 --------------SSQHSLCSMMIVDTPGFQNpewggsARGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIEL 822
Cdd:cd14899    389 qasapwgadesdvdDEEDATDFIGLLDIFGFED------MAENSFEQLCINYANEALQHQFNQYIFEEEQRLYRDEGIRW 462
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  823 AFDDLEpvadDSVAAVDQASHlvrslahadEARGLLWLLEEEALVPGATEDALLDRlfsYYgpQEGDKKGQSPLLRSS-- 900
Cdd:cd14899    463 SFVDFP----NNRACLELFEH---------RPIGIFSLTDQECVFPQGTDRALVAK---YY--LEFEKKNSHPHFRSApl 524
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  901 --KPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKKIISNLflgRAGSATVLSGSIAGLEGGSQLALR 978
Cdd:cd14899    525 iqRTTQFVVAHYAGC--VTYTIDGFLAKNK-DSFCESAAQLLAGSSNPLIQAL---AAGSNDEDANGDSELDGFGGRTRR 598
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  979 RATSmrktfttgmaAVKKKSLCIQIKLQVDALIDTIKRSKMHFVHCflpvaegwpgeprsassrrvsssseldLPPGDPC 1058
Cdd:cd14899    599 RAKS----------AIAAVSVGTQFKIQLNELLSTVRATTPRYVRC---------------------------IKPNDSH 641
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*
gi 1907082233 1059 EAGLLQldVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRF 1103
Cdd:cd14899    642 VGSLFQ--STRVVEQLRSGGVLEAVRVARAGFPVRLTHKQFLGRY 684
PTZ00014 PTZ00014
myosin-A; Provisional
329-1207 1.87e-36

myosin-A; Provisional


Pssm-ID: 240229 [Multi-domain]  Cd Length: 821  Bit Score: 150.57  E-value: 1.87e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  329 LVHRDgfSLASQLKSEELSLPEGKarvkldhdgaILDVDEDDIEKANAP-SCDRLEDLASLVYLNESSVLHTLRQRYGAS 407
Cdd:PTZ00014    56 LVLPG--STGEKLTLKQIDPPTNS----------TFEVKPEHAFNANSQiDPMTYGDIGLLPHTNIPCVLDFLKHRYLKN 123
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  408 LLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRR-EDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGSSGSGKTTSFQHL 486
Cdd:PTZ00014   124 QIYTTADPLLVAINPFKDLGNTTNDWIRRYRDAKDsDKLPPHVFTTARRALENLHGVKKSQTIIVSGESGAGKTEATKQI 203
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  487 VQYLAtiAGTSGTKVFSVEK--WQAlSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTMLLEKLRVARRP 564
Cdd:PTZ00014   204 MRYFA--SSKSGNMDLKIQNaiMAA-NPVLEAFGNAKTIRNNNSSRFGRFMQLQLGEEGGIRYGSIVAFLLEKSRVVTQE 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  565 ASEATFNVFYYLLACGDATLRTELHLNHLAENNVfgIVPLSKPEEKQKAAQQFSKLQAAMKVLAISPEEQKTCWLILASI 644
Cdd:PTZ00014   281 DDERSYHIFYQLLKGANDEMKEKYKLKSLEEYKY--INPKCLDVPGIDDVKDFEEVMESFDSMGLSESQIEDIFSILSGV 358
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  645 YHLGAAGATKEPLEEQDEAAEAGRKQFarhEWAQKAAYLLGCSLEELS-SAIFKHQLKGGTLQRStsfRQGPEESglgEG 723
Cdd:PTZ00014   359 LLLGNVEIEGKEEGGLTDAAAISDESL---EVFNEACELLFLDYESLKkELTVKVTYAGNQKIEG---PWSKDES---EM 429
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  724 TKLSaleclegMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQ----NpewggsargaSFEELCHNYAQDRL 799
Cdd:PTZ00014   430 LKDS-------LSKAVYEKLFLWIIRNLNATIEPPGGFKVFIGMLDIFGFEvfknN----------SLEQLFINITNEML 492
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  800 QRLFHERTFLQELERYKEDNI---ELAFDDLEPVAD------DSVAAV--DQashlvrSLAhadeargllwlleeealvP 868
Cdd:PTZ00014   493 QKNFVDIVFERESKLYKDEGIsteELEYTSNESVIDllcgkgKSVLSIleDQ------CLA------------------P 548
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  869 GATEDALLDRLFSYYGPQEGDKKGqspllRSSKPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKKII 948
Cdd:PTZ00014   549 GGTDEKFVSSCNTNLKNNPKYKPA-----KVDSNKNFVIKHTIGD--IQYCASGFLFKNK-DVLRPELVEVVKASPNPLV 620
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  949 SNLFLGragsATVLSGSIA-GLEGGSQLalrratsMRktfttgmaavkkkslciqiklQVDALIDTIKRSKMHFVHCFLP 1027
Cdd:PTZ00014   621 RDLFEG----VEVEKGKLAkGQLIGSQF-------LN---------------------QLDSLMSLINSTEPHFIRCIKP 668
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1028 vaegwpgeprsassrrvssssELDLPPGDPCEAGLLqldvsllrAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLA 1107
Cdd:PTZ00014   669 ---------------------NENKKPLDWNSSKVL--------IQLHSLSILEALQLRQLGFSYRRTFAEFLSQFKYLD 719
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1108 PHLTKKHGrnyivVDEKRAVEELLESLDLEKSSCCLGLSRVFFR---AGTLARLEEQRDEQTSRHLTLFQAACRGYLARQ 1184
Cdd:PTZ00014   720 LAVSNDSS-----LDPKEKAEKLLERSGLPKDSYAIGKTMVFLKkdaAKELTQIQREKLAAWEPLVSVLEALILKIKKKR 794
                          890       900
                   ....*....|....*....|...
gi 1907082233 1185 HFKKRkIQDLAIrcVQKNIKKNK 1207
Cdd:PTZ00014   795 KVRKN-IKSLVR--IQAHLRRHL 814
MYSc_Myo37 cd14898
class XXXVII myosin, motor domain; The class XXXVIII myosins are comprised of fungi. Not much ...
395-1118 3.18e-36

class XXXVII myosin, motor domain; The class XXXVIII myosins are comprised of fungi. Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276863  Cd Length: 578  Bit Score: 147.35  E-value: 3.18e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGpsLLVLSTRGAPAVYSEKVMHMFKGCRREdMAPHIYAVAQTAYRAMLMSrQDQSIVLLGS 474
Cdd:cd14898      2 ATLEILEKRYASGKIYTKSG--LVFLALNPYETIYGAGAMKAYLKNYSH-VEPHVYDVAEASVQDLLVH-GNQTIVISGE 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  475 SGSGKTTSFQHLVQYLatIAGTSGTKvfSVEK-WQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDqaGQVASASIQTM 553
Cdd:cd14898     78 SGSGKTENAKLVIKYL--VERTASTT--SIEKlITAANLILEAFGNAKTQLNDNSSRFGKRIKLKFD--GKITGAKFETY 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  554 LLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSkpeekqkaaQQFSKLQAAMKVLAI-SPE 632
Cdd:cd14898    152 LLEKSRVTHHEKGERNFHIFYQFCASKRLNIKNDFIDTSSTAGNKESIVQLS---------EKYKMTCSAMKSLGIaNFK 222
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  633 EQKTCwliLASIYHLGAAGATKEpleeqdeaaeaGRKQFARHEWAQKAAYLLGCSLEELSSAIFKH--QLKGGTLQRSTS 710
Cdd:cd14898    223 SIEDC---LLGILYLGSIQFVND-----------GILKLQRNESFTEFCKLHNIQEEDFEESLVKFsiQVKGETIEVFNT 288
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  711 FRQgpeesglgegtklsALECLEGMASGLYSELFTLLISLVNRALK-SSQHSLCsmmIVDTPGFQNPEWGGsargasFEE 789
Cdd:cd14898    289 LKQ--------------ARTIRNSMARLLYSNVFNYITASINNCLEgSGERSIS---VLDIFGFEIFESNG------LDQ 345
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  790 LCHNYAQDRLQRLFHERTFLQELERYKEDNIElaFDDLEPVADDSVaavdqashlvrsLAHADEARGLLWLLEEEALVPG 869
Cdd:cd14898    346 LCINWTNEKIQNDFIKKMFRAKQGMYKEEGIE--WPDVEFFDNNQC------------IRDFEKPCGLMDLISEESFNAW 411
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  870 ATEDALLDRLFSYYGPQEGDKKGQSPLLrsskprhfllghSHGTNWVEYNVAGWLNYTKQNpatqnaprllqdSQKKIIS 949
Cdd:cd14898    412 GNVKNLLVKIKKYLNGFINTKARDKIKV------------SHYAGDVEYDLRDFLDKNREK------------GQLLIFK 467
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  950 NLFLGRAGSatvlsgsiagleggsqlalrratsmrktfttgmaavkKKSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVA 1029
Cdd:cd14898    468 NLLINDEGS-------------------------------------KEDLVKYFKDSMNKLLNSINETQAKYIKCIRPNE 510
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1030 EGWPgeprsassrrvsssseldlppgdpceaglLQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPH 1109
Cdd:cd14898    511 ECRP-----------------------------WCFDRDLVSKQLAECGILETIRLSKQCFPQEIPKDRFEERYRILGIT 561
                          730
                   ....*....|...
gi 1907082233 1110 LTK----KHGRNY 1118
Cdd:cd14898    562 LFEvvdyRKGRTR 574
MYSc_Myo32 cd14893
class XXXII myosin, motor domain; Class XXXII myosins do not contain any IQ motifs, but ...
394-1109 9.83e-33

class XXXII myosin, motor domain; Class XXXII myosins do not contain any IQ motifs, but possess tandem MyTH4 and FERM domains. The myosin classes XXX to XXXIV contain members from Phytophthora species and Hyaloperonospora parasitica. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276858  Cd Length: 741  Bit Score: 138.18  E-value: 9.83e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRRE----------DMAPHIYAVAQTAYRAMLMS 463
Cdd:cd14893      1 NVALYTLRARYRMEQVYTWVDRVLVGVNPVTPLPIYTPDHMQAYNKSREQtplyekdtvnDAPPHVFALAQNALRCMQDA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  464 RQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKWQALS----------TLLEAFGNSPTIMNGSATRFSQ 533
Cdd:cd14893     81 GEDQAVILLGGMGAGKSEAAKLIVQYLCEIGDETEPRPDSEGASGVLHpigqqilhafTILEAFGNAATRQNRNSSRFAK 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  534 ILSLDFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLAC--GDATLRTELHLNHLAENnvFGIVPLSKPEEKQ 611
Cdd:cd14893    161 MISVEFSKHGHVIGGGFTTHYFEKSRVIDCRSHERNFHVFYQVLAGvqHDPTLRDSLEMNKCVNE--FVMLKQADPLATN 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  612 KA--AQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEPleEQDEAAEAGRKQFARHEWA----QKAAYLLG 685
Cdd:cd14893    239 FAldARDYRDLMSSFSALRIRKNQRVEIVRIVAALLHLGNVDFVPDP--EGGKSVGGANSTTVSDAQScalkDPAQILLA 316
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  686 CSLEELSSAIFKHQLKggtLQRSTSFRQGPEESGLGEGTKLSALECLEGMASGLYSELFTLLISLVN--------RALKS 757
Cdd:cd14893    317 AKLLEVEPVVLDNYFR---TRQFFSKDGNKTVSSLKVVTVHQARKARDTFVRSLYESLFNFLVETLNgilggifdRYEKS 393
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  758 -----SQhslcSMMIVDTPGFQNPEwggsARGASFEELCHNYAQDRLQRLFHERTfLQELERYKEDNIELAFDDLepvAD 832
Cdd:cd14893    394 nivinSQ----GVHVLDMVGFENLT----PSQNSFDQLCFNYWSEKVHHFYVQNT-LAINFSFLEDESQQVENRL---TV 461
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  833 DSVAAVDQASHLVRSLAHaDEARGLLWLLEEEALVPGATEDALLDRLFS-------YYGPQEGDKKGQSPLLRSSKPR-H 904
Cdd:cd14893    462 NSNVDITSEQEKCLQLFE-DKPFGIFDLLTENCKVRLPNDEDFVNKLFSgneavggLSRPNMGADTTNEYLAPSKDWRlL 540
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  905 FLLGHSHGTnwVEYNVAGwlnYTKQNPATQNA--PRLLQDSQkkiisNLFLGRAGSATVLSGSIAglEGGSQLALRRATS 982
Cdd:cd14893    541 FIVQHHCGK--VTYNGKG---LSSKNMLSISStcAAIMQSSK-----NAVLHAVGAAQMAAASSE--KAAKQTEERGSTS 608
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  983 --MRKTFTTGMAAVK-KKSLCIQIKLQVDALIDTIKRSKMHFVHCflpvaegwpgeprsassrrvsssseldLPPGDPCE 1059
Cdd:cd14893    609 skFRKSASSARESKNiTDSAATDVYNQADALLHALNHTGKNFLVC---------------------------IKPNETLE 661
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1060 AGLlqLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPH 1109
Cdd:cd14893    662 EGV--FDSAYVMKQIRMNHLVELMQASRSIFTVHLTYGHFFRRYKNVCGH 709
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1241-1959 1.13e-31

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 136.35  E-value: 1.13e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1241 IQQLRSKLEKVEKERNELRLSSDRLETRISELTSE---LTDERNTGESASQLL----DAETAERLR----TEKEMKELQT 1309
Cdd:TIGR02169  165 VAEFDRKKEKALEELEEVEENIERLDLIIDEKRQQlerLRREREKAERYQALLkekrEYEGYELLKekeaLERQKEAIER 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1310 QYDALKKQMEVMEMEVM------EARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDFTK-KRLQQELEDKMEVEQQSR 1382
Cdd:TIGR02169  245 QLASLEEELEKLTEEISelekrlEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASlERSIAEKERELEDAEERL 324
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1383 RQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKL 1462
Cdd:TIGR02169  325 AKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINEL 404
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1463 QREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDE-------ASLAKVKKQLRDLEAKVKDQEEE 1535
Cdd:TIGR02169  405 KRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEwkleqlaADLSKYEQELYDLKEEYDRVEKE 484
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1536 LDEQAGSIQMLEqAKLRLEMEMERMRQTHSKEMESRDEEV---------------------------------EEARQSC 1582
Cdd:TIGR02169  485 LSKLQRELAEAE-AQARASEERVRGGRAVEEVLKASIQGVhgtvaqlgsvgeryataievaagnrlnnvvvedDAVAKEA 563
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1583 QKKLKQME----------------------------------VQLEEEYEDKQK----------ALREKREL--ESKLST 1616
Cdd:TIGR02169  564 IELLKRRKagratflplnkmrderrdlsilsedgvigfavdlVEFDPKYEPAFKyvfgdtlvveDIEAARRLmgKYRMVT 643
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1617 L-------------------SDQVNQR-DFESEKRLRKDLKRTKALLADAQIMLDHLKNNA--------PSKREIAQLKN 1668
Cdd:TIGR02169  644 LegelfeksgamtggsraprGGILFSRsEPAELQRLRERLEGLKRELSSLQSELRRIENRLdelsqelsDASRKIGEIEK 723
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1669 QLEESEFTcAAAVKAR--------KAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEI-----QNRLEEDQEDMNE 1735
Cdd:TIGR02169  724 EIEQLEQE-EEKLKERleeleedlSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLearlsHSRIPEIQAELSK 802
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1736 LMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSmvdkslVSRQEAKIRELETRLE----F 1811
Cdd:TIGR02169  803 LEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKE------IENLNGKKEELEEELEeleaA 876
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1812 EKTQVKRLENLAS---RLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHElEMDLES 1888
Cdd:TIGR02169  877 LRDLESRLGDLKKerdELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEE-ELSLED 955
                          810       820       830       840       850       860       870
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082233 1889 LEAANQSLQADLklafKRIGDLQAAIEDEMEsdENEDLINSLQDmvtkyqkKKNKLEGDSDvdsELEDRVD 1959
Cdd:TIGR02169  956 VQAELQRVEEEI----RALEPVNMLAIQEYE--EVLKRLDELKE-------KRAKLEEERK---AILERIE 1010
MYSc_Myo21 cd14882
class XXI myosin, motor domain; The myosins here are comprised of insects. Leishmania class ...
395-840 2.43e-31

class XXI myosin, motor domain; The myosins here are comprised of insects. Leishmania class XXI myosins do not group with them. Myo21, unlike other myosin proteins, contains UBA-like protein domains and has no structural or functional relationship with the myosins present in other organisms possessing cilia or flagella. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. They have diverse tails with IQ, WW, PX, and Tub domains. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276848  Cd Length: 642  Bit Score: 132.94  E-value: 2.43e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKvMHMFKGCR-REDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd14882      2 NILEELRHRYLMGESYTFIGDILLSLNPNEIKQEYPQE-FHAKYRCKsRSDNAPHIFSVADSAYQDMLHHEEPQHIILSG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLATIA-GTSGTkvfsVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQT 552
Cdd:cd14882     81 ESYSGKTTNARLLIKHLCYLGdGNRGA----TGRVESSIKAILALVNAGTPLNADSTRCILQYQLTFGSTGKMSGAIFWM 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  553 MLLEKLRVARRPASEATFNVFYYLLACGDATLRteLHLNHLAENNVFGIVPLSKPEEKQKA----------AQQFSKLQA 622
Cdd:cd14882    157 YQLEKLRVSTTDGNQSNFHIFYYFYDFIEAQNR--LKEYNLKAGRNYRYLRIPPEVPPSKLkyrrddpegnVERYKEFEE 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  623 AMKVLAISPEEQKTCWLILASIYHLGAAgatkePLEEQDEAAEagrkqFARHEWAQKAAYLLGCSLEELSSAIFKHQL-K 701
Cdd:cd14882    235 ILKDLDFNEEQLETVRKVLAAILNLGEI-----RFRQNGGYAE-----LENTEIASRVAELLRLDEKKFMWALTNYCLiK 304
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  702 GGTLQRStsfRQGPEEsglgegtklsALECLEGMASGLYSELFTLLISLVN------RALKSSQHSLcsmMIVDTPGFQN 775
Cdd:cd14882    305 GGSAERR---KHTTEE----------ARDARDVLASTLYSRLVDWIINRINmkmsfpRAVFGDKYSI---SIHDMFGFEC 368
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082233  776 PEWGGsargasFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIelafddlePVAD----DSVAAVDQ 840
Cdd:cd14882    369 FHRNR------LEQLMVNTLNEQMQYHYNQRIFISEMLEMEEEDI--------PTINlrfyDNKTAVDQ 423
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1365-1945 1.42e-30

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 132.37  E-value: 1.42e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1365 KRLQQELED-KMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDS 1443
Cdd:COG1196    216 RELKEELKElEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLA 295
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1444 ELSQAHEETQREKLQREKLQREKDmllaeafSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLR 1523
Cdd:COG1196    296 ELARLEQDIARLEERRRELEERLE-------ELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALL 368
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1524 DLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMEsRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKA 1603
Cdd:COG1196    369 EAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLE-RLERLEEELEELEEALAELEEEEEEEEEALEEA 447
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1604 LREKRELESKLSTLSDQVnqrdfeseKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESeftcAAAVKA 1683
Cdd:COG1196    448 AEEEAELEEEEEALLELL--------AELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEG----VKAALL 515
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1684 RKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQReknEIQNRLEEDQEDMNELMKKHKAAVA-------QASRDMAQMND 1756
Cdd:COG1196    516 LAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQ---NIVVEDDEVAAAAIEYLKAAKAGRAtflpldkIRARAALAAAL 592
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1757 LQAQIEESNKEKQELQEKLQALQSQveFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEE 1836
Cdd:COG1196    593 ARGAIGAAVDLVASDLREADARYYV--LGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRREL 670
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1837 RDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIED 1916
Cdd:COG1196    671 LAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEE 750
                          570       580       590
                   ....*....|....*....|....*....|
gi 1907082233 1917 EMESDENEDL-INSLQDMVTKYQKKKNKLE 1945
Cdd:COG1196    751 EALEELPEPPdLEELERELERLEREIEALG 780
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1225-1947 1.51e-30

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 132.49  E-value: 1.51e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1225 IQVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERntgesasqlldaetAERLRTEKEM 1304
Cdd:TIGR02168  218 LKAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELR--------------LEVSELEEEI 283
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1305 KELQTQYDALKkqmevmemevmearliraAEINgevddddaggewrlkyeRAVREVDFTKKRLQQeLEDKMEVEQQSRRQ 1384
Cdd:TIGR02168  284 EELQKELYALA------------------NEIS-----------------RLEQQKQILRERLAN-LERQLEELEAQLEE 327
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1385 LERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQR 1464
Cdd:TIGR02168  328 LESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEA 407
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1465 EKDMLLAEAFSLKQQMEEKDLDIAgftqkvvslEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQ 1544
Cdd:TIGR02168  408 RLERLEDRRERLQQEIEELLKKLE---------EAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALD 478
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1545 M----LEQAKLRLEMeMERMRQTHSKEMESRDEEVEEARQ------------SCQKKLKQ-MEVQLEE-----EYEDKQK 1602
Cdd:TIGR02168  479 AaereLAQLQARLDS-LERLQENLEGFSEGVKALLKNQSGlsgilgvlseliSVDEGYEAaIEAALGGrlqavVVENLNA 557
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1603 ALRE----KRELESKLSTLS-DQVNQRDFESEKRLRKDLKRTKALLAD--------AQIMLDHLKNN---APSKREIAQL 1666
Cdd:TIGR02168  558 AKKAiaflKQNELGRVTFLPlDSIKGTEIQGNDREILKNIEGFLGVAKdlvkfdpkLRKALSYLLGGvlvVDDLDNALEL 637
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1667 KNQLEESE----------FTCAAAVKARK-------AMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEED 1729
Cdd:TIGR02168  638 AKKLRPGYrivtldgdlvRPGGVITGGSAktnssilERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQL 717
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1730 QEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMV-DKSLVSRQEAKIRELETR 1808
Cdd:TIGR02168  718 RKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAeAEAEIEELEAQIEQLKEE 797
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1809 LEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLES 1888
Cdd:TIGR02168  798 LKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEA 877
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1889 LEAANQSLQADLKLAFKRIGDLQAAIED-EMESDENEDLINSLQDMVTKYQKKKNKLEGD 1947
Cdd:TIGR02168  878 LLNERASLEEALALLRSELEELSEELRElESKRSELRRELEELREKLAQLELRLEGLEVR 937
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1239-1946 2.85e-30

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 131.45  E-value: 2.85e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1239 EEI-QQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDErntgESASQLLDAEtaeRLRTEKEMKELQTQYDALKKQ 1317
Cdd:pfam01576   74 EEIlHELESRLEEEEERSQQLQNEKKKMQQHIQDLEEQLDEE----EAARQKLQLE---KVTTEAKIKKLEEDILLLEDQ 146
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1318 MEVMEMEVMEARLiRAAEINGEVDDDDAG----GEWRLKYERAVREVDftkKRLQQELEDKMEVEQqSRRQLERRLGDLQ 1393
Cdd:pfam01576  147 NSKLSKERKLLEE-RISEFTSNLAEEEEKakslSKLKNKHEAMISDLE---ERLKKEEKGRQELEK-AKRKLEGESTDLQ 221
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1394 ADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEA 1473
Cdd:pfam01576  222 EQIAELQAQIAELRAQLAKKEEELQAALARLEEETAQKNNALKKIRELEAQISELQEDLESERAARNKAEKQRRDLGEEL 301
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1474 FSLKQQMEEkdldiagfTQKVVSLEAELQdissqeSKDEASLAKVKK-----------QLRDLEAKVKDQEEELDEQags 1542
Cdd:pfam01576  302 EALKTELED--------TLDTTAAQQELR------SKREQEVTELKKaleeetrsheaQLQEMRQKHTQALEELTEQ--- 364
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1543 iqmLEQAKlRLEMEMERMRQTHSKEMESRDEEV---EEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLS- 1618
Cdd:pfam01576  365 ---LEQAK-RNKANLEKAKQALESENAELQAELrtlQQAKQDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSKLQs 440
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1619 --DQVNQRDFESEK---RLRKDLKRTKALLADAQIML-------------------------DHLKNNAPSKREI----- 1663
Cdd:pfam01576  441 elESVSSLLNEAEGkniKLSKDVSSLESQLQDTQELLqeetrqklnlstrlrqledernslqEQLEEEEEAKRNVerqls 520
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1664 ------AQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQRE----------KNEIQNRLE 1727
Cdd:pfam01576  521 tlqaqlSDMKKKLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKLEKTKNRLQQElddllvdldhQRQLVSNLE 600
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1728 EDQEDMNELMKKHKAAVAQAS--RDMAQMN---------DLQAQIEESNKEKQELQEKLQALQSQVEFLEQSM--VDKSL 1794
Cdd:pfam01576  601 KKQKKFDQMLAEEKAISARYAeeRDRAEAEareketralSLARALEEALEAKEELERTNKQLRAEMEDLVSSKddVGKNV 680
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1795 VSRQEAKiRELETRLEFEKTQVKRL-------ENLASRLKETME--KLTEERDQRAAAENREkEQNKRLQRQLRDTKEEM 1865
Cdd:pfam01576  681 HELERSK-RALEQQVEEMKTQLEELedelqatEDAKLRLEVNMQalKAQFERDLQARDEQGE-EKRRQLVKQVRELEAEL 758
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1866 SELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMEsdENEDLINSLQDMVTKYQKKKNKLE 1945
Cdd:pfam01576  759 EDERKQRAQAVAAKKKLELDLKELEAQIDAANKGREEAVKQLKKLQAQMKDLQR--ELEEARASRDEILAQSKESEKKLK 836

                   .
gi 1907082233 1946 G 1946
Cdd:pfam01576  837 N 837
MYSc_Myo25 cd14886
class XXV myosin, motor domain; These myosins are MyTH-FERM myosins that play a role in cell ...
396-821 1.24e-29

class XXV myosin, motor domain; These myosins are MyTH-FERM myosins that play a role in cell adhesion and filopodia formation. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276851  Cd Length: 650  Bit Score: 127.70  E-value: 1.24e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  396 VLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFKGCRRE-----DMAPHIYAVAQTAYRAMLMSRQDQSI 469
Cdd:cd14886      3 VIDILRDRFAKDKIYTYAGKLLVALNPfKQIRNLYGTEVIGRYRQADTSrgfpsDLPPHSYAVAQSALNGLISDGISQSC 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  470 VLLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVekwqALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVAS 547
Cdd:cd14886     83 IVSGESGAGKTETAKQLMNFFAYGHSTSSTDVQSL----ILGSnpLLESFGNAKTLRNNNSSRFGKFIKLLVGPDGGLKG 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  548 ASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVF-GIVPLSKPE-EKQKaaqQFSKLQAAMK 625
Cdd:cd14886    159 GKITSYMLELSRIEFQSTNERNYHIFYQCIKGLSPEEKKSLGFKSLESYNFLnASKCYDAPGiDDQK---EFAPVRSQLE 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  626 VLaISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAeagrkQFARHEWAQKAAYLLGCSLEELSSAIfkhqlkggtL 705
Cdd:cd14886    236 KL-FSKNEIDSFYKCISGILLAGNIEFSEEGDMGVINAA-----KISNDEDFGKMCELLGIESSKAAQAI---------I 300
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  706 QRSTSFRQGPEESGLgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsarga 785
Cdd:cd14886    301 TKVVVINNETIISPV---TQAQAEVNIRAVAKDLYGALFELCVDTLNEIIQFDADARPWIGILDIYGFEFFERN------ 371
                          410       420       430
                   ....*....|....*....|....*....|....*.
gi 1907082233  786 SFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE 821
Cdd:cd14886    372 TYEQLLINYANERLQQYFINQVFKSEIQEYEIEGID 407
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1250-1893 1.50e-29

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 128.90  E-value: 1.50e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1250 KVEKERNELRLSS--DRLEtRISELTSELtdERNTGESASQlldAETAERLRtekemkELQTQYDALKKQMEVMEMEVME 1327
Cdd:COG1196    171 KERKEEAERKLEAteENLE-RLEDILGEL--ERQLEPLERQ---AEKAERYR------ELKEELKELEAELLLLKLRELE 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1328 ARLIRAAEINGEVDDDDAggewRLKYERAVREVDFTKKRLQ-QELEDKMEVEQQSRRQLERRLGDLQADSDESQRALQQL 1406
Cdd:COG1196    239 AELEELEAELEELEAELE----ELEAELAELEAELEELRLElEELELELEEAQAEEYELLAELARLEQDIARLEERRREL 314
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1407 KKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLD 1486
Cdd:COG1196    315 EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRA 394
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1487 IAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSK 1566
Cdd:COG1196    395 AAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAAL 474
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1567 EMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLrkdlkrtkALLADA 1646
Cdd:COG1196    475 LEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAAL--------EAALAA 546
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1647 QIMLDHLKNNAPSKREIAQLK-NQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNR 1725
Cdd:COG1196    547 ALQNIVVEDDEVAAAAIEYLKaAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRT 626
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1726 LEEDQEDM-NELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEfleqsmvdkslvSRQEAKIRE 1804
Cdd:COG1196    627 LVAARLEAaLRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELA------------ERLAEEELE 694
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1805 LETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEM 1884
Cdd:COG1196    695 LEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLER 774

                   ....*....
gi 1907082233 1885 DLESLEAAN 1893
Cdd:COG1196    775 EIEALGPVN 783
MYSc_Myo13 cd14875
class XIII myosin, motor domain; These myosins have an N-terminal motor domain, a light-chain ...
447-1113 2.00e-29

class XIII myosin, motor domain; These myosins have an N-terminal motor domain, a light-chain binding domain, and a C-terminal GPA/Q-rich domain. There is little known about the function of this myosin class. Two of the earliest members identified in this class are green alga Acetabularia cliftonii, Aclmyo1 and Aclmyo2. They are striking with their short tail of Aclmyo1 of 18 residues and the maximum of 7 IQ motifs in Aclmyo2. It is thought that these myosins are involved in organelle transport and tip growth. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276842 [Multi-domain]  Cd Length: 664  Bit Score: 127.23  E-value: 2.00e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  447 PHIYAVAQTAYRAMLM-SRQDQSIVLLGSSGSGKTTSFQHLVQYLA---------TIAGTSGTKVFSVEKWQalSTLLEA 516
Cdd:cd14875     56 PHIWQVAHKAFNAIFVqGLGNQSVVISGESGSGKTENAKMLIAYLGqlsymhssnTSQRSIADKIDENLKWS--NPVMES 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  517 FGNSPTIMNGSATRFSQILSLDFDQA-GQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDAT-------LRTEL 588
Cdd:cd14875    134 FGNARTVRNDNSSRFGKYIKLYFDPTsGVMVGGQTVTYLLEKSRIIMQSPGERNYHIFYEMLAGLSPEekkelggLKTAQ 213
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  589 HLNHLAENNVF---GI--VPLSKPEEkqkaaqqFSKLQAAMKVLAISPEEQKTCWLILASIYHLgaagatkepLEEQDEA 663
Cdd:cd14875    214 DYKCLNGGNTFvrrGVdgKTLDDAHE-------FQNVRHALSMIGVELETQNSIFRVLASILHL---------MEVEFES 277
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  664 AEAGRKQFARHEwaqkaAYLLGCSLEELSSAifkhQLKGGTLQRS-----TSFRQGPEESGLgegtklsalecLEGMASG 738
Cdd:cd14875    278 DQNDKAQIADET-----PFLTACRLLQLDPA----KLRECFLVKSktslvTILANKTEAEGF-----------RNAFCKA 337
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  739 LYSELFTLLISLVNRALKSSQH-SLCSMM-IVDTPGFQNpewggsARGASFEELCHNYAQDRLQRLFHERTFLQELERYK 816
Cdd:cd14875    338 IYVGLFDRLVEFVNASITPQGDcSGCKYIgLLDIFGFEN------FTRNSFEQLCINYANESLQNHYNKYTFINDEEECR 411
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  817 EDNIE---LAFDDlepvADDSVAAVDQASHLVRSLahADEargllwlleeEALVPGATEDALLDRLFSYYGpqegdkkGQ 893
Cdd:cd14875    412 REGIQipkIEFPD----NSECVNMFDQKRTGIFSM--LDE----------ECNFKGGTTERFTTNLWDQWA-------NK 468
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  894 SPLL---RSSKPRHFllGHSHGTNWVEYNVAGWLNytKQNPA-TQNAPRLLQDSQKKIISNLFLGRAGSAtvlsgsiagl 969
Cdd:cd14875    469 SPYFvlpKSTIPNQF--GVNHYAAFVNYNTDEWLE--KNTDAlKEDMYECVSNSTDEFIRTLLSTEKGLA---------- 534
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  970 eggsqlalRRatsmrktfttgmaavkKKSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssse 1049
Cdd:cd14875    535 --------RR----------------KQTVAIRFQRQLTDLRTELESTETQFIRCIKPNMEASPS--------------- 575
                          650       660       670       680       690       700
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082233 1050 ldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHLTKK 1113
Cdd:cd14875    576 --------------FLDNLLVGSQLESAGVLQTIALKRQGYPVRRPIEQFCRYFYLIMPRSTAS 625
MYSc_Myo12 cd14874
class XXXIII myosin, motor domain; Little is known about the XXXIII class of myosins. They ...
394-1108 1.27e-28

class XXXIII myosin, motor domain; Little is known about the XXXIII class of myosins. They are found predominately in nematodes. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276841 [Multi-domain]  Cd Length: 628  Bit Score: 124.60  E-value: 1.27e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVmhmFKGCrredmapHIYAVAQTAYRAML-MSRQDQSIVLL 472
Cdd:cd14874      1 AGIAQNLHERFKKGQTYTKASNVLVFVNDFNKLSIQDQLV---IKKC-------HISGVAENALDRIKsMSSNAESIVFG 70
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  473 GSSGSGKTTSFQHLVQYLATIAGTSGTKVFSvekwQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQT 552
Cdd:cd14874     71 GESGSGKSYNAFQVFKYLTSQPKSKVTTKHS----SAIESVFKSFGCAKTLKNDEATRFGCSIDLLYKRNVLTGLNLKYT 146
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  553 MLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLaeNNVFGIVPLSKPEEKQKAAQQFSKLQAAMKVLAISPE 632
Cdd:cd14874    147 VPLEVPRVISQKPGERNFNVFYEVYHGLNDEMKAKFGIKGL--QKFFYINQGNSTENIQSDVNHFKHLEDALHVLGFSDD 224
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  633 EQKTCWLILASIYHLGAA--GATKEPLEEQDeAAEAGrkQFARHEWaqkAAYLLGCSLEELSSAIFKHQLKGGTLQRSts 710
Cdd:cd14874    225 HCISIYKIISTILHIGNIyfRTKRNPNVEQD-VVEIG--NMSEVKW---VAFLLEVDFDQLVNFLLPKSEDGTTIDLN-- 296
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  711 frqgpeesglgegtklSALECLEGMASGLYSELFTLLISLVNRALKSSQHSlCSMMIVDTPGFQNPEWGGsargasFEEL 790
Cdd:cd14874    297 ----------------AALDNRDSFAMLIYEELFKWVLNRIGLHLKCPLHT-GVISILDHYGFEKYNNNG------VEEF 353
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  791 CHNYAQDRLQRLFHERTFLQELERYKEDNIELafddlepvaDDSVAAVDQASHLVRSLAHADEarGLLWLLEEEALVPGA 870
Cdd:cd14874    354 LINSVNERIENLFVKHSFHDQLVDYAKDGISV---------DYKVPNSIENGKTVELLFKKPY--GLLPLLTDECKFPKG 422
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  871 TEDALLDRL------FSYYGPQegdkkgqspllRSSKPRHFLLGHSHGTNWveYNVAGWLNYTKQNpATQNAPRLLQDSQ 944
Cdd:cd14874    423 SHESYLEHCnlnhtdRSSYGKA-----------RNKERLEFGVRHCIGTTW--YNVTDFFSRNKRI-ISLSAVQLLRSSK 488
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  945 KKIISNLFLGRAGSATVLSGSIAgleggsQLALRRATSmrktfttgmaavkkkslciqiklqvdaLIDTIKRSKMHFVHC 1024
Cdd:cd14874    489 NPIIGLLFESYSSNTSDMIVSQA------QFILRGAQE---------------------------IADKINGSHAHFVRC 535
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1025 FLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFD 1104
Cdd:cd14874    536 IKSNNERQPK-----------------------------KFDIPLVNRQIKNLLLAELLSFRIKGYPVKISKTTFARQYR 586

                   ....
gi 1907082233 1105 VLAP 1108
Cdd:cd14874    587 CLLP 590
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1225-1956 1.94e-28

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 125.67  E-value: 1.94e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1225 IQVQLSE--EQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERntgesasqlldaetAERLRTEK 1302
Cdd:pfam01576  227 LQAQIAElrAQLAKKEEELQAALARLEEETAQKNNALKKIRELEAQISELQEDLESER--------------AARNKAEK 292
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1303 EMKELQTQYDALKkqmevmemevmearliraAEINGEVDDDDAGGEWRLKYERAVREVdftKKRLQQELEDKMEVEQQSR 1382
Cdd:pfam01576  293 QRRDLGEELEALK------------------TELEDTLDTTAAQQELRSKREQEVTEL---KKALEEETRSHEAQLQEMR 351
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1383 RQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKL 1462
Cdd:pfam01576  352 QKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAELRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQRAEL 431
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1463 QREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGS 1542
Cdd:pfam01576  432 AEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLESQLQDTQELLQEETRQKLNLSTRLRQLEDERNSLQEQLEEEEEA 511
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1543 IQMLEQAKLRLEMEMERMRQtHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLS-DQV 1621
Cdd:pfam01576  512 KRNVERQLSTLQAQLSDMKK-KLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKLEKTKNRLQQELDDLLvDLD 590
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1622 NQRDFES--EKRLRK---DLKRTKALLADAQIMLDHLKNNAPSKREIA-QLKNQLEESEFTCAAAVKARKAMEVEMEDLH 1695
Cdd:pfam01576  591 HQRQLVSnlEKKQKKfdqMLAEEKAISARYAEERDRAEAEAREKETRAlSLARALEEALEAKEELERTNKQLRAEMEDLV 670
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1696 LQIDDIAK-------AKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDmaqmndLQAQIEESnkek 1768
Cdd:pfam01576  671 SSKDDVGKnvhelerSKRALEQQVEEMKTQLEELEDELQATEDAKLRLEVNMQALKAQFERD------LQARDEQG---- 740
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1769 qelQEKLQALQSQVefleqsmvdkslvsrqeakiRELETRLEFEKTQvkRLENLASRLKETMEKLTEERDQRAAAENREK 1848
Cdd:pfam01576  741 ---EEKRRQLVKQV--------------------RELEAELEDERKQ--RAQAVAAKKKLELDLKELEAQIDAANKGREE 795
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1849 --EQNKRLQRQLRDTKEEMSE--LARKEAEASRKkhELEMDLESLEAANQSLQADLKLAFKrigdlqAAIEDEMESDENE 1924
Cdd:pfam01576  796 avKQLKKLQAQMKDLQRELEEarASRDEILAQSK--ESEKKLKNLEAELLQLQEDLAASER------ARRQAQQERDELA 867
                          730       740       750
                   ....*....|....*....|....*....|...
gi 1907082233 1925 DLINSLQDMVTKYQKKKNKLEGD-SDVDSELED 1956
Cdd:pfam01576  868 DEIASGASGKSALQDEKRRLEARiAQLEEELEE 900
MYSc_Myo14 cd14876
class XIV myosin, motor domain; These myosins localize to plasma membranes of the ...
396-1027 2.02e-28

class XIV myosin, motor domain; These myosins localize to plasma membranes of the intracellular parasites and may be involved in the cell invasion process. Their known functions include: transporting phagosomes to the nucleus and perturbing the developmentally regulated elimination of the macronucleus during conjugation. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. C-terminal to their motor domain these myosins have a MyTH4-FERM protein domain combination. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276843  Cd Length: 649  Bit Score: 123.94  E-value: 2.02e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  396 VLHTLRQRYGASLLHTYAGPSLLVLST---------------RGAPAVysekvmhmfkgcrrEDMAPHIYAVAQTAYRAM 460
Cdd:cd14876      3 VLDFLKHRYLKNQIYTTADPLLVAINPfkdlgnatdewirkyRDAPDL--------------TKLPPHVFYTARRALENL 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  461 LMSRQDQSIVLLGSSGSGKTTSFQHLVQYLAtiAGTSGTKVFSVEKW-QALSTLLEAFGNSPTIMNGSATRFSQILSLDF 539
Cdd:cd14876     69 HGVNKSQTIIVSGESGAGKTEATKQIMRYFA--SAKSGNMDLRIQTAiMAANPVLEAFGNAKTIRNNNSSRFGRFMQLDV 146
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  540 DQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENN--------VFGIVPLskpeekq 611
Cdd:cd14876    147 ASEGGIRYGSVVAFLLEKSRIVTQDDNERSYHIFYQLLKGADSEMKSKYHLLGLKEYKflnpkcldVPGIDDV------- 219
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  612 kaaQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEPLEEQDEAAEAGRkqfARHEWAQKAAYLLGCSLEEL 691
Cdd:cd14876    220 ---ADFEEVLESLKSMGLTEEQIDTVFSIVSGVLLLGNVKITGKTEQGVDDAAAISN---ESLEVFKEACSLLFLDPEAL 293
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  692 SSAIF-KHQLKGGtlQRSTSFRQGPEesglGEGTKLSaleclegMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDT 770
Cdd:cd14876    294 KRELTvKVTKAGG--QEIEGRWTKDD----AEMLKLS-------LAKAMYDKLFLWIIRNLNSTIEPPGGFKNFMGMLDI 360
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  771 PGF---QNpewggsargASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNI---ELAFDDLEPVAD------DSVAAV 838
Cdd:cd14876    361 FGFevfKN---------NSLEQLFINITNEMLQKNFIDIVFERESKLYKDEGIptaELEYTSNAEVIDvlcgkgKSVLSI 431
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  839 --DQAshlvrsLAhadeargllwlleeealvPGATEDALLDRLFSyygpQEGDKKGQSPLLRSSKpRHFLLGHSHGTnwV 916
Cdd:cd14876    432 leDQC------LA------------------PGGSDEKFVSACVS----KLKSNGKFKPAKVDSN-INFIVVHTIGD--I 480
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  917 EYNVAGWLNYTKqNPATQNAPRLLQDSQKKIISNLFLGragsATVLSGSIA-GLEGGSQLalrratsMRktfttgmaavk 995
Cdd:cd14876    481 QYNAEGFLFKNK-DVLRAELVEVVQASTNPVVKALFEG----VVVEKGKIAkGSLIGSQF-------LK----------- 537
                          650       660       670
                   ....*....|....*....|....*....|..
gi 1907082233  996 kkslciqiklQVDALIDTIKRSKMHFVHCFLP 1027
Cdd:cd14876    538 ----------QLESLMGLINSTEPHFIRCIKP 559
MYSc_Myo23 cd14884
class XXIII myosin, motor domain; These myosins are predicted to have a neck region with 1-2 ...
395-1027 1.72e-27

class XXIII myosin, motor domain; These myosins are predicted to have a neck region with 1-2 IQ motifs and a single MyTH4 domain in its C-terminal tail. The lack of a FERM domain here is odd since MyTH4 domains are usually found alongside FERM domains where they bind to microtubules. At any rate these Class XXIII myosins are still proposed to function in the apicomplexan microtubule cytoskeleton. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276850 [Multi-domain]  Cd Length: 685  Bit Score: 121.17  E-value: 1.72e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMF-------KGCRREDMAPHIYAVAQTAYRAMLMSRQD 466
Cdd:cd14884      2 NVLQNLKNRYLKNKIYTFHASLLLALNPyKPLKELYDQDVMNVYlhkksnsAASAAPFPKAHIYDIANMAYKNMRGKLKR 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  467 QSIVLLGSSGSGKTTSFQHLVQYLATIAGTSgTKVFSVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQ----- 541
Cdd:cd14884     82 QTIVVSGHSGSGKTENCKFLFKYFHYIQTDS-QMTERIDKLIYINNILESMSNATTIKNNNSSRCGRINLLIFEEventq 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  542 ----AGQVASASIQTMLLEKLRVARRPASEATFNVFYYLL-ACGDATLRT-----ELHLNHLAENNVFGIVPLSK----- 606
Cdd:cd14884    161 knmfNGCFRNIKIKILLLEINRCIAHNFGERNFHVFYQVLrGLSDEDLARrnlvrNCGVYGLLNPDESHQKRSVKgtlrl 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  607 --------PEEKQKAAQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGaagatkepleeqdeaaeagrkqfarhEWAQ 678
Cdd:cd14884    241 gsdsldpsEEEKAKDEKNFVALLHGLHYIKYDERQINEFFDIIAGILHLG--------------------------NRAY 294
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  679 KAAyllgCSLEELSSAIFKHQLKGGTLQ-RSTSFRQgpeesglgEGTKLSALECLEGMASGLYSELFTLLISLVNRALKS 757
Cdd:cd14884    295 KAA----AECLQIEEEDLENVIKYKNIRvSHEVIRT--------ERRKENATSTRDTLIKFIYKKLFNKIIEDINRNVLK 362
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  758 SQ--------------HSLCSmmIVDTPGFQnpewggSARGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIeLA 823
Cdd:cd14884    363 CKekdesdnediysinEAIIS--ILDIYGFE------ELSGNDFDQLCINLANEKLNNYYINNEIEKEKRIYARENI-IC 433
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  824 FDDLEPVADDSVAAVD------------QASHLVRSLAH--ADEARGLLWLLEEEALVPGatedALLDRLFSYYGPQEGD 889
Cdd:cd14884    434 CSDVAPSYSDTLIFIAkifrrldditklKNQGQKKTDDHffRYLLNNERQQQLEGKVSYG----FVLNHDADGTAKKQNI 509
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  890 KKGQspllrsskprhFLLGHSHGTnwVEYNVAGWLnYTKQNPATQNAPRLLQDSqkkiiSNLFLGRAgsatvlsgsiagL 969
Cdd:cd14884    510 KKNI-----------FFIRHYAGL--VTYRINNWI-DKNSDKIETSIETLISCS-----SNRFLREA------------N 558
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082233  970 EGGSQlalRRATSMRKTFttgmaavkkkslciqIKlQVDALIDTIKRSKMHFVHCFLP 1027
Cdd:cd14884    559 NGGNK---GNFLSVSKKY---------------IK-ELDNLFTQLQSTDMYYIRCFLP 597
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1231-1893 1.61e-26

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 119.39  E-value: 1.61e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1231 EEQIRNKDEEIQQLRSKLEKVEKERNELR-----LSSD--RLETRISELTSELTDERNTGESASQLLDAETAERLRTEKE 1303
Cdd:TIGR02168  259 TAELQELEEKLEELRLEVSELEEEIEELQkelyaLANEisRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEE 338
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1304 MKELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDFTKKRLQQeLEDKMEVEQQSRR 1383
Cdd:TIGR02168  339 LAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIER-LEARLERLEDRRE 417
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1384 QLERRLGDLQADSDESQRALQQLKkkCQRLTAELQDTKLHLEGQQVR----NHELEKKQRRFDSELSQAHEETQR----E 1455
Cdd:TIGR02168  418 RLQQEIEELLKKLEEAELKELQAE--LEELEEELEELQEELERLEEAleelREELEEAEQALDAAERELAQLQARldslE 495
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1456 KLQR--EKLQREKDMLLAEAF-------SLKQQME-----EKDLDIA--GFTQKVV--SLEAELQDISSQESKDE---AS 1514
Cdd:TIGR02168  496 RLQEnlEGFSEGVKALLKNQSglsgilgVLSELISvdegyEAAIEAAlgGRLQAVVveNLNAAKKAIAFLKQNELgrvTF 575
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1515 LAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERM--------------RQTHSKEMESR--------- 1571
Cdd:TIGR02168  576 LPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLlggvlvvddldnalELAKKLRPGYRivtldgdlv 655
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1572 ---------DEEVEEARQSCQKKLKQMEVQ---LEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRL---RKDL 1636
Cdd:TIGR02168  656 rpggvitggSAKTNSSILERRREIEELEEKieeLEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQIsalRKDL 735
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1637 KRTKALLADAQIMLDHL-KNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRL 1715
Cdd:TIGR02168  736 ARLEAEVEQLEERIAQLsKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLL 815
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1716 QREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVD-KSL 1794
Cdd:TIGR02168  816 NEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALlRSE 895
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1795 VSRQEAKIRELETRlefektqVKRLENLasrLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARK--- 1871
Cdd:TIGR02168  896 LEELSEELRELESK-------RSELRRE---LEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKied 965
                          730       740
                   ....*....|....*....|...
gi 1907082233 1872 -EAEASRKKHELEMDLESLEAAN 1893
Cdd:TIGR02168  966 dEEEARRRLKRLENKIKELGPVN 988
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1229-1933 4.80e-26

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 117.86  E-value: 4.80e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1229 LSEEqiRNKDEEIQQLRSKLEKVE------------KERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAE 1296
Cdd:TIGR02169  203 LRRE--REKAERYQALLKEKREYEgyellkekealeRQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKK 280
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1297 -RLRTEKEMKELQTQYDALKKQMEVMEMEVMEA--RLIRAAE--INGEVDDDDAGGEWRlKYERAVREVDFTKKRLQQEL 1371
Cdd:TIGR02169  281 iKDLGEEEQLRVKEKIGELEAEIASLERSIAEKerELEDAEErlAKLEAEIDKLLAEIE-ELEREIEEERKRRDKLTEEY 359
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1372 EDKmeveQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDtklhLEGQQVRnheLEKKQRRFDSELSQAHEE 1451
Cdd:TIGR02169  360 AEL----KEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINE----LKRELDR---LQEELQRLSEELADLNAA 428
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1452 TQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKV-- 1529
Cdd:TIGR02169  429 IAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVrg 508
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1530 -KDQEEELD------------------------EQAGS-----------------IQMLEQAKL-RLE-MEMERMRQTHS 1565
Cdd:TIGR02169  509 gRAVEEVLKasiqgvhgtvaqlgsvgeryataiEVAAGnrlnnvvveddavakeaIELLKRRKAgRATfLPLNKMRDERR 588
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1566 -KEMESRD------------------------------EEVEEAR----------------------------------- 1579
Cdd:TIGR02169  589 dLSILSEDgvigfavdlvefdpkyepafkyvfgdtlvvEDIEAARrlmgkyrmvtlegelfeksgamtggsraprggilf 668
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1580 --------QSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEkRLRKDLKRTKALLADAQIMLD 1651
Cdd:TIGR02169  669 srsepaelQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIE-QLEQEEEKLKERLEELEEDLS 747
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1652 HLKNN-APSKREIAQLKNQLEESEftcAAAVKARKAME-VEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEED 1729
Cdd:TIGR02169  748 SLEQEiENVKSELKELEARIEELE---EDLHKLEEALNdLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRL 824
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1730 QEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVD-KSLVSRQEAKIRELETR 1808
Cdd:TIGR02169  825 TLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDlKKERDELEAQLRELERK 904
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1809 LEFEKTQVKRLENLASRLKETMEKLteerdqraaaenreKEQNKRLQRQLRDTKEEMSELARKEAEASRKKhELEMDLES 1888
Cdd:TIGR02169  905 IEELEAQIEKKRKRLSELKAKLEAL--------------EEELSEIEDPKGEDEEIPEEELSLEDVQAELQ-RVEEEIRA 969
                          810       820       830       840
                   ....*....|....*....|....*....|....*....|....*.
gi 1907082233 1889 LEAANQSLQADLKLAFKRIGDLQAAIED-EMESDENEDLINSLQDM 1933
Cdd:TIGR02169  970 LEPVNMLAIQEYEEVLKRLDELKEKRAKlEEERKAILERIEEYEKK 1015
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1368-1944 5.37e-25

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 113.58  E-value: 5.37e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1368 QQELEDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKlhlegqqvrNHELEKKQR--RFDSEL 1445
Cdd:TIGR04523   35 EKQLEKKLKTIKNELKNKEKELKNLDKNLNKDEEKINNSNNKIKILEQQIKDLN---------DKLKKNKDKinKLNSDL 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1446 SQAHEETQREKLQREKLQREKDmllaeafSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKdeaslakVKKQLRDL 1525
Cdd:TIGR04523  106 SKINSEIKNDKEQKNKLEVELN-------KLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYND-------LKKQKEEL 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1526 EAKVKDQEEELDEQAGSIQMLEQAKLRLEMEM---ERMRQTHsKEMESRDEEVEEARQSCQKKLKQmevqLEEEYEDKQK 1602
Cdd:TIGR04523  172 ENELNLLEKEKLNIQKNIDKIKNKLLKLELLLsnlKKKIQKN-KSLESQISELKKQNNQLKDNIEK----KQQEINEKTT 246
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1603 ALREKrelESKLSTLSDQvNQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNnapskrEIAQLKNQLEESeftCAAAVK 1682
Cdd:TIGR04523  247 EISNT---QTQLNQLKDE-QNKIKKQLSEKQKELEQNNKKIKELEKQLNQLKS------EISDLNNQKEQD---WNKELK 313
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1683 AR-KAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKN-------EIQNRLEEDQEDMnELMKKHKAAVAQASRDM-AQ 1753
Cdd:TIGR04523  314 SElKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTnsesensEKQRELEEKQNEI-EKLKKENQSYKQEIKNLeSQ 392
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1754 MNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVD-KSLVSRQ--------------EAKIRELETRLEFEKTQVKR 1818
Cdd:TIGR04523  393 INDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERlKETIIKNnseikdltnqdsvkELIIKNLDNTRESLETQLKV 472
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1819 LENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQA 1898
Cdd:TIGR04523  473 LSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDF 552
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907082233 1899 DLKLAF--KRIGDLQAAIEdEMEsDENEDLINS---LQDMVTKYQKKKNKL 1944
Cdd:TIGR04523  553 ELKKENleKEIDEKNKEIE-ELK-QTQKSLKKKqeeKQELIDQKEKEKKDL 601
PTZ00121 PTZ00121
MAEBL; Provisional
1229-1924 8.78e-25

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 114.08  E-value: 8.78e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1229 LSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRL--ETRISELTSELTDERNTgESASQLLDAETAERLRTEKEMKE 1306
Cdd:PTZ00121  1099 KAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKaeDARKAEEARKAEDAKRV-EIARKAEDARKAEEARKAEDAKK 1177
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1307 LQTQYDALKKQMEVMEMEVMEARLIRAAEingevddddaggewRLKYERAVREVdftkkrlqQELEDKMEVEQqSRRQLE 1386
Cdd:PTZ00121  1178 AEAARKAEEVRKAEELRKAEDARKAEAAR--------------KAEEERKAEEA--------RKAEDAKKAEA-VKKAEE 1234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1387 RRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKK--QRRFDSELSQAHEETQREKLQREKLQR 1464
Cdd:PTZ00121  1235 AKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKaeEKKKADEAKKAEEKKKADEAKKKAEEA 1314
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1465 EKdmllAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLE-AKVKDQEEELDEQAGSI 1543
Cdd:PTZ00121  1315 KK----ADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKkEEAKKKADAAKKKAEEK 1390
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1544 QMLEQAKLRLEMEMERMRQTHSKEMESRdeEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQ 1623
Cdd:PTZ00121  1391 KKADEAKKKAEEDKKKADELKKAAAAKK--KADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEE 1468
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1624 RDFESEKRLRKDLKR----TKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCA-AAVKARKAMEVEMEDLHLQI 1698
Cdd:PTZ00121  1469 AKKADEAKKKAEEAKkadeAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAkKAEEAKKADEAKKAEEKKKA 1548
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1699 DDIAKA---KTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKL 1775
Cdd:PTZ00121  1549 DELKKAeelKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKA 1628
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1776 QALQSQVEFLEQSMVDKslvSRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQ 1855
Cdd:PTZ00121  1629 EEEKKKVEQLKKKEAEE---KKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAE 1705
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1856 R-------------QLRDTKEE----MSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEM 1918
Cdd:PTZ00121  1706 ElkkkeaeekkkaeELKKAEEEnkikAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEEL 1785

                   ....*.
gi 1907082233 1919 ESDENE 1924
Cdd:PTZ00121  1786 DEEDEK 1791
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1226-1871 3.79e-24

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 111.69  E-value: 3.79e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1226 QVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLD----AETAERL-RT 1300
Cdd:TIGR02168  366 ELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEeaelKELQAELeEL 445
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1301 EKEMKELQTQYDALKKQMEVMEMEVMEARLIR------AAEINGEVDDDDAGGEWRLKYERAVREVDFTKKRL---QQEL 1371
Cdd:TIGR02168  446 EEELEELQEELERLEEALEELREELEEAEQALdaaereLAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLsgiLGVL 525
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1372 EDKMEVEQQSRRQLERRLG-DLQA----DSDESQRALQQLKK--------------KCQRLTAELQDTKLHLEGQQVRNH 1432
Cdd:TIGR02168  526 SELISVDEGYEAAIEAALGgRLQAvvveNLNAAKKAIAFLKQnelgrvtflpldsiKGTEIQGNDREILKNIEGFLGVAK 605
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1433 ELEKKQRRFDSELS------------QAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAE 1500
Cdd:TIGR02168  606 DLVKFDPKLRKALSyllggvlvvddlDNALELAKKLRPGYRIVTLDGDLVRPGGVITGGSAKTNSSILERRREIEELEEK 685
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1501 LQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERmrqtHSKEMESRDEEVEEArq 1580
Cdd:TIGR02168  686 IEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQ----LEERIAQLSKELTEL-- 759
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1581 scQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDfESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSK 1660
Cdd:TIGR02168  760 --EAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALR-EALDELRAELTLLNEEAANLRERLESLERRIAAT 836
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1661 -REIAQLKNQLEESEftcaaavKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKK 1739
Cdd:TIGR02168  837 eRRLEDLEEQIEELS-------EDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESK 909
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1740 HKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQsQVEFleqsMVDKSLVSRQEAKIRELETRLEFEKTQVKRL 1819
Cdd:TIGR02168  910 RSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEY-SLTL----EEAEALENKIEDDEEEARRRLKRLENKIKEL 984
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907082233 1820 E--NLASrlketMEKLTEERdQRAAAENREKEQNKRLQRQLRDTKEEMSELARK 1871
Cdd:TIGR02168  985 GpvNLAA-----IEEYEELK-ERYDFLTAQKEDLTEAKETLEEAIEEIDREARE 1032
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1226-1855 1.28e-23

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 109.34  E-value: 1.28e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1226 QVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEMK 1305
Cdd:TIGR04523  111 EIKNDKEQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQKNID 190
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1306 ELQTQYDALKKQMEVMEMEVMEARLIRAaEINgevddddaggewrlKYERAVREVDFTKKRLQQELEDKMEVEQQSRRQL 1385
Cdd:TIGR04523  191 KIKNKLLKLELLLSNLKKKIQKNKSLES-QIS--------------ELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQL 255
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1386 E----------RRLGDLQADSDESQRALQQLKKKCQRLTAELQDtkLHLEGQQVRNHEL-------EKKQRRFDSELSQA 1448
Cdd:TIGR04523  256 NqlkdeqnkikKQLSEKQKELEQNNKKIKELEKQLNQLKSEISD--LNNQKEQDWNKELkselknqEKKLEEIQNQISQN 333
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1449 HEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKdldiagftqkvvslEAELQDISSQESKDEASLAKVKKQLRDLEAK 1528
Cdd:TIGR04523  334 NKIISQLNEQISQLKKELTNSESENSEKQRELEEK--------------QNEIEKLKKENQSYKQEIKNLESQINDLESK 399
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1529 VKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHS------KEMESRDEEVEEARQSCQKKLKQMEVQLEE---EYED 1599
Cdd:TIGR04523  400 IQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIknnseiKDLTNQDSVKELIIKNLDNTRESLETQLKVlsrSINK 479
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1600 KQKALREK-RELESKLSTLsDQVNQRDFESEKRLrKDLKRTKALLADAQIMLDHLKNNapSKREIAQLKNQLEESEFTca 1678
Cdd:TIGR04523  480 IKQNLEQKqKELKSKEKEL-KKLNEEKKELEEKV-KDLTKKISSLKEKIEKLESEKKE--KESKISDLEDELNKDDFE-- 553
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1679 aavkarkamevemedlhLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQ 1758
Cdd:TIGR04523  554 -----------------LKKENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLE 616
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1759 AQIEESNKEKQELQEKLQALQSQVEFLEQSM------VDKSLVSRQEA--KIRELETRL-EFEKTQVKRLENLASRLKET 1829
Cdd:TIGR04523  617 KELEKAKKENEKLSSIIKNIKSKKNKLKQEVkqiketIKEIRNKWPEIikKIKESKTKIdDIIELMKDWLKELSLHYKKY 696
                          650       660
                   ....*....|....*....|....*...
gi 1907082233 1830 MEKLTEERDQRAAAENRE--KEQNKRLQ 1855
Cdd:TIGR04523  697 ITRMIRIKDLPKLEEKYKeiEKELKKLD 724
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1155-1840 8.00e-23

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 106.94  E-value: 8.00e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1155 LARLEEQRdEQTSRHLTLfQAACRGYLARQHFKKRKIQDLAIRCVQKNIKKNKgvkdwpwwklfttvrplIQVQLSEEQI 1234
Cdd:COG1196    202 LEPLERQA-EKAERYREL-KEELKELEAELLLLKLRELEAELEELEAELEELE-----------------AELEELEAEL 262
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1235 RNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEMKELQTQYDAL 1314
Cdd:COG1196    263 AELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEEL 342
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1315 KKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREvdftkkrlQQELEDKMEVEQQSRRQLERRLGDLQA 1394
Cdd:COG1196    343 EEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEE--------LLEALRAAAELAAQLEELEEAEEALLE 414
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1395 DSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAF 1474
Cdd:COG1196    415 RLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLL 494
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1475 SLKQQMEEKDLDIAGftqkvvSLEAELQDISSQESKDEASLAKVKKQLRD-----LEAKVKDQEEELDEQAGS-IQMLEQ 1548
Cdd:COG1196    495 LLLEAEADYEGFLEG------VKAALLLAGLRGLAGAVAVLIGVEAAYEAaleaaLAAALQNIVVEDDEVAAAaIEYLKA 568
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1549 AKL--RLEMEMERMRQTHSKEMESRDEEVEEARqscqkklkqmevqLEEEYEDKQKALREKRELESKLSTLSDQvnqrdf 1626
Cdd:COG1196    569 AKAgrATFLPLDKIRARAALAAALARGAIGAAV-------------DLVASDLREADARYYVLGDTLLGRTLVA------ 629
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1627 eseKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEftcaaaVKARKAMEVEMEDLHLQIDDIAKAKT 1706
Cdd:COG1196    630 ---ARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAAL------LEAEAELEELAERLAEEELELEEALL 700
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1707 ALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLE 1786
Cdd:COG1196    701 AEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLEREIEALG 780
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907082233 1787 QsmVDksLVSRQEAKirELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQR 1840
Cdd:COG1196    781 P--VN--LLAIEEYE--ELEERYDFLSEQREDLEEARETLEEAIEEIDRETRER 828
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1347-1915 1.30e-22

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 106.41  E-value: 1.30e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1347 GEWRLKYERAVREVDFTKKRLQQELE-----DKMEVEQQSRRQ-LERRLGDLQADSDESQRALQQLKKKCQRLTAELQDT 1420
Cdd:pfam01576   29 KELEKKHQQLCEEKNALQEQLQAETElcaeaEEMRARLAARKQeLEEILHELESRLEEEEERSQQLQNEKKKMQQHIQDL 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1421 KLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREKDMllaeafslkqqmeekdldiagftqkvvsLEAE 1500
Cdd:pfam01576  109 EEQLDEEEAARQKLQLEKVTTEAKIKKLEEDILLLEDQNSKLSKERKL----------------------------LEER 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1501 LQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSkEMESRDEEVEEARQ 1580
Cdd:pfam01576  161 ISEFTSNLAEEEEKAKSLSKLKNKHEAMISDLEERLKKEEKGRQELEKAKRKLEGESTDLQEQIA-ELQAQIAELRAQLA 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1581 SCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLsdqvnQRDFESEKRLRKDLKRTK-----ALLADAQIMLDHLKN 1655
Cdd:pfam01576  240 KKEEELQAALARLEEETAQKNNALKKIRELEAQISEL-----QEDLESERAARNKAEKQRrdlgeELEALKTELEDTLDT 314
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1656 NAP-----SKR--EIAQLKNQLEESEFTCAAAVKA-RKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLE 1727
Cdd:pfam01576  315 TAAqqelrSKReqEVTELKKALEEETRSHEAQLQEmRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAELR 394
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1728 EDQEDMNELMKKHKAAvaqasrdMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLeqsmvdKSLVSRQEAKIRELEt 1807
Cdd:pfam01576  395 TLQQAKQDSEHKRKKL-------EGQLQELQARLSESERQRAELAEKLSKLQSELESV------SSLLNEAEGKNIKLS- 460
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1808 rlefektqvKRLENLASRLKETMEKLTEERDQRAAAENR----EKEQNKrLQRQLRDTKE-------EMSELARKEAEAS 1876
Cdd:pfam01576  461 ---------KDVSSLESQLQDTQELLQEETRQKLNLSTRlrqlEDERNS-LQEQLEEEEEakrnverQLSTLQAQLSDMK 530
                          570       580       590
                   ....*....|....*....|....*....|....*....
gi 1907082233 1877 RKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIE 1915
Cdd:pfam01576  531 KKLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYD 569
MYSc_Myo24A cd14937
class XXIV A myosin, motor domain; These myosins have a 1-2 IQ motifs in their neck and a ...
394-820 1.80e-22

class XXIV A myosin, motor domain; These myosins have a 1-2 IQ motifs in their neck and a coiled-coil region in their C-terminal tail. The function of the class XXIV myosins remain elusive. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276897  Cd Length: 637  Bit Score: 105.10  E-value: 1.80e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  394 SSVLHTLRQRYGASLLHTYAGPSLLVLStrgaPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd14937      1 AEVLNMLALRYKKNYIYTIAEPMLISIN----PYQVIDVDINEYKNKNTNELPPHVYSYAKDAMTDFINTKTNQSIIISG 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLatIAGTSGTKVFSVEKWQAlSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTM 553
Cdd:cd14937     77 ESGSGKTEASKLVIKYY--LSGVKEDNEISNTLWDS-NFILEAFGNAKTLKNNNSSRYGKYIKIELDEYQNIVSSSIEIF 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  554 LLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNhlAENNVFGIVPLSKPEEKQKAAQQFSKLQAAMKVLAISpEE 633
Cdd:cd14937    154 LLENIRVVSQEEEERGYHIFYQIFNGMSQELKNKYKIR--SENEYKYIVNKNVVIPEIDDAKDFGNLMISFDKMNMH-DM 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  634 QKTCWLILASIYHLGAAgatkepleEQDEAAEAGRKQFAR-----HEWAQKAAYLLGCSLEELSSAIfkhqlkggTLQRS 708
Cdd:cd14937    231 KDDLFLTLSGLLLLGNV--------EYQEIEKGGKTNCSEldknnLELVNEISNLLGINYENLKDCL--------VFTEK 294
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  709 TSFRQGPEesglgegTKLSALECL---EGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQnpewggSARGA 785
Cdd:cd14937    295 TIANQKIE-------IPLSVEESVsicKSISKDLYNKIFSYITKRINNFLNNNKELNNYIGILDIFGFE------IFSKN 361
                          410       420       430
                   ....*....|....*....|....*....|....*
gi 1907082233  786 SFEELCHNYAQDRLQRLFHERTFLQELERYKEDNI 820
Cdd:cd14937    362 SLEQLLINIANEEIHSIYLYIVYEKETELYKAEDI 396
PTZ00121 PTZ00121
MAEBL; Provisional
1151-1940 2.20e-22

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 106.38  E-value: 2.20e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1151 RAGTLARLEEQRDEQTSRHLTLF---QAACRGYLARQHFKKRKIQDlAIRCVQ----KNIKKNKGVKDWPWWKLFTTVRP 1223
Cdd:PTZ00121  1135 KAEDARKAEEARKAEDAKRVEIArkaEDARKAEEARKAEDAKKAEA-ARKAEEvrkaEELRKAEDARKAEAARKAEEERK 1213
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1224 LIQVQLSEEQIR----NKDEEIQQLRSKLEKVEKERNE---LRLSSDRLETRISELTSELTDERNTGESASQLLDAETAE 1296
Cdd:PTZ00121  1214 AEEARKAEDAKKaeavKKAEEAKKDAEEAKKAEEERNNeeiRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKAD 1293
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1297 RLRTEKEMKELqtqyDALKKQMEVMEMEVMEARliRAAEINGEVDDDDAGGEWRLKYERAVREVDFTKKRLQQELEDKME 1376
Cdd:PTZ00121  1294 EAKKAEEKKKA----DEAKKKAEEAKKADEAKK--KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1377 VEQQSRRQLERRLGDLQADSDESQRAlQQLKKKCQrltaelQDTKlhlEGQQVRNHELEKKQRrfdSELSQAHEETQREK 1456
Cdd:PTZ00121  1368 AAEKKKEEAKKKADAAKKKAEEKKKA-DEAKKKAE------EDKK---KADELKKAAAAKKKA---DEAKKKAEEKKKAD 1434
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1457 LQREKLQREKdmllaEAFSLKQQMEEKDldiagftqkvvslEAELQDISSQESKDEASLAKVKKQLRDLEaKVKDQEEEL 1536
Cdd:PTZ00121  1435 EAKKKAEEAK-----KADEAKKKAEEAK-------------KAEEAKKKAEEAKKADEAKKKAEEAKKAD-EAKKKAEEA 1495
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1537 DEQAGSIQMLEQAKLRLEmEMERMRQTHSKEMESRDEE---VEEARQSCQKKlKQMEVQLEEEY---EDKQKALREKREL 1610
Cdd:PTZ00121  1496 KKKADEAKKAAEAKKKAD-EAKKAEEAKKADEAKKAEEakkADEAKKAEEKK-KADELKKAEELkkaEEKKKAEEAKKAE 1573
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1611 ESKLSTL-----SDQVNQRDFESEKRLRKDLKRTKALLA----DAQIMLDHLKNNAPSKREIAQLKNQLEES-------- 1673
Cdd:PTZ00121  1574 EDKNMALrkaeeAKKAEEARIEEVMKLYEEEKKMKAEEAkkaeEAKIKAEELKKAEEEKKKVEQLKKKEAEEkkkaeelk 1653
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1674 ----EFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTalEEQLSRLQREK---NEIQNRLEEDQEDMNELMKKHKAAVAQ 1746
Cdd:PTZ00121  1654 kaeeENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKA--AEALKKEAEEAkkaEELKKKEAEEKKKAEELKKAEEENKIK 1731
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1747 ASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLENLAS-- 1824
Cdd:PTZ00121  1732 AEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANii 1811
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1825 --------RLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSL 1896
Cdd:PTZ00121  1812 eggkegnlVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEEIEEADEIE 1891
                          810       820       830       840
                   ....*....|....*....|....*....|....*....|....
gi 1907082233 1897 QADLKlafkrigDLQAAIEDEMESDENEDLINSLQDmVTKYQKK 1940
Cdd:PTZ00121  1892 KIDKD-------DIEREIPNNNMAGKNNDIIDDKLD-KDEYIKR 1927
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1331-1945 3.96e-22

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 104.76  E-value: 3.96e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1331 IRAAEINGEVDDDDA---------GGEwrlKYERA-------VREVDFTKKRLQQELEDKMEVEQQSRRQlERRLGDLQA 1394
Cdd:PRK03918   132 IRQGEIDAILESDESrekvvrqilGLD---DYENAyknlgevIKEIKRRIERLEKFIKRTENIEELIKEK-EKELEEVLR 207
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1395 DSDESQRALQQLKKKCQRLTAELQD---TKLHLEGQQVRNHELEKKQRRFDSELSQAhEETQREKLQREKLQREKDMLLA 1471
Cdd:PRK03918   208 EINEISSELPELREELEKLEKEVKEleeLKEEIEELEKELESLEGSKRKLEEKIREL-EERIEELKKEIEELEEKVKELK 286
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1472 EAfslkQQMEEKDLDIAGFTQKVVSleaELQDISSQESKDEASLAKVKKQLRDLE---AKVKDQEEELDEQAGSIQMLEQ 1548
Cdd:PRK03918   287 EL----KEKAEEYIKLSEFYEEYLD---ELREIEKRLSRLEEEINGIEERIKELEekeERLEELKKKLKELEKRLEELEE 359
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1549 AKLRLEMEMERMRQTHSKEMESRDEEVEEArqscQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQrdFES 1628
Cdd:PRK03918   360 RHELYEEAKAKKEELERLKKRLTGLTPEKL----EKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEE--LKK 433
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1629 EKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEftcAAAVKARKAMEVEMEDLHLQidDIAKAKTAL 1708
Cdd:PRK03918   434 AKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLR---KELRELEKVLKKESELIKLK--ELAEQLKEL 508
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1709 EEQLSRLQREKNEiqnRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQS 1788
Cdd:PRK03918   509 EEKLKKYNLEELE---KKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFE 585
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1789 MVDKSlvsrqEAKIRELET---RLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEM 1865
Cdd:PRK03918   586 SVEEL-----EERLKELEPfynEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEE 660
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1866 SElarkeaEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIED----EMESDENEDLINSLQDMVTKYQKKK 1941
Cdd:PRK03918   661 YE------ELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEErekaKKELEKLEKALERVEELREKVKKYK 734

                   ....
gi 1907082233 1942 NKLE 1945
Cdd:PRK03918   735 ALLK 738
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1352-1874 5.38e-22

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 104.38  E-value: 5.38e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1352 KYERAVREVDFTKKRLQqELEDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRN 1431
Cdd:PRK03918   225 KLEKEVKELEELKEEIE-ELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYE 303
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1432 HELEKKQR--RFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFtQKVVSLEAELQDISSQES 1509
Cdd:PRK03918   304 EYLDELREieKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELY-EEAKAKKEELERLKKRLT 382
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1510 kdEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSK----EMESRDEEVEEARQSCQKK 1585
Cdd:PRK03918   383 --GLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKcpvcGRELTEEHRKELLEEYTAE 460
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1586 LKQMEVQLEEEYEDKQKALREKRELESKLstlsdqvnqrdfESEKRLRKDLKrtkalladaqiMLDHLKN--NAPSKREI 1663
Cdd:PRK03918   461 LKRIEKELKEIEEKERKLRKELRELEKVL------------KKESELIKLKE-----------LAEQLKEleEKLKKYNL 517
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1664 AQLKNQLEESEFTCAAAVKARKAMEVEMEDLHlQIDDIAKAKTALEEQLSRLQREKNEIQNRL--------EEDQEDMNE 1735
Cdd:PRK03918   518 EELEKKAEEYEKLKEKLIKLKGEIKSLKKELE-KLEELKKKLAELEKKLDELEEELAELLKELeelgfesvEELEERLKE 596
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1736 LMKKH------KAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKSLvSRQEAKIRELETRL 1809
Cdd:PRK03918   597 LEPFYneylelKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEY-EELREEYLELSREL 675
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082233 1810 EFEKTQVKRLENLASRLKETMEKLTEERDQRAAAE------NREKEQNKRLQRQLRDTKEEMSELARKEAE 1874
Cdd:PRK03918   676 AGLRAELEELEKRREEIKKTLEKLKEELEEREKAKkeleklEKALERVEELREKVKKYKALLKERALSKVG 746
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1232-1890 2.51e-21

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 101.64  E-value: 2.51e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1232 EQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESasqlLDAET---AERLRTEKEMK-EL 1307
Cdd:TIGR04523   47 NELKNKEKELKNLDKNLNKDEEKINNSNNKIKILEQQIKDLNDKLKKNKDKINK----LNSDLskiNSEIKNDKEQKnKL 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1308 QTQYDALKKQMEVMEMEVmearliraAEINGEV-DDDDAGGEWRLKYERAVREVDFTKKRLQQELEDKMEVEQ------Q 1380
Cdd:TIGR04523  123 EVELNKLEKQKKENKKNI--------DKFLTEIkKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQKnidkikN 194
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1381 SRRQLERRLGDLQAdsdesqralqqLKKKCQRLTAELQDTKlhlEGQQVRNHELEKKQRRFdSELSQAHEETQrEKLQRE 1460
Cdd:TIGR04523  195 KLLKLELLLSNLKK-----------KIQKNKSLESQISELK---KQNNQLKDNIEKKQQEI-NEKTTEISNTQ-TQLNQL 258
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1461 KLQREKDmllaeafslKQQMEEKDLDIAGFTQKVVSLEAELQDISSQES-----KDEASLAKVKKQLRDLEAKVKDQEEE 1535
Cdd:TIGR04523  259 KDEQNKI---------KKQLSEKQKELEQNNKKIKELEKQLNQLKSEISdlnnqKEQDWNKELKSELKNQEKKLEEIQNQ 329
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1536 LDEQAGSIQMLEQAKLRLEMEmermRQTHSKEMESRDEEVEEArqscQKKLKQMEVQLEEEYEDKQKALREKRELESKLS 1615
Cdd:TIGR04523  330 ISQNNKIISQLNEQISQLKKE----LTNSESENSEKQRELEEK----QNEIEKLKKENQSYKQEIKNLESQINDLESKIQ 401
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1616 TLSDQVNQRD-----FESEKR-LRKDLKRTKALLADaqimldhlknnapSKREIAQLKNQLEESEftcaaavkarkamev 1689
Cdd:TIGR04523  402 NQEKLNQQKDeqikkLQQEKElLEKEIERLKETIIK-------------NNSEIKDLTNQDSVKE--------------- 453
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1690 emedlhLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASrdmaqmnDLQAQIEESNKEKQ 1769
Cdd:TIGR04523  454 ------LIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKK-------ELEEKVKDLTKKIS 520
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1770 ELQEKLQALQSQVEFLEQSMVDKslvsrqEAKIRELETRLEFE--KTQVKRLENLASRLKETMEKLTEERDQ-RAAAENR 1846
Cdd:TIGR04523  521 SLKEKIEKLESEKKEKESKISDL------EDELNKDDFELKKEnlEKEIDEKNKEIEELKQTQKSLKKKQEEkQELIDQK 594
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082233 1847 EKEQNK-------------RLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLE 1890
Cdd:TIGR04523  595 EKEKKDlikeieekekkisSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIK 651
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1498-1956 5.08e-21

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 101.41  E-value: 5.08e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1498 EAELQDISSQESKDEASLAKVKKQLRDLEAKVkdqeEELDEQAGSIQMLEQAKLRLEMEMERMRQTHS----------KE 1567
Cdd:pfam01576    4 EEEMQAKEEELQKVKERQQKAESELKELEKKH----QQLCEEKNALQEQLQAETELCAEAEEMRARLAarkqeleeilHE 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1568 MESRDEEVEEARQSCQ---KKLKQ----MEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDfESEKRLRKDLKRTK 1640
Cdd:pfam01576   80 LESRLEEEEERSQQLQnekKKMQQhiqdLEEQLDEEEAARQKLQLEKVTTEAKIKKLEEDILLLE-DQNSKLSKERKLLE 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1641 ALLADAQI----------MLDHLKNNAPSKreIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEE 1710
Cdd:pfam01576  159 ERISEFTSnlaeeeekakSLSKLKNKHEAM--ISDLEERLKKEEKGRQELEKAKRKLEGESTDLQEQIAELQAQIAELRA 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1711 QLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVE-FLEQSM 1789
Cdd:pfam01576  237 QLAKKEEELQAALARLEEETAQKNNALKKIRELEAQISELQEDLESERAARNKAEKQRRDLGEELEALKTELEdTLDTTA 316
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1790 VDKSLVSRQEAKIRELETRLEFE-KTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSEL 1868
Cdd:pfam01576  317 AQQELRSKREQEVTELKKALEEEtRSHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAELRTL 396
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1869 ARKEAEASRKKHELEMDLESLEA-ANQSLQADLKLAfKRIGDLQAAIEdemesdenedlinSLQDMVTKYQKKKNKLEGD 1947
Cdd:pfam01576  397 QQAKQDSEHKRKKLEGQLQELQArLSESERQRAELA-EKLSKLQSELE-------------SVSSLLNEAEGKNIKLSKD 462
                          490
                   ....*....|
gi 1907082233 1948 -SDVDSELED 1956
Cdd:pfam01576  463 vSSLESQLQD 472
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1222-1978 1.07e-20

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 100.43  E-value: 1.07e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1222 RPLIQVQLSEEQIRNKDEEIQqlRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTE 1301
Cdd:pfam02463  176 KKLIEETENLAELIIDLEELK--LQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEI 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1302 KEMKELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDFTKKRLQQELEDKMEVEQQS 1381
Cdd:pfam02463  254 ESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELKKE 333
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1382 RRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREK 1461
Cdd:pfam02463  334 KEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLEL 413
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1462 LQREKDMLLAEAFSL--KQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQ 1539
Cdd:pfam02463  414 ARQLEDLLKEEKKEEleILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQ 493
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1540 AG-SIQMLEQ------AKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELES 1612
Cdd:pfam02463  494 KLeERSQKESkarsglKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTEL 573
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1613 KLStlSDQVNQRDFESEKRLRKDLK------RTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEftcaaavKARKA 1686
Cdd:pfam02463  574 PLG--ARKLRLLIPKLKLPLKSIAVleidpiLNLAQLDKATLEADEDDKRAKVVEGILKDTELTKLKE-------SAKAK 644
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1687 MEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNK 1766
Cdd:pfam02463  645 ESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLA 724
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1767 EKQELQEKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENR 1846
Cdd:pfam02463  725 DRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELR 804
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1847 E--KEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENE 1924
Cdd:pfam02463  805 AleEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKL 884
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082233 1925 DLINS---------LQDMVTKYQKKKNKLEGDSDVDSELEDRVDGVKSWLSKNKGPSKAPSDD 1978
Cdd:pfam02463  885 KDELEskeekekeeKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEADE 947
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1224-1864 1.17e-20

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 100.04  E-value: 1.17e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1224 LIQVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLE-----TRISELTSELTDERNTGESASQLLDAETAERl 1298
Cdd:TIGR00618  245 LTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINrarkaAPLAAHIKAVTQIEQQAQRIHTELQSKMRSR- 323
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1299 rtekeMKELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGgeWRLKYERAVREVDFTKKrLQQELEDKMEVE 1378
Cdd:TIGR00618  324 -----AKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATS--IREISCQQHTLTQHIHT-LQQQKTTLTQKL 395
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1379 QQSRRQLE--RRLGDLQADSDESQRALQQlkkkcqrltaelqdTKLHLEGQQVRNHELEKKQRRFDSElsQAHEETQREK 1456
Cdd:TIGR00618  396 QSLCKELDilQREQATIDTRTSAFRDLQG--------------QLAHAKKQQELQQRYAELCAAAITC--TAQCEKLEKI 459
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1457 LQREKLQ--REKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISS---QESKDEASLAKVKKQLRDLEAKVKD 1531
Cdd:TIGR00618  460 HLQESAQslKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIhpnPARQDIDNPGPLTRRMQRGEQTYAQ 539
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1532 QEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRD---EEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKR 1608
Cdd:TIGR00618  540 LETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNrskEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALLR 619
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1609 ELESKLSTLSDQVNQRDFESEKRLRK-DLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAM 1687
Cdd:TIGR00618  620 KLQPEQDLQDVRLHLQQCSQELALKLtALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQLTYWKEML 699
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1688 EVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKE 1767
Cdd:TIGR00618  700 AQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELMHQARTVLKARTEAHFNNNEEVTAALQTGAE 779
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1768 KQELQEKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQvKRLENLASRLKETMEKLTEERDQRAAAENRE 1847
Cdd:TIGR00618  780 LSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLV-QEEEQFLSRLEEKSATLGEITHQLLKYEECS 858
                          650
                   ....*....|....*..
gi 1907082233 1848 KEQNKRLQRQLRDTKEE 1864
Cdd:TIGR00618  859 KQLAQLTQEQAKIIQLS 875
MYSc_Myo44 cd14905
class XLIV myosin, motor domain; There is little known about the function of the myosin XLIV ...
395-847 3.29e-20

class XLIV myosin, motor domain; There is little known about the function of the myosin XLIV class. Members here include cellular slime mold Polysphondylium and soil-living amoeba Dictyostelium. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276870  Cd Length: 673  Bit Score: 97.86  E-value: 3.29e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFKgcRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 473
Cdd:cd14905      2 TLINIIQARYKKEIIYTYIGPILVSVNPlRYLPFLHSQELVRNYN--QRRGLPPHLFALAAKAISDMQDFRRDQLIFIGG 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  474 SSGSGKTTSFQHLVQYLAT--IAGTSGTKVFSVEKwqalSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQ 551
Cdd:cd14905     80 ESGSGKSENTKIIIQYLLTtdLSRSKYLRDYILES----GIILESFGHASTDSNHNSSRWGKYFEMFYSLYGEIQGAKLY 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  552 TMLLEKLRVARRPASEATFNVFYYLLAcgdatlrtelhlnhlaennvfGIVplskpeEKQKAAQQFSKLQaamkvlaisp 631
Cdd:cd14905    156 SYFLDENRVTYQNKGERNFHIFYQFLK---------------------GIT------DEEKAAYQLGDIN---------- 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  632 eeqktcwlilaSIYHLGAAGATKEPLEEQDEAAEAGRKQFARHEWAQKAAYLLGCSLE---ELSSAIF-----KHQLKGG 703
Cdd:cd14905    199 -----------SYHYLNQGGSISVESIDDNRVFDRLKMSFVFFDFPSEKIDLIFKTLSfiiILGNVTFfqkngKTEVKDR 267
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  704 TLQRSTSFRQGPEESGLgEGTKLS--------ALECLEGMASGLYSELFTLLISLVNRALKSSQHSLcSMMIVDTPGFQN 775
Cdd:cd14905    268 TLIESLSHNITFDSTKL-ENILISdrsmpvneAVENRDSLARSLYSALFHWIIDFLNSKLKPTQYSH-TLGILDLFGQES 345
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082233  776 PEWGGsargasFEELCHNYAQDRLQRLFHERTFLQELERYKEDNI----ELAFDDLE---PVADDSVAAVDQASHLVRS 847
Cdd:cd14905    346 SQLNG------YEQFSINFLEERLQQIYLQTVLKQEQREYQTERIpwmtPISFKDNEesvEMMEKIINLLDQESKNINS 418
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1230-1936 3.89e-20

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 97.78  E-value: 3.89e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1230 SEEQIRNKDEEIQQLRSKL----EKVEKERNELRLSSDRLETR---ISELTSELTDERNTGESASQLLDAETAERLRTEK 1302
Cdd:TIGR04523   73 SNNKIKILEQQIKDLNDKLkknkDKINKLNSDLSKINSEIKNDkeqKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEK 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1303 EMKELQTQYDALKKQmevmemevmearliraaeingevddddaggewrlkyeraVREVDFTKKRLQQELEDKMEVEQQSR 1382
Cdd:TIGR04523  153 ELEKLNNKYNDLKKQ---------------------------------------KEELENELNLLEKEKLNIQKNIDKIK 193
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1383 RQLERrlgdlqadsdeSQRALQQLKKKCQRLTaELQDTKLHLEGQQ-VRNHELEKKQRRFdSELSQAHEETQrEKLQREK 1461
Cdd:TIGR04523  194 NKLLK-----------LELLLSNLKKKIQKNK-SLESQISELKKQNnQLKDNIEKKQQEI-NEKTTEISNTQ-TQLNQLK 259
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1462 LQREKDmllaeafslKQQMEEKDLDIAGFTQKVVSLEAELQDISSQES-----KDEASLAKVKKQLRDLEAKVKDQEEEL 1536
Cdd:TIGR04523  260 DEQNKI---------KKQLSEKQKELEQNNKKIKELEKQLNQLKSEISdlnnqKEQDWNKELKSELKNQEKKLEEIQNQI 330
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1537 DEQAGSIQMLEQAKLRLEMEmermRQTHSKEMESRDEEVEEArqscQKKLKQMEVQLEEEYEDKQKALREKRELESKLST 1616
Cdd:TIGR04523  331 SQNNKIISQLNEQISQLKKE----LTNSESENSEKQRELEEK----QNEIEKLKKENQSYKQEIKNLESQINDLESKIQN 402
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1617 LSDQVNQRD-----FESEKR-LRKDLKRTKALLADaqimldhlknnapSKREIAQLKNQLEESEftcaaavkarkameve 1690
Cdd:TIGR04523  403 QEKLNQQKDeqikkLQQEKElLEKEIERLKETIIK-------------NNSEIKDLTNQDSVKE---------------- 453
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1691 medlhLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEdmnELMKKHKaavaqasrdmaqmndlqaQIEESNKEKQE 1770
Cdd:TIGR04523  454 -----LIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQK---ELKSKEK------------------ELKKLNEEKKE 507
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1771 LQEKLQALQSQVEFLEQSMVD-KSLVSRQEAKIRELETRLEFEKTQVKRlenlaSRLKETMEKLTEERDQRAAAENREKE 1849
Cdd:TIGR04523  508 LEEKVKDLTKKISSLKEKIEKlESEKKEKESKISDLEDELNKDDFELKK-----ENLEKEIDEKNKEIEELKQTQKSLKK 582
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1850 QNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQ---SLQADLKLAFKRIGDLQAAIEDEMESDENE-- 1924
Cdd:TIGR04523  583 KQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEklsSIIKNIKSKKNKLKQEVKQIKETIKEIRNKwp 662
                          730
                   ....*....|..
gi 1907082233 1925 DLINSLQDMVTK 1936
Cdd:TIGR04523  663 EIIKKIKESKTK 674
Motor_domain cd01363
Myosin and Kinesin motor domain; Myosin and Kinesin motor domain. These ATPases belong to the ...
425-543 1.31e-19

Myosin and Kinesin motor domain; Myosin and Kinesin motor domain. These ATPases belong to the P-loop NTPase family and provide the driving force in myosin and kinesin mediated processes. Some of the names do not match with what is given in the sequence list. This is because they are based on the current nomenclature by Kollmar/Sebe-Pedros.


Pssm-ID: 276814 [Multi-domain]  Cd Length: 170  Bit Score: 88.17  E-value: 1.31e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  425 APAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGTSGTK---- 500
Cdd:cd01363     11 LPIYRDSKIIVFYRGFRRSESQPHVFAIADPAYQSMLDGYNNQSIFAYGESGAGKTETMKGVIPYLASVAFNGINKgete 90
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907082233  501 ---------VFSVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAG 543
Cdd:cd01363     91 gwvylteitVTLEDQILQANPILEAFGNAKTTRNENSSRFGKFIEILLDIAG 142
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1231-1958 2.74e-19

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 95.57  E-value: 2.74e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1231 EEQIRNKDEEIQQLRSKL----EKVEKERNELRLSSDRLETRISELTSE---LTDERNTgESASQ----------LLDAE 1293
Cdd:pfam15921   77 ERVLEEYSHQVKDLQRRLnesnELHEKQKFYLRQSVIDLQTKLQEMQMErdaMADIRRR-ESQSQedlrnqlqntVHELE 155
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1294 TAERLRtEKEMKELQTQYDALKKQMEVMEMEVMEARLIRaaeingeVDDDDAGGEWRLKYERA----VREVDFTKKRLQQ 1369
Cdd:pfam15921  156 AAKCLK-EDMLEDSNTQIEQLRKMMLSHEGVLQEIRSIL-------VDFEEASGKKIYEHDSMstmhFRSLGSAISKILR 227
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1370 ELEDKMEVEQQSRRQLERRLGDLQADS-DESQRALQQLKKKCQRLTAE--LQDTKLHLEGQQVRNhELEKKQRRFDSELS 1446
Cdd:pfam15921  228 ELDTEISYLKGRIFPVEDQLEALKSESqNKIELLLQQHQDRIEQLISEheVEITGLTEKASSARS-QANSIQSQLEIIQE 306
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1447 QAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKdldIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLE 1526
Cdd:pfam15921  307 QARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEDK---IEELEKQLVLANSELTEARTERDQFSQESGNLDDQLQKLL 383
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1527 AKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRqthsKEMESRDEEVeearQSCQKKLKQMEVQLEEEYEDKQKALRE 1606
Cdd:pfam15921  384 ADLHKREKELSLEKEQNKRLWDRDTGNSITIDHLR----RELDDRNMEV----QRLEALLKAMKSECQGQMERQMAAIQG 455
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1607 KRELESKLSTLSDQVnqrdfESEKRLrkdLKRTKALLADAQIMLDHlknnapSKREIAQLKNQLEESE----FTCAAAVK 1682
Cdd:pfam15921  456 KNESLEKVSSLTAQL-----ESTKEM---LRKVVEELTAKKMTLES------SERTVSDLTASLQEKEraieATNAEITK 521
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1683 ARKAMEVEMEDL-HLQIDDIAKAKTALEEQLSRLQ-REKNEIQNRLEEDQEDMNELmkkhkaaVAQASRDMAQMNDLQAQ 1760
Cdd:pfam15921  522 LRSRVDLKLQELqHLKNEGDHLRNVQTECEALKLQmAEKDKVIEILRQQIENMTQL-------VGQHGRTAGAMQVEKAQ 594
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1761 IE-ESNKEKQELQEKlqalqsqvefleqsmvdKSLVSRQEAKIRELETR---LEFEKTqvkRLENLASRLKETMEKLTEE 1836
Cdd:pfam15921  595 LEkEINDRRLELQEF-----------------KILKDKKDAKIRELEARvsdLELEKV---KLVNAGSERLRAVKDIKQE 654
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1837 RDQ-------RAAAENREKEQNKRLQRQLRDTKEEMSELARK-EAEASRKKHELEM---DLESLEAAN-------QSLQA 1898
Cdd:pfam15921  655 RDQllnevktSRNELNSLSEDYEVLKRNFRNKSEEMETTTNKlKMQLKSAQSELEQtrnTLKSMEGSDghamkvaMGMQK 734
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082233 1899 DLKLAFKRIGDLQAAI---EDEMESDENEDLI-----NSLQDMVTKYQKKKNKLEGDSDVDSELEDRV 1958
Cdd:pfam15921  735 QITAKRGQIDALQSKIqflEEAMTNANKEKHFlkeekNKLSQELSTVATEKNKMAGELEVLRSQERRL 802
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
1231-1941 2.79e-19

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 95.17  E-value: 2.79e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1231 EEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSE---LTDERNTGESASQLLD---AETAERL-RTEKE 1303
Cdd:pfam05483   98 EAELKQKENKLQENRKIIEAQRKAIQELQFENEKVSLKLEEEIQEnkdLIKENNATRHLCNLLKetcARSAEKTkKYEYE 177
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1304 MKELQTQYDALKKQMEvmemevmeaRLIRAAEiNGEVDDDDAGGEWRLKYERAVREVDFTKKRLQQELEDKmevEQQSRR 1383
Cdd:pfam05483  178 REETRQVYMDLNNNIE---------KMILAFE-ELRVQAENARLEMHFKLKEDHEKIQHLEEEYKKEINDK---EKQVSL 244
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1384 QL------ERRLGDLQADSDESQRALQQLKKKcqrltAELQDTKLHlegqqvrnhELEKKQRRFDSELSQAHEETQREKL 1457
Cdd:pfam05483  245 LLiqitekENKMKDLTFLLEESRDKANQLEEK-----TKLQDENLK---------ELIEKKDHLTKELEDIKMSLQRSMS 310
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1458 QREKLQREKDMLLAEAFSLKQ----QMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQE 1533
Cdd:pfam05483  311 TQKALEEDLQIATKTICQLTEekeaQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQKKS 390
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1534 EELDEQAG-----SIQMLEQAKLRLEMEM----ERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKAL 1604
Cdd:pfam05483  391 SELEEMTKfknnkEVELEELKKILAEDEKlldeKKQFEKIAEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYL 470
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1605 REKRELESKLstlsdqvnqrdfESEKrlrkdLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEEseftcaaaVKAR 1684
Cdd:pfam05483  471 KEVEDLKTEL------------EKEK-----LKNIELTAHCDKLLLENKELTQEASDMTLELKKHQED--------IINC 525
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1685 KAMEvemEDLHLQIDDIAKAKTALEEQLSRLQRE----KNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQ 1760
Cdd:pfam05483  526 KKQE---ERMLKQIENLEEKEMNLRDELESVREEfiqkGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKCNNLKKQ 602
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1761 IEESNKEKQELQEKLQALQsqveflEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLENLASRLKE----TMEKLTEE 1836
Cdd:pfam05483  603 IENKNKNIEELHQENKALK------KKGSAENKQLNAYEIKVNKLELELASAKQKFEEIIDNYQKEIEdkkiSEEKLLEE 676
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1837 RDQRAA----AENREKEQNKRLQRQLRD---------------TKEEMSELA---RKEAEASRKKHELEMDLESLEAANQ 1894
Cdd:pfam05483  677 VEKAKAiadeAVKLQKEIDKRCQHKIAEmvalmekhkhqydkiIEERDSELGlykNKEQEQSSAKAALEIELSNIKAELL 756
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*..
gi 1907082233 1895 SLQADLklafkrigdlqaaiedEMESDENEDLINSLQDMVTKYQKKK 1941
Cdd:pfam05483  757 SLKKQL----------------EIEKEEKEKLKMEAKENTAILKDKK 787
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1486-1876 2.93e-19

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 95.52  E-value: 2.93e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1486 DIAG---FTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEaKVKDQEEELDEQAGSIQMLEQAKLRLEMEmermrq 1562
Cdd:TIGR02169  161 EIAGvaeFDRKKEKALEELEEVEENIERLDLIIDEKRQQLERLR-REREKAERYQALLKEKREYEGYELLKEKE------ 233
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1563 thskEMESRDEEVEEARQSCQKKLKQMEVQLEEeyedkqkalREKR--ELESKLSTLSDQVNQRDFESEKRLRKDLKRTK 1640
Cdd:TIGR02169  234 ----ALERQKEAIERQLASLEEELEKLTEEISE---------LEKRleEIEQLLEELNKKIKDLGEEEQLRVKEKIGELE 300
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1641 ALLADAQimldhlknnapskREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKN 1720
Cdd:TIGR02169  301 AEIASLE-------------RSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELE 367
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1721 EIQNRLEEDQEDMNELMKKHKAAVaqasrdmAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKslvsrqEA 1800
Cdd:TIGR02169  368 DLRAELEEVDKEFAETRDELKDYR-------EKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGI------EA 434
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082233 1801 KIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRaaaenreKEQNKRLQRQLRDTKeemSELARKEAEAS 1876
Cdd:TIGR02169  435 KINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDL-------KEEYDRVEKELSKLQ---RELAEAEAQAR 500
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1356-1937 8.80e-19

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 93.95  E-value: 8.80e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1356 AVREVDFTKKRLQQELEDkmEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELE 1435
Cdd:PRK02224   177 GVERVLSDQRGSLDQLKA--QIEEKEEKDLHERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELE 254
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1436 kkqrrfdsELSQAHEETQREKLQREklqREKDMLLAEAFSLKQQMEEkdldiagftqkvvsLEAELQDISSQESKDEASL 1515
Cdd:PRK02224   255 --------TLEAEIEDLRETIAETE---REREELAEEVRDLRERLEE--------------LEEERDDLLAEAGLDDADA 309
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1516 AKVKKQLRDLEAKVKDQEEELDEQAGSIQML-EQAK------LRLEMEMERMRqTHSKEMESRDEEVEEARQSCQKKLKQ 1588
Cdd:PRK02224   310 EAVEARREELEDRDEELRDRLEECRVAAQAHnEEAEslredaDDLEERAEELR-EEAAELESELEEAREAVEDRREEIEE 388
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1589 MEVQLEEEYEDKQKALREKRELESKLSTLSDQVNqRDFESEKRLRKDLKRTKALLADAQIMLDHLKnnAPskrEIAQlkn 1668
Cdd:PRK02224   389 LEEEIEELRERFGDAPVDLGNAEDFLEELREERD-ELREREAELEATLRTARERVEEAEALLEAGK--CP---ECGQ--- 459
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1669 QLEESEFTCAAAVK--ARKAMEVEMEDLHLQIDDIAKAKTALEEqLSRLQREKNeiqnRLEEDQEDMNELMKKHKAAVAQ 1746
Cdd:PRK02224   460 PVEGSPHVETIEEDreRVEELEAELEDLEEEVEEVEERLERAED-LVEAEDRIE----RLEERREDLEELIAERRETIEE 534
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1747 ASRdmaqmndlqaQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKslvsRQEAKirELETRLEFEKTQVKRLENLASRL 1826
Cdd:PRK02224   535 KRE----------RAEELRERAAELEAEAEEKREAAAEAEEEAEEA----REEVA--ELNSKLAELKERIESLERIRTLL 598
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1827 ------KETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHE-----LEMDLESLEAANQS 1895
Cdd:PRK02224   599 aaiadaEDEIERLREKREALAELNDERRERLAEKRERKRELEAEFDEARIEEAREDKERAEeyleqVEEKLDELREERDD 678
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|....
gi 1907082233 1896 LQadlklafKRIGDLQAAIEdEMES--DENEDLINSLQDMVTKY 1937
Cdd:PRK02224   679 LQ-------AEIGAVENELE-ELEElrERREALENRVEALEALY 714
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1296-1945 1.24e-18

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 93.50  E-value: 1.24e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1296 ERLRTEKEM---KELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERA---VREVDFTKKRLQQ 1369
Cdd:pfam02463  154 RRLEIEEEAagsRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEkleLEEEYLLYLDYLK 233
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1370 ELEDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAH 1449
Cdd:pfam02463  234 LNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDE 313
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1450 EETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLaKVKKQLRDLEAKV 1529
Cdd:pfam02463  314 EKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKK-KLESERLSSAAKL 392
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1530 KDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQthsKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRE 1609
Cdd:pfam02463  393 KEEELELKSEEEKEAQLLLELARQLEDLLKEEK---KEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKK 469
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1610 LESKLSTLSDQVNQRDfESEKRLRKDLKRTKALLADAQIMLDhlKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAM-- 1687
Cdd:pfam02463  470 SEDLLKETQLVKLQEQ-LELLLSRQKLEERSQKESKARSGLK--VLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAis 546
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1688 ---EVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDM----NELMKKHKAAVAQASRDMAQMNDLQAQ 1760
Cdd:pfam02463  547 tavIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAvleiDPILNLAQLDKATLEADEDDKRAKVVE 626
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1761 IEESNKEKQELQEKLQALQSQVefLEQSMVDKSLVSRQEAKIRELETRLEfekTQVKRLENLASRLKETMEKLTEERDQR 1840
Cdd:pfam02463  627 GILKDTELTKLKESAKAKESGL--RKGVSLEEGLAEKSEVKASLSELTKE---LLEIQELQEKAESELAKEEILRRQLEI 701
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1841 AAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRigdLQAAIEDEMES 1920
Cdd:pfam02463  702 KKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKS---ELSLKEKELAE 778
                          650       660
                   ....*....|....*....|....*
gi 1907082233 1921 DENEDLINSLQDMVTKYQKKKNKLE 1945
Cdd:pfam02463  779 EREKTEKLKVEEEKEEKLKAQEEEL 803
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1663-1925 7.83e-18

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 90.77  E-value: 7.83e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1663 IAQLKNQLEESEftcAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKA 1742
Cdd:COG1196    195 LGELERQLEPLE---RQAEKAERYRELKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEE 271
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1743 AVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKSL-VSRQEAKIRELETRLEFEKTQVKRLEN 1821
Cdd:COG1196    272 LRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEeLAELEEELEELEEELEELEEELEEAEE 351
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1822 LASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLK 1901
Cdd:COG1196    352 ELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALA 431
                          250       260
                   ....*....|....*....|....
gi 1907082233 1902 LAFKRIGDLQAAIEDEMESDENED 1925
Cdd:COG1196    432 ELEEEEEEEEEALEEAAEEEAELE 455
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1336-1735 1.10e-17

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 90.50  E-value: 1.10e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1336 INGEVDDDDAGgewRLKYERAVREVDFTKKRLQQ---ELEDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQR 1412
Cdd:TIGR02168  661 ITGGSAKTNSS---ILERRREIEELEEKIEELEEkiaELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLAR 737
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1413 LTAELQdtklhlegqqvrnhELEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQ 1492
Cdd:TIGR02168  738 LEAEVE--------------QLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALRE 803
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1493 KVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRD 1572
Cdd:TIGR02168  804 ALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERA 883
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1573 eEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNqrdfesekRLRKDLKRTK-ALLADAQIMLD 1651
Cdd:TIGR02168  884 -SLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLE--------GLEVRIDNLQeRLSEEYSLTLE 954
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1652 HLKNN--------APSKREIAQLKNQLEE---------SEFtcaAAVKARKamevemEDLHLQIDDIAKAKTALEEQLSR 1714
Cdd:TIGR02168  955 EAEALenkieddeEEARRRLKRLENKIKElgpvnlaaiEEY---EELKERY------DFLTAQKEDLTEAKETLEEAIEE 1025
                          410       420
                   ....*....|....*....|.
gi 1907082233 1715 LQReknEIQNRLEEDQEDMNE 1735
Cdd:TIGR02168 1026 IDR---EARERFKDTFDQVNE 1043
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1363-1945 1.51e-17

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 90.03  E-value: 1.51e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1363 TKKRLQQELEDKMEVEQQSRRQLeRRLGDLQADSDESQRALQQLKKKCQRLTA-----ELQDTKLHLEGQQVRNHELEKK 1437
Cdd:TIGR00618  223 VLEKELKHLREALQQTQQSHAYL-TQKREAQEEQLKKQQLLKQLRARIEELRAqeavlEETQERINRARKAAPLAAHIKA 301
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1438 QRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQ--MEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASL 1515
Cdd:TIGR00618  302 VTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQrrLLQTLHSQEIHIRDAHEVATSIREISCQQHTLTQHI 381
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1516 AKVKKQLRDLEAKVK---DQEEELDEQAGSIQMLEQAKLRLEMEMERMRqthsKEMESRDEEVEEARQSCQKKLkQMEVQ 1592
Cdd:TIGR00618  382 HTLQQQKTTLTQKLQslcKELDILQREQATIDTRTSAFRDLQGQLAHAK----KQQELQQRYAELCAAAITCTA-QCEKL 456
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1593 LEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRL-------RKDLKRTKALLADAQIMLDHLKNNAPSKR---E 1662
Cdd:TIGR00618  457 EKIHLQESAQSLKEREQQLQTKEQIHLQETRKKAVVLARLlelqeepCPLCGSCIHPNPARQDIDNPGPLTRRMQRgeqT 536
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1663 IAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEdmnELMKKHKA 1742
Cdd:TIGR00618  537 YAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSE---AEDMLACE 613
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1743 AVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKSLVSRQ---------EAKIRELETRLEFEK 1813
Cdd:TIGR00618  614 QHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRVlpkellasrQLALQKMQSEKEQLT 693
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1814 TQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEA--------SRKKHELEMD 1885
Cdd:TIGR00618  694 YWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELMHQARTVlkarteahFNNNEEVTAA 773
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082233 1886 L------ESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENEDLINSLQdMVTKYQKKKNKLE 1945
Cdd:TIGR00618  774 LqtgaelSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCET-LVQEEEQFLSRLE 838
MYSc_Myo24B cd14938
class XXIV B myosin, motor domain; These myosins have a 1-2 IQ motifs in their neck and a ...
395-577 9.36e-17

class XXIV B myosin, motor domain; These myosins have a 1-2 IQ motifs in their neck and a coiled-coil region in their C-terminal tail. The functions of these myosins remain elusive. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276898 [Multi-domain]  Cd Length: 713  Bit Score: 86.81  E-value: 9.36e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKgCRR--EDMAPHIYAVAQTAYRAMLMSRQDQSIVLL 472
Cdd:cd14938      2 SVLYHLKERFKNNKFYTKMGPLLIFINPKINNNINNEETIEKYK-CIDciEDLSLNEYHVVHNALKNLNELKRNQSIIIS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  473 GSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKWQAL---------------------STLLEAFGNSPTIMNGSATRF 531
Cdd:cd14938     81 GESGSGKSEIAKNIINFIAYQVKGSRRLPTNLNDQEEDnihneentdyqfnmsemlkhvNVVMEAFGNAKTVKNNNSSRF 160
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1907082233  532 SQILSLDFDQAgQVASASIQTMLLEKLRVARRPASEATFNVFYYLL 577
Cdd:cd14938    161 SKFCTIHIENE-EIKSFHIKKFLLDKERLINRKANENSFNIFYYII 205
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
1222-1951 1.33e-16

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 87.02  E-value: 1.33e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1222 RPLIQVQLSEEQIRNKDEEIQ--QLRSKLEKVEKE-------RNELRLSSDRLETR---ISELTSELTDERNTGESASQL 1289
Cdd:TIGR00606  351 RLQLQADRHQEHIRARDSLIQslATRLELDGFERGpfserqiKNFHTLVIERQEDEaktAAQLCADLQSKERLKQEQADE 430
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1290 LDAETAERLRT--------EKEMKELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVR--- 1358
Cdd:TIGR00606  431 IRDEKKGLGRTielkkeilEKKQEELKFVIKELQQLEGSSDRILELDQELRKAERELSKAEKNSLTETLKKEVKSLQnek 510
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1359 -EVDFTKKRLQQELE-----------------DKMEVEQQSRRQLERRLGDLQA------DSDESQRALQQLKKKCQRLT 1414
Cdd:TIGR00606  511 aDLDRKLRKLDQEMEqlnhhtttrtqmemltkDKMDKDEQIRKIKSRHSDELTSllgyfpNKKQLEDWLHSKSKEINQTR 590
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1415 AELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHE-----------ETQREKLQR--EKLQREKDMLLAEAFSLKQQME 1481
Cdd:TIGR00606  591 DRLAKLNKELASLEQNKNHINNELESKEEQLSSYEDklfdvcgsqdeESDLERLKEeiEKSSKQRAMLAGATAVYSQFIT 670
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1482 EKDLDIAG---FTQKVVSLEAELQDISsqeSKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEME 1558
Cdd:TIGR00606  671 QLTDENQSccpVCQRVFQTEAELQEFI---SDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIP 747
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1559 RMRQTHSK------EMESRDEEVEEARQSCQKKLKQMEV---------QLEEEYEDKQKALREK-RELESKLSTLS-DQV 1621
Cdd:TIGR00606  748 ELRNKLQKvnrdiqRLKNDIEEQETLLGTIMPEEESAKVcltdvtimeRFQMELKDVERKIAQQaAKLQGSDLDRTvQQV 827
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1622 NQRDFESEKRLRK---DLKRTKALLADAQIMLDHLKNNApskreiaqlkNQLEESEFTCAAAVKARKAME-------VEM 1691
Cdd:TIGR00606  828 NQEKQEKQHELDTvvsKIELNRKLIQDQQEQIQHLKSKT----------NELKSEKLQIGTNLQRRQQFEeqlvelsTEV 897
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1692 EDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEED----QEDMNELMKKHKAAVA---------------QASRDMA 1752
Cdd:TIGR00606  898 QSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSnkkaQDKVNDIKEKVKNIHGymkdienkiqdgkddYLKQKET 977
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1753 QMNDLQAQIEESNKEKQELQEKLQALQSQVE---FLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLENLasRLKET 1829
Cdd:TIGR00606  978 ELNTVNAQLEECEKHQEKINEDMRLMRQDIDtqkIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQMQVL--QMKQE 1055
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1830 MEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEAsrKKHELEMDLESLEAANQSL--------QADLK 1901
Cdd:TIGR00606 1056 HQKLEENIDLIKRNHVLALGRQKGYEKEIKHFKKELREPQFRDAEE--KYREMMIVMRTTELVNKDLdiyyktldQAIMK 1133
                          810       820       830       840       850       860
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082233 1902 LAFKRIGDLQAAIED--------------EMESDENEDLinSLQDMVTKYQKKKNKLEGDSDVD 1951
Cdd:TIGR00606 1134 FHSMKMEEINKIIRDlwrstyrgqdieyiEIRSDADENV--SASDKRRNYNYRVVMLKGDTALD 1195
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1187-1776 2.07e-16

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 85.84  E-value: 2.07e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1187 KKRKIQDLAIRCVQKNIKKNKgvkdwpwwKLFTTVRPLIQVQLS-EEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRL 1265
Cdd:TIGR04523  194 NKLLKLELLLSNLKKKIQKNK--------SLESQISELKKQNNQlKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKI 265
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1266 ETRISELTSELtderntgESASQLLDaetaerlRTEKEMKELQTQYDALKKQmevmemevmearliRAAEINGEVDDDDA 1345
Cdd:TIGR04523  266 KKQLSEKQKEL-------EQNNKKIK-------ELEKQLNQLKSEISDLNNQ--------------KEQDWNKELKSELK 317
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1346 GGEWRL--------KYERAVREVDFTKKRLQQELEDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKkcqrLTAEL 1417
Cdd:TIGR04523  318 NQEKKLeeiqnqisQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKN----LESQI 393
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1418 QDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREKDmllaeafSLKQQMEEKDLDIAGFTQKVVSL 1497
Cdd:TIGR04523  394 NDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIK-------DLTNQDSVKELIIKNLDNTRESL 466
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1498 EAELQDISSQESKDEASLAKVKKQLrdleakvKDQEEELDeqagsiqMLEQAKLRLEMEMermrqthsKEMESRdeevee 1577
Cdd:TIGR04523  467 ETQLKVLSRSINKIKQNLEQKQKEL-------KSKEKELK-------KLNEEKKELEEKV--------KDLTKK------ 518
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1578 arqscQKKLKQMEVQLEeeyedkqkalREKRELESKLSTLSDQVNQRDFESEKRLRKD-----------LKRTKALLADA 1646
Cdd:TIGR04523  519 -----ISSLKEKIEKLE----------SEKKEKESKISDLEDELNKDDFELKKENLEKeideknkeieeLKQTQKSLKKK 583
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1647 QIMLDHLKNNapSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNR- 1725
Cdd:TIGR04523  584 QEEKQELIDQ--KEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIKETIKEIRNKw 661
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082233 1726 ---------LEEDQEDMNELMKKHK----AAVAQASRDMAQMNDLQaQIEESNKEKQELQEKLQ 1776
Cdd:TIGR04523  662 peiikkikeSKTKIDDIIELMKDWLkelsLHYKKYITRMIRIKDLP-KLEEKYKEIEKELKKLD 724
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1232-1719 2.21e-16

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 85.59  E-value: 2.21e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1232 EQIRNKDEEIQQLRSKLEKVEKERNELrlssDRLETRISELTSELTDERNTGESASQLLDAET--AERLRTEKEMKELQT 1309
Cdd:COG4717     71 KELKELEEELKEAEEKEEEYAELQEEL----EELEEELEELEAELEELREELEKLEKLLQLLPlyQELEALEAELAELPE 146
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1310 QYDALKKQMEVMEmevmearliraaeingevddddaggEWRLKYERAVREVDFTKKRLQQELEDKMEVEQQSRRQLERRL 1389
Cdd:COG4717    147 RLEELEERLEELR-------------------------ELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEEL 201
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1390 GDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQvrnhelekkqrrfdselsqaheetqreklQREKLQREKDML 1469
Cdd:COG4717    202 EELQQRLAELEEELEEAQEELEELEEELEQLENELEAAA-----------------------------LEERLKEARLLL 252
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1470 LAEA-----FSLKQQMEEKDLDIAGFTQKVVSLeaeLQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQ 1544
Cdd:COG4717    253 LIAAallalLGLGGSLLSLILTIAGVLFLVLGL---LALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALG 329
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1545 MLEQAKL-RLEMEMERMRQTHSKEMESRDEE----VEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSD 1619
Cdd:COG4717    330 LPPDLSPeELLELLDRIEELQELLREAEELEeelqLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQELKEELEELEE 409
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1620 QVNQR--------DFESEKRLRKDLKRTKALLADAQIMLDHLknnapsKREIAQLKNQLE--ESEFTCAAAVKARKAMEV 1689
Cdd:COG4717    410 QLEELlgeleellEALDEEELEEELEELEEELEELEEELEEL------REELAELEAELEqlEEDGELAELLQELEELKA 483
                          490       500       510
                   ....*....|....*....|....*....|
gi 1907082233 1690 EMEDLHLQIDDIAKAKTALEEQLSRLQREK 1719
Cdd:COG4717    484 ELRELAEEWAALKLALELLEEAREEYREER 513
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1413-1913 2.64e-16

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 85.20  E-value: 2.64e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1413 LTAELQDTKLHLEGQQVRNHELEKKQRRfdsELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQ 1492
Cdd:COG4717     47 LLERLEKEADELFKPQGRKPELNLKELK---ELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEK 123
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1493 KVvsleaELQDISSQESKDEASLAKVKKQLRDLEAkvkdQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRD 1572
Cdd:COG4717    124 LL-----QLLPLYQELEALEAELAELPERLEELEE----RLEELRELEEELEELEAELAELQEELEELLEQLSLATEEEL 194
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1573 EEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQvnqrdfesekrlrKDLKRTKALLADAQIMLdh 1652
Cdd:COG4717    195 QDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALE-------------ERLKEARLLLLIAAALL-- 259
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1653 lknnapskrEIAQLKNQLEESEFTCAAAVKARKAMevemedLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQED 1732
Cdd:COG4717    260 ---------ALLGLGGSLLSLILTIAGVLFLVLGL------LALLFLLLAREKASLGKEAEELQALPALEELEEEELEEL 324
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1733 MNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKlQALQSQVEFLEQSMVDKslvsrqEAKIRELETRLEFE 1812
Cdd:COG4717    325 LAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLE-ELEQEIAALLAEAGVED------EEELRAALEQAEEY 397
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1813 KTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQnkRLQRQLRDTKEEMSELARKEAEASRKKHELEMD--LESLE 1890
Cdd:COG4717    398 QELKEELEELEEQLEELLGELEELLEALDEEELEEELE--ELEEELEELEEELEELREELAELEAELEQLEEDgeLAELL 475
                          490       500
                   ....*....|....*....|...
gi 1907082233 1891 AANQSLQADLKLAFKRIGDLQAA 1913
Cdd:COG4717    476 QELEELKAELRELAEEWAALKLA 498
MYSc_Myo20 cd14881
class XX myosin, motor domain; These class 20 myosins are primarily insect myosins with such ...
395-1124 1.08e-15

class XX myosin, motor domain; These class 20 myosins are primarily insect myosins with such members as Drosophila, Daphnia, and mosquitoes. These myosins contain a single IQ motif in the neck region. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276847 [Multi-domain]  Cd Length: 633  Bit Score: 83.24  E-value: 1.08e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  395 SVLHTLRQRYGASLLHTYAGPSLLVLST---RGAPAVYSEKvmhmfkgcRREDMAPHIYAVAQTAYRAMLMSRQDQSIVL 471
Cdd:cd14881      2 AVMKCLQARFYAKEFFTNVGPILLSVNPyrdVGNPLTLTST--------RSSPLAPQLLKVVQEAVRQQSETGYPQAIIL 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  472 LGSSGSGKTTSFQHLVQYLATIAGtSGTKVFSVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQaGQVASASIQ 551
Cdd:cd14881     74 SGTSGSGKTYASMLLLRQLFDVAG-GGPETDAFKHLAAAFTVLRSLGSAKTATNSESSRIGHFIEVQVTD-GALYRTKIH 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  552 TMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLN--------HLAENNVFgivplskpEEKQKAAQQFSKLQAA 623
Cdd:cd14881    152 CYFLDQTRVIRPLPGEKNYHIFYQMLAGLSQEERVKLHLDgyspanlrYLSHGDTR--------QNEAEDAARFQAWKAC 223
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  624 MKVLAISpeeqktcWL----ILASIYHLGAAGATKEPLEEQDEAAEAGRKQfarhewaqkAAYLLGCSleelSSAIFKhq 699
Cdd:cd14881    224 LGILGIP-------FLdvvrVLAAVLLLGNVQFIDGGGLEVDVKGETELKS---------VAALLGVS----GAALFR-- 281
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  700 lkgGTLQRSTSFRQGPEESGLGEGTKLSALECLegmASGLYSElftLLISLVNRAlkSSQHSLCSMM----------IVD 769
Cdd:cd14881    282 ---GLTTRTHNARGQLVKSVCDANMSNMTRDAL---AKALYCR---TVATIVRRA--NSLKRLGSTLgthatdgfigILD 350
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  770 TPGFQNPewggsaRGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELafdDLEPVADDSVAAVDqashLVRSLa 849
Cdd:cd14881    351 MFGFEDP------KPSQLEHLCINLCAETMQHFYNTHIFKSSIESCRDEGIQC---EVEVDYVDNVPCID----LISSL- 416
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  850 hadeARGLLWLLEEEALVPGATEdalldrlfSYYGPQEGDKKGQSPLLRSSK--PRHFLLGHSHGTnwVEYNVAGWLnyt 927
Cdd:cd14881    417 ----RTGLLSMLDVECSPRGTAE--------SYVAKIKVQHRQNPRLFEAKPqdDRMFGIRHFAGR--VVYDASDFL--- 479
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  928 kqnpatqnaprllqDSQKKIISNlflgragsatvlsgsiagleggSQLALRRATSMRKTFTTGMAavkkkslciQIKLQV 1007
Cdd:cd14881    480 --------------DTNRDVVPD----------------------DLVAVFYKQNCNFGFATHTQ---------DFHTRL 514
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1008 DALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYR 1087
Cdd:cd14881    515 DNLLRTLVHARPHFVRCIRSNTTETPN-----------------------------HFDRGTVVRQIRSLQVLETVNLMA 565
                          730       740       750
                   ....*....|....*....|....*....|....*..
gi 1907082233 1088 QGYPDHMVFSEFRRRFDVLAPHLTKKHGRNYIVVDEK 1124
Cdd:cd14881    566 GGYPHRMRFKAFNARYRLLAPFRLLRRVEEKALEDCA 602
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1225-1741 1.14e-15

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 83.57  E-value: 1.14e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1225 IQVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLET---------RISELTSELTDERNTGE-------SASQ 1288
Cdd:PRK03918   245 KELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKElkekaeeyiKLSEFYEEYLDELREIEkrlsrleEEIN 324
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1289 LLDAETAERLRTEKEMKELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEvddddaggewRLKYERAVREVDFTKKRLQ 1368
Cdd:PRK03918   325 GIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELE----------RLKKRLTGLTPEKLEKELE 394
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1369 QELEDKMEVEQQsRRQLERRLGDLQADSDESQRALQQLKK---KCQRLTAELQDtklhlegqqvrnHELEKKQRRFDSEL 1445
Cdd:PRK03918   395 ELEKAKEEIEEE-ISKITARIGELKKEIKELKKAIEELKKakgKCPVCGRELTE------------EHRKELLEEYTAEL 461
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1446 SQAHEETQREKLQREKLQREKDMLlaEAFSLKQQMEEKDLDIAgftQKVVSLEAELQDISSQE-SKDEASLAKVKKQLRD 1524
Cdd:PRK03918   462 KRIEKELKEIEEKERKLRKELREL--EKVLKKESELIKLKELA---EQLKELEEKLKKYNLEElEKKAEEYEKLKEKLIK 536
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1525 LEAKVKDQEEELDEQagsiqmleqaklrlememermrqthsKEMESRDEEVEEARQSCQKKLKQMEVQLEEE----YEDK 1600
Cdd:PRK03918   537 LKGEIKSLKKELEKL--------------------------EELKKKLAELEKKLDELEEELAELLKELEELgfesVEEL 590
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1601 QKALREKRELESKLSTLSDQVnqRDFESEKRLRKDLKRTkalLADAQIMLDHLKNNAPSKR-EIAQLKNQLEESEFTCAA 1679
Cdd:PRK03918   591 EERLKELEPFYNEYLELKDAE--KELEREEKELKKLEEE---LDKAFEELAETEKRLEELRkELEELEKKYSEEEYEELR 665
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082233 1680 AVKARKAMEV-----EMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNrLEEDQEDMNELMKKHK 1741
Cdd:PRK03918   666 EEYLELSRELaglraELEELEKRREEIKKTLEKLKEELEEREKAKKELEK-LEKALERVEELREKVK 731
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1231-1712 1.40e-15

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 83.69  E-value: 1.40e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1231 EEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISEL---TSELTDERNTGESAsqlldaetaeRLRTEKEMKEL 1307
Cdd:pfam01576  656 ERTNKQLRAEMEDLVSSKDDVGKNVHELERSKRALEQQVEEMktqLEELEDELQATEDA----------KLRLEVNMQAL 725
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1308 QTQYDAlkkqmevmemevmearliraaEINGEvddDDAGGEWRLKYERAVREvdftkkrLQQELEDKMEVEQQ---SRRQ 1384
Cdd:pfam01576  726 KAQFER---------------------DLQAR---DEQGEEKRRQLVKQVRE-------LEAELEDERKQRAQavaAKKK 774
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1385 LERRLGDLQADSDESQR----ALQQLKK---KCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKL 1457
Cdd:pfam01576  775 LELDLKELEAQIDAANKgreeAVKQLKKlqaQMKDLQRELEEARASRDEILAQSKESEKKLKNLEAELLQLQEDLAASER 854
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1458 QREKLQREKDmllaeafslkqqmeekdldiagftqkvvsleaELQDISSQESKDEASLAKVKkqlRDLEAKVKDQEEELD 1537
Cdd:pfam01576  855 ARRQAQQERD--------------------------------ELADEIASGASGKSALQDEK---RRLEARIAQLEEELE 899
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1538 EQAGSIQMLEQAKLRLEMEMERMRQTHSKEmESRDEEVEEARQSCQKKLKQMEVQL-EEEYEDKQKALREKRELESKLST 1616
Cdd:pfam01576  900 EEQSNTELLNDRLRKSTLQVEQLTTELAAE-RSTSQKSESARQQLERQNKELKAKLqEMEGTVKSKFKSSIAALEAKIAQ 978
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1617 LSDQVNQrdfESEKRLR--KDLKRTKALLADAQIMLDHLKNNAPSKREIA--------QLKNQLEESEFTCAAAVKARKA 1686
Cdd:pfam01576  979 LEEQLEQ---ESRERQAanKLVRRTEKKLKEVLLQVEDERRHADQYKDQAekgnsrmkQLKRQLEEAEEEASRANAARRK 1055
                          490       500
                   ....*....|....*....|....*.
gi 1907082233 1687 MEVEMEDLHLQIDDIAKAKTALEEQL 1712
Cdd:pfam01576 1056 LQRELDDATESNESMNREVSTLKSKL 1081
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1225-1571 2.15e-15

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 82.80  E-value: 2.15e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1225 IQVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLET-------RISELTSELTDERNTGESASQLLDAETAER 1297
Cdd:TIGR02168  698 KALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAeveqleeRIAQLSKELTELEAEIEELEERLEEAEEEL 777
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1298 LRTEKEMKELQTQYDALKKQMEVMEMevmearliRAAEINGEVDD-DDAGGEWRLKYERAVREVDFTKKRLqQELEDKME 1376
Cdd:TIGR02168  778 AEAEAEIEELEAQIEQLKEELKALRE--------ALDELRAELTLlNEEAANLRERLESLERRIAATERRL-EDLEEQIE 848
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1377 VEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREK 1456
Cdd:TIGR02168  849 ELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLE 928
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1457 LQREKLQREKDMLLaEAFSLKQQMeekdldiagftqkvvsleaELQDISSQESKDEASLAKVKKQLRDLEAKVK------ 1530
Cdd:TIGR02168  929 LRLEGLEVRIDNLQ-ERLSEEYSL-------------------TLEEAEALENKIEDDEEEARRRLKRLENKIKelgpvn 988
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*....
gi 1907082233 1531 --------DQEEELDEQAGSIQMLEQAKLRLEMEMERMrqthSKEMESR 1571
Cdd:TIGR02168  989 laaieeyeELKERYDFLTAQKEDLTEAKETLEEAIEEI----DREARER 1033
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1605-1961 2.20e-15

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 82.81  E-value: 2.20e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1605 REKRELESKLSTLS--DQVNQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTcaAAVK 1682
Cdd:TIGR02169  153 VERRKIIDEIAGVAefDRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGY--ELLK 230
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1683 ARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDM-AQMNDLQAQI 1761
Cdd:TIGR02169  231 EKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELeAEIASLERSI 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1762 EESNKEKQELQEKLQ-------ALQSQVEFLEQSMVDKSL--------VSRQEAKIRELETRLEFEKTQVKRLENLASRL 1826
Cdd:TIGR02169  311 AEKERELEDAEERLAkleaeidKLLAEIEELEREIEEERKrrdklteeYAELKEELEDLRAELEEVDKEFAETRDELKDY 390
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1827 KETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKR 1906
Cdd:TIGR02169  391 REKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQE 470
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082233 1907 IGDLQA---AIEDEMESDENEdlinsLQDMVTKYQKKKNKLEGDSDVDSELEDRVDGV 1961
Cdd:TIGR02169  471 LYDLKEeydRVEKELSKLQRE-----LAEAEAQARASEERVRGGRAVEEVLKASIQGV 523
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1504-1956 2.54e-15

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 82.38  E-value: 2.54e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1504 ISSQESKD---EASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEmemERMRQTHSKEMESRD-------- 1572
Cdd:TIGR04523   28 ANKQDTEEkqlEKKLKTIKNELKNKEKELKNLDKNLNKDEEKINNSNNKIKILE---QQIKDLNDKLKKNKDkinklnsd 104
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1573 -----EEVEEARQSCQKK---LKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNqrdfesekrlrkDLKRTKALLA 1644
Cdd:TIGR04523  105 lskinSEIKNDKEQKNKLeveLNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYN------------DLKKQKEELE 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1645 DAQIMLDHLKNNApsKREIAQLKNQLEESEFTCAAAVKARKamevEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQN 1724
Cdd:TIGR04523  173 NELNLLEKEKLNI--QKNIDKIKNKLLKLELLLSNLKKKIQ----KNKSLESQISELKKQNNQLKDNIEKKQQEINEKTT 246
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1725 RLEEDQEDMNELMKKHKaavaqasRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVD------KSLVSRQ 1798
Cdd:TIGR04523  247 EISNTQTQLNQLKDEQN-------KIKKQLSEKQKELEQNNKKIKELEKQLNQLKSEISDLNNQKEQdwnkelKSELKNQ 319
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1799 EAKIRELETRLEFEKTQVKRLENLASRLKETMEKLT---EERDQRAAAENRE----KEQNKRLQRQLRDTKEEMSELARK 1871
Cdd:TIGR04523  320 EKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSEsenSEKQRELEEKQNEieklKKENQSYKQEIKNLESQINDLESK 399
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1872 EAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESD-ENEDLINSLQDMVTKYQKKKNKLEGD-SD 1949
Cdd:TIGR04523  400 IQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDsVKELIIKNLDNTRESLETQLKVLSRSiNK 479

                   ....*..
gi 1907082233 1950 VDSELED 1956
Cdd:TIGR04523  480 IKQNLEQ 486
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1366-1871 2.71e-15

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 82.66  E-value: 2.71e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1366 RLQQELEDKMEVEQQSRRQLERR--LGDLQADSDESQRALQQLKkkcqrlTAELQDTKLHLEGQQVRNHELEKKQRRFDS 1443
Cdd:COG4913    229 ALVEHFDDLERAHEALEDAREQIelLEPIRELAERYAAARERLA------ELEYLRAALRLWFAQRRLELLEAELEELRA 302
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1444 ELSQAHEETQREKLQREKLQREKDMLLAEAFS--------LKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASL 1515
Cdd:COG4913    303 ELARLEAELERLEARLDALREELDELEAQIRGnggdrleqLEREIERLERELEERERRRARLEALLAALGLPLPASAEEF 382
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1516 AKVKKQLR------------------DLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKE---------- 1567
Cdd:COG4913    383 AALRAEAAallealeeelealeealaEAEAALRDLRRELRELEAEIASLERRKSNIPARLLALRDALAEAlgldeaelpf 462
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1568 ----MESRDEE-------------------VEEARQSC------QKKLKQmEVQLEEEYEDKQKALREK-------RELE 1611
Cdd:COG4913    463 vgelIEVRPEEerwrgaiervlggfaltllVPPEHYAAalrwvnRLHLRG-RLVYERVRTGLPDPERPRldpdslaGKLD 541
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1612 SKLSTLSDQVNQ---RDF-----ESEKRLRKDlkrTKALLADAQImldhlKNNApSKREIaQLKNQLEESEFTCAAAVKA 1683
Cdd:COG4913    542 FKPHPFRAWLEAelgRRFdyvcvDSPEELRRH---PRAITRAGQV-----KGNG-TRHEK-DDRRRIRSRYVLGFDNRAK 611
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1684 RKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNelmkkhkaaVAQASRDMAQMNDLQAQIEE 1763
Cdd:COG4913    612 LAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEID---------VASAEREIAELEAELERLDA 682
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1764 SNKEKQELQEKLQALQSQVEFLEQsmvdksLVSRQEAKIRELETRLEFEKTQVK----RLENLASRLKETMEKLTEERDQ 1839
Cdd:COG4913    683 SSDDLAALEEQLEELEAELEELEE------ELDELKGEIGRLEKELEQAEEELDelqdRLEAAEDLARLELRALLEERFA 756
                          570       580       590
                   ....*....|....*....|....*....|..
gi 1907082233 1840 RAAAENREKEQNKRLQRQLRDTKEEMSELARK 1871
Cdd:COG4913    757 AALGDAVERELRENLEERIDALRARLNRAEEE 788
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1623-1956 4.23e-15

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 82.03  E-value: 4.23e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1623 QRDFESEKRLRK---DLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEEseftcaaavkarkamevemedlhLQID 1699
Cdd:TIGR02168  172 ERRKETERKLERtreNLDRLEDILNELERQLKSLERQAEKAERYKELKAELRE-----------------------LELA 228
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1700 DIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELmkkhkaavaqasrdmaqmndlQAQIEESNKEKQELQEKLQALQ 1779
Cdd:TIGR02168  229 LLVLRLEELREELEELQEELKEAEEELEELTAELQEL---------------------EEKLEELRLEVSELEEEIEELQ 287
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1780 SqvEFLEQSmvdkslvsrqeAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLR 1859
Cdd:TIGR02168  288 K--ELYALA-----------NEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELE 354
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1860 DTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENEDLINSLQDMVTKYQK 1939
Cdd:TIGR02168  355 SLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAE 434
                          330
                   ....*....|....*..
gi 1907082233 1940 KKNKLEGDSDVDSELED 1956
Cdd:TIGR02168  435 LKELQAELEELEEELEE 451
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1228-1879 7.42e-15

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 81.17  E-value: 7.42e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1228 QLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDaetaERLRTEKEMKEL 1307
Cdd:pfam02463  380 KLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQ----GKLTEEKEELEK 455
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1308 QTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDFTKKRLQQELE--DKMEVEQQSRRQL 1385
Cdd:pfam02463  456 QELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVggRIISAHGRLGDLG 535
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1386 ERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQrrfDSELSQAHEETQREKLQREKLQRE 1465
Cdd:pfam02463  536 VAVENYKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLK---LPLKSIAVLEIDPILNLAQLDKAT 612
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1466 KDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQE------SKDEASLAKVKKQLRDLEAKVKDQEEELDEQ 1539
Cdd:pfam02463  613 LEADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVSLEeglaekSEVKASLSELTKELLEIQELQEKAESELAKE 692
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1540 AGSIQMLEQAKLRLEMEmERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSD 1619
Cdd:pfam02463  693 EILRRQLEIKKKEQREK-EELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSL 771
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1620 QVNQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQID 1699
Cdd:pfam02463  772 KEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKL 851
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1700 DIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVaqasrdMAQMNDLQAQIEESNKEKQELQEKLQALQ 1779
Cdd:pfam02463  852 AEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEK------EEKKELEEESQKLNLLEEKENEIEERIKE 925
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1780 SQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLEN---LASRLKETMEKLTEERDQRAAAENREKEQN----K 1852
Cdd:pfam02463  926 EAEILLKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLLLakeELGKVNLMAIEEFEEKEERYNKDELEKERLeeekK 1005
                          650       660
                   ....*....|....*....|....*..
gi 1907082233 1853 RLQRQLRDTKEEMSELARKEAEASRKK 1879
Cdd:pfam02463 1006 KLIRAIIEETCQRLKEFLELFVSINKG 1032
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1235-1840 1.56e-14

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 80.08  E-value: 1.56e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1235 RNKDEEIQQLRSKLEkvEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDaETAERLrteKEMKELQTQYDAL 1314
Cdd:PRK02224   183 SDQRGSLDQLKAQIE--EKEEKDLHERLNGLESELAELDEEIERYEEQREQARETRD-EADEVL---EEHEERREELETL 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1315 KkqmevmemevmearliraAEINGEVDDDDAGGEWRLKYERAVREVDFTKKRLQQELED---KMEVEQQSRRQLERRLGD 1391
Cdd:PRK02224   257 E------------------AEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDllaEAGLDDADAEAVEARREE 318
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1392 LQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREKDmlla 1471
Cdd:PRK02224   319 LEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIE---- 394
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1472 eafSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKD-------QEEELDEQAGSIQ 1544
Cdd:PRK02224   395 ---ELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEALLEAgkcpecgQPVEGSPHVETIE 471
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1545 MLEQAKLRLEMEMERMRQTHskemESRDEEVEEArqscqKKLKQMEVQLEEeyedkqkaLREKRELESKLSTlsdqvNQR 1624
Cdd:PRK02224   472 EDRERVEELEAELEDLEEEV----EEVEERLERA-----EDLVEAEDRIER--------LEERREDLEELIA-----ERR 529
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1625 DFESEKRLRKDLKRTKAlladaqimlDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVE----MEDLHLQIDD 1700
Cdd:PRK02224   530 ETIEEKRERAEELRERA---------AELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKEriesLERIRTLLAA 600
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1701 IAKAKTALE---EQLSRLQREKNEIQNRLEEDQEDMNELMKKH-KAAVAQASRDMAQMNDLQAQIEEsnkEKQELQEKLQ 1776
Cdd:PRK02224   601 IADAEDEIErlrEKREALAELNDERRERLAEKRERKRELEAEFdEARIEEAREDKERAEEYLEQVEE---KLDELREERD 677
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082233 1777 ALQSQVefleqSMVDKSLvsrqeAKIRELETRLEFEKTQVKRLENL---ASRLKETMEKLTEERDQR 1840
Cdd:PRK02224   678 DLQAEI-----GAVENEL-----EELEELRERREALENRVEALEALydeAEELESMYGDLRAELRQR 734
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1232-1868 3.06e-14

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 79.19  E-value: 3.06e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1232 EQIRNKDEEIQQLRSKLEKVEKERNELRLssDRLETRISELTSELTDERntgesasQLLDAETAERLRTEKEMKELQTQY 1311
Cdd:COG4913    255 EPIRELAERYAAARERLAELEYLRAALRL--WFAQRRLELLEAELEELR-------AELARLEAELERLEARLDALREEL 325
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1312 DALKKQMevmemevmearliraaeingevddDDAGGEwRLkyERAVREVdftkKRLQQELEDKmeveQQSRRQLERRLGD 1391
Cdd:COG4913    326 DELEAQI------------------------RGNGGD-RL--EQLEREI----ERLERELEER----ERRRARLEALLAA 370
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1392 LQADSDESQRALQQLKKKCQRLTAELQDtklHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQR----EKLQREKD 1467
Cdd:COG4913    371 LGLPLPASAEEFAALRAEAAALLEALEE---ELEALEEALAEAEAALRDLRRELRELEAEIASLERRKsnipARLLALRD 447
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1468 MLlAEAFSLKqqmeEKDLDIAGftqkvvsleaELQDISSQESK------------------DEASLAKVKKQL--RDLEA 1527
Cdd:COG4913    448 AL-AEALGLD----EAELPFVG----------ELIEVRPEEERwrgaiervlggfaltllvPPEHYAAALRWVnrLHLRG 512
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1528 KVKDQEEELDEQAGSIQMLEQAKL--RLEMEMERMRQTHSKEMESRD-----EEVEEARQ-------SCQKKL------K 1587
Cdd:COG4913    513 RLVYERVRTGLPDPERPRLDPDSLagKLDFKPHPFRAWLEAELGRRFdyvcvDSPEELRRhpraitrAGQVKGngtrheK 592
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1588 QMEVQLEEEY------EDKQKALREKR-ELESKLSTLSDQVNQRdfeseKRLRKDLKRTKALLADAQIMLDHLKNNAPSK 1660
Cdd:COG4913    593 DDRRRIRSRYvlgfdnRAKLAALEAELaELEEELAEAEERLEAL-----EAELDALQERREALQRLAEYSWDEIDVASAE 667
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1661 REIAQLKNQ---LEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEE----DQEDM 1733
Cdd:COG4913    668 REIAELEAElerLDASSDDLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAaedlARLEL 747
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1734 NELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQveFLEQSMVDKSLVSRQEAKIRELETRLEfek 1813
Cdd:COG4913    748 RALLEERFAAALGDAVERELRENLEERIDALRARLNRAEEELERAMRA--FNREWPAETADLDADLESLPEYLALLD--- 822
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082233 1814 tqvkRLENlaSRLKETMEKLteeRDQRAAAENREKEQ-NKRLQRQLRDTKEEMSEL 1868
Cdd:COG4913    823 ----RLEE--DGLPEYEERF---KELLNENSIEFVADlLSKLRRAIREIKERIDPL 869
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1660-1876 4.71e-14

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 76.34  E-value: 4.71e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1660 KREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKK 1739
Cdd:COG4942     33 QQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELAELLRA 112
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1740 -------HKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQsmvdkslvsRQEAKIRELETRLEFE 1812
Cdd:COG4942    113 lyrlgrqPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRA---------ELEAERAELEALLAEL 183
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082233 1813 KTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEAS 1876
Cdd:COG4942    184 EEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPAAG 247
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1544-1945 4.86e-14

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 77.89  E-value: 4.86e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1544 QMLEQaklRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEvQLEEEYEDKQKALREKRELESKLSTLSDQVNQ 1623
Cdd:COG4717     45 AMLLE---RLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEE-EYAELQEELEELEEELEELEAELEELREELEK 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1624 -RDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPS----KREIAQLKNQLEESEFTCAAAVK-ARKAMEVEMEDLHLQ 1697
Cdd:COG4717    121 lEKLLQLLPLYQELEALEAELAELPERLEELEERLEElrelEEELEELEAELAELQEELEELLEqLSLATEEELQDLAEE 200
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1698 IDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHK-----------AAVA--------------------- 1745
Cdd:COG4717    201 LEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERlkearlllliaAALLallglggsllsliltiagvlf 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1746 ---------------QASRDMAQMNDLQAQIEESNKEKQELQEKLQALQ----SQVEFLEQSMVDKSLVSRQEAKIRELE 1806
Cdd:COG4717    281 lvlgllallflllarEKASLGKEAEELQALPALEELEEEELEELLAALGlppdLSPEELLELLDRIEELQELLREAEELE 360
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1807 TRLEFEKTQvKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEAsrkkhELEMDL 1886
Cdd:COG4717    361 EELQLEELE-QEIAALLAEAGVEDEEELRAALEQAEEYQELKEELEELEEQLEELLGELEELLEALDEE-----ELEEEL 434
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082233 1887 ESLEAANQSLQADLKLAFKRIGDLQAAIEdEMESDEnedlinSLQDMVTKYQKKKNKLE 1945
Cdd:COG4717    435 EELEEELEELEEELEELREELAELEAELE-QLEEDG------ELAELLQELEELKAELR 486
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1381-1946 7.07e-14

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 78.03  E-value: 7.07e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1381 SRRQLERRLGDLQADSDESQRALQqlkkkcQRLTAelqdtKLHLEGQQ--------------------VRNHELEKKQ-- 1438
Cdd:COG4913    156 DIRALKARLKKQGVEFFDSFSAYL------ARLRR-----RLGIGSEKalrllhktqsfkpigdlddfVREYMLEEPDtf 224
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1439 RRFDS------ELSQAHEETQREKLQREKLQREKDmlLAEAF-SLKQQMEEKDLDIAGFTQKVVSLEAELQdissqeskd 1511
Cdd:COG4913    225 EAADAlvehfdDLERAHEALEDAREQIELLEPIRE--LAERYaAARERLAELEYLRAALRLWFAQRRLELL--------- 293
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1512 EASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKL--------RLEMEMERMRQTHskemesrdEEVEEARQSCQ 1583
Cdd:COG4913    294 EAELEELRAELARLEAELERLEARLDALREELDELEAQIRgnggdrleQLEREIERLEREL--------EERERRRARLE 365
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1584 KKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQImLDHLKNNAPskREI 1663
Cdd:COG4913    366 ALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIAS-LERRKSNIP--ARL 442
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1664 AQLKNQLEEseftcAAAVKARKA------MEVEMED--------------------------------------LHLQID 1699
Cdd:COG4913    443 LALRDALAE-----ALGLDEAELpfvgelIEVRPEEerwrgaiervlggfaltllvppehyaaalrwvnrlhlrGRLVYE 517
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1700 DI------AKAKTALEEQLSR-LQREKNEIQNRLE------------EDQEDmnelMKKHKAAVAQA-----SRDMAQMN 1755
Cdd:COG4913    518 RVrtglpdPERPRLDPDSLAGkLDFKPHPFRAWLEaelgrrfdyvcvDSPEE----LRRHPRAITRAgqvkgNGTRHEKD 593
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1756 DL-------------QAQIEESNKEKQELQEKLQALQSQVEFLEQSMvdKSLVSRQEAKIRELETRleFEKTQVKRLENL 1822
Cdd:COG4913    594 DRrrirsryvlgfdnRAKLAALEAELAELEEELAEAEERLEALEAEL--DALQERREALQRLAEYS--WDEIDVASAERE 669
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1823 ASRLKETMEKLTEERDQRAAAENREKEQNKR---LQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEA-ANQSLQA 1898
Cdd:COG4913    670 IAELEAELERLDASSDDLAALEEQLEELEAEleeLEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDlARLELRA 749
                          650       660       670       680
                   ....*....|....*....|....*....|....*....|....*...
gi 1907082233 1899 DLKLAFkrigdlQAAIEDEMESDENEDLINSLQDMVTKYQKKKNKLEG 1946
Cdd:COG4913    750 LLEERF------AAALGDAVERELRENLEERIDALRARLNRAEEELER 791
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
1343-1955 8.53e-14

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 78.17  E-value: 8.53e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1343 DDAGGEWRLKYERAVREVDFTKKRLQQeledkmeveqqsrrQLERRLGDLQADSDESQRALQQLKKKCQRLTaELQDTKL 1422
Cdd:TIGR01612 1099 DDFGKEENIKYADEINKIKDDIKNLDQ--------------KIDHHIKALEEIKKKSENYIDEIKAQINDLE-DVADKAI 1163
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1423 HLEGQQvrnhELEKKQRRFDSELSQA---HEETQreKLQREKLQREKDmllaeafslKQQMEE-KDLDIAgFTQKVVSLE 1498
Cdd:TIGR01612 1164 SNDDPE----EIEKKIENIVTKIDKKkniYDEIK--KLLNEIAEIEKD---------KTSLEEvKGINLS-YGKNLGKLF 1227
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1499 aeLQDISSQESKDEASLAKVKKQLRDLEaKVKDQEEELDEQAGsIQMLEQAklrlemEMERMRQTHSKEM------ESRD 1572
Cdd:TIGR01612 1228 --LEKIDEEKKKSEHMIKAMEAYIEDLD-EIKEKSPEIENEMG-IEMDIKA------EMETFNISHDDDKdhhiisKKHD 1297
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1573 EEVEEARQscqKKLKQMEVQLEE-EYEDKQKAL--------REKRELESKLSTLSDQVNQRDFESEKRLRKDLKR-TKAL 1642
Cdd:TIGR01612 1298 ENISDIRE---KSLKIIEDFSEEsDINDIKKELqknlldaqKHNSDINLYLNEIANIYNILKLNKIKKIIDEVKEyTKEI 1374
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1643 LADAQIMLDHLKNnapSKREIAQLKNQLeeSEFTCAAAVKArkamEVEMEDLHLQIDDIAKAKTALEEQLSRLQ------ 1716
Cdd:TIGR01612 1375 EENNKNIKDELDK---SEKLIKKIKDDI--NLEECKSKIES----TLDDKDIDECIKKIKELKNHILSEESNIDtyfkna 1445
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1717 REKNE----IQNRLEEDQEDMNELMKKHKAavaQASRDMA-QMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSmvd 1791
Cdd:TIGR01612 1446 DENNEnvllLFKNIEMADNKSQHILKIKKD---NATNDHDfNINELKEHIDKSKGCKDEADKNAKAIEKNKELFEQY--- 1519
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1792 KSLVSRQEAKIRELETRLEFEKTQvKRLENLASRLKETMEKLTEErdqraaAENREKEQNKRLQRQLRDTKEemselARK 1871
Cdd:TIGR01612 1520 KKDVTELLNKYSALAIKNKFAKTK-KDSEIIIKEIKDAHKKFILE------AEKSEQKIKEIKKEKFRIEDD-----AAK 1587
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1872 EAEASRKKHELEMDLESLEaaNQSLQ-ADLKlafKRIGDLQAAIED-------------EMESDENEDLINSLQDMVTKY 1937
Cdd:TIGR01612 1588 NDKSNKAAIDIQLSLENFE--NKFLKiSDIK---KKINDCLKETESiekkissfsidsqDTELKENGDNLNSLQEFLESL 1662
                          650       660
                   ....*....|....*....|..
gi 1907082233 1938 QKKKNKLEGDS----DVDSELE 1955
Cdd:TIGR01612 1663 KDQKKNIEDKKkeldELDSEIE 1684
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1404-1937 2.99e-13

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 75.78  E-value: 2.99e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1404 QQLKKKCQRLTAELQDTKLHLEGQQVRNHElekKQRRFDSELSQAHEETQREKLQREKLQREkDMLLAEAFSLKQQMEEK 1483
Cdd:TIGR00618  190 KSLHGKAELLTLRSQLLTLCTPCMPDTYHE---RKQVLEKELKHLREALQQTQQSHAYLTQK-REAQEEQLKKQQLLKQL 265
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1484 DLDIAGFT--QKVVSLEAELQDISSQESK---DEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEME 1558
Cdd:TIGR00618  266 RARIEELRaqEAVLEETQERINRARKAAPlaaHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQRR 345
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1559 RMRQTHSKEMESRDE-EVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFE--SEKRLRKD 1635
Cdd:TIGR00618  346 LLQTLHSQEIHIRDAhEVATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRtsAFRDLQGQ 425
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1636 LKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTcaaAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRL 1715
Cdd:TIGR00618  426 LAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQ---SLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEE 502
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1716 QREKNE-----------------IQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQAL 1778
Cdd:TIGR00618  503 PCPLCGscihpnparqdidnpgpLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRS 582
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1779 QSQVEFLEQSMVD------------KSLVSRQEAKIRELETRLE------FEKTQVKRLENLASRLKETMEKLTEERDQR 1840
Cdd:TIGR00618  583 KEDIPNLQNITVRlqdlteklseaeDMLACEQHALLRKLQPEQDlqdvrlHLQQCSQELALKLTALHALQLTLTQERVRE 662
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1841 AAAENREKEQNKRLQRQLRDTKEE-------------------MSELARKEAEASRKKHELEMDLESLEaanQSLQADLK 1901
Cdd:TIGR00618  663 HALSIRVLPKELLASRQLALQKMQsekeqltywkemlaqcqtlLRELETHIEEYDREFNEIENASSSLG---SDLAARED 739
                          570       580       590
                   ....*....|....*....|....*....|....*.
gi 1907082233 1902 LAFKRIGDLQAAIEDEMESDENEDLINSLQDMVTKY 1937
Cdd:TIGR00618  740 ALNQSLKELMHQARTVLKARTEAHFNNNEEVTAALQ 775
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1561-1990 5.42e-13

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 74.67  E-value: 5.42e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1561 RQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKrLRKDLKRTK 1640
Cdd:TIGR04523   31 QDTEEKQLEKKLKTIKNELKNKEKELKNLDKNLNKDEEKINNSNNKIKILEQQIKDLNDKLKKNKDKINK-LNSDLSKIN 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1641 AlladaQIMLD-HLKNNapSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREK 1719
Cdd:TIGR04523  110 S-----EIKNDkEQKNK--LEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEK 182
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1720 NEIQN----------RLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSm 1789
Cdd:TIGR04523  183 LNIQKnidkiknkllKLELLLSNLKKKIQKNKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDE- 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1790 vDKSLVSRQEAKIRELETrlefEKTQVKRLENLASRLKETMEKLteerdqraaaeNREKEQN--KRLQRQLRDTKEEMSE 1867
Cdd:TIGR04523  262 -QNKIKKQLSEKQKELEQ----NNKKIKELEKQLNQLKSEISDL-----------NNQKEQDwnKELKSELKNQEKKLEE 325
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1868 LarkeaeasrkkhelemdlesleaanqslqadlklafkrigdlqaaiedEMESDENEDLINSLQDMVTKYQKKKNKLEGD 1947
Cdd:TIGR04523  326 I------------------------------------------------QNQISQNNKIISQLNEQISQLKKELTNSESE 357
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....
gi 1907082233 1948 -SDVDSELEDRVDGVKSWLSKNKGPSKAPSddgSLKSSRTALNS 1990
Cdd:TIGR04523  358 nSEKQRELEEKQNEIEKLKKENQSYKQEIK---NLESQINDLES 398
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1445-1647 5.71e-13

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 73.26  E-value: 5.71e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1445 LSQAHEETQREKlQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRD 1524
Cdd:COG4942     16 AAQADAAAEAEA-ELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAE 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1525 LEAKVKDQEEELDEQAGSIQML-EQAKLRLEM------EMERMRQTHSKEMESRDEEVEEARQScQKKLKQMEVQLEEEY 1597
Cdd:COG4942     95 LRAELEAQKEELAELLRALYRLgRQPPLALLLspedflDAVRRLQYLKYLAPARREQAEELRAD-LAELAALRAELEAER 173
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1598 EDKQKALREKRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQ 1647
Cdd:COG4942    174 AELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAE 223
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1702-1906 5.92e-13

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 73.26  E-value: 5.92e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1702 AKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQ 1781
Cdd:COG4942     19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAE 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1782 VEFLEQSMVD--------------KSLVSRQEAK-----IRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAA 1842
Cdd:COG4942     99 LEAQKEELAEllralyrlgrqpplALLLSPEDFLdavrrLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEA 178
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082233 1843 AENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKR 1906
Cdd:COG4942    179 LLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAER 242
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
1351-1903 8.31e-13

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 74.09  E-value: 8.31e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1351 LKYERAVREVDFTKKRLqqeLEDKMEVEQQSRRQLERRLGDLQaDSDESQRALQQL----KKKCQRLTAELQDTKLHLEG 1426
Cdd:pfam10174   41 LKKERALRKEEAARISV---LKEQYRVTQEENQHLQLTIQALQ-DELRAQRDLNQLlqqdFTTSPVDGEDKFSTPELTEE 116
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1427 QQVRNHELEKKQRRFDSELSQAHEE------TQREKL--QREKLQREKDMLLAEAFSLKQQMEEKDLD--IAGFTQKVVS 1496
Cdd:pfam10174  117 NFRRLQSEHERQAKELFLLRKTLEEmelrieTQKQTLgaRDESIKKLLEMLQSKGLPKKSGEEDWERTrrIAEAEMQLGH 196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1497 LEAELQDISSQESKDEASLaKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVE 1576
Cdd:pfam10174  197 LEVLLDQKEKENIHLREEL-HRRNQLQPDPAKTKALQTVIEMKDTKISSLERNIRDLEDEVQMLKTNGLLHTEDREEEIK 275
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1577 --EARQSCQKKLKQMEVQLEEEYEDKQKalrEKRELESKLSTLSDQvNQRDFESEKRLRKDLKRTKALLADAQIMLDHLK 1654
Cdd:pfam10174  276 qmEVYKSHSKFMKNKIDQLKQELSKKES---ELLALQTKLETLTNQ-NSDCKQHIEVLKESLTAKEQRAAILQTEVDALR 351
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1655 NNAPSKREIAQLKNQ----LEESEFTCAAA---------VKARKAMEVE--MEDLHLQIDDIAKAKTALEEQLSRLQREK 1719
Cdd:pfam10174  352 LRLEEKESFLNKKTKqlqdLTEEKSTLAGEirdlkdmldVKERKINVLQkkIENLQEQLRDKDKQLAGLKERVKSLQTDS 431
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1720 NEIQNRL---EE---DQEDMNELMKKHKAAVAQASRDmaqmndlqaQIEESNKEKQELQEKLQALQSQVEFLEQSMVD-- 1791
Cdd:pfam10174  432 SNTDTALttlEEalsEKERIIERLKEQREREDRERLE---------ELESLKKENKDLKEKVSALQPELTEKESSLIDlk 502
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1792 ---KSLVSR---QEAKIRELETRLEFEKTQVKRLENlasRLKETMEKlteERDQRAAAENREKEQNkrLQRqlrdtkeem 1865
Cdd:pfam10174  503 ehaSSLASSglkKDSKLKSLEIAVEQKKEECSKLEN---QLKKAHNA---EEAVRTNPEINDRIRL--LEQ--------- 565
                          570       580       590
                   ....*....|....*....|....*....|....*....
gi 1907082233 1866 sELARKEAEASRKKHELEMDLESL-EAANQSLQADLKLA 1903
Cdd:pfam10174  566 -EVARYKEESGKAQAEVERLLGILrEVENEKNDKDKKIA 603
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
1354-1892 8.61e-13

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 74.49  E-value: 8.61e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1354 ERAVREVDFTKKRLQQELEDKMEVEQQSRRQLERRL----GDLQADSDESQRALQQLKkkcQRLTAELQDTKLHLEGQQV 1429
Cdd:pfam12128  275 ASRQEERQETSAELNQLLRTLDDQWKEKRDELNGELsaadAAVAKDRSELEALEDQHG---AFLDADIETAAADQEQLPS 351
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1430 RNHELEKKQRRFDSeLSQAHEETQREKLQREKL--QREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQ 1507
Cdd:pfam12128  352 WQSELENLEERLKA-LTGKHQDVTAKYNRRRSKikEQNNRDIAGIKDKLAKIREARDRQLAVAEDDLQALESELREQLEA 430
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1508 eSKDEASLAKVKKQLRDLEAKVK-----DQEEELDEQAGSIQMLEQAKLRLEM---EMERMrQTHSKEMESRDEEVEEAR 1579
Cdd:pfam12128  431 -GKLEFNEEEYRLKSRLGELKLRlnqatATPELLLQLENFDERIERAREEQEAanaEVERL-QSELRQARKRRDQASEAL 508
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1580 QSCQKKLKQMEVQLEE-------------EYEDKQKALREK---RELESKLSTLSDQVNQRDFESEKR------LRKDLK 1637
Cdd:pfam12128  509 RQASRRLEERQSALDElelqlfpqagtllHFLRKEAPDWEQsigKVISPELLHRTDLDPEVWDGSVGGelnlygVKLDLK 588
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1638 RTkallaDAQIMLDHlknNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDiakAKTALEEQLSRLQR 1717
Cdd:pfam12128  589 RI-----DVPEWAAS---EEELRERLDKAEEALQSAREKQAAAEEQLVQANGELEKASREETF---ARTALKNARLDLRR 657
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1718 EKNEIQNrleeDQEDMNELMKKHKAAVAQASRDMAQ-----MNDLQAQIEESNKEKQELQ-EKLQALQSQVEFLEQSM-- 1789
Cdd:pfam12128  658 LFDEKQS----EKDKKNKALAERKDSANERLNSLEAqlkqlDKKHQAWLEEQKEQKREARtEKQAYWQVVEGALDAQLal 733
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1790 VDKSLVSRQEAKIRELETRLEFEKTQVKRL---ENLASRLKETMEKLTE-----ERDQRAAAENRE------KEQNKRLQ 1855
Cdd:pfam12128  734 LKAAIAARRSGAKAELKALETWYKRDLASLgvdPDVIAKLKREIRTLERkieriAVRRQEVLRYFDwyqetwLQRRPRLA 813
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|.
gi 1907082233 1856 RQLRDTKEEMSE----LARKEAEASRKKHELEMDLESLEAA 1892
Cdd:pfam12128  814 TQLSNIERAISElqqqLARLIADTKLRRAKLEMERKASEKQ 854
PRK01156 PRK01156
chromosome segregation protein; Provisional
1496-1970 1.13e-12

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 73.78  E-value: 1.13e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1496 SLEAELQDISSQESKDEAS---LAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMermrqthsKEMESRD 1572
Cdd:PRK01156   177 MLRAEISNIDYLEEKLKSSnleLENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSAL--------NELSSLE 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1573 EEVeearqscqkklKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLRkDLKRTKALLADAQIMLDH 1652
Cdd:PRK01156   249 DMK-----------NRYESEIKTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYIN-DYFKYKNDIENKKQILSN 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1653 LKNNAPSKREIAQLKNQLEE--SEFTcaaavkarkAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQ 1730
Cdd:PRK01156   317 IDAEINKYHAIIKKLSVLQKdyNDYI---------KKKSRYDDLNNQILELEGYEMDYNSYLKSIESLKKKIEEYSKNIE 387
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1731 EDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSM-----------------VDKS 1793
Cdd:PRK01156   388 RMSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRNMemlngqsvcpvcgttlgEEKS 467
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1794 L---------VSRQEAKIRELEtrlefekTQVKRLENLASRLKETMEKLTEERDQRAAAENRekeQNKRLQRQLRDTKEE 1864
Cdd:PRK01156   468 NhiinhynekKSRLEEKIREIE-------IEVKDIDEKIVDLKKRKEYLESEEINKSINEYN---KIESARADLEDIKIK 537
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1865 MSELARKEAEASRKKHELE-MDLESLEA------------------ANQSLQADLKLAFKRIGDLQAAIEDEMESDE--N 1923
Cdd:PRK01156   538 INELKDKHDKYEEIKNRYKsLKLEDLDSkrtswlnalavislidieTNRSRSNEIKKQLNDLESRLQEIEIGFPDDKsyI 617
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*..
gi 1907082233 1924 EDLINSLQDMVTKYQKKKNKLEGDSDVDSELEDRVDGVKSWLSKNKG 1970
Cdd:PRK01156   618 DKSIREIENEANNLNNKYNEIQENKILIEKLRGKIDNYKKQIAEIDS 664
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1554-1894 4.73e-12

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 71.69  E-value: 4.73e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1554 EMEMERMRQTH---SKEMESRD--EEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTlsDQVNQRDFES 1628
Cdd:pfam17380  295 KMEQERLRQEKeekAREVERRRklEEAEKARQAEMDRQAAIYAEQERMAMERERELERIRQEERKREL--ERIRQEEIAM 372
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1629 EKRLRKDLKRtkalladaqIMLDHLKNNAPSKREI-AQLKNQLEESEFTcaaavKARKAMEVEMEDLHLQIDDiakaktA 1707
Cdd:pfam17380  373 EISRMRELER---------LQMERQQKNERVRQELeAARKVKILEEERQ-----RKIQQQKVEMEQIRAEQEE------A 432
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1708 LEEQLSRLQREKNEIQNRLEEDqedmnELMKKHkaavaqasrdmaQMNDLQAQIEESNKEKQELqEKLQALQSQVEFLEQ 1787
Cdd:pfam17380  433 RQREVRRLEEERAREMERVRLE-----EQERQQ------------QVERLRQQEEERKRKKLEL-EKEKRDRKRAEEQRR 494
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1788 SMVDKSLVSRQEAKIRELETRLEFEKtqvkrlenlasRLKETMEKLTEERDQRAAAENREKEQN----KRLQRQLRDTKE 1863
Cdd:pfam17380  495 KILEKELEERKQAMIEEERKRKLLEK-----------EMEERQKAIYEEERRREAEEERRKQQEmeerRRIQEQMRKATE 563
                          330       340       350
                   ....*....|....*....|....*....|...
gi 1907082233 1864 EMSEL--ARKEAEASRKKHELEMDLESLEAANQ 1894
Cdd:pfam17380  564 ERSRLeaMEREREMMRQIVESEKARAEYEATTP 596
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1689-1850 8.34e-12

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 67.26  E-value: 8.34e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1689 VEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQ---EDMNELMKKHKAAVAQASRDMAQMNDLQAQIeESN 1765
Cdd:COG1579     10 LDLQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKtelEDLEKEIKRLELEIEEVEARIKKYEEQLGNV-RNN 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1766 KEkqelqekLQALQSQVEFLEQsmvdksLVSRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAEN 1845
Cdd:COG1579     89 KE-------YEALQKEIESLKR------RISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELE 155

                   ....*
gi 1907082233 1846 REKEQ 1850
Cdd:COG1579    156 AELEE 160
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
1232-1848 8.79e-12

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 71.24  E-value: 8.79e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1232 EQIRNKDEE-IQQLRSKLEKVEKERNELRLSSD--RLETRISELTSELTDERNTGESASQLLDaETAErlrTEKEmkelQ 1308
Cdd:TIGR01612 1135 EEIKKKSENyIDEIKAQINDLEDVADKAISNDDpeEIEKKIENIVTKIDKKKNIYDEIKKLLN-EIAE---IEKD----K 1206
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1309 TQYDALKKQMEVMEMEVMEARLiraaeinGEVDDDDAGGEWRLK-YERAVREVDFTKKRlQQELEDKMEVEQQSRRQLEr 1387
Cdd:TIGR01612 1207 TSLEEVKGINLSYGKNLGKLFL-------EKIDEEKKKSEHMIKaMEAYIEDLDEIKEK-SPEIENEMGIEMDIKAEME- 1277
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1388 rLGDLQADSDESQRALQQLKKKcqrltaELQDtklhlegqqVRNHELEKKQRRF-DSELSQAHEETQREKLQREKLQREK 1466
Cdd:TIGR01612 1278 -TFNISHDDDKDHHIISKKHDE------NISD---------IREKSLKIIEDFSeESDINDIKKELQKNLLDAQKHNSDI 1341
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1467 DMLLAEAFSLKQQMEEKDL-----DIAGFTQKVvslEAELQDISSQESKDEASLAKVKKQLRDLEAKVKdQEEELDEQ-- 1539
Cdd:TIGR01612 1342 NLYLNEIANIYNILKLNKIkkiidEVKEYTKEI---EENNKNIKDELDKSEKLIKKIKDDINLEECKSK-IESTLDDKdi 1417
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1540 AGSIQMLEQAKLRLEMEmERMRQTHSKEMESRDEEV-------EEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELES 1612
Cdd:TIGR01612 1418 DECIKKIKELKNHILSE-ESNIDTYFKNADENNENVlllfkniEMADNKSQHILKIKKDNATNDHDFNINELKEHIDKSK 1496
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1613 KLSTLSDQvNQRDFESEKRLRKDLKRTKALLADAQIMLDhLKNN-APSKREIAQLKNQLEE--SEFTCAAAVKARKAMEV 1689
Cdd:TIGR01612 1497 GCKDEADK-NAKAIEKNKELFEQYKKDVTELLNKYSALA-IKNKfAKTKKDSEIIIKEIKDahKKFILEAEKSEQKIKEI 1574
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1690 EMEDLHLQiDDIAKA----KTALEEQLSrlqrekneIQNrLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESn 1765
Cdd:TIGR01612 1575 KKEKFRIE-DDAAKNdksnKAAIDIQLS--------LEN-FENKFLKISDIKKKINDCLKETESIEKKISSFSIDSQDT- 1643
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1766 kekqELQEKLQALQSQVEFLEqSMVDkslvsrQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAEN 1845
Cdd:TIGR01612 1644 ----ELKENGDNLNSLQEFLE-SLKD------QKKNIEDKKKELDELDSEIEKIEIDVDQHKKNYEIGIIEKIKEIAIAN 1712

                   ...
gi 1907082233 1846 REK 1848
Cdd:TIGR01612 1713 KEE 1715
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
1223-1869 9.22e-12

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 71.02  E-value: 9.22e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1223 PLIQVQLSeeqirNKDEEIQQLRSKLEKVEKERNELRLSSD-RLETRISELTSELTDERntgesasqlldaETAERLRTE 1301
Cdd:pfam12128  350 PSWQSELE-----NLEERLKALTGKHQDVTAKYNRRRSKIKeQNNRDIAGIKDKLAKIR------------EARDRQLAV 412
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1302 KEmKELQTQYDALKKQMEVMEMEVMEARL---IRAAEINGEVDDDDAGGEWRLKYERAVREVDftkkRLQQELEDKMEve 1378
Cdd:pfam12128  413 AE-DDLQALESELREQLEAGKLEFNEEEYrlkSRLGELKLRLNQATATPELLLQLENFDERIE----RAREEQEAANA-- 485
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1379 QQSRRQLERRLgdLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQrrfdselSQAHEETQREKLQ 1458
Cdd:pfam12128  486 EVERLQSELRQ--ARKRRDQASEALRQASRRLEERQSALDELELQLFPQAGTLLHFLRKE-------APDWEQSIGKVIS 556
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1459 REKLQREKDMLLAEAFSLKQQ-------MEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKD 1531
Cdd:pfam12128  557 PELLHRTDLDPEVWDGSVGGElnlygvkLDLKRIDVPEWAASEEELRERLDKAEEALQSAREKQAAAEEQLVQANGELEK 636
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1532 QEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKR-EL 1610
Cdd:pfam12128  637 ASREETFARTALKNARLDLRRLFDEKQSEKDKKNKALAERKDSANERLNSLEAQLKQLDKKHQAWLEEQKEQKREARtEK 716
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1611 ESKL----STLSDQVNQRDFESEKRLRKDLKRTKALLADAQIMLDHL----KNNAPSKREIAQLKNQLEESEFTCAAAVK 1682
Cdd:pfam12128  717 QAYWqvveGALDAQLALLKAAIAARRSGAKAELKALETWYKRDLASLgvdpDVIAKLKREIRTLERKIERIAVRRQEVLR 796
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1683 ARKAME----VEMEDLHLQIDDIAKAKTALEEQLSRLQ------REKNEIQNRLEEDQED-MNELMKKHKAAVAQASRDM 1751
Cdd:pfam12128  797 YFDWYQetwlQRRPRLATQLSNIERAISELQQQLARLIadtklrRAKLEMERKASEKQQVrLSENLRGLRCEMSKLATLK 876
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1752 AQMNDLQAQ--IEESNKEKQELQEKLQ----ALQSQVEFLEQSMVDKS---------LVSRQEAKIRELETRLEFEKTQV 1816
Cdd:pfam12128  877 EDANSEQAQgsIGERLAQLEDLKLKRDylseSVKKYVEHFKNVIADHSgsglaetweSLREEDHYQNDKGIRLLDYRKLV 956
                          650       660       670       680       690       700
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082233 1817 KRLENLASRLKETMEKLTEER------------DQRAAAENREKEQNKRLQRQLRDTK--EEMSELA 1869
Cdd:pfam12128  957 PYLEQWFDVRVPQSIMVLREQvsilgvdltefyDVLADFDRRIASFSRELQREVGEEAffEGVSESA 1023
46 PHA02562
endonuclease subunit; Provisional
1608-1871 9.59e-12

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 70.43  E-value: 9.59e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1608 RELESKLSTLSDQVN--------QRDFESEKRlrkdlKRTKALLADAQIMLDHLKNNAPS-KREIAQLKNQLEEseftca 1678
Cdd:PHA02562   177 RELNQQIQTLDMKIDhiqqqiktYNKNIEEQR-----KKNGENIARKQNKYDELVEEAKTiKAEIEELTDELLN------ 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1679 aavkarkaMEVEMEDLHLQIDDIAKAKTALEEQLSRLQR-----EKNEIQNRLEEDQEDMNELMKKHKAavaqasrdmaQ 1753
Cdd:PHA02562   246 --------LVMDIEDPSAALNKLNTAAAKIKSKIEQFQKvikmyEKGGVCPTCTQQISEGPDRITKIKD----------K 307
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1754 MNDLQAQIEESNKEKQELQEKLqalqsqVEFLEQSMvdkslvsrqeaKIRELETRLEFEKTQVKRLENLASRLKETMEKL 1833
Cdd:PHA02562   308 LKELQHSLEKLDTAIDELEEIM------DEFNEQSK-----------KLLELKNKISTNKQSLITLVDKAKKVKAAIEEL 370
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1907082233 1834 TEERDQRaaaenreKEQNKRLQRQLRDTKEEMSELARK 1871
Cdd:PHA02562   371 QAEFVDN-------AEELAKLQDELDKIVKTKSELVKE 401
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1226-1806 1.05e-11

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 70.92  E-value: 1.05e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1226 QVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGEsasqlLDAETAERLRTEKE-M 1304
Cdd:pfam15921  469 QLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRVD-----LKLQELQHLKNEGDhL 543
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1305 KELQTQYDALKKQMEVMEMEVMEAR--LIRAAEINGEvDDDDAGGewrLKYERAVREVDFTKKRLQ-QELEDKMEVEQQS 1381
Cdd:pfam15921  544 RNVQTECEALKLQMAEKDKVIEILRqqIENMTQLVGQ-HGRTAGA---MQVEKAQLEKEINDRRLElQEFKILKDKKDAK 619
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1382 RRQLERRLGDLQADS-------DESQRALQQLKKKCQRLTAELQDTKLHLEG---------QQVRN--HELEKKQRRFDS 1443
Cdd:pfam15921  620 IRELEARVSDLELEKvklvnagSERLRAVKDIKQERDQLLNEVKTSRNELNSlsedyevlkRNFRNksEEMETTTNKLKM 699
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1444 ELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLE---------------------AELQ 1502
Cdd:pfam15921  700 QLKSAQSELEQTRNTLKSMEGSDGHAMKVAMGMQKQITAKRGQIDALQSKIQFLEeamtnankekhflkeeknklsQELS 779
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1503 DISSQESKDEASLAKVKKQLRDLEAKVKDQEEELD-------EQAGSIQMLEQAKLRLEM-------EMERMRQTHSKEM 1568
Cdd:pfam15921  780 TVATEKNKMAGELEVLRSQERRLKEKVANMEVALDkaslqfaECQDIIQRQEQESVRLKLqhtldvkELQGPGYTSNSSM 859
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1569 ESR-DEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALRE--KRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLAD 1645
Cdd:pfam15921  860 KPRlLQPASFTRTHSNVPSSQSTASFLSHHSRKTNALKEdpTRDLKQLLQELRSVINEEPTVQLSKAEDKGRAPSLGALD 939
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1646 AQIMLDHLKNNAPSK---REIAQLKNQLEESEFTCAAAVKARKAMevEMEDLHLQIDDIAKAKTALEEQLSR---LQREK 1719
Cdd:pfam15921  940 DRVRDCIIESSLRSDichSSSNSLQTEGSKSSETCSREPVLLHAG--ELEDPSSCFTFPSTASPSVKNSASRsfhSSPKK 1017
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1720 NEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQmnDLQAQ-IEESNKEKQELQEKLQALQSQVEFLE---QSMvdKSLV 1795
Cdd:pfam15921 1018 SPVHSLLTSSAEGSIGSSSQYRSAKTIHSPDSVK--DSQSLpIETTGKTCRKLQNRLESLQTLVEDLQlknQAM--SSMI 1093
                          650
                   ....*....|.
gi 1907082233 1796 SRQEAKIRELE 1806
Cdd:pfam15921 1094 RNQEKRIQKVK 1104
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1232-1727 1.16e-11

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 70.45  E-value: 1.16e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1232 EQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERntGESASQLLDAETAERLRTEKEMKELQTQy 1311
Cdd:PRK02224   251 EELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLL--AEAGLDDADAEAVEARREELEDRDEELR- 327
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1312 DALKKQMevmemevmearlIRAAEINGEVddddaggewrlkyERAVREVDftkkrlqqELEDKMEVEQQSRRQLERRLGD 1391
Cdd:PRK02224   328 DRLEECR------------VAAQAHNEEA-------------ESLREDAD--------DLEERAEELREEAAELESELEE 374
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1392 LQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEkkqrrfdSELSQAHEetqREKLQREKLQREKDMlLA 1471
Cdd:PRK02224   375 AREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELR-------EERDELRE---REAELEATLRTARER-VE 443
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1472 EAFSLK---------QQMEEKDL--DIAGFTQKVVSLEAELQDISSQESKDEA------SLAKVKKQLRDLEAKVK---- 1530
Cdd:PRK02224   444 EAEALLeagkcpecgQPVEGSPHveTIEEDRERVEELEAELEDLEEEVEEVEErleraeDLVEAEDRIERLEERREdlee 523
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1531 ---DQEEELDEQAGSIQMLEQAKLRLEMEMERMRQThSKEMESRDEEVEEARQSCQKKLKQMEVQLE------------E 1595
Cdd:PRK02224   524 liaERRETIEEKRERAEELRERAAELEAEAEEKREA-AAEAEEEAEEAREEVAELNSKLAELKERIEslerirtllaaiA 602
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1596 EYEDKQKALREKRElesKLSTLSDQvnQRDFESEKRLRKDlkrtkalladaqimldhlknnapskreiaQLKNQLEESEF 1675
Cdd:PRK02224   603 DAEDEIERLREKRE---ALAELNDE--RRERLAEKRERKR-----------------------------ELEAEFDEARI 648
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1676 TCAAAVKARKA-----MEVEMEDLHLQIDDIAKAKTALE---EQLSRLQREKNEIQNRLE 1727
Cdd:PRK02224   649 EEAREDKERAEeyleqVEEKLDELREERDDLQAEIGAVEnelEELEELRERREALENRVE 708
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
1375-1962 1.57e-11

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 70.25  E-value: 1.57e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1375 MEVEQQSRRQLERRLGDLQADsDESQRALQQLKKkcQRLTAELQDTKlHLEGQQVRNHELEKKQRRFDS------ELSQA 1448
Cdd:pfam12128  188 MHSKEGKFRDVKSMIVAILED-DGVVPPKSRLNR--QQVEHWIRDIQ-AIAGIMKIRPEFTKLQQEFNTlesaelRLSHL 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1449 HEETQREKLQREKLQREKDMLLAEafslkqqmeekdldiagFTQKVVSLEAELQdissqESKDEASLakvkkQLRDLEAK 1528
Cdd:pfam12128  264 HFGYKSDETLIASRQEERQETSAE-----------------LNQLLRTLDDQWK-----EKRDELNG-----ELSAADAA 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1529 VKDQEEEL---DEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEE-------YE 1598
Cdd:pfam12128  317 VAKDRSELealEDQHGAFLDADIETAAADQEQLPSWQSELENLEERLKALTGKHQDVTAKYNRRRSKIKEQnnrdiagIK 396
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1599 DKQKALREKRE------------LESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQIMLDhlknnapskreiaqL 1666
Cdd:pfam12128  397 DKLAKIREARDrqlavaeddlqaLESELREQLEAGKLEFNEEEYRLKSRLGELKLRLNQATATPE--------------L 462
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1667 KNQLEESEFTCAAAVKARKAMEVEMEDLHlqiDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQ 1746
Cdd:pfam12128  463 LLQLENFDERIERAREEQEAANAEVERLQ---SELRQARKRRDQASEALRQASRRLEERQSALDELELQLFPQAGTLLHF 539
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1747 ASRDMAQMNDLQAQIEESNK-EKQELQEKLQALQSQVEF--------LEQSMVDKSLVSRQEAKIR--ELETRLEFEKTQ 1815
Cdd:pfam12128  540 LRKEAPDWEQSIGKVISPELlHRTDLDPEVWDGSVGGELnlygvkldLKRIDVPEWAASEEELRERldKAEEALQSAREK 619
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1816 VKRLENLASRLKETMEKLT-EERDQRAAAENREKEQnKRL---QRQLRDTKEEMSELARKEAEASRKK--HELEMDLESL 1889
Cdd:pfam12128  620 QAAAEEQLVQANGELEKASrEETFARTALKNARLDL-RRLfdeKQSEKDKKNKALAERKDSANERLNSleAQLKQLDKKH 698
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907082233 1890 EAANQSLQADLKLAfkRIGDLQAAIEDEMESDENEDLINslQDMVTKYQKKKNKLEG-DSDVDSELEDR-VDGVK 1962
Cdd:pfam12128  699 QAWLEEQKEQKREA--RTEKQAYWQVVEGALDAQLALLK--AAIAARRSGAKAELKAlETWYKRDLASLgVDPDV 769
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1379-1603 1.96e-11

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 68.25  E-value: 1.96e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1379 QQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQ 1458
Cdd:COG4942     19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAE 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1459 REKLQREKDMLLAEAFSLKQQMEEKDL-DIAGFTQKVVSLEAeLQDISSQeskDEASLAKVKKQLRDLEAKVKDQEEELD 1537
Cdd:COG4942     99 LEAQKEELAELLRALYRLGRQPPLALLlSPEDFLDAVRRLQY-LKYLAPA---RREQAEELRADLAELAALRAELEAERA 174
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082233 1538 EQAGSIQMLEQAKLRLEMEMERMRQT---HSKEMESRDEEVEEARQScQKKLKQMEVQLEEEYEDKQKA 1603
Cdd:COG4942    175 ELEALLAELEEERAALEALKAERQKLlarLEKELAELAAELAELQQE-AEELEALIARLEAEAAAAAER 242
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1239-1556 2.02e-11

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 70.10  E-value: 2.02e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1239 EEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTD-ERNTGE---SASQLLDAETAERLRTEKEMKELQTQYDAL 1314
Cdd:TIGR02169  674 AELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDaSRKIGEiekEIEQLEQEEEKLKERLEELEEDLSSLEQEI 753
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1315 -----KKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRlKYERAVREVDFTKKRLQ----------QELEDKMEVEQ 1379
Cdd:TIGR02169  754 envksELKELEARIEELEEDLHKLEEALNDLEARLSHSRIP-EIQAELSKLEEEVSRIEarlreieqklNRLTLEKEYLE 832
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1380 QSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQR 1459
Cdd:TIGR02169  833 KEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQI 912
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1460 EKLQREKDMLLAEAFSLKQQMEE-------------KDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLE 1526
Cdd:TIGR02169  913 EKKRKRLSELKAKLEALEEELSEiedpkgedeeipeEELSLEDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELK 992
                          330       340       350
                   ....*....|....*....|....*....|
gi 1907082233 1527 AKVKDQEEELDEQAGSIQMLEQAKLRLEME 1556
Cdd:TIGR02169  993 EKRAKLEEERKAILERIEEYEKKKREVFME 1022
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
1493-1899 2.29e-11

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 69.38  E-value: 2.29e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1493 KVVSLEAELQDISSQESKDEASLakvKKQLRDLEAKVKDQEEELDEQAgsiqmleQAKLRLEMEMERMRQTHSKEMESRD 1572
Cdd:pfam05557    3 ELIESKARLSQLQNEKKQMELEH---KRARIELEKKASALKRQLDRES-------DRNQELQKRIRLLEKREAEAEEALR 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1573 EEVEEARQsCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFEsekrlrkdLKRTKALLADAQIMLDH 1652
Cdd:pfam05557   73 EQAELNRL-KKKYLEALNKKLNEKESQLADAREVISCLKNELSELRRQIQRAELE--------LQSTNSELEELQERLDL 143
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1653 LKNNApskREIAQLKNQLEESEFTCAAAVKARKAMEVEMEdlhLQIDDIAKAKTALEEQLS--------RLQREKNEIQN 1724
Cdd:pfam05557  144 LKAKA---SEAEQLRQNLEKQQSSLAEAEQRIKELEFEIQ---SQEQDSEIVKNSKSELARipelekelERLREHNKHLN 217
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1725 RLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQ----------ALQSQVEFLEQSmvDKSL 1794
Cdd:pfam05557  218 ENIENKLLLKEEVEDLKRKLEREEKYREEAATLELEKEKLEQELQSWVKLAQdtglnlrspeDLSRRIEQLQQR--EIVL 295
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1795 VSRQ-----------------EAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERD-QRAAAENREKEQN-KRLQ 1855
Cdd:pfam05557  296 KEENssltssarqlekarrelEQELAQYLKKIEDLNKKLKRHKALVRRLQRRVLLLTKERDgYRAILESYDKELTmSNYS 375
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*
gi 1907082233 1856 RQLRDTKEEMSELARK-EAEASRKKHELEMDLESLEAANQSLQAD 1899
Cdd:pfam05557  376 PQLLERIEEAEDMTQKmQAHNEEMEAQLSVAEEELGGYKQQAQTL 420
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1239-1456 2.87e-11

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 67.87  E-value: 2.87e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1239 EEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEMKELQTQYDALKKQm 1318
Cdd:COG4942     20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAE- 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1319 eVMEMEVMEARLIRAAEINGEVD--------DDDAGGEWRLKYERAVREVDFTK----KRLQQELEDKMEVEQQSRRQLE 1386
Cdd:COG4942     99 -LEAQKEELAELLRALYRLGRQPplalllspEDFLDAVRRLQYLKYLAPARREQaeelRADLAELAALRAELEAERAELE 177
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1387 RRLGDLQadsdESQRALQQLKKKCQRLTAELQDTKlhlEGQQVRNHELEKKQRRFDSELSQAHEETQREK 1456
Cdd:COG4942    178 ALLAELE----EERAALEALKAERQKLLARLEKEL---AELAAELAELQQEAEELEALIARLEAEAAAAA 240
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
1229-1777 3.34e-11

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 68.61  E-value: 3.34e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1229 LSEEQIRNKDEEIQQLRSKLEKVEKERNElrlsSDRLETRISELTSELTDErnTGESASQLLDAETAERlRTEKEMKELQ 1308
Cdd:pfam05557   66 EAEEALREQAELNRLKKKYLEALNKKLNE----KESQLADAREVISCLKNE--LSELRRQIQRAELELQ-STNSELEELQ 138
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1309 TQYDALKKqmevmemevmearliRAAEINGEVDDDDAGGEWRLKYERAVREVDFtkkRLQQELEDKMEVEQQSRRQLerR 1388
Cdd:pfam05557  139 ERLDLLKA---------------KASEAEQLRQNLEKQQSSLAEAEQRIKELEF---EIQSQEQDSEIVKNSKSELA--R 198
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1389 LGDLQadsdesqRALQQLKKKCQRLTaELQDTKLHLEGQQvrnHELEKKQRRfdselsqahEETQREKLQreKLQREKDM 1468
Cdd:pfam05557  199 IPELE-------KELERLREHNKHLN-ENIENKLLLKEEV---EDLKRKLER---------EEKYREEAA--TLELEKEK 256
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1469 LLAEAFSLKQQMEEKDLDIA---GFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVK-------DQEEELDE 1538
Cdd:pfam05557  257 LEQELQSWVKLAQDTGLNLRspeDLSRRIEQLQQREIVLKEENSSLTSSARQLEKARRELEQELAqylkkieDLNKKLKR 336
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1539 QAGSIQMLEQAKLRLEMEMERMRQ------------THSKEMESRDEEVEEARQSCQKKLKQMEVQLEeeyedkqKALRE 1606
Cdd:pfam05557  337 HKALVRRLQRRVLLLTKERDGYRAilesydkeltmsNYSPQLLERIEEAEDMTQKMQAHNEEMEAQLS-------VAEEE 409
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1607 KRELESKLSTLSDQVnqrdfesekrlrkDLKRTKALLADAqimldhlknnAPSKREIAQLKNQLEESEFTCAAAVKARKA 1686
Cdd:pfam05557  410 LGGYKQQAQTLEREL-------------QALRQQESLADP----------SYSKEEVDSLRRKLETLELERQRLREQKNE 466
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1687 MEVEMED--------------LHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRdma 1752
Cdd:pfam05557  467 LEMELERrclqgdydpkktkvLHLSMNPAAEAYQQRKNQLEKLQAEIERLKRLLKKLEDDLEQVLRLPETTSTMNFK--- 543
                          570       580
                   ....*....|....*....|....*
gi 1907082233 1753 QMNDLQAQIEESNKEKQELQEKLQA 1777
Cdd:pfam05557  544 EVLDLRKELESAELKNQRLKEVFQA 568
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1228-1669 3.91e-11

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 68.99  E-value: 3.91e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1228 QLSEEQiRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETR-------ISELTSELTDERNTGESASQLLDAETAE-RLR 1299
Cdd:pfam15921  367 QFSQES-GNLDDQLQKLLADLHKREKELSLEKEQNKRLWDRdtgnsitIDHLRRELDDRNMEVQRLEALLKAMKSEcQGQ 445
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1300 TEKEMKELQTQYDALKKQMEVMEMEVMEARLIRAAeingeVDDDDAGGEWRLKYERAVREVDFTKKRLQQELEDKMEVEQ 1379
Cdd:pfam15921  446 MERQMAAIQGKNESLEKVSSLTAQLESTKEMLRKV-----VEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEIT 520
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1380 QSRRQLERRLGDLQADSDESQRaLQQLKKKCQRLTAEL--QDTKLHLEGQQVRNHelekkqrrfdSELSQAHEET----Q 1453
Cdd:pfam15921  521 KLRSRVDLKLQELQHLKNEGDH-LRNVQTECEALKLQMaeKDKVIEILRQQIENM----------TQLVGQHGRTagamQ 589
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1454 REKLQREKLQREKDMLLAEAFSLKqqmEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQE 1533
Cdd:pfam15921  590 VEKAQLEKEINDRRLELQEFKILK---DKKDAKIRELEARVSDLELEKVKLVNAGSERLRAVKDIKQERDQLLNEVKTSR 666
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1534 EELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQS-------------------------------C 1582
Cdd:pfam15921  667 NELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQTRNTlksmegsdghamkvamgmqkqitakrgqidaL 746
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1583 QKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKrLRKDLKRTKALLADAQIMLDHLKNNAPSKRE 1662
Cdd:pfam15921  747 QSKIQFLEEAMTNANKEKHFLKEEKNKLSQELSTVATEKNKMAGELEV-LRSQERRLKEKVANMEVALDKASLQFAECQD 825

                   ....*..
gi 1907082233 1663 IAQLKNQ 1669
Cdd:pfam15921  826 IIQRQEQ 832
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1705-1913 4.76e-11

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 67.16  E-value: 4.76e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1705 KTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKL--QALQSQV 1782
Cdd:COG3883     18 IQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELgeRARALYR 97
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1783 EFLEQSMVDKSLVSRQeakIRELETRLEFektqvkrLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTK 1862
Cdd:COG3883     98 SGGSVSYLDVLLGSES---FSDFLDRLSA-------LSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELE 167
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907082233 1863 EEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAA 1913
Cdd:COG3883    168 AAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAA 218
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
1424-1810 5.67e-11

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 67.61  E-value: 5.67e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1424 LEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREkdmllaeafsLKQQMEEKDLDIAGFTQKVVSLEAELQD 1503
Cdd:pfam07888   36 LEECLQERAELLQAQEAANRQREKEKERYKRDREQWERQRRE----------LESRVAELKEELRQSREKHEELEEKYKE 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1504 ISsqeskdeASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQScQ 1583
Cdd:pfam07888  106 LS-------ASSEELSEEKDALLAQRAAHEARIRELEEDIKTLTQRVLERETELERMKERAKKAGAQRKEEEAERKQL-Q 177
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1584 KKLKQMEVQLeeeyedkqkalrekRELESKLSTLSDQVNQRDfESEKRLRKDLKRTKALLADAQimlDHLKNNAPSKREI 1663
Cdd:pfam07888  178 AKLQQTEEEL--------------RSLSKEFQELRNSLAQRD-TQVLQLQDTITTLTQKLTTAH---RKEAENEALLEEL 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1664 AQLKNQLEESEFTCAA-------AVKARKAMEVEMEDLHLQIDD----IAKAKTALEEQLSRLQREKNEIQNRLEEDQED 1732
Cdd:pfam07888  240 RSLQERLNASERKVEGlgeelssMAAQRDRTQAELHQARLQAAQltlqLADASLALREGRARWAQERETLQQSAEADKDR 319
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1733 M----NELMKKHKAAVAQASR------DMAQMNDL-QAQIEESNKEKQELQEKLQALQSQVEFLeqsmvdksLVSRQE-- 1799
Cdd:pfam07888  320 IeklsAELQRLEERLQEERMEreklevELGREKDCnRVQLSESRRELQELKASLRVAQKEKEQL--------QAEKQEll 391
                          410
                   ....*....|.
gi 1907082233 1800 AKIRELETRLE 1810
Cdd:pfam07888  392 EYIRQLEQRLE 402
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1213-1644 8.00e-11

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 67.78  E-value: 8.00e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1213 PWWKLFTTVRPLiQVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERntgESASQLLDA 1292
Cdd:PRK03918   359 ERHELYEEAKAK-KEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELK---KAIEELKKA 434
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1293 E----TAERLRTEKEMKELQTQYDA-LKKQMEVMEMEVMEARLIRAAEIngEVDDDDAGGEWRLKYERAVREVDFTKKRL 1367
Cdd:PRK03918   435 KgkcpVCGRELTEEHRKELLEEYTAeLKRIEKELKEIEEKERKLRKELR--ELEKVLKKESELIKLKELAEQLKELEEKL 512
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1368 QQELEDKMEVEQQSRRQLERRLGDLQADS---DESQRALQQLKKKCQRLTAELQDTKLHLEgqqvrnhELEKKQRRFDSE 1444
Cdd:PRK03918   513 KKYNLEELEKKAEEYEKLKEKLIKLKGEIkslKKELEKLEELKKKLAELEKKLDELEEELA-------ELLKELEELGFE 585
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1445 LSQAHEETQREklqREKLQREkdmlLAEAFSLKQQMEEKDldiagftQKVVSLEAELQDISSQESKDEASLAKVKKQLRD 1524
Cdd:PRK03918   586 SVEELEERLKE---LEPFYNE----YLELKDAEKELEREE-------KELKKLEEELDKAFEELAETEKRLEELRKELEE 651
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1525 LEAKVKDQEEEldeqagsiqMLEQAKLRLEMEMERMRqthskemeSRDEEVEEARQSCQKKLKqmevQLEEEYEDKQKAL 1604
Cdd:PRK03918   652 LEKKYSEEEYE---------ELREEYLELSRELAGLR--------AELEELEKRREEIKKTLE----KLKEELEEREKAK 710
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|
gi 1907082233 1605 REKRELESKLSTLsdqvnqrdfeseKRLRKDLKRTKALLA 1644
Cdd:PRK03918   711 KELEKLEKALERV------------EELREKVKKYKALLK 738
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1570-1937 1.19e-10

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 67.51  E-value: 1.19e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1570 SRDEEVEEARqscqkklkqmEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQrdfesekrLRKDLKRTKALLADAQIM 1649
Cdd:pfam01576    1 TRQEEEMQAK----------EEELQKVKERQQKAESELKELEKKHQQLCEEKNA--------LQEQLQAETELCAEAEEM 62
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1650 LDHLKNNAPSKREIAQ-LKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTAL-------------------- 1708
Cdd:pfam01576   63 RARLAARKQELEEILHeLESRLEEEEERSQQLQNEKKKMQQHIQDLEEQLDEEEAARQKLqlekvtteakikkleedill 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1709 -EEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQE--------------LQE 1773
Cdd:pfam01576  143 lEDQNSKLSKERKLLEERISEFTSNLAEEEEKAKSLSKLKNKHEAMISDLEERLKKEEKGRQElekakrklegestdLQE 222
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1774 KLQALQSQVEFLEQSMvdkslvSRQEAKIRELETRLEFEKTQ----VKRLENLASRLKETMEKLTEERDQRAAAEnreke 1849
Cdd:pfam01576  223 QIAELQAQIAELRAQL------AKKEEELQAALARLEEETAQknnaLKKIRELEAQISELQEDLESERAARNKAE----- 291
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1850 qnkrlqRQLRDTKEEMSELarkeaeasrkKHELEMDLESlEAANQSLQADLKlafKRIGDLQAAIEDEMESDENEdlins 1929
Cdd:pfam01576  292 ------KQRRDLGEELEAL----------KTELEDTLDT-TAAQQELRSKRE---QEVTELKKALEEETRSHEAQ----- 346

                   ....*...
gi 1907082233 1930 LQDMVTKY 1937
Cdd:pfam01576  347 LQEMRQKH 354
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
1691-1945 1.21e-10

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 64.93  E-value: 1.21e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1691 MEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQE 1770
Cdd:COG1340      3 TDELSSSLEELEEKIEELREEIEELKEKRDELNEELKELAEKRDELNAQVKELREEAQELREKRDELNEKVKELKEERDE 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1771 LQEKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFE-KTQVKRLEN---LASRLKETMEKLteerdQRAAAENR 1846
Cdd:COG1340     83 LNEKLNELREELDELRKELAELNKAGGSIDKLRKEIERLEWRqQTEVLSPEEekeLVEKIKELEKEL-----EKAKKALE 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1847 EKEQNKRLQRQLRDTKEEMSELARKEAEASRK--KHELEMDlESLEAANQsLQADLKLAFKRIGDLQAAIEDEMES-DEN 1923
Cdd:COG1340    158 KNEKLKELRAELKELRKEAEEIHKKIKELAEEaqELHEEMI-ELYKEADE-LRKEADELHKEIVEAQEKADELHEEiIEL 235
                          250       260
                   ....*....|....*....|..
gi 1907082233 1924 EDLINSLQDMVTKYQKKKNKLE 1945
Cdd:COG1340    236 QKELRELRKELKKLRKKQRALK 257
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
1351-1945 1.35e-10

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 67.38  E-value: 1.35e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1351 LKYERAVREVDFTKKRLQQ---ELEDKME-VEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQdtklhLEG 1426
Cdd:TIGR00606  265 MKLDNEIKALKSRKKQMEKdnsELELKMEkVFQGTDEQLNDLYHNHQRTVREKERELVDCQRELEKLNKERR-----LLN 339
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1427 QQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISS 1506
Cdd:TIGR00606  340 QEKTELLVEQGRLQLQADRHQEHIRARDSLIQSLATRLELDGFERGPFSERQIKNFHTLVIERQEDEAKTAAQLCADLQS 419
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1507 QESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKL 1586
Cdd:TIGR00606  420 KERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRILELDQELRKAERELSKAEKNSLTETL 499
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1587 KQMEVQLEEeyedkqkalrEKRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQIMldhlKNNAPSKREIAQL 1666
Cdd:TIGR00606  500 KKEVKSLQN----------EKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKDEQIR----KIKSRHSDELTSL 565
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1667 ------KNQLEEsefTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRL------EEDQEDMN 1734
Cdd:TIGR00606  566 lgyfpnKKQLED---WLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEDKLfdvcgsQDEESDLE 642
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1735 ELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQ-ALQSQVEFLEqsmvdksLVSRQEAKIRELETRLEFEK 1813
Cdd:TIGR00606  643 RLKEEIEKSSKQRAMLAGATAVYSQFITQLTDENQSCCPVCQrVFQTEAELQE-------FISDLQSKLRLAPDKLKSTE 715
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1814 TQVKRLEnlasRLKETMEKLTEERDQRAAAENRE-KEQNKRLQRQLRDTKEEMSELARKEA--EASRKKHELEMDLESLE 1890
Cdd:TIGR00606  716 SELKKKE----KRRDEMLGLAPGRQSIIDLKEKEiPELRNKLQKVNRDIQRLKNDIEEQETllGTIMPEEESAKVCLTDV 791
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907082233 1891 AANQSLQADLKlafkrigDLQAAIEDEMESDENEDLINSLQDMVTKYQKKKNKLE 1945
Cdd:TIGR00606  792 TIMERFQMELK-------DVERKIAQQAAKLQGSDLDRTVQQVNQEKQEKQHELD 839
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
1546-1953 1.42e-10

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 66.69  E-value: 1.42e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1546 LEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEvQLEEEYEDKQKALREKRELesklstlsdqvNQRD 1625
Cdd:pfam05557   14 LQNEKKQMELEHKRARIELEKKASALKRQLDRESDRNQELQKRIR-LLEKREAEAEEALREQAEL-----------NRLK 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1626 FESEKRLRKDLKRTKALLADAQIMLDHLKNnapskrEIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAK 1705
Cdd:pfam05557   82 KKYLEALNKKLNEKESQLADAREVISCLKN------ELSELRRQIQRAELELQSTNSELEELQERLDLLKAKASEAEQLR 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1706 TALEEQLSRLQREKNEIQNrLE---EDQEDMNELMKKHKAAVAQasrdMAQMNDLQAQIEESNKEKQELQEKLQALQSQV 1782
Cdd:pfam05557  156 QNLEKQQSSLAEAEQRIKE-LEfeiQSQEQDSEIVKNSKSELAR----IPELEKELERLREHNKHLNENIENKLLLKEEV 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1783 EFLEQSMvdkslvSRQE---AKIRELETRLEFEKTQVKRLENLASRLKETM---EKLTEERDQRAAAENREKEQNKRLQR 1856
Cdd:pfam05557  231 EDLKRKL------EREEkyrEEAATLELEKEKLEQELQSWVKLAQDTGLNLrspEDLSRRIEQLQQREIVLKEENSSLTS 304
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1857 QLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIE---DEME----SDENEDLINS 1929
Cdd:pfam05557  305 SARQLEKARRELEQELAQYLKKIEDLNKKLKRHKALVRRLQRRVLLLTKERDGYRAILEsydKELTmsnySPQLLERIEE 384
                          410       420
                   ....*....|....*....|....
gi 1907082233 1930 LQDMVTKYQKKKNKLEGDSDVDSE 1953
Cdd:pfam05557  385 AEDMTQKMQAHNEEMEAQLSVAEE 408
46 PHA02562
endonuclease subunit; Provisional
1697-1963 1.64e-10

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 66.19  E-value: 1.64e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1697 QIDDIAKAKT-ALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAqasrdmaqmnDLQAQIEESNKEKQELQEKL 1775
Cdd:PHA02562   167 EMDKLNKDKIrELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIA----------RKQNKYDELVEEAKTIKAEI 236
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1776 QALQSQVEFLEQSMVDkslVSRQEAKIRELETRLefeKTQVKRLENLASRLKE------TMEKLTEERDQRAAAENREKE 1849
Cdd:PHA02562   237 EELTDELLNLVMDIED---PSAALNKLNTAAAKI---KSKIEQFQKVIKMYEKggvcptCTQQISEGPDRITKIKDKLKE 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1850 QNKRLqRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDemESDENEDLINS 1929
Cdd:PHA02562   311 LQHSL-EKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVD--NAEELAKLQDE 387
                          250       260       270
                   ....*....|....*....|....*....|....
gi 1907082233 1930 LQDMVTKYQKKKNKLEGDSDVDSELEDrvDGVKS 1963
Cdd:PHA02562   388 LDKIVKTKSELVKEKYHRGIVTDLLKD--SGIKA 419
PRK04778 PRK04778
septation ring formation regulator EzrA; Provisional
1443-1831 1.98e-10

septation ring formation regulator EzrA; Provisional


Pssm-ID: 179877 [Multi-domain]  Cd Length: 569  Bit Score: 66.01  E-value: 1.98e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1443 SELSQAHEETQREKLQREKLQRE--KDmLLAEAFS-------LKQQMEEKDLDIAGFTQKVVS---LEAE--LQDISSQE 1508
Cdd:PRK04778   129 QELLESEEKNREEVEQLKDLYRElrKS-LLANRFSfgpaldeLEKQLENLEEEFSQFVELTESgdyVEAReiLDQLEEEL 207
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1509 SKDEASLAKVKKQLRDLEAKVKDQEEELdeQAGSIQMLEQ----AKLRLEMEMERMRQTHSK---EMESRD-EEVEEARQ 1580
Cdd:PRK04778   208 AALEQIMEEIPELLKELQTELPDQLQEL--KAGYRELVEEgyhlDHLDIEKEIQDLKEQIDEnlaLLEELDlDEAEEKNE 285
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1581 SCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEkRLR-------KDLKRTKALLADAQIM---- 1649
Cdd:PRK04778   286 EIQERIDQLYDILEREVKARKYVEKNSDTLPDFLEHAKEQNKELKEEID-RVKqsytlneSELESVRQLEKQLESLekqy 364
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1650 LDHLKNNAPSKREIAQLKNQLEESEftcaaavKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLE-- 1727
Cdd:PRK04778   365 DEITERIAEQEIAYSELQEELEEIL-------KQLEEIEKEQEKLSEMLQGLRKDELEAREKLERYRNKLHEIKRYLEks 437
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1728 ------EDQEDMNELMKKH-KAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSM----------- 1789
Cdd:PRK04778   438 nlpglpEDYLEMFFEVSDEiEALAEELEEKPINMEAVNRLLEEATEDVETLEEETEELVENATLTEQLIqyanryrsdne 517
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907082233 1790 -VDKSLVS--------RQEAKIRELETRLE-FEKTQVKRLENLASRLKETME 1831
Cdd:PRK04778   518 eVAEALNEaerlfreyDYKAALEIIATALEkVEPGVTKRIEDSYEKEKETIR 569
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1573-1798 3.20e-10

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 64.40  E-value: 3.20e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1573 EEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKrLRKDLKRTKALLADAQIMLDH 1652
Cdd:COG4942     23 AEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAA-LEAELAELEKEIAELRAELEA 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1653 LKNNAPSKREIAQLKNQLEESEF-----TCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQ---N 1724
Cdd:COG4942    102 QKEELAELLRALYRLGRQPPLALllspeDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEallA 181
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082233 1725 RLEEDQEDMNELMKKHKAAVAQASRDMAQmndLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKSLVSRQ 1798
Cdd:COG4942    182 ELEEERAALEALKAERQKLLARLEKELAE---LAAELAELQQEAEELEALIARLEAEAAAAAERTPAAGFAALK 252
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1226-1504 3.56e-10

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 65.86  E-value: 3.56e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1226 QVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAE-----TAERLRT 1300
Cdd:TIGR02169  724 EIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSripeiQAELSKL 803
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1301 EKEMKELQTQYDALKKQMEVMEMEVMEARliraAEINGEVDDDDaggEWRLKYERAVREVDFTKKRLQqELEDKMEVEQQ 1380
Cdd:TIGR02169  804 EEEVSRIEARLREIEQKLNRLTLEKEYLE----KEIQELQEQRI---DLKEQIKSIEKEIENLNGKKE-ELEEELEELEA 875
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1381 SRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQrEKLQRE 1460
Cdd:TIGR02169  876 ALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPE-EELSLE 954
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907082233 1461 KLQREKDMLLAEAFSLK-------QQMEEKDLDIAGFTQKVVSLEAELQDI 1504
Cdd:TIGR02169  955 DVQAELQRVEEEIRALEpvnmlaiQEYEEVLKRLDELKEKRAKLEEERKAI 1005
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1506-1945 3.59e-10

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 65.76  E-value: 3.59e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1506 SQESKDEASLAKVKKQLRDLEAKVKDQ--EEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEmeSRDEEVEEARQSCQ 1583
Cdd:TIGR00618  182 ALMEFAKKKSLHGKAELLTLRSQLLTLctPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYL--TQKREAQEEQLKKQ 259
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1584 KKLKQMEVQLEE------EYEDKQKALREKRELEsKLSTLSDQVNQRDFESEkRLRKDLKRTKALLADAQIMLDHLKNNA 1657
Cdd:TIGR00618  260 QLLKQLRARIEElraqeaVLEETQERINRARKAA-PLAAHIKAVTQIEQQAQ-RIHTELQSKMRSRAKLLMKRAAHVKQQ 337
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1658 PSKREIAQLKNQL--EESEFTCAAAVKA--RKAMEVEMEDLHlQIDDIAKAKTALEEQ-------LSRLQREKNEIQNRL 1726
Cdd:TIGR00618  338 SSIEEQRRLLQTLhsQEIHIRDAHEVATsiREISCQQHTLTQ-HIHTLQQQKTTLTQKlqslckeLDILQREQATIDTRT 416
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1727 EEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSqvefLEQSMVDKSLVSRQEAKIRELE 1806
Cdd:TIGR00618  417 SAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLKE----REQQLQTKEQIHLQETRKKAVV 492
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1807 -TRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQnKRLQRQLRDTKEEMSELARKEAEASRKKHELEMD 1885
Cdd:TIGR00618  493 lARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQRGEQTY-AQLETSEEDVYHQLTSERKQRASLKEQMQEIQQS 571
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1886 LESLEAANQSLQADLKLAFKRIGDLQAAIedEMESDENEDLINSLQDMVTKYQKKKNKLE 1945
Cdd:TIGR00618  572 FSILTQCDNRSKEDIPNLQNITVRLQDLT--EKLSEAEDMLACEQHALLRKLQPEQDLQD 629
PRK01156 PRK01156
chromosome segregation protein; Provisional
1437-1989 3.63e-10

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 65.69  E-value: 3.63e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1437 KQRRFDSELSQAHEETQR---EKLQREKLQREKDMLLAEAFSLKQQMEekdlDIAGFTQKVVSLEAELQDISSQESKDEA 1513
Cdd:PRK01156   136 GQGEMDSLISGDPAQRKKildEILEINSLERNYDKLKDVIDMLRAEIS----NIDYLEEKLKSSNLELENIKKQIADDEK 211
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1514 SLAKVKKQLRDLEAKVKDQEEE---LDEQAGSIQMLEQAKLRLEME---------MERMRQTHSKEMESRDEEVEEARQS 1581
Cdd:PRK01156   212 SHSITLKEIERLSIEYNNAMDDynnLKSALNELSSLEDMKNRYESEiktaesdlsMELEKNNYYKELEERHMKIINDPVY 291
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1582 CQKKLKQMEVQLEEEYEDKQKALRE-KRELES------KLSTLsdQVNQRDFESEKRLRKDLKRTKALLADaqimlDHLK 1654
Cdd:PRK01156   292 KNRNYINDYFKYKNDIENKKQILSNiDAEINKyhaiikKLSVL--QKDYNDYIKKKSRYDDLNNQILELEG-----YEMD 364
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1655 NNApSKREIAQLKNQLEE----SEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQ 1730
Cdd:PRK01156   365 YNS-YLKSIESLKKKIEEysknIERMSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELS 443
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1731 EDMNELMKKHKAAVAQASRDMAQMNDLqaqIEESNKEKQELQEKLQALQSQVEFLEQSMVD-KSLVSRQEA-KIRELET- 1807
Cdd:PRK01156   444 RNMEMLNGQSVCPVCGTTLGEEKSNHI---INHYNEKKSRLEEKIREIEIEVKDIDEKIVDlKKRKEYLESeEINKSINe 520
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1808 --RLEFEKTQVKRLENLASRLKETMEKLT-----------EERDQR----------------AAAENREKEQNKRLQRQL 1858
Cdd:PRK01156   521 ynKIESARADLEDIKIKINELKDKHDKYEeiknrykslklEDLDSKrtswlnalavislidiETNRSRSNEIKKQLNDLE 600
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1859 RDTKEEMSELARKEA--EASRKKHELEMD--------LESLEAANQSLQADLKLAFKRIGDLQAAIEDEME-----SDEN 1923
Cdd:PRK01156   601 SRLQEIEIGFPDDKSyiDKSIREIENEANnlnnkyneIQENKILIEKLRGKIDNYKKQIAEIDSIIPDLKEitsriNDIE 680
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082233 1924 EDLINS---LQDMVTKYQKKKNKLEGDSDVDSELEDRVDGVKSWLSKNKGPSKAPSDdgsLKSSRTALN 1989
Cdd:PRK01156   681 DNLKKSrkaLDDAKANRARLESTIEILRTRINELSDRINDINETLESMKKIKKAIGD---LKRLREAFD 746
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1692-1897 3.69e-10

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 65.42  E-value: 3.69e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1692 EDLHLQIDDIAKAKTALEEQLsrlqrekNEIQNRLEEDQEDMNELMKKHKAAVAQAsrdmaQMNDLQAQIEESNKEKQEL 1771
Cdd:COG3206    164 QNLELRREEARKALEFLEEQL-------PELRKELEEAEAALEEFRQKNGLVDLSE-----EAKLLLQQLSELESQLAEA 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1772 QEKLQALQSQVEFLEQSM----------VDKSLVSRQEAKIRELETRLEFEKTQV-----------KRLENLASRLKETM 1830
Cdd:COG3206    232 RAELAEAEARLAALRAQLgsgpdalpelLQSPVIQQLRAQLAELEAELAELSARYtpnhpdvialrAQIAALRAQLQQEA 311
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082233 1831 EKLTEE-RDQRAAAENREKEqnkrLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQ 1897
Cdd:COG3206    312 QRILASlEAELEALQAREAS----LQAQLAQLEARLAELPELEAELRRLEREVEVARELYESLLQRLE 375
PRK01156 PRK01156
chromosome segregation protein; Provisional
1231-1773 6.40e-10

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 64.92  E-value: 6.40e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1231 EEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRIS-------ELTSELTDERNTGESASQLL---DAETAERLRT 1300
Cdd:PRK01156   248 EDMKNRYESEIKTAESDLSMELEKNNYYKELEERHMKIINdpvyknrNYINDYFKYKNDIENKKQILsniDAEINKYHAI 327
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1301 EKEMKELQTQYDA-LKKQMevmemevmearliRAAEINGEVDDDDaggEWRLKYERAVREVDFTKKRLQQELEDKMEVEQ 1379
Cdd:PRK01156   328 IKKLSVLQKDYNDyIKKKS-------------RYDDLNNQILELE---GYEMDYNSYLKSIESLKKKIEEYSKNIERMSA 391
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1380 QSRRQLERRLGD---LQADSDESQRALQQLKKKCQRLTA----------ELQDTKLHLEGQQV----RNHELEKKQRR-- 1440
Cdd:PRK01156   392 FISEILKIQEIDpdaIKKELNEINVKLQDISSKVSSLNQriralrenldELSRNMEMLNGQSVcpvcGTTLGEEKSNHii 471
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1441 --FDSELSQAHEETQREKLQREKLQREKDMLLaeafSLKQQMEEKDLD-IAGFTQKVVSLEAELQDissqeskDEASLAK 1517
Cdd:PRK01156   472 nhYNEKKSRLEEKIREIEIEVKDIDEKIVDLK----KRKEYLESEEINkSINEYNKIESARADLED-------IKIKINE 540
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1518 VKKQlrdleakvKDQEEELDEQAGSIQmLEQAKLRLEMEMERMRQTHSKEME---SRDEEVEEARQSCQKKLKQMEVQLE 1594
Cdd:PRK01156   541 LKDK--------HDKYEEIKNRYKSLK-LEDLDSKRTSWLNALAVISLIDIEtnrSRSNEIKKQLNDLESRLQEIEIGFP 611
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1595 EEYEDKQKALrekRELESKLSTLSDQVNQrdFESEKRLRKDLKRTkalladaqimLDHLKNNAPSKREIAQLKNQLeese 1674
Cdd:PRK01156   612 DDKSYIDKSI---REIENEANNLNNKYNE--IQENKILIEKLRGK----------IDNYKKQIAEIDSIIPDLKEI---- 672
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1675 ftcaaAVKARKaMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMnELMKKHKAAVAQAS--RDMA 1752
Cdd:PRK01156   673 -----TSRIND-IEDNLKKSRKALDDAKANRARLESTIEILRTRINELSDRINDINETL-ESMKKIKKAIGDLKrlREAF 745
                          570       580
                   ....*....|....*....|....*..
gi 1907082233 1753 QMNDLQAQIEES------NKEKQELQE 1773
Cdd:PRK01156   746 DKSGVPAMIRKSasqamtSLTRKYLFE 772
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1231-1676 7.80e-10

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 64.55  E-value: 7.80e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1231 EEQIRNKD-EEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDE-----------RNTGESASQLLDAETAERL 1298
Cdd:COG4913    329 EAQIRGNGgDRLEQLEREIERLERELEERERRRARLEALLAALGLPLPASaeefaalraeaAALLEALEEELEALEEALA 408
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1299 RTEKEMKELQTQYDALKKQMEV---------MEMEVMEARLIRAAEING----------EVDDDDAggEWRL-------- 1351
Cdd:COG4913    409 EAEAALRDLRRELRELEAEIASlerrksnipARLLALRDALAEALGLDEaelpfvgeliEVRPEEE--RWRGaiervlgg 486
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1352 ---------KYERAVRE----------VDFTKKRLQQELEDKMEVEQQS----------------RRQLERRLGDLQADS 1396
Cdd:COG4913    487 faltllvppEHYAAALRwvnrlhlrgrLVYERVRTGLPDPERPRLDPDSlagkldfkphpfrawlEAELGRRFDYVCVDS 566
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1397 desqraLQQLKKKCQRLTAELQ----------DTKLHLEGQQV-------RNHELEKKQRRFDSELSQAHEETQREKLQR 1459
Cdd:COG4913    567 ------PEELRRHPRAITRAGQvkgngtrhekDDRRRIRSRYVlgfdnraKLAALEAELAELEEELAEAEERLEALEAEL 640
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1460 EKLQRekdmlLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDIssqeSKDEASLAKVKKQLRDLEAKVKDQEEELDEQ 1539
Cdd:COG4913    641 DALQE-----RREALQRLAEYSWDEIDVASAEREIAELEAELERL----DASSDDLAALEEQLEELEAELEELEEELDEL 711
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1540 AGSIQMLEQAKLRLEMEMERMRQTHSkemESRDEEVEEARQSCQKKLKQmeVQLEEEYEDKQKALREKRE-LESKLSTLS 1618
Cdd:COG4913    712 KGEIGRLEKELEQAEEELDELQDRLE---AAEDLARLELRALLEERFAA--ALGDAVERELRENLEERIDaLRARLNRAE 786
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082233 1619 DQVN------QRDFESEKRlrkDLKRTKALLADAQIMLDHLKNNA-PSKRE-IAQLKNQLEESEFT 1676
Cdd:COG4913    787 EELEramrafNREWPAETA---DLDADLESLPEYLALLDRLEEDGlPEYEErFKELLNENSIEFVA 849
PRK01156 PRK01156
chromosome segregation protein; Provisional
1231-1832 1.12e-09

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 64.15  E-value: 1.12e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1231 EEQIRNKDEEIQQLRSKLEKVEKernELRLSSDRLEtRISELTSELTDERNTGESASQLLDAETAERLRTEKEMKELQTq 1310
Cdd:PRK01156   189 EEKLKSSNLELENIKKQIADDEK---SHSITLKEIE-RLSIEYNNAMDDYNNLKSALNELSSLEDMKNRYESEIKTAES- 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1311 yDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYeravrevDFTKKRlqqELEDKMEVEQQSRRQLERRLG 1390
Cdd:PRK01156   264 -DLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKN-------DIENKK---QILSNIDAEINKYHAIIKKLS 332
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1391 DLQADSDEsqraLQQLKKKCQRLTAELQDTKLHLEGQQ--VRNHELEKKQRRfdselsqahEETQREKLQREKLQREKDM 1468
Cdd:PRK01156   333 VLQKDYND----YIKKKSRYDDLNNQILELEGYEMDYNsyLKSIESLKKKIE---------EYSKNIERMSAFISEILKI 399
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1469 LLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLR------DL-EAKVKDQEEELDEQAG 1541
Cdd:PRK01156   400 QEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRNMEMLNGQSVcpvcgtTLgEEKSNHIINHYNEKKS 479
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1542 SiqmLEQAKLRLEMEMERM--RQTHSKEMESR--DEEVEEARQScQKKLKQMEVQLeEEYEDKQKALREKrelESKLSTL 1617
Cdd:PRK01156   480 R---LEEKIREIEIEVKDIdeKIVDLKKRKEYleSEEINKSINE-YNKIESARADL-EDIKIKINELKDK---HDKYEEI 551
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1618 SDQVNQRDFESekrlrKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEeseftcaaavKARKAMEVEMEDLHLQ 1697
Cdd:PRK01156   552 KNRYKSLKLED-----LDSKRTSWLNALAVISLIDIETNRSRSNEIKKQLNDLE----------SRLQEIEIGFPDDKSY 616
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1698 IDDIAKaktALEEQLSRLQREKNEIQNrLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQ- 1776
Cdd:PRK01156   617 IDKSIR---EIENEANNLNNKYNEIQE-NKILIEKLRGKIDNYKKQIAEIDSIIPDLKEITSRINDIEDNLKKSRKALDd 692
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082233 1777 ALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLENLaSRLKETMEK 1832
Cdd:PRK01156   693 AKANRARLESTIEILRTRINELSDRINDINETLESMKKIKKAIGDL-KRLREAFDK 747
mukB PRK04863
chromosome partition protein MukB;
1355-1903 1.13e-09

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 64.21  E-value: 1.13e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1355 RAVREVDFTKKRLQQELEDKMEVEQQSRRQLERR--LGDLQADSDES----QRALQQLKKKCQRLTA------ELQDTKL 1422
Cdd:PRK04863   534 RAERLLAEFCKRLGKNLDDEDELEQLQEELEARLesLSESVSEARERrmalRQQLEQLQARIQRLAArapawlAAQDALA 613
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1423 HLEGQQvrNHELEKKQRRfdSELSQAHEETQRE-KLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKV--VSLeA 1499
Cdd:PRK04863   614 RLREQS--GEEFEDSQDV--TEYMQQLLERERElTVERDELAARKQALDEEIERLSQPGGSEDPRLNALAERFggVLL-S 688
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1500 EL-QDISSQESKD-EASLAKVKKQL--RDLEAkVKDQEEELDEQAGSIQMLEQAKLRL--------EMEMERMRQTHSKE 1567
Cdd:PRK04863   689 EIyDDVSLEDAPYfSALYGPARHAIvvPDLSD-AAEQLAGLEDCPEDLYLIEGDPDSFddsvfsveELEKAVVVKIADRQ 767
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1568 ME-SRDEEV----EEARQscqKKLKQMEVQLEEEYEDKQKALREKREL-----------------------ESKLSTLSD 1619
Cdd:PRK04863   768 WRySRFPEVplfgRAARE---KRIEQLRAEREELAERYATLSFDVQKLqrlhqafsrfigshlavafeadpEAELRQLNR 844
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1620 QVNQR-----DFES-EKRLRKDLKRTKALLADAQIMLDHLKNNAPSK--REIAQLKNQLEESEFtcAAAVKARKAMEVEm 1691
Cdd:PRK04863   845 RRVELeralaDHESqEQQQRSQLEQAKEGLSALNRLLPRLNLLADETlaDRVEEIREQLDEAEE--AKRFVQQHGNALA- 921
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1692 edlhlQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQE---DMNELM-KKHKAAVAQASRDMAQMNDLQAQIEESNKE 1767
Cdd:PRK04863   922 -----QLEPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAKQqafALTEVVqRRAHFSYEDAAEMLAKNSDLNEKLRQRLEQ 996
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1768 KQELQEKL--QALQSQVEFLEQSMVDKSLVSRQEAKireLETRLEFEKtqvkRLENLASRLKETME-KLTEERDQRAAAE 1844
Cdd:PRK04863   997 AEQERTRAreQLRQAQAQLAQYNQVLASLKSSYDAK---RQMLQELKQ----ELQDLGVPADSGAEeRARARRDELHARL 1069
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082233 1845 NREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEmdlESLEAANQSLQADLKLA 1903
Cdd:PRK04863  1070 SANRSRRNQLEKQLTFCEAEMDNLTKKLRKLERDYHEMR---EQVVNAKAGWCAVLRLV 1125
EzrA pfam06160
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ...
1432-1790 1.43e-09

Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.


Pssm-ID: 428797 [Multi-domain]  Cd Length: 542  Bit Score: 63.33  E-value: 1.43e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1432 HELEKKQRRFDSELSQ--AHEETQREKLQ--REKLQREKDMLLAEAFS-------LKQQMEEKDLDIAGFTQKVVS---L 1497
Cdd:pfam06160   96 DDIEEDIKQILEELDEllESEEKNREEVEelKDKYRELRKTLLANRFSygpaideLEKQLAEIEEEFSQFEELTESgdyL 175
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1498 EAE--LQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELdeQAGSIQMLEQ----AKLRLEMEMERMRQTHSKEMES- 1570
Cdd:pfam06160  176 EARevLEKLEEETDALEELMEDIPPLYEELKTELPDQLEEL--KEGYREMEEEgyalEHLNVDKEIQQLEEQLEENLALl 253
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1571 ---RDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQ----------------------VNQRD 1625
Cdd:pfam06160  254 enlELDEAEEALEEIEERIDQLYDLLEKEVDAKKYVEKNLPEIEDYLEHAEEQnkelkeelervqqsytlnenelERVRG 333
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1626 FESE-KRLRKDLKRTKALLADAQI----MLDHLKnnapskreiaQLKNQLEEseftcaaavkarkaMEVEMEDLHLQIDD 1700
Cdd:pfam06160  334 LEKQlEELEKRYDEIVERLEEKEVayseLQEELE----------EILEQLEE--------------IEEEQEEFKESLQS 389
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1701 IAKAKTALEEQLSRLQREKNEIQNRLEEDQ-----EDMNELMKKHKAAVAQASRDMAQ----MNDLQAQIEESNKEKQEL 1771
Cdd:pfam06160  390 LRKDELEAREKLDEFKLELREIKRLVEKSNlpglpESYLDYFFDVSDEIEDLADELNEvplnMDEVNRLLDEAQDDVDTL 469
                          410
                   ....*....|....*....
gi 1907082233 1772 QEKLQALQSQVEFLEQSMV 1790
Cdd:pfam06160  470 YEKTEELIDNATLAEQLIQ 488
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1592-1979 1.53e-09

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 62.61  E-value: 1.53e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1592 QLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQimldhlknnapskREIAQLKNQLE 1671
Cdd:COG4372      3 RLGEKVGKARLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAR-------------EELEQLEEELE 69
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1672 ESEftcaaavKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDM 1751
Cdd:COG4372     70 QAR-------SELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQ 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1752 AQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLENLASRLKETME 1831
Cdd:COG4372    143 SEIAEREEELKELEEQLESLQEELAALEQELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLE 222
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1832 KLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQ 1911
Cdd:COG4372    223 AKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLL 302
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082233 1912 AAIEDEMESDENEDLINSLQDMVTKYQKKKNKLEGDSDVDSELEDRVDGVKSWLSKNKGPSKAPSDDG 1979
Cdd:COG4372    303 NLAALSLIGALEDALLAALLELAKKLELALAILLAELADLLQLLLVGLLDNDVLELLSKGAEAGVADG 370
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1155-1571 1.53e-09

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 63.25  E-value: 1.53e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1155 LARLEEQRDEQTSRHLTLFQAacRGYLARQHFKKRKIQDLAIRCVQKNIKKNKGVKDWPWWKLFTTVRP-LIQVQLSEEQ 1233
Cdd:COG4717     73 LKELEEELKEAEEKEEEYAEL--QEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAeLAELPERLEE 150
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1234 IRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELT-SELTDERNTGESASQLLDAETAERLRTEKEMKELQTQYD 1312
Cdd:COG4717    151 LEERLEELRELEEELEELEAELAELQEELEELLEQLSLATeEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELE 230
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1313 ALKKQMEVMEMEVMEARLIRAAEINGEV------DDDDAGGEWRLK---------------YERAVREVDFTKKRLQQEL 1371
Cdd:COG4717    231 QLENELEAAALEERLKEARLLLLIAAALlallglGGSLLSLILTIAgvlflvlgllallflLLAREKASLGKEAEELQAL 310
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1372 EDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHE---LEKKQRRFDSELSQA 1448
Cdd:COG4717    311 PALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIaalLAEAGVEDEEELRAA 390
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1449 HEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAgfTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLE-- 1526
Cdd:COG4717    391 LEQAEEYQELKEELEELEEQLEELLGELEELLEALDEEEL--EEELEELEEELEELEEELEELREELAELEAELEQLEed 468
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907082233 1527 -------AKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESR 1571
Cdd:COG4717    469 gelaellQELEELKAELRELAEEWAALKLALELLEEAREEYREERLPPVLER 520
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
1359-1859 1.64e-09

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 63.22  E-value: 1.64e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1359 EVDFTKKRLQQELEDKMEVEQQSRRQLerrlgdlqadsDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQ 1438
Cdd:pfam05557    3 ELIESKARLSQLQNEKKQMELEHKRAR-----------IELEKKASALKRQLDRESDRNQELQKRIRLLEKREAEAEEAL 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1439 RrfdsELSQAHEETQREKLQREKLQREKDMLLAEA----FSLKQQMEEKDLDIAGFTQKVVSLEAELQDIS--------- 1505
Cdd:pfam05557   72 R----EQAELNRLKKKYLEALNKKLNEKESQLADAreviSCLKNELSELRRQIQRAELELQSTNSELEELQerldllkak 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1506 -SQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRL----EMEME----RMRQTHSKEMESRDEEVE 1576
Cdd:pfam05557  148 aSEAEQLRQNLEKQQSSLAEAEQRIKELEFEIQSQEQDSEIVKNSKSELaripELEKElerlREHNKHLNENIENKLLLK 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1577 EARQSCQKKLKQME------VQLEEEYEDKQKAL---------------------REKRELESKLSTLSDQV-------- 1621
Cdd:pfam05557  228 EEVEDLKRKLEREEkyreeaATLELEKEKLEQELqswvklaqdtglnlrspedlsRRIEQLQQREIVLKEENssltssar 307
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1622 ----NQRDFESEKR--------LRKDLKRTKALLADAQimldhlKNNAPSKREIAQLKNQLE--ESEFTCAAAVKARKAM 1687
Cdd:pfam05557  308 qlekARRELEQELAqylkkiedLNKKLKRHKALVRRLQ------RRVLLLTKERDGYRAILEsyDKELTMSNYSPQLLER 381
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1688 EVEMEDLhlqIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNelMKKHKAAVAQASRDMAQMNDLQAQIEESNKE 1767
Cdd:pfam05557  382 IEEAEDM---TQKMQAHNEEMEAQLSVAEEELGGYKQQAQTLERELQ--ALRQQESLADPSYSKEEVDSLRRKLETLELE 456
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1768 KQELQEKLQALQSQVEFLE-QSMVDKS------LVSRQEAKIRELEtrlefeKTQVKRLENLASRLKETMEKLTEERDQR 1840
Cdd:pfam05557  457 RQRLREQKNELEMELERRClQGDYDPKktkvlhLSMNPAAEAYQQR------KNQLEKLQAEIERLKRLLKKLEDDLEQV 530
                          570
                   ....*....|....*....
gi 1907082233 1841 AAAENREKEQNKRLQRQLR 1859
Cdd:pfam05557  531 LRLPETTSTMNFKEVLDLR 549
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1661-1842 2.49e-09

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 59.94  E-value: 2.49e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1661 REIAQLKNQLEESEftcaaavKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELmkkh 1740
Cdd:COG1579     17 SELDRLEHRLKELP-------AELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNV---- 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1741 kaavaqasRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSmvdkslVSRQEAKIRELETRLEFEKtqvKRLE 1820
Cdd:COG1579     86 --------RNNKEYEALQKEIESLKRRISDLEDEILELMERIEELEEE------LAELEAELAELEAELEEKK---AELD 148
                          170       180
                   ....*....|....*....|..
gi 1907082233 1821 NLASRLKETMEKLTEERDQRAA 1842
Cdd:COG1579    149 EELAELEAELEELEAEREELAA 170
CCDC22 pfam05667
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 ...
1367-1781 5.38e-09

Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 (CCDC22) is involved in regulation of NF-kappa-B signalling; the function may involve association with COMMD8 and a CUL1-dependent E3 ubiquitin ligase complex. It is part of the OMMD/CCDC22/CCDC93 (CCC) complex, which interacts with the multisubunit WASH complex required for endosomal deposition of F-actin and cargo trafficking in conjunction with the retromer. This entry also includes CCDC22 homologs from animals and plants.


Pssm-ID: 461708 [Multi-domain]  Cd Length: 600  Bit Score: 61.58  E-value: 5.38e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1367 LQQELEDKMEVEQQSrrqLERRLGDLQADSDESQRALQQLKkkcQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELS 1446
Cdd:pfam05667  216 LAAAQEWEEEWNSQG---LASRLTPEEYRKRKRTKLLKRIA---EQLRSAALAGTEATSGASRSAQDLAELLSSFSGSST 289
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1447 QAHEETQREKLQR-EKLQREKDMLLAEAFSLKQQMEEKDLDIAGfTQKVVSLEAELQDISSqeskdeaSLAKVKKQLRDL 1525
Cdd:pfam05667  290 TDTGLTKGSRFTHtEKLQFTNEAPAATSSPPTKVETEEELQQQR-EEELEELQEQLEDLES-------SIQELEKEIKKL 361
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1526 EAKVKDQEEELDEQagsiqmlEQAKLRLEMEMERMRQTHsKEMESRDEEVEEARQSCQKKLKQMeVQLEEEYEDKQKAL- 1604
Cdd:pfam05667  362 ESSIKQVEEELEEL-------KEQNEELEKQYKVKKKTL-DLLPDAEENIAKLQALVDASAQRL-VELAGQWEKHRVPLi 432
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1605 REKRELESKLSTlsdqvnqRDFESEKRLR--KDLK-RTKALLADAQimldhlknnapSKRE-IAQLKNQLEE-------S 1673
Cdd:pfam05667  433 EEYRALKEAKSN-------KEDESQRKLEeiKELReKIKEVAEEAK-----------QKEElYKQLVAEYERlpkdvsrS 494
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1674 EFTcaaavkaRKAMEVeMEDLHLQIDDIAKaktALEEQLSrLQREKNEIQNRLEEDQEDMNELM---KKHKAAVAQASRD 1750
Cdd:pfam05667  495 AYT-------RRILEI-VKNIKKQKEEITK---ILSDTKS-LQKEINSLTGKLDRTFTVTDELVfkdAKKDESVRKAYKY 562
                          410       420       430
                   ....*....|....*....|....*....|....*...
gi 1907082233 1751 MAQMNDLQAQ----IEESNK---EKQELQEKLQALQSQ 1781
Cdd:pfam05667  563 LAALHENCEQliqtVEETGTimrEIRDLEEQIETESGK 600
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
1258-1787 5.58e-09

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 61.89  E-value: 5.58e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1258 LRLSSDR------LETRI-----SELTSELTD---ERNTG-ESASQLLDAETAERLRTEKEMKELQTQYDaLKKQMEVME 1322
Cdd:COG3096    187 LRSSSDRskfyrlIEASLyggisSAITRSLRDyllPENSGvRKAFQDMEAALRENRMTLEAIRVTQSDRD-LFKHLITEA 265
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1323 MEVMEARLIRAAEingevddddaggEWRLKYERAV--REVDFTKKRLQQELEDKM-----EVEQQSRRQ--LERrlgDLQ 1393
Cdd:COG3096    266 TNYVAADYMRHAN------------ERRELSERALelRRELFGARRQLAEEQYRLvemarELEELSAREsdLEQ---DYQ 330
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1394 ADSDESQRALQ--QLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREKDML-- 1469
Cdd:COG3096    331 AASDHLNLVQTalRQQEKIERYQEDLEELTERLEEQEEVVEEAAEQLAEAEARLEAAEEEVDSLKSQLADYQQALDVQqt 410
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1470 -----------LAEAfslKQQMEEKDLDIAGFTQKVVSLEAELQDISSQeskdeaslakvkkqLRDLEAKVKDQEEELDE 1538
Cdd:COG3096    411 raiqyqqavqaLEKA---RALCGLPDLTPENAEDYLAAFRAKEQQATEE--------------VLELEQKLSVADAARRQ 473
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1539 QAGSIQMLEqaklRLEMEMERmrqthSKEMESRDEEVEEARQscQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLS 1618
Cdd:COG3096    474 FEKAYELVC----KIAGEVER-----SQAWQTARELLRRYRS--QQALAQRLQQLRAQLAELEQRLRQQQNAERLLEEFC 542
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1619 dqvnqrdfeseKRLRKDLkrtkallaDAQIMLDHLKnnapskreiAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQI 1698
Cdd:COG3096    543 -----------QRIGQQL--------DAAEELEELL---------AELEAQLEELEEQAAEAVEQRSELRQQLEQLRARI 594
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1699 D---DIAKAKTALEEQLSRLQREKNEiqnrleedqedmnelmkkhkaAVAQASRDMAQMNDLQAQIEESNKEKQELQEKL 1775
Cdd:COG3096    595 KelaARAPAWLAAQDALERLREQSGE---------------------ALADSQEVTAAMQQLLEREREATVERDELAARK 653
                          570
                   ....*....|..
gi 1907082233 1776 QALQSQVEFLEQ 1787
Cdd:COG3096    654 QALESQIERLSQ 665
CCCAP pfam15964
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in ...
1363-1782 5.79e-09

Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in eukaryotes. CCCAP is also known as SDCCAG8, serologically defined colon cancer antigen 8. It is associated with the centrosome.


Pssm-ID: 435040 [Multi-domain]  Cd Length: 703  Bit Score: 61.46  E-value: 5.79e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1363 TKKRLQQELEDKMEVEQQSRRQLerrlgdlqadSDESQR---ALQQLKKKCQRLT-AELQDTKLHLEGQQVRNhELEKKQ 1438
Cdd:pfam15964  301 TIERLTKERDDLMSALVSVRSSL----------AEAQQRessAYEQVKQAVQMTEeANFEKTKALIQCEQLKS-ELERQK 369
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1439 RRFDSEL-SQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAK 1517
Cdd:pfam15964  370 ERLEKELaSQQEKRAQEKEALRKEMKKEREELGATMLALSQNVAQLEAQVEKVTREKNSLVSQLEEAQKQLASQEMDVTK 449
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1518 VKKQLRDLEAKVKDQEEELdeqagsiqmleqaklrlEMEMERMRQTHSKEMESRDEEVEEARQscqkKLKQMEVQLEEEY 1597
Cdd:pfam15964  450 VCGEMRYQLNQTKMKKDEA-----------------EKEHREYRTKTGRQLEIKDQEIEKLGL----ELSESKQRLEQAQ 508
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1598 EDKQKALREKRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQIMLDHLKnnapskREIAQLKNQLEESEFTC 1677
Cdd:pfam15964  509 QDAARAREECLKLTELLGESEHQLHLTRLEKESIQQSFSNEAKAQALQAQQREQELT------QKMQQMEAQHDKTVNEQ 582
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1678 AAAVKARKAMEVEMED----LHLQIDDIAKAKTALEEQLSR----LQREKNEIQNRLEEDQED------MNELMK----- 1738
Cdd:pfam15964  583 YSLLTSQNTFIAKLKEecctLAKKLEEITQKSRSEVEQLSQekeyLQDRLEKLQKRNEELEEQcvqhgrMHERMKqrlrq 662
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*.
gi 1907082233 1739 --KHKAAVAQasrdmaQMNDLQAQIEESNKEKQELQEKLQALQSQV 1782
Cdd:pfam15964  663 ldKHCQATAQ------QLVQLLSKQNQLFKERQNLTEEVQSLRSQV 702
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1718-1956 5.91e-09

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 61.85  E-value: 5.91e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1718 EKNEIQNRLEEDQEDMNELMKKHKAAVaQASRdmaQMNDLQaQIEESNKEKQELQEKLQALQSQVEFLEqsmvdkslVSR 1797
Cdd:COG4913    219 EEPDTFEAADALVEHFDDLERAHEALE-DARE---QIELLE-PIRELAERYAAARERLAELEYLRAALR--------LWF 285
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1798 QEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERD----QRAAAENREKEQnkrLQRQLRDTKEEMSELARKEA 1873
Cdd:COG4913    286 AQRRLELLEAELEELRAELARLEAELERLEARLDALREELDeleaQIRGNGGDRLEQ---LEREIERLERELEERERRRA 362
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1874 EASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMesDENEDLINSLQDMVTKYQKKKNKLE-GDSDVDS 1952
Cdd:COG4913    363 RLEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEAL--AEAEAALRDLRRELRELEAEIASLErRKSNIPA 440

                   ....
gi 1907082233 1953 ELED 1956
Cdd:COG4913    441 RLLA 444
PRK11281 PRK11281
mechanosensitive channel MscK;
1607-1872 1.01e-08

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 61.08  E-value: 1.01e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1607 KRELESKLSTLSDQvnqRDFESEKrlrkdlkrtKALLADAQIMLDHLKNNAPSKREIAQLKNQLEEseftcaAAVKARKA 1686
Cdd:PRK11281    38 EADVQAQLDALNKQ---KLLEAED---------KLVQQDLEQTLALLDKIDRQKEETEQLKQQLAQ------APAKLRQA 99
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1687 MEvEMEDlhLQIDDIAKAKTALEEQ-LSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESN 1765
Cdd:PRK11281   100 QA-ELEA--LKDDNDEETRETLSTLsLRQLESRLAQTLDQLQNAQNDLAEYNSQLVSLQTQPERAQAALYANSQRLQQIR 176
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1766 K---------------EKQELQEKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELET----RLEfekTQVKRLENLAS-- 1824
Cdd:PRK11281   177 NllkggkvggkalrpsQRVLLQAEQALLNAQNDLQRKSLEGNTQLQDLLQKQRDYLTariqRLE---HQLQLLQEAINsk 253
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907082233 1825 RLKETMEKLTEERDQRAAAEN-------REKEQNKRLQRQLRDTKEEMSELARKE 1872
Cdd:PRK11281   254 RLTLSEKTVQEAQSQDEAARIqanplvaQELEINLQLSQRLLKATEKLNTLTQQN 308
Filament pfam00038
Intermediate filament protein;
1664-1928 1.05e-08

Intermediate filament protein;


Pssm-ID: 459643 [Multi-domain]  Cd Length: 313  Bit Score: 59.16  E-value: 1.05e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1664 AQLKNQLEESEFTCAAAVKARKAM-EVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEdqedmnELMKKHKA 1742
Cdd:pfam00038   28 KLLETKISELRQKKGAEPSRLYSLyEKEIEDLRRQLDTLTVERARLQLELDNLRLAAEDFRQKYED------ELNLRTSA 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1743 AvaqasrdmAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSmvdkslvsrQEAKIRELETRLEFEKTQVK----R 1818
Cdd:pfam00038  102 E--------NDLVGLRKDLDEATLARVDLEAKIESLKEELAFLKKN---------HEEEVRELQAQVSDTQVNVEmdaaR 164
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1819 LENLASRLKETMEKLTEE-RDQRAAAENREKEQNKRLQRQ-------LRDTKEEMSELarkeaeaSRKKHELEMDLESLE 1890
Cdd:pfam00038  165 KLDLTSALAEIRAQYEEIaAKNREEAEEWYQSKLEELQQAaarngdaLRSAKEEITEL-------RRTIQSLEIELQSLK 237
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907082233 1891 AANQSLQA-----------DLKLAFKRIGDLQAA---IEDEMES--DENEDLIN 1928
Cdd:pfam00038  238 KQKASLERqlaeteeryelQLADYQELISELEAElqeTRQEMARqlREYQELLN 291
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
1238-1859 2.29e-08

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 59.84  E-value: 2.29e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1238 DEEIQQLRSKLEKVEKERNELRLSSDRLETRIS--ELTSELTDE-----------RNTGESASQLLDAETAERLRTEKEM 1304
Cdd:pfam10174  115 EENFRRLQSEHERQAKELFLLRKTLEEMELRIEtqKQTLGARDEsikkllemlqsKGLPKKSGEEDWERTRRIAEAEMQL 194
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1305 KELQTQYD-------ALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGgewrlKYERAVREVDFTKKRLQQELEDKMEV 1377
Cdd:pfam10174  195 GHLEVLLDqkekeniHLREELHRRNQLQPDPAKTKALQTVIEMKDTKIS-----SLERNIRDLEDEVQMLKTNGLLHTED 269
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1378 EQQSRRQLE----------RRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEgqqVRNHELEKKQRRFD----- 1442
Cdd:pfam10174  270 REEEIKQMEvykshskfmkNKIDQLKQELSKKESELLALQTKLETLTNQNSDCKQHIE---VLKESLTAKEQRAAilqte 346
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1443 -SELSQAHEETQR------EKLQRekLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDissqesKDeasl 1515
Cdd:pfam10174  347 vDALRLRLEEKESflnkktKQLQD--LTEEKSTLAGEIRDLKDMLDVKERKINVLQKKIENLQEQLRD------KD---- 414
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1516 akvkKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQsCQKKLKQMEVQLEE 1595
Cdd:pfam10174  415 ----KQLAGLKERVKSLQTDSSNTDTALTTLEEALSEKERIIERLKEQREREDRERLEELESLKK-ENKDLKEKVSALQP 489
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1596 EYEDKQKALREKRE-----------LESKLSTLSDQVNQRdFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIA 1664
Cdd:pfam10174  490 ELTEKESSLIDLKEhasslassglkKDSKLKSLEIAVEQK-KEECSKLENQLKKAHNAEEAVRTNPEINDRIRLLEQEVA 568
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1665 QLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDiakaktaLEEQLSRLQREkneiQNRLEEDQEDMNELMKKHKAAV 1744
Cdd:pfam10174  569 RYKEESGKAQAEVERLLGILREVENEKNDKDKKIAE-------LESLTLRQMKE----QNKKVANIKHGQQEMKKKGAQL 637
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1745 AQASR---DMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKS--LVSRQEAKIRELETRLEF-------- 1811
Cdd:pfam10174  638 LEEARrreDNLADNSQQLQLEELMGALEKTRQELDATKARLSSTQQSLAEKDghLTNLRAERRKQLEEILEMkqeallaa 717
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907082233 1812 --EK-TQVKRLENLASRLKETMEKLteerdqraAAENREKEqnkRLQRQLR 1859
Cdd:pfam10174  718 isEKdANIALLELSSSKKKKTQEEV--------MALKREKD---RLVHQLK 757
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
1224-1627 2.37e-08

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 59.14  E-value: 2.37e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1224 LIQVQLsEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERntgESASQLLDAETaerlRTEKE 1303
Cdd:pfam07888   31 LLQNRL-EECLQERAELLQAQEAANRQREKEKERYKRDREQWERQRRELESRVAELK---EELRQSREKHE----ELEEK 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1304 MKELQTQYDALKKQMevmemevmearliraaeingevdddDAGGEWRLKYERAVREVDFTKKRLQQ---ELEDKMEVEQQ 1380
Cdd:pfam07888  103 YKELSASSEELSEEK-------------------------DALLAQRAAHEARIRELEEDIKTLTQrvlERETELERMKE 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1381 SRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETqreklqre 1460
Cdd:pfam07888  158 RAKKAGAQRKEEEAERKQLQAKLQQTEEELRSLSKEFQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRKE-------- 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1461 klqrekdmllAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQA 1540
Cdd:pfam07888  230 ----------AENEALLEELRSLQERLNASERKVEGLGEELSSMAAQRDRTQAELHQARLQAAQLTLQLADASLALREGR 299
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1541 GSIQMlEQAKLRLEMEMERMR-QTHSKEMESRDEEVEEARQSCQKklkqMEVQLEEEYEDKQKALRE-KRELESKLSTLs 1618
Cdd:pfam07888  300 ARWAQ-ERETLQQSAEADKDRiEKLSAELQRLEERLQEERMEREK----LEVELGREKDCNRVQLSEsRRELQELKASL- 373

                   ....*....
gi 1907082233 1619 dQVNQRDFE 1627
Cdd:pfam07888  374 -RVAQKEKE 381
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1348-1689 2.65e-08

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 59.37  E-value: 2.65e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1348 EWRLKYE-RAVREVDFTKKRLQQELEDKMEVEQQSRRQLERRlgdlqadsdeSQRALQQLKKKCQRltaELQDTKLHLEG 1426
Cdd:pfam17380  255 EYTVRYNgQTMTENEFLNQLLHIVQHQKAVSERQQQEKFEKM----------EQERLRQEKEEKAR---EVERRRKLEEA 321
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1427 QQVRNHELEKKQRRFDSELSQAHE-ETQREKLQREKLQREKDMLLAEAFSLK-QQMEEKDLDIAGFTQKVVSLEAELQDI 1504
Cdd:pfam17380  322 EKARQAEMDRQAAIYAEQERMAMErERELERIRQEERKRELERIRQEEIAMEiSRMRELERLQMERQQKNERVRQELEAA 401
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1505 SSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQagsIQMLEQAKLRlemEMERMRQTHSKEMES----RDEEVEEARQ 1580
Cdd:pfam17380  402 RKVKILEEERQRKIQQQKVEMEQIRAEQEEARQRE---VRRLEEERAR---EMERVRLEEQERQQQverlRQQEEERKRK 475
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1581 SCQKKLKQMEVQLEEEY----------EDKQKALREKR-------ELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALL 1643
Cdd:pfam17380  476 KLELEKEKRDRKRAEEQrrkilekeleERKQAMIEEERkrkllekEMEERQKAIYEEERRREAEEERRKQQEMEERRRIQ 555
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*.
gi 1907082233 1644 ADAQIMLDHLKNNAPSKREiAQLKNQLEESEftcaaavKARKAMEV 1689
Cdd:pfam17380  556 EQMRKATEERSRLEAMERE-REMMRQIVESE-------KARAEYEA 593
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1660-1876 2.68e-08

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 59.26  E-value: 2.68e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1660 KREIAQLKNQLEESEftcaAAVKARKAmEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEdqedMNELMKK 1739
Cdd:COG3206    181 EEQLPELRKELEEAE----AALEEFRQ-KNGLVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAA----LRAQLGS 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1740 HKAAVAQASRDmAQMNDLQAQIEESNKEKQELQEKL-------QALQSQVEFLEQSmvdksLVSRQEAKIRELETRLEFE 1812
Cdd:COG3206    252 GPDALPELLQS-PVIQQLRAQLAELEAELAELSARYtpnhpdvIALRAQIAALRAQ-----LQQEAQRILASLEAELEAL 325
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082233 1813 KTQVKRLENLASRLKETMEKLTEERDQRAaaenrekeqnkRLQRQLRDTKEEMSELARKEAEAS 1876
Cdd:COG3206    326 QAREASLQAQLAQLEARLAELPELEAELR-----------RLEREVEVARELYESLLQRLEEAR 378
mukB PRK04863
chromosome partition protein MukB;
1230-1903 2.98e-08

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 59.59  E-value: 2.98e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1230 SEEQIRNKDEEIQQLRSKLEKVEKERNEL----RLSSDRL------------ETRISELTSELTDERNTGESASQLLDAE 1293
Cdd:PRK04863   298 SRRQLAAEQYRLVEMARELAELNEAESDLeqdyQAASDHLnlvqtalrqqekIERYQADLEELEERLEEQNEVVEEADEQ 377
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1294 TAER----LRTEKEMKELQTQ-------YDALkkQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAvREVDF 1362
Cdd:PRK04863   378 QEENearaEAAEEEVDELKSQladyqqaLDVQ--QTRAIQYQQAVQALERAKQLCGLPDLTADNAEDWLEEFQA-KEQEA 454
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1363 TKKRLQqeLEDKMEVEQQSRRQLERRLGDLQ--ADSDESQRALQQLKKKCQRL-TAELQDTKLhlegQQVRNHElekkqr 1439
Cdd:PRK04863   455 TEELLS--LEQKLSVAQAAHSQFEQAYQLVRkiAGEVSRSEAWDVARELLRRLrEQRHLAEQL----QQLRMRL------ 522
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1440 rfdSELSQAHEETQR-EKLQREKLQREKDMLLAEAFsLKQQMEEkdldiagftqkvvsLEAELQDISSQeskdeaslakv 1518
Cdd:PRK04863   523 ---SELEQRLRQQQRaERLLAEFCKRLGKNLDDEDE-LEQLQEE--------------LEARLESLSES----------- 573
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1519 KKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRdEEVEEARQSCQKKLKQMEVQlEEEYE 1598
Cdd:PRK04863   574 VSEARERRMALRQQLEQLQARIQRLAARAPAWLAAQDALARLREQSGEEFEDS-QDVTEYMQQLLERERELTVE-RDELA 651
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1599 dkqkalREKRELESKLSTLSdqvnQRDFESEKRLRKDLKRTKA-LLAD--AQIMLDhlknNAP----------------- 1658
Cdd:PRK04863   652 ------ARKQALDEEIERLS----QPGGSEDPRLNALAERFGGvLLSEiyDDVSLE----DAPyfsalygparhaivvpd 717
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1659 ---SKREIAQLKNQLEE------------------SEFTCAAAVK-----------------ARKAMEVEMEDLHLQIDD 1700
Cdd:PRK04863   718 lsdAAEQLAGLEDCPEDlyliegdpdsfddsvfsvEELEKAVVVKiadrqwrysrfpevplfGRAAREKRIEQLRAEREE 797
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1701 IAkaktaleEQLSRLQREKNEIQnRLeedQEDMNELMKKHkAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQS 1780
Cdd:PRK04863   798 LA-------ERYATLSFDVQKLQ-RL---HQAFSRFIGSH-LAVAFEADPEAELRQLNRRRVELERALADHESQEQQQRS 865
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1781 QVEFLEQSMV-------------DKSLVSRQEAkIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAEnRE 1847
Cdd:PRK04863   866 QLEQAKEGLSalnrllprlnllaDETLADRVEE-IREQLDEAEEAKRFVQQHGNALAQLEPIVSVLQSDPEQFEQLK-QD 943
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082233 1848 KEQNKRLQRQLRDTKEEMSEL-ARKEAEASrkkHELEMDLESLEAANQSLQADLKLA 1903
Cdd:PRK04863   944 YQQAQQTQRDAKQQAFALTEVvQRRAHFSY---EDAAEMLAKNSDLNEKLRQRLEQA 997
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1662-1959 5.02e-08

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 58.51  E-value: 5.02e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1662 EIAQLKNQLEESEFTCAAAVKARKAMEV---EMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMK 1738
Cdd:PRK02224   214 ELAELDEEIERYEEQREQARETRDEADEvleEHEERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEE 293
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1739 KHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKL-QALQSQVEFLEQsmvdkslVSRQEAKIRELETRLEFEKTQVK 1817
Cdd:PRK02224   294 ERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLeECRVAAQAHNEE-------AESLREDADDLEERAEELREEAA 366
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1818 RLEnlaSRLKETMEKLTEERDQRAAAENREKEQNKRLQR---QLRDTKEEMSELARKEAEASRKKHELEMDLESLE---A 1891
Cdd:PRK02224   367 ELE---SELEEAREAVEDRREEIEELEEEIEELRERFGDapvDLGNAEDFLEELREERDELREREAELEATLRTARervE 443
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082233 1892 ANQSLQADLKLA-----FKRIGDLQAAIEDEMESDENEDLINSLQDMVTKYQKKKNKLEGDSDVDSELEDRVD 1959
Cdd:PRK02224   444 EAEALLEAGKCPecgqpVEGSPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEE 516
PRK04778 PRK04778
septation ring formation regulator EzrA; Provisional
1474-1858 6.30e-08

septation ring formation regulator EzrA; Provisional


Pssm-ID: 179877 [Multi-domain]  Cd Length: 569  Bit Score: 57.92  E-value: 6.30e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1474 FSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQ-------EEELDEQAGSIQML 1546
Cdd:PRK04778   101 RKAKHEINEIESLLDLIEEDIEQILEELQELLESEEKNREEVEQLKDLYRELRKSLLANrfsfgpaLDELEKQLENLEEE 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1547 -------------EQAK---LRLEMEMERMRQThskeMesrdEEVEEARQSCQKKLKQmevQLEEEYEDKQKALREKREL 1610
Cdd:PRK04778   181 fsqfveltesgdyVEAReilDQLEEELAALEQI----M----EEIPELLKELQTELPD---QLQELKAGYRELVEEGYHL 249
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1611 ESKlstlsdqvnqrDFESE-KRLRKDLKRTKALLADAQimLDHLK-NNAPSKREIAQLKNQLEeseftcaAAVKARKAME 1688
Cdd:PRK04778   250 DHL-----------DIEKEiQDLKEQIDENLALLEELD--LDEAEeKNEEIQERIDQLYDILE-------REVKARKYVE 309
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1689 VEMEDLHLQIDDIAKAKTALEEQLSRLQ-----------------REKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDM 1751
Cdd:PRK04778   310 KNSDTLPDFLEHAKEQNKELKEEIDRVKqsytlneselesvrqleKQLESLEKQYDEITERIAEQEIAYSELQEELEEIL 389
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1752 AQMNDLQAQIEESN-------KEKQELQEKLQALQSQVE----FLEQS---------MVDKSLVSRQeakIRELETRLE- 1810
Cdd:PRK04778   390 KQLEEIEKEQEKLSemlqglrKDELEAREKLERYRNKLHeikrYLEKSnlpglpedyLEMFFEVSDE---IEALAEELEe 466
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082233 1811 --FEKTQVKRLENLASRLKETMEKLTEE-RDQRAAAE------NREKEQNKRLQRQL 1858
Cdd:PRK04778   467 kpINMEAVNRLLEEATEDVETLEEETEElVENATLTEqliqyaNRYRSDNEEVAEAL 523
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1657-1852 8.91e-08

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 56.76  E-value: 8.91e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1657 APSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNEL 1736
Cdd:COG3883     12 AFADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGER 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1737 ---MKKHKAAV-----------------------AQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMV 1790
Cdd:COG3883     92 araLYRSGGSVsyldvllgsesfsdfldrlsalsKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKA 171
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082233 1791 D-KSLVSRQEAKIRELETRlefEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNK 1852
Cdd:COG3883    172 ElEAQQAEQEALLAQLSAE---EAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAA 231
EzrA pfam06160
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ...
1517-1865 1.06e-07

Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.


Pssm-ID: 428797 [Multi-domain]  Cd Length: 542  Bit Score: 57.17  E-value: 1.06e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1517 KVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQaklrlememermrqthsKEMESRdEEVEEAR---QSCQKKLKQMEVQL 1593
Cdd:pfam06160   83 KAKKALDEIEELLDDIEEDIKQILEELDELLE-----------------SEEKNR-EEVEELKdkyRELRKTLLANRFSY 144
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1594 EEEYEDKQKALrekRELESKLSTLSDQVNQRDFESEK----RLRKDLKRTKALLADAQIMLDHLKNNAPskREIAQLKN- 1668
Cdd:pfam06160  145 GPAIDELEKQL---AEIEEEFSQFEELTESGDYLEARevleKLEEETDALEELMEDIPPLYEELKTELP--DQLEELKEg 219
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1669 --QLEESEFtcaaaVKARKAMEVEMEDLHlqiDDIAKAKTALEE-QLSRLQREKNEIQNRLEEDQEDM-------NELMK 1738
Cdd:pfam06160  220 yrEMEEEGY-----ALEHLNVDKEIQQLE---EQLEENLALLENlELDEAEEALEEIEERIDQLYDLLekevdakKYVEK 291
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1739 KHKAAVAQASRDMAQMNDLQAQIEESNK----------EKQELQEKLQALQSQVEFLEQSMVDKSLV-SRQEAKIRELET 1807
Cdd:pfam06160  292 NLPEIEDYLEHAEEQNKELKEELERVQQsytlneneleRVRGLEKQLEELEKRYDEIVERLEEKEVAySELQEELEEILE 371
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082233 1808 RLE-FEKTQVKrlenlasrLKETMEKLTEErdqraaaENREKEQNKRLQRQLRDTKEEM 1865
Cdd:pfam06160  372 QLEeIEEEQEE--------FKESLQSLRKD-------ELEAREKLDEFKLELREIKRLV 415
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
1522-1872 1.12e-07

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 56.47  E-value: 1.12e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1522 LRDLEAKV------KDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEE-VEEARQSCQKKLKQMEVQLE 1594
Cdd:pfam13868    8 LRELNSKLlaakcnKERDAQIAEKKRIKAEEKEEERRLDEMMEEERERALEEEEEKEEErKEERKRYRQELEEQIEEREQ 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1595 EEYEDKQKALREKRELesklstLSDQVNQRDFESEKRLRKDLKRTKAlladaqimldhlknnapsKREIAQLKNQLEese 1674
Cdd:pfam13868   88 KRQEEYEEKLQEREQM------DEIVERIQEEDQAEAEEKLEKQRQL------------------REEIDEFNEEQA--- 140
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1675 ftcaaavKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQ----REKNEIQNRLEEDQEDMNELMKKHKAAVAQASRD 1750
Cdd:pfam13868  141 -------EWKELEKEEEREEDERILEYLKEKAEREEEREAEReeieEEKEREIARLRAQQEKAQDEKAERDELRAKLYQE 213
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1751 MAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMV-----DKSLVSRQEAKIRELEtRLEFEKTQVKRLENLasR 1825
Cdd:pfam13868  214 EQERKERQKEREEAEKKARQRQELQQAREEQIELKERRLAeeaerEEEEFERMLRKQAEDE-EIEQEEAEKRRMKRL--E 290
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*..
gi 1907082233 1826 LKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKE 1872
Cdd:pfam13868  291 HRRELEKQIEEREEQRAAEREEELEEGERLREEEAERRERIEEERQK 337
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1601-1777 1.15e-07

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 54.93  E-value: 1.15e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1601 QKALREKRELESKLSTLSDQVNQRDfESEKRLRKDLKRTKALLADAQIMLDHLKNnapskrEIAQLKNQLEESEFTCAAA 1680
Cdd:COG1579     13 QELDSELDRLEHRLKELPAELAELE-DELAALEARLEAAKTELEDLEKEIKRLEL------EIEEVEARIKKYEEQLGNV 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1681 VKAR--KAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAqasrdmaqmnDLQ 1758
Cdd:COG1579     86 RNNKeyEALQKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELA----------ELE 155
                          170
                   ....*....|....*....
gi 1907082233 1759 AQIEESNKEKQELQEKLQA 1777
Cdd:COG1579    156 AELEELEAEREELAAKIPP 174
FAM184 pfam15665
Family with sequence similarity 184, A and B; The function of FAM184 is not known.
1397-1611 1.21e-07

Family with sequence similarity 184, A and B; The function of FAM184 is not known.


Pssm-ID: 464788 [Multi-domain]  Cd Length: 211  Bit Score: 54.67  E-value: 1.21e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1397 DESQRALQQLKK----KCQRLTAELQDTKLHLEGQQVRNHELEKKqrrfdselSQAHEETQREKlqreklQREKDMLLAE 1472
Cdd:pfam15665   10 DEHEAEIQALKEaheeEIQQILAETREKILQYKSKIGEELDLKRR--------IQTLEESLEQH------ERMKRQALTE 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1473 AFSLKQQMEEKDL-DIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQL-RDLEAKVKDQEEELDEQAGSIQMLEQAK 1550
Cdd:pfam15665   76 FEQYKRRVEERELkAEAEHRQRVVELSREVEEAKRAFEEKLESFEQLQAQFeQEKRKALEELRAKHRQEIQELLTTQRAQ 155
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907082233 1551 LRLEM-EMERMRQTHSKEMESRDEEVEEarqscqkkLKQMEVQLEEEYEDK---QKALREkRELE 1611
Cdd:pfam15665  156 SASSLaEQEKLEELHKAELESLRKEVED--------LRKEKKKLAEEYEQKlskAQAFYE-RELE 211
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1232-1962 1.43e-07

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 57.53  E-value: 1.43e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1232 EQIRNKDEEIQQLRSKLEKVEKER------NELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAeRLRTEKEMK 1305
Cdd:PTZ00440  1190 EEIESYKKDIDQVKKNMSKERNDHlttfeyNAYYDKATASYENIEELTTEAKGLKGEANRSTNVDELKEI-KLQVFSYLQ 1268
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1306 ELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEwrlKYER-AVREVDFTKkRLQQELEDKMEVEQQSRRQ 1384
Cdd:PTZ00440  1269 QVIKENNKMENALHEIKNMYEFLISIDSEKILKEILNSTKKAE---EFSNdAKKELEKTD-NLIKQVEAKIEQAKEHKNK 1344
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1385 LERRLGDLQADSDesqraLQQLKKKCQRLTAELQDTKLHLEgqqvrnhELEKKQRRFDSELSQAH------------EET 1452
Cdd:PTZ00440  1345 IYGSLEDKQIDDE-----IKKIEQIKEEISNKRKEINKYLS-------NIKSNKEKCDLHVRNASrgkdkidflnkhEAI 1412
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1453 QREKLQREKLQREKDML------LAEAFSLKQQMEEKDLDIAGFTQKVVSLeaeLQD--ISSQESKDEaslaKVKKQLRD 1524
Cdd:PTZ00440  1413 EPSNSKEVNIIKITDNInkckqySNEAMETENKADENNDSIIKYEKEITNI---LNNssILGKKTKLE----KKKKEATN 1485
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1525 LEAKVKDQEEELDEQAGSIQmleqaklrlememERMRQTHSKEMESRDEEV------EEARQSCQKKLKQMEVQLEEEYE 1598
Cdd:PTZ00440  1486 IMDDINGEHSIIKTKLTKSS-------------EKLNQLNEQPNIKREGDVlnndksTIAYETIQYNLGRVKHNLLNILN 1552
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1599 DKQKA---LREKRELESKLSTLSDQVNQRDFES-EKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEES- 1673
Cdd:PTZ00440  1553 IKDEIetiLNKAQDLMRDISKISKIVENKNLENlNDKEADYVKYLDNILKEKQLMEAEYKKLNEIYSDVDNIEKELKKHk 1632
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1674 --------EFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTAL-------EEQLSRLQREKNEIQNRLEEDQEDMNELMK 1738
Cdd:PTZ00440  1633 knyeigllEKVIEINKNIKLYMDSTKESLNSLVNNFSSLFNNFylnkyniNENLEKYKKKLNEIYNEFMESYNIIQEKMK 1712
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1739 KhkaavaqASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQsMVDKSLVSRQEAKIRELETRLEFEKTQVKR 1818
Cdd:PTZ00440  1713 E-------VSNDDVDYNEAKTLREEAQKEEVNLNNKEEEAKKYLNDIKK-QESFRFILYMKEKLDELSKMCKQQYNIVDE 1784
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1819 LENLASRLKETMEKLTEErDQRAAAENREKEQNKRLQRQLRDT-KEE-------------------MSELARKE--AEAS 1876
Cdd:PTZ00440  1785 GYNYIKKKIEYIKTLNDE-NNLSDSLNQAEDKNKEVANLTHYTnKNEaknllghvvksanfigikiMTGLQPTEltPDAS 1863
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1877 RKK-------HELEMDLESLEAANQSLQAD---------------LKLAF---KRIGDLQAAIEDEMESDENEDLINSLQ 1931
Cdd:PTZ00440  1864 LETapeltfeSENNSDLELDHLSSNKNELDvykniqdayksslqiLKYSDdidKKQRDCNKLVEDGNEIYLKSTAINELK 1943
                          810       820       830
                   ....*....|....*....|....*....|.
gi 1907082233 1932 DMVTKYQKKKNKLEGDSDVDSELEDRVDGVK 1962
Cdd:PTZ00440  1944 NMINSVKNKESAISNKIDNVSNKLSELNKIT 1974
46 PHA02562
endonuclease subunit; Provisional
1499-1682 2.24e-07

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 56.18  E-value: 2.24e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1499 AELQDISSQESKDEASLakvKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMER------MRQTH------SK 1566
Cdd:PHA02562   216 ARKQNKYDELVEEAKTI---KAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQfqkvikMYEKGgvcptcTQ 292
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1567 EMESRDEEVEEARQS---CQKKLKQMEVQLEEEYE---DKQKALREKRELESKLSTLsDQVNQRDFESEKRLRKDLKRTK 1640
Cdd:PHA02562   293 QISEGPDRITKIKDKlkeLQHSLEKLDTAIDELEEimdEFNEQSKKLLELKNKISTN-KQSLITLVDKAKKVKAAIEELQ 371
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1907082233 1641 ALLADaqimldhlknnapSKREIAQLKNQLEESEFTCAAAVK 1682
Cdd:PHA02562   372 AEFVD-------------NAEELAKLQDELDKIVKTKSELVK 400
mukB PRK04863
chromosome partition protein MukB;
1512-1874 2.46e-07

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 56.50  E-value: 2.46e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1512 EASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQ----AKLRLEMEMERMRQThskemesrdeeveEARQSCQKKLK 1587
Cdd:PRK04863   292 RRELYTSRRQLAAEQYRLVEMARELAELNEAESDLEQdyqaASDHLNLVQTALRQQ-------------EKIERYQADLE 358
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1588 QMEVQLEEEYEDKQKALREKRELESKLSTLSDQVnqrdfesekrlrkdlKRTKALLADAQIMLDHLKnnapsKREIA--Q 1665
Cdd:PRK04863   359 ELEERLEEQNEVVEEADEQQEENEARAEAAEEEV---------------DELKSQLADYQQALDVQQ-----TRAIQyqQ 418
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1666 LKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQreknEIQNRLEEdqedmneLMKKHKAAVA 1745
Cdd:PRK04863   419 AVQALERAKQLCGLPDLTADNAEDWLEEFQAKEQEATEELLSLEQKLSVAQ----AAHSQFEQ-------AYQLVRKIAG 487
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1746 QASRDMAQmNDLQAQIEESNKEKQELQeKLQALQSQVEFLEQsmvDKSLVSRQEAKIRELETRLEFEKTQVKRLENLASR 1825
Cdd:PRK04863   488 EVSRSEAW-DVARELLRRLREQRHLAE-QLQQLRMRLSELEQ---RLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEE 562
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*....
gi 1907082233 1826 LKETMEKLTEErdQRAAAENREkeqnkRLQRQLRDTKEEMSELARKEAE 1874
Cdd:PRK04863   563 LEARLESLSES--VSEARERRM-----ALRQQLEQLQARIQRLAARAPA 604
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
1583-1957 4.31e-07

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 55.35  E-value: 4.31e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1583 QKKLKQMEVQ-LEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLR--KDLKRTKALLadaQIMLDHLKNNAPS 1659
Cdd:COG5185    165 FGKLTQELNQnLKKLEIFGLTLGLLKGISELKKAEPSGTVNSIKESETGNLGseSTLLEKAKEI---INIEEALKGFQDP 241
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1660 KREIAQLKNQLEESEFtcaaAVKARKamEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEE-----DQEDMN 1734
Cdd:COG5185    242 ESELEDLAQTSDKLEK----LVEQNT--DLRLEKLGENAESSKRLNENANNLIKQFENTKEKIAEYTKSidikkATESLE 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1735 ELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEfleqSMVDKSLVSRQEAKIRELETRLEFEKT 1814
Cdd:COG5185    316 EQLAAAEAEQELEESKRETETGIQNLTAEIEQGQESLTENLEAIKEEIE----NIVGEVELSKSSEELDSFKDTIESTKE 391
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1815 ----QVKRLENLASRLKETMEKLTEERDQraaaenrekeQNKRLQRQLRDTKEEMSELARK--EAEASRKKHELEMDLES 1888
Cdd:COG5185    392 sldeIPQNQRGYAQEILATLEDTLKAADR----------QIEELQRQIEQATSSNEEVSKLlnELISELNKVMREADEES 461
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1889 LE-------AANQSLQADLKLAFKRIGDLQAAI-----EDEMESDENEDLINSLQDMVTKYQKKKNKLEGDSDVDSELED 1956
Cdd:COG5185    462 QSrleeaydEINRSVRSKKEDLNEELTQIESRVstlkaTLEKLRAKLERQLEGVRSKLDQVAESLKDFMRARGYAHILAL 541

                   .
gi 1907082233 1957 R 1957
Cdd:COG5185    542 E 542
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1568-1781 5.44e-07

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 55.02  E-value: 5.44e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1568 MESRDEEVEEARQSCQKKLKQMEVQLEEeyedKQKALREKRElESKLSTLSDQVNQRDfESEKRLRKDLKRTKALLADAQ 1647
Cdd:COG3206    166 LELRREEARKALEFLEEQLPELRKELEE----AEAALEEFRQ-KNGLVDLSEEAKLLL-QQLSELESQLAEARAELAEAE 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1648 IMLDHLKNNAPSKRE----------IAQLKNQLEESEFTCAAAVK-------ARKAMEVEMEDLHLQIDD-IAKAKTALE 1709
Cdd:COG3206    240 ARLAALRAQLGSGPDalpellqspvIQQLRAQLAELEAELAELSArytpnhpDVIALRAQIAALRAQLQQeAQRILASLE 319
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907082233 1710 EQLSRLQREKNEIQNRLEEdqedmnelmkkHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQ 1781
Cdd:COG3206    320 AELEALQAREASLQAQLAQ-----------LEARLAELPELEAELRRLEREVEVARELYESLLQRLEEARLA 380
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
1231-1728 5.90e-07

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 55.21  E-value: 5.90e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1231 EEQIRNKDEEIQQLRSKLEKVEKERNE-------LRLSSDRLETRISELTSELTderntgesasqlldaetaerlRTEKE 1303
Cdd:pfam10174  246 ERNIRDLEDEVQMLKTNGLLHTEDREEeikqmevYKSHSKFMKNKIDQLKQELS---------------------KKESE 304
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1304 MKELQTQYDALK------KQMEVMEMEVMEARLIRAAEINGEVDdddaggEWRLKYERAVREVDFTKKRLQQELEDK--- 1374
Cdd:pfam10174  305 LLALQTKLETLTnqnsdcKQHIEVLKESLTAKEQRAAILQTEVD------ALRLRLEEKESFLNKKTKQLQDLTEEKstl 378
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1375 ----------MEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHL--------EGQQV-----RN 1431
Cdd:pfam10174  379 ageirdlkdmLDVKERKINVLQKKIENLQEQLRDKDKQLAGLKERVKSLQTDSSNTDTALttleealsEKERIierlkEQ 458
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1432 HELEKKQRRFDSELSQAHEETQREK---LQREKLQREKDM--LLAEAFSLKQQMEEKD-----LDIAGFTQK--VVSLEA 1499
Cdd:pfam10174  459 REREDRERLEELESLKKENKDLKEKvsaLQPELTEKESSLidLKEHASSLASSGLKKDsklksLEIAVEQKKeeCSKLEN 538
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1500 ELQDISSQESKDEASlAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAkLRlEMEMERmrqtHSKEMESRDEEVEEAR 1579
Cdd:pfam10174  539 QLKKAHNAEEAVRTN-PEINDRIRLLEQEVARYKEESGKAQAEVERLLGI-LR-EVENEK----NDKDKKIAELESLTLR 611
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1580 QSCQKKLKQMEVQLEEEyEDKQKALREKRELESKLSTLSDQVNQRDFE----SEKRLRKDLKRTKALLADAQIMLD---- 1651
Cdd:pfam10174  612 QMKEQNKKVANIKHGQQ-EMKKKGAQLLEEARRREDNLADNSQQLQLEelmgALEKTRQELDATKARLSSTQQSLAekdg 690
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082233 1652 HLKNNAPSKREiaQLKNQLEESEFTCAAAVKARKAmevemeDLHLQIDDIAKAKTALEEQLSrLQREKNEIQNRLEE 1728
Cdd:pfam10174  691 HLTNLRAERRK--QLEEILEMKQEALLAAISEKDA------NIALLELSSSKKKKTQEEVMA-LKREKDRLVHQLKQ 758
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
1226-1919 6.49e-07

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 55.34  E-value: 6.49e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1226 QVQLSEEQIRNKDEEIQQLRSKLEKVEKERNEL------------RLSSDRLETRISELTSELTDERNtgESASQLLDAE 1293
Cdd:COG3096    376 QLAEAEARLEAAEEEVDSLKSQLADYQQALDVQqtraiqyqqavqALEKARALCGLPDLTPENAEDYL--AAFRAKEQQA 453
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1294 TAERLRTEKEM---KELQTQYDAlkkqmevmemevmEARLIRAaeINGEVDDDDAggewrlkYERAvREVdftkkrLQQE 1370
Cdd:COG3096    454 TEEVLELEQKLsvaDAARRQFEK-------------AYELVCK--IAGEVERSQA-------WQTA-REL------LRRY 504
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1371 LEDKMEVEQQSrrQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDtklhlegqqvrNHELEKKQRRFDSELSQAHE 1450
Cdd:COG3096    505 RSQQALAQRLQ--QLRAQLAELEQRLRQQQNAERLLEEFCQRIGQQLDA-----------AEELEELLAELEAQLEELEE 571
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1451 ETQREKLQREKLQREKDMLLAEAFSLKQQmeekdldiAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVK 1530
Cdd:COG3096    572 QAAEAVEQRSELRQQLEQLRARIKELAAR--------APAWLAAQDALERLREQSGEALADSQEVTAAMQQLLEREREAT 643
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1531 DQEEEldeqagsiqmLEQAKLRLEMEMERMRQTHSkemesrdeeVEEARqscqkkLKQME-----VQLEEEYEDkqKALR 1605
Cdd:COG3096    644 VERDE----------LAARKQALESQIERLSQPGG---------AEDPR------LLALAerlggVLLSEIYDD--VTLE 696
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1606 EKRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKRE-----IAQLKN-QLEESEFTcAA 1679
Cdd:COG3096    697 DAPYFSALYGPARHAIVVPDLSAVKEQLAGLEDCPEDLYLIEGDPDSFDDSVFDAEEledavVVKLSDrQWRYSRFP-EV 775
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1680 AVKARKAMEVEMEDLHLQIDDIAK--AKTAL----------------------------EEQLSRLQREKNEIQNRLEED 1729
Cdd:COG3096    776 PLFGRAAREKRLEELRAERDELAEqyAKASFdvqklqrlhqafsqfvgghlavafapdpEAELAALRQRRSELERELAQH 855
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1730 QEDmnelMKKHKAAVAQASRDMAQMNDLQAQIEESNKEkqELQEKLQALQSQVEFLEQsmvDKSLVSRQEAKIRELETRL 1809
Cdd:COG3096    856 RAQ----EQQLRQQLDQLKEQLQLLNKLLPQANLLADE--TLADRLEELREELDAAQE---AQAFIQQHGKALAQLEPLV 926
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1810 ----------EFEKTQVKRLENLASRLKETMEKLTEERDQRAA----------AENRekEQNKRLQRQLRDTKEEMSElA 1869
Cdd:COG3096    927 avlqsdpeqfEQLQADYLQAKEQQRRLKQQIFALSEVVQRRPHfsyedavgllGENS--DLNEKLRARLEQAEEARRE-A 1003
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907082233 1870 RKEAEASRKKHE----LEMDLES-LEAANQSLQAdlklAFKRIGDLQAAIEDEME 1919
Cdd:COG3096   1004 REQLRQAQAQYSqynqVLASLKSsRDAKQQTLQE----LEQELEELGVQADAEAE 1054
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
1348-1970 1.00e-06

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 54.67  E-value: 1.00e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1348 EWRLK------YERAVREvDFTK----KRLQQELEDKMEVEQQSRRQLERRLGDL------------------------- 1392
Cdd:TIGR01612  531 DQNIKaklykeIEAGLKE-SYELaknwKKLIHEIKKELEEENEDSIHLEKEIKDLfdkyleiddeiiyinklklelkeki 609
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1393 --QADSDESQRALQQLKKKCQRLTA---ELQDTKLHLEGQQVRNHEleKKQRRFDSELSQAHEEtQREKLQRE--KLQRE 1465
Cdd:TIGR01612  610 knISDKNEYIKKAIDLKKIIENNNAyidELAKISPYQVPEHLKNKD--KIYSTIKSELSKIYED-DIDALYNElsSIVKE 686
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1466 KDMLLAEAfslKQQMEEKDLDIAGFTQKVVSLEAE-----LQDISSQESKDEASLAKVKKQL-----RDLEAKVKDQEEE 1535
Cdd:TIGR01612  687 NAIDNTED---KAKLDDLKSKIDKEYDKIQNMETAtvelhLSNIENKKNELLDIIVEIKKHIhgeinKDLNKILEDFKNK 763
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1536 LDEQAGSIQMLEQAKLRLEM---EMERMRQTHSKEMESRDEEVEEARQSCQKK---LKQMEVQLEEEYEDKQKALREKRE 1609
Cdd:TIGR01612  764 EKELSNKINDYAKEKDELNKyksKISEIKNHYNDQINIDNIKDEDAKQNYDKSkeyIKTISIKEDEIFKIINEMKFMKDD 843
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1610 LESKLSTLSDQVN--QRDFESEKRLRKDL-KRTKALLADAQIMlDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKA 1686
Cdd:TIGR01612  844 FLNKVDKFINFENncKEKIDSEHEQFAELtNKIKAEISDDKLN-DYEKKFNDSKSLINEINKSIEEEYQNINTLKKVDEY 922
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1687 MEVeMEDLHLQIDDIAKAKTALEEQLSR---LQREKNEIQ----NRLEEDQED-MNELMKKHKAAV-----AQASRDMAQ 1753
Cdd:TIGR01612  923 IKI-CENTKESIEKFHNKQNILKEILNKnidTIKESNLIEksykDKFDNTLIDkINELDKAFKDASlndyeAKNNELIKY 1001
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1754 MNDLQA------------QIEESNKEKQELQEKLQALQSQVEFLE--------------QSMVDKSLVSRQEAKIRELET 1807
Cdd:TIGR01612 1002 FNDLKAnlgknkenmlyhQFDEKEKATNDIEQKIEDANKNIPNIEiaihtsiyniideiEKEIGKNIELLNKEILEEAEI 1081
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1808 RL-EFEKTQVK-RLENLASRLKETMEKLTEERdqraaaeNREKEQNKRLQRQLRDTKEEMSELarkeaeasRKKHELEMD 1885
Cdd:TIGR01612 1082 NItNFNEIKEKlKHYNFDDFGKEENIKYADEI-------NKIKDDIKNLDQKIDHHIKALEEI--------KKKSENYID 1146
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1886 leslEAANQslqadlklafkrIGDLQAAIEDEMESDENEDLINSLQDMVTKYQKKKNKLEGDSDVDSELEdRVDGVKSWL 1965
Cdd:TIGR01612 1147 ----EIKAQ------------INDLEDVADKAISNDDPEEIEKKIENIVTKIDKKKNIYDEIKKLLNEIA-EIEKDKTSL 1209

                   ....*
gi 1907082233 1966 SKNKG 1970
Cdd:TIGR01612 1210 EEVKG 1214
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1667-1945 1.24e-06

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 53.97  E-value: 1.24e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1667 KNQLEESEFTCAAAVKARKAMEVEMEDLHLQiddiakaktaLEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQ 1746
Cdd:pfam17380  315 RRKLEEAEKARQAEMDRQAAIYAEQERMAME----------RERELERIRQEERKRELERIRQEEIAMEISRMRELERLQ 384
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1747 ASRdmaQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLENLASRL 1826
Cdd:pfam17380  385 MER---QQKNERVRQELEAARKVKILEEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERAREMERVRLEEQERQQQ 461
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1827 KETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKE-EMSELARKEAEASRKKHELEMDLESLEAANQSLQAdlklafK 1905
Cdd:pfam17380  462 VERLRQQEEERKRKKLELEKEKRDRKRAEEQRRKILEkELEERKQAMIEEERKRKLLEKEMEERQKAIYEEER------R 535
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1907082233 1906 RIGDLQAAIEDEMESDENedlinsLQDMVTKYQKKKNKLE 1945
Cdd:pfam17380  536 REAEEERRKQQEMEERRR------IQEQMRKATEERSRLE 569
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1238-1539 1.51e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 53.77  E-value: 1.51e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1238 DEEIQQLRSKLEKVEKERNELRLSSDRLEtrisELTSELTDERNTGESASQLLDAETAERLRTEKEMKELQTQYDALKKQ 1317
Cdd:COG4913    660 EIDVASAEREIAELEAELERLDASSDDLA----ALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDR 735
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1318 MEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDFTKKRLQQELEDKMeveqqsrRQLERRLGDLQADSD 1397
Cdd:COG4913    736 LEAAEDLARLELRALLEERFAAALGDAVERELRENLEERIDALRARLNRAEEELERAM-------RAFNREWPAETADLD 808
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1398 ESQRALQQLKKKCQRLTAElqdtklHLEGQQVRNHELEKKQ-----RRFDSELSQAHEETqreklqREKLQREKDMLLAE 1472
Cdd:COG4913    809 ADLESLPEYLALLDRLEED------GLPEYEERFKELLNENsiefvADLLSKLRRAIREI------KERIDPLNDSLKRI 876
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082233 1473 AFS----LKQQMEE-KDLDIAGFTQkvvsleaELQDISSQESKDEASLAKVK-KQLRDLEAKVKDQEEELDEQ 1539
Cdd:COG4913    877 PFGpgryLRLEARPrPDPEVREFRQ-------ELRAVTSGASLFDEELSEARfAALKRLIERLRSEEEESDRR 942
PLN03229 PLN03229
acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha; Provisional
1551-1901 1.65e-06

acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha; Provisional


Pssm-ID: 178768 [Multi-domain]  Cd Length: 762  Bit Score: 53.70  E-value: 1.65e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1551 LRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQmevQLEEEYEDKQKALrekrELESKLSTLSDqvnqrdfESEK 1630
Cdd:PLN03229   432 RELEGEVEKLKEQILKAKESSSKPSELALNEMIEKLKK---EIDLEYTEAVIAM----GLQERLENLRE-------EFSK 497
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1631 RLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEeseftcaaavkarkamevemedlhlQIDDIAKAKTALEE 1710
Cdd:PLN03229   498 ANSQDQLMHPVLMEKIEKLKDEFNKRLSRAPNYLSLKYKLD-------------------------MLNEFSRAKALSEK 552
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1711 QlSRLQREKNEIQNRLEE--DQEDMNELMKKHKAAVAQ--ASRDMAQMNDLQAQIEESNKEKQ-ELQEKLQALQSQVEFL 1785
Cdd:PLN03229   553 K-SKAEKLKAEINKKFKEvmDRPEIKEKMEALKAEVASsgASSGDELDDDLKEKVEKMKKEIElELAGVLKSMGLEVIGV 631
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1786 EQsmvdKSLVSRQEAKIRELETRLEFEKTQV-KRLENL--ASRLKETMEKLTEERDQRAAAEN-REKEQNKRLQRQLRDT 1861
Cdd:PLN03229   632 TK----KNKDTAEQTPPPNLQEKIESLNEEInKKIERVirSSDLKSKIELLKLEVAKASKTPDvTEKEKIEALEQQIKQK 707
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|.
gi 1907082233 1862 -KEEMSELARKEaeasrKKHELEMDLESLEAANQSLQADLK 1901
Cdd:PLN03229   708 iAEALNSSELKE-----KFEELEAELAAARETAAESNGSLK 743
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1232-1637 1.67e-06

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 53.51  E-value: 1.67e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1232 EQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEMKELQTQY 1311
Cdd:PRK02224   349 EDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELRERE 428
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1312 DALKKQMEVMEMEVMEARLIRAA----EINGEVDDD---DAGGEWRLKYEravrevdftkkRLQQELEDkMEVEQQSRRQ 1384
Cdd:PRK02224   429 AELEATLRTARERVEEAEALLEAgkcpECGQPVEGSphvETIEEDRERVE-----------ELEAELED-LEEEVEEVEE 496
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1385 LERRLGDLQADSDESQRALQQLKKKCQRLT---AELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREK 1461
Cdd:PRK02224   497 RLERAEDLVEAEDRIERLEERREDLEELIAerrETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAE 576
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1462 LQREKDMLLAEAFSLkQQMEEKDLDIAGFTQKVVSLEAELQDIssQESKDEAslakvKKQLRDLEAKVKDQEEELDEQAg 1541
Cdd:PRK02224   577 LNSKLAELKERIESL-ERIRTLLAAIADAEDEIERLREKREAL--AELNDER-----RERLAEKRERKRELEAEFDEAR- 647
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1542 sIQMLEQAKLRLEMEMErmrqthskEMESRDEEVEEARQSCQKKLKQMEVQLE--EEYEDKQKALREKRElesKLSTLSD 1619
Cdd:PRK02224   648 -IEEAREDKERAEEYLE--------QVEEKLDELREERDDLQAEIGAVENELEelEELRERREALENRVE---ALEALYD 715
                          410
                   ....*....|....*...
gi 1907082233 1620 QVNQRDfESEKRLRKDLK 1637
Cdd:PRK02224   716 EAEELE-SMYGDLRAELR 732
mukB PRK04863
chromosome partition protein MukB;
1372-1794 1.74e-06

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 53.81  E-value: 1.74e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1372 EDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKkkcqRLTAelqdTKLHLEGQQVRNHELEKKQRR---FDSELSQA 1448
Cdd:PRK04863   785 EKRIEQLRAEREELAERYATLSFDVQKLQRLHQAFS----RFIG----SHLAVAFEADPEAELRQLNRRrveLERALADH 856
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1449 HEETQREKLQREKLQREKDML--LAEAFSLkqqmeekdLDIAGFTQKVVSLEAELQdissQESKDEASLAKVKKQLrdle 1526
Cdd:PRK04863   857 ESQEQQQRSQLEQAKEGLSALnrLLPRLNL--------LADETLADRVEEIREQLD----EAEEAKRFVQQHGNAL---- 920
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1527 akvkdqeEELDEQAGSIQMLEQaklrlemEMERMRQthskemesrdeEVEEARQScQKKLKQ--------MEVQLEEEYE 1598
Cdd:PRK04863   921 -------AQLEPIVSVLQSDPE-------QFEQLKQ-----------DYQQAQQT-QRDAKQqafaltevVQRRAHFSYE 974
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1599 DKQKALREKRELESKLSTLSDQVNQ-RDfesekRLRKDLKRTKALLADAQIMLDHLKNNAPSKREI-AQLKNQLEesEFT 1676
Cdd:PRK04863   975 DAAEMLAKNSDLNEKLRQRLEQAEQeRT-----RAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMlQELKQELQ--DLG 1047
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1677 CAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVA--QASRDMAQM 1754
Cdd:PRK04863  1048 VPADSGAEERARARRDELHARLSANRSRRNQLEKQLTFCEAEMDNLTKKLRKLERDYHEMREQVVNAKAgwCAVLRLVKD 1127
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|
gi 1907082233 1755 NDLqaqieesnkEKQELQEKLQALQSQVeflEQSMVDKSL 1794
Cdd:PRK04863  1128 NGV---------ERRLHRRELAYLSADE---LRSMSDKAL 1155
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1370-1745 1.80e-06

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 52.60  E-value: 1.80e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1370 ELEDKMEVEQQSRRQLERRLGDLQADSDESQRA----LQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSEL 1445
Cdd:COG4372      3 RLGEKVGKARLSLFGLRPKTGILIAALSEQLRKalfeLDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEEL 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1446 SQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEkdldiagFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDL 1525
Cdd:COG4372     83 EELNEQLQAAQAELAQAQEELESLQEEAEELQEELEE-------LQKERQDLEQQRKQLEAQIAELQSEIAEREEELKEL 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1526 EAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEearqscqKKLKQMEVQLEEEYEDKQKALR 1605
Cdd:COG4372    156 EEQLESLQEELAALEQELQALSEAEAEQALDELLKEANRNAEKEEELAEAE-------KLIESLPRELAEELLEAKDSLE 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1606 EKRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARK 1685
Cdd:COG4372    229 AKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALS 308
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1686 AMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVA 1745
Cdd:COG4372    309 LIGALEDALLAALLELAKKLELALAILLAELADLLQLLLVGLLDNDVLELLSKGAEAGVA 368
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1444-1634 2.06e-06

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 51.46  E-value: 2.06e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1444 ELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQlR 1523
Cdd:COG1579     11 DLQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNN-K 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1524 DLEAKVKdqeeELDEQAGSIQMLEQAKLRLEMEMERMRqthskemesrdEEVEEARQSCQKKLKQMEvQLEEEYEDKQKA 1603
Cdd:COG1579     90 EYEALQK----EIESLKRRISDLEDEILELMERIEELE-----------EELAELEAELAELEAELE-EKKAELDEELAE 153
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1907082233 1604 LREKRE-LESKLSTLSDQVNQRDFESEKRLRK 1634
Cdd:COG1579    154 LEAELEeLEAEREELAAKIPPELLALYERIRK 185
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1379-1712 2.55e-06

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 52.21  E-value: 2.55e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1379 QQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQ 1458
Cdd:COG4372     30 SEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEE 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1459 REKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLR-----DLEAKVKDQE 1533
Cdd:COG4372    110 AEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQalseaEAEQALDELL 189
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1534 EELDEQAGSIQMLEQAKLRLEMEMERMRQTHS--KEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELE 1611
Cdd:COG4372    190 KEANRNAEKEEELAEAEKLIESLPRELAEELLeaKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILV 269
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1612 SKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVEM 1691
Cdd:COG4372    270 EKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILLAELADLLQLLLVG 349
                          330       340
                   ....*....|....*....|.
gi 1907082233 1692 EDLHLQIDDIAKAKTALEEQL 1712
Cdd:COG4372    350 LLDNDVLELLSKGAEAGVADG 370
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1383-1539 2.59e-06

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 51.08  E-value: 2.59e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1383 RQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLhlegqQVRNHELEKKQrrfdselSQAHEETQREKLQREKL 1462
Cdd:COG1579     20 DRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEK-----EIKRLELEIEE-------VEARIKKYEEQLGNVRN 87
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082233 1463 QREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQ 1539
Cdd:COG1579     88 NKEYEALQKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAE 164
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
1492-1777 2.84e-06

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 51.45  E-value: 2.84e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1492 QKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQmleqaKLRLEMemermrQTHSKEMESR 1571
Cdd:COG1340      1 SKTDELSSSLEELEEKIEELREEIEELKEKRDELNEELKELAEKRDELNAQVK-----ELREEA------QELREKRDEL 69
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1572 DEEVEEARQSCQKKLKQMEvQLEEEYEDKQKALREKRELESKLSTLSDQVNQ---------RDFESEKRLRKDLKRTKAL 1642
Cdd:COG1340     70 NEKVKELKEERDELNEKLN-ELREELDELRKELAELNKAGGSIDKLRKEIERlewrqqtevLSPEEEKELVEKIKELEKE 148
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1643 LADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEI 1722
Cdd:COG1340    149 LEKAKKALEKNEKLKELRAELKELRKEAEEIHKKIKELAEEAQELHEEMIELYKEADELRKEADELHKEIVEAQEKADEL 228
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907082233 1723 QNRLEEDQEDMNELMKKHKAAvaqasRDMAQMNDLQAQIEESNKEKQELQEKLQA 1777
Cdd:COG1340    229 HEEIIELQKELRELRKELKKL-----RKKQRALKREKEKEELEEKAEEIFEKLKK 278
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1453-1643 2.98e-06

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 52.71  E-value: 2.98e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1453 QREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQK--VVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVK 1530
Cdd:COG3206    164 QNLELRREEARKALEFLEEQLPELRKELEEAEAALEEFRQKngLVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLA 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1531 DQEEELDE---------QAGSIQMLEQAKLRLEMEMERMRQTHSKE---MESRDEEVEEARQSCQKKLKQMEVQLEEEYE 1598
Cdd:COG3206    244 ALRAQLGSgpdalpellQSPVIQQLRAQLAELEAELAELSARYTPNhpdVIALRAQIAALRAQLQQEAQRILASLEAELE 323
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1907082233 1599 DKQKALREKRELESKLSTLSDQVNQRDFESEkRLRKDLKRTKALL 1643
Cdd:COG3206    324 ALQAREASLQAQLAQLEARLAELPELEAELR-RLEREVEVARELY 367
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
1231-1900 3.11e-06

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 53.03  E-value: 3.11e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1231 EEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEMKELQTQ 1310
Cdd:COG3096    514 LQQLRAQLAELEQRLRQQQNAERLLEEFCQRIGQQLDAAEELEELLAELEAQLEELEEQAAEAVEQRSELRQQLEQLRAR 593
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1311 YDALKKQMEVMEMEVmeARLIRAAEINGE-VDDDDAGGEWRLKYERAVREVDFTKKRLQQEledKMEVEQQSRRQLerrl 1389
Cdd:COG3096    594 IKELAARAPAWLAAQ--DALERLREQSGEaLADSQEVTAAMQQLLEREREATVERDELAAR---KQALESQIERLS---- 664
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1390 gdlQADSDESQRaLQQLkkkCQRLTAEL-----QDTKLH-------LEGQQ-----VRNHELEKKQ-------------- 1438
Cdd:COG3096    665 ---QPGGAEDPR-LLAL---AERLGGVLlseiyDDVTLEdapyfsaLYGPArhaivVPDLSAVKEQlagledcpedlyli 737
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1439 ----RRFDSELSQAHEETQREKLQREKLQ----------------REK--DMLLAEAFSLKQQMEEKDLDIagftQKVVS 1496
Cdd:COG3096    738 egdpDSFDDSVFDAEELEDAVVVKLSDRQwrysrfpevplfgraaREKrlEELRAERDELAEQYAKASFDV----QKLQR 813
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1497 LEAELQDISSQ------ESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAgsiQMLEQAKLRLEMEMERMRQTHSKEMES 1570
Cdd:COG3096    814 LHQAFSQFVGGhlavafAPDPEAELAALRQRRSELERELAQHRAQEQQLR---QQLDQLKEQLQLLNKLLPQANLLADET 890
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1571 RDEEVEEARQscqkklkQMEVQLEEEYEDKQ--KALREkreLESKLSTLsdqvnQRDFESEKRLRKDLKRTKALLADAQI 1648
Cdd:COG3096    891 LADRLEELRE-------ELDAAQEAQAFIQQhgKALAQ---LEPLVAVL-----QSDPEQFEQLQADYLQAKEQQRRLKQ 955
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1649 MLDHLKNnapskreiaqlknqleeseftcaaaVKARKAmevemedlHLQIDDIAKaktaleeqlsrlqrekneiqnRLEE 1728
Cdd:COG3096    956 QIFALSE-------------------------VVQRRP--------HFSYEDAVG---------------------LLGE 981
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1729 DQeDMNELMKkHKAAVAQASRDMA--QMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSM------VDKSLVSRQEA 1800
Cdd:COG3096    982 NS-DLNEKLR-ARLEQAEEARREAreQLRQAQAQYSQYNQVLASLKSSRDAKQQTLQELEQELeelgvqADAEAEERARI 1059
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1801 KIRELETRL--------EFEKTQVKR---LENLASRLKETMEKLTEERDQRAAAenreKEQNKRLQRQLRDTKEE----M 1865
Cdd:COG3096   1060 RRDELHEELsqnrsrrsQLEKQLTRCeaeMDSLQKRLRKAERDYKQEREQVVQA----KAGWCAVLRLARDNDVErrlhR 1135
                          730       740       750
                   ....*....|....*....|....*....|....*
gi 1907082233 1866 SELARKEAEASRkkhelEMDLESLEAANQSlQADL 1900
Cdd:COG3096   1136 RELAYLSADELR-----SMSDKALGALRLA-VADN 1164
MutS2 COG1193
dsDNA-specific endonuclease/ATPase MutS2 [Replication, recombination and repair];
1509-1624 3.94e-06

dsDNA-specific endonuclease/ATPase MutS2 [Replication, recombination and repair];


Pssm-ID: 440806 [Multi-domain]  Cd Length: 784  Bit Score: 52.45  E-value: 3.94e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1509 SKDEASLAKVkkqLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRqthskemESRDEEVEEARQSCQKKLKQ 1588
Cdd:COG1193    510 GEESIDVEKL---IEELERERRELEEEREEAERLREELEKLREELEEKLEELE-------EEKEEILEKAREEAEEILRE 579
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1907082233 1589 MEVQLEE---EYEDKQKALREKRELESKLSTLSDQVNQR 1624
Cdd:COG1193    580 ARKEAEElirELREAQAEEEELKEARKKLEELKQELEEK 618
MYSc_Myo33 cd14894
class myosin, motor domain; Class XXXIII myosins have variable numbers of IQ domain and 2 ...
513-799 4.26e-06

class myosin, motor domain; Class XXXIII myosins have variable numbers of IQ domain and 2 tandem ANK repeats that are separated by a PH domain. The myosin classes XXX to XXXIV contain members from Phytophthora species and Hyaloperonospora parasitica. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276859 [Multi-domain]  Cd Length: 871  Bit Score: 52.44  E-value: 4.26e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  513 LLEAFGNSPTIMNGSATRFSQILSLDF-----DQAGQVASASIQTMLLEKLRVARRPASEA------TFNVFYYLLACGD 581
Cdd:cd14894    255 VLEAFGHATTSMNLNSSRFGKMTTLQVafglhPWEFQICGCHISPFLLEKSRVTSERGRESgdqnelNFHILYAMVAGVN 334
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  582 A-----TLRTELHLNHL----------AENNVFGIVplSKPEEKQKAAQQFSKLQAAMKVLAISPEEQKTCWLILASIYH 646
Cdd:cd14894    335 AfpfmrLLAKELHLDGIdcsaltylgrSDHKLAGFV--SKEDTWKKDVERWQQVIDGLDELNVSPDEQKTIFKVLSAVLW 412
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  647 LGAAgatkePLEEQDEAAEAGRKQFARHEWAQKAAYLLGC-SLEELSSAIFKhqlKGGTLQRSTSFRQGPEESGLGEGTK 725
Cdd:cd14894    413 LGNI-----ELDYREVSGKLVMSSTGALNAPQKVVELLELgSVEKLERMLMT---KSVSLQSTSETFEVTLEKGQVNHVR 484
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  726 lsaleclEGMASGLYSELFTLLISLVNRALKSS-------QHSLCS----------MMIVDTPGFQNpewggsARGASFE 788
Cdd:cd14894    485 -------DTLARLLYQLAFNYVVFVMNEATKMSalstdgnKHQMDSnasapeavslLKIVDVFGFED------LTHNSLD 551
                          330
                   ....*....|.
gi 1907082233  789 ELCHNYAQDRL 799
Cdd:cd14894    552 QLCINYLSEKL 562
mukB PRK04863
chromosome partition protein MukB;
1230-1846 4.65e-06

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 52.27  E-value: 4.65e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1230 SEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEMKELQT 1309
Cdd:PRK04863   514 QLQQLRMRLSELEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEELEARLESLSESVSEARERRMALRQQLEQLQA 593
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1310 QYDALKKQMEVMEMEVmeARLIRAAEINGEVDDDDAGGEwrlkyeravrevDFTKKRLQQELEDKMEVEQ--QSRRQLER 1387
Cdd:PRK04863   594 RIQRLAARAPAWLAAQ--DALARLREQSGEEFEDSQDVT------------EYMQQLLERERELTVERDElaARKQALDE 659
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1388 RLGDLQADSDESQRALQQLkkkCQRLTAEL-----QDTKLH-------LEGQQ-----VRNHELEKKQ------------ 1438
Cdd:PRK04863   660 EIERLSQPGGSEDPRLNAL---AERFGGVLlseiyDDVSLEdapyfsaLYGPArhaivVPDLSDAAEQlagledcpedly 736
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1439 ------RRFDSELSQAhEETQREKLQRE-----------------KLQREK--DMLLAEAFSLKQQMEEKDLDIagftQK 1493
Cdd:PRK04863   737 liegdpDSFDDSVFSV-EELEKAVVVKIadrqwrysrfpevplfgRAAREKriEQLRAEREELAERYATLSFDV----QK 811
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1494 VVSLEAELQDISSQ------ESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAgsiQMLEQAKLRLEMEMERMRQTHSKE 1567
Cdd:PRK04863   812 LQRLHQAFSRFIGShlavafEADPEAELRQLNRRRVELERALADHESQEQQQR---SQLEQAKEGLSALNRLLPRLNLLA 888
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1568 MESRDEEVEEARQscqkklkqmevQLEEEYEDK------QKALrekRELESKLSTLsdQVNQRDFEsekRLRKDLKRTKA 1641
Cdd:PRK04863   889 DETLADRVEEIRE-----------QLDEAEEAKrfvqqhGNAL---AQLEPIVSVL--QSDPEQFE---QLKQDYQQAQQ 949
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1642 LLADAQIMLDHLKNnapskreiaqlknqleeseftcaaaVKARKAmevemedlHLQIDDIAK---AKTALEEQLSRlqre 1718
Cdd:PRK04863   950 TQRDAKQQAFALTE-------------------------VVQRRA--------HFSYEDAAEmlaKNSDLNEKLRQ---- 992
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1719 kneiqnRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEF--LEQSMVDKS--- 1793
Cdd:PRK04863   993 ------RLEQAEQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQELQDLGVPADSgaEERARARRDelh 1066
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907082233 1794 -LVSRQEAKIRELETRLEFEKTqvkRLENLASRLKETMEKLTEERDQRAAAENR 1846
Cdd:PRK04863  1067 aRLSANRSRRNQLEKQLTFCEA---EMDNLTKKLRKLERDYHEMREQVVNAKAG 1117
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
1512-1872 4.65e-06

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 52.26  E-value: 4.65e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1512 EASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQ----AKLRLEMEMERMRQThsKEMESRDEEVEEArqscQKKLK 1587
Cdd:COG3096    291 RRELFGARRQLAEEQYRLVEMARELEELSARESDLEQdyqaASDHLNLVQTALRQQ--EKIERYQEDLEEL----TERLE 364
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1588 QMEVQLEEEYEDKQKAlREKRElesklstlsdqvnqrdfesekRLRKDLKRTKALLADAQIMLDHLKNNAPSKReiaQLK 1667
Cdd:COG3096    365 EQEEVVEEAAEQLAEA-EARLE---------------------AAEEEVDSLKSQLADYQQALDVQQTRAIQYQ---QAV 419
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1668 NQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEedqedmneLMKKHKAAV-AQ 1746
Cdd:COG3096    420 QALEKARALCGLPDLTPENAEDYLAAFRAKEQQATEEVLELEQKLSVADAARRQFEKAYE--------LVCKIAGEVeRS 491
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1747 ASRDMAQmndlqaQIEESNKEKQELQEKLQALQSQVEFLEQsmvdksLVSRQEAKIRELEtrlEFEKTQVKRLENlASRL 1826
Cdd:COG3096    492 QAWQTAR------ELLRRYRSQQALAQRLQQLRAQLAELEQ------RLRQQQNAERLLE---EFCQRIGQQLDA-AEEL 555
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*.
gi 1907082233 1827 KETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKE 1872
Cdd:COG3096    556 EELLAELEAQLEELEEQAAEAVEQRSELRQQLEQLRARIKELAARA 601
PRK12704 PRK12704
phosphodiesterase; Provisional
1517-1672 5.83e-06

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 51.70  E-value: 5.83e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1517 KVKKQLRDLEAKVKDQEEELDEQAGSIQmlEQAKLRLEMEMERMRQTHSKEMESRDEEVEEArqscQKKLKQMEVQLEEE 1596
Cdd:PRK12704    28 IAEAKIKEAEEEAKRILEEAKKEAEAIK--KEALLEAKEEIHKLRNEFEKELRERRNELQKL----EKRLLQKEENLDRK 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1597 YEDKQKALREKRELESKLSTLSDQVNQRDFESEK---RLRKDLKRTKALLAD--AQIMLDHLKNNApsKREIAQLKNQLE 1671
Cdd:PRK12704   102 LELLEKREEELEKKEKELEQKQQELEKKEEELEElieEQLQELERISGLTAEeaKEILLEKVEEEA--RHEAAVLIKEIE 179

                   .
gi 1907082233 1672 E 1672
Cdd:PRK12704   180 E 180
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1292-1540 5.94e-06

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 50.92  E-value: 5.94e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1292 AETAERLRTEKEMKELQTQYDALKKQMEVMEMevmearliRAAEINGEVDDDDaggewrlkyeravREVDFTKKRLqQEL 1371
Cdd:COG4942     17 AQADAAAEAEAELEQLQQEIAELEKELAALKK--------EEKALLKQLAALE-------------RRIAALARRI-RAL 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1372 EDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLtAELQDTKLHLEGQQVRnhELEKKQRRFDSELSQAHEE 1451
Cdd:COG4942     75 EQELAALEAELAELEKEIAELRAELEAQKEELAELLRALYRL-GRQPPLALLLSPEDFL--DAVRRLQYLKYLAPARREQ 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1452 TQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKD 1531
Cdd:COG4942    152 AEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIAR 231

                   ....*....
gi 1907082233 1532 QEEELDEQA 1540
Cdd:COG4942    232 LEAEAAAAA 240
HMMR_N pfam15905
Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of ...
1627-1926 7.84e-06

Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of eukaryotic hyaluronan-mediated motility receptor proteins. The protein is functionally associated with BRCA1 and thus predicted to be a common, low-penetrance breast cancer candidate.


Pssm-ID: 464932 [Multi-domain]  Cd Length: 329  Bit Score: 50.58  E-value: 7.84e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1627 ESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMevemedlhlqiddiAKAKT 1706
Cdd:pfam15905   60 ELKKKSQKNLKESKDQKELEKEIRALVQERGEQDKRLQALEEELEKVEAKLNAAVREKTSL--------------SASVA 125
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1707 ALEEQLSRLQREKNEIQNRLEED--QEDMN----ELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQS 1780
Cdd:pfam15905  126 SLEKQLLELTRVNELLKAKFSEDgtQKKMSslsmELMKLRNKLEAKMKEVMAKQEGMEGKLQVTQKNLEHSKGKVAQLEE 205
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1781 QVEFLEQSMVDkslvsrqeakireletrlefEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRD 1860
Cdd:pfam15905  206 KLVSTEKEKIE--------------------EKSETEKLLEYITELSCVSEQVEKYKLDIAQLEELLKEKNDEIESLKQS 265
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082233 1861 TKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKlafKRIGDLQAAIedEMESDENEDL 1926
Cdd:pfam15905  266 LEEKEQELSKQIKDLNEKCKLLESEKEELLREYEEKEQTLN---AELEELKEKL--TLEEQEHQKL 326
Filament pfam00038
Intermediate filament protein;
1236-1561 1.01e-05

Intermediate filament protein;


Pssm-ID: 459643 [Multi-domain]  Cd Length: 313  Bit Score: 49.92  E-value: 1.01e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1236 NKDEEIQQLRSKL----EKVEkernELRLSSDRLETRISELTSELTDERN--------TGESASQLLDAETAERLRTEKE 1303
Cdd:pfam00038    1 NEKEQLQELNDRLasyiDKVR----FLEQQNKLLETKISELRQKKGAEPSrlyslyekEIEDLRRQLDTLTVERARLQLE 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1304 MKELQTQYDALKKQMEVMEMEVMEARliraAEINGEVDDDDAGGEWRLKYERAVR----EVDFTKKRLQQELedkmeveq 1379
Cdd:pfam00038   77 LDNLRLAAEDFRQKYEDELNLRTSAE----NDLVGLRKDLDEATLARVDLEAKIEslkeELAFLKKNHEEEV-------- 144
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1380 qsrRQLERRLGDlqadsdeSQRALQQLKKKCQRLTAELQDTKLHLEGQQVRN-HELEKKQRRFDSELSQAhEETQREKLQ 1458
Cdd:pfam00038  145 ---RELQAQVSD-------TQVNVEMDAARKLDLTSALAEIRAQYEEIAAKNrEEAEEWYQSKLEELQQA-AARNGDALR 213
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1459 REKlqrekdmllAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDIssqESKDEASLAKVKKQLRDLEAKVKDQEEELDE 1538
Cdd:pfam00038  214 SAK---------EEITELRRTIQSLEIELQSLKKQKASLERQLAET---EERYELQLADYQELISELEAELQETRQEMAR 281
                          330       340
                   ....*....|....*....|...
gi 1907082233 1539 QAGSIQMLEQAKLRLEMEMERMR 1561
Cdd:pfam00038  282 QLREYQELLNVKLALDIEIATYR 304
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
1583-1954 1.02e-05

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 51.21  E-value: 1.02e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1583 QKKLKQMEVQLEEEYEDKQKALREKRelesklsTLSDQVNQrdfesEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKRE 1662
Cdd:TIGR01612  458 KSKLKALEKRFFEIFEEEWGSYDIKK-------DIDENSKQ-----DNTVKLILMRMKDFKDIIDFMELYKPDEVPSKNI 525
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1663 IAqlknqleeseFTCAAAVKARKAMEVEmedlhlqiddiAKAKTALEeQLSRLQREKNEIQNRLEEDQEDMNELMKkhka 1742
Cdd:TIGR01612  526 IG----------FDIDQNIKAKLYKEIE-----------AGLKESYE-LAKNWKKLIHEIKKELEEENEDSIHLEK---- 579
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1743 avaqasrdmaQMNDLQAQ----IEES---NKEKQELQEKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQ 1815
Cdd:TIGR01612  580 ----------EIKDLFDKyleiDDEIiyiNKLKLELKEKIKNISDKNEYIKKAIDLKKIIENNNAYIDELAKISPYQVPE 649
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1816 -VKRLENLASRLKETMEKLTEerDQRAAAENrekEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELE-MDLESLEAAN 1893
Cdd:TIGR01612  650 hLKNKDKIYSTIKSELSKIYE--DDIDALYN---ELSSIVKENAIDNTEDKAKLDDLKSKIDKEYDKIQnMETATVELHL 724
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082233 1894 QSLQADLKLAFKRIGDLQAAIEDEMesdeNEDLINSLQDMVTKYQKKKNKLEGDSDVDSEL 1954
Cdd:TIGR01612  725 SNIENKKNELLDIIVEIKKHIHGEI----NKDLNKILEDFKNKEKELSNKINDYAKEKDEL 781
GOLGA2L5 pfam15070
Putative golgin subfamily A member 2-like protein 5; The function of the GOLGA2L5 protein ...
1573-1891 1.02e-05

Putative golgin subfamily A member 2-like protein 5; The function of the GOLGA2L5 protein family remains unknown. This family of proteins is thought to be found in the Golgi apparatus of eukaryotes.


Pssm-ID: 464485 [Multi-domain]  Cd Length: 521  Bit Score: 50.83  E-value: 1.02e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1573 EEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDF----------ESEKRLRKDLKRTKAL 1642
Cdd:pfam15070   18 ENLKEEGAVWQQKMQQLSEQVRTLREEKERSVSQVQELETSLAELKNQAAVPPAeeeqppagpsEEEQRLQEEAEQLQKE 97
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1643 LADAQIML-DHLKNNAPSKREIAQLKNQLEESEFTC---AAAVKARKAMeveMEDLHLQIDDIAKAKT---ALEEQLSRL 1715
Cdd:pfam15070   98 LEALAGQLqAQVQDNEQLSRLNQEQEQRLLELERAAerwGEQAEDRKQI---LEDMQSDRATISRALSqnrELKEQLAEL 174
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1716 Q-------REKNEIQNRLEEDQEDMNELMKKhkaaVAQASRDMAQMNDlqaQIEESNKEKQELQEK----LQALQSQVEF 1784
Cdd:pfam15070  175 QngfvkltNENMELTSALQSEQHVKKELAKK----LGQLQEELGELKE---TLELKSQEAQSLQEQrdqyLAHLQQYVAA 247
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1785 LEQSMVDKSLVSRQEAKIRELETRLEFEKTQVK-RLENLASRLKETMEKLTeerdqrAAAenrekEQNKRLQRQLRDTKE 1863
Cdd:pfam15070  248 YQQLASEKEELHKQYLLQTQLMDRLQHEEVQGKvAAEMARQELQETQERLE------ALT-----QQNQQLQAQLSLLAN 316
                          330       340       350
                   ....*....|....*....|....*....|..
gi 1907082233 1864 EMS----ELARKEAEASRKKHELEMDLESLEA 1891
Cdd:pfam15070  317 PGEgdglESEEEEEEAPRPSLSIPEDFESREA 348
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
1573-1881 1.12e-05

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 50.73  E-value: 1.12e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1573 EEVEEARQSCQKKLKQMEvQLEEEYED--------KQKALREKRELESKLSTLSDQVNQRDFESEKRLRKDLKRtkallA 1644
Cdd:COG5185    232 EEALKGFQDPESELEDLA-QTSDKLEKlveqntdlRLEKLGENAESSKRLNENANNLIKQFENTKEKIAEYTKS-----I 305
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1645 DAQIMLDHLKNNAPSKREIAQLKNQLEESEftcaAAVKARKA-MEVEMEDLHLQIDDIAKAKTAL--EEQLSRLQREKNE 1721
Cdd:COG5185    306 DIKKATESLEEQLAAAEAEQELEESKRETE----TGIQNLTAeIEQGQESLTENLEAIKEEIENIvgEVELSKSSEELDS 381
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1722 IQNRLE----EDQEDMNELMKKHKAAVAQASRDMA----QMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKS 1793
Cdd:COG5185    382 FKDTIEstkeSLDEIPQNQRGYAQEILATLEDTLKaadrQIEELQRQIEQATSSNEEVSKLLNELISELNKVMREADEES 461
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1794 LVSRQEAK---IRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELAR 1870
Cdd:COG5185    462 QSRLEEAYdeiNRSVRSKKEDLNEELTQIESRVSTLKATLEKLRAKLERQLEGVRSKLDQVAESLKDFMRARGYAHILAL 541
                          330
                   ....*....|.
gi 1907082233 1871 KEAEASRKKHE 1881
Cdd:COG5185    542 ENLIPASELIQ 552
PLN02939 PLN02939
transferase, transferring glycosyl groups
1437-1844 1.18e-05

transferase, transferring glycosyl groups


Pssm-ID: 215507 [Multi-domain]  Cd Length: 977  Bit Score: 51.06  E-value: 1.18e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1437 KQRRFDSElsqaheETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLD-IAGFTQKVVSLEAELQDISSQesKDEASL 1515
Cdd:PLN02939    38 RRRGFSSQ------QKKKRGKNIAPKQRSSNSKLQSNTDENGQLENTSLRtVMELPQKSTSSDDDHNRASMQ--RDEAIA 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1516 AKVKKQLRdleaKVKDQEEELDEQAGS-IQMLEQAklrlEMEMERMRQTHSKEMESRDEEVEEaRQSCQKKLKQMEVQLE 1594
Cdd:PLN02939   110 AIDNEQQT----NSKDGEQLSDFQLEDlVGMIQNA----EKNILLLNQARLQALEDLEKILTE-KEALQGKINILEMRLS 180
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1595 EEYEDKQKALREKRELEsklsTLSDQVNQRDFESEKRLRKDLKRTKALLADaqimLDHLK-NNAPSKREIAQLKNQL--- 1670
Cdd:PLN02939   181 ETDARIKLAAQEKIHVE----ILEEQLEKLRNELLIRGATEGLCVHSLSKE----LDVLKeENMLLKDDIQFLKAELiev 252
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1671 EESEFTCAAAVKARKAMEVEMEDLHLQIddiakakTALEEQLSRLQREKNEIqnrLEEDQEDMNELMKKHKAAVAQASRD 1750
Cdd:PLN02939   253 AETEERVFKLEKERSLLDASLRELESKF-------IVAQEDVSKLSPLQYDC---WWEKVENLQDLLDRATNQVEKAALV 322
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1751 MAQMNDLQAQI---EESNKEKQELQEKLQalqsQVEFLEQSMvdKSLVSRQEAKIRELETrlefektQVKRLENLASRLK 1827
Cdd:PLN02939   323 LDQNQDLRDKVdklEASLKEANVSKFSSY----KVELLQQKL--KLLEERLQASDHEIHS-------YIQLYQESIKEFQ 389
                          410
                   ....*....|....*..
gi 1907082233 1828 ETMEKLTEERDQRAAAE 1844
Cdd:PLN02939   390 DTLSKLKEESKKRSLEH 406
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
1666-1900 1.25e-05

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 50.28  E-value: 1.25e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1666 LKNQLEESEFTCAAAVKARKAMEVEMEDlhlQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVA 1745
Cdd:pfam07888   32 LQNRLEECLQERAELLQAQEAANRQREK---EKERYKRDREQWERQRRELESRVAELKEELRQSREKHEELEEKYKELSA 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1746 QASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVD-KSLVSRQEAKIRELETRLEFEKTQVKRLENLAS 1824
Cdd:pfam07888  109 SSEELSEEKDALLAQRAAHEARIRELEEDIKTLTQRVLERETELERmKERAKKAGAQRKEEEAERKQLQAKLQQTEEELR 188
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1825 RLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRD----------TKEEMSELARKEAEASRKKHELEMDLESLEAANQ 1894
Cdd:pfam07888  189 SLSKEFQELRNSLAQRDTQVLQLQDTITTLTQKLTTahrkeaeneaLLEELRSLQERLNASERKVEGLGEELSSMAAQRD 268

                   ....*.
gi 1907082233 1895 SLQADL 1900
Cdd:pfam07888  269 RTQAEL 274
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1412-1777 1.36e-05

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 49.90  E-value: 1.36e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1412 RLTAELQDTKLHLEGQQVRNHELEKKQrrfDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFT 1491
Cdd:COG4372      3 RLGEKVGKARLSLFGLRPKTGILIAAL---SEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLE 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1492 QKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMErMRQTHSKEMESR 1571
Cdd:COG4372     80 EELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIA-EREEELKELEEQ 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1572 DEEVEEARQSCQKKLKQMEVQleEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQIMLD 1651
Cdd:COG4372    159 LESLQEELAALEQELQALSEA--EAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALS 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1652 HLKNNAPSKREIAQLKNQLEESEFTcAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQE 1731
Cdd:COG4372    237 ALLDALELEEDKEELLEEVILKEIE-ELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALED 315
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1732 DM----NELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQA 1777
Cdd:COG4372    316 ALlaalLELAKKLELALAILLAELADLLQLLLVGLLDNDVLELLSKGAEA 365
mukB PRK04863
chromosome partition protein MukB;
1465-1787 1.56e-05

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 50.73  E-value: 1.56e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1465 EKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEAS-------LAKVKKQLR-------------D 1524
Cdd:PRK04863   280 ERRVHLEEALELRRELYTSRRQLAAEQYRLVEMARELAELNEAESDLEQDyqaasdhLNLVQTALRqqekieryqadleE 359
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1525 LEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMR-------------QTHSKEMESRDEEVEEARQSCQK---KLKQ 1588
Cdd:PRK04863   360 LEERLEEQNEVVEEADEQQEENEARAEAAEEEVDELKsqladyqqaldvqQTRAIQYQQAVQALERAKQLCGLpdlTADN 439
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1589 MEVQLEEeYEDKQKALREK-RELESKLSTLSDQVNQRD------------------FESEKRLRKDLKRTKALLADAQIM 1649
Cdd:PRK04863   440 AEDWLEE-FQAKEQEATEElLSLEQKLSVAQAAHSQFEqayqlvrkiagevsrseaWDVARELLRRLREQRHLAEQLQQL 518
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1650 LDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEE- 1728
Cdd:PRK04863   519 RMRLSELEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEELEARLESLSESVSEARERRMALRQQLEQLQARIQRl 598
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082233 1729 ---------DQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQ 1787
Cdd:PRK04863   599 aarapawlaAQDALARLREQSGEEFEDSQDVTEYMQQLLERERELTVERDELAARKQALDEEIERLSQ 666
mukB PRK04863
chromosome partition protein MukB;
1571-1912 1.96e-05

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 50.34  E-value: 1.96e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1571 RDEEVEEArQSCQKKLKQMEVQLEEEyedkQKALREkreLESKLSTLSdqvnqrdfESEKRLRKDLKRTKALLADAQIML 1650
Cdd:PRK04863   281 RRVHLEEA-LELRRELYTSRRQLAAE----QYRLVE---MARELAELN--------EAESDLEQDYQAASDHLNLVQTAL 344
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1651 DHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKA----KTALEEQLSRLQREKNEIQnRL 1726
Cdd:PRK04863   345 RQQEKIERYQADLEELEERLEEQNEVVEEADEQQEENEARAEAAEEEVDELKSQladyQQALDVQQTRAIQYQQAVQ-AL 423
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1727 EEDQEdMNELMKkhkAAVAQASRDMAQmndLQAQIEESNKEKQELQEKL---QALQSQVEF---LEQSMVDKslVSRQEA 1800
Cdd:PRK04863   424 ERAKQ-LCGLPD---LTADNAEDWLEE---FQAKEQEATEELLSLEQKLsvaQAAHSQFEQayqLVRKIAGE--VSRSEA 494
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1801 K--IRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDqraaAENREKEQNKRLQRQLRDtkEEMSELARKEAEASRK 1878
Cdd:PRK04863   495 WdvARELLRRLREQRHLAEQLQQLRMRLSELEQRLRQQQR----AERLLAEFCKRLGKNLDD--EDELEQLQEELEARLE 568
                          330       340       350
                   ....*....|....*....|....*....|....
gi 1907082233 1879 khELEMDLESLEAANQSLQADLKLAFKRIGDLQA 1912
Cdd:PRK04863   569 --SLSESVSEARERRMALRQQLEQLQARIQRLAA 600
PRK10246 PRK10246
exonuclease subunit SbcC; Provisional
1372-1925 2.29e-05

exonuclease subunit SbcC; Provisional


Pssm-ID: 182330 [Multi-domain]  Cd Length: 1047  Bit Score: 49.80  E-value: 2.29e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1372 EDKMEVEQQSRRQLE----RRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNH---------ELEKKQ 1438
Cdd:PRK10246   231 EEKQLLTAQQQQQQSlnwlTRLDELQQEASRRQQALQQALAAEEKAQPQLAALSLAQPARQLRPHweriqeqsaALAHTR 310
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1439 RRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDlDIAGFTQKVVSLEAELqdisSQESKDEASLAKV 1518
Cdd:PRK10246   311 QQIEEVNTRLQSTMALRARIRHHAAKQSAELQAQQQSLNTWLAEHD-RFRQWNNELAGWRAQF----SQQTSDREQLRQW 385
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1519 KKQLRDLEAKVKDQEE-----ELDEQAGSI-QMLEQAKLRlememERMRQTHSK--EMESRDEEVEEARQSCQKKLKQME 1590
Cdd:PRK10246   386 QQQLTHAEQKLNALPAitltlTADEVAAALaQHAEQRPLR-----QRLVALHGQivPQQKRLAQLQVAIQNVTQEQTQRN 460
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1591 VQLEEeyedKQKALREKRELESKLSTLSDQvnqrdfesEKRLrKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKnqL 1670
Cdd:PRK10246   461 AALNE----MRQRYKEKTQQLADVKTICEQ--------EARI-KDLEAQRAQLQAGQPCPLCGSTSHPAVEAYQALE--P 525
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1671 EESEFTCAAAVKARKAMEVEMEDLHLQIDdiakaktALEEQLsrlQREKNEIQNRLEEDQEdmneLMKKHKAAVAQASRD 1750
Cdd:PRK10246   526 GVNQSRLDALEKEVKKLGEEGAALRGQLD-------ALTKQL---QRDESEAQSLRQEEQA----LTQQWQAVCASLNIT 591
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1751 MAQMNDLQAQIEESNKEKQELQE--KLQALQSQVEfleqsmvdkslvsRQEAKIRELETRLEFEKTQVK-RLENLASRLK 1827
Cdd:PRK10246   592 LQPQDDIQPWLDAQEEHERQLRLlsQRHELQGQIA-------------AHNQQIIQYQQQIEQRQQQLLtALAGYALTLP 658
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1828 ETMEKLT--EERDQRAAAENREKEQNKRLQRQ------LRDTKEEMSELARKEAEAS----RKKHElemDLESLEAANQS 1895
Cdd:PRK10246   659 QEDEEASwlATRQQEAQSWQQRQNELTALQNRiqqltpLLETLPQSDDLPHSEETVAldnwRQVHE---QCLSLHSQLQT 735
                          570       580       590
                   ....*....|....*....|....*....|
gi 1907082233 1896 LQADLKLAFKRIGDLQAAIEDEMESDENED 1925
Cdd:PRK10246   736 LQQQDVLEAQRLQKAQAQFDTALQASVFDD 765
WEMBL pfam05701
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required ...
1382-1806 2.33e-05

Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required for the chloroplast avoidance response under high intensity blue light. This avoidance response consists in the relocation of chloroplasts on the anticlinal side of exposed cells. Acts in association with PMI2 to maintain the velocity of chloroplast photo-relocation movement via the regulation of cp-actin filaments. Thus several member-sequences are described as "myosin heavy chain-like".


Pssm-ID: 461718 [Multi-domain]  Cd Length: 562  Bit Score: 49.64  E-value: 2.33e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1382 RRQLERRLGDLQADSDESQRALQ-------QLKKKCQRLTAELQDTKLHLEGQQVrnhelEKKQRRFDSELSQ------- 1447
Cdd:pfam05701   37 RKLVELELEKVQEEIPEYKKQSEaaeaakaQVLEELESTKRLIEELKLNLERAQT-----EEAQAKQDSELAKlrveeme 111
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1448 ---AHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEkdldiagfTQKVVSLEAELQDISSQESKDEASLAK-VKKQLR 1523
Cdd:pfam05701  112 qgiADEASVAAKAQLEVAKARHAAAVAELKSVKEELES--------LRKEYASLVSERDIAIKRAEEAVSASKeIEKTVE 183
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1524 DLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKlKQMEVQLEEEYEdkqKA 1603
Cdd:pfam05701  184 ELTIELIATKESLESAHAAHLEAEEHRIGAALAREQDKLNWEKELKQAEEELQRLNQQLLSA-KDLKSKLETASA---LL 259
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1604 LREKREL----ESKLSTLSDQvNQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIA-QLKNQLEEsEFTCA 1678
Cdd:pfam05701  260 LDLKAELaaymESKLKEEADG-EGNEKKTSTSIQAALASAKKELEEVKANIEKAKDEVNCLRVAAaSLRSELEK-EKAEL 337
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1679 AAVKARKAMEvemedlhlqiddiAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKhkaaVAQASRDMAQMNDLQ 1758
Cdd:pfam05701  338 ASLRQREGMA-------------SIAVSSLEAELNRTKSEIALVQAKEKEAREKMVELPKQ----LQQAAQEAEEAKSLA 400
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*...
gi 1907082233 1759 AQIEESNKEKQELQEKLQALQSQVEfleqsmvdkslvSRQEAKIRELE 1806
Cdd:pfam05701  401 QAAREELRKAKEEAEQAKAAASTVE------------SRLEAVLKEIE 436
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
22-305 2.53e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 49.78  E-value: 2.53e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   22 PKPGPPGKEEGAPEASSKDGSAPPRSP--QPATSPVPSETSQTAKSPEPTltmnglGAASAEGPNEEAQGLSRKRVANAV 99
Cdd:PHA03307   106 PTPPGPSSPDPPPPTPPPASPPPSPAPdlSEMLRPVGSPGPPPAASPPAA------GASPAAVASDAASSRQAALPLSSP 179
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  100 RKVV-SKVLPGEEPGTAKEPPGRGARSPEH--PARGKKGEKAATSPKPPPPPPPPPAPPKPETK-KEAAKDELSVGLRSL 175
Cdd:PHA03307   180 EETArAPSSPPAEPPPSTPPAAASPRPPRRssPISASASSPAPAPGRSAADDAGASSSDSSSSEsSGCGWGPENECPLPR 259
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  176 MSRGRGKDHKSRGKQPPGKGEKAPSQEPASTGKSGSPDMvdSPKKAGSPAKPETPNKRCSPAPAQE------LADPNLAG 249
Cdd:PHA03307   260 PAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSP--SPSSPGSGPAPSSPRASSSSSSSREssssstSSSSESSR 337
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082233  250 PKLNPSGKQQAKSPASDvqqeTGADPEVRLSPAEEAHQRLERIFTASLDPEAASPA 305
Cdd:PHA03307   338 GAAVSPGPSPSRSPSPS----RPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRR 389
ATG16 pfam08614
Autophagy protein 16 (ATG16); Autophagy is a ubiquitous intracellular degradation system for ...
1490-1557 2.73e-05

Autophagy protein 16 (ATG16); Autophagy is a ubiquitous intracellular degradation system for eukaryotic cells. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. ATG16 (also known as Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate in the yeast autophagy pathway.


Pssm-ID: 462536 [Multi-domain]  Cd Length: 176  Bit Score: 46.85  E-value: 2.73e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1490 FTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQML--EQAKLRLEMEM 1557
Cdd:pfam08614   76 LAQRLVDLNEELQELEKKLREDERRLAALEAERAQLEEKLKDREEELREKRKLNQDLqdELVALQLQLNM 145
CCCAP pfam15964
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in ...
1368-1867 2.80e-05

Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in eukaryotes. CCCAP is also known as SDCCAG8, serologically defined colon cancer antigen 8. It is associated with the centrosome.


Pssm-ID: 435040 [Multi-domain]  Cd Length: 703  Bit Score: 49.52  E-value: 2.80e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1368 QQELEDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKlhleGQQVRNHELEKKQRrfDSELSQ 1447
Cdd:pfam15964  219 RLELEKLKLLYEAKTEVLESQVKSLRKDLAESQKTCEDLKERLKHKESLVAAST----SSRVGGLCLKCAQH--EAVLAQ 292
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1448 AHEETQREKLQRekLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSleaelqdisSQESKDEASLAKVKK--QLRDL 1525
Cdd:pfam15964  293 THTNVHMQTIER--LTKERDDLMSALVSVRSSLAEAQQRESSAYEQVKQ---------AVQMTEEANFEKTKAliQCEQL 361
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1526 EAKVKDQEEELDEQAGSiqmlEQAKLRLEMEMERmrqthsKEMESRDEEVEEARQSCQKKLKQMEVQLEeeyedkqKALR 1605
Cdd:pfam15964  362 KSELERQKERLEKELAS----QQEKRAQEKEALR------KEMKKEREELGATMLALSQNVAQLEAQVE-------KVTR 424
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1606 EKRELESKLSTLSDQVNQRDFESEK---RLRKDLKRTKALLADAQimLDHLKNNAPSKR-------EIAQLKNQLEESef 1675
Cdd:pfam15964  425 EKNSLVSQLEEAQKQLASQEMDVTKvcgEMRYQLNQTKMKKDEAE--KEHREYRTKTGRqleikdqEIEKLGLELSES-- 500
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1676 tcaaavkaRKAMEVEMEDLHLQIDDIAKaktaLEEQLSRLQREKNeiqnrleedqedmneLMKKHKAAVAQASRDMAQMN 1755
Cdd:pfam15964  501 --------KQRLEQAQQDAARAREECLK----LTELLGESEHQLH---------------LTRLEKESIQQSFSNEAKAQ 553
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1756 DLQAQIEEsnkekQELQEKLQALQSQvefleqsmvdkslvsrQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTE 1835
Cdd:pfam15964  554 ALQAQQRE-----QELTQKMQQMEAQ----------------HDKTVNEQYSLLTSQNTFIAKLKEECCTLAKKLEEITQ 612
                          490       500       510
                   ....*....|....*....|....*....|..
gi 1907082233 1836 ERDQRAAAENREKEQNKRLQRQLRDTKEEMSE 1867
Cdd:pfam15964  613 KSRSEVEQLSQEKEYLQDRLEKLQKRNEELEE 644
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
99-281 3.53e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 49.30  E-value: 3.53e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   99 VRKVVSKVLPGEEPGTAK--EPPgRGARSPEHPARGKkGEKAATSPKPPPPPPPPPAPpkpETKKEAAKDELSVGLRSlm 176
Cdd:PTZ00449   489 IKKSKKKLAPIEEEDSDKhdEPP-EGPEASGLPPKAP-GDKEGEEGEHEDSKESDEPK---EGGKPGETKEGEVGKKP-- 561
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  177 srGRGKDHKSrgkqppgkgEKAP--SQEPASTGKSGSPDMVDSPKKAGSPAKPETPNKRCSPAPAQELADPNlagpklNP 254
Cdd:PTZ00449   562 --GPAKEHKP---------SKIPtlSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPK------SP 624
                          170       180
                   ....*....|....*....|....*..
gi 1907082233  255 SGKQQAKSPASDVQQETGADPEVRLSP 281
Cdd:PTZ00449   625 KRPESPKSPKRPPPPQRPSSPERPEGP 651
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
1354-1611 3.89e-05

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 48.38  E-value: 3.89e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1354 ERAVREVDFTKKRLQQELEDKMEVEQQSRRQLERRlgdlqadsdESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRnHE 1433
Cdd:pfam13868   51 EERERALEEEEEKEEERKEERKRYRQELEEQIEER---------EQKRQEEYEEKLQEREQMDEIVERIQEEDQAEA-EE 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1434 LEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKdldiagftqkvvsLEAELQDIssqeskdEA 1513
Cdd:pfam13868  121 KLEKQRQLREEIDEFNEEQAEWKELEKEEEREEDERILEYLKEKAEREEE-------------REAEREEI-------EE 180
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1514 SLAKVKKQLRDLEAKVKDQEEELDEQagsiqmleQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQK----KLKQM 1589
Cdd:pfam13868  181 EKEREIARLRAQQEKAQDEKAERDEL--------RAKLYQEEQERKERQKEREEAEKKARQRQELQQAREEqielKERRL 252
                          250       260
                   ....*....|....*....|..
gi 1907082233 1590 EVQLEEEYEDKQKALREKRELE 1611
Cdd:pfam13868  253 AEEAEREEEEFERMLRKQAEDE 274
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1475-1673 4.05e-05

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 48.29  E-value: 4.05e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1475 SLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLE 1554
Cdd:COG3883     27 ELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARALYRSGGSVSYLD 106
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1555 M---------------EMERMRQTHSKEMesrdEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSD 1619
Cdd:COG3883    107 VllgsesfsdfldrlsALSKIADADADLL----EELKADKAELEAKKAELEAKLAELEALKAELEAAKAELEAQQAEQEA 182
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907082233 1620 QVNQrdfesekrLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEES 1673
Cdd:COG3883    183 LLAQ--------LSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAA 228
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1743-1924 4.12e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 47.23  E-value: 4.12e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1743 AVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEqsmvdkslvsrqeAKIRELETRLEFEKTQVKRLENL 1822
Cdd:COG1579      1 AMPEDLRALLDLQELDSELDRLEHRLKELPAELAELEDELAALE-------------ARLEAAKTELEDLEKEIKRLELE 67
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1823 ASRLKETMEKLTEErdQRAAAENREKEQnkrLQRQLRDTKEEMSELARKEAEasrkkheLEMDLESLEAANQSLQADLKL 1902
Cdd:COG1579     68 IEEVEARIKKYEEQ--LGNVRNNKEYEA---LQKEIESLKRRISDLEDEILE-------LMERIEELEEELAELEAELAE 135
                          170       180
                   ....*....|....*....|..
gi 1907082233 1903 AFKRIGDLQAAIEDEMESDENE 1924
Cdd:COG1579    136 LEAELEEKKAELDEELAELEAE 157
Crescentin pfam19220
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament ...
1662-1931 4.18e-05

Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament proteins, named crescentin, whose cytoskeletal function is required for the vibrioid and helical shapes of Caulobacter crescentus. Without crescentin, the cells adopt a straight-rod morphology. Crescentin has characteriztic features of IF proteins including the ability to assemble into filaments in vitro without energy or cofactor requirements. In vivo, crescentin forms a helical structure that colocalizes with the inner cell curvatures beneath the cytoplasmic membrane.


Pssm-ID: 437057 [Multi-domain]  Cd Length: 401  Bit Score: 48.53  E-value: 4.18e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1662 EIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHK 1741
Cdd:pfam19220   91 RLAKLEAALREAEAAKEELRIELRDKTAQAEALERQLAAETEQNRALEEENKALREEAQAAEKALQRAEGELATARERLA 170
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1742 AAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVD-KSLVSRQEAKIRELETRLEFEKT-QVKRL 1819
Cdd:pfam19220  171 LLEQENRRLQALSEEQAAELAELTRRLAELETQLDATRARLRALEGQLAAeQAERERAEAQLEEAVEAHRAERAsLRMKL 250
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1820 ENLASRLKETMEKLTEERDQ--RAAAENREKEQN--------KRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESL 1889
Cdd:pfam19220  251 EALTARAAATEQLLAEARNQlrDRDEAIRAAERRlkeasierDTLERRLAGLEADLERRTQQFQEMQRARAELEERAEML 330
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907082233 1890 EAANQSLQADLKLAFKRIGDLQAAIE---DEMESDE------NEDLINSLQ 1931
Cdd:pfam19220  331 TKALAAKDAALERAEERIASLSDRIAeltKRFEVERaaleqaNRRLKEELQ 381
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1225-1544 4.75e-05

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 48.36  E-value: 4.75e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1225 IQVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEM 1304
Cdd:COG4372     24 ILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEEL 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1305 KELQTQYDALKKQMevmemevmearliraaeingevddddaggewrlkyeravrevdftkKRLQQELEDKmeveQQSRRQ 1384
Cdd:COG4372    104 ESLQEEAEELQEEL----------------------------------------------EELQKERQDL----EQQRKQ 133
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1385 LERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKlhlegQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQR 1464
Cdd:COG4372    134 LEAQIAELQSEIAEREEELKELEEQLESLQEELAALE-----QELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKL 208
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1465 EKDMLLAEAfslKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQ 1544
Cdd:COG4372    209 IESLPRELA---EELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALEL 285
MscS_porin pfam12795
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part ...
1723-1916 4.89e-05

Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part of the turgor-driven solute efflux system that protects bacteria from lysis in the event of osmotic shock. The MscS protein alone is sufficient to form a functional mechanosensitive channel gated directly by tension in the lipid bilayer. The MscS proteins are heptamers of three transmembrane subunits with seven converging M3 domains, and this MscS_porin is towards the N-terminal of the molecules. The high concentration of negative charges at the extracellular entrance of the pore helps select the cations for efflux.


Pssm-ID: 432790 [Multi-domain]  Cd Length: 238  Bit Score: 47.30  E-value: 4.89e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1723 QNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEfleqsmvDKSLVSRQEAKI 1802
Cdd:pfam12795    8 AKLDEAAKKKLLQDLQQALSLLDKIDASKQRAAAYQKALDDAPAELRELRQELAALQAKAE-------AAPKEILASLSL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1803 RELETRLEFEKTQvkrLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQR------QLRDTKEEMSELARKEAEAS 1876
Cdd:pfam12795   81 EELEQRLLQTSAQ---LQELQNQLAQLNSQLIELQTRPERAQQQLSEARQRLQQirnrlnGPAPPGEPLSEAQRWALQAE 157
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1907082233 1877 RKKHELEMDLESLEAANQSLQADL-----KLAFKRIGDLQAAIED 1916
Cdd:pfam12795  158 LAALKAQIDMLEQELLSNNNRQDLlkarrDLLTLRIQRLEQQLQA 202
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
1630-1916 5.06e-05

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 47.99  E-value: 5.06e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1630 KRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVE-----MEDLHLQIDDIAKA 1704
Cdd:pfam13868    9 RELNSKLLAAKCNKERDAQIAEKKRIKAEEKEEERRLDEMMEEERERALEEEEEKEEERKEerkryRQELEEQIEEREQK 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1705 KtalEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQE-KLQALQSQVE 1783
Cdd:pfam13868   89 R---QEEYEEKLQEREQMDEIVERIQEEDQAEAEEKLEKQRQLREEIDEFNEEQAEWKELEKEEEREEDeRILEYLKEKA 165
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1784 FLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKR-----LQRQL 1858
Cdd:pfam13868  166 EREEEREAEREEIEEEKEREIARLRAQQEKAQDEKAERDELRAKLYQEEQERKERQKEREEAEKKARQRQelqqaREEQI 245
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082233 1859 RDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRigDLQAAIED 1916
Cdd:pfam13868  246 ELKERRLAEEAEREEEEFERMLRKQAEDEEIEQEEAEKRRMKRLEHRR--ELEKQIEE 301
CCCAP pfam15964
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in ...
1218-1593 5.25e-05

Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in eukaryotes. CCCAP is also known as SDCCAG8, serologically defined colon cancer antigen 8. It is associated with the centrosome.


Pssm-ID: 435040 [Multi-domain]  Cd Length: 703  Bit Score: 48.75  E-value: 5.25e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1218 FTTVRPLIQV-QLSEEQIRNKDEEIQQLRSKLEKVEKERNELRlssdrletriSELTSELTDERNTGESASQLLDAETAE 1296
Cdd:pfam15964  349 FEKTKALIQCeQLKSELERQKERLEKELASQQEKRAQEKEALR----------KEMKKEREELGATMLALSQNVAQLEAQ 418
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1297 RLRTEKEMKELQTQYDALKKQMEVMemevmearliraaeingEVDDDDAGGEWR-------LKYERAVREVDFTKKRLQQ 1369
Cdd:pfam15964  419 VEKVTREKNSLVSQLEEAQKQLASQ-----------------EMDVTKVCGEMRyqlnqtkMKKDEAEKEHREYRTKTGR 481
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1370 ELEDKmeveQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDT--KLHLEGQqvrnhELEKKQRRFDSEL-S 1446
Cdd:pfam15964  482 QLEIK----DQEIEKLGLELSESKQRLEQAQQDAARAREECLKLTELLGESehQLHLTRL-----EKESIQQSFSNEAkA 552
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1447 QAHEETQREK----------LQREKLQREKDMLLAEAFSLkqqmeekdldIAGFTQKVVSLEAELQDISSQESKDEASLA 1516
Cdd:pfam15964  553 QALQAQQREQeltqkmqqmeAQHDKTVNEQYSLLTSQNTF----------IAKLKEECCTLAKKLEEITQKSRSEVEQLS 622
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1517 KVKKQLRDLEAKVKDQEEELDEQAgsIQ---MLEQAKLRLEmEMERMRQTHSK---EMESRDEEVEEARQSCQKKLKQME 1590
Cdd:pfam15964  623 QEKEYLQDRLEKLQKRNEELEEQC--VQhgrMHERMKQRLR-QLDKHCQATAQqlvQLLSKQNQLFKERQNLTEEVQSLR 699

                   ...
gi 1907082233 1591 VQL 1593
Cdd:pfam15964  700 SQV 702
Rab5-bind pfam09311
Rabaptin-like protein; Members of this family are predominantly found in Rabaptin and allow ...
1627-1900 6.19e-05

Rabaptin-like protein; Members of this family are predominantly found in Rabaptin and allow for binding to the GTPase Rab5. This interaction is necessary and sufficient for Rab5-dependent recruitment of Rabaptin5 to early endosomal membranes.


Pssm-ID: 462752 [Multi-domain]  Cd Length: 307  Bit Score: 47.65  E-value: 6.19e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1627 ESEKRLRKDLKRTKALLADAQimlDHLKNNAPSKREIAQLKNQL-EESEFTCAAAVKARKAMEVEMEDLHLQIDdiaKAK 1705
Cdd:pfam09311   16 EQEAETRDQVKKLQEMLRQAN---DQLEKTMKDKKELEDKMNQLsEETSNQVSTLAKRNQKSETLLDELQQAFS---QAK 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1706 TALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVaqaSRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFL 1785
Cdd:pfam09311   90 RNFQDQLAVLMDSREQVSDELVRLQKDNESLQGKHSLHV---SLQQAEKFDMPDTVQELQELVLKYREELIEVRTAADHM 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1786 EQSMVDKSLVSRQEAKiRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLqRQLRDTKEEM 1865
Cdd:pfam09311  167 EEKLKAEILFLKEQIQ-AEQCLKENLEETLQAEIENCKEEIASISSLKVELERIKAEKEQLENGLTEKI-RQLEDLQTTK 244
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 1907082233 1866 SELARKEAEASRKKHELEMDLESLEAANQSLQADL 1900
Cdd:pfam09311  245 GSLETQLKKETNEKAAVEQLVFEEKNKAQRLQTEL 279
HMMR_N pfam15905
Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of ...
1397-1774 6.47e-05

Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of eukaryotic hyaluronan-mediated motility receptor proteins. The protein is functionally associated with BRCA1 and thus predicted to be a common, low-penetrance breast cancer candidate.


Pssm-ID: 464932 [Multi-domain]  Cd Length: 329  Bit Score: 47.50  E-value: 6.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1397 DESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHE-ETQREKLQREKLQREKDMllaeafs 1475
Cdd:pfam15905   24 EKSQRFRKQKAAESQPNLNNSKDASTPATARKVKSLELKKKSQKNLKESKDQKElEKEIRALVQERGEQDKRL------- 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1476 lkQQMEEKdldiagftqkVVSLEAELQDISSQESKDEASLAKVKKQLRDLeAKVKDqeeeldeqagsiqmLEQAKLRLEM 1555
Cdd:pfam15905   97 --QALEEE----------LEKVEAKLNAAVREKTSLSASVASLEKQLLEL-TRVNE--------------LLKAKFSEDG 149
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1556 EMERMRQTHSKEMESRDEeVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLRkd 1635
Cdd:pfam15905  150 TQKKMSSLSMELMKLRNK-LEAKMKEVMAKQEGMEGKLQVTQKNLEHSKGKVAQLEEKLVSTEKEKIEEKSETEKLLE-- 226
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1636 lkrtkalladaqimldhlknnapskrEIAQLKNQLEESEftcaaavKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRL 1715
Cdd:pfam15905  227 --------------------------YITELSCVSEQVE-------KYKLDIAQLEELLKEKNDEIESLKQSLEEKEQEL 273
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082233 1716 QREKNEIQNRLEEDQEDMNELMKKHKAavaQASRDMAQMNDLQAQIEESNKEKQELQEK 1774
Cdd:pfam15905  274 SKQIKDLNEKCKLLESEKEELLREYEE---KEQTLNAELEELKEKLTLEEQEHQKLQQK 329
HOOK pfam05622
HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from ...
1391-1886 6.56e-05

HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organizms. The different members of the human gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three human HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas this central coiled-coil motif mediates homodimerization and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes, whereas both HOOK1 and HOOK2 are localized to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head. This entry includes the central coiled-coiled domain and the divergent C-terminal domain.


Pssm-ID: 461694 [Multi-domain]  Cd Length: 528  Bit Score: 48.15  E-value: 6.56e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1391 DLQADSDESQralqQLKKKCQRL---TAELQDTKLHLegqQVRNHELEKKQRRFDSELSQA--------HEETQREKLQR 1459
Cdd:pfam05622    1 DLSEAQEEKD----ELAQRCHELdqqVSLLQEEKNSL---QQENKKLQERLDQLESGDDSGtpggkkylLLQKQLEQLQE 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1460 EKLQREKDmllAEAFSLKQQMEEKD-LDIAGFTQKVVSLEAElqdisSQESKDE-----ASLAKVKKqlrdLEAKV---K 1530
Cdd:pfam05622   74 ENFRLETA---RDDYRIKCEELEKEvLELQHRNEELTSLAEE-----AQALKDEmdilrESSDKVKK----LEATVetyK 141
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1531 DQEEELDEQAGSIQMLEqaklrlEMEMERMRQTHSKEMESRDEEVEEAR-QSCQKKLKQMEVQLEEEYEDKQKALREKRE 1609
Cdd:pfam05622  142 KKLEDLGDLRRQVKLLE------ERNAEYMQRTLQLEEELKKANALRGQlETYKRQVQELHGKLSEESKKADKLEFEYKK 215
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1610 LESKLSTLsdqvnQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPS--------------------KREIAQLknQ 1669
Cdd:pfam05622  216 LEEKLEAL-----QKEKERLIIERDTLRETNEELRCAQLQQAELSQADALlspssdpgdnlaaeimpaeiREKLIRL--Q 288
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1670 LEESEFTCaaavKARKAMEVEMEDLHLQIDDIAKAKTALEEQLsRLQREK-NEIQNRLEEDQEDMNELMKKHKAAVAqas 1748
Cdd:pfam05622  289 HENKMLRL----GQEGSYRERLTELQQLLEDANRRKNELETQN-RLANQRiLELQQQVEELQKALQEQGSKAEDSSL--- 360
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1749 rdmaqmndLQAQIEESNKEKQELQEKLQALQSQVEFLEQsmvdkSLVSRQEAKIRELETRLefektQVKRLENLA--SRL 1826
Cdd:pfam05622  361 --------LKQKLEEHLEKLHEAQSELQKKKEQIEELEP-----KQDSNLAQKIDELQEAL-----RKKDEDMKAmeERY 422
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082233 1827 KETMEKLTEE-RDQRAAAENREKEQNKRLQRQLRDtKEEMSELARKEAEASRKKHELEMDL 1886
Cdd:pfam05622  423 KKYVEKAKSViKTLDPKQNPASPPEIQALKNQLLE-KDKKIEHLERDFEKSKLQREQEEKL 482
HCR pfam07111
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...
1557-1903 6.76e-05

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.


Pssm-ID: 284517 [Multi-domain]  Cd Length: 749  Bit Score: 48.21  E-value: 6.76e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1557 MERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQ----LEEEYEDKQKALREKRELESKLSTLSDQVNQRDFE----- 1627
Cdd:pfam07111   21 LERRLDTQRPTVTMWEQDVSGDGQGPGRRGRSLELEgsqaLSQQAELISRQLQELRRLEEEVRLLRETSLQQKMRleaqa 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1628 --------SEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTcAAAVKARKAMEVEMEDLHLQID 1699
Cdd:pfam07111  101 meldalavAEKAGQAEAEGLRAALAGAEMVRKNLEEGSQRELEEIQRLHQEQLSSLT-QAHEEALSSLTSKAEGLEKSLN 179
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1700 DIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMN---ELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQ 1776
Cdd:pfam07111  180 SLETKRAGEAKQLAEAQKEAELLRKQLSKTQEELEaqvTLVESLRKYVGEQVPPEVHSQTWELERQELLDTMQHLQEDRA 259
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1777 ALQSQVEFLE---QSMVdKSLVSRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEerdQRAAAENREKEQNKR 1853
Cdd:pfam07111  260 DLQATVELLQvrvQSLT-HMLALQEEELTRKIQPSDSLEPEFPKKCRSLLNRWREKVFALMV---QLKAQDLEHRDSVKQ 335
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1854 LQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLA 1903
Cdd:pfam07111  336 LRGQVAELQEQVTSQSQEQAILQRALQDKAAEVEVERMSAKGLQMELSRA 385
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1482-1641 8.16e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 46.46  E-value: 8.16e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1482 EKDLDIAGFTQKVVSLEAELQDIssqeskdEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEmemERMR 1561
Cdd:COG1579     14 ELDSELDRLEHRLKELPAELAEL-------EDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYE---EQLG 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1562 QTHS-KEMESRDEEVEEARQScQKKLKQMEVQLEEEYEDKQKALRE-KRELESKLSTLSDQVNQRDfESEKRLRKDLKRT 1639
Cdd:COG1579     84 NVRNnKEYEALQKEIESLKRR-ISDLEDEILELMERIEELEEELAElEAELAELEAELEEKKAELD-EELAELEAELEEL 161

                   ..
gi 1907082233 1640 KA 1641
Cdd:COG1579    162 EA 163
DUF2046 pfam09755
Uncharacterized conserved protein H4 (DUF2046); This is the conserved N-terminal 350 residues ...
1536-1888 8.23e-05

Uncharacterized conserved protein H4 (DUF2046); This is the conserved N-terminal 350 residues of a family of proteins of unknown function possibly containing a coiled-coil domain.


Pssm-ID: 401633 [Multi-domain]  Cd Length: 304  Bit Score: 47.13  E-value: 8.23e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1536 LDEQAGSIQMLEQAKLRLEMEMERMRqTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKREleskls 1615
Cdd:pfam09755   23 REQLQKRIESLQQENRVLKMELETYK-LRCKALQEENRALRQASVNIQAKAEQEEEFISNTLLKKIQALKKEKE------ 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1616 TLSDQVNQRdfesEKRLRKDLKRTkalladaqimLDHLKnnapskREIAQLKNQLEeseftcaaavkarKAMEVEMEDLH 1695
Cdd:pfam09755   96 TLAMNYEQE----EEFLTNDLSRK----------LTQLR------QEKVELEQTLE-------------QEQEYQVNKLM 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1696 LQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDM-NELMKKhkaavaqasrdmaqMNDLQAqieesnkEKQELQEK 1774
Cdd:pfam09755  143 RKIEKLEAETLNKQTNLEQLRREKVELENTLEQEQEALvNRLWKR--------------MDKLEA-------EKRLLQEK 201
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1775 LQALQSQVEFLEQSMVDKSLVSRQEAKIRELetrlefeKTQVKRLENlasRLKETMEKLTEERDQRAAAENREKEQNKRL 1854
Cdd:pfam09755  202 LDQPVSAPPSPRDSTSEGDTAQNLTAHIQYL-------RKEVERLRR---QLATAQQEHTEKMAQYAQEERHIREENLRL 271
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 1907082233 1855 QRQLRDTKEEMSELARK--EAEASrkkheLEMDLES 1888
Cdd:pfam09755  272 QRKLQLEMERREALCRHlsESESS-----LEMDEER 302
DUF4686 pfam15742
Domain of unknown function (DUF4686); This family of proteins is found in eukaryotes. Proteins ...
1529-1863 8.26e-05

Domain of unknown function (DUF4686); This family of proteins is found in eukaryotes. Proteins in this family are typically between 498 and 775 amino acids in length. There is a conserved DLK sequence motif.


Pssm-ID: 464838 [Multi-domain]  Cd Length: 384  Bit Score: 47.37  E-value: 8.26e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1529 VKDQEEELDEQAGSIQMLEQAKLRL---------EMEMER-----MRQTHSKEMESRDEEVEEARQSCQKKL--KQMEVQ 1592
Cdd:pfam15742    1 VSSGEKLKYQQQEEVQQLRQNLQRLqilctsaekELRYERgknldLKQHNSLLQEENIKIKAELKQAQQKLLdsTKMCSS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1593 LEEEYEDKQKALREkRELESKLSTLSDQvnqrdfeSEKRLRKDLKRTKALLADAQimldhlknnapskREIAQLKNQLEE 1672
Cdd:pfam15742   81 LTAEWKHCQQKIRE-LELEVLKQAQSIK-------SQNSLQEKLAQEKSRVADAE-------------EKILELQQKLEH 139
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1673 SEFTCAAAVKA--RKAMEVEMEDLhlqIDDIAKAKTAL-EEQLSR--LQREKNEIQNRLEEDQEDMNELMKKHKAAVAQA 1747
Cdd:pfam15742  140 AHKVCLTDTCIleKKQLEERIKEA---SENEAKLKQQYqEEQQKRklLDQNVNELQQQVRSLQDKEAQLEMTNSQQQLRI 216
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1748 SRDMAQMNDLQAQI---EESNKEKQELQEKLQALQSQVEFL--EQSMVDKSL----------VSRQEAKIRELETRLEFE 1812
Cdd:pfam15742  217 QQQEAQLKQLENEKrksDEHLKSNQELSEKLSSLQQEKEALqeELQQVLKQLdvhvrkynekHHHHKAKLRRAKDRLVHE 296
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907082233 1813 KTQ----VKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKE 1863
Cdd:pfam15742  297 VEQrderIKQLENEIGILQQQSEKEKAFQKQVTAQNEILLLEKRKLLEQLTEQEE 351
Golgin_A5 pfam09787
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining ...
1364-1562 8.76e-05

Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1.


Pssm-ID: 462900 [Multi-domain]  Cd Length: 305  Bit Score: 47.06  E-value: 8.76e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1364 KKRLQQELEDKMEVEQ-QSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDtklhLEGQqvrnHELEKKQRRfd 1442
Cdd:pfam09787   23 KEKLIASLKEGSGVEGlDSSTALTLELEELRQERDLLREEIQKLRGQIQQLRTELQE----LEAQ----QQEEAESSR-- 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1443 SELSQAHEETQREKLQREKLQrekdmllAEAFSLKQQM----EEKDLDIAGFTQKVVSLEAELQDISSQ-ESKDEASLAK 1517
Cdd:pfam09787   93 EQLQELEEQLATERSARREAE-------AELERLQEELryleEELRRSKATLQSRIKDREAEIEKLRNQlTSKSQSSSSQ 165
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1907082233 1518 vkkqlRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQ 1562
Cdd:pfam09787  166 -----SELENRLHQLTETLIQKQTMLEALSTEKNSLVLQLERMEQ 205
Mitofilin pfam09731
Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. ...
1583-1898 8.82e-05

Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence rich in positive-charged and hydroxylated residues and a membrane anchor domain. In addition, it has three centrally located coiled coil domains.


Pssm-ID: 430783 [Multi-domain]  Cd Length: 618  Bit Score: 47.83  E-value: 8.82e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1583 QKKLKQMEVQLEE---EYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPS 1659
Cdd:pfam09731  124 QEKEKALEEVLKEaisKAESATAVAKEAKDDAIQAVKAHTDSLKEASDTAEISREKATDSALQKAEALAEKLKEVINLAK 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1660 KREIAQLKNQLEESEFTCAAAVKARKAME--VEMEDLHLQIDDIAKAKTALEEQLSRLQREK--NEIQNRLEEDQEDMNE 1735
Cdd:pfam09731  204 QSEEEAAPPLLDAAPETPPKLPEHLDNVEekVEKAQSLAKLVDQYKELVASERIVFQQELVSifPDIIPVLKEDNLLSND 283
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1736 lmkKHKAAVAQASRDMAQmndLQAQIEEsnKEKQELQEKLQALQSQVEFLEQSmvDKSLVSRQEAKIRELET--RLEFEK 1813
Cdd:pfam09731  284 ---DLNSLIAHAHREIDQ---LSKKLAE--LKKREEKHIERALEKQKEELDKL--AEELSARLEEVRAADEAqlRLEFER 353
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1814 TQVKRLENLASRLKETMEKLTEERDQRaaAENREKEQNKRLQRQ-LRDTKEEMSE----LARKEAEASRKKHELEMDLES 1888
Cdd:pfam09731  354 EREEIRESYEEKLRTELERQAEAHEEH--LKDVLVEQEIELQREfLQDIKEKVEEeragRLLKLNELLANLKGLEKATSS 431
                          330
                   ....*....|.
gi 1907082233 1889 -LEAANQSLQA 1898
Cdd:pfam09731  432 hSEVEDENRKA 442
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
1228-1599 1.09e-04

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 47.64  E-value: 1.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1228 QLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETrisELTSELTDERNTGESASQlldaETAERLRTEKEMKEL 1307
Cdd:COG5185    235 LKGFQDPESELEDLAQTSDKLEKLVEQNTDLRLEKLGENA---ESSKRLNENANNLIKQFE----NTKEKIAEYTKSIDI 307
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1308 QTQYDALKKQmevmemevmearlIRAAEINGEvddddaggewrlkYERAVREVDFTKKRLQQELEdkmeveqQSRRQLER 1387
Cdd:COG5185    308 KKATESLEEQ-------------LAAAEAEQE-------------LEESKRETETGIQNLTAEIE-------QGQESLTE 354
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1388 RLGDLQADSDESQRALQQlkkkcQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKL-QREKLQREK 1466
Cdd:COG5185    355 NLEAIKEEIENIVGEVEL-----SKSSEELDSFKDTIESTKESLDEIPQNQRGYAQEILATLEDTLKAADrQIEELQRQI 429
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1467 DmllaeafSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAK--VKKQLRDLEAKVKDQEEELDEQAGSiq 1544
Cdd:COG5185    430 E-------QATSSNEEVSKLLNELISELNKVMREADEESQSRLEEAYDEINrsVRSKKEDLNEELTQIESRVSTLKAT-- 500
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907082233 1545 mLEQAKLRLEMEMERMRQTHSKEMESRDEEvEEARQSCQKKLKQMEVQLEEEYED 1599
Cdd:COG5185    501 -LEKLRAKLERQLEGVRSKLDQVAESLKDF-MRARGYAHILALENLIPASELIQA 553
Tropomyosin pfam00261
Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 ...
1752-1962 1.13e-04

Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites. The protein is best known for its role in regulating the interaction between actin and myosin in muscle contraction, but is also involved in the organization and dynamics of the cytoskeleton in non-muscle cells. There are multiple cell-specific isoforms, expressed by alternative promoters and alternative RNA processing of at least four genes. Muscle isoforms of tropomyosin are characterized by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region.


Pssm-ID: 459736 [Multi-domain]  Cd Length: 235  Bit Score: 46.18  E-value: 1.13e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1752 AQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSmvdkslVSRQEAKIRELETrlEFEKTQvKRLENLASRLKEtME 1831
Cdd:pfam00261    1 KKMQQIKEELDEAEERLKEAMKKLEEAEKRAEKAEAE------VAALNRRIQLLEE--ELERTE-ERLAEALEKLEE-AE 70
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1832 KLTEERDQ-RAAAENREKEQNKR---LQRQLRDTKEEMSELARKEAEASRKKHELEMDLE-------SLEAANQSLQADL 1900
Cdd:pfam00261   71 KAADESERgRKVLENRALKDEEKmeiLEAQLKEAKEIAEEADRKYEEVARKLVVVEGDLEraeeraeLAESKIVELEEEL 150
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1901 KLAFKRIGDLQAAIEDEMES-DENEDLINSLQD----MVTKYQ---KKKNKLEGDSDvdsELEDRVDGVK 1962
Cdd:pfam00261  151 KVVGNNLKSLEASEEKASEReDKYEEQIRFLTEklkeAETRAEfaeRSVQKLEKEVD---RLEDELEAEK 217
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1372-1581 1.15e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 47.13  E-value: 1.15e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1372 EDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEE 1451
Cdd:COG3883     15 DPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARA 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1452 TQREKLQREKLqrekDMLL-AEAFS--------LKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQL 1522
Cdd:COG3883     95 LYRSGGSVSYL----DVLLgSESFSdfldrlsaLSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAK 170
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082233 1523 RDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQS 1581
Cdd:COG3883    171 AELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAA 229
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
1795-1912 1.16e-04

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 47.55  E-value: 1.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1795 VSRQEAKIRELEtrlefekTQVKRLENLASRLKETMEKLteerdqraaaenreKEQNKRLQRQLRDTKEEMSELARKEAE 1874
Cdd:COG2433    408 LTEEEEEIRRLE-------EQVERLEAEVEELEAELEEK--------------DERIERLERELSEARSEERREIRKDRE 466
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 1907082233 1875 ASRKKHE---LEMDLESLEAANQSLQADLKLaFKRIGDLQA 1912
Cdd:COG2433    467 ISRLDREierLERELEEERERIEELKRKLER-LKELWKLEH 506
Golgin_A5 pfam09787
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining ...
1706-1891 1.28e-04

Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1.


Pssm-ID: 462900 [Multi-domain]  Cd Length: 305  Bit Score: 46.67  E-value: 1.28e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1706 TALEEQLSRLQREKN----EIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQ 1781
Cdd:pfam09787   43 TALTLELEELRQERDllreEIQKLRGQIQQLRTELQELEAQQQEEAESSREQLQELEEQLATERSARREAEAELERLQEE 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1782 VEFLEQSMVdKSLVSRQEaKIRELETRLEFEKTQVK--------------RLENLASRL--KETM-EKLTEERDQRAAAE 1844
Cdd:pfam09787  123 LRYLEEELR-RSKATLQS-RIKDREAEIEKLRNQLTsksqssssqselenRLHQLTETLiqKQTMlEALSTEKNSLVLQL 200
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 1907082233 1845 NREKEQNKRLQ-RQLRDTKEEMSELARKEAEASRKKHELEMDLESLEA 1891
Cdd:pfam09787  201 ERMEQQIKELQgEGSNGTSINMEGISDGEGTRLRNVPGLFSESDSDRA 248
PRK12704 PRK12704
phosphodiesterase; Provisional
1397-1540 1.37e-04

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 47.08  E-value: 1.37e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1397 DESQRALQQLKKKCQrLTA--ELQDTKLHLEGQ-QVRNHELEKKQRRFdselsQAHEETQREKLqrEKLQREKDMLLAEA 1473
Cdd:PRK12704    45 EEAKKEAEAIKKEAL-LEAkeEIHKLRNEFEKElRERRNELQKLEKRL-----LQKEENLDRKL--ELLEKREEELEKKE 116
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082233 1474 FSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQeSKDEAS---LAKVKKQLR-DLEAKVKDQEEELDEQA 1540
Cdd:PRK12704   117 KELEQKQQELEKKEEELEELIEEQLQELERISGL-TAEEAKeilLEKVEEEARhEAAVLIKEIEEEAKEEA 186
PRK04778 PRK04778
septation ring formation regulator EzrA; Provisional
1226-1595 1.46e-04

septation ring formation regulator EzrA; Provisional


Pssm-ID: 179877 [Multi-domain]  Cd Length: 569  Bit Score: 47.14  E-value: 1.46e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1226 QVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRIseltseLTDERNTGESASQLldaetaerlrtEKEMK 1305
Cdd:PRK04778   113 LLDLIEEDIEQILEELQELLESEEKNREEVEQLKDLYRELRKSL------LANRFSFGPALDEL-----------EKQLE 175
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1306 ELQTQYDalkkqmevmemevmearliRAAEINGEVDdddaggewrlkYERAVREVDFTKKRLQQeLEDKME------VEQ 1379
Cdd:PRK04778   176 NLEEEFS-------------------QFVELTESGD-----------YVEAREILDQLEEELAA-LEQIMEeipellKEL 224
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1380 QSR------------RQLER---RLGDLQADSDesqraLQQLKKKCQRLTAELQDtkLHLEGQQVRNHELEKK------- 1437
Cdd:PRK04778   225 QTElpdqlqelkagyRELVEegyHLDHLDIEKE-----IQDLKEQIDENLALLEE--LDLDEAEEKNEEIQERidqlydi 297
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1438 -QRRFDSElSQAHEETQREKLQREKLQREKDMLLAEAFSLKQ--QMEEKDLDIA-GFTQKVVSLEAELQDISSQESKDEA 1513
Cdd:PRK04778   298 lEREVKAR-KYVEKNSDTLPDFLEHAKEQNKELKEEIDRVKQsyTLNESELESVrQLEKQLESLEKQYDEITERIAEQEI 376
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1514 SLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQ---THSKEMESRD-----EEVEEARQSCQKK 1585
Cdd:PRK04778   377 AYSELQEELEEILKQLEEIEKEQEKLSEMLQGLRKDELEAREKLERYRNklhEIKRYLEKSNlpglpEDYLEMFFEVSDE 456
                          410
                   ....*....|
gi 1907082233 1586 LKQMEVQLEE 1595
Cdd:PRK04778   457 IEALAEELEE 466
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1802-1967 1.63e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 46.98  E-value: 1.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1802 IRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLqRQLRDTKEEMSELARKEAEASRKKHE 1881
Cdd:PRK03918   171 IKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREEL-EKLEKEVKELEELKEEIEELEKELES 249
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1882 LEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENEDLINSLQDMVTKYQKKKNKLEGDSdvdSELEDRVDGV 1961
Cdd:PRK03918   250 LEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRL---SRLEEEINGI 326

                   ....*.
gi 1907082233 1962 KSWLSK 1967
Cdd:PRK03918   327 EERIKE 332
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
1703-2075 1.71e-04

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 47.35  E-value: 1.71e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1703 KAKTALEEQLSRLQ--REKNEIQNRLEEDQedmnelmkkhkaavAQASRDMAQMNDLQAQIEESNKEKQELQEKLqaLQS 1780
Cdd:PTZ00108  1031 AKKKDLVKELKKLGyvRFKDIIKKKSEKIT--------------AEEEEGAEEDDEADDEDDEEELGAAVSYDYL--LSM 1094
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1781 QVEFLEQSMVDKsLVSRQEAKIRELEtrlEFEKTQVKR--LENLAsRLKETMEKlTEERDQRAAAENREKEQNKRL-QRQ 1857
Cdd:PTZ00108  1095 PIWSLTKEKVEK-LNAELEKKEKELE---KLKNTTPKDmwLEDLD-KFEEALEE-QEEVEEKEIAKEQRLKSKTKGkASK 1168
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1858 LRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENEDLINSLQDMVTKY 1937
Cdd:PTZ00108  1169 LRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKN 1248
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1938 QKKKNKLEGDSDVDSELED-----RVDGVKSWLSKNKGPSKAPSDDGSLKSSRTalNSLPKEGKGPEERPSSVLSSLSYR 2012
Cdd:PTZ00108  1249 NSSKSSEDNDEFSSDDLSKegkpkNAPKRVSAVQYSPPPPSKRPDGESNGGSKP--SSPTKKKVKKRLEGSLAALKKKKK 1326
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082233 2013 KRLTlkdsiggTGDQDSLLTTLSERATSPERPPRKARggPREEPDQaqgrrcEELDDCSSIFS 2075
Cdd:PTZ00108  1327 SEKK-------TARKKKSKTRVKQASASQSSRLLRRP--RKKKSDS------SSEDDDDSEVD 1374
PRK11281 PRK11281
mechanosensitive channel MscK;
1226-1447 1.75e-04

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 47.21  E-value: 1.75e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1226 QVQLSEEQIRNKDEEIQQLRSKLEKVEKERNElRLSSDRLETRISELTSELTDERNT-GESASQLLDAETAERlRTEKEM 1304
Cdd:PRK11281    88 QLAQAPAKLRQAQAELEALKDDNDEETRETLS-TLSLRQLESRLAQTLDQLQNAQNDlAEYNSQLVSLQTQPE-RAQAAL 165
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1305 KELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAggewRLKYERAVREVDFTKKRLQQELEDKMEVEQQsrrQ 1384
Cdd:PRK11281   166 YANSQRLQQIRNLLKGGKVGGKALRPSQRVLLQAEQALLNA----QNDLQRKSLEGNTQLQDLLQKQRDYLTARIQ---R 238
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082233 1385 LERRLGDLQADSdeSQRALQQLKKKCQRltAELQDtklhlEGQQVRNHELEKKQRRFDSELSQ 1447
Cdd:PRK11281   239 LEHQLQLLQEAI--NSKRLTLSEKTVQE--AQSQD-----EAARIQANPLVAQELEINLQLSQ 292
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1238-1417 1.90e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 45.30  E-value: 1.90e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1238 DEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEMKELQT--QYDALK 1315
Cdd:COG1579     16 DSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNnkEYEALQ 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1316 KQmevmemEVMEARLIRAAeingevddddaggewrlkyERAVREVDFTKKRLQQELEDKMEVEQQSRRQLERRLGDLQAD 1395
Cdd:COG1579     96 KE------IESLKRRISDL-------------------EDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEE 150
                          170       180
                   ....*....|....*....|..
gi 1907082233 1396 SDESQRALQQLKKKCQRLTAEL 1417
Cdd:COG1579    151 LAELEAELEELEAEREELAAKI 172
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
1225-1535 1.95e-04

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 46.64  E-value: 1.95e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1225 IQVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEM 1304
Cdd:pfam05483  457 IQLTAIKTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQEERMLKQI 536
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1305 KELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDFTKKRLQQELEDK---MEVEQQS 1381
Cdd:pfam05483  537 ENLEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKCNNLKKQIENKnknIEELHQE 616
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1382 RRQLERRlgdlqadSDESQRALQQLKKKCQRLTAELQDTKLHLeGQQVRNHELEKKQRRFDSElsQAHEETQREKLQRE- 1460
Cdd:pfam05483  617 NKALKKK-------GSAENKQLNAYEIKVNKLELELASAKQKF-EEIIDNYQKEIEDKKISEE--KLLEEVEKAKAIADe 686
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1461 --KLQREKDML----LAEAFSLKQQ--------MEEKDLDIAGFTQK-------VVSLEAELQDIssqeskdEASLAKVK 1519
Cdd:pfam05483  687 avKLQKEIDKRcqhkIAEMVALMEKhkhqydkiIEERDSELGLYKNKeqeqssaKAALEIELSNI-------KAELLSLK 759
                          330
                   ....*....|....*....
gi 1907082233 1520 KQL---RDLEAKVKDQEEE 1535
Cdd:pfam05483  760 KQLeieKEEKEKLKMEAKE 778
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
16-304 2.08e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.09  E-value: 2.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   16 EEDKGVPKPGPPGKEE--GAPEASSKDGSAPPRSPQPAT-SPVPSETSQT---AKSPEPTLTMNGLGAASAEGPNEEAQ- 88
Cdd:PHA03307   136 EMLRPVGSPGPPPAASppAAGASPAAVASDAASSRQAALpLSSPEETARApssPPAEPPPSTPPAAASPRPPRRSSPISa 215
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   89 -------GLSRKRVANAVRKVVSKVLP-------GEEPGTAKEPPGRGaRSPEHPARGKKGEKAATSPKPPPPPPPPPAP 154
Cdd:PHA03307   216 sasspapAPGRSAADDAGASSSDSSSSessgcgwGPENECPLPRPAPI-TLPTRIWEASGWNGPSSRPGPASSSSSPRER 294
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  155 PKPETKKEAAKDELSVGLRSLMSRGR----GKDHKSRGKQPPGKGEKAPSQEPASTGKSGSPdmvDSPKKAGSPAKPETP 230
Cdd:PHA03307   295 SPSPSPSSPGSGPAPSSPRASSSSSSsresSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRP---PPPADPSSPRKRPRP 371
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  231 NkRCSPAPAQELADPNL--AGPKLNPSGKQQAKSPASdvqqeTGADPEVRLSPAEEAHQRL---ERIFTASLD--PEAAS 303
Cdd:PHA03307   372 S-RAPSSPAASAGRPTRrrARAAVAGRARRRDATGRF-----PAGRPRPSPLDAGAASGAFyarYPLLTPSGEpwPGSPP 445

                   .
gi 1907082233  304 P 304
Cdd:PHA03307   446 P 446
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
10-311 2.15e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 46.61  E-value: 2.15e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   10 WEQKVKEEDKGVPKPGPPGKEEGapeaSSKDGSaPPRSPQPATSP--VPSETSQTAKSPEPTLTMNglgaASAEG--PNE 85
Cdd:PTZ00449   481 FTQEIKKLIKKSKKKLAPIEEED----SDKHDE-PPEGPEASGLPpkAPGDKEGEEGEHEDSKESD----EPKEGgkPGE 551
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   86 EAQGLSRKRVANAVRKVVSKVlpgeePGTAKEPPG-RGARSPEHPARGKKGEKAATSPKpppppppppappkpETKKEAA 164
Cdd:PTZ00449   552 TKEGEVGKKPGPAKEHKPSKI-----PTLSKKPEFpKDPKHPKDPEEPKKPKRPRSAQR--------------PTRPKSP 612
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  165 KDELSVGLRSlmSRGRGKDHKSRgKQPPgkgekaPSQEPAStgksgsPDMVDSPKKAGSPAKPETPNKRCSPAPAQELAD 244
Cdd:PTZ00449   613 KLPELLDIPK--SPKRPESPKSP-KRPP------PPQRPSS------PERPEGPKIIKSPKPPKSPKPPFDPKFKEKFYD 677
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082233  245 PNLAGP------KLNPSGKQQAKSPASDVQQETGADPEVRLSPAEEAHQRLERI-FTASLDPEAASPAYSQAKT 311
Cdd:PTZ00449   678 DYLDAAaksketKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFpFEPIGDPDAEQPDDIEFFT 751
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
20-321 2.26e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.52  E-value: 2.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   20 GVPKPGPPGKEEGAPEASSKDGSAPPRSPQ---PATSPVPSETSQTAKSPEPTLTMNGLGAASAEGPNEEAQGLSRKRVA 96
Cdd:PRK07764   392 GAPAAAAPSAAAAAPAAAPAPAAAAPAAAAapaPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPA 471
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   97 NAVRKVVSkvlPGEEPGTAKEPPGRgARSPEHPARGKKGEKAATSPKP----PPPPPPPPAPPKPETKKEAA-----KDE 167
Cdd:PRK07764   472 AAPEPTAA---PAPAPPAAPAPAAA-PAAPAAPAAPAGADDAATLRERwpeiLAAVPKRSRKTWAILLPEATvlgvrGDT 547
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  168 LSVG-----LRSLMSRGR-------------GKDHK--------------SRGKQPPGKGEKAPSQEPASTGKSGSPDMV 215
Cdd:PRK07764   548 LVLGfstggLARRFASPGnaevlvtalaeelGGDWQveavvgpapgaaggEGPPAPASSGPPEEAARPAAPAAPAAPAAP 627
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  216 DSPKKAGSPAKPETPNKRCSPAPAQELADPNLAGPKLNPSGKQQAKSPASDVQQETGADPEVRLSPAEEAhqrlERIFTA 295
Cdd:PRK07764   628 APAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAA----PAQPAP 703
                          330       340
                   ....*....|....*....|....*.
gi 1907082233  296 SLDPEAASPAYSQAKTEEQIAAEEAW 321
Cdd:PRK07764   704 APAATPPAGQADDPAAQPPQAAQGAS 729
PRK11637 PRK11637
AmiB activator; Provisional
1660-1870 2.29e-04

AmiB activator; Provisional


Pssm-ID: 236942 [Multi-domain]  Cd Length: 428  Bit Score: 46.22  E-value: 2.29e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1660 KREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIddiakakTALEEQLSRLQREKNEiQNRLEEDQEDMNELMKK 1739
Cdd:PRK11637    67 QQQRASLLAQLKKQEEAISQASRKLRETQNTLNQLNKQI-------DELNASIAKLEQQQAA-QERLLAAQLDAAFRQGE 138
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1740 HKAAVAQASRDMAQ-------------------MNDLQAQIEESNKEKQELQEKlQALQSQVEFLEQSMVDKSLVSRQEA 1800
Cdd:PRK11637   139 HTGLQLILSGEESQrgerilayfgylnqarqetIAELKQTREELAAQKAELEEK-QSQQKTLLYEQQAQQQKLEQARNER 217
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082233 1801 K--IRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRD-------TKEEMSELAR 1870
Cdd:PRK11637   218 KktLTGLESSLQKDQQQLSELRANESRLRDSIARAEREAKARAEREAREAARVRDKQKQAKRkgstykpTESERSLMSR 296
PRK12705 PRK12705
hypothetical protein; Provisional
1519-1699 2.42e-04

hypothetical protein; Provisional


Pssm-ID: 237178 [Multi-domain]  Cd Length: 508  Bit Score: 46.24  E-value: 2.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1519 KKQLRDLEAKVKDQEEELDEQAgsiqMLEQAKLRLEMEMERMRqtHSKEMESRDEEVEEARQscQKKLKQMEVQLEEEYE 1598
Cdd:PRK12705    27 KRQRLAKEAERILQEAQKEAEE----KLEAALLEAKELLLRER--NQQRQEARREREELQRE--EERLVQKEEQLDARAE 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1599 DKQKALREKRELESKLSTLSDQVNQRdfesEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEEsEFTCA 1678
Cdd:PRK12705    99 KLDNLENQLEEREKALSARELELEEL----EKQLDNELYRVAGLTPEQARKLLLKLLDAELEEEKAQRVKKIEE-EADLE 173
                          170       180
                   ....*....|....*....|.
gi 1907082233 1679 AAVKARKAMEVEMEDLHLQID 1699
Cdd:PRK12705   174 AERKAQNILAQAMQRIASETA 194
DUF4670 pfam15709
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ...
1716-1895 2.66e-04

Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.


Pssm-ID: 464815 [Multi-domain]  Cd Length: 522  Bit Score: 46.10  E-value: 2.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1716 QREKNEIQNRLEEDQEDMNELMKKHkaaVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVE----FLEQSMVD 1791
Cdd:pfam15709  353 KRREQEEQRRLQQEQLERAEKMREE---LELEQQRRFEEIRLRKQRLEEERQRQEEEERKQRLQLQAAqeraRQQQEEFR 429
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1792 KSLVSRQEAKIRELETRLEFEKTQVKRLE-NLASRLKETMEKLTEERDQraaaenrekeqnkrLQRQlrdtKEEMSELAR 1870
Cdd:pfam15709  430 RKLQELQRKKQQEEAERAEAEKQRQKELEmQLAEEQKRLMEMAEEERLE--------------YQRQ----KQEAEEKAR 491
                          170       180
                   ....*....|....*....|....*
gi 1907082233 1871 KEAEASRKKHELEMDLESLEAANQS 1895
Cdd:pfam15709  492 LEAEERRQKEEEAARLALEEAMKQA 516
Spc7 smart00787
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ...
1607-1781 3.24e-04

Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.


Pssm-ID: 197874 [Multi-domain]  Cd Length: 312  Bit Score: 45.39  E-value: 3.24e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  1607 KRELESKLSTLSdqvnqrdfESEKRLRKDLKRTKALLADAQIMLDHLKnnapskREIAQLKNQleeseftcaaavkarka 1686
Cdd:smart00787  146 KEGLDENLEGLK--------EDYKLLMKELELLNSIKPKLRDRKDALE------EELRQLKQL----------------- 194
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  1687 mEVEMEDLHLQIDDIAKAKtaleeqLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEE--- 1763
Cdd:smart00787  195 -EDELEDCDPTELDRAKEK------LKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQcrg 267
                           170
                    ....*....|....*....
gi 1907082233  1764 -SNKEKQELQEKLQALQSQ 1781
Cdd:smart00787  268 fTFKEIEKLKEQLKLLQSL 286
CCDC73 pfam15818
Coiled-coil domain-containing protein 73 family; CCDC73 is a family of eukaryotic coiled-coil ...
1703-1954 3.29e-04

Coiled-coil domain-containing protein 73 family; CCDC73 is a family of eukaryotic coiled-coil containing proteins. The function is not known. The alternative name is sarcoma antigen NY-SAR-79.


Pssm-ID: 464893 [Multi-domain]  Cd Length: 1048  Bit Score: 46.09  E-value: 3.29e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1703 KAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVA----QASRDMAQMND------LQAQIEEsnKEKQELQ 1772
Cdd:pfam15818   21 EAETQYEEQIGKIIVETQELKWQKETLQNQKETLAKQHKEAMAvfkkQLQMKMCALEEekgkyqLATEIKE--KEIEGLK 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1773 EKLQALQSQVEFLEQSmvdkslVSRQEAKIRELETRLEFEKTQVKRLEN----LASR---LKETMEKLtEERDQRAAAEN 1845
Cdd:pfam15818   99 ETLKALQVSKYSLQKK------VSEMEQKLQLHLLAKEDHHKQLNEIEKyyatITGQfglVKENHGKL-EQNVQEAIQLN 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1846 -REKEQNKRLQRQLRDTKEEM----SELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFkrigDLQAAIEDEME- 1919
Cdd:pfam15818  172 kRLSALNKKQESEICSLKKELkkvtSDLIKSKVTCQYKMGEENINLTIKEQKFQELQERLNMEL----ELNKKINEEITh 247
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1907082233 1920 -SDENEDLINSLQDMVTKYQKkknKLEGDSDVDSEL 1954
Cdd:pfam15818  248 iQEEKQDIIISFQHMQQLLQQ---QTQANTEMEAEL 280
PRK12704 PRK12704
phosphodiesterase; Provisional
1667-1841 3.75e-04

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 45.54  E-value: 3.75e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1667 KNQLEESEFTCAAAVK-ARKAMEVEMEDLHLQI-DDIAKAKTALEEQLsrlqREKNEIQNRLEEDQEDMNELMKKHKAAV 1744
Cdd:PRK12704    30 EAKIKEAEEEAKRILEeAKKEAEAIKKEALLEAkEEIHKLRNEFEKEL----RERRNELQKLEKRLLQKEENLDRKLELL 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1745 AQASRDMAQMNDLQAQIEESNKEKQELQEKLQalQSQVEFLEQSMvdkSLvSRQEAK---IRELETRLEFEKtqvkrlen 1821
Cdd:PRK12704   106 EKREEELEKKEKELEQKQQELEKKEEELEELI--EEQLQELERIS---GL-TAEEAKeilLEKVEEEARHEA-------- 171
                          170       180
                   ....*....|....*....|
gi 1907082233 1822 lASRLKETMEKLTEERDQRA 1841
Cdd:PRK12704   172 -AVLIKEIEEEAKEEADKKA 190
secA PRK12903
preprotein translocase subunit SecA; Reviewed
1749-1925 4.11e-04

preprotein translocase subunit SecA; Reviewed


Pssm-ID: 237258 [Multi-domain]  Cd Length: 925  Bit Score: 45.82  E-value: 4.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1749 RDMAQMNDLQAQIEESNKEKQELQEKLqALQSQVEFLEQSMVDKSLvsrqeaKIRELETRLEFEKTQVKRLENLASRL-K 1827
Cdd:PRK12903   741 VQYSQKNPYQVYTEEGTKKFNILLQEI-AYDVIVSLFNNPNAEKIL------IITEILSDGINNSDINDRPQELIDQIiE 813
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1828 ETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFK-R 1906
Cdd:PRK12903   814 SEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKdV 893
                          170
                   ....*....|....*....
gi 1907082233 1907 IGDLQAAIEDEMESDENED 1925
Cdd:PRK12903   894 LIKLVISSDEIKQDEKTTK 912
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1418-1939 4.33e-04

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 45.98  E-value: 4.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1418 QDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQreKLQREKLQREK-DMLLAEAFSLKQQMEEKDLDIagFTQKVVS 1496
Cdd:PTZ00440   617 EKNDLQEKVKYILNKFYKGDLQELLDELSHFLDDHK--YLYHEAKSKEDlQTLLNTSKNEYEKLEFMKSDN--IDNIIKN 692
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1497 LEAELQDISSQesKDEAslakVKKQLRDLEAKVKDQEEELDEQAGSIQM-LEQAKlrlemEMERMRQTHSKEMESRDEEV 1575
Cdd:PTZ00440   693 LKKELQNLLSL--KENI----IKKQLNNIEQDISNSLNQYTIKYNDLKSsIEEYK-----EEEEKLEVYKHQIINRKNEF 761
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1576 EEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQV--NQRDFESEKRLRKDLK--------RTKALL-- 1643
Cdd:PTZ00440   762 ILHLYENDKDLPDGKNTYEEFLQYKDTILNKENKISNDINILKENKknNQDLLNSYNILIQKLEahtekndeELKQLLqk 841
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1644 ---ADAQIMLDHLKNNAPSKREIAQ-LKNQLEESEFTCAAAVKARKAMEVEMEDLHLqIDDIAKAKTALEEQLS------ 1713
Cdd:PTZ00440   842 fptEDENLNLKELEKEFNENNQIVDnIIKDIENMNKNINIIKTLNIAINRSNSNKQL-VEHLLNNKIDLKNKLEqhmkii 920
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1714 ----------------RLQREKNEIQNRLeeDQEDMNEL-MKKHKA-AVAQASRDMAQMND--LQAQIEESNKEKQELQE 1773
Cdd:PTZ00440   921 ntdniiqkneklnllnNLNKEKEKIEKQL--SDTKINNLkMQIEKTlEYYDKSKENINGNDgtHLEKLDKEKDEWEHFKS 998
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1774 KLQALQSQVEFLEQSMVDksLVSRQEAKIRELETRLEFEKTQ-----VKRLENLASRLKETMEKLTeerdqraAAENREK 1848
Cdd:PTZ00440   999 EIDKLNVNYNILNKKIDD--LIKKQHDDIIELIDKLIKEKGKeieekVDQYISLLEKMKTKLSSFH-------FNIDIKK 1069
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1849 EQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAF----KRIGDLQAAIE---DEMESD 1921
Cdd:PTZ00440  1070 YKNPKIKEEIKLLEEKVEALLKKIDENKNKLIEIKNKSHEHVVNADKEKNKQTEHYnkkkKSLEKIYKQMEktlKELENM 1149
                          570
                   ....*....|....*....
gi 1907082233 1922 ENEDL-INSLQDMVTKYQK 1939
Cdd:PTZ00440  1150 NLEDItLNEVNEIEIEYER 1168
PspA COG1842
Phage shock protein A [Transcription, Signal transduction mechanisms];
1679-1870 4.49e-04

Phage shock protein A [Transcription, Signal transduction mechanisms];


Pssm-ID: 441447 [Multi-domain]  Cd Length: 217  Bit Score: 44.05  E-value: 4.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1679 AAVKARKAMEVEMEDLHlqiDDIAKAKTALEEQLSRLQRekneIQNRLEEDQEDMNELMKKHKAAVAQASRDMA------ 1752
Cdd:COG1842     20 KAEDPEKMLDQAIRDME---EDLVEARQALAQVIANQKR----LERQLEELEAEAEKWEEKARLALEKGREDLArealer 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1753 ------QMNDLQAQIEESNKEKQELQEKLQALQSQVEfleqsmvdkslvsRQEAKIRELETRLEFEKTQVKRLENLAS-- 1824
Cdd:COG1842     93 kaeleaQAEALEAQLAQLEEQVEKLKEALRQLESKLE-------------ELKAKKDTLKARAKAAKAQEKVNEALSGid 159
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907082233 1825 -----RLKETMEKLTEERDQRAAAEnREKEQNKRLQRQLRDTKEEMS---ELAR 1870
Cdd:COG1842    160 sddatSALERMEEKIEEMEARAEAA-AELAAGDSLDDELAELEADSEvedELAA 212
GAS pfam13851
Growth-arrest specific micro-tubule binding; This family is the highly conserved central ...
1455-1606 4.84e-04

Growth-arrest specific micro-tubule binding; This family is the highly conserved central region of a number of metazoan proteins referred to as growth-arrest proteins. In mouse, Gas8 is predominantly a testicular protein, whose expression is developmentally regulated during puberty and spermatogenesis. In humans, it is absent in infertile males who lack the ability to generate gametes. The localization of Gas8 in the motility apparatus of post-meiotic gametocytes and mature spermatozoa, together with the detection of Gas8 also in cilia at the apical surfaces of epithelial cells lining the pulmonary bronchi and Fallopian tubes suggests that the Gas8 protein may have a role in the functioning of motile cellular appendages. Gas8 is a microtubule-binding protein localized to regions of dynein regulation in mammalian cells.


Pssm-ID: 464001 [Multi-domain]  Cd Length: 200  Bit Score: 43.74  E-value: 4.84e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1455 EKLQREKLQREKDMLLAEAFSLKQQMEEKdldIAGFTQKVVSLEAELQdissQESKDEASLAKVKKQLRDLEAKVKDQEE 1534
Cdd:pfam13851   34 EIAELKKKEERNEKLMSEIQQENKRLTEP---LQKAQEEVEELRKQLE----NYEKDKQSLKNLKARLKVLEKELKDLKW 106
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082233 1535 ELDeqagsiqMLEQAKLRLEMEMErmrqthskEMESRDEE-VEEARQSCQKK---LKQMEVQLEEEYEDKQKALRE 1606
Cdd:pfam13851  107 EHE-------VLEQRFEKVERERD--------ELYDKFEAaIQDVQQKTGLKnllLEKKLQALGETLEKKEAQLNE 167
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
25-295 5.49e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 45.36  E-value: 5.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   25 GPPGKEEGAPEASSKDGSAPPRSPQPATSPVPSETSQTAKSPEPTLTMNGLGAASAEGPNEEAQGLSRKRVANAVRKVVS 104
Cdd:PRK07764   596 GGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGG 675
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  105 KVLPGEEPGTAKEPPGRGARSPEHPARgkkgekaatspkpppppppppappkpetkkeaakdelsvglrslmSRGRGKDH 184
Cdd:PRK07764   676 AAPAAPPPAPAPAAPAAPAGAAPAQPA---------------------------------------------PAPAATPP 710
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  185 KSRGKQPPgkGEKAPSQEPASTGKSGSPDMVDSPKKAGSPAKPETPNKrcSPAPAQELADPNLAGPKLNPSgkQQAKSPA 264
Cdd:PRK07764   711 AGQADDPA--AQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPA--QPPPPPAPAPAAAPAAAPPPS--PPSEEEE 784
                          250       260       270
                   ....*....|....*....|....*....|.
gi 1907082233  265 SDVQQETGADPEVRLSPAEEAHQRLERIFTA 295
Cdd:PRK07764   785 MAEDDAPSMDDEDRRDAEEVAMELLEEELGA 815
Uso1_p115_C pfam04871
Uso1 / p115 like vesicle tethering protein, C terminal region; Also known as General vesicular ...
1800-1925 5.96e-04

Uso1 / p115 like vesicle tethering protein, C terminal region; Also known as General vesicular transport factor, Transcytosis associate protein (TAP) and Vesicle docking protein, this myosin-shaped molecule consists of an N-terminal globular head region, a coiled-coil tail which mediates dimerization, and a short C-terminal acidic region. p115 tethers COP1 vesicles to the Golgi by binding the coiled coil proteins giantin (on the vesicles) and GM130 (on the Golgi), via its C-terminal acidic region. It is required for intercisternal transport in the golgi stack. This family consists of the acidic C-terminus, which binds to the golgins giantin and GM130. p115 is thought to juxtapose two membranes by binding giantin with one acidic region, and GM130 with another.


Pssm-ID: 461461 [Multi-domain]  Cd Length: 121  Bit Score: 42.00  E-value: 5.96e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1800 AKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAEnrekEQNKRLQRQLRDTKEEMSELARKEAEASRKK 1879
Cdd:pfam04871    1 AKKSELESEASSLKNENTELKAELQELSKQYNSLEQKESQAKELE----AEVKKLEEALKKLKAELSEEKQKEKEKQSEL 76
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 1907082233 1880 HELEMDLESLEAANQSLQADLKlafKRIGDLQAAIEDEMESDENED 1925
Cdd:pfam04871   77 DDLLLLLGDLEEKVEKYKARLK---ELGEEVLSDDEDDDEDDEEDD 119
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1382-1599 6.20e-04

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 45.01  E-value: 6.20e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1382 RRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTK-----LHLEGQQvrnHELEKKQRRFDSELSQAHEETQREK 1456
Cdd:COG3206    163 EQNLELRREEARKALEFLEEQLPELRKELEEAEAALEEFRqknglVDLSEEA---KLLLQQLSELESQLAEARAELAEAE 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1457 LQREKLQREKDMLLAEAFSLKQqmeekDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQ---- 1532
Cdd:COG3206    240 ARLAALRAQLGSGPDALPELLQ-----SPVIQQLRAQLAELEAELAELSARYTPNHPDVIALRAQIAALRAQLQQEaqri 314
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082233 1533 ----EEELDEQAGSIQMLEQAKLRLEMEMERMRQThSKEMESRDEEVEEARQ---SCQKKLKQMEVQLEEEYED 1599
Cdd:COG3206    315 laslEAELEALQAREASLQAQLAQLEARLAELPEL-EAELRRLEREVEVARElyeSLLQRLEEARLAEALTVGN 387
MscS_porin pfam12795
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part ...
1622-1893 6.39e-04

Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part of the turgor-driven solute efflux system that protects bacteria from lysis in the event of osmotic shock. The MscS protein alone is sufficient to form a functional mechanosensitive channel gated directly by tension in the lipid bilayer. The MscS proteins are heptamers of three transmembrane subunits with seven converging M3 domains, and this MscS_porin is towards the N-terminal of the molecules. The high concentration of negative charges at the extracellular entrance of the pore helps select the cations for efflux.


Pssm-ID: 432790 [Multi-domain]  Cd Length: 238  Bit Score: 43.83  E-value: 6.39e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1622 NQRDFESEKRLRKDLKRTKALLADAQImldhlknnapSKREIAQLKNQLEEseftcaAAVKARKAMEvemEDLHLQIDDI 1701
Cdd:pfam12795    8 AKLDEAAKKKLLQDLQQALSLLDKIDA----------SKQRAAAYQKALDD------APAELRELRQ---ELAALQAKAE 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1702 AKAKTALEEQ-LSRLQREKNEIQNRLEEDQEDMNELMKkhkaavaqasrdmaQMNDLQAQIEESNKEKQELQEKLQALQS 1780
Cdd:pfam12795   69 AAPKEILASLsLEELEQRLLQTSAQLQELQNQLAQLNS--------------QLIELQTRPERAQQQLSEARQRLQQIRN 134
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1781 QvefLEQSMVDKSLVSrqEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRaaaenreKEQNKRLQRQLRD 1860
Cdd:pfam12795  135 R---LNGPAPPGEPLS--EAQRWALQAELAALKAQIDMLEQELLSNNNRQDLLKARRDLL-------TLRIQRLEQQLQA 202
                          250       260       270
                   ....*....|....*....|....*....|...
gi 1907082233 1861 TKEEMSELARKEAEASRKkhELEMDLESLEAAN 1893
Cdd:pfam12795  203 LQELLNEKRLQEAEQAVA--QTEQLAEEAAGDH 233
GOLGA2L5 pfam15070
Putative golgin subfamily A member 2-like protein 5; The function of the GOLGA2L5 protein ...
1708-1897 6.42e-04

Putative golgin subfamily A member 2-like protein 5; The function of the GOLGA2L5 protein family remains unknown. This family of proteins is thought to be found in the Golgi apparatus of eukaryotes.


Pssm-ID: 464485 [Multi-domain]  Cd Length: 521  Bit Score: 45.05  E-value: 6.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1708 LEEQLSRLQREKNEIQNRLEEDQEDMNELmkKHKAAVAQASRDM------AQMNDLQAQIEESNKEKQELQEKLQAlqsQ 1781
Cdd:pfam15070   34 LSEQVRTLREEKERSVSQVQELETSLAEL--KNQAAVPPAEEEQppagpsEEEQRLQEEAEQLQKELEALAGQLQA---Q 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1782 VEFLEQSmvdKSLVSRQEAKIRELETRLEFEKTQVKRLEnlasRLKETME--KLTeerDQRAAAENRE-KEQNKRLQRQ- 1857
Cdd:pfam15070  109 VQDNEQL---SRLNQEQEQRLLELERAAERWGEQAEDRK----QILEDMQsdRAT---ISRALSQNRElKEQLAELQNGf 178
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907082233 1858 LRDTKEEM-------------SELARKEAEASRKKHEL--EMDLESLEAanQSLQ 1897
Cdd:pfam15070  179 VKLTNENMeltsalqseqhvkKELAKKLGQLQEELGELkeTLELKSQEA--QSLQ 231
CENP-F_leu_zip pfam10473
Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; Cenp-F, a centromeric kinetochore, ...
1703-1828 6.45e-04

Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway. There are several leucine-rich repeats along the sequence of LEK1 that are considered to be zippers, though they do not appear to be binding DNA directly in this instance.


Pssm-ID: 463102 [Multi-domain]  Cd Length: 140  Bit Score: 42.28  E-value: 6.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1703 KAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQV 1782
Cdd:pfam10473    3 KKQLHVLEKLKESERKADSLKDKVENLERELEMSEENQELAILEAENSKAEVETLKAEIEEMAQNLRDLELDLVTLRSEK 82
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 1907082233 1783 EFLEQSMVDKslvsrqEAKIRELETRLEFEKTQVKRLENLASRLKE 1828
Cdd:pfam10473   83 ENLTKELQKK------QERVSELESLNSSLENLLEEKEQEKVQMKE 122
ClyA_Cry6Aa-like cd22656
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ...
1733-1949 6.49e-04

Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.


Pssm-ID: 439154 [Multi-domain]  Cd Length: 309  Bit Score: 44.28  E-value: 6.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1733 MNELMKKHKAAVAQASRDMAqMNDLQAQIEESNKEKQELQEKLQALQSQV-EFLEQSMVDKSlvsrqeaKIRELETRLEf 1811
Cdd:cd22656     96 ILELIDDLADATDDEELEEA-KKTIKALLDDLLKEAKKYQDKAAKVVDKLtDFENQTEKDQT-------ALETLEKALK- 166
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1812 ektqvkrlenlasrlketmEKLTEErdqraaAENREKEQNKRLQRQLrdtKEEMSELARKEAEASRKKHELEMDLESLEA 1891
Cdd:cd22656    167 -------------------DLLTDE------GGAIARKEIKDLQKEL---EKLNEEYAAKLKAKIDELKALIADDEAKLA 218
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082233 1892 ANQSLQADLKLAFKRIGDLQAAIEDEMESDENedLINSLQDMVTKYQKKKNKLEGDSD 1949
Cdd:cd22656    219 AALRLIADLTAADTDLDNLLALIGPAIPALEK--LQGAWQAIATDLDSLKDLLEDDIS 274
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1238-1418 7.16e-04

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 45.01  E-value: 7.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1238 DEEIQQLRSKLEKVEKERNELRLSSDrletriseltseLTDERNTGESASQLLDAETAERLRTEKEMKELQTQYDALKKQ 1317
Cdd:COG3206    181 EEQLPELRKELEEAEAALEEFRQKNG------------LVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAALRAQ 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1318 MEVMEMEVMEAR--------LIRAAEINGEVDdddaggEWRLKY-------ERAVREVDFTKKRLQQELEDKMEVEQQSR 1382
Cdd:COG3206    249 LGSGPDALPELLqspviqqlRAQLAELEAELA------ELSARYtpnhpdvIALRAQIAALRAQLQQEAQRILASLEAEL 322
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1907082233 1383 RQLERRLGDLQADSDESQRALQQLKKKCQRLtAELQ 1418
Cdd:COG3206    323 EALQAREASLQAQLAQLEARLAELPELEAEL-RRLE 357
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
1237-1542 7.20e-04

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 44.95  E-value: 7.20e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1237 KDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAE---TAERLRTEKEMKELQTQYDA 1313
Cdd:COG5185    266 RLEKLGENAESSKRLNENANNLIKQFENTKEKIAEYTKSIDIKKATESLEEQLAAAEaeqELEESKRETETGIQNLTAEI 345
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1314 LKKQMEVMEmevmearliRAAEINGEVDDDDAGGEWRLKYER---AVREVDFTKKRLQQELEDKMEVEQQSRRQLERRLG 1390
Cdd:COG5185    346 EQGQESLTE---------NLEAIKEEIENIVGEVELSKSSEEldsFKDTIESTKESLDEIPQNQRGYAQEILATLEDTLK 416
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1391 DLQADSDESQRALQQLKKKcqrlTAELQDTKLHLEgqqvrnHELEKKQRRFDSELSQAHEETQREKLQR-----EKLQRE 1465
Cdd:COG5185    417 AADRQIEELQRQIEQATSS----NEEVSKLLNELI------SELNKVMREADEESQSRLEEAYDEINRSvrskkEDLNEE 486
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1466 KDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLE---AELQDISSQESKDEASLAKVKKQlRDLEAKvKDQEEELDEQAGS 1542
Cdd:COG5185    487 LTQIESRVSTLKATLEKLRAKLERQLEGVRSKLdqvAESLKDFMRARGYAHILALENLI-PASELI-QASNAKTDGQAAN 564
RecN COG0497
DNA repair ATPase RecN [Replication, recombination and repair];
1380-1623 7.36e-04

DNA repair ATPase RecN [Replication, recombination and repair];


Pssm-ID: 440263 [Multi-domain]  Cd Length: 555  Bit Score: 44.68  E-value: 7.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1380 QSRRQLERRLGDLQADSDESQRALQQLKKKCQRLT-AELQdtklhlEGQQVrnhELEKKQRRfdseLSQAheetqrEKLq 1458
Cdd:COG0497    165 RAWRALKKELEELRADEAERARELDLLRFQLEELEaAALQ------PGEEE---ELEEERRR----LSNA------EKL- 224
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1459 REKLQREKDML----------LAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESK-------DEASLAKVKK- 1520
Cdd:COG0497    225 REALQEALEALsggeggaldlLGQALRALERLAEYDPSLAELAERLESALIELEEAASELRRyldslefDPERLEEVEEr 304
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1521 --QLRDLEAKVKDQEEELdeqagsIQMLEQAKLRLEmemermrqthskEMESRDEEVEEarqscqkkLKQMEVQLEEEYE 1598
Cdd:COG0497    305 laLLRRLARKYGVTVEEL------LAYAEELRAELA------------ELENSDERLEE--------LEAELAEAEAELL 358
                          250       260
                   ....*....|....*....|....*
gi 1907082233 1599 DKQKALREKRELESKlsTLSDQVNQ 1623
Cdd:COG0497    359 EAAEKLSAARKKAAK--KLEKAVTA 381
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1372-1776 7.55e-04

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 45.00  E-value: 7.55e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1372 EDKMEVEQQSRRQLERRLGDLQADSDESQ--------RALQQLKKKCQRLTAELQDTKlhlegqqvrNHELEKKQRrfdS 1443
Cdd:NF033838    54 ESQKEHAKEVESHLEKILSEIQKSLDKRKhtqnvalnKKLSDIKTEYLYELNVLKEKS---------EAELTSKTK---K 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1444 ELSQAHEETQREKLQREKLQREKDMLLAEA-FSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESkdEASLAKVKKQL 1522
Cdd:NF033838   122 ELDAAFEQFKKDTLEPGKKVAEATKKVEEAeKKAKDQKEEDRRNYPTNTYKTLELEIAESDVEVKKA--ELELVKEEAKE 199
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1523 RDLEAKVKDQEEELDEQAGSIQMLEQAKlrlememermrqthskemESRDEEVEEARQSCQKKLKQ-MEVQLEEEYEDKQ 1601
Cdd:NF033838   200 PRDEEKIKQAKAKVESKKAEATRLEKIK------------------TDREKAEEEAKRRADAKLKEaVEKNVATSEQDKP 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1602 KAlREKRELESKLSTlsdqvnqrdfesekrlrKDLKRTKALLADAQIMLDHLKNnaPSKREiaqlKNQLEESEFTCAAAV 1681
Cdd:NF033838   262 KR-RAKRGVLGEPAT-----------------PDKKENDAKSSDSSVGEETLPS--PSLKP----EKKVAEAEKKVEEAK 317
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1682 KARKAMEVE---------MEDLHLQiddIAKAKTALEE-QLSRLQREKNEIQNrleedQEDMNELMKKHKAAVAQASRdM 1751
Cdd:NF033838   318 KKAKDQKEEdrrnyptntYKTLELE---IAESDVKVKEaELELVKEEAKEPRN-----EEKIKQAKAKVESKKAEATR-L 388
                          410       420
                   ....*....|....*....|....*
gi 1907082233 1752 AQMNDLQAQIEESNKEKQELQEKLQ 1776
Cdd:NF033838   389 EKIKTDRKKAEEEAKRKAAEEDKVK 413
HOOK pfam05622
HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from ...
1290-1556 8.01e-04

HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organizms. The different members of the human gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three human HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas this central coiled-coil motif mediates homodimerization and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes, whereas both HOOK1 and HOOK2 are localized to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head. This entry includes the central coiled-coiled domain and the divergent C-terminal domain.


Pssm-ID: 461694 [Multi-domain]  Cd Length: 528  Bit Score: 44.68  E-value: 8.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1290 LDAETAERLRTEKEMKELQTQYDALKKQMEVMEMEVMEARliraaEINGEV-----------DDDDAGGEWRLKYERAVR 1358
Cdd:pfam05622  199 LSEESKKADKLEFEYKKLEEKLEALQKEKERLIIERDTLR-----ETNEELrcaqlqqaelsQADALLSPSSDPGDNLAA 273
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1359 EV---DFTKKRLQQELEDKMEVEQQSrRQLERRLGDLQADSDESQRALQQL----KKKCQR---LTAELQDTKLHLEGQQ 1428
Cdd:pfam05622  274 EImpaEIREKLIRLQHENKMLRLGQE-GSYRERLTELQQLLEDANRRKNELetqnRLANQRileLQQQVEELQKALQEQG 352
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1429 VRNHELEKKQRRFDS---ELSQAHEETQREKLQREKLQREKDMLLAE-AFSLKQQMEEKDLDIAGFTQK----------- 1493
Cdd:pfam05622  353 SKAEDSSLLKQKLEEhleKLHEAQSELQKKKEQIEELEPKQDSNLAQkIDELQEALRKKDEDMKAMEERykkyvekaksv 432
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082233 1494 VVSLEAELQDISSQEskdeasLAKVKKQLRDLEAKVKDQEEEldeqagsiqmLEQAKLRLEME 1556
Cdd:pfam05622  433 IKTLDPKQNPASPPE------IQALKNQLLEKDKKIEHLERD----------FEKSKLQREQE 479
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1487-1718 8.25e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 44.44  E-value: 8.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1487 IAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQaklrlememermrqthsk 1566
Cdd:COG3883      4 LALAAPTPAFADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQA------------------ 65
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1567 EMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESK-LSTLSDQVNQRDF--ESEKRLRKDLKRTKALL 1643
Cdd:COG3883     66 EIDKLQAEIAEAEAEIEERREELGERARALYRSGGSVSYLDVLLGSEsFSDFLDRLSALSKiaDADADLLEELKADKAEL 145
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907082233 1644 ADAQIMLDHLKNnapskrEIAQLKNQLEESEftcAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQRE 1718
Cdd:COG3883    146 EAKKAELEAKLA------ELEALKAELEAAK---AELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAA 211
HCR pfam07111
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...
1403-1620 8.96e-04

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.


Pssm-ID: 284517 [Multi-domain]  Cd Length: 749  Bit Score: 44.74  E-value: 8.96e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1403 LQQLKKKCQRLTAELQdTKLHLEGQQV-RNHELEKKQRRFDSELSQAHEetqreklqrEKLQREKDMLLaeafSLKQQME 1481
Cdd:pfam07111  483 LEQLREERNRLDAELQ-LSAHLIQQEVgRAREQGEAERQQLSEVAQQLE---------QELQRAQESLA----SVGQQLE 548
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1482 EKDLDIAGFTQKVVSLEAEL---QDISSQESKDEasLAKVKKQLRDLEAKVKDQ-EEELDEQAGSIQMLEQAKLRLEMEM 1557
Cdd:pfam07111  549 VARQGQQESTEEAASLRQELtqqQEIYGQALQEK--VAEVETRLREQLSDTKRRlNEARREQAKAVVSLRQIQHRATQEK 626
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082233 1558 ERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQ 1620
Cdd:pfam07111  627 ERNQELRRLQDEARKEEGQRLARRVQELERDKNLMLATLQQEGLLSRYKQQRLLAVLPSGLDK 689
PRK05771 PRK05771
V-type ATP synthase subunit I; Validated
1521-1835 9.42e-04

V-type ATP synthase subunit I; Validated


Pssm-ID: 235600 [Multi-domain]  Cd Length: 646  Bit Score: 44.53  E-value: 9.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1521 QLRDLeaKVKDQEEELDEQAGSIQMLEQA--KLRLEMEMERMRQthSKEMESRDEEVEEARQSCQKKLKQmevqLEEEYE 1598
Cdd:PRK05771    32 HIEDL--KEELSNERLRKLRSLLTKLSEAldKLRSYLPKLNPLR--EEKKKVSVKSLEELIKDVEEELEK----IEKEIK 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1599 DKQKALR----EKRELESKLSTLSdqvNQRDFESEKRLRKDLKRTKALLADAQimldhlKNNAPSKREIAQLKNQLEESE 1674
Cdd:PRK05771   104 ELEEEISelenEIKELEQEIERLE---PWGNFDLDLSLLLGFKYVSVFVGTVP------EDKLEELKLESDVENVEYIST 174
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1675 F----TCAAAVKARKAMEV--EMEDLHLQIDDIAKAKTALEEqLSRLQREKNEIQNRLEEDQEDMNELMKKHKaavaqas 1748
Cdd:PRK05771   175 DkgyvYVVVVVLKELSDEVeeELKKLGFERLELEEEGTPSEL-IREIKEELEEIEKERESLLEELKELAKKYL------- 246
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1749 rdmaqmNDLQAQIEESNKEKQELQEKLQALQS----QVE-FLEQSMVD--KSLVSRQEAK---IRELETRLEFEKTQVKr 1818
Cdd:PRK05771   247 ------EELLALYEYLEIELERAEALSKFLKTdktfAIEgWVPEDRVKklKELIDKATGGsayVEFVEPDEEEEEVPTK- 319
                          330
                   ....*....|....*..
gi 1907082233 1819 LENlaSRLKETMEKLTE 1835
Cdd:PRK05771   320 LKN--PKFIKPFESLTE 334
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
1230-1316 9.71e-04

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 44.46  E-value: 9.71e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1230 SEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERntgesasqlldAETAERLRTEKEMKELQT 1309
Cdd:COG2433    404 EERELTEEEEEIRRLEEQVERLEAEVEELEAELEEKDERIERLERELSEAR-----------SEERREIRKDREISRLDR 472

                   ....*..
gi 1907082233 1310 QYDALKK 1316
Cdd:COG2433    473 EIERLER 479
PHA03247 PHA03247
large tegument protein UL36; Provisional
18-305 1.00e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 1.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   18 DKGVPKPGP-PGKEEGAPEASSKDGSAPP-----RSP---------QPATSPVPSETSQ---TAKSPEPTLTMNGLGAAS 79
Cdd:PHA03247  2565 DRSVPPPRPaPRPSEPAVTSRARRPDAPPqsarpRAPvddrgdprgPAPPSPLPPDTHApdpPPPSPSPAANEPDPHPPP 2644
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   80 AEGPNEEAQGLSRKRVANAVRKVVSKVLPGEEPGTAKEPPGRGARSPEHPArgkkgekaatspkpppppPPPPAPPKPET 159
Cdd:PHA03247  2645 TVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL------------------TSLADPPPPPP 2706
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  160 KKEAAKDELSVGLRSLMSRGRGkdhksRGKQPPGKGEKAPSQEPASTGKSGSPDMVDSPKKAGSPAKPETPNKRCSPAPa 239
Cdd:PHA03247  2707 TPEPAPHALVSATPLPPGPAAA-----RQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPP- 2780
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082233  240 qeladPNLAGPKLNPSGKQQAKSPASdvqqETGADPEVRLSPAEEAHQRLERIFTASLDPEAASPA 305
Cdd:PHA03247  2781 -----RRLTRPAVASLSESRESLPSP----WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPT 2837
PRK05431 PRK05431
seryl-tRNA synthetase; Provisional
1691-1782 1.04e-03

seryl-tRNA synthetase; Provisional


Pssm-ID: 235461 [Multi-domain]  Cd Length: 425  Bit Score: 43.90  E-value: 1.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1691 MEDLHLQIDDIAKAKTALE--------EQLSRLQREKNEIQNRLEEDQEDMNEL---MKKHKAAVAQASRDMAQMNDLQA 1759
Cdd:PRK05431     1 MLDIKLIRENPEAVKEALAkrgfpldvDELLELDEERRELQTELEELQAERNALskeIGQAKRKGEDAEALIAEVKELKE 80
                           90       100
                   ....*....|....*....|...
gi 1907082233 1760 QIEESNKEKQELQEKLQALQSQV 1782
Cdd:PRK05431    81 EIKALEAELDELEAELEELLLRI 103
PRK11281 PRK11281
mechanosensitive channel MscK;
1496-1900 1.15e-03

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 44.52  E-value: 1.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1496 SLEAeLQDISSQESKDEASLAKVKKQLRDLeAKVKDQEEELDEqagsiqmLEQaklRLEMEMERMRQThSKEMES-RDEE 1574
Cdd:PRK11281    44 QLDA-LNKQKLLEAEDKLVQQDLEQTLALL-DKIDRQKEETEQ-------LKQ---QLAQAPAKLRQA-QAELEAlKDDN 110
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1575 VEEARQSCQK-KLKQMEVQLEE---EYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRL---RKDLKRT----KALL 1643
Cdd:PRK11281   111 DEETRETLSTlSLRQLESRLAQtldQLQNAQNDLAEYNSQLVSLQTQPERAQAALYANSQRLqqiRNLLKGGkvggKALR 190
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1644 ADAQIMLdhlknNApskrEIAQLKNQLEESEFTCAAAVKarkameveMEDLHLQIDDIAKAKTA-LEEQLSRLQREKNei 1722
Cdd:PRK11281   191 PSQRVLL-----QA----EQALLNAQNDLQRKSLEGNTQ--------LQDLLQKQRDYLTARIQrLEHQLQLLQEAIN-- 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1723 QNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQI--------EESNK-EKQELQEKLQ---ALQSQVEFLEQSMV 1790
Cdd:PRK11281   252 SKRLTLSEKTVQEAQSQDEAARIQANPLVAQELEINLQLsqrllkatEKLNTlTQQNLRVKNWldrLTQSERNIKEQISV 331
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1791 DK-SLV-SR----------QEAKIREL-----ETRLE-FEKTQvKRleNLASRLKETMEKLteERDQRAAAENREKEQNK 1852
Cdd:PRK11281   332 LKgSLLlSRilyqqqqalpSADLIEGLadriaDLRLEqFEINQ-QR--DALFQPDAYIDKL--EAGHKSEVTDEVRDALL 406
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907082233 1853 rlqrQLRDTKEEM-----SELARKEAEASrkkhELEMDLESLEAANQSLQADL 1900
Cdd:PRK11281   407 ----QLLDERRELldqlnKQLNNQLNLAI----NLQLNQQQLLSVSDSLQSTL 451
hsdR PRK11448
type I restriction enzyme EcoKI subunit R; Provisional
1820-1929 1.19e-03

type I restriction enzyme EcoKI subunit R; Provisional


Pssm-ID: 236912 [Multi-domain]  Cd Length: 1123  Bit Score: 44.17  E-value: 1.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1820 ENLASRLKETMEKLTEERDQRAaaenREKEQNKRLQRQLRDTKEEMSELArKEAEAsrKKHELEMDLESLEAANQSLQAD 1899
Cdd:PRK11448   141 ENLLHALQQEVLTLKQQLELQA----REKAQSQALAEAQQQELVALEGLA-AELEE--KQQELEAQLEQLQEKAAETSQE 213
                           90       100       110
                   ....*....|....*....|....*....|
gi 1907082233 1900 LKLAFKRIGDlQAAIEDEMESDENEDLINS 1929
Cdd:PRK11448   214 RKQKRKEITD-QAAKRLELSEEETRILIDQ 242
PHA03247 PHA03247
large tegument protein UL36; Provisional
32-320 1.24e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.54  E-value: 1.24e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   32 GAPEASSKDGSAPPrSPQPATSPvPSETSQTAKSPEPtltmnglgaasAEGPNEEAQGlSRKRVANAVRKVVSKVLPGEE 111
Cdd:PHA03247  2539 GLEELASDDAGDPP-PPLPPAAP-PAAPDRSVPPPRP-----------APRPSEPAVT-SRARRPDAPPQSARPRAPVDD 2604
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  112 PGtakEPPGRGARSPEHPARGKKGEKAATSPKPPPPPPPPPAPPKPEtkKEAAKDELSVGLRSLMSRGRGKDHKSRGKQP 191
Cdd:PHA03247  2605 RG---DPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPP--PERPRDDPAPGRVSRPRRARRLGRAAQASSP 2679
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  192 PGKGEK-----------------APSQEPASTGKSGSPDMVDSP----KKAGSPAKPETPNKRCSPAPAQELADPNLAGP 250
Cdd:PHA03247  2680 PQRPRRraarptvgsltsladppPPPPTPEPAPHALVSATPLPPgpaaARQASPALPAAPAPPAVPAGPATPGGPARPAR 2759
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  251 KLNPSGKQQAKSPASDVQQETGADPEVRLSPAEEAHQRLERIFTASLDPEAASPAYSQAKTEEQIAAEEA 320
Cdd:PHA03247  2760 PPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLP 2829
DUF1633 pfam07794
Protein of unknown function (DUF1633); This family contains sequences derived from a group of ...
1519-1710 1.38e-03

Protein of unknown function (DUF1633); This family contains sequences derived from a group of hypothetical proteins expressed by Arabidopsis thaliana. These sequences are highly similar and the region concerned is about 100 residues long.


Pssm-ID: 116408 [Multi-domain]  Cd Length: 698  Bit Score: 44.11  E-value: 1.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1519 KKQLRDLEAKVK-DQEEELDEQAgsiQMLEQAKLRLEMEMermrqTHSKEMESRDEEveearqscqkKLKQMEVQLEEEY 1597
Cdd:pfam07794  457 ERAIREEDPHLGaDQDREVRFGA---EGIVPGIERLKIEL-----STSKDLEKGYAE----------KIGFMEMEFGGLE 518
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1598 EDKQKALREKRELESKLSTLSDQVnqRDFESEKR-LRKDLKRTKALLADAQI-MLDHLKNNAPSKREIAQLKNQLEESEF 1675
Cdd:pfam07794  519 ADKQMARNQIHRLEEKKDELSKKV--LDLTSIAQgAKKAVHDAKVELAAAYLkLLAGIKDKWVAKKEFTVLEGQAAEVES 596
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1907082233 1676 TCA---AAVKARKAMEVEMEDLHLQIDDIaKAKTALEE 1710
Cdd:pfam07794  597 NLAlidQITKAAIDLTLEKPRFQAEIDDL-EARCKLKE 633
ERM_helical pfam20492
Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related ...
1762-1880 1.42e-03

Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related proteins, ezrin, radixin and moesin. Ezrin was first identified as a constituent of microvilli, radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions, and moesin as a heparin binding protein. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain, an extended alpha-helical domain and a charged C-terminal domain (pfam00769). Ezrin, radixin and merlin also contain a polyproline linker region between the helical and C-terminal domains. The N-terminal domain is highly conserved and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily, designated the FERM domain. ERM proteins crosslink actin filaments with plasma membranes. They co-localize with CD44 at actin filament plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains. This is the alpha-helical domain, which is involved in intramolecular masking of protein-protein interaction sites, regulating the activity of this proteins.


Pssm-ID: 466641 [Multi-domain]  Cd Length: 120  Bit Score: 40.67  E-value: 1.42e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1762 EESNKEKQELQEKLQALQSQVEFLEQSMvdkslvSRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKL------TE 1835
Cdd:pfam20492    2 EEAEREKQELEERLKQYEEETKKAQEEL------EESEETAEELEEERRQAEEEAERLEQKRQEAEEEKERLeesaemEA 75
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 1907082233 1836 ERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKH 1880
Cdd:pfam20492   76 EEKEQLEAELAEAQEEIARLEEEVERKEEEARRLQEELEEAREEE 120
OmpH pfam03938
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH ...
1852-1946 1.50e-03

Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.


Pssm-ID: 461098 [Multi-domain]  Cd Length: 140  Bit Score: 41.02  E-value: 1.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1852 KRLQRQLRDTKEEM-SELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENEdLINSL 1930
Cdd:pfam03938   18 KAAQAQLEKKFKKRqAELEAKQKELQKLYEELQKDGALLEEEREEKEQELQKKEQELQQLQQKAQQELQKKQQE-LLQPI 96
                           90
                   ....*....|....*.
gi 1907082233 1931 QDMVTKYQKKKNKLEG 1946
Cdd:pfam03938   97 QDKINKAIKEVAKEKG 112
growth_prot_Scy NF041483
polarized growth protein Scy;
1427-1901 1.53e-03

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 44.05  E-value: 1.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1427 QQVRNHELEKKQRRFDSE--LSQAHEETQR----EKLQREKLQREkdmLLAEAFSLKQQMEEK------------DLDIA 1488
Cdd:NF041483    76 QLLRNAQIQADQLRADAEreLRDARAQTQRilqeHAEHQARLQAE---LHTEAVQRRQQLDQElaerrqtveshvNENVA 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1489 GFTQKVVSLEAE---LQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEM-------- 1557
Cdd:NF041483   153 WAEQLRARTESQarrLLDESRAEAEQALAAARAEAERLAEEARQRLGSEAESARAEAEAILRRARKDAERLLnaastqaq 232
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1558 ------ERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNqrdfesEKR 1631
Cdd:NF041483   233 eatdhaEQLRSSTAAESDQARRQAAELSRAAEQRMQEAEEALREARAEAEKVVAEAKEAAAKQLASAESAN------EQR 306
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1632 LRKdlkrtkalladaqimldhlknnapSKREIAQLKNQ-LEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEE 1710
Cdd:NF041483   307 TRT------------------------AKEEIARLVGEaTKEAEALKAEAEQALADARAEAEKLVAEAAEKARTVAAEDT 362
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1711 --QLSRLQREKNEIQNRLEEDQEDM-----NELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQ--------ELQEKL 1775
Cdd:NF041483   363 aaQLAKAARTAEEVLTKASEDAKATtraaaEEAERIRREAEAEADRLRGEAADQAEQLKGAAKDDTkeyraktvELQEEA 442
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1776 QALQSQVEFLEQSMVDKSLVSRQEAKireletrlefeKTQVKRLENLASRLKETMEKLTEERDQ--RAAAENREKEQNKR 1853
Cdd:NF041483   443 RRLRGEAEQLRAEAVAEGERIRGEAR-----------REAVQQIEEAARTAEELLTKAKADADElrSTATAESERVRTEA 511
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1854 LQR--QLRDTKEEMSELARKEAEasRKKHELEMDLESLEAANQSLQADLK 1901
Cdd:NF041483   512 IERatTLRRQAEETLERTRAEAE--RLRAEAEEQAEEVRAAAERAARELR 559
PRK11281 PRK11281
mechanosensitive channel MscK;
1658-1955 1.63e-03

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 43.75  E-value: 1.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1658 PSKREIaqlKNQLEeseftcaaAVKARKAMEVE-------MEDLHLQIDDIAKAK---TALEEQLSRLQREKNEIQNRLE 1727
Cdd:PRK11281    36 PTEADV---QAQLD--------ALNKQKLLEAEdklvqqdLEQTLALLDKIDRQKeetEQLKQQLAQAPAKLRQAQAELE 104
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1728 EdqedmnelMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQvefleqsmvdksLVSRQEAKIR---E 1804
Cdd:PRK11281   105 A--------LKDDNDEETRETLSTLSLRQLESRLAQTLDQLQNAQNDLAEYNSQ------------LVSLQTQPERaqaA 164
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1805 LETRLefekTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELArkeaeasRKKHELem 1884
Cdd:PRK11281   165 LYANS----QRLQQIRNLLKGGKVGGKALRPSQRVLLQAEQALLNAQNDLQRKSLEGNTQLQDLL-------QKQRDY-- 231
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082233 1885 dlesLEAANQSLQadlklafKRIGDLQAAIedemesdeNEDLINSLQDMVTKYQ--KKKNKLEGDSDVDSELE 1955
Cdd:PRK11281   232 ----LTARIQRLE-------HQLQLLQEAI--------NSKRLTLSEKTVQEAQsqDEAARIQANPLVAQELE 285
PRK12704 PRK12704
phosphodiesterase; Provisional
1738-1885 1.70e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 43.61  E-value: 1.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1738 KKHKAAVAQASRDMAQMndlqaqIEESNKEKQELQeKLQALQSQVEFLE-QSMVDKSLVSRqEAKIRELETRLEFEKTQV 1816
Cdd:PRK12704    27 KIAEAKIKEAEEEAKRI------LEEAKKEAEAIK-KEALLEAKEEIHKlRNEFEKELRER-RNELQKLEKRLLQKEENL 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1817 KRLENLASRLKETMEKLTEERDQR--------AAAENREKEQNKRLQRQLRDTKEE-----MSELARK-EAEASRKKHEL 1882
Cdd:PRK12704    99 DRKLELLEKREEELEKKEKELEQKqqelekkeEELEELIEEQLQELERISGLTAEEakeilLEKVEEEaRHEAAVLIKEI 178

                   ...
gi 1907082233 1883 EMD 1885
Cdd:PRK12704   179 EEE 181
GAS pfam13851
Growth-arrest specific micro-tubule binding; This family is the highly conserved central ...
1367-1528 1.75e-03

Growth-arrest specific micro-tubule binding; This family is the highly conserved central region of a number of metazoan proteins referred to as growth-arrest proteins. In mouse, Gas8 is predominantly a testicular protein, whose expression is developmentally regulated during puberty and spermatogenesis. In humans, it is absent in infertile males who lack the ability to generate gametes. The localization of Gas8 in the motility apparatus of post-meiotic gametocytes and mature spermatozoa, together with the detection of Gas8 also in cilia at the apical surfaces of epithelial cells lining the pulmonary bronchi and Fallopian tubes suggests that the Gas8 protein may have a role in the functioning of motile cellular appendages. Gas8 is a microtubule-binding protein localized to regions of dynein regulation in mammalian cells.


Pssm-ID: 464001 [Multi-domain]  Cd Length: 200  Bit Score: 41.82  E-value: 1.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1367 LQQELEDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQD---TKLHLEGQQVRNHELEKKQRrfds 1443
Cdd:pfam13851   27 LIKSLKEEIAELKKKEERNEKLMSEIQQENKRLTEPLQKAQEEVEELRKQLENyekDKQSLKNLKARLKVLEKELK---- 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1444 ELSQAHEEtqreKLQR-EKLQREKDMLLAE----AFSLKQQMEEKDLDIAgftQKVVSL-------EAELQDISSQESKD 1511
Cdd:pfam13851  103 DLKWEHEV----LEQRfEKVERERDELYDKfeaaIQDVQQKTGLKNLLLE---KKLQALgetlekkEAQLNEVLAAANLD 175
                          170
                   ....*....|....*...
gi 1907082233 1512 EASLAKVKKQLRD-LEAK 1528
Cdd:pfam13851  176 PDALQAVTEKLEDvLESK 193
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
1798-1906 1.77e-03

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 43.69  E-value: 1.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1798 QEAKIRELETRLEFEKTQVKRLENLASRLKETMEklteERDQRAAaenREKEQNKRLQRQLRDTKEEMSELARKEAEASR 1877
Cdd:COG2433    404 EERELTEEEEEIRRLEEQVERLEAEVEELEAELE----EKDERIE---RLERELSEARSEERREIRKDREISRLDREIER 476
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1907082233 1878 KKHELE---MDLESLEAANQSLQADLKLAFKR 1906
Cdd:COG2433    477 LERELEeerERIEELKRKLERLKELWKLEHSG 508
GBP_C pfam02841
Guanylate-binding protein, C-terminal domain; Transcription of the anti-viral ...
1367-1504 1.93e-03

Guanylate-binding protein, C-terminal domain; Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP.


Pssm-ID: 460721 [Multi-domain]  Cd Length: 297  Bit Score: 42.66  E-value: 1.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1367 LQQELEDKmEVEQQSRRQLERRLgdlqadsDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRnheLEKKQRRFDSELS 1446
Cdd:pfam02841  178 LQEFLQSK-EAVEEAILQTDQAL-------TAKEKAIEAERAKAEAAEAEQELLREKQKEEEQM---MEAQERSYQEHVK 246
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082233 1447 QAHEETQREklqREKLQREKDMLLAeafslKQQMEEKDLDIAGFTQKVVSLEAELQDI 1504
Cdd:pfam02841  247 QLIEKMEAE---REQLLAEQERMLE-----HKLQEQEELLKEGFKTEAESLQKEIQDL 296
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
1727-1877 1.98e-03

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 43.31  E-value: 1.98e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1727 EEDQEDMNELMKKHKAAVAQASRDMAQ-MNDLQAQIEESNKEKQELQEKLQalqsqvefleqsmvdkslvsRQEAKIREL 1805
Cdd:COG2433    387 EKELPEEEPEAEREKEHEERELTEEEEeIRRLEEQVERLEAEVEELEAELE--------------------EKDERIERL 446
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907082233 1806 ETRLEFEKTQVKRlenlasrlketmekltEERDQRaAAENREKEqNKRLQRQLRDTKEEMSELARKEAEASR 1877
Cdd:COG2433    447 ERELSEARSEERR----------------EIRKDR-EISRLDRE-IERLERELEEERERIEELKRKLERLKE 500
PHA03247 PHA03247
large tegument protein UL36; Provisional
18-314 2.05e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 2.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   18 DKGVPKPGPPGKEEGAPEASSKDGSAPPRSPQPATSPVPSETSQTAKSPEPtltmnglgaASAEGPNEEAQGLSRKRVAN 97
Cdd:PHA03247  2620 DTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRL---------GRAAQASSPPQRPRRRAARP 2690
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   98 AVRKVVSKVLP---GEEPGTAKEPPGRGARSPEHPARGKKGEKAATSPKPPPPPPPPPAPPKPETKKeAAKDELSVGLRS 174
Cdd:PHA03247  2691 TVGSLTSLADPpppPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARP-ARPPTTAGPPAP 2769
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  175 LMSRGRGKDHKSRGKQPPGKG-----EKAPSQEPASTGKSGSPDMVDSPKKAGSPAKPETPnkrcsPAPAQELADPNLAG 249
Cdd:PHA03247  2770 APPAAPAAGPPRRLTRPAVASlsesrESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPP-----PTSAQPTAPPPPPG 2844
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907082233  250 PKLNPSGKQQAKSPASDVQQETGADPEVRlSPAEEAHQRLERIFTASLDPEAASPAYSQAKTEEQ 314
Cdd:PHA03247  2845 PPPPSLPLGGSVAPGGDVRRRPPSRSPAA-KPAAPARPPVRRLARPAVSRSTESFALPPDQPERP 2908
SHE3 pfam17078
SWI5-dependent HO expression protein 3; SWI5-dependent HO expression protein 3 (She3) is an ...
1691-1858 2.23e-03

SWI5-dependent HO expression protein 3; SWI5-dependent HO expression protein 3 (She3) is an RNA-binding protein that binds specific mRNAs, including the mRNA of Ash1, which is invalid in cell-fate determination. She3 acts as an adapter protein that docks the myosin motor Myo4p onto an Ash1-She2p ribonucleoprotein complex. She3 seems to bind to Myo4p and Shep2p via different domains.


Pssm-ID: 293683 [Multi-domain]  Cd Length: 228  Bit Score: 42.03  E-value: 2.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1691 MEDLHLQIDDIAKAKTALEEQ----LSRL----QRE----------KNEIQN----------RLEEDQEDMNELMKKHKA 1742
Cdd:pfam17078    5 IESLHDQIDALTKTNLQLTVQsqnlLSKLeiaqQKEskflenlaslKHENDNlssmlnrkerRLKDLEDQLSELKNSYEE 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1743 AVaqasrdmAQMNDLQAQIEESNKEKQELQEKLQALQSQVEfleqSMVDKSLVSRQ--EAKIRELETRLE-FEKTQVKRL 1819
Cdd:pfam17078   85 LT-------ESNKQLKKRLENSSASETTLEAELERLQIQYD----ALVDSQNEYKDhyQQEINTLQESLEdLKLENEKQL 153
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 1907082233 1820 ENLASRL----KETMEKLTEERDQRAAAENREKEQNKRLQRQL 1858
Cdd:pfam17078  154 ENYQQRIssndKDIDTKLDSYNNKFKNLDNIYVNKNNKLLTKL 196
AAA_13 pfam13166
AAA domain; This family of domains contain a P-loop motif that is characteriztic of the AAA ...
1507-1871 2.29e-03

AAA domain; This family of domains contain a P-loop motif that is characteriztic of the AAA superfamily. Many of the proteins in this family are conjugative transfer proteins. This family includes the PrrC protein that is thought to be the active component of the anticodon nuclease.


Pssm-ID: 463796 [Multi-domain]  Cd Length: 712  Bit Score: 43.13  E-value: 2.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1507 QESKD-EASLAKVKKQLRDLEAKVKDQEEELdeqagsiqmlEQAKLRLEMEMERMRQTHSKEMESR-DEEVEEARQSCQK 1584
Cdd:pfam13166   89 EESIEiQEKIAKLKKEIKDHEEKLDAAEANL----------QKLDKEKEKLEADFLDECWKKIKRKkNSALSEALNGFKY 158
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1585 KLKQMEVQLEE--EYEDKQKALREKRELESKLSTLSDQ---------VNQRDFESEKRLRKDLKRTKALLADAQIMLDHL 1653
Cdd:pfam13166  159 EANFKSRLLREieKDNFNAGVLLSDEDRKAALATVFSDnkpeiapltFNVIDFDALEKAEILIQKVIGKSSAIEELIKNP 238
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1654 KNNAPSKREIAQLKNQLEESEFtC--------AAAVKAR--KAMEVEMEDLHLQIDDIAKAKTALEEQL----------S 1713
Cdd:pfam13166  239 DLADWVEQGLELHKAHLDTCPF-CgqplpaerKAALEAHfdDEFTEFQNRLQKLIEKVESAISSLLAQLpavsdlasllS 317
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1714 RLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASR---------DMAQMNDLQAQIEES----NKEKQELQEKLQALQS 1780
Cdd:pfam13166  318 AFELDVEDIESEAEVLNSQLDGLRRALEAKRKDPFKsieldsvdaKIESINDLVASINELiakhNEITDNFEEEKNKAKK 397
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1781 QVEfleqsmvdKSLVSRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLteerdqraaaenreKEQNKRLQRQLRD 1860
Cdd:pfam13166  398 KLR--------LHLVEEFKSEIDEYKDKYAGLEKAINSLEKEIKNLEAEIKKL--------------REEIKELEAQLRD 455
                          410
                   ....*....|....
gi 1907082233 1861 TK---EEMSELARK 1871
Cdd:pfam13166  456 HKpgaDEINKLLKA 469
CCCAP pfam15964
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in ...
1635-1891 2.37e-03

Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in eukaryotes. CCCAP is also known as SDCCAG8, serologically defined colon cancer antigen 8. It is associated with the centrosome.


Pssm-ID: 435040 [Multi-domain]  Cd Length: 703  Bit Score: 43.36  E-value: 2.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1635 DLKRTKALladaqIMLDHLKNNAPSKREiaQLKNQLE-ESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLS 1713
Cdd:pfam15964  348 NFEKTKAL-----IQCEQLKSELERQKE--RLEKELAsQQEKRAQEKEALRKEMKKEREELGATMLALSQNVAQLEAQVE 420
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1714 RLQREKNEIQNRLEEDQEDmnelMKKHKAAVAQASRDMA-QMNDLQAQIEESNKEKQELQEKlqaLQSQVEFLEQsmvdk 1792
Cdd:pfam15964  421 KVTREKNSLVSQLEEAQKQ----LASQEMDVTKVCGEMRyQLNQTKMKKDEAEKEHREYRTK---TGRQLEIKDQ----- 488
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1793 slvsrqeaKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDqraaaenrEKEQNKRLQRQLRDTKEE-MSELARK 1871
Cdd:pfam15964  489 --------EIEKLGLELSESKQRLEQAQQDAARAREECLKLTELLG--------ESEHQLHLTRLEKESIQQsFSNEAKA 552
                          250       260
                   ....*....|....*....|.
gi 1907082233 1872 EA-EASRKKHELEMDLESLEA 1891
Cdd:pfam15964  553 QAlQAQQREQELTQKMQQMEA 573
PHA03247 PHA03247
large tegument protein UL36; Provisional
17-303 2.38e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 2.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   17 EDKGVPKPGP-PGKEEGAPEasskDGSAPPRSPQPATSPVPSET--------------------SQTAKSPEPTLTMNGL 75
Cdd:PHA03247  2485 EARFPFAAGAaPDPGGGGPP----DPDAPPAPSRLAPAILPDEPvgepvhprmltwirgleelaSDDAGDPPPPLPPAAP 2560
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   76 GAAS---------AEGPNEEAQGlSRKRVANAVRKVVSKVLPGEEPGtakEPPGRGARSPEHPARGKKGEKAATSPKPPP 146
Cdd:PHA03247  2561 PAAPdrsvppprpAPRPSEPAVT-SRARRPDAPPQSARPRAPVDDRG---DPRGPAPPSPLPPDTHAPDPPPPSPSPAAN 2636
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  147 PPPPPPAPPKPEtkKEAAKDELSVGLRSLMSRGRGKDHKSRGKQPPgKGEKAPSQEPAS---TGKSGSPDMVDSPKKAGS 223
Cdd:PHA03247  2637 EPDPHPPPTVPP--PERPRDDPAPGRVSRPRRARRLGRAAQASSPP-QRPRRRAARPTVgslTSLADPPPPPPTPEPAPH 2713
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  224 PAKPETP-------NKRCSPAPAQELADPNLAGPKLNPSGKQQAKSPASDVQQETGADPEVrlsPAEEAHQRLERIFTAS 296
Cdd:PHA03247  2714 ALVSATPlppgpaaARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA---PAAGPPRRLTRPAVAS 2790

                   ....*..
gi 1907082233  297 LDPEAAS 303
Cdd:PHA03247  2791 LSESRES 2797
GOLGA2L5 pfam15070
Putative golgin subfamily A member 2-like protein 5; The function of the GOLGA2L5 protein ...
1239-1733 2.41e-03

Putative golgin subfamily A member 2-like protein 5; The function of the GOLGA2L5 protein family remains unknown. This family of proteins is thought to be found in the Golgi apparatus of eukaryotes.


Pssm-ID: 464485 [Multi-domain]  Cd Length: 521  Bit Score: 43.13  E-value: 2.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1239 EEIQQLRSKLEKVEKERnelrlssDRLETRISELTSELTDERNTGESASQLLDAETA----ERLRTEKEMKELQTQYDAL 1314
Cdd:pfam15070   29 QKMQQLSEQVRTLREEK-------ERSVSQVQELETSLAELKNQAAVPPAEEEQPPAgpseEEQRLQEEAEQLQKELEAL 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1315 kkqmevmemevmearlirAAEINGEVDDDDagGEWRLKYERAVREVDftkkrLQQELEDKMEVEQQSRRQLErrlgDLQA 1394
Cdd:pfam15070  102 ------------------AGQLQAQVQDNE--QLSRLNQEQEQRLLE-----LERAAERWGEQAEDRKQILE----DMQS 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1395 DSDESQRALQQLKKKCQRLtAELQD-------------TKLHLEgQQVRNhELEKKQRRFDSELSQAHEETQREKLQREK 1461
Cdd:pfam15070  153 DRATISRALSQNRELKEQL-AELQNgfvkltnenmeltSALQSE-QHVKK-ELAKKLGQLQEELGELKETLELKSQEAQS 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1462 LQREKDMLLAE----AFSLKQQMEEKDLdiagfTQKVVSLEAELQD-ISSQESKDEASLAKVKKQLRD----LEAKVKDQ 1532
Cdd:pfam15070  230 LQEQRDQYLAHlqqyVAAYQQLASEKEE-----LHKQYLLQTQLMDrLQHEEVQGKVAAEMARQELQEtqerLEALTQQN 304
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1533 EEEldeQAGSIQMLEQAK---LRLEMEMERMRQTH---SKEMESRDEEVE---EARQSCQKKLKQMEVQLEEEYedkqka 1603
Cdd:pfam15070  305 QQL---QAQLSLLANPGEgdgLESEEEEEEAPRPSlsiPEDFESREAMVAffnSALAQAEEERAELRRQLKEQK------ 375
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1604 lREKRELESKLSTLSDQvnQRDFESEKRLRKD---LKRTKALladaQIMLDHLKNNAPS-KREIAQLKNQLEESEFTCAa 1679
Cdd:pfam15070  376 -RRCRRLAQQAAPAQEE--PEHEAHAPGTGGDsvpVEVHQAL----QVAMEKLQSRFTElMQEKADLKERVEELEHRCI- 447
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082233 1680 avkarkamevemeDLHLQIDDIAKAKTALEEQ---LSRLQREKNEIQNRLEEDQEDM 1733
Cdd:pfam15070  448 -------------QLSGETDTIGEYIALYQSQraiLKQRHREKEEYISRLAQDKEEM 491
OmpH pfam03938
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH ...
1519-1606 2.57e-03

Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.


Pssm-ID: 461098 [Multi-domain]  Cd Length: 140  Bit Score: 40.25  E-value: 2.57e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1519 KKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYE 1598
Cdd:pfam03938   18 KAAQAQLEKKFKKRQAELEAKQKELQKLYEELQKDGALLEEEREEKEQELQKKEQELQQLQQKAQQELQKKQQELLQPIQ 97

                   ....*....
gi 1907082233 1599 DK-QKALRE 1606
Cdd:pfam03938   98 DKiNKAIKE 106
KLF9_13_N-like cd21975
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ...
20-130 2.63e-03

Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.


Pssm-ID: 409240 [Multi-domain]  Cd Length: 163  Bit Score: 40.83  E-value: 2.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   20 GVPKPGPPGKEEGAPEASSKDGSAPPRSPQPATSPVPSETSQTAKSPEPTLTMNGLGAAS----AEGPNEEAQGLSrKRV 95
Cdd:cd21975     15 GAVVHGVRPDPEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGADSPGLVTAAPHLLAANvlapLRGPSVEGSSLE-SGD 93
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 1907082233   96 ANAVRKVVSKVLPGEEPGTAKEPPGRGARSPEHPA 130
Cdd:cd21975     94 ADMGSDSDVAPASGAAASTSPESSSDAASSPSPLS 128
PRK11281 PRK11281
mechanosensitive channel MscK;
1285-1544 3.02e-03

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 42.98  E-value: 3.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1285 SASQLLDAETAERLRTEKEmkeLQTQYDALKKQMEVmemevmearliraaeingEVDDddaggewrlkyeravrevdftk 1364
Cdd:PRK11281    22 LSSAFARAASNGDLPTEAD---VQAQLDALNKQKLL------------------EAED---------------------- 58
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1365 KRLQQELE------DKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAE-LQDTKL-HLEGQQV-RNHELE 1435
Cdd:PRK11281    59 KLVQQDLEqtlallDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDDNDEETREtLSTLSLrQLESRLAqTLDQLQ 138
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1436 KKQRRF---DSELSQAHeeTQREKLQRE---KLQR---------------------EKDMLLAEAFSLKQQMEEKDLDIA 1488
Cdd:PRK11281   139 NAQNDLaeyNSQLVSLQ--TQPERAQAAlyaNSQRlqqirnllkggkvggkalrpsQRVLLQAEQALLNAQNDLQRKSLE 216
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907082233 1489 GFTQkvvsleaeLQDIsSQESKDEASL--AKVKKQLRDLE----AKVKDQEEELDEQAGSIQ 1544
Cdd:PRK11281   217 GNTQ--------LQDL-LQKQRDYLTAriQRLEHQLQLLQeainSKRLTLSEKTVQEAQSQD 269
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1467-1944 3.06e-03

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 43.28  E-value: 3.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1467 DMLLAEAFSLKQQMEEKDLDIagfTQKVVSLEAELQDISSQESKDEASL-----------AKVKKQLRDLEAKVKDQEEE 1535
Cdd:PTZ00440   601 EELINEALFNKEKFINEKNDL---QEKVKYILNKFYKGDLQELLDELSHflddhkylyheAKSKEDLQTLLNTSKNEYEK 677
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1536 LDEQAGSI--QMLEQAKLRLEmemermrqtHSKEMESR--DEEVEEARQSCQKKLKQMEvqleEEYEDKQKALREKRELE 1611
Cdd:PTZ00440   678 LEFMKSDNidNIIKNLKKELQ---------NLLSLKENiiKKQLNNIEQDISNSLNQYT----IKYNDLKSSIEEYKEEE 744
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1612 SKLSTLSDQVNQRdfesEKRLRKDLKRTKALLADAQIMLDHLKNNapsKREIAQLKNQLEESEFTCAAAVKARKAMEVEM 1691
Cdd:PTZ00440   745 EKLEVYKHQIINR----KNEFILHLYENDKDLPDGKNTYEEFLQY---KDTILNKENKISNDINILKENKKNNQDLLNSY 817
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1692 EDLHLQIDDIAKAKTALEEQLsrLQREKNEIQN-RLEEDQEDMNELMKKhkaavaqasrdmaqMNDLQAQIEESNKekqe 1770
Cdd:PTZ00440   818 NILIQKLEAHTEKNDEELKQL--LQKFPTEDENlNLKELEKEFNENNQI--------------VDNIIKDIENMNK---- 877
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1771 lqeKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEfektqvkrlenlaSRLKETMEKLTEERDQRAAAENREKEQ 1850
Cdd:PTZ00440   878 ---NINIIKTLNIAINRSNSNKQLVEHLLNNKIDLKNKLE-------------QHMKIINTDNIIQKNEKLNLLNNLNKE 941
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1851 NKRLQRQLRDTKeeMSELarkeaeasrkKHELEMDLESLEAANQSLQADLKLafkrigDLQAAIEDEMESDENEDLINSL 1930
Cdd:PTZ00440   942 KEKIEKQLSDTK--INNL----------KMQIEKTLEYYDKSKENINGNDGT------HLEKLDKEKDEWEHFKSEIDKL 1003
                          490
                   ....*....|....
gi 1907082233 1931 QdmvTKYQKKKNKL 1944
Cdd:PTZ00440  1004 N---VNYNILNKKI 1014
Tropomyosin pfam00261
Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 ...
1661-1904 3.13e-03

Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites. The protein is best known for its role in regulating the interaction between actin and myosin in muscle contraction, but is also involved in the organization and dynamics of the cytoskeleton in non-muscle cells. There are multiple cell-specific isoforms, expressed by alternative promoters and alternative RNA processing of at least four genes. Muscle isoforms of tropomyosin are characterized by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region.


Pssm-ID: 459736 [Multi-domain]  Cd Length: 235  Bit Score: 41.55  E-value: 3.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1661 REIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKH 1740
Cdd:pfam00261    1 KKMQQIKEELDEAEERLKEAMKKLEEAEKRAEKAEAEVAALNRRIQLLEEELERTEERLAEALEKLEEAEKAADESERGR 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1741 KAAVAQASRDMAQMNDLQAQI-------EESNKEKQELQEKLQALQSQVEFLEQSmvdkslVSRQEAKIRELETRLEFEK 1813
Cdd:pfam00261   81 KVLENRALKDEEKMEILEAQLkeakeiaEEADRKYEEVARKLVVVEGDLERAEER------AELAESKIVELEEELKVVG 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1814 TQVKRLENLAsrlketmekltEERDQRaaaENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAAN 1893
Cdd:pfam00261  155 NNLKSLEASE-----------EKASER---EDKYEEQIRFLTEKLKEAETRAEFAERSVQKLEKEVDRLEDELEAEKEKY 220
                          250
                   ....*....|.
gi 1907082233 1894 QSLQADLKLAF 1904
Cdd:pfam00261  221 KAISEELDQTL 231
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1556-1729 3.15e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 41.45  E-value: 3.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1556 EMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEvQLEEEYEDKQKALREKRELESKLSTLSDQV-NQRDFES-EKRLr 1633
Cdd:COG1579     21 RLEHRLKELPAELAELEDELAALEARLEAAKTELE-DLEKEIKRLELEIEEVEARIKKYEEQLGNVrNNKEYEAlQKEI- 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1634 KDLKRTKALLADaqimldhlknnapskrEIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAktaLEEQLS 1713
Cdd:COG1579     99 ESLKRRISDLED----------------EILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAE---LEAELE 159
                          170
                   ....*....|....*.
gi 1907082233 1714 RLQREKNEIQNRLEED 1729
Cdd:COG1579    160 ELEAEREELAAKIPPE 175
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
22-346 3.30e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.67  E-value: 3.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   22 PKPGPPGKEEGAPEASSKDGSAP---PRSPQPATSPVPSETSQT---------------------AKSPEPTLTMNGLGA 77
Cdd:PRK07764   468 PAPAAAPEPTAAPAPAPPAAPAPaaaPAAPAAPAAPAGADDAATlrerwpeilaavpkrsrktwaILLPEATVLGVRGDT 547
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   78 ASAEGPNE-EAQGLSRKRVANAVRKVVSKVL-------------PGEEPGTAKEPPGRGARSPEHPARGKKGEKAATSPK 143
Cdd:PRK07764   548 LVLGFSTGgLARRFASPGNAEVLVTALAEELggdwqveavvgpaPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAP 627
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  144 PPPPPPPPPAPPKPETKKEAAKDELSVGLRSLMSRGRGKDHksrgkQPPGKGEKAPSQEPASTGKSGSPDMVDSPKKAGS 223
Cdd:PRK07764   628 APAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDG-----WPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPA 702
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  224 PAKPETPNKRCSPAPAQELADPNLAGPKLNPSGKQQAKSPASDVQQETGADPEVRLSPAEEAHQRleriftasLDPEAAS 303
Cdd:PRK07764   703 PAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPA--------AAPAAAP 774
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1907082233  304 PAYSQAktEEQIAAEEAWYETEKvwLVHRDGFSLASQLKSEEL 346
Cdd:PRK07764   775 PPSPPS--EEEEMAEDDAPSMDD--EDRRDAEEVAMELLEEEL 813
DUF4795 pfam16043
Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. ...
1720-1845 3.39e-03

Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. This family of proteins is found in bacteria and eukaryotes. Proteins in this family are typically between 285 and 978 amino acids in length.


Pssm-ID: 464990 [Multi-domain]  Cd Length: 181  Bit Score: 40.75  E-value: 3.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1720 NEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEK-------QELQEK--LQALQSQVEF------ 1784
Cdd:pfam16043   10 DQLQALILDLQEELEKLSETTSELSERLQQRQKHLEALYQQIEKLEKVKadkevveEELDEKadKEALASKVSRdqfdet 89
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082233 1785 ---LEQSMVDK-SLVSRQEAKIR---ELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAEN 1845
Cdd:pfam16043   90 leeLNQMLQELlDKLEGQEDAWKkalETLSEELDTKLDRLELDPLKELLERRIKALQKLLQEGSEELD 157
SPEC cd00176
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ...
1367-1606 3.49e-03

Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here


Pssm-ID: 238103 [Multi-domain]  Cd Length: 213  Bit Score: 41.28  E-value: 3.49e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1367 LQQELEDKMEVEQQSRRQLERRLGDLQADSDESQraLQQLKKKCQRLTAELQdtklhleGQQVRNHELEKKQRRFDSELS 1446
Cdd:cd00176      1 KLQQFLRDADELEAWLSEKEELLSSTDYGDDLES--VEALLKKHEALEAELA-------AHEERVEALNELGEQLIEEGH 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1447 QAHEETQReklQREKLQREKDMLLAEAFSLKQQMEEKdLDIAGFTQKVVSLEAELQDISSQESKDE--ASLAKVKKQLRD 1524
Cdd:cd00176     72 PDAEEIQE---RLEELNQRWEELRELAEERRQRLEEA-LDLQQFFRDADDLEQWLEEKEAALASEDlgKDLESVEELLKK 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1525 LeakvKDQEEELDEQAGSIQMLEQAKLRLEmemermrqthskemESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKAL 1604
Cdd:cd00176    148 H----KELEEELEAHEPRLKSLNELAEELL--------------EEGHPDADEEIEEKLEELNERWEELLELAEERQKKL 209

                   ..
gi 1907082233 1605 RE 1606
Cdd:cd00176    210 EE 211
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
18-250 3.55e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 42.53  E-value: 3.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   18 DKGVPKPGPPGKEEGAPEASSKDGSAPPRSPQPATSPVPSETSQTAKSPEPtltmnglgaaSAEGPNEEAQGLSRKRVAN 97
Cdd:PRK07003   444 DGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAP----------RAAAPSAATPAAVPDARAP 513
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   98 AVRKVVSKVLPGEEPGTAKEPPGRGARSPehPARGkkgekaatspkpppppppppappkpeTKKEAAKDEL-SVGLRSLM 176
Cdd:PRK07003   514 AAASREDAPAAAAPPAPEARPPTPAAAAP--AARA--------------------------GGAAAALDVLrNAGMRVSS 565
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082233  177 SRGRGkdhksrgkqpPGKGEKAPSQEPASTGKSGSPDMVDSPKKAGSPAKPETPNKRCSPAPAQelADPNLAGP 250
Cdd:PRK07003   566 DRGAR----------AAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQA--AESRGAPP 627
Caldesmon pfam02029
Caldesmon;
1558-1883 3.69e-03

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 42.55  E-value: 3.69e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1558 ERMRQTHSKEMESR-DEEVEEARQSCQKKLKQMEVQLEEEYEDKQKalrekrelesklstLSDQVNQRDFESEKRLRKDL 1636
Cdd:pfam02029   18 ERRRQKEEEEPSGQvTESVEPNEHNSYEEDSELKPSGQGGLDEEEA--------------FLDRTAKREERRQKRLQEAL 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1637 KRTKALLadaqimldhlKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEdlhlqiddiakaktalEEQLSRlq 1716
Cdd:pfam02029   84 ERQKEFD----------PTIADEKESVAERKENNEEEENSSWEKEEKRDSRLGRYK----------------EEETEI-- 135
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1717 REKNEIQNRLEEDQEDMNELMKKHKAAVAQASR-DMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQS------M 1789
Cdd:pfam02029  136 REKEYQENKWSTEVRQAEEEGEEEEDKSEEAEEvPTENFAKEEVKDEKIKKEKKVKYESKVFLDQKRGHPEVKsqngeeE 215
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1790 VDKSLVS--RQEAKIRELETRLEFEKTQV---KRLENLASRLKET----MEKLtEERDQRAAAE----NREKEQNKRL-- 1854
Cdd:pfam02029  216 VTKLKVTtkRRQGGLSQSQEREEEAEVFLeaeQKLEELRRRRQEKeseeFEKL-RQKQQEAELEleelKKKREERRKLle 294
                          330       340
                   ....*....|....*....|....*....
gi 1907082233 1855 QRQLRDTKEEMSELARKEAEASRKKHELE 1883
Cdd:pfam02029  295 EEEQRRKQEEAERKLREEEEKRRMKEEIE 323
Prefoldin_2 pfam01920
Prefoldin subunit; This family includes prefoldin subunits that are not detected by pfam02996.
1752-1836 3.84e-03

Prefoldin subunit; This family includes prefoldin subunits that are not detected by pfam02996.


Pssm-ID: 396482 [Multi-domain]  Cd Length: 102  Bit Score: 39.13  E-value: 3.84e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1752 AQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQS--------MVDKSLVSR-QEAKIRELETRLEFEKTQVKRLENL 1822
Cdd:pfam01920    2 NKFQQLQQQLQLLAQQIKQLETQLKELELALEELELLdedtkvykLIGDVLVKQdKEEVKEQLEERKETLEKEIKTLEKQ 81
                           90
                   ....*....|....
gi 1907082233 1823 ASRLKETMEKLTEE 1836
Cdd:pfam01920   82 LEKLEKELEELKEE 95
ATG17_like pfam04108
Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ...
1685-1871 4.19e-03

Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ATG17 and ATG11, conserved across eukaryotes. ATG17 forms a complex with ATG29 and ATG31, critical for both PAS (preautophagosomal structure) formation and autophagy. Together with ATG13, it is required for ATG1 kinase activation. ATG11 is a scaffold protein required for the cytoplasm-to-vacuole targeting (Cvt) pathway during starvation and to recruit ATG proteins to the pre-autophagosome. It is also required for ATG1 kinase activation. In many eukaryotes, ATG11 (the orthologue in mammals is RB1-inducible coiled-coil protein 1 (RB1CC1) and in S. pombe is Taz1-interacting factor 1 (taf1)) is essential for bulk autophagy, except in S.cerevisiae. ATG17 and ATG11 are large similar proteins, both predicted to be almost entirely helical, containing conserved coiled-coil regions and lack obvious functional motifs.


Pssm-ID: 427715 [Multi-domain]  Cd Length: 360  Bit Score: 41.99  E-value: 4.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1685 KAMEVEMEDLHLQIDDIAKAktaLEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLqaQIEES 1764
Cdd:pfam04108   20 RSLLEELVVLLAKIAFLRRG---LSVQLANLEKVREGLEKVLNELKKDFKQLLKDLDAALERLEETLDKLRNT--PVEPA 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1765 NKEKQELQEKLqalqsqvefleQSMVDKSLVSRQEAKIRELETRLefeKTQVKRLENLASRLKETMEKLTEERDQRAAAe 1844
Cdd:pfam04108   95 LPPGEEKQKTL-----------LDFIDEDSVEILRDALKELIDEL---QAAQESLDSDLKRFDDDLRDLQKELESLSSP- 159
                          170       180
                   ....*....|....*....|....*..
gi 1907082233 1845 NREKEQNKRLQRQLRDTKEEMSELARK 1871
Cdd:pfam04108  160 SESISLIPTLLKELESLEEEMASLLES 186
DUF4200 pfam13863
Domain of unknown function (DUF4200); This family is found in eukaryotes. It is a coiled-coil ...
1683-1787 4.20e-03

Domain of unknown function (DUF4200); This family is found in eukaryotes. It is a coiled-coil domain of unknwon function.


Pssm-ID: 464003 [Multi-domain]  Cd Length: 119  Bit Score: 39.47  E-value: 4.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1683 ARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIE 1762
Cdd:pfam13863   11 VQLALDAKREEIERLEELLKQREEELEKKEQELKEDLIKFDKFLKENDAKRRRALKKAEEETKLKKEKEKEIKKLTAQIE 90
                           90       100
                   ....*....|....*....|....*
gi 1907082233 1763 ESNKEKQELQEKLQALQSQVEFLEQ 1787
Cdd:pfam13863   91 ELKSEISKLEEKLEEYKPYEDFLEK 115
ATG17_like pfam04108
Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ...
1523-1879 4.27e-03

Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ATG17 and ATG11, conserved across eukaryotes. ATG17 forms a complex with ATG29 and ATG31, critical for both PAS (preautophagosomal structure) formation and autophagy. Together with ATG13, it is required for ATG1 kinase activation. ATG11 is a scaffold protein required for the cytoplasm-to-vacuole targeting (Cvt) pathway during starvation and to recruit ATG proteins to the pre-autophagosome. It is also required for ATG1 kinase activation. In many eukaryotes, ATG11 (the orthologue in mammals is RB1-inducible coiled-coil protein 1 (RB1CC1) and in S. pombe is Taz1-interacting factor 1 (taf1)) is essential for bulk autophagy, except in S.cerevisiae. ATG17 and ATG11 are large similar proteins, both predicted to be almost entirely helical, containing conserved coiled-coil regions and lack obvious functional motifs.


Pssm-ID: 427715 [Multi-domain]  Cd Length: 360  Bit Score: 41.99  E-value: 4.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1523 RDLEAKVKDQEEELDEQagsIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVeearqscQKKLKQmevqleeeYEDKQK 1602
Cdd:pfam04108   52 EGLEKVLNELKKDFKQL---LKDLDAALERLEETLDKLRNTPVEPALPPGEEK-------QKTLLD--------FIDEDS 113
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1603 AlrekRELESKLSTLSDQVNQRdfesEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEftcaaavk 1682
Cdd:pfam04108  114 V----EILRDALKELIDELQAA----QESLDSDLKRFDDDLRDLQKELESLSSPSESISLIPTLLKELESLE-------- 177
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1683 arKAMEVEMEDLHLQIDDIAKAKTALEEqlsrlqrEKNEIQNRLEEDQEDMNELMKKHKAAVaqasRDMAQMND-LQAQI 1761
Cdd:pfam04108  178 --EEMASLLESLTNHYDQCVTAVKLTEG-------GRAEMLEVLENDARELDDVVPELQDRL----DEMENNYErLQKLL 244
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1762 EESNKEKQELQEKLQalqsqvefleqsmvdkslvsrqeaKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEE-RDQR 1840
Cdd:pfam04108  245 EQKNSLIDELLSALQ------------------------LIAEIQSRLPEYLAALKEFEERWEEEKETIEDYLSElEDLR 300
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907082233 1841 AAAEN-------------REKEQNKRLQRQLRDTKEEMSELARKEAEAsRKK 1879
Cdd:pfam04108  301 EFYEGfpsaygsllleveRRREWAEKMKKILRKLAEELDRLQEEERKR-REK 351
CAGE1 pfam15066
Cancer-associated gene protein 1 family; CAGE-1 is a family of proteins overexpressed in ...
1667-1891 4.58e-03

Cancer-associated gene protein 1 family; CAGE-1 is a family of proteins overexpressed in tumour tissues compared with surrounding tissues. CAGE-1 gene showed testis-specific expression among normal tissues and displayed wide expression in a variety of cancer cell lines and cancer tissues. CAGE-1 is predominantly expressed during post-meiotic stages. It localizes to the acrosomal matrix and acrosomal granule showing it to be a component of the acrosome of mammalian spermatids and spermatozoa.


Pssm-ID: 464481  Cd Length: 528  Bit Score: 42.13  E-value: 4.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1667 KNQLEESEFTCAAAVKARKAMEVEMEDLHLQ--IDDIAKAKTALEEQLS---RLQREKNEIQNRLEEDQEdmnelmkkhk 1741
Cdd:pfam15066  329 KQQMQIQDLQCSNLYLEKKVKELQMKITKQQvfVDIINKLKENVEELIEdkyNVILEKNDINKTLQNLQE---------- 398
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1742 aavaqasrdmaQMNDLQAQIEESNKEKQELQEKLQALQ-SQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLE 1820
Cdd:pfam15066  399 -----------ILANTQKHLQESRKEKETLQLELKKIKvNYVHLQERYITEMQQKNKSVSQCLEMDKTLSKKEEEVERLQ 467
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082233 1821 nlasRLKETMEKLTEerdqrAAAENREKEQNKRLQRQLRDTKeemsELARKEAEASRKKHELEMDLESLEA 1891
Cdd:pfam15066  468 ----QLKGELEKATT-----SALDLLKREKETREQEFLSLQE----EFQKHEKENLEERQKLKSRLEKLVA 525
ALIX_LYPXL_bnd pfam13949
ALIX V-shaped domain binding to HIV; The binding of the LYPxL motif of late HIV p6Gag and EIAV ...
1692-1945 4.69e-03

ALIX V-shaped domain binding to HIV; The binding of the LYPxL motif of late HIV p6Gag and EIAV p9Gag to this domain is necessary for viral budding.This domain is generally central between an N-terminal Bro1 domain, pfam03097 and a C-terminal proline-rich domain. The retroviruses thus used this domain to hijack the ESCRT system of the cell.


Pssm-ID: 464053 [Multi-domain]  Cd Length: 294  Bit Score: 41.45  E-value: 4.69e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1692 EDLHLQIDDIAKAKTALEEQLsrlqrekNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMA------QMNDLQAQIEESN 1765
Cdd:pfam13949   20 ERLEKSLDDLPKLKQRNREIL-------DEAEKLLDEEESEDEQLRAKYGTRWTRPPSSELtatlraEIRKYREILEQAS 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1766 KEKQELQEKLQALQSQVEFLEQSMVD--KSLVSRQEAKIREletrlefektqvkRLENLASRLKETMEKLTEERDQRAAA 1843
Cdd:pfam13949   93 ESDSQVRSKFREHEEDLELLSGPDEDleAFLPSSRRAKNSP-------------SVEEQVAKLRELLNKLNELKREREQL 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1844 EN--REKEQNKRLQRQLRDTKEEMSELARKEA---EASRKKHELEMDLESLEAANQSLQADLKLAFKRIgdLQAAIEDEM 1918
Cdd:pfam13949  160 LKdlKEKARNDDISPKLLLEKARLIAPNQEEQlfeEELEKYDPLQNRLEQNLHKQEELLKEITEANNEF--LQDKRVDSE 237
                          250       260
                   ....*....|....*....|....*..
gi 1907082233 1919 ESDENEDLINSLQDMVTKYQKKKNKLE 1945
Cdd:pfam13949  238 KQRQREEALQKLENAYDKYKELVSNLQ 264
LXG pfam04740
LXG domain of WXG superfamily; This domain is present is the N-terminal region of a group of ...
1719-1814 5.40e-03

LXG domain of WXG superfamily; This domain is present is the N-terminal region of a group of polymorphic toxin proteins in bacteria. It is predicted to use Type VII secretion pathway to mediate export of bacterial toxins.


Pssm-ID: 428100 [Multi-domain]  Cd Length: 202  Bit Score: 40.69  E-value: 5.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1719 KNEIQNRLEEDQEDMNELMKKHKAAVAQASR----DMAQMNDLQAQIEESNKEKQELQEKLQAL-QSQVEFLEQSmvdKS 1793
Cdd:pfam04740  101 EHELENGLKKAKEKTEELTDEINSILASVSDivslPKLSDSEVQDSLQKAKKKVKDTIEKLYDFdQEQTSELSEL---EA 177
                           90       100
                   ....*....|....*....|.
gi 1907082233 1794 LVSRQEAKIRELETRLEFEKT 1814
Cdd:pfam04740  178 DLQALKTYVSELEEMTSNGKI 198
PRK11637 PRK11637
AmiB activator; Provisional
1711-1900 6.28e-03

AmiB activator; Provisional


Pssm-ID: 236942 [Multi-domain]  Cd Length: 428  Bit Score: 41.60  E-value: 6.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1711 QLSRLQREKNEIQNRLEEDQEDMNEL---MKKHKAAVAQASRdmaQMNDLQAQIEESNKEKQELQEKLQALQSQvefleQ 1787
Cdd:PRK11637    48 QLKSIQQDIAAKEKSVRQQQQQRASLlaqLKKQEEAISQASR---KLRETQNTLNQLNKQIDELNASIAKLEQQ-----Q 119
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1788 SMVDKSLVSRQEAKIRE-----LETRLEFEKTQVK-RLENLASRL----KETMEKLTEERDQrAAAENREKEQNKRLQRQ 1857
Cdd:PRK11637   120 AAQERLLAAQLDAAFRQgehtgLQLILSGEESQRGeRILAYFGYLnqarQETIAELKQTREE-LAAQKAELEEKQSQQKT 198
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 1907082233 1858 LRDTKEEmSELARKEAEASRKKhelemDLESLEAANQSLQADL 1900
Cdd:PRK11637   199 LLYEQQA-QQQKLEQARNERKK-----TLTGLESSLQKDQQQL 235
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1230-1453 6.50e-03

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 41.93  E-value: 6.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1230 SEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELtdeRNTGESASQLLDAETAERLRTekEMKELQT 1309
Cdd:COG3206    203 QKNGLVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAALRAQL---GSGPDALPELLQSPVIQQLRA--QLAELEA 277
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1310 QYDALkkqmevmemevmearliraaeingevddddaggewRLKYERAVREVdftkKRLQQELEdkmEVEQQSRRQLERRL 1389
Cdd:COG3206    278 ELAEL-----------------------------------SARYTPNHPDV----IALRAQIA---ALRAQLQQEAQRIL 315
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082233 1390 GDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSeLSQAHEETQ 1453
Cdd:COG3206    316 ASLEAELEALQAREASLQAQLAQLEARLAELPELEAELRRLEREVEVARELYES-LLQRLEEAR 378
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
15-305 7.91e-03

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 41.59  E-value: 7.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   15 KEEDKGVPKPGPPGKEEGAPEASSKDGSAPPRSPQPATSPVPSETSQTAKSPEPTLTMnglgAASAEGPNEEAQGLSRKR 94
Cdd:COG5180    225 RPEAASSPKVDPPSTSEARSRPATVDAQPEMRPPADAKERRRAAIGDTPAAEPPGLPV----LEAGSEPQSDAPEAETAR 300
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233   95 VANavRKVVSKVLPGEEPGtakEPPGrGARSPEHPARGKKGEKAATSPKPPPPPPPPPAPPK--------PETKKEAAKD 166
Cdd:COG5180    301 PID--VKGVASAPPATRPV---RPPG-GARDPGTPRPGQPTERPAGVPEAASDAGQPPSAYPpaeeavpgKPLEQGAPRP 374
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  167 ELSVGLRSLMSRGRGKD-HKSRGKQPPGKGEKAPSQEPASTGKSGSPDMVDSPKKAG---SPAKPETPNKRCSP-APAQE 241
Cdd:COG5180    375 GSSGGDGAPFQPPNGAPqPGLGRRGAPGPPMGAGDLVQAALDGGGRETASLGGAAGGagqGPKADFVPGDAESVsGPAGL 454
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907082233  242 LADP-NLAGPKLNPSGKQQAKSPASDVQQETGADPEVRLSPAEEAHQRLERIFTASLDPEAASPA 305
Cdd:COG5180    455 ADQAgAAASTAMADFVAPVTDATPVDVADVLGVRPDAILGGNVAPASGLDAETRIIEAEGAPATE 519
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1230-1448 8.00e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 40.97  E-value: 8.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1230 SEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDerntgesasqlldaetaerlrTEKEMKELQT 1309
Cdd:COG3883     14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEA---------------------LQAEIDKLQA 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1310 QYDALKKQmeVMEMEVMEARLIRAAEINGEVDDDDA---GGE------WRLKYERAVREVDFTKKRLQQELEDKMEVEQQ 1380
Cdd:COG3883     73 EIAEAEAE--IEERREELGERARALYRSGGSVSYLDvllGSEsfsdflDRLSALSKIADADADLLEELKADKAELEAKKA 150
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082233 1381 SRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQA 1448
Cdd:COG3883    151 ELEAKLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAA 218
TMF_TATA_bd pfam12325
TATA element modulatory factor 1 TATA binding; This is the C-terminal conserved coiled coil ...
1814-1889 8.35e-03

TATA element modulatory factor 1 TATA binding; This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organization in animal and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family.


Pssm-ID: 432481 [Multi-domain]  Cd Length: 115  Bit Score: 38.30  E-value: 8.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1814 TQVKRLENLASRLKETMEKLTEERDQrAAAE----NREKEQNKRLQRQLRDTKEEMSELARKEA-------EASRKKHEL 1882
Cdd:pfam12325   19 STIRRLEGELASLKEELARLEAQRDE-ARQEivklMKENEELKELKKELEELEKELKELEQRYEttlellgEKSEEVEEL 97

                   ....*..
gi 1907082233 1883 EMDLESL 1889
Cdd:pfam12325   98 KADVEDL 104
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1225-1951 8.45e-03

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 41.74  E-value: 8.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1225 IQVQLSEEQIRNKDEEIQQLRSKLEKVEKERNelrlssDRLETRISEltseltderntgesasqlLDAETAERLRTEKEM 1304
Cdd:PTZ00440   945 IEKQLSDTKINNLKMQIEKTLEYYDKSKENIN------GNDGTHLEK------------------LDKEKDEWEHFKSEI 1000
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1305 KELQTQYDALKKQMEVMemevmearliraaeINGEVDD---------DDAGGEWRLKYERAVREVDFTKKRLQ-----QE 1370
Cdd:PTZ00440  1001 DKLNVNYNILNKKIDDL--------------IKKQHDDiielidkliKEKGKEIEEKVDQYISLLEKMKTKLSsfhfnID 1066
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1371 LE-DKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKcqrltaeLQDTKLHLEGQQVRNHELEKKQRRFDSELSQAH 1449
Cdd:PTZ00440  1067 IKkYKNPKIKEEIKLLEEKVEALLKKIDENKNKLIEIKNK-------SHEHVVNADKEKNKQTEHYNKKKKSLEKIYKQM 1139
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1450 EETQREKLQREklqrEKDMLLAEAFSLKQQMEEK--DLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDL-- 1525
Cdd:PTZ00440  1140 EKTLKELENMN----LEDITLNEVNEIEIEYERIliDHIVEQINNEAKKSKTIMEEIESYKKDIDQVKKNMSKERNDHlt 1215
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1526 EAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQThsKEMESRDEEVEEARQscqkklkqmevQLEEEYEDKQKALR 1605
Cdd:PTZ00440  1216 TFEYNAYYDKATASYENIEELTTEAKGLKGEANRSTNV--DELKEIKLQVFSYLQ-----------QVIKENNKMENALH 1282
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1606 EKRELESKL-STLSDQV-------NQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNNApSKREIAQLKNQLEESeftC 1677
Cdd:PTZ00440  1283 EIKNMYEFLiSIDSEKIlkeilnsTKKAEEFSNDAKKELEKTDNLIKQVEAKIEQAKEHK-NKIYGSLEDKQIDDE---I 1358
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1678 AAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEI---QNRLEEDQEDMNEL-MKKHKAAVAQASRDMAQ 1753
Cdd:PTZ00440  1359 KKIEQIKEEISNKRKEINKYLSNIKSNKEKCDLHVRNASRGKDKIdflNKHEAIEPSNSKEVnIIKITDNINKCKQYSNE 1438
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1754 MNDLQAQIEESNKEKQELQEKLQALQSQVEFLEqsMVDKSLVSRQEAkiRELETRLEFEKTQVK-RLENLASRLKETMEK 1832
Cdd:PTZ00440  1439 AMETENKADENNDSIIKYEKEITNILNNSSILG--KKTKLEKKKKEA--TNIMDDINGEHSIIKtKLTKSSEKLNQLNEQ 1514
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1833 LTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELarkeaeaSRKKHELEmdlESLEAANqslqaDLKLAFKRIGDLQA 1912
Cdd:PTZ00440  1515 PNIKREGDVLNNDKSTIAYETIQYNLGRVKHNLLNI-------LNIKDEIE---TILNKAQ-----DLMRDISKISKIVE 1579
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|...
gi 1907082233 1913 AIE-DEMESDENE--DLINSLQDMVTKYQKKKNKLEG-DSDVD 1951
Cdd:PTZ00440  1580 NKNlENLNDKEADyvKYLDNILKEKQLMEAEYKKLNEiYSDVD 1622
SynN cd00179
Syntaxin N-terminus domain; syntaxins are nervous system-specific proteins implicated in the ...
1712-1867 8.69e-03

Syntaxin N-terminus domain; syntaxins are nervous system-specific proteins implicated in the docking of synaptic vesicles with the presynaptic plasma membrane; they are a family of receptors for intracellular transport vesicles; each target membrane may be identified by a specific member of the syntaxin family; syntaxins contain a moderately well conserved amino-terminal domain, called Habc, whose structure is an antiparallel three-helix bundle; a linker of about 30 amino acids connects this to the carboxy-terminal region, designated H3 (t_SNARE), of the syntaxin cytoplasmic domain; the highly conserved H3 region forms a single, long alpha-helix when it is part of the core SNARE complex and anchors the protein on the cytoplasmic surface of cellular membranes; H3 is not included in defining this domain


Pssm-ID: 238105 [Multi-domain]  Cd Length: 151  Bit Score: 39.19  E-value: 8.69e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1712 LSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQsmvd 1791
Cdd:cd00179      1 LEEFFEEVEEIRGNIDKISEDVEELQKLHSQLLTAPDADPELKQELESLVQEIKKLAKEIKGKLKELEESNEQNEA---- 76
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082233 1792 ksLVSRQEAKIReletrlefeKTQVkrlENLASRLKETMEKLTEERdQRAAAENREkeqnkRLQRQLRDTKEEMSE 1867
Cdd:cd00179     77 --LNGSSVDRIR---------KTQH---SGLSKKFVEVMTEFNKAQ-RKYRERYKE-----RIQRQLEITGGEATD 132
HAUS-augmin3 pfam14932
HAUS augmin-like complex subunit 3; This domain is subunit three of the augmin complex found ...
1692-1785 8.95e-03

HAUS augmin-like complex subunit 3; This domain is subunit three of the augmin complex found from Drosophila to humans. The HAUS-augmin complex is made up of eight subunits. The augmin complex interacts with gamma-TuRC, and attenuation of this interaction severely impairs spindle MT generation. Furthermore, we provide evidence that human augmin plays critical and non-redundant roles in the kinetochore-MT attachment and also central spindle formation during anaphase in human cells.The HAUS complex is required for mitotic spindle assembly and for maintenance of centrosome integrity.


Pssm-ID: 464384 [Multi-domain]  Cd Length: 261  Bit Score: 40.38  E-value: 8.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1692 EDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKekqEL 1771
Cdd:pfam14932   70 EALEESLEEIREATEDLEAELQELQKTKQLKINRLNKLQAQASSLSQGLRALVAEEEEAAKQLEELQEELAALNA---KT 146
                           90
                   ....*....|....
gi 1907082233 1772 QEKLQALQSQVEFL 1785
Cdd:pfam14932  147 NNVLQSLQSEVKEL 160
mS26_Tt cd23695
Tetrahymena thermophila ribosomal protein mS26 and similar proteins; Ribosomal protein mS26 is ...
1348-1670 8.96e-03

Tetrahymena thermophila ribosomal protein mS26 and similar proteins; Ribosomal protein mS26 is a component of small subunit (SSU) in Tetrahymena thermophila mitochondrial ribosome (mitoribosome). The structure of the mitoribosome reveals an assembly of 94-ribosomal proteins and four-rRNAs with an additional protein mass of ~700 kDa on the small subunit; the large mitoribosomal subunit (LSU) lacks 5S rRNA.


Pssm-ID: 467909 [Multi-domain]  Cd Length: 496  Bit Score: 41.35  E-value: 8.96e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1348 EWRLK--YERAVREVDFTKKRLQQELEDKMEveqqsrrQLERRLGDLQADSDESqraLQQLKKKCQRLTAELQDtklhle 1425
Cdd:cd23695    193 LFKLLkeYQDAKAIIIEDFRESSEEGAEKLE-------KLEKAFATLLKNYKEE---LEEPEKQLEFMQKRLLD------ 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1426 gqqvrnheLEKKQRRFDSELSQAH--EETQREKLQREKLQREKdmllaeaFSLKQQMEEKDLDIAGFTQKVVSLEAELQD 1503
Cdd:cd23695    257 --------LYNLLRLWGQYITIVKmpDSVVRDIMNKTQARPEV-------AKLNSKQELEDAKNRKRDTEENEFDDDYES 321
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1504 ISSQESKDEaslakvkkqlrdleakvkdqEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQ 1583
Cdd:cd23695    322 ADEGETSDE--------------------EDEIEEENFQLQKEKKKEEELNAEFNIAKNSLYKFSPQNDKNVVDDRDFYS 381
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1584 KklKQMEVQLEEE------------YEDKQKALREKRELESkLSTLSDQVNQRDFESEKRLRKDLKrtkalladAQIMLD 1651
Cdd:cd23695    382 G--VDLENVFPRAllnnlndftgldFQNVKEILNNEEKLKI-IQGEDDQNDQEDFNNPRKFQTSLI--------VQTYKQ 450
                          330       340
                   ....*....|....*....|.
gi 1907082233 1652 HLKN--NAPSKREIAQLKNQL 1670
Cdd:cd23695    451 KINNldAESLTRATQEKKNDI 471
46 PHA02562
endonuclease subunit; Provisional
1366-1546 9.50e-03

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 41.15  E-value: 9.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1366 RLQQELEDKMEVEQQSRRQLER---RLGDLQADSDESQRALQQLKKKCQRLTAELQ----DTKLHLEG-------QQVrn 1431
Cdd:PHA02562   217 RKQNKYDELVEEAKTIKAEIEEltdELLNLVMDIEDPSAALNKLNTAAAKIKSKIEqfqkVIKMYEKGgvcptctQQI-- 294
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233 1432 HELEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLA---EAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQE 1508
Cdd:PHA02562   295 SEGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEqskKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEF 374
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1907082233 1509 SKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQML 1546
Cdd:PHA02562   375 VDNAEELAKLQDELDKIVKTKSELVKEKYHRGIVTDLL 412
OmpH smart00935
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH ...
1491-1588 9.62e-03

Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.


Pssm-ID: 214922 [Multi-domain]  Cd Length: 140  Bit Score: 38.72  E-value: 9.62e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082233  1491 TQKVVSLEAELQDISSQeskDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSiqMLEQAKLRLEMEMERMRQTHSKEMES 1570
Cdd:smart00935    6 VQKILQESPAGKAAQKQ---LEKEFKKRQAELEKLEKELQKLKEKLQKDAAT--LSEAAREKKEKELQKKVQEFQRKQQK 80
                            90
                    ....*....|....*...
gi 1907082233  1571 RDEEVEEARQSCQKKLKQ 1588
Cdd:smart00935   81 LQQDLQKRQQEELQKILD 98
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH