View
Concise Results
Standard Results
Full Results
Nipped-B homolog (Drosophila), isoform CRA_c [Homo sapiens]
Protein Classification
List of domain hits
Name
Accession
Description
Interval
E-value
SCC2
cd23958
Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid ...
1263-2473
0e+00
Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid cohesion protein 2 (Scc2) and its homolog (Scc2 homolog, also called Nipped-B-like protein or NIPBL). Scc2/NIPBL and Scc4 form a complex that is responsible for loading the cohesin protein onto sister chromatids during mitosis and meiosis. Cohesin is a ring-shaped protein complex that encircles the sister chromatids and helps to hold them together until they are ready to be separated during cell division. In addition to its role in chromosome segregation, cohesin also plays important roles in other cellular processes such as transcription, chromosome condensation, and DNA repair.
:Pssm-ID: 467937 [Multi-domain]
Cd Length: 1197
Bit Score: 1442.10
E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1263 V KV L N ILE K NI Q DG SK L S t L LNHNNDTEE EERLW RDLIME R VTKS ADA C LT tin I M TSP NM PK AV Y I ED V IERV IQYT KF 1342
Cdd:cd23958 3 V RL L T ILE R NI R DG ES L D - L DLDESQEDD EERLW LLERID R ALEA ADA S LT --- I L TSP GL PK QL Y S ED L IERV VDFL KF 78
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1343 H L Q NT L YP Q YDPVYR L D PHGGGL lss K A KRAK C S TH K QRVIVM L Y NK V C DIV S S L S ELL EI Q L LTD TT ILQ VSSMG I T PF 1422
Cdd:cd23958 79 Q L E NT I YP A YDPVYR S D SSAKAG --- K K KRAK A S SK K KKSVST L L NK L C ELL S L L A ELL SL Q S LTD SV ILQ LVYLA I S PF 155
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1423 FVE ---- NV S ELQL C A I KL V T AV FSRY EKH RQ L I L EEI FT SLA R LP T SKR S LR N FRLN SSD mdgepm Y IQMVTAL V LQL I 1498
Cdd:cd23958 156 FVE navs NV D ELQL S A L KL L T SI FSRY PDQ RQ F I I EEI LS SLA K LP S SKR N LR Q FRLN DGK ------ S IQMVTAL L LQL V 229
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1499 Q CV V H LP SS EK D S N ----- A E EDSNKKI D QDVVITN SYE T A M R T A QN FLS IF L K KC GS K -- QGEE DYRPLFENFVQDLL S 1571
Cdd:cd23958 230 Q SS V K LP NL EK E S S rdksl E E DSDELLE D EESALAK SYE S A V R I A SY FLS FL L Q KC TK K kk EKDT DYRPLFENFVQDLL T 309
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1572 TV N K PEWPAAELLLSLLGRLLV HQ FSNK S T EMAL RV AS LD Y LG TV AARLRKDA VT skmdqgsierilkqvsggedei QQ L 1651
Cdd:cd23958 310 VL N L PEWPAAELLLSLLGRLLV SI FSNK K T DANA RV MA LD L LG LI AARLRKDA LA ---------------------- EE L 367
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1652 QKALLDYL D EN TET DPSL VFS R K FY I AQW F RD TTL E T EKA M K sqkdeessegthh A K E I E T T GQIMHRA E N RKKFL R S I i 1731
Cdd:cd23958 368 QKALLDYL A EN SSS DPSL ESA R G FY L AQW L RD LSN E L EKA E K ------------- A A E E E D T ILKLELS E L RKKFL D S K - 433
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1732 kttpsq FSTLKMNSDTVDYD DA C L IV R Y LAS M RP FA QSFD IY L T Q I L RV L G E N A IAV RTKA M K C LS E VV AV DPSIL ARL D 1811
Cdd:cd23958 434 ------ ILSKEEEASPLSRE DA K L LY R A LAS Q RP LS QSFD PI L K Q L L SS L D E P A VTL RTKA L K A LS L VV EA DPSIL GDP D 507
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1812 M QR G V H GRL M D N S T SVREAAVEL L G RFVLC RP Q LAEQYY D M LI ERILDTG I SVRKRVIKILRDI CIEQ P T F PKITEM CV K 1891
Cdd:cd23958 508 V QR A V E GRL L D S S A SVREAAVEL V G KYISS RP D LAEQYY E M IA ERILDTG V SVRKRVIKILRDI YLRT P D F EIKVDI CV R 587
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1892 MI RR V ND - EE G IK K L VNE TFQ K LWFTP T P HN ----- DKE AMTRKI L N I T DVVAACR d T G Y D WF EQLL QN LLKS E ED SSY K 1965
Cdd:cd23958 588 LL RR I ND e EE S IK D L ARK TFQ E LWFTP F P ES sspaq DKE SLAERV L L I V DVVAACR - K G L D LL EQLL KR LLKS K ED KED K 666
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1966 P V K KAC T QLVD N LVE H IL KY EE SLAD S dnkgv NSGR LVAC IT TL F LF S K IR P - Q L M V K HA M T M QPYL TT KCST QN D FM V I 2044
Cdd:cd23958 667 S V R KAC K QLVD C LVE L IL EL EE DDDE S ----- SESD LVAC LS TL H LF A K AD P k L L L V E HA E T L QPYL KS KCST RE D QQ V L 741
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2045 CN V AK IL EL V V PL ME HPSE T FL ATI EEDL M KL II K YGM TV V Q HCVS CL G AVVNK V T Q N FKFVWACFNRYYGAIS K L K S Q H 2124
Cdd:cd23958 742 RY V LR IL RS V L PL LS HPSE S FL EEL EEDL L KL LL K HSV TV L Q EAIA CL C AVVNK L T K N YERLRKALQSCLKLLR K Y K R Q A 821
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2125 QE DP NNT sll TNK P A LLR S L FTV G A L C R HF DFD L E DFKGN ----- S K VNI K DK V LE LL MY FTK HS - DE E V QT KA IIG LGF 2198
Cdd:cd23958 822 NL DP SSL --- KED P K LLR L L YIL G L L A R YC DFD S E RDDFE kaplk T K ESV K EL V FD LL LF FTK PP i DE D V RK KA LQA LGF 898
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2199 AF I Q HP S L MFEQ EV KN L YNN IL SD kn S S VN LK I QVL K NLQ TY LQ E E DT RM QQ AD RD WKK VA K Q ----- E D L KEMGD VS SG 2273
Cdd:cd23958 899 LC I A HP K L FLSP EV LK L LDE IL AS -- G S LK LK L QVL R NLQ EF LQ A E EK RM EA AD AE WKK NS K A advkv L D G KEMGD AD SG 976
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2274 MS SSIMQ L YLK QV LE AFFHTQ S S VR HF AL N V IA L T L N QGL I HP V QCVP Y LIA MG TDP E PA M R NK A DQQ L V E IDK KY AGFI 2353
Cdd:cd23958 977 VA SSIMQ R YLK DI LE LCLSSD S Q VR LA AL K V LE L I L R QGL V HP I QCVP T LIA LE TDP N PA I R KL A LRL L K E LHE KY ESLV 1056
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2354 HM K AVA G MKMSY Q V Q QAINT cl KDPV RGFR Q D ESSS AL CSH LYS MI RGNR QH RR A FL I SLL N LFD D ------ TAKT D VTM 2427
Cdd:cd23958 1057 ES K YLE G VRLAF Q Y Q KRLAG -- DTRG RGFR T D SPPT AL LGR LYS LL RGNR KS RR K FL K SLL K LFD F dlkkss DSPS D LDF 1134
1210 1220 1230 1240
....*....|....*....|....*....|....*....|....*..
gi 119576354 2428 LL YI A D NLA CF PYQTQ E EPLF IM H H ID IT LSV S GS N LLQ SF - K E S MV 2473
Cdd:cd23958 1135 LL FL A E NLA FL PYQTQ D EPLF VI H T ID RI LSV T GS S LLQ AI a K A S QA 1181
PspC_subgroup_2 super family
cl41463
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
597-832
3.04e-15
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
The actual alignment was detected with superfamily member NF033839 :Pssm-ID: 468202 [Multi-domain]
Cd Length: 557
Bit Score: 81.74
E-value: 3.04e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 597 PE TPK KKSDPEL S -- K SE M KQ S - ESRLA E S KP N - E NRLV E T K SSEN K LETK V ET Q T E EL K QNESRTT E CKQN E sti V E P K 672
Cdd:NF033839 281 QD TPK EPGNKKP S ap K PG M QP S p QPEKK E V KP E p E TPKP E V K PQLE K PKPE V KP Q P E KP K PEVKPQL E TPKP E --- V K P Q 357
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 673 qnenr LSDT KP NDNK Q NNGRSETT K SR PETPK QKGESR PE T PK QKSDGH PE T PK QKGDGR PE T PK QKGESR PE T PK QKNE 752
Cdd:NF033839 358 ----- PEKP KP EVKP Q PEKPKPEV K PQ PETPK PEVKPQ PE K PK PEVKPQ PE K PK PEVKPQ PE K PK PEVKPQ PE K PK PEVK 432
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 753 GR PE T PK HRHDNRRDSG KP STEKK PE VS K HKQDTKSDS P R ------ LKSERAEAL K QRP D GR -- S VSES L RR D hdn KQ K S 824
Cdd:NF033839 433 PQ PE K PK PEVKPQPEKP KP EVKPQ PE TP K PEVKPQPEK P K pevkpq PEKPKPDNS K PQA D DK kp S TPNN L SK D --- KQ P S 509
....*...
gi 119576354 825 DDRGES E R 832
Cdd:NF033839 510 NQASTN E K 517
PTZ00121 super family
cl31754
MAEBL; Provisional
476-1092
3.65e-12
MAEBL; Provisional
The actual alignment was detected with superfamily member PTZ00121 :Pssm-ID: 173412 [Multi-domain]
Cd Length: 2084
Bit Score: 72.87
E-value: 3.65e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 476 E IERI E RESAIERERFSK E VQDK D KPL KK RKQDSYPQ EA GGATGGNRPASQETGSTGNGSRP A LMVSIDLHQ A GRVDSQ A 555
Cdd:PTZ00121 1282 E LKKA E EKKKADEAKKAE E KKKA D EAK KK AEEAKKAD EA KKKAEEAKKKADAAKKKAEEAKK A AEAAKAEAE A AADEAE A 1361
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 556 SITQ - DSDSI KK P E EI K QCND A PVSVLQEDIVGSL K STP E N hpet P KKK S D p EL S K SEMKQSESRL A ES K PN E NR lvet K 634
Cdd:PTZ00121 1362 AEEK a EAAEK KK E E AK K KADA A KKKAEEKKKADEA K KKA E E ---- D KKK A D - EL K K AAAAKKKADE A KK K AE E KK ---- K 1432
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 635 SS E N K LETKVETQTE E L K Q nes RTT E C K QN E S ti VEP K QN E NRLS D t KPNDNKQNNGRSETT K SRP E TP K Q K GES -- RPE 712
Cdd:PTZ00121 1433 AD E A K KKAEEAKKAD E A K K --- KAE E A K KA E E -- AKK K AE E AKKA D - EAKKKAEEAKKADEA K KKA E EA K K K ADE ak KAA 1506
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 713 TP K Q K S D ghp E TP K QK gdgrp E TP K QKGESRP E TP K QKN E GRPETP K HRH D NRRDS --- G K PSTE KK P E VS K HKQDT K SD 789
Cdd:PTZ00121 1507 EA K K K A D --- E AK K AE ----- E AK K ADEAKKA E EA K KAD E AKKAEE K KKA D ELKKA eel K K AEEK KK A E EA K KAEED K NM 1578
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 790 SP R lkse R AE AL K QRPDG R SV ------------- S E SLRRDHDN K Q K SDD -- RG E S E RHRGD Q SR ------ VRRP E T L RS 848
Cdd:PTZ00121 1579 AL R ---- K AE EA K KAEEA R IE evmklyeeekkmk A E EAKKAEEA K I K AEE lk KA E E E KKKVE Q LK kkeaee KKKA E E L KK 1654
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 849 SSRN ------------- E HGI K SDSS K TDKLER K HRH E SGDSRE --------- RPSSG E Q K SRPDSPR v K QGDS NK SRSD 906
Cdd:PTZ00121 1655 AEEE nkikaaeeakkae E DKK K AEEA K KAEEDE K KAA E ALKKEA eeakkaeel KKKEA E E K KKAEELK - K AEEE NK IKAE 1733
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 907 KLGFKSPTS K ---- DD K RT E GN K S K VDTN K AHPDN KAE fpsyllgg RSGAL K NF VI PKIKRDK D gnvtq E TKK ME MKGEP 982
Cdd:PTZ00121 1734 EAKKEAEED K kkae EA K KD E EE K K K IAHL K KEEEK KAE -------- EIRKE K EA VI EEELDEE D ----- E KRR ME VDKKI 1800
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 983 KD KVEKIGLVEDLN K GAKP V VVLQ K LSL D DVQ K LIK D REDKSRSSLKPIK ------- N KPSKSNKGSI D QSVL K E L PPEL 1055
Cdd:PTZ00121 1801 KD IFDNFANIIEGG K EGNL V INDS K EME D SAI K EVA D SKNMQLEEADAFE khkfnkn N ENGEDGNKEA D FNKE K D L KEDD 1880
650 660 670 680
....*....|....*....|....*....|....*....|.
gi 119576354 1056 LA EIE STM plc E RV K MN K ---- R KRSTV N EKP K YAE I SS D E 1092
Cdd:PTZ00121 1881 EE EIE EAD --- E IE K ID K ddie R EIPNN N MAG K NND I ID D K 1918
Name
Accession
Description
Interval
E-value
SCC2
cd23958
Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid ...
1263-2473
0e+00
Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid cohesion protein 2 (Scc2) and its homolog (Scc2 homolog, also called Nipped-B-like protein or NIPBL). Scc2/NIPBL and Scc4 form a complex that is responsible for loading the cohesin protein onto sister chromatids during mitosis and meiosis. Cohesin is a ring-shaped protein complex that encircles the sister chromatids and helps to hold them together until they are ready to be separated during cell division. In addition to its role in chromosome segregation, cohesin also plays important roles in other cellular processes such as transcription, chromosome condensation, and DNA repair.
Pssm-ID: 467937 [Multi-domain]
Cd Length: 1197
Bit Score: 1442.10
E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1263 V KV L N ILE K NI Q DG SK L S t L LNHNNDTEE EERLW RDLIME R VTKS ADA C LT tin I M TSP NM PK AV Y I ED V IERV IQYT KF 1342
Cdd:cd23958 3 V RL L T ILE R NI R DG ES L D - L DLDESQEDD EERLW LLERID R ALEA ADA S LT --- I L TSP GL PK QL Y S ED L IERV VDFL KF 78
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1343 H L Q NT L YP Q YDPVYR L D PHGGGL lss K A KRAK C S TH K QRVIVM L Y NK V C DIV S S L S ELL EI Q L LTD TT ILQ VSSMG I T PF 1422
Cdd:cd23958 79 Q L E NT I YP A YDPVYR S D SSAKAG --- K K KRAK A S SK K KKSVST L L NK L C ELL S L L A ELL SL Q S LTD SV ILQ LVYLA I S PF 155
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1423 FVE ---- NV S ELQL C A I KL V T AV FSRY EKH RQ L I L EEI FT SLA R LP T SKR S LR N FRLN SSD mdgepm Y IQMVTAL V LQL I 1498
Cdd:cd23958 156 FVE navs NV D ELQL S A L KL L T SI FSRY PDQ RQ F I I EEI LS SLA K LP S SKR N LR Q FRLN DGK ------ S IQMVTAL L LQL V 229
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1499 Q CV V H LP SS EK D S N ----- A E EDSNKKI D QDVVITN SYE T A M R T A QN FLS IF L K KC GS K -- QGEE DYRPLFENFVQDLL S 1571
Cdd:cd23958 230 Q SS V K LP NL EK E S S rdksl E E DSDELLE D EESALAK SYE S A V R I A SY FLS FL L Q KC TK K kk EKDT DYRPLFENFVQDLL T 309
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1572 TV N K PEWPAAELLLSLLGRLLV HQ FSNK S T EMAL RV AS LD Y LG TV AARLRKDA VT skmdqgsierilkqvsggedei QQ L 1651
Cdd:cd23958 310 VL N L PEWPAAELLLSLLGRLLV SI FSNK K T DANA RV MA LD L LG LI AARLRKDA LA ---------------------- EE L 367
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1652 QKALLDYL D EN TET DPSL VFS R K FY I AQW F RD TTL E T EKA M K sqkdeessegthh A K E I E T T GQIMHRA E N RKKFL R S I i 1731
Cdd:cd23958 368 QKALLDYL A EN SSS DPSL ESA R G FY L AQW L RD LSN E L EKA E K ------------- A A E E E D T ILKLELS E L RKKFL D S K - 433
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1732 kttpsq FSTLKMNSDTVDYD DA C L IV R Y LAS M RP FA QSFD IY L T Q I L RV L G E N A IAV RTKA M K C LS E VV AV DPSIL ARL D 1811
Cdd:cd23958 434 ------ ILSKEEEASPLSRE DA K L LY R A LAS Q RP LS QSFD PI L K Q L L SS L D E P A VTL RTKA L K A LS L VV EA DPSIL GDP D 507
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1812 M QR G V H GRL M D N S T SVREAAVEL L G RFVLC RP Q LAEQYY D M LI ERILDTG I SVRKRVIKILRDI CIEQ P T F PKITEM CV K 1891
Cdd:cd23958 508 V QR A V E GRL L D S S A SVREAAVEL V G KYISS RP D LAEQYY E M IA ERILDTG V SVRKRVIKILRDI YLRT P D F EIKVDI CV R 587
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1892 MI RR V ND - EE G IK K L VNE TFQ K LWFTP T P HN ----- DKE AMTRKI L N I T DVVAACR d T G Y D WF EQLL QN LLKS E ED SSY K 1965
Cdd:cd23958 588 LL RR I ND e EE S IK D L ARK TFQ E LWFTP F P ES sspaq DKE SLAERV L L I V DVVAACR - K G L D LL EQLL KR LLKS K ED KED K 666
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1966 P V K KAC T QLVD N LVE H IL KY EE SLAD S dnkgv NSGR LVAC IT TL F LF S K IR P - Q L M V K HA M T M QPYL TT KCST QN D FM V I 2044
Cdd:cd23958 667 S V R KAC K QLVD C LVE L IL EL EE DDDE S ----- SESD LVAC LS TL H LF A K AD P k L L L V E HA E T L QPYL KS KCST RE D QQ V L 741
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2045 CN V AK IL EL V V PL ME HPSE T FL ATI EEDL M KL II K YGM TV V Q HCVS CL G AVVNK V T Q N FKFVWACFNRYYGAIS K L K S Q H 2124
Cdd:cd23958 742 RY V LR IL RS V L PL LS HPSE S FL EEL EEDL L KL LL K HSV TV L Q EAIA CL C AVVNK L T K N YERLRKALQSCLKLLR K Y K R Q A 821
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2125 QE DP NNT sll TNK P A LLR S L FTV G A L C R HF DFD L E DFKGN ----- S K VNI K DK V LE LL MY FTK HS - DE E V QT KA IIG LGF 2198
Cdd:cd23958 822 NL DP SSL --- KED P K LLR L L YIL G L L A R YC DFD S E RDDFE kaplk T K ESV K EL V FD LL LF FTK PP i DE D V RK KA LQA LGF 898
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2199 AF I Q HP S L MFEQ EV KN L YNN IL SD kn S S VN LK I QVL K NLQ TY LQ E E DT RM QQ AD RD WKK VA K Q ----- E D L KEMGD VS SG 2273
Cdd:cd23958 899 LC I A HP K L FLSP EV LK L LDE IL AS -- G S LK LK L QVL R NLQ EF LQ A E EK RM EA AD AE WKK NS K A advkv L D G KEMGD AD SG 976
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2274 MS SSIMQ L YLK QV LE AFFHTQ S S VR HF AL N V IA L T L N QGL I HP V QCVP Y LIA MG TDP E PA M R NK A DQQ L V E IDK KY AGFI 2353
Cdd:cd23958 977 VA SSIMQ R YLK DI LE LCLSSD S Q VR LA AL K V LE L I L R QGL V HP I QCVP T LIA LE TDP N PA I R KL A LRL L K E LHE KY ESLV 1056
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2354 HM K AVA G MKMSY Q V Q QAINT cl KDPV RGFR Q D ESSS AL CSH LYS MI RGNR QH RR A FL I SLL N LFD D ------ TAKT D VTM 2427
Cdd:cd23958 1057 ES K YLE G VRLAF Q Y Q KRLAG -- DTRG RGFR T D SPPT AL LGR LYS LL RGNR KS RR K FL K SLL K LFD F dlkkss DSPS D LDF 1134
1210 1220 1230 1240
....*....|....*....|....*....|....*....|....*..
gi 119576354 2428 LL YI A D NLA CF PYQTQ E EPLF IM H H ID IT LSV S GS N LLQ SF - K E S MV 2473
Cdd:cd23958 1135 LL FL A E NLA FL PYQTQ D EPLF VI H T ID RI LSV T GS S LLQ AI a K A S QA 1181
Nipped-B_C
pfam12830
Sister chromatid cohesion C-terminus; This domain lies towards the C-terminus of nipped-B or ...
2275-2456
8.60e-70
Sister chromatid cohesion C-terminus; This domain lies towards the C-terminus of nipped-B or sister chromatid cohesion proteins.
Pssm-ID: 463722
Cd Length: 180
Bit Score: 232.43
E-value: 8.60e-70
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2275 S S SIM Q L YLK QV LE AFFHTQSS VR HF AL N V I AL T L N QGL I HP VQ C V P Y LIA MG T D P E P AM R NK A DQQLV E IDK K YAGFIH 2354
Cdd:pfam12830 1 C S ALV Q R YLK HI LE ICLSSDDQ VR LL AL E V L AL I L R QGL V HP KE C I P T LIA LE T S P N P YI R KL A FELHK E LHE K HESLLE 80
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2355 MKAVA G MKMSYQV Q QAINTC lkdpvrgf RQD E SSSALC S H LYS MI R G N RQH R RA FL I SL LN LF D D ------ TAKT D VTM L 2428
Cdd:pfam12830 81 SRYME G IRLAFEY Q RRVLSG -------- ATL E PPTSFL S L LYS LL R S N KKS R KK FL K SL VK LF F D ldlsse SSPS D LDF L 152
170 180
....*....|....*....|....*...
gi 119576354 2429 LYI A D NLA CF PYQTQ E E P LF IM HHID IT 2456
Cdd:pfam12830 153 RFL A E NLA FL PYQTQ D E V LF LI HHID RI 180
PspC_subgroup_2
NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
597-832
3.04e-15
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain]
Cd Length: 557
Bit Score: 81.74
E-value: 3.04e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 597 PE TPK KKSDPEL S -- K SE M KQ S - ESRLA E S KP N - E NRLV E T K SSEN K LETK V ET Q T E EL K QNESRTT E CKQN E sti V E P K 672
Cdd:NF033839 281 QD TPK EPGNKKP S ap K PG M QP S p QPEKK E V KP E p E TPKP E V K PQLE K PKPE V KP Q P E KP K PEVKPQL E TPKP E --- V K P Q 357
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 673 qnenr LSDT KP NDNK Q NNGRSETT K SR PETPK QKGESR PE T PK QKSDGH PE T PK QKGDGR PE T PK QKGESR PE T PK QKNE 752
Cdd:NF033839 358 ----- PEKP KP EVKP Q PEKPKPEV K PQ PETPK PEVKPQ PE K PK PEVKPQ PE K PK PEVKPQ PE K PK PEVKPQ PE K PK PEVK 432
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 753 GR PE T PK HRHDNRRDSG KP STEKK PE VS K HKQDTKSDS P R ------ LKSERAEAL K QRP D GR -- S VSES L RR D hdn KQ K S 824
Cdd:NF033839 433 PQ PE K PK PEVKPQPEKP KP EVKPQ PE TP K PEVKPQPEK P K pevkpq PEKPKPDNS K PQA D DK kp S TPNN L SK D --- KQ P S 509
....*...
gi 119576354 825 DDRGES E R 832
Cdd:NF033839 510 NQASTN E K 517
PspC_subgroup_2
NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
590-790
3.60e-13
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain]
Cd Length: 557
Bit Score: 75.19
E-value: 3.60e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 590 K STPENH PE T PK KKSD P E L SK s EMKQSESRLAES KP NENRLV E TKSS E N K LET kv ET QTE E L K - Q N E SRTT E C K QNEST i 668
Cdd:NF033839 329 K PEVKPQ PE K PK PEVK P Q L ET - PKPEVKPQPEKP KP EVKPQP E KPKP E V K PQP -- ET PKP E V K p Q P E KPKP E V K PQPEK - 404
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 669 vepkqnenrlsd T KP NDNK Q NNGRSETT K SR PE T PK QKGESR PE T PK QKSDGH PE T PK QKGDGR PETPK QKGESR PE T PK 748
Cdd:NF033839 405 ------------ P KP EVKP Q PEKPKPEV K PQ PE K PK PEVKPQ PE K PK PEVKPQ PE K PK PEVKPQ PETPK PEVKPQ PE K PK 472
170 180 190 200
....*....|....*....|....*....|....*....|....*
gi 119576354 749 QKNEGR PE T PK hr H DN RR --- D SG KPST EK kp EV SK H KQ DTKSD S 790
Cdd:NF033839 473 PEVKPQ PE K PK -- P DN SK pqa D DK KPST PN -- NL SK D KQ PSNQA S 513
PTZ00121
PTZ00121
MAEBL; Provisional
476-1092
3.65e-12
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain]
Cd Length: 2084
Bit Score: 72.87
E-value: 3.65e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 476 E IERI E RESAIERERFSK E VQDK D KPL KK RKQDSYPQ EA GGATGGNRPASQETGSTGNGSRP A LMVSIDLHQ A GRVDSQ A 555
Cdd:PTZ00121 1282 E LKKA E EKKKADEAKKAE E KKKA D EAK KK AEEAKKAD EA KKKAEEAKKKADAAKKKAEEAKK A AEAAKAEAE A AADEAE A 1361
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 556 SITQ - DSDSI KK P E EI K QCND A PVSVLQEDIVGSL K STP E N hpet P KKK S D p EL S K SEMKQSESRL A ES K PN E NR lvet K 634
Cdd:PTZ00121 1362 AEEK a EAAEK KK E E AK K KADA A KKKAEEKKKADEA K KKA E E ---- D KKK A D - EL K K AAAAKKKADE A KK K AE E KK ---- K 1432
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 635 SS E N K LETKVETQTE E L K Q nes RTT E C K QN E S ti VEP K QN E NRLS D t KPNDNKQNNGRSETT K SRP E TP K Q K GES -- RPE 712
Cdd:PTZ00121 1433 AD E A K KKAEEAKKAD E A K K --- KAE E A K KA E E -- AKK K AE E AKKA D - EAKKKAEEAKKADEA K KKA E EA K K K ADE ak KAA 1506
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 713 TP K Q K S D ghp E TP K QK gdgrp E TP K QKGESRP E TP K QKN E GRPETP K HRH D NRRDS --- G K PSTE KK P E VS K HKQDT K SD 789
Cdd:PTZ00121 1507 EA K K K A D --- E AK K AE ----- E AK K ADEAKKA E EA K KAD E AKKAEE K KKA D ELKKA eel K K AEEK KK A E EA K KAEED K NM 1578
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 790 SP R lkse R AE AL K QRPDG R SV ------------- S E SLRRDHDN K Q K SDD -- RG E S E RHRGD Q SR ------ VRRP E T L RS 848
Cdd:PTZ00121 1579 AL R ---- K AE EA K KAEEA R IE evmklyeeekkmk A E EAKKAEEA K I K AEE lk KA E E E KKKVE Q LK kkeaee KKKA E E L KK 1654
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 849 SSRN ------------- E HGI K SDSS K TDKLER K HRH E SGDSRE --------- RPSSG E Q K SRPDSPR v K QGDS NK SRSD 906
Cdd:PTZ00121 1655 AEEE nkikaaeeakkae E DKK K AEEA K KAEEDE K KAA E ALKKEA eeakkaeel KKKEA E E K KKAEELK - K AEEE NK IKAE 1733
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 907 KLGFKSPTS K ---- DD K RT E GN K S K VDTN K AHPDN KAE fpsyllgg RSGAL K NF VI PKIKRDK D gnvtq E TKK ME MKGEP 982
Cdd:PTZ00121 1734 EAKKEAEED K kkae EA K KD E EE K K K IAHL K KEEEK KAE -------- EIRKE K EA VI EEELDEE D ----- E KRR ME VDKKI 1800
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 983 KD KVEKIGLVEDLN K GAKP V VVLQ K LSL D DVQ K LIK D REDKSRSSLKPIK ------- N KPSKSNKGSI D QSVL K E L PPEL 1055
Cdd:PTZ00121 1801 KD IFDNFANIIEGG K EGNL V INDS K EME D SAI K EVA D SKNMQLEEADAFE khkfnkn N ENGEDGNKEA D FNKE K D L KEDD 1880
650 660 670 680
....*....|....*....|....*....|....*....|.
gi 119576354 1056 LA EIE STM plc E RV K MN K ---- R KRSTV N EKP K YAE I SS D E 1092
Cdd:PTZ00121 1881 EE EIE EAD --- E IE K ID K ddie R EIPNN N MAG K NND I ID D K 1918
PspC_subgroup_2
NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
591-806
2.56e-10
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain]
Cd Length: 557
Bit Score: 65.95
E-value: 2.56e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 591 ST PEN ---- H P E TP KKKSD P ELSKSEM K Q S ESRLAES K PNENRL V E T KS S ----------- ENKLETKVETQTE EL KQNE 655
Cdd:NF033839 162 PQ PEN pehq K P T TP APDTK P SPQPEGK K P S VPDINQE K EKAKLA V A T YM S kilddiqkhhl QKEKHRQIVALIK EL DELK 241
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 656 SRTTECKQ N ES T I VE PKQNENRL - S D TKPNDN K QNN G RSET T ----- KSR P ET PK QKGESR P ETP K QKSDGH PETPK QKG 729
Cdd:NF033839 242 KQALSEID N VN T K VE IENTVHKI f A D MDAVVT K FKK G LTQD T pkepg NKK P SA PK PGMQPS P QPE K KEVKPE PETPK PEV 321
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 119576354 730 DGRP E T PK QKGESR PE T PK QKNEGRP ETPK HRHDNRRDSG KP st E K KP EVS K H K QDT K SDSPRL K S E - RAEAL K QR P D 806
Cdd:NF033839 322 KPQL E K PK PEVKPQ PE K PK PEVKPQL ETPK PEVKPQPEKP KP -- E V KP QPE K P K PEV K PQPETP K P E v KPQPE K PK P E 397
ftsN
TIGR02223
cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a ...
591-778
1.22e-07
cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a number of Proteobacteria. The N-terminal 30 residue region tends to by Lys/Arg-rich, and is followed by a membrane-spanning region. This is followed by an acidic low-complexity region of variable length and a well-conserved C-terminal domain of two tandem regions matched by pfam05036 (Sporulation related repeat), found in several cell division and sporulation proteins. The role of FtsN as a suppressor for other cell division mutations is poorly understood; it may involve cell wall hydrolysis. [Cellular processes, Cell division]
Pssm-ID: 274041 [Multi-domain]
Cd Length: 298
Bit Score: 55.85
E-value: 1.22e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 591 S TPE N H PET PKK K SDP E LSKSEMK --- QS E S R LAESKPN E N R L V ETKSS E NKLETKVETQTEE L KQNESRTT E ckqnest 667
Cdd:TIGR02223 51 S KQA N E PET LQP K NQT E NGETAAD lpp KP E E R WSYIEEL E A R E V LINDP E EPSNGGGVEESAQ L TAEQRQLL E ------- 123
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 668 ive PK Q NEN R LSDTKPNDNKQNNGRSETTKSRPETP K QKGESRP E TP K QK sdgh P ET P K QKGDGRPETP KQK GESRPETP 747
Cdd:TIGR02223 124 --- QM Q ADM R AAEKVLATAPSEQTVAVEARKQTAEK K PQKARTA E AQ K TP ---- V ET E K IASKVKEAKQ KQK ALPKQTAE 196
170 180 190
....*....|....*....|....*....|.
gi 119576354 748 K Q K N EGRP ET P kh RHDNRR D SG KP STEK K P E 778
Cdd:TIGR02223 197 T Q S N SKPI ET A -- PKADKA D KT KP KPKE K A E 225
PTZ00449
PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
476-894
7.81e-07
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain]
Cd Length: 943
Bit Score: 55.08
E-value: 7.81e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 476 EI ERIERE S AIERERFSK E VQ DK - D K P LKKRKQDSY P QE A G G --- ATG G NRPA S Q E TGSTGN G SR P AL mvsidlhqagrv 551
Cdd:PTZ00449 484 EI KKLIKK S KKKLAPIEE E DS DK h D E P PEGPEASGL P PK A P G dke GEE G EHED S K E SDEPKE G GK P GE ------------ 551
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 552 dsqasi T QDSDSI KKP EEI K QCNDAPVSV L QEDIVGSLKSTPENH PE T PKK KSD P ELSKSEMKQSESR L A E SK -- P NENR 629
Cdd:PTZ00449 552 ------ T KEGEVG KKP GPA K EHKPSKIPT L SKKPEFPKDPKHPKD PE E PKK PKR P RSAQRPTRPKSPK L P E LL di P KSPK 625
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 630 LV E TKS S ENKLETKVETQTE E LKQNESRTTEC K QNE S TIV -- E PK QN E N ---------- RLSD TK PNDNKQNNGR S ETTK 697
Cdd:PTZ00449 626 RP E SPK S PKRPPPPQRPSSP E RPEGPKIIKSP K PPK S PKP pf D PK FK E K fyddyldaaa KSKE TK TTVVLDESFE S ILKE 705
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 698 SR PETP KQKGES - RP ET PK QKS D gh P E T P KQK g D G R P ETPKQKGESRPET P KQKNEGRP ETP KHRH ------------ D N 764
Cdd:PTZ00449 706 TL PETP GTPFTT p RP LP PK LPR D -- E E F P FEP - I G D P DAEQPDDIEFFTP P EEERTFFH ETP ADTP lpdilaeefkee D I 782
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 765 RRDS G K P - STE K K P EV - S K H KQDTKS D S P R L -- K SE R AEA L K - QRP D GR S VSESLRR D HDN K QKSDD R GE S erh RG D QSR 839
Cdd:PTZ00449 783 HAET G E P d EAM K R P DS p S E H EDKPPG D H P S L pk K RH R LDG L A l STT D LE S DAGRIAK D ASG K IVKLK R SK S --- FD D LTT 859
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 119576354 840 V RRP E TLRSSS R ---- NEH G IKS D SSK T DKL E R KH RH E SGDS R -- ER PS SGEQK S R P DS P R 894
Cdd:PTZ00449 860 V EEA E EMGAEA R kivv DDD G TEA D DED T HPP E E KH KS E VRRR R pp KK PS KPKKP S K P KK P K 920
Caldesmon
pfam02029
Caldesmon;
480-890
4.30e-04
Caldesmon;
Pssm-ID: 460421 [Multi-domain]
Cd Length: 495
Bit Score: 45.63
E-value: 4.30e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 480 IE R E SAIE RER FSKEVQDK dkplkk R K Q DSYPQEA G GA T GGNR P ASQETGSTGNGSR P ALMVSI D LHQ A G rvdsqasitq 559
Cdd:pfam02029 1 IE D E EEAA RER RRRAREER ------ R R Q KEEEEPS G QV T ESVE P NEHNSYEEDSELK P SGQGGL D EEE A F ---------- 64
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 560 d S D SIK K P EE IK Q CNDAPVSVL Q EDIVGSLKSTP E NHP E TPKKKSDP E L S KS E MK - QSE SRL AES K PN E NRLV E TKSS EN 638
Cdd:pfam02029 65 - L D RTA K R EE RR Q KRLQEALER Q KEFDPTIADEK E SVA E RKENNEEE E N S SW E KE e KRD SRL GRY K EE E TEIR E KEYQ EN 143
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 639 K LE T -------------- K V E TQT E ELKQ N ESRTTECKQNESTIVEP K QNENRLS D T K PN dnkqnngrsettksrpe T P K 704
Cdd:pfam02029 144 K WS T evrqaeeegeeeed K S E EAE E VPTE N FAKEEVKDEKIKKEKKV K YESKVFL D Q K RG ----------------- H P E 206
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 705 Q K GESRP E tp KQKSDGHPETPK Q K G DGRPETPKQKG E SRP E TPKQKN E G R petpkhrhdn RR DSG K P S T E KKP ev SKH KQ 784
Cdd:pfam02029 207 V K SQNGE E -- EVTKLKVTTKRR Q G G LSQSQEREEEA E VFL E AEQKLE E L R ---------- RR RQE K E S E E FEK -- LRQ KQ 272
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 785 dtksds PRLKS E RA E AL K Q R PDG R SV - S E SLR R DHDNKQKSDD R G E S E RH R GDQSRV RR ----------- PE T lr SSS RN 852
Cdd:pfam02029 273 ------ QEAEL E LE E LK K K R EER R KL l E E EEQ R RKQEEAERKL R E E E E KR R MKEEIE RR raeaaekrqkl PE D -- SSS EG 344
410 420 430
....*....|....*....|....*....|....*...
gi 119576354 853 EHGI K SD S S K TDK L ERKH R H E SGDSRERP SS GEQ K SR P 890
Cdd:pfam02029 345 KKPF K CF S P K GSS L KITE R A E FLNKSLQK SS SVK K TH P 382
PspC_subgroup_1
NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
475-759
1.52e-03
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain]
Cd Length: 684
Bit Score: 43.85
E-value: 1.52e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 475 AE I E RIE R ES A IER E RFS K E V -- QDK DKP l K K R KQDSYPQ E AGGATGGNRP A SQETG S T G NGSR P A lmvsidlhqagrv D 552
Cdd:NF033838 233 AE E E AKR R AD A KLK E AVE K N V at SEQ DKP - K R R AKRGVLG E PATPDKKEND A KSSDS S V G EETL P S ------------- P 298
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 553 S QASITQDSDSI KK P EE I - K QCN D apvsvl Q ED ivgslk STPE N H P ETPK K KSDP E LSK S EM K QS E SR L aeskpnen R LV 631
Cdd:NF033838 299 S LKPEKKVAEAE KK V EE A k K KAK D ------ Q KE ------ EDRR N Y P TNTY K TLEL E IAE S DV K VK E AE L -------- E LV 358
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 632 ETKSS E NKL E T K VETQTEEL kqnesrtt E C K QN E S T IV E PKQNENR lsd TKPNDN K QNNGRSETT K SR P - E T P KQKGESR 710
Cdd:NF033838 359 KEEAK E PRN E E K IKQAKAKV -------- E S K KA E A T RL E KIKTDRK --- KAEEEA K RKAAEEDKV K EK P a E Q P QPAPAPQ 427
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 711 PE T P KQ K SDGHP E T PK - Q K GDGR ---------- P E TPKQKGESR P et PK QKNEGR P E TPK 759
Cdd:NF033838 428 PE K P AP K PEKPA E Q PK a E K PADQ qaeedyarrs E E EYNRLTQQQ P -- PK TEKPAQ P S TPK 485
Name
Accession
Description
Interval
E-value
SCC2
cd23958
Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid ...
1263-2473
0e+00
Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid cohesion protein 2 (Scc2) and its homolog (Scc2 homolog, also called Nipped-B-like protein or NIPBL). Scc2/NIPBL and Scc4 form a complex that is responsible for loading the cohesin protein onto sister chromatids during mitosis and meiosis. Cohesin is a ring-shaped protein complex that encircles the sister chromatids and helps to hold them together until they are ready to be separated during cell division. In addition to its role in chromosome segregation, cohesin also plays important roles in other cellular processes such as transcription, chromosome condensation, and DNA repair.
Pssm-ID: 467937 [Multi-domain]
Cd Length: 1197
Bit Score: 1442.10
E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1263 V KV L N ILE K NI Q DG SK L S t L LNHNNDTEE EERLW RDLIME R VTKS ADA C LT tin I M TSP NM PK AV Y I ED V IERV IQYT KF 1342
Cdd:cd23958 3 V RL L T ILE R NI R DG ES L D - L DLDESQEDD EERLW LLERID R ALEA ADA S LT --- I L TSP GL PK QL Y S ED L IERV VDFL KF 78
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1343 H L Q NT L YP Q YDPVYR L D PHGGGL lss K A KRAK C S TH K QRVIVM L Y NK V C DIV S S L S ELL EI Q L LTD TT ILQ VSSMG I T PF 1422
Cdd:cd23958 79 Q L E NT I YP A YDPVYR S D SSAKAG --- K K KRAK A S SK K KKSVST L L NK L C ELL S L L A ELL SL Q S LTD SV ILQ LVYLA I S PF 155
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1423 FVE ---- NV S ELQL C A I KL V T AV FSRY EKH RQ L I L EEI FT SLA R LP T SKR S LR N FRLN SSD mdgepm Y IQMVTAL V LQL I 1498
Cdd:cd23958 156 FVE navs NV D ELQL S A L KL L T SI FSRY PDQ RQ F I I EEI LS SLA K LP S SKR N LR Q FRLN DGK ------ S IQMVTAL L LQL V 229
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1499 Q CV V H LP SS EK D S N ----- A E EDSNKKI D QDVVITN SYE T A M R T A QN FLS IF L K KC GS K -- QGEE DYRPLFENFVQDLL S 1571
Cdd:cd23958 230 Q SS V K LP NL EK E S S rdksl E E DSDELLE D EESALAK SYE S A V R I A SY FLS FL L Q KC TK K kk EKDT DYRPLFENFVQDLL T 309
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1572 TV N K PEWPAAELLLSLLGRLLV HQ FSNK S T EMAL RV AS LD Y LG TV AARLRKDA VT skmdqgsierilkqvsggedei QQ L 1651
Cdd:cd23958 310 VL N L PEWPAAELLLSLLGRLLV SI FSNK K T DANA RV MA LD L LG LI AARLRKDA LA ---------------------- EE L 367
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1652 QKALLDYL D EN TET DPSL VFS R K FY I AQW F RD TTL E T EKA M K sqkdeessegthh A K E I E T T GQIMHRA E N RKKFL R S I i 1731
Cdd:cd23958 368 QKALLDYL A EN SSS DPSL ESA R G FY L AQW L RD LSN E L EKA E K ------------- A A E E E D T ILKLELS E L RKKFL D S K - 433
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1732 kttpsq FSTLKMNSDTVDYD DA C L IV R Y LAS M RP FA QSFD IY L T Q I L RV L G E N A IAV RTKA M K C LS E VV AV DPSIL ARL D 1811
Cdd:cd23958 434 ------ ILSKEEEASPLSRE DA K L LY R A LAS Q RP LS QSFD PI L K Q L L SS L D E P A VTL RTKA L K A LS L VV EA DPSIL GDP D 507
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1812 M QR G V H GRL M D N S T SVREAAVEL L G RFVLC RP Q LAEQYY D M LI ERILDTG I SVRKRVIKILRDI CIEQ P T F PKITEM CV K 1891
Cdd:cd23958 508 V QR A V E GRL L D S S A SVREAAVEL V G KYISS RP D LAEQYY E M IA ERILDTG V SVRKRVIKILRDI YLRT P D F EIKVDI CV R 587
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1892 MI RR V ND - EE G IK K L VNE TFQ K LWFTP T P HN ----- DKE AMTRKI L N I T DVVAACR d T G Y D WF EQLL QN LLKS E ED SSY K 1965
Cdd:cd23958 588 LL RR I ND e EE S IK D L ARK TFQ E LWFTP F P ES sspaq DKE SLAERV L L I V DVVAACR - K G L D LL EQLL KR LLKS K ED KED K 666
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1966 P V K KAC T QLVD N LVE H IL KY EE SLAD S dnkgv NSGR LVAC IT TL F LF S K IR P - Q L M V K HA M T M QPYL TT KCST QN D FM V I 2044
Cdd:cd23958 667 S V R KAC K QLVD C LVE L IL EL EE DDDE S ----- SESD LVAC LS TL H LF A K AD P k L L L V E HA E T L QPYL KS KCST RE D QQ V L 741
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2045 CN V AK IL EL V V PL ME HPSE T FL ATI EEDL M KL II K YGM TV V Q HCVS CL G AVVNK V T Q N FKFVWACFNRYYGAIS K L K S Q H 2124
Cdd:cd23958 742 RY V LR IL RS V L PL LS HPSE S FL EEL EEDL L KL LL K HSV TV L Q EAIA CL C AVVNK L T K N YERLRKALQSCLKLLR K Y K R Q A 821
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2125 QE DP NNT sll TNK P A LLR S L FTV G A L C R HF DFD L E DFKGN ----- S K VNI K DK V LE LL MY FTK HS - DE E V QT KA IIG LGF 2198
Cdd:cd23958 822 NL DP SSL --- KED P K LLR L L YIL G L L A R YC DFD S E RDDFE kaplk T K ESV K EL V FD LL LF FTK PP i DE D V RK KA LQA LGF 898
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2199 AF I Q HP S L MFEQ EV KN L YNN IL SD kn S S VN LK I QVL K NLQ TY LQ E E DT RM QQ AD RD WKK VA K Q ----- E D L KEMGD VS SG 2273
Cdd:cd23958 899 LC I A HP K L FLSP EV LK L LDE IL AS -- G S LK LK L QVL R NLQ EF LQ A E EK RM EA AD AE WKK NS K A advkv L D G KEMGD AD SG 976
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2274 MS SSIMQ L YLK QV LE AFFHTQ S S VR HF AL N V IA L T L N QGL I HP V QCVP Y LIA MG TDP E PA M R NK A DQQ L V E IDK KY AGFI 2353
Cdd:cd23958 977 VA SSIMQ R YLK DI LE LCLSSD S Q VR LA AL K V LE L I L R QGL V HP I QCVP T LIA LE TDP N PA I R KL A LRL L K E LHE KY ESLV 1056
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2354 HM K AVA G MKMSY Q V Q QAINT cl KDPV RGFR Q D ESSS AL CSH LYS MI RGNR QH RR A FL I SLL N LFD D ------ TAKT D VTM 2427
Cdd:cd23958 1057 ES K YLE G VRLAF Q Y Q KRLAG -- DTRG RGFR T D SPPT AL LGR LYS LL RGNR KS RR K FL K SLL K LFD F dlkkss DSPS D LDF 1134
1210 1220 1230 1240
....*....|....*....|....*....|....*....|....*..
gi 119576354 2428 LL YI A D NLA CF PYQTQ E EPLF IM H H ID IT LSV S GS N LLQ SF - K E S MV 2473
Cdd:cd23958 1135 LL FL A E NLA FL PYQTQ D EPLF VI H T ID RI LSV T GS S LLQ AI a K A S QA 1181
Nipped-B_C
pfam12830
Sister chromatid cohesion C-terminus; This domain lies towards the C-terminus of nipped-B or ...
2275-2456
8.60e-70
Sister chromatid cohesion C-terminus; This domain lies towards the C-terminus of nipped-B or sister chromatid cohesion proteins.
Pssm-ID: 463722
Cd Length: 180
Bit Score: 232.43
E-value: 8.60e-70
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2275 S S SIM Q L YLK QV LE AFFHTQSS VR HF AL N V I AL T L N QGL I HP VQ C V P Y LIA MG T D P E P AM R NK A DQQLV E IDK K YAGFIH 2354
Cdd:pfam12830 1 C S ALV Q R YLK HI LE ICLSSDDQ VR LL AL E V L AL I L R QGL V HP KE C I P T LIA LE T S P N P YI R KL A FELHK E LHE K HESLLE 80
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 2355 MKAVA G MKMSYQV Q QAINTC lkdpvrgf RQD E SSSALC S H LYS MI R G N RQH R RA FL I SL LN LF D D ------ TAKT D VTM L 2428
Cdd:pfam12830 81 SRYME G IRLAFEY Q RRVLSG -------- ATL E PPTSFL S L LYS LL R S N KKS R KK FL K SL VK LF F D ldlsse SSPS D LDF L 152
170 180
....*....|....*....|....*...
gi 119576354 2429 LYI A D NLA CF PYQTQ E E P LF IM HHID IT 2456
Cdd:pfam12830 153 RFL A E NLA FL PYQTQ D E V LF LI HHID RI 180
PspC_subgroup_2
NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
597-832
3.04e-15
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain]
Cd Length: 557
Bit Score: 81.74
E-value: 3.04e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 597 PE TPK KKSDPEL S -- K SE M KQ S - ESRLA E S KP N - E NRLV E T K SSEN K LETK V ET Q T E EL K QNESRTT E CKQN E sti V E P K 672
Cdd:NF033839 281 QD TPK EPGNKKP S ap K PG M QP S p QPEKK E V KP E p E TPKP E V K PQLE K PKPE V KP Q P E KP K PEVKPQL E TPKP E --- V K P Q 357
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 673 qnenr LSDT KP NDNK Q NNGRSETT K SR PETPK QKGESR PE T PK QKSDGH PE T PK QKGDGR PE T PK QKGESR PE T PK QKNE 752
Cdd:NF033839 358 ----- PEKP KP EVKP Q PEKPKPEV K PQ PETPK PEVKPQ PE K PK PEVKPQ PE K PK PEVKPQ PE K PK PEVKPQ PE K PK PEVK 432
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 753 GR PE T PK HRHDNRRDSG KP STEKK PE VS K HKQDTKSDS P R ------ LKSERAEAL K QRP D GR -- S VSES L RR D hdn KQ K S 824
Cdd:NF033839 433 PQ PE K PK PEVKPQPEKP KP EVKPQ PE TP K PEVKPQPEK P K pevkpq PEKPKPDNS K PQA D DK kp S TPNN L SK D --- KQ P S 509
....*...
gi 119576354 825 DDRGES E R 832
Cdd:NF033839 510 NQASTN E K 517
PspC_subgroup_2
NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
590-790
3.60e-13
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain]
Cd Length: 557
Bit Score: 75.19
E-value: 3.60e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 590 K STPENH PE T PK KKSD P E L SK s EMKQSESRLAES KP NENRLV E TKSS E N K LET kv ET QTE E L K - Q N E SRTT E C K QNEST i 668
Cdd:NF033839 329 K PEVKPQ PE K PK PEVK P Q L ET - PKPEVKPQPEKP KP EVKPQP E KPKP E V K PQP -- ET PKP E V K p Q P E KPKP E V K PQPEK - 404
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 669 vepkqnenrlsd T KP NDNK Q NNGRSETT K SR PE T PK QKGESR PE T PK QKSDGH PE T PK QKGDGR PETPK QKGESR PE T PK 748
Cdd:NF033839 405 ------------ P KP EVKP Q PEKPKPEV K PQ PE K PK PEVKPQ PE K PK PEVKPQ PE K PK PEVKPQ PETPK PEVKPQ PE K PK 472
170 180 190 200
....*....|....*....|....*....|....*....|....*
gi 119576354 749 QKNEGR PE T PK hr H DN RR --- D SG KPST EK kp EV SK H KQ DTKSD S 790
Cdd:NF033839 473 PEVKPQ PE K PK -- P DN SK pqa D DK KPST PN -- NL SK D KQ PSNQA S 513
PTZ00121
PTZ00121
MAEBL; Provisional
476-1092
3.65e-12
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain]
Cd Length: 2084
Bit Score: 72.87
E-value: 3.65e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 476 E IERI E RESAIERERFSK E VQDK D KPL KK RKQDSYPQ EA GGATGGNRPASQETGSTGNGSRP A LMVSIDLHQ A GRVDSQ A 555
Cdd:PTZ00121 1282 E LKKA E EKKKADEAKKAE E KKKA D EAK KK AEEAKKAD EA KKKAEEAKKKADAAKKKAEEAKK A AEAAKAEAE A AADEAE A 1361
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 556 SITQ - DSDSI KK P E EI K QCND A PVSVLQEDIVGSL K STP E N hpet P KKK S D p EL S K SEMKQSESRL A ES K PN E NR lvet K 634
Cdd:PTZ00121 1362 AEEK a EAAEK KK E E AK K KADA A KKKAEEKKKADEA K KKA E E ---- D KKK A D - EL K K AAAAKKKADE A KK K AE E KK ---- K 1432
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 635 SS E N K LETKVETQTE E L K Q nes RTT E C K QN E S ti VEP K QN E NRLS D t KPNDNKQNNGRSETT K SRP E TP K Q K GES -- RPE 712
Cdd:PTZ00121 1433 AD E A K KKAEEAKKAD E A K K --- KAE E A K KA E E -- AKK K AE E AKKA D - EAKKKAEEAKKADEA K KKA E EA K K K ADE ak KAA 1506
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 713 TP K Q K S D ghp E TP K QK gdgrp E TP K QKGESRP E TP K QKN E GRPETP K HRH D NRRDS --- G K PSTE KK P E VS K HKQDT K SD 789
Cdd:PTZ00121 1507 EA K K K A D --- E AK K AE ----- E AK K ADEAKKA E EA K KAD E AKKAEE K KKA D ELKKA eel K K AEEK KK A E EA K KAEED K NM 1578
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 790 SP R lkse R AE AL K QRPDG R SV ------------- S E SLRRDHDN K Q K SDD -- RG E S E RHRGD Q SR ------ VRRP E T L RS 848
Cdd:PTZ00121 1579 AL R ---- K AE EA K KAEEA R IE evmklyeeekkmk A E EAKKAEEA K I K AEE lk KA E E E KKKVE Q LK kkeaee KKKA E E L KK 1654
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 849 SSRN ------------- E HGI K SDSS K TDKLER K HRH E SGDSRE --------- RPSSG E Q K SRPDSPR v K QGDS NK SRSD 906
Cdd:PTZ00121 1655 AEEE nkikaaeeakkae E DKK K AEEA K KAEEDE K KAA E ALKKEA eeakkaeel KKKEA E E K KKAEELK - K AEEE NK IKAE 1733
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 907 KLGFKSPTS K ---- DD K RT E GN K S K VDTN K AHPDN KAE fpsyllgg RSGAL K NF VI PKIKRDK D gnvtq E TKK ME MKGEP 982
Cdd:PTZ00121 1734 EAKKEAEED K kkae EA K KD E EE K K K IAHL K KEEEK KAE -------- EIRKE K EA VI EEELDEE D ----- E KRR ME VDKKI 1800
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 983 KD KVEKIGLVEDLN K GAKP V VVLQ K LSL D DVQ K LIK D REDKSRSSLKPIK ------- N KPSKSNKGSI D QSVL K E L PPEL 1055
Cdd:PTZ00121 1801 KD IFDNFANIIEGG K EGNL V INDS K EME D SAI K EVA D SKNMQLEEADAFE khkfnkn N ENGEDGNKEA D FNKE K D L KEDD 1880
650 660 670 680
....*....|....*....|....*....|....*....|.
gi 119576354 1056 LA EIE STM plc E RV K MN K ---- R KRSTV N EKP K YAE I SS D E 1092
Cdd:PTZ00121 1881 EE EIE EAD --- E IE K ID K ddie R EIPNN N MAG K NND I ID D K 1918
PspC_subgroup_2
NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
591-806
2.56e-10
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain]
Cd Length: 557
Bit Score: 65.95
E-value: 2.56e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 591 ST PEN ---- H P E TP KKKSD P ELSKSEM K Q S ESRLAES K PNENRL V E T KS S ----------- ENKLETKVETQTE EL KQNE 655
Cdd:NF033839 162 PQ PEN pehq K P T TP APDTK P SPQPEGK K P S VPDINQE K EKAKLA V A T YM S kilddiqkhhl QKEKHRQIVALIK EL DELK 241
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 656 SRTTECKQ N ES T I VE PKQNENRL - S D TKPNDN K QNN G RSET T ----- KSR P ET PK QKGESR P ETP K QKSDGH PETPK QKG 729
Cdd:NF033839 242 KQALSEID N VN T K VE IENTVHKI f A D MDAVVT K FKK G LTQD T pkepg NKK P SA PK PGMQPS P QPE K KEVKPE PETPK PEV 321
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 119576354 730 DGRP E T PK QKGESR PE T PK QKNEGRP ETPK HRHDNRRDSG KP st E K KP EVS K H K QDT K SDSPRL K S E - RAEAL K QR P D 806
Cdd:NF033839 322 KPQL E K PK PEVKPQ PE K PK PEVKPQL ETPK PEVKPQPEKP KP -- E V KP QPE K P K PEV K PQPETP K P E v KPQPE K PK P E 397
Cohesin_HEAT
pfam12765
HEAT repeat associated with sister chromatid cohesion; This HEAT repeat is found most ...
1794-1835
2.72e-09
HEAT repeat associated with sister chromatid cohesion; This HEAT repeat is found most frequently in sister chromatid cohesion proteins such as Nipped-B. HEAT repeats are found tandemly repeated in many proteins, and they appear to serve as flexible scaffolding on which other components can assemble.
Pssm-ID: 403845 [Multi-domain]
Cd Length: 42
Bit Score: 54.77
E-value: 2.72e-09
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 119576354 1794 K C LS EV V AV DPSIL ARL D MQRGVHG RL M D N S T SVR E AA V ELL 1835
Cdd:pfam12765 1 K A LS SL V EK DPSIL DSP D VKEAISR RL T D S S P SVR D AA L ELL 42
PTZ00121
PTZ00121
MAEBL; Provisional
607-1192
3.46e-09
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain]
Cd Length: 2084
Bit Score: 62.85
E-value: 3.46e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 607 EL S K S E -- M K QSES R L AE skp N E NRLV E TKSS E NKLETKVETQT EE L K QN -- E SRTT E CKQ N EST I VEPKQNENRLSDTK 682
Cdd:PTZ00121 1192 EL R K A E da R K AEAA R K AE --- E E RKAE E ARKA E DAKKAEAVKKA EE A K KD ae E AKKA E EER N NEE I RKFEEARMAHFARR 1268
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 683 PNDN K QNNG R SETTKSRP E TP K QKG E SRPETP K Q K S D ghp E TP K QKGDGR - PETP K Q K G E S --- RPETP K Q K N E --- GRP 755
Cdd:PTZ00121 1269 QAAI K AEEA R KADELKKA E EK K KAD E AKKAEE K K K A D --- E AK K KAEEAK k ADEA K K K A E E akk KADAA K K K A E eak KAA 1345
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 756 E TP K HRHDNRR D SGKPST EK KPEVS K H K QDT K sdsprlks ER A E A L K QRPDG - RSVS E SLRRDHDN K Q K S D D -- RGESER 832
Cdd:PTZ00121 1346 E AA K AEAEAAA D EAEAAE EK AEAAE K K K EEA K -------- KK A D A A K KKAEE k KKAD E AKKKAEED K K K A D E lk KAAAAK 1417
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 833 HRG D QSRVRRP E TLRS ssr N E HGI K SDSS K TDKLER K HRH E SGDSR E RPSSG E QKSRP D sp RV K QGDSNKSRS D KLGF K S 912
Cdd:PTZ00121 1418 KKA D EAKKKAE E KKKA --- D E AKK K AEEA K KADEAK K KAE E AKKAE E AKKKA E EAKKA D -- EA K KKAEEAKKA D EAKK K A 1492
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 913 PTS K D d K RT E GN K SKVDTN KA HPDN KAE fpsyllggrsgalknfvip KI K RDKDGNVTQ E T KK ME -- M K G E P K D K VEKIG 990
Cdd:PTZ00121 1493 EEA K K - K AD E AK K AAEAKK KA DEAK KAE ------------------- EA K KADEAKKAE E A KK AD ea K K A E E K K K ADELK 1552
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 991 LV E D L N K GA kpvvvl Q K LSLDDVQ K LIK D REDKS R SSLKPI K NKPSKSNKGSIDQSVL K ELPP E LLAEI E STMPLC E RV K 1070
Cdd:PTZ00121 1553 KA E E L K K AE ------ E K KKAEEAK K AEE D KNMAL R KAEEAK K AEEARIEEVMKLYEEE K KMKA E EAKKA E EAKIKA E EL K 1626
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1071 MNKRKRST V N -------- EK P K YA E ISSD E DNDSDE A F E SSR K RH --- KK DDDKAWEY E ERDRRSSGDHRRSGHSHEGRR 1139
Cdd:PTZ00121 1627 KAEEEKKK V E qlkkkeae EK K K AE E LKKA E EENKIK A A E EAK K AE edk KK AEEAKKAE E DEKKAAEALKKEAEEAKKAEE 1706
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|...
gi 119576354 1140 SSGGGRYRNRSPSDSDMEDYSPPPSLS E VARKMKK kekq K K R KA Y E P K LTP EE 1192
Cdd:PTZ00121 1707 LKKKEAEEKKKAEELKKAEEENKIKAE E AKKEAEE ---- D K K KA E E A K KDE EE 1755
PTZ00449
PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
704-1139
6.96e-08
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain]
Cd Length: 943
Bit Score: 58.55
E-value: 6.96e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 704 K QKGESRPETPKQK SD G H petpkqkg D GR PE T P KQK G ES r P ET P kqkne G RP E TPKHR H DNRRD S GK P STEK KP EVS K HK 783
Cdd:PTZ00449 490 K KSKKKLAPIEEED SD K H -------- D EP PE G P EAS G LP - P KA P ----- G DK E GEEGE H EDSKE S DE P KEGG KP GET K EG 555
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 784 QDT K SDS P -- RL K SERAEA L KQR P DGRSVSESLRRDHDN K Q ksddrge SE R H R GD Q SRV R RPETLRSSS rnehgik S D SS 861
Cdd:PTZ00449 556 EVG K KPG P ak EH K PSKIPT L SKK P EFPKDPKHPKDPEEP K K ------- PK R P R SA Q RPT R PKSPKLPEL ------- L D IP 621
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 862 K TD K lerkh R H ES GD S RE RP SSGEQK S --- RP DS P RVKQ g DSNKSR S D K LG F k S P TS K ---- DD KRTEGN KSK VDTNKAH 934
Cdd:PTZ00449 622 K SP K ----- R P ES PK S PK RP PPPQRP S spe RP EG P KIIK - SPKPPK S P K PP F - D P KF K ekfy DD YLDAAA KSK ETKTTVV 694
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 935 P D NKA E F ---- PSYLLG G RSGALKNFVI PK IK RD KDGNVT ------- QETKKM E MKGE P KDKVEKIG lv E DLNKGAK P VV 1003
Cdd:PTZ00449 695 L D ESF E S ilke TLPETP G TPFTTPRPLP PK LP RD EEFPFE pigdpda EQPDDI E FFTP P EEERTFFH -- E TPADTPL P DI 772
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1004 VLQKLSLD D VQKLIKDREDKSRSSLK P IKNKPSKS nk G SIDQSVL K ELPPEL LA ---- EI ES TM ------ PLCER VK MNK 1073
Cdd:PTZ00449 773 LAEEFKEE D IHAETGEPDEAMKRPDS P SEHEDKPP -- G DHPSLPK K RHRLDG LA lstt DL ES DA griakd ASGKI VK LKR 850
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 119576354 1074 R K R ---- S TV N E K ---- PKYAE I SS D E D NDS -- DE AFESSRKR HK K dddkawey E E R D RR SSGDHRRSGHSHEGRR 1139
Cdd:PTZ00449 851 S K S fddl T TV E E A eemg AEARK I VV D D D GTE ad DE DTHPPEEK HK S -------- E V R R RR PPKKPSKPKKPSKPKK 918
ftsN
TIGR02223
cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a ...
591-778
1.22e-07
cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a number of Proteobacteria. The N-terminal 30 residue region tends to by Lys/Arg-rich, and is followed by a membrane-spanning region. This is followed by an acidic low-complexity region of variable length and a well-conserved C-terminal domain of two tandem regions matched by pfam05036 (Sporulation related repeat), found in several cell division and sporulation proteins. The role of FtsN as a suppressor for other cell division mutations is poorly understood; it may involve cell wall hydrolysis. [Cellular processes, Cell division]
Pssm-ID: 274041 [Multi-domain]
Cd Length: 298
Bit Score: 55.85
E-value: 1.22e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 591 S TPE N H PET PKK K SDP E LSKSEMK --- QS E S R LAESKPN E N R L V ETKSS E NKLETKVETQTEE L KQNESRTT E ckqnest 667
Cdd:TIGR02223 51 S KQA N E PET LQP K NQT E NGETAAD lpp KP E E R WSYIEEL E A R E V LINDP E EPSNGGGVEESAQ L TAEQRQLL E ------- 123
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 668 ive PK Q NEN R LSDTKPNDNKQNNGRSETTKSRPETP K QKGESRP E TP K QK sdgh P ET P K QKGDGRPETP KQK GESRPETP 747
Cdd:TIGR02223 124 --- QM Q ADM R AAEKVLATAPSEQTVAVEARKQTAEK K PQKARTA E AQ K TP ---- V ET E K IASKVKEAKQ KQK ALPKQTAE 196
170 180 190
....*....|....*....|....*....|.
gi 119576354 748 K Q K N EGRP ET P kh RHDNRR D SG KP STEK K P E 778
Cdd:TIGR02223 197 T Q S N SKPI ET A -- PKADKA D KT KP KPKE K A E 225
PTZ00449
PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
476-894
7.81e-07
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain]
Cd Length: 943
Bit Score: 55.08
E-value: 7.81e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 476 EI ERIERE S AIERERFSK E VQ DK - D K P LKKRKQDSY P QE A G G --- ATG G NRPA S Q E TGSTGN G SR P AL mvsidlhqagrv 551
Cdd:PTZ00449 484 EI KKLIKK S KKKLAPIEE E DS DK h D E P PEGPEASGL P PK A P G dke GEE G EHED S K E SDEPKE G GK P GE ------------ 551
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 552 dsqasi T QDSDSI KKP EEI K QCNDAPVSV L QEDIVGSLKSTPENH PE T PKK KSD P ELSKSEMKQSESR L A E SK -- P NENR 629
Cdd:PTZ00449 552 ------ T KEGEVG KKP GPA K EHKPSKIPT L SKKPEFPKDPKHPKD PE E PKK PKR P RSAQRPTRPKSPK L P E LL di P KSPK 625
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 630 LV E TKS S ENKLETKVETQTE E LKQNESRTTEC K QNE S TIV -- E PK QN E N ---------- RLSD TK PNDNKQNNGR S ETTK 697
Cdd:PTZ00449 626 RP E SPK S PKRPPPPQRPSSP E RPEGPKIIKSP K PPK S PKP pf D PK FK E K fyddyldaaa KSKE TK TTVVLDESFE S ILKE 705
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 698 SR PETP KQKGES - RP ET PK QKS D gh P E T P KQK g D G R P ETPKQKGESRPET P KQKNEGRP ETP KHRH ------------ D N 764
Cdd:PTZ00449 706 TL PETP GTPFTT p RP LP PK LPR D -- E E F P FEP - I G D P DAEQPDDIEFFTP P EEERTFFH ETP ADTP lpdilaeefkee D I 782
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 765 RRDS G K P - STE K K P EV - S K H KQDTKS D S P R L -- K SE R AEA L K - QRP D GR S VSESLRR D HDN K QKSDD R GE S erh RG D QSR 839
Cdd:PTZ00449 783 HAET G E P d EAM K R P DS p S E H EDKPPG D H P S L pk K RH R LDG L A l STT D LE S DAGRIAK D ASG K IVKLK R SK S --- FD D LTT 859
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 119576354 840 V RRP E TLRSSS R ---- NEH G IKS D SSK T DKL E R KH RH E SGDS R -- ER PS SGEQK S R P DS P R 894
Cdd:PTZ00449 860 V EEA E EMGAEA R kivv DDD G TEA D DED T HPP E E KH KS E VRRR R pp KK PS KPKKP S K P KK P K 920
PRK12678
PRK12678
transcription termination factor Rho; Provisional
693-907
1.02e-05
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain]
Cd Length: 672
Bit Score: 51.06
E-value: 1.02e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 693 SE T TKSRPETPKQKGESRPETPKQKSDGHPETPKQKGDGRPETPKQKGESRPETPKQKNEG R PETPKHRHDN R RDSG K PS 772
Cdd:PRK12678 67 AA T PAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQA R ERRERGEAAR R GAAR K AG 146
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 773 TEKKPEVSKHKQ D TKSDSPRLKSER aealk Q R PD G R svsesl R R D HDNKQKSDD RG ES E RHRG D QSRVR R PETLRSSS R N 852
Cdd:PRK12678 147 EGGEQPATEARA D AAERTEEEERDE ----- R R RR G D ------ R E D RQAEAERGE RG RR E ERGR D GDDRD R RDRREQGD R R 215
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 119576354 853 E HG iksdsskt DKLERKH R HESGDS R E R PSSGEQKS R P D SPRVKQG D SNKSRSDK 907
Cdd:PRK12678 216 E ER -------- GRRDGGD R RGRRRR R D R RDARGDDN R E D RGDRDGD D GEGRGGRR 262
PRK12678
PRK12678
transcription termination factor Rho; Provisional
692-889
2.49e-05
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain]
Cd Length: 672
Bit Score: 49.90
E-value: 2.49e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 692 RSETTKSRPETPKQKGESRPETPKQKSDGHPETPKQKGDGRPETPKQKGES R PETPKQKNEG R PETP K HRHDNRRDSGKP 771
Cdd:PRK12678 77 ARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQA R ERRERGEAAR R GAAR K AGEGGEQPATEA 156
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 772 STEKKPEVSKHKQ D TKSD -- SPRLKSER AE ALKQRPDGRSVSESLR RD HDNKQKSD DR G E SERH R GDQS R VR R PE tl R SS 849
Cdd:PRK12678 157 RADAAERTEEEER D ERRR rg DREDRQAE AE RGERGRREERGRDGDD RD RRDRREQG DR R E ERGR R DGGD R RG R RR -- R RD 234
170 180 190 200
....*....|....*....|....*....|....*....|
gi 119576354 850 S R NEH G iks D SSKT D KLE R KHRHES G DSRE R PSSGEQKS R 889
Cdd:PRK12678 235 R R DAR G --- D DNRE D RGD R DGDDGE G RGGR R GRRFRDRD R 271
PTZ00108
PTZ00108
DNA topoisomerase 2-like protein; Provisional
878-1113
5.58e-05
DNA topoisomerase 2-like protein; Provisional
Pssm-ID: 240271 [Multi-domain]
Cd Length: 1388
Bit Score: 48.89
E-value: 5.58e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 878 RERPSSGEQKS R PD S PRVKQGDSNKSRSD K LGF K SPTSKDDKRTEGNKSKVDTNKAHP D N K --- AEF P SYLLGGR SG ALK 954
Cdd:PTZ00108 1146 EVEEKEIAKEQ R LK S KTKGKASKLRKPKL K KKE K KKKKSSADKSKKASVVGNSKRVDS D E K rkl DDK P DNKKSNS SG SDQ 1225
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 955 NFVIPKIKRD K DGN V TQETK K m EMKGEPKDKVEKIGLVE DL N K GA KP VVVLQKL S LDDVQKLIKDREDKSR S SLKPIKNK 1034
Cdd:PTZ00108 1226 EDDEEQKTKP K KSS V KRLKS K - KNNSSKSSEDNDEFSSD DL S K EG KP KNAPKRV S AVQYSPPPPSKRPDGE S NGGSKPSS 1304
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1035 P S K SNKGSIDQSV L KE L PPELLA E IEST mplc ERV K MNK R ------- KR S TVNEK P KYAEIS S DEDN D S D EAFES S RKRH 1107
Cdd:PTZ00108 1305 P T K KKVKKRLEGS L AA L KKKKKS E KKTA ---- RKK K SKT R vkqasas QS S RLLRR P RKKKSD S SSED D D D SEVDD S EDED 1380
....*.
gi 119576354 1108 KK DD DK 1113
Cdd:PTZ00108 1381 DE DD ED 1386
dnaA
PRK14086
chromosomal replication initiator protein DnaA;
700-889
2.35e-04
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain]
Cd Length: 617
Bit Score: 46.74
E-value: 2.35e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 700 P ET P KQKGE S R PE T P KQKSDGHPETPKQKG D G RP ETPK --- Q KGES RP ET P KQKNEGR ---- P ETPKHRHDNRRDS G K P S 772
Cdd:PRK14086 97 P PP P HARRT S E PE L P RPGRRPYEGYGGPRA D D RP PGLP rqd Q LPTA RP AY P AYQQRPE pgaw P RAADDYGWQQQRL G F P P 176
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 773 TEKKPEVS khkqdtk S DS P RLKSE R AEALKQ RP D grsvseslr R D HDNKQKSDD R GESE R H R G D qs R VR RPE TLRSSSRN 852
Cdd:PRK14086 177 RAPYASPA ------- S YA P EQERD R EPYDAG RP E --------- Y D QRRRDYDHP R PDWD R P R R D -- R TD RPE PPPGAGHV 238
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 119576354 853 EH G IKSDSSKT D KLERKH R ----- HESGDSRER P SS GE QKS R 889
Cdd:PRK14086 239 HR G GPGPPERD D APVVPI R psapg PLAAQPAPA P GP GE PTA R 280
PTZ00108
PTZ00108
DNA topoisomerase 2-like protein; Provisional
590-838
4.07e-04
DNA topoisomerase 2-like protein; Provisional
Pssm-ID: 240271 [Multi-domain]
Cd Length: 1388
Bit Score: 46.19
E-value: 4.07e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 590 KSTPENHPETPK K KSD P E L S K S E M K QSE S r L A ESKPNENRLVET K SSENKLET K VETQTEEL K Q N E S RTTECKQN E STIV 669
Cdd:PTZ00108 1156 QRLKSKTKGKAS K LRK P K L K K K E K K KKK S - S A DKSKKASVVGNS K RVDSDEKR K LDDKPDNK K S N S S GSDQEDDE E QKTK 1234
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 670 EP K QNEN RL SDT K P N D n KQNNGRSETTK S RPETPKQ K GESR P E t PKQKSDGH P ET P KQ kgdgr PETPKQK G E S R P ET P KQ 749
Cdd:PTZ00108 1235 PK K SSVK RL KSK K N N S - SKSSEDNDEFS S DDLSKEG K PKNA P K - RVSAVQYS P PP P SK ----- RPDGESN G G S K P SS P TK 1307
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 750 K NEGRPETPK hrhdnrrd SGKPSTE KK P E VSKHKQD t KS DSPRLKSERAEALKQRPDG R SVSESLRRDH D NKQKS DD RGE 829
Cdd:PTZ00108 1308 K KVKKRLEGS -------- LAALKKK KK S E KKTARKK - KS KTRVKQASASQSSRLLRRP R KKKSDSSSED D DDSEV DD SED 1378
....*....
gi 119576354 830 SERHRGDQS 838
Cdd:PTZ00108 1379 EDDEDDEDD 1387
Caldesmon
pfam02029
Caldesmon;
480-890
4.30e-04
Caldesmon;
Pssm-ID: 460421 [Multi-domain]
Cd Length: 495
Bit Score: 45.63
E-value: 4.30e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 480 IE R E SAIE RER FSKEVQDK dkplkk R K Q DSYPQEA G GA T GGNR P ASQETGSTGNGSR P ALMVSI D LHQ A G rvdsqasitq 559
Cdd:pfam02029 1 IE D E EEAA RER RRRAREER ------ R R Q KEEEEPS G QV T ESVE P NEHNSYEEDSELK P SGQGGL D EEE A F ---------- 64
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 560 d S D SIK K P EE IK Q CNDAPVSVL Q EDIVGSLKSTP E NHP E TPKKKSDP E L S KS E MK - QSE SRL AES K PN E NRLV E TKSS EN 638
Cdd:pfam02029 65 - L D RTA K R EE RR Q KRLQEALER Q KEFDPTIADEK E SVA E RKENNEEE E N S SW E KE e KRD SRL GRY K EE E TEIR E KEYQ EN 143
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 639 K LE T -------------- K V E TQT E ELKQ N ESRTTECKQNESTIVEP K QNENRLS D T K PN dnkqnngrsettksrpe T P K 704
Cdd:pfam02029 144 K WS T evrqaeeegeeeed K S E EAE E VPTE N FAKEEVKDEKIKKEKKV K YESKVFL D Q K RG ----------------- H P E 206
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 705 Q K GESRP E tp KQKSDGHPETPK Q K G DGRPETPKQKG E SRP E TPKQKN E G R petpkhrhdn RR DSG K P S T E KKP ev SKH KQ 784
Cdd:pfam02029 207 V K SQNGE E -- EVTKLKVTTKRR Q G G LSQSQEREEEA E VFL E AEQKLE E L R ---------- RR RQE K E S E E FEK -- LRQ KQ 272
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 785 dtksds PRLKS E RA E AL K Q R PDG R SV - S E SLR R DHDNKQKSDD R G E S E RH R GDQSRV RR ----------- PE T lr SSS RN 852
Cdd:pfam02029 273 ------ QEAEL E LE E LK K K R EER R KL l E E EEQ R RKQEEAERKL R E E E E KR R MKEEIE RR raeaaekrqkl PE D -- SSS EG 344
410 420 430
....*....|....*....|....*....|....*...
gi 119576354 853 EHGI K SD S S K TDK L ERKH R H E SGDSRERP SS GEQ K SR P 890
Cdd:pfam02029 345 KKPF K CF S P K GSS L KITE R A E FLNKSLQK SS SVK K TH P 382
PRK12678
PRK12678
transcription termination factor Rho; Provisional
692-899
4.75e-04
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain]
Cd Length: 672
Bit Score: 45.67
E-value: 4.75e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 692 R SETTKSRPETPKQKGESRPETPKQKSDGHPETPKQKGDG R PETPKQKGES R PETP K QKNE G RPETPKH R H D NRRDSGKP 771
Cdd:PRK12678 88 R QAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQA R ERRERGEAAR R GAAR K AGEG G EQPATEA R A D AAERTEEE 167
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 772 STEKKPEVSKHKQDTKSDSPRLKSE R A E ALKQRP D GRSVSESLRR D HDNKQKSD D R G ESERH R GDQS R VRRPE tl RSSSR 851
Cdd:PRK12678 168 ERDERRRRGDREDRQAEAERGERGR R E E RGRDGD D RDRRDRREQG D RREERGRR D G G DRRGR R RRRD R RDARG -- DDNRE 245
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 119576354 852 NEHGIKS D SSKTDKLE R KH R HESG D S R E R PSSGEQKS R P ds P RVKQG D 899
Cdd:PRK12678 246 DRGDRDG D DGEGRGGR R GR R FRDR D R R G R RGGDGGNE R E -- P ELRED D 291
PDS5
cd19953
Sister chromatid cohesion protein PDS5; Pds5 plays a crucial role in sister chromatid cohesion. ...
1786-1882
4.83e-04
Sister chromatid cohesion protein PDS5; Pds5 plays a crucial role in sister chromatid cohesion. Together with WapI and Scc3, it is involved in the release of the cohesin complex from chromosomes during S phase. The core of the cohesin complex consists of a coiled-coiled heterodimer of Smc1 and Smc30, together with Scc1 (also called kleisin). Pds5 interacts with Scc1 via a conserved patch on the surface of its heat repeats. Pds5 also promotes the acetylation of Smc3 that protects cohesin from releasing activity in G2 phase.
Pssm-ID: 410996 [Multi-domain]
Cd Length: 630
Bit Score: 45.59
E-value: 4.83e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1786 IA VR TK A M K C L SEVV A VDP S - IL A R ldmqrg VH -------- GR LM D N S TS VR E A A VE LLGRFV L CR P Q LAE QYYDM L IE R 1856
Cdd:cd19953 259 VD VR LL A T K L L GKMF A EKG S a GF A Q ------ TY pslwkefl GR FN D K S PE VR L A W VE SAKHIL L NH P D LAE DILEA L KK R 332
90 100
....*....|....*....|....*.
gi 119576354 1857 I LD TGIS VR KRVI K ILR D ICI E QPTF 1882
Cdd:cd19953 333 L LD PDEK VR LAAV K AIC D LAY E DLLH 358
PDS5
pfam20168
Sister chromatid cohesion protein PDS5 protein; This entry represents the Sister chromatid ...
1785-1928
5.38e-04
Sister chromatid cohesion protein PDS5 protein; This entry represents the Sister chromatid cohesion protein PDS5. The large PDS5 molecule is exclusively alpha helical, composed of a large number of HEAT-like repeats and helical extensions/additions that deviate from the HEAT repeat pattern.
Pssm-ID: 466319 [Multi-domain]
Cd Length: 1051
Bit Score: 45.66
E-value: 5.38e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1785 AI AVR TKAMKCLSEVVAVD P SI la R LDMQRGVHG RL M D NSTS VR E AAV ELL G RF ------- V LCRPQ L AE qyydm L I ER I 1857
Cdd:pfam20168 297 SV AVR IAWVEAAKQILLNH P DL -- R SEILEALKD RL L D PDEK VR L AAV KAI G DL dyetllh V VSEKL L KT ----- L A ER L 369
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1858 L D TGI SVRK RVI K I L RDI ------- CI E QP tf PKIT E MCV ---- K MIR -- RV ND E E g I KK LV NETFQKLWF t P TPHN D K E 1924
Cdd:pfam20168 370 R D KKP SVRK EAL K T L AKL ynvayge IE E GD -- EEAI E KFG wipn K ILH ly YI ND P E - I RA LV ERVLFEYLL - P ALLD D E E 445
....
gi 119576354 1925 AMT R 1928
Cdd:pfam20168 446 RVK R 449
PspC_subgroup_1
NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
475-759
1.52e-03
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain]
Cd Length: 684
Bit Score: 43.85
E-value: 1.52e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 475 AE I E RIE R ES A IER E RFS K E V -- QDK DKP l K K R KQDSYPQ E AGGATGGNRP A SQETG S T G NGSR P A lmvsidlhqagrv D 552
Cdd:NF033838 233 AE E E AKR R AD A KLK E AVE K N V at SEQ DKP - K R R AKRGVLG E PATPDKKEND A KSSDS S V G EETL P S ------------- P 298
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 553 S QASITQDSDSI KK P EE I - K QCN D apvsvl Q ED ivgslk STPE N H P ETPK K KSDP E LSK S EM K QS E SR L aeskpnen R LV 631
Cdd:NF033838 299 S LKPEKKVAEAE KK V EE A k K KAK D ------ Q KE ------ EDRR N Y P TNTY K TLEL E IAE S DV K VK E AE L -------- E LV 358
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 632 ETKSS E NKL E T K VETQTEEL kqnesrtt E C K QN E S T IV E PKQNENR lsd TKPNDN K QNNGRSETT K SR P - E T P KQKGESR 710
Cdd:NF033838 359 KEEAK E PRN E E K IKQAKAKV -------- E S K KA E A T RL E KIKTDRK --- KAEEEA K RKAAEEDKV K EK P a E Q P QPAPAPQ 427
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 711 PE T P KQ K SDGHP E T PK - Q K GDGR ---------- P E TPKQKGESR P et PK QKNEGR P E TPK 759
Cdd:NF033838 428 PE K P AP K PEKPA E Q PK a E K PADQ qaeedyarrs E E EYNRLTQQQ P -- PK TEKPAQ P S TPK 485
PTZ00108
PTZ00108
DNA topoisomerase 2-like protein; Provisional
610-919
2.70e-03
DNA topoisomerase 2-like protein; Provisional
Pssm-ID: 240271 [Multi-domain]
Cd Length: 1388
Bit Score: 43.50
E-value: 2.70e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 610 KS E MKQS E SR L AES K pne N RLVET -- KSSEN K L E TKV E t QT EE LKQN E SRTTECKQNESTIVEP K QNENR L SDTKPNDN K 687
Cdd:PTZ00108 1108 NA E LEKK E KE L EKL K --- N TTPKD mw LEDLD K F E EAL E - EQ EE VEEK E IAKEQRLKSKTKGKAS K LRKPK L KKKEKKKK K 1183
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 688 qnngr S ETT KS RPETPKQ kgesrp ETPKQK SD GHPETPKQKGDGRPETPKQKG E SRP E TPKQKNEGRPETP K HRHD N RRD 767
Cdd:PTZ00108 1184 ----- S SAD KS KKASVVG ------ NSKRVD SD EKRKLDDKPDNKKSNSSGSDQ E DDE E QKTKPKKSSVKRL K SKKN N SSK 1252
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 768 S GKPST E KKPEVSKHKQDT K SDSP R LKS er AEALKQR P DG R S vseslrrdh D NKQKSDDRGE S ERHRGDQS R VR rpetlr 847
Cdd:PTZ00108 1253 S SEDND E FSSDDLSKEGKP K NAPK R VSA -- VQYSPPP P SK R P --------- D GESNGGSKPS S PTKKKVKK R LE ------ 1315
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 119576354 848 sssrnehg IKSDSS K TD K LER K HRHESGD S RE R P -- S S GE Q K SR PDSPRV K QGDSNK S RS D KLGFKSPTSKD D K 919
Cdd:PTZ00108 1316 -------- GSLAAL K KK K KSE K KTARKKK S KT R V kq A S AS Q S SR LLRRPR K KKSDSS S ED D DDSEVDDSEDE D D 1381
PRK08581
PRK08581
amidase domain-containing protein;
585-869
2.74e-03
amidase domain-containing protein;
Pssm-ID: 236304 [Multi-domain]
Cd Length: 619
Bit Score: 43.24
E-value: 2.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 585 IVGS L k ST P ENHPET P K K K S DPELSKSEM K Q S ESR l AE SK PNENRLVETKSSE N KLET kv ETQTEELKQNE S R T TE ckqn 664
Cdd:PRK08581 16 VLPT L - TS P TAYADD P Q K D S TAKTTSHDS K K S NDD - ET SK DTSSKDTDKADNN N TSNQ -- DNNDKKFSTID S S T SD ---- 87
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 665 EST I VEPKQNENRLSDTKP -- ND NK QNNGR S E TT KSRPETPKQKGE S RP E T P KQKSDGHPETP K QKGDGRPETPKQKGES 742
Cdd:PRK08581 88 SNN I IDFIYKNLPQTNINQ ll TK NK YDDNY S L TT LIQNLFNLNSDI S DY E Q P RNSEKSTNDSN K NSDSSIKNDTDTQSSK 167
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 743 RPETPK QK NEGRPE T PKHRHDNRRD S G KP STEKKPEVSKHKQ DT KSDSP rl K S ERAEALKQRPDGRSVSES lr RDHDN K Q 822
Cdd:PRK08581 168 QDKADN QK APSSNN T KPSTSNKQPN S P KP TQPNQSNSQPASD DT ANQKS -- S S KDNQSMSDSALDSILDQY -- SEDAK K T 243
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 119576354 823 KS D DRGE S ERHRGDQ S RVRR P E t L RSSSRNE H GI K SDS S KTDKLERK 869
Cdd:PRK08581 244 QK D YASQ S KKDKTET S NTKN P Q - L PTQDELK H KS K PAQ S FENDVNQS 289
PTZ00112
PTZ00112
origin recognition complex 1 protein; Provisional
588-785
3.19e-03
origin recognition complex 1 protein; Provisional
Pssm-ID: 240274 [Multi-domain]
Cd Length: 1164
Bit Score: 43.05
E-value: 3.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 588 SL K STPE N HPETPKK K S D PELS K SEMKQ SE SRLAESKPNEN R LVETKSS ENK LET K VETQTEELKQNES R TTE C KQNE S T 667
Cdd:PTZ00112 219 ND K NKEK N KEKDKNI K K D RDGD K QTKRN SE KSKVQNSHFDV R ILRSYTK ENK KDE K NVVSGIRSSVLLK R KSQ C LRKD S Y 298
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 668 IVEPK Q NENRLS D T K p N DNKQ NNG R S ETTKSRPETPKQK G ES R P etpkqk S DGH P ET P KQ K GDGRPE T PKQ K g ESRPETP 747
Cdd:PTZ00112 299 VYSNH Q KKAKTG D P K - N IIHR NNG S S NSNNDDTSSSNHL G SN R I ------ S NRN P SS P YK K QTTTKH T NNT K - NNKYNKT 370
170 180 190
....*....|....*....|....*....|....*...
gi 119576354 748 K QKNEGRPETPK H RHD N R R D S GK P ST E K K PEVSKH K QD 785
Cdd:PTZ00112 371 K TTQKFNHPLRH H ATI N K R S S ML P MS E Q K GRGASE K SE 408
PTZ00121
PTZ00121
MAEBL; Provisional
563-1201
3.93e-03
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain]
Cd Length: 2084
Bit Score: 42.82
E-value: 3.93e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 563 S I K K P EE IKQCNDAPVSVLQE DI VGSLK stp ENHP E TPKKKSDPELSKSEM K Q S ESRLAESKPNE NR LV E TKSSENKL et 642
Cdd:PTZ00121 1025 N I E K I EE LTEYGNNDDVLKEK DI IDEDI --- DGNH E GKAEAKAHVGQDEGL K P S YKDFDFDAKED NR AD E ATEEAFGK -- 1099
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 643 kvetq T EE L K QN E SRTT E CKQNESTIVEPKQNENRLSDTK pndnkqnng RS E TTKSRP E TP K QKGES R P E TPKQKS D GHP 722
Cdd:PTZ00121 1100 ----- A EE A K KT E TGKA E EARKAEEAKKKAEDARKAEEAR --------- KA E DARKAE E AR K AEDAK R V E IARKAE D ARK 1165
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 723 - E TPKQKG D GRPETPKQ K G E S -- RP E TPKQKNEG R PETPKHRHDNR R --- DSG K PSTE KK P E VS K HKQDT K S D SPRL K se 796
Cdd:PTZ00121 1166 a E EARKAE D AKKAEAAR K A E E vr KA E ELRKAEDA R KAEAARKAEEE R kae EAR K AEDA KK A E AV K KAEEA K K D AEEA K -- 1243
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 797 R AE ALKQRPDG R SVS E S l R RD H DNKQKSDDRG E s E RHRG D QSR ---- VRRPETLRSSSR ---- N E HGI K SDSS - K T D KLE 867
Cdd:PTZ00121 1244 K AE EERNNEEI R KFE E A - R MA H FARRQAAIKA E - E ARKA D ELK kaee KKKADEAKKAEE kkka D E AKK K AEEA k K A D EAK 1321
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 868 R K HRHESGDSRERPSSG E QKSRPDSPRVKQGDSNKSRSDKLGF K SPTSKDD K RTEGN K SKVDTN KA HPDN KA E --- FPSY 944
Cdd:PTZ00121 1322 K K AEEAKKKADAAKKKA E EAKKAAEAAKAEAEAAADEAEAAEE K AEAAEKK K EEAKK K ADAAKK KA EEKK KA D eak KKAE 1401
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 945 LLGGRSGA LK NFVIP K I K R D KDGNVTQ E T KK ME mkg E P K D K V E KIGLVEDLN K G A K pvvvl QKLSLDDVQ K li K DR E D K S 1024
Cdd:PTZ00121 1402 EDKKKADE LK KAAAA K K K A D EAKKKAE E K KK AD --- E A K K K A E EAKKADEAK K K A E ----- EAKKAEEAK K -- K AE E A K K 1471
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1025 RSSL K PIKNKPS K SNKGSIDQSVL K ELPP E LLAEI E STMPLC E RV K MNKR K RS ---- TVN E KP K YA E I - SSD E DNDS DE A 1099
Cdd:PTZ00121 1472 ADEA K KKAEEAK K ADEAKKKAEEA K KKAD E AKKAA E AKKKAD E AK K AEEA K KA deak KAE E AK K AD E A k KAE E KKKA DE L 1551
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1100 FESSRKRHKKDDD KA W E YEERDRRSSGDH R RS --- GHSH E G R RSSGGGR Y RNRSPSDS dm E DYSPPPSLSEV A RKM KK K E 1176
Cdd:PTZ00121 1552 KKAEELKKAEEKK KA E E AKKAEEDKNMAL R KA eea KKAE E A R IEEVMKL Y EEEKKMKA -- E EAKKAEEAKIK A EEL KK A E 1629
650 660
....*....|....*....|....*
gi 119576354 1177 KQ KK RKAYEP K LTP EE MMDSSTF K R 1201
Cdd:PTZ00121 1630 EE KK KVEQLK K KEA EE KKKAEEL K K 1654
PRK14949
PRK14949
DNA polymerase III subunits gamma and tau; Provisional
337-679
6.55e-03
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237863 [Multi-domain]
Cd Length: 944
Bit Score: 42.02
E-value: 6.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 337 E Q S EKAAMYDIISSPSK DST K ltl RL S R V RS S DM D Q Q E D MI S GVE NS NVSEN -- D IPFNVQYPGQTSKTPITPQDINR PL 414
Cdd:PRK14949 473 E A S SSLDADNSAVPEQI DST A --- EQ S V V NP S VT D T Q V D DT S ASN NS AADNT vd D NYSAEDTLESNGLDEGDYAQDSA PL 549
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 415 N A A Q CLSQQEQTAFLP A NQVPVLQQNTSVA A KQPQTSVVQN Q QQISQQGPIYDEVE ------ LDA - LA E ierie R E S AIE 487
Cdd:PRK14949 550 D A Y Q DDYVAFSSESYN A LSDDEQHSANVQS A QSAAEAQPSS Q SLSPISAVTTAAAS ladddi LDA v LA A ----- R D S LLS 624
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 488 r ERFSKEVQDK D -- K PLKK RK QDSY P QE A GG A tggnr PA S QETG S TGNGSRP A LMVSIDLHQAGRVD S QASITQD S D S IK 565
Cdd:PRK14949 625 - DLDALSPKEG D gk K SSAD RK PKTP P SR A PP A ----- SL S KPAS S PDASQTS A SFDLDPDFELATHQ S VPEAALA S G S AP 698
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 566 K P EEIKQCN D A P vsvlqedivgslkst P ENHPETPKKKS D PELSKS E MKQ SES RLAE S KPNENRL vetkss E NKLETKVE 645
Cdd:PRK14949 699 A P PPVPDPY D R P --------------- P WEEAPEVASAN D GPNNAA E GNL SES VEDA S NSELQAV ------ E QQATHQPQ 757
330 340 350
....*....|....*....|....*....|....
gi 119576354 646 T Q T E elkqn ESRTTECKQNES T IV E PKQN E NR L S 679
Cdd:PRK14949 758 V Q A E ----- AQSPASTTALTQ T SS E VQDT E LN L V 786
PTZ00108
PTZ00108
DNA topoisomerase 2-like protein; Provisional
862-1103
6.76e-03
DNA topoisomerase 2-like protein; Provisional
Pssm-ID: 240271 [Multi-domain]
Cd Length: 1388
Bit Score: 41.96
E-value: 6.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 862 K TDK L ER K HRHESGDS R ERPSSGEQ K SRPD S PRV K QGDSNKSRSD K LGFKSPTS K D D KR teg NKS K VDTNKAHPDNKA E F 941
Cdd:PTZ00108 1154 K EQR L KS K TKGKASKL R KPKLKKKE K KKKK S SAD K SKKASVVGNS K RVDSDEKR K L D DK --- PDN K KSNSSGSDQEDD E E 1230
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 942 PSYLLGGR S GALKNFVIPKIKRDKDG N VTQETKKMEMK G E PK DKVEKIGL V EDLNKGAKPVVVLQKLSLDDVQKLI K DRE 1021
Cdd:PTZ00108 1231 QKTKPKKS S VKRLKSKKNNSSKSSED N DEFSSDDLSKE G K PK NAPKRVSA V QYSPPPPSKRPDGESNGGSKPSSPT K KKV 1310
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 1022 D K SRSSLKPIKN K PS KS N K GSIDQSVL K ELPPELL A EIE S TM plce RVKMN K R K RSTVN E KPKYA E ISSD ED N D SDEAFE 1101
Cdd:PTZ00108 1311 K K RLEGSLAALK K KK KS E K KTARKKKS K TRVKQAS A SQS S RL ---- LRRPR K K K SDSSS E DDDDS E VDDS ED E D DEDDED 1386
..
gi 119576354 1102 SS 1103
Cdd:PTZ00108 1387 DD 1388
DUF612
pfam04747
Protein of unknown function, DUF612; This family includes several uncharacterized proteins ...
468-906
9.86e-03
Protein of unknown function, DUF612; This family includes several uncharacterized proteins from Caenorhabditis elegans.
Pssm-ID: 282585 [Multi-domain]
Cd Length: 511
Bit Score: 41.20
E-value: 9.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 468 E V E LDAL A EI E RIERESAI E R ER FS KE VQD K DKP LKK RKQDSYPQE A GG A TGGNRPASQETG ST GNGSRPALM V S idlhq 547
Cdd:pfam04747 106 E A E AKKR A AQ E EEHKQWKA E Q ER IQ KE QEK K EAD LKK LQAEKKKEK A VK A EKAEKAEKTKKA ST PAPVEEEIV V K ----- 180
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 548 agrvdsqa SITQ D SDSIKK PE eikqcndapvsvlqedivgsl KS TP E N H P ET P KKKSDP --- ELS K SEM K Q SES RLAESK 624
Cdd:pfam04747 181 -------- KVAN D RSAAPA PE --------------------- PK TP T N T P AE P AEQVQE itg KKN K KNK K K SES EATAAP 231
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 625 PNENRL VE TK --- SS E NKLETKVETQTEELKQNE S RTTECKQNES T I VEP KQNENRLSDTKPND NK QNNGR SE TT K SRP E 701
Cdd:pfam04747 232 ASVEQV VE QP kvv TE E PHQQAAPQEKKNKKNKRK S ESENVPAASE T P VEP VVETTPPASENQKK NK KDKKK SE SE K VVE E 311
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 702 TPKQKG esr P ETP K QKS D GHPE ----- T P K QKGDGR P - ETP KQKG E SRP E TPKQKNEGRPE TP khrhdnrrdsgk P S TE K 775
Cdd:pfam04747 312 PVQAEA --- P KSK K PTA D DNMD fldfv T A K EEPKDE P a ETP AAPV E EVV E NVVENVVEKST TP ------------ P A TE N 376
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119576354 776 K PEVS K H K Q dt KS D S PRLKSERA E ALKQR P DGRS V S E SLRRDHD NK Q K S - D D RGE SE RHRGDQSR V RRPETLRSSSRNEH 854
Cdd:pfam04747 377 K KKNK K D K K -- KS E S EKVTEQPV E SAPAP P QVEQ V V E TTPPASE NK K K N k K D KKK SE SEKAVEEP V QAAPSSKKPTADDN 454
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....
gi 119576354 855 GIKS D -- SS K T DK L E RKHR H ESGDSRER P SSGEQKSRPDSPRV K QGDSN K SRSD 906
Cdd:pfam04747 455 MDFL D fv TA K P DK S E SVEE H IAAPMIVE P AHADEETAAAAEGK K KNKKD K KKKE 508
Blast search parameters
Data Source:
Precalculated data, version = cdd.v.3.21
Preset Options: Database: CDSEARCH/cdd Low complexity filter: no Composition Based Adjustment: yes E-value threshold: 0.01