NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|734521674|gb|AJA06044|]
View 

mKate2/TALER10 fusion protein [Cloning vector pT9_T9x3_72-mKate2-2A-TALER10-4xTarget_T18a]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
AvrBs3 super family cl49339
type III secretion system effector avirulence protein AvrBs3;
258-1405 0e+00

type III secretion system effector avirulence protein AvrBs3;


The actual alignment was detected with superfamily member NF041308:

Pssm-ID: 469205 [Multi-domain]  Cd Length: 1179  Bit Score: 1829.19  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  258 IRPRRPSPARELLPGPQPDRVQPTADRGVSAPAGSPLDGLPARRTVSRTRLPSPPAPSPAFSAGSFSDLLRPFDPSLLDT 337
Cdd:NF041308    4 IRSRTPSPAREPQAGSQPDGVQPIAGRLVSTAASSPLDGLPARPAMSRTRQPATPAPSPAFSVGSFSDLLRQFDPSLFDP 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  338 SLLDSMPAVGTPHTAAAPAEWDEAQSALRAADDPPPTVRVAVTAARP--PRAKPAPRRRAAQPSDASPAAQVDLRTLGYS 415
Cdd:NF041308   84 SLFDSSPAFGAHHADAAPGEMDEVQSGLRAADDPQSHLSAAVTAPSPtpPRTQAAARRRSAQTSDASPAESVDLSTLGYT 163
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  416 QQQQEKIKPKVRSTVAQHHEALVGHGFTHAHIVALSQHPAALGTVAVTYQHIITALPEATHEDIVGVGKQWSGARALEAL 495
Cdd:NF041308  164 QQQQEQIKPNARSTVAQHHAALVGHGFTHAHIVELSKHAAALGTVADRYQAIIAVLPEATHKDIVEVGKQWSGARALQAL 243
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  496 LTDAGELRGPPLQLDTGQLVKIAKRGGVTAMEAVHASRNALTGAPLNLTPDQVVAIASNIGGKQALETVQRLLPVLCQD- 574
Cdd:NF041308  244 LMVAEELRGPPLQLDTGQLIKIAKRGGAPAVEAVHASRNALTGAPLHLTPHQVVAIASNNGGKQALETVQRLLPVLCQPp 323
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  575 HGLTPDQVVAIASHDGGKQALETVQRLLPVLCQ-DHGLTPDQVVAIASHDGGKQALETVQRLLPVLCQ-DHGLTPDQVVA 652
Cdd:NF041308  324 HGLTPEQVVAIASNDGGKQALETVQRLLPVLCQaEHGLTPDQVVAIASNIGGKPALETVQRLLPVLCQpPHGLTPDQVVA 403
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  653 IASNGGGKQALETVQRLLPVLCQD-HGLTPDQVVAIASHDGGKQALETVQRLLPVLCQDHGLTPDQVVAIASNIGGKQAL 731
Cdd:NF041308  404 IASNDGGKQALETVQRLLPVLCQApHGLTPDQVVAIASNDGGKQALETVQRLLPELCQAHGLTPDQVVAIASNGGGKQAL 483
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  732 ETVQRLLPVLCQ-DHGLTPDQVVAIASNGGGKQALETVQRLLPVLCQ-DHGLTPDQVVAIASHDGGKQALETVQRLLPVL 809
Cdd:NF041308  484 ETVQRLLPVLCQpPHGLTPEQVVAIASNGGGKQALETVQRLLPVLCQpPHGLTPEQVVAIASHDGGKQALETVHRLLPVL 563
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  810 CQdhgltpdqvvaiasniggkqaletvqrllpvlcQDHGLTPDQVVAIASNNGGKQALETVQRLLPVLCQD-HGLTPDQV 888
Cdd:NF041308  564 CQ---------------------------------APHGLTPEQVVAIASHNGGKQALETVQRLLPVLCQRpYGLTPNQV 610
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  889 VAIASNNGGKQALETVQRLLPVLCQ-DHGLTPDQVVAIASNIGGKQALETVQRLLPVLCQD-HGLTPDQVVAIASNIGGK 966
Cdd:NF041308  611 VAIASNDGGKQALETVQRLLPVLCQaPHGLTPDQVVAIASNGGGKQALETVQRLLPVLCQRpHGLTPHQVVAIASNDGGK 690
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  967 QALETVQRLLPVLCQ-DHGLTPDQVVAIASHDGGKQALETVQRLLPVLCQD-HGLTPDQVVAIASNIGGKQALETVQRLL 1044
Cdd:NF041308  691 QALETVQRLLPVLCQpPYGLTPEQVVAIASNNGGKQALETVQRLLPVLCQRpHGLTPDQVVAIASNDGGKQALETVQRLL 770
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674 1045 PVLCQD-HGLTPDQVVAIASNGGGKQALETVQRLLPVLCQD-HGLTPDQVVAIASNNGGKQALETVQRLLPVLCQDHGLT 1122
Cdd:NF041308  771 PVLCQPpHGLTPDQVVAIASNDGGKQALETVQRLLPVLCDApHGLTPHQVVAIASNIGGRQALETVQRLLPVLCQAHGLT 850
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674 1123 PDQVVAIASNGGGKQALETVQRLLPVLCQ-DHGLTPDQVVAIASNGGGKQALESIVAQLSRPDPALAALTNDHLVALACL 1201
Cdd:NF041308  851 PDQVVAIASNNGGKQALETVQRLLPVLCQpPHGLTPHQVVAIASNIGGKQALESVVAQLSSPDPALAALTNDRLVALACI 930
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674 1202 GGRPAMDAVKKGLPHAPELIRRVNRRIGERTSHRVADYAQVVRVLEFFQCHSHPAYAFDEAMTQFGMSRNGLVQLFRRVG 1281
Cdd:NF041308  931 GGRPALNAVKKGLPHAVALIRKMNNRVPERTAHLVADLTQVVRVLSFFQCHSNPAQAFHEAMTQFEMSRQGLLQLFRRVG 1010
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674 1282 VTELEARGGTLPPASQRWDRILQASGMkraKPSPTSAQTPDQASLHAFADSLERDLDAPSPMHEGDQTRASSRKRSRSDR 1361
Cdd:NF041308 1011 VTELEARSGTLPPASQRWQRILHALGL---KPSSASAQTPGQESLHAFADSLERELDAPSPMQDASQAGSSSRKRSRSDD 1087
                        1130      1140      1150      1160
                  ....*....|....*....|....*....|....*....|....*.
gi 734521674 1362 AVTGPSAQQAVEVRVPEQRDALHLP--LSWRVKRPRTRIWGGLPDP 1405
Cdd:NF041308 1088 PVHGFPAQQIAEALIPEHRDAPHLLplSSWGAKRRRSRIAGGLPDP 1133
GFP pfam01353
Green fluorescent protein;
10-220 1.58e-48

Green fluorescent protein;


:

Pssm-ID: 426217  Cd Length: 211  Bit Score: 171.98  E-value: 1.58e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674    10 MHMKLYMEGTVNNHHFKCTSEGEGKPYEGTQTMRIKAVEGgPLPFAFDILATSFMYgsKTFINHTQGiPDFFKQSFPEG- 88
Cdd:pfam01353    1 MTHDLHMEGSVNGHEFDIVGGGNGNPNDGSLETKVKSTKG-ALPFSPYLLAPHL*Y--YQYLPFPDG-TSPFQAAVENGg 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674    89 FTWERVTTYEDGGVLTATQDTSLQDGCLIYNVKIRGVNFPSNGPVMQKKTLGWEASTETL-YPADGGLEGRADMALKLVG 167
Cdd:pfam01353   77 YQVHRTFKFEDGGVLTIVFTYTYEGGHIKGEFTFQGSGFPPDGPVMTKSLTGWDPSVEKMiPRNDKTLVGDINWSLKLTD 156
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 734521674   168 GGHLICNLKTTYRSKKP-AKNLKMPGVYYVDRRLERIKEADKETYVEQHEVAVA 220
Cdd:pfam01353  157 GKRYRAQVVTNYTFAKPvPAGLKLPPPHFVFRKIERTGSKTEINLVEQQKAFVD 210
 
Name Accession Description Interval E-value
AvrBs3 NF041308
type III secretion system effector avirulence protein AvrBs3;
258-1405 0e+00

type III secretion system effector avirulence protein AvrBs3;


Pssm-ID: 469205 [Multi-domain]  Cd Length: 1179  Bit Score: 1829.19  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  258 IRPRRPSPARELLPGPQPDRVQPTADRGVSAPAGSPLDGLPARRTVSRTRLPSPPAPSPAFSAGSFSDLLRPFDPSLLDT 337
Cdd:NF041308    4 IRSRTPSPAREPQAGSQPDGVQPIAGRLVSTAASSPLDGLPARPAMSRTRQPATPAPSPAFSVGSFSDLLRQFDPSLFDP 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  338 SLLDSMPAVGTPHTAAAPAEWDEAQSALRAADDPPPTVRVAVTAARP--PRAKPAPRRRAAQPSDASPAAQVDLRTLGYS 415
Cdd:NF041308   84 SLFDSSPAFGAHHADAAPGEMDEVQSGLRAADDPQSHLSAAVTAPSPtpPRTQAAARRRSAQTSDASPAESVDLSTLGYT 163
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  416 QQQQEKIKPKVRSTVAQHHEALVGHGFTHAHIVALSQHPAALGTVAVTYQHIITALPEATHEDIVGVGKQWSGARALEAL 495
Cdd:NF041308  164 QQQQEQIKPNARSTVAQHHAALVGHGFTHAHIVELSKHAAALGTVADRYQAIIAVLPEATHKDIVEVGKQWSGARALQAL 243
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  496 LTDAGELRGPPLQLDTGQLVKIAKRGGVTAMEAVHASRNALTGAPLNLTPDQVVAIASNIGGKQALETVQRLLPVLCQD- 574
Cdd:NF041308  244 LMVAEELRGPPLQLDTGQLIKIAKRGGAPAVEAVHASRNALTGAPLHLTPHQVVAIASNNGGKQALETVQRLLPVLCQPp 323
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  575 HGLTPDQVVAIASHDGGKQALETVQRLLPVLCQ-DHGLTPDQVVAIASHDGGKQALETVQRLLPVLCQ-DHGLTPDQVVA 652
Cdd:NF041308  324 HGLTPEQVVAIASNDGGKQALETVQRLLPVLCQaEHGLTPDQVVAIASNIGGKPALETVQRLLPVLCQpPHGLTPDQVVA 403
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  653 IASNGGGKQALETVQRLLPVLCQD-HGLTPDQVVAIASHDGGKQALETVQRLLPVLCQDHGLTPDQVVAIASNIGGKQAL 731
Cdd:NF041308  404 IASNDGGKQALETVQRLLPVLCQApHGLTPDQVVAIASNDGGKQALETVQRLLPELCQAHGLTPDQVVAIASNGGGKQAL 483
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  732 ETVQRLLPVLCQ-DHGLTPDQVVAIASNGGGKQALETVQRLLPVLCQ-DHGLTPDQVVAIASHDGGKQALETVQRLLPVL 809
Cdd:NF041308  484 ETVQRLLPVLCQpPHGLTPEQVVAIASNGGGKQALETVQRLLPVLCQpPHGLTPEQVVAIASHDGGKQALETVHRLLPVL 563
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  810 CQdhgltpdqvvaiasniggkqaletvqrllpvlcQDHGLTPDQVVAIASNNGGKQALETVQRLLPVLCQD-HGLTPDQV 888
Cdd:NF041308  564 CQ---------------------------------APHGLTPEQVVAIASHNGGKQALETVQRLLPVLCQRpYGLTPNQV 610
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  889 VAIASNNGGKQALETVQRLLPVLCQ-DHGLTPDQVVAIASNIGGKQALETVQRLLPVLCQD-HGLTPDQVVAIASNIGGK 966
Cdd:NF041308  611 VAIASNDGGKQALETVQRLLPVLCQaPHGLTPDQVVAIASNGGGKQALETVQRLLPVLCQRpHGLTPHQVVAIASNDGGK 690
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  967 QALETVQRLLPVLCQ-DHGLTPDQVVAIASHDGGKQALETVQRLLPVLCQD-HGLTPDQVVAIASNIGGKQALETVQRLL 1044
Cdd:NF041308  691 QALETVQRLLPVLCQpPYGLTPEQVVAIASNNGGKQALETVQRLLPVLCQRpHGLTPDQVVAIASNDGGKQALETVQRLL 770
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674 1045 PVLCQD-HGLTPDQVVAIASNGGGKQALETVQRLLPVLCQD-HGLTPDQVVAIASNNGGKQALETVQRLLPVLCQDHGLT 1122
Cdd:NF041308  771 PVLCQPpHGLTPDQVVAIASNDGGKQALETVQRLLPVLCDApHGLTPHQVVAIASNIGGRQALETVQRLLPVLCQAHGLT 850
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674 1123 PDQVVAIASNGGGKQALETVQRLLPVLCQ-DHGLTPDQVVAIASNGGGKQALESIVAQLSRPDPALAALTNDHLVALACL 1201
Cdd:NF041308  851 PDQVVAIASNNGGKQALETVQRLLPVLCQpPHGLTPHQVVAIASNIGGKQALESVVAQLSSPDPALAALTNDRLVALACI 930
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674 1202 GGRPAMDAVKKGLPHAPELIRRVNRRIGERTSHRVADYAQVVRVLEFFQCHSHPAYAFDEAMTQFGMSRNGLVQLFRRVG 1281
Cdd:NF041308  931 GGRPALNAVKKGLPHAVALIRKMNNRVPERTAHLVADLTQVVRVLSFFQCHSNPAQAFHEAMTQFEMSRQGLLQLFRRVG 1010
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674 1282 VTELEARGGTLPPASQRWDRILQASGMkraKPSPTSAQTPDQASLHAFADSLERDLDAPSPMHEGDQTRASSRKRSRSDR 1361
Cdd:NF041308 1011 VTELEARSGTLPPASQRWQRILHALGL---KPSSASAQTPGQESLHAFADSLERELDAPSPMQDASQAGSSSRKRSRSDD 1087
                        1130      1140      1150      1160
                  ....*....|....*....|....*....|....*....|....*.
gi 734521674 1362 AVTGPSAQQAVEVRVPEQRDALHLP--LSWRVKRPRTRIWGGLPDP 1405
Cdd:NF041308 1088 PVHGFPAQQIAEALIPEHRDAPHLLplSSWGAKRRRSRIAGGLPDP 1133
GFP pfam01353
Green fluorescent protein;
10-220 1.58e-48

Green fluorescent protein;


Pssm-ID: 426217  Cd Length: 211  Bit Score: 171.98  E-value: 1.58e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674    10 MHMKLYMEGTVNNHHFKCTSEGEGKPYEGTQTMRIKAVEGgPLPFAFDILATSFMYgsKTFINHTQGiPDFFKQSFPEG- 88
Cdd:pfam01353    1 MTHDLHMEGSVNGHEFDIVGGGNGNPNDGSLETKVKSTKG-ALPFSPYLLAPHL*Y--YQYLPFPDG-TSPFQAAVENGg 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674    89 FTWERVTTYEDGGVLTATQDTSLQDGCLIYNVKIRGVNFPSNGPVMQKKTLGWEASTETL-YPADGGLEGRADMALKLVG 167
Cdd:pfam01353   77 YQVHRTFKFEDGGVLTIVFTYTYEGGHIKGEFTFQGSGFPPDGPVMTKSLTGWDPSVEKMiPRNDKTLVGDINWSLKLTD 156
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 734521674   168 GGHLICNLKTTYRSKKP-AKNLKMPGVYYVDRRLERIKEADKETYVEQHEVAVA 220
Cdd:pfam01353  157 GKRYRAQVVTNYTFAKPvPAGLKLPPPHFVFRKIERTGSKTEINLVEQQKAFVD 210
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
861-894 1.31e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 51.68  E-value: 1.31e-08
                           10        20        30
                   ....*....|....*....|....*....|....
gi 734521674   861 NGGKQALETVQRLLPVLCQdHGLTPDQVVAIASN 894
Cdd:pfam03377    1 DGGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
PksD COG3321
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ...
210-722 1.81e-05

Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 442550 [Multi-domain]  Cd Length: 1386  Bit Score: 49.49  E-value: 1.81e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  210 TYVEQHEVAvarycdLPSKLGHRQLEGRGSLLTCGDVEENPGPTGLSTIRPRRPSPARELLPGPQPDRVQPTADRGVSAP 289
Cdd:COG3321   865 TYPFQREDA------AAALLAAALAAALAAAAALGALLLAALAAALAAALLALAAAAAAALALAAAALAALLALVALAAA 938
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  290 AGSPLDGLPARRTVSRTRLPSPPAPSPAFSAGSFSDLLRPFDPSLLDTSLLDSMPAVGTPHTAAAPAEWDEAQSALRAAD 369
Cdd:COG3321   939 AAALLALAAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAALALLAAAALLLAAAAAA 1018
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  370 DPPPTVRVAVTAARPPRAKPAPRRRAAQPSDASPAAQVDLRTLGYSQQQQEKIKPKVRSTVAQHHEALVGHGFTHAHIVA 449
Cdd:COG3321  1019 AALLALAALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALAELALAAAALALAAALAAAALALA 1098
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  450 LSQHPAALGTVAVTYQHIITALPEATHEDIVGVGKQWSGARALEALLTDAGELRGPPLQLDTGQLVKIAKRGGVTAMEAV 529
Cdd:COG3321  1099 LAALAAALLLLALLAALALAAAAAALLALAALLAAAAAAAALAAAAAAAAALALAAAAAALAAALAAALLAAAALLLALA 1178
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  530 HASRNALTGAPLNLTPDQVVAIASNIGGKQALETVQRLLPVLCQDHGLTPDQVVAIASHDGGKQALETVQRLLPVLCQDH 609
Cdd:COG3321  1179 LALAAALAAALAGLAALLLAALLAALLAALLALALAALAAAAAALLAAAAAAAALALLALAAAAAAVAALAAAAAALLAA 1258
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  610 GLTPDQVVAIASHDGGKQALETVQRLLPVLcqDHGLTPDQVVAIASNGGGKQALETVQRLLPVLCQDHGLTPDQVVAIAS 689
Cdd:COG3321  1259 LAALALLAAAAGLAALAAAAAAAAAALALA--AAAAAAAAALAALLAAAAAAAAAAAAAAAAAALAAALLAAALAALAAA 1336
                         490       500       510
                  ....*....|....*....|....*....|...
gi 734521674  690 HDGGKQALETVQRLLPVLCQDHGLTPDQVVAIA 722
Cdd:COG3321  1337 VAAALALAAAAAAAAAAAAAAAAAAALAAAAGA 1369
PHA03378 PHA03378
EBNA-3B; Provisional
244-422 5.73e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 47.75  E-value: 5.73e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  244 GDVEENPGPTGLSTIRPRRPSPARELLPGPQPDRVQPtadrgvsaPAGSPLDGLPARRTVSRTRLPSPPAPSPAFSAGSF 323
Cdd:PHA03378  670 GHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRP--------PAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAP 741
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  324 SDLLRPfdpslldtsllDSMPAVGTPHTAAAPAEWDEAQSALRAADDPPPtvRVAVTAARPPRAKPAPRrraaQPSDASP 403
Cdd:PHA03378  742 GRARPP-----------AAAPGRARPPAAAPGRARPPAAAPGAPTPQPPP--QAPPAPQQRPRGAPTPQ----PPPQAGP 804
                         170       180
                  ....*....|....*....|
gi 734521674  404 AA-QVDLRTLGYSQQQQEKI 422
Cdd:PHA03378  805 TSmQLMPRAAPGQQGPTKQI 824
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
258-399 1.20e-03

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 42.83  E-value: 1.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  258 IRPRRPSPARELLPGPQPDRVQPTADRG--VSAPAGSPLDGLPARRTVSRTRLPSPPAPSPAFSAGSFSDLLrPFDPSLL 335
Cdd:NF040712  188 IDPDFGRPLRPLATVPRLAREPADARPEevEPAPAAEGAPATDSDPAEAGTPDDLASARRRRAGVEQPEDEP-VGPGAAP 266
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 734521674  336 DTSLLDSMPAVGTPHTAAAPAEWDEAQSALRA--ADDPPPTVRVAVTAARPPRAKPAPRRRAAQPS 399
Cdd:NF040712  267 AAEPDEATRDAGEPPAPGAAETPEAAEPPAPApaAPAAPAAPEAEEPARPEPPPAPKPKRRRRRAS 332
 
Name Accession Description Interval E-value
AvrBs3 NF041308
type III secretion system effector avirulence protein AvrBs3;
258-1405 0e+00

type III secretion system effector avirulence protein AvrBs3;


Pssm-ID: 469205 [Multi-domain]  Cd Length: 1179  Bit Score: 1829.19  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  258 IRPRRPSPARELLPGPQPDRVQPTADRGVSAPAGSPLDGLPARRTVSRTRLPSPPAPSPAFSAGSFSDLLRPFDPSLLDT 337
Cdd:NF041308    4 IRSRTPSPAREPQAGSQPDGVQPIAGRLVSTAASSPLDGLPARPAMSRTRQPATPAPSPAFSVGSFSDLLRQFDPSLFDP 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  338 SLLDSMPAVGTPHTAAAPAEWDEAQSALRAADDPPPTVRVAVTAARP--PRAKPAPRRRAAQPSDASPAAQVDLRTLGYS 415
Cdd:NF041308   84 SLFDSSPAFGAHHADAAPGEMDEVQSGLRAADDPQSHLSAAVTAPSPtpPRTQAAARRRSAQTSDASPAESVDLSTLGYT 163
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  416 QQQQEKIKPKVRSTVAQHHEALVGHGFTHAHIVALSQHPAALGTVAVTYQHIITALPEATHEDIVGVGKQWSGARALEAL 495
Cdd:NF041308  164 QQQQEQIKPNARSTVAQHHAALVGHGFTHAHIVELSKHAAALGTVADRYQAIIAVLPEATHKDIVEVGKQWSGARALQAL 243
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  496 LTDAGELRGPPLQLDTGQLVKIAKRGGVTAMEAVHASRNALTGAPLNLTPDQVVAIASNIGGKQALETVQRLLPVLCQD- 574
Cdd:NF041308  244 LMVAEELRGPPLQLDTGQLIKIAKRGGAPAVEAVHASRNALTGAPLHLTPHQVVAIASNNGGKQALETVQRLLPVLCQPp 323
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  575 HGLTPDQVVAIASHDGGKQALETVQRLLPVLCQ-DHGLTPDQVVAIASHDGGKQALETVQRLLPVLCQ-DHGLTPDQVVA 652
Cdd:NF041308  324 HGLTPEQVVAIASNDGGKQALETVQRLLPVLCQaEHGLTPDQVVAIASNIGGKPALETVQRLLPVLCQpPHGLTPDQVVA 403
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  653 IASNGGGKQALETVQRLLPVLCQD-HGLTPDQVVAIASHDGGKQALETVQRLLPVLCQDHGLTPDQVVAIASNIGGKQAL 731
Cdd:NF041308  404 IASNDGGKQALETVQRLLPVLCQApHGLTPDQVVAIASNDGGKQALETVQRLLPELCQAHGLTPDQVVAIASNGGGKQAL 483
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  732 ETVQRLLPVLCQ-DHGLTPDQVVAIASNGGGKQALETVQRLLPVLCQ-DHGLTPDQVVAIASHDGGKQALETVQRLLPVL 809
Cdd:NF041308  484 ETVQRLLPVLCQpPHGLTPEQVVAIASNGGGKQALETVQRLLPVLCQpPHGLTPEQVVAIASHDGGKQALETVHRLLPVL 563
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  810 CQdhgltpdqvvaiasniggkqaletvqrllpvlcQDHGLTPDQVVAIASNNGGKQALETVQRLLPVLCQD-HGLTPDQV 888
Cdd:NF041308  564 CQ---------------------------------APHGLTPEQVVAIASHNGGKQALETVQRLLPVLCQRpYGLTPNQV 610
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  889 VAIASNNGGKQALETVQRLLPVLCQ-DHGLTPDQVVAIASNIGGKQALETVQRLLPVLCQD-HGLTPDQVVAIASNIGGK 966
Cdd:NF041308  611 VAIASNDGGKQALETVQRLLPVLCQaPHGLTPDQVVAIASNGGGKQALETVQRLLPVLCQRpHGLTPHQVVAIASNDGGK 690
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  967 QALETVQRLLPVLCQ-DHGLTPDQVVAIASHDGGKQALETVQRLLPVLCQD-HGLTPDQVVAIASNIGGKQALETVQRLL 1044
Cdd:NF041308  691 QALETVQRLLPVLCQpPYGLTPEQVVAIASNNGGKQALETVQRLLPVLCQRpHGLTPDQVVAIASNDGGKQALETVQRLL 770
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674 1045 PVLCQD-HGLTPDQVVAIASNGGGKQALETVQRLLPVLCQD-HGLTPDQVVAIASNNGGKQALETVQRLLPVLCQDHGLT 1122
Cdd:NF041308  771 PVLCQPpHGLTPDQVVAIASNDGGKQALETVQRLLPVLCDApHGLTPHQVVAIASNIGGRQALETVQRLLPVLCQAHGLT 850
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674 1123 PDQVVAIASNGGGKQALETVQRLLPVLCQ-DHGLTPDQVVAIASNGGGKQALESIVAQLSRPDPALAALTNDHLVALACL 1201
Cdd:NF041308  851 PDQVVAIASNNGGKQALETVQRLLPVLCQpPHGLTPHQVVAIASNIGGKQALESVVAQLSSPDPALAALTNDRLVALACI 930
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674 1202 GGRPAMDAVKKGLPHAPELIRRVNRRIGERTSHRVADYAQVVRVLEFFQCHSHPAYAFDEAMTQFGMSRNGLVQLFRRVG 1281
Cdd:NF041308  931 GGRPALNAVKKGLPHAVALIRKMNNRVPERTAHLVADLTQVVRVLSFFQCHSNPAQAFHEAMTQFEMSRQGLLQLFRRVG 1010
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674 1282 VTELEARGGTLPPASQRWDRILQASGMkraKPSPTSAQTPDQASLHAFADSLERDLDAPSPMHEGDQTRASSRKRSRSDR 1361
Cdd:NF041308 1011 VTELEARSGTLPPASQRWQRILHALGL---KPSSASAQTPGQESLHAFADSLERELDAPSPMQDASQAGSSSRKRSRSDD 1087
                        1130      1140      1150      1160
                  ....*....|....*....|....*....|....*....|....*.
gi 734521674 1362 AVTGPSAQQAVEVRVPEQRDALHLP--LSWRVKRPRTRIWGGLPDP 1405
Cdd:NF041308 1088 PVHGFPAQQIAEALIPEHRDAPHLLplSSWGAKRRRSRIAGGLPDP 1133
GFP pfam01353
Green fluorescent protein;
10-220 1.58e-48

Green fluorescent protein;


Pssm-ID: 426217  Cd Length: 211  Bit Score: 171.98  E-value: 1.58e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674    10 MHMKLYMEGTVNNHHFKCTSEGEGKPYEGTQTMRIKAVEGgPLPFAFDILATSFMYgsKTFINHTQGiPDFFKQSFPEG- 88
Cdd:pfam01353    1 MTHDLHMEGSVNGHEFDIVGGGNGNPNDGSLETKVKSTKG-ALPFSPYLLAPHL*Y--YQYLPFPDG-TSPFQAAVENGg 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674    89 FTWERVTTYEDGGVLTATQDTSLQDGCLIYNVKIRGVNFPSNGPVMQKKTLGWEASTETL-YPADGGLEGRADMALKLVG 167
Cdd:pfam01353   77 YQVHRTFKFEDGGVLTIVFTYTYEGGHIKGEFTFQGSGFPPDGPVMTKSLTGWDPSVEKMiPRNDKTLVGDINWSLKLTD 156
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 734521674   168 GGHLICNLKTTYRSKKP-AKNLKMPGVYYVDRRLERIKEADKETYVEQHEVAVA 220
Cdd:pfam01353  157 GKRYRAQVVTNYTFAKPvPAGLKLPPPHFVFRKIERTGSKTEINLVEQQKAFVD 210
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
861-894 1.31e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 51.68  E-value: 1.31e-08
                           10        20        30
                   ....*....|....*....|....*....|....
gi 734521674   861 NGGKQALETVQRLLPVLCQdHGLTPDQVVAIASN 894
Cdd:pfam03377    1 DGGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
895-928 1.31e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 51.68  E-value: 1.31e-08
                           10        20        30
                   ....*....|....*....|....*....|....
gi 734521674   895 NGGKQALETVQRLLPVLCQdHGLTPDQVVAIASN 928
Cdd:pfam03377    1 DGGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
1099-1132 1.31e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 51.68  E-value: 1.31e-08
                           10        20        30
                   ....*....|....*....|....*....|....
gi 734521674  1099 NGGKQALETVQRLLPVLCQdHGLTPDQVVAIASN 1132
Cdd:pfam03377    1 DGGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
623-656 1.35e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 51.68  E-value: 1.35e-08
                           10        20        30
                   ....*....|....*....|....*....|....
gi 734521674   623 DGGKQALETVQRLLPVLCQdHGLTPDQVVAIASN 656
Cdd:pfam03377    1 DGGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
691-724 1.35e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 51.68  E-value: 1.35e-08
                           10        20        30
                   ....*....|....*....|....*....|....
gi 734521674   691 DGGKQALETVQRLLPVLCQdHGLTPDQVVAIASN 724
Cdd:pfam03377    1 DGGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
793-826 1.35e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 51.68  E-value: 1.35e-08
                           10        20        30
                   ....*....|....*....|....*....|....
gi 734521674   793 DGGKQALETVQRLLPVLCQdHGLTPDQVVAIASN 826
Cdd:pfam03377    1 DGGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
997-1030 1.35e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 51.68  E-value: 1.35e-08
                           10        20        30
                   ....*....|....*....|....*....|....
gi 734521674   997 DGGKQALETVQRLLPVLCQdHGLTPDQVVAIASN 1030
Cdd:pfam03377    1 DGGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
589-622 1.83e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 51.30  E-value: 1.83e-08
                           10        20        30
                   ....*....|....*....|....*....|....
gi 734521674   589 DGGKQALETVQRLLPVLCQdHGLTPDQVVAIASH 622
Cdd:pfam03377    1 DGGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
725-758 1.85e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 51.30  E-value: 1.85e-08
                           10        20        30
                   ....*....|....*....|....*....|....
gi 734521674   725 IGGKQALETVQRLLPVLCQdHGLTPDQVVAIASN 758
Cdd:pfam03377    1 DGGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
827-860 1.85e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 51.30  E-value: 1.85e-08
                           10        20        30
                   ....*....|....*....|....*....|....
gi 734521674   827 IGGKQALETVQRLLPVLCQdHGLTPDQVVAIASN 860
Cdd:pfam03377    1 DGGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
929-962 1.85e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 51.30  E-value: 1.85e-08
                           10        20        30
                   ....*....|....*....|....*....|....
gi 734521674   929 IGGKQALETVQRLLPVLCQdHGLTPDQVVAIASN 962
Cdd:pfam03377    1 DGGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
1031-1064 1.85e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 51.30  E-value: 1.85e-08
                           10        20        30
                   ....*....|....*....|....*....|....
gi 734521674  1031 IGGKQALETVQRLLPVLCQdHGLTPDQVVAIASN 1064
Cdd:pfam03377    1 DGGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
555-588 2.50e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 50.91  E-value: 2.50e-08
                           10        20        30
                   ....*....|....*....|....*....|....
gi 734521674   555 IGGKQALETVQRLLPVLCQdHGLTPDQVVAIASH 588
Cdd:pfam03377    1 DGGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
963-996 2.50e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 50.91  E-value: 2.50e-08
                           10        20        30
                   ....*....|....*....|....*....|....
gi 734521674   963 IGGKQALETVQRLLPVLCQdHGLTPDQVVAIASH 996
Cdd:pfam03377    1 DGGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
1066-1098 5.16e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 50.14  E-value: 5.16e-08
                           10        20        30
                   ....*....|....*....|....*....|...
gi 734521674  1066 GGKQALETVQRLLPVLCQdHGLTPDQVVAIASN 1098
Cdd:pfam03377    2 GGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
1134-1166 5.16e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 50.14  E-value: 5.16e-08
                           10        20        30
                   ....*....|....*....|....*....|...
gi 734521674  1134 GGKQALETVQRLLPVLCQdHGLTPDQVVAIASN 1166
Cdd:pfam03377    2 GGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
658-690 6.99e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 49.75  E-value: 6.99e-08
                           10        20        30
                   ....*....|....*....|....*....|...
gi 734521674   658 GGKQALETVQRLLPVLCQdHGLTPDQVVAIASH 690
Cdd:pfam03377    2 GGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
760-792 6.99e-08

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 49.75  E-value: 6.99e-08
                           10        20        30
                   ....*....|....*....|....*....|...
gi 734521674   760 GGKQALETVQRLLPVLCQdHGLTPDQVVAIASH 792
Cdd:pfam03377    2 GGAQALEAVLEHGPALRQ-RGFSRADIVKIAGN 33
PksD COG3321
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ...
210-722 1.81e-05

Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 442550 [Multi-domain]  Cd Length: 1386  Bit Score: 49.49  E-value: 1.81e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  210 TYVEQHEVAvarycdLPSKLGHRQLEGRGSLLTCGDVEENPGPTGLSTIRPRRPSPARELLPGPQPDRVQPTADRGVSAP 289
Cdd:COG3321   865 TYPFQREDA------AAALLAAALAAALAAAAALGALLLAALAAALAAALLALAAAAAAALALAAAALAALLALVALAAA 938
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  290 AGSPLDGLPARRTVSRTRLPSPPAPSPAFSAGSFSDLLRPFDPSLLDTSLLDSMPAVGTPHTAAAPAEWDEAQSALRAAD 369
Cdd:COG3321   939 AAALLALAAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAALALLAAAALLLAAAAAA 1018
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  370 DPPPTVRVAVTAARPPRAKPAPRRRAAQPSDASPAAQVDLRTLGYSQQQQEKIKPKVRSTVAQHHEALVGHGFTHAHIVA 449
Cdd:COG3321  1019 AALLALAALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALAELALAAAALALAAALAAAALALA 1098
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  450 LSQHPAALGTVAVTYQHIITALPEATHEDIVGVGKQWSGARALEALLTDAGELRGPPLQLDTGQLVKIAKRGGVTAMEAV 529
Cdd:COG3321  1099 LAALAAALLLLALLAALALAAAAAALLALAALLAAAAAAAALAAAAAAAAALALAAAAAALAAALAAALLAAAALLLALA 1178
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  530 HASRNALTGAPLNLTPDQVVAIASNIGGKQALETVQRLLPVLCQDHGLTPDQVVAIASHDGGKQALETVQRLLPVLCQDH 609
Cdd:COG3321  1179 LALAAALAAALAGLAALLLAALLAALLAALLALALAALAAAAAALLAAAAAAAALALLALAAAAAAVAALAAAAAALLAA 1258
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  610 GLTPDQVVAIASHDGGKQALETVQRLLPVLcqDHGLTPDQVVAIASNGGGKQALETVQRLLPVLCQDHGLTPDQVVAIAS 689
Cdd:COG3321  1259 LAALALLAAAAGLAALAAAAAAAAAALALA--AAAAAAAAALAALLAAAAAAAAAAAAAAAAAALAAALLAAALAALAAA 1336
                         490       500       510
                  ....*....|....*....|....*....|...
gi 734521674  690 HDGGKQALETVQRLLPVLCQDHGLTPDQVVAIA 722
Cdd:COG3321  1337 VAAALALAAAAAAAAAAAAAAAAAAALAAAAGA 1369
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
422-453 2.34e-05

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 42.44  E-value: 2.34e-05
                           10        20        30
                   ....*....|....*....|....*....|..
gi 734521674   422 IKPKVRSTVAQHHEALVGHGFTHAHIVALSQH 453
Cdd:pfam03377    2 GGAQALEAVLEHGPALRQRGFSRADIVKIAGN 33
PHA03378 PHA03378
EBNA-3B; Provisional
244-422 5.73e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 47.75  E-value: 5.73e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  244 GDVEENPGPTGLSTIRPRRPSPARELLPGPQPDRVQPtadrgvsaPAGSPLDGLPARRTVSRTRLPSPPAPSPAFSAGSF 323
Cdd:PHA03378  670 GHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRP--------PAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAP 741
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  324 SDLLRPfdpslldtsllDSMPAVGTPHTAAAPAEWDEAQSALRAADDPPPtvRVAVTAARPPRAKPAPRrraaQPSDASP 403
Cdd:PHA03378  742 GRARPP-----------AAAPGRARPPAAAPGRARPPAAAPGAPTPQPPP--QAPPAPQQRPRGAPTPQ----PPPQAGP 804
                         170       180
                  ....*....|....*....|
gi 734521674  404 AA-QVDLRTLGYSQQQQEKI 422
Cdd:PHA03378  805 TSmQLMPRAAPGQQGPTKQI 824
PHA03247 PHA03247
large tegument protein UL36; Provisional
252-405 5.28e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 5.28e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  252 PTGLSTIRPRRPSPARELLPGPQPDRVQPTADRGVSAPAGSPLDGLPARRTVSRTRLPSPPAPSPAFSAGSFSDlLRPFD 331
Cdd:PHA03247 2720 PLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE-SRESL 2798
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  332 PSLLDTSlldSMPAVGTPHTAAAPAewdeaqSALRAADDPPPTVRVAVTAARPPRAKPAP--------------RRRAAQ 397
Cdd:PHA03247 2799 PSPWDPA---DPPAAVLAPAAALPP------AASPAGPLPPPTSAQPTAPPPPPGPPPPSlplggsvapggdvrRRPPSR 2869

                  ....*...
gi 734521674  398 PSDASPAA 405
Cdd:PHA03247 2870 SPAAKPAA 2877
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
258-399 1.20e-03

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 42.83  E-value: 1.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  258 IRPRRPSPARELLPGPQPDRVQPTADRG--VSAPAGSPLDGLPARRTVSRTRLPSPPAPSPAFSAGSFSDLLrPFDPSLL 335
Cdd:NF040712  188 IDPDFGRPLRPLATVPRLAREPADARPEevEPAPAAEGAPATDSDPAEAGTPDDLASARRRRAGVEQPEDEP-VGPGAAP 266
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 734521674  336 DTSLLDSMPAVGTPHTAAAPAEWDEAQSALRA--ADDPPPTVRVAVTAARPPRAKPAPRRRAAQPS 399
Cdd:NF040712  267 AAEPDEATRDAGEPPAPGAAETPEAAEPPAPApaAPAAPAAPEAEEPARPEPPPAPKPKRRRRRAS 332
PRK10307 PRK10307
colanic acid biosynthesis glycosyltransferase WcaI;
712-770 1.59e-03

colanic acid biosynthesis glycosyltransferase WcaI;


Pssm-ID: 236670 [Multi-domain]  Cd Length: 412  Bit Score: 42.66  E-value: 1.59e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 734521674  712 GLTPDQVVAIAS-NIGGKQALETV----QRLlpvlcQDHgltPDQVVAIASNGGGKQALETVQR 770
Cdd:PRK10307  224 GLPDGKKIVLYSgNIGEKQGLELVidaaRRL-----RDR---PDLIFVICGQGGGKARLEKMAQ 279
PRK10307 PRK10307
colanic acid biosynthesis glycosyltransferase WcaI;
1018-1076 1.59e-03

colanic acid biosynthesis glycosyltransferase WcaI;


Pssm-ID: 236670 [Multi-domain]  Cd Length: 412  Bit Score: 42.66  E-value: 1.59e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 734521674 1018 GLTPDQVVAIAS-NIGGKQALETV----QRLlpvlcQDHgltPDQVVAIASNGGGKQALETVQR 1076
Cdd:PRK10307  224 GLPDGKKIVLYSgNIGEKQGLELVidaaRRL-----RDR---PDLIFVICGQGGGKARLEKMAQ 279
TAL_effector pfam03377
TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair ...
520-554 1.68e-03

TAL effector repeat; The proteins in this family bind to DNA. Each repeat binds to a base pair in a predictable way. The structure shows that each repeat is composed of two alpha helices.


Pssm-ID: 397449 [Multi-domain]  Cd Length: 33  Bit Score: 37.43  E-value: 1.68e-03
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 734521674   520 RGGVTAMEAVHASRNALTGAplNLTPDQVVAIASN 554
Cdd:pfam03377    1 DGGAQALEAVLEHGPALRQR--GFSRADIVKIAGN 33
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
342-405 1.77e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.05  E-value: 1.77e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 734521674  342 SMPAVGTPHTAAAPAEWDEAQSALRAADDPPPTVRVAVTAARPPRAKPAPRRRAAQPSDASPAA 405
Cdd:PRK07764  433 PAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAA 496
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
260-402 1.95e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.56  E-value: 1.95e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  260 PRRPSPARELLP------GPQPDRVQPTADRGVSAPAGSPLDGLPARRTVSRTRLPSPPAPSPAFSAGSFSDLLRPFDps 333
Cdd:PRK12323  423 PARRSPAPEALAaarqasARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWE-- 500
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 734521674  334 lldtSLLDSMPAVGTPHTAAAPAEWDEAQS---ALRAADDPPPTVRVAVTAARPPRAKPAPRRRAAQ-PSDAS 402
Cdd:PRK12323  501 ----ELPPEFASPAPAQPDAAPAGWVAESIpdpATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPrPPRAS 569
PHA03247 PHA03247
large tegument protein UL36; Provisional
244-407 2.94e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 2.94e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  244 GDVEENPGPTGLSTIRPRRPSPArellPGPQPDRVQPTADRGVSAPAGSPLDGLPARRTVSRTRLPSPPAPSPAFSA--- 320
Cdd:PHA03247 2606 GDPRGPAPPSPLPPDTHAPDPPP----PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSppq 2681
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  321 -----------GSFSDLLRPFDPSllDTSLLDSMPAVGTPHTAAAPAEWDEAQSALRAADDPPPTVRVAVTAArPPRAKP 389
Cdd:PHA03247 2682 rprrraarptvGSLTSLADPPPPP--PTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPG-GPARPA 2758
                         170
                  ....*....|....*...
gi 734521674  390 APRRRAAQPSDASPAAQV 407
Cdd:PHA03247 2759 RPPTTAGPPAPAPPAAPA 2776
PksD COG3321
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ...
345-858 6.07e-03

Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 442550 [Multi-domain]  Cd Length: 1386  Bit Score: 41.40  E-value: 6.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  345 AVGTPHTAAAPAEWDEAQSALRAADDPPPT------VRVAVTAARPPRAKPAPRRRAAQPSDASPAAQVDLRTLGYSQQQ 418
Cdd:COG3321   836 ALAQLWVAGVPVDWSALYPGRGRRRVPLPTypfqreDAAAALLAAALAAALAAAAALGALLLAALAAALAAALLALAAAA 915
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  419 QEKIKPKVRSTVAQHHEALVGHGFTHAHIVALSQHPAALGTVAVTYQHIITALPEATHEDIVGVGKQWSGARALEALLTD 498
Cdd:COG3321   916 AAALALAAAALAALLALVALAAAAAALLALAAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAA 995
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  499 AGELRGPPLQLDTGQLVKIAKRGGVTAMEAVHASRNALTGAPLNLTPDQVVAIASNIGGKQALETVQRLLPVLCQDHGLT 578
Cdd:COG3321   996 LAAAAALALLAAAALLLAAAAAAAALLALAALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALA 1075
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  579 PDQVVAIASHDGGKQALETVQRLLPVLcQDHGLTPDQVVAIASHDGGKQALETVQRLLPVLCQDHGLTPDQVVAIASNGG 658
Cdd:COG3321  1076 ELALAAAALALAAALAAAALALALAAL-AAALLLLALLAALALAAAAAALLALAALLAAAAAAAALAAAAAAAAALALAA 1154
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  659 GKQALETVQRLLPVLCQDHGLTPDQVVAIASHDGGKQALETVQRLLPVLCQDHGLTPDQVVAIASNIGGKQALETVQRLL 738
Cdd:COG3321  1155 AAAALAAALAAALLAAAALLLALALALAAALAAALAGLAALLLAALLAALLAALLALALAALAAAAAALLAAAAAAAALA 1234
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  739 PVLCQDHGLTPDQVVAIASNGGGKQALETVQRLLPVLCQDHGLTPDQVVAIASHDGGKQALETVQRLLPVLCQDHGLTPD 818
Cdd:COG3321  1235 LLALAAAAAAVAALAAAAAALLAALAALALLAAAAGLAALAAAAAAAAAALALAAAAAAAAAALAALLAAAAAAAAAAAA 1314
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|
gi 734521674  819 QVVAIASNIGGKQALETVQRLLPVLCQDHGLTPDQVVAIA 858
Cdd:COG3321  1315 AAAAAALAAALLAAALAALAAAVAAALALAAAAAAAAAAA 1354
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
326-427 6.33e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 40.85  E-value: 6.33e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 734521674  326 LLR--PFDPSLLDTSLLDSMPAVGTPHTAAAPAEWDEAQSALRAADDPPPTVRVAVTAARPPRAKPAPRRRAAQ----PS 399
Cdd:PRK14951  358 LLRllAFKPAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAaapaAA 437
                          90       100
                  ....*....|....*....|....*...
gi 734521674  400 DASPAAQVDLRTLGYSQQQQEKIKPKVR 427
Cdd:PRK14951  438 PAAAPAAVALAPAPPAQAAPETVAIPVR 465
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH