NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2569268747|ref|WP_306702044|]
View 

hypothetical protein [Escherichia coli]

Protein Classification

similar to recombinase Flp protein( domain architecture ID 10083047)

protein similar to recombinase Flp protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
INT_Flp_C cd00217
Flp Tyrosine-based site-specific recombinases (also called integrases), C-terminal catalytic ...
7-421 0e+00

Flp Tyrosine-based site-specific recombinases (also called integrases), C-terminal catalytic domain; Yeast Flp-like recombinases mediate the amplification of the 2 micron circular plasmid copy number by catalyzing the intra-molecular recombination between two inverted repeats during replication. They belong to the DNA breaking-rejoining enzyme superfamily, which also includes prokaryotic tyrosine recombinases and type IB topoisomerases. These enzymes share the same fold in their catalytic domain containing six conserved active site residues and the overall reaction mechanism. Flp-like recombinases are almost exclusively found in yeast and are highly diverged in sequence from the prokaryotic tyrosine recombinases. They cleave their target DNA in trans with a composite active site in which the catalytic tyrosine is provided by a promoter bound to a site other than the one being cleaved. Thus each active site within Flp complexes is assembled by domain swapping and contains catalytic residues from two different monomers. Two DNA segments are synapsed by the tetrameric enzyme, carrying the nucleophilic tyrosine in each active site with only two of the four monomers active at a given time. The catalytic domain is linked through a flexible loop to the N-terminal domain, which is largely responsible for non-specific DNA binding and isomerization. Its overall fold is similar to the SAM domain fold also found in the N-terminal domains of lambda integrase and XerD recombinase.


:

Pssm-ID: 271174 [Multi-domain]  Cd Length: 410  Bit Score: 594.05  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747   7 LCKTPPKVLVRQFVERFERPSGEKIALCAAELTYLCWMITHNGTAIKRATFMSYNTIISNSLSFDIVNKSLQFKYKTQKA 86
Cdd:cd00217     1 LIPVRPAILIELFLELFGKDKIEDKRKLASLLTYLILMAFPAITEVKRGTFRKYKTIISNSLSFDYSRKTIQFKYRLKKN 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747  87 TILEASLKKLIPAWEFTIIPYYGQKHQSDITDIVSSlqlqFESSEEADKGNSHSKKMLKALLSEGESIWEITEKILNSFE 166
Cdd:cd00217    81 RLLQKGLEDAEPPYKFVILSDKRQEENLFIIDKVPL----EPNTESKHIRNSEVNLEFTNILSEKESIWKIIYKILDSFE 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747 167 YTSRFTKTKTLYQFLFLATFINCGRFSDIKNVDPKSFKLVQNKYLGVIIQCLVTETKTSVSRHIYFFSARGRIDPLVYLD 246
Cdd:cd00217   157 ENTSRTTTKARYKLLLLATFTNCCRISDLKNLDPSTFELVKNKYLGTIVRAHVTETKTRISRTVYFFPARGRCDLLLALD 236
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747 247 EFLRNSEPVLKRVNRTGNsssNKQEYQLLKDNLVRSYNKALKKNAPYSIFAIKNGPKSHIGRHLMTSFLSMKGLTELTNV 326
Cdd:cd00217   237 EYLRICKPIPKTVVSDQN---VNQKYQLLKESLVRSYNKFLSKHPAEPIFKIKNGPKSHLGRHLMASFLSKNELDKEANS 313
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747 327 VGNWSDKRA--SAVARTTYTHQITAIPDHYFALVSRYYAYDPISKEmIALKDETNPIEEWQHIEQLKGSA-EGSIRYPAW 403
Cdd:cd00217   314 LGNWSKVREigSAVARRNYTHTITPCPDSLFAFISGYYQISPEGSE-IELNNNKNPMERVHTIPELPTSEdELQLRYGHW 392
                         410
                  ....*....|....*...
gi 2569268747 404 NGIISQEVLDYLSSYINR 421
Cdd:cd00217   393 AKIISHDVLAFLSEYSRK 410
 
Name Accession Description Interval E-value
INT_Flp_C cd00217
Flp Tyrosine-based site-specific recombinases (also called integrases), C-terminal catalytic ...
7-421 0e+00

Flp Tyrosine-based site-specific recombinases (also called integrases), C-terminal catalytic domain; Yeast Flp-like recombinases mediate the amplification of the 2 micron circular plasmid copy number by catalyzing the intra-molecular recombination between two inverted repeats during replication. They belong to the DNA breaking-rejoining enzyme superfamily, which also includes prokaryotic tyrosine recombinases and type IB topoisomerases. These enzymes share the same fold in their catalytic domain containing six conserved active site residues and the overall reaction mechanism. Flp-like recombinases are almost exclusively found in yeast and are highly diverged in sequence from the prokaryotic tyrosine recombinases. They cleave their target DNA in trans with a composite active site in which the catalytic tyrosine is provided by a promoter bound to a site other than the one being cleaved. Thus each active site within Flp complexes is assembled by domain swapping and contains catalytic residues from two different monomers. Two DNA segments are synapsed by the tetrameric enzyme, carrying the nucleophilic tyrosine in each active site with only two of the four monomers active at a given time. The catalytic domain is linked through a flexible loop to the N-terminal domain, which is largely responsible for non-specific DNA binding and isomerization. Its overall fold is similar to the SAM domain fold also found in the N-terminal domains of lambda integrase and XerD recombinase.


Pssm-ID: 271174 [Multi-domain]  Cd Length: 410  Bit Score: 594.05  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747   7 LCKTPPKVLVRQFVERFERPSGEKIALCAAELTYLCWMITHNGTAIKRATFMSYNTIISNSLSFDIVNKSLQFKYKTQKA 86
Cdd:cd00217     1 LIPVRPAILIELFLELFGKDKIEDKRKLASLLTYLILMAFPAITEVKRGTFRKYKTIISNSLSFDYSRKTIQFKYRLKKN 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747  87 TILEASLKKLIPAWEFTIIPYYGQKHQSDITDIVSSlqlqFESSEEADKGNSHSKKMLKALLSEGESIWEITEKILNSFE 166
Cdd:cd00217    81 RLLQKGLEDAEPPYKFVILSDKRQEENLFIIDKVPL----EPNTESKHIRNSEVNLEFTNILSEKESIWKIIYKILDSFE 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747 167 YTSRFTKTKTLYQFLFLATFINCGRFSDIKNVDPKSFKLVQNKYLGVIIQCLVTETKTSVSRHIYFFSARGRIDPLVYLD 246
Cdd:cd00217   157 ENTSRTTTKARYKLLLLATFTNCCRISDLKNLDPSTFELVKNKYLGTIVRAHVTETKTRISRTVYFFPARGRCDLLLALD 236
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747 247 EFLRNSEPVLKRVNRTGNsssNKQEYQLLKDNLVRSYNKALKKNAPYSIFAIKNGPKSHIGRHLMTSFLSMKGLTELTNV 326
Cdd:cd00217   237 EYLRICKPIPKTVVSDQN---VNQKYQLLKESLVRSYNKFLSKHPAEPIFKIKNGPKSHLGRHLMASFLSKNELDKEANS 313
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747 327 VGNWSDKRA--SAVARTTYTHQITAIPDHYFALVSRYYAYDPISKEmIALKDETNPIEEWQHIEQLKGSA-EGSIRYPAW 403
Cdd:cd00217   314 LGNWSKVREigSAVARRNYTHTITPCPDSLFAFISGYYQISPEGSE-IELNNNKNPMERVHTIPELPTSEdELQLRYGHW 392
                         410
                  ....*....|....*...
gi 2569268747 404 NGIISQEVLDYLSSYINR 421
Cdd:cd00217   393 AKIISHDVLAFLSEYSRK 410
Flp_C pfam05202
Recombinase Flp protein;
137-379 3.79e-144

Recombinase Flp protein;


Pssm-ID: 398741  Cd Length: 254  Bit Score: 410.61  E-value: 3.79e-144
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747 137 NSHSKKMLKALLSEGESIWEITEKILNSFEYTSRFTKTKTLYQFLFLATFINCGRFSDIKNVDPKSFKLVQNKYLGVIIQ 216
Cdd:pfam05202   1 KFHQKKLEKALLNEGEDIWDITEKCFAMFENHSRETKSCILYKFIFLATFINACRFSDIINLDPKSFHLKKNKYLGTIIC 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747 217 CLVTETKTSVSRHIYFFSARGR-IDPLVYLDEFLRNSE-------PVLKRVNRTGNSSSNKQEYQLLKDNLVRSYNKALK 288
Cdd:pfam05202  81 CHTFETKNNIPRHIQFFPARGRgCDMLQLLDEFLKINEngpfeyvPMLKNKNPIGNSNDNKQEYQFFKDGLGAAYNKALK 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747 289 KNAPYSIFAIKNGPKSHIGRHLMTSFLSMKGLTELTNVVGNWSDK---RASAVARTTYTHQITAIPDHYFALVSRYYAYD 365
Cdd:pfam05202 161 KHAAHHIFAIKNAPKSDIGIHLMINFLNKIGLQEEGHRLGNFSDKcpiDASALAKRNFTHQITACHDHRDALRAIISAYD 240
                         250
                  ....*....|....
gi 2569268747 366 PISKEMIALKDETN 379
Cdd:pfam05202 241 PISKEMIALKDEMN 254
 
Name Accession Description Interval E-value
INT_Flp_C cd00217
Flp Tyrosine-based site-specific recombinases (also called integrases), C-terminal catalytic ...
7-421 0e+00

Flp Tyrosine-based site-specific recombinases (also called integrases), C-terminal catalytic domain; Yeast Flp-like recombinases mediate the amplification of the 2 micron circular plasmid copy number by catalyzing the intra-molecular recombination between two inverted repeats during replication. They belong to the DNA breaking-rejoining enzyme superfamily, which also includes prokaryotic tyrosine recombinases and type IB topoisomerases. These enzymes share the same fold in their catalytic domain containing six conserved active site residues and the overall reaction mechanism. Flp-like recombinases are almost exclusively found in yeast and are highly diverged in sequence from the prokaryotic tyrosine recombinases. They cleave their target DNA in trans with a composite active site in which the catalytic tyrosine is provided by a promoter bound to a site other than the one being cleaved. Thus each active site within Flp complexes is assembled by domain swapping and contains catalytic residues from two different monomers. Two DNA segments are synapsed by the tetrameric enzyme, carrying the nucleophilic tyrosine in each active site with only two of the four monomers active at a given time. The catalytic domain is linked through a flexible loop to the N-terminal domain, which is largely responsible for non-specific DNA binding and isomerization. Its overall fold is similar to the SAM domain fold also found in the N-terminal domains of lambda integrase and XerD recombinase.


Pssm-ID: 271174 [Multi-domain]  Cd Length: 410  Bit Score: 594.05  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747   7 LCKTPPKVLVRQFVERFERPSGEKIALCAAELTYLCWMITHNGTAIKRATFMSYNTIISNSLSFDIVNKSLQFKYKTQKA 86
Cdd:cd00217     1 LIPVRPAILIELFLELFGKDKIEDKRKLASLLTYLILMAFPAITEVKRGTFRKYKTIISNSLSFDYSRKTIQFKYRLKKN 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747  87 TILEASLKKLIPAWEFTIIPYYGQKHQSDITDIVSSlqlqFESSEEADKGNSHSKKMLKALLSEGESIWEITEKILNSFE 166
Cdd:cd00217    81 RLLQKGLEDAEPPYKFVILSDKRQEENLFIIDKVPL----EPNTESKHIRNSEVNLEFTNILSEKESIWKIIYKILDSFE 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747 167 YTSRFTKTKTLYQFLFLATFINCGRFSDIKNVDPKSFKLVQNKYLGVIIQCLVTETKTSVSRHIYFFSARGRIDPLVYLD 246
Cdd:cd00217   157 ENTSRTTTKARYKLLLLATFTNCCRISDLKNLDPSTFELVKNKYLGTIVRAHVTETKTRISRTVYFFPARGRCDLLLALD 236
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747 247 EFLRNSEPVLKRVNRTGNsssNKQEYQLLKDNLVRSYNKALKKNAPYSIFAIKNGPKSHIGRHLMTSFLSMKGLTELTNV 326
Cdd:cd00217   237 EYLRICKPIPKTVVSDQN---VNQKYQLLKESLVRSYNKFLSKHPAEPIFKIKNGPKSHLGRHLMASFLSKNELDKEANS 313
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747 327 VGNWSDKRA--SAVARTTYTHQITAIPDHYFALVSRYYAYDPISKEmIALKDETNPIEEWQHIEQLKGSA-EGSIRYPAW 403
Cdd:cd00217   314 LGNWSKVREigSAVARRNYTHTITPCPDSLFAFISGYYQISPEGSE-IELNNNKNPMERVHTIPELPTSEdELQLRYGHW 392
                         410
                  ....*....|....*...
gi 2569268747 404 NGIISQEVLDYLSSYINR 421
Cdd:cd00217   393 AKIISHDVLAFLSEYSRK 410
Flp_C pfam05202
Recombinase Flp protein;
137-379 3.79e-144

Recombinase Flp protein;


Pssm-ID: 398741  Cd Length: 254  Bit Score: 410.61  E-value: 3.79e-144
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747 137 NSHSKKMLKALLSEGESIWEITEKILNSFEYTSRFTKTKTLYQFLFLATFINCGRFSDIKNVDPKSFKLVQNKYLGVIIQ 216
Cdd:pfam05202   1 KFHQKKLEKALLNEGEDIWDITEKCFAMFENHSRETKSCILYKFIFLATFINACRFSDIINLDPKSFHLKKNKYLGTIIC 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747 217 CLVTETKTSVSRHIYFFSARGR-IDPLVYLDEFLRNSE-------PVLKRVNRTGNSSSNKQEYQLLKDNLVRSYNKALK 288
Cdd:pfam05202  81 CHTFETKNNIPRHIQFFPARGRgCDMLQLLDEFLKINEngpfeyvPMLKNKNPIGNSNDNKQEYQFFKDGLGAAYNKALK 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747 289 KNAPYSIFAIKNGPKSHIGRHLMTSFLSMKGLTELTNVVGNWSDK---RASAVARTTYTHQITAIPDHYFALVSRYYAYD 365
Cdd:pfam05202 161 KHAAHHIFAIKNAPKSDIGIHLMINFLNKIGLQEEGHRLGNFSDKcpiDASALAKRNFTHQITACHDHRDALRAIISAYD 240
                         250
                  ....*....|....
gi 2569268747 366 PISKEMIALKDETN 379
Cdd:pfam05202 241 PISKEMIALKDEMN 254
Flp_N pfam03930
Recombinase Flp protein N-terminus;
44-128 1.52e-28

Recombinase Flp protein N-terminus;


Pssm-ID: 397837  Cd Length: 82  Bit Score: 107.12  E-value: 1.52e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2569268747  44 MITHNGTAIKRATFMSYNTIISNSLSFDIVNKSLQFKYKTQKATILEASLKKLIPAWEFTIipyYGQKHQSDITDIVSSL 123
Cdd:pfam03930   1 MATRKLTEIKRSTFTKYRRIISQSLQYDSSNKTVSFEYHLKRPTELKEGLSKAFKPYNFVI---KSHKKPTSMTTLFSSL 77

                  ....*
gi 2569268747 124 QLQFE 128
Cdd:pfam03930  78 HLKKE 82
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH