NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1002286839|ref|XP_015647722|]
View 

pentatricopeptide repeat-containing protein At3g06920 [Oryza sativa Japonica Group]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03218 super family cl33664
maturation of RBCL 1; Provisional
321-781 3.03e-42

maturation of RBCL 1; Provisional


The actual alignment was detected with superfamily member PLN03218:

Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 166.98  E-value: 3.03e-42
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  321 AYNTMImgygSAGRFEDAYKLLERLRERGcipsvvsfnsiltcLGKKRKVDEAlSLFEVMKKDaepnsstyniiidmlcl 400
Cdd:PLN03218   376 AYNRLL----RDGRIKDCIDLLEDMEKRG--------------LLDMDKIYHA-KFFKACKKQ----------------- 419
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  401 gGRVEEAYRIldemehASLFPN--LLTVNIMVDRLCKARKLEEAYKIFESASQRGCNPDCVTYCSLIDGLGKKGQVDEAY 478
Cdd:PLN03218   420 -RAVKEAFRF------AKLIRNptLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMF 492
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  479 RLFEKMLDAGHNANPVVYTSLIRNFFIHGRKEDGHKIFKELIRRGCKPDLTLLNTYMDCVFKAGEVEKGRMIFEDIR--S 556
Cdd:PLN03218   493 EVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKaeT 572
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  557 YGFLPDVRSYSILIHGLTKAGQARETSNIFHAMKQQGFALDARAYNAVVDGFCKSGKVHKAYEILEEMKEKCVQPTVATY 636
Cdd:PLN03218   573 HPIDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFF 652
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  637 GAIVDGLAKIDRLDEAYMLFEEAKSKGIELNVVLYSSLIDGFGKVGRIDEAYLILEEMMKKGLTPNVYTWNSLLDALVKA 716
Cdd:PLN03218   653 SALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEG 732
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1002286839  717 EEINEALVCFQSMKEMKCPPNTYTYSILINGLCRVQKYNKAFVFWQDMQKQGLVPNVVTYTTMIS 781
Cdd:PLN03218   733 NQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCITG 797
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
247-296 8.28e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 66.23  E-value: 8.28e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 247 PDIVLYNVCIDCFGKAGNVDMAWKFFHELKAQGLKPDDVSYTSMIWVLCK 296
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
771-820 3.92e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 55.83  E-value: 3.92e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 771 PNVVTYTTMISGLAKVGNITDAYSLFERFKANGGIPDAASFNALIEGMSN 820
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 super family cl38385
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
806-855 1.06e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


The actual alignment was detected with superfamily member pfam13041:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 43.51  E-value: 1.06e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 806 PDAASFNALIEGMSNANRAMEAYQVFEETRLRGCRINIKSCISLLDALNK 855
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
285-317 2.65e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


:

Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 41.67  E-value: 2.65e-05
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002286839 285 VSYTSMIWVLCKAGRLGEAEELFAQMEAERSVP 317
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEP 33
PPR_2 super family cl38385
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
181-226 5.66e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


The actual alignment was detected with superfamily member pfam13041:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 41.19  E-value: 5.66e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 1002286839 181 AYTVLIGALAEARRPERALELLRQMQEVGYEVGVHLFTTLVRALAR 226
Cdd:pfam13041   5 TYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
DnrI super family cl34649
DNA-binding transcriptional regulator DnrI/AfsR/EmbR, SARP family, contains BTAD domain ...
111-239 3.91e-04

DNA-binding transcriptional regulator DnrI/AfsR/EmbR, SARP family, contains BTAD domain [Transcription];


The actual alignment was detected with superfamily member COG3629:

Pssm-ID: 442847 [Multi-domain]  Cd Length: 244  Bit Score: 42.90  E-value: 3.91e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 111 ADAYHAVLPFLHHDLAALEKVLEEmAVLGYGLPNQACAHLAAALVRARR-----LDDAVLAVAVMRRLKFRpAFSAYTVL 185
Cdd:COG3629    86 GGGYRLDVDPDEVDLHRFERLVAR-ARAAAGDPAAAAALLREALALWRGpplagLPFEPWLEAERARLEEL-RLAALEAL 163
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1002286839 186 IGALAEARRPERALELLRQ------MQEVGYEVgvhlfttLVRALAREGQVADALA--------LVDE 239
Cdd:COG3629   164 AEALLALGRHAEALAELRRlvaahpLRERLHRL-------LMRALYRAGRRAEALAayrrlrrrLADE 224
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
321-781 3.03e-42

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 166.98  E-value: 3.03e-42
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  321 AYNTMImgygSAGRFEDAYKLLERLRERGcipsvvsfnsiltcLGKKRKVDEAlSLFEVMKKDaepnsstyniiidmlcl 400
Cdd:PLN03218   376 AYNRLL----RDGRIKDCIDLLEDMEKRG--------------LLDMDKIYHA-KFFKACKKQ----------------- 419
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  401 gGRVEEAYRIldemehASLFPN--LLTVNIMVDRLCKARKLEEAYKIFESASQRGCNPDCVTYCSLIDGLGKKGQVDEAY 478
Cdd:PLN03218   420 -RAVKEAFRF------AKLIRNptLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMF 492
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  479 RLFEKMLDAGHNANPVVYTSLIRNFFIHGRKEDGHKIFKELIRRGCKPDLTLLNTYMDCVFKAGEVEKGRMIFEDIR--S 556
Cdd:PLN03218   493 EVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKaeT 572
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  557 YGFLPDVRSYSILIHGLTKAGQARETSNIFHAMKQQGFALDARAYNAVVDGFCKSGKVHKAYEILEEMKEKCVQPTVATY 636
Cdd:PLN03218   573 HPIDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFF 652
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  637 GAIVDGLAKIDRLDEAYMLFEEAKSKGIELNVVLYSSLIDGFGKVGRIDEAYLILEEMMKKGLTPNVYTWNSLLDALVKA 716
Cdd:PLN03218   653 SALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEG 732
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1002286839  717 EEINEALVCFQSMKEMKCPPNTYTYSILINGLCRVQKYNKAFVFWQDMQKQGLVPNVVTYTTMIS 781
Cdd:PLN03218   733 NQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCITG 797
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
701-750 1.24e-17

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 77.02  E-value: 1.24e-17
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 701 PNVYTWNSLLDALVKAEEINEALVCFQSMKEMKCPPNTYTYSILINGLCR 750
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
247-296 8.28e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 66.23  E-value: 8.28e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 247 PDIVLYNVCIDCFGKAGNVDMAWKFFHELKAQGLKPDDVSYTSMIWVLCK 296
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
771-820 3.92e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 55.83  E-value: 3.92e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 771 PNVVTYTTMISGLAKVGNITDAYSLFERFKANGGIPDAASFNALIEGMSN 820
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
739-773 2.54e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 47.45  E-value: 2.54e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002286839 739 YTYSILINGLCRVQKYNKAFVFWQDMQKQGLVPNV 773
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
250-284 4.23e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 46.68  E-value: 4.23e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002286839 250 VLYNVCIDCFGKAGNVDMAWKFFHELKAQGLKPDD 284
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
177-452 7.36e-06

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 48.57  E-value: 7.36e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 177 PAFSAYTVLIGALA--EARRPERALELLRQMQEVGYE-VGVHLftTLVRALAREGQVADALALVDEVKGSCLEPDIVLYN 253
Cdd:COG2956     4 PVAAALGWYFKGLNylLNGQPDKAIDLLEEALELDPEtVEAHL--ALGNLYRRRGEYDRAIRIHQKLLERDPDRAEALLE 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 254 VCIDCFgKAGNVDMAWKFFheLKAQGLKPDDVS-YTSMIWVLCKAGRLGEAEELFAQMEaERSVPCAYAYNTMIMGYGSA 332
Cdd:COG2956    82 LAQDYL-KAGLLDRAEELL--EKLLELDPDDAEaLRLLAEIYEQEGDWEKAIEVLERLL-KLGPENAHAYCELAELYLEQ 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 333 GRFEDAYKLLERLRE--RGCIPSVVSFNSILTclgKKRKVDEALSLFEVMKKDAEPNSSTYNIIIDMLCLGGRVEEAYRI 410
Cdd:COG2956   158 GDYDEAIEALEKALKldPDCARALLLLAELYL---EQGDYEEAIAALERALEQDPDYLPALPRLAELYEKLGDPEEALEL 234
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|..
gi 1002286839 411 LDemEHASLFPNLLTVNIMVDRLCKARKLEEAYKIFESASQR 452
Cdd:COG2956   235 LR--KALELDPSDDLLLALADLLERKEGLEAALALLERQLRR 274
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
806-855 1.06e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 43.51  E-value: 1.06e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 806 PDAASFNALIEGMSNANRAMEAYQVFEETRLRGCRINIKSCISLLDALNK 855
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
285-317 2.65e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 41.67  E-value: 2.65e-05
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002286839 285 VSYTSMIWVLCKAGRLGEAEELFAQMEAERSVP 317
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEP 33
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
774-808 3.49e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 41.29  E-value: 3.49e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002286839 774 VTYTTMISGLAKVGNITDAYSLFERFKANGGIPDA 808
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
181-226 5.66e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 41.19  E-value: 5.66e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 1002286839 181 AYTVLIGALAEARRPERALELLRQMQEVGYEVGVHLFTTLVRALAR 226
Cdd:pfam13041   5 TYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
285-311 6.09e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 40.53  E-value: 6.09e-05
                          10        20
                  ....*....|....*....|....*..
gi 1002286839 285 VSYTSMIWVLCKAGRLGEAEELFAQME 311
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMK 27
DnrI COG3629
DNA-binding transcriptional regulator DnrI/AfsR/EmbR, SARP family, contains BTAD domain ...
111-239 3.91e-04

DNA-binding transcriptional regulator DnrI/AfsR/EmbR, SARP family, contains BTAD domain [Transcription];


Pssm-ID: 442847 [Multi-domain]  Cd Length: 244  Bit Score: 42.90  E-value: 3.91e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 111 ADAYHAVLPFLHHDLAALEKVLEEmAVLGYGLPNQACAHLAAALVRARR-----LDDAVLAVAVMRRLKFRpAFSAYTVL 185
Cdd:COG3629    86 GGGYRLDVDPDEVDLHRFERLVAR-ARAAAGDPAAAAALLREALALWRGpplagLPFEPWLEAERARLEEL-RLAALEAL 163
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1002286839 186 IGALAEARRPERALELLRQ------MQEVGYEVgvhlfttLVRALAREGQVADALA--------LVDE 239
Cdd:COG3629   164 AEALLALGRHAEALAELRRlvaahpLRERLHRL-------LMRALYRAGRRAEALAayrrlrrrLADE 224
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
213-261 7.60e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 38.11  E-value: 7.60e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002286839 213 GVHLFTTLVRALAREGQVADALALVDEVKGSCLEPDIVLYNVCIDCFGK 261
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
BTAD smart01043
Bacterial transcriptional activator domain; Found in the DNRI/REDD/AFSR family of regulators. ...
124-235 9.16e-04

Bacterial transcriptional activator domain; Found in the DNRI/REDD/AFSR family of regulators. This region of AFSR along with the C terminal region is capable of independently directing actinorhodin production. This family contains TPR repeats.


Pssm-ID: 198111 [Multi-domain]  Cd Length: 145  Bit Score: 40.36  E-value: 9.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  124 DLAALEKVLEEMAVLGYGLPNQACAHL--AAALVRARRLDD-AVLAVAVMRRLKFRPAF-SAYTVLIGALAEARRPERAL 199
Cdd:smart01043   2 DVDRFERLVAAARAALAADPEAALALLeaALALYRGPLLADvPDEDWAEAERERLRELRlEALEALAEALLALGRHEEAL 81
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 1002286839  200 ELLRQMQEV--GYEVGVHLfttLVRALAREGQVADALA 235
Cdd:smart01043  82 ALLERLLALdpLRERLHRL---LMRALYRAGRRAEALR 116
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
321-781 3.03e-42

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 166.98  E-value: 3.03e-42
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  321 AYNTMImgygSAGRFEDAYKLLERLRERGcipsvvsfnsiltcLGKKRKVDEAlSLFEVMKKDaepnsstyniiidmlcl 400
Cdd:PLN03218   376 AYNRLL----RDGRIKDCIDLLEDMEKRG--------------LLDMDKIYHA-KFFKACKKQ----------------- 419
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  401 gGRVEEAYRIldemehASLFPN--LLTVNIMVDRLCKARKLEEAYKIFESASQRGCNPDCVTYCSLIDGLGKKGQVDEAY 478
Cdd:PLN03218   420 -RAVKEAFRF------AKLIRNptLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMF 492
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  479 RLFEKMLDAGHNANPVVYTSLIRNFFIHGRKEDGHKIFKELIRRGCKPDLTLLNTYMDCVFKAGEVEKGRMIFEDIR--S 556
Cdd:PLN03218   493 EVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKaeT 572
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  557 YGFLPDVRSYSILIHGLTKAGQARETSNIFHAMKQQGFALDARAYNAVVDGFCKSGKVHKAYEILEEMKEKCVQPTVATY 636
Cdd:PLN03218   573 HPIDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFF 652
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  637 GAIVDGLAKIDRLDEAYMLFEEAKSKGIELNVVLYSSLIDGFGKVGRIDEAYLILEEMMKKGLTPNVYTWNSLLDALVKA 716
Cdd:PLN03218   653 SALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEG 732
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1002286839  717 EEINEALVCFQSMKEMKCPPNTYTYSILINGLCRVQKYNKAFVFWQDMQKQGLVPNVVTYTTMIS 781
Cdd:PLN03218   733 NQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCITG 797
PLN03077 PLN03077
Protein ECB2; Provisional
221-862 4.35e-31

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 131.13  E-value: 4.35e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 221 VRALAREGQVADALALVDEVKGSCLEPD----IVLYNVCidcfgkagnvdmAWKFFHELKAQGLKPDDVSYTS------- 289
Cdd:PLN03077   58 LRALCSHGQLEQALKLLESMQELRVPVDedayVALFRLC------------EWKRAVEEGSRVCSRALSSHPSlgvrlgn 125
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 290 -MIWVLCKAGRLGEAEELFAQMeAERSVpcaYAYNTMIMGYGSAGRFEDAYKLLERLRERGCIPSVVSFNSIL-TCLG-- 365
Cdd:PLN03077  126 aMLSMFVRFGELVHAWYVFGKM-PERDL---FSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLrTCGGip 201
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 366 ---KKRKVDealslFEVMKKDAEPNSSTYNIIIDMLCLGGRVEEAYRILDEMehaslfpnlltvnimvdrlckARKleea 442
Cdd:PLN03077  202 dlaRGREVH-----AHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRM---------------------PRR---- 251
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 443 ykifesasqrgcnpDCVTYCSLIDGLGKKGQVDEAYRLFEKMldaghnanpvvytsliRNFFIhgrkedghkifkelirr 522
Cdd:PLN03077  252 --------------DCISWNAMISGYFENGECLEGLELFFTM----------------RELSV----------------- 284
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 523 gcKPDLTLLNTYMDCVFKAGEVEKGRMIFEDIRSYGFLPDVRSYSILIHGLTKAGQARETSNIFHAMKQQgfalDARAYN 602
Cdd:PLN03077  285 --DPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK----DAVSWT 358
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 603 AVVDGFCKSGKVHKAYEILEEMKEKCVQPTVATYGAIVDGLAKIDRLDEAYMLFEEAKSKGIELNVVLYSSLIDGFGKVG 682
Cdd:PLN03077  359 AMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCK 438
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 683 RIDEAYLILEEMMKKgltpNVYTWNSLLDALVKAEEINEALVCFQSMKeMKCPPNTYTYSILIN-----GLCRVQKYNKA 757
Cdd:PLN03077  439 CIDKALEVFHNIPEK----DVISWTSIIAGLRLNNRCFEALIFFRQML-LTLKPNSVTLIAALSacariGALMCGKEIHA 513
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 758 FVFWQDMQKQGLVPN-------------------------VVTYTTMISGLAKVGNITDAYSLFERFKANGGIPDAASFN 812
Cdd:PLN03077  514 HVLRTGIGFDGFLPNalldlyvrcgrmnyawnqfnshekdVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFI 593
                         650       660       670       680       690
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1002286839 813 ALIEGMSNANRAMEAYQVFEETRLR-GCRINIKSCISLLDALNKSECLEQA 862
Cdd:PLN03077  594 SLLCACSRSGMVTQGLEYFHSMEEKySITPNLKHYACVVDLLGRAGKLTEA 644
PLN03218 PLN03218
maturation of RBCL 1; Provisional
227-675 9.27e-31

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 130.77  E-value: 9.27e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  227 EGQVADALALVDEVKGSCLEPDIVLYNVCidcfGKAGNVDMAWKFFHELKAQGLKPDDVSYTSMIWVLCKAGRLGEAEEL 306
Cdd:PLN03218   419 QRAVKEAFRFAKLIRNPTLSTFNMLMSVC----ASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEV 494
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  307 FAQMEAERSVPCAYAYNTMIMGYGSAGRFEDAYKLLERLRERGCIPSVVSFNSILTCLGKKRKVDEALSLFEVMKKDA-- 384
Cdd:PLN03218   495 FHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAEThp 574
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  385 -EPNSSTYNIIIDMLCLGGRVE---EAYRILDEMEHASlFPNLLTvnIMVDRLCKARKLEEAYKIFESASQRGCNPDCVT 460
Cdd:PLN03218   575 iDPDHITVGALMKACANAGQVDrakEVYQMIHEYNIKG-TPEVYT--IAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVF 651
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  461 YCSLIDGLGKKGQVDEAYRLFEKMLDAGHNANPVVYTSLirnffihgrkedghkifkelirrgckpdltllntyMDCVFK 540
Cdd:PLN03218   652 FSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSL-----------------------------------MGACSN 696
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  541 AGEVEKGRMIFEDIRSYGFLPDVRSysilihgltkagqaretsnifhamkqqgfaldaraYNAVVDGFCKSGKVHKAYEI 620
Cdd:PLN03218   697 AKNWKKALELYEDIKSIKLRPTVST-----------------------------------MNALITALCEGNQLPKALEV 741
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1002286839  621 LEEMKEKCVQPTVATYGAIVDGLAKIDRLDEAYMLFEEAKSKGIELNVVLYSSLI 675
Cdd:PLN03218   742 LSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT 796
PLN03218 PLN03218
maturation of RBCL 1; Provisional
589-862 1.17e-25

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 114.20  E-value: 1.17e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  589 MKQQGFALDARAYNAVVDGFCKSGKVHKAYEILEEMKEKCVQPTVATYGAIVDGLAKIDRLDEAYMLFEEAKSKGIELNV 668
Cdd:PLN03218   463 VQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDR 542
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  669 VLYSSLIDGFGKVGRIDEAYLILEEMM--KKGLTPNVYTWNSLLDALVKAEEINEALVCFQSMKE--MKCPPNTYTysIL 744
Cdd:PLN03218   543 VVFNALISACGQSGAVDRAFDVLAEMKaeTHPIDPDHITVGALMKACANAGQVDRAKEVYQMIHEynIKGTPEVYT--IA 620
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  745 INGLCRVQKYNKAFVFWQDMQKQGLVPNVVTYTTMISGLAKVGNITDAYSLFERFKANGGIPDAASFNALIEGMSNANRA 824
Cdd:PLN03218   621 VNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNW 700
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1002286839  825 MEAYQVFEETRLRGCRINIKSCISLLDALNKSECLEQA 862
Cdd:PLN03218   701 KKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKA 738
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
168-664 2.51e-24

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 109.19  E-value: 2.51e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 168 AVMRRLKFRPAFSAYTVLIGALAEARRPERALELLRqMQEVG--YEVGVHLFTTLVRALAREGQVADALALVDEVKGSCL 245
Cdd:PLN03081   76 ARLDDTQIRKSGVSLCSQIEKLVACGRHREALELFE-ILEAGcpFTLPASTYDALVEACIALKSIRCVKAVYWHVESSGF 154
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 246 EPDIVLYNVCIDCFGKAGNVDMAWKFFHELKAQGLkpddVSYTSMIWVLCKAGRLGEAEELFAQMEAERSVPCAYAYNTM 325
Cdd:PLN03081  155 EPDQYMMNRVLLMHVKCGMLIDARRLFDEMPERNL----ASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVM 230
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 326 IMGYGSAGRFEDAYKLLERLRERGCIPSVVSFNSILTCLGKKRKVDEALSLFEVMkkdaePNSST--YNIIIDMLCLGGR 403
Cdd:PLN03081  231 LRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGM-----PEKTTvaWNSMLAGYALHGY 305
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 404 VEEAYRILDEMEHASLFPNLLTVNIMVDRLCKARKLEEAYKIFESASQRGCNPDCVTYCSLIDGLGKKGQVDEAYRLFEK 483
Cdd:PLN03081  306 SEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDR 385
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 484 MldagHNANPVVYTSLIRNFFIHGRKEDGHKIFKELIRRGCKPD-LTLLNTYMDCVFkAGEVEKGRMIFEDI-RSYGFLP 561
Cdd:PLN03081  386 M----PRKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNhVTFLAVLSACRY-SGLSEQGWEIFQSMsENHRIKP 460
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 562 DVRSYSILIHGLTKAGQARETsniFHAMKQQGFALDARAYNAVVDgfckSGKVHKAYEILEEMKEKC--VQP-TVATYGA 638
Cdd:PLN03081  461 RAMHYACMIELLGREGLLDEA---YAMIRRAPFKPTVNMWAALLT----ACRIHKNLELGRLAAEKLygMGPeKLNNYVV 533
                         490       500
                  ....*....|....*....|....*.
gi 1002286839 639 IVDGLAKIDRLDEAYMLFEEAKSKGI 664
Cdd:PLN03081  534 LLNLYNSSGRQAEAAKVVETLKRKGL 559
PLN03218 PLN03218
maturation of RBCL 1; Provisional
127-553 2.33e-21

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 100.34  E-value: 2.33e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  127 ALEKVLEEMAVLGYglpnQACAHLAAALV----RARRLDDAVLAVAVMRRLKFRPAFSAYTVLIGALAEARRPERALELL 202
Cdd:PLN03218   490 AMFEVFHEMVNAGV----EANVHTFGALIdgcaRAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVL 565
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  203 RQMQEVGYEVGVHLFT--TLVRALAREGQVADAL---ALVDE--VKGSclePDIvlYNVCIDCFGKAGNVDMAWKFFHEL 275
Cdd:PLN03218   566 AEMKAETHPIDPDHITvgALMKACANAGQVDRAKevyQMIHEynIKGT---PEV--YTIAVNSCSQKGDWDFALSIYDDM 640
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  276 KAQGLKPDDVSYTSMIWVLCKAGRLGEAEELFAQMEAERSVPCAYAYNTMiMGYGS-AGRFEDAYKLLERLRERGCIPSV 354
Cdd:PLN03218   641 KKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSL-MGACSnAKNWKKALELYEDIKSIKLRPTV 719
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  355 vsfnsiltclgkkrkvdealslfevmkkdaepnsSTYNIIIDMLCLGGRVEEAYRILDEMEHASLFPNLLTVNIMVDRLC 434
Cdd:PLN03218   720 ----------------------------------STMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASE 765
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  435 KARKLEEAYKIFESASQRGCNPDcVTYCSLIDGLGKkgqvdeayRLFEKMLDAGhnaNPVVYtslirnfFIHGRKEDGHK 514
Cdd:PLN03218   766 RKDDADVGLDLLSQAKEDGIKPN-LVMCRCITGLCL--------RRFEKACALG---EPVVS-------FDSGRPQIENK 826
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*.
gi 1002286839  515 -------IFKELIRRGCKPDLTLLNTYMDCVFKAGEVEKGRMIFED 553
Cdd:PLN03218   827 wtswalmVYRETISAGTLPTMEVLSQVLGCLQLPHDATLRNRLIEN 872
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
371-841 6.16e-20

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 95.32  E-value: 6.16e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 371 DEALSLFEVMKKDA--EPNSSTYNIIIDMlCLGGR-VEEAYRILDEMEHASLFPNLLTVNIMVDRLCKARKLEEAYKIFE 447
Cdd:PLN03081  104 REALELFEILEAGCpfTLPASTYDALVEA-CIALKsIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFD 182
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 448 SASQRgcnpDCVTYCSLIDGLGKKGQVDEAYRLFEKMLDAGHNANPVVYTSLIRNFFIHGRKEDGHKIFKELIRRGCKPD 527
Cdd:PLN03081  183 EMPER----NLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLHCCVLKTGVVGD 258
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 528 LTLLNTYMDCVFKAGEVEKGRMIFEDirsygfLPD--VRSYSILIHGLTKAGQARETSNIFHAMKQQGFALDARAYNAVV 605
Cdd:PLN03081  259 TFVSCALIDMYSKCGDIEDARCVFDG------MPEktTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMI 332
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 606 DGFCKSGKVHKAYEILEEMKEKCVQPTVATYGAIVDGLAKIDRLDEAYMLFEEAKSKgielNVVLYSSLIDGFGKVGRID 685
Cdd:PLN03081  333 RIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRK----NLISWNALIAGYGNHGRGT 408
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 686 EAYLILEEMMKKGLTPNVYTWNSLLDALVKAEEINEALVCFQSMKE-MKCPPNTYTYSILINGLCRVQKYNKAFVFwqdM 764
Cdd:PLN03081  409 KAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSEnHRIKPRAMHYACMIELLGREGLLDEAYAM---I 485
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1002286839 765 QKQGLVPNVVTYTTMISGLAKVGNITDAYSLFERFKANGgiPDA-ASFNALIEGMSNANRAMEAYQVFEETRLRGCRI 841
Cdd:PLN03081  486 RRAPFKPTVNMWAALLTACRIHKNLELGRLAAEKLYGMG--PEKlNNYVVLLNLYNSSGRQAEAAKVVETLKRKGLSM 561
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
507-861 6.22e-18

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 88.77  E-value: 6.22e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 507 GRKEDGHKIFKEliRRGCKPDLTLLNTYMDCVFkAGEVEKGRMIFEDIRSYG-FLPDVRSYSILIHGLTKAGQARETSNI 585
Cdd:PLN03081   69 EVSESKDARLDD--TQIRKSGVSLCSQIEKLVA-CGRHREALELFEILEAGCpFTLPASTYDALVEACIALKSIRCVKAV 145
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 586 FHAMKQQGFALDARAYNAVVDGFCKSGKVHKAYEILEEMKEKcvqpTVATYGAIVDGLAKIDRLDEAYMLFEE------- 658
Cdd:PLN03081  146 YWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPER----NLASWGTIIGGLVDAGNYREAFALFREmwedgsd 221
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 659 ----------------------------AKSKGIELNVVLYSSLIDGFGKVGRIDEAYLILEEMMKKgltpNVYTWNSLL 710
Cdd:PLN03081  222 aeprtfvvmlrasaglgsaragqqlhccVLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEK----TTVAWNSML 297
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 711 DALVKAEEINEALVCFQSMKEMKCPPNTYTYSILINGLCRVQKYNKAFVFWQDMQKQGLVPNVVTYTTMISGLAKVGNIT 790
Cdd:PLN03081  298 AGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRME 377
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1002286839 791 DAYSLFERFKANGGIpdaaSFNALIEGMSNANRAMEAYQVFEETRLRGCRINIKSCISLLDALNKSECLEQ 861
Cdd:PLN03081  378 DARNVFDRMPRKNLI----SWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQ 444
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
701-750 1.24e-17

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 77.02  E-value: 1.24e-17
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 701 PNVYTWNSLLDALVKAEEINEALVCFQSMKEMKCPPNTYTYSILINGLCR 750
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
736-785 1.39e-16

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 74.32  E-value: 1.39e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 736 PNTYTYSILINGLCRVQKYNKAFVFWQDMQKQGLVPNVVTYTTMISGLAK 785
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
247-296 8.28e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 66.23  E-value: 8.28e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 247 PDIVLYNVCIDCFGKAGNVDMAWKFFHELKAQGLKPDDVSYTSMIWVLCK 296
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
352-399 8.45e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 66.23  E-value: 8.45e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002286839 352 PSVVSFNSILTCLGKKRKVDEALSLFEVMKKDA-EPNSSTYNIIIDMLC 399
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGvKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
667-715 1.14e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 65.85  E-value: 1.14e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002286839 667 NVVLYSSLIDGFGKVGRIDEAYLILEEMMKKGLTPNVYTWNSLLDALVK 715
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
597-645 3.26e-12

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 61.61  E-value: 3.26e-12
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002286839 597 DARAYNAVVDGFCKSGKVHKAYEILEEMKEKCVQPTVATYGAIVDGLAK 645
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03077 PLN03077
Protein ECB2; Provisional
170-598 4.92e-12

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 69.88  E-value: 4.92e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 170 MRRLKFRPAFSAYTVLIGAlAEARRPERaleLLRQMQ----EVGYEVGVHLFTTLVRALAREGQVADALALVDEVKGScl 245
Cdd:PLN03077  279 MRELSVDPDLMTITSVISA-CELLGDER---LGREMHgyvvKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK-- 352
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 246 epDIVLYNVCIDCFGKAGNVDMAWKFFHELKAQGLKPDDVSYTSMIWVLCKAGRLGEAEELFAQMEAERSVPCAYAYNTM 325
Cdd:PLN03077  353 --DAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANAL 430
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 326 IMGYGSAGRFEDAYKLLERLRERgcipSVVSFNSILTCLGKKRKVDEALSLFEVMKKDAEPNSSTY-------------- 391
Cdd:PLN03077  431 IEMYSKCKCIDKALEVFHNIPEK----DVISWTSIIAGLRLNNRCFEALIFFRQMLLTLKPNSVTLiaalsacarigalm 506
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 392 ---------------------NIIIDMLCLGGRVEEAYRILDEMEHaslfpNLLTVNIMVDRLCKARKLEEAYKIFESAS 450
Cdd:PLN03077  507 cgkeihahvlrtgigfdgflpNALLDLYVRCGRMNYAWNQFNSHEK-----DVVSWNILLTGYVAHGKGSMAVELFNRMV 581
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 451 QRGCNPDCVTYCSLIDGLGKKGQVDEAYRLFEKMLDAGHNA-NPVVYTSLIRNFFIHGRKEDGHKIFKELirrGCKPDLT 529
Cdd:PLN03077  582 ESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKYSITpNLKHYACVVDLLGRAGKLTEAYNFINKM---PITPDPA 658
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1002286839 530 LLNTYMDCVFKAGEVEKGRM----IFE-DIRSYGFlpdvrsYSILIHGLTKAGQARETSNIFHAMKQQGFALDA 598
Cdd:PLN03077  659 VWGALLNACRIHRHVELGELaaqhIFElDPNSVGY------YILLCNLYADAGKWDEVARVRKTMRENGLTVDP 726
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
452-484 1.33e-11

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 59.67  E-value: 1.33e-11
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002286839 452 RGCNPDCVTYCSLIDGLGKKGQVDEAYRLFEKM 484
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
386-435 2.15e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 59.30  E-value: 2.15e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 386 PNSSTYNIIIDMLCLGGRVEEAYRILDEMEHASLFPNLLTVNIMVDRLCK 435
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
421-470 5.18e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 58.53  E-value: 5.18e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 421 PNLLTVNIMVDRLCKARKLEEAYKIFESASQRGCNPDCVTYCSLIDGLGK 470
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
317-366 5.44e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 58.14  E-value: 5.44e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 317 PCAYAYNTMIMGYGSAGRFEDAYKLLERLRERGCIPSVVSFNSILTCLGK 366
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
631-680 1.86e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 56.60  E-value: 1.86e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 631 PTVATYGAIVDGLAKIDRLDEAYMLFEEAKSKGIELNVVLYSSLIDGFGK 680
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
655-715 1.96e-10

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 56.98  E-value: 1.96e-10
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1002286839 655 LFEEAKSKGIELNVVLYSSLIDGFGKVGRIDEAYLILEEMMKKGLTPNVYTWNSLLDALVK 715
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGG 62
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
771-820 3.92e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 55.83  E-value: 3.92e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 771 PNVVTYTTMISGLAKVGNITDAYSLFERFKANGGIPDAASFNALIEGMSN 820
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03077 PLN03077
Protein ECB2; Provisional
146-414 6.28e-10

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 62.94  E-value: 6.28e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 146 ACAHLAaalvrarRLDDAVLavavMRRLKFRPAFSAYTVLIGAL----AEARRPERALELLRQMQevgyEVGVHLFTTLV 221
Cdd:PLN03077  398 ACACLG-------DLDVGVK----LHELAERKGLISYVVVANALiemySKCKCIDKALEVFHNIP----EKDVISWTSII 462
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 222 RALAREGQVADALALVDEVKGScLEPDIVLYNVCIDCFGKAGNVdMAWKFFHE--LKAqGLKPDDVSYTSMIWVLCKAGR 299
Cdd:PLN03077  463 AGLRLNNRCFEALIFFRQMLLT-LKPNSVTLIAALSACARIGAL-MCGKEIHAhvLRT-GIGFDGFLPNALLDLYVRCGR 539
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 300 LGEAEELFAQMEAERSvpcayAYNTMIMGYGSAGRFEDAYKLLERLRERGCIPSVVSFNSILTCLGKKRKVDEALSLFEV 379
Cdd:PLN03077  540 MNYAWNQFNSHEKDVV-----SWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHS 614
                         250       260       270
                  ....*....|....*....|....*....|....*..
gi 1002286839 380 MKKD--AEPNSSTYNIIIDMLCLGGRVEEAYRILDEM 414
Cdd:PLN03077  615 MEEKysITPNLKHYACVVDLLGRAGKLTEAYNFINKM 651
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
382-415 1.52e-09

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 53.89  E-value: 1.52e-09
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002286839 382 KDAEPNSSTYNIIIDMLCLGGRVEEAYRILDEME 415
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
456-503 1.63e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 53.91  E-value: 1.63e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1002286839 456 PDCVTYCSLIDGLGKKGQVDEAYRLFEKMLDAGHNANPVVYTSLIRNF 503
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
279-311 1.68e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 50.81  E-value: 1.68e-08
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002286839 279 GLKPDDVSYTSMIWVLCKAGRLGEAEELFAQME 311
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
662-694 2.49e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 50.42  E-value: 2.49e-08
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002286839 662 KGIELNVVLYSSLIDGFGKVGRIDEAYLILEEM 694
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
561-610 2.51e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 50.82  E-value: 2.51e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 561 PDVRSYSILIHGLTKAGQARETSNIFHAMKQQGFALDARAYNAVVDGFCK 610
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
585-640 4.10e-08

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 50.43  E-value: 4.10e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1002286839 585 IFHAMKQQGFALDARAYNAVVDGFCKSGKVHKAYEILEEMKEKCVQPTVATYGAIV 640
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
593-625 6.13e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 49.27  E-value: 6.13e-08
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002286839 593 GFALDARAYNAVVDGFCKSGKVHKAYEILEEMK 625
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
768-798 7.53e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 48.88  E-value: 7.53e-08
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002286839 768 GLVPNVVTYTTMISGLAKVGNITDAYSLFER 798
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDE 32
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
762-815 8.38e-08

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 49.66  E-value: 8.38e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1002286839 762 QDMQKQGLVPNVVTYTTMISGLAKVGNITDAYSLFERFKANGGIPDAASFNALI 815
Cdd:pfam13812   4 REMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
282-329 8.80e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 49.28  E-value: 8.80e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002286839 282 PDDVSYTSMIWVLCKAGRLGEAEELFAQMEaERSVPC-AYAYNTMIMGY 329
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMK-KRGVKPnVYTYTILINGL 48
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
689-746 9.33e-08

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 49.66  E-value: 9.33e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1002286839 689 LILEEMMKKGLTPNVYTWNSLLDALVKAEEINEALVCFQSMKEMKCPPNTYTYSILIN 746
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILG 58
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
739-773 2.54e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 47.45  E-value: 2.54e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002286839 739 YTYSILINGLCRVQKYNKAFVFWQDMQKQGLVPNV 773
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
526-575 3.08e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 47.74  E-value: 3.08e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 526 PDLTLLNTYMDCVFKAGEVEKGRMIFEDIRSYGFLPDVRSYSILIHGLTK 575
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
669-703 3.61e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 47.06  E-value: 3.61e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002286839 669 VLYSSLIDGFGKVGRIDEAYLILEEMMKKGLTPNV 703
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
250-284 4.23e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 46.68  E-value: 4.23e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002286839 250 VLYNVCIDCFGKAGNVDMAWKFFHELKAQGLKPDD 284
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
320-354 4.80e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 46.68  E-value: 4.80e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002286839 320 YAYNTMIMGYGSAGRFEDAYKLLERLRERGCIPSV 354
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
574-716 5.54e-07

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 51.24  E-value: 5.54e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 574 TKAGQARETSNIFHAMKQQGFALDARAYNAVVDgFCKSG----------KVHKAYEILEEMKEKCVQPTVATYGAIVDGL 643
Cdd:pfam17177  22 SKHADATGALALYDAAKAEGVRLAQYHYNVLLY-LCSKAadatdlkpqlAADRGFEVFEAMKAQGVSPNEATYTAVARLA 100
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1002286839 644 AKIDRLDEAYMLFEEAKSKGIELNVVLYSSLIDGFGKVGRIDEAYLILEEMMKKGLTPNVYTWNSLLDALVKA 716
Cdd:pfam17177 101 AAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKA 173
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
734-764 1.25e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 45.41  E-value: 1.25e-06
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002286839 734 CPPNTYTYSILINGLCRVQKYNKAFVFWQDM 764
Cdd:pfam12854   3 LKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
599-633 1.56e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 45.14  E-value: 1.56e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002286839 599 RAYNAVVDGFCKSGKVHKAYEILEEMKEKCVQPTV 633
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
459-489 1.68e-06

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 45.15  E-value: 1.68e-06
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002286839 459 VTYCSLIDGLGKKGQVDEAYRLFEKMLDAGH 489
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
320-350 1.84e-06

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 45.15  E-value: 1.84e-06
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002286839 320 YAYNTMIMGYGSAGRFEDAYKLLERLRERGC 350
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
459-492 2.03e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 44.75  E-value: 2.03e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002286839 459 VTYCSLIDGLGKKGQVDEAYRLFEKMLDAGHNAN 492
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
652-800 2.36e-06

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 49.32  E-value: 2.36e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 652 AYMLFEEAKSKGIEL-----NVVLY--SSLIDGFG--KVGRIDEAYLILEEMMKKGLTPNVYTWNSLLDALVKAEEINEA 722
Cdd:pfam17177  30 ALALYDAAKAEGVRLaqyhyNVLLYlcSKAADATDlkPQLAADRGFEVFEAMKAQGVSPNEATYTAVARLAAAKGDGDLA 109
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1002286839 723 LVCFQSMKEMKCPPNTYTYSILINGLCRVQKYNKAFVFWQDMQKQGLVPNVVTYTTMISGLAKVGNITDAYSLFERFK 800
Cdd:pfam17177 110 FDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGRADKVYAYLHRLR 187
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
704-738 2.78e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 44.37  E-value: 2.78e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002286839 704 YTWNSLLDALVKAEEINEALVCFQSMKEMKCPPNT 738
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
390-422 5.62e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 43.60  E-value: 5.62e-06
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002286839 390 TYNIIIDMLCLGGRVEEAYRILDEMEHASLFPN 422
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
177-452 7.36e-06

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 48.57  E-value: 7.36e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 177 PAFSAYTVLIGALA--EARRPERALELLRQMQEVGYE-VGVHLftTLVRALAREGQVADALALVDEVKGSCLEPDIVLYN 253
Cdd:COG2956     4 PVAAALGWYFKGLNylLNGQPDKAIDLLEEALELDPEtVEAHL--ALGNLYRRRGEYDRAIRIHQKLLERDPDRAEALLE 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 254 VCIDCFgKAGNVDMAWKFFheLKAQGLKPDDVS-YTSMIWVLCKAGRLGEAEELFAQMEaERSVPCAYAYNTMIMGYGSA 332
Cdd:COG2956    82 LAQDYL-KAGLLDRAEELL--EKLLELDPDDAEaLRLLAEIYEQEGDWEKAIEVLERLL-KLGPENAHAYCELAELYLEQ 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 333 GRFEDAYKLLERLRE--RGCIPSVVSFNSILTclgKKRKVDEALSLFEVMKKDAEPNSSTYNIIIDMLCLGGRVEEAYRI 410
Cdd:COG2956   158 GDYDEAIEALEKALKldPDCARALLLLAELYL---EQGDYEEAIAALERALEQDPDYLPALPRLAELYEKLGDPEEALEL 234
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|..
gi 1002286839 411 LDemEHASLFPNLLTVNIMVDRLCKARKLEEAYKIFESASQR 452
Cdd:COG2956   235 LR--KALELDPSDDLLLALADLLERKEGLEAALALLERQLRR 274
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
600-629 7.53e-06

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 43.22  E-value: 7.53e-06
                          10        20        30
                  ....*....|....*....|....*....|
gi 1002286839 600 AYNAVVDGFCKSGKVHKAYEILEEMKEKCV 629
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
348-381 7.75e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 43.10  E-value: 7.75e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002286839 348 RGCIPSVVSFNSILTCLGKKRKVDEALSLFEVMK 381
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
806-855 1.06e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 43.51  E-value: 1.06e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 806 PDAASFNALIEGMSNANRAMEAYQVFEETRLRGCRINIKSCISLLDALNK 855
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
669-699 1.16e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 42.84  E-value: 1.16e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002286839 669 VLYSSLIDGFGKVGRIDEAYLILEEMMKKGL 699
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
690-835 1.40e-05

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 47.01  E-value: 1.40e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 690 ILEEMMKKGLTPNVYTWNSLLDALVKAEEINEALVC------FQSMKEMK---CPPNTYTYSILINGLCRVQKYNKAFVF 760
Cdd:pfam17177  33 LYDAAKAEGVRLAQYHYNVLLYLCSKAADATDLKPQlaadrgFEVFEAMKaqgVSPNEATYTAVARLAAAKGDGDLAFDL 112
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1002286839 761 WQDMQKQGLVPNVVTYTTMISGLAKVGNITDAYSLFERFKANGGIPDAASFNALIEGMSNANRAMEAYQVFEETR 835
Cdd:pfam17177 113 VKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGRADKVYAYLHRLR 187
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
306-366 1.70e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 43.12  E-value: 1.70e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1002286839 306 LFAQMEAERSVPCAYAYNTMIMGYGSAGRFEDAYKLLERLRERGCIPSVVSFNSILTCLGK 366
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGG 62
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
285-317 2.65e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 41.67  E-value: 2.65e-05
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002286839 285 VSYTSMIWVLCKAGRLGEAEELFAQMEAERSVP 317
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEP 33
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
620-680 3.22e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 42.35  E-value: 3.22e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1002286839 620 ILEEMKEKCVQPTVATYGAIVDGLAKIDRLDEAYMLFEEAKSKGIELNVVLYSSLIDGFGK 680
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGG 62
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
296-487 3.47e-05

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 45.85  E-value: 3.47e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 296 KAGRLGEAEELFAQMEAERSVPCAYAYNTMImgygsagrfedaykllerlrergcipSVVSFNSILTCLGKKRKVDEALS 375
Cdd:pfam17177  23 KHADATGALALYDAAKAEGVRLAQYHYNVLL--------------------------YLCSKAADATDLKPQLAADRGFE 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 376 LFEVMKKD-AEPNSSTYNIIIDMLCLGGRVEEAYRILDEMEHASLFPNLLTVNIMVDRLCKARKLEEAYKIFESASQRGC 454
Cdd:pfam17177  77 VFEAMKAQgVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGV 156
                         170       180       190
                  ....*....|....*....|....*....|...
gi 1002286839 455 NPDCVTYCSLIDGLGKKGQVDEAYRLFEKMLDA 487
Cdd:pfam17177 157 ELEEPELAALLKVSAKAGRADKVYAYLHRLRDA 189
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
774-808 3.49e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 41.29  E-value: 3.49e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002286839 774 VTYTTMISGLAKVGNITDAYSLFERFKANGGIPDA 808
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
539-653 4.66e-05

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 45.47  E-value: 4.66e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 539 FKAGEVEKGRMIFEDIRSYGFLPDVRSYSILIHGLTKAGQARETSNIFHAMKQQGFALDARAYNAVVDGFCKSGKVHKAY 618
Cdd:pfam17177  66 KPQLAADRGFEVFEAMKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAY 145
                          90       100       110
                  ....*....|....*....|....*....|....*
gi 1002286839 619 EILEEMKEKCVQPTVATYGAIVDGLAKIDRLDEAY 653
Cdd:pfam17177 146 EVEEHMLAHGVELEEPELAALLKVSAKAGRADKVY 180
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
181-226 5.66e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 41.19  E-value: 5.66e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 1002286839 181 AYTVLIGALAEARRPERALELLRQMQEVGYEVGVHLFTTLVRALAR 226
Cdd:pfam13041   5 TYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
285-311 6.09e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 40.53  E-value: 6.09e-05
                          10        20
                  ....*....|....*....|....*..
gi 1002286839 285 VSYTSMIWVLCKAGRLGEAEELFAQME 311
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMK 27
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
425-458 9.59e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.13  E-value: 9.59e-05
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002286839 425 TVNIMVDRLCKARKLEEAYKIFESASQRGCNPDC 458
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
419-447 1.14e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 40.02  E-value: 1.14e-04
                          10        20
                  ....*....|....*....|....*....
gi 1002286839 419 LFPNLLTVNIMVDRLCKARKLEEAYKIFE 447
Cdd:pfam12854   3 LKPDVVTYNTLINGLCRAGRVDEAFELLD 31
Spy COG3914
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational ...
111-344 1.22e-04

Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443119 [Multi-domain]  Cd Length: 658  Bit Score: 45.75  E-value: 1.22e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 111 ADAYHAVLPFLHHDLAALEKVLEEMAVLGYGLPNQACAHLAAALVRARRLDDAVLAVAVMRRLKFRPAFSAYTvligALA 190
Cdd:COG3914    14 AAALLAAAAAAELALAAELEAAALAAALGLALLLLAALAEAAAAALLALAAGEAAAAAAALLLLAALLELAAL----LLQ 89
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 191 EARRPERALELLRQMQEVGYEVGVHLFTtLVRALAREGQVADALAlvdevkgsCLEpdivlynvcidcfgkagnvdmawk 270
Cdd:COG3914    90 ALGRYEEALALYRRALALNPDNAEALFN-LGNLLLALGRLEEALA--------ALR------------------------ 136
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1002286839 271 ffhelKAQGLKPDDVS-YTSMIWVLCKAGRLGEAEELFAQmeAERSVP-CAYAYNTMIMGYGSAGRFEDAYKLLER 344
Cdd:COG3914   137 -----RALALNPDFAEaYLNLGEALRRLGRLEEAIAALRR--ALELDPdNAEALNNLGNALQDLGRLEEAIAAYRR 205
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
201-261 1.23e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 40.80  E-value: 1.23e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1002286839 201 LLRQMQEVGYEVGVHLFTTLVRALAREGQVADALALVDEVKGSCLEPDIVLYNVCIDCFGK 261
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGG 62
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
774-803 1.23e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 39.76  E-value: 1.23e-04
                          10        20        30
                  ....*....|....*....|....*....|
gi 1002286839 774 VTYTTMISGLAKVGNITDAYSLFERFKANG 803
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKG 30
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
494-527 1.37e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 39.75  E-value: 1.37e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002286839 494 VVYTSLIRNFFIHGRKEDGHKIFKELIRRGCKPD 527
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
627-658 1.42e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 39.64  E-value: 1.42e-04
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1002286839 627 KCVQPTVATYGAIVDGLAKIDRLDEAYMLFEE 658
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDE 32
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
270-330 1.60e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 40.42  E-value: 1.60e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1002286839 270 KFFHELKAQGLKPDDVSYTSMIWVLCKAGRLGEAEELFAQMEAERSVPCAYAYNTMIMGYG 330
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIG 61
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
181-348 2.01e-04

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 43.84  E-value: 2.01e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 181 AYTVLIGALAEARRPERALELLRQMQEVGYEVGVHLFtTLVRALAREGQVADALALVDEVKGscLEP-DIVLYNVCIDCF 259
Cdd:COG0457    10 AYNNLGLAYRRLGRYEEAIEDYEKALELDPDDAEALY-NLGLAYLRLGRYEEALADYEQALE--LDPdDAEALNNLGLAL 86
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 260 GKAGNVDMAWKFFHelKAQGLKPDDV-SYTSMIWVLCKAGRLGEAEELFAQmeAERSVP-CAYAYNTMIMGYGSAGRFED 337
Cdd:COG0457    87 QALGRYEEALEDYD--KALELDPDDAeALYNLGLALLELGRYDEAIEAYER--ALELDPdDADALYNLGIALEKLGRYEE 162
                         170
                  ....*....|.
gi 1002286839 338 AYKLLERLRER 348
Cdd:COG0457   163 ALELLEKLEAA 173
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
565-692 2.09e-04

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 43.54  E-value: 2.09e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 565 SYSILIHGLTKAGQARETSN---------IFHAMKQQGFALDARAYNAVVDGFCKSGKVHKAYEILEEMKEKCVQPTVAT 635
Cdd:pfam17177  48 HYNVLLYLCSKAADATDLKPqlaadrgfeVFEAMKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRS 127
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1002286839 636 YGAIVDGLAKIDRLDEAYMLFEEAKSKGIELNVVLYSSLIDGFGKVGRIDEAYLILE 692
Cdd:pfam17177 128 YSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGRADKVYAYLH 184
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
225-412 2.78e-04

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 43.15  E-value: 2.78e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 225 AREGQVADALALVDEVKGSCLEPDIVLYNVCIDCFGKAGN---------VDMAWKFFHELKAQGLKPDDVSYTSMIWVLC 295
Cdd:pfam17177  22 SKHADATGALALYDAAKAEGVRLAQYHYNVLLYLCSKAADatdlkpqlaADRGFEVFEAMKAQGVSPNEATYTAVARLAA 101
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 296 KAGRLGEAEELFAQMEaersvpcayayntmimgygsagrfedAYKLLERLRergcipsvvSFNSILTCLGKKRKVDEALS 375
Cdd:pfam17177 102 AKGDGDLAFDLVKEME--------------------------AAGVSPRLR---------SYSPALHAYCEAGDADKAYE 146
                         170       180       190
                  ....*....|....*....|....*....|....*...
gi 1002286839 376 LFEVMKK-DAEPNSSTYNIIIDMLCLGGRVEEAYRILD 412
Cdd:pfam17177 147 VEEHMLAhGVELEEPELAALLKVSAKAGRADKVYAYLH 184
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
340-397 2.80e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 39.65  E-value: 2.80e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1002286839 340 KLLERLRERGCIPSVVSFNSILTCLGKKRKVDEALSLFEVMKKDA-EPNSSTYNIIIDM 397
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGiKPTLDTYNAILGV 59
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
355-383 2.99e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.60  E-value: 2.99e-04
                          10        20
                  ....*....|....*....|....*....
gi 1002286839 355 VSFNSILTCLGKKRKVDEALSLFEVMKKD 383
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEK 29
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
635-668 3.03e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 38.98  E-value: 3.03e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002286839 635 TYGAIVDGLAKIDRLDEAYMLFEEAKSKGIELNV 668
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
697-730 3.14e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 38.87  E-value: 3.14e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002286839 697 KGLTPNVYTWNSLLDALVKAEEINEALVCFQSMK 730
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
297-554 3.78e-04

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 43.18  E-value: 3.78e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 297 AGRLGEAEELFAQMEAERSvPCAYAYNTMIMGYGSAGRFEDAYKLLERLRERGcipsvVSFNSILTCLG----KKRKVDE 372
Cdd:COG2956    21 NGQPDKAIDLLEEALELDP-ETVEAHLALGNLYRRRGEYDRAIRIHQKLLERD-----PDRAEALLELAqdylKAGLLDR 94
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 373 ALSLFEVMKKDAEPNSSTYNIIIDMLCLGGRVEEAYRILDEMEHASLfPNLLTVNIMVDRLCKARKLEEAYKIFESASQR 452
Cdd:COG2956    95 AEELLEKLLELDPDDAEALRLLAEIYEQEGDWEKAIEVLERLLKLGP-ENAHAYCELAELYLEQGDYDEAIEALEKALKL 173
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 453 gcNPDCV-TYCSLIDGLGKKGQVDEAYRLFEKMLDAgHNANPVVYTSLIRNFFIHGRKEDGHKIFKELIRRgcKPDLTLL 531
Cdd:COG2956   174 --DPDCArALLLLAELYLEQGDYEEAIAALERALEQ-DPDYLPALPRLAELYEKLGDPEEALELLRKALEL--DPSDDLL 248
                         250       260
                  ....*....|....*....|...
gi 1002286839 532 NTYMDCVFKAGEVEKGRMIFEDI 554
Cdd:COG2956   249 LALADLLERKEGLEAALALLERQ 271
DnrI COG3629
DNA-binding transcriptional regulator DnrI/AfsR/EmbR, SARP family, contains BTAD domain ...
111-239 3.91e-04

DNA-binding transcriptional regulator DnrI/AfsR/EmbR, SARP family, contains BTAD domain [Transcription];


Pssm-ID: 442847 [Multi-domain]  Cd Length: 244  Bit Score: 42.90  E-value: 3.91e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 111 ADAYHAVLPFLHHDLAALEKVLEEmAVLGYGLPNQACAHLAAALVRARR-----LDDAVLAVAVMRRLKFRpAFSAYTVL 185
Cdd:COG3629    86 GGGYRLDVDPDEVDLHRFERLVAR-ARAAAGDPAAAAALLREALALWRGpplagLPFEPWLEAERARLEEL-RLAALEAL 163
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1002286839 186 IGALAEARRPERALELLRQ------MQEVGYEVgvhlfttLVRALAREGQVADALA--------LVDE 239
Cdd:COG3629   164 AEALLALGRHAEALAELRRlvaahpLRERLHRL-------LMRALYRAGRRAEALAayrrlrrrLADE 224
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
244-276 5.29e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 38.10  E-value: 5.29e-04
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002286839 244 CLEPDIVLYNVCIDCFGKAGNVDMAWKFFHELK 276
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
389-415 5.33e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.83  E-value: 5.33e-04
                          10        20
                  ....*....|....*....|....*..
gi 1002286839 389 STYNIIIDMLCLGGRVEEAYRILDEME 415
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMK 27
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
296-522 5.65e-04

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 42.79  E-value: 5.65e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 296 KAGRLGEAEELFAQMeAERSVPCAYAYNTMIMGYGSAGRFEDAYKLLERLRERGcIPSVVSFNSILTCLGKKRKVDEALS 375
Cdd:COG2956    54 RRGEYDRAIRIHQKL-LERDPDRAEALLELAQDYLKAGLLDRAEELLEKLLELD-PDDAEALRLLAEIYEQEGDWEKAIE 131
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 376 LFEVMKKDAEPNSSTYNIIIDMLCLGGRVEEAYRILDEMehASLFPNLLTVNIMVDRLC-KARKLEEAYKIFESASQRgc 454
Cdd:COG2956   132 VLERLLKLGPENAHAYCELAELYLEQGDYDEAIEALEKA--LKLDPDCARALLLLAELYlEQGDYEEAIAALERALEQ-- 207
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1002286839 455 NPDCV-TYCSLIDGLGKKGQVDEAYRLFEKMLDagHNANPVVYTSLIRNFFIHGRKEDGHKIFKELIRR 522
Cdd:COG2956   208 DPDYLpALPRLAELYEKLGDPEEALELLRKALE--LDPSDDLLLALADLLERKEGLEAALALLERQLRR 274
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
704-734 6.62e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.83  E-value: 6.62e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002286839 704 YTWNSLLDALVKAEEINEALVCFQSMKEMKC 734
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
475-587 7.08e-04

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 42.00  E-value: 7.08e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 475 DEAYRLFEKMLDAGHNANPVVYTSLIRnffIHGRKEDG---HKIFKELIRRGCKPDLTLLNTYMDCVFKAGEVEKGRMIF 551
Cdd:pfam17177  72 DRGFEVFEAMKAQGVSPNEATYTAVAR---LAAAKGDGdlaFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVE 148
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 1002286839 552 EDIRSYGFLPDVRSYSILIHGLTKAGQARETSNIFH 587
Cdd:pfam17177 149 EHMLAHGVELEEPELAALLKVSAKAGRADKVYAYLH 184
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
213-261 7.60e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 38.11  E-value: 7.60e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002286839 213 GVHLFTTLVRALAREGQVADALALVDEVKGSCLEPDIVLYNVCIDCFGK 261
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PilF COG3063
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
258-348 7.63e-04

Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];


Pssm-ID: 442297 [Multi-domain]  Cd Length: 94  Bit Score: 39.38  E-value: 7.63e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 258 CFGKAGNVDMAWKFFheLKAQGLKPDDV-SYTSMIWVLCKAGRLGEAEELFAQMEAERSVPcaYAYNTMIMGYGSAGRFE 336
Cdd:COG3063     1 LYLKLGDLEEAEEYY--EKALELDPDNAdALNNLGLLLLEQGRYDEAIALEKALKLDPNNA--EALLNLAELLLELGDYD 76
                          90
                  ....*....|..
gi 1002286839 337 DAYKLLERLRER 348
Cdd:COG3063    77 EALAYLERALEL 88
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
250-280 8.14e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.44  E-value: 8.14e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002286839 250 VLYNVCIDCFGKAGNVDMAWKFFHELKAQGL 280
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
BTAD smart01043
Bacterial transcriptional activator domain; Found in the DNRI/REDD/AFSR family of regulators. ...
124-235 9.16e-04

Bacterial transcriptional activator domain; Found in the DNRI/REDD/AFSR family of regulators. This region of AFSR along with the C terminal region is capable of independently directing actinorhodin production. This family contains TPR repeats.


Pssm-ID: 198111 [Multi-domain]  Cd Length: 145  Bit Score: 40.36  E-value: 9.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839  124 DLAALEKVLEEMAVLGYGLPNQACAHL--AAALVRARRLDD-AVLAVAVMRRLKFRPAF-SAYTVLIGALAEARRPERAL 199
Cdd:smart01043   2 DVDRFERLVAAARAALAADPEAALALLeaALALYRGPLLADvPDEDWAEAERERLRELRlEALEALAEALLALGRHEEAL 81
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 1002286839  200 ELLRQMQEV--GYEVGVHLfttLVRALAREGQVADALA 235
Cdd:smart01043  82 ALLERLLALdpLRERLHRL---LMRALYRAGRRAEALR 116
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
322-346 1.58e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 36.55  E-value: 1.58e-03
                          10        20
                  ....*....|....*....|....*
gi 1002286839 322 YNTMIMGYGSAGRFEDAYKLLERLR 346
Cdd:pfam12854  10 YNTLINGLCRAGRVDEAFELLDEME 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
355-383 1.76e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 36.66  E-value: 1.76e-03
                          10        20
                  ....*....|....*....|....*....
gi 1002286839 355 VSFNSILTCLGKKRKVDEALSLFEVMKKD 383
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKER 29
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
236-291 2.06e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 37.34  E-value: 2.06e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1002286839 236 LVDEVKGSCLEPDIVLYNVCIDCFGKAGNVDMAWKFFHELKAQGLKPDDVSYTSMI 291
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
479-535 3.48e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 36.57  E-value: 3.48e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1002286839 479 RLFEKMLDAGHNANPVVYTSLIRNFFIHGRKEDGHKIFKELIRRGCKPDLTLLNTYM 535
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
BepA COG4783
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ...
222-348 4.14e-03

Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443813 [Multi-domain]  Cd Length: 139  Bit Score: 38.63  E-value: 4.14e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 222 RALAREGQVADALALVDEVKGscLEPD-IVLYNVCIDCFGKAGNVDMAWKFFHELKAqgLKPDDV-SYTSMIWVLCKAGR 299
Cdd:COG4783    12 QALLLAGDYDEAEALLEKALE--LDPDnPEAFALLGEILLQLGDLDEAIVLLHEALE--LDPDEPeARLNLGLALLKAGD 87
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002286839 300 LGEAEELFAQ-MEAERSVPCAYAYNTMImgYGSAGRFEDAYKLLERLRER 348
Cdd:COG4783    88 YDEALALLEKaLKLDPEHPEAYLRLARA--YRALGRPDEAIAALEKALEL 135
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
564-598 4.76e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.51  E-value: 4.76e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002286839 564 RSYSILIHGLTKAGQARETSNIFHAMKQQGFALDA 598
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
739-769 4.87e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.13  E-value: 4.87e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002286839 739 YTYSILINGLCRVQKYNKAFVFWQDMQKQGL 769
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
635-664 5.22e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.13  E-value: 5.22e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 1002286839 635 TYGAIVDGLAKIDRLDEAYMLFEEAKSKGI 664
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
409-470 5.42e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 36.18  E-value: 5.42e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1002286839 409 RILDEMEHASLFPNLLTVNIMVDRLCKARKLEEAYKIFESASQRGCNPDCVTYCSLIDGLGK 470
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGG 62
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
180-209 5.59e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.13  E-value: 5.59e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 1002286839 180 SAYTVLIGALAEARRPERALELLRQMQEVG 209
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKG 30
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
726-781 7.80e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 35.80  E-value: 7.80e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1002286839 726 FQSMKEMKCPPNTYTYSILINGLCRVQKYNKAFVFWQDMQKQGLVPNVVTYTTMIS 781
Cdd:pfam13812   3 LREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILG 58
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
802-833 8.56e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 34.63  E-value: 8.56e-03
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1002286839 802 NGGIPDAASFNALIEGMSNANRAMEAYQVFEE 833
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDE 32
TadD COG5010
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, ...
196-348 8.80e-03

Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444034 [Multi-domain]  Cd Length: 155  Bit Score: 38.02  E-value: 8.80e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 196 ERALELLRQMQEVGYEVGVHLFTTLVRALAREGQVADALALVDEVKGSCLEPDIVLYNVCIDCFGKAGNVDMAWKFFHel 275
Cdd:COG5010     1 ARALEGFDRLPLYLLLLTKLRTLVEKYEAALAGANNTKEDELAAAGRDKLAKAFAIESPSDNLYNKLGDFEESLALLE-- 78
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1002286839 276 KAQGLKPDDVS-YTSMIWVLCKAGRLGEAEELFAQmeAERSVP-CAYAYNTMIMGYGSAGRFEDAYKLLERLRER 348
Cdd:COG5010    79 QALQLDPNNPElYYNLALLYSRSGDKDEAKEYYEK--ALALSPdNPNAYSNLAALLLSLGQDDEAKAALQRALGT 151
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
373-488 9.30e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 38.53  E-value: 9.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002286839 373 ALSLFEVMKKDAEP-NSSTYNIIIDmLCLGGRV----------EEAYRILDEMEHASLFPNLLTVNIMVdRLC-KARKLE 440
Cdd:pfam17177  30 ALALYDAAKAEGVRlAQYHYNVLLY-LCSKAADatdlkpqlaaDRGFEVFEAMKAQGVSPNEATYTAVA-RLAaAKGDGD 107
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 1002286839 441 EAYKIFESASQRGCNPDCVTYCSLIDGLGKKGQVDEAYRLFEKMLDAG 488
Cdd:pfam17177 108 LAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHG 155
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH