NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1002246397|ref|XP_015627373|]
View 

pentatricopeptide repeat-containing protein At1g09900 [Oryza sativa Japonica Group]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 13329975)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner

CATH:  1.25.40.10
Gene Ontology:  GO:0003723
SCOP:  4001344

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03218 super family cl33664
maturation of RBCL 1; Provisional
78-487 1.05e-37

maturation of RBCL 1; Provisional


The actual alignment was detected with superfamily member PLN03218:

Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 148.87  E-value: 1.05e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397   78 RGELDEALRLVGSARRPDAGTCAALIKKLSASGRTAEARRVL-----AACGPDVMAYNAMVAGYCGAGQLDAARRLVAEM 152
Cdd:PLN03218   419 QRAVKEAFRFAKLIRNPTLSTFNMLMSVCASSQDIDGALRVLrlvqeAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEM 498
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  153 P---VEPDAYTYNTLIRGlCGR-GRTANALAVLDEMLRRRCVPDVVTYTILLEAtCKRSGYKQ-AMKLLDEMR--DKGCT 225
Cdd:PLN03218   499 VnagVEANVHTFGALIDG-CARaGQVAKAFGAYGIMRSKNVKPDRVVFNALISA-CGQSGAVDrAFDVLAEMKaeTHPID 576
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  226 PDIVTYNVVVNGICQEGRVDDAIEFLKNLPSYGCEPNTVSYNIVLKGLCTAERWEDAEELMGEMGQKGCPPNVVTFNMLI 305
Cdd:PLN03218   577 PDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALV 656
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  306 SFLCRKGLVEPALEVLEQIPKYGCTPNSLSYNPLLHAFCKQKKMDKAMAFLDLMVSRGCYPDIVSYNTLLTALCRSGEVD 385
Cdd:PLN03218   657 DVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLP 736
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  386 vavellhqlkdkgcapvlisyntvidgltkagktkEALELLNEMVSKGLQPDIITYSTIAAGLCREDRIEDAIRAFGKVQ 465
Cdd:PLN03218   737 -----------------------------------KALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAK 781
                          410       420
                   ....*....|....*....|..
gi 1002246397  466 DMGIRPNTVLYNAIIlGLCKRR 487
Cdd:PLN03218   782 EDGIKPNLVMCRCIT-GLCLRR 802
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
471-519 2.52e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 67.00  E-value: 2.52e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002246397 471 PNTVLYNAIILGLCKRRETHSAIDLFAYMIGNGCMPNESTYTILIEGLA 519
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
366-415 4.54e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 63.54  E-value: 4.54e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002246397 366 PDIVSYNTLLTALCRSGEVDVAVELLHQLKDKGCAPVLISYNTVIDGLTK 415
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 super family cl38385
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
506-539 5.40e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


The actual alignment was detected with superfamily member pfam13041:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 38.11  E-value: 5.40e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002246397 506 PNESTYTILIEGLAYEGLIKEARDLLDELCSRGV 539
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGV 34
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
78-487 1.05e-37

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 148.87  E-value: 1.05e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397   78 RGELDEALRLVGSARRPDAGTCAALIKKLSASGRTAEARRVL-----AACGPDVMAYNAMVAGYCGAGQLDAARRLVAEM 152
Cdd:PLN03218   419 QRAVKEAFRFAKLIRNPTLSTFNMLMSVCASSQDIDGALRVLrlvqeAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEM 498
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  153 P---VEPDAYTYNTLIRGlCGR-GRTANALAVLDEMLRRRCVPDVVTYTILLEAtCKRSGYKQ-AMKLLDEMR--DKGCT 225
Cdd:PLN03218   499 VnagVEANVHTFGALIDG-CARaGQVAKAFGAYGIMRSKNVKPDRVVFNALISA-CGQSGAVDrAFDVLAEMKaeTHPID 576
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  226 PDIVTYNVVVNGICQEGRVDDAIEFLKNLPSYGCEPNTVSYNIVLKGLCTAERWEDAEELMGEMGQKGCPPNVVTFNMLI 305
Cdd:PLN03218   577 PDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALV 656
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  306 SFLCRKGLVEPALEVLEQIPKYGCTPNSLSYNPLLHAFCKQKKMDKAMAFLDLMVSRGCYPDIVSYNTLLTALCRSGEVD 385
Cdd:PLN03218   657 DVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLP 736
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  386 vavellhqlkdkgcapvlisyntvidgltkagktkEALELLNEMVSKGLQPDIITYSTIAAGLCREDRIEDAIRAFGKVQ 465
Cdd:PLN03218   737 -----------------------------------KALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAK 781
                          410       420
                   ....*....|....*....|..
gi 1002246397  466 DMGIRPNTVLYNAIIlGLCKRR 487
Cdd:PLN03218   782 EDGIKPNLVMCRCIT-GLCLRR 802
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
331-380 2.86e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 69.70  E-value: 2.86e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002246397 331 PNSLSYNPLLHAFCKQKKMDKAMAFLDLMVSRGCYPDIVSYNTLLTALCR 380
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
471-519 2.52e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 67.00  E-value: 2.52e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002246397 471 PNTVLYNAIILGLCKRRETHSAIDLFAYMIGNGCMPNESTYTILIEGLA 519
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
366-415 4.54e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 63.54  E-value: 4.54e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002246397 366 PDIVSYNTLLTALCRSGEVDVAVELLHQLKDKGCAPVLISYNTVIDGLTK 415
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
404-438 5.41e-08

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 48.99  E-value: 5.41e-08
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002246397 404 ISYNTVIDGLTKAGKTKEALELLNEMVSKGLQPDI 438
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
77-286 2.36e-04

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 43.18  E-value: 2.36e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  77 RRGELDEALRLVGSARRPD---AGTCAALIKKLSASGRTAEA----RRVLAACGPDVMAYNAMVAGYCGAGQLDAA---- 145
Cdd:COG2956    54 RRGEYDRAIRIHQKLLERDpdrAEALLELAQDYLKAGLLDRAeellEKLLELDPDDAEALRLLAEIYEQEGDWEKAievl 133
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 146 RRLVAEMPVEPDAYTYntLIRGLCGRGRTANALAVLDEMLRRRcvPDVVTYTILLEATCKRSG-YKQAMKLLDEMRDKgc 224
Cdd:COG2956   134 ERLLKLGPENAHAYCE--LAELYLEQGDYDEAIEALEKALKLD--PDCARALLLLAELYLEQGdYEEAIAALERALEQ-- 207
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1002246397 225 TPD-IVTYNVVVNGICQEGRVDDAIEFLKNLpsYGCEPNTVSYNIVLKGLCTAERWEDAEELM 286
Cdd:COG2956   208 DPDyLPALPRLAELYEKLGDPEEALELLRKA--LELDPSDDLLLALADLLERKEGLEAALALL 268
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
474-508 4.32e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.82  E-value: 4.32e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002246397 474 VLYNAIILGLCKRRETHSAIDLFAYMIGNGCMPNE 508
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
506-539 5.40e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 38.11  E-value: 5.40e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002246397 506 PNESTYTILIEGLAYEGLIKEARDLLDELCSRGV 539
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGV 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
510-540 4.40e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.12  E-value: 4.40e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002246397 510 TYTILIEGLAYEGLIKEARDLLDELCSRGVV 540
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIE 32
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
78-487 1.05e-37

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 148.87  E-value: 1.05e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397   78 RGELDEALRLVGSARRPDAGTCAALIKKLSASGRTAEARRVL-----AACGPDVMAYNAMVAGYCGAGQLDAARRLVAEM 152
Cdd:PLN03218   419 QRAVKEAFRFAKLIRNPTLSTFNMLMSVCASSQDIDGALRVLrlvqeAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEM 498
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  153 P---VEPDAYTYNTLIRGlCGR-GRTANALAVLDEMLRRRCVPDVVTYTILLEAtCKRSGYKQ-AMKLLDEMR--DKGCT 225
Cdd:PLN03218   499 VnagVEANVHTFGALIDG-CARaGQVAKAFGAYGIMRSKNVKPDRVVFNALISA-CGQSGAVDrAFDVLAEMKaeTHPID 576
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  226 PDIVTYNVVVNGICQEGRVDDAIEFLKNLPSYGCEPNTVSYNIVLKGLCTAERWEDAEELMGEMGQKGCPPNVVTFNMLI 305
Cdd:PLN03218   577 PDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALV 656
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  306 SFLCRKGLVEPALEVLEQIPKYGCTPNSLSYNPLLHAFCKQKKMDKAMAFLDLMVSRGCYPDIVSYNTLLTALCRSGEVD 385
Cdd:PLN03218   657 DVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLP 736
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  386 vavellhqlkdkgcapvlisyntvidgltkagktkEALELLNEMVSKGLQPDIITYSTIAAGLCREDRIEDAIRAFGKVQ 465
Cdd:PLN03218   737 -----------------------------------KALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAK 781
                          410       420
                   ....*....|....*....|..
gi 1002246397  466 DMGIRPNTVLYNAIIlGLCKRR 487
Cdd:PLN03218   782 EDGIKPNLVMCRCIT-GLCLRR 802
PLN03218 PLN03218
maturation of RBCL 1; Provisional
155-507 2.34e-32

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 132.69  E-value: 2.34e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  155 EPDAYTYNTLIRGLCGRGRTANALAVLDEMLRRRCVPDVVTYTILLeATCKRSGYKQAM-KLLDEMRDKGCTPDIVTYNV 233
Cdd:PLN03218   434 NPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLI-STCAKSGKVDAMfEVFHEMVNAGVEANVHTFGA 512
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  234 VVNGICQEGRVDDAIEFLKNLPSYGCEPNTVSYNIVLKGLCTAERWEDAEELMGEMGQKGCP--PNVVTFNMLISFLCRK 311
Cdd:PLN03218   513 LIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKAETHPidPDHITVGALMKACANA 592
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  312 GLVEPALEVLEQIPKYGCTPNSLSYNPLLHAFCKQKKMDKAMAFLDLMVSRGCYPDIVSYNTLLTALCRSGEVDVAVELL 391
Cdd:PLN03218   593 GQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEIL 672
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  392 HQLKDKGCAPVLISYNTVIDGLTKAGKTKEALELLNEMVSKGLQPDIITYSTIAAGLCREDRIEDAIRAFGKVQDMGIRP 471
Cdd:PLN03218   673 QDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCP 752
                          330       340       350
                   ....*....|....*....|....*....|....*..
gi 1002246397  472 NTVLYnAIILGLCKRR-ETHSAIDLFAYMIGNGCMPN 507
Cdd:PLN03218   753 NTITY-SILLVASERKdDADVGLDLLSQAKEDGIKPN 788
PLN03218 PLN03218
maturation of RBCL 1; Provisional
161-539 1.11e-30

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 127.69  E-value: 1.11e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  161 YNTLIRGlcgrGRTANALAVLDEMLRR--------------------RCVPDVV------------TYTILLEATCKRSG 208
Cdd:PLN03218   377 YNRLLRD----GRIKDCIDLLEDMEKRglldmdkiyhakffkackkqRAVKEAFrfaklirnptlsTFNMLMSVCASSQD 452
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  209 YKQAMKLLDEMRDKGCTPDIVTYNVVVNGICQEGRVDDAIEFLKNLPSYGCEPNTVSYNIVLKGLCTAERWEDAEELMGE 288
Cdd:PLN03218   453 IDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGI 532
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  289 MGQKGCPPNVVTFNMLISFLCRKGLVEPALEVL-----EQIPkygCTPNSLSYNPLLHAFCKQKKMDKAMAFLDLMVSRG 363
Cdd:PLN03218   533 MRSKNVKPDRVVFNALISACGQSGAVDRAFDVLaemkaETHP---IDPDHITVGALMKACANAGQVDRAKEVYQMIHEYN 609
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  364 CYPDIVSYNTLLTALCRSGEVDVAVELLHQLKDKGCAPVLISYNTVIDGLTKAGKTKEALELLNEMVSKGLQPDIITYST 443
Cdd:PLN03218   610 IKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSS 689
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  444 IAAGLCREDRIEDAIRAFGKVQDMGIRPNTVLYNAIILGLCKRRETHSAIDLFAYMIGNGCMPNESTYTILIEGLAYEGL 523
Cdd:PLN03218   690 LMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDD 769
                          410
                   ....*....|....*.
gi 1002246397  524 IKEARDLLDELCSRGV 539
Cdd:PLN03218   770 ADVGLDLLSQAKEDGI 785
PLN03077 PLN03077
Protein ECB2; Provisional
125-429 1.21e-22

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 102.62  E-value: 1.21e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 125 DVMAYNAMVAGYCGAGQLDAARRLVAEMPVEpDAYTYNTLIRGLCGRGRTANALAVLDEMLRRRCVPDVVTYTILLEATC 204
Cdd:PLN03077  322 DVSVCNSLIQMYLSLGSWGEAEKVFSRMETK-DAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACA 400
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 205 KRSGYKQAMKLLDEMRDKGCTPDIVTYNVVVNGICQEGRVDDAIEFLKNLPsygcEPNTVSYNIVLKGLCTAERWEDAEE 284
Cdd:PLN03077  401 CLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIP----EKDVISWTSIIAGLRLNNRCFEALI 476
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 285 LMGEMGQKgCPPNVVTFNMLISFLCRKGLVEPALEVLEQIPKYGC-----TPNSL------------------------- 334
Cdd:PLN03077  477 FFRQMLLT-LKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIgfdgfLPNALldlyvrcgrmnyawnqfnshekdvv 555
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 335 SYNPLLHAFCKQKKMDKAMAFLDLMVSRGCYPDIVSYNTLLTALCRSGEVDVAVELLHQLKDK-GCAPVLISYNTVIDGL 413
Cdd:PLN03077  556 SWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKySITPNLKHYACVVDLL 635
                         330
                  ....*....|....*.
gi 1002246397 414 TKAGKTKEALELLNEM 429
Cdd:PLN03077  636 GRAGKLTEAYNFINKM 651
PLN03077 PLN03077
Protein ECB2; Provisional
71-486 4.43e-20

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 94.53  E-value: 4.43e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  71 RLRSLVRRGELDEALRLVGSARR----PDAGTCAALIK----KLSAS-------------------------------GR 111
Cdd:PLN03077   57 QLRALCSHGQLEQALKLLESMQElrvpVDEDAYVALFRlcewKRAVEegsrvcsralsshpslgvrlgnamlsmfvrfGE 136
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 112 TAEARRVLAACGP-DVMAYNAMVAGYCGAGQLDAARRLVAEM---PVEPDAYTYNTLIRGlCG------RGRTANALAVL 181
Cdd:PLN03077  137 LVHAWYVFGKMPErDLFSWNVLVGGYAKAGYFDEALCLYHRMlwaGVRPDVYTFPCVLRT-CGgipdlaRGREVHAHVVR 215
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 182 ----DEMlrrrcvpDVVTYTILLEATCkrSGYKQAMKLLDEMrdkgCTPDIVTYNVVVNGICQEGRVDDAIEFLKNLPSY 257
Cdd:PLN03077  216 fgfeLDV-------DVVNALITMYVKC--GDVVSARLVFDRM----PRRDCISWNAMISGYFENGECLEGLELFFTMREL 282
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 258 GCEPN--TVSYNIVLKGLCTAERWedAEELMGEMGQKGCPPNVVTFNMLISFLCRKGLVEPALEVLEQIPkygcTPNSLS 335
Cdd:PLN03077  283 SVDPDlmTITSVISACELLGDERL--GREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRME----TKDAVS 356
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 336 YNPLLHAFCKQKKMDKAMAFLDLMVSRGCYPDIVSYNTLLTALCRSGEVDVAVElLHQLKDKGCapvLISY----NTVID 411
Cdd:PLN03077  357 WTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVK-LHELAERKG---LISYvvvaNALIE 432
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1002246397 412 GLTKAGKTKEALELLNEMVSKglqpDIITYSTIAAGLCREDRIEDAIRAFGKVQdMGIRPNTVLYNAiILGLCKR 486
Cdd:PLN03077  433 MYSKCKCIDKALEVFHNIPEK----DVISWTSIIAGLRLNNRCFEALIFFRQML-LTLKPNSVTLIA-ALSACAR 501
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
168-530 1.71e-19

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 92.24  E-value: 1.71e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 168 LCGRGRTANALAvldEMLRRRC--VPDVVTYTILLEATCKRSGYKQAMKLLDEMRDKGCTPDIVTYNVVVNGICQEGRVD 245
Cdd:PLN03081   99 ACGRHREALELF---EILEAGCpfTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLI 175
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 246 DAIEFLKNLPsygcEPNTVSYNIVLKGLCTAERWEDAEELMGEMGQKGCPPNVVTFNMLISFLCRKGLVEPALEVLEQIP 325
Cdd:PLN03081  176 DARRLFDEMP----ERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLHCCVL 251
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 326 KYGCTPNSLSYNPLLHAFCKQKKMDKAMAFLDLMVSRgcypDIVSYNTLLTALCRSGEVDVAVELLHQLKDKGCAPVLIS 405
Cdd:PLN03081  252 KTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEK----TTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFT 327
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 406 YNTVIDGLTKAGKTKEALELLNEMVSKGLQPDIITYSTIAAGLCREDRIEDAIRAFgkvqDMGIRPNTVLYNAIILGLCK 485
Cdd:PLN03081  328 FSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVF----DRMPRKNLISWNALIAGYGN 403
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*
gi 1002246397 486 RRETHSAIDLFAYMIGNGCMPNESTYTILIEGLAYEGLIKEARDL 530
Cdd:PLN03081  404 HGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEI 448
PLN03218 PLN03218
maturation of RBCL 1; Provisional
266-540 9.15e-19

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 90.32  E-value: 9.15e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  266 YNIVLKglctAERWEDAEELMGEMGQKGcppnvvTFNML----ISFL--CRK-GLVEPALEVLEQIPkygcTPNSLSYNP 338
Cdd:PLN03218   377 YNRLLR----DGRIKDCIDLLEDMEKRG------LLDMDkiyhAKFFkaCKKqRAVKEAFRFAKLIR----NPTLSTFNM 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  339 LLHAFCKQKKMDKAMAFLDLMVSRGCYPDIVSYNTLLTALCRSGEVDVAVELLHQLKDKGCAPVLISYNTVIDGLTKAGK 418
Cdd:PLN03218   443 LMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQ 522
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  419 TKEALELLNEMVSKGLQPDIITYST-IAAglCREDRIEDaiRAFGKVQDMG-----IRPNTVLYNAIILGLCKRRETHSA 492
Cdd:PLN03218   523 VAKAFGAYGIMRSKNVKPDRVVFNAlISA--CGQSGAVD--RAFDVLAEMKaethpIDPDHITVGALMKACANAGQVDRA 598
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 1002246397  493 IDLFAYMIGNGCMPNESTYTILIEGLAYEGLIKEARDLLDELCSRGVV 540
Cdd:PLN03218   599 KEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVK 646
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
75-434 2.05e-18

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 88.77  E-value: 2.05e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  75 LVRRGELDEALRL------VGSArrPDAGTCAALIKKLSASGrTAEARRVLAAC------GPDVMAYNAMVAGYCGAGQL 142
Cdd:PLN03081  199 LVDAGNYREAFALfremweDGSD--AEPRTFVVMLRASAGLG-SARAGQQLHCCvlktgvVGDTFVSCALIDMYSKCGDI 275
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 143 DAARRLVAEMPvEPDAYTYNTLIRGLCGRGRTANALAVLDEMLRRRCVPDVVTYTILLEATCKRSGYKQAMKLLDEMRDK 222
Cdd:PLN03081  276 EDARCVFDGMP-EKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRT 354
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 223 GCTPDIVTYNVVVNGICQEGRVDDAIEFLKNLPsygcEPNTVSYNIVLKGLCTAERWEDAEELMGEMGQKGCPPNVVTFN 302
Cdd:PLN03081  355 GFPLDIVANTALVDLYSKWGRMEDARNVFDRMP----RKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFL 430
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 303 MLISFLCRKGLVEPALEVLEQIPK-YGCTPNSLSYNPLLHAFCKQKKMDKAMAfldlMVSRGCY-PDIVSYNTLLTAlCR 380
Cdd:PLN03081  431 AVLSACRYSGLSEQGWEIFQSMSEnHRIKPRAMHYACMIELLGREGLLDEAYA----MIRRAPFkPTVNMWAALLTA-CR 505
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1002246397 381 SGE----VDVAVELLHQLKDKGCAPVLISYNTVIdgltKAGKTKEALELLNEMVSKGL 434
Cdd:PLN03081  506 IHKnlelGRLAAEKLYGMGPEKLNNYVVLLNLYN----SSGRQAEAAKVVETLKRKGL 559
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
75-458 3.92e-18

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 88.00  E-value: 3.92e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  75 LVRRGELDEALRL-----VGSARRPDAGTCAALikkLSASGRTAEARRVLAACG--------PDVMAYNAMVAGYCGAGQ 141
Cdd:PLN03081   97 LVACGRHREALELfeileAGCPFTLPASTYDAL---VEACIALKSIRCVKAVYWhvessgfePDQYMMNRVLLMHVKCGM 173
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 142 LDAARRLVAEMPvEPDAYTYNTLIRGLCGRGRTANALAVLDEMLRRRCVPDVVTYTILLEATCKrSGYKQAMKLLDEMRD 221
Cdd:PLN03081  174 LIDARRLFDEMP-ERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAG-LGSARAGQQLHCCVL 251
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 222 K-GCTPDIVTYNVVVNGICQEGRVDDAIEFLKNLPsygcEPNTVSYNIVLKGLCTAERWEDAEELMGEMGQKGCPPNVVT 300
Cdd:PLN03081  252 KtGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMP----EKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFT 327
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 301 FNMLISFLCRKGLVEPALEVLEQIPKYGCTPNSLSYNPLLHAFCKQKKMDKAMAFLDLMVSRgcypDIVSYNTLLTALCR 380
Cdd:PLN03081  328 FSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRK----NLISWNALIAGYGN 403
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1002246397 381 SGEVDVAVELLHQLKDKGCAPVLISYNTVIDGLTKAGKTKEALELLNEMV-SKGLQPDIITYSTIAAGLCREDRIEDAI 458
Cdd:PLN03081  404 HGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSeNHRIKPRAMHYACMIELLGREGLLDEAY 482
PLN03077 PLN03077
Protein ECB2; Provisional
77-480 1.80e-15

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 79.89  E-value: 1.80e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  77 RRGELDEAL----RLVGSARRPDAGTCAALIKKLS-----ASGRTAEARRVLAACGPDVMAYNAMVAGYCGAGQLDAARR 147
Cdd:PLN03077  164 KAGYFDEALclyhRMLWAGVRPDVYTFPCVLRTCGgipdlARGREVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARL 243
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 148 LVAEMPVEpDAYTYNTLIRGLCGRGRTANALAVLDEMLRRRCVPDVVTYTILLEAT---------------CKRSGY--- 209
Cdd:PLN03077  244 VFDRMPRR-DCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITSVISACellgderlgremhgyVVKTGFavd 322
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 210 -----------------KQAMKLLDEMRDKgctpDIVTYNVVVNGICQEGRVDDAIEFLKNLPSYGCEPNTVSYNIVLKG 272
Cdd:PLN03077  323 vsvcnsliqmylslgswGEAEKVFSRMETK----DAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSA 398
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 273 LCTAERWEDAEELMGEMGQKGCPPNVVTFNMLISFLCRKGLVEPALEVLEQIPKygctPNSLSYNPLLHAFCKQKKMDKA 352
Cdd:PLN03077  399 CACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPE----KDVISWTSIIAGLRLNNRCFEA 474
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 353 MAFLDLMVSRgCYPDIVSYNTLLTALCRSGEVDVAVEL-LHQLKD-----------------------------KGCAPV 402
Cdd:PLN03077  475 LIFFRQMLLT-LKPNSVTLIAALSACARIGALMCGKEIhAHVLRTgigfdgflpnalldlyvrcgrmnyawnqfNSHEKD 553
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1002246397 403 LISYNTVIDGLTKAGKTKEALELLNEMVSKGLQPDIITYSTIAAGLCREDRIEDAIRAFGKVQD-MGIRPNTVLYNAII 480
Cdd:PLN03077  554 VVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEkYSITPNLKHYACVV 632
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
331-380 2.86e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 69.70  E-value: 2.86e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002246397 331 PNSLSYNPLLHAFCKQKKMDKAMAFLDLMVSRGCYPDIVSYNTLLTALCR 380
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
261-310 3.84e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 69.31  E-value: 3.84e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002246397 261 PNTVSYNIVLKGLCTAERWEDAEELMGEMGQKGCPPNVVTFNMLISFLCR 310
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
156-205 3.87e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 69.31  E-value: 3.87e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002246397 156 PDAYTYNTLIRGLCGRGRTANALAVLDEMLRRRCVPDVVTYTILLEATCK 205
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
226-274 8.46e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 68.54  E-value: 8.46e-15
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002246397 226 PDIVTYNVVVNGICQEGRVDDAIEFLKNLPSYGCEPNTVSYNIVLKGLC 274
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
191-240 9.51e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 68.54  E-value: 9.51e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002246397 191 PDVVTYTILLEATCKRSGYKQAMKLLDEMRDKGCTPDIVTYNVVVNGICQ 240
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
401-450 1.29e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 68.16  E-value: 1.29e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002246397 401 PVLISYNTVIDGLTKAGKTKEALELLNEMVSKGLQPDIITYSTIAAGLCR 450
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
471-519 2.52e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 67.00  E-value: 2.52e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002246397 471 PNTVLYNAIILGLCKRRETHSAIDLFAYMIGNGCMPNESTYTILIEGLA 519
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
124-169 3.58e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 66.62  E-value: 3.58e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002246397 124 PDVMAYNAMVAGYCGAGQLDAARRLVAEMP---VEPDAYTYNTLIRGLC 169
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKkrgVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
366-415 4.54e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 63.54  E-value: 4.54e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002246397 366 PDIVSYNTLLTALCRSGEVDVAVELLHQLKDKGCAPVLISYNTVIDGLTK 415
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
296-345 9.91e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 62.77  E-value: 9.91e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002246397 296 PNVVTFNMLISFLCRKGLVEPALEVLEQIPKYGCTPNSLSYNPLLHAFCK 345
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
203-396 1.09e-11

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 64.34  E-value: 1.09e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 203 TCKRSG-YKQAMKLLDEMRDKGCTPDIVTYNVVVNgICQEGrvDDAIEFLKNLPSYgcepntvsynivlkglctaerweD 281
Cdd:pfam17177  20 KCSKHAdATGALALYDAAKAEGVRLAQYHYNVLLY-LCSKA--ADATDLKPQLAAD-----------------------R 73
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 282 AEELMGEMGQKGCPPNVVTFNMLISFLCRKGLVEPALEVLEQIPKYGCTPNSLSYNPLLHAFCKQKKMDKAMAFLDLMVS 361
Cdd:pfam17177  74 GFEVFEAMKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLA 153
                         170       180       190
                  ....*....|....*....|....*....|....*
gi 1002246397 362 RGCYPDIVSYNTLLTALCRSGEVDVAVELLHQLKD 396
Cdd:pfam17177 154 HGVELEEPELAALLKVSAKAGRADKVYAYLHRLRD 188
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
180-238 3.98e-10

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 55.83  E-value: 3.98e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1002246397 180 VLDEMLRRRCVPDVVTYTILLEATCKRSGYKQAMKLLDEMRDKGCTPDIVTYNVVVNGI 238
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PLN03077 PLN03077
Protein ECB2; Provisional
136-510 2.22e-09

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 60.25  E-value: 2.22e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 136 YCGAGQLDAARRLVAEMPVEPDAYTYNTLIRGLCGRGRTANALAVLDEMLRRRCVPDVVTYTILLEATCKRSGYKQAMKL 215
Cdd:PLN03077   29 YWNFHGRKRSRGLSVAASSSSSTHDSNSQLRALCSHGQLEQALKLLESMQELRVPVDEDAYVALFRLCEWKRAVEEGSRV 108
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 216 LDEMRDKGCTPDIVTYNVVVNGICQEGRVDDAIEFLKNLPsygcEPNTVSYNIVLKGLCTAERWEDAEELMGEMGQKGCP 295
Cdd:PLN03077  109 CSRALSSHPSLGVRLGNAMLSMFVRFGELVHAWYVFGKMP----ERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVR 184
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 296 PNVVTFNMLISfLCrKGLVEPAL--EVLEQIPKYGCTPNSLSYNPLLHAFCKQKKMDKAMAFLDLMVSRgcypDIVSYNT 373
Cdd:PLN03077  185 PDVYTFPCVLR-TC-GGIPDLARgrEVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRR----DCISWNA 258
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 374 LLTALCRSGEVDVAVELLHQLKDKGCAPVLISYNTVIDGLTKAGKTKEALELLNEMVSKGLQPDIITYSTIAAGLCREDR 453
Cdd:PLN03077  259 MISGYFENGECLEGLELFFTMRELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGS 338
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1002246397 454 IEDAIRAFGKVQdmgiRPNTVLYNAIILGLCKRRETHSAIDLFAYMIGNGCMPNEST 510
Cdd:PLN03077  339 WGEAEKVFSRME----TKDAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEIT 391
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
222-252 3.16e-09

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 52.35  E-value: 3.16e-09
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002246397 222 KGCTPDIVTYNVVVNGICQEGRVDDAIEFLK 252
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLD 31
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
436-485 5.41e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 51.98  E-value: 5.41e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002246397 436 PDIITYSTIAAGLCREDRIEDAIRAFGKVQDMGIRPNTVLYNAIILGLCK 485
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
292-323 7.63e-09

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 51.19  E-value: 7.63e-09
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1002246397 292 KGCPPNVVTFNMLISFLCRKGLVEPALEVLEQ 323
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDE 32
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
362-395 1.27e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 50.42  E-value: 1.27e-08
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002246397 362 RGCYPDIVSYNTLLTALCRSGEVDVAVELLHQLK 395
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
397-430 1.75e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 50.04  E-value: 1.75e-08
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002246397 397 KGCAPVLISYNTVIDGLTKAGKTKEALELLNEMV 430
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
154-184 2.86e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 49.65  E-value: 2.86e-08
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002246397 154 VEPDAYTYNTLIRGLCGRGRTANALAVLDEM 184
Cdd:pfam12854   3 LKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
404-438 5.41e-08

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 48.99  E-value: 5.41e-08
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002246397 404 ISYNTVIDGLTKAGKTKEALELLNEMVSKGLQPDI 438
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
425-487 5.52e-08

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 49.66  E-value: 5.52e-08
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1002246397 425 LLNEMVSKGLQPDIITYSTIAAGLCREDRIEDAIRAFGKVQDMGIRPNTVLYNAIIlGLCKRR 487
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL-GVIGGR 63
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
356-410 1.13e-07

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 48.89  E-value: 1.13e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1002246397 356 LDLMVSRGCYPDIVSYNTLLTALCRSGEVDVAVELLHQLKDKGCAPVLISYNTVI 410
Cdd:pfam13812   3 LREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
258-289 1.14e-07

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 47.73  E-value: 1.14e-07
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1002246397 258 GCEPNTVSYNIVLKGLCTAERWEDAEELMGEM 289
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
159-193 1.91e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 47.45  E-value: 1.91e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002246397 159 YTYNTLIRGLCGRGRTANALAVLDEMLRRRCVPDV 193
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
335-368 2.15e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 47.06  E-value: 2.15e-07
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002246397 335 SYNPLLHAFCKQKKMDKAMAFLDLMVSRGCYPDI 368
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
404-434 4.19e-07

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 46.30  E-value: 4.19e-07
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002246397 404 ISYNTVIDGLTKAGKTKEALELLNEMVSKGL 434
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
214-271 1.39e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 45.81  E-value: 1.39e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1002246397 214 KLLDEMRDKGCTPDIVTYNVVVNGICQEGRVDDAIEFLKNLPSYGCEPNTVSYNIVLK 271
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILG 58
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
328-360 2.88e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 43.87  E-value: 2.88e-06
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002246397 328 GCTPNSLSYNPLLHAFCKQKKMDKAMAFLDLMV 360
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
264-298 3.51e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 43.60  E-value: 3.51e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002246397 264 VSYNIVLKGLCTAERWEDAEELMGEMGQKGCPPNV 298
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PLN03077 PLN03077
Protein ECB2; Provisional
232-540 4.94e-06

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 49.46  E-value: 4.94e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 232 NVVVNGICQEGRVDDAIEFLKNLPSYGCEPNTVSYnIVLKGLCtaeRWEDAEElMGEMGQKGCPPNVVTF-----NMLIS 306
Cdd:PLN03077   55 NSQLRALCSHGQLEQALKLLESMQELRVPVDEDAY-VALFRLC---EWKRAVE-EGSRVCSRALSSHPSLgvrlgNAMLS 129
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 307 FLCRKGLVEPALEVLEQIPKYgctpNSLSYNPLLHAFCKQKKMDKAMAFLDLMVSRGCYPDIVSYNTLLTA------LCR 380
Cdd:PLN03077  130 MFVRFGELVHAWYVFGKMPER----DLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTcggipdLAR 205
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 381 SGEVDVavellHQLKdKGCAPVLISYNTVIDGLTKAGKTKEALELLNEMVSKglqpDIITYSTIAAGLCREDRIEDAIRA 460
Cdd:PLN03077  206 GREVHA-----HVVR-FGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRR----DCISWNAMISGYFENGECLEGLEL 275
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 461 FGKVQDMGIRPNTVLYNAIILGLCKRRETHSAIDLFAYMIGNGCMPNESTYTILIEGLAYEGLIKEARDLLDELCSRGVV 540
Cdd:PLN03077  276 FFTMRELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETKDAV 355
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
187-220 5.19e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 43.10  E-value: 5.19e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002246397 187 RRCVPDVVTYTILLEATCKRSGYKQAMKLLDEMR 220
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
285-343 6.35e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 43.89  E-value: 6.35e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1002246397 285 LMGEMGQKGCPPNVVTFNMLISFLCRKGLVEPALEVLEQIPKYGCTPNSLSYNPLLHAF 343
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
502-533 6.77e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 42.72  E-value: 6.77e-06
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1002246397 502 NGCMPNESTYTILIEGLAYEGLIKEARDLLDE 533
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDE 32
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
369-401 8.25e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 42.83  E-value: 8.25e-06
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002246397 369 VSYNTLLTALCRSGEVDVAVELLHQLKDKGCAP 401
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEP 33
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
432-463 9.73e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 42.33  E-value: 9.73e-06
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1002246397 432 KGLQPDIITYSTIAAGLCREDRIEDAIRAFGK 463
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDE 32
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
258-306 1.17e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 43.12  E-value: 1.17e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002246397 258 GCEPNTVSYNIVLKGLCTAERWEDAEELMGEMGQKGCPPNVVTFNMLIS 306
Cdd:pfam13812  10 GIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILG 58
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
390-444 1.51e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 42.73  E-value: 1.51e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1002246397 390 LLHQLKDKGCAPVLISYNTVIDGLTKAGKTKEALELLNEMVSKGLQPDIITYSTI 444
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAI 56
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
229-263 1.88e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 41.67  E-value: 1.88e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002246397 229 VTYNVVVNGICQEGRVDDAIEFLKNLPSYGCEPNT 263
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
369-399 2.98e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 40.91  E-value: 2.98e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002246397 369 VSYNTLLTALCRSGEVDVAVELLHQLKDKGC 399
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
320-377 5.72e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 41.19  E-value: 5.72e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1002246397 320 VLEQIPKYGCTPNSLSYNPLLHAFCKQKKMDKAMAFLDLMVSRGCYPDIVSYNTLLTA 377
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGV 59
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
467-499 1.08e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 39.64  E-value: 1.08e-04
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002246397 467 MGIRPNTVLYNAIILGLCKRRETHSAIDLFAYM 499
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
264-294 1.46e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.99  E-value: 1.46e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002246397 264 VSYNIVLKGLCTAERWEDAEELMGEMGQKGC 294
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
380-516 1.52e-04

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 43.15  E-value: 1.52e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 380 RSGEVDVAVELLHQLKDKGCAPVLISYNTVIDGLTKAGKTKE---------ALELLNEMVSKGLQPDIITYSTIAAgLCR 450
Cdd:pfam17177  23 KHADATGALALYDAAKAEGVRLAQYHYNVLLYLCSKAADATDlkpqlaadrGFEVFEAMKAQGVSPNEATYTAVAR-LAA 101
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1002246397 451 EDRieDAIRAFGKVQDM---GIRPNTVLYNAIILGLCKRRETHSAIDLFAYMIGNGCMPNESTYTILIE 516
Cdd:pfam17177 102 AKG--DGDLAFDLVKEMeaaGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLK 168
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
122-152 2.34e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 38.48  E-value: 2.34e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002246397 122 CGPDVMAYNAMVAGYCGAGQLDAARRLVAEM 152
Cdd:pfam12854   3 LKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
77-286 2.36e-04

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 43.18  E-value: 2.36e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  77 RRGELDEALRLVGSARRPD---AGTCAALIKKLSASGRTAEA----RRVLAACGPDVMAYNAMVAGYCGAGQLDAA---- 145
Cdd:COG2956    54 RRGEYDRAIRIHQKLLERDpdrAEALLELAQDYLKAGLLDRAeellEKLLELDPDDAEALRLLAEIYEQEGDWEKAievl 133
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 146 RRLVAEMPVEPDAYTYntLIRGLCGRGRTANALAVLDEMLRRRcvPDVVTYTILLEATCKRSG-YKQAMKLLDEMRDKgc 224
Cdd:COG2956   134 ERLLKLGPENAHAYCE--LAELYLEQGDYDEAIEALEKALKLD--PDCARALLLLAELYLEQGdYEEAIAALERALEQ-- 207
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1002246397 225 TPD-IVTYNVVVNGICQEGRVDDAIEFLKNLpsYGCEPNTVSYNIVLKGLCTAERWEDAEELM 286
Cdd:COG2956   208 DPDyLPALPRLAELYEKLGDPEEALELLRKA--LELDPSDDLLLALADLLERKEGLEAALALL 268
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
159-189 2.63e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.22  E-value: 2.63e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002246397 159 YTYNTLIRGLCGRGRTANALAVLDEMLRRRC 189
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
335-364 2.66e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.22  E-value: 2.66e-04
                          10        20        30
                  ....*....|....*....|....*....|
gi 1002246397 335 SYNPLLHAFCKQKKMDKAMAFLDLMVSRGC 364
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
154-206 3.01e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 39.26  E-value: 3.01e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 1002246397 154 VEPDAYTYNTLIRGLCGRGRTANALAVLDEMLRRRCVPDVVTYTILLEATCKR 206
Cdd:pfam13812  11 IQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
474-508 4.32e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.82  E-value: 4.32e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002246397 474 VLYNAIILGLCKRRETHSAIDLFAYMIGNGCMPNE 508
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
439-473 4.40e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.82  E-value: 4.40e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002246397 439 ITYSTIAAGLCREDRIEDAIRAFGKVQDMGIRPNT 473
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
506-539 5.40e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 38.11  E-value: 5.40e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002246397 506 PNESTYTILIEGLAYEGLIKEARDLLDELCSRGV 539
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGV 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
459-515 5.55e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 38.49  E-value: 5.55e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1002246397 459 RAFGKVQDMGIRPNTVLYNAIILGLCKRRETHSAIDLFAYMIGNGCMPNESTYTILI 515
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
317-436 6.67e-04

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 41.23  E-value: 6.67e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 317 ALEVLEQIPKYGCTPNSLSYNPLLH---------AFCKQKKMDKAMAFLDLMVSRGCYPDIVSYnTLLTALC-RSGEVDV 386
Cdd:pfam17177  30 ALALYDAAKAEGVRLAQYHYNVLLYlcskaadatDLKPQLAADRGFEVFEAMKAQGVSPNEATY-TAVARLAaAKGDGDL 108
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002246397 387 AVELLHQLKDKGCAPVLISYNTVIDGLTKAGKTKEALELLNEMVSKGLQP 436
Cdd:pfam17177 109 AFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVEL 158
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
194-228 1.10e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 36.66  E-value: 1.10e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002246397 194 VTYTILLEATCKRSGYKQAMKLLDEMRDKGCTPDI 228
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
299-333 1.30e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 36.28  E-value: 1.30e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002246397 299 VTFNMLISFLCRKGLVEPALEVLEQIPKYGCTPNS 333
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PLN03218 PLN03218
maturation of RBCL 1; Provisional
406-527 1.35e-03

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 41.79  E-value: 1.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  406 YNTVIDGLTKAGKTKEALELLNEMVSKGL--------------------------------QPDIITYSTIAAGLCREDR 453
Cdd:PLN03218   373 YIDAYNRLLRDGRIKDCIDLLEDMEKRGLldmdkiyhakffkackkqravkeafrfaklirNPTLSTFNMLMSVCASSQD 452
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1002246397  454 IEDAIRAFGKVQDMGIRPNTVLYNAIILGLCKRRETHSAIDLFAYMIGNGCMPNESTYTILIEGLAYEGLIKEA 527
Cdd:PLN03218   453 IDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKA 526
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
509-539 2.12e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.90  E-value: 2.12e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002246397 509 STYTILIEGLAYEGLIKEARDLLDELCSRGV 539
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
194-224 2.53e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.52  E-value: 2.53e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002246397 194 VTYTILLEATCKRSGYKQAMKLLDEMRDKGC 224
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
299-329 3.56e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.13  E-value: 3.56e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002246397 299 VTFNMLISFLCRKGLVEPALEVLEQIPKYGC 329
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
510-540 4.40e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.12  E-value: 4.40e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002246397 510 TYTILIEGLAYEGLIKEARDLLDELCSRGVV 540
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIE 32
PLN03077 PLN03077
Protein ECB2; Provisional
72-273 4.85e-03

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 39.83  E-value: 4.85e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397  72 LRSLVRRGELDEAL----RLVGSARRPDAGTCAALIKKLSASGRTAEARRVLA------ACGPDVMAYNAMVAGYCGAGQ 141
Cdd:PLN03077  561 LTGYVAHGKGSMAVelfnRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHsmeekySITPNLKHYACVVDLLGRAGK 640
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 142 LDAARRLVAEMPVEPDAYTYNTLIRGlCGRGRTANalavLDEMLRRRCV---PDVVTYTILLEATCKRSG-YKQAMKLLD 217
Cdd:PLN03077  641 LTEAYNFINKMPITPDPAVWGALLNA-CRIHRHVE----LGELAAQHIFeldPNSVGYYILLCNLYADAGkWDEVARVRK 715
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1002246397 218 EMRDKGCTPDIVTYNVVVNGICQEGRVDDaiEFlknlpsygcEPNTVSYNIVLKGL 273
Cdd:PLN03077  716 TMRENGLTVDPGCSWVEVKGKVHAFLTDD--ES---------HPQIKEINTVLEGF 760
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
127-158 5.63e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 34.74  E-value: 5.63e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002246397 127 MAYNAMVAGYCGAGQLDAARRLVAEMP---VEPDA 158
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKergIEPDV 35
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
158-271 8.63e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 37.76  E-value: 8.63e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 158 AYTYNTLIRgLCGRGRTA----------NALAVLDEMLRRRCVPDVVTYTILLEATCKRSGYKQAMKLLDEMRDKGCTPD 227
Cdd:pfam17177  46 QYHYNVLLY-LCSKAADAtdlkpqlaadRGFEVFEAMKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPR 124
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 1002246397 228 IVTYNVVVNGICQEGRVDDAIEFLKNLPSYGCEPNTVSYNIVLK 271
Cdd:pfam17177 125 LRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLK 168
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
352-471 8.78e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 37.76  E-value: 8.78e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002246397 352 AMAFLDLMVSRGCYPDIVSYNTLLTaLCRSGEVDVAV----------ELLHQLKDKGCAPVLISYNTVIDGLTKAGKTKE 421
Cdd:pfam17177  30 ALALYDAAKAEGVRLAQYHYNVLLY-LCSKAADATDLkpqlaadrgfEVFEAMKAQGVSPNEATYTAVARLAAAKGDGDL 108
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002246397 422 ALELLNEMVSKGLQPDIITYSTIAAGLCREDRIEDAIRAFGKVQDMGIRP 471
Cdd:pfam17177 109 AFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVEL 158
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
474-504 9.92e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 33.98  E-value: 9.92e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002246397 474 VLYNAIILGLCKRRETHSAIDLFAYMIGNGC 504
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH