NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1002264281|ref|XP_015636473|]
View 

pentatricopeptide repeat-containing protein At1g06710, mitochondrial isoform X2 [Oryza sativa Japonica Group]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 1001960)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner; similar to Arabidopsis pentatricopeptide repeat proteins that play constitutive, often essential roles in mitochondria and chloroplasts, probably via binding to organellar transcripts

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03218 super family cl33664
maturation of RBCL 1; Provisional
438-799 2.84e-41

maturation of RBCL 1; Provisional


The actual alignment was detected with superfamily member PLN03218:

Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 164.66  E-value: 2.84e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  438 FDKAFQLIKEMMRKGFVPDTSTYSKVITFLCHATKVEKAFLLFQEMKMVGVTPDVYTYTILIDSFCKAGLIEQAQWLFEE 517
Cdd:PLN03218   453 IDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGI 532
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  518 MRSVGCSPTVVTYTALIHAYLKAKQVPQANDIFHRM------VDagcrPNDVTYGALVDGLCKAGNISKAFEVYaKLIGT 591
Cdd:PLN03218   533 MRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMkaethpID----PDHITVGALMKACANAGQVDRAKEVY-QMIHE 607
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  592 SDSADSdfyfpcedrhtlaPNVvtYGALVDGLCKAHKVDHAHELLDAMLSSGCEPNHIVYDALIDGFCKAGKIDSAQEVF 671
Cdd:PLN03218   608 YNIKGT-------------PEV--YTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEIL 672
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  672 LQMTKCGYLPSVHTYTSLIDRMFKDGRLDLAMKVLSQMLKDSCTPNVVTYTAMIDGLCRIGESEKALKLLSLMEEKGCSP 751
Cdd:PLN03218   673 QDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCP 752
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 1002264281  752 NVVTYTALIDGLGKAGKIDLSLDLFTQMSRKGCSPNYVTYRVLINhLC 799
Cdd:PLN03218   753 NTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCITG-LC 799
PLN03218 super family cl33664
maturation of RBCL 1; Provisional
651-954 6.82e-25

maturation of RBCL 1; Provisional


The actual alignment was detected with superfamily member PLN03218:

Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 111.89  E-value: 6.82e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  651 YDALIDGFCKAGKIDSAQEVFLQMTKCGYLPSVHTYTSLIDRMFKDGRLDLAMKVLSQMLKDSCTPNVVTYTAMIDGLCR 730
Cdd:PLN03218   440 FNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCAR 519
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  731 IGESEKALKLLSLMEEKGCSPNVVTYTALIDGLGKAGKIDLSLDLFTQMS--RKGCSPNYVTYRVLINHLCAAGLLDKAR 808
Cdd:PLN03218   520 AGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKaeTHPIDPDHITVGALMKACANAGQVDRAK 599
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  809 lllgemkQTYwpKYLQGYRcaIQGFSKsfiaslgileemesygtvpiapVYGMLIDCFSKAGRLEIAMELHKEMMEvpSS 888
Cdd:PLN03218   600 -------EVY--QMIHEYN--IKGTPE----------------------VYTIAVNSCSQKGDWDFALSIYDDMKK--KG 644
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1002264281  889 VKTDNDMYASLIQALCLASQVEEAFRLYSEMTRRGFVPELSVFVCLIKGLVEVKKWDEALQLCYGI 954
Cdd:PLN03218   645 VKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDI 710
PLN03218 super family cl33664
maturation of RBCL 1; Provisional
139-353 1.34e-11

maturation of RBCL 1; Provisional


The actual alignment was detected with superfamily member PLN03218:

Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 68.75  E-value: 1.34e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  139 GACYDALADAlGFDGRARDAERLLREIGeedreVLGRL--LNVLVRRCCRGGMWNEALEELGRLKDFGYRPSKVTYNALV 216
Cdd:PLN03218   583 GALMKACANA-GQVDRAKEVYQMIHEYN-----IKGTPevYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALV 656
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  217 QVLSSAGQVDLGFRVQKEMSESGFCMDRFTVGCFAHALCKEGRWADALDMIER-EDFKLDTVLCTH--MISGLMEASYFD 293
Cdd:PLN03218   657 DVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDiKSIKLRPTVSTMnaLITALCEGNQLP 736
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  294 EAMSFLHRMRCNSCIPNVVTYRTLLSGFLKKKQLGWCKRIINMMMTEGCNPNPSLFNSLV 353
Cdd:PLN03218   737 KALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT 796
PPR_2 super family cl38385
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
344-392 7.79e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


The actual alignment was detected with superfamily member pfam13041:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 40.81  E-value: 7.79e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002264281 344 PNPSLFNSLVHSYCNEKDYAYAYKLLNRMTTCGCPPGYVVYNIFIGSIC 392
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
438-799 2.84e-41

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 164.66  E-value: 2.84e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  438 FDKAFQLIKEMMRKGFVPDTSTYSKVITFLCHATKVEKAFLLFQEMKMVGVTPDVYTYTILIDSFCKAGLIEQAQWLFEE 517
Cdd:PLN03218   453 IDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGI 532
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  518 MRSVGCSPTVVTYTALIHAYLKAKQVPQANDIFHRM------VDagcrPNDVTYGALVDGLCKAGNISKAFEVYaKLIGT 591
Cdd:PLN03218   533 MRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMkaethpID----PDHITVGALMKACANAGQVDRAKEVY-QMIHE 607
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  592 SDSADSdfyfpcedrhtlaPNVvtYGALVDGLCKAHKVDHAHELLDAMLSSGCEPNHIVYDALIDGFCKAGKIDSAQEVF 671
Cdd:PLN03218   608 YNIKGT-------------PEV--YTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEIL 672
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  672 LQMTKCGYLPSVHTYTSLIDRMFKDGRLDLAMKVLSQMLKDSCTPNVVTYTAMIDGLCRIGESEKALKLLSLMEEKGCSP 751
Cdd:PLN03218   673 QDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCP 752
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 1002264281  752 NVVTYTALIDGLGKAGKIDLSLDLFTQMSRKGCSPNYVTYRVLINhLC 799
Cdd:PLN03218   753 NTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCITG-LC 799
PLN03218 PLN03218
maturation of RBCL 1; Provisional
651-954 6.82e-25

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 111.89  E-value: 6.82e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  651 YDALIDGFCKAGKIDSAQEVFLQMTKCGYLPSVHTYTSLIDRMFKDGRLDLAMKVLSQMLKDSCTPNVVTYTAMIDGLCR 730
Cdd:PLN03218   440 FNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCAR 519
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  731 IGESEKALKLLSLMEEKGCSPNVVTYTALIDGLGKAGKIDLSLDLFTQMS--RKGCSPNYVTYRVLINHLCAAGLLDKAR 808
Cdd:PLN03218   520 AGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKaeTHPIDPDHITVGALMKACANAGQVDRAK 599
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  809 lllgemkQTYwpKYLQGYRcaIQGFSKsfiaslgileemesygtvpiapVYGMLIDCFSKAGRLEIAMELHKEMMEvpSS 888
Cdd:PLN03218   600 -------EVY--QMIHEYN--IKGTPE----------------------VYTIAVNSCSQKGDWDFALSIYDDMKK--KG 644
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1002264281  889 VKTDNDMYASLIQALCLASQVEEAFRLYSEMTRRGFVPELSVFVCLIKGLVEVKKWDEALQLCYGI 954
Cdd:PLN03218   645 VKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDI 710
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
716-765 4.43e-18

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 78.56  E-value: 4.43e-18
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002264281 716 PNVVTYTAMIDGLCRIGESEKALKLLSLMEEKGCSPNVVTYTALIDGLGK 765
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03218 PLN03218
maturation of RBCL 1; Provisional
139-353 1.34e-11

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 68.75  E-value: 1.34e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  139 GACYDALADAlGFDGRARDAERLLREIGeedreVLGRL--LNVLVRRCCRGGMWNEALEELGRLKDFGYRPSKVTYNALV 216
Cdd:PLN03218   583 GALMKACANA-GQVDRAKEVYQMIHEYN-----IKGTPevYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALV 656
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  217 QVLSSAGQVDLGFRVQKEMSESGFCMDRFTVGCFAHALCKEGRWADALDMIER-EDFKLDTVLCTH--MISGLMEASYFD 293
Cdd:PLN03218   657 DVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDiKSIKLRPTVSTMnaLITALCEGNQLP 736
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  294 EAMSFLHRMRCNSCIPNVVTYRTLLSGFLKKKQLGWCKRIINMMMTEGCNPNPSLFNSLV 353
Cdd:PLN03218   737 KALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT 796
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
275-323 9.50e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 54.68  E-value: 9.50e-10
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002264281 275 DTVLCTHMISGLMEASYFDEAMSFLHRMRCNSCIPNVVTYRTLLSGFLK 323
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
719-753 4.29e-08

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 49.76  E-value: 4.29e-08
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002264281 719 VTYTAMIDGLCRIGESEKALKLLSLMEEKGCSPNV 753
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
896-938 8.29e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 43.89  E-value: 8.29e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 1002264281 896 YASLIQALCLASQVEEAFRLYSEMTRRGFVPELSVFVCLIKGL 938
Cdd:pfam13041   6 YNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
344-392 7.79e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 40.81  E-value: 7.79e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002264281 344 PNPSLFNSLVHSYCNEKDYAYAYKLLNRMTTCGCPPGYVVYNIFIGSIC 392
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
895-926 2.47e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 38.98  E-value: 2.47e-04
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1002264281 895 MYASLIQALCLASQVEEAFRLYSEMTRRGFVP 926
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEP 33
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
609-922 9.53e-04

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 42.02  E-value: 9.53e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 609 LAPNVVTYGALVDGLC--KAHKVDHAHELLDAMLSSgcEPNHI-VYDALIDGFCKAGKIDSAQEVFLQMTKcgYLP-SVH 684
Cdd:COG2956     2 LLPVAAALGWYFKGLNylLNGQPDKAIDLLEEALEL--DPETVeAHLALGNLYRRRGEYDRAIRIHQKLLE--RDPdRAE 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 685 TYTSLIDRMFKDGRLDLAMKVLSQMLKDSCTpNVVTYTAMIDGLCRIGESEKALKLLSLMEEKGcSPNVVTYTALIDGLG 764
Cdd:COG2956    78 ALLELAQDYLKAGLLDRAEELLEKLLELDPD-DAEALRLLAEIYEQEGDWEKAIEVLERLLKLG-PENAHAYCELAELYL 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 765 KAGKIDLSLDLFTQMSRKgcSPNYVtyrvlinhlcaaglldKARLLLGEMkqtywpkYLQG--YRCAIQGFSKsfiaslg 842
Cdd:COG2956   156 EQGDYDEAIEALEKALKL--DPDCA----------------RALLLLAEL-------YLEQgdYEEAIAALER------- 203
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 843 ILEEMESYgtvpiAPVYGMLIDCFSKAGRLEIAMELHKEMMEVPSSVkTDNDMYASLIQALclaSQVEEAFRLYSEMTRR 922
Cdd:COG2956   204 ALEQDPDY-----LPALPRLAELYEKLGDPEEALELLRKALELDPSD-DLLLALADLLERK---EGLEAALALLERQLRR 274
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
210-243 2.59e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 36.28  E-value: 2.59e-03
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002264281 210 VTYNALVQVLSSAGQVDLGFRVQKEMSESGFCMD 243
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
438-799 2.84e-41

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 164.66  E-value: 2.84e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  438 FDKAFQLIKEMMRKGFVPDTSTYSKVITFLCHATKVEKAFLLFQEMKMVGVTPDVYTYTILIDSFCKAGLIEQAQWLFEE 517
Cdd:PLN03218   453 IDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGI 532
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  518 MRSVGCSPTVVTYTALIHAYLKAKQVPQANDIFHRM------VDagcrPNDVTYGALVDGLCKAGNISKAFEVYaKLIGT 591
Cdd:PLN03218   533 MRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMkaethpID----PDHITVGALMKACANAGQVDRAKEVY-QMIHE 607
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  592 SDSADSdfyfpcedrhtlaPNVvtYGALVDGLCKAHKVDHAHELLDAMLSSGCEPNHIVYDALIDGFCKAGKIDSAQEVF 671
Cdd:PLN03218   608 YNIKGT-------------PEV--YTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEIL 672
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  672 LQMTKCGYLPSVHTYTSLIDRMFKDGRLDLAMKVLSQMLKDSCTPNVVTYTAMIDGLCRIGESEKALKLLSLMEEKGCSP 751
Cdd:PLN03218   673 QDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCP 752
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 1002264281  752 NVVTYTALIDGLGKAGKIDLSLDLFTQMSRKGCSPNYVTYRVLINhLC 799
Cdd:PLN03218   753 NTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCITG-LC 799
PLN03218 PLN03218
maturation of RBCL 1; Provisional
436-815 2.83e-40

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 161.58  E-value: 2.83e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  436 GKFDKAFQLIKEMMRKGFVPDTSTYSKVITFLCHATK-VEKAFLLFQEMKmvgvTPDVYTYTILIDSFCKAGLIEQAQWL 514
Cdd:PLN03218   384 GRIKDCIDLLEDMEKRGLLDMDKIYHAKFFKACKKQRaVKEAFRFAKLIR----NPTLSTFNMLMSVCASSQDIDGALRV 459
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  515 FEEMRSVGCSPTVVTYTALIHAYLKAKQVPQANDIFHRMVDAGCRPNDVTYGALVDGLCKAGNISKAFEVYAKLIGTSds 594
Cdd:PLN03218   460 LRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKN-- 537
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  595 adsdfyfpcedrhtLAPNVVTYGALVDGLCKAHKVDHAHELLDAMLSSG--CEPNHIVYDALIDGFCKAGKIDSAQEVFL 672
Cdd:PLN03218   538 --------------VKPDRVVFNALISACGQSGAVDRAFDVLAEMKAEThpIDPDHITVGALMKACANAGQVDRAKEVYQ 603
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  673 QMTKCGYLPSVHTYTSLIDRMFKDGRLDLAMKVLSQMLKDSCTPNVVTYTAMIDGLCRIGESEKALKLLSLMEEKGCSPN 752
Cdd:PLN03218   604 MIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLG 683
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1002264281  753 VVTYTALIDGLGKAGKIDLSLDLFTQMSRKGCSPNYVTYRVLINHLCAAGLLDKARLLLGEMK 815
Cdd:PLN03218   684 TVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMK 746
PLN03218 PLN03218
maturation of RBCL 1; Provisional
152-625 2.18e-30

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 129.61  E-value: 2.18e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  152 DGRARDAERLLREIGEEDREVLGRLLNVLVRRCCRGgmwNEALEELGRLKDFGYRPSKVTYNALVQVLSSAGQVDLGFRV 231
Cdd:PLN03218   383 DGRIKDCIDLLEDMEKRGLLDMDKIYHAKFFKACKK---QRAVKEAFRFAKLIRNPTLSTFNMLMSVCASSQDIDGALRV 459
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  232 QKEMSESGFcmdrftvgcfahalckegrwadaldmieredfKLDTVLCTHMISGLMEASYFDEAMSFLHRMRCNSCIPNV 311
Cdd:PLN03218   460 LRLVQEAGL--------------------------------KADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANV 507
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  312 VTYRTLLSGFLKKKQLGWCKRIINMMMTEGCNPNPSLFNSLVhSYCNEKD-YAYAYKLLNRMTTCGCP--PGYVVYNIFI 388
Cdd:PLN03218   508 HTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALI-SACGQSGaVDRAFDVLAEMKAETHPidPDHITVGALM 586
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  389 GSIC--GQeklpspdlLDLAEKIYgEMLAANCVLNKVNVANFARCLCG-VGKFDKAFQLIKEMMRKGFVPDTSTYSKVIT 465
Cdd:PLN03218   587 KACAnaGQ--------VDRAKEVY-QMIHEYNIKGTPEVYTIAVNSCSqKGDWDFALSIYDDMKKKGVKPDEVFFSALVD 657
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  466 FLCHATKVEKAFLLFQEMKMVGVTPDVYTYTILIDSFCKAGLIEQAQWLFEEMRSVGCSPTVVTYTALIHAYLKAKQVPQ 545
Cdd:PLN03218   658 VAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPK 737
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  546 ANDIFHRMVDAGCRPNDVTYGALVDGLCKAGNISKAFEVYAKLIGTSdsadsdfyfpcedrhtLAPNVVTYGALVdGLCK 625
Cdd:PLN03218   738 ALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDG----------------IKPNLVMCRCIT-GLCL 800
PLN03218 PLN03218
maturation of RBCL 1; Provisional
651-954 6.82e-25

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 111.89  E-value: 6.82e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  651 YDALIDGFCKAGKIDSAQEVFLQMTKCGYLPSVHTYTSLIDRMFKDGRLDLAMKVLSQMLKDSCTPNVVTYTAMIDGLCR 730
Cdd:PLN03218   440 FNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCAR 519
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  731 IGESEKALKLLSLMEEKGCSPNVVTYTALIDGLGKAGKIDLSLDLFTQMS--RKGCSPNYVTYRVLINHLCAAGLLDKAR 808
Cdd:PLN03218   520 AGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKaeTHPIDPDHITVGALMKACANAGQVDRAK 599
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  809 lllgemkQTYwpKYLQGYRcaIQGFSKsfiaslgileemesygtvpiapVYGMLIDCFSKAGRLEIAMELHKEMMEvpSS 888
Cdd:PLN03218   600 -------EVY--QMIHEYN--IKGTPE----------------------VYTIAVNSCSQKGDWDFALSIYDDMKK--KG 644
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1002264281  889 VKTDNDMYASLIQALCLASQVEEAFRLYSEMTRRGFVPELSVFVCLIKGLVEVKKWDEALQLCYGI 954
Cdd:PLN03218   645 VKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDI 710
PLN03077 PLN03077
Protein ECB2; Provisional
178-779 3.80e-22

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 103.01  E-value: 3.80e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 178 NVLVRRCCRGGMWNEALEELGRLKDFGYRPSKVTYNALVQVLSSAGQVDLGFRVQKEMSESgfcMDRFTV---GCFAHAL 254
Cdd:PLN03077   55 NSQLRALCSHGQLEQALKLLESMQELRVPVDEDAYVALFRLCEWKRAVEEGSRVCSRALSS---HPSLGVrlgNAMLSMF 131
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 255 CKEGRWADAL----DMIEREDFKLDTvlcthMISGLMEASYFDEAMSFLHRMRCNSCIPNVVTYRTLLSGFLKKKQLGWC 330
Cdd:PLN03077  132 VRFGELVHAWyvfgKMPERDLFSWNV-----LVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARG 206
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 331 KRIINMMMTEGCNPNPSLFNSLVHSYCNEKDYAYAYKLLNRMTTCGCppgyVVYNIFIGS-----ICGQ---------EK 396
Cdd:PLN03077  207 REVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRRDC----ISWNAMISGyfengECLEglelfftmrEL 282
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 397 LPSPDLLDLAEKIYGEMLAANCVLNK--------------VNVAN-FARCLCGVGKFDKAFQLIKEMMRKgfvpDTSTYS 461
Cdd:PLN03077  283 SVDPDLMTITSVISACELLGDERLGRemhgyvvktgfavdVSVCNsLIQMYLSLGSWGEAEKVFSRMETK----DAVSWT 358
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 462 KVITFLCHATKVEKAFLLFQEMKMVGVTPDVYTYTILIDSFCKAGLIEQAQWLFEEMRSVGCSPTVVTYTALIHAYLKAK 541
Cdd:PLN03077  359 AMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCK 438
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 542 QVPQANDIFHRMVdagcRPNDVTYGALVDGLCKAGNISKAFEVYAKLIGtsdsadsdfyfpcedrhTLAPNVVTY----- 616
Cdd:PLN03077  439 CIDKALEVFHNIP----EKDVISWTSIIAGLRLNNRCFEALIFFRQMLL-----------------TLKPNSVTLiaals 497
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 617 -----GALVDGlckahKVDHAHELLDAMLSSGCEPNhivydALIDGFCKAGKIDSAQEVFLQMTKcgylpSVHTYTSLID 691
Cdd:PLN03077  498 acariGALMCG-----KEIHAHVLRTGIGFDGFLPN-----ALLDLYVRCGRMNYAWNQFNSHEK-----DVVSWNILLT 562
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 692 RMFKDGRLDLAMKVLSQMLKDSCTPNVVTYTAMIDGLCRIGESEKALKLLSLMEEK-GCSPNVVTYTALIDGLGKAGKID 770
Cdd:PLN03077  563 GYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKySITPNLKHYACVVDLLGRAGKLT 642

                  ....*....
gi 1002264281 771 LSLDLFTQM 779
Cdd:PLN03077  643 EAYNFINKM 651
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
716-765 4.43e-18

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 78.56  E-value: 4.43e-18
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002264281 716 PNVVTYTAMIDGLCRIGESEKALKLLSLMEEKGCSPNVVTYTALIDGLGK 765
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
464-902 7.01e-18

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 88.77  E-value: 7.01e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 464 ITFLCHATKVEKAFLLFQEMKMVG-VTPDVYTYTILIDSFCKAGLIEQAQWLFEEMRSVGCSPTVVTYTALIHAYLKAKQ 542
Cdd:PLN03081   94 IEKLVACGRHREALELFEILEAGCpFTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGM 173
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 543 VPQANDIFHRMVDAgcrpNDVTYGALVDGLCKAGNISKAFEVYAKLIGTSDSADSdfyfpcedrHTLAPNVVTYGALvdG 622
Cdd:PLN03081  174 LIDARRLFDEMPER----NLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEP---------RTFVVMLRASAGL--G 238
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 623 LCKAHKVDHAhelldAMLSSGCEPNHIVYDALIDGFCKAGKIDSAQEVFLQMTK----------CGYlpSVHTYT----- 687
Cdd:PLN03081  239 SARAGQQLHC-----CVLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEkttvawnsmlAGY--ALHGYSeealc 311
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 688 -----------------SLIDRMF-KDGRLDLAMKVLSQMLKDSCTPNVVTYTAMIDGLCRIGESEKALKLLSLMEEKgc 749
Cdd:PLN03081  312 lyyemrdsgvsidqftfSIMIRIFsRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRK-- 389
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 750 spNVVTYTALIDGLGKAGKIDLSLDLFTQMSRKGCSPNYVTYRVLINHLCAAGLLDkarlllgemkqtywpkylQGYRca 829
Cdd:PLN03081  390 --NLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSE------------------QGWE-- 447
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1002264281 830 iqgfsksfiaslgILEEM-ESYGTVPIAPVYGMLIDCFSKAGRLEIAMELHKEmmevpSSVKTDNDMYASLIQA 902
Cdd:PLN03081  448 -------------IFQSMsENHRIKPRAMHYACMIELLGREGLLDEAYAMIRR-----APFKPTVNMWAALLTA 503
PLN03077 PLN03077
Protein ECB2; Provisional
430-947 1.51e-17

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 87.98  E-value: 1.51e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 430 RCLCGVGKFDKAFQLIKEMMRKGFVPDTSTYSKVITfLCHATK-VEKAFLLFQEMKMVGVTPDVYTYTILIDSFCKAGLI 508
Cdd:PLN03077   59 RALCSHGQLEQALKLLESMQELRVPVDEDAYVALFR-LCEWKRaVEEGSRVCSRALSSHPSLGVRLGNAMLSMFVRFGEL 137
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 509 EQAQWLFEEMRSvgcsPTVVTYTALIHAYLKAKQVPQANDIFHRMVDAGCRPNDVTYGALVDGLCKAGNISKAFEVYAKL 588
Cdd:PLN03077  138 VHAWYVFGKMPE----RDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHV 213
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 589 IGTSDSADSD-------FYFPCEDRHTL--------APNVVTYGALVDGLCKAHKVDHAHELLDAMLSSGCEPNHIVYDA 653
Cdd:PLN03077  214 VRFGFELDVDvvnalitMYVKCGDVVSArlvfdrmpRRDCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITS 293
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 654 LIDGFCKAGKIDSAQEVFLQMTKCGYLPSVHTYTSLIDRMFKDGRLDLAMKVLSQMLkdscTPNVVTYTAMIDGLCRIGE 733
Cdd:PLN03077  294 VISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRME----TKDAVSWTAMISGYEKNGL 369
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 734 SEKALKLLSLMEEKGCSPN-----------------------------------VVTYTALIDGLGKAGKIDLSLDLFTQ 778
Cdd:PLN03077  370 PDKALETYALMEQDNVSPDeitiasvlsacaclgdldvgvklhelaerkglisyVVVANALIEMYSKCKCIDKALEVFHN 449
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 779 MSRKgcspNYVTYRVLINHLCAAGLLDKARLLLGEMKQTYWPKylqgyrcaiqgfSKSFIASL------GIL---EEMES 849
Cdd:PLN03077  450 IPEK----DVISWTSIIAGLRLNNRCFEALIFFRQMLLTLKPN------------SVTLIAALsacariGALmcgKEIHA 513
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 850 Y---------GTVPIAPV-------------------------YGMLIDCFSKAGRLEIAMELHKEMMEvpSSVKTDNDM 895
Cdd:PLN03077  514 HvlrtgigfdGFLPNALLdlyvrcgrmnyawnqfnshekdvvsWNILLTGYVAHGKGSMAVELFNRMVE--SGVNPDEVT 591
                         570       580       590       600       610
                  ....*....|....*....|....*....|....*....|....*....|...
gi 1002264281 896 YASLIQALCLASQVEEAFRLYSEMTRR-GFVPELSVFVCLIKGLVEVKKWDEA 947
Cdd:PLN03077  592 FISLLCACSRSGMVTQGLEYFHSMEEKySITPNLKHYACVVDLLGRAGKLTEA 644
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
681-730 8.00e-16

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 72.01  E-value: 8.00e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002264281 681 PSVHTYTSLIDRMFKDGRLDLAMKVLSQMLKDSCTPNVVTYTAMIDGLCR 730
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
751-799 4.41e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 70.08  E-value: 4.41e-15
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002264281 751 PNVVTYTALIDGLGKAGKIDLSLDLFTQMSRKGCSPNYVTYRVLINHLC 799
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
525-574 4.91e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 69.70  E-value: 4.91e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002264281 525 PTVVTYTALIHAYLKAKQVPQANDIFHRMVDAGCRPNDVTYGALVDGLCK 574
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
455-504 1.15e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 68.93  E-value: 1.15e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002264281 455 PDTSTYSKVITFLCHATKVEKAFLLFQEMKMVGVTPDVYTYTILIDSFCK 504
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
490-539 4.54e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 67.00  E-value: 4.54e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002264281 490 PDVYTYTILIDSFCKAGLIEQAQWLFEEMRSVGCSPTVVTYTALIHAYLK 539
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
611-660 3.67e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 64.69  E-value: 3.67e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002264281 611 PNVVTYGALVDGLCKAHKVDHAHELLDAMLSSGCEPNHIVYDALIDGFCK 660
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
417-556 3.42e-12

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 66.65  E-value: 3.42e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 417 NCVLNKVNVANFARCLCGVGKFDKAFQLIKEMMRKGFVPDTSTYSKVITFLCHATKVEKAFLLFQEMKMVGVTPDVYTYT 496
Cdd:pfam17177  50 NVLLYLCSKAADATDLKPQLAADRGFEVFEAMKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYS 129
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 497 ILIDSFCKAGLIEQAQWLFEEMRSVGCSPTVVTYTALIHAYLKAKQVPQANDIFHRMVDA 556
Cdd:pfam17177 130 PALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGRADKVYAYLHRLRDA 189
PLN03218 PLN03218
maturation of RBCL 1; Provisional
139-353 1.34e-11

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 68.75  E-value: 1.34e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  139 GACYDALADAlGFDGRARDAERLLREIGeedreVLGRL--LNVLVRRCCRGGMWNEALEELGRLKDFGYRPSKVTYNALV 216
Cdd:PLN03218   583 GALMKACANA-GQVDRAKEVYQMIHEYN-----IKGTPevYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALV 656
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  217 QVLSSAGQVDLGFRVQKEMSESGFCMDRFTVGCFAHALCKEGRWADALDMIER-EDFKLDTVLCTH--MISGLMEASYFD 293
Cdd:PLN03218   657 DVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDiKSIKLRPTVSTMnaLITALCEGNQLP 736
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  294 EAMSFLHRMRCNSCIPNVVTYRTLLSGFLKKKQLGWCKRIINMMMTEGCNPNPSLFNSLV 353
Cdd:PLN03218   737 KALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT 796
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
188-670 4.04e-11

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 66.82  E-value: 4.04e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 188 GMWNEALE--ELGRLKdFGYRPSKVTYNALVqvLSSAGQVDLGF--RVQKEMSESGFcmdrftvgcfahalckegrwada 263
Cdd:PLN03081  101 GRHREALElfEILEAG-CPFTLPASTYDALV--EACIALKSIRCvkAVYWHVESSGF----------------------- 154
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 264 ldmiEREDFKLDTVLCTHMISGLM-EA-SYFDEamsflhrmrcnscIP--NVVTYRTLLSGFLKKKQLGWCKRIINMMMT 339
Cdd:PLN03081  155 ----EPDQYMMNRVLLMHVKCGMLiDArRLFDE-------------MPerNLASWGTIIGGLVDAGNYREAFALFREMWE 217
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 340 EGCNPNPSLFNSLVHSYCNEKDyAYAYKLLNrmtTCGCPPGyVVYNIFIGsiCGqeklpspdLLDLAEKIyGEMLAANCV 419
Cdd:PLN03081  218 DGSDAEPRTFVVMLRASAGLGS-ARAGQQLH---CCVLKTG-VVGDTFVS--CA--------LIDMYSKC-GDIEDARCV 281
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 420 LNKV---NVANFARCLCGV---GKFDKAFQLIKEMMRKGFVPDTSTYSKVITFLCHATKVEKAFLLFQEMKMVGVTPDVY 493
Cdd:PLN03081  282 FDGMpekTTVAWNSMLAGYalhGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIV 361
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 494 TYTILIDSFCKAGLIEQAQWLFEEMRsvgcSPTVVTYTALIHAYLKAKQVPQANDIFHRMVDAGCRPNDVTYGALVDGLC 573
Cdd:PLN03081  362 ANTALVDLYSKWGRMEDARNVFDRMP----RKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACR 437
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 574 KAGNISKAFEV----------------YA---KLIGTSDSADSDFYFPceDRHTLAPNVVTYGALVDGlCKAHKVDH--- 631
Cdd:PLN03081  438 YSGLSEQGWEIfqsmsenhrikpramhYAcmiELLGREGLLDEAYAMI--RRAPFKPTVNMWAALLTA-CRIHKNLElgr 514
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|.
gi 1002264281 632 -AHELLDAMlssgcEPNHI-VYDALIDGFCKAGKIDSAQEV 670
Cdd:PLN03081  515 lAAEKLYGM-----GPEKLnNYVVLLNLYNSSGRQAEAAKV 550
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
478-537 4.50e-11

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 58.91  E-value: 4.50e-11
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 478 LLFQEMKMVGVTPDVYTYTILIDSFCKAGLIEQAQWLFEEMRSVGCSPTVVTYTALIHAY 537
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
646-695 6.77e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 58.14  E-value: 6.77e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002264281 646 PNHIVYDALIDGFCKAGKIDSAQEVFLQMTKCGYLPSVHTYTSLIDRMFK 695
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
487-519 1.27e-10

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 56.97  E-value: 1.27e-10
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002264281 487 GVTPDVYTYTILIDSFCKAGLIEQAQWLFEEMR 519
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
275-323 9.50e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 54.68  E-value: 9.50e-10
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002264281 275 DTVLCTHMISGLMEASYFDEAMSFLHRMRCNSCIPNVVTYRTLLSGFLK 323
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
359-523 1.76e-09

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 58.56  E-value: 1.76e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 359 EKDYAYAYKLLNRMTTCGCPPGYVVYNIFIgSICGQEKLPSPDLLDLAEK----IYGEMLAANCVLNKVNVANFARCLCG 434
Cdd:pfam17177  24 HADATGALALYDAAKAEGVRLAQYHYNVLL-YLCSKAADATDLKPQLAADrgfeVFEAMKAQGVSPNEATYTAVARLAAA 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 435 VGKFDKAFQLIKEMMRKGFVPDTSTYSKVITFLCHATKVEKAFLLFQEMKMVGVTPDVYTYTILIDSFCKAGLIEQAQWL 514
Cdd:pfam17177 103 KGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGRADKVYAY 182
                         170
                  ....*....|
gi 1002264281 515 FEEMR-SVGC 523
Cdd:pfam17177 183 LHRLRdAVRQ 192
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
704-765 1.91e-09

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 54.29  E-value: 1.91e-09
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1002264281 704 KVLSQMLKDSCTPNVVTYTAMIDGLCRIGESEKALKLLSLMEEKGCSPNVVTYTALIDGLGK 765
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGG 62
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
643-675 4.09e-09

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 52.73  E-value: 4.09e-09
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002264281 643 GCEPNHIVYDALIDGFCKAGKIDSAQEVFLQMT 675
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
739-796 4.38e-09

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 53.52  E-value: 4.38e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1002264281 739 KLLSLMEEKGCSPNVVTYTALIDGLGKAGKIDLSLDLFTQMSRKGCSPNYVTYRVLIN 796
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILG 58
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
514-573 6.11e-09

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 53.13  E-value: 6.11e-09
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 514 LFEEMRSVGCSPTVVTYTALIHAYLKAKQVPQANDIFHRMVDAGCRPNDVTYGALVdGLC 573
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL-GVI 60
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
747-780 1.18e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 51.19  E-value: 1.18e-08
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002264281 747 KGCSPNVVTYTALIDGLGKAGKIDLSLDLFTQMS 780
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
714-745 1.30e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 51.19  E-value: 1.30e-08
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1002264281 714 CTPNVVTYTAMIDGLCRIGESEKALKLLSLME 745
Cdd:pfam12854   3 LKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
632-770 2.57e-08

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 55.48  E-value: 2.57e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 632 AHELLDAMLSSGCEPNHIVYDALIDGFCKAG---------KIDSAQEVFLQMTKCGYLPSVHTYTSLIDRMFKDGRLDLA 702
Cdd:pfam17177  30 ALALYDAAKAEGVRLAQYHYNVLLYLCSKAAdatdlkpqlAADRGFEVFEAMKAQGVSPNEATYTAVARLAAAKGDGDLA 109
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1002264281 703 MKVLSQMLKDSCTPNVVTYTAMIDGLCRIGESEKALKLLSLMEEKGCSPNVVTYTALIDGLGKAGKID 770
Cdd:pfam17177 110 FDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGRAD 177
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
719-753 4.29e-08

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 49.76  E-value: 4.29e-08
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002264281 719 VTYTAMIDGLCRIGESEKALKLLSLMEEKGCSPNV 753
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
476-588 9.26e-08

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 53.55  E-value: 9.26e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 476 AFLLFQEMKMVGVTPDVYTYTILIDSFCKAGL---------IEQAQWLFEEMRSVGCSPTVVTYTALIHAYLKAKQVPQA 546
Cdd:pfam17177  30 ALALYDAAKAEGVRLAQYHYNVLLYLCSKAADatdlkpqlaADRGFEVFEAMKAQGVSPNEATYTAVARLAAAKGDGDLA 109
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 1002264281 547 NDIFHRMVDAGCRPNDVTYGALVDGLCKAGNISKAFEVYAKL 588
Cdd:pfam17177 110 FDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHM 151
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
560-625 1.56e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 48.51  E-value: 1.56e-07
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1002264281 560 PNDVTYGALVDGLCKAGNISKAFEVYAKLIgtsdsadsdfyfpcedRHTLAPNVVTYGALVDGLCK 625
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMK----------------KRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
557-584 1.69e-07

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 48.11  E-value: 1.69e-07
                          10        20
                  ....*....|....*....|....*...
gi 1002264281 557 GCRPNDVTYGALVDGLCKAGNISKAFEV 584
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFEL 29
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
609-639 2.98e-07

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 47.34  E-value: 2.98e-07
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002264281 609 LAPNVVTYGALVDGLCKAHKVDHAHELLDAM 639
Cdd:pfam12854   3 LKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
441-589 5.35e-07

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 51.24  E-value: 5.35e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 441 AFQLIKEMMRKGFVPDTSTYSkVITFLCHA----------TKVEKAFLLFQEMKMVGVTPDVYTYTILIDSFCKAGLIEQ 510
Cdd:pfam17177  30 ALALYDAAKAEGVRLAQYHYN-VLLYLCSKaadatdlkpqLAADRGFEVFEAMKAQGVSPNEATYTAVARLAAAKGDGDL 108
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1002264281 511 AQWLFEEMRSVGCSPTVVTYTALIHAYLKAKQVPQANDIFHRMVDAGCRPNDVTYGALVDGLCKAGNISKAFEVYAKLI 589
Cdd:pfam17177 109 AFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGRADKVYAYLHRLR 187
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
670-729 5.47e-07

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 47.35  E-value: 5.47e-07
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 670 VFLQMTKCGYLPSVHTYTSLIDRMFKDGRLDLAMKVLSQMLKDSCTPNVVTYTAMIdGLC 729
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL-GVI 60
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
719-749 5.63e-07

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 46.69  E-value: 5.63e-07
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002264281 719 VTYTAMIDGLCRIGESEKALKLLSLMEEKGC 749
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
430-469 6.73e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 46.97  E-value: 6.73e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|
gi 1002264281 430 RCLCGVGKFDKAFQLIKEMMRKGFVPDTSTYSKVITFLCH 469
Cdd:pfam13041  11 NGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
782-815 1.40e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 45.41  E-value: 1.40e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002264281 782 KGCSPNYVTYRVLINHLCAAGLLDKARLLLGEMK 815
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
754-787 1.44e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 45.52  E-value: 1.44e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002264281 754 VTYTALIDGLGKAGKIDLSLDLFTQMSRKGCSPN 787
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
614-647 1.72e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 45.14  E-value: 1.72e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002264281 614 VTYGALVDGLCKAHKVDHAHELLDAMLSSGCEPN 647
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
493-523 2.52e-06

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 44.76  E-value: 2.52e-06
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002264281 493 YTYTILIDSFCKAGLIEQAQWLFEEMRSVGC 523
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
678-709 6.52e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 43.49  E-value: 6.52e-06
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1002264281 678 GYLPSVHTYTSLIDRMFKDGRLDLAMKVLSQM 709
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
896-938 8.29e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 43.89  E-value: 8.29e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 1002264281 896 YASLIQALCLASQVEEAFRLYSEMTRRGFVPELSVFVCLIKGL 938
Cdd:pfam13041   6 YNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
309-358 9.61e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 43.51  E-value: 9.61e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002264281 309 PNVVTYRTLLSGFLKKKQLGWCKRIINMMMTEGCNPNPSLFNSLVHSYCN 358
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03218 PLN03218
maturation of RBCL 1; Provisional
144-284 1.00e-05

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 49.49  E-value: 1.00e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281  144 ALADALGFDGRARDAERLLREIGEEDREVLGRLLNVLVRRCCRGGMWNEALEELGRLKDFGYRPSKVTYNALVQVLSSAG 223
Cdd:PLN03218   654 ALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGN 733
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1002264281  224 QVDLGFRVQKEMSESGFCMDRFTVGCFAHALCKEGRWADALDMIE--RED-FKLDTVLCTHMIS 284
Cdd:PLN03218   734 QLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSqaKEDgIKPNLVMCRCITG 797
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
493-527 1.14e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 42.83  E-value: 1.14e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002264281 493 YTYTILIDSFCKAGLIEQAQWLFEEMRSVGCSPTV 527
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
528-562 1.71e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 42.44  E-value: 1.71e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002264281 528 VTYTALIHAYLKAKQVPQANDIFHRMVDAGCRPND 562
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
264-321 3.21e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 42.35  E-value: 3.21e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1002264281 264 LDMIEREDFKLDTVLCTHMISGLMEASYFDEAMSFLHRMRCNSCIPNVVTYRTLLSGF 321
Cdd:pfam13812   3 LREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
754-784 3.46e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 41.30  E-value: 3.46e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002264281 754 VTYTALIDGLGKAGKIDLSLDLFTQMSRKGC 784
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
612-749 4.14e-05

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 45.85  E-value: 4.14e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 612 NVVTY----GALVDGLCKAHKVDHAHELLDAMLSSGCEPNHIVYDALIDGFCKAGKIDSAQEVFLQMTKCGYLPSVHTYT 687
Cdd:pfam17177  50 NVLLYlcskAADATDLKPQLAADRGFEVFEAMKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYS 129
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1002264281 688 SLIDRMFKDGRLDLAMKVLSQMLKDSCTPNVVTYTAMIDGLCRIGESEKALKLL-SLMEEKGC 749
Cdd:pfam17177 130 PALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGRADKVYAYLhRLRDAVRQ 192
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
744-807 4.30e-05

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 45.85  E-value: 4.30e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1002264281 744 MEEKGCSPNVVTYTALIDGLGKAGKIDLSLDLFTQMSRKGCSPNYVTYRVLINHLCAAGLLDKA 807
Cdd:pfam17177  81 MKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKA 144
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
444-502 5.51e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 41.96  E-value: 5.51e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1002264281 444 LIKEMMRKGFVPDTSTYSKVITFLCHATKVEKAFLLFQEMKMVGVTPDVYTYTILIDSF 502
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
660-846 6.16e-05

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 45.08  E-value: 6.16e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 660 KAGKIDSAQEVFLQMTKCGYLPSVHTYTSLI---------DRMFKDGRLDLAMKVLSQMLKDSCTPNVVTYTAMIDGLCR 730
Cdd:pfam17177  23 KHADATGALALYDAAKAEGVRLAQYHYNVLLylcskaadaTDLKPQLAADRGFEVFEAMKAQGVSPNEATYTAVARLAAA 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 731 IGESEKALKLLSLMEEKGCSPNVVTYTALIDGLGKAGKIDLSLDLFTQMSRKGCSPNYVTYRVLINHLCAAGLLDKarll 810
Cdd:pfam17177 103 KGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGRADK---- 178
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 1002264281 811 lgemkqTYwpKYLQGYRCAIQGFSKSfiaSLGILEE 846
Cdd:pfam17177 179 ------VY--AYLHRLRDAVRQVSES---TAGVLEE 203
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
684-718 7.61e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.52  E-value: 7.61e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002264281 684 HTYTSLIDRMFKDGRLDLAMKVLSQMLKDSCTPNV 718
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
344-392 7.79e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 40.81  E-value: 7.79e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002264281 344 PNPSLFNSLVHSYCNEKDYAYAYKLLNRMTTCGCPPGYVVYNIFIGSIC 392
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
635-690 1.17e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 40.80  E-value: 1.17e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1002264281 635 LLDAMLSSGCEPNHIVYDALIDGFCKAGKIDSAQEVFLQMTKCGYLPSVHTYTSLI 690
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
430-457 1.84e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 39.36  E-value: 1.84e-04
                          10        20
                  ....*....|....*....|....*...
gi 1002264281 430 RCLCGVGKFDKAFQLIKEMMRKGFVPDT 457
Cdd:TIGR00756   8 DGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
207-256 2.28e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 39.65  E-value: 2.28e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002264281 207 PSKVTYNALVQVLSSAGQVDLGFRVQKEMSESGFCMDRFTVGCFAHALCK 256
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
895-926 2.47e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 38.98  E-value: 2.47e-04
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1002264281 895 MYASLIQALCLASQVEEAFRLYSEMTRRGFVP 926
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEP 33
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
522-554 4.86e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 38.10  E-value: 4.86e-04
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002264281 522 GCSPTVVTYTALIHAYLKAKQVPQANDIFHRMV 554
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
649-678 4.95e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.22  E-value: 4.95e-04
                          10        20        30
                  ....*....|....*....|....*....|
gi 1002264281 649 IVYDALIDGFCKAGKIDSAQEVFLQMTKCG 678
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKG 30
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
528-558 5.51e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.22  E-value: 5.51e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002264281 528 VTYTALIHAYLKAKQVPQANDIFHRMVDAGC 558
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
773-946 5.54e-04

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 42.38  E-value: 5.54e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 773 LDLFTQMSRKGCSPNYVTYRVLInHLCAAGLLDKArlllgemkqtywpkyLQGYRCAIQGFSksfiaslgILEEMESYGT 852
Cdd:pfam17177  31 LALYDAAKAEGVRLAQYHYNVLL-YLCSKAADATD---------------LKPQLAADRGFE--------VFEAMKAQGV 86
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 853 VPIAPVYGMLIDCFSKAGRLEIAMELHKEMME--VPSSVKTdndmYASLIQALCLASQVEEAFRLYSEMTRRGFVPELSV 930
Cdd:pfam17177  87 SPNEATYTAVARLAAAKGDGDLAFDLVKEMEAagVSPRLRS----YSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPE 162
                         170
                  ....*....|....*.
gi 1002264281 931 FVCLIKglVEVKKWDE 946
Cdd:pfam17177 163 LAALLK--VSAKAGRA 176
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
614-644 6.84e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.83  E-value: 6.84e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002264281 614 VTYGALVDGLCKAHKVDHAHELLDAMLSSGC 644
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
451-484 9.11e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 37.33  E-value: 9.11e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002264281 451 KGFVPDTSTYSKVITFLCHATKVEKAFLLFQEMK 484
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
609-922 9.53e-04

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 42.02  E-value: 9.53e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 609 LAPNVVTYGALVDGLC--KAHKVDHAHELLDAMLSSgcEPNHI-VYDALIDGFCKAGKIDSAQEVFLQMTKcgYLP-SVH 684
Cdd:COG2956     2 LLPVAAALGWYFKGLNylLNGQPDKAIDLLEEALEL--DPETVeAHLALGNLYRRRGEYDRAIRIHQKLLE--RDPdRAE 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 685 TYTSLIDRMFKDGRLDLAMKVLSQMLKDSCTpNVVTYTAMIDGLCRIGESEKALKLLSLMEEKGcSPNVVTYTALIDGLG 764
Cdd:COG2956    78 ALLELAQDYLKAGLLDRAEELLEKLLELDPD-DAEALRLLAEIYEQEGDWEKAIEVLERLLKLG-PENAHAYCELAELYL 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 765 KAGKIDLSLDLFTQMSRKgcSPNYVtyrvlinhlcaaglldKARLLLGEMkqtywpkYLQG--YRCAIQGFSKsfiaslg 842
Cdd:COG2956   156 EQGDYDEAIEALEKALKL--DPDCA----------------RALLLLAEL-------YLEQgdYEEAIAALER------- 203
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 843 ILEEMESYgtvpiAPVYGMLIDCFSKAGRLEIAMELHKEMMEVPSSVkTDNDMYASLIQALclaSQVEEAFRLYSEMTRR 922
Cdd:COG2956   204 ALEQDPDY-----LPALPRLAELYEKLGDPEEALELLRKALELDPSD-DLLLALADLLERK---EGLEAALALLERQLRR 274
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
649-683 1.04e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.43  E-value: 1.04e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002264281 649 IVYDALIDGFCKAGKIDSAQEVFLQMTKCGYLPSV 683
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
859-904 1.31e-03

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 37.34  E-value: 1.31e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 1002264281 859 YGMLIDCFSKAGRLEIAMELHKEMMEvpSSVKTDNDMYASLIQALC 904
Cdd:pfam13041   6 YNTLINGYCKKGKVEEAFKLFNEMKK--RGVKPNVYTYTILINGLC 49
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
340-373 1.45e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 36.94  E-value: 1.45e-03
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002264281 340 EGCNPNPSLFNSLVHSYCNEKDYAYAYKLLNRMT 373
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
332-393 2.06e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 37.34  E-value: 2.06e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1002264281 332 RIINMMMTEGCNPNPSLFNSLVHSYCNEKDYAYAYKLLNRMTTCGCPPGYVVYNIFIGSICG 393
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGG 62
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
895-924 2.36e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 36.29  E-value: 2.36e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 1002264281 895 MYASLIQALCLASQVEEAFRLYSEMTRRGF 924
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
210-243 2.59e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 36.28  E-value: 2.59e-03
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002264281 210 VTYNALVQVLSSAGQVDLGFRVQKEMSESGFCMD 243
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
459-492 2.65e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 36.28  E-value: 2.65e-03
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002264281 459 TYSKVITFLCHATKVEKAFLLFQEMKMVGVTPDV 492
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
786-816 3.06e-03

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 36.57  E-value: 3.06e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002264281 786 PNYVTYRVLINHLCAAGLLDKARLLLGEMKQ 816
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKK 31
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
539-709 3.52e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 40.07  E-value: 3.52e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 539 KAKQVPQANDIFHRMVDAGCRPNDVTYGALVDgLCKAGN----------ISKAFEVYAKLigtsdSADSdfyfpcedrht 608
Cdd:pfam17177  23 KHADATGALALYDAAKAEGVRLAQYHYNVLLY-LCSKAAdatdlkpqlaADRGFEVFEAM-----KAQG----------- 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002264281 609 LAPNVVTYGALVDGLCKAHKVDHAHELLDAMLSSGCEPNHIVYDALIDGFCKAGKIDSAQEVFLQMTKCGYLPSVHTYTS 688
Cdd:pfam17177  86 VSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAA 165
                         170       180
                  ....*....|....*....|.
gi 1002264281 689 LIDRMFKDGRldlAMKVLSQM 709
Cdd:pfam17177 166 LLKVSAKAGR---ADKVYAYL 183
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
858-884 4.34e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.52  E-value: 4.34e-03
                          10        20
                  ....*....|....*....|....*..
gi 1002264281 858 VYGMLIDCFSKAGRLEIAMELHKEMME 884
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKE 28
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
277-311 5.59e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.51  E-value: 5.59e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002264281 277 VLCTHMISGLMEASYFDEAMSFLHRMRCNSCIPNV 311
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
563-589 5.65e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.51  E-value: 5.65e-03
                          10        20
                  ....*....|....*....|....*..
gi 1002264281 563 VTYGALVDGLCKAGNISKAFEVYAKLI 589
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMK 27
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
605-655 8.24e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 35.80  E-value: 8.24e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1002264281 605 DRHTLAPNVVTYGALVDGLCKAHKVDHAHELLDAMLSSGCEPNHIVYDALI 655
Cdd:pfam13812   7 VRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
858-884 8.97e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 34.74  E-value: 8.97e-03
                          10        20
                  ....*....|....*....|....*..
gi 1002264281 858 VYGMLIDCFSKAGRLEIAMELHKEMME 884
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKE 28
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH