NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1002314886|ref|XP_015620812|]
View 

putative pentatricopeptide repeat-containing protein At3g16890, mitochondrial [Oryza sativa Japonica Group]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 13329975)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner

CATH:  1.25.40.10
Gene Ontology:  GO:0003723
SCOP:  4001344

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03218 super family cl33664
maturation of RBCL 1; Provisional
181-592 2.57e-28

maturation of RBCL 1; Provisional


The actual alignment was detected with superfamily member PLN03218:

Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 121.91  E-value: 2.57e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  181 RLGLRPSTAVYNALIAASVRAGAVDTAYLRFQQMPADGCRPDHFTYNSLVHGVCRRGIVDEAVRLVRQMEGEGIRPNVFT 260
Cdd:PLN03218   430 KLIRNPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHT 509
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  261 YTMLVDGFCNAGRVEEAFRMLDKMKEKGVAPSEATYRTLVHGVFRCLERDKAYRMLSDwLGHETslHPsachtmlyclsk 340
Cdd:PLN03218   510 FGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAE-MKAET--HP------------ 574
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  341 kdmakeavefekrmsargYLLDSTAFGIVMSCALKCLEVSDLCELLDSFIKNG--GNPgfDVYIMVIKSLLNCKNFSKAN 418
Cdd:PLN03218   575 ------------------IDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNikGTP--EVYTIAVNSCSQKGDWDFAL 634
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  419 HYLGHMVLKGLLSSVMSYNMVIDCFVKAGAVDKAEEIVKEMQDKGFLPNLVTFNTLISGYSKLGNVHNAKVVLKMLMEHG 498
Cdd:PLN03218   635 SIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIK 714
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  499 FMPDIITFTSLIDGLCNTHQLDDAFVCFEEMAEWGVRPNAQTYNVLMHTLCSAGHVNKAIDLLNKMKIDGVTPDAYSFNA 578
Cdd:PLN03218   715 LRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRC 794
                          410
                   ....*....|....
gi 1002314886  579 LIlSFCrMRKVDKA 592
Cdd:PLN03218   795 IT-GLC-LRRFEKA 806
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
571-620 1.17e-16

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 73.94  E-value: 1.17e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002314886 571 PDAYSFNALILSFCRMRKVDKAEDIFNDMVRFGVVPDSYTYNSLIKALCD 620
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 super family cl38385
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
606-656 8.36e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


The actual alignment was detected with superfamily member pfam13041:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 40.42  E-value: 8.36e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1002314886 606 PDSYTYNSLIKALCDERRVNKAKEILFARERSGCStSNNQSYWPIVAALAK 656
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVK-PNVYTYTILINGLCK 50
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
181-592 2.57e-28

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 121.91  E-value: 2.57e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  181 RLGLRPSTAVYNALIAASVRAGAVDTAYLRFQQMPADGCRPDHFTYNSLVHGVCRRGIVDEAVRLVRQMEGEGIRPNVFT 260
Cdd:PLN03218   430 KLIRNPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHT 509
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  261 YTMLVDGFCNAGRVEEAFRMLDKMKEKGVAPSEATYRTLVHGVFRCLERDKAYRMLSDwLGHETslHPsachtmlyclsk 340
Cdd:PLN03218   510 FGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAE-MKAET--HP------------ 574
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  341 kdmakeavefekrmsargYLLDSTAFGIVMSCALKCLEVSDLCELLDSFIKNG--GNPgfDVYIMVIKSLLNCKNFSKAN 418
Cdd:PLN03218   575 ------------------IDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNikGTP--EVYTIAVNSCSQKGDWDFAL 634
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  419 HYLGHMVLKGLLSSVMSYNMVIDCFVKAGAVDKAEEIVKEMQDKGFLPNLVTFNTLISGYSKLGNVHNAKVVLKMLMEHG 498
Cdd:PLN03218   635 SIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIK 714
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  499 FMPDIITFTSLIDGLCNTHQLDDAFVCFEEMAEWGVRPNAQTYNVLMHTLCSAGHVNKAIDLLNKMKIDGVTPDAYSFNA 578
Cdd:PLN03218   715 LRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRC 794
                          410
                   ....*....|....
gi 1002314886  579 LIlSFCrMRKVDKA 592
Cdd:PLN03218   795 IT-GLC-LRRFEKA 806
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
571-620 1.17e-16

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 73.94  E-value: 1.17e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002314886 571 PDAYSFNALILSFCRMRKVDKAEDIFNDMVRFGVVPDSYTYNSLIKALCD 620
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
501-549 1.23e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 71.24  E-value: 1.23e-15
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002314886 501 PDIITFTSLIDGLCNTHQLDDAFVCFEEMAEWGVRPNAQTYNVLMHTLC 549
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
259-293 2.62e-08

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 50.15  E-value: 2.62e-08
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002314886 259 FTYTMLVDGFCNAGRVEEAFRMLDKMKEKGVAPSE 293
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
606-656 8.36e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 40.42  E-value: 8.36e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1002314886 606 PDSYTYNSLIKALCDERRVNKAKEILFARERSGCStSNNQSYWPIVAALAK 656
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVK-PNVYTYTILINGLCK 50
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
518-676 7.35e-03

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 38.83  E-value: 7.35e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 518 QLDDAFVCFEEMAEwgVRPN-AQTYNVLMHTLCSAGHVNKAIDLLNK-MKIDGVTPDAYsfNALILSFCRMRKVDKAEDI 595
Cdd:COG0457    23 RYEEAIEDYEKALE--LDPDdAEALYNLGLAYLRLGRYEEALADYEQaLELDPDDAEAL--NNLGLALQALGRYEEALED 98
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 596 FNDMVRFGvvP-DSYTYNSLIKALCDERRVNKAKEILfarERS-GCSTSNNQSYWPIVAALAKMGQFSEAGELMDKYLSR 673
Cdd:COG0457    99 YDKALELD--PdDAEALYNLGLALLELGRYDEAIEAY---ERAlELDPDDADALYNLGIALEKLGRYEEALELLEKLEAA 173

                  ...
gi 1002314886 674 NAQ 676
Cdd:COG0457   174 ALA 176
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
609-640 7.87e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 34.35  E-value: 7.87e-03
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1002314886 609 YTYNSLIKALCDERRVNKAKEILFARERSGCS 640
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIE 32
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
181-592 2.57e-28

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 121.91  E-value: 2.57e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  181 RLGLRPSTAVYNALIAASVRAGAVDTAYLRFQQMPADGCRPDHFTYNSLVHGVCRRGIVDEAVRLVRQMEGEGIRPNVFT 260
Cdd:PLN03218   430 KLIRNPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHT 509
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  261 YTMLVDGFCNAGRVEEAFRMLDKMKEKGVAPSEATYRTLVHGVFRCLERDKAYRMLSDwLGHETslHPsachtmlyclsk 340
Cdd:PLN03218   510 FGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAE-MKAET--HP------------ 574
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  341 kdmakeavefekrmsargYLLDSTAFGIVMSCALKCLEVSDLCELLDSFIKNG--GNPgfDVYIMVIKSLLNCKNFSKAN 418
Cdd:PLN03218   575 ------------------IDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNikGTP--EVYTIAVNSCSQKGDWDFAL 634
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  419 HYLGHMVLKGLLSSVMSYNMVIDCFVKAGAVDKAEEIVKEMQDKGFLPNLVTFNTLISGYSKLGNVHNAKVVLKMLMEHG 498
Cdd:PLN03218   635 SIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIK 714
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  499 FMPDIITFTSLIDGLCNTHQLDDAFVCFEEMAEWGVRPNAQTYNVLMHTLCSAGHVNKAIDLLNKMKIDGVTPDAYSFNA 578
Cdd:PLN03218   715 LRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRC 794
                          410
                   ....*....|....
gi 1002314886  579 LIlSFCrMRKVDKA 592
Cdd:PLN03218   795 IT-GLC-LRRFEKA 806
PLN03218 PLN03218
maturation of RBCL 1; Provisional
424-638 2.27e-21

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 99.57  E-value: 2.27e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  424 MVLKGLLSSVMSYNMVIDCFVKAGAVDKAEEIVKEMQDKGFLPNLVTFNTLISGYSKLGNVHNAKVVL-KMLME-HGFMP 501
Cdd:PLN03218   498 MVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLaEMKAEtHPIDP 577
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  502 DIITFTSLIDGLCNTHQLDDAFVCFEEMAEWGVRPNAQTYNVLMHTLCSAGHVNKAIDLLNKMKIDGVTPDAYSFNALIL 581
Cdd:PLN03218   578 DHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVD 657
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  582 SFCRMRKVDKAEDIFNDM----VRFGVVPDSY-------------------------------TYNSLIKALCDERRVNK 626
Cdd:PLN03218   658 VAGHAGDLDKAFEILQDArkqgIKLGTVSYSSlmgacsnaknwkkalelyediksiklrptvsTMNALITALCEGNQLPK 737
                          250
                   ....*....|..
gi 1002314886  627 AKEILFARERSG 638
Cdd:PLN03218   738 ALEVLSEMKRLG 749
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
156-627 9.18e-21

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 97.25  E-value: 9.18e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 156 VCALVESWGRLGLARYAHEVFvQMPRLG--LRPSTAVYNALIAASVRAGAVDTAYLRFQQMPADGCRPDHFTYNS--LVH 231
Cdd:PLN03081   90 LCSQIEKLVACGRHREALELF-EILEAGcpFTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRvlLMH 168
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 232 GVCrrGIVDEAVRLVRQMEGEgirpNVFTYTMLVDGFCNAGRVEEAFRMLDKMKEKGVAPSEATYRTLVHGVfrclerdk 311
Cdd:PLN03081  169 VKC--GMLIDARRLFDEMPER----NLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRAS-------- 234
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 312 ayrmlsdwlgheTSLHPSACHTMLYClskkdmakeavefekrmsargylldstafgivmsCALKCLEVSDL---CELLDS 388
Cdd:PLN03081  235 ------------AGLGSARAGQQLHC----------------------------------CVLKTGVVGDTfvsCALIDM 268
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 389 FIKnggnpgfdvyimviksllnCKNFSKANHYLGHMVLKgllsSVMSYNMVIDCFVKAGAVDKAEEIVKEMQDKGFLPNL 468
Cdd:PLN03081  269 YSK-------------------CGDIEDARCVFDGMPEK----TTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQ 325
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 469 VTFNTLISGYSKLGNVHNAKVVLKMLMEHGFMPDIITFTSLIDGLCNTHQLDDAFVCFEEMAewgvRPNAQTYNVLMHTL 548
Cdd:PLN03081  326 FTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMP----RKNLISWNALIAGY 401
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 549 CSAGHVNKAIDLLNKMKIDGVTPDAYSFNAlILSFCRMRK-VDKAEDIFNDMVR-FGVVPDSYTYNSLIKALCDERRVNK 626
Cdd:PLN03081  402 GNHGRGTKAVEMFERMIAEGVAPNHVTFLA-VLSACRYSGlSEQGWEIFQSMSEnHRIKPRAMHYACMIELLGREGLLDE 480

                  .
gi 1002314886 627 A 627
Cdd:PLN03081  481 A 481
PLN03218 PLN03218
maturation of RBCL 1; Provisional
421-630 3.28e-20

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 95.71  E-value: 3.28e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  421 LGHMVLKGLLSSVMSYNMvidCFVKA----GAVDKAEEIVKEMQDkgflPNLVTFNTLISGYSKLGNVHNAKVVLKMLME 496
Cdd:PLN03218   393 LEDMEKRGLLDMDKIYHA---KFFKAckkqRAVKEAFRFAKLIRN----PTLSTFNMLMSVCASSQDIDGALRVLRLVQE 465
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  497 HGFMPDIITFTSLIDGLCNTHQLDDAFVCFEEMAEWGVRPNAQTYNVLMHTLCSAGHVNKAIDLLNKMKIDGVTPDAYSF 576
Cdd:PLN03218   466 AGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVF 545
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1002314886  577 NALILSFCRMRKVDKAEDIFNDMVRFG--VVPDSYTYNSLIKALCDERRVNKAKEI 630
Cdd:PLN03218   546 NALISACGQSGAVDRAFDVLAEMKAEThpIDPDHITVGALMKACANAGQVDRAKEV 601
PLN03218 PLN03218
maturation of RBCL 1; Provisional
389-631 1.32e-19

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 93.79  E-value: 1.32e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  389 FIKNGGNPGFDVYIMVIKSLLNCKNFSKAnhyLGHMVL---KGLLSSVMSYNMVIDCFVKAGAVDKAEEIVKEMQDKGFL 465
Cdd:PLN03218   428 FAKLIRNPTLSTFNMLMSVCASSQDIDGA---LRVLRLvqeAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVE 504
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  466 PNLVTFNTLISGYSKLGNVHNAKVVLKMLMEHGFMPDIITFTSLIDGLCNTHQLDDAFVCFEEM-AEWG-VRPNAQTYNV 543
Cdd:PLN03218   505 ANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMkAETHpIDPDHITVGA 584
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  544 LMHTLCSAGHVNKAIDLLNKMKIDGV--TPDAYSfnaLILSFCRMR-KVDKAEDIFNDMVRFGVVPDSYTYNSLIKALCD 620
Cdd:PLN03218   585 LMKACANAGQVDRAKEVYQMIHEYNIkgTPEVYT---IAVNSCSQKgDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGH 661
                          250
                   ....*....|.
gi 1002314886  621 ERRVNKAKEIL 631
Cdd:PLN03218   662 AGDLDKAFEIL 672
PLN03077 PLN03077
Protein ECB2; Provisional
157-593 3.64e-19

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 92.22  E-value: 3.64e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 157 CALVESWGRLGLARyAHEVFVQMPRLGLRPSTAVYNALIAASVRAGAVDTAYLRFQQMPadgcRPDHFTYNSLVHGVCRR 236
Cdd:PLN03077  192 CVLRTCGGIPDLAR-GREVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMP----RRDCISWNAMISGYFEN 266
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 237 GIVDEAVRLVRQMEGEGIRPNVFTYT-------MLVD--------------------GFCN--------AGRVEEAFRML 281
Cdd:PLN03077  267 GECLEGLELFFTMRELSVDPDLMTITsvisaceLLGDerlgremhgyvvktgfavdvSVCNsliqmylsLGSWGEAEKVF 346
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 282 DKMKEKGVApseaTYRTLVHGVFRCLERDKA---YRMLSdwlghETSLHP---------SACHtmlyCLSKKDMAKEAVE 349
Cdd:PLN03077  347 SRMETKDAV----SWTAMISGYEKNGLPDKAletYALME-----QDNVSPdeitiasvlSACA----CLGDLDVGVKLHE 413
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 350 FEKRMSARGYLLDSTAFgIVMSCALKCLEvsdlcELLDSFiKNGGNPGFDVYIMVIKSL-LNCKNFsKANHYLGHMVLKG 428
Cdd:PLN03077  414 LAERKGLISYVVVANAL-IEMYSKCKCID-----KALEVF-HNIPEKDVISWTSIIAGLrLNNRCF-EALIFFRQMLLTL 485
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 429 LLSSVMSYNMVIDCfVKAGAVDKAEEIVKE-----MQDKGFLPN-------------------------LVTFNTLISGY 478
Cdd:PLN03077  486 KPNSVTLIAALSAC-ARIGALMCGKEIHAHvlrtgIGFDGFLPNalldlyvrcgrmnyawnqfnshekdVVSWNILLTGY 564
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 479 SKLGNVHNAKVVLKMLMEHGFMPDIITFTSLIDGLCNTHQLDDAFVCFEEMAE-WGVRPNAQTYNVLMHTLCSAGHVNKA 557
Cdd:PLN03077  565 VAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEkYSITPNLKHYACVVDLLGRAGKLTEA 644
                         490       500       510
                  ....*....|....*....|....*....|....*..
gi 1002314886 558 IDLLNKMKIdgvTPDAYSFNALiLSFCRM-RKVDKAE 593
Cdd:PLN03077  645 YNFINKMPI---TPDPAVWGAL-LNACRIhRHVELGE 677
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
571-620 1.17e-16

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 73.94  E-value: 1.17e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002314886 571 PDAYSFNALILSFCRMRKVDKAEDIFNDMVRFGVVPDSYTYNSLIKALCD 620
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03077 PLN03077
Protein ECB2; Provisional
151-669 2.76e-16

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 82.98  E-value: 2.76e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 151 VSEELVCALVE--SWGRL---GLARYAHeVFVQMPRLGLRpstaVYNALIAASVRAGAVDTAYLRFQQMPadgcRPDHFT 225
Cdd:PLN03077   84 VDEDAYVALFRlcEWKRAveeGSRVCSR-ALSSHPSLGVR----LGNAMLSMFVRFGELVHAWYVFGKMP----ERDLFS 154
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 226 YNSLVHGVCRRGIVDEAVRLVRQMEGEGIRPNVFTYTMLV---DGFCNAGRVEEAFRMLDKMkekGVAPSEATYRTLVHG 302
Cdd:PLN03077  155 WNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLrtcGGIPDLARGREVHAHVVRF---GFELDVDVVNALITM 231
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 303 VFRCLERdKAYRMLSDWLGHETSLHPSAchtMLYCLSKKDMAKEAVEFEKRMsaRGYLLDS---TAFGIVMSCALkcLEV 379
Cdd:PLN03077  232 YVKCGDV-VSARLVFDRMPRRDCISWNA---MISGYFENGECLEGLELFFTM--RELSVDPdlmTITSVISACEL--LGD 303
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 380 SDLCELLDSFIKNggnPGFDVYIMVIKSLLncknfsKANHYLGHMVLKGLLSS------VMSYNMVIDCFVKAGAVDKAE 453
Cdd:PLN03077  304 ERLGREMHGYVVK---TGFAVDVSVCNSLI------QMYLSLGSWGEAEKVFSrmetkdAVSWTAMISGYEKNGLPDKAL 374
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 454 EIVKEMQDKGFLPNLVTFNTLISGYSKLGN---------------------VHNA--------KVVLKMLMEHGFMP--D 502
Cdd:PLN03077  375 ETYALMEQDNVSPDEITIASVLSACACLGDldvgvklhelaerkglisyvvVANAliemyskcKCIDKALEVFHNIPekD 454
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 503 IITFTSLIDGLCNTHQLDDAFVCFEEMAeWGVRPNAQTYNVLMHTLCSAGHV--NKAID---LLNKMKIDGVTPDAY--- 574
Cdd:PLN03077  455 VISWTSIIAGLRLNNRCFEALIFFRQML-LTLKPNSVTLIAALSACARIGALmcGKEIHahvLRTGIGFDGFLPNALldl 533
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 575 ----------------------SFNALILSFCRMRKVDKAEDIFNDMVRFGVVPDSYTYNSLIKALCDERRVNKAKEILF 632
Cdd:PLN03077  534 yvrcgrmnyawnqfnshekdvvSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFH 613
                         570       580       590
                  ....*....|....*....|....*....|....*..
gi 1002314886 633 ARERSGCSTSNNQSYWPIVAALAKMGQFSEAGELMDK 669
Cdd:PLN03077  614 SMEEKYSITPNLKHYACVVDLLGRAGKLTEAYNFINK 650
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
501-549 1.23e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 71.24  E-value: 1.23e-15
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002314886 501 PDIITFTSLIDGLCNTHQLDDAFVCFEEMAEWGVRPNAQTYNVLMHTLC 549
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
256-302 2.39e-15

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 70.47  E-value: 2.39e-15
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 1002314886 256 PNVFTYTMLVDGFCNAGRVEEAFRMLDKMKEKGVAPSEATYRTLVHG 302
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILING 47
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
160-316 2.27e-14

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 72.43  E-value: 2.27e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 160 VESWGRLGLARYAHEVFVQMPRLGLRPSTAVYNALI---------AASVRAGAVDTAYLRFQQMPADGCRPDHFTYNSLV 230
Cdd:pfam17177  18 LDKCSKHADATGALALYDAAKAEGVRLAQYHYNVLLylcskaadaTDLKPQLAADRGFEVFEAMKAQGVSPNEATYTAVA 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 231 HGVCRRGIVDEAVRLVRQMEGEGIRPNVFTYTMLVDGFCNAGRVEEAFRMLDKMKEKGVAPSEATYRTLVHGVFRCLERD 310
Cdd:pfam17177  98 RLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGRAD 177

                  ....*.
gi 1002314886 311 KAYRML 316
Cdd:pfam17177 178 KVYAYL 183
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
221-270 2.85e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 67.39  E-value: 2.85e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002314886 221 PDHFTYNSLVHGVCRRGIVDEAVRLVRQMEGEGIRPNVFTYTMLVDGFCN 270
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
536-585 2.21e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 64.69  E-value: 2.21e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002314886 536 PNAQTYNVLMHTLCSAGHVNKAIDLLNKMKIDGVTPDAYSFNALILSFCR 585
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
252-285 3.80e-13

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 63.52  E-value: 3.80e-13
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002314886 252 EGIRPNVFTYTMLVDGFCNAGRVEEAFRMLDKMK 285
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PLN03218 PLN03218
maturation of RBCL 1; Provisional
157-315 9.30e-13

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 71.83  E-value: 9.30e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  157 CALVESWGrlglarYAHEVFVQMPRLGLRPSTAVYNALIAASVRAGAVDTAYLRFQQMPADGCRPDHFTYNSLVHGVCRR 236
Cdd:PLN03218   624 CSQKGDWD------FALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNA 697
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1002314886  237 GIVDEAVRLVRQMEGEGIRPNVFTYTMLVDGFCNAGRVEEAFRMLDKMKEKGVAPSEATYRTLvhgvFRCLERDKAYRM 315
Cdd:PLN03218   698 KNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSIL----LVASERKDDADV 772
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
466-515 1.56e-12

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 62.38  E-value: 1.56e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002314886 466 PNLVTFNTLISGYSKLGNVHNAKVVLKMLMEHGFMPDIITFTSLIDGLCN 515
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03218 PLN03218
maturation of RBCL 1; Provisional
156-318 2.19e-12

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 70.68  E-value: 2.19e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  156 VCALVESWGRLGLARYAHEVFVQMPRLGLRPSTAVYNALIAASVRAGAVDTAYLRFQQMPADGCRPDHFTYNSLVHGVCR 235
Cdd:PLN03218   582 VGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGH 661
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  236 RGIVDEAVRLVRQMEGEGIRPNVFTYTMLVDGFCNAGRVEEAFRMLDKMKEKGVAPSEATYRTLVHGVFRCLERDKAYRM 315
Cdd:PLN03218   662 AGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEV 741

                   ...
gi 1002314886  316 LSD 318
Cdd:PLN03218   742 LSE 744
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
432-480 2.83e-12

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 61.61  E-value: 2.83e-12
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002314886 432 SVMSYNMVIDCFVKAGAVDKAEEIVKEMQDKGFLPNLVTFNTLISGYSK 480
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
217-250 3.93e-11

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 58.13  E-value: 3.93e-11
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002314886 217 DGCRPDHFTYNSLVHGVCRRGIVDEAVRLVRQME 250
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
244-300 1.68e-10

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 56.98  E-value: 1.68e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1002314886 244 RLVRQMEGEGIRPNVFTYTMLVDGFCNAGRVEEAFRMLDKMKEKGVAPSEATYRTLV 300
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PLN03218 PLN03218
maturation of RBCL 1; Provisional
158-309 2.19e-10

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 64.13  E-value: 2.19e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886  158 ALVESWGRLGLARYAHEVFVQMPRLGLRPSTAVYNALIAASVRAGAVDTAYLRFQQMPADGCRPDHFTYNSLVHGVCRRG 237
Cdd:PLN03218   654 ALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGN 733
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1002314886  238 IVDEAVRLVRQMEGEGIRPNVFTYTMLVDGFCNAGRVEEAFRMLDKMKEKGVAPSEATYRTLVhGVfrCLER 309
Cdd:PLN03218   734 QLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT-GL--CLRR 802
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
186-235 2.93e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 53.14  E-value: 2.93e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002314886 186 PSTAVYNALIAASVRAGAVDTAYLRFQQMPADGCRPDHFTYNSLVHGVCR 235
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03077 PLN03077
Protein ECB2; Provisional
267-617 5.44e-09

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 59.48  E-value: 5.44e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 267 GFCNAGRVEEAFRMLDKMKEKGVAPSEATYRTLvhgvFRCLERDKAYRMLSDWLGHETSLHPSAC----HTMLYCLSKKD 342
Cdd:PLN03077   60 ALCSHGQLEQALKLLESMQELRVPVDEDAYVAL----FRLCEWKRAVEEGSRVCSRALSSHPSLGvrlgNAMLSMFVRFG 135
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 343 MAKEAVEFEKRMSARgyllDSTAFGIVMSCALKCLEVSDLCELLDSFIKNGGNPgfDVYIM--VIKSLLNCKNFSKANHY 420
Cdd:PLN03077  136 ELVHAWYVFGKMPER----DLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRP--DVYTFpcVLRTCGGIPDLARGREV 209
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 421 LGHMVLKGLLSSVMSYNMVIDCFVKAGAVDKAEEIVKEMQDKgflpNLVTFNTLISGYSKLGNVHNAKVVLKMLMEHGFM 500
Cdd:PLN03077  210 HAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRR----DCISWNAMISGYFENGECLEGLELFFTMRELSVD 285
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 501 PDIITFTSLID--GLCNTHQLDdafvcfEEMAEWGVRP----NAQTYNVLMHTLCSAGHVNKAIDLLNKMKidgvTPDAY 574
Cdd:PLN03077  286 PDLMTITSVISacELLGDERLG------REMHGYVVKTgfavDVSVCNSLIQMYLSLGSWGEAEKVFSRME----TKDAV 355
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|...
gi 1002314886 575 SFNALILSFCRMRKVDKAEDIFNDMVRFGVVPDSYTYNSLIKA 617
Cdd:PLN03077  356 SWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSA 398
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
519-631 9.30e-09

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 56.25  E-value: 9.30e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 519 LDDAFVCFEEMAEWGVRPNAQTYNVLMHtLCSA-GHVNKAIDLLNKMKIDGVTPDAYSFNALILSFCRMRKVDKAEDIFN 597
Cdd:pfam17177  71 ADRGFEVFEAMKAQGVSPNEATYTAVAR-LAAAkGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEE 149
                          90       100       110
                  ....*....|....*....|....*....|....
gi 1002314886 598 DMVRFGVVPDSYTYNSLIKALCDERRVNKAKEIL 631
Cdd:pfam17177 150 HMLAHGVELEEPELAALLKVSAKAGRADKVYAYL 183
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
259-293 2.62e-08

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 50.15  E-value: 2.62e-08
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002314886 259 FTYTMLVDGFCNAGRVEEAFRMLDKMKEKGVAPSE 293
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
526-630 3.48e-08

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 54.32  E-value: 3.48e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 526 FEEMAEWGVRPNAQTYNVLMHtLCS------AGHVNKAID----LLNKMKIDGVTPDAYSFNALILSFCRMRKVDKAEDI 595
Cdd:pfam17177  34 YDAAKAEGVRLAQYHYNVLLY-LCSkaadatDLKPQLAADrgfeVFEAMKAQGVSPNEATYTAVARLAAAKGDGDLAFDL 112
                          90       100       110
                  ....*....|....*....|....*....|....*
gi 1002314886 596 FNDMVRFGVVPDSYTYNSLIKALCDERRVNKAKEI 630
Cdd:pfam17177 113 VKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEV 147
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
455-514 5.10e-08

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 50.05  E-value: 5.10e-08
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 455 IVKEMQDKGFLPNLVTFNTLISGYSKLGNVHNAKVVLKMLMEHGFMPDIITFTSLIdGLC 514
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL-GVI 60
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
345-674 6.11e-08

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 56.03  E-value: 6.11e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 345 KEAVEFEKRMSARG-YLLDSTAFGIVMSCALKCLEVSDLCELLDSFIKNGGNPGFDVYIMVIKSLLNCKNFSKANHYLGH 423
Cdd:PLN03081  104 REALELFEILEAGCpFTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDE 183
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 424 MVLKGLLSsvmsYNMVIDCFVKAGAVDKAEEIVKEMQDKGFLPNLVTFNTLISGYSKLGNVHNAKVVLKMLMEHGFMPDI 503
Cdd:PLN03081  184 MPERNLAS----WGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLHCCVLKTGVVGDT 259
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 504 ITFTSLIDGLCNTHQLDDAFVCFEEMAEWGVrpnaQTYNVLMHTLCSAGHVNKAIDLLNKMKIDGVTPDAYSFNALILSF 583
Cdd:PLN03081  260 FVSCALIDMYSKCGDIEDARCVFDGMPEKTT----VAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIF 335
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 584 CRMRKVDKAEDIFNDMVRFGVVPDSYTYNSLIKALCDERRVNKAKEILFARERsgcstSNNQSYWPIVAALAKMGQFSEA 663
Cdd:PLN03081  336 SRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPR-----KNLISWNALIAGYGNHGRGTKA 410
                         330
                  ....*....|.
gi 1002314886 664 GELMDKYLSRN 674
Cdd:PLN03081  411 VEMFERMIAEG 421
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
532-565 7.31e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 48.88  E-value: 7.31e-08
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002314886 532 WGVRPNAQTYNVLMHTLCSAGHVNKAIDLLNKMK 565
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
498-545 2.17e-07

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 48.12  E-value: 2.17e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1002314886 498 GFMPDIITFTSLIDGLCNTHQLDDAFVCFEEMAEWGVRPNAQTYNVLM 545
Cdd:pfam13812  10 GIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
574-608 2.44e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 47.06  E-value: 2.44e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002314886 574 YSFNALILSFCRMRKVDKAEDIFNDMVRFGVVPDS 608
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
560-621 3.15e-07

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 47.74  E-value: 3.15e-07
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1002314886 560 LLNKMKIDGVTPDAYSFNALILSFCRMRKVDKAEDIFNDMVRFGVVPDSYTYNSLIKALCDE 621
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
259-289 4.44e-07

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 46.30  E-value: 4.44e-07
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002314886 259 FTYTMLVDGFCNAGRVEEAFRMLDKMKEKGV 289
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
567-600 4.57e-07

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 46.57  E-value: 4.57e-07
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002314886 567 DGVTPDAYSFNALILSFCRMRKVDKAEDIFNDMV 600
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
497-529 6.19e-07

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 46.18  E-value: 6.19e-07
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002314886 497 HGFMPDIITFTSLIDGLCNTHQLDDAFVCFEEM 529
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
140-318 8.67e-07

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 52.18  E-value: 8.67e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 140 LVAEVRGCGCEVSEELVCALVESWGRLGLARYAHEVFVQMPRLGLrPSTAVYN-ALIAASVRAGAVDTAYLRFQQMPadg 218
Cdd:PLN03081  312 LYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGF-PLDIVANtALVDLYSKWGRMEDARNVFDRMP--- 387
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 219 cRPDHFTYNSLVHGVCRRGIVDEAVRLVRQMEGEGIRPNVFTYTMLVDGFCNAGRVEEAFRMLDKMKE-KGVAPSEATYR 297
Cdd:PLN03081  388 -RKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSEnHRIKPRAMHYA 466
                         170       180
                  ....*....|....*....|.
gi 1002314886 298 TLVHGVFRCLERDKAYRMLSD 318
Cdd:PLN03081  467 CMIELLGREGLLDEAYAMIRR 487
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
200-326 1.30e-06

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 49.70  E-value: 1.30e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 200 RAGAVDTAYLRFQQMPADGCRPDHFTYNSLVHgVCRRGIV----------DEAVRLVRQMEGEGIRPNVFTYTMLVDGFC 269
Cdd:pfam17177  23 KHADATGALALYDAAKAEGVRLAQYHYNVLLY-LCSKAADatdlkpqlaaDRGFEVFEAMKAQGVSPNEATYTAVARLAA 101
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1002314886 270 NAGRVEEAFRMLDKMKEKGVAPSEATYRTLVHGVFRCLERDKAYRMLSDWLGHETSL 326
Cdd:pfam17177 102 AKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVEL 158
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
424-480 1.48e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 45.81  E-value: 1.48e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1002314886 424 MVLKGLLSSVMSYNMVIDCFVKAGAVDKAEEIVKEMQDKGFLPNLVTFNTLISGYSK 480
Cdd:pfam13812   6 MVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGG 62
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
602-631 1.89e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 44.64  E-value: 1.89e-06
                          10        20        30
                  ....*....|....*....|....*....|
gi 1002314886 602 FGVVPDSYTYNSLIKALCDERRVNKAKEIL 631
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELL 30
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
462-493 2.30e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 44.64  E-value: 2.30e-06
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002314886 462 KGFLPNLVTFNTLISGYSKLGNVHNAKVVL-KM 493
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLdEM 33
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
540-573 2.47e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 44.37  E-value: 2.47e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002314886 540 TYNVLMHTLCSAGHVNKAIDLLNKMKIDGVTPDA 573
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
224-258 4.19e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 43.60  E-value: 4.19e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002314886 224 FTYNSLVHGVCRRGIVDEAVRLVRQMEGEGIRPNV 258
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
211-265 7.49e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 43.89  E-value: 7.49e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1002314886 211 FQQMPADGCRPDHFTYNSLVHGVCRRGIVDEAVRLVRQMEGEGIRPNVFTYTMLV 265
Cdd:pfam13812   3 LREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
434-467 7.78e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 42.83  E-value: 7.78e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002314886 434 MSYNMVIDCFVKAGAVDKAEEIVKEMQDKGFLPN 467
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
175-234 8.43e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 43.89  E-value: 8.43e-06
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 175 VFVQMPRLGLRPSTAVYNALIAASVRAGAVDTAYLRFQQMPADGCRPDHFTYNSLVhGVC 234
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL-GVI 60
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
423-501 8.88e-06

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 47.39  E-value: 8.88e-06
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1002314886 423 HMVLKGLLSSVMSYNMVIDCFVKAGAVDKAEEIVKEMQDKGFLPNLVTFNTLISGYSKLGNVHNAKVVLKMLMEHGFMP 501
Cdd:pfam17177  80 AMKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVEL 158
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
435-464 2.45e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 41.68  E-value: 2.45e-05
                          10        20        30
                  ....*....|....*....|....*....|
gi 1002314886 435 SYNMVIDCFVKAGAVDKAEEIVKEMQDKGF 464
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
574-604 2.45e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 41.68  E-value: 2.45e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002314886 574 YSFNALILSFCRMRKVDKAEDIFNDMVRFGV 604
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
526-580 3.05e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 42.35  E-value: 3.05e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1002314886 526 FEEMAEWGVRPNAQTYNVLMHTLCSAGHVNKAIDLLNKMKIDGVTPDAYSFNALI 580
Cdd:pfam13812   3 LREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
504-538 3.81e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.90  E-value: 3.81e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002314886 504 ITFTSLIDGLCNTHQLDDAFVCFEEMAEWGVRPNA 538
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
427-459 5.92e-05

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 40.41  E-value: 5.92e-05
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002314886 427 KGLLSSVMSYNMVIDCFVKAGAVDKAEEIVKEM 459
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
469-503 6.81e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.52  E-value: 6.81e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1002314886 469 VTFNTLISGYSKLGNVHNAKVVLKMLMEHGFMPDI 503
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
606-656 8.36e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 40.42  E-value: 8.36e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1002314886 606 PDSYTYNSLIKALCDERRVNKAKEILFARERSGCStSNNQSYWPIVAALAK 656
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVK-PNVYTYTILINGLCK 50
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
469-499 1.83e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.99  E-value: 1.83e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002314886 469 VTFNTLISGYSKLGNVHNAKVVLKMLMEHGF 499
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PLN03077 PLN03077
Protein ECB2; Provisional
350-622 3.21e-04

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 44.07  E-value: 3.21e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 350 FEKRMSARGYLLDSTAFGIVMSCALKCL----EVSDLCELLDSFIKNGGNPGFDVYIMVIKSllnCKNfsKANHYLGHMV 425
Cdd:PLN03077   34 GRKRSRGLSVAASSSSSTHDSNSQLRALcshgQLEQALKLLESMQELRVPVDEDAYVALFRL---CEW--KRAVEEGSRV 108
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 426 LKGLLSSVMSY-----NMVIDCFVKAGAVDKAEEIVKEMQDKgflpNLVTFNTLISGYSKLGNVHNAKVVLKMLMEHGFM 500
Cdd:PLN03077  109 CSRALSSHPSLgvrlgNAMLSMFVRFGELVHAWYVFGKMPER----DLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVR 184
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 501 PDIITFTSLIDGLCNTHQLDDAFVCFEEMAEWGVRPNAQTYNVLMHTLCSAGHVNKAIDLLNKMKidgvTPDAYSFNALI 580
Cdd:PLN03077  185 PDVYTFPCVLRTCGGIPDLARGREVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMP----RRDCISWNAMI 260
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*
gi 1002314886 581 LSFCRMRKVDKAEDIFNDMVRFGVVPDSYTYNSLIKA---LCDER 622
Cdd:PLN03077  261 SGYFENGECLEGLELFFTMRELSVDPDLMTITSVISAcelLGDER 305
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
224-254 4.75e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.83  E-value: 4.75e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002314886 224 FTYNSLVHGVCRRGIVDEAVRLVRQMEGEGI 254
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
540-569 7.24e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.44  E-value: 7.24e-04
                          10        20        30
                  ....*....|....*....|....*....|
gi 1002314886 540 TYNVLMHTLCSAGHVNKAIDLLNKMKIDGV 569
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
115-289 7.55e-04

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 42.94  E-value: 7.55e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 115 ARDGGVRRALVGALWRRGPVVLSGRLVAEVRGCGCEVSEELVCALVESWGRLGLARYAHEVFVQMPRLGLRPSTAVYNAL 194
Cdd:PLN03081  186 ERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCAL 265
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 195 IAASVRAGAVDTAYLRFQQMPadgcRPDHFTYNSLVHGVCRRGIVDEAVRLVRQMEGEGIRPNVFTY------------- 261
Cdd:PLN03081  266 IDMYSKCGDIEDARCVFDGMP----EKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFsimirifsrlall 341
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|
gi 1002314886 262 ----------------------TMLVDGFCNAGRVEEAFRMLDKMKEKGV 289
Cdd:PLN03081  342 ehakqahaglirtgfpldivanTALVDLYSKWGRMEDARNVFDRMPRKNL 391
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
504-534 1.07e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.06  E-value: 1.07e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1002314886 504 ITFTSLIDGLCNTHQLDDAFVCFEEMAEWGV 534
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
518-676 7.35e-03

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 38.83  E-value: 7.35e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 518 QLDDAFVCFEEMAEwgVRPN-AQTYNVLMHTLCSAGHVNKAIDLLNK-MKIDGVTPDAYsfNALILSFCRMRKVDKAEDI 595
Cdd:COG0457    23 RYEEAIEDYEKALE--LDPDdAEALYNLGLAYLRLGRYEEALADYEQaLELDPDDAEAL--NNLGLALQALGRYEEALED 98
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002314886 596 FNDMVRFGvvP-DSYTYNSLIKALCDERRVNKAKEILfarERS-GCSTSNNQSYWPIVAALAKMGQFSEAGELMDKYLSR 673
Cdd:COG0457    99 YDKALELD--PdDAEALYNLGLALLELGRYDEAIEAY---ERAlELDPDDADALYNLGIALEKLGRYEEALELLEKLEAA 173

                  ...
gi 1002314886 674 NAQ 676
Cdd:COG0457   174 ALA 176
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
609-640 7.87e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 34.35  E-value: 7.87e-03
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1002314886 609 YTYNSLIKALCDERRVNKAKEILFARERSGCS 640
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIE 32
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH