NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1002231757|ref|XP_015613947|]
View 

putative pentatricopeptide repeat-containing protein At3g25970 [Oryza sativa Japonica Group]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03077 super family cl33629
Protein ECB2; Provisional
130-820 3.21e-140

Protein ECB2; Provisional


The actual alignment was detected with superfamily member PLN03077:

Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 436.20  E-value: 3.21e-140
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 130 TAMVSAITAAGDAGAALRLFAEMSEEGVVPNGFALAAALKACTVGSDLGFTPQVHAQAVKLEGLLDPYVSSSLVEAYVSC 209
Cdd:PLN03077   55 NSQLRALCSHGQLEQALKLLESMQELRVPVDEDAYVALFRLCEWKRAVEEGSRVCSRALSSHPSLGVRLGNAMLSMFVRF 134
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 210 GEVDVAERALLDSPVRSDVSWNALLNEYARDGDYAKVMLVFDKLVESGDEISKYTLPTVLKCCMELGLAKSGQAVHGLVI 289
Cdd:PLN03077  135 GELVHAWYVFGKMPERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHVV 214
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 290 KRGLETDRVLNNCLIEMYSKCLSAEDAYEVFARIDEPDVVHCSLMISCFDRHDMAPEAFDIFMQMSDMGVKPNQYTFVGL 369
Cdd:PLN03077  215 RFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRRDCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITSV 294
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 370 AIVASRTGDVNLCRSIHAHIVKSGFSRTKGVCDAIVGMYVKTGAVQDAILAFDLMQGPDIASWNTLLSGFYSGNNCEHGL 449
Cdd:PLN03077  295 ISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEKNGLPDKAL 374
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 450 RIFKELICEGVLANKYTYVGILRCCTSLMDLRFGCQVHACVLKSGFQGDYDVSKMLLDMYVQAGCFTNARLVFDRLKERD 529
Cdd:PLN03077  375 ETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEKD 454
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 530 VFSWTVVMSTYAKTDEGEKAIECFRSMLRENKrPNDATLATSLSVCSDLACLGSGLQLHSYTIKSGWNSS-VVSSALVDM 608
Cdd:PLN03077  455 VISWTSIIAGLRLNNRCFEALIFFRQMLLTLK-PNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGFDgFLPNALLDL 533
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 609 YVKCGNLADAEMLFDeSDTHDLVEWNTIICGYAQHGHGYKALEAFQEMIDEGNVPDEITFVGVLSACSHAGLLDEGRRYF 688
Cdd:PLN03077  534 YVRCGRMNYAWNQFN-SHEKDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYF 612
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 689 KLLSSVYGITPTLEHYACMVDILAKAGKLAEAESLINEMPLTPDASLWKTILGACRMHGNIEIAERAAEKLFESQPDDIS 768
Cdd:PLN03077  613 HSMEEKYSITPNLKHYACVVDLLGRAGKLTEAYNFINKMPITPDPAVWGALLNACRIHRHVELGELAAQHIFELDPNSVG 692
                         650       660       670       680       690
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1002231757 769 SCILLSNIYADLKRWNDVAKLRSMLVDRGVKKEPGCSWIEINGKLHVFLSQD 820
Cdd:PLN03077  693 YYILLCNLYADAGKWDEVARVRKTMRENGLTVDPGCSWVEVKGKVHAFLTDD 744
PPR super family cl29710
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
90-121 5.35e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


The actual alignment was detected with superfamily member pfam12854:

Pssm-ID: 421891 [Multi-domain]  Cd Length: 34  Bit Score: 38.10  E-value: 5.35e-04
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1002231757  90 GPRPDAFLHDSLLNMYCKCGRLADARSVFDGM 121
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
 
Name Accession Description Interval E-value
PLN03077 PLN03077
Protein ECB2; Provisional
130-820 3.21e-140

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 436.20  E-value: 3.21e-140
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 130 TAMVSAITAAGDAGAALRLFAEMSEEGVVPNGFALAAALKACTVGSDLGFTPQVHAQAVKLEGLLDPYVSSSLVEAYVSC 209
Cdd:PLN03077   55 NSQLRALCSHGQLEQALKLLESMQELRVPVDEDAYVALFRLCEWKRAVEEGSRVCSRALSSHPSLGVRLGNAMLSMFVRF 134
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 210 GEVDVAERALLDSPVRSDVSWNALLNEYARDGDYAKVMLVFDKLVESGDEISKYTLPTVLKCCMELGLAKSGQAVHGLVI 289
Cdd:PLN03077  135 GELVHAWYVFGKMPERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHVV 214
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 290 KRGLETDRVLNNCLIEMYSKCLSAEDAYEVFARIDEPDVVHCSLMISCFDRHDMAPEAFDIFMQMSDMGVKPNQYTFVGL 369
Cdd:PLN03077  215 RFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRRDCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITSV 294
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 370 AIVASRTGDVNLCRSIHAHIVKSGFSRTKGVCDAIVGMYVKTGAVQDAILAFDLMQGPDIASWNTLLSGFYSGNNCEHGL 449
Cdd:PLN03077  295 ISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEKNGLPDKAL 374
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 450 RIFKELICEGVLANKYTYVGILRCCTSLMDLRFGCQVHACVLKSGFQGDYDVSKMLLDMYVQAGCFTNARLVFDRLKERD 529
Cdd:PLN03077  375 ETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEKD 454
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 530 VFSWTVVMSTYAKTDEGEKAIECFRSMLRENKrPNDATLATSLSVCSDLACLGSGLQLHSYTIKSGWNSS-VVSSALVDM 608
Cdd:PLN03077  455 VISWTSIIAGLRLNNRCFEALIFFRQMLLTLK-PNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGFDgFLPNALLDL 533
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 609 YVKCGNLADAEMLFDeSDTHDLVEWNTIICGYAQHGHGYKALEAFQEMIDEGNVPDEITFVGVLSACSHAGLLDEGRRYF 688
Cdd:PLN03077  534 YVRCGRMNYAWNQFN-SHEKDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYF 612
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 689 KLLSSVYGITPTLEHYACMVDILAKAGKLAEAESLINEMPLTPDASLWKTILGACRMHGNIEIAERAAEKLFESQPDDIS 768
Cdd:PLN03077  613 HSMEEKYSITPNLKHYACVVDLLGRAGKLTEAYNFINKMPITPDPAVWGALLNACRIHRHVELGELAAQHIFELDPNSVG 692
                         650       660       670       680       690
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1002231757 769 SCILLSNIYADLKRWNDVAKLRSMLVDRGVKKEPGCSWIEINGKLHVFLSQD 820
Cdd:PLN03077  693 YYILLCNLYADAGKWDEVARVRKTMRENGLTVDPGCSWVEVKGKVHAFLTDD 744
E_motif pfam20431
E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) ...
746-808 1.92e-19

E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) proteins which contain a DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E+ motif, precedes the DYW domain and, although their role is not clear, they are essential in the RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.


Pssm-ID: 466580 [Multi-domain]  Cd Length: 63  Bit Score: 82.59  E-value: 1.92e-19
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1002231757 746 HGNIEIAERAAEKLFESQPDDISSCILLSNIYADLKRWNDVAKLRSMLVDRGVKKEPGCSWIE 808
Cdd:pfam20431   1 YSNVELAEKAANILLELEKTNDGNYTLLSNIYAYAGRWKDVERIRKLMKSSGIKKRPGCSWIE 63
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
540-789 1.54e-05

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 47.42  E-value: 1.54e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 540 YAKTDEGEKAIECFRSMLRENkrPNDATLATSLSVC------SDLAclgsgLQLHSYTIKSGWNSSVVSSALVDMYVKCG 613
Cdd:COG2956    18 YLLNGQPDKAIDLLEEALELD--PETVEAHLALGNLyrrrgeYDRA-----IRIHQKLLERDPDRAEALLELAQDYLKAG 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 614 NLADAEMLFD---ESDTHDLVEWNTIICGYAQHGHGYKALEAFQEMIDEGnvPDEITFVGVLSAC-SHAGLLDEGRRYF- 688
Cdd:COG2956    91 LLDRAEELLEkllELDPDDAEALRLLAEIYEQEGDWEKAIEVLERLLKLG--PENAHAYCELAELyLEQGDYDEAIEALe 168
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 689 KLLSSVYGITPTLEHYAcmvDILAKAGKLAEAESLINEMP-LTPDASLWKTILGACRMH-GNIEIAERAAEKLFESQPDD 766
Cdd:COG2956   169 KALKLDPDCARALLLLA---ELYLEQGDYEEAIAALERALeQDPDYLPALPRLAELYEKlGDPEEALELLRKALELDPSD 245
                         250       260
                  ....*....|....*....|...
gi 1002231757 767 iSSCILLSNIYADLKRWNDVAKL 789
Cdd:COG2956   246 -DLLLALADLLERKEGLEAALAL 267
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
633-665 1.10e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.13  E-value: 1.10e-04
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002231757 633 WNTIICGYAQHGHGYKALEAFQEMIDEGNVPDE 665
Cdd:TIGR00756   3 YNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
90-121 5.35e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 38.10  E-value: 5.35e-04
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1002231757  90 GPRPDAFLHDSLLNMYCKCGRLADARSVFDGM 121
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
 
Name Accession Description Interval E-value
PLN03077 PLN03077
Protein ECB2; Provisional
130-820 3.21e-140

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 436.20  E-value: 3.21e-140
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 130 TAMVSAITAAGDAGAALRLFAEMSEEGVVPNGFALAAALKACTVGSDLGFTPQVHAQAVKLEGLLDPYVSSSLVEAYVSC 209
Cdd:PLN03077   55 NSQLRALCSHGQLEQALKLLESMQELRVPVDEDAYVALFRLCEWKRAVEEGSRVCSRALSSHPSLGVRLGNAMLSMFVRF 134
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 210 GEVDVAERALLDSPVRSDVSWNALLNEYARDGDYAKVMLVFDKLVESGDEISKYTLPTVLKCCMELGLAKSGQAVHGLVI 289
Cdd:PLN03077  135 GELVHAWYVFGKMPERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHVV 214
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 290 KRGLETDRVLNNCLIEMYSKCLSAEDAYEVFARIDEPDVVHCSLMISCFDRHDMAPEAFDIFMQMSDMGVKPNQYTFVGL 369
Cdd:PLN03077  215 RFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRRDCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITSV 294
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 370 AIVASRTGDVNLCRSIHAHIVKSGFSRTKGVCDAIVGMYVKTGAVQDAILAFDLMQGPDIASWNTLLSGFYSGNNCEHGL 449
Cdd:PLN03077  295 ISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEKNGLPDKAL 374
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 450 RIFKELICEGVLANKYTYVGILRCCTSLMDLRFGCQVHACVLKSGFQGDYDVSKMLLDMYVQAGCFTNARLVFDRLKERD 529
Cdd:PLN03077  375 ETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEKD 454
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 530 VFSWTVVMSTYAKTDEGEKAIECFRSMLRENKrPNDATLATSLSVCSDLACLGSGLQLHSYTIKSGWNSS-VVSSALVDM 608
Cdd:PLN03077  455 VISWTSIIAGLRLNNRCFEALIFFRQMLLTLK-PNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGFDgFLPNALLDL 533
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 609 YVKCGNLADAEMLFDeSDTHDLVEWNTIICGYAQHGHGYKALEAFQEMIDEGNVPDEITFVGVLSACSHAGLLDEGRRYF 688
Cdd:PLN03077  534 YVRCGRMNYAWNQFN-SHEKDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYF 612
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 689 KLLSSVYGITPTLEHYACMVDILAKAGKLAEAESLINEMPLTPDASLWKTILGACRMHGNIEIAERAAEKLFESQPDDIS 768
Cdd:PLN03077  613 HSMEEKYSITPNLKHYACVVDLLGRAGKLTEAYNFINKMPITPDPAVWGALLNACRIHRHVELGELAAQHIFELDPNSVG 692
                         650       660       670       680       690
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1002231757 769 SCILLSNIYADLKRWNDVAKLRSMLVDRGVKKEPGCSWIEINGKLHVFLSQD 820
Cdd:PLN03077  693 YYILLCNLYADAGKWDEVARVRKTMRENGLTVDPGCSWVEVKGKVHAFLTDD 744
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
338-825 5.69e-108

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 346.86  E-value: 5.69e-108
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 338 FDRHDMAPEAFDIFMQMS--DMGVKpnqyTFVGLAIVASRTGDVNLCRSIHAHIVKSGFSRTKGVCDAIVGMYVKTGAVQ 415
Cdd:PLN03081  100 CGRHREALELFEILEAGCpfTLPAS----TYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLI 175
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 416 DAILAFDLMQGPDIASWNTLLSGFYSGNNCEHGLRIFKELICEGVLANKYTYVGILRCCTSLMDLRFGCQVHACVLKSGF 495
Cdd:PLN03081  176 DARRLFDEMPERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLHCCVLKTGV 255
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 496 QGDYDVSKMLLDMYVQAGCFTNARLVFDRLKERDVFSWTVVMSTYAKTDEGEKAIECFRSMLRENKRPNDATLATSLSVC 575
Cdd:PLN03081  256 VGDTFVSCALIDMYSKCGDIEDARCVFDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIF 335
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 576 SDLACLGSGLQLHSYTIKSGWNSSVVS-SALVDMYVKCGNLADAEMLFDESDTHDLVEWNTIICGYAQHGHGYKALEAFQ 654
Cdd:PLN03081  336 SRLALLEHAKQAHAGLIRTGFPLDIVAnTALVDLYSKWGRMEDARNVFDRMPRKNLISWNALIAGYGNHGRGTKAVEMFE 415
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 655 EMIDEGNVPDEITFVGVLSACSHAGLLDEGRRYFKLLSSVYGITPTLEHYACMVDILAKAGKLAEAESLINEMPLTPDAS 734
Cdd:PLN03081  416 RMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSENHRIKPRAMHYACMIELLGREGLLDEAYAMIRRAPFKPTVN 495
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 735 LWKTILGACRMHGNIEIAERAAEKLFESQPDDISSCILLSNIYADLKRWNDVAKLRSMLVDRGVKKEPGCSWIEINGKLH 814
Cdd:PLN03081  496 MWAALLTACRIHKNLELGRLAAEKLYGMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLKRKGLSMHPACTWIEVKKQDH 575
                         490
                  ....*....|.
gi 1002231757 815 VFLSQDGCAKY 825
Cdd:PLN03081  576 SFFSGDRLHPQ 586
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
61-367 1.64e-51

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 192.01  E-value: 1.64e-51
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757  61 RSYAAALQGCAASRALRRGKALHARLLRSGPRPDAFLHDSLLNMYCKCGRLADARSVFDGMPHRDVVAWTAMVSAITAAG 140
Cdd:PLN03081  124 STYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPERNLASWGTIIGGLVDAG 203
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 141 DAGAALRLFAEMSEEGVVPNGFALAAALKACTVGSDLGFTPQVHAQAVKLEGLLDPYVSSSLVEAYVSCGEVDVAERALL 220
Cdd:PLN03081  204 NYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFD 283
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 221 DSPVRSDVSWNALLNEYARDGDYAKVMLVFDKLVESGDEISKYTLPTVLKCCMELGLAKSGQAVHGLVIKRGLETDRVLN 300
Cdd:PLN03081  284 GMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVAN 363
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1002231757 301 NCLIEMYSKCLSAEDAYEVFARIDEPDVVHCSLMISCFDRHDMAPEAFDIFMQMSDMGVKPNQYTFV 367
Cdd:PLN03081  364 TALVDLYSKWGRMEDARNVFDRMPRKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFL 430
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
59-322 1.80e-31

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 131.53  E-value: 1.80e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757  59 ELRSYAAALQGCAASRALRRGKALHARLLRSGPRPDAFLHDSLLNMYCKCGRLADARSVFDGMPHRDVVAWTAMVSAITA 138
Cdd:PLN03081  223 EPRTFVVMLRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEKTTVAWNSMLAGYAL 302
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 139 AGDAGAALRLFAEMSEEGVVPNGFALAAALKACTVGSDLGFTPQVHAQAVKLEGLLDPYVSSSLVEAYVSCGEVDVAERA 218
Cdd:PLN03081  303 HGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNV 382
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 219 LLDSPVRSDVSWNALLNEYARDGDYAKVMLVFDKLVESGDEISKYTLPTVLKCCMELGLAKSGQAVHGLVIKRGLETDRV 298
Cdd:PLN03081  383 FDRMPRKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSENHRIKPRA 462
                         250       260
                  ....*....|....*....|....*
gi 1002231757 299 LN-NCLIEMYSKCLSAEDAYEVFAR 322
Cdd:PLN03081  463 MHyACMIELLGREGLLDEAYAMIRR 487
E_motif pfam20431
E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) ...
746-808 1.92e-19

E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) proteins which contain a DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E+ motif, precedes the DYW domain and, although their role is not clear, they are essential in the RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.


Pssm-ID: 466580 [Multi-domain]  Cd Length: 63  Bit Score: 82.59  E-value: 1.92e-19
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1002231757 746 HGNIEIAERAAEKLFESQPDDISSCILLSNIYADLKRWNDVAKLRSMLVDRGVKKEPGCSWIE 808
Cdd:pfam20431   1 YSNVELAEKAANILLELEKTNDGNYTLLSNIYAYAGRWKDVERIRKLMKSSGIKKRPGCSWIE 63
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
629-677 1.37e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 48.51  E-value: 1.37e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1002231757 629 DLVEWNTIICGYAQHGHGYKALEAFQEMIDEGNVPDEITFVGVLSACSH 677
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03218 PLN03218
maturation of RBCL 1; Provisional
308-708 1.79e-07

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 54.88  E-value: 1.79e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757  308 SKCLSAED---AYEVFARIDE----PDVVHCSLMISCFDRHDMAPEAFDIFMQMSDMGVKPNQYTFVGLAIVASRTGDVn 380
Cdd:PLN03218   445 SVCASSQDidgALRVLRLVQEaglkADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQV- 523
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757  381 lCRSIHAHivksGFSRTKGV------CDAIVGMYVKTGAVQDailAFDLMQ---------GPDIASWNTLLSGFYSGNNC 445
Cdd:PLN03218   524 -AKAFGAY----GIMRSKNVkpdrvvFNALISACGQSGAVDR---AFDVLAemkaethpiDPDHITVGALMKACANAGQV 595
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757  446 EHGLRIFKELicegvlaNKYTYVGILRCCTSlmdlrfgcQVHACVLKsgfqGDYDVskmlldmyvqagcftnARLVFDRL 525
Cdd:PLN03218   596 DRAKEVYQMI-------HEYNIKGTPEVYTI--------AVNSCSQK----GDWDF----------------ALSIYDDM 640
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757  526 KERDV------FSWTVVMSTYAKtdEGEKAIECFRSMLRENKRPNDATLATSLSVCSDLACLGSGLQLHSyTIKSGWNSS 599
Cdd:PLN03218   641 KKKGVkpdevfFSALVDVAGHAG--DLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYE-DIKSIKLRP 717
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757  600 VVSS--ALVDMyvkcgnLADAEMLfdesdthdlvewntiicgyaqhghgYKALEAFQEMIDEGNVPDEITFVGVLSACSH 677
Cdd:PLN03218   718 TVSTmnALITA------LCEGNQL-------------------------PKALEVLSEMKRLGLCPNTITYSILLVASER 766
                          410       420       430
                   ....*....|....*....|....*....|...
gi 1002231757  678 AGLLDEGrryFKLLSSVY--GITPTLEHYACMV 708
Cdd:PLN03218   767 KDDADVG---LDLLSQAKedGIKPNLVMCRCIT 796
PLN03218 PLN03218
maturation of RBCL 1; Provisional
125-500 3.21e-07

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 54.11  E-value: 3.21e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757  125 DVVAWTAMVSAITAAGDAGAALRLFAEMSEEGVVPNGFALAAALKACTVGSDLgftpqvhAQAVKLEGLL-------DPY 197
Cdd:PLN03218   471 DCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQV-------AKAFGAYGIMrsknvkpDRV 543
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757  198 VSSSLVEAyvsCGEVDVAERA---LLD-----SPVRSD-VSWNALLNEYARDGDYAKVMLVFDKLVESGDEISKYTLPTV 268
Cdd:PLN03218   544 VFNALISA---CGQSGAVDRAfdvLAEmkaetHPIDPDhITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIA 620
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757  269 LKCCMELGLAKSGQAVHGLVIKRGLETDRVLNNCLIEM----------------------------YSK----CLSAED- 315
Cdd:PLN03218   621 VNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVaghagdldkafeilqdarkqgiklgtvsYSSlmgaCSNAKNw 700
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757  316 --AYEVFARID----EPDVVHCSLMISCFDRHDMAPEAFDIFMQMSDMGVKPNQYTFVGLAIVASRTGDVNLCRSIHAHI 389
Cdd:PLN03218   701 kkALELYEDIKsiklRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQA 780
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757  390 VKSGFSRTKGVCDAIVGM----YVKTGAVQDAILAFDLMQgPDIAS-WNTLlsgfysgnncehGLRIFKELICEGVLANK 464
Cdd:PLN03218   781 KEDGIKPNLVMCRCITGLclrrFEKACALGEPVVSFDSGR-PQIENkWTSW------------ALMVYRETISAGTLPTM 847
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|
gi 1002231757  465 YTYVGILRC----CTSLMDLRF---------GCQVHA-CVLKSGFqGDYD 500
Cdd:PLN03218   848 EVLSQVLGClqlpHDATLRNRLienlgisadSQKQSNlSTLVDGF-GEYD 896
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
326-366 6.16e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 43.89  E-value: 6.16e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 1002231757 326 PDVVHCSLMISCFDRHDMAPEAFDIFMQMSDMGVKPNQYTF 366
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTY 41
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
427-474 1.26e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 43.12  E-value: 1.26e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1002231757 427 PDIASWNTLLSGFYSGNNCEHGLRIFKELICEGVLANKYTYVGILRCC 474
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PLN03218 PLN03218
maturation of RBCL 1; Provisional
504-732 1.28e-05

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 49.11  E-value: 1.28e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757  504 MLLDMYVQAGCFTNARLVFDRLKER----DVFSWTVVMSTYAKTDEGEKAIECFRSMLRENKRPNdatlatslsvcsdla 579
Cdd:PLN03218   442 MLMSVCASSQDIDGALRVLRLVQEAglkaDCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEAN--------------- 506
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757  580 clgsglqLHSYtiksgwnssvvsSALVDMYVKCGNLADA----EMLFDESDTHDLVEWNTIICGYAQHGHGYKALEAFQE 655
Cdd:PLN03218   507 -------VHTF------------GALIDGCARAGQVAKAfgayGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAE 567
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757  656 MIDEGN--VPDEITFVGVLSACSHAGLLDEGRRYFKLLSSvYGITPTLEHYACMVDILAKAGKLAEAESLINEMP---LT 730
Cdd:PLN03218   568 MKAETHpiDPDHITVGALMKACANAGQVDRAKEVYQMIHE-YNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKkkgVK 646

                   ..
gi 1002231757  731 PD 732
Cdd:PLN03218   647 PD 648
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
540-789 1.54e-05

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 47.42  E-value: 1.54e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 540 YAKTDEGEKAIECFRSMLRENkrPNDATLATSLSVC------SDLAclgsgLQLHSYTIKSGWNSSVVSSALVDMYVKCG 613
Cdd:COG2956    18 YLLNGQPDKAIDLLEEALELD--PETVEAHLALGNLyrrrgeYDRA-----IRIHQKLLERDPDRAEALLELAQDYLKAG 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 614 NLADAEMLFD---ESDTHDLVEWNTIICGYAQHGHGYKALEAFQEMIDEGnvPDEITFVGVLSAC-SHAGLLDEGRRYF- 688
Cdd:COG2956    91 LLDRAEELLEkllELDPDDAEALRLLAEIYEQEGDWEKAIEVLERLLKLG--PENAHAYCELAELyLEQGDYDEAIEALe 168
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 689 KLLSSVYGITPTLEHYAcmvDILAKAGKLAEAESLINEMP-LTPDASLWKTILGACRMH-GNIEIAERAAEKLFESQPDD 766
Cdd:COG2956   169 KALKLDPDCARALLLLA---ELYLEQGDYEEAIAALERALeQDPDYLPALPRLAELYEKlGDPEEALELLRKALELDPSD 245
                         250       260
                  ....*....|....*....|...
gi 1002231757 767 iSSCILLSNIYADLKRWNDVAKL 789
Cdd:COG2956   246 -DLLLALADLLERKEGLEAALAL 267
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
124-160 3.79e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 41.58  E-value: 3.79e-05
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 1002231757 124 RDVVAWTAMVSAITAAGDAGAALRLFAEMSEEGVVPN 160
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPN 37
BepA COG4783
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ...
704-783 7.43e-05

Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443813 [Multi-domain]  Cd Length: 139  Bit Score: 43.64  E-value: 7.43e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002231757 704 YACMVDILAKAGKLAEAESLINE-MPLTPDASLWKTILG-ACRMHGNIEIAERAAEKLFESQPDDISSCILLSNIYADLK 781
Cdd:COG4783    41 FALLGEILLQLGDLDEAIVLLHEaLELDPDEPEARLNLGlALLKAGDYDEALALLEKALKLDPEHPEAYLRLARAYRALG 120

                  ..
gi 1002231757 782 RW 783
Cdd:COG4783   121 RP 122
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
528-567 9.45e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 40.42  E-value: 9.45e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|
gi 1002231757 528 RDVFSWTVVMSTYAKTDEGEKAIECFRSMLRENKRPNDAT 567
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYT 40
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
633-665 1.10e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.13  E-value: 1.10e-04
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1002231757 633 WNTIICGYAQHGHGYKALEAFQEMIDEGNVPDE 665
Cdd:TIGR00756   3 YNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
633-660 3.79e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.22  E-value: 3.79e-04
                          10        20
                  ....*....|....*....|....*...
gi 1002231757 633 WNTIICGYAQHGHGYKALEAFQEMIDEG 660
Cdd:pfam01535   3 YNSLISGYCKNGKLEEALELFKEMKEKG 30
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
90-121 5.35e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 38.10  E-value: 5.35e-04
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1002231757  90 GPRPDAFLHDSLLNMYCKCGRLADARSVFDGM 121
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
228-272 3.79e-03

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 36.19  E-value: 3.79e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 1002231757 228 VSWNALLNEYARDGDYAKVMLVFDKLVESGDEISKYTLPTVLKCC 272
Cdd:pfam13041   4 VTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
127-160 5.48e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.12  E-value: 5.48e-03
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1002231757 127 VAWTAMVSAITAAGDAGAALRLFAEMSEEGVVPN 160
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
125-171 8.12e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 35.41  E-value: 8.12e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 1002231757 125 DVVAWTAMVSAITAAGDAGAALRLFAEMSEEGVVPNGFALAAALKAC 171
Cdd:pfam13812  14 NVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
228-257 8.15e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 34.75  E-value: 8.15e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 1002231757 228 VSWNALLNEYARDGDYAKVMLVFDKLVESG 257
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKG 30
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH