NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2217391915|ref|XP_047298017|]
View 

host cell factor 1 isoform X15 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Kelch_5 pfam13854
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
16-48 2.43e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


:

Pssm-ID: 433528 [Multi-domain]  Cd Length: 41  Bit Score: 37.16  E-value: 2.43e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 2217391915   16 PRARAGHCAVAINTRLYIWSGRDGYRKAWNNQV 48
Cdd:pfam13854    1 PVPRYGHCAVTVGDYIYLYGGYTGGEGQPSDDV 33
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1601-1630 3.49e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 38.63  E-value: 3.49e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2217391915 1601 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1630
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
163-468 3.54e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.21  E-value: 3.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915  163 TVPGSSISVPTAARTQGVPAVLKVTgPQATTGTPLVTMRPASQAGKAPVTVTSLPAgvrMVVPTQSAQGTVIGSSPQMSG 242
Cdd:pfam05109  507 TSPTSAVTTPTPNATSPTPAVTTPT-PNATSPTLGKTSPTSAVTTPTPNATSPTPA---VTTPTPNATIPTLGKTSPTSA 582
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915  243 MaalaaaaaaTQKIPPSSAPTV-LSVPAGTTIVKTMAVTPGTTTLPATVKVASSPVM-----VSNPATRMLKTAAAQVGT 316
Cdd:pfam05109  583 V---------TTPTPNATSPTVgETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTtgqhnITSSSTSSMSLRPSSISE 653
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915  317 SVSSAT---NTSTRPIIT-VHKSGTVTVAQQAQVVTTVVGGVTktitlvKSPISVPGGSALISNLGKvmSVVQTKPVQTS 392
Cdd:pfam05109  654 TLSPSTsdnSTSHMPLLTsAHPTGGENITQVTPASTSTHHVST------SSPAPRPGTTSQASGPGN--SSTSTKPGEVN 725
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217391915  393 AVTGQastgPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKptilgiSSVSPSTTKPGTTTIIKT 468
Cdd:pfam05109  726 VTKGT----PPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGAR------TSTEPTTDYGGDSTTPRT 791
PHA03247 super family cl33720
large tegument protein UL36; Provisional
825-1317 4.16e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 4.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915  825 GANHQRDARRACAAGTPAVIRISVATGALEAAQGSKSQCQTRQTSATSTTMTVMATGAPCSAGPLLG----PSMAREPGG 900
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSladpPPPPPTPEP 2710
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915  901 RSPAFVQLAPL---SSKVRLSSPSIKDLPAGRHSHA-----VSTAAMTRSSVGAGEPRMAPVCESLQGGSPSTTVTVTAL 972
Cdd:PHA03247  2711 APHALVSATPLppgPAAARQASPALPAAPAPPAVPAgpatpGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915  973 EALLCPSATVTQVCSNPPCETHETGTTNTATTSNAG----SAQRVCSNPPCETHETGTTHTATTATSNGGTGQPEGGQQP 1048
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplppPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915 1049 PAGRPcethqttstgttmsvsvgallpdATSSHRTVESglevAAAPSVTPQAGTALLAPFPTQRvcsnPPCETHETGTTH 1128
Cdd:PHA03247  2871 PAAKP-----------------------AAPARPPVRR----LARPAVSRSTESFALPPDQPER----PPQPQAPPPPQP 2919
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915 1129 TATTVTSNMSSNQDPPPAASDQGEVESTQGDSVNITSSSAITTTVSSTLTRAVTTVTQSTPVPGPSVPKISSMTETAPRA 1208
Cdd:PHA03247  2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915 1209 LTTEVPIPAkiTVTIANTETSDMPFSAVDILQPPEELQvspgprqqlpprqllQSASTALMGESAEVLSASQTPELPAAV 1288
Cdd:PHA03247  3000 SLSRVSSWA--SSLALHEETDPPPVSLKQTLWPPDDTE---------------DSDADSLFDSDSERSDLEALDPLPPEP 3062
                          490       500
                   ....*....|....*....|....*....
gi 2217391915 1289 DLSSTGEPSSGQESAGSAVVATVVVQPPP 1317
Cdd:PHA03247  3063 HDPFAHEPDPATPEAGARESPSSQFGPPP 3091
 
Name Accession Description Interval E-value
Kelch_5 pfam13854
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
16-48 2.43e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 433528 [Multi-domain]  Cd Length: 41  Bit Score: 37.16  E-value: 2.43e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 2217391915   16 PRARAGHCAVAINTRLYIWSGRDGYRKAWNNQV 48
Cdd:pfam13854    1 PVPRYGHCAVTVGDYIYLYGGYTGGEGQPSDDV 33
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1601-1630 3.49e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 38.63  E-value: 3.49e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2217391915 1601 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1630
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
163-468 3.54e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.21  E-value: 3.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915  163 TVPGSSISVPTAARTQGVPAVLKVTgPQATTGTPLVTMRPASQAGKAPVTVTSLPAgvrMVVPTQSAQGTVIGSSPQMSG 242
Cdd:pfam05109  507 TSPTSAVTTPTPNATSPTPAVTTPT-PNATSPTLGKTSPTSAVTTPTPNATSPTPA---VTTPTPNATIPTLGKTSPTSA 582
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915  243 MaalaaaaaaTQKIPPSSAPTV-LSVPAGTTIVKTMAVTPGTTTLPATVKVASSPVM-----VSNPATRMLKTAAAQVGT 316
Cdd:pfam05109  583 V---------TTPTPNATSPTVgETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTtgqhnITSSSTSSMSLRPSSISE 653
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915  317 SVSSAT---NTSTRPIIT-VHKSGTVTVAQQAQVVTTVVGGVTktitlvKSPISVPGGSALISNLGKvmSVVQTKPVQTS 392
Cdd:pfam05109  654 TLSPSTsdnSTSHMPLLTsAHPTGGENITQVTPASTSTHHVST------SSPAPRPGTTSQASGPGN--SSTSTKPGEVN 725
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217391915  393 AVTGQastgPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKptilgiSSVSPSTTKPGTTTIIKT 468
Cdd:pfam05109  726 VTKGT----PPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGAR------TSTEPTTDYGGDSTTPRT 791
PHA03247 PHA03247
large tegument protein UL36; Provisional
825-1317 4.16e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 4.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915  825 GANHQRDARRACAAGTPAVIRISVATGALEAAQGSKSQCQTRQTSATSTTMTVMATGAPCSAGPLLG----PSMAREPGG 900
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSladpPPPPPTPEP 2710
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915  901 RSPAFVQLAPL---SSKVRLSSPSIKDLPAGRHSHA-----VSTAAMTRSSVGAGEPRMAPVCESLQGGSPSTTVTVTAL 972
Cdd:PHA03247  2711 APHALVSATPLppgPAAARQASPALPAAPAPPAVPAgpatpGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915  973 EALLCPSATVTQVCSNPPCETHETGTTNTATTSNAG----SAQRVCSNPPCETHETGTTHTATTATSNGGTGQPEGGQQP 1048
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplppPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915 1049 PAGRPcethqttstgttmsvsvgallpdATSSHRTVESglevAAAPSVTPQAGTALLAPFPTQRvcsnPPCETHETGTTH 1128
Cdd:PHA03247  2871 PAAKP-----------------------AAPARPPVRR----LARPAVSRSTESFALPPDQPER----PPQPQAPPPPQP 2919
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915 1129 TATTVTSNMSSNQDPPPAASDQGEVESTQGDSVNITSSSAITTTVSSTLTRAVTTVTQSTPVPGPSVPKISSMTETAPRA 1208
Cdd:PHA03247  2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915 1209 LTTEVPIPAkiTVTIANTETSDMPFSAVDILQPPEELQvspgprqqlpprqllQSASTALMGESAEVLSASQTPELPAAV 1288
Cdd:PHA03247  3000 SLSRVSSWA--SSLALHEETDPPPVSLKQTLWPPDDTE---------------DSDADSLFDSDSERSDLEALDPLPPEP 3062
                          490       500
                   ....*....|....*....|....*....
gi 2217391915 1289 DLSSTGEPSSGQESAGSAVVATVVVQPPP 1317
Cdd:PHA03247  3063 HDPFAHEPDPATPEAGARESPSSQFGPPP 3091
 
Name Accession Description Interval E-value
Kelch_5 pfam13854
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
16-48 2.43e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 433528 [Multi-domain]  Cd Length: 41  Bit Score: 37.16  E-value: 2.43e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 2217391915   16 PRARAGHCAVAINTRLYIWSGRDGYRKAWNNQV 48
Cdd:pfam13854    1 PVPRYGHCAVTVGDYIYLYGGYTGGEGQPSDDV 33
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1601-1630 3.49e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 38.63  E-value: 3.49e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2217391915 1601 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1630
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
163-468 3.54e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.21  E-value: 3.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915  163 TVPGSSISVPTAARTQGVPAVLKVTgPQATTGTPLVTMRPASQAGKAPVTVTSLPAgvrMVVPTQSAQGTVIGSSPQMSG 242
Cdd:pfam05109  507 TSPTSAVTTPTPNATSPTPAVTTPT-PNATSPTLGKTSPTSAVTTPTPNATSPTPA---VTTPTPNATIPTLGKTSPTSA 582
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915  243 MaalaaaaaaTQKIPPSSAPTV-LSVPAGTTIVKTMAVTPGTTTLPATVKVASSPVM-----VSNPATRMLKTAAAQVGT 316
Cdd:pfam05109  583 V---------TTPTPNATSPTVgETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTtgqhnITSSSTSSMSLRPSSISE 653
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915  317 SVSSAT---NTSTRPIIT-VHKSGTVTVAQQAQVVTTVVGGVTktitlvKSPISVPGGSALISNLGKvmSVVQTKPVQTS 392
Cdd:pfam05109  654 TLSPSTsdnSTSHMPLLTsAHPTGGENITQVTPASTSTHHVST------SSPAPRPGTTSQASGPGN--SSTSTKPGEVN 725
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217391915  393 AVTGQastgPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKptilgiSSVSPSTTKPGTTTIIKT 468
Cdd:pfam05109  726 VTKGT----PPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGAR------TSTEPTTDYGGDSTTPRT 791
PHA03247 PHA03247
large tegument protein UL36; Provisional
825-1317 4.16e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 4.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915  825 GANHQRDARRACAAGTPAVIRISVATGALEAAQGSKSQCQTRQTSATSTTMTVMATGAPCSAGPLLG----PSMAREPGG 900
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSladpPPPPPTPEP 2710
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915  901 RSPAFVQLAPL---SSKVRLSSPSIKDLPAGRHSHA-----VSTAAMTRSSVGAGEPRMAPVCESLQGGSPSTTVTVTAL 972
Cdd:PHA03247  2711 APHALVSATPLppgPAAARQASPALPAAPAPPAVPAgpatpGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915  973 EALLCPSATVTQVCSNPPCETHETGTTNTATTSNAG----SAQRVCSNPPCETHETGTTHTATTATSNGGTGQPEGGQQP 1048
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplppPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915 1049 PAGRPcethqttstgttmsvsvgallpdATSSHRTVESglevAAAPSVTPQAGTALLAPFPTQRvcsnPPCETHETGTTH 1128
Cdd:PHA03247  2871 PAAKP-----------------------AAPARPPVRR----LARPAVSRSTESFALPPDQPER----PPQPQAPPPPQP 2919
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915 1129 TATTVTSNMSSNQDPPPAASDQGEVESTQGDSVNITSSSAITTTVSSTLTRAVTTVTQSTPVPGPSVPKISSMTETAPRA 1208
Cdd:PHA03247  2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391915 1209 LTTEVPIPAkiTVTIANTETSDMPFSAVDILQPPEELQvspgprqqlpprqllQSASTALMGESAEVLSASQTPELPAAV 1288
Cdd:PHA03247  3000 SLSRVSSWA--SSLALHEETDPPPVSLKQTLWPPDDTE---------------DSDADSLFDSDSERSDLEALDPLPPEP 3062
                          490       500
                   ....*....|....*....|....*....
gi 2217391915 1289 DLSSTGEPSSGQESAGSAVVATVVVQPPP 1317
Cdd:PHA03247  3063 HDPFAHEPDPATPEAGARESPSSQFGPPP 3091
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH