NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2217391889|ref|XP_047298014|]
View 

host cell factor 1 isoform X11 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
25-344 7.86e-22

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


:

Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 97.53  E-value: 7.86e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889   25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055      3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055     79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldidtltwNKPSLsgvaPLPRS 256
Cdd:COG3055    143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055    199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
                          330
                   ....*....|
gi 2217391889  337 WSG--RDGYR 344
Cdd:COG3055    257 IGGetKPGVR 266
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1062-1554 1.14e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1062 GANHQRDARRACAAGTPAVIRISVATGALEAAQGSKSQCQTRQTSATSTTMTVMATGAPCSAGPLLG----PSMAREPGG 1137
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSladpPPPPPTPEP 2710
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1138 RSPAFVQLAPL---SSKVRLSSPSIKDLPAGRHSHA-----VSTAAMTRSSVGAGEPRMAPVCESLQGGSPSTTVTVTAL 1209
Cdd:PHA03247  2711 APHALVSATPLppgPAAARQASPALPAAPAPPAVPAgpatpGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1210 EALLCPSATVTQVCSNPPCETHETGTTNTATTSNAG----SAQRVCSNPPCETHETGTTHTATTATSNGGTGQPEGGQQP 1285
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplppPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1286 PAGRPcethqttstgttmsvsvgallpdATSSHRTVESglevAAAPSVTPQAGTALLAPFPTQRvcsnPPCETHETGTTH 1365
Cdd:PHA03247  2871 PAAKP-----------------------AAPARPPVRR----LARPAVSRSTESFALPPDQPER----PPQPQAPPPPQP 2919
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1366 TATTVTSNMSSNQDPPPAASDQGEVESTQGDSVNITSSSAITTTVSSTLTRAVTTVTQSTPVPGPSVPKISSMTETAPRA 1445
Cdd:PHA03247  2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1446 LTTEVPIPAkiTVTIANTETSDMPFSAVDILQPPEELQvspgprqqlpprqllQSASTALMGESAEVLSASQTPELPAAV 1525
Cdd:PHA03247  3000 SLSRVSSWA--SSLALHEETDPPPVSLKQTLWPPDDTE---------------DSDADSLFDSDSERSDLEALDPLPPEP 3062
                          490       500
                   ....*....|....*....|....*....
gi 2217391889 1526 DLSSTGEPSSGQESAGSAVVATVVVQPPP 1554
Cdd:PHA03247  3063 HDPFAHEPDPATPEAGARESPSSQFGPPP 3091
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1838-1867 2.48e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.02  E-value: 2.48e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2217391889 1838 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1867
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1873-1978 6.03e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 37.86  E-value: 6.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1873 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVylaiqssqaggELKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 1951
Cdd:cd00063      1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVV-----------EYREKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
                           90       100
                   ....*....|....*....|....*..
gi 2217391889 1952 HIDYTtkpaiiFRIAARNEKGYGPATQ 1978
Cdd:cd00063     67 GTEYE------FRVRAVNGGGESPPSE 87
PHA03255 super family cl31530
BDLF3; Provisional
617-757 7.64e-03

BDLF3; Provisional


The actual alignment was detected with superfamily member PHA03255:

Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 40.27  E-value: 7.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  617 VMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGTILKLVTSADGKPTTI-ITTTQASGAGTKPTILGISSVSPSTT 695
Cdd:PHA03255    14 MILICETSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSApITTTAILSTNTTTVTSTGTTVTPVPT 93
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217391889  696 KPGTTTIIKTIPMSAIITQAGATGVTSSPGIKSPITIITTKVmTSGTGAPAKIITAVPKIAT 757
Cdd:PHA03255    94 TSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSST-TSATTRITNATTLAPTLSS 154
 
Name Accession Description Interval E-value
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
25-344 7.86e-22

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 97.53  E-value: 7.86e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889   25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055      3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055     79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldidtltwNKPSLsgvaPLPRS 256
Cdd:COG3055    143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055    199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
                          330
                   ....*....|
gi 2217391889  337 WSG--RDGYR 344
Cdd:COG3055    257 IGGetKPGVR 266
PLN02193 PLN02193
nitrile-specifier protein
27-322 4.94e-16

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 83.08  E-value: 4.94e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889   27 GPVPRPRHGHRAVAIKelIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAAYGFVCDGTRLLVFGGMVE 102
Cdd:PLN02193   162 GPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLGVRMVSIGSTLYVFGGRDA 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  103 YGKYsNDLYELQASRWEWKRLkakTP-KNGppPCPRLGHSFSLVGNKCYLFGGLANDSEdpknniPRYLNDLYILELRpg 181
Cdd:PLN02193   240 SRQY-NGFYSFDTTTNEWKLL---TPvEEG--PTPRSFHSMAADEENVYVFGGVSATAR------LKTLDSYNIVDKK-- 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  182 sgvvaWDIPITygvlppPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIDTLTWNKPSLSGVAPLPRSLHSAT 261
Cdd:PLN02193   306 -----WFHCST------PGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDKWTQVETFGVRPSERSVFASA 374
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217391889  262 TIGNKMYVFGGwvPLVMDDVkvaTHEKEWKCTNTLACLNLDTMAWETILMDTLEDNIPRAR 322
Cdd:PLN02193   375 AVGKHIVIFGG--EIAMDPL---AHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSR 430
Kelch_1 pfam01344
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
32-69 7.75e-05

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 396078 [Multi-domain]  Cd Length: 46  Bit Score: 41.83  E-value: 7.75e-05
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2217391889   32 PRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQW 69
Cdd:pfam01344    1 RRSGAGVVVVGGKIYVIGGFDGNqSLNSVEVYDPETNTW 39
PHA03247 PHA03247
large tegument protein UL36; Provisional
1062-1554 1.14e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1062 GANHQRDARRACAAGTPAVIRISVATGALEAAQGSKSQCQTRQTSATSTTMTVMATGAPCSAGPLLG----PSMAREPGG 1137
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSladpPPPPPTPEP 2710
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1138 RSPAFVQLAPL---SSKVRLSSPSIKDLPAGRHSHA-----VSTAAMTRSSVGAGEPRMAPVCESLQGGSPSTTVTVTAL 1209
Cdd:PHA03247  2711 APHALVSATPLppgPAAARQASPALPAAPAPPAVPAgpatpGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1210 EALLCPSATVTQVCSNPPCETHETGTTNTATTSNAG----SAQRVCSNPPCETHETGTTHTATTATSNGGTGQPEGGQQP 1285
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplppPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1286 PAGRPcethqttstgttmsvsvgallpdATSSHRTVESglevAAAPSVTPQAGTALLAPFPTQRvcsnPPCETHETGTTH 1365
Cdd:PHA03247  2871 PAAKP-----------------------AAPARPPVRR----LARPAVSRSTESFALPPDQPER----PPQPQAPPPPQP 2919
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1366 TATTVTSNMSSNQDPPPAASDQGEVESTQGDSVNITSSSAITTTVSSTLTRAVTTVTQSTPVPGPSVPKISSMTETAPRA 1445
Cdd:PHA03247  2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1446 LTTEVPIPAkiTVTIANTETSDMPFSAVDILQPPEELQvspgprqqlpprqllQSASTALMGESAEVLSASQTPELPAAV 1525
Cdd:PHA03247  3000 SLSRVSSWA--SSLALHEETDPPPVSLKQTLWPPDDTE---------------DSDADSLFDSDSERSDLEALDPLPPEP 3062
                          490       500
                   ....*....|....*....|....*....
gi 2217391889 1526 DLSSTGEPSSGQESAGSAVVATVVVQPPP 1554
Cdd:PHA03247  3063 HDPFAHEPDPATPEAGARESPSSQFGPPP 3091
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1838-1867 2.48e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.02  E-value: 2.48e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2217391889 1838 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1867
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1873-1978 6.03e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 37.86  E-value: 6.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1873 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVylaiqssqaggELKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 1951
Cdd:cd00063      1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVV-----------EYREKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
                           90       100
                   ....*....|....*....|....*..
gi 2217391889 1952 HIDYTtkpaiiFRIAARNEKGYGPATQ 1978
Cdd:cd00063     67 GTEYE------FRVRAVNGGGESPPSE 87
PHA03255 PHA03255
BDLF3; Provisional
617-757 7.64e-03

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 40.27  E-value: 7.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  617 VMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGTILKLVTSADGKPTTI-ITTTQASGAGTKPTILGISSVSPSTT 695
Cdd:PHA03255    14 MILICETSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSApITTTAILSTNTTTVTSTGTTVTPVPT 93
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217391889  696 KPGTTTIIKTIPMSAIITQAGATGVTSSPGIKSPITIITTKVmTSGTGAPAKIITAVPKIAT 757
Cdd:PHA03255    94 TSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSST-TSATTRITNATTLAPTLSS 154
 
Name Accession Description Interval E-value
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
25-344 7.86e-22

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 97.53  E-value: 7.86e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889   25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055      3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055     79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldidtltwNKPSLsgvaPLPRS 256
Cdd:COG3055    143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055    199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
                          330
                   ....*....|
gi 2217391889  337 WSG--RDGYR 344
Cdd:COG3055    257 IGGetKPGVR 266
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
76-345 3.00e-17

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 84.05  E-value: 3.00e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889   76 GDIP-PGCAAYGFVCDGtRLLVFGGMvEYGKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLV-GNKCYLFG 153
Cdd:COG3055      7 PDLPtPRSEAAAALLDG-KVYVAGGL-SGGSASNSFEVYDPATNTWSEL-------APLPGPPRHHAAAVAqDGKLYVFG 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  154 GLandseDPKNNIPRYLNDLYILELRPGSgvvaWdipITYGVLPPPRESHTAVVYtekDNKKskLVIYGGMSGCRLGDLW 233
Cdd:COG3055     78 GF-----TGANPSSTPLNDVYVYDPATNT----W---TKLAPMPTPRGGATALLL---DGKI--YVVGGWDDGGNVAWVE 140
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  234 TLDIDTLTWNKPslsGVAPLPRSLHSATTIGN-KMYVFGGwvplvmddVKVATHEKEWkctntlaclnldtmawetilmd 312
Cdd:COG3055    141 VYDPATGTWTQL---APLPTPRDHLAAAVLPDgKILVIGG--------RNGSGFSNTW---------------------- 187
                          250       260       270
                   ....*....|....*....|....*....|...
gi 2217391889  313 TLEDNIPRARAGHCAVAINTRLYIWSGRDGYRK 345
Cdd:COG3055    188 TTLAPLPTARAGHAAAVLGGKILVFGGESGFSD 220
PLN02193 PLN02193
nitrile-specifier protein
27-322 4.94e-16

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 83.08  E-value: 4.94e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889   27 GPVPRPRHGHRAVAIKelIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAAYGFVCDGTRLLVFGGMVE 102
Cdd:PLN02193   162 GPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLGVRMVSIGSTLYVFGGRDA 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  103 YGKYsNDLYELQASRWEWKRLkakTP-KNGppPCPRLGHSFSLVGNKCYLFGGLANDSEdpknniPRYLNDLYILELRpg 181
Cdd:PLN02193   240 SRQY-NGFYSFDTTTNEWKLL---TPvEEG--PTPRSFHSMAADEENVYVFGGVSATAR------LKTLDSYNIVDKK-- 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  182 sgvvaWDIPITygvlppPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIDTLTWNKPSLSGVAPLPRSLHSAT 261
Cdd:PLN02193   306 -----WFHCST------PGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDKWTQVETFGVRPSERSVFASA 374
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217391889  262 TIGNKMYVFGGwvPLVMDDVkvaTHEKEWKCTNTLACLNLDTMAWETILMDTLEDNIPRAR 322
Cdd:PLN02193   375 AVGKHIVIFGG--EIAMDPL---AHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSR 430
PLN02153 PLN02153
epithiospecifier protein
12-330 3.12e-15

epithiospecifier protein


Pssm-ID: 177814 [Multi-domain]  Cd Length: 341  Bit Score: 79.26  E-value: 3.12e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889   12 AVLLQPRWKRVVGWSGPVPRPRHGHRAVAIKELIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAAYGF 87
Cdd:PLN02153     2 APTLQGGWIKVEQKGGKGPGPRCSHGIAVVGDKLYSFGGElkpNEHIDKDLYVFDFNTHTWSIAPANGDVPRiSCLGVRM 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889   88 VCDGTRLLVFGGMVEYGKYsNDLYELQASRWEWKRLKAKTPKNGPPPcpRLGHSFSLVGNKCYLFGGLANDS-------- 159
Cdd:PLN02153    82 VAVGTKLYIFGGRDEKREF-SDFYSYDTVKNEWTFLTKLDEEGGPEA--RTFHSMASDENHVYVFGGVSKGGlmktperf 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  160 ----------------EDPKNNiprylndlyiLELRPGSG--VVAWDIPITYGVLpppreshTAVVYTEKDNKKSKLVIY 221
Cdd:PLN02153   159 rtieayniadgkwvqlPDPGEN----------FEKRGGAGfaVVQGKIWVVYGFA-------TSILPGGKSDYESNAVQF 221
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  222 ggmsgcrlgdlwtLDIDTLTWNKPSLSGVAPLPRSLHSATTIGNKMYVFGGWvplVMDDVKvaTHEKEWKCTNTLACLNL 301
Cdd:PLN02153   222 -------------FDPASGKWTEVETTGAKPSARSVFAHAVVGKYIIIFGGE---VWPDLK--GHLGPGTLSNEGYALDT 283
                          330       340
                   ....*....|....*....|....*....
gi 2217391889  302 DTMAWETiLMDTLEDNIPRARAGHCAVAI 330
Cdd:PLN02153   284 ETLVWEK-LGECGEPAMPRGWTAYTTATV 311
PLN02193 PLN02193
nitrile-specifier protein
126-272 3.27e-08

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 58.43  E-value: 3.27e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  126 KTPKNGPPPCPRLGHSFSLVGNKCYLFGGlandSEDPKNNIPRYlndLYILELRPGSgvvaWDIPITYGVLPppresHTA 205
Cdd:PLN02193   155 KVEQKGEGPGLRCSHGIAQVGNKIYSFGG----EFTPNQPIDKH---LYVFDLETRT----WSISPATGDVP-----HLS 218
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217391889  206 VVYTEKDNKKSKLVIYGGMSGCR-LGDLWTLDIDTLTWNKPSLSGVAPLPRSLHSATTIGNKMYVFGG 272
Cdd:PLN02193   219 CLGVRMVSIGSTLYVFGGRDASRqYNGFYSFDTTTNEWKLLTPVEEGPTPRSFHSMAADEENVYVFGG 286
PRK14131 PRK14131
N-acetylneuraminate epimerase;
18-99 1.73e-05

N-acetylneuraminate epimerase;


Pssm-ID: 237617 [Multi-domain]  Cd Length: 376  Bit Score: 49.24  E-value: 1.73e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889   18 RWKRVVGWSGPvprPRHGHRAVAIKELIVVFGG----GNEG---IVDELHVYNTATNQWFIPAVRGdiPPGCA-AYGFVC 89
Cdd:PRK14131    63 GWTKIAAFPGG---PREQAVAAFIDGKLYVFGGigktNSEGspqVFDDVYKYDPKTNSWQKLDTRS--PVGLAgHVAVSL 137
                           90
                   ....*....|
gi 2217391889   90 DGTRLLVFGG 99
Cdd:PRK14131   138 HNGKAYITGG 147
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
27-114 4.33e-05

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 47.46  E-value: 4.33e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889   27 GPVPRPRHGHRAVAIKELIVVFGGGNeGIVDELHVYNTATNQWFipaVRGDIPPGCAAYGFVCDGTRLLVFGGMVEYGKY 106
Cdd:COG3055    191 APLPTARAGHAAAVLGGKILVFGGES-GFSDEVEAYDPATNTWT---ALGELPTPRHGHAAVLTDGKVYVIGGETKPGVR 266

                   ....*...
gi 2217391889  107 SNDLYELQ 114
Cdd:COG3055    267 TPLVTSAE 274
PLN02193 PLN02193
nitrile-specifier protein
242-393 7.47e-05

nitrile-specifier protein


Pssm-ID: 177844 [Multi-domain]  Cd Length: 470  Bit Score: 47.64  E-value: 7.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  242 WNKPSLSGVAPLPRSLHSATTIGNKMYVFGG-WVPlvmdDVKVATHekewkctntLACLNLDTMAWEtilMDTLEDNIPR 320
Cdd:PLN02193   153 WIKVEQKGEGPGLRCSHGIAQVGNKIYSFGGeFTP----NQPIDKH---------LYVFDLETRTWS---ISPATGDVPH 216
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  321 ARA-GHCAVAINTRLYIWSGRDGYRK--AWNNQVCCKDLWYLET--EKPPPPARVQLVRANTNSLEVSWGAVATA----- 390
Cdd:PLN02193   217 LSClGVRMVSIGSTLYVFGGRDASRQynGFYSFDTTTNEWKLLTpvEEGPTPRSFHSMAADEENVYVFGGVSATArlktl 296

                   ...
gi 2217391889  391 DSY 393
Cdd:PLN02193   297 DSY 299
Kelch_1 pfam01344
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
32-69 7.75e-05

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 396078 [Multi-domain]  Cd Length: 46  Bit Score: 41.83  E-value: 7.75e-05
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2217391889   32 PRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQW 69
Cdd:pfam01344    1 RRSGAGVVVVGGKIYVIGGFDGNqSLNSVEVYDPETNTW 39
Kelch_3 pfam13415
Galactose oxidase, central domain;
264-330 1.12e-04

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 41.51  E-value: 1.12e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217391889  264 GNKMYVFGGWVPLVMDdvkvathekewkCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAI 330
Cdd:pfam13415    1 GDKLYIFGGLGFDGQT------------RLNDLYVYDLDTNTWTQI------GDLPPPRSGHSATYI 49
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
241-348 1.51e-04

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 45.92  E-value: 1.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  241 TWNK-PSLsgvaPLPRSLHSATTIGNKMYVFGGWvplvmddvkvatheKEWKCTNTLACLNLDTMAWETIlmdtleDNIP 319
Cdd:COG3055      2 TWSSlPDL----PTPRSEAAAALLDGKVYVAGGL--------------SGGSASNSFEVYDPATNTWSEL------APLP 57
                           90       100       110
                   ....*....|....*....|....*....|
gi 2217391889  320 RARAGH-CAVAINTRLYIWSGRDGYRKAWN 348
Cdd:COG3055     58 GPPRHHaAAVAQDGKLYVFGGFTGANPSST 87
Kelch_3 pfam13415
Galactose oxidase, central domain;
215-263 3.74e-04

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 39.97  E-value: 3.74e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2217391889  215 KSKLVIYGG---MSGCRLGDLWTLDIDTLTWNKPslsGVAPLPRSLHSATTI 263
Cdd:pfam13415    1 GDKLYIFGGlgfDGQTRLNDLYVYDLDTNTWTQI---GDLPPPRSGHSATYI 49
PHA03247 PHA03247
large tegument protein UL36; Provisional
1062-1554 1.14e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1062 GANHQRDARRACAAGTPAVIRISVATGALEAAQGSKSQCQTRQTSATSTTMTVMATGAPCSAGPLLG----PSMAREPGG 1137
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSladpPPPPPTPEP 2710
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1138 RSPAFVQLAPL---SSKVRLSSPSIKDLPAGRHSHA-----VSTAAMTRSSVGAGEPRMAPVCESLQGGSPSTTVTVTAL 1209
Cdd:PHA03247  2711 APHALVSATPLppgPAAARQASPALPAAPAPPAVPAgpatpGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1210 EALLCPSATVTQVCSNPPCETHETGTTNTATTSNAG----SAQRVCSNPPCETHETGTTHTATTATSNGGTGQPEGGQQP 1285
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplppPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1286 PAGRPcethqttstgttmsvsvgallpdATSSHRTVESglevAAAPSVTPQAGTALLAPFPTQRvcsnPPCETHETGTTH 1365
Cdd:PHA03247  2871 PAAKP-----------------------AAPARPPVRR----LARPAVSRSTESFALPPDQPER----PPQPQAPPPPQP 2919
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1366 TATTVTSNMSSNQDPPPAASDQGEVESTQGDSVNITSSSAITTTVSSTLTRAVTTVTQSTPVPGPSVPKISSMTETAPRA 1445
Cdd:PHA03247  2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1446 LTTEVPIPAkiTVTIANTETSDMPFSAVDILQPPEELQvspgprqqlpprqllQSASTALMGESAEVLSASQTPELPAAV 1525
Cdd:PHA03247  3000 SLSRVSSWA--SSLALHEETDPPPVSLKQTLWPPDDTE---------------DSDADSLFDSDSERSDLEALDPLPPEP 3062
                          490       500
                   ....*....|....*....|....*....
gi 2217391889 1526 DLSSTGEPSSGQESAGSAVVATVVVQPPP 1554
Cdd:PHA03247  3063 HDPFAHEPDPATPEAGARESPSSQFGPPP 3091
Kelch_4 pfam13418
Galactose oxidase, central domain;
32-80 1.32e-03

Galactose oxidase, central domain;


Pssm-ID: 433191 [Multi-domain]  Cd Length: 49  Bit Score: 38.36  E-value: 1.32e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2217391889   32 PRHGHRAVAIKE-LIVVFGG--GNEGIVDELHVYNTATNQWfipAVRGDIPP 80
Cdd:pfam13418    1 PRAYHTSTSIPDdTIYLFGGegEDGTLLSDLWVFDLSTNEW---TRLGSLPS 49
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1838-1867 2.48e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 39.02  E-value: 2.48e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2217391889 1838 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1867
Cdd:cd00063     64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
Kelch_5 pfam13854
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
30-67 2.58e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 433528 [Multi-domain]  Cd Length: 41  Bit Score: 37.54  E-value: 2.58e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2217391889   30 PRPRHGHRAVAIKELIVVFGG---GNEGIVDELHVYNTATN 67
Cdd:pfam13854    1 PVPRYGHCAVTVGDYIYLYGGytgGEGQPSDDVYVLSLPTF 41
Kelch_5 pfam13854
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
319-351 2.76e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 433528 [Multi-domain]  Cd Length: 41  Bit Score: 37.16  E-value: 2.76e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 2217391889  319 PRARAGHCAVAINTRLYIWSGRDGYRKAWNNQV 351
Cdd:pfam13854    1 PVPRYGHCAVTVGDYIYLYGGYTGGEGQPSDDV 33
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1873-1978 6.03e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 37.86  E-value: 6.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889 1873 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVylaiqssqaggELKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 1951
Cdd:cd00063      1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVV-----------EYREKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
                           90       100
                   ....*....|....*....|....*..
gi 2217391889 1952 HIDYTtkpaiiFRIAARNEKGYGPATQ 1978
Cdd:cd00063     67 GTEYE------FRVRAVNGGGESPPSE 87
PHA03255 PHA03255
BDLF3; Provisional
617-757 7.64e-03

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 40.27  E-value: 7.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217391889  617 VMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGTILKLVTSADGKPTTI-ITTTQASGAGTKPTILGISSVSPSTT 695
Cdd:PHA03255    14 MILICETSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSApITTTAILSTNTTTVTSTGTTVTPVPT 93
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217391889  696 KPGTTTIIKTIPMSAIITQAGATGVTSSPGIKSPITIITTKVmTSGTGAPAKIITAVPKIAT 757
Cdd:PHA03255    94 TSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSST-TSATTRITNATTLAPTLSS 154
Kelch_3 pfam13415
Galactose oxidase, central domain;
146-208 9.19e-03

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 36.11  E-value: 9.19e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217391889  146 GNKCYLFGGLANDSEDpknniprYLNDLYilELRPGSGVVAwdipiTYGVLPPPRESHTAVVY 208
Cdd:pfam13415    1 GDKLYIFGGLGFDGQT-------RLNDLY--VYDLDTNTWT-----QIGDLPPPRSGHSATYI 49
Kelch_1 pfam01344
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ...
254-273 9.36e-03

Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.


Pssm-ID: 396078 [Multi-domain]  Cd Length: 46  Bit Score: 36.05  E-value: 9.36e-03
                           10        20
                   ....*....|....*....|
gi 2217391889  254 PRSLHSATTIGNKMYVFGGW 273
Cdd:pfam01344    1 RRSGAGVVVVGGKIYVIGGF 20
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH