|
Name |
Accession |
Description |
Interval |
E-value |
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
14-332 |
4.85e-20 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; :
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 90.99 E-value: 4.85e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 14 SFTGPVPRARHGHRAVAIRELMIIFGGGNEG-IADELHVYNTATNQWflpaVRGDIPPGCAAHGFVC--DGTRILVFGGM 90
Cdd:COG3055 4 SSLPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGGF 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 91 VEYGRYSN-----ELYELQASRWlwkkvkphpPPSGLPPCPRLGHSFSLYGNKCYLFGGlanesedsnNNVPRYLNDFYE 165
Cdd:COG3055 80 TGANPSSTplndvYVYDPATNTW---------TKLAPMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEV 141
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 166 LELQHGSgvvgWSipvTKGVVPSPRESHTAVIyckkdSGSPKMYVFGGMCGARLDDLWqldletmswskpETKGTVPLPR 245
Cdd:COG3055 142 YDPATGT----WT---QLAPLPTPRDHLAAAV-----LPDGKILVIGGRNGSGFSNTW------------TTLAPLPTAR 197
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 246 SLHTASVIGNKMYIFGGwvphkgentETSPHDCEWRctssfsyLNLDTTEWTTLvsdsqedkkNSRPRPRAGHCAVAIGT 325
Cdd:COG3055 198 AGHAAAVLGGKILVFGG---------ESGFSDEVEA-------YDPATNTWTAL---------GELPTPRHGHAAVLTDG 252
|
....*..
gi 7019405 326 RLYFWSG 332
Cdd:COG3055 253 KVYVIGG 259
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
618-672 |
3.25e-05 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases. :
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 43.25 E-value: 3.25e-05
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*
gi 7019405 618 PKGKQSISKVGNADVPDYSLlKKQDLVPGTGYRFRVAAINGCGIGPFSKISEFKT 672
Cdd:cd00063 40 EKGSGDWKEVEVTPGSETSY-TLTGLKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
678-778 |
6.32e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases. :
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 39.40 E-value: 6.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 678 PGAPSAVRISKNVEG-IHLSWEPPTSPSGNILEYSaylaIRTAQIQDNPSQLVfmRIYCGLKTSCIVTagQLaNAHIDYT 756
Cdd:cd00063 1 PSPPTNLRVTDVTSTsVTLSWTPPEDDGGPITGYV----VEYREKGSGDWKEV--EVTPGSETSYTLT--GL-KPGTEYE 71
|
90 100
....*....|....*....|..
gi 7019405 757 srpaivFRISAKNEKGYGPATQ 778
Cdd:cd00063 72 ------FRVRAVNGGGESPPSE 87
|
|
| Kelch_5 |
pfam13854 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
312-355 |
1.35e-03 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415. :
Pssm-ID: 433528 [Multi-domain] Cd Length: 41 Bit Score: 37.16 E-value: 1.35e-03
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 7019405 312 PRPRAGHCAVAIGTRLYFWSGRDGYkkalNSQVcCKDLWYLDTE 355
Cdd:pfam13854 1 PVPRYGHCAVTVGDYIYLYGGYTGG----EGQP-SDDVYVLSLP 39
|
|
| FN3 super family |
cl27307 |
Fibronectin type 3 domain [General function prediction only]; |
535-704 |
6.92e-03 |
|
Fibronectin type 3 domain [General function prediction only]; The actual alignment was detected with superfamily member COG3401:
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 39.99 E-value: 6.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 535 STNGAVVKDETSLTTFSTKSEVDETY--ALPATKISRVETHATATPFSKETPSNPVATVKAGERQWCDVGIFKNNTALVS 612
Cdd:COG3401 82 VAVAAAPPTATGLTTLTGSGSVGGATntGLTSSDEVPSPAVGTATTATAVAGGAATAGTYALGAGLYGVDGANASGTTAS 161
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 613 Q-------FYLLPKGKQSISKVGNADVPDYSLLKKQDLVPGTGYRFRVAAINGCGIGPFSKISEFKTcIPGFPGAPSAVR 685
Cdd:COG3401 162 SvagagvvVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTYYYRVAATDTGGESAPSNEVSVTT-PTTPPSAPTGLT 240
|
170 180
....*....|....*....|
gi 7019405 686 ISKNVEG-IHLSWEPPTSPS 704
Cdd:COG3401 241 ATADTPGsVTLSWDPVTESD 260
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
14-332 |
4.85e-20 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 90.99 E-value: 4.85e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 14 SFTGPVPRARHGHRAVAIRELMIIFGGGNEG-IADELHVYNTATNQWflpaVRGDIPPGCAAHGFVC--DGTRILVFGGM 90
Cdd:COG3055 4 SSLPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGGF 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 91 VEYGRYSN-----ELYELQASRWlwkkvkphpPPSGLPPCPRLGHSFSLYGNKCYLFGGlanesedsnNNVPRYLNDFYE 165
Cdd:COG3055 80 TGANPSSTplndvYVYDPATNTW---------TKLAPMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEV 141
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 166 LELQHGSgvvgWSipvTKGVVPSPRESHTAVIyckkdSGSPKMYVFGGMCGARLDDLWqldletmswskpETKGTVPLPR 245
Cdd:COG3055 142 YDPATGT----WT---QLAPLPTPRDHLAAAV-----LPDGKILVIGGRNGSGFSNTW------------TTLAPLPTAR 197
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 246 SLHTASVIGNKMYIFGGwvphkgentETSPHDCEWRctssfsyLNLDTTEWTTLvsdsqedkkNSRPRPRAGHCAVAIGT 325
Cdd:COG3055 198 AGHAAAVLGGKILVFGG---------ESGFSDEVEA-------YDPATNTWTAL---------GELPTPRHGHAAVLTDG 252
|
....*..
gi 7019405 326 RLYFWSG 332
Cdd:COG3055 253 KVYVIGG 259
|
|
| PLN02193 |
PLN02193 |
nitrile-specifier protein |
2-262 |
5.94e-12 |
|
nitrile-specifier protein
Pssm-ID: 177844 [Multi-domain] Cd Length: 470 Bit Score: 68.83 E-value: 5.94e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 2 AAPSLL-NWRRV-SSFTGPVPRARHGHRAVAIRelMIIFGGG---NEGIADELHVYNTATNQWFLPAVRGDIPP-GCAAH 75
Cdd:PLN02193 145 STPKLLgKWIKVeQKGEGPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLGV 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 76 GFVCDGTRILVFGGMvEYGRYSNELYELQASRWLWKKVkphpPPSGLPPCPRLGHSFSLYGNKCYLFGGLANESE----D 151
Cdd:PLN02193 223 RMVSIGSTLYVFGGR-DASRQYNGFYSFDTTTNEWKLL----TPVEEGPTPRSFHSMAADEENVYVFGGVSATARlktlD 297
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 152 SNNNVPRylndfyelELQHGSgvvgwsipvtkgvvpSPRESHTAVIYCKKDSGSPKMYVFGGMCGARLDDLWQLDLETMS 231
Cdd:PLN02193 298 SYNIVDK--------KWFHCS---------------TPGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDK 354
|
250 260 270
....*....|....*....|....*....|.
gi 7019405 232 WSKPETKGTVPLPRSLHTASVIGNKMYIFGG 262
Cdd:PLN02193 355 WTQVETFGVRPSERSVFASAAVGKHIVIFGG 385
|
|
| Kelch_3 |
pfam13415 |
Galactose oxidase, central domain; |
207-253 |
1.85e-06 |
|
Galactose oxidase, central domain;
Pssm-ID: 433188 [Multi-domain] Cd Length: 49 Bit Score: 45.36 E-value: 1.85e-06
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 7019405 207 KMYVFGGMC---GARLDDLWQLDLETMSWSKPetkGTVPLPRSLHTASVI 253
Cdd:pfam13415 3 KLYIFGGLGfdgQTRLNDLYVYDLDTNTWTQI---GDLPPPRSGHSATYI 49
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
618-672 |
3.25e-05 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 43.25 E-value: 3.25e-05
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*
gi 7019405 618 PKGKQSISKVGNADVPDYSLlKKQDLVPGTGYRFRVAAINGCGIGPFSKISEFKT 672
Cdd:cd00063 40 EKGSGDWKEVEVTPGSETSY-TLTGLKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
641-665 |
2.76e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 40.48 E-value: 2.76e-04
10 20
....*....|....*....|....*
gi 7019405 641 QDLVPGTGYRFRVAAINGCGIGPFS 665
Cdd:pfam00041 61 TGLKPGTEYEVRVQAVNGGGEGPPS 85
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
678-778 |
6.32e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 39.40 E-value: 6.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 678 PGAPSAVRISKNVEG-IHLSWEPPTSPSGNILEYSaylaIRTAQIQDNPSQLVfmRIYCGLKTSCIVTagQLaNAHIDYT 756
Cdd:cd00063 1 PSPPTNLRVTDVTSTsVTLSWTPPEDDGGPITGYV----VEYREKGSGDWKEV--EVTPGSETSYTLT--GL-KPGTEYE 71
|
90 100
....*....|....*....|..
gi 7019405 757 srpaivFRISAKNEKGYGPATQ 778
Cdd:cd00063 72 ------FRVRAVNGGGESPPSE 87
|
|
| Kelch_5 |
pfam13854 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
312-355 |
1.35e-03 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 433528 [Multi-domain] Cd Length: 41 Bit Score: 37.16 E-value: 1.35e-03
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 7019405 312 PRPRAGHCAVAIGTRLYFWSGRDGYkkalNSQVcCKDLWYLDTE 355
Cdd:pfam13854 1 PVPRYGHCAVTVGDYIYLYGGYTGG----EGQP-SDDVYVLSLP 39
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
535-704 |
6.92e-03 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 39.99 E-value: 6.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 535 STNGAVVKDETSLTTFSTKSEVDETY--ALPATKISRVETHATATPFSKETPSNPVATVKAGERQWCDVGIFKNNTALVS 612
Cdd:COG3401 82 VAVAAAPPTATGLTTLTGSGSVGGATntGLTSSDEVPSPAVGTATTATAVAGGAATAGTYALGAGLYGVDGANASGTTAS 161
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 613 Q-------FYLLPKGKQSISKVGNADVPDYSLLKKQDLVPGTGYRFRVAAINGCGIGPFSKISEFKTcIPGFPGAPSAVR 685
Cdd:COG3401 162 SvagagvvVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTYYYRVAATDTGGESAPSNEVSVTT-PTTPPSAPTGLT 240
|
170 180
....*....|....*....|
gi 7019405 686 ISKNVEG-IHLSWEPPTSPS 704
Cdd:COG3401 241 ATADTPGsVTLSWDPVTESD 260
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
14-332 |
4.85e-20 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 90.99 E-value: 4.85e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 14 SFTGPVPRARHGHRAVAIRELMIIFGGGNEG-IADELHVYNTATNQWflpaVRGDIPPGCAAHGFVC--DGTRILVFGGM 90
Cdd:COG3055 4 SSLPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGGF 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 91 VEYGRYSN-----ELYELQASRWlwkkvkphpPPSGLPPCPRLGHSFSLYGNKCYLFGGlanesedsnNNVPRYLNDFYE 165
Cdd:COG3055 80 TGANPSSTplndvYVYDPATNTW---------TKLAPMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEV 141
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 166 LELQHGSgvvgWSipvTKGVVPSPRESHTAVIyckkdSGSPKMYVFGGMCGARLDDLWqldletmswskpETKGTVPLPR 245
Cdd:COG3055 142 YDPATGT----WT---QLAPLPTPRDHLAAAV-----LPDGKILVIGGRNGSGFSNTW------------TTLAPLPTAR 197
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 246 SLHTASVIGNKMYIFGGwvphkgentETSPHDCEWRctssfsyLNLDTTEWTTLvsdsqedkkNSRPRPRAGHCAVAIGT 325
Cdd:COG3055 198 AGHAAAVLGGKILVFGG---------ESGFSDEVEA-------YDPATNTWTAL---------GELPTPRHGHAAVLTDG 252
|
....*..
gi 7019405 326 RLYFWSG 332
Cdd:COG3055 253 KVYVIGG 259
|
|
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
9-264 |
2.02e-18 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 86.36 E-value: 2.02e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 9 WRRVSSFTGPvprARHGHRAVAIRELMIIFGG-----GNEGIADELHVYNTATNQWFlpaVRGDIPPGCAAHGFVCDGTR 83
Cdd:COG3055 50 WSELAPLPGP---PRHHAAAVAQDGKLYVFGGftganPSSTPLNDVYVYDPATNTWT---KLAPMPTPRGGATALLLDGK 123
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 84 ILVFGGMVEYGRYSN-ELYELQASRWlwkkvkphpPPSGLPPCPRLGHS-FSLYGNKCYLFGGlANESEDSNNnvpryln 161
Cdd:COG3055 124 IYVVGGWDDGGNVAWvEVYDPATGTW---------TQLAPLPTPRDHLAaAVLPDGKILVIGG-RNGSGFSNT------- 186
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 162 dfyelelqhgsgvvgWSipvTKGVVPSPRESHTAVIYckkdsgSPKMYVFGGMCGArLDDLWQLDLETMSWSkpeTKGTV 241
Cdd:COG3055 187 ---------------WT---TLAPLPTARAGHAAAVL------GGKILVFGGESGF-SDEVEAYDPATNTWT---ALGEL 238
|
250 260
....*....|....*....|...
gi 7019405 242 PLPRSLHTASVIGNKMYIFGGWV 264
Cdd:COG3055 239 PTPRHGHAAVLTDGKVYVIGGET 261
|
|
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
134-335 |
7.27e-17 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 81.74 E-value: 7.27e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 134 LYGNKCYLFGGLANESedsnnnvprYLNDFYELELQHGSgvvgWSipvTKGVVPSPRESHTAVIYCKKdsgspKMYVFGG 213
Cdd:COG3055 20 LLDGKVYVAGGLSGGS---------ASNSFEVYDPATNT----WS---ELAPLPGPPRHHAAAVAQDG-----KLYVFGG 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 214 MCGAR-----LDDLWQLDLETMSWSKpetKGTVPLPRSLHTASVIGNKMYIFGGWVPHKGentetsphdcewrcTSSFSY 288
Cdd:COG3055 79 FTGANpsstpLNDVYVYDPATNTWTK---LAPMPTPRGGATALLLDGKIYVVGGWDDGGN--------------VAWVEV 141
|
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 7019405 289 LNLDTTEWTTLVSDsqedkknsrPRPRAGHCA-VAIGTRLYFWSGRDG 335
Cdd:COG3055 142 YDPATGTWTQLAPL---------PTPRDHLAAaVLPDGKILVIGGRNG 180
|
|
| PLN02193 |
PLN02193 |
nitrile-specifier protein |
2-262 |
5.94e-12 |
|
nitrile-specifier protein
Pssm-ID: 177844 [Multi-domain] Cd Length: 470 Bit Score: 68.83 E-value: 5.94e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 2 AAPSLL-NWRRV-SSFTGPVPRARHGHRAVAIRelMIIFGGG---NEGIADELHVYNTATNQWFLPAVRGDIPP-GCAAH 75
Cdd:PLN02193 145 STPKLLgKWIKVeQKGEGPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLGV 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 76 GFVCDGTRILVFGGMvEYGRYSNELYELQASRWLWKKVkphpPPSGLPPCPRLGHSFSLYGNKCYLFGGLANESE----D 151
Cdd:PLN02193 223 RMVSIGSTLYVFGGR-DASRQYNGFYSFDTTTNEWKLL----TPVEEGPTPRSFHSMAADEENVYVFGGVSATARlktlD 297
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 152 SNNNVPRylndfyelELQHGSgvvgwsipvtkgvvpSPRESHTAVIYCKKDSGSPKMYVFGGMCGARLDDLWQLDLETMS 231
Cdd:PLN02193 298 SYNIVDK--------KWFHCS---------------TPGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDK 354
|
250 260 270
....*....|....*....|....*....|.
gi 7019405 232 WSKPETKGTVPLPRSLHTASVIGNKMYIFGG 262
Cdd:PLN02193 355 WTQVETFGVRPSERSVFASAAVGKHIVIFGG 385
|
|
| PLN02153 |
PLN02153 |
epithiospecifier protein |
16-264 |
3.25e-10 |
|
epithiospecifier protein
Pssm-ID: 177814 [Multi-domain] Cd Length: 341 Bit Score: 62.31 E-value: 3.25e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 16 TGPVPRARHGHRAVAirELMIIFGGG---NEGIADELHVYNTATNQWFLPAVRGDIPP-GCAAHGFVCDGTRILVFGGMV 91
Cdd:PLN02153 18 KGPGPRCSHGIAVVG--DKLYSFGGElkpNEHIDKDLYVFDFNTHTWSIAPANGDVPRiSCLGVRMVAVGTKLYIFGGRD 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 92 EYGRYSN-ELYELQASRWLWkkvkPHPPPSGLPPCPRLGHSFSLYGNKCYLFGGLaneSEDSNNNVPRYLNDFYELELQH 170
Cdd:PLN02153 96 EKREFSDfYSYDTVKNEWTF----LTKLDEEGGPEARTFHSMASDENHVYVFGGV---SKGGLMKTPERFRTIEAYNIAD 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 171 GsgvvGWSipvtkgVVPSPRESHTA--------------VIYCKKDSGSPkmyvfGGMCGARLDDLWQLDLETMSWSKPE 236
Cdd:PLN02153 169 G----KWV------QLPDPGENFEKrggagfavvqgkiwVVYGFATSILP-----GGKSDYESNAVQFFDPASGKWTEVE 233
|
250 260
....*....|....*....|....*...
gi 7019405 237 TKGTVPLPRSLHTASVIGNKMYIFGGWV 264
Cdd:PLN02153 234 TTGAKPSARSVFAHAVVGKYIIIFGGEV 261
|
|
| PLN02193 |
PLN02193 |
nitrile-specifier protein |
77-262 |
5.67e-07 |
|
nitrile-specifier protein
Pssm-ID: 177844 [Multi-domain] Cd Length: 470 Bit Score: 53.04 E-value: 5.67e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 77 FVCDGTRILVFggmveYGRYSNELYELQASRWL---------WKKVKPHPPPSGLppcpRLGHSFSLYGNKCYLFGGlan 147
Cdd:PLN02193 116 FVLQGGKIVGF-----HGRSTDVLHSLGAYISLpstpkllgkWIKVEQKGEGPGL----RCSHGIAQVGNKIYSFGG--- 183
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 148 eSEDSNNNVPRYLNDFyELELQhgsgvvGWSIPVTKGVVPspresHTAVIYCKKDSGSPKMYVFGGMCGAR-LDDLWQLD 226
Cdd:PLN02193 184 -EFTPNQPIDKHLYVF-DLETR------TWSISPATGDVP-----HLSCLGVRMVSIGSTLYVFGGRDASRqYNGFYSFD 250
|
170 180 190
....*....|....*....|....*....|....*...
gi 7019405 227 LETMSWS--KPETKGtvPLPRSLHTASVIGNKMYIFGG 262
Cdd:PLN02193 251 TTTNEWKllTPVEEG--PTPRSFHSMAADEENVYVFGG 286
|
|
| Kelch_3 |
pfam13415 |
Galactose oxidase, central domain; |
207-253 |
1.85e-06 |
|
Galactose oxidase, central domain;
Pssm-ID: 433188 [Multi-domain] Cd Length: 49 Bit Score: 45.36 E-value: 1.85e-06
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 7019405 207 KMYVFGGMC---GARLDDLWQLDLETMSWSKPetkGTVPLPRSLHTASVI 253
Cdd:pfam13415 3 KLYIFGGLGfdgQTRLNDLYVYDLDTNTWTQI---GDLPPPRSGHSATYI 49
|
|
| PLN02193 |
PLN02193 |
nitrile-specifier protein |
232-375 |
4.45e-06 |
|
nitrile-specifier protein
Pssm-ID: 177844 [Multi-domain] Cd Length: 470 Bit Score: 49.95 E-value: 4.45e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 232 WSKPETKGTVPLPRSLHTASVIGNKMYIFGgwvphkGENTETSPHDcewrctSSFSYLNLDTTEWTtlVSDSQEDKKNSR 311
Cdd:PLN02193 153 WIKVEQKGEGPGLRCSHGIAQVGNKIYSFG------GEFTPNQPID------KHLYVFDLETRTWS--ISPATGDVPHLS 218
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 7019405 312 PrprAGHCAVAIGTRLYFWSGRDGYKK--ALNSQVCCKDLWYLDT--EKPPAPSQVQLIKATTNSFHV 375
Cdd:PLN02193 219 C---LGVRMVSIGSTLYVFGGRDASRQynGFYSFDTTTNEWKLLTpvEEGPTPRSFHSMAADEENVYV 283
|
|
| PRK14131 |
PRK14131 |
N-acetylneuraminate epimerase; |
9-89 |
6.60e-06 |
|
N-acetylneuraminate epimerase;
Pssm-ID: 237617 [Multi-domain] Cd Length: 376 Bit Score: 49.24 E-value: 6.60e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 9 WRRVSSFTGPvprARHGHRAVAIRELMIIFGG----GNEG---IADELHVYNTATNQWFLPAVRGdiPPGCAAH-GFVCD 80
Cdd:PRK14131 64 WTKIAAFPGG---PREQAVAAFIDGKLYVFGGigktNSEGspqVFDDVYKYDPKTNSWQKLDTRS--PVGLAGHvAVSLH 138
|
....*....
gi 7019405 81 GTRILVFGG 89
Cdd:PRK14131 139 NGKAYITGG 147
|
|
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
8-104 |
7.08e-06 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 48.61 E-value: 7.08e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 8 NWRRVSSFtgPVPRARHGHRAVAIRelmIIFGGGNEGIADELHVYNTATNQWFlpaVRGDIPPGCAAHGFVCDGTRILVF 87
Cdd:COG3055 186 TWTTLAPL--PTARAGHAAAVLGGK---ILVFGGESGFSDEVEAYDPATNTWT---ALGELPTPRHGHAAVLTDGKVYVI 257
|
90
....*....|....*..
gi 7019405 88 GGMVEYGRYSNELYELQ 104
Cdd:COG3055 258 GGETKPGVRTPLVTSAE 274
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
618-672 |
3.25e-05 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 43.25 E-value: 3.25e-05
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*
gi 7019405 618 PKGKQSISKVGNADVPDYSLlKKQDLVPGTGYRFRVAAINGCGIGPFSKISEFKT 672
Cdd:cd00063 40 EKGSGDWKEVEVTPGSETSY-TLTGLKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
641-665 |
2.76e-04 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 40.48 E-value: 2.76e-04
10 20
....*....|....*....|....*
gi 7019405 641 QDLVPGTGYRFRVAAINGCGIGPFS 665
Cdd:pfam00041 61 TGLKPGTEYEVRVQAVNGGGEGPPS 85
|
|
| Kelch_3 |
pfam13415 |
Galactose oxidase, central domain; |
254-323 |
3.03e-04 |
|
Galactose oxidase, central domain;
Pssm-ID: 433188 [Multi-domain] Cd Length: 49 Bit Score: 39.20 E-value: 3.03e-04
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 254 GNKMYIFGGWVPHKGEntetsphdcewrCTSSFSYLNLDTTEWTTLvsdsqedkkNSRPRPRAGHCAVAI 323
Cdd:pfam13415 1 GDKLYIFGGLGFDGQT------------RLNDLYVYDLDTNTWTQI---------GDLPPPRSGHSATYI 49
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
678-778 |
6.32e-04 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 39.40 E-value: 6.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 678 PGAPSAVRISKNVEG-IHLSWEPPTSPSGNILEYSaylaIRTAQIQDNPSQLVfmRIYCGLKTSCIVTagQLaNAHIDYT 756
Cdd:cd00063 1 PSPPTNLRVTDVTSTsVTLSWTPPEDDGGPITGYV----VEYREKGSGDWKEV--EVTPGSETSYTLT--GL-KPGTEYE 71
|
90 100
....*....|....*....|..
gi 7019405 757 srpaivFRISAKNEKGYGPATQ 778
Cdd:cd00063 72 ------FRVRAVNGGGESPPSE 87
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
642-703 |
6.46e-04 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 43.07 E-value: 6.46e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 7019405 642 DLVPGTGYRFRVAAINGCGI-GPFSKISEFKTCIPGfPGAPSAVRISKNVE-GIHLSWEPPTSP 703
Cdd:COG3401 291 GLTNGTTYYYRVTAVDAAGNeSAPSNVVSVTTDLTP-PAAPSGLTATAVGSsSITLSWTASSDA 353
|
|
| Kelch_1 |
pfam01344 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
244-301 |
9.68e-04 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 396078 [Multi-domain] Cd Length: 46 Bit Score: 37.59 E-value: 9.68e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 7019405 244 PRSLHTASVIGNKMYIFGGWvphkgentetsphdCEWRCTSSFSYLNLDTTEWTTLVS 301
Cdd:pfam01344 1 RRSGAGVVVVGGKIYVIGGF--------------DGNQSLNSVEVYDPETNTWSKLPS 44
|
|
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
231-337 |
9.92e-04 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 41.68 E-value: 9.92e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 231 SWSkpeTKGTVPLPRSLHTASVIGNKMYIFGGWvphkGENTETSPHDCewrctssfsyLNLDTTEWTTLVSDSQEdkkns 310
Cdd:COG3055 2 TWS---SLPDLPTPRSEAAAALLDGKVYVAGGL----SGGSASNSFEV----------YDPATNTWSELAPLPGP----- 59
|
90 100
....*....|....*....|....*..
gi 7019405 311 rprPRAGHCAVAIGTRLYFWSGRDGYK 337
Cdd:COG3055 60 ---PRHHAAAVAQDGKLYVFGGFTGAN 83
|
|
| Kelch_5 |
pfam13854 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
312-355 |
1.35e-03 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 433528 [Multi-domain] Cd Length: 41 Bit Score: 37.16 E-value: 1.35e-03
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 7019405 312 PRPRAGHCAVAIGTRLYFWSGRDGYkkalNSQVcCKDLWYLDTE 355
Cdd:pfam13854 1 PVPRYGHCAVTVGDYIYLYGGYTGG----EGQP-SDDVYVLSLP 39
|
|
| Kelch_4 |
pfam13418 |
Galactose oxidase, central domain; |
189-234 |
1.50e-03 |
|
Galactose oxidase, central domain;
Pssm-ID: 433191 [Multi-domain] Cd Length: 49 Bit Score: 37.21 E-value: 1.50e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 7019405 189 PRESHTAViYCKKDSGspkmYVFGGMC--GARLDDLWQLDLETMSWSK 234
Cdd:pfam13418 1 PRAYHTST-SIPDDTI----YLFGGEGedGTLLSDLWVFDLSTNEWTR 43
|
|
| Kelch_5 |
pfam13854 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
187-229 |
1.86e-03 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 433528 [Multi-domain] Cd Length: 41 Bit Score: 36.77 E-value: 1.86e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 7019405 187 PSPRESHTAVIYckkdsgSPKMYVFGGMCGAR---LDDLWQLDLET 229
Cdd:pfam13854 1 PVPRYGHCAVTV------GDYIYLYGGYTGGEgqpSDDVYVLSLPT 40
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
535-704 |
6.92e-03 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 39.99 E-value: 6.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 535 STNGAVVKDETSLTTFSTKSEVDETY--ALPATKISRVETHATATPFSKETPSNPVATVKAGERQWCDVGIFKNNTALVS 612
Cdd:COG3401 82 VAVAAAPPTATGLTTLTGSGSVGGATntGLTSSDEVPSPAVGTATTATAVAGGAATAGTYALGAGLYGVDGANASGTTAS 161
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 7019405 613 Q-------FYLLPKGKQSISKVGNADVPDYSLLKKQDLVPGTGYRFRVAAINGCGIGPFSKISEFKTcIPGFPGAPSAVR 685
Cdd:COG3401 162 SvagagvvVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTYYYRVAATDTGGESAPSNEVSVTT-PTTPPSAPTGLT 240
|
170 180
....*....|....*....|
gi 7019405 686 ISKNVEG-IHLSWEPPTSPS 704
Cdd:COG3401 241 ATADTPGsVTLSWDPVTESD 260
|
|
| Kelch_3 |
pfam13415 |
Galactose oxidase, central domain; |
136-198 |
9.16e-03 |
|
Galactose oxidase, central domain;
Pssm-ID: 433188 [Multi-domain] Cd Length: 49 Bit Score: 34.96 E-value: 9.16e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 7019405 136 GNKCYLFGGLANESEDsnnnvprYLNDFYELELQHGSgvvgWsipVTKGVVPSPRESHTAVIY 198
Cdd:pfam13415 1 GDKLYIFGGLGFDGQT-------RLNDLYVYDLDTNT----W---TQIGDLPPPRSGHSATYI 49
|
|
| Kelch_1 |
pfam01344 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
22-59 |
9.72e-03 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 396078 [Multi-domain] Cd Length: 46 Bit Score: 34.90 E-value: 9.72e-03
10 20 30
....*....|....*....|....*....|....*....
gi 7019405 22 ARHGHRAVAIRELMIIFGGGNEGIA-DELHVYNTATNQW 59
Cdd:pfam01344 1 RRSGAGVVVVGGKIYVIGGFDGNQSlNSVEVYDPETNTW 39
|
|
|