|
Name |
Accession |
Description |
Interval |
E-value |
| Arm_2 |
pfam04826 |
Armadillo-like; This domain contains armadillo-like repeats. Proteins containing this domain ... |
2104-2324 |
7.35e-87 |
|
Armadillo-like; This domain contains armadillo-like repeats. Proteins containing this domain interact with numerous other proteins, through these interactions they are involved in a wide variety of processes including carcinogenesis, control of cellular ageing and survival, regulation of circadian rhythm and lysosomal sorting of G protein-coupled receptors. :
Pssm-ID: 461447 [Multi-domain] Cd Length: 221 Bit Score: 283.15 E-value: 7.35e-87
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 2104 LDPHDLEKLICMIEMTEDPSVHEIATNALYNSADYPYPQEIDRNIGGISVIQSLLSNPYPNVRQKALNALNNISVAAENH 2183
Cdd:pfam04826 1 LEPQELDKLLALLKSSEDPFIHEIALITLGNSAAYPFNQDIIRDLGGIPIIANLLSDSNPEVKEKALNALNNLSMNVENQ 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 2184 RTVKTYLSQVCEDTVTYPLNSNVQVAGLRLIKHLTITSEYQHMVTNYISEFLRLLALGSGETKDHVLGMLVNFSKNPSMT 2263
Cdd:pfam04826 81 KKIKVYVNQVCKDIVSSPLNSEVQLAGLKLLTNLSVTNDYHHLVVSYIRDFFSLLSQGNEKTKFQVLKVLLNLSENPAMT 160
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 569011201 2264 RDLLIANAPTALINIFSKKETKENILNALLLFENINRHFKKRGKTYPQDRFSKTSLYFLFQ 2324
Cdd:pfam04826 161 RELLSAQVPSSLLSLFNSKEAKENLLRALTIFENINFHLKKEAKLFVKEHFTKGSLFSIFR 221
|
|
| MSCRAMM_ClfA super family |
cl41352 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
396-808 |
2.86e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif. The actual alignment was detected with superfamily member NF033609:
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 49.52 E-value: 2.86e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 396 DTQPQAVTSDQTEAMPDAKVKGKGNASAMAKAGAKANTKTSSQTDALPDAGDKNRSDNNVTAKAETGIDMVSCTQTEPVA 475
Cdd:NF033609 553 EIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSAS 632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 476 KDNANTTSKEGAQATGQSQGEALPNTKGKARGKAKAKCKAAAGTDTKTCAQPQAGTKTQAEALSDSKVDSKSDSNGVSKA 555
Cdd:NF033609 633 DSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 712
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 556 GAKADQKASGQpqpvancqnealpgtknkvkgnpnptpkteaGTATTSSAQTNVTSSQGETTPGAKNKAKGNRNSVPKAG 635
Cdd:NF033609 713 DSDSDSDSDSD-------------------------------SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 761
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 636 AGPDTTGSAQSQTVANSHSEALPGAKNKVKSNSNVVPKAEAGVGACPQSVPASQGTALTGTKTKVKGNSSAVSKPDAGAG 715
Cdd:NF033609 762 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 841
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 716 TMGSAHAKTAANSQGETLPGSKNKVKGNSNAVPkaeagagttePNQPQAEALLGARNKVKGNSNSVP----KAESGASTI 791
Cdd:NF033609 842 SDSDSDSDSDSDSDSESDSNSDSESGSNNNVVP----------PNSPKNGTNASNKNEAKDSKEPLPdtgsEDEANTSLI 911
|
410
....*....|....*..
gi 569011201 792 LALASSQAEALLGARNK 808
Cdd:NF033609 912 WGLLASLGSLLLFRRKK 928
|
|
| COG4625 super family |
cl34793 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
1007-1515 |
4.35e-05 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown]; The actual alignment was detected with superfamily member COG4625:
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 49.01 E-value: 4.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1007 NEVALGSWVSAGDQATGAIWAGAQTTDGTWVADKASAGSWTGAENQISAGSWVVSGNQAIAGPWAVSQVTDGSWPAVQAS 1086
Cdd:COG4625 11 GGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGT 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1087 GVSWVVDQATGTWTVAENQTGAVSWAGAGNIVSIGYWTGAVDQTNAVSWTGTTDQVGVEVKPRFEDQASEKGSWVVAGVQ 1166
Cdd:COG4625 91 GGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGG 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1167 TSGETRLGSEDQSSGRSWTETVDQANAASRLGTVDQAGGTSWAGTGDQVGGVSTSGSADQSSSGSWAGTRNLAGERSWTG 1246
Cdd:COG4625 171 GGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGG 250
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1247 TGDQSDGAAKPGFENQTSDEGSWAGTIGQPSGGSKSVSEAQSAGRSWADSADQLSGGFLVGPLDQANGESQPVSGELAAS 1326
Cdd:COG4625 251 GGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 330
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1327 GVDQTSGGGcwtGSGDQSGGESRLGPRDQSNGESWPGTGDQTGGWYCTYTGTQTIGGGPWVGVSGQDVGGPKPVHMNQAT 1406
Cdd:COG4625 331 GGGAGGGGG---SGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGG 407
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1407 GGAWLGTGTQVSAVSWTGDQVGGCSKPGFEDQAIGGGFWAGAGDQTTGGSrpavsedqssGGVSWGGAGGHVIGGSRTDQ 1486
Cdd:COG4625 408 TGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAG----------GSGGGAGAGGGSGSGAGTLT 477
|
490 500
....*....|....*....|....*....
gi 569011201 1487 SSGTSWPGMGSQVSGGswiGSVDQTSGCT 1515
Cdd:COG4625 478 LTGNNTYTGTTTVNGG---GNYTQSAGST 503
|
|
| WEMBL super family |
cl25644 |
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required ... |
109-253 |
2.55e-03 |
|
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required for the chloroplast avoidance response under high intensity blue light. This avoidance response consists in the relocation of chloroplasts on the anticlinal side of exposed cells. Acts in association with PMI2 to maintain the velocity of chloroplast photo-relocation movement via the regulation of cp-actin filaments. Thus several member-sequences are described as "myosin heavy chain-like". The actual alignment was detected with superfamily member pfam05701:
Pssm-ID: 461718 [Multi-domain] Cd Length: 562 Bit Score: 43.09 E-value: 2.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 109 LEEEIE-TQSETsslvetmvmtEAVTLTESTSQAKEVTMKEAVTQTDAEAEavgkkeavtqtKAKAWAMAGRAEVKK--E 185
Cdd:pfam05701 354 LEAELNrTKSEI----------ALVQAKEKEAREKMVELPKQLQQAAQEAE-----------EAKSLAQAAREELRKakE 412
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 569011201 186 AMTQTKAEARTlaeketeinrvtvTQSEVLAVTKEVVKIGTMNETGVVAeamIRPLEETVSVTRTQSE 253
Cdd:pfam05701 413 EAEQAKAAAST-------------VESRLEAVLKEIEAAKASEKLALAA---IKALQESESSAESTNQ 464
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Arm_2 |
pfam04826 |
Armadillo-like; This domain contains armadillo-like repeats. Proteins containing this domain ... |
2104-2324 |
7.35e-87 |
|
Armadillo-like; This domain contains armadillo-like repeats. Proteins containing this domain interact with numerous other proteins, through these interactions they are involved in a wide variety of processes including carcinogenesis, control of cellular ageing and survival, regulation of circadian rhythm and lysosomal sorting of G protein-coupled receptors.
Pssm-ID: 461447 [Multi-domain] Cd Length: 221 Bit Score: 283.15 E-value: 7.35e-87
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 2104 LDPHDLEKLICMIEMTEDPSVHEIATNALYNSADYPYPQEIDRNIGGISVIQSLLSNPYPNVRQKALNALNNISVAAENH 2183
Cdd:pfam04826 1 LEPQELDKLLALLKSSEDPFIHEIALITLGNSAAYPFNQDIIRDLGGIPIIANLLSDSNPEVKEKALNALNNLSMNVENQ 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 2184 RTVKTYLSQVCEDTVTYPLNSNVQVAGLRLIKHLTITSEYQHMVTNYISEFLRLLALGSGETKDHVLGMLVNFSKNPSMT 2263
Cdd:pfam04826 81 KKIKVYVNQVCKDIVSSPLNSEVQLAGLKLLTNLSVTNDYHHLVVSYIRDFFSLLSQGNEKTKFQVLKVLLNLSENPAMT 160
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 569011201 2264 RDLLIANAPTALINIFSKKETKENILNALLLFENINRHFKKRGKTYPQDRFSKTSLYFLFQ 2324
Cdd:pfam04826 161 RELLSAQVPSSLLSLFNSKEAKENLLRALTIFENINFHLKKEAKLFVKEHFTKGSLFSIFR 221
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
396-808 |
2.86e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 49.52 E-value: 2.86e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 396 DTQPQAVTSDQTEAMPDAKVKGKGNASAMAKAGAKANTKTSSQTDALPDAGDKNRSDNNVTAKAETGIDMVSCTQTEPVA 475
Cdd:NF033609 553 EIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSAS 632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 476 KDNANTTSKEGAQATGQSQGEALPNTKGKARGKAKAKCKAAAGTDTKTCAQPQAGTKTQAEALSDSKVDSKSDSNGVSKA 555
Cdd:NF033609 633 DSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 712
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 556 GAKADQKASGQpqpvancqnealpgtknkvkgnpnptpkteaGTATTSSAQTNVTSSQGETTPGAKNKAKGNRNSVPKAG 635
Cdd:NF033609 713 DSDSDSDSDSD-------------------------------SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 761
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 636 AGPDTTGSAQSQTVANSHSEALPGAKNKVKSNSNVVPKAEAGVGACPQSVPASQGTALTGTKTKVKGNSSAVSKPDAGAG 715
Cdd:NF033609 762 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 841
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 716 TMGSAHAKTAANSQGETLPGSKNKVKGNSNAVPkaeagagttePNQPQAEALLGARNKVKGNSNSVP----KAESGASTI 791
Cdd:NF033609 842 SDSDSDSDSDSDSDSESDSNSDSESGSNNNVVP----------PNSPKNGTNASNKNEAKDSKEPLPdtgsEDEANTSLI 911
|
410
....*....|....*..
gi 569011201 792 LALASSQAEALLGARNK 808
Cdd:NF033609 912 WGLLASLGSLLLFRRKK 928
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
365-714 |
3.81e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 49.14 E-value: 3.81e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 365 VAKIETEATSSATMDGGKDANAKAMTDVNVTDTQPQAVTSDQTEAMPDAKVKGKGNASAMAKAGAKANTKTSSQTDALPD 444
Cdd:NF033609 554 IEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASD 633
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 445 AGDKNRSDNNVTAKAETGIDMVSCTQTEPVAKDNANTTSKEGAQATGQSQGEALPNTKGKARGKAKAKCKAAAGTDTKTC 524
Cdd:NF033609 634 SDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 713
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 525 AQPQAGTKTQAEALSDSKVDSKSDSNGVSKAGAKADQKASGQPQPVANCQNEALPGTKNKVKGNPNPTPKTEAGTATTSS 604
Cdd:NF033609 714 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 793
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 605 AQTNvTSSQGETTPGAKNKAKGNRNSVPKAGAGPDTTGSAQSQTVANSHSEALPGAKNKVKSNSNVVPKAEAGVGACPQS 684
Cdd:NF033609 794 SDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNV 872
|
330 340 350
....*....|....*....|....*....|
gi 569011201 685 VPASQGTALTGTKTKVKGNSSAVSKPDAGA 714
Cdd:NF033609 873 VPPNSPKNGTNASNKNEAKDSKEPLPDTGS 902
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
267-603 |
4.33e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 49.14 E-value: 4.33e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 267 NDMSVVVTGVDTKSCAQSQAVAIIQNDDMAGAEADK-EDLKNMSKAGSGIDMKVPGQPHIAANNLAEAVPGAKNDAWDNA 345
Cdd:NF033609 571 SDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSaSDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDS 650
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 346 KDICEAEADIRTcliQSETVAKIETEATSSATMDGGKDANAKAMTDVNV---TDTQPQAVTSDQTEAMPDAKVKGKGNAS 422
Cdd:NF033609 651 DSDSDSDSDSDS---DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdsdSDSDSDSDSDSDSDSDSDSDSDSDSDSD 727
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 423 AMAKAGAKANTKTSSQTDALPDAGDKNRSDNNVTAKAETGIDMVSCTQTEPVAKDNANTTSKEGAQATGQSQGEALPNTK 502
Cdd:NF033609 728 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 807
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 503 GKARGKAKAKCKAAAGTDTKTCAQPQAGTKTQAEALSDSKVDSKSDSNGVSKAGAKADQKASGQPQPVANCQNEALPGTK 582
Cdd:NF033609 808 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNK 887
|
330 340
....*....|....*....|..
gi 569011201 583 NKVKGNPNPTPKTEA-GTATTS 603
Cdd:NF033609 888 NEAKDSKEPLPDTGSeDEANTS 909
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
1007-1515 |
4.35e-05 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 49.01 E-value: 4.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1007 NEVALGSWVSAGDQATGAIWAGAQTTDGTWVADKASAGSWTGAENQISAGSWVVSGNQAIAGPWAVSQVTDGSWPAVQAS 1086
Cdd:COG4625 11 GGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGT 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1087 GVSWVVDQATGTWTVAENQTGAVSWAGAGNIVSIGYWTGAVDQTNAVSWTGTTDQVGVEVKPRFEDQASEKGSWVVAGVQ 1166
Cdd:COG4625 91 GGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGG 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1167 TSGETRLGSEDQSSGRSWTETVDQANAASRLGTVDQAGGTSWAGTGDQVGGVSTSGSADQSSSGSWAGTRNLAGERSWTG 1246
Cdd:COG4625 171 GGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGG 250
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1247 TGDQSDGAAKPGFENQTSDEGSWAGTIGQPSGGSKSVSEAQSAGRSWADSADQLSGGFLVGPLDQANGESQPVSGELAAS 1326
Cdd:COG4625 251 GGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 330
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1327 GVDQTSGGGcwtGSGDQSGGESRLGPRDQSNGESWPGTGDQTGGWYCTYTGTQTIGGGPWVGVSGQDVGGPKPVHMNQAT 1406
Cdd:COG4625 331 GGGAGGGGG---SGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGG 407
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1407 GGAWLGTGTQVSAVSWTGDQVGGCSKPGFEDQAIGGGFWAGAGDQTTGGSrpavsedqssGGVSWGGAGGHVIGGSRTDQ 1486
Cdd:COG4625 408 TGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAG----------GSGGGAGAGGGSGSGAGTLT 477
|
490 500
....*....|....*....|....*....
gi 569011201 1487 SSGTSWPGMGSQVSGGswiGSVDQTSGCT 1515
Cdd:COG4625 478 LTGNNTYTGTTTVNGG---GNYTQSAGST 503
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
454-797 |
8.50e-05 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 47.99 E-value: 8.50e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 454 NVTAKAETGIDMVSCTQTEPVAKDNANTTSKEGAQATGQSQGE-------ALPNTKGKARGKAKAKCKAAAGTDTKTCAQ 526
Cdd:pfam05109 457 NLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTeskapdmTSPTSAVTTPTPNATSPTPAVTTPTPNATS 536
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 527 PQAGTKTQAEALSDSKVDSKSDSNGVSKAGAKADQKASGQPQP---VANCQNEALPGTKNKVKGNPNPTPKTEAGTATTS 603
Cdd:pfam05109 537 PTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPtsaVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTP 616
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 604 SAQTNVTSSQGETTPGAKNKAKGNRNSVPKAGAGPDTTGSAQSQTVANSHSEALPGAKNKVKSNSNVVPKAEAGVGACPQ 683
Cdd:pfam05109 617 VVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVST 696
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 684 SVPASQGTalTGTKTKVKGNSSAVSKPDAGAGTMGS----AHAKTAANSQGETLPG-SKNKVKGNSNAVPKAEAGAGTTE 758
Cdd:pfam05109 697 SSPAPRPG--TTSQASGPGNSSTSTKPGEVNVTKGTppknATSPQAPSGQKTAVPTvTSTGGKANSTTGGKHTTGHGART 774
|
330 340 350
....*....|....*....|....*....|....*....
gi 569011201 759 PNQPQAEAllgarnkvkGNSNSVPKAESGASTILALASS 797
Cdd:pfam05109 775 STEPTTDY---------GGDSTTPRTRYNATTYLPPSTS 804
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1084-1370 |
1.79e-04 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 46.92 E-value: 1.79e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1084 QASGVSWVVDQATGtwtvaENQTGAVSWAgAGNIVSIGYWTgavdqTNAVSwTGTTDQVGVEVKPRFEDQASEKGSWVVA 1163
Cdd:NF033849 279 HGSTRGWSHTQSTS-----ESESTGQSSS-VGTSESQSHGT-----TEGTS-TTDSSSHSQSSSYNVSSGTGVSSSHSDG 346
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1164 GVQTSGETRLGSEDQSSGRSWTETVdqanaaSRLGTVDQAGGTSW-AGTGDQVGGVSTSGSADQSSSGSwagtrNLAGER 1242
Cdd:NF033849 347 TSQSTSISHSESSSESTGTSVGHST------SSSVSSSESSSRSSsSGVSGGFSGGIAGGGVTSEGLGA-----SQGGSE 415
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1243 SW-TGTGDQSDGAAKPGFENQTSDEGsWAGTIGQPSGGSKSVSEAQsaGRSWADSADQLSGgflvgpldQANGESQPVSg 1321
Cdd:NF033849 416 GWgSGDSVQSVSQSYGSSSSTGTSSG-HSDSSSHSTSSGQADSVSQ--GTSWSEGTGTSQG--------QSVGTSESWS- 483
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 569011201 1322 elaASGVDQTSGGGCWTGSGDQSGGESRLGPRDQSNGESWPGTGDQTGG 1370
Cdd:NF033849 484 ---TSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSG 529
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
362-636 |
2.16e-04 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 46.83 E-value: 2.16e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 362 SETVAKIETEATSSATMDGGKDANAKAMTDvnvTDTQPQAVTSDQTEAMPDAKVKGKGNASAMAKAGAKANTKTSSQTDA 441
Cdd:NF033609 646 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 722
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 442 LPDAGDKNRSDNNVTAKAETGIDMVSCTQTEPVAKDNANTTSKEGAQATGQSQGEALPNTKGKARGKAKAKCKAAAGTDT 521
Cdd:NF033609 723 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 802
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 522 KTCAQPQAGTKTQAEALSDSKVDSKSDSNGVSKAGAKADQKASGQPQPVANCQNEALPGTKNKVKGNPNPTPKTEAGTAT 601
Cdd:NF033609 803 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGT 882
|
250 260 270
....*....|....*....|....*....|....*
gi 569011201 602 TSSaqtnvtssqgettpgAKNKAKGNRNSVPKAGA 636
Cdd:NF033609 883 NAS---------------NKNEAKDSKEPLPDTGS 902
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1031-1300 |
2.95e-04 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 46.15 E-value: 2.95e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1031 TTDGTWVADKASAGSWTGAENQISAG---SWVVSGNQAIAGPWAVSQVTDGSWPAVQASGVSWVVDQATGTWTVAENQTG 1107
Cdd:NF033849 281 STRGWSHTQSTSESESTGQSSSVGTSesqSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSS 360
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1108 AVSWAGAGNIVSIGYWTGAVDQTNAvSWtGTTDQVGVEVKP------RFEDQASEKGSWVVA-GVQTSGETRLGSEDQSS 1180
Cdd:NF033849 361 ESTGTSVGHSTSSSVSSSESSSRSS-SS-GVSGGFSGGIAGggvtseGLGASQGGSEGWGSGdSVQSVSQSYGSSSSTGT 438
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1181 GRSWTETVDQANAASRLGTVDQAGGTSwAGTGDQVGgVSTSGSADQSSSGSWAGTRNlageRSWTGTGDQSDGAAKPGFE 1260
Cdd:NF033849 439 SSGHSDSSSHSTSSGQADSVSQGTSWS-EGTGTSQG-QSVGTSESWSTSQSETDSVG----DSTGTSESVSQGDGRSTGR 512
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 569011201 1261 NQTSDEG-SWAGTIGQPSGGSKSVSEAQSAGRSWA---DSADQL 1300
Cdd:NF033849 513 SESQGTSlGTSGGRTSGAGGSMGLGPSISLGKSYQwedDPAILL 556
|
|
| PRK08691 |
PRK08691 |
DNA polymerase III subunits gamma and tau; Validated |
589-762 |
5.72e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236333 [Multi-domain] Cd Length: 709 Bit Score: 45.08 E-value: 5.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 589 PNPTPKTEAGTATTSSAQTNVTSSQGETTPGAKNKAKgnrNSVPKAGAGPDTT----GSAQ-SQTVANSHSEALPGAKNK 663
Cdd:PRK08691 394 PQPRPEAETAQTPVQTASAAAMPSEGKTAGPVSNQEN---NDVPPWEDAPDEAqtaaGTAQtSAKSIQTASEAETPPENQ 470
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 664 VKSNSNVVPKAEAGVGACPQSVPASqgtaltgtktkvkgnSSAVSKPdagAGTMGSAHAKTAANSQGETLPGSKNKVKGN 743
Cdd:PRK08691 471 VSKNKAADNETDAPLSEVPSENPIQ---------------ATPNDEA---VETETFAHEAPAEPFYGYGFPDNDCPPEDG 532
|
170 180
....*....|....*....|....*..
gi 569011201 744 --------SNAVPKAEAGAGTTEPNQP 762
Cdd:PRK08691 533 aeipppdwEHAAPADTAGGGADEEAEA 559
|
|
| WEMBL |
pfam05701 |
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required ... |
109-253 |
2.55e-03 |
|
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required for the chloroplast avoidance response under high intensity blue light. This avoidance response consists in the relocation of chloroplasts on the anticlinal side of exposed cells. Acts in association with PMI2 to maintain the velocity of chloroplast photo-relocation movement via the regulation of cp-actin filaments. Thus several member-sequences are described as "myosin heavy chain-like".
Pssm-ID: 461718 [Multi-domain] Cd Length: 562 Bit Score: 43.09 E-value: 2.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 109 LEEEIE-TQSETsslvetmvmtEAVTLTESTSQAKEVTMKEAVTQTDAEAEavgkkeavtqtKAKAWAMAGRAEVKK--E 185
Cdd:pfam05701 354 LEAELNrTKSEI----------ALVQAKEKEAREKMVELPKQLQQAAQEAE-----------EAKSLAQAAREELRKakE 412
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 569011201 186 AMTQTKAEARTlaeketeinrvtvTQSEVLAVTKEVVKIGTMNETGVVAeamIRPLEETVSVTRTQSE 253
Cdd:pfam05701 413 EAEQAKAAAST-------------VESRLEAVLKEIEAAKASEKLALAA---IKALQESESSAESTNQ 464
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Arm_2 |
pfam04826 |
Armadillo-like; This domain contains armadillo-like repeats. Proteins containing this domain ... |
2104-2324 |
7.35e-87 |
|
Armadillo-like; This domain contains armadillo-like repeats. Proteins containing this domain interact with numerous other proteins, through these interactions they are involved in a wide variety of processes including carcinogenesis, control of cellular ageing and survival, regulation of circadian rhythm and lysosomal sorting of G protein-coupled receptors.
Pssm-ID: 461447 [Multi-domain] Cd Length: 221 Bit Score: 283.15 E-value: 7.35e-87
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 2104 LDPHDLEKLICMIEMTEDPSVHEIATNALYNSADYPYPQEIDRNIGGISVIQSLLSNPYPNVRQKALNALNNISVAAENH 2183
Cdd:pfam04826 1 LEPQELDKLLALLKSSEDPFIHEIALITLGNSAAYPFNQDIIRDLGGIPIIANLLSDSNPEVKEKALNALNNLSMNVENQ 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 2184 RTVKTYLSQVCEDTVTYPLNSNVQVAGLRLIKHLTITSEYQHMVTNYISEFLRLLALGSGETKDHVLGMLVNFSKNPSMT 2263
Cdd:pfam04826 81 KKIKVYVNQVCKDIVSSPLNSEVQLAGLKLLTNLSVTNDYHHLVVSYIRDFFSLLSQGNEKTKFQVLKVLLNLSENPAMT 160
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 569011201 2264 RDLLIANAPTALINIFSKKETKENILNALLLFENINRHFKKRGKTYPQDRFSKTSLYFLFQ 2324
Cdd:pfam04826 161 RELLSAQVPSSLLSLFNSKEAKENLLRALTIFENINFHLKKEAKLFVKEHFTKGSLFSIFR 221
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
396-808 |
2.86e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 49.52 E-value: 2.86e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 396 DTQPQAVTSDQTEAMPDAKVKGKGNASAMAKAGAKANTKTSSQTDALPDAGDKNRSDNNVTAKAETGIDMVSCTQTEPVA 475
Cdd:NF033609 553 EIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSAS 632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 476 KDNANTTSKEGAQATGQSQGEALPNTKGKARGKAKAKCKAAAGTDTKTCAQPQAGTKTQAEALSDSKVDSKSDSNGVSKA 555
Cdd:NF033609 633 DSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 712
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 556 GAKADQKASGQpqpvancqnealpgtknkvkgnpnptpkteaGTATTSSAQTNVTSSQGETTPGAKNKAKGNRNSVPKAG 635
Cdd:NF033609 713 DSDSDSDSDSD-------------------------------SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 761
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 636 AGPDTTGSAQSQTVANSHSEALPGAKNKVKSNSNVVPKAEAGVGACPQSVPASQGTALTGTKTKVKGNSSAVSKPDAGAG 715
Cdd:NF033609 762 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 841
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 716 TMGSAHAKTAANSQGETLPGSKNKVKGNSNAVPkaeagagttePNQPQAEALLGARNKVKGNSNSVP----KAESGASTI 791
Cdd:NF033609 842 SDSDSDSDSDSDSDSESDSNSDSESGSNNNVVP----------PNSPKNGTNASNKNEAKDSKEPLPdtgsEDEANTSLI 911
|
410
....*....|....*..
gi 569011201 792 LALASSQAEALLGARNK 808
Cdd:NF033609 912 WGLLASLGSLLLFRRKK 928
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
365-714 |
3.81e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 49.14 E-value: 3.81e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 365 VAKIETEATSSATMDGGKDANAKAMTDVNVTDTQPQAVTSDQTEAMPDAKVKGKGNASAMAKAGAKANTKTSSQTDALPD 444
Cdd:NF033609 554 IEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASD 633
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 445 AGDKNRSDNNVTAKAETGIDMVSCTQTEPVAKDNANTTSKEGAQATGQSQGEALPNTKGKARGKAKAKCKAAAGTDTKTC 524
Cdd:NF033609 634 SDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 713
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 525 AQPQAGTKTQAEALSDSKVDSKSDSNGVSKAGAKADQKASGQPQPVANCQNEALPGTKNKVKGNPNPTPKTEAGTATTSS 604
Cdd:NF033609 714 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 793
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 605 AQTNvTSSQGETTPGAKNKAKGNRNSVPKAGAGPDTTGSAQSQTVANSHSEALPGAKNKVKSNSNVVPKAEAGVGACPQS 684
Cdd:NF033609 794 SDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNV 872
|
330 340 350
....*....|....*....|....*....|
gi 569011201 685 VPASQGTALTGTKTKVKGNSSAVSKPDAGA 714
Cdd:NF033609 873 VPPNSPKNGTNASNKNEAKDSKEPLPDTGS 902
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
267-603 |
4.33e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 49.14 E-value: 4.33e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 267 NDMSVVVTGVDTKSCAQSQAVAIIQNDDMAGAEADK-EDLKNMSKAGSGIDMKVPGQPHIAANNLAEAVPGAKNDAWDNA 345
Cdd:NF033609 571 SDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSaSDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDS 650
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 346 KDICEAEADIRTcliQSETVAKIETEATSSATMDGGKDANAKAMTDVNV---TDTQPQAVTSDQTEAMPDAKVKGKGNAS 422
Cdd:NF033609 651 DSDSDSDSDSDS---DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdsdSDSDSDSDSDSDSDSDSDSDSDSDSDSD 727
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 423 AMAKAGAKANTKTSSQTDALPDAGDKNRSDNNVTAKAETGIDMVSCTQTEPVAKDNANTTSKEGAQATGQSQGEALPNTK 502
Cdd:NF033609 728 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 807
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 503 GKARGKAKAKCKAAAGTDTKTCAQPQAGTKTQAEALSDSKVDSKSDSNGVSKAGAKADQKASGQPQPVANCQNEALPGTK 582
Cdd:NF033609 808 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNK 887
|
330 340
....*....|....*....|..
gi 569011201 583 NKVKGNPNPTPKTEA-GTATTS 603
Cdd:NF033609 888 NEAKDSKEPLPDTGSeDEANTS 909
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
1007-1515 |
4.35e-05 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 49.01 E-value: 4.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1007 NEVALGSWVSAGDQATGAIWAGAQTTDGTWVADKASAGSWTGAENQISAGSWVVSGNQAIAGPWAVSQVTDGSWPAVQAS 1086
Cdd:COG4625 11 GGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGT 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1087 GVSWVVDQATGTWTVAENQTGAVSWAGAGNIVSIGYWTGAVDQTNAVSWTGTTDQVGVEVKPRFEDQASEKGSWVVAGVQ 1166
Cdd:COG4625 91 GGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGG 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1167 TSGETRLGSEDQSSGRSWTETVDQANAASRLGTVDQAGGTSWAGTGDQVGGVSTSGSADQSSSGSWAGTRNLAGERSWTG 1246
Cdd:COG4625 171 GGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGG 250
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1247 TGDQSDGAAKPGFENQTSDEGSWAGTIGQPSGGSKSVSEAQSAGRSWADSADQLSGGFLVGPLDQANGESQPVSGELAAS 1326
Cdd:COG4625 251 GGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 330
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1327 GVDQTSGGGcwtGSGDQSGGESRLGPRDQSNGESWPGTGDQTGGWYCTYTGTQTIGGGPWVGVSGQDVGGPKPVHMNQAT 1406
Cdd:COG4625 331 GGGAGGGGG---SGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGG 407
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1407 GGAWLGTGTQVSAVSWTGDQVGGCSKPGFEDQAIGGGFWAGAGDQTTGGSrpavsedqssGGVSWGGAGGHVIGGSRTDQ 1486
Cdd:COG4625 408 TGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAG----------GSGGGAGAGGGSGSGAGTLT 477
|
490 500
....*....|....*....|....*....
gi 569011201 1487 SSGTSWPGMGSQVSGGswiGSVDQTSGCT 1515
Cdd:COG4625 478 LTGNNTYTGTTTVNGG---GNYTQSAGST 503
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
454-797 |
8.50e-05 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 47.99 E-value: 8.50e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 454 NVTAKAETGIDMVSCTQTEPVAKDNANTTSKEGAQATGQSQGE-------ALPNTKGKARGKAKAKCKAAAGTDTKTCAQ 526
Cdd:pfam05109 457 NLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTeskapdmTSPTSAVTTPTPNATSPTPAVTTPTPNATS 536
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 527 PQAGTKTQAEALSDSKVDSKSDSNGVSKAGAKADQKASGQPQP---VANCQNEALPGTKNKVKGNPNPTPKTEAGTATTS 603
Cdd:pfam05109 537 PTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPtsaVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTP 616
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 604 SAQTNVTSSQGETTPGAKNKAKGNRNSVPKAGAGPDTTGSAQSQTVANSHSEALPGAKNKVKSNSNVVPKAEAGVGACPQ 683
Cdd:pfam05109 617 VVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVST 696
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 684 SVPASQGTalTGTKTKVKGNSSAVSKPDAGAGTMGS----AHAKTAANSQGETLPG-SKNKVKGNSNAVPKAEAGAGTTE 758
Cdd:pfam05109 697 SSPAPRPG--TTSQASGPGNSSTSTKPGEVNVTKGTppknATSPQAPSGQKTAVPTvTSTGGKANSTTGGKHTTGHGART 774
|
330 340 350
....*....|....*....|....*....|....*....
gi 569011201 759 PNQPQAEAllgarnkvkGNSNSVPKAESGASTILALASS 797
Cdd:pfam05109 775 STEPTTDY---------GGDSTTPRTRYNATTYLPPSTS 804
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1084-1370 |
1.79e-04 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 46.92 E-value: 1.79e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1084 QASGVSWVVDQATGtwtvaENQTGAVSWAgAGNIVSIGYWTgavdqTNAVSwTGTTDQVGVEVKPRFEDQASEKGSWVVA 1163
Cdd:NF033849 279 HGSTRGWSHTQSTS-----ESESTGQSSS-VGTSESQSHGT-----TEGTS-TTDSSSHSQSSSYNVSSGTGVSSSHSDG 346
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1164 GVQTSGETRLGSEDQSSGRSWTETVdqanaaSRLGTVDQAGGTSW-AGTGDQVGGVSTSGSADQSSSGSwagtrNLAGER 1242
Cdd:NF033849 347 TSQSTSISHSESSSESTGTSVGHST------SSSVSSSESSSRSSsSGVSGGFSGGIAGGGVTSEGLGA-----SQGGSE 415
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1243 SW-TGTGDQSDGAAKPGFENQTSDEGsWAGTIGQPSGGSKSVSEAQsaGRSWADSADQLSGgflvgpldQANGESQPVSg 1321
Cdd:NF033849 416 GWgSGDSVQSVSQSYGSSSSTGTSSG-HSDSSSHSTSSGQADSVSQ--GTSWSEGTGTSQG--------QSVGTSESWS- 483
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 569011201 1322 elaASGVDQTSGGGCWTGSGDQSGGESRLGPRDQSNGESWPGTGDQTGG 1370
Cdd:NF033849 484 ---TSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSG 529
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
362-636 |
2.16e-04 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 46.83 E-value: 2.16e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 362 SETVAKIETEATSSATMDGGKDANAKAMTDvnvTDTQPQAVTSDQTEAMPDAKVKGKGNASAMAKAGAKANTKTSSQTDA 441
Cdd:NF033609 646 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 722
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 442 LPDAGDKNRSDNNVTAKAETGIDMVSCTQTEPVAKDNANTTSKEGAQATGQSQGEALPNTKGKARGKAKAKCKAAAGTDT 521
Cdd:NF033609 723 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 802
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 522 KTCAQPQAGTKTQAEALSDSKVDSKSDSNGVSKAGAKADQKASGQPQPVANCQNEALPGTKNKVKGNPNPTPKTEAGTAT 601
Cdd:NF033609 803 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGT 882
|
250 260 270
....*....|....*....|....*....|....*
gi 569011201 602 TSSaqtnvtssqgettpgAKNKAKGNRNSVPKAGA 636
Cdd:NF033609 883 NAS---------------NKNEAKDSKEPLPDTGS 902
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1031-1300 |
2.95e-04 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 46.15 E-value: 2.95e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1031 TTDGTWVADKASAGSWTGAENQISAG---SWVVSGNQAIAGPWAVSQVTDGSWPAVQASGVSWVVDQATGTWTVAENQTG 1107
Cdd:NF033849 281 STRGWSHTQSTSESESTGQSSSVGTSesqSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSS 360
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1108 AVSWAGAGNIVSIGYWTGAVDQTNAvSWtGTTDQVGVEVKP------RFEDQASEKGSWVVA-GVQTSGETRLGSEDQSS 1180
Cdd:NF033849 361 ESTGTSVGHSTSSSVSSSESSSRSS-SS-GVSGGFSGGIAGggvtseGLGASQGGSEGWGSGdSVQSVSQSYGSSSSTGT 438
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1181 GRSWTETVDQANAASRLGTVDQAGGTSwAGTGDQVGgVSTSGSADQSSSGSWAGTRNlageRSWTGTGDQSDGAAKPGFE 1260
Cdd:NF033849 439 SSGHSDSSSHSTSSGQADSVSQGTSWS-EGTGTSQG-QSVGTSESWSTSQSETDSVG----DSTGTSESVSQGDGRSTGR 512
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 569011201 1261 NQTSDEG-SWAGTIGQPSGGSKSVSEAQSAGRSWA---DSADQL 1300
Cdd:NF033849 513 SESQGTSlGTSGGRTSGAGGSMGLGPSISLGKSYQwedDPAILL 556
|
|
| PRK08691 |
PRK08691 |
DNA polymerase III subunits gamma and tau; Validated |
589-762 |
5.72e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236333 [Multi-domain] Cd Length: 709 Bit Score: 45.08 E-value: 5.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 589 PNPTPKTEAGTATTSSAQTNVTSSQGETTPGAKNKAKgnrNSVPKAGAGPDTT----GSAQ-SQTVANSHSEALPGAKNK 663
Cdd:PRK08691 394 PQPRPEAETAQTPVQTASAAAMPSEGKTAGPVSNQEN---NDVPPWEDAPDEAqtaaGTAQtSAKSIQTASEAETPPENQ 470
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 664 VKSNSNVVPKAEAGVGACPQSVPASqgtaltgtktkvkgnSSAVSKPdagAGTMGSAHAKTAANSQGETLPGSKNKVKGN 743
Cdd:PRK08691 471 VSKNKAADNETDAPLSEVPSENPIQ---------------ATPNDEA---VETETFAHEAPAEPFYGYGFPDNDCPPEDG 532
|
170 180
....*....|....*....|....*..
gi 569011201 744 --------SNAVPKAEAGAGTTEPNQP 762
Cdd:PRK08691 533 aeipppdwEHAAPADTAGGGADEEAEA 559
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
955-1139 |
2.31e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.20 E-value: 2.31e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 955 DWENSVSWTEDESGATIGPWTGAANdkAGLVSSWAVACDESSIKSWTGARPENEVALGSWVSAGDQATGAIWAGAQTTDG 1034
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGA--AATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTST 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1035 TWVADKASAGSWTGAenqisagswvvsgnqaiagPWAVSQVTDGSWPAVQASGVSWVVDQATGTWTVAENQTGAVSWAGA 1114
Cdd:COG3469 79 TATATAAAAAATSTS-------------------ATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAS 139
|
170 180
....*....|....*....|....*
gi 569011201 1115 GNIVSIGYWTGAVDQTNAVSWTGTT 1139
Cdd:COG3469 140 ATSSAGSTTTTTTVSGTETATGGTT 164
|
|
| WEMBL |
pfam05701 |
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required ... |
109-253 |
2.55e-03 |
|
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required for the chloroplast avoidance response under high intensity blue light. This avoidance response consists in the relocation of chloroplasts on the anticlinal side of exposed cells. Acts in association with PMI2 to maintain the velocity of chloroplast photo-relocation movement via the regulation of cp-actin filaments. Thus several member-sequences are described as "myosin heavy chain-like".
Pssm-ID: 461718 [Multi-domain] Cd Length: 562 Bit Score: 43.09 E-value: 2.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 109 LEEEIE-TQSETsslvetmvmtEAVTLTESTSQAKEVTMKEAVTQTDAEAEavgkkeavtqtKAKAWAMAGRAEVKK--E 185
Cdd:pfam05701 354 LEAELNrTKSEI----------ALVQAKEKEAREKMVELPKQLQQAAQEAE-----------EAKSLAQAAREELRKakE 412
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 569011201 186 AMTQTKAEARTlaeketeinrvtvTQSEVLAVTKEVVKIGTMNETGVVAeamIRPLEETVSVTRTQSE 253
Cdd:pfam05701 413 EAEQAKAAAST-------------VESRLEAVLKEIEAAKASEKLALAA---IKALQESESSAESTNQ 464
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
339-686 |
3.51e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.60 E-value: 3.51e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 339 NDAWDNAKDICEAEADIRTCLIQSETVAKIETEATSSATMDG-GKDANAKAMTDVNVTDTQPqavTSDQTEAMPDAKVKG 417
Cdd:pfam05109 497 NGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTlGKTSPTSAVTTPTPNATSP---TPAVTTPTPNATIPT 573
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 418 KGNASAMAKAGAKANTKTSsqtdalPDAGDKNrSDNNVTAKAETGIDMVSCTQTEPVAKDNANTTSKEGAQATGQSQGEA 497
Cdd:pfam05109 574 LGKTSPTSAVTTPTPNATS------PTVGETS-PQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSL 646
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 498 LPNTKGKARGKAKAKCKAAAGTdTKTCAQPQAGTKTQAEALSDSKVDSKSDSNGVSKAGAKADQKASGqpqpvaNCQNEA 577
Cdd:pfam05109 647 RPSSISETLSPSTSDNSTSHMP-LLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPG------NSSTST 719
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 578 LPGTKNKVKGNP--NPT-PKTEAGTATTSSAQTNVTSSQGETTPGAKNKAKGNRNSV-PKAGAGPDTTgsaQSQTVANSH 653
Cdd:pfam05109 720 KPGEVNVTKGTPpkNATsPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTePTTDYGGDST---TPRTRYNAT 796
|
330 340 350
....*....|....*....|....*....|....*..
gi 569011201 654 SEALPGAKNKVKS----NSNVVPKAEAGVGACPQSVP 686
Cdd:pfam05109 797 TYLPPSTSSKLRPrwtfTSPPVTTAQATVPVPPTSQP 833
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
1060-1553 |
8.14e-03 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 41.69 E-value: 8.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1060 VSGNQAIAGPWAVSQVTDGSWPAVQASGVSWVVDQATGTWTVAENQTGAVSWAGAGNIVSIGYWTGAVDQTNAVSWTGTT 1139
Cdd:COG4625 3 GGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGG 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1140 DQVGVEVKPRFEDQASEKGSWVVAGVQTSGETRLGSEDQSSGRSWTETVDQANAASRLGTVDQAGGTSWAGTGDQVGGVS 1219
Cdd:COG4625 83 GGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGG 162
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1220 TSGSADQSSSGSWAGTRNLAGERSWTGTGDQSDGAAKPGFENQTSDEGSwAGTIGQPSGGSKSVSEAQSAGRSWADSADQ 1299
Cdd:COG4625 163 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGG-GGAGGGGGGGGGGGGGGGGGGGGGGGGGGG 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1300 LSGGFLVGPLDQANGESQPVSGELAASGVDQTSGGGCWTGSGDQSGGESRLGPRDQSNGESWPGTGDQTGGWYCTYTGTQ 1379
Cdd:COG4625 242 GGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 321
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1380 TIGGGPWVGVSGQDVGGPKPVHMNQATGGAWLGTGTQVSAVSWTGDQVGGCSKPGFEDQAIGGGFWAGAGDQTTGGSRPA 1459
Cdd:COG4625 322 GGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGG 401
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1460 VSEDQSSGGVSWGGAGGHVIGGSRTDQSSGTSWPGMGSQVSGGSWIGSVDQTSGcTKSGFEDQTCAGGSWVGTGEQTSGD 1539
Cdd:COG4625 402 GGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGG-AGGSGGGAGAGGGSGSGAGTLTLTG 480
|
490
....*....|....
gi 569011201 1540 SWPGSRASNEASGG 1553
Cdd:COG4625 481 NNTYTGTTTVNGGG 494
|
|
| AidA |
COG3468 |
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ... |
1015-1425 |
8.16e-03 |
|
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442691 [Multi-domain] Cd Length: 846 Bit Score: 41.47 E-value: 8.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1015 VSAGDQATGAIWAGAQTTDGTWVADKASAGSWTGAENQISAGSWVVSGNQAIAGPWAVSQVTDGSWPAVQASGVSWVVDQ 1094
Cdd:COG3468 1 TASGGGGGATGLGGGGTGGGGGLGGTGGGNAGLGIGNGGGGGAASGSGAGGVAGNGGGGGGGAGGGGGGAGSGGGLAGAG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1095 ATGTWTVAENQTGAVSWAGAGNIVSIGYWTGAVDQTNAVSWTGTTDQVGVEVKPRFEDQASEKGSWVVAGVQTSGETRLG 1174
Cdd:COG3468 81 SGGTGGNSTGGGGGNSGTGGTGGGGGGGGSGNGGGGGGGGGGGGTGGGGGGGTGSAGGGGGGGGGGTGVGGTGAAAAGGG 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1175 SEDQSSGRSWTETVDQANAASRLGTVDQAGGTSW--AGTGDQVGGVSTSGSADQSSSGSWAGTRNLAGERSWTGTGDQSD 1252
Cdd:COG3468 161 TGSGGGGSGGGGGAGGGGGGGAGGSGGAGSTGSGagGGGGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGGGVGG 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1253 GA--AKPGFENQTSDEGSWAGTIGQPSGGSKSVSEAQSAGRSWADSADQLSGGFLVGPLDQANGESQPVSGELAASGVDQ 1330
Cdd:COG3468 241 GGgsAGGTGGGGLTGGGAAGTGGGGGGTGTGSGGGGGGGANGGGSGGGGGASGTGGGGTASTGGGGGGGGGNGGGGGGGS 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 1331 TSGGGCWTGSGDQSGGESR-----------------LGPRDQSNGESWPGTGDQTGGWYCTYTGTQTIGGGPWVGVSGQD 1393
Cdd:COG3468 321 NAGGGSGGGGGGGGGGGGGgttlngagsagggtgaaLAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDGVGTGLTTGGT 400
|
410 420 430
....*....|....*....|....*....|..
gi 569011201 1394 VGGPKPVHMNQATGGAWLGTGTQVSAVSWTGD 1425
Cdd:COG3468 401 GNNGGGGVGGGGGGGLTLTGGTLTVNGNYTGN 432
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
469-798 |
9.21e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 41.10 E-value: 9.21e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 469 TQTEPVAKDNANTTSKEGAQATGQSQGEALPNTKGKARGKAKAKCKAAAGTDTKTCAQPQAGTKTQAEALSDSKVdsksd 548
Cdd:pfam17823 88 AEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAI----- 162
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 549 sngvskAGAKADQKASGQPQPVANCQNEALPGTKnkvkGNPNPTPKTEAGTATTSSAQTNVTSSQGETTPGAKNKAKGNR 628
Cdd:pfam17823 163 ------AAASAPHAASPAPRTAASSTTAASSTTA----ASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVG 232
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 629 NSVPKAGAGPDTTGSAQSQTVAN--SHSEALPGAKNKVKSNSNV---------VPKAEAGVGACPQSVPASQGTALTGTK 697
Cdd:pfam17823 233 NSSPAAGTVTAAVGTVTPAALATlaAAAGTVASAAGTINMGDPHarrlspakhMPSDTMARNPAAPMGAQAQGPIIQVST 312
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569011201 698 TKVKGNSSAVSKPDAGAGTMGSAHAKTAANSQGETLPGSKNKVKGNSNA---------VPKAEAGAGTTEPNQ------- 761
Cdd:pfam17823 313 DQPVHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASpvpvlhtsmIPEVEATSPTTQPSPllptqga 392
|
330 340 350 360
....*....|....*....|....*....|....*....|..
gi 569011201 762 -----PQAEALLGARNKVKGNSNSVPKAESGASTILALASSQ 798
Cdd:pfam17823 393 agpgiLLAPEQVATEATAGTASAGPTPRSSGDPKTLAMASCQ 434
|
|
|