NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|595582345|ref|NP_001277699|]
View 

trophinin isoform 1 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
859-2081 6.57e-33

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


:

Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 140.67  E-value: 6.57e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  859 NADPTTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 938
Cdd:COG3210   294 DTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLT 373
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  939 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 1018
Cdd:COG3210   374 TAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSG 453
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1019 ISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSF 1098
Cdd:COG3210   454 TTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTG 533
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1099 STASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPT 1178
Cdd:COG3210   534 GDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGT 613
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1179 TSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVC 1258
Cdd:COG3210   614 ITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNA 693
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1259 FGSSPYSGAGFGGTLST-SISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFG-----GTLSTSVSFGGSSGANAGFGGT 1332
Cdd:COG3210   694 ATGGTLNNAGNTLTISTgSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGvtitsGNAGTLSIGLTANTTASGTTLT 773
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1333 LNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLN 1412
Cdd:COG3210   774 LANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTT 853
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1413 GSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALN 1492
Cdd:COG3210   854 SDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAA 933
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1493 NSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLN 1572
Cdd:COG3210   934 GGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSG 1013
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1573 NSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAIS 1652
Cdd:COG3210  1014 AIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHT 1093
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1653 NSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNST 1732
Cdd:COG3210  1094 LGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAAT 1173
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1733 NCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFS 1812
Cdd:COG3210  1174 TTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGT 1253
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1813 GGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFS 1892
Cdd:COG3210  1254 GDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNS 1333
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1893 VSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGP 1972
Cdd:COG3210  1334 GGVNAGGGTINTTAANTGLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGN 1413
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1973 STGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFG 2052
Cdd:COG3210  1414 NGGVSGTTVAGTTGSSATTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAG 1493
                        1210      1220
                  ....*....|....*....|....*....
gi 595582345 2053 GGLNTSAGFSGGPPSTGTGFGGGASSHGG 2081
Cdd:COG3210  1494 VAGATASNGGTSTGAGGTAGGTTAEVAKA 1522
MAGE pfam01454
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ...
596-756 2.13e-24

MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.


:

Pssm-ID: 426270  Cd Length: 205  Bit Score: 103.12  E-value: 2.13e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   596 LVKYLLVKDQTKIPIKRSDMLKDVIQEYE-DYFPEIIERASYALEKMFRVNLKEID--------------------KQNN 654
Cdd:pfam01454    1 LVRYALACEYQRTPIRREDISKKVLGENRkRLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   655 LYILIST---QESSAGIMGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLH---PGVKHSLFG 719
Cdd:pfam01454   81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 595582345   720 EVKKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 756
Cdd:pfam01454  161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
MscS_porin super family cl25507
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part ...
268-445 2.01e-09

Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part of the turgor-driven solute efflux system that protects bacteria from lysis in the event of osmotic shock. The MscS protein alone is sufficient to form a functional mechanosensitive channel gated directly by tension in the lipid bilayer. The MscS proteins are heptamers of three transmembrane subunits with seven converging M3 domains, and this MscS_porin is towards the N-terminal of the molecules. The high concentration of negative charges at the extracellular entrance of the pore helps select the cations for efflux.


The actual alignment was detected with superfamily member pfam12795:

Pssm-ID: 432790 [Multi-domain]  Cd Length: 238  Bit Score: 60.01  E-value: 2.01e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   268 QTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASnrqigASNRQTE 347
Cdd:pfam12795   10 LDEAAKKKLLQDLQQALSLLDKIDASKQRAAAYQKALDDAPAELRELRQELAALQAKAEAAPKEILAS-----LSLEELE 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   348 vsSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASsrQTEASSRQTEASS 427
Cdd:pfam12795   85 --QRLLQTSAQLQELQNQLAQLNSQLIELQTRPERAQQQLSEARQRLQQIRNRLNGPAPPGEPLS--EAQRWALQAELAA 160
                          170
                   ....*....|....*...
gi 595582345   428 RQTEASSRQIEASAAAVR 445
Cdd:pfam12795  161 LKAQIDMLEQELLSNNNR 178
growth_prot_Scy super family cl49463
polarized growth protein Scy;
97-524 1.17e-07

polarized growth protein Scy;


The actual alignment was detected with superfamily member NF041483:

Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 57.53  E-value: 1.17e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   97 SQASATTEAPNIQASVTSQTQKAKTMRVTPKVSLTGSEDATtqlkpplQALNLPVTTPTIQTPVANESANSLAS--TAVN 174
Cdd:NF041483  293 AKQLASAESANEQRTRTAKEEIARLVGEATKEAEALKAEAE-------QALADARAEAEKLVAEAAEKARTVAAedTAAQ 365
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  175 KSKKASTANNAANKTVPSAAEISLAsAATHTVTTQGQAAKETGSIQTIAATA------RSKKNSKGKRtpAKTTNTDNEY 248
Cdd:NF041483  366 LAKAARTAEEVLTKASEDAKATTRA-AAEEAERIRREAEAEADRLRGEAADQaeqlkgAAKDDTKEYR--AKTVELQEEA 442
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  249 V----EA----SNAIEASSRQIGASGRqtEASnRQIEASSRQTEA-------------SNRQTEASSRQTEASSRQT--- 304
Cdd:NF041483  443 RrlrgEAeqlrAEAVAEGERIRGEARR--EAV-QQIEEAARTAEElltkakadadelrSTATAESERVRTEAIERATtlr 519
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  305 ----ETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIgasnrqTEVSSRQIEAsnrqigasnRQTEASNRqigASNRQ 380
Cdd:NF041483  520 rqaeETLERTRAEAERLRAEAEEQAEEVRAAAERAAREL------REETERAIAA---------RQAEAAEE---LTRLH 581
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  381 TEASNRQIGASNRQTDASN------RQT-DASNRQ-TEASSR--------QTEASSRQTEASSrqtEASSRQIEASAAAV 444
Cdd:NF041483  582 TEAEERLTAAEEALADARAeaerirREAaEETERLrTEAAERirtlqaqaEQEAERLRTEAAA---DASAARAEGENVAV 658
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  445 RPKkprgkkgnnkgSNSASEPSeappaiqtvtnhalsvtvriRRGSRARKAANKNRAtESQAQIAEQGAQASEASISALE 524
Cdd:NF041483  659 RLR-----------SEAAAEAE--------------------RLKSEAQESADRVRA-EAAAAAERVGTEAAEALAAAQE 706
 
Name Accession Description Interval E-value
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
859-2081 6.57e-33

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 140.67  E-value: 6.57e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  859 NADPTTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 938
Cdd:COG3210   294 DTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLT 373
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  939 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 1018
Cdd:COG3210   374 TAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSG 453
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1019 ISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSF 1098
Cdd:COG3210   454 TTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTG 533
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1099 STASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPT 1178
Cdd:COG3210   534 GDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGT 613
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1179 TSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVC 1258
Cdd:COG3210   614 ITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNA 693
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1259 FGSSPYSGAGFGGTLST-SISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFG-----GTLSTSVSFGGSSGANAGFGGT 1332
Cdd:COG3210   694 ATGGTLNNAGNTLTISTgSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGvtitsGNAGTLSIGLTANTTASGTTLT 773
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1333 LNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLN 1412
Cdd:COG3210   774 LANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTT 853
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1413 GSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALN 1492
Cdd:COG3210   854 SDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAA 933
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1493 NSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLN 1572
Cdd:COG3210   934 GGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSG 1013
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1573 NSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAIS 1652
Cdd:COG3210  1014 AIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHT 1093
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1653 NSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNST 1732
Cdd:COG3210  1094 LGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAAT 1173
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1733 NCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFS 1812
Cdd:COG3210  1174 TTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGT 1253
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1813 GGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFS 1892
Cdd:COG3210  1254 GDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNS 1333
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1893 VSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGP 1972
Cdd:COG3210  1334 GGVNAGGGTINTTAANTGLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGN 1413
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1973 STGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFG 2052
Cdd:COG3210  1414 NGGVSGTTVAGTTGSSATTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAG 1493
                        1210      1220
                  ....*....|....*....|....*....
gi 595582345 2053 GGLNTSAGFSGGPPSTGTGFGGGASSHGG 2081
Cdd:COG3210  1494 VAGATASNGGTSTGAGGTAGGTTAEVAKA 1522
MAGE pfam01454
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ...
596-756 2.13e-24

MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.


Pssm-ID: 426270  Cd Length: 205  Bit Score: 103.12  E-value: 2.13e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   596 LVKYLLVKDQTKIPIKRSDMLKDVIQEYE-DYFPEIIERASYALEKMFRVNLKEID--------------------KQNN 654
Cdd:pfam01454    1 LVRYALACEYQRTPIRREDISKKVLGENRkRLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   655 LYILIST---QESSAGIMGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLH---PGVKHSLFG 719
Cdd:pfam01454   81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 595582345   720 EVKKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 756
Cdd:pfam01454  161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1434-1791 3.72e-20

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 98.15  E-value: 3.72e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1434 SAGFGGAMNtnATFGGALNSNAGFGGaistSTNFGGALNNSAGFGGAMNTSASFGgalnNSAGFGGAISTNATFGGALNN 1513
Cdd:NF033849  220 SISFGVSLP--MMYAANLGQSAGTGY----GESVGHSTSQGQSHSVGTSESHSVG----TSQSQSHTTGHGSTRGWSHTQ 289
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1514 SAGFGGAISTNATFGGALNNSAGF--GGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGAL 1591
Cdd:NF033849  290 STSESESTGQSSSVGTSESQSHGTteGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGH 369
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1592 TNSAGFGGAISTSASFGGALNnsAGFGGAIstsasfGGALNNSAGFGGAISTNASFGGAISNSpDFGGAFSTSVGFG--- 1668
Cdd:NF033849  370 STSSSVSSSESSSRSSSSGVS--GGFSGGI------AGGGVTSEGLGASQGGSEGWGSGDSVQ-SVSQSYGSSSSTGtss 440
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1669 GTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSlcfGSASNTnlcFGGSNSTncfsgatsanfNEGHS 1748
Cdd:NF033849  441 GHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTS---QSETDS---VGDSTGT-----------SESVS 503
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|...
gi 595582345 1749 ISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASF 1791
Cdd:NF033849  504 QGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLGKSY 546
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1494-1832 4.41e-20

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 98.15  E-value: 4.41e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1494 SAGFGgaISTNATFGGALNNSAG------FGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASF 1567
Cdd:NF033849  220 SISFG--VSLPMMYAANLGQSAGtgygesVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1568 GgvLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASF 1647
Cdd:NF033849  298 G--QSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSV 375
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1648 GGAISNSPDFggAFSTSVGFGGTLNttdfgsthsnsisfgSAPTTSVSFGGSHSTNLCFGGApstslcfGSASNTNLCFG 1727
Cdd:NF033849  376 SSSESSSRSS--SSGVSGGFSGGIA---------------GGGVTSEGLGASQGGSEGWGSG-------DSVQSVSQSYG 431
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1728 GSNSTNCFSGATSanfNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTS----------TGFGGSLGPSASFNGGLGT 1797
Cdd:NF033849  432 SSSSTGTSSGHSD---SSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSeswstsqsetDSVGDSTGTSESVSQGDGR 508
                         330       340       350
                  ....*....|....*....|....*....|....*
gi 595582345 1798 STGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFN 1832
Cdd:NF033849  509 STGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1454-1821 9.74e-18

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 90.45  E-value: 9.74e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1454 NAGFGgaISTSTNFGGALNNSAGFGGAMNTSASFggalnnSAGFGGAISTNATFGGAlnNSAGFGGAISTNATFGGALNN 1533
Cdd:NF033849  220 SISFG--VSLPMMYAANLGQSAGTGYGESVGHST------SQGQSHSVGTSESHSVG--TSQSQSHTTGHGSTRGWSHTQ 289
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1534 SAGFGGAISTSASFGG--TLNNSASFGGAINTSASFggvlnnSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGAL 1611
Cdd:NF033849  290 STSESESTGQSSSVGTseSQSHGTTEGTSTTDSSSH------SQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSEST 363
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1612 NNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSpdFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSapT 1691
Cdd:NF033849  364 GTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEG--LGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT--S 439
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1692 TSVSFGGSHSTNLcfGGAPSTSLCFGSASNTnlcfgGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGF 1771
Cdd:NF033849  440 SGHSDSSSHSTSS--GQADSVSQGTSWSEGT-----GTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGR 512
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|
gi 595582345 1772 GSSLGTStgfggslgpsasfnggLGTSTGFGGGLGTSTDFSGGLNHNADF 1821
Cdd:NF033849  513 SESQGTS----------------LGTSGGRTSGAGGSMGLGPSISLGKSY 546
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1592-1958 7.29e-17

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 87.37  E-value: 7.29e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1592 TNSAGFGgaISTSASFGGALNNSAG--FGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGG 1669
Cdd:NF033849  218 QKSISFG--VSLPMMYAANLGQSAGtgYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESE 295
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1670 TLNTTD-FGSTHSNSISFGSAPTTSVSFGGSHSTnlcfggapSTSLCFGSASNTNLCFGGSNStncfsgaTSANFNEGHS 1748
Cdd:NF033849  296 STGQSSsVGTSESQSHGTTEGTSTTDSSSHSQSS--------SYNVSSGTGVSSSHSDGTSQS-------TSISHSESSS 360
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1749 ISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGglnhnadFNGGLGNS 1828
Cdd:NF033849  361 ESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQS-------VSQSYGSS 433
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1829 AGFngglntntdfggelGTSAGFGDGLGSSTSFG--AGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNGFGNGPN 1906
Cdd:NF033849  434 SST--------------GTSSGHSDSSSHSTSSGqaDSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTS 499
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|..
gi 595582345 1907 ASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSngpssiVGFsgGPSTGAG 1958
Cdd:NF033849  500 ESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGS------MGL--GPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1574-1873 1.01e-16

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 86.98  E-value: 1.01e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1574 SAGFGgaINTSANFGGALTNSAG------FGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASF 1647
Cdd:NF033849  220 SISFG--VSLPMMYAANLGQSAGtgygesVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1648 GGAISNspdfGGAFSTSVGFGGTLNTTDfGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNlcfG 1727
Cdd:NF033849  298 GQSSSV----GTSESQSHGTTEGTSTTD-SSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVG---H 369
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1728 GSNSTNCFSGATSANFNEGHSISFGNGLS----TSAGFGNGLGTSAGFGSSLG---TSTGFGGSLGPSASF----NGGLG 1796
Cdd:NF033849  370 STSSSVSSSESSSRSSSSGVSGGFSGGIAgggvTSEGLGASQGGSEGWGSGDSvqsVSQSYGSSSSTGTSSghsdSSSHS 449
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 595582345 1797 TSTGFGGGLGTSTDFSGGLNHNAdfNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAG 1873
Cdd:NF033849  450 TSSGQADSVSQGTSWSEGTGTSQ--GQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSG 524
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1222-1547 1.41e-15

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 83.13  E-value: 1.41e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1222 ALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVcfgsspysgagfggTLSTSISFGGSPSTNTGFGGTLSTSVSFGA 1301
Cdd:NF033849  252 SQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSH--------------TQSTSESESTGQSSSVGTSESQSHGTTEGT 317
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1302 SSSTSSDFGGTLSTSVSFGgssganAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGgai 1381
Cdd:NF033849  318 STTDSSSHSQSSSYNVSSG------TGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSG--- 388
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1382 nTSAGFGSTLNSSASFGSALSTSASfggvlnGSAGFGGAlntnatfGGVLNGSAGFGGAMNTNATFGGALNS--NAGFGG 1459
Cdd:NF033849  389 -VSGGFSGGIAGGGVTSEGLGASQG------GSEGWGSG-------DSVQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQ 454
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1460 AISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSA---- 1535
Cdd:NF033849  455 ADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAggsm 534
                         330
                  ....*....|..
gi 595582345 1536 GFGGAISTSASF 1547
Cdd:NF033849  535 GLGPSISLGKSY 546
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1339-1647 3.50e-12

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 71.96  E-value: 3.50e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1339 FGGAISTSTGFGSalnnSANFGGAISTSFsgvlnsSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFG 1418
Cdd:NF033849  231 YAANLGQSAGTGY----GESVGHSTSQGQ------SHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQS 300
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1419 GALNTNATFG-GVLNG-----SAGFGGAMNTNATFG----GALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFG 1488
Cdd:NF033849  301 SSVGTSESQShGTTEGtsttdSSSHSQSSSYNVSSGtgvsSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSES 380
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1489 GALNNSAGFGGAISTNATFGGALnnSAGFGGAISTNATFG---GALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSA 1565
Cdd:NF033849  381 SSRSSSSGVSGGFSGGIAGGGVT--SEGLGASQGGSEGWGsgdSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSV 458
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1566 SFGGVL--NNSAGFGGAINTSANFG--GALTNSAGF--GGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGG 1639
Cdd:NF033849  459 SQGTSWseGTGTSQGQSVGTSESWStsQSETDSVGDstGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGP 538

                  ....*...
gi 595582345 1640 AISTNASF 1647
Cdd:NF033849  539 SISLGKSY 546
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1356-1702 7.74e-12

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 71.19  E-value: 7.74e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1356 SANFGGAISTSFSGvlNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSA 1435
Cdd:NF033849  220 SISFGVSLPMMYAA--NLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1436 GFGGAMNTNATfggaLNSNAGFGGAISTSTNF--GGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNN 1513
Cdd:NF033849  298 GQSSSVGTSES----QSHGTTEGTSTTDSSSHsqSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSS 373
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1514 SAGFGGAISTNATFGgalnNSAGFGGAIS----TSASFGGTLNNSASFGgaintsaSFGGVLNNSAGFGGAINTSANFGG 1589
Cdd:NF033849  374 SVSSSESSSRSSSSG----VSGGFSGGIAgggvTSEGLGASQGGSEGWG-------SGDSVQSVSQSYGSSSSTGTSSGH 442
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1590 ALTNSAGFGgaISTSASFGGALNNSAGFGGAISTSASfggalnNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFG- 1668
Cdd:NF033849  443 SDSSSHSTS--SGQADSVSQGTSWSEGTGTSQGQSVG------TSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSe 514
                         330       340       350
                  ....*....|....*....|....*....|....*.
gi 595582345 1669 --GTLNTTDFGSTHSNSISFGSAPttSVSFGGSHST 1702
Cdd:NF033849  515 sqGTSLGTSGGRTSGAGGSMGLGP--SISLGKSYQW 548
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1736-2064 2.07e-10

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 66.18  E-value: 2.07e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1736 SGATSANFNEGHSISFGNGLSTSAgfGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGL 1815
Cdd:NF033849  216 QGQKSISFGVSLPMMYAANLGQSA--GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSE 293
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1816 NHNAdfngGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTS--DGFAGNLGTNTGFGGTLGTGAGFSV 1893
Cdd:NF033849  294 SEST----GQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSShsDGTSQSTSISHSESSSESTGTSVGH 369
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1894 SLNNGNGFGNGPNASFNRGLNTiiGFGSGSNTSnGFTGEpntGSSFSNGPSSIVGFSGG-PSTGAGFCSGPSTGGFGGGP 1972
Cdd:NF033849  370 STSSSVSSSESSSRSSSSGVSG--GFSGGIAGG-GVTSE---GLGASQGGSEGWGSGDSvQSVSQSYGSSSSTGTSSGHS 443
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1973 STGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFG 2052
Cdd:NF033849  444 DSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTS 523
                         330
                  ....*....|..
gi 595582345 2053 GGLNTSAGFSGG 2064
Cdd:NF033849  524 GGRTSGAGGSMG 535
MscS_porin pfam12795
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part ...
268-445 2.01e-09

Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part of the turgor-driven solute efflux system that protects bacteria from lysis in the event of osmotic shock. The MscS protein alone is sufficient to form a functional mechanosensitive channel gated directly by tension in the lipid bilayer. The MscS proteins are heptamers of three transmembrane subunits with seven converging M3 domains, and this MscS_porin is towards the N-terminal of the molecules. The high concentration of negative charges at the extracellular entrance of the pore helps select the cations for efflux.


Pssm-ID: 432790 [Multi-domain]  Cd Length: 238  Bit Score: 60.01  E-value: 2.01e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   268 QTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASnrqigASNRQTE 347
Cdd:pfam12795   10 LDEAAKKKLLQDLQQALSLLDKIDASKQRAAAYQKALDDAPAELRELRQELAALQAKAEAAPKEILAS-----LSLEELE 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   348 vsSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASsrQTEASSRQTEASS 427
Cdd:pfam12795   85 --QRLLQTSAQLQELQNQLAQLNSQLIELQTRPERAQQQLSEARQRLQQIRNRLNGPAPPGEPLS--EAQRWALQAELAA 160
                          170
                   ....*....|....*...
gi 595582345   428 RQTEASSRQIEASAAAVR 445
Cdd:pfam12795  161 LKAQIDMLEQELLSNNNR 178
auto_AIDA-I NF033176
autotransporter adhesin AIDA-I;
1318-1785 1.30e-08

autotransporter adhesin AIDA-I;


Pssm-ID: 380183 [Multi-domain]  Cd Length: 1287  Bit Score: 60.44  E-value: 1.30e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1318 SFGGSSGANAGFGGTLNSSTsfGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSaSF 1397
Cdd:NF033176   72 SNGQTSNATVNSGGIQNVNN--GGKTTSTTVNSSGAQNVGNSGTAISTIVNSGGVQRVSSGGVTSATSLSGGAQNIY-NL 148
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1398 GSALSTSASFGGVLNGSAGfGGALNTNATFGGVLNGSAGfGGAMNTNATFGGALNSNAGfGGAISTSTNFGGALNNSAGf 1477
Cdd:NF033176  149 GHASNTVIFNGGNQTIFSG-GISDDTNISSGGQQRVSSG-GVASNTTINSSGTQNILSG-GSTVSTHISSGGNQYISAG- 224
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1478 GGAMNTSASFGGALNNSAgfgGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGfGGAISTSASFGGTLNNSaSF 1557
Cdd:NF033176  225 GNASATVVSSGGFQRVSS---GGTATGTVLSGGTQNVSSGGSAISTSVYSSGVQTVYAG-ATVTDTTVNSGGKQNIS-SG 299
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1558 GGAINTSASFGGVLNNsagFGGAINTSANFGGALTNSAGfGGAISTSASFGGALNNSAGfGGAISTSASFGGALNNSAGf 1637
Cdd:NF033176  300 GIVSGTIVNSSGTQNI---YSGGSALSANIKGSQIVNSD-GTAINTLVNDGGYQHIRNG-GVASGTIINQSGRVNISSG- 373
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1638 GGAISTNASFGGAISNSPDfGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSapTTSVSFGGSHSTNLCFGGAPSTSLCFG 1717
Cdd:NF033176  374 GYAESTIINSGGTQSVLSG-GYASGTLINNSGRENVSNGGSAYNTIINAGG--NQYIYSNGEASGTTVNTSGFQRVNSGG 450
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 595582345 1718 SASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSL 1785
Cdd:NF033176  451 TATGTKLSGGNQNVSSGGKAIAAEVYSGGKQTVYAGGEASGTQIFDGGVVNVSGGSVSGASVNLNGRL 518
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
262-468 5.43e-08

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 57.22  E-value: 5.43e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  262 IGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGA 341
Cdd:COG4372    26 IAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELES 105
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  342 SNRQTEVSSRQIEASNRQIGASNRQTEASNRQIgasnrqteaSNRQIGASNRQTDASNRQTDASNRQTEASSRQTEASSR 421
Cdd:COG4372   106 LQEEAEELQEELEELQKERQDLEQQRKQLEAQI---------AELQSEIAEREEELKELEEQLESLQEELAALEQELQAL 176
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 595582345  422 QTEASSRQTEASSRQIEASAAAVRPKKPRGKKGNNKGSNSASEPSEA 468
Cdd:COG4372   177 SEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEA 223
growth_prot_Scy NF041483
polarized growth protein Scy;
97-524 1.17e-07

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 57.53  E-value: 1.17e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   97 SQASATTEAPNIQASVTSQTQKAKTMRVTPKVSLTGSEDATtqlkpplQALNLPVTTPTIQTPVANESANSLAS--TAVN 174
Cdd:NF041483  293 AKQLASAESANEQRTRTAKEEIARLVGEATKEAEALKAEAE-------QALADARAEAEKLVAEAAEKARTVAAedTAAQ 365
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  175 KSKKASTANNAANKTVPSAAEISLAsAATHTVTTQGQAAKETGSIQTIAATA------RSKKNSKGKRtpAKTTNTDNEY 248
Cdd:NF041483  366 LAKAARTAEEVLTKASEDAKATTRA-AAEEAERIRREAEAEADRLRGEAADQaeqlkgAAKDDTKEYR--AKTVELQEEA 442
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  249 V----EA----SNAIEASSRQIGASGRqtEASnRQIEASSRQTEA-------------SNRQTEASSRQTEASSRQT--- 304
Cdd:NF041483  443 RrlrgEAeqlrAEAVAEGERIRGEARR--EAV-QQIEEAARTAEElltkakadadelrSTATAESERVRTEAIERATtlr 519
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  305 ----ETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIgasnrqTEVSSRQIEAsnrqigasnRQTEASNRqigASNRQ 380
Cdd:NF041483  520 rqaeETLERTRAEAERLRAEAEEQAEEVRAAAERAAREL------REETERAIAA---------RQAEAAEE---LTRLH 581
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  381 TEASNRQIGASNRQTDASN------RQT-DASNRQ-TEASSR--------QTEASSRQTEASSrqtEASSRQIEASAAAV 444
Cdd:NF041483  582 TEAEERLTAAEEALADARAeaerirREAaEETERLrTEAAERirtlqaqaEQEAERLRTEAAA---DASAARAEGENVAV 658
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  445 RPKkprgkkgnnkgSNSASEPSeappaiqtvtnhalsvtvriRRGSRARKAANKNRAtESQAQIAEQGAQASEASISALE 524
Cdd:NF041483  659 RLR-----------SEAAAEAE--------------------RLKSEAQESADRVRA-EAAAAAERVGTEAAEALAAAQE 706
PTZ00121 PTZ00121
MAEBL; Provisional
250-524 7.08e-07

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 54.76  E-value: 7.08e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  250 EASNAIEASSRqiGASGRQTEASNRQIEAssRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASN 329
Cdd:PTZ00121 1197 EDARKAEAARK--AEEERKAEEARKAEDA--KKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAI 1272
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  330 RQIE---------ASNRQIGASNRQTEVSSRQIEASN-----RQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQT 395
Cdd:PTZ00121 1273 KAEEarkadelkkAEEKKKADEAKKAEEKKKADEAKKkaeeaKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEA 1352
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  396 DASNRQTDASNRQTEASSRQTEASSRQTEASSRQTEASSRQIEASAAAVRPKKP----RGKKGNNKGSNSASEPSEAPPA 471
Cdd:PTZ00121 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKadelKKAAAAKKKADEAKKKAEEKKK 1432
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|...
gi 595582345  472 IQTVTNHALSVtvriRRGSRARKAANKNRATESQAQIAEQGAQASEASISALE 524
Cdd:PTZ00121 1433 ADEAKKKAEEA----KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1028-1427 1.66e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 50.39  E-value: 1.66e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1028 GRNSITFG-SVPNT-SANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGgapststsfstasISF 1105
Cdd:NF033849  217 GQKSISFGvSLPMMyAANLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHG-------------STR 283
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1106 GGApststslstasisfggapststsfstasisfggapststslstasisfggapsiNSSSGGSSVSFGGAPTTSTSFSG 1185
Cdd:NF033849  284 GWS------------------------------------------------------HTQSTSESESTGQSSSVGTSESQ 309
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1186 GPCISFGgapcTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYS 1265
Cdd:NF033849  310 SHGTTEG----TSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSS 385
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1266 GAGFGGTLSTSIsfGGSPSTNTGFGGTLSTSVSFGASSSTSSdFGGTLSTSVSFGGSSGA--NAGFGGTLNSSTSFGGAI 1343
Cdd:NF033849  386 SSGVSGGFSGGI--AGGGVTSEGLGASQGGSEGWGSGDSVQS-VSQSYGSSSSTGTSSGHsdSSSHSTSSGQADSVSQGT 462
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1344 STSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNT 1423
Cdd:NF033849  463 SWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISL 542

                  ....
gi 595582345 1424 NATF 1427
Cdd:NF033849  543 GKSY 546
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
206-525 7.28e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 48.13  E-value: 7.28e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   206 VTTQGQAAKETGSIqtiaATARSKKNSKgkrtpakTTNTDNEYVEASNAIEASSRQIgasgrqTEASNRQIEAssrQTEA 285
Cdd:TIGR02168  648 VTLDGDLVRPGGVI----TGGSAKTNSS-------ILERRREIEELEEKIEELEEKI------AELEKALAEL---RKEL 707
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   286 SNRQTEASSRQteassRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNR 365
Cdd:TIGR02168  708 EELEEELEQLR-----KELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEA 782
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   366 QTEASNRQIGASNRQTEASNRQIGASNRQTDASNRqtdasnRQTEASSRQtEASSRQTEASSRQTEASSRQIEASAAAVr 445
Cdd:TIGR02168  783 EIEELEAQIEQLKEELKALREALDELRAELTLLNE------EAANLRERL-ESLERRIAATERRLEDLEEQIEELSEDI- 854
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   446 pkkprgkKGNNKgsnSASEPSEAPPAIQTVTNHAL----SVTVRIRRG-SRARKAANKNRATESQAQIAEQGAQASEASI 520
Cdd:TIGR02168  855 -------ESLAA---EIEELEELIEELESELEALLneraSLEEALALLrSELEELSEELRELESKRSELRRELEELREKL 924

                   ....*
gi 595582345   521 SALET 525
Cdd:TIGR02168  925 AQLEL 929
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
869-1089 2.79e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 46.15  E-value: 2.79e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  869 NQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGfggiSNPSGGFGGisnpSGGFGGISNPSGGFGGISNPSGGFGG 948
Cdd:NF033849  310 SHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDG----TSQSTSISH----SESSSESTGTSVGHSTSSSVSSSESS 381
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  949 ISNPSGGFggisnpSGGFGGISNPSGGFggisnpSGGFGGISNPSGGFGGisnpSGGFGGISNPSGGFGGISNPSG-GFG 1027
Cdd:NF033849  382 SRSSSSGV------SGGFSGGIAGGGVT------SEGLGASQGGSEGWGS----GDSVQSVSQSYGSSSSTGTSSGhSDS 445
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 595582345 1028 GRNSITFGsvpnTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFG 1089
Cdd:NF033849  446 SSHSTSSG----QADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVS 503
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1239-1459 2.86e-04

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 45.81  E-value: 2.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  1239 FSGAVSTTTGFGGTLSTSVCFGSSPYS--GAGFGGTLSTSISFGGSPSTNTGFGGTLstsvsFGASSSTSSDFGGTLSTS 1316
Cdd:pfam15967    6 FGGGPGSTATAGGGFSFGAAAASNPGStgGFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASST 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  1317 VSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSAs 1396
Cdd:pfam15967   81 AATGPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTPATTT- 159
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 595582345  1397 fgsALSTSASFGGVLNgsaGFGGALNTNATFGGVLNGSAGFGGAMNTNATFgGALNSNAGFGG 1459
Cdd:pfam15967  160 ---AVSTGLSLGSTLT---SLGGSLFQNTNSTGLGQTTLGLTLLATSTAPV-SAPAASEGLGG 215
PHA02515 PHA02515
hypothetical protein; Provisional
1294-1505 3.67e-04

hypothetical protein; Provisional


Pssm-ID: 107197 [Multi-domain]  Cd Length: 508  Bit Score: 45.54  E-value: 3.67e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1294 STSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGgtlNSSTSFGGAISTSTGFGSALNNSANFG--GAISTSFSGVL 1371
Cdd:PHA02515  175 TVAASVGAVDTVAGDLGGTWAAGVSYDFGSIAVPPIG---NTSPPGGNIVIVANSIGNVDTVAENIGdvSTVSTHLSSML 251
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1372 -------------------NSSASFGGAINTSAGFGSTLNSSASfgSALSTSASFGGVLNGSAGFGGALNTNATFGGVLN 1432
Cdd:PHA02515  252 avandidsvvsvagdleniDAVADNAANINTVAGANANVNTVAS--NILDVGTVAGNIDDVQAVAGNAANINVVADNADN 329
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 595582345 1433 GSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGA--LNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNA 1505
Cdd:PHA02515  330 INATAANQANINAAVGNADNINAAVANQANINAVVGNAnnINAVAANEGNVNTVVDNLADVQTVAGIAADVSTVA 404
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
272-490 3.75e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 45.67  E-value: 3.75e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  272 SNRQIEASSRQ-TEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGASNRQTEVSS 350
Cdd:NF033609   33 SSKEADASENSvTQSDSASNESKSNDSSSVSAAPKTDDTNVSDTKTSSNTNNGETSVAQNPAQQETTQSASTNATTEETP 112
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  351 RQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSR--QTEASSRQTEASSR 428
Cdd:NF033609  113 VTGEATTTATNQANTPATTQSSNTNAEELVNQTSNETTSNDTNTVSSVNSPQNSTNAENVSTTQdtSTEATPSNNESAPQ 192
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 595582345  429 QTEASSRQIeaSAAAVRPKKPRgkkgnNKGSNSASEPSEAPPAIQTVTNHALSVTVRIRRGS 490
Cdd:NF033609  193 STDASNKDV--VNQAVNTSAPR-----MRAFSLAAVAADAPAAGTDITNQLTNVTVGIDSGT 247
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
20-243 5.51e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.83  E-value: 5.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345    20 PAGSLGLPFSPDVQSETT---EKDPPIASRSKKNKNKKNSIKPMDKTTPAPPPVPSANDNASNKPKVTLQALNLPMFTQI 96
Cdd:pfam05109  442 PNTTTGLPSSTHVPTNLTapaSTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNAT 521
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345    97 SQASA-TTEAPNIQASVTSQTQKAKTMrVTPKVSLTGSEDATTQLKPPLQALNLPVTTPTiqTPVANESANSLASTAVNK 175
Cdd:pfam05109  522 SPTPAvTTPTPNATSPTLGKTSPTSAV-TTPTPNATSPTPAVTTPTPNATIPTLGKTSPT--SAVTTPTPNATSPTVGET 598
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 595582345   176 SKKASTANNAANKTVPSAAEISLASAATHTVTTqGQAAKETGSIQTIAATARSKKNSKGKRTPAKTTN 243
Cdd:pfam05109  599 SPQANTTNHTLGGTSSTPVVTSPPKNATSAVTT-GQHNITSSSTSSMSLRPSSISETLSPSTSDNSTS 665
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
1632-1860 6.99e-03

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 41.14  E-value: 6.99e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1632 NNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNlcfggaPS 1711
Cdd:cd21118   133 QGGPGVQGHGIPGGTGGPWASGGNYGTNSLGGSVGQGGNGGPLNYGTNSQGAVAQPGYGTVRGNNQNSGCTN------PP 206
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1712 TSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASF 1791
Cdd:cd21118   207 PSGSHESFSNSGGSSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSGGSNGGSSGNSGSGSGGSSSGGSN 286
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 595582345 1792 NGGLGTSTGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTS 1860
Cdd:cd21118   287 GWGGSSSSGGSGGSGGGNKPECNNPGNDVRMAGGGGSQGSKESSGSHGSNGGNGQAEAVGGLNTLNSDA 355
 
Name Accession Description Interval E-value
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
859-2081 6.57e-33

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 140.67  E-value: 6.57e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  859 NADPTTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 938
Cdd:COG3210   294 DTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLT 373
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  939 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 1018
Cdd:COG3210   374 TAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSG 453
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1019 ISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSF 1098
Cdd:COG3210   454 TTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTG 533
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1099 STASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPT 1178
Cdd:COG3210   534 GDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGT 613
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1179 TSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVC 1258
Cdd:COG3210   614 ITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNA 693
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1259 FGSSPYSGAGFGGTLST-SISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFG-----GTLSTSVSFGGSSGANAGFGGT 1332
Cdd:COG3210   694 ATGGTLNNAGNTLTISTgSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGvtitsGNAGTLSIGLTANTTASGTTLT 773
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1333 LNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLN 1412
Cdd:COG3210   774 LANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTT 853
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1413 GSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALN 1492
Cdd:COG3210   854 SDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAA 933
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1493 NSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLN 1572
Cdd:COG3210   934 GGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSG 1013
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1573 NSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAIS 1652
Cdd:COG3210  1014 AIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHT 1093
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1653 NSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNST 1732
Cdd:COG3210  1094 LGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAAT 1173
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1733 NCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFS 1812
Cdd:COG3210  1174 TTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGT 1253
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1813 GGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFS 1892
Cdd:COG3210  1254 GDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNS 1333
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1893 VSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGP 1972
Cdd:COG3210  1334 GGVNAGGGTINTTAANTGLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGN 1413
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1973 STGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFG 2052
Cdd:COG3210  1414 NGGVSGTTVAGTTGSSATTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAG 1493
                        1210      1220
                  ....*....|....*....|....*....
gi 595582345 2053 GGLNTSAGFSGGPPSTGTGFGGGASSHGG 2081
Cdd:COG3210  1494 VAGATASNGGTSTGAGGTAGGTTAEVAKA 1522
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
863-2078 1.01e-31

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 136.43  E-value: 1.01e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  863 TTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 942
Cdd:COG3210   289 GASSGDTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGN 368
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  943 SGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 1022
Cdd:COG3210   369 GGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGG 448
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1023 SGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSFSTAS 1102
Cdd:COG3210   449 LTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNAT 528
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1103 ISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPTTSTS 1182
Cdd:COG3210   529 SGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSA 608
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1183 FSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSS 1262
Cdd:COG3210   609 GATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTG 688
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1263 PYSGAGFGGTLSTS---ISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDfggtlSTSVSFGGSSGANAGFGGTLNSSTSF 1339
Cdd:COG3210   689 TTLNAATGGTLNNAgntLTISTGSITVTGQIGALANANGDTVTFGNLGT-----GATLTLNAGVTITSGNAGTLSIGLTA 763
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1340 GGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGG 1419
Cdd:COG3210   764 NTTASGTTLTLANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGS 843
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1420 ALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTS--TNFGGALNNSAGFGGAMNTSASFGGALNNSAGF 1497
Cdd:COG3210   844 NTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTgtANAGTLTNLGTTTNAASGNGAVLATVTATGTGG 923
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1498 GGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGF 1577
Cdd:COG3210   924 GGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSG 1003
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1578 GGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDF 1657
Cdd:COG3210  1004 TTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTA 1083
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1658 GGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSG 1737
Cdd:COG3210  1084 QASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASA 1163
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1738 ATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLNH 1817
Cdd:COG3210  1164 GDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGS 1243
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1818 NADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNN 1897
Cdd:COG3210  1244 FVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTA 1323
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1898 GNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGPSTGPG 1977
Cdd:COG3210  1324 TGTAVAAVNSGGVNAGGGTINTTAANTGLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGG 1403
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1978 FGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFGGGLNT 2057
Cdd:COG3210  1404 VTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIG 1483
                        1210      1220
                  ....*....|....*....|.
gi 595582345 2058 SAGFSGGPPSTGTGFGGGASS 2078
Cdd:COG3210  1484 GTTTGGNGAGVAGATASNGGT 1504
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
859-2081 5.80e-31

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 134.12  E-value: 5.80e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  859 NADPTTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 938
Cdd:COG3210   129 TGGTTTSSTNTVTTLGGTTTGNTVLSTSGAGNNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGV 208
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  939 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 1018
Cdd:COG3210   209 LANAGGGTAGGVASANSTLTGGVVAAGTGAGVISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTA 288
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1019 ISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSF 1098
Cdd:COG3210   289 GASSGDTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGN 368
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1099 STASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPT 1178
Cdd:COG3210   369 GGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGG 448
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1179 TSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVC 1258
Cdd:COG3210   449 LTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNAT 528
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1259 FGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTS 1338
Cdd:COG3210   529 SGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSA 608
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1339 FGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFG 1418
Cdd:COG3210   609 GATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTG 688
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1419 GALNTNATF-----GGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTN-FGGALNNSAGFGGAMNTSASFGGALN 1492
Cdd:COG3210   689 TTLNAATGGtlnnaGNTLTISTGSITVTGQIGALANANGDTVTFGNLGTGATLtLNAGVTITSGNAGTLSIGLTANTTAS 768
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1493 NSAGFGGAISTNATFGGALNNS-AGFGGAISTNATFGGALNNS---AGFGGAISTSASFGGTLNNSASFGGAINTSASFG 1568
Cdd:COG3210   769 GTTLTLANANGNTSAGATLDNAgAEISIDITADGTITAAGTTAinvTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDT 848
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1569 GVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFG 1648
Cdd:COG3210   849 TTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTG 928
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1649 GAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGG 1728
Cdd:COG3210   929 GNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTAST 1008
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1729 SNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTS 1808
Cdd:COG3210  1009 TGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGA 1088
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1809 TDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTG 1888
Cdd:COG3210  1089 GTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTA 1168
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1889 AGFSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGF 1968
Cdd:COG3210  1169 VAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAG 1248
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1969 GGGPSTGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTS 2048
Cdd:COG3210  1249 SASGTGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAV 1328
                        1210      1220      1230
                  ....*....|....*....|....*....|...
gi 595582345 2049 TGFGGGLNTSAGFSGGPPSTGTGFGGGASSHGG 2081
Cdd:COG3210  1329 AAVNSGGVNAGGGTINTTAANTGLNGGNGATDS 1361
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
863-2077 1.96e-30

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 132.20  E-value: 1.96e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  863 TTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 942
Cdd:COG3210   477 GNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTT 556
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  943 SGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 1022
Cdd:COG3210   557 AASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTG 636
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1023 SGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSFSTAS 1102
Cdd:COG3210   637 SAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVT 716
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1103 ISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASI--SFGGAPSINSSSGGSSVSFGGAPTTS 1180
Cdd:COG3210   717 GQIGALANANGDTVTFGNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASgtTLTLANANGNTSAGATLDNAGAEISI 796
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1181 TSFSGGPCISFGgapcttASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFG 1260
Cdd:COG3210   797 DITADGTITAAG------TTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSL 870
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1261 SSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFG 1340
Cdd:COG3210   871 AATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQG 950
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1341 GAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGA 1420
Cdd:COG3210   951 NAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTAS 1030
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1421 LNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGA 1500
Cdd:COG3210  1031 ATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTT 1110
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1501 ISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGA 1580
Cdd:COG3210  1111 TSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAAT 1190
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1581 INTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGA 1660
Cdd:COG3210  1191 EGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGAT 1270
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1661 FSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATS 1740
Cdd:COG3210  1271 STVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANT 1350
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1741 ANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLNHNAD 1820
Cdd:COG3210  1351 GLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSA 1430
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1821 FNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNG 1900
Cdd:COG3210  1431 TTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGG 1510
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1901 FGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGPSTGPGFGG 1980
Cdd:COG3210  1511 TAGGTTAEVAKASLEGGEGTYGGSSVAEAGTGGGILGAVSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQAPT 1590
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1981 PSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFGGGLNTSAG 2060
Cdd:COG3210  1591 AGNTATLTLSLAEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNGGEGVLALVAGGNTTNGTTLSGAVNGAGNGWAV 1670
                        1210
                  ....*....|....*..
gi 595582345 2061 FSGGPPSTGTGFGGGAS 2077
Cdd:COG3210  1671 DLTDATLAGLGGATTAA 1687
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
889-2078 1.92e-29

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 129.12  E-value: 1.92e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  889 ITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 968
Cdd:COG3210     1 GSGGLAGTTGNKTIGVDIAVTTTAATLGSNTAGTSGLNILGSGGVGTAGGIASNAGTTASTSGGSGTAGGVGNTSASTGG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  969 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSSAP 1048
Cdd:COG3210    81 IGAAAANTAGTLETGLTSNIGGGSVNGSNSTGNGTLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAGN 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1049 SISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPST 1128
Cdd:COG3210   161 NTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAGV 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1129 STSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPTTSTSFSGGPCISFGGAPCTTASISGGASSg 1208
Cdd:COG3210   241 ISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGTAAG- 319
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1209 fgstlcSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTG 1288
Cdd:COG3210   320 ------ITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGN 393
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1289 FGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFS 1368
Cdd:COG3210   394 ASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVT 473
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1369 GVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFG 1448
Cdd:COG3210   474 NSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGAS 553
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1449 GALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFG 1528
Cdd:COG3210   554 GTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAG 633
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1529 GALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFG-----GAINTSANFGGALTNSAGFGGAIST 1603
Cdd:COG3210   634 LTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATggttgTTLNAATGGTLNNAGNTLTISTGSI 713
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1604 SASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNS 1683
Cdd:COG3210   714 TVTGQIGALANANGDTVTFGNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATLDNAGAE 793
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1684 ISFGSAPTTSVSFGGSHSTNLcFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGN 1763
Cdd:COG3210   794 ISIDITADGTITAAGTTAINV-TGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAA 872
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1764 GLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGG 1843
Cdd:COG3210   873 TAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNA 952
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1844 ELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGS 1923
Cdd:COG3210   953 GLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASAT 1032
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1924 NTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGPSTGPGFGGPSTGPGFGGPSTGGGFGGPNTGG 2003
Cdd:COG3210  1033 GTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTS 1112
                        1130      1140      1150      1160      1170      1180      1190
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 595582345 2004 GFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFGGGLNTSAGFSGGPPSTGTGFGGGASS 2078
Cdd:COG3210  1113 TGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADS 1187
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
859-1958 2.76e-29

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 128.73  E-value: 2.76e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  859 NADPTTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 938
Cdd:COG3210   513 GLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGT 592
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  939 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 1018
Cdd:COG3210   593 GTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGG 672
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1019 ISNPSGGFGGRNSITFGSVPNTSAN---------FSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFG 1089
Cdd:COG3210   673 GTTGTVTSGATGGTTGTTLNAATGGtlnnagntlTISTGSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGVTITSG 752
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1090 GAPSTSTSFSTASISFGGApsTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGS 1169
Cdd:COG3210   753 NAGTLSIGLTANTTASGTT--LTLANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGL 830
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1170 SVSFGGAPTTSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGF 1249
Cdd:COG3210   831 TGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNG 910
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1250 GGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGF 1329
Cdd:COG3210   911 AVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGV 990
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1330 GGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGG 1409
Cdd:COG3210   991 IAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTT 1070
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1410 VLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGamNTSASFGG 1489
Cdd:COG3210  1071 GGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAA--GAGTLTGL 1148
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1490 ALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGG 1569
Cdd:COG3210  1149 VAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTG 1228
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1570 VLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGG 1649
Cdd:COG3210  1229 NTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAG 1308
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1650 AISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGS 1729
Cdd:COG3210  1309 ANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAG 1388
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1730 NSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTST 1809
Cdd:COG3210  1389 NNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGA 1468
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1810 DFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGA 1889
Cdd:COG3210  1469 GGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTTAEVAKASLEGGEGTYGGSSVAEAGTGGGILGA 1548
                        1050      1060      1070      1080      1090      1100
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 595582345 1890 GFSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAG 1958
Cdd:COG3210  1549 VSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQAPTAGNTATLTLSLAEGTNAEYGGTTNVTS 1617
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
957-2081 2.72e-27

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 122.18  E-value: 2.72e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  957 GGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGS 1036
Cdd:COG3210     1 GSGGLAGTTGNKTIGVDIAVTTTAATLGSNTAGTSGLNILGSGGVGTAGGIASNAGTTASTSGGSGTAGGVGNTSASTGG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1037 VPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSApfcNTASISFGGAPSTSTSFSTASISFGGAPSTSTSLS 1116
Cdd:COG3210    81 IGAAAANTAGTLETGLTSNIGGGSVNGSNSTGNGTLTTTAAS---ATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSG 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1117 TASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPTTSTSFSGGPCISFGGAPC 1196
Cdd:COG3210   158 AGNNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTG 237
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1197 TTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTS 1276
Cdd:COG3210   238 AGVISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGTA 317
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1277 ISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSS------GANAGFGGTLNSSTSFGGAISTSTGFG 1350
Cdd:COG3210   318 AGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTgtgnggGLTTAGAGTVASTVGTATASTGNASST 397
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1351 SALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGV 1430
Cdd:COG3210   398 TVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSAG 477
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1431 LNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGA 1510
Cdd:COG3210   478 NTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTA 557
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1511 LNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGA 1590
Cdd:COG3210   558 ASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGS 637
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1591 LTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSpDFGGAFSTSVGFGGT 1670
Cdd:COG3210   638 AVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNN-AGNTLTISTGSITVT 716
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1671 LNTTDFGSTHSNSISFGSAPT-TSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSI 1749
Cdd:COG3210   717 GQIGALANANGDTVTFGNLGTgATLTLNAGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATLDNAGAEISI 796
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1750 SFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLNHNADFNGGLGNSA 1829
Cdd:COG3210   797 DITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAAS 876
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1830 GFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNGFGNGPNASF 1909
Cdd:COG3210   877 ITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSA 956
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1910 NRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGGGPSTGPGFGGPSTGPGFGG 1989
Cdd:COG3210   957 ASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGT 1036
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1990 PSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFGGGLNTSAGFSGGPPSTG 2069
Cdd:COG3210  1037 AATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGV 1116
                        1130
                  ....*....|..
gi 595582345 2070 TGFGGGASSHGG 2081
Cdd:COG3210  1117 TASKVGGTTTVG 1128
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
859-1958 4.47e-27

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 121.41  E-value: 4.47e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  859 NADPTTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 938
Cdd:COG3210   558 ASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGS 637
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  939 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 1018
Cdd:COG3210   638 AVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVTG 717
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1019 ISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSG--TPSTSAPFCNTASISFGGAPSTST 1096
Cdd:COG3210   718 QIGALANANGDTVTFGNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASGTTltLANANGNTSAGATLDNAGAEISID 797
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1097 SFSTASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGA 1176
Cdd:COG3210   798 ITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASI 877
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1177 PTTSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTS 1256
Cdd:COG3210   878 TVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAA 957
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1257 VCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSS 1336
Cdd:COG3210   958 SASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTA 1037
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1337 TSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAG 1416
Cdd:COG3210  1038 ATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVT 1117
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1417 FGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAG 1496
Cdd:COG3210  1118 ASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTD 1197
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1497 FGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAG 1576
Cdd:COG3210  1198 LKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNA 1277
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1577 FGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPD 1656
Cdd:COG3210  1278 GATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGNG 1357
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1657 FGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFS 1736
Cdd:COG3210  1358 ATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGGT 1437
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1737 GATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLN 1816
Cdd:COG3210  1438 GNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTTA 1517
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1817 HNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLN 1896
Cdd:COG3210  1518 EVAKASLEGGEGTYGGSSVAEAGTGGGILGAVSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQAPTAGNTATL 1597
                        1050      1060      1070      1080      1090      1100
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 595582345 1897 NGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAG 1958
Cdd:COG3210  1598 TLSLAEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNGGEGVLALVAGGNTTNGTTLSG 1659
MAGE pfam01454
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ...
596-756 2.13e-24

MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.


Pssm-ID: 426270  Cd Length: 205  Bit Score: 103.12  E-value: 2.13e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   596 LVKYLLVKDQTKIPIKRSDMLKDVIQEYE-DYFPEIIERASYALEKMFRVNLKEID--------------------KQNN 654
Cdd:pfam01454    1 LVRYALACEYQRTPIRREDISKKVLGENRkRLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   655 LYILIST---QESSAGIMGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLH---PGVKHSLFG 719
Cdd:pfam01454   81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 595582345   720 EVKKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 756
Cdd:pfam01454  161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1434-1791 3.72e-20

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 98.15  E-value: 3.72e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1434 SAGFGGAMNtnATFGGALNSNAGFGGaistSTNFGGALNNSAGFGGAMNTSASFGgalnNSAGFGGAISTNATFGGALNN 1513
Cdd:NF033849  220 SISFGVSLP--MMYAANLGQSAGTGY----GESVGHSTSQGQSHSVGTSESHSVG----TSQSQSHTTGHGSTRGWSHTQ 289
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1514 SAGFGGAISTNATFGGALNNSAGF--GGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGAL 1591
Cdd:NF033849  290 STSESESTGQSSSVGTSESQSHGTteGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGH 369
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1592 TNSAGFGGAISTSASFGGALNnsAGFGGAIstsasfGGALNNSAGFGGAISTNASFGGAISNSpDFGGAFSTSVGFG--- 1668
Cdd:NF033849  370 STSSSVSSSESSSRSSSSGVS--GGFSGGI------AGGGVTSEGLGASQGGSEGWGSGDSVQ-SVSQSYGSSSSTGtss 440
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1669 GTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSlcfGSASNTnlcFGGSNSTncfsgatsanfNEGHS 1748
Cdd:NF033849  441 GHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTS---QSETDS---VGDSTGT-----------SESVS 503
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|...
gi 595582345 1749 ISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASF 1791
Cdd:NF033849  504 QGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLGKSY 546
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1494-1832 4.41e-20

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 98.15  E-value: 4.41e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1494 SAGFGgaISTNATFGGALNNSAG------FGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASF 1567
Cdd:NF033849  220 SISFG--VSLPMMYAANLGQSAGtgygesVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1568 GgvLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASF 1647
Cdd:NF033849  298 G--QSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSV 375
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1648 GGAISNSPDFggAFSTSVGFGGTLNttdfgsthsnsisfgSAPTTSVSFGGSHSTNLCFGGApstslcfGSASNTNLCFG 1727
Cdd:NF033849  376 SSSESSSRSS--SSGVSGGFSGGIA---------------GGGVTSEGLGASQGGSEGWGSG-------DSVQSVSQSYG 431
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1728 GSNSTNCFSGATSanfNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTS----------TGFGGSLGPSASFNGGLGT 1797
Cdd:NF033849  432 SSSSTGTSSGHSD---SSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSeswstsqsetDSVGDSTGTSESVSQGDGR 508
                         330       340       350
                  ....*....|....*....|....*....|....*
gi 595582345 1798 STGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFN 1832
Cdd:NF033849  509 STGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1454-1821 9.74e-18

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 90.45  E-value: 9.74e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1454 NAGFGgaISTSTNFGGALNNSAGFGGAMNTSASFggalnnSAGFGGAISTNATFGGAlnNSAGFGGAISTNATFGGALNN 1533
Cdd:NF033849  220 SISFG--VSLPMMYAANLGQSAGTGYGESVGHST------SQGQSHSVGTSESHSVG--TSQSQSHTTGHGSTRGWSHTQ 289
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1534 SAGFGGAISTSASFGG--TLNNSASFGGAINTSASFggvlnnSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGAL 1611
Cdd:NF033849  290 STSESESTGQSSSVGTseSQSHGTTEGTSTTDSSSH------SQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSEST 363
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1612 NNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSpdFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSapT 1691
Cdd:NF033849  364 GTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEG--LGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT--S 439
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1692 TSVSFGGSHSTNLcfGGAPSTSLCFGSASNTnlcfgGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGF 1771
Cdd:NF033849  440 SGHSDSSSHSTSS--GQADSVSQGTSWSEGT-----GTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGR 512
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|
gi 595582345 1772 GSSLGTStgfggslgpsasfnggLGTSTGFGGGLGTSTDFSGGLNHNADF 1821
Cdd:NF033849  513 SESQGTS----------------LGTSGGRTSGAGGSMGLGPSISLGKSY 546
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1592-1958 7.29e-17

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 87.37  E-value: 7.29e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1592 TNSAGFGgaISTSASFGGALNNSAG--FGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGG 1669
Cdd:NF033849  218 QKSISFG--VSLPMMYAANLGQSAGtgYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESE 295
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1670 TLNTTD-FGSTHSNSISFGSAPTTSVSFGGSHSTnlcfggapSTSLCFGSASNTNLCFGGSNStncfsgaTSANFNEGHS 1748
Cdd:NF033849  296 STGQSSsVGTSESQSHGTTEGTSTTDSSSHSQSS--------SYNVSSGTGVSSSHSDGTSQS-------TSISHSESSS 360
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1749 ISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGglnhnadFNGGLGNS 1828
Cdd:NF033849  361 ESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQS-------VSQSYGSS 433
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1829 AGFngglntntdfggelGTSAGFGDGLGSSTSFG--AGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNGFGNGPN 1906
Cdd:NF033849  434 SST--------------GTSSGHSDSSSHSTSSGqaDSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTS 499
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|..
gi 595582345 1907 ASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSngpssiVGFsgGPSTGAG 1958
Cdd:NF033849  500 ESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGS------MGL--GPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1574-1873 1.01e-16

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 86.98  E-value: 1.01e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1574 SAGFGgaINTSANFGGALTNSAG------FGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASF 1647
Cdd:NF033849  220 SISFG--VSLPMMYAANLGQSAGtgygesVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1648 GGAISNspdfGGAFSTSVGFGGTLNTTDfGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNlcfG 1727
Cdd:NF033849  298 GQSSSV----GTSESQSHGTTEGTSTTD-SSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVG---H 369
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1728 GSNSTNCFSGATSANFNEGHSISFGNGLS----TSAGFGNGLGTSAGFGSSLG---TSTGFGGSLGPSASF----NGGLG 1796
Cdd:NF033849  370 STSSSVSSSESSSRSSSSGVSGGFSGGIAgggvTSEGLGASQGGSEGWGSGDSvqsVSQSYGSSSSTGTSSghsdSSSHS 449
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 595582345 1797 TSTGFGGGLGTSTDFSGGLNHNAdfNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAG 1873
Cdd:NF033849  450 TSSGQADSVSQGTSWSEGTGTSQ--GQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSG 524
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1222-1547 1.41e-15

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 83.13  E-value: 1.41e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1222 ALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVcfgsspysgagfggTLSTSISFGGSPSTNTGFGGTLSTSVSFGA 1301
Cdd:NF033849  252 SQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSH--------------TQSTSESESTGQSSSVGTSESQSHGTTEGT 317
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1302 SSSTSSDFGGTLSTSVSFGgssganAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGgai 1381
Cdd:NF033849  318 STTDSSSHSQSSSYNVSSG------TGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSG--- 388
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1382 nTSAGFGSTLNSSASFGSALSTSASfggvlnGSAGFGGAlntnatfGGVLNGSAGFGGAMNTNATFGGALNS--NAGFGG 1459
Cdd:NF033849  389 -VSGGFSGGIAGGGVTSEGLGASQG------GSEGWGSG-------DSVQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQ 454
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1460 AISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSA---- 1535
Cdd:NF033849  455 ADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAggsm 534
                         330
                  ....*....|..
gi 595582345 1536 GFGGAISTSASF 1547
Cdd:NF033849  535 GLGPSISLGKSY 546
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
1221-1763 4.05e-14

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 78.27  E-value: 4.05e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1221 SALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFG 1300
Cdd:COG5295    64 AAATAGAGSGGTSATAASSVASGGASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAA 143
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1301 ASSSTSSDFGGTLSTSVSFGGSSGANAGFGGT-LNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGG 1379
Cdd:COG5295   144 STGGSSAAGGSNTATATGSSTANAATAAAGATsTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGV 223
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1380 AINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGG 1459
Cdd:COG5295   224 NAGAATGSAASAGGSASAGAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGA 303
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1460 AISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGG 1539
Cdd:COG5295   304 ANATAGGGNAGSGGGGAAALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATA 383
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1540 AISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGG 1619
Cdd:COG5295   384 AGNAAGAAGAGSAGSGGSSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAA 463
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1620 AISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGS 1699
Cdd:COG5295   464 NVGAATTAASAAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAG 543
                         490       500       510       520       530       540
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 595582345 1700 HSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGN 1763
Cdd:COG5295   544 GGSTTAATGTNSVAVGNNTATGANSVALGAGSVASGANSVSVGAAGAENVAAGATDTDAVNGGG 607
AidA COG3468
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ...
1468-1879 4.41e-14

Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442691 [Multi-domain]  Cd Length: 846  Bit Score: 78.06  E-value: 4.41e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1468 GGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASF 1547
Cdd:COG3468     1 TASGGGGGATGLGGGGTGGGGGLGGTGGGNAGLGIGNGGGGGAASGSGAGGVAGNGGGGGGGAGGGGGGAGSGGGLAGAG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1548 GGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSagfgGAISTSASF 1627
Cdd:COG3468    81 SGGTGGNSTGGGGGNSGTGGTGGGGGGGGSGNGGGGGGGGGGGGTGGGGGGGTGSAGGGGGGGGGGT----GVGGTGAAA 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1628 GGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFG 1707
Cdd:COG3468   157 AGGGTGSGGGGSGGGGGAGGGGGGGAGGSGGAGSTGSGAGGGGGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGG 236
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1708 GAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGP 1787
Cdd:COG3468   237 GVGGGGGSAGGTGGGGLTGGGAAGTGGGGGGTGTGSGGGGGGGANGGGSGGGGGASGTGGGGTASTGGGGGGGGGNGGGG 316
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1788 SASFNGGLGTSTGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVT 1867
Cdd:COG3468   317 GGGSNAGGGSGGGGGGGGGGGGGGTTLNGAGSAGGGTGAALAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDGVGTGLT 396
                         410
                  ....*....|..
gi 595582345 1868 SDGFAGNLGTNT 1879
Cdd:COG3468   397 TGGTGNNGGGGV 408
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
1221-1799 1.45e-13

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 76.35  E-value: 1.45e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1221 SALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGA--GFGGTLSTSISFGGSPSTNTGFGGTLSTSVS 1298
Cdd:COG5295    19 SGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAatAGAGSGGTSATAASSVASGGASAATAASTGT 98
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1299 FGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFG 1378
Cdd:COG5295    99 GNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATATGSSTANAATAAAGATSTS 178
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1379 GAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFG 1458
Cdd:COG5295   179 ASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSASAGAASGNATTASASSVSG 258
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1459 GAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFG 1538
Cdd:COG5295   259 SAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGAAALGSAGGSSGVGTASGA 338
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1539 GAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFG 1618
Cdd:COG5295   339 SAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGSGGSSTGASAGGGASAAGGAA 418
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1619 GAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGG 1698
Cdd:COG5295   419 AGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAATSSAAIAGATATGAGAA 498
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1699 SHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTS 1778
Cdd:COG5295   499 AGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSVAVGNNTATGANSVALGAGSVAS 578
                         570       580
                  ....*....|....*....|.
gi 595582345 1779 TGFGGSLGPSASFNGGLGTST 1799
Cdd:COG5295   579 GANSVSVGAAGAENVAAGATD 599
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
1215-1723 7.67e-13

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 74.04  E-value: 7.67e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1215 STNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLS 1294
Cdd:COG5295    88 ASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATATGSSTANA 167
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1295 TSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSS 1374
Cdd:COG5295   168 ATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSASAGAASGN 247
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1375 ASFGGAINTSAGFGSTLNSS----ASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGA 1450
Cdd:COG5295   248 ATTASASSVSGSAVAAGTAStattASTTAASGAAGTATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGAAALGSAG 327
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1451 LNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGA 1530
Cdd:COG5295   328 GSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGSGGSSTGASA 407
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1531 LNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGA 1610
Cdd:COG5295   408 GGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAATSSAAI 487
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1611 LNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGS--------THSN 1682
Cdd:COG5295   488 AGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSvavgnntaTGAN 567
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|....*
gi 595582345 1683 SISFGSAPTT----SVSFGGSHSTNLCFGGAPSTSLCFGSASNTN 1723
Cdd:COG5295   568 SVALGAGSVAsganSVSVGAAGAENVAAGATDTDAVNGGGAVATG 612
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
1351-1942 1.43e-12

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 73.27  E-value: 1.43e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1351 SALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGST-----LNSSASFGSALSTSASFGGVLNGSAGFGGALNTNA 1425
Cdd:COG5295     1 SASNAGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGsaatsSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1426 TFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNA 1505
Cdd:COG5295    81 SSVASGGASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATAT 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1506 TFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSA 1585
Cdd:COG5295   161 GSSTANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSAS 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1586 NfgGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSV 1665
Cdd:COG5295   241 A--GAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGG 318
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1666 GFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANFNE 1745
Cdd:COG5295   319 GAAALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGS 398
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1746 GHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLnhnadfNGGL 1825
Cdd:COG5295   399 GGSSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAA------TTAA 472
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1826 GNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNGFGNGP 1905
Cdd:COG5295   473 SAAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATG 552
                         570       580       590
                  ....*....|....*....|....*....|....*..
gi 595582345 1906 NASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNG 1942
Cdd:COG5295   553 TNSVAVGNNTATGANSVALGAGSVASGANSVSVGAAG 589
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
1445-1952 1.66e-12

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 72.89  E-value: 1.66e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1445 ATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTN 1524
Cdd:COG4625     1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1525 ATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTS 1604
Cdd:COG4625    81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1605 ASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSI 1684
Cdd:COG4625   161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1685 SFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLCFGGSNSTNCFSGATSANfneghsISFGNGLSTSAGFGNG 1764
Cdd:COG4625   241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGG------GGGGGGGGGGGGGGGG 314
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1765 LGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGE 1844
Cdd:COG4625   315 GGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGG 394
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1845 LGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAGFSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGSN 1924
Cdd:COG4625   395 GAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAG 474
                         490       500
                  ....*....|....*....|....*...
gi 595582345 1925 TSNGFTGEPNTGSSFSNGPSSIVGFSGG 1952
Cdd:COG4625   475 TLTLTGNNTYTGTTTVNGGGNYTQSAGS 502
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
1106-1596 3.44e-12

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 72.12  E-value: 3.44e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1106 GGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPTTSTSFSG 1185
Cdd:COG4625    11 GGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGT 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1186 GPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYS 1265
Cdd:COG4625    91 GGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGG 170
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1266 GAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAIST 1345
Cdd:COG4625   171 GGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGG 250
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1346 STGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNA 1425
Cdd:COG4625   251 GGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 330
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1426 TFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNA 1505
Cdd:COG4625   331 GGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGG 410
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1506 TFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSA 1585
Cdd:COG4625   411 GGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTV 490
                         490
                  ....*....|.
gi 595582345 1586 NFGGALTNSAG 1596
Cdd:COG4625   491 NGGGNYTQSAG 501
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1339-1647 3.50e-12

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 71.96  E-value: 3.50e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1339 FGGAISTSTGFGSalnnSANFGGAISTSFsgvlnsSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFG 1418
Cdd:NF033849  231 YAANLGQSAGTGY----GESVGHSTSQGQ------SHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQS 300
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1419 GALNTNATFG-GVLNG-----SAGFGGAMNTNATFG----GALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFG 1488
Cdd:NF033849  301 SSVGTSESQShGTTEGtsttdSSSHSQSSSYNVSSGtgvsSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSES 380
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1489 GALNNSAGFGGAISTNATFGGALnnSAGFGGAISTNATFG---GALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSA 1565
Cdd:NF033849  381 SSRSSSSGVSGGFSGGIAGGGVT--SEGLGASQGGSEGWGsgdSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSV 458
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1566 SFGGVL--NNSAGFGGAINTSANFG--GALTNSAGF--GGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGG 1639
Cdd:NF033849  459 SQGTSWseGTGTSQGQSVGTSESWStsQSETDSVGDstGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGP 538

                  ....*...
gi 595582345 1640 AISTNASF 1647
Cdd:NF033849  539 SISLGKSY 546
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1356-1702 7.74e-12

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 71.19  E-value: 7.74e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1356 SANFGGAISTSFSGvlNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSA 1435
Cdd:NF033849  220 SISFGVSLPMMYAA--NLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1436 GFGGAMNTNATfggaLNSNAGFGGAISTSTNF--GGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNN 1513
Cdd:NF033849  298 GQSSSVGTSES----QSHGTTEGTSTTDSSSHsqSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSS 373
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1514 SAGFGGAISTNATFGgalnNSAGFGGAIS----TSASFGGTLNNSASFGgaintsaSFGGVLNNSAGFGGAINTSANFGG 1589
Cdd:NF033849  374 SVSSSESSSRSSSSG----VSGGFSGGIAgggvTSEGLGASQGGSEGWG-------SGDSVQSVSQSYGSSSSTGTSSGH 442
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1590 ALTNSAGFGgaISTSASFGGALNNSAGFGGAISTSASfggalnNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFG- 1668
Cdd:NF033849  443 SDSSSHSTS--SGQADSVSQGTSWSEGTGTSQGQSVG------TSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSe 514
                         330       340       350
                  ....*....|....*....|....*....|....*.
gi 595582345 1669 --GTLNTTDFGSTHSNSISFGSAPttSVSFGGSHST 1702
Cdd:NF033849  515 sqGTSLGTSGGRTSGAGGSMGLGP--SISLGKSYQW 548
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1736-2064 2.07e-10

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 66.18  E-value: 2.07e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1736 SGATSANFNEGHSISFGNGLSTSAgfGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTDFSGGL 1815
Cdd:NF033849  216 QGQKSISFGVSLPMMYAANLGQSA--GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSE 293
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1816 NHNAdfngGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTS--DGFAGNLGTNTGFGGTLGTGAGFSV 1893
Cdd:NF033849  294 SEST----GQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSShsDGTSQSTSISHSESSSESTGTSVGH 369
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1894 SLNNGNGFGNGPNASFNRGLNTiiGFGSGSNTSnGFTGEpntGSSFSNGPSSIVGFSGG-PSTGAGFCSGPSTGGFGGGP 1972
Cdd:NF033849  370 STSSSVSSSESSSRSSSSGVSG--GFSGGIAGG-GVTSE---GLGASQGGSEGWGSGDSvQSVSQSYGSSSSTGTSSGHS 443
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1973 STGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTGFG 2052
Cdd:NF033849  444 DSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTS 523
                         330
                  ....*....|..
gi 595582345 2053 GGLNTSAGFSGG 2064
Cdd:NF033849  524 GGRTSGAGGSMG 535
MscS_porin pfam12795
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part ...
268-445 2.01e-09

Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part of the turgor-driven solute efflux system that protects bacteria from lysis in the event of osmotic shock. The MscS protein alone is sufficient to form a functional mechanosensitive channel gated directly by tension in the lipid bilayer. The MscS proteins are heptamers of three transmembrane subunits with seven converging M3 domains, and this MscS_porin is towards the N-terminal of the molecules. The high concentration of negative charges at the extracellular entrance of the pore helps select the cations for efflux.


Pssm-ID: 432790 [Multi-domain]  Cd Length: 238  Bit Score: 60.01  E-value: 2.01e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   268 QTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASnrqigASNRQTE 347
Cdd:pfam12795   10 LDEAAKKKLLQDLQQALSLLDKIDASKQRAAAYQKALDDAPAELRELRQELAALQAKAEAAPKEILAS-----LSLEELE 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   348 vsSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASsrQTEASSRQTEASS 427
Cdd:pfam12795   85 --QRLLQTSAQLQELQNQLAQLNSQLIELQTRPERAQQQLSEARQRLQQIRNRLNGPAPPGEPLS--EAQRWALQAELAA 160
                          170
                   ....*....|....*...
gi 595582345   428 RQTEASSRQIEASAAAVR 445
Cdd:pfam12795  161 LKAQIDMLEQELLSNNNR 178
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
863-1511 5.69e-09

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 61.33  E-value: 5.69e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  863 TTNVLFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 942
Cdd:COG4625    28 AGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGGG 107
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  943 SGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 1022
Cdd:COG4625   108 GGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGGGGGGGGGGGG 187
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1023 SGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSFSTAS 1102
Cdd:COG4625   188 GGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGG 267
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1103 ISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSINSSSGGSSVSFGGAPTTSTS 1182
Cdd:COG4625   268 GGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGG 347
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1183 FSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSS 1262
Cdd:COG4625   348 GGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGT 427
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1263 PYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGA 1342
Cdd:COG4625   428 GAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGGGNYTQSAGSTLAVE 507
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1343 ISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALN 1422
Cdd:COG4625   508 VDAANSDRLVVTGTATLNGGTVVVLAGGYAPGTTYTILAVAAALDALAGNGDLSALYNALAALDAAAARAALDQLSGEIH 587
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1423 TNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALnnsAGFGGAIS 1502
Cdd:COG4625   588 ASAAAALLQASRALRDALSNRLRALRGAGAAGDAAAEGWGVWAQGFGSWGDQDGDGGAAGYDSSTGGLL---VGADYRLG 664

                  ....*....
gi 595582345 1503 TNATFGGAL 1511
Cdd:COG4625   665 DNWRLGVAL 673
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
917-1432 7.53e-09

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 60.95  E-value: 7.53e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  917 GGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGF 996
Cdd:COG4625     1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  997 GGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPST 1076
Cdd:COG4625    81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1077 SAPFCNTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISF 1156
Cdd:COG4625   161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1157 GGAPSINSSSGGSSVSFGGAPTTSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTS 1236
Cdd:COG4625   241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1237 TVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTS 1316
Cdd:COG4625   321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1317 VSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGS-TLNSSA 1395
Cdd:COG4625   401 GGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAgTLTLTG 480
                         490       500       510
                  ....*....|....*....|....*....|....*..
gi 595582345 1396 SFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLN 1432
Cdd:COG4625   481 NNTYTGTTTVNGGGNYTQSAGSTLAVEVDAANSDRLV 517
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
997-1512 9.96e-09

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 60.56  E-value: 9.96e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  997 GGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPST 1076
Cdd:COG4625     2 GGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGG 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1077 SAPFCNTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISF 1156
Cdd:COG4625    82 GGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAG 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1157 GGAPSINSSSGGSSVSFGGAPTTSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTS 1236
Cdd:COG4625   162 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGG 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1237 TVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSSDFGGTLSTS 1316
Cdd:COG4625   242 GGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 321
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1317 VSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSAS 1396
Cdd:COG4625   322 GGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGG 401
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1397 FGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAG 1476
Cdd:COG4625   402 GGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGN 481
                         490       500       510
                  ....*....|....*....|....*....|....*.
gi 595582345 1477 FGGAMNTSASFGGALNNSAGFGGAISTNATFGGALN 1512
Cdd:COG4625   482 NTYTGTTTVNGGGNYTQSAGSTLAVEVDAANSDRLV 517
auto_AIDA-I NF033176
autotransporter adhesin AIDA-I;
1318-1785 1.30e-08

autotransporter adhesin AIDA-I;


Pssm-ID: 380183 [Multi-domain]  Cd Length: 1287  Bit Score: 60.44  E-value: 1.30e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1318 SFGGSSGANAGFGGTLNSSTsfGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSaSF 1397
Cdd:NF033176   72 SNGQTSNATVNSGGIQNVNN--GGKTTSTTVNSSGAQNVGNSGTAISTIVNSGGVQRVSSGGVTSATSLSGGAQNIY-NL 148
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1398 GSALSTSASFGGVLNGSAGfGGALNTNATFGGVLNGSAGfGGAMNTNATFGGALNSNAGfGGAISTSTNFGGALNNSAGf 1477
Cdd:NF033176  149 GHASNTVIFNGGNQTIFSG-GISDDTNISSGGQQRVSSG-GVASNTTINSSGTQNILSG-GSTVSTHISSGGNQYISAG- 224
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1478 GGAMNTSASFGGALNNSAgfgGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGfGGAISTSASFGGTLNNSaSF 1557
Cdd:NF033176  225 GNASATVVSSGGFQRVSS---GGTATGTVLSGGTQNVSSGGSAISTSVYSSGVQTVYAG-ATVTDTTVNSGGKQNIS-SG 299
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1558 GGAINTSASFGGVLNNsagFGGAINTSANFGGALTNSAGfGGAISTSASFGGALNNSAGfGGAISTSASFGGALNNSAGf 1637
Cdd:NF033176  300 GIVSGTIVNSSGTQNI---YSGGSALSANIKGSQIVNSD-GTAINTLVNDGGYQHIRNG-GVASGTIINQSGRVNISSG- 373
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1638 GGAISTNASFGGAISNSPDfGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSapTTSVSFGGSHSTNLCFGGAPSTSLCFG 1717
Cdd:NF033176  374 GYAESTIINSGGTQSVLSG-GYASGTLINNSGRENVSNGGSAYNTIINAGG--NQYIYSNGEASGTTVNTSGFQRVNSGG 450
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 595582345 1718 SASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSL 1785
Cdd:NF033176  451 TATGTKLSGGNQNVSSGGKAIAAEVYSGGKQTVYAGGEASGTQIFDGGVVNVSGGSVSGASVNLNGRL 518
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
1559-2060 1.77e-08

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 59.79  E-value: 1.77e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1559 GAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFG 1638
Cdd:COG4625     1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1639 GAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGS 1718
Cdd:COG4625    81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1719 ASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTS 1798
Cdd:COG4625   161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1799 TGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTN 1878
Cdd:COG4625   241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1879 TGFGGTLGTGAGFSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAG 1958
Cdd:COG4625   321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1959 FCSGPSTGGFGGGPSTGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTA 2038
Cdd:COG4625   401 GGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTG 480
                         490       500
                  ....*....|....*....|..
gi 595582345 2039 AGFGSGLSTSTGfGGGLNTSAG 2060
Cdd:COG4625   481 NNTYTGTTTVNG-GGNYTQSAG 501
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
999-1646 2.68e-08

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 59.40  E-value: 2.68e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  999 ISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSA 1078
Cdd:COG5295     1 SASNAGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1079 PFCNTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTAsisfGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGG 1158
Cdd:COG5295    81 SSVASGGASAATAASTGTGNTAGTAATVAGAASSGSATNA----GASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNT 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1159 APSINSSSGGSSVSFGGAPTTSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTV 1238
Cdd:COG5295   157 ATATGSSTANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAG 236
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1239 FSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTSsdfGGTLSTSVS 1318
Cdd:COG5295   237 GSASAGAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGAA---NATAGGGNA 313
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1319 FGGSSGANAG--FGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSAS 1396
Cdd:COG5295   314 GSGGGGAAALgsAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGA 393
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1397 FGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAG 1476
Cdd:COG5295   394 GSAGSGGSSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAAS 473
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1477 FGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSAS 1556
Cdd:COG5295   474 AAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGT 553
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1557 FGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAG 1636
Cdd:COG5295   554 NSVAVGNNTATGANSVALGAGSVASGANSVSVGAAGAENVAAGATDTDAVNGGGAVATGDNSVAVGNNAQASGANSVALG 633
                         650
                  ....*....|
gi 595582345 1637 FGGAISTNAS 1646
Cdd:COG5295   634 AGATATANNS 643
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
1566-2071 3.73e-08

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 59.02  E-value: 3.73e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1566 SFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNA 1645
Cdd:COG4625     1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1646 SFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNLC 1725
Cdd:COG4625    81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1726 FGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGL 1805
Cdd:COG4625   161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1806 GTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTL 1885
Cdd:COG4625   241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1886 GTGAGFSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGfcsgpST 1965
Cdd:COG4625   321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGG-----GG 395
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1966 GGFGGGPSTGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGL 2045
Cdd:COG4625   396 AGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGT 475
                         490       500
                  ....*....|....*....|....*.
gi 595582345 2046 STSTGFGGGLNTSAGFSGGPPSTGTG 2071
Cdd:COG4625   476 LTLTGNNTYTGTTTVNGGGNYTQSAG 501
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
262-468 5.43e-08

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 57.22  E-value: 5.43e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  262 IGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGA 341
Cdd:COG4372    26 IAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELES 105
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  342 SNRQTEVSSRQIEASNRQIGASNRQTEASNRQIgasnrqteaSNRQIGASNRQTDASNRQTDASNRQTEASSRQTEASSR 421
Cdd:COG4372   106 LQEEAEELQEELEELQKERQDLEQQRKQLEAQI---------AELQSEIAEREEELKELEEQLESLQEELAALEQELQAL 176
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 595582345  422 QTEASSRQTEASSRQIEASAAAVRPKKPRGKKGNNKGSNSASEPSEA 468
Cdd:COG4372   177 SEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEA 223
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
268-522 9.60e-08

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 56.76  E-value: 9.60e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  268 QTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGASNRQTE 347
Cdd:COG3883    17 QIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARALY 96
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  348 VSSRQI-------EASN-----RQIGASNRQTEASNRQIGA-SNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSR 414
Cdd:COG3883    97 RSGGSVsyldvllGSESfsdflDRLSALSKIADADADLLEElKADKAELEAKKAELEAKLAELEALKAELEAAKAELEAQ 176
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  415 QTEASSRQTEASSRQTEASSRQIEASAAAVRPKKPRGKKGNNKGSNSASEPSEAPPAIQTVTNHALSVTVRIRRGSRARK 494
Cdd:COG3883   177 QAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAGAGAAGAAG 256
                         250       260
                  ....*....|....*....|....*...
gi 595582345  495 AANKNRATESQAQIAEQGAQASEASISA 522
Cdd:COG3883   257 AAAGSAGAAGAAAGAAGAGAAAASAAGG 284
growth_prot_Scy NF041483
polarized growth protein Scy;
97-524 1.17e-07

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 57.53  E-value: 1.17e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   97 SQASATTEAPNIQASVTSQTQKAKTMRVTPKVSLTGSEDATtqlkpplQALNLPVTTPTIQTPVANESANSLAS--TAVN 174
Cdd:NF041483  293 AKQLASAESANEQRTRTAKEEIARLVGEATKEAEALKAEAE-------QALADARAEAEKLVAEAAEKARTVAAedTAAQ 365
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  175 KSKKASTANNAANKTVPSAAEISLAsAATHTVTTQGQAAKETGSIQTIAATA------RSKKNSKGKRtpAKTTNTDNEY 248
Cdd:NF041483  366 LAKAARTAEEVLTKASEDAKATTRA-AAEEAERIRREAEAEADRLRGEAADQaeqlkgAAKDDTKEYR--AKTVELQEEA 442
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  249 V----EA----SNAIEASSRQIGASGRqtEASnRQIEASSRQTEA-------------SNRQTEASSRQTEASSRQT--- 304
Cdd:NF041483  443 RrlrgEAeqlrAEAVAEGERIRGEARR--EAV-QQIEEAARTAEElltkakadadelrSTATAESERVRTEAIERATtlr 519
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  305 ----ETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIgasnrqTEVSSRQIEAsnrqigasnRQTEASNRqigASNRQ 380
Cdd:NF041483  520 rqaeETLERTRAEAERLRAEAEEQAEEVRAAAERAAREL------REETERAIAA---------RQAEAAEE---LTRLH 581
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  381 TEASNRQIGASNRQTDASN------RQT-DASNRQ-TEASSR--------QTEASSRQTEASSrqtEASSRQIEASAAAV 444
Cdd:NF041483  582 TEAEERLTAAEEALADARAeaerirREAaEETERLrTEAAERirtlqaqaEQEAERLRTEAAA---DASAARAEGENVAV 658
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  445 RPKkprgkkgnnkgSNSASEPSeappaiqtvtnhalsvtvriRRGSRARKAANKNRAtESQAQIAEQGAQASEASISALE 524
Cdd:NF041483  659 RLR-----------SEAAAEAE--------------------RLKSEAQESADRVRA-EAAAAAERVGTEAAEALAAAQE 706
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
255-525 1.52e-07

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 56.06  E-value: 1.52e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  255 IEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEA 334
Cdd:COG4372    40 LDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEE 119
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  335 SNRQIGASNRQTEVSSRQIEASNRQIGASNRQTEASNRQIgaSNRQTEASNRQIGASNRQTDASNRQTDASNRQTEassR 414
Cdd:COG4372   120 LQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQL--ESLQEELAALEQELQALSEAEAEQALDELLKEAN---R 194
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  415 QTEASSRQTEASSRQTEASSRQIEASAAAVRPKKPRGKKGNNKGSNSASEPSEAPPAIQTVT-NHALSVTVRIRRGSRAR 493
Cdd:COG4372   195 NAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVIlKEIEELELAILVEKDTE 274
                         250       260       270
                  ....*....|....*....|....*....|..
gi 595582345  494 KAANKNRATESQAQIAEQGAQASEASISALET 525
Cdd:COG4372   275 EEELEIAALELEALEEAALELKLLALLLNLAA 306
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
264-442 2.40e-07

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 55.16  E-value: 2.40e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  264 ASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGASN 343
Cdd:COG4942    17 AQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELR 96
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  344 RQTEvssRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDA-SNRQTEASSRQTEASSRQ 422
Cdd:COG4942    97 AELE---AQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEElRADLAELAALRAELEAER 173
                         170       180
                  ....*....|....*....|
gi 595582345  423 TEASSRQTEASSRQIEASAA 442
Cdd:COG4942   174 AELEALLAELEEERAALEAL 193
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
251-459 2.89e-07

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 55.16  E-value: 2.89e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  251 ASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNR 330
Cdd:COG4942    18 QADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRA 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  331 QIEASNRQIGASNRQTEVSSRQ-------------------------IEASNRQIGASNRQTEASNRQIGASNRQTEASN 385
Cdd:COG4942    98 ELEAQKEELAELLRALYRLGRQpplalllspedfldavrrlqylkylAPARREQAEELRADLAELAALRAELEAERAELE 177
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 595582345  386 RQIgASNRQTDASNRQTDASNRQTEASSRQTEASSRQT----EASSRQTEASSRQIEASAAAVRPKKPRGKKGNNKGS 459
Cdd:COG4942   178 ALL-AELEEERAALEALKAERQKLLARLEKELAELAAElaelQQEAEELEALIARLEAEAAAAAERTPAAGFAALKGK 254
Tar COG0840
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];
250-442 3.71e-07

Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];


Pssm-ID: 440602 [Multi-domain]  Cd Length: 533  Bit Score: 55.03  E-value: 3.71e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  250 EASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIgasnRQIMASNRQIG--- 326
Cdd:COG0840   292 ETAAAMEELSATVQEVAENAQQAAELAEEASELAEEGGEVVEEAVEGIEEIRESVEETAETI----EELGESSQEIGeiv 367
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  327 -------------ASNRQIEAS------------------------------NRQIGASNRQTEVSSRQIEASNRQIGAS 363
Cdd:COG0840   368 dviddiaeqtnllALNAAIEAArageagrgfavvadevrklaersaeatkeiEELIEEIQSETEEAVEAMEEGSEEVEEG 447
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  364 NRQTEASN---RQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSRQTEASSRQTEASSRQTEASSRQIEAS 440
Cdd:COG0840   448 VELVEEAGealEEIVEAVEEVSDLIQEIAAASEEQSAGTEEVNQAIEQIAAAAQENAASVEEVAAAAEELAELAEELQEL 527

                  ..
gi 595582345  441 AA 442
Cdd:COG0840   528 VS 529
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
254-524 6.90e-07

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 53.75  E-value: 6.90e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  254 AIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIE 333
Cdd:COG4372    25 LIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELE 104
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  334 ASNRQigASNRQTEVSsrQIEASNRQIGASNRQTEASNRQIGAS--NRQTEASNRQIGASNRQTDASNRQTDASNRQTEA 411
Cdd:COG4372   105 SLQEE--AEELQEELE--ELQKERQDLEQQRKQLEAQIAELQSEiaEREEELKELEEQLESLQEELAALEQELQALSEAE 180
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  412 SSRQTEASSRQTEASSRQTEASSRQIEA-------SAAAVRPKKPRGKKGNNKGSNSASEPSEAPPAIQTVTNHALSVTV 484
Cdd:COG4372   181 AEQALDELLKEANRNAEKEEELAEAEKLieslpreLAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEI 260
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|
gi 595582345  485 RIRRGSRARKAANKNRATESQAQIAEQGAQASEASISALE 524
Cdd:COG4372   261 EELELAILVEKDTEEEELEIAALELEALEEAALELKLLAL 300
PTZ00121 PTZ00121
MAEBL; Provisional
250-524 7.08e-07

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 54.76  E-value: 7.08e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  250 EASNAIEASSRqiGASGRQTEASNRQIEAssRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASN 329
Cdd:PTZ00121 1197 EDARKAEAARK--AEEERKAEEARKAEDA--KKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAI 1272
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  330 RQIE---------ASNRQIGASNRQTEVSSRQIEASN-----RQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQT 395
Cdd:PTZ00121 1273 KAEEarkadelkkAEEKKKADEAKKAEEKKKADEAKKkaeeaKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEA 1352
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  396 DASNRQTDASNRQTEASSRQTEASSRQTEASSRQTEASSRQIEASAAAVRPKKP----RGKKGNNKGSNSASEPSEAPPA 471
Cdd:PTZ00121 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKadelKKAAAAKKKADEAKKKAEEKKK 1432
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|...
gi 595582345  472 IQTVTNHALSVtvriRRGSRARKAANKNRATESQAQIAEQGAQASEASISALE 524
Cdd:PTZ00121 1433 ADEAKKKAEEA----KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
1492-2053 1.03e-06

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 54.01  E-value: 1.03e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1492 NNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVL 1571
Cdd:COG5295     2 ASNAGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAAS 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1572 NNSAGFGGAINTSANFGGA-LTNSAGFGGAISTSASfggalnNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGA 1650
Cdd:COG5295    82 SVASGGASAATAASTGTGNtAGTAATVAGAASSGSA------TNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSN 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1651 ISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNLCFGGAPSTSLCFGSASNTNlcfGGSN 1730
Cdd:COG5295   156 TATATGSSTANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAA---TGSA 232
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1731 STNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASFNGGLGTSTGFGGGLGTSTD 1810
Cdd:COG5295   233 ASAGGSASAGAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGAANATAGGGN 312
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1811 FSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTSFGAGLVTSDGFAGNLGTNTGFGGTLGTGAG 1890
Cdd:COG5295   313 AGSGGGGAAALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAG 392
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1891 FSVSLNNGNGFGNGPNASFNRGLNTIIGFGSGSNTSNGFTGEPNTGSSFSNGPSSIVGFSGGPSTGAGFCSGPSTGGFGG 1970
Cdd:COG5295   393 AGSAGSGGSSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAA 472
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1971 GPSTGPGFGGPSTGPGFGGPSTGGGFGGPNTGGGFGGPSTGGGFGGPSTGGGFGGPSTGGGFGGPSTAAGFGSGLSTSTG 2050
Cdd:COG5295   473 SAAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATG 552

                  ...
gi 595582345 2051 FGG 2053
Cdd:COG5295   553 TNS 555
YhjY COG5571
Uncharacterized conserved protein YhjY, contains autotransporter beta-barrel domain [General ...
1227-1654 1.41e-06

Uncharacterized conserved protein YhjY, contains autotransporter beta-barrel domain [General function prediction only];


Pssm-ID: 444313 [Multi-domain]  Cd Length: 648  Bit Score: 53.34  E-value: 1.41e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1227 TSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLSTSVSFGASSSTS 1306
Cdd:COG5571     5 SAAGSLGYLASASSNAATAPGLAAATASAAGAAGLGAASTASSLSGASLALLAAQALGAGLSGTNGFSGGAGSSSGTGPT 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1307 SDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAG 1386
Cdd:COG5571    85 ANGGLAGAGGVDLAGAGGGGGASGLAGGAGGAGGTAAAGGAAAAGGGAAGNAATAAAAAAAGTALQLSGLTTAGAVGGVA 164
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1387 FGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTN 1466
Cdd:COG5571   165 GTAALNGATANTGLGAAAALAAAAAAAAAAAAAAAAAAAAATAAAAAAAAAAAAAVLASPAPAAGGAAAAAAGAAAAAAS 244
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1467 FGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSAS 1546
Cdd:COG5571   245 AAANAATQANLLLLALALGSNGNAVGLNAVGLANEAAAPGAVGGDAGSTGATPSTLSSASCVASSLTAANANTLYAAADT 324
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1547 FGGTLNNSASFGGAINTSASFGGVlnnsAGFGGAINTSANFGGALTNSAGFGGAiSTSASFGGALNNSAGFGGAISTSAS 1626
Cdd:COG5571   325 AGPAGATAALAAAAAAVLASAAAV----AQAALALAAAGGQARSLAVAAGQGRG-ARGGQTRGGGGAGGTTGGGVGAGGG 399
                         410       420
                  ....*....|....*....|....*...
gi 595582345 1627 FGGALNNSAGFGGAISTNASFGGAISNS 1654
Cdd:COG5571   400 DGDGPNLTLGVDYRLSDNLLLGAALSYG 427
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
909-1546 1.60e-06

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 53.24  E-value: 1.60e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  909 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 988
Cdd:COG5295     1 SASNAGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  989 ISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSFSGGANS 1068
Cdd:COG5295    81 SSVASGGASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATAT 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1069 SFSGTPSTSAPFCNTASISFGGAPSTSTSFSTASISFGGAPSTSTSLSTASISFGGAPSTSTSFSTASISFGGAPSTSTS 1148
Cdd:COG5295   161 GSSTANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSAS 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1149 LSTASISFGGAPSINSSSGGSSVSFGGAPTTSTSFSGGPCISFGGAPCTTASISGGASSGFGSTLCSTNPGFSALSTNTS 1228
Cdd:COG5295   241 AGAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGTATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGA 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1229 FGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSIS-FGGSPSTNTGFGGTLSTSVSFGASSSTSS 1307
Cdd:COG5295   321 AALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGaAATSSSGGSATAAGNAAGAAGAGSAGSGG 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1308 DFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGF 1387
Cdd:COG5295   401 SSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAA 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1388 GSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNF 1467
Cdd:COG5295   481 ATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSVAVGN 560
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1468 GGALNNSAGFGGAMNTSASFGGALNNSAGFG----GAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAIST 1543
Cdd:COG5295   561 NTATGANSVALGAGSVASGANSVSVGAAGAEnvaaGATDTDAVNGGGAVATGDNSVAVGNNAQASGANSVALGAGATATA 640

                  ...
gi 595582345 1544 SAS 1546
Cdd:COG5295   641 NNS 643
PPE COG5651
PPE-repeat protein [Function unknown];
1399-1630 1.95e-06

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 52.59  E-value: 1.95e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1399 SALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFG--GAMNTNATFGGALNSNAGFGGAISTSTNFGGAlnnsag 1476
Cdd:COG5651   159 AAAVALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFAnlGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFA------ 232
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1477 fGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNSAS 1556
Cdd:COG5651   233 -GTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLG 311
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 595582345 1557 FGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGA 1630
Cdd:COG5651   312 AGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
MscS_porin pfam12795
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part ...
250-436 1.99e-06

Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part of the turgor-driven solute efflux system that protects bacteria from lysis in the event of osmotic shock. The MscS protein alone is sufficient to form a functional mechanosensitive channel gated directly by tension in the lipid bilayer. The MscS proteins are heptamers of three transmembrane subunits with seven converging M3 domains, and this MscS_porin is towards the N-terminal of the molecules. The high concentration of negative charges at the extracellular entrance of the pore helps select the cations for efflux.


Pssm-ID: 432790 [Multi-domain]  Cd Length: 238  Bit Score: 51.15  E-value: 1.99e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   250 EASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASN 329
Cdd:pfam12795   48 DAPAELRELRQELAALQAKAEAAPKEILASLSLEELEQRLLQTSAQLQELQNQLAQLNSQLIELQTRPERAQQQLSEARQ 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   330 RQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNRQTEASNR-QIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQ 408
Cdd:pfam12795  128 RLQQIRNRLNGPAPPGEPLSEAQRWALQAELAALKAQIDMLEQeLLSNNNRQDLLKARRDLLTLRIQRLEQQLQALQELL 207
                          170       180
                   ....*....|....*....|....*...
gi 595582345   409 TEasSRQTEAssRQTEASSRQTEASSRQ 436
Cdd:pfam12795  208 NE--KRLQEA--EQAVAQTEQLAEEAAG 231
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
268-524 3.37e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 52.63  E-value: 3.37e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  268 QTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEAsnrqigASNRQTE 347
Cdd:COG1196   219 KEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEE------AQAEEYE 292
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  348 VSSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSRQTEASSRQTEASS 427
Cdd:COG1196   293 LLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEA 372
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  428 RQTEASSRQIEASAAAVRPKKprgkkgnnkgsnsasepsEAPPAIQTVTNHALSVTVRIRRGSRARKAANKNRATESQAQ 507
Cdd:COG1196   373 ELAEAEEELEELAEELLEALR------------------AAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELE 434
                         250
                  ....*....|....*..
gi 595582345  508 IAEQGAQASEASISALE 524
Cdd:COG1196   435 EEEEEEEEALEEAAEEE 451
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
250-455 1.14e-05

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 49.90  E-value: 1.14e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  250 EASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASN 329
Cdd:COG4372    56 QAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLE 135
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  330 RQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNRQTEASNRQ--IGASNRQTEASNRQIGASNRQTDASNRQTDASNR 407
Cdd:COG4372   136 AQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAeaEQALDELLKEANRNAEKEEELAEAEKLIESLPRE 215
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*...
gi 595582345  408 QTEASSRQTEASSRQTEASSRQTEASSRQIEASAAAVRPKKPRGKKGN 455
Cdd:COG4372   216 LAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEEL 263
COG4935 COG4935
Regulatory P domain of the subtilisin-like proprotein convertases and other proteases ...
1216-1664 1.31e-05

Regulatory P domain of the subtilisin-like proprotein convertases and other proteases [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443962 [Multi-domain]  Cd Length: 641  Bit Score: 50.20  E-value: 1.31e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1216 TNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLST 1295
Cdd:COG4935    96 GVVAVAGAGLAATASGAAAGAVAAAANGNTGAGPGSGGTGGGSGGAGAAAAAAALSAAGAAVGVAAVAGAAGGGGGVGVA 175
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1296 SVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSA 1375
Cdd:COG4935   176 AAVGVVLGAGLVADGGNGGGGAVAGGAAGGGGGGGGGGGLGGAAGGGGAGLAAAGGGGGGAAAAAAAGVGGLGAAATAAA 255
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1376 SFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNA 1455
Cdd:COG4935   256 ADGGGGGGAGAAGAGGSAGAAAGGAGAGVVGAAAGGGDAALGGAVGAAGTGNAAAAAAASAGSGGGGGSAAAAGAAAAAA 335
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1456 GFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSA 1535
Cdd:COG4935   336 AAAAGAAAGVSGAASVVAGASGGGAGTAAAAGGGAAAAAAGGAAAAGAAAGAAAGAAAGAAAAGGVASAAGAVGAGTAAG 415
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1536 GFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSA 1615
Cdd:COG4935   416 ASATAAVSTGAASGSSTTSSTGTTATATGLGGGADAGSTSTGTGSAAGAAGGTTTATSGLASSTTAAAAAAAAGLATTAA 495
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|..
gi 595582345 1616 GFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNS---PDFGGAFSTS 1664
Cdd:COG4935   496 VAAGAAGAAAAAATAASVGGATGAAGTTNSTATFSNTTDvaiPDNGPAGVTS 547
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1028-1427 1.66e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 50.39  E-value: 1.66e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1028 GRNSITFG-SVPNT-SANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGgapststsfstasISF 1105
Cdd:NF033849  217 GQKSISFGvSLPMMyAANLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHG-------------STR 283
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1106 GGApststslstasisfggapststsfstasisfggapststslstasisfggapsiNSSSGGSSVSFGGAPTTSTSFSG 1185
Cdd:NF033849  284 GWS------------------------------------------------------HTQSTSESESTGQSSSVGTSESQ 309
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1186 GPCISFGgapcTTASISGGASSGFGSTLCSTNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGFGGTLSTSVCFGSSPYS 1265
Cdd:NF033849  310 SHGTTEG----TSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSS 385
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1266 GAGFGGTLSTSIsfGGSPSTNTGFGGTLSTSVSFGASSSTSSdFGGTLSTSVSFGGSSGA--NAGFGGTLNSSTSFGGAI 1343
Cdd:NF033849  386 SSGVSGGFSGGI--AGGGVTSEGLGASQGGSEGWGSGDSVQS-VSQSYGSSSSTGTSSGHsdSSSHSTSSGQADSVSQGT 462
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1344 STSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNT 1423
Cdd:NF033849  463 SWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISL 542

                  ....
gi 595582345 1424 NATF 1427
Cdd:NF033849  543 GKSY 546
Tar COG0840
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];
162-380 3.49e-05

Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];


Pssm-ID: 440602 [Multi-domain]  Cd Length: 533  Bit Score: 48.86  E-value: 3.49e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  162 NESANSLASTAVNKSKKASTANNAANKTVPSAAEISlasAATHTVTTQGQAAKETgSIQTIAATARSKKN-SKGKRTPAK 240
Cdd:COG0840   266 ASASEELAASAEELAAGAEEQAASLEETAAAMEELS---ATVQEVAENAQQAAEL-AEEASELAEEGGEVvEEAVEGIEE 341
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  241 TTNTDNEYVEASNAIEASSRQIG-------------------AS---------GR--------------QTEASNRQIEA 278
Cdd:COG0840   342 IRESVEETAETIEELGESSQEIGeivdviddiaeqtnllalnAAieaarageaGRgfavvadevrklaeRSAEATKEIEE 421
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  279 ssrQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQImasnRQIGASNRQIEASNRQIGASNRQTEVSSRQIEASNR 358
Cdd:COG0840   422 ---LIEEIQSETEEAVEAMEEGSEEVEEGVELVEEAGEAL----EEIVEAVEEVSDLIQEIAAASEEQSAGTEEVNQAIE 494
                         250       260
                  ....*....|....*....|..
gi 595582345  359 QIGASNRQTEASNRQIGASNRQ 380
Cdd:COG0840   495 QIAAAAQENAASVEEVAAAAEE 516
PPE COG5651
PPE-repeat protein [Function unknown];
1270-1470 4.05e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 48.35  E-value: 4.05e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1270 GGTLSTSISFGGSPSTNTGFGGTLSTS---VSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGGTLNSSTSFGGAISTS 1346
Cdd:COG5651   178 GGLLGAQNAGSGNTSSNPGFANLGLTGlnqVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAGAGASA 257
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1347 TGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNAT 1426
Cdd:COG5651   258 ALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAA 337
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 595582345 1427 FGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGA 1470
Cdd:COG5651   338 GAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGG 381
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
206-525 7.28e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 48.13  E-value: 7.28e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   206 VTTQGQAAKETGSIqtiaATARSKKNSKgkrtpakTTNTDNEYVEASNAIEASSRQIgasgrqTEASNRQIEAssrQTEA 285
Cdd:TIGR02168  648 VTLDGDLVRPGGVI----TGGSAKTNSS-------ILERRREIEELEEKIEELEEKI------AELEKALAEL---RKEL 707
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   286 SNRQTEASSRQteassRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNR 365
Cdd:TIGR02168  708 EELEEELEQLR-----KELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEA 782
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   366 QTEASNRQIGASNRQTEASNRQIGASNRQTDASNRqtdasnRQTEASSRQtEASSRQTEASSRQTEASSRQIEASAAAVr 445
Cdd:TIGR02168  783 EIEELEAQIEQLKEELKALREALDELRAELTLLNE------EAANLRERL-ESLERRIAATERRLEDLEEQIEELSEDI- 854
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   446 pkkprgkKGNNKgsnSASEPSEAPPAIQTVTNHAL----SVTVRIRRG-SRARKAANKNRATESQAQIAEQGAQASEASI 520
Cdd:TIGR02168  855 -------ESLAA---EIEELEELIEELESELEALLneraSLEEALALLrSELEELSEELRELESKRSELRRELEELREKL 924

                   ....*
gi 595582345   521 SALET 525
Cdd:TIGR02168  925 AQLEL 929
Tar COG0840
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];
241-445 1.89e-04

Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];


Pssm-ID: 440602 [Multi-domain]  Cd Length: 533  Bit Score: 46.55  E-value: 1.89e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  241 TTNTDNEYVEASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMA 320
Cdd:COG0840   230 DVDSKDEIGQLADAFNRMIENLRELVGQVRESAEQVASASEELAASAEELAAGAEEQAASLEETAAAMEELSATVQEVAE 309
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  321 S------------------NRQIGASNRQIEASNRQIGASNR------------------------QT------------ 346
Cdd:COG0840   310 NaqqaaelaeeaselaeegGEVVEEAVEGIEEIRESVEETAEtieelgessqeigeivdviddiaeQTnllalnaaieaa 389
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  347 --------------EV---------SSRQIEAsnrQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQ----TDASN 399
Cdd:COG0840   390 rageagrgfavvadEVrklaersaeATKEIEE---LIEEIQSETEEAVEAMEEGSEEVEEGVELVEEAGEAleeiVEAVE 466
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 595582345  400 RQTDASNRQTEASSRQTEASS------RQTEASSRQTEASSRQIEASAAAVR 445
Cdd:COG0840   467 EVSDLIQEIAAASEEQSAGTEevnqaiEQIAAAAQENAASVEEVAAAAEELA 518
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
245-519 2.58e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 46.21  E-value: 2.58e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   245 DNEYVEASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTE--TSNRQIgasnrqimASN 322
Cdd:TIGR02169  222 EYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKdlGEEEQL--------RVK 293
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   323 RQIGASNRQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNRQTEASNRQIGASNRQTEA-SNRQIGASNRQTDASNR- 400
Cdd:TIGR02169  294 EKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKlTEEYAELKEELEDLRAEl 373
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   401 -QTDASNRQT--EASSRQTEASSRQTEASSRQTEAS-----SRQIEASAAAVRPKKPRGKKGNNKgsnSASEPSEAPPAI 472
Cdd:TIGR02169  374 eEVDKEFAETrdELKDYREKLEKLKREINELKRELDrlqeeLQRLSEELADLNAAIAGIEAKINE---LEEEKEDKALEI 450
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 595582345   473 QTVTNHaLSVTVRIRRGSRARKAANKN-------RATESQAQIAEQGAQASEAS 519
Cdd:TIGR02169  451 KKQEWK-LEQLAADLSKYEQELYDLKEeydrvekELSKLQRELAEAEAQARASE 503
YjbI COG1357
Uncharacterized conserved protein YjbI, contains pentapeptide repeats [Function unknown];
1453-1627 2.65e-04

Uncharacterized conserved protein YjbI, contains pentapeptide repeats [Function unknown];


Pssm-ID: 440968 [Multi-domain]  Cd Length: 178  Bit Score: 43.78  E-value: 2.65e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1453 SNAGFGGAISTSTNFGGAlNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGAln 1532
Cdd:COG1357     8 SGADLSGADLSGADLSGA-NLSGALSGANLSGANLSGANLTGANLSGADLSGADLSGANLSGADLSGANLTGADLSGA-- 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1533 NSAGFGGAISTSASFGGtlnnsASFGGAINTSASFGGvlnnsAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALN 1612
Cdd:COG1357    85 NLANLSGANLSGANLSG-----ANLRGANLSGANLSG-----ADLSGADLSGANLSGADLSGANLSGANLSGADLSGADL 154
                         170
                  ....*....|....*
gi 595582345 1613 NSAGFGGAISTSASF 1627
Cdd:COG1357   155 SGANLSGANLSGANL 169
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
319-404 2.66e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 45.53  E-value: 2.66e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  319 MASNRQIGASNRQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDAS 398
Cdd:COG4942    16 AAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAEL 95

                  ....*.
gi 595582345  399 NRQTDA 404
Cdd:COG4942    96 RAELEA 101
COG5412 COG5412
Phage-related protein [Mobilome: prophages, transposons];
1215-1643 2.79e-04

Phage-related protein [Mobilome: prophages, transposons];


Pssm-ID: 444167 [Multi-domain]  Cd Length: 704  Bit Score: 46.23  E-value: 2.79e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1215 STNPGFSALSTNTSFGSAPTTSTVFSGAVSTTTGfGGTLSTSVCFGSSPYSGAGFGGTLSTSISFGGSPSTNTGFGGTLS 1294
Cdd:COG5412     7 SAKEAASAALLLAQAKAADSELTAASGGVVSAAA-KAQGSIAQLGKIGAAAGAEAALADSSLAFATLAAALGATVAGASL 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1295 TSVSFGASSSTSSDFGGTLstsvsfGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSS 1374
Cdd:COG5412    86 LLAAGGARAKGSAAAAAAL------GAVAAAAKVLNGALAAAGAALAATQALAAAATGAKGEANAAAKAGGAAALASAGL 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1375 ASFGGAINTSAGFGSTLNSSASFGSALSTSASFGGVLNGSAGFGGALNTNAtfggvlngsAGFGGAMNTNATFGGALNSN 1454
Cdd:COG5412   160 AAAGAAAAASALAAAGAIAKAILSASKLSGQALAGQSAAAGGALEAAAAAA---------AGAAAAGAAAAAATAASALL 230
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1455 AGFGGAISTSTNFGGALNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNATFGGALnnsagfgGAISTNATFGGALNNS 1534
Cdd:COG5412   231 ALAALQGLAAGAATGAAAGAAGAAGLGAAGAGAGQAAALLGLVAGAEASGGTAGGAV-------AGLAAGLAAAAGASAN 303
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1535 AGFGGAISTSASFGGTLNNSAsfGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNS 1614
Cdd:COG5412   304 LGAAAAASFGASLAASAGVDT--AAAALAAAEAIADGSLVAGLGSAGTVLSTLSGAVGGLEGAIGQLGAAGGLGSALGGL 381
                         410       420       430
                  ....*....|....*....|....*....|....
gi 595582345 1615 AGFGGAISTS-----ASFGGALNNSAGFGGAIST 1643
Cdd:COG5412   382 TGPIGIVIAAiaaliAAFVALWKNSETFRNLVQG 415
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
869-1089 2.79e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 46.15  E-value: 2.79e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  869 NQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGfggiSNPSGGFGGisnpSGGFGGISNPSGGFGGISNPSGGFGG 948
Cdd:NF033849  310 SHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDG----TSQSTSISH----SESSSESTGTSVGHSTSSSVSSSESS 381
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  949 ISNPSGGFggisnpSGGFGGISNPSGGFggisnpSGGFGGISNPSGGFGGisnpSGGFGGISNPSGGFGGISNPSG-GFG 1027
Cdd:NF033849  382 SRSSSSGV------SGGFSGGIAGGGVT------SEGLGASQGGSEGWGS----GDSVQSVSQSYGSSSSTGTSSGhSDS 445
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 595582345 1028 GRNSITFGsvpnTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFG 1089
Cdd:NF033849  446 SSHSTSSG----QADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVS 503
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1239-1459 2.86e-04

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 45.81  E-value: 2.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  1239 FSGAVSTTTGFGGTLSTSVCFGSSPYS--GAGFGGTLSTSISFGGSPSTNTGFGGTLstsvsFGASSSTSSDFGGTLSTS 1316
Cdd:pfam15967    6 FGGGPGSTATAGGGFSFGAAAASNPGStgGFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASST 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  1317 VSFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSGVLNSSASFGGAINTSAGFGSTLNSSAs 1396
Cdd:pfam15967   81 AATGPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTPATTT- 159
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 595582345  1397 fgsALSTSASFGGVLNgsaGFGGALNTNATFGGVLNGSAGFGGAMNTNATFgGALNSNAGFGG 1459
Cdd:pfam15967  160 ---AVSTGLSLGSTLT---SLGGSLFQNTNSTGLGQTTLGLTLLATSTAPV-SAPAASEGLGG 215
COG5412 COG5412
Phage-related protein [Mobilome: prophages, transposons];
1318-1666 2.91e-04

Phage-related protein [Mobilome: prophages, transposons];


Pssm-ID: 444167 [Multi-domain]  Cd Length: 704  Bit Score: 45.84  E-value: 2.91e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1318 SFGGSSGANAGFGGTLNSSTSFGGAISTSTGFGSALNNSANFGGAISTSFSgvLNSSASFGGAINTSAGFGSTLNSSASF 1397
Cdd:COG5412    35 VVSAAAKAQGSIAQLGKIGAAAGAEAALADSSLAFATLAAALGATVAGASL--LLAAGGARAKGSAAAAAALGAVAAAAK 112
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1398 GSALSTSASFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGALNNSAGF 1477
Cdd:COG5412   113 VLNGALAAAGAALAATQALAAAATGAKGEANAAAKAGGAAALASAGLAAAGAAAAASALAAAGAIAKAILSASKLSGQAL 192
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1478 GGAMNTSASFGGALN---NSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTSASFGGTLNNS 1554
Cdd:COG5412   193 AGQSAAAGGALEAAAaaaAGAAAAGAAAAAATAASALLALAALQGLAAGAATGAAAGAAGAAGLGAAGAGAGQAAALLGL 272
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1555 AsFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNS 1634
Cdd:COG5412   273 V-AGAEASGGTAGGAVAGLAAGLAAAAGASANLGAAAAASFGASLAASAGVDTAAAALAAAEAIADGSLVAGLGSAGTVL 351
                         330       340       350
                  ....*....|....*....|....*....|..
gi 595582345 1635 AGFGGAISTNASFGGAISNSPDFGGAFSTSVG 1666
Cdd:COG5412   352 STLSGAVGGLEGAIGQLGAAGGLGSALGGLTG 383
PRK09039 PRK09039
peptidoglycan -binding protein;
259-380 3.34e-04

peptidoglycan -binding protein;


Pssm-ID: 181619 [Multi-domain]  Cd Length: 343  Bit Score: 45.34  E-value: 3.34e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  259 SRQIgaSGRQTE--ASNRQIEASSRQ---TEASNRQTEASSRQTEASSRQTETSNRQIGASN----RQIMASNRQIGASN 329
Cdd:PRK09039   45 SREI--SGKDSAldRLNSQIAELADLlslERQGNQDLQDSVANLRASLSAAEAERSRLQALLaelaGAGAAAEGRAGELA 122
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 595582345  330 RQIeASNRQIGA-SNRQTEVSSRQIEASNRQIGASNRQTEASNRQIGASNRQ 380
Cdd:PRK09039  123 QEL-DSEKQVSArALAQVELLNQQIAALRRQLAALEAALDASEKRDRESQAK 173
PHA02515 PHA02515
hypothetical protein; Provisional
1294-1505 3.67e-04

hypothetical protein; Provisional


Pssm-ID: 107197 [Multi-domain]  Cd Length: 508  Bit Score: 45.54  E-value: 3.67e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1294 STSVSFGASSSTSSDFGGTLSTSVSFGGSSGANAGFGgtlNSSTSFGGAISTSTGFGSALNNSANFG--GAISTSFSGVL 1371
Cdd:PHA02515  175 TVAASVGAVDTVAGDLGGTWAAGVSYDFGSIAVPPIG---NTSPPGGNIVIVANSIGNVDTVAENIGdvSTVSTHLSSML 251
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1372 -------------------NSSASFGGAINTSAGFGSTLNSSASfgSALSTSASFGGVLNGSAGFGGALNTNATFGGVLN 1432
Cdd:PHA02515  252 avandidsvvsvagdleniDAVADNAANINTVAGANANVNTVAS--NILDVGTVAGNIDDVQAVAGNAANINVVADNADN 329
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 595582345 1433 GSAGFGGAMNTNATFGGALNSNAGFGGAISTSTNFGGA--LNNSAGFGGAMNTSASFGGALNNSAGFGGAISTNA 1505
Cdd:PHA02515  330 INATAANQANINAAVGNADNINAAVANQANINAVVGNAnnINAVAANEGNVNTVVDNLADVQTVAGIAADVSTVA 404
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
272-490 3.75e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 45.67  E-value: 3.75e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  272 SNRQIEASSRQ-TEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQIGASNRQTEVSS 350
Cdd:NF033609   33 SSKEADASENSvTQSDSASNESKSNDSSSVSAAPKTDDTNVSDTKTSSNTNNGETSVAQNPAQQETTQSASTNATTEETP 112
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  351 RQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSR--QTEASSRQTEASSR 428
Cdd:NF033609  113 VTGEATTTATNQANTPATTQSSNTNAEELVNQTSNETTSNDTNTVSSVNSPQNSTNAENVSTTQdtSTEATPSNNESAPQ 192
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 595582345  429 QTEASSRQIeaSAAAVRPKKPRgkkgnNKGSNSASEPSEAPPAIQTVTNHALSVTVRIRRGS 490
Cdd:NF033609  193 STDASNKDV--VNQAVNTSAPR-----MRAFSLAAVAADAPAAGTDITNQLTNVTVGIDSGT 247
PPE COG5651
PPE-repeat protein [Function unknown];
889-1092 4.31e-04

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 44.88  E-value: 4.31e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  889 ITNPSGGFGGiSNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGG 968
Cdd:COG5651   174 ITNPGGLLGA-QNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAG 252
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  969 ISNPSGGFGGISNP--SGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSS 1046
Cdd:COG5651   253 AGASAALASLAATLlnASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGA 332
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*.
gi 595582345 1047 APSISFGDTPNTSTSFSGGANSSFSGTPSTSAPFCNTASISFGGAP 1092
Cdd:COG5651   333 AAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAA 378
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
1373-1606 7.56e-04

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 44.66  E-value: 7.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  1373 SSASFGGAINTSAGFGSTLnssaSFGSALSTSA-SFGGVLNGSAGFGGALNTNATFGGVLNGSAGFGGAMNTNATFGGAL 1451
Cdd:pfam15967    2 SGFSFGGGPGSTATAGGGF----SFGAAAASNPgSTGGFSFGTLGAAPAATATTTTATLGLGGGLFGQKPATGFTFGTPA 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  1452 NSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSAsfGGALNNSAGFGGAISTNATFGGALNNSAGFGGAISTNATFGGAL 1531
Cdd:pfam15967   78 SSTAATGPTGLTLGTPAATTAASTGFSLGFNKPA--ASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTP 155
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 595582345  1532 NNSAgfggAISTSASFGGTLNnsaSFGGAINTSASFGGVLNNSAGFGGAINTSANFgGALTNSAGFGGAISTSAS 1606
Cdd:pfam15967  156 ATTT----AVSTGLSLGSTLT---SLGGSLFQNTNSTGLGQTTLGLTLLATSTAPV-SAPAASEGLGGLDFSTSS 222
PRK09039 PRK09039
peptidoglycan -binding protein;
250-360 7.84e-04

peptidoglycan -binding protein;


Pssm-ID: 181619 [Multi-domain]  Cd Length: 343  Bit Score: 43.80  E-value: 7.84e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  250 EASNAIEASSRQIGASGRQTEASNRQIEASsrQTEASNRQTEASSRQTE-ASSRQTEtsnRQIGASnrqimaSNRQIGAS 328
Cdd:PRK09039   74 QGNQDLQDSVANLRASLSAAEAERSRLQAL--LAELAGAGAAAEGRAGElAQELDSE---KQVSAR------ALAQVELL 142
                          90       100       110
                  ....*....|....*....|....*....|..
gi 595582345  329 NRQIEASNRQIGASNRQTEVSSRQIEASNRQI 360
Cdd:PRK09039  143 NQQIAALRRQLAALEAALDASEKRDRESQAKI 174
Keratin_2_head pfam16208
Keratin type II head;
877-1015 8.50e-04

Keratin type II head;


Pssm-ID: 465068 [Multi-domain]  Cd Length: 156  Bit Score: 41.95  E-value: 8.50e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   877 SFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPS-GGFGGISNPSGGFGGISNPSGGFGGISNPSGG 955
Cdd:pfam16208    1 GFSSCSAVVPSRSRRSYSSVSSSRRGGGGGGGGGGGGGGFGSRSLYNlGGSKSISISVAGGGSRPGSGFGFGGGGGGGFG 80
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   956 FGGISNPSGGFGGISNPSGGFGGISNPSGGFGGisnpsGGFGGisnpSGGFGGISNPSGG 1015
Cdd:pfam16208   81 GGFGGGGGGGFGGGGGFGGGFGGGGYGGGGFGG-----GGFGG----RGGFGGPPCPPGG 131
PPE COG5651
PPE-repeat protein [Function unknown];
881-1079 1.09e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 43.73  E-value: 1.09e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  881 GAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSG----GFGGiSNPSGGFGGISNPSGGFGGISNPSGGF 956
Cdd:COG5651   182 GAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGpgntGFAG-TGAAAGAAAAAAAAAAAAGAGASAALA 260
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  957 GGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGS 1036
Cdd:COG5651   261 SLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAAGAA 340
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|...
gi 595582345 1037 VPNTSANFSSAPSISFGDTPNTSTSFSGGANSSFSGTPSTSAP 1079
Cdd:COG5651   341 AGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGA 383
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
176-452 1.52e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 43.35  E-value: 1.52e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  176 SKKASTANNAANKTvpsAAEISLASAATHT-VTTQGQAAKETGSIQTIAATARSKKNskgkRTPAKTTNTDNEYVEASNA 254
Cdd:COG4372     9 GKARLSLFGLRPKT---GILIAALSEQLRKaLFELDKLQEELEQLREELEQAREELE----QLEEELEQARSELEQLEEE 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  255 IEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQTETSNRQIGASNRQIMASNRQIGASNRQIEA 334
Cdd:COG4372    82 LEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLES 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  335 SNRQIgaSNRQTEVSSRQIEASNRQIgasNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSR 414
Cdd:COG4372   162 LQEEL--AALEQELQALSEAEAEQAL---DELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALS 236
                         250       260       270
                  ....*....|....*....|....*....|....*...
gi 595582345  415 QTEASSRQTEASSRQTEASSRQIEASAAAVRPKKPRGK 452
Cdd:COG4372   237 ALLDALELEEDKEELLEEVILKEIEELELAILVEKDTE 274
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
260-435 1.58e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 43.75  E-value: 1.58e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  260 RQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSRQtetsnRQIGASNRQImasnRQIGASNRQIEASNRQI 339
Cdd:COG4913   624 EELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAE-----REIAELEAEL----ERLDASSDDLAALEEQL 694
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  340 GASNRQTEVSSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNrQIGASNRQTDASNRQTDASNRQTEASSRQtEAS 419
Cdd:COG4913   695 EELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAE-DLARLELRALLEERFAAALGDAVERELRE-NLE 772
                         170
                  ....*....|....*.
gi 595582345  420 SRQTEASSRQTEASSR 435
Cdd:COG4913   773 ERIDALRARLNRAEEE 788
Tar COG0840
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];
162-426 1.68e-03

Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];


Pssm-ID: 440602 [Multi-domain]  Cd Length: 533  Bit Score: 43.47  E-value: 1.68e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  162 NESANSLASTAVNKSKKASTANNAANKTvpsAAEISLASAATHTVTTqgqaaketgSIQTIAATARskknskgkrtpakt 241
Cdd:COG0840   259 RESAEQVASASEELAASAEELAAGAEEQ---AASLEETAAAMEELSA---------TVQEVAENAQ-------------- 312
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  242 tntdneyvEASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASN---RQTEASSR--------------QT------- 297
Cdd:COG0840   313 --------QAAELAEEASELAEEGGEVVEEAVEGIEEIRESVEETAetiEELGESSQeigeivdviddiaeQTnllalna 384
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  298 --EA-----------------------SSRQTETSNRQIGASNRQIMASNRQIGASNRQIEASNRQI---GASNRQTEVS 349
Cdd:COG0840   385 aiEAarageagrgfavvadevrklaerSAEATKEIEELIEEIQSETEEAVEAMEEGSEEVEEGVELVeeaGEALEEIVEA 464
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 595582345  350 SRQIEASNRQIGAsnrqteasnrqigASNRQTEASNrQIGASNRQTDASNRQTDASNRQTEASSRQTEASSRQTEAS 426
Cdd:COG0840   465 VEEVSDLIQEIAA-------------ASEEQSAGTE-EVNQAIEQIAAAAQENAASVEEVAAAAEELAELAEELQEL 527
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
310-543 2.91e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 42.12  E-value: 2.91e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  310 QIGASNRQIMASNRQIGASNRQIEASNRQIGASNRQTEVSSRQIEASNRQIGASNRQTEASNRQIgaSNRQTEASNR--- 386
Cdd:COG3883    17 QIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEI--EERREELGERara 94
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  387 -------------------------QIGASNRQTDASNR--------QTDASNRQTEASSRQTEASSRQTEASSRQTEAS 433
Cdd:COG3883    95 lyrsggsvsyldvllgsesfsdfldRLSALSKIADADADlleelkadKAELEAKKAELEAKLAELEALKAELEAAKAELE 174
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  434 SRQIEASAAAVRPKKPRGKKGNNKGSNSASEPSEAPPAIQTVTNHALSVTVRIRRGSRARKAANKNRATESQAQIAEQGA 513
Cdd:COG3883   175 AQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAGAGAAGA 254
                         250       260       270
                  ....*....|....*....|....*....|
gi 595582345  514 QASEASISALETQVAAAVQALADDYLAQLS 543
Cdd:COG3883   255 AGAAAGSAGAAGAAAGAAGAGAAAASAAGG 284
PPE COG5651
PPE-repeat protein [Function unknown];
903-1109 2.99e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 42.19  E-value: 2.99e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  903 SGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNP 982
Cdd:COG5651   177 PGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAGAGAS 256
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  983 SGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGRNSITFGSVPNTSANFSSAPSISFGDTPNTSTSF 1062
Cdd:COG5651   257 AALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAA 336
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 595582345 1063 SGGANSSFSGTPSTSAPFCNTASISFGGAPSTSTSFSTASISFGGAP 1109
Cdd:COG5651   337 AGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGA 383
PPE COG5651
PPE-repeat protein [Function unknown];
867-1049 3.14e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 42.19  E-value: 3.14e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  867 LFNQGATTRNSFSDGAGISFGGITNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGF 946
Cdd:COG5651   202 LTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASS 281
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  947 GGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISNPSGGFGGISN-PSGGFGGISNPSGGFGGISNPSGG 1025
Cdd:COG5651   282 AATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAaGAAAGAGAAAAAAAGGAGGGGGGA 361
                         170       180
                  ....*....|....*....|....
gi 595582345 1026 FGGRNSITFGSVPNTSANFSSAPS 1049
Cdd:COG5651   362 LGAGGGGGSAGAAAGAASGGGAAA 385
PHA00430 PHA00430
tail fiber protein
325-467 3.26e-03

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 42.57  E-value: 3.26e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  325 IGASNR-QIEASNRQI----GASNRQTEVSSRQIEASNRQIGASNRQTEASNRQIGASNRQTEASNRQIGASNRQTDASN 399
Cdd:PHA00430  121 IGVNNDgHLDARGRRIvnlaDAVDDGDAVPLGQIKTWNQSAWNARNEANRSRNEADRARNQAERFNNESGASATNTKQWR 200
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 595582345  400 RQTDASNRQTE-----ASSRQTEASSRQTEASSRQTEASSRQIEASAAAVRPKKPRGKKGNNKGSNSASEPSE 467
Cdd:PHA00430  201 SEADGSNSEANrfkgyADSMTSSVEAAKGQAESSSKEANTAGDYATKAAASASAAHASEVNAANSATAAATSA 273
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1522-1713 3.30e-03

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 42.43  E-value: 3.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1522 STNATFGGALNNSAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFGGALTNSAGFGGAI 1601
Cdd:COG3469    13 GGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATST 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1602 STSASFGGALNNSAGfggaiSTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHS 1681
Cdd:COG3469    93 SATLVATSTASGANT-----GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTS 167
                         170       180       190
                  ....*....|....*....|....*....|..
gi 595582345 1682 NSISFGSAPTTSvSFGGSHSTNLCFGGAPSTS 1713
Cdd:COG3469   168 TTTTTTSASTTP-SATTTATATTASGATTPSA 198
34 PHA02584
long tail fiber, proximal subunit; Provisional
1412-1595 5.23e-03

long tail fiber, proximal subunit; Provisional


Pssm-ID: 222890 [Multi-domain]  Cd Length: 1229  Bit Score: 42.05  E-value: 5.23e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1412 NGSAGFGGALNTNATFggVLNGSAGFGGAMNTNATF------GGALNSNAGFGGAISTSTNFGGALNNSAGFGGAMNTSA 1485
Cdd:PHA02584  908 NGSLTFTKNTNLSAPL--VSSSTATFGGSVTANSTLttqntsNGTVVVVDETSIAFYSQNNTTGNIVFNIDGTVDPINVN 985
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1486 SFGGALNNSAG-FGGAISTNatfGGALNNSAGFGGAISTNATFGGALNNSagfggAISTSASFGGTLNNSASFGGAINTS 1564
Cdd:PHA02584  986 ANGTLNATGVAtNGRAVYAE---GGGIARTNNAARAITGGFTIRNDGSTT-----VFLLTAAGDQTGGFNGLKSLIINNA 1057
                         170       180       190
                  ....*....|....*....|....*....|.
gi 595582345 1565 ASFGGVLNNSAGFGGAINTSanfgGALTNSA 1595
Cdd:PHA02584 1058 NGQVTINDNYIINAGGTIMS----GGLTVNS 1084
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
20-243 5.51e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.83  E-value: 5.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345    20 PAGSLGLPFSPDVQSETT---EKDPPIASRSKKNKNKKNSIKPMDKTTPAPPPVPSANDNASNKPKVTLQALNLPMFTQI 96
Cdd:pfam05109  442 PNTTTGLPSSTHVPTNLTapaSTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNAT 521
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345    97 SQASA-TTEAPNIQASVTSQTQKAKTMrVTPKVSLTGSEDATTQLKPPLQALNLPVTTPTiqTPVANESANSLASTAVNK 175
Cdd:pfam05109  522 SPTPAvTTPTPNATSPTLGKTSPTSAV-TTPTPNATSPTPAVTTPTPNATIPTLGKTSPT--SAVTTPTPNATSPTVGET 598
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 595582345   176 SKKASTANNAANKTVPSAAEISLASAATHTVTTqGQAAKETGSIQTIAATARSKKNSKGKRTPAKTTN 243
Cdd:pfam05109  599 SPQANTTNHTLGGTSSTPVVTSPPKNATSAVTT-GQHNITSSSTSSMSLRPSSISETLSPSTSDNSTS 665
YjbI COG1357
Uncharacterized conserved protein YjbI, contains pentapeptide repeats [Function unknown];
1503-1660 5.96e-03

Uncharacterized conserved protein YjbI, contains pentapeptide repeats [Function unknown];


Pssm-ID: 440968 [Multi-domain]  Cd Length: 178  Bit Score: 39.92  E-value: 5.96e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1503 TNATFGGALNNSAGFGGAISTNATFGGALNNsAGFGGAISTSASFGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAIN 1582
Cdd:COG1357     3 SGADLSGADLSGADLSGADLSGANLSGALSG-ANLSGANLSGANLTGANLSGADLSGADLSGANLSGADLSGANLTGADL 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1583 TSAN---FGGALTNSAGFGGAISTSASFGGALNNSAGFGGAISTSASFGGALNNSAGFGGAISTNASFGGAISNSPDFGG 1659
Cdd:COG1357    82 SGANlanLSGANLSGANLSGANLRGANLSGANLSGADLSGADLSGANLSGADLSGANLSGANLSGADLSGADLSGANLSG 161

                  .
gi 595582345 1660 A 1660
Cdd:COG1357   162 A 162
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
1632-1860 6.99e-03

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 41.14  E-value: 6.99e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1632 NNSAGFGGAISTNASFGGAISNSPDFGGAFSTSVGFGGTLNTTDFGSTHSNSISFGSAPTTSVSFGGSHSTNlcfggaPS 1711
Cdd:cd21118   133 QGGPGVQGHGIPGGTGGPWASGGNYGTNSLGGSVGQGGNGGPLNYGTNSQGAVAQPGYGTVRGNNQNSGCTN------PP 206
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1712 TSLCFGSASNTNLCFGGSNSTNCFSGATSANFNEGHSISFGNGLSTSAGFGNGLGTSAGFGSSLGTSTGFGGSLGPSASF 1791
Cdd:cd21118   207 PSGSHESFSNSGGSSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSGGSNGGSSGNSGSGSGGSSSGGSN 286
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 595582345 1792 NGGLGTSTGFGGGLGTSTDFSGGLNHNADFNGGLGNSAGFNGGLNTNTDFGGELGTSAGFGDGLGSSTS 1860
Cdd:cd21118   287 GWGGSSSSGGSGGSGGGNKPECNNPGNDVRMAGGGGSQGSKESSGSHGSNGGNGQAEAVGGLNTLNSDA 355
34 PHA02584
long tail fiber, proximal subunit; Provisional
1390-1588 7.27e-03

long tail fiber, proximal subunit; Provisional


Pssm-ID: 222890 [Multi-domain]  Cd Length: 1229  Bit Score: 41.66  E-value: 7.27e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1390 TLNSSASFGSALSTSASFggVLNGSAGFGGALNTNATF------GGVLNGSAGFGGAMNTNATFGGALNSNAGFGGAIST 1463
Cdd:PHA02584  906 TVNGSLTFTKNTNLSAPL--VSSSTATFGGSVTANSTLttqntsNGTVVVVDETSIAFYSQNNTTGNIVFNIDGTVDPIN 983
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345 1464 STNFGGALNNSAG-FGGAMNTSasfGGALNNSAGFGGAISTNATFGGALNNSAG-----FGGAISTNATFGGALNNSAGF 1537
Cdd:PHA02584  984 VNANGTLNATGVAtNGRAVYAE---GGGIARTNNAARAITGGFTIRNDGSTTVFlltaaGDQTGGFNGLKSLIINNANGQ 1060
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 595582345 1538 -----------GGAISTsasfGGTLNNSASFGGAINTSASFGGVLNNSAGFGGAINTSANFG 1588
Cdd:PHA02584 1061 vtindnyiinaGGTIMS----GGLTVNSRIRSQGTKASYTRAPTADTVGFWSVDINDSATYN 1118
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
223-516 8.05e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 40.97  E-value: 8.05e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  223 AATARSKKNSKGKRTPAKTTNTDNEYVEASNAIEASSRQIGASGRQTEASNRQIEASSRQTEASNRQTEASSRQTEASSR 302
Cdd:COG3883    14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERAR 93
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  303 QTETSNRQIGASNRQIMASN-----RQIGASNRQIEASNRQIgasnrqTEVSSRQIEASNRQIGASNRQTEASNRQIGAS 377
Cdd:COG3883    94 ALYRSGGSVSYLDVLLGSESfsdflDRLSALSKIADADADLL------EELKADKAELEAKKAELEAKLAELEALKAELE 167
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345  378 NRQTEASNRQIGASNRQTDASNRQTDASNRQTEASSRQTEASSRQTEASSRQTEASSRQIEASAAAVRPKKPRGKKGNNK 457
Cdd:COG3883   168 AAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAA 247
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 595582345  458 GSNSASEPSEAPPAIQTVTNHALSVTVRIRRGSRARKAANKNRATESQAQIAEQGAQAS 516
Cdd:COG3883   248 GAGAAGAAGAAAGSAGAAGAAAGAAGAGAAAASAAGGGAGGAGGGGGGGGAASGGSGGG 306
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
29-297 8.74e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.10  E-value: 8.74e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345    29 SPDVQSETTekdppiaSRSKKNKNKKNSIKPMDKTTPAPPPVPSANDNASNKPKVTLQALNLPmftQISQASATTEAPNI 108
Cdd:pfam17823   83 STEVTAEHT-------PHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALP---SEAFSAPRAAACRA 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   109 QASVTSQTQKAKTMRVTPKVSLTGSEDATTQLKPPLQALNLP-----VTTPTIQTPVanesaNSLASTAVNKSKKASTAN 183
Cdd:pfam17823  153 NASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSApttaaSSAPATLTPA-----RGISTAATATGHPAAGTA 227
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 595582345   184 NAANKTVPSAAEISLASAATHTVTTQGQAAKETGSIQTIAATARSKKNSKGKRTPAKTTNTDNeyveasnaieASSRQIG 263
Cdd:pfam17823  228 LAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDT----------MARNPAA 297
                          250       260       270
                   ....*....|....*....|....*....|....
gi 595582345   264 ASGRQTEASNRQIEASSRQTEASNRQTEASSRQT 297
Cdd:pfam17823  298 PMGAQAQGPIIQVSTDQPVHNTAGEPTPSPSNTT 331
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH