NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|568923858|ref|XP_006502049|]
View 

hornerin isoform X1 [Mus musculus]

Protein Classification

S-100 domain-containing protein( domain architecture ID 10082979)

S-100 domain-containing protein contains the Ca-binding EF-hand motif; similar to Homo sapiens S100 proteins that are implicated in intracellular and extracellular regulatory activities

CATH:  1.10.238.10
Gene Ontology:  GO:0005509
PubMed:  2479149|10191494
SCOP:  3001983

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
S-100 cd00213
S-100: S-100 domain, which represents the largest family within the superfamily of proteins ...
2-89 1.68e-35

S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents the interaction site of S100 proteins with their target proteins. There is experimental evidence showing that many S100 proteins have multiple binding partners with diverse mode of interaction with different targets. In addition to S100 proteins (such as S100A1,-3,-4,-6,-7,-10,-11,and -13), this group includes the ''fused'' gene family, a group of calcium binding S100-related proteins. The ''fused'' gene family includes multifunctional epidermal differentiation proteins - profilaggrin, trichohyalin, repetin, hornerin, and cornulin; functionally these proteins are associated with keratin intermediate filaments and partially crosslinked to the cell envelope. These ''fused'' gene proteins contain N-terminal sequence with two Ca-binding EF-hands motif, which may be associated with calcium signaling in epidermal cells and autoprocessing in a calcium-dependent manner. In contrast to S100 proteins, "fused" gene family proteins contain an extraordinary high number of almost perfect peptide repeats with regular array of polar and charged residues similar to many known cell envelope proteins.


:

Pssm-ID: 238131 [Multi-domain]  Cd Length: 88  Bit Score: 131.07  E-value: 1.68e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858    2 PKLLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKL 81
Cdd:cd00213     1 SELEKAIETIIDVFHKYSGKEGDKDTLSKKELKELLETELPNFLKNQKDPEAVDKIMKDLDVNKDGKVDFQEFLVLIGKL 80

                  ....*...
gi 568923858   82 TKACNKII 89
Cdd:cd00213    81 AVACHEFF 88
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2750-3051 1.22e-08

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 61.17  E-value: 1.22e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2750 GQSSGYGqneYGSGHSASSGQqgshySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGL--GQYGSPSGQTSSSTRQGSGQ 2827
Cdd:NF033849  236 GQSAGTG---YGESVGHSTSQ-----GQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWshTQSTSESESTGQSSSVGTSE 307
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2828 GQASGSGRyGASSGQTSGCGSGQS-TRYGEQGSGSRNSSTQSRGRSTSRESSTSQ-RYGSGSGESSGFSQGGSGQGRSSR 2905
Cdd:NF033849  308 SQSHGTTE-GTSTTDSSSHSQSSSyNVSSGTGVSSSHSDGTSQSTSISHSESSSEsTGTSVGHSTSSSVSSSESSSRSSS 386
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2906 GGQQGSFSGQTSGRSQHQSGSRHGSGSGQfPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTS 2985
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSE-GWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 2986 GQTASSTRQGSGQGQA-SGSGRCGASSGQTSGCGSGQSTRYGeQGSGSRNSSTQSRGRSTSRESSCS 3051
Cdd:NF033849  466 EGTGTSQGQSVGTSESwSTSQSETDSVGDSTGTSESVSQGDG-RSTGRSESQGTSLGTSGGRTSGAG 531
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1738-2003 4.30e-08

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 59.63  E-value: 4.30e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1738 THNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGrcgASSGQTSGCGSGQSTRYgEQGSGS 1817
Cdd:NF033849  264 SHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEG---TSTTDSSSHSQSSSYNV-SSGTGV 339
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1818 RNSSTQSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGsFSGQTEGSQQHGSCCGQSSGYGQneygsGH 1897
Cdd:NF033849  340 SSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSG-VSGGFSGGIAGGGVTSEGLGASQ-----GG 413
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1898 SASSGQQGSHYSQSSSYGTHNSGGSPS--SSQRGHGSRSGRSSGLGQYGSPSGQTSSSTRQGSGQGQA-SGSGRYGASSG 1974
Cdd:NF033849  414 SEGWGSGDSVQSVSQSYGSSSSTGTSSghSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESwSTSQSETDSVG 493
                         250       260
                  ....*....|....*....|....*....
gi 568923858 1975 QTSGCGSGQStrygeQGSGSrnssTQSRG 2003
Cdd:NF033849  494 DSTGTSESVS-----QGDGR----STGRS 513
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1508-1817 4.83e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 4.83e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1508 QGRSSRGGQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRS 1586
Cdd:NF033849  253 QGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSeSESTGQSSSVGT-SESQSHGTTEGTSTTDSSSHSQSSSY 331
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1587 GRSSGLGQYGSPSGQTSSSTRQGSGQGQASGSGrygassgqtSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQ 1666
Cdd:NF033849  332 NVSSGTGVSSSHSDGTSQSTSISHSESSSESTG---------TSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGV 402
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1667 RYGSGSGESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQhqsgsrhgsgsgqfpisGQQGSHHGHSSSSGTHNSGSSQS 1746
Cdd:NF033849  403 TSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGH-----------------SDSSSHSTSSGQADSVSQGTSWS 465
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568923858 1747 SSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSgrcGASSGQTSGCGSGQSTRYGEQGSGS 1817
Cdd:NF033849  466 EGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGR---STGRSESQGTSLGTSGGRTSGAGGS 533
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1187-1478 5.58e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.78  E-value: 5.58e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1187 GQSSGYGqneYGSGHSASSGQqgshySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGL--GQYGSPSGQTSSSTRQGSGQ 1264
Cdd:NF033849  236 GQSAGTG---YGESVGHSTSQ-----GQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWshTQSTSESESTGQSSSVGTSE 307
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1265 GQASGSGRyGASSGQTSGCGSGQS-TRYGEQGSGSRNSSTQSRGRSTSRESSTSQ-RYGSGSGESSGFSQGGSGQGRSSR 1342
Cdd:NF033849  308 SQSHGTTE-GTSTTDSSSHSQSSSyNVSSGTGVSSSHSDGTSQSTSISHSESSSEsTGTSVGHSTSSSVSSSESSSRSSS 386
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1343 GGQQGSFSGQTSGRSQHQSGSRHGSGSGQfPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTS 1422
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSE-GWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 1423 GQTASSTRQGSGQGQA-SGSGRCGASSGQTSGCGSGQSTRYGeQGSGSRNSSTQSRG 1478
Cdd:NF033849  466 EGTGTSQGQSVGTSESwSTSQSETDSVGDSTGTSESVSQGDG-RSTGRSESQGTSLG 521
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2402-2693 5.58e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.78  E-value: 5.58e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2402 GQSSGYGqneYGSGHSASSGQqgshySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGL--GQYGSPSGQTSSSTRQGSGQ 2479
Cdd:NF033849  236 GQSAGTG---YGESVGHSTSQ-----GQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWshTQSTSESESTGQSSSVGTSE 307
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2480 GQASGSGRyGASSGQTSGCGSGQS-TRYGEQGSGSRNSSTQSRGRSTSRESSTSQ-RYGSGSGESSGFSQGGSGQGRSSR 2557
Cdd:NF033849  308 SQSHGTTE-GTSTTDSSSHSQSSSyNVSSGTGVSSSHSDGTSQSTSISHSESSSEsTGTSVGHSTSSSVSSSESSSRSSS 386
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2558 GGQQGSFSGQTSGRSQHQSGSRHGSGSGQfPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTS 2637
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSE-GWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 2638 GQTASSTRQGSGQGQA-SGSGRCGASSGQTSGCGSGQSTRYGeQGSGSRNSSTQSRG 2693
Cdd:NF033849  466 EGTGTSQGQSVGTSESwSTSQSETDSVGDSTGTSESVSQGDG-RSTGRSESQGTSLG 521
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2272-2473 6.22e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.78  E-value: 6.22e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2272 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 2351
Cdd:NF033849  319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2352 SSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 2425
Cdd:NF033849  398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 2426 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 2473
Cdd:NF033849  478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
471-773 1.60e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 54.24  E-value: 1.60e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  471 GQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLG 549
Cdd:NF033849  260 GTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSeSESTGQSSSVGT-SESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTG 338
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  550 QYGSPSGQTSSSTRQGSGQGQASGSGrygassgqtSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSG 629
Cdd:NF033849  339 VSSSHSDGTSQSTSISHSESSSESTG---------TSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  630 ESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQhqsgsrhgsgsgqfpisGQQGSHHGHSSSSGTHNSGSSQSSSTQWSH 709
Cdd:NF033849  410 SQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGH-----------------SDSSSHSTSSGQADSVSQGTSWSEGTGTSQ 472
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568923858  710 GSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSgrcGASSGQTSGCGSGQSTRYGEQGSGS 773
Cdd:NF033849  473 GQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGR---STGRSESQGTSLGTSGGRTSGAGGS 533
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
709-910 4.36e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 53.09  E-value: 4.36e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  709 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 788
Cdd:NF033849  319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  789 SSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 862
Cdd:NF033849  398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858  863 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 910
Cdd:NF033849  478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
278-602 4.99e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 52.70  E-value: 4.99e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  278 YSSGSSEEPGFTHGSGRKNSSTCGKNGSYS-GQSTGR-HQQGFGSSHELESGQSITSANHGSHSNQSSCSGTRECGSSES 355
Cdd:NF033849  237 QSAGTGYGESVGHSTSQGQSHSVGTSESHSvGTSQSQsHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEG 316
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  356 SmkkthvsgsghSSSTGKYTSTSGQNYNSTRQGCGQGKSSGSEQygassgqsSGCSSGQSTRYGEQGSGSRNSSTQSRGR 435
Cdd:NF033849  317 T-----------STTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQ--------STSISHSESSSESTGTSVGHSTSSSVSS 377
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  436 STSRESSTSQQFGSGSGRSSGFSQGGSGQGRSSRGGQQGsfSGQTEGSQQHGSCCGQSSGYGQNeygSGHSASSGQQGSH 515
Cdd:NF033849  378 SESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEG--WGSGDSVQSVSQSYGSSSSTGTS---SGHSDSSSHSTSS 452
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  516 ySQSSSYGTHNSGGSPSSSQRGHGsrSGRSSGLGQYGSPSGQTSSSTRQGSGQGQASGsgrYGASSGQTSGCGSGQSTRY 595
Cdd:NF033849  453 -GQADSVSQGTSWSEGTGTSQGQS--VGTSESWSTSQSETDSVGDSTGTSESVSQGDG---RSTGRSESQGTSLGTSGGR 526

                  ....*..
gi 568923858  596 GEQGSGS 602
Cdd:NF033849  527 TSGAGGS 533
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1057-1258 8.70e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 51.93  E-value: 8.70e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1057 HGSGSEQSSGLGHYGSTSGQTASSTRHRSGQGQASGSGRcGASSGQTSGCGSGQSTRYDEQGSGSRNSSTQSRGRSTSRE 1136
Cdd:NF033849  319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1137 SSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 1210
Cdd:NF033849  398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 1211 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 1258
Cdd:NF033849  478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2086-2341 4.71e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 43.07  E-value: 4.71e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2086 THNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGrcgASSGQTSGCGSGQSTRYGEQGSGS 2165
Cdd:NF033849  286 SHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTG---VSSSHSDGTSQSTSISHSESSSES 362
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2166 RNSST-QSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGsfsgQTSGRSQHQSGSRHGSGSGQFPISGQ 2244
Cdd:NF033849  363 TGTSVgHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQG----GSEGWGSGDSVQSVSQSYGSSSSTGT 438
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2245 QGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSG 2324
Cdd:NF033849  439 SSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ-GDGRSTGRSESQG 517
                         250
                  ....*....|....*..
gi 568923858 2325 QSTrygeQGSGSRNSST 2341
Cdd:NF033849  518 TSL----GTSGGRTSGA 530
 
Name Accession Description Interval E-value
S-100 cd00213
S-100: S-100 domain, which represents the largest family within the superfamily of proteins ...
2-89 1.68e-35

S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents the interaction site of S100 proteins with their target proteins. There is experimental evidence showing that many S100 proteins have multiple binding partners with diverse mode of interaction with different targets. In addition to S100 proteins (such as S100A1,-3,-4,-6,-7,-10,-11,and -13), this group includes the ''fused'' gene family, a group of calcium binding S100-related proteins. The ''fused'' gene family includes multifunctional epidermal differentiation proteins - profilaggrin, trichohyalin, repetin, hornerin, and cornulin; functionally these proteins are associated with keratin intermediate filaments and partially crosslinked to the cell envelope. These ''fused'' gene proteins contain N-terminal sequence with two Ca-binding EF-hands motif, which may be associated with calcium signaling in epidermal cells and autoprocessing in a calcium-dependent manner. In contrast to S100 proteins, "fused" gene family proteins contain an extraordinary high number of almost perfect peptide repeats with regular array of polar and charged residues similar to many known cell envelope proteins.


Pssm-ID: 238131 [Multi-domain]  Cd Length: 88  Bit Score: 131.07  E-value: 1.68e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858    2 PKLLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKL 81
Cdd:cd00213     1 SELEKAIETIIDVFHKYSGKEGDKDTLSKKELKELLETELPNFLKNQKDPEAVDKIMKDLDVNKDGKVDFQEFLVLIGKL 80

                  ....*...
gi 568923858   82 TKACNKII 89
Cdd:cd00213    81 AVACHEFF 88
S_100 pfam01023
S-100/ICaBP type calcium binding domain; The S-100 domain is a subfamily of the EF-hand ...
4-48 4.48e-15

S-100/ICaBP type calcium binding domain; The S-100 domain is a subfamily of the EF-hand calcium binding proteins.


Pssm-ID: 460028  Cd Length: 45  Bit Score: 71.31  E-value: 4.48e-15
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 568923858     4 LLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNP 48
Cdd:pfam01023    1 LERAIETIIDVFHKYAGKEGDKDTLSKKELKELLEKELPNFLKNQ 45
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2750-3051 1.22e-08

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 61.17  E-value: 1.22e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2750 GQSSGYGqneYGSGHSASSGQqgshySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGL--GQYGSPSGQTSSSTRQGSGQ 2827
Cdd:NF033849  236 GQSAGTG---YGESVGHSTSQ-----GQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWshTQSTSESESTGQSSSVGTSE 307
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2828 GQASGSGRyGASSGQTSGCGSGQS-TRYGEQGSGSRNSSTQSRGRSTSRESSTSQ-RYGSGSGESSGFSQGGSGQGRSSR 2905
Cdd:NF033849  308 SQSHGTTE-GTSTTDSSSHSQSSSyNVSSGTGVSSSHSDGTSQSTSISHSESSSEsTGTSVGHSTSSSVSSSESSSRSSS 386
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2906 GGQQGSFSGQTSGRSQHQSGSRHGSGSGQfPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTS 2985
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSE-GWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 2986 GQTASSTRQGSGQGQA-SGSGRCGASSGQTSGCGSGQSTRYGeQGSGSRNSSTQSRGRSTSRESSCS 3051
Cdd:NF033849  466 EGTGTSQGQSVGTSESwSTSQSETDSVGDSTGTSESVSQGDG-RSTGRSESQGTSLGTSGGRTSGAG 531
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1738-2003 4.30e-08

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 59.63  E-value: 4.30e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1738 THNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGrcgASSGQTSGCGSGQSTRYgEQGSGS 1817
Cdd:NF033849  264 SHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEG---TSTTDSSSHSQSSSYNV-SSGTGV 339
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1818 RNSSTQSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGsFSGQTEGSQQHGSCCGQSSGYGQneygsGH 1897
Cdd:NF033849  340 SSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSG-VSGGFSGGIAGGGVTSEGLGASQ-----GG 413
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1898 SASSGQQGSHYSQSSSYGTHNSGGSPS--SSQRGHGSRSGRSSGLGQYGSPSGQTSSSTRQGSGQGQA-SGSGRYGASSG 1974
Cdd:NF033849  414 SEGWGSGDSVQSVSQSYGSSSSTGTSSghSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESwSTSQSETDSVG 493
                         250       260
                  ....*....|....*....|....*....
gi 568923858 1975 QTSGCGSGQStrygeQGSGSrnssTQSRG 2003
Cdd:NF033849  494 DSTGTSESVS-----QGDGR----STGRS 513
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1508-1817 4.83e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 4.83e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1508 QGRSSRGGQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRS 1586
Cdd:NF033849  253 QGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSeSESTGQSSSVGT-SESQSHGTTEGTSTTDSSSHSQSSSY 331
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1587 GRSSGLGQYGSPSGQTSSSTRQGSGQGQASGSGrygassgqtSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQ 1666
Cdd:NF033849  332 NVSSGTGVSSSHSDGTSQSTSISHSESSSESTG---------TSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGV 402
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1667 RYGSGSGESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQhqsgsrhgsgsgqfpisGQQGSHHGHSSSSGTHNSGSSQS 1746
Cdd:NF033849  403 TSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGH-----------------SDSSSHSTSSGQADSVSQGTSWS 465
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568923858 1747 SSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSgrcGASSGQTSGCGSGQSTRYGEQGSGS 1817
Cdd:NF033849  466 EGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGR---STGRSESQGTSLGTSGGRTSGAGGS 533
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1187-1478 5.58e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.78  E-value: 5.58e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1187 GQSSGYGqneYGSGHSASSGQqgshySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGL--GQYGSPSGQTSSSTRQGSGQ 1264
Cdd:NF033849  236 GQSAGTG---YGESVGHSTSQ-----GQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWshTQSTSESESTGQSSSVGTSE 307
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1265 GQASGSGRyGASSGQTSGCGSGQS-TRYGEQGSGSRNSSTQSRGRSTSRESSTSQ-RYGSGSGESSGFSQGGSGQGRSSR 1342
Cdd:NF033849  308 SQSHGTTE-GTSTTDSSSHSQSSSyNVSSGTGVSSSHSDGTSQSTSISHSESSSEsTGTSVGHSTSSSVSSSESSSRSSS 386
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1343 GGQQGSFSGQTSGRSQHQSGSRHGSGSGQfPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTS 1422
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSE-GWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 1423 GQTASSTRQGSGQGQA-SGSGRCGASSGQTSGCGSGQSTRYGeQGSGSRNSSTQSRG 1478
Cdd:NF033849  466 EGTGTSQGQSVGTSESwSTSQSETDSVGDSTGTSESVSQGDG-RSTGRSESQGTSLG 521
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2402-2693 5.58e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.78  E-value: 5.58e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2402 GQSSGYGqneYGSGHSASSGQqgshySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGL--GQYGSPSGQTSSSTRQGSGQ 2479
Cdd:NF033849  236 GQSAGTG---YGESVGHSTSQ-----GQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWshTQSTSESESTGQSSSVGTSE 307
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2480 GQASGSGRyGASSGQTSGCGSGQS-TRYGEQGSGSRNSSTQSRGRSTSRESSTSQ-RYGSGSGESSGFSQGGSGQGRSSR 2557
Cdd:NF033849  308 SQSHGTTE-GTSTTDSSSHSQSSSyNVSSGTGVSSSHSDGTSQSTSISHSESSSEsTGTSVGHSTSSSVSSSESSSRSSS 386
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2558 GGQQGSFSGQTSGRSQHQSGSRHGSGSGQfPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTS 2637
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSE-GWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 2638 GQTASSTRQGSGQGQA-SGSGRCGASSGQTSGCGSGQSTRYGeQGSGSRNSSTQSRG 2693
Cdd:NF033849  466 EGTGTSQGQSVGTSESwSTSQSETDSVGDSTGTSESVSQGDG-RSTGRSESQGTSLG 521
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1405-1606 6.22e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.78  E-value: 6.22e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1405 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 1484
Cdd:NF033849  319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1485 SSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 1558
Cdd:NF033849  398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 1559 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 1606
Cdd:NF033849  478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2272-2473 6.22e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.78  E-value: 6.22e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2272 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 2351
Cdd:NF033849  319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2352 SSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 2425
Cdd:NF033849  398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 2426 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 2473
Cdd:NF033849  478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1402-1630 1.17e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 54.63  E-value: 1.17e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1402 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSS 1473
Cdd:NF033849  308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1474 TQSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 1553
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 1554 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 1630
Cdd:NF033849  465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2269-2497 1.17e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 54.63  E-value: 1.17e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2269 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSS 2340
Cdd:NF033849  308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2341 TQSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 2420
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 2421 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 2497
Cdd:NF033849  465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2617-2845 1.17e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 54.63  E-value: 1.17e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2617 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSS 2688
Cdd:NF033849  308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2689 TQSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 2768
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 2769 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 2845
Cdd:NF033849  465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
471-773 1.60e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 54.24  E-value: 1.60e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  471 GQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLG 549
Cdd:NF033849  260 GTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSeSESTGQSSSVGT-SESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTG 338
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  550 QYGSPSGQTSSSTRQGSGQGQASGSGrygassgqtSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSG 629
Cdd:NF033849  339 VSSSHSDGTSQSTSISHSESSSESTG---------TSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  630 ESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQhqsgsrhgsgsgqfpisGQQGSHHGHSSSSGTHNSGSSQSSSTQWSH 709
Cdd:NF033849  410 SQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGH-----------------SDSSSHSTSSGQADSVSQGTSWSEGTGTSQ 472
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568923858  710 GSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSgrcGASSGQTSGCGSGQSTRYGEQGSGS 773
Cdd:NF033849  473 GQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGR---STGRSESQGTSLGTSGGRTSGAGGS 533
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1863-2165 1.60e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 54.24  E-value: 1.60e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1863 GQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLG 1941
Cdd:NF033849  260 GTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSeSESTGQSSSVGT-SESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTG 338
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1942 QYGSPSGQTSSSTRQGSGQGQASGSGrygassgqtSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSG 2021
Cdd:NF033849  339 VSSSHSDGTSQSTSISHSESSSESTG---------TSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2022 ESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQhqsgsrhgsgsgqfpisGQQGSHHGHSSSSGTHNSGSSQSSSTQWSH 2101
Cdd:NF033849  410 SQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGH-----------------SDSSSHSTSSGQADSVSQGTSWSEGTGTSQ 472
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568923858 2102 GSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSgrcGASSGQTSGCGSGQSTRYGEQGSGS 2165
Cdd:NF033849  473 GQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGR---STGRSESQGTSLGTSGGRTSGAGGS 533
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1753-1954 4.36e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 53.09  E-value: 4.36e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1753 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 1832
Cdd:NF033849  319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1833 SSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 1906
Cdd:NF033849  398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 1907 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 1954
Cdd:NF033849  478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
709-910 4.36e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 53.09  E-value: 4.36e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  709 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 788
Cdd:NF033849  319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  789 SSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 862
Cdd:NF033849  398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858  863 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 910
Cdd:NF033849  478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
278-602 4.99e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 52.70  E-value: 4.99e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  278 YSSGSSEEPGFTHGSGRKNSSTCGKNGSYS-GQSTGR-HQQGFGSSHELESGQSITSANHGSHSNQSSCSGTRECGSSES 355
Cdd:NF033849  237 QSAGTGYGESVGHSTSQGQSHSVGTSESHSvGTSQSQsHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEG 316
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  356 SmkkthvsgsghSSSTGKYTSTSGQNYNSTRQGCGQGKSSGSEQygassgqsSGCSSGQSTRYGEQGSGSRNSSTQSRGR 435
Cdd:NF033849  317 T-----------STTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQ--------STSISHSESSSESTGTSVGHSTSSSVSS 377
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  436 STSRESSTSQQFGSGSGRSSGFSQGGSGQGRSSRGGQQGsfSGQTEGSQQHGSCCGQSSGYGQNeygSGHSASSGQQGSH 515
Cdd:NF033849  378 SESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEG--WGSGDSVQSVSQSYGSSSSTGTS---SGHSDSSSHSTSS 452
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  516 ySQSSSYGTHNSGGSPSSSQRGHGsrSGRSSGLGQYGSPSGQTSSSTRQGSGQGQASGsgrYGASSGQTSGCGSGQSTRY 595
Cdd:NF033849  453 -GQADSVSQGTSWSEGTGTSQGQS--VGTSESWSTSQSETDSVGDSTGTSESVSQGDG---RSTGRSESQGTSLGTSGGR 526

                  ....*..
gi 568923858  596 GEQGSGS 602
Cdd:NF033849  527 TSGAGGS 533
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
706-934 8.20e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 51.93  E-value: 8.20e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  706 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSS 777
Cdd:NF033849  308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  778 TQSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 857
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858  858 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 934
Cdd:NF033849  465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1057-1258 8.70e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 51.93  E-value: 8.70e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1057 HGSGSEQSSGLGHYGSTSGQTASSTRHRSGQGQASGSGRcGASSGQTSGCGSGQSTRYDEQGSGSRNSSTQSRGRSTSRE 1136
Cdd:NF033849  319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1137 SSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 1210
Cdd:NF033849  398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 1211 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 1258
Cdd:NF033849  478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1054-1282 1.64e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 51.16  E-value: 1.64e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1054 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRHRSGQGQASGSGRcGASSGQTSGCGSGQSTRYDEQGSGSRNSS 1125
Cdd:NF033849  308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1126 TQSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 1205
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 1206 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 1282
Cdd:NF033849  465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
819-1121 1.72e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 50.77  E-value: 1.72e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  819 GQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLG 897
Cdd:NF033849  260 GTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSeSESTGQSSSVGT-SESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTG 338
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  898 QYGSPSGQTSSSTRQGSGQGQASGSGrygassgqtSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSG 977
Cdd:NF033849  339 VSSSHSDGTSQSTSISHSESSSESTG---------TSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  978 ESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQhqsgsrhgsgsgqfpisGQQGSHHGHSSSSGTHNSGSSQSSSTQWSH 1057
Cdd:NF033849  410 SQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGH-----------------SDSSSHSTSSGQADSVSQGTSWSEGTGTSQ 472
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568923858 1058 GSGSEQSSGLGHYGSTSGQTASSTRHRSGQGQASGSgrcGASSGQTSGCGSGQSTRYDEQGSGS 1121
Cdd:NF033849  473 GQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGR---STGRSESQGTSLGTSGGRTSGAGGS 533
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1793-2054 4.03e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 49.62  E-value: 4.03e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1793 GASSGQTSGCGSGQStrygeQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGSFSGQT 1872
Cdd:NF033849  232 AANLGQSAGTGYGES-----VGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTS 306
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1873 EgSQQHGSCCGQSSGYGQ-NEYGSGHSASSGQ-QGSHYSQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQYGSPSGQT 1950
Cdd:NF033849  307 E-SQSHGTTEGTSTTDSSsHSQSSSYNVSSGTgVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSS 385
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1951 SSSTRQGSGQGQASG---SGRYGASSGQTSGCGSGqstrYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGESSGFS 2027
Cdd:NF033849  386 SSGVSGGFSGGIAGGgvtSEGLGASQGGSEGWGSG----DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQG 461
                         250       260
                  ....*....|....*....|....*..
gi 568923858 2028 QGGSGQGRSSRGGQQGSFSGQTSGRSQ 2054
Cdd:NF033849  462 TSWSEGTGTSQGQSVGTSESWSTSQSE 488
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1455-1706 9.61e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 48.46  E-value: 9.61e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1455 GSGQSTRYGEQGSGSRNSST-QSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQqHGSC 1533
Cdd:NF033849  240 GTGYGESVGHSTSQGQSHSVgTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTT-EGTS 318
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1534 CGQSSGYGQneyGSGHSASSGQ-QGSHYSQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQYGSPSGQTSSSTRQGSGQ 1612
Cdd:NF033849  319 TTDSSSHSQ---SSSYNVSSGTgVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSG 395
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1613 GQASG---SGRYGASSGQTSGCGSGqstrYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGESSGFSQGGSGQGRSS 1689
Cdd:NF033849  396 GIAGGgvtSEGLGASQGGSEGWGSG----DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTS 471
                         250
                  ....*....|....*..
gi 568923858 1690 RGGQQGSFSGQTSGRSQ 1706
Cdd:NF033849  472 QGQSVGTSESWSTSQSE 488
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2434-2689 2.02e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 47.31  E-value: 2.02e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2434 GTHNSGGSPSSSQRGHGSRSGRSSGLGQYGSPSGQTSSSTRQGSGQGQASG-----SGRYGASSGQTSGCGSGQSTRYGE 2508
Cdd:NF033849  234 NLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGwshtqSTSESESTGQSSSVGTSESQSHGT 313
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2509 Q-----GSGSRNSSTQSRGRSTSRESSTSQRYGSGSGESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQHQSGSRHGSG 2583
Cdd:NF033849  314 TegtstTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGF 393
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2584 SGQFPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASS 2663
Cdd:NF033849  394 SGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGT-GTSQ 472
                         250       260
                  ....*....|....*....|....*.
gi 568923858 2664 GQTSGCGSGQSTRYGEQGSGSRNSST 2689
Cdd:NF033849  473 GQSVGTSESWSTSQSETDSVGDSTGT 498
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
421-662 4.30e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 46.15  E-value: 4.30e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  421 QGSGSRNSSTQSRGRSTSRESSTSQQFGSGSGRSSGFSQGGSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQ-N 499
Cdd:NF033849  247 VGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSE-SQSHGTTEGTSTTDSSsH 325
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  500 EYGSGHSASSGQ-QGSHYSQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQYGSPSGQTSSSTRQGSGQGQASG---SG 575
Cdd:NF033849  326 SQSSSYNVSSGTgVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGgvtSE 405
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  576 RYGASSGQTSGCGSGqstrYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGESSGFSQGGSGQGRSSRGGQQGSFSG 655
Cdd:NF033849  406 GLGASQGGSEGWGSG----DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSES 481

                  ....*..
gi 568923858  656 QTSGRSQ 662
Cdd:NF033849  482 WSTSQSE 488
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
723-959 7.97e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.55  E-value: 7.97e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  723 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRgrstsresstsqrygsgsg 800
Cdd:PHA03307  210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIW------------------- 270
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  801 gssGFSQGGSGQGRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 880
Cdd:PHA03307  271 ---EASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568923858  881 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 959
Cdd:PHA03307  336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1767-2003 7.97e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.55  E-value: 7.97e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1767 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRgrstsresstsqrygsgsg 1844
Cdd:PHA03307  210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIW------------------- 270
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1845 gssGFSQGGSGQGRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 1924
Cdd:PHA03307  271 ---EASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568923858 1925 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 2003
Cdd:PHA03307  336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1419-1655 1.25e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.78  E-value: 1.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1419 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTsqrfgsgsg 1496
Cdd:PHA03307  210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS--------- 280
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1497 gssgfsqgrsgqgRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 1576
Cdd:PHA03307  281 -------------RPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568923858 1577 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 1655
Cdd:PHA03307  336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2286-2522 1.25e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.78  E-value: 1.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2286 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTsqrfgsgsg 2363
Cdd:PHA03307  210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS--------- 280
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2364 gssgfsqgrsgqgRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 2443
Cdd:PHA03307  281 -------------RPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568923858 2444 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 2522
Cdd:PHA03307  336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2634-2870 1.25e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.78  E-value: 1.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2634 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTsqrfgsgsg 2711
Cdd:PHA03307  210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS--------- 280
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2712 gssgfsqgrsgqgRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 2791
Cdd:PHA03307  281 -------------RPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568923858 2792 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 2870
Cdd:PHA03307  336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1071-1307 1.47e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.78  E-value: 1.47e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1071 GSTSGQTASSTRHRSGQGQASGSGR--CGASSGQTSGCGSGQSTRYDEQGSGSRNSSTQSRGRSTSRESSTsqrfgsgsg 1148
Cdd:PHA03307  210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS--------- 280
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1149 gssgfsqgrsgqgRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 1228
Cdd:PHA03307  281 -------------RPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568923858 1229 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 1307
Cdd:PHA03307  336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1621-1941 3.60e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 43.46  E-value: 3.60e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1621 YGASSGQTSGCGSGQStrygeQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGESSGFSQGGSGQGRSSRGGQQGSFSGQ 1700
Cdd:NF033849  231 YAANLGQSAGTGYGES-----VGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGT 305
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1701 TSGRSQHQSGSRHGSGSGQFpisgqqgshhghsSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQG 1780
Cdd:NF033849  306 SESQSHGTTEGTSTTDSSSH-------------SQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTS 372
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1781 SGQGQASGSGRcgASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSS 1860
Cdd:NF033849  373 SSVSSSESSSR--SSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHST 450
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1861 RGGQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-------GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSR 1933
Cdd:NF033849  451 SSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSqsetdsvGDSTGTSESVSQ-GDGRSTGRSESQGTSLGTSGGRTSG 529

                  ....*...
gi 568923858 1934 SGRSSGLG 1941
Cdd:NF033849  530 AGGSMGLG 537
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2086-2341 4.71e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 43.07  E-value: 4.71e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2086 THNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGrcgASSGQTSGCGSGQSTRYGEQGSGS 2165
Cdd:NF033849  286 SHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTG---VSSSHSDGTSQSTSISHSESSSES 362
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2166 RNSST-QSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGsfsgQTSGRSQHQSGSRHGSGSGQFPISGQ 2244
Cdd:NF033849  363 TGTSVgHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQG----GSEGWGSGDSVQSVSQSYGSSSSTGT 438
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2245 QGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSG 2324
Cdd:NF033849  439 SSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ-GDGRSTGRSESQG 517
                         250
                  ....*....|....*..
gi 568923858 2325 QSTrygeQGSGSRNSST 2341
Cdd:NF033849  518 TSL----GTSGGRTSGA 530
 
Name Accession Description Interval E-value
S-100 cd00213
S-100: S-100 domain, which represents the largest family within the superfamily of proteins ...
2-89 1.68e-35

S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents the interaction site of S100 proteins with their target proteins. There is experimental evidence showing that many S100 proteins have multiple binding partners with diverse mode of interaction with different targets. In addition to S100 proteins (such as S100A1,-3,-4,-6,-7,-10,-11,and -13), this group includes the ''fused'' gene family, a group of calcium binding S100-related proteins. The ''fused'' gene family includes multifunctional epidermal differentiation proteins - profilaggrin, trichohyalin, repetin, hornerin, and cornulin; functionally these proteins are associated with keratin intermediate filaments and partially crosslinked to the cell envelope. These ''fused'' gene proteins contain N-terminal sequence with two Ca-binding EF-hands motif, which may be associated with calcium signaling in epidermal cells and autoprocessing in a calcium-dependent manner. In contrast to S100 proteins, "fused" gene family proteins contain an extraordinary high number of almost perfect peptide repeats with regular array of polar and charged residues similar to many known cell envelope proteins.


Pssm-ID: 238131 [Multi-domain]  Cd Length: 88  Bit Score: 131.07  E-value: 1.68e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858    2 PKLLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKL 81
Cdd:cd00213     1 SELEKAIETIIDVFHKYSGKEGDKDTLSKKELKELLETELPNFLKNQKDPEAVDKIMKDLDVNKDGKVDFQEFLVLIGKL 80

                  ....*...
gi 568923858   82 TKACNKII 89
Cdd:cd00213    81 AVACHEFF 88
calgranulins cd05030
Calgranulins: S-100 domain found in proteins belonging to the Calgranulin subgroup of the S100 ...
3-87 1.48e-17

Calgranulins: S-100 domain found in proteins belonging to the Calgranulin subgroup of the S100 family of EF-hand calcium-modulated proteins, including S100A8, S100A9, and S100A12 . Note that the S-100 hierarchy, to which this Calgranulin group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. These proteins are expressed mainly in granulocytes, and are involved in inflammation, allergy, and neuritogenesis, as well as in host-parasite response. Calgranulins are modulated not only by calcium, but also by other metals such as zinc and copper. Structural data suggested that calgranulins may exist in multiple structural forms, homodimers, as well as hetero-oligomers. For example, the S100A8/S100A9 complex called calprotectin plays important roles in the regulation of inflammatory processes, wound repair, and regulating zinc-dependent enzymes as well as microbial growth.


Pssm-ID: 240156 [Multi-domain]  Cd Length: 88  Bit Score: 80.08  E-value: 1.48e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858    3 KLLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKLT 82
Cdd:cd05030     2 ELEKAIETIINVFHQYSVRKGHPDTLYKKEFKQLVEKELPNFLKKEKNQKAIDKIFEDLDTNQDGQLSFEEFLVLVIKVG 81

                  ....*
gi 568923858   83 KACNK 87
Cdd:cd05030    82 VAAHE 86
S-100A10_like cd05031
S-100A10_like: S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of ...
4-88 2.14e-17

S-100A10_like: S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of the S100 family of EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1_like group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. A unique feature of S100A10 is that it contains mutation in both of the calcium binding sites, making it calcium insensitive. S100A10 has been detected in brain, heart, gastrointestinal tract, kidney, liver, lung, spleen, testes, epidermis, aorta, and thymus. Structural data supports the homo- and hetero-dimeric as well as hetero-tetrameric nature of the protein. S100A10 has multiple binding partners in its calcium free state and is therefore involved in many diverse biological functions.


Pssm-ID: 240157 [Multi-domain]  Cd Length: 94  Bit Score: 79.77  E-value: 2.14e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858    4 LLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKLTK 83
Cdd:cd05031     3 LEHAMESLILTFHRYAGKDGDKNTLSRKELKKLMEKELSEFLKNQKDPMAVDKIMKDLDQNRDGKVNFEEFVSLVAGLSI 82

                  ....*
gi 568923858   84 ACNKI 88
Cdd:cd05031    83 ACEEY 87
S-100A1 cd05025
S-100A1: S-100A1 domain found in proteins similar to S100A1. S100A1 is a calcium-binding ...
3-87 1.93e-15

S-100A1: S-100A1 domain found in proteins similar to S100A1. S100A1 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. As is the case with many other members of S100 protein family, S100A1 is implicated in intracellular and extracellular regulatory activities, including interaction with myosin-associated twitchin kinase, actin-capping protein CapZ, sinapsin I, and tubulin. Structural data suggests that S100A1 proteins exist within cells as antiparallel homodimers, while heterodimers with S100A4 and S100B also has been reported. Upon binding calcium S100A1 changes conformation to expose a hydrophobic cleft which is the interaction site of S100A1 with its more that 20 known target proteins.


Pssm-ID: 240152 [Multi-domain]  Cd Length: 92  Bit Score: 74.15  E-value: 1.93e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858    3 KLLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKLT 82
Cdd:cd05025     3 ELETAMETLINVFHAHSGKEGDKYKLSKKELKDLLQTELSDFLDAQKDADAVDKIMKELDENGDGEVDFQEFVVLVAALT 82

                  ....*
gi 568923858   83 KACNK 87
Cdd:cd05025    83 VACNN 87
S_100 pfam01023
S-100/ICaBP type calcium binding domain; The S-100 domain is a subfamily of the EF-hand ...
4-48 4.48e-15

S-100/ICaBP type calcium binding domain; The S-100 domain is a subfamily of the EF-hand calcium binding proteins.


Pssm-ID: 460028  Cd Length: 45  Bit Score: 71.31  E-value: 4.48e-15
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 568923858     4 LLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNP 48
Cdd:pfam01023    1 LERAIETIIDVFHKYAGKEGDKDTLSKKELKELLEKELPNFLKNQ 45
S-100Z cd05026
S-100Z: S-100Z domain found in proteins similar to S100Z. S100Z is a member of the S100 domain ...
1-86 4.69e-15

S-100Z: S-100Z domain found in proteins similar to S100Z. S100Z is a member of the S100 domain family within the EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100Z group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately.S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control. S100Z is normally expressed in various tissues, with its highest level of expression being in spleen and leukocytes. The function of S100Z remains unclear. Preliminary structural data suggests that S100Z is homodimer, however a heterodimer with S100P has been reported. S100Z is capable of binding calcium ions. When calcium binds to S110Z, the protein experiences a conformational change, which exposes hydrophobic surfaces on the protein. In comparison with their normal tissue counterparts, S100Z gene expression appears to be deregulated in some tumor tissues.


Pssm-ID: 240153 [Multi-domain]  Cd Length: 93  Bit Score: 72.98  E-value: 4.69e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858    1 MPKLLE-SIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMIL 79
Cdd:cd05026     1 MPTQLEgAMDTLIRIFHNYSGKEGDRYKLSKGELKELLQRELTDFLSSQKDPMLVDKIMNDLDSNKDNEVDFNEFVVLVA 80

                  ....*..
gi 568923858   80 KLTKACN 86
Cdd:cd05026    81 ALTVACN 87
S-100B cd05027
S-100B: S-100B domain found in proteins similar to S100B. S100B is a calcium-binding protein ...
3-87 1.08e-12

S-100B: S-100B domain found in proteins similar to S100B. S100B is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100B group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100B is most abundant in glial cells of the central nervous system, predominately in astrocytes. S100B is involved in signal transduction via the inhibition of protein phoshorylation, regulation of enzyme activity and by affecting the calcium homeostasis. Upon calcium binding the S100B homodimer changes conformation to expose a hydrophobic cleft, which represents the interaction site of S100B with its more than 20 known target proteins. These target proteins include several cellular architecture proteins such as tubulin and GFAP; S100B can inhibit polymerization of these oligomeric molecules. Furthermore, S100B inhibits the phosphorylation of multiple kinase substrates including the Alzheimer protein tau and neuromodulin (GAP-43) through a calcium-sensitive interaction with the protein substrates.


Pssm-ID: 240154 [Multi-domain]  Cd Length: 88  Bit Score: 66.03  E-value: 1.08e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858    3 KLLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKLT 82
Cdd:cd05027     2 ELEKAMVALIDVFHQYSGREGDKHKLKKSELKELINNELSHFLEEIKEQEVVDKVMETLDSDGDGECDFQEFMAFVAMVT 81

                  ....*
gi 568923858   83 KACNK 87
Cdd:cd05027    82 TACHE 86
S-100A10 cd05024
S-100A10: A subgroup of the S-100A10 domain found in proteins similar to S100A10. S100A10 is a ...
3-86 7.86e-10

S-100A10: A subgroup of the S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of the S100 family of EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A10 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. A unique feature of S100A10 is that it contains mutation in both of the calcium binding sites, making it calcium insensitive. S100A10 has been detected in brain, heart, gastrointestinal tract, kidney, liver, lung, spleen, testes, epidermis, aorta, and thymus. Structural data supports the homo- and hetero-dimeric as well as hetero-tetrameric nature of the protein. S100A10 has multiple binding partners in its calcium free state and is therefore involved in many diverse biological functions.


Pssm-ID: 240151 [Multi-domain]  Cd Length: 91  Bit Score: 57.93  E-value: 7.86e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858    3 KLLESIVTVIDVFYQYAteyGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKLT 82
Cdd:cd05024     2 ELEHSMEKMMLTFHKFA---GEKNYLNRDDLQKLMEKEFSEFLKNQNDPMAVDKIMKDLDDCRDGKVGFQSFFSLIAGLL 78

                  ....
gi 568923858   83 KACN 86
Cdd:cd05024    79 IACN 82
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2750-3051 1.22e-08

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 61.17  E-value: 1.22e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2750 GQSSGYGqneYGSGHSASSGQqgshySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGL--GQYGSPSGQTSSSTRQGSGQ 2827
Cdd:NF033849  236 GQSAGTG---YGESVGHSTSQ-----GQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWshTQSTSESESTGQSSSVGTSE 307
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2828 GQASGSGRyGASSGQTSGCGSGQS-TRYGEQGSGSRNSSTQSRGRSTSRESSTSQ-RYGSGSGESSGFSQGGSGQGRSSR 2905
Cdd:NF033849  308 SQSHGTTE-GTSTTDSSSHSQSSSyNVSSGTGVSSSHSDGTSQSTSISHSESSSEsTGTSVGHSTSSSVSSSESSSRSSS 386
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2906 GGQQGSFSGQTSGRSQHQSGSRHGSGSGQfPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTS 2985
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSE-GWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 2986 GQTASSTRQGSGQGQA-SGSGRCGASSGQTSGCGSGQSTRYGeQGSGSRNSSTQSRGRSTSRESSCS 3051
Cdd:NF033849  466 EGTGTSQGQSVGTSESwSTSQSETDSVGDSTGTSESVSQGDG-RSTGRSESQGTSLGTSGGRTSGAG 531
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1738-2003 4.30e-08

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 59.63  E-value: 4.30e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1738 THNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGrcgASSGQTSGCGSGQSTRYgEQGSGS 1817
Cdd:NF033849  264 SHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEG---TSTTDSSSHSQSSSYNV-SSGTGV 339
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1818 RNSSTQSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGsFSGQTEGSQQHGSCCGQSSGYGQneygsGH 1897
Cdd:NF033849  340 SSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSG-VSGGFSGGIAGGGVTSEGLGASQ-----GG 413
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1898 SASSGQQGSHYSQSSSYGTHNSGGSPS--SSQRGHGSRSGRSSGLGQYGSPSGQTSSSTRQGSGQGQA-SGSGRYGASSG 1974
Cdd:NF033849  414 SEGWGSGDSVQSVSQSYGSSSSTGTSSghSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESwSTSQSETDSVG 493
                         250       260
                  ....*....|....*....|....*....
gi 568923858 1975 QTSGCGSGQStrygeQGSGSrnssTQSRG 2003
Cdd:NF033849  494 DSTGTSESVS-----QGDGR----STGRS 513
S-100A11 cd05023
S-100A11: S-100A11 domain found in proteins similar to S100A11. S100A11 is a member of the ...
7-85 1.89e-07

S-100A11: S-100A11 domain found in proteins similar to S100A11. S100A11 is a member of the S-100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A11 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100 proteins have also been associated with a variety of pathological events, including neoplastic transformation and neurodegenerative diseases such as Alzheimer's, usually via over expression of the protein. S100A11 is expressed in smooth muscle and other tissues and involves in calcium-dependent membrane aggregation, which is important for cell vesiculation . As is the case for many other S100 proteins, S100A11 is homodimer, which is able to form a heterodimer with S100B through subunit exchange. Ca2+ binding to S100A11 results in a conformational change in the protein, exposing a hydrophobic surface that interacts with target proteins. In addition to binding to annexin A1 and A6 S100A11 also interacts with actin and transglutaminase.


Pssm-ID: 240150 [Multi-domain]  Cd Length: 89  Bit Score: 51.31  E-value: 1.89e-07
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568923858    7 SIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKLTKAC 85
Cdd:cd05023     7 CIESLIAVFQKYAGKDGDSYQLSKTEFLSFMNTELASFTKNQKDPGVLDRMMKKLDLNSDGQLDFQEFLNLIGGLAVAC 85
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1508-1817 4.83e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 4.83e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1508 QGRSSRGGQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRS 1586
Cdd:NF033849  253 QGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSeSESTGQSSSVGT-SESQSHGTTEGTSTTDSSSHSQSSSY 331
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1587 GRSSGLGQYGSPSGQTSSSTRQGSGQGQASGSGrygassgqtSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQ 1666
Cdd:NF033849  332 NVSSGTGVSSSHSDGTSQSTSISHSESSSESTG---------TSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGV 402
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1667 RYGSGSGESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQhqsgsrhgsgsgqfpisGQQGSHHGHSSSSGTHNSGSSQS 1746
Cdd:NF033849  403 TSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGH-----------------SDSSSHSTSSGQADSVSQGTSWS 465
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568923858 1747 SSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSgrcGASSGQTSGCGSGQSTRYGEQGSGS 1817
Cdd:NF033849  466 EGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGR---STGRSESQGTSLGTSGGRTSGAGGS 533
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1187-1478 5.58e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.78  E-value: 5.58e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1187 GQSSGYGqneYGSGHSASSGQqgshySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGL--GQYGSPSGQTSSSTRQGSGQ 1264
Cdd:NF033849  236 GQSAGTG---YGESVGHSTSQ-----GQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWshTQSTSESESTGQSSSVGTSE 307
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1265 GQASGSGRyGASSGQTSGCGSGQS-TRYGEQGSGSRNSSTQSRGRSTSRESSTSQ-RYGSGSGESSGFSQGGSGQGRSSR 1342
Cdd:NF033849  308 SQSHGTTE-GTSTTDSSSHSQSSSyNVSSGTGVSSSHSDGTSQSTSISHSESSSEsTGTSVGHSTSSSVSSSESSSRSSS 386
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1343 GGQQGSFSGQTSGRSQHQSGSRHGSGSGQfPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTS 1422
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSE-GWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 1423 GQTASSTRQGSGQGQA-SGSGRCGASSGQTSGCGSGQSTRYGeQGSGSRNSSTQSRG 1478
Cdd:NF033849  466 EGTGTSQGQSVGTSESwSTSQSETDSVGDSTGTSESVSQGDG-RSTGRSESQGTSLG 521
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2402-2693 5.58e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.78  E-value: 5.58e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2402 GQSSGYGqneYGSGHSASSGQqgshySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGL--GQYGSPSGQTSSSTRQGSGQ 2479
Cdd:NF033849  236 GQSAGTG---YGESVGHSTSQ-----GQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWshTQSTSESESTGQSSSVGTSE 307
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2480 GQASGSGRyGASSGQTSGCGSGQS-TRYGEQGSGSRNSSTQSRGRSTSRESSTSQ-RYGSGSGESSGFSQGGSGQGRSSR 2557
Cdd:NF033849  308 SQSHGTTE-GTSTTDSSSHSQSSSyNVSSGTGVSSSHSDGTSQSTSISHSESSSEsTGTSVGHSTSSSVSSSESSSRSSS 386
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2558 GGQQGSFSGQTSGRSQHQSGSRHGSGSGQfPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTS 2637
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSE-GWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 2638 GQTASSTRQGSGQGQA-SGSGRCGASSGQTSGCGSGQSTRYGeQGSGSRNSSTQSRG 2693
Cdd:NF033849  466 EGTGTSQGQSVGTSESwSTSQSETDSVGDSTGTSESVSQGDG-RSTGRSESQGTSLG 521
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1405-1606 6.22e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.78  E-value: 6.22e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1405 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 1484
Cdd:NF033849  319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1485 SSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 1558
Cdd:NF033849  398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 1559 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 1606
Cdd:NF033849  478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2272-2473 6.22e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.78  E-value: 6.22e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2272 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 2351
Cdd:NF033849  319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2352 SSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 2425
Cdd:NF033849  398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 2426 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 2473
Cdd:NF033849  478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1402-1630 1.17e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 54.63  E-value: 1.17e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1402 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSS 1473
Cdd:NF033849  308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1474 TQSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 1553
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 1554 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 1630
Cdd:NF033849  465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2269-2497 1.17e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 54.63  E-value: 1.17e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2269 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSS 2340
Cdd:NF033849  308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2341 TQSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 2420
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 2421 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 2497
Cdd:NF033849  465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2617-2845 1.17e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 54.63  E-value: 1.17e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2617 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSS 2688
Cdd:NF033849  308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2689 TQSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 2768
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 2769 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 2845
Cdd:NF033849  465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
471-773 1.60e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 54.24  E-value: 1.60e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  471 GQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLG 549
Cdd:NF033849  260 GTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSeSESTGQSSSVGT-SESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTG 338
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  550 QYGSPSGQTSSSTRQGSGQGQASGSGrygassgqtSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSG 629
Cdd:NF033849  339 VSSSHSDGTSQSTSISHSESSSESTG---------TSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  630 ESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQhqsgsrhgsgsgqfpisGQQGSHHGHSSSSGTHNSGSSQSSSTQWSH 709
Cdd:NF033849  410 SQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGH-----------------SDSSSHSTSSGQADSVSQGTSWSEGTGTSQ 472
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568923858  710 GSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSgrcGASSGQTSGCGSGQSTRYGEQGSGS 773
Cdd:NF033849  473 GQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGR---STGRSESQGTSLGTSGGRTSGAGGS 533
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1863-2165 1.60e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 54.24  E-value: 1.60e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1863 GQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLG 1941
Cdd:NF033849  260 GTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSeSESTGQSSSVGT-SESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTG 338
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1942 QYGSPSGQTSSSTRQGSGQGQASGSGrygassgqtSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSG 2021
Cdd:NF033849  339 VSSSHSDGTSQSTSISHSESSSESTG---------TSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2022 ESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQhqsgsrhgsgsgqfpisGQQGSHHGHSSSSGTHNSGSSQSSSTQWSH 2101
Cdd:NF033849  410 SQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGH-----------------SDSSSHSTSSGQADSVSQGTSWSEGTGTSQ 472
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568923858 2102 GSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSgrcGASSGQTSGCGSGQSTRYGEQGSGS 2165
Cdd:NF033849  473 GQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGR---STGRSESQGTSLGTSGGRTSGAGGS 533
S-100A13 cd05022
S-100A13: S-100A13 domain found in proteins similar to S100A13. S100A13 is a calcium-binding ...
6-84 2.59e-06

S-100A13: S-100A13 domain found in proteins similar to S100A13. S100A13 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A13 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100A13 is involved in the cellular export of interleukin-1 (IL-1) and of fibroblast growth factor-1 (FGF-1), which plays an important role in angiogenesis and tissue regeneration. Export is based on the CuII-dependent formation of multiprotein complexes containing the S100A13 protein. Assembly of these complexes occurs near the inner surface of the plasma membrane. Binding of two Ca(II) ions per monomer triggers key conformational changes leading to the creation of two identical and symmetrical Cu(II)-binding sites on the surface of the protein, close to the interface between the two monomers. These Cu(II)-binding sites are unique among the S100 proteins, which are reported to bind Cu(II) or Zn(II) ions in addition to Ca(II) ions. In addition, the three-dimensional structure of S100A13 differs significantly from those of other S100 proteins; the hydrophobic pocket that largely contributes to protein-protein interactions in other S100 proteins is absent in S100A13. The structure of S100A13 contains a large patch of negatively charged residues flanked by dense cationic clusters, formed mostly from positively charged residues from the C-terminal end, which plays major role in binding FGF-1.


Pssm-ID: 240149 [Multi-domain]  Cd Length: 89  Bit Score: 48.11  E-value: 2.59e-06
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568923858    6 ESIVTVIDVFYQYATEyGNCDMLSKEEMKELLVTEFHQILKnpdDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKLTKA 84
Cdd:cd05022     5 KAIETLVSNFHKASVK-GGKESLTASEFQELLTQQLPHLLK---DVEGLEEKMKNLDVNQDSKLSFEEFWELIGELAKA 79
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1753-1954 4.36e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 53.09  E-value: 4.36e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1753 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 1832
Cdd:NF033849  319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1833 SSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 1906
Cdd:NF033849  398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 1907 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 1954
Cdd:NF033849  478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
709-910 4.36e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 53.09  E-value: 4.36e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  709 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 788
Cdd:NF033849  319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  789 SSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 862
Cdd:NF033849  398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858  863 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 910
Cdd:NF033849  478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
278-602 4.99e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 52.70  E-value: 4.99e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  278 YSSGSSEEPGFTHGSGRKNSSTCGKNGSYS-GQSTGR-HQQGFGSSHELESGQSITSANHGSHSNQSSCSGTRECGSSES 355
Cdd:NF033849  237 QSAGTGYGESVGHSTSQGQSHSVGTSESHSvGTSQSQsHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEG 316
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  356 SmkkthvsgsghSSSTGKYTSTSGQNYNSTRQGCGQGKSSGSEQygassgqsSGCSSGQSTRYGEQGSGSRNSSTQSRGR 435
Cdd:NF033849  317 T-----------STTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQ--------STSISHSESSSESTGTSVGHSTSSSVSS 377
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  436 STSRESSTSQQFGSGSGRSSGFSQGGSGQGRSSRGGQQGsfSGQTEGSQQHGSCCGQSSGYGQNeygSGHSASSGQQGSH 515
Cdd:NF033849  378 SESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEG--WGSGDSVQSVSQSYGSSSSTGTS---SGHSDSSSHSTSS 452
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  516 ySQSSSYGTHNSGGSPSSSQRGHGsrSGRSSGLGQYGSPSGQTSSSTRQGSGQGQASGsgrYGASSGQTSGCGSGQSTRY 595
Cdd:NF033849  453 -GQADSVSQGTSWSEGTGTSQGQS--VGTSESWSTSQSETDSVGDSTGTSESVSQGDG---RSTGRSESQGTSLGTSGGR 526

                  ....*..
gi 568923858  596 GEQGSGS 602
Cdd:NF033849  527 TSGAGGS 533
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
706-934 8.20e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 51.93  E-value: 8.20e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  706 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSS 777
Cdd:NF033849  308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  778 TQSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 857
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858  858 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 934
Cdd:NF033849  465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1057-1258 8.70e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 51.93  E-value: 8.70e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1057 HGSGSEQSSGLGHYGSTSGQTASSTRHRSGQGQASGSGRcGASSGQTSGCGSGQSTRYDEQGSGSRNSSTQSRGRSTSRE 1136
Cdd:NF033849  319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1137 SSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 1210
Cdd:NF033849  398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 1211 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 1258
Cdd:NF033849  478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
S-100A6 cd05029
S-100A6: S-100A6 domain found in proteins similar to S100A6. S100A6 is a member of the S100 ...
4-89 1.17e-05

S-100A6: S-100A6 domain found in proteins similar to S100A6. S100A6 is a member of the S100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A6 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100A6 is normally expressed in the G1 phase of the cell cycle in neuronal cells. The function of S100A6 remains unclear, but evidence suggests that it is involved in cell cycle regulation and exocytosis. S100A6 may also be involved in tumorigenesis; the protein is overexpressed in several tumors. Ca2+ binding to S100A6 leads to a conformational change in the protein, which exposes a hydrophobic surface for interaction with target proteins. Several such proteins have been identified: glyceraldehyde-3-phosphate dehydrogenase , annexins 2, 6 and 11 and Calcyclin-Binding Protein (CacyBP).


Pssm-ID: 240155 [Multi-domain]  Cd Length: 88  Bit Score: 45.99  E-value: 1.17e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858    4 LLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFhQILKNPDDPDTVDiIMQNLDRDHNHKVDFTEYLLMILKLTK 83
Cdd:cd05029     5 LDQAIGLLVAIFHKYSGREGDKNTLSKKELKELIQKEL-TIGSKLQDAEIAK-LMEDLDRNKDQEVNFQEYVTFLGALAL 82

                  ....*.
gi 568923858   84 ACNKII 89
Cdd:cd05029    83 IYNEAL 88
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1054-1282 1.64e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 51.16  E-value: 1.64e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1054 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRHRSGQGQASGSGRcGASSGQTSGCGSGQSTRYDEQGSGSRNSS 1125
Cdd:NF033849  308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1126 TQSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 1205
Cdd:NF033849  387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 568923858 1206 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 1282
Cdd:NF033849  465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
819-1121 1.72e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 50.77  E-value: 1.72e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  819 GQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLG 897
Cdd:NF033849  260 GTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSeSESTGQSSSVGT-SESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTG 338
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  898 QYGSPSGQTSSSTRQGSGQGQASGSGrygassgqtSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSG 977
Cdd:NF033849  339 VSSSHSDGTSQSTSISHSESSSESTG---------TSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  978 ESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQhqsgsrhgsgsgqfpisGQQGSHHGHSSSSGTHNSGSSQSSSTQWSH 1057
Cdd:NF033849  410 SQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGH-----------------SDSSSHSTSSGQADSVSQGTSWSEGTGTSQ 472
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568923858 1058 GSGSEQSSGLGHYGSTSGQTASSTRHRSGQGQASGSgrcGASSGQTSGCGSGQSTRYDEQGSGS 1121
Cdd:NF033849  473 GQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGR---STGRSESQGTSLGTSGGRTSGAGGS 533
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1793-2054 4.03e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 49.62  E-value: 4.03e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1793 GASSGQTSGCGSGQStrygeQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGSFSGQT 1872
Cdd:NF033849  232 AANLGQSAGTGYGES-----VGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTS 306
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1873 EgSQQHGSCCGQSSGYGQ-NEYGSGHSASSGQ-QGSHYSQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQYGSPSGQT 1950
Cdd:NF033849  307 E-SQSHGTTEGTSTTDSSsHSQSSSYNVSSGTgVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSS 385
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1951 SSSTRQGSGQGQASG---SGRYGASSGQTSGCGSGqstrYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGESSGFS 2027
Cdd:NF033849  386 SSGVSGGFSGGIAGGgvtSEGLGASQGGSEGWGSG----DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQG 461
                         250       260
                  ....*....|....*....|....*..
gi 568923858 2028 QGGSGQGRSSRGGQQGSFSGQTSGRSQ 2054
Cdd:NF033849  462 TSWSEGTGTSQGQSVGTSESWSTSQSE 488
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1455-1706 9.61e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 48.46  E-value: 9.61e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1455 GSGQSTRYGEQGSGSRNSST-QSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQqHGSC 1533
Cdd:NF033849  240 GTGYGESVGHSTSQGQSHSVgTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTT-EGTS 318
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1534 CGQSSGYGQneyGSGHSASSGQ-QGSHYSQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQYGSPSGQTSSSTRQGSGQ 1612
Cdd:NF033849  319 TTDSSSHSQ---SSSYNVSSGTgVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSG 395
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1613 GQASG---SGRYGASSGQTSGCGSGqstrYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGESSGFSQGGSGQGRSS 1689
Cdd:NF033849  396 GIAGGgvtSEGLGASQGGSEGWGSG----DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTS 471
                         250
                  ....*....|....*..
gi 568923858 1690 RGGQQGSFSGQTSGRSQ 1706
Cdd:NF033849  472 QGQSVGTSESWSTSQSE 488
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2434-2689 2.02e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 47.31  E-value: 2.02e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2434 GTHNSGGSPSSSQRGHGSRSGRSSGLGQYGSPSGQTSSSTRQGSGQGQASG-----SGRYGASSGQTSGCGSGQSTRYGE 2508
Cdd:NF033849  234 NLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGwshtqSTSESESTGQSSSVGTSESQSHGT 313
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2509 Q-----GSGSRNSSTQSRGRSTSRESSTSQRYGSGSGESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQHQSGSRHGSG 2583
Cdd:NF033849  314 TegtstTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGF 393
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2584 SGQFPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASS 2663
Cdd:NF033849  394 SGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGT-GTSQ 472
                         250       260
                  ....*....|....*....|....*.
gi 568923858 2664 GQTSGCGSGQSTRYGEQGSGSRNSST 2689
Cdd:NF033849  473 GQSVGTSESWSTSQSETDSVGDSTGT 498
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
421-662 4.30e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 46.15  E-value: 4.30e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  421 QGSGSRNSSTQSRGRSTSRESSTSQQFGSGSGRSSGFSQGGSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQ-N 499
Cdd:NF033849  247 VGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSE-SQSHGTTEGTSTTDSSsH 325
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  500 EYGSGHSASSGQ-QGSHYSQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQYGSPSGQTSSSTRQGSGQGQASG---SG 575
Cdd:NF033849  326 SQSSSYNVSSGTgVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGgvtSE 405
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  576 RYGASSGQTSGCGSGqstrYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGESSGFSQGGSGQGRSSRGGQQGSFSG 655
Cdd:NF033849  406 GLGASQGGSEGWGSG----DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSES 481

                  ....*..
gi 568923858  656 QTSGRSQ 662
Cdd:NF033849  482 WSTSQSE 488
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
723-959 7.97e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.55  E-value: 7.97e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  723 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRgrstsresstsqrygsgsg 800
Cdd:PHA03307  210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIW------------------- 270
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858  801 gssGFSQGGSGQGRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 880
Cdd:PHA03307  271 ---EASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568923858  881 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 959
Cdd:PHA03307  336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1767-2003 7.97e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.55  E-value: 7.97e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1767 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRgrstsresstsqrygsgsg 1844
Cdd:PHA03307  210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIW------------------- 270
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1845 gssGFSQGGSGQGRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 1924
Cdd:PHA03307  271 ---EASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568923858 1925 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 2003
Cdd:PHA03307  336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1419-1655 1.25e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.78  E-value: 1.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1419 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTsqrfgsgsg 1496
Cdd:PHA03307  210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS--------- 280
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1497 gssgfsqgrsgqgRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 1576
Cdd:PHA03307  281 -------------RPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568923858 1577 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 1655
Cdd:PHA03307  336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2286-2522 1.25e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.78  E-value: 1.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2286 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTsqrfgsgsg 2363
Cdd:PHA03307  210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS--------- 280
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2364 gssgfsqgrsgqgRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 2443
Cdd:PHA03307  281 -------------RPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568923858 2444 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 2522
Cdd:PHA03307  336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2634-2870 1.25e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.78  E-value: 1.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2634 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTsqrfgsgsg 2711
Cdd:PHA03307  210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS--------- 280
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2712 gssgfsqgrsgqgRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 2791
Cdd:PHA03307  281 -------------RPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568923858 2792 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 2870
Cdd:PHA03307  336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1071-1307 1.47e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.78  E-value: 1.47e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1071 GSTSGQTASSTRHRSGQGQASGSGR--CGASSGQTSGCGSGQSTRYDEQGSGSRNSSTQSRGRSTSRESSTsqrfgsgsg 1148
Cdd:PHA03307  210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS--------- 280
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1149 gssgfsqgrsgqgRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 1228
Cdd:PHA03307  281 -------------RPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568923858 1229 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 1307
Cdd:PHA03307  336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
EF-hand_7 pfam13499
EF-hand domain pair;
28-79 1.95e-03

EF-hand domain pair;


Pssm-ID: 463900 [Multi-domain]  Cd Length: 67  Bit Score: 39.16  E-value: 1.95e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 568923858    28 LSKEEMKELLVTEFhqiLKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMIL 79
Cdd:pfam13499   19 LDVEELKKLLRKLE---EGEPLSDEEVEELFKEFDLDKDGRISFEEFLELYS 67
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1621-1941 3.60e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 43.46  E-value: 3.60e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1621 YGASSGQTSGCGSGQStrygeQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGESSGFSQGGSGQGRSSRGGQQGSFSGQ 1700
Cdd:NF033849  231 YAANLGQSAGTGYGES-----VGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGT 305
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1701 TSGRSQHQSGSRHGSGSGQFpisgqqgshhghsSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQG 1780
Cdd:NF033849  306 SESQSHGTTEGTSTTDSSSH-------------SQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTS 372
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1781 SGQGQASGSGRcgASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSS 1860
Cdd:NF033849  373 SSVSSSESSSR--SSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHST 450
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 1861 RGGQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-------GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSR 1933
Cdd:NF033849  451 SSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSqsetdsvGDSTGTSESVSQ-GDGRSTGRSESQGTSLGTSGGRTSG 529

                  ....*...
gi 568923858 1934 SGRSSGLG 1941
Cdd:NF033849  530 AGGSMGLG 537
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2086-2341 4.71e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 43.07  E-value: 4.71e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2086 THNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGrcgASSGQTSGCGSGQSTRYGEQGSGS 2165
Cdd:NF033849  286 SHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTG---VSSSHSDGTSQSTSISHSESSSES 362
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2166 RNSST-QSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGsfsgQTSGRSQHQSGSRHGSGSGQFPISGQ 2244
Cdd:NF033849  363 TGTSVgHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQG----GSEGWGSGDSVQSVSQSYGSSSSTGT 438
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568923858 2245 QGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSG 2324
Cdd:NF033849  439 SSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ-GDGRSTGRSESQG 517
                         250
                  ....*....|....*..
gi 568923858 2325 QSTrygeQGSGSRNSST 2341
Cdd:NF033849  518 TSL----GTSGGRTSGA 530
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH