NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720404921|ref|XP_030108641|]
View 

hornerin isoform X3 [Mus musculus]

Protein Classification

S-100 domain-containing protein( domain architecture ID 10082979)

S-100 domain-containing protein contains the Ca-binding EF-hand motif; similar to Homo sapiens S100 proteins that are implicated in intracellular and extracellular regulatory activities

CATH:  1.10.238.10
Gene Ontology:  GO:0005509
PubMed:  2479149|10191494
SCOP:  3001983

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
S-100 cd00213
S-100: S-100 domain, which represents the largest family within the superfamily of proteins ...
2-89 1.59e-35

S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents the interaction site of S100 proteins with their target proteins. There is experimental evidence showing that many S100 proteins have multiple binding partners with diverse mode of interaction with different targets. In addition to S100 proteins (such as S100A1,-3,-4,-6,-7,-10,-11,and -13), this group includes the ''fused'' gene family, a group of calcium binding S100-related proteins. The ''fused'' gene family includes multifunctional epidermal differentiation proteins - profilaggrin, trichohyalin, repetin, hornerin, and cornulin; functionally these proteins are associated with keratin intermediate filaments and partially crosslinked to the cell envelope. These ''fused'' gene proteins contain N-terminal sequence with two Ca-binding EF-hands motif, which may be associated with calcium signaling in epidermal cells and autoprocessing in a calcium-dependent manner. In contrast to S100 proteins, "fused" gene family proteins contain an extraordinary high number of almost perfect peptide repeats with regular array of polar and charged residues similar to many known cell envelope proteins.


:

Pssm-ID: 238131 [Multi-domain]  Cd Length: 88  Bit Score: 131.07  E-value: 1.59e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921    2 PKLLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKL 81
Cdd:cd00213      1 SELEKAIETIIDVFHKYSGKEGDKDTLSKKELKELLETELPNFLKNQKDPEAVDKIMKDLDVNKDGKVDFQEFLVLIGKL 80

                   ....*...
gi 1720404921   82 TKACNKII 89
Cdd:cd00213     81 AVACHEFF 88
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2573-2874 8.39e-09

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 61.56  E-value: 8.39e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2573 GQSSGYGqneYGSGHSASSGQqgshySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGL--GQYGSPSGQTSSSTRQGSGQ 2650
Cdd:NF033849   236 GQSAGTG---YGESVGHSTSQ-----GQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWshTQSTSESESTGQSSSVGTSE 307
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2651 GQASGSGRyGASSGQTSGCGSGQS-TRYGEQGSGSRNSSTQSRGRSTSRESSTSQ-RYGSGSGESSGFSQGGSGQGRSSR 2728
Cdd:NF033849   308 SQSHGTTE-GTSTTDSSSHSQSSSyNVSSGTGVSSSHSDGTSQSTSISHSESSSEsTGTSVGHSTSSSVSSSESSSRSSS 386
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2729 GGQQGSFSGQTSGRSQHQSGSRHGSGSGQfPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTS 2808
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSE-GWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 2809 GQTASSTRQGSGQGQA-SGSGRCGASSGQTSGCGSGQSTRYGeQGSGSRNSSTQSRGRSTSRESSCS 2874
Cdd:NF033849   466 EGTGTSQGQSVGTSESwSTSQSETDSVGDSTGTSESVSQGDG-RSTGRSESQGTSLGTSGGRTSGAG 531
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1561-1826 3.15e-08

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 60.02  E-value: 3.15e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1561 THNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGrcgASSGQTSGCGSGQSTRYgEQGSGS 1640
Cdd:NF033849   264 SHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEG---TSTTDSSSHSQSSSYNV-SSGTGV 339
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1641 RNSSTQSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGsFSGQTEGSQQHGSCCGQSSGYGQneygsGH 1720
Cdd:NF033849   340 SSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSG-VSGGFSGGIAGGGVTSEGLGASQ-----GG 413
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1721 SASSGQQGSHYSQSSSYGTHNSGGSPS--SSQRGHGSRSGRSSGLGQYGSPSGQTSSSTRQGSGQGQA-SGSGRYGASSG 1797
Cdd:NF033849   414 SEGWGSGDSVQSVSQSYGSSSSTGTSSghSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESwSTSQSETDSVG 493
                          250       260
                   ....*....|....*....|....*....
gi 1720404921 1798 QTSGCGSGQStrygeQGSGSrnssTQSRG 1826
Cdd:NF033849   494 DSTGTSESVS-----QGDGR----STGRS 513
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1331-1640 3.40e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.55  E-value: 3.40e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1331 QGRSSRGGQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRS 1409
Cdd:NF033849   253 QGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSeSESTGQSSSVGT-SESQSHGTTEGTSTTDSSSHSQSSSY 331
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1410 GRSSGLGQYGSPSGQTSSSTRQGSGQGQASGSGrygassgqtSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQ 1489
Cdd:NF033849   332 NVSSGTGVSSSHSDGTSQSTSISHSESSSESTG---------TSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGV 402
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1490 RYGSGSGESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQhqsgsrhgsgsgqfpisGQQGSHHGHSSSSGTHNSGSSQS 1569
Cdd:NF033849   403 TSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGH-----------------SDSSSHSTSSGQADSVSQGTSWS 465
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720404921 1570 SSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSgrcGASSGQTSGCGSGQSTRYGEQGSGS 1640
Cdd:NF033849   466 EGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGR---STGRSESQGTSLGTSGGRTSGAGGS 533
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1010-1301 3.89e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 3.89e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1010 GQSSGYGqneYGSGHSASSGQqgshySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGL--GQYGSPSGQTSSSTRQGSGQ 1087
Cdd:NF033849   236 GQSAGTG---YGESVGHSTSQ-----GQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWshTQSTSESESTGQSSSVGTSE 307
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1088 GQASGSGRyGASSGQTSGCGSGQS-TRYGEQGSGSRNSSTQSRGRSTSRESSTSQ-RYGSGSGESSGFSQGGSGQGRSSR 1165
Cdd:NF033849   308 SQSHGTTE-GTSTTDSSSHSQSSSyNVSSGTGVSSSHSDGTSQSTSISHSESSSEsTGTSVGHSTSSSVSSSESSSRSSS 386
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1166 GGQQGSFSGQTSGRSQHQSGSRHGSGSGQfPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTS 1245
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSE-GWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 1246 GQTASSTRQGSGQGQA-SGSGRCGASSGQTSGCGSGQSTRYGeQGSGSRNSSTQSRG 1301
Cdd:NF033849   466 EGTGTSQGQSVGTSESwSTSQSETDSVGDSTGTSESVSQGDG-RSTGRSESQGTSLG 521
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2225-2516 3.89e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 3.89e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2225 GQSSGYGqneYGSGHSASSGQqgshySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGL--GQYGSPSGQTSSSTRQGSGQ 2302
Cdd:NF033849   236 GQSAGTG---YGESVGHSTSQ-----GQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWshTQSTSESESTGQSSSVGTSE 307
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2303 GQASGSGRyGASSGQTSGCGSGQS-TRYGEQGSGSRNSSTQSRGRSTSRESSTSQ-RYGSGSGESSGFSQGGSGQGRSSR 2380
Cdd:NF033849   308 SQSHGTTE-GTSTTDSSSHSQSSSyNVSSGTGVSSSHSDGTSQSTSISHSESSSEsTGTSVGHSTSSSVSSSESSSRSSS 386
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2381 GGQQGSFSGQTSGRSQHQSGSRHGSGSGQfPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTS 2460
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSE-GWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 2461 GQTASSTRQGSGQGQA-SGSGRCGASSGQTSGCGSGQSTRYGeQGSGSRNSSTQSRG 2516
Cdd:NF033849   466 EGTGTSQGQSVGTSESwSTSQSETDSVGDSTGTSESVSQGDG-RSTGRSESQGTSLG 521
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2095-2296 4.23e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 4.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2095 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 2174
Cdd:NF033849   319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2175 SSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 2248
Cdd:NF033849   398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 2249 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 2296
Cdd:NF033849   478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
532-733 3.04e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 53.47  E-value: 3.04e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  532 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 611
Cdd:NF033849   319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  612 SSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 685
Cdd:NF033849   398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921  686 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 733
Cdd:NF033849   478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
880-1081 5.97e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 52.31  E-value: 5.97e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  880 HGSGSEQSSGLGHYGSTSGQTASSTRHRSGQGQASGSGRcGASSGQTSGCGSGQSTRYDEQGSGSRNSSTQSRGRSTSRE 959
Cdd:NF033849   319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  960 SSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 1033
Cdd:NF033849   398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 1034 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 1081
Cdd:NF033849   478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1909-2164 3.79e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 43.07  E-value: 3.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1909 THNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGrcgASSGQTSGCGSGQSTRYGEQGSGS 1988
Cdd:NF033849   286 SHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTG---VSSSHSDGTSQSTSISHSESSSES 362
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1989 RNSST-QSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGsfsgQTSGRSQHQSGSRHGSGSGQFPISGQ 2067
Cdd:NF033849   363 TGTSVgHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQG----GSEGWGSGDSVQSVSQSYGSSSSTGT 438
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2068 QGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSG 2147
Cdd:NF033849   439 SSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ-GDGRSTGRSESQG 517
                          250
                   ....*....|....*..
gi 1720404921 2148 QSTrygeQGSGSRNSST 2164
Cdd:NF033849   518 TSL----GTSGGRTSGA 530
 
Name Accession Description Interval E-value
S-100 cd00213
S-100: S-100 domain, which represents the largest family within the superfamily of proteins ...
2-89 1.59e-35

S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents the interaction site of S100 proteins with their target proteins. There is experimental evidence showing that many S100 proteins have multiple binding partners with diverse mode of interaction with different targets. In addition to S100 proteins (such as S100A1,-3,-4,-6,-7,-10,-11,and -13), this group includes the ''fused'' gene family, a group of calcium binding S100-related proteins. The ''fused'' gene family includes multifunctional epidermal differentiation proteins - profilaggrin, trichohyalin, repetin, hornerin, and cornulin; functionally these proteins are associated with keratin intermediate filaments and partially crosslinked to the cell envelope. These ''fused'' gene proteins contain N-terminal sequence with two Ca-binding EF-hands motif, which may be associated with calcium signaling in epidermal cells and autoprocessing in a calcium-dependent manner. In contrast to S100 proteins, "fused" gene family proteins contain an extraordinary high number of almost perfect peptide repeats with regular array of polar and charged residues similar to many known cell envelope proteins.


Pssm-ID: 238131 [Multi-domain]  Cd Length: 88  Bit Score: 131.07  E-value: 1.59e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921    2 PKLLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKL 81
Cdd:cd00213      1 SELEKAIETIIDVFHKYSGKEGDKDTLSKKELKELLETELPNFLKNQKDPEAVDKIMKDLDVNKDGKVDFQEFLVLIGKL 80

                   ....*...
gi 1720404921   82 TKACNKII 89
Cdd:cd00213     81 AVACHEFF 88
S_100 pfam01023
S-100/ICaBP type calcium binding domain; The S-100 domain is a subfamily of the EF-hand ...
4-48 4.25e-15

S-100/ICaBP type calcium binding domain; The S-100 domain is a subfamily of the EF-hand calcium binding proteins.


Pssm-ID: 460028  Cd Length: 45  Bit Score: 71.31  E-value: 4.25e-15
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1720404921    4 LLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNP 48
Cdd:pfam01023    1 LERAIETIIDVFHKYAGKEGDKDTLSKKELKELLEKELPNFLKNQ 45
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2573-2874 8.39e-09

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 61.56  E-value: 8.39e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2573 GQSSGYGqneYGSGHSASSGQqgshySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGL--GQYGSPSGQTSSSTRQGSGQ 2650
Cdd:NF033849   236 GQSAGTG---YGESVGHSTSQ-----GQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWshTQSTSESESTGQSSSVGTSE 307
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2651 GQASGSGRyGASSGQTSGCGSGQS-TRYGEQGSGSRNSSTQSRGRSTSRESSTSQ-RYGSGSGESSGFSQGGSGQGRSSR 2728
Cdd:NF033849   308 SQSHGTTE-GTSTTDSSSHSQSSSyNVSSGTGVSSSHSDGTSQSTSISHSESSSEsTGTSVGHSTSSSVSSSESSSRSSS 386
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2729 GGQQGSFSGQTSGRSQHQSGSRHGSGSGQfPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTS 2808
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSE-GWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 2809 GQTASSTRQGSGQGQA-SGSGRCGASSGQTSGCGSGQSTRYGeQGSGSRNSSTQSRGRSTSRESSCS 2874
Cdd:NF033849   466 EGTGTSQGQSVGTSESwSTSQSETDSVGDSTGTSESVSQGDG-RSTGRSESQGTSLGTSGGRTSGAG 531
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1561-1826 3.15e-08

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 60.02  E-value: 3.15e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1561 THNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGrcgASSGQTSGCGSGQSTRYgEQGSGS 1640
Cdd:NF033849   264 SHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEG---TSTTDSSSHSQSSSYNV-SSGTGV 339
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1641 RNSSTQSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGsFSGQTEGSQQHGSCCGQSSGYGQneygsGH 1720
Cdd:NF033849   340 SSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSG-VSGGFSGGIAGGGVTSEGLGASQ-----GG 413
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1721 SASSGQQGSHYSQSSSYGTHNSGGSPS--SSQRGHGSRSGRSSGLGQYGSPSGQTSSSTRQGSGQGQA-SGSGRYGASSG 1797
Cdd:NF033849   414 SEGWGSGDSVQSVSQSYGSSSSTGTSSghSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESwSTSQSETDSVG 493
                          250       260
                   ....*....|....*....|....*....
gi 1720404921 1798 QTSGCGSGQStrygeQGSGSrnssTQSRG 1826
Cdd:NF033849   494 DSTGTSESVS-----QGDGR----STGRS 513
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1331-1640 3.40e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.55  E-value: 3.40e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1331 QGRSSRGGQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRS 1409
Cdd:NF033849   253 QGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSeSESTGQSSSVGT-SESQSHGTTEGTSTTDSSSHSQSSSY 331
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1410 GRSSGLGQYGSPSGQTSSSTRQGSGQGQASGSGrygassgqtSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQ 1489
Cdd:NF033849   332 NVSSGTGVSSSHSDGTSQSTSISHSESSSESTG---------TSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGV 402
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1490 RYGSGSGESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQhqsgsrhgsgsgqfpisGQQGSHHGHSSSSGTHNSGSSQS 1569
Cdd:NF033849   403 TSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGH-----------------SDSSSHSTSSGQADSVSQGTSWS 465
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720404921 1570 SSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSgrcGASSGQTSGCGSGQSTRYGEQGSGS 1640
Cdd:NF033849   466 EGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGR---STGRSESQGTSLGTSGGRTSGAGGS 533
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1010-1301 3.89e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 3.89e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1010 GQSSGYGqneYGSGHSASSGQqgshySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGL--GQYGSPSGQTSSSTRQGSGQ 1087
Cdd:NF033849   236 GQSAGTG---YGESVGHSTSQ-----GQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWshTQSTSESESTGQSSSVGTSE 307
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1088 GQASGSGRyGASSGQTSGCGSGQS-TRYGEQGSGSRNSSTQSRGRSTSRESSTSQ-RYGSGSGESSGFSQGGSGQGRSSR 1165
Cdd:NF033849   308 SQSHGTTE-GTSTTDSSSHSQSSSyNVSSGTGVSSSHSDGTSQSTSISHSESSSEsTGTSVGHSTSSSVSSSESSSRSSS 386
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1166 GGQQGSFSGQTSGRSQHQSGSRHGSGSGQfPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTS 1245
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSE-GWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 1246 GQTASSTRQGSGQGQA-SGSGRCGASSGQTSGCGSGQSTRYGeQGSGSRNSSTQSRG 1301
Cdd:NF033849   466 EGTGTSQGQSVGTSESwSTSQSETDSVGDSTGTSESVSQGDG-RSTGRSESQGTSLG 521
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2225-2516 3.89e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 3.89e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2225 GQSSGYGqneYGSGHSASSGQqgshySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGL--GQYGSPSGQTSSSTRQGSGQ 2302
Cdd:NF033849   236 GQSAGTG---YGESVGHSTSQ-----GQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWshTQSTSESESTGQSSSVGTSE 307
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2303 GQASGSGRyGASSGQTSGCGSGQS-TRYGEQGSGSRNSSTQSRGRSTSRESSTSQ-RYGSGSGESSGFSQGGSGQGRSSR 2380
Cdd:NF033849   308 SQSHGTTE-GTSTTDSSSHSQSSSyNVSSGTGVSSSHSDGTSQSTSISHSESSSEsTGTSVGHSTSSSVSSSESSSRSSS 386
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2381 GGQQGSFSGQTSGRSQHQSGSRHGSGSGQfPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTS 2460
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSE-GWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 2461 GQTASSTRQGSGQGQA-SGSGRCGASSGQTSGCGSGQSTRYGeQGSGSRNSSTQSRG 2516
Cdd:NF033849   466 EGTGTSQGQSVGTSESwSTSQSETDSVGDSTGTSESVSQGDG-RSTGRSESQGTSLG 521
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1228-1429 4.23e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 4.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1228 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 1307
Cdd:NF033849   319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1308 SSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 1381
Cdd:NF033849   398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 1382 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 1429
Cdd:NF033849   478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2095-2296 4.23e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 4.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2095 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 2174
Cdd:NF033849   319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2175 SSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 2248
Cdd:NF033849   398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 2249 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 2296
Cdd:NF033849   478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1225-1453 8.23e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.01  E-value: 8.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1225 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSS 1296
Cdd:NF033849   308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1297 TQSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 1376
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 1377 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 1453
Cdd:NF033849   465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2092-2320 8.23e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.01  E-value: 8.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2092 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSS 2163
Cdd:NF033849   308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2164 TQSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 2243
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 2244 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 2320
Cdd:NF033849   465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2440-2668 8.23e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.01  E-value: 8.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2440 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSS 2511
Cdd:NF033849   308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2512 TQSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 2591
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 2592 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 2668
Cdd:NF033849   465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1686-1988 1.12e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 54.63  E-value: 1.12e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1686 GQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLG 1764
Cdd:NF033849   260 GTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSeSESTGQSSSVGT-SESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTG 338
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1765 QYGSPSGQTSSSTRQGSGQGQASGSGrygassgqtSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSG 1844
Cdd:NF033849   339 VSSSHSDGTSQSTSISHSESSSESTG---------TSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1845 ESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQhqsgsrhgsgsgqfpisGQQGSHHGHSSSSGTHNSGSSQSSSTQWSH 1924
Cdd:NF033849   410 SQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGH-----------------SDSSSHSTSSGQADSVSQGTSWSEGTGTSQ 472
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720404921 1925 GSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSgrcGASSGQTSGCGSGQSTRYGEQGSGS 1988
Cdd:NF033849   473 GQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGR---STGRSESQGTSLGTSGGRTSGAGGS 533
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1576-1777 3.04e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 53.47  E-value: 3.04e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1576 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 1655
Cdd:NF033849   319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1656 SSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 1729
Cdd:NF033849   398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 1730 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 1777
Cdd:NF033849   478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
532-733 3.04e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 53.47  E-value: 3.04e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  532 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 611
Cdd:NF033849   319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  612 SSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 685
Cdd:NF033849   398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921  686 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 733
Cdd:NF033849   478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
529-757 5.92e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 52.31  E-value: 5.92e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  529 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSS 600
Cdd:NF033849   308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  601 TQSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 680
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921  681 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 757
Cdd:NF033849   465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
880-1081 5.97e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 52.31  E-value: 5.97e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  880 HGSGSEQSSGLGHYGSTSGQTASSTRHRSGQGQASGSGRcGASSGQTSGCGSGQSTRYDEQGSGSRNSSTQSRGRSTSRE 959
Cdd:NF033849   319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  960 SSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 1033
Cdd:NF033849   398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 1034 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 1081
Cdd:NF033849   478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
877-1105 1.16e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 51.54  E-value: 1.16e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  877 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRHRSGQGQASGSGRcGASSGQTSGCGSGQSTRYDEQGSGSRNSS 948
Cdd:NF033849   308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  949 TQSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 1028
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 1029 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 1105
Cdd:NF033849   465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
642-944 1.23e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 51.16  E-value: 1.23e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  642 GQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLG 720
Cdd:NF033849   260 GTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSeSESTGQSSSVGT-SESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTG 338
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  721 QYGSPSGQTSSSTRQGSGQGQASGSGrygassgqtSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSG 800
Cdd:NF033849   339 VSSSHSDGTSQSTSISHSESSSESTG---------TSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  801 ESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQhqsgsrhgsgsgqfpisGQQGSHHGHSSSSGTHNSGSSQSSSTQWSH 880
Cdd:NF033849   410 SQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGH-----------------SDSSSHSTSSGQADSVSQGTSWSEGTGTSQ 472
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720404921  881 GSGSEQSSGLGHYGSTSGQTASSTRHRSGQGQASGSgrcGASSGQTSGCGSGQSTRYDEQGSGS 944
Cdd:NF033849   473 GQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGR---STGRSESQGTSLGTSGGRTSGAGGS 533
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1616-1879 2.98e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 50.00  E-value: 2.98e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1616 GASSGQTSGCGSGQStrygeQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGSFSGQT 1695
Cdd:NF033849   232 AANLGQSAGTGYGES-----VGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTS 306
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1696 EgSQQHGSCCGQSSGYGQ-NEYGSGHSASSGQ-QGSHYSQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQYGSPSGQT 1773
Cdd:NF033849   307 E-SQSHGTTEGTSTTDSSsHSQSSSYNVSSGTgVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSS 385
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1774 SSSTRQGSGQGQASG---SGRYGASSGQTSGCGSGqstrYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGESSGFS 1850
Cdd:NF033849   386 SSGVSGGFSGGIAGGgvtSEGLGASQGGSEGWGSG----DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQG 461
                          250       260
                   ....*....|....*....|....*....
gi 1720404921 1851 QGGSGQGRSSRGGQQGSFSGQTSGRSQHQ 1879
Cdd:NF033849   462 TSWSEGTGTSQGQSVGTSESWSTSQSETD 490
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1278-1531 6.93e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 48.85  E-value: 6.93e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1278 GSGQSTRYGEQGSGSRNSST-QSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQqHGSC 1356
Cdd:NF033849   240 GTGYGESVGHSTSQGQSHSVgTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTT-EGTS 318
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1357 CGQSSGYGQneyGSGHSASSGQ-QGSHYSQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQYGSPSGQTSSSTRQGSGQ 1435
Cdd:NF033849   319 TTDSSSHSQ---SSSYNVSSGTgVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSG 395
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1436 GQASG---SGRYGASSGQTSGCGSGqstrYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGESSGFSQGGSGQGRSS 1512
Cdd:NF033849   396 GIAGGgvtSEGLGASQGGSEGWGSG----DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTS 471
                          250
                   ....*....|....*....
gi 1720404921 1513 RGGQQGSFSGQTSGRSQHQ 1531
Cdd:NF033849   472 QGQSVGTSESWSTSQSETD 490
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2257-2512 1.65e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 47.69  E-value: 1.65e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2257 GTHNSGGSPSSSQRGHGSRSGRSSGLGQYGSPSGQTSSSTRQGSGQGQASG-----SGRYGASSGQTSGCGSGQSTRYGE 2331
Cdd:NF033849   234 NLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGwshtqSTSESESTGQSSSVGTSESQSHGT 313
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2332 Q-----GSGSRNSSTQSRGRSTSRESSTSQRYGSGSGESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQHQSGSRHGSG 2406
Cdd:NF033849   314 TegtstTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGF 393
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2407 SGQFPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASS 2486
Cdd:NF033849   394 SGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGT-GTSQ 472
                          250       260
                   ....*....|....*....|....*.
gi 1720404921 2487 GQTSGCGSGQSTRYGEQGSGSRNSST 2512
Cdd:NF033849   473 GQSVGTSESWSTSQSETDSVGDSTGT 498
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
546-782 6.42e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.55  E-value: 6.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  546 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRgrstsresstsqrygsgsg 623
Cdd:PHA03307   210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIW------------------- 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  624 gssGFSQGGSGQGRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 703
Cdd:PHA03307   271 ---EASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720404921  704 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 782
Cdd:PHA03307   336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1590-1826 6.42e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.55  E-value: 6.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1590 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRgrstsresstsqrygsgsg 1667
Cdd:PHA03307   210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIW------------------- 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1668 gssGFSQGGSGQGRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 1747
Cdd:PHA03307   271 ---EASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720404921 1748 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 1826
Cdd:PHA03307   336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1242-1478 1.01e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 1.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1242 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTsqrfgsgsg 1319
Cdd:PHA03307   210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS--------- 280
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1320 gssgfsqgrsgqgRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 1399
Cdd:PHA03307   281 -------------RPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720404921 1400 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 1478
Cdd:PHA03307   336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2109-2345 1.01e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 1.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2109 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTsqrfgsgsg 2186
Cdd:PHA03307   210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS--------- 280
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2187 gssgfsqgrsgqgRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 2266
Cdd:PHA03307   281 -------------RPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720404921 2267 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 2345
Cdd:PHA03307   336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2457-2693 1.01e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 1.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2457 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTsqrfgsgsg 2534
Cdd:PHA03307   210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS--------- 280
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2535 gssgfsqgrsgqgRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 2614
Cdd:PHA03307   281 -------------RPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720404921 2615 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 2693
Cdd:PHA03307   336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
894-1130 1.21e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.78  E-value: 1.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  894 GSTSGQTASSTRHRSGQGQASGSGR--CGASSGQTSGCGSGQSTRYDEQGSGSRNSSTQSRGRSTSRESSTsqrfgsgsg 971
Cdd:PHA03307   210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS--------- 280
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  972 gssgfsqgrsgqgRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 1051
Cdd:PHA03307   281 -------------RPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720404921 1052 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 1130
Cdd:PHA03307   336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1444-1764 2.73e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 43.46  E-value: 2.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1444 YGASSGQTSGCGSGQStrygeQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGESSGFSQGGSGQGRSSRGGQQGSFSGQ 1523
Cdd:NF033849   231 YAANLGQSAGTGYGES-----VGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGT 305
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1524 TSGRSQHQSGSRHGSGSGQFpisgqqgshhghsSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQG 1603
Cdd:NF033849   306 SESQSHGTTEGTSTTDSSSH-------------SQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTS 372
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1604 SGQGQASGSGRcgASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSS 1683
Cdd:NF033849   373 SSVSSSESSSR--SSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHST 450
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1684 RGGQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-------GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSR 1756
Cdd:NF033849   451 SSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSqsetdsvGDSTGTSESVSQ-GDGRSTGRSESQGTSLGTSGGRTSG 529

                   ....*...
gi 1720404921 1757 SGRSSGLG 1764
Cdd:NF033849   530 AGGSMGLG 537
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1909-2164 3.79e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 43.07  E-value: 3.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1909 THNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGrcgASSGQTSGCGSGQSTRYGEQGSGS 1988
Cdd:NF033849   286 SHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTG---VSSSHSDGTSQSTSISHSESSSES 362
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1989 RNSST-QSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGsfsgQTSGRSQHQSGSRHGSGSGQFPISGQ 2067
Cdd:NF033849   363 TGTSVgHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQG----GSEGWGSGDSVQSVSQSYGSSSSTGT 438
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2068 QGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSG 2147
Cdd:NF033849   439 SSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ-GDGRSTGRSESQG 517
                          250
                   ....*....|....*..
gi 1720404921 2148 QSTrygeQGSGSRNSST 2164
Cdd:NF033849   518 TSL----GTSGGRTSGA 530
 
Name Accession Description Interval E-value
S-100 cd00213
S-100: S-100 domain, which represents the largest family within the superfamily of proteins ...
2-89 1.59e-35

S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents the interaction site of S100 proteins with their target proteins. There is experimental evidence showing that many S100 proteins have multiple binding partners with diverse mode of interaction with different targets. In addition to S100 proteins (such as S100A1,-3,-4,-6,-7,-10,-11,and -13), this group includes the ''fused'' gene family, a group of calcium binding S100-related proteins. The ''fused'' gene family includes multifunctional epidermal differentiation proteins - profilaggrin, trichohyalin, repetin, hornerin, and cornulin; functionally these proteins are associated with keratin intermediate filaments and partially crosslinked to the cell envelope. These ''fused'' gene proteins contain N-terminal sequence with two Ca-binding EF-hands motif, which may be associated with calcium signaling in epidermal cells and autoprocessing in a calcium-dependent manner. In contrast to S100 proteins, "fused" gene family proteins contain an extraordinary high number of almost perfect peptide repeats with regular array of polar and charged residues similar to many known cell envelope proteins.


Pssm-ID: 238131 [Multi-domain]  Cd Length: 88  Bit Score: 131.07  E-value: 1.59e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921    2 PKLLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKL 81
Cdd:cd00213      1 SELEKAIETIIDVFHKYSGKEGDKDTLSKKELKELLETELPNFLKNQKDPEAVDKIMKDLDVNKDGKVDFQEFLVLIGKL 80

                   ....*...
gi 1720404921   82 TKACNKII 89
Cdd:cd00213     81 AVACHEFF 88
calgranulins cd05030
Calgranulins: S-100 domain found in proteins belonging to the Calgranulin subgroup of the S100 ...
3-87 1.40e-17

Calgranulins: S-100 domain found in proteins belonging to the Calgranulin subgroup of the S100 family of EF-hand calcium-modulated proteins, including S100A8, S100A9, and S100A12 . Note that the S-100 hierarchy, to which this Calgranulin group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. These proteins are expressed mainly in granulocytes, and are involved in inflammation, allergy, and neuritogenesis, as well as in host-parasite response. Calgranulins are modulated not only by calcium, but also by other metals such as zinc and copper. Structural data suggested that calgranulins may exist in multiple structural forms, homodimers, as well as hetero-oligomers. For example, the S100A8/S100A9 complex called calprotectin plays important roles in the regulation of inflammatory processes, wound repair, and regulating zinc-dependent enzymes as well as microbial growth.


Pssm-ID: 240156 [Multi-domain]  Cd Length: 88  Bit Score: 80.08  E-value: 1.40e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921    3 KLLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKLT 82
Cdd:cd05030      2 ELEKAIETIINVFHQYSVRKGHPDTLYKKEFKQLVEKELPNFLKKEKNQKAIDKIFEDLDTNQDGQLSFEEFLVLVIKVG 81

                   ....*
gi 1720404921   83 KACNK 87
Cdd:cd05030     82 VAAHE 86
S-100A10_like cd05031
S-100A10_like: S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of ...
4-88 2.02e-17

S-100A10_like: S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of the S100 family of EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1_like group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. A unique feature of S100A10 is that it contains mutation in both of the calcium binding sites, making it calcium insensitive. S100A10 has been detected in brain, heart, gastrointestinal tract, kidney, liver, lung, spleen, testes, epidermis, aorta, and thymus. Structural data supports the homo- and hetero-dimeric as well as hetero-tetrameric nature of the protein. S100A10 has multiple binding partners in its calcium free state and is therefore involved in many diverse biological functions.


Pssm-ID: 240157 [Multi-domain]  Cd Length: 94  Bit Score: 79.77  E-value: 2.02e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921    4 LLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKLTK 83
Cdd:cd05031      3 LEHAMESLILTFHRYAGKDGDKNTLSRKELKKLMEKELSEFLKNQKDPMAVDKIMKDLDQNRDGKVNFEEFVSLVAGLSI 82

                   ....*
gi 1720404921   84 ACNKI 88
Cdd:cd05031     83 ACEEY 87
S-100A1 cd05025
S-100A1: S-100A1 domain found in proteins similar to S100A1. S100A1 is a calcium-binding ...
3-87 1.83e-15

S-100A1: S-100A1 domain found in proteins similar to S100A1. S100A1 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. As is the case with many other members of S100 protein family, S100A1 is implicated in intracellular and extracellular regulatory activities, including interaction with myosin-associated twitchin kinase, actin-capping protein CapZ, sinapsin I, and tubulin. Structural data suggests that S100A1 proteins exist within cells as antiparallel homodimers, while heterodimers with S100A4 and S100B also has been reported. Upon binding calcium S100A1 changes conformation to expose a hydrophobic cleft which is the interaction site of S100A1 with its more that 20 known target proteins.


Pssm-ID: 240152 [Multi-domain]  Cd Length: 92  Bit Score: 74.15  E-value: 1.83e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921    3 KLLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKLT 82
Cdd:cd05025      3 ELETAMETLINVFHAHSGKEGDKYKLSKKELKDLLQTELSDFLDAQKDADAVDKIMKELDENGDGEVDFQEFVVLVAALT 82

                   ....*
gi 1720404921   83 KACNK 87
Cdd:cd05025     83 VACNN 87
S_100 pfam01023
S-100/ICaBP type calcium binding domain; The S-100 domain is a subfamily of the EF-hand ...
4-48 4.25e-15

S-100/ICaBP type calcium binding domain; The S-100 domain is a subfamily of the EF-hand calcium binding proteins.


Pssm-ID: 460028  Cd Length: 45  Bit Score: 71.31  E-value: 4.25e-15
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1720404921    4 LLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNP 48
Cdd:pfam01023    1 LERAIETIIDVFHKYAGKEGDKDTLSKKELKELLEKELPNFLKNQ 45
S-100Z cd05026
S-100Z: S-100Z domain found in proteins similar to S100Z. S100Z is a member of the S100 domain ...
1-86 4.45e-15

S-100Z: S-100Z domain found in proteins similar to S100Z. S100Z is a member of the S100 domain family within the EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100Z group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately.S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control. S100Z is normally expressed in various tissues, with its highest level of expression being in spleen and leukocytes. The function of S100Z remains unclear. Preliminary structural data suggests that S100Z is homodimer, however a heterodimer with S100P has been reported. S100Z is capable of binding calcium ions. When calcium binds to S110Z, the protein experiences a conformational change, which exposes hydrophobic surfaces on the protein. In comparison with their normal tissue counterparts, S100Z gene expression appears to be deregulated in some tumor tissues.


Pssm-ID: 240153 [Multi-domain]  Cd Length: 93  Bit Score: 72.98  E-value: 4.45e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921    1 MPKLLE-SIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMIL 79
Cdd:cd05026      1 MPTQLEgAMDTLIRIFHNYSGKEGDRYKLSKGELKELLQRELTDFLSSQKDPMLVDKIMNDLDSNKDNEVDFNEFVVLVA 80

                   ....*..
gi 1720404921   80 KLTKACN 86
Cdd:cd05026     81 ALTVACN 87
S-100B cd05027
S-100B: S-100B domain found in proteins similar to S100B. S100B is a calcium-binding protein ...
3-87 1.03e-12

S-100B: S-100B domain found in proteins similar to S100B. S100B is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100B group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100B is most abundant in glial cells of the central nervous system, predominately in astrocytes. S100B is involved in signal transduction via the inhibition of protein phoshorylation, regulation of enzyme activity and by affecting the calcium homeostasis. Upon calcium binding the S100B homodimer changes conformation to expose a hydrophobic cleft, which represents the interaction site of S100B with its more than 20 known target proteins. These target proteins include several cellular architecture proteins such as tubulin and GFAP; S100B can inhibit polymerization of these oligomeric molecules. Furthermore, S100B inhibits the phosphorylation of multiple kinase substrates including the Alzheimer protein tau and neuromodulin (GAP-43) through a calcium-sensitive interaction with the protein substrates.


Pssm-ID: 240154 [Multi-domain]  Cd Length: 88  Bit Score: 66.03  E-value: 1.03e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921    3 KLLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKLT 82
Cdd:cd05027      2 ELEKAMVALIDVFHQYSGREGDKHKLKKSELKELINNELSHFLEEIKEQEVVDKVMETLDSDGDGECDFQEFMAFVAMVT 81

                   ....*
gi 1720404921   83 KACNK 87
Cdd:cd05027     82 TACHE 86
S-100A10 cd05024
S-100A10: A subgroup of the S-100A10 domain found in proteins similar to S100A10. S100A10 is a ...
3-86 7.45e-10

S-100A10: A subgroup of the S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of the S100 family of EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A10 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. A unique feature of S100A10 is that it contains mutation in both of the calcium binding sites, making it calcium insensitive. S100A10 has been detected in brain, heart, gastrointestinal tract, kidney, liver, lung, spleen, testes, epidermis, aorta, and thymus. Structural data supports the homo- and hetero-dimeric as well as hetero-tetrameric nature of the protein. S100A10 has multiple binding partners in its calcium free state and is therefore involved in many diverse biological functions.


Pssm-ID: 240151 [Multi-domain]  Cd Length: 91  Bit Score: 57.93  E-value: 7.45e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921    3 KLLESIVTVIDVFYQYAteyGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKLT 82
Cdd:cd05024      2 ELEHSMEKMMLTFHKFA---GEKNYLNRDDLQKLMEKEFSEFLKNQNDPMAVDKIMKDLDDCRDGKVGFQSFFSLIAGLL 78

                   ....
gi 1720404921   83 KACN 86
Cdd:cd05024     79 IACN 82
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2573-2874 8.39e-09

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 61.56  E-value: 8.39e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2573 GQSSGYGqneYGSGHSASSGQqgshySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGL--GQYGSPSGQTSSSTRQGSGQ 2650
Cdd:NF033849   236 GQSAGTG---YGESVGHSTSQ-----GQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWshTQSTSESESTGQSSSVGTSE 307
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2651 GQASGSGRyGASSGQTSGCGSGQS-TRYGEQGSGSRNSSTQSRGRSTSRESSTSQ-RYGSGSGESSGFSQGGSGQGRSSR 2728
Cdd:NF033849   308 SQSHGTTE-GTSTTDSSSHSQSSSyNVSSGTGVSSSHSDGTSQSTSISHSESSSEsTGTSVGHSTSSSVSSSESSSRSSS 386
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2729 GGQQGSFSGQTSGRSQHQSGSRHGSGSGQfPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTS 2808
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSE-GWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 2809 GQTASSTRQGSGQGQA-SGSGRCGASSGQTSGCGSGQSTRYGeQGSGSRNSSTQSRGRSTSRESSCS 2874
Cdd:NF033849   466 EGTGTSQGQSVGTSESwSTSQSETDSVGDSTGTSESVSQGDG-RSTGRSESQGTSLGTSGGRTSGAG 531
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1561-1826 3.15e-08

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 60.02  E-value: 3.15e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1561 THNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGrcgASSGQTSGCGSGQSTRYgEQGSGS 1640
Cdd:NF033849   264 SHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEG---TSTTDSSSHSQSSSYNV-SSGTGV 339
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1641 RNSSTQSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGsFSGQTEGSQQHGSCCGQSSGYGQneygsGH 1720
Cdd:NF033849   340 SSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSG-VSGGFSGGIAGGGVTSEGLGASQ-----GG 413
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1721 SASSGQQGSHYSQSSSYGTHNSGGSPS--SSQRGHGSRSGRSSGLGQYGSPSGQTSSSTRQGSGQGQA-SGSGRYGASSG 1797
Cdd:NF033849   414 SEGWGSGDSVQSVSQSYGSSSSTGTSSghSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESwSTSQSETDSVG 493
                          250       260
                   ....*....|....*....|....*....
gi 1720404921 1798 QTSGCGSGQStrygeQGSGSrnssTQSRG 1826
Cdd:NF033849   494 DSTGTSESVS-----QGDGR----STGRS 513
S-100A11 cd05023
S-100A11: S-100A11 domain found in proteins similar to S100A11. S100A11 is a member of the ...
7-85 1.83e-07

S-100A11: S-100A11 domain found in proteins similar to S100A11. S100A11 is a member of the S-100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A11 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100 proteins have also been associated with a variety of pathological events, including neoplastic transformation and neurodegenerative diseases such as Alzheimer's, usually via over expression of the protein. S100A11 is expressed in smooth muscle and other tissues and involves in calcium-dependent membrane aggregation, which is important for cell vesiculation . As is the case for many other S100 proteins, S100A11 is homodimer, which is able to form a heterodimer with S100B through subunit exchange. Ca2+ binding to S100A11 results in a conformational change in the protein, exposing a hydrophobic surface that interacts with target proteins. In addition to binding to annexin A1 and A6 S100A11 also interacts with actin and transglutaminase.


Pssm-ID: 240150 [Multi-domain]  Cd Length: 89  Bit Score: 51.31  E-value: 1.83e-07
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720404921    7 SIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFHQILKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKLTKAC 85
Cdd:cd05023      7 CIESLIAVFQKYAGKDGDSYQLSKTEFLSFMNTELASFTKNQKDPGVLDRMMKKLDLNSDGQLDFQEFLNLIGGLAVAC 85
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1331-1640 3.40e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.55  E-value: 3.40e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1331 QGRSSRGGQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRS 1409
Cdd:NF033849   253 QGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSeSESTGQSSSVGT-SESQSHGTTEGTSTTDSSSHSQSSSY 331
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1410 GRSSGLGQYGSPSGQTSSSTRQGSGQGQASGSGrygassgqtSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQ 1489
Cdd:NF033849   332 NVSSGTGVSSSHSDGTSQSTSISHSESSSESTG---------TSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGV 402
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1490 RYGSGSGESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQhqsgsrhgsgsgqfpisGQQGSHHGHSSSSGTHNSGSSQS 1569
Cdd:NF033849   403 TSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGH-----------------SDSSSHSTSSGQADSVSQGTSWS 465
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720404921 1570 SSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSgrcGASSGQTSGCGSGQSTRYGEQGSGS 1640
Cdd:NF033849   466 EGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGR---STGRSESQGTSLGTSGGRTSGAGGS 533
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1010-1301 3.89e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 3.89e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1010 GQSSGYGqneYGSGHSASSGQqgshySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGL--GQYGSPSGQTSSSTRQGSGQ 1087
Cdd:NF033849   236 GQSAGTG---YGESVGHSTSQ-----GQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWshTQSTSESESTGQSSSVGTSE 307
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1088 GQASGSGRyGASSGQTSGCGSGQS-TRYGEQGSGSRNSSTQSRGRSTSRESSTSQ-RYGSGSGESSGFSQGGSGQGRSSR 1165
Cdd:NF033849   308 SQSHGTTE-GTSTTDSSSHSQSSSyNVSSGTGVSSSHSDGTSQSTSISHSESSSEsTGTSVGHSTSSSVSSSESSSRSSS 386
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1166 GGQQGSFSGQTSGRSQHQSGSRHGSGSGQfPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTS 1245
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSE-GWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 1246 GQTASSTRQGSGQGQA-SGSGRCGASSGQTSGCGSGQSTRYGeQGSGSRNSSTQSRG 1301
Cdd:NF033849   466 EGTGTSQGQSVGTSESwSTSQSETDSVGDSTGTSESVSQGDG-RSTGRSESQGTSLG 521
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2225-2516 3.89e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 3.89e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2225 GQSSGYGqneYGSGHSASSGQqgshySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGL--GQYGSPSGQTSSSTRQGSGQ 2302
Cdd:NF033849   236 GQSAGTG---YGESVGHSTSQ-----GQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWshTQSTSESESTGQSSSVGTSE 307
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2303 GQASGSGRyGASSGQTSGCGSGQS-TRYGEQGSGSRNSSTQSRGRSTSRESSTSQ-RYGSGSGESSGFSQGGSGQGRSSR 2380
Cdd:NF033849   308 SQSHGTTE-GTSTTDSSSHSQSSSyNVSSGTGVSSSHSDGTSQSTSISHSESSSEsTGTSVGHSTSSSVSSSESSSRSSS 386
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2381 GGQQGSFSGQTSGRSQHQSGSRHGSGSGQfPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTS 2460
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSE-GWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 2461 GQTASSTRQGSGQGQA-SGSGRCGASSGQTSGCGSGQSTRYGeQGSGSRNSSTQSRG 2516
Cdd:NF033849   466 EGTGTSQGQSVGTSESwSTSQSETDSVGDSTGTSESVSQGDG-RSTGRSESQGTSLG 521
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1228-1429 4.23e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 4.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1228 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 1307
Cdd:NF033849   319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1308 SSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 1381
Cdd:NF033849   398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 1382 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 1429
Cdd:NF033849   478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2095-2296 4.23e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 56.17  E-value: 4.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2095 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 2174
Cdd:NF033849   319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2175 SSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 2248
Cdd:NF033849   398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 2249 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 2296
Cdd:NF033849   478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1225-1453 8.23e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.01  E-value: 8.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1225 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSS 1296
Cdd:NF033849   308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1297 TQSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 1376
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 1377 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 1453
Cdd:NF033849   465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2092-2320 8.23e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.01  E-value: 8.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2092 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSS 2163
Cdd:NF033849   308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2164 TQSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 2243
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 2244 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 2320
Cdd:NF033849   465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2440-2668 8.23e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 55.01  E-value: 8.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2440 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSS 2511
Cdd:NF033849   308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2512 TQSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 2591
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 2592 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 2668
Cdd:NF033849   465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1686-1988 1.12e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 54.63  E-value: 1.12e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1686 GQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLG 1764
Cdd:NF033849   260 GTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSeSESTGQSSSVGT-SESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTG 338
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1765 QYGSPSGQTSSSTRQGSGQGQASGSGrygassgqtSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSG 1844
Cdd:NF033849   339 VSSSHSDGTSQSTSISHSESSSESTG---------TSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1845 ESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQhqsgsrhgsgsgqfpisGQQGSHHGHSSSSGTHNSGSSQSSSTQWSH 1924
Cdd:NF033849   410 SQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGH-----------------SDSSSHSTSSGQADSVSQGTSWSEGTGTSQ 472
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720404921 1925 GSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSgrcGASSGQTSGCGSGQSTRYGEQGSGS 1988
Cdd:NF033849   473 GQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGR---STGRSESQGTSLGTSGGRTSGAGGS 533
S-100A13 cd05022
S-100A13: S-100A13 domain found in proteins similar to S100A13. S100A13 is a calcium-binding ...
6-84 2.58e-06

S-100A13: S-100A13 domain found in proteins similar to S100A13. S100A13 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A13 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100A13 is involved in the cellular export of interleukin-1 (IL-1) and of fibroblast growth factor-1 (FGF-1), which plays an important role in angiogenesis and tissue regeneration. Export is based on the CuII-dependent formation of multiprotein complexes containing the S100A13 protein. Assembly of these complexes occurs near the inner surface of the plasma membrane. Binding of two Ca(II) ions per monomer triggers key conformational changes leading to the creation of two identical and symmetrical Cu(II)-binding sites on the surface of the protein, close to the interface between the two monomers. These Cu(II)-binding sites are unique among the S100 proteins, which are reported to bind Cu(II) or Zn(II) ions in addition to Ca(II) ions. In addition, the three-dimensional structure of S100A13 differs significantly from those of other S100 proteins; the hydrophobic pocket that largely contributes to protein-protein interactions in other S100 proteins is absent in S100A13. The structure of S100A13 contains a large patch of negatively charged residues flanked by dense cationic clusters, formed mostly from positively charged residues from the C-terminal end, which plays major role in binding FGF-1.


Pssm-ID: 240149 [Multi-domain]  Cd Length: 89  Bit Score: 48.11  E-value: 2.58e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720404921    6 ESIVTVIDVFYQYATEyGNCDMLSKEEMKELLVTEFHQILKnpdDPDTVDIIMQNLDRDHNHKVDFTEYLLMILKLTKA 84
Cdd:cd05022      5 KAIETLVSNFHKASVK-GGKESLTASEFQELLTQQLPHLLK---DVEGLEEKMKNLDVNQDSKLSFEEFWELIGELAKA 79
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1576-1777 3.04e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 53.47  E-value: 3.04e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1576 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 1655
Cdd:NF033849   319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1656 SSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 1729
Cdd:NF033849   398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 1730 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 1777
Cdd:NF033849   478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
532-733 3.04e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 53.47  E-value: 3.04e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  532 HGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRE 611
Cdd:NF033849   319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  612 SSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 685
Cdd:NF033849   398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921  686 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 733
Cdd:NF033849   478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
529-757 5.92e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 52.31  E-value: 5.92e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  529 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSGQSTRYGEQGSGSRNSS 600
Cdd:NF033849   308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  601 TQSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 680
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921  681 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 757
Cdd:NF033849   465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
880-1081 5.97e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 52.31  E-value: 5.97e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  880 HGSGSEQSSGLGHYGSTSGQTASSTRHRSGQGQASGSGRcGASSGQTSGCGSGQSTRYDEQGSGSRNSSTQSRGRSTSRE 959
Cdd:NF033849   319 TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI 397
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  960 SSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQQ-----HGSCCGQSSGYGQNE-YGSGHSASSGQQGS 1033
Cdd:NF033849   398 AGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHsdsssHSTSSGQADSVSQGTsWSEGTGTSQGQSVG 477
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 1034 H-----YSQSSSYGT-HNSGGSPSSSQ---RGHGSRSGRSSGLGQYGSPSGQTSSST 1081
Cdd:NF033849   478 TseswsTSQSETDSVgDSTGTSESVSQgdgRSTGRSESQGTSLGTSGGRTSGAGGSM 534
S-100A6 cd05029
S-100A6: S-100A6 domain found in proteins similar to S100A6. S100A6 is a member of the S100 ...
4-89 1.11e-05

S-100A6: S-100A6 domain found in proteins similar to S100A6. S100A6 is a member of the S100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A6 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100A6 is normally expressed in the G1 phase of the cell cycle in neuronal cells. The function of S100A6 remains unclear, but evidence suggests that it is involved in cell cycle regulation and exocytosis. S100A6 may also be involved in tumorigenesis; the protein is overexpressed in several tumors. Ca2+ binding to S100A6 leads to a conformational change in the protein, which exposes a hydrophobic surface for interaction with target proteins. Several such proteins have been identified: glyceraldehyde-3-phosphate dehydrogenase , annexins 2, 6 and 11 and Calcyclin-Binding Protein (CacyBP).


Pssm-ID: 240155 [Multi-domain]  Cd Length: 88  Bit Score: 45.99  E-value: 1.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921    4 LLESIVTVIDVFYQYATEYGNCDMLSKEEMKELLVTEFhQILKNPDDPDTVDiIMQNLDRDHNHKVDFTEYLLMILKLTK 83
Cdd:cd05029      5 LDQAIGLLVAIFHKYSGREGDKNTLSKKELKELIQKEL-TIGSKLQDAEIAK-LMEDLDRNKDQEVNFQEYVTFLGALAL 82

                   ....*.
gi 1720404921   84 ACNKII 89
Cdd:cd05029     83 IYNEAL 88
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
877-1105 1.16e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 51.54  E-value: 1.16e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  877 QWSHGS--------GSEQSSGLGHYGSTSGQTASSTRHRSGQGQASGSGRcGASSGQTSGCGSGQSTRYDEQGSGSRNSS 948
Cdd:NF033849   308 SQSHGTtegtsttdSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE-SSSESTGTSVGHSTSSSVSSSESSSRSSS 386
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  949 TQSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEgSQQHGSCCGQSSGYGQNEyGSGHSASS 1028
Cdd:NF033849   387 SGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGT-SSGHSDSSSHSTSSGQAD-SVSQGTSW 464
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720404921 1029 GQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQygspsgqtSSSTRQGSGQGQASGSGRYGASSGQTSG 1105
Cdd:NF033849   465 SEGTGT-SQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ--------GDGRSTGRSESQGTSLGTSGGRTSGAGG 532
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
642-944 1.23e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 51.16  E-value: 1.23e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  642 GQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLG 720
Cdd:NF033849   260 GTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSeSESTGQSSSVGT-SESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTG 338
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  721 QYGSPSGQTSSSTRQGSGQGQASGSGrygassgqtSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSG 800
Cdd:NF033849   339 VSSSHSDGTSQSTSISHSESSSESTG---------TSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  801 ESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQhqsgsrhgsgsgqfpisGQQGSHHGHSSSSGTHNSGSSQSSSTQWSH 880
Cdd:NF033849   410 SQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGH-----------------SDSSSHSTSSGQADSVSQGTSWSEGTGTSQ 472
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720404921  881 GSGSEQSSGLGHYGSTSGQTASSTRHRSGQGQASGSgrcGASSGQTSGCGSGQSTRYDEQGSGS 944
Cdd:NF033849   473 GQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGR---STGRSESQGTSLGTSGGRTSGAGGS 533
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1616-1879 2.98e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 50.00  E-value: 2.98e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1616 GASSGQTSGCGSGQStrygeQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGSFSGQT 1695
Cdd:NF033849   232 AANLGQSAGTGYGES-----VGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTS 306
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1696 EgSQQHGSCCGQSSGYGQ-NEYGSGHSASSGQ-QGSHYSQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQYGSPSGQT 1773
Cdd:NF033849   307 E-SQSHGTTEGTSTTDSSsHSQSSSYNVSSGTgVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSS 385
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1774 SSSTRQGSGQGQASG---SGRYGASSGQTSGCGSGqstrYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGESSGFS 1850
Cdd:NF033849   386 SSGVSGGFSGGIAGGgvtSEGLGASQGGSEGWGSG----DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQG 461
                          250       260
                   ....*....|....*....|....*....
gi 1720404921 1851 QGGSGQGRSSRGGQQGSFSGQTSGRSQHQ 1879
Cdd:NF033849   462 TSWSEGTGTSQGQSVGTSESWSTSQSETD 490
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1278-1531 6.93e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 48.85  E-value: 6.93e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1278 GSGQSTRYGEQGSGSRNSST-QSRGRSTSRESSTSQRFGSGSGGSSGFSQGRSGQGRSSRGGQQGSFSGQTEGSQqHGSC 1356
Cdd:NF033849   240 GTGYGESVGHSTSQGQSHSVgTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTT-EGTS 318
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1357 CGQSSGYGQneyGSGHSASSGQ-QGSHYSQSSSYGTHNSGGSPSSSQRGHGSRSGRSSGLGQYGSPSGQTSSSTRQGSGQ 1435
Cdd:NF033849   319 TTDSSSHSQ---SSSYNVSSGTgVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSG 395
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1436 GQASG---SGRYGASSGQTSGCGSGqstrYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGESSGFSQGGSGQGRSS 1512
Cdd:NF033849   396 GIAGGgvtSEGLGASQGGSEGWGSG----DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTS 471
                          250
                   ....*....|....*....
gi 1720404921 1513 RGGQQGSFSGQTSGRSQHQ 1531
Cdd:NF033849   472 QGQSVGTSESWSTSQSETD 490
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
2257-2512 1.65e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 47.69  E-value: 1.65e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2257 GTHNSGGSPSSSQRGHGSRSGRSSGLGQYGSPSGQTSSSTRQGSGQGQASG-----SGRYGASSGQTSGCGSGQSTRYGE 2331
Cdd:NF033849   234 NLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGwshtqSTSESESTGQSSSVGTSESQSHGT 313
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2332 Q-----GSGSRNSSTQSRGRSTSRESSTSQRYGSGSGESSGFSQGGSGQGRSSRGGQQGSFSGQTSGRSQHQSGSRHGSG 2406
Cdd:NF033849   314 TegtstTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGF 393
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2407 SGQFPISGQQGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASS 2486
Cdd:NF033849   394 SGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGT-GTSQ 472
                          250       260
                   ....*....|....*....|....*.
gi 1720404921 2487 GQTSGCGSGQSTRYGEQGSGSRNSST 2512
Cdd:NF033849   473 GQSVGTSESWSTSQSETDSVGDSTGT 498
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
546-782 6.42e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.55  E-value: 6.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  546 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRgrstsresstsqrygsgsg 623
Cdd:PHA03307   210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIW------------------- 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  624 gssGFSQGGSGQGRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 703
Cdd:PHA03307   271 ---EASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720404921  704 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 782
Cdd:PHA03307   336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1590-1826 6.42e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.55  E-value: 6.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1590 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRgrstsresstsqrygsgsg 1667
Cdd:PHA03307   210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIW------------------- 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1668 gssGFSQGGSGQGRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 1747
Cdd:PHA03307   271 ---EASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720404921 1748 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 1826
Cdd:PHA03307   336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1242-1478 1.01e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 1.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1242 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTsqrfgsgsg 1319
Cdd:PHA03307   210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS--------- 280
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1320 gssgfsqgrsgqgRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 1399
Cdd:PHA03307   281 -------------RPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720404921 1400 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 1478
Cdd:PHA03307   336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2109-2345 1.01e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 1.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2109 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTsqrfgsgsg 2186
Cdd:PHA03307   210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS--------- 280
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2187 gssgfsqgrsgqgRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 2266
Cdd:PHA03307   281 -------------RPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720404921 2267 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 2345
Cdd:PHA03307   336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2457-2693 1.01e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 1.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2457 GSTSGQTASSTRQGSGQGQASGSGR--CGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTsqrfgsgsg 2534
Cdd:PHA03307   210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS--------- 280
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2535 gssgfsqgrsgqgRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 2614
Cdd:PHA03307   281 -------------RPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720404921 2615 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 2693
Cdd:PHA03307   336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
894-1130 1.21e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.78  E-value: 1.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  894 GSTSGQTASSTRHRSGQGQASGSGR--CGASSGQTSGCGSGQSTRYDEQGSGSRNSSTQSRGRSTSRESSTsqrfgsgsg 971
Cdd:PHA03307   210 SSPISASASSPAPAPGRSAADDAGAssSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSS--------- 280
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921  972 gssgfsqgrsgqgRSSRGGQQGSFSGQTEGSQQHGSCCGQSSgygqneygSGHSASSGQQGSHYSQSSSygthNSGGSPS 1051
Cdd:PHA03307   281 -------------RPGPASSSSSPRERSPSPSPSSPGSGPAP--------SSPRASSSSSSSRESSSSS----TSSSSES 335
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720404921 1052 SSQRGhgSRSGRSSGlgqyGSPSGQTSSSTRQGSGQGQASGSGRYGASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRG 1130
Cdd:PHA03307   336 SRGAA--VSPGPSPS----RSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
EF-hand_7 pfam13499
EF-hand domain pair;
28-79 1.84e-03

EF-hand domain pair;


Pssm-ID: 463900 [Multi-domain]  Cd Length: 67  Bit Score: 39.16  E-value: 1.84e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1720404921   28 LSKEEMKELLVTEFhqiLKNPDDPDTVDIIMQNLDRDHNHKVDFTEYLLMIL 79
Cdd:pfam13499   19 LDVEELKKLLRKLE---EGEPLSDEEVEELFKEFDLDKDGRISFEEFLELYS 67
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1444-1764 2.73e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 43.46  E-value: 2.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1444 YGASSGQTSGCGSGQStrygeQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGESSGFSQGGSGQGRSSRGGQQGSFSGQ 1523
Cdd:NF033849   231 YAANLGQSAGTGYGES-----VGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGT 305
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1524 TSGRSQHQSGSRHGSGSGQFpisgqqgshhghsSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQG 1603
Cdd:NF033849   306 SESQSHGTTEGTSTTDSSSH-------------SQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTS 372
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1604 SGQGQASGSGRcgASSGQTSGCGSGQSTRYGEQGSGSRNSSTQSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSS 1683
Cdd:NF033849   373 SSVSSSESSSR--SSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHST 450
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1684 RGGQQGSFSGQTEGSQQHGSCCGQSSGYGQNEYGS-------GHSASSGQQGSHySQSSSYGTHNSGGSPSSSQRGHGSR 1756
Cdd:NF033849   451 SSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSqsetdsvGDSTGTSESVSQ-GDGRSTGRSESQGTSLGTSGGRTSG 529

                   ....*...
gi 1720404921 1757 SGRSSGLG 1764
Cdd:NF033849   530 AGGSMGLG 537
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1909-2164 3.79e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 43.07  E-value: 3.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1909 THNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGrcgASSGQTSGCGSGQSTRYGEQGSGS 1988
Cdd:NF033849   286 SHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTG---VSSSHSDGTSQSTSISHSESSSES 362
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 1989 RNSST-QSRGRSTSRESSTSQRYGSGSGGSSGFSQGGSGQGRSSRGGQQGsfsgQTSGRSQHQSGSRHGSGSGQFPISGQ 2067
Cdd:NF033849   363 TGTSVgHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQG----GSEGWGSGDSVQSVSQSYGSSSSTGT 438
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720404921 2068 QGSHHGHSSSSGTHNSGSSQSSSTQWSHGSGSEQSSGLGHYGSTSGQTASSTRQGSGQGQASGSGRcGASSGQTSGCGSG 2147
Cdd:NF033849   439 SSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQ-GDGRSTGRSESQG 517
                          250
                   ....*....|....*..
gi 1720404921 2148 QSTrygeQGSGSRNSST 2164
Cdd:NF033849   518 TSL----GTSGGRTSGA 530
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH