NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|403310651|ref|NP_001258113|]
View 

trophinin isoform 7 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
667-1034 1.31e-24

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 111.25  E-value: 1.31e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  667 NTNASFGcaVSTSASFSGAVSTSAcfsgapitnpgfGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAH 746
Cdd:NF033849  218 QKSISFG--VSLPMMYAANLGQSA------------GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTR 283
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  747 GTSlcFGGAPSTSLCFGSASNTnlcfggppSTSACFSGATSpsfcDGPSTSTGFSFGNGLSTNAGFGGGLNTSagfgGGL 826
Cdd:NF033849  284 GWS--HTQSTSESESTGQSSSV--------GTSESQSHGTT----EGTSTTDSSSHSQSSSYNVSSGTGVSSS----HSD 345
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  827 GTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGlgTSAGFGGGL----VTSDGFGGGLGTNASF 902
Cdd:NF033849  346 GTSQSTSISHSESSS------ESTGTSVGHSTSSSVSSSESSSRSSSSG--VSGGFSGGIagggVTSEGLGASQGGSEGW 417
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  903 GSTLGTSaGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCS 982
Cdd:NF033849  418 GSGDSVQ-SVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDST 496
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 403310651  983 GPSTSgFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFgsgaaSLG-ACGFSYG 1034
Cdd:NF033849  497 GTSES-VSQGDGRSTGRSESQGTSLGTSGGRTSGAGG-----SMGlGPSISLG 543
MAGE pfam01454
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ...
54-214 2.72e-23

MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.


:

Pssm-ID: 426270  Cd Length: 205  Bit Score: 98.88  E-value: 2.72e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651    54 LVKYLLVKDQTKIPIKRSDMLRDVIQEYDE-YFPEIIERASYTLEKMFRVNLKEID--------------------KQSS 112
Cdd:pfam01454    1 LVRYALACEYQRTPIRREDISKKVLGENRKrLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   113 LYILIST---QESSAGILGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLR---PGVRHSLFG 177
Cdd:pfam01454   81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 403310651   178 EVRKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 214
Cdd:pfam01454  161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
457-741 1.34e-15

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 81.98  E-value: 1.34e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  457 STTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSI------CFGGSPCTST 530
Cdd:NF033849  244 GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSEsqshgtTEGTSTTDSS 323
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  531 GFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGF----GGALNTSASFGSVLNTSTGFGGAMSTSADFGGTL 606
Cdd:NF033849  324 SHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTsvghSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVT 403
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  607 STSvcFGGSPGTSVSFGSA---LNTNAGYGGAVSTNTDFGGTLST--SVCFGGSPSTSAGFGGALNTNASFGCAVSTSAS 681
Cdd:NF033849  404 SEG--LGASQGGSEGWGSGdsvQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSES 481
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 403310651  682 FSG----AVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGaaPSTSVS 741
Cdd:NF033849  482 WSTsqseTDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLG--PSISLG 543
 
Name Accession Description Interval E-value
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
667-1034 1.31e-24

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 111.25  E-value: 1.31e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  667 NTNASFGcaVSTSASFSGAVSTSAcfsgapitnpgfGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAH 746
Cdd:NF033849  218 QKSISFG--VSLPMMYAANLGQSA------------GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTR 283
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  747 GTSlcFGGAPSTSLCFGSASNTnlcfggppSTSACFSGATSpsfcDGPSTSTGFSFGNGLSTNAGFGGGLNTSagfgGGL 826
Cdd:NF033849  284 GWS--HTQSTSESESTGQSSSV--------GTSESQSHGTT----EGTSTTDSSSHSQSSSYNVSSGTGVSSS----HSD 345
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  827 GTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGlgTSAGFGGGL----VTSDGFGGGLGTNASF 902
Cdd:NF033849  346 GTSQSTSISHSESSS------ESTGTSVGHSTSSSVSSSESSSRSSSSG--VSGGFSGGIagggVTSEGLGASQGGSEGW 417
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  903 GSTLGTSaGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCS 982
Cdd:NF033849  418 GSGDSVQ-SVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDST 496
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 403310651  983 GPSTSgFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFgsgaaSLG-ACGFSYG 1034
Cdd:NF033849  497 GTSES-VSQGDGRSTGRSESQGTSLGTSGGRTSGAGG-----SMGlGPSISLG 543
MAGE pfam01454
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ...
54-214 2.72e-23

MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.


Pssm-ID: 426270  Cd Length: 205  Bit Score: 98.88  E-value: 2.72e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651    54 LVKYLLVKDQTKIPIKRSDMLRDVIQEYDE-YFPEIIERASYTLEKMFRVNLKEID--------------------KQSS 112
Cdd:pfam01454    1 LVRYALACEYQRTPIRREDISKKVLGENRKrLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   113 LYILIST---QESSAGILGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLR---PGVRHSLFG 177
Cdd:pfam01454   81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 403310651   178 EVRKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 214
Cdd:pfam01454  161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
537-893 9.77e-21

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 98.92  E-value: 9.77e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  537 STSVSFGgsSSTSANFGGTLSTSicfdGSPSTGAGFGGAL--NTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGG 614
Cdd:NF033849  218 QKSISFG--VSLPMMYAANLGQS----AGTGYGESVGHSTsqGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQST 291
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  615 SPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSG 694
Cdd:NF033849  292 SESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHST 371
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  695 APITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGgahgtslcfggapstslcfGSASNTNLCFGG 774
Cdd:NF033849  372 SSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSG-------------------DSVQSVSQSYGS 432
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  775 PPSTSacfsgaTSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDggLGTSAGFGG 854
Cdd:NF033849  433 SSSTG------TSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDS--TGTSESVSQ 504
                         330       340       350
                  ....*....|....*....|....*....|....*....
gi 403310651  855 GPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFG 893
Cdd:NF033849  505 GDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
567-905 2.45e-20

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 97.38  E-value: 2.45e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  567 STGAGFGGALNTSASfgsvLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTdfggTL 646
Cdd:NF033849  238 SAGTGYGESVGHSTS----QGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSE----SQ 309
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  647 STSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACfsgapiTNPGFGGAFSTSAGFGGALSTAADFGGTP 726
Cdd:NF033849  310 SHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSIS------HSESSSESTGTSVGHSTSSSVSSSESSSR 383
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  727 SNSIGFGAAPSTSVSFGGAhgTSLCFGGAPSTSLCFGSAsntnlcfGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGl 806
Cdd:NF033849  384 SSSSGVSGGFSGGIAGGGV--TSEGLGASQGGSEGWGSG-------DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSG- 453
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  807 sTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGL 886
Cdd:NF033849  454 -QADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQS------ETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGR 526
                         330
                  ....*....|....*....
gi 403310651  887 VTSDGFGGGLGTNASFGST 905
Cdd:NF033849  527 TSGAGGSMGLGPSISLGKS 545
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
467-883 8.35e-19

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 92.38  E-value: 8.35e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  467 STSTSFG-SAPTtstVFSSALSTSTGFGGilSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGS 545
Cdd:NF033849  218 QKSISFGvSLPM---MYAANLGQSAGTGY--GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTS 292
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  546 SSTSANFGGTLSTSIcfdgSPSTGAGFGGALNTSASF--GSVLNTSTGFGGAMSTSAdfggtlstsvcfGGSPGTSVSFG 623
Cdd:NF033849  293 ESESTGQSSSVGTSE----SQSHGTTEGTSTTDSSSHsqSSSYNVSSGTGVSSSHSD------------GTSQSTSISHS 356
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  624 SALNTNAGYGGAVSTNTDFGGTLSTSVCFggSPSTSAGFGGALNtnasfgcavstsasfsgavstsacfsGAPITNPGFG 703
Cdd:NF033849  357 ESSSESTGTSVGHSTSSSVSSSESSSRSS--SSGVSGGFSGGIA--------------------------GGGVTSEGLG 408
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  704 GAFSTSAGFGGALSTAAdFGGTPSNSIGFGAapSTSVSFGGAHGTSLcfggapstslcfgsasntnlcfggppSTSACFS 783
Cdd:NF033849  409 ASQGGSEGWGSGDSVQS-VSQSYGSSSSTGT--SSGHSDSSSHSTSS--------------------------GQADSVS 459
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  784 GATSPSfcDGPSTSTGFSFGNGlstnagfggglnTSAGFGGGLGTSAGFSGGLSTSSGfdGGLGTSAGFGGGPGTSTGFG 863
Cdd:NF033849  460 QGTSWS--EGTGTSQGQSVGTS------------ESWSTSQSETDSVGDSTGTSESVS--QGDGRSTGRSESQGTSLGTS 523
                         410       420
                  ....*....|....*....|
gi 403310651  864 GGLGTSAGFSGGLGTSAGFG 883
Cdd:NF033849  524 GGRTSGAGGSMGLGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
457-741 1.34e-15

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 81.98  E-value: 1.34e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  457 STTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSI------CFGGSPCTST 530
Cdd:NF033849  244 GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSEsqshgtTEGTSTTDSS 323
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  531 GFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGF----GGALNTSASFGSVLNTSTGFGGAMSTSADFGGTL 606
Cdd:NF033849  324 SHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTsvghSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVT 403
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  607 STSvcFGGSPGTSVSFGSA---LNTNAGYGGAVSTNTDFGGTLST--SVCFGGSPSTSAGFGGALNTNASFGCAVSTSAS 681
Cdd:NF033849  404 SEG--LGASQGGSEGWGSGdsvQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSES 481
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 403310651  682 FSG----AVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGaaPSTSVS 741
Cdd:NF033849  482 WSTsqseTDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLG--PSISLG 543
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
329-1028 8.04e-14

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 76.34  E-value: 8.04e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  329 GASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSF 408
Cdd:COG3210   791 GAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSL 870
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  409 SSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALST 488
Cdd:COG3210   871 AATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQG 950
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  489 STGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPST 568
Cdd:COG3210   951 NAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTAS 1030
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  569 GAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLST 648
Cdd:COG3210  1031 ATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTT 1110
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  649 SVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSN 728
Cdd:COG3210  1111 TSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAAT 1190
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  729 SIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLST 808
Cdd:COG3210  1191 EGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGAT 1270
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  809 NAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVT 888
Cdd:COG3210  1271 STVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANT 1350
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  889 SDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIV 968
Cdd:COG3210  1351 GLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSA 1430
                         650       660       670       680       690       700
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  969 GFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 1028
Cdd:COG3210  1431 TTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGN 1490
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
361-683 1.28e-11

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 68.88  E-value: 1.28e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  361 TFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSFSSEASISF--GGMPCTSASFSGgvsssfsgpl 438
Cdd:NF033849  246 SVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSsvGTSESQSHGTTE---------- 315
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  439 STSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLST 518
Cdd:NF033849  316 GTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSG 395
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  519 SIcfGGSPCTSTGFGGTLSTSVSFGGSSSTSanfggtlSTSICFDGSPSTGAGFGGALNTSASFGSvlNTSTGFGGAMST 598
Cdd:NF033849  396 GI--AGGGVTSEGLGASQGGSEGWGSGDSVQ-------SVSQSYGSSSSTGTSSGHSDSSSHSTSS--GQADSVSQGTSW 464
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  599 SADFGGTLSTSVcfggspGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVST 678
Cdd:NF033849  465 SEGTGTSQGQSV------GTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGP 538

                  ....*
gi 403310651  679 SASFS 683
Cdd:NF033849  539 SISLG 543
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
699-923 8.56e-09

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 59.30  E-value: 8.56e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   699 NPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPstslcFGSASNTNLCFGGPPST 778
Cdd:pfam15967    5 SFGGGPGSTATAGGGFSFGAAAASNPGSTGGFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASS 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   779 SAC--FSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLgtsaGFSGGLSTSSGFDGGLGTSAGFGGGP 856
Cdd:pfam15967   80 TAAtgPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGL----SLGSVLTSTAAQQGATGFTLNLGGTP 155
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 403310651   857 GTSTGFGGGLGTSagfsgglGTSAGFGGGLVTSDGfGGGLGTNASFGSTLGTSAGFSGGLSTSDGFG 923
Cdd:pfam15967  156 ATTTAVSTGLSLG-------STLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLG 214
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
403-888 9.28e-08

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 56.33  E-value: 9.28e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  403 STSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVF 482
Cdd:COG4625    14 GGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGV 93
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  483 SSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICF 562
Cdd:COG4625    94 GGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGG 173
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  563 DGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVcfGGSPGTSVSFGSALNTNAGYGGAVSTNTDF 642
Cdd:COG4625   174 GGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGG--GGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 251
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  643 GGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADF 722
Cdd:COG4625   252 GGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 331
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  723 GGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSF 802
Cdd:COG4625   332 GGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGG 411
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  803 GNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGF 882
Cdd:COG4625   412 GAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVN 491

                  ....*.
gi 403310651  883 GGGLVT 888
Cdd:COG4625   492 GGGNYT 497
PTZ00395 PTZ00395
Sec24-related protein; Provisional
612-834 2.19e-05

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 48.92  E-value: 2.19e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  612 FGGSPGTSVSFGSALNTNAGYGgavstNTDFGGTLSTSvcfggSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSAC 691
Cdd:PTZ00395  339 YGGFHDGSPNAASAGAPFNGLG-----NQADGGHINQV-----HPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAG 408
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  692 FSGAPITNPGFggafsTSAGFggalsTAADFGGTPSNSigfgaAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNlc 771
Cdd:PTZ00395  409 FSNAGYSNPGN-----SNPGY-----NNAPNSNTPYNN-----PPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSN-- 471
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 403310651  772 fgGPPSTS----ACFSGATSPSFCDGPS---------TSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSG 834
Cdd:PTZ00395  472 --APPSSAkdhhSAYHAAYQHRAANQPAanlptanqpAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNG 545
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
824-1027 6.10e-04

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 43.45  E-value: 6.10e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  824 GGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGfggGLVTSDGFGGGLGTNASFG 903
Cdd:cd21118   125 GGHGAYGSQGGPGVQGHGIPGGTGGPWASGGNYGTNSLGGSVGQGGNGGPLNYGTNSQ---GAVAQPGYGTVRGNNQNSG 201
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  904 STLGTSAGFSGGLSTSDGfGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSivGFSGGPSTGVGFCSG 983
Cdd:cd21118   202 CTNPPPSGSHESFSNSGG-SSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSG--GSNGGSSGNSGSGSG 278
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 403310651  984 PSTSGFSGGPSTGAGFGGGPNTGAGFG---GGPSTSAGFGSGAASLG 1027
Cdd:cd21118   279 GSSSGGSNGWGGSSSSGGSGGSGGGNKpecNNPGNDVRMAGGGGSQG 325
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
525-710 1.46e-03

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 42.35  E-value: 1.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   525 SPCTSTG---FGGTLSTSVSFGGSSSTSANFGGTLstsicFDGSPSTGAGFGGALNTSASFGSVLNT----------STG 591
Cdd:pfam15967   28 SNPGSTGgfsFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASSTAATGPTGLTlgtpaattaaSTG 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   592 FGGAMSTSAdfGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGS---PSTSAGFGGALNT 668
Cdd:pfam15967  103 FSLGFNKPA--ASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTPATTTAVSTGlslGSTLTSLGGSLFQ 180
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 403310651   669 NASfGCAVSTSASFSGAVSTSACFSGAPITNPGFGGA-FSTSA 710
Cdd:pfam15967  181 NTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGGLdFSTSS 222
 
Name Accession Description Interval E-value
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
667-1034 1.31e-24

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 111.25  E-value: 1.31e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  667 NTNASFGcaVSTSASFSGAVSTSAcfsgapitnpgfGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAH 746
Cdd:NF033849  218 QKSISFG--VSLPMMYAANLGQSA------------GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTR 283
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  747 GTSlcFGGAPSTSLCFGSASNTnlcfggppSTSACFSGATSpsfcDGPSTSTGFSFGNGLSTNAGFGGGLNTSagfgGGL 826
Cdd:NF033849  284 GWS--HTQSTSESESTGQSSSV--------GTSESQSHGTT----EGTSTTDSSSHSQSSSYNVSSGTGVSSS----HSD 345
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  827 GTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGlgTSAGFGGGL----VTSDGFGGGLGTNASF 902
Cdd:NF033849  346 GTSQSTSISHSESSS------ESTGTSVGHSTSSSVSSSESSSRSSSSG--VSGGFSGGIagggVTSEGLGASQGGSEGW 417
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  903 GSTLGTSaGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCS 982
Cdd:NF033849  418 GSGDSVQ-SVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDST 496
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 403310651  983 GPSTSgFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFgsgaaSLG-ACGFSYG 1034
Cdd:NF033849  497 GTSES-VSQGDGRSTGRSESQGTSLGTSGGRTSGAGG-----SMGlGPSISLG 543
MAGE pfam01454
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ...
54-214 2.72e-23

MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.


Pssm-ID: 426270  Cd Length: 205  Bit Score: 98.88  E-value: 2.72e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651    54 LVKYLLVKDQTKIPIKRSDMLRDVIQEYDE-YFPEIIERASYTLEKMFRVNLKEID--------------------KQSS 112
Cdd:pfam01454    1 LVRYALACEYQRTPIRREDISKKVLGENRKrLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   113 LYILIST---QESSAGILGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLR---PGVRHSLFG 177
Cdd:pfam01454   81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 403310651   178 EVRKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 214
Cdd:pfam01454  161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
537-893 9.77e-21

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 98.92  E-value: 9.77e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  537 STSVSFGgsSSTSANFGGTLSTSicfdGSPSTGAGFGGAL--NTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGG 614
Cdd:NF033849  218 QKSISFG--VSLPMMYAANLGQS----AGTGYGESVGHSTsqGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQST 291
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  615 SPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSG 694
Cdd:NF033849  292 SESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHST 371
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  695 APITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGgahgtslcfggapstslcfGSASNTNLCFGG 774
Cdd:NF033849  372 SSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSG-------------------DSVQSVSQSYGS 432
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  775 PPSTSacfsgaTSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDggLGTSAGFGG 854
Cdd:NF033849  433 SSSTG------TSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDS--TGTSESVSQ 504
                         330       340       350
                  ....*....|....*....|....*....|....*....
gi 403310651  855 GPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFG 893
Cdd:NF033849  505 GDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
567-905 2.45e-20

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 97.38  E-value: 2.45e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  567 STGAGFGGALNTSASfgsvLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTdfggTL 646
Cdd:NF033849  238 SAGTGYGESVGHSTS----QGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSE----SQ 309
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  647 STSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACfsgapiTNPGFGGAFSTSAGFGGALSTAADFGGTP 726
Cdd:NF033849  310 SHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSIS------HSESSSESTGTSVGHSTSSSVSSSESSSR 383
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  727 SNSIGFGAAPSTSVSFGGAhgTSLCFGGAPSTSLCFGSAsntnlcfGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGl 806
Cdd:NF033849  384 SSSSGVSGGFSGGIAGGGV--TSEGLGASQGGSEGWGSG-------DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSG- 453
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  807 sTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGfdgglgTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGL 886
Cdd:NF033849  454 -QADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQS------ETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGR 526
                         330
                  ....*....|....*....
gi 403310651  887 VTSDGFGGGLGTNASFGST 905
Cdd:NF033849  527 TSGAGGSMGLGPSISLGKS 545
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
467-883 8.35e-19

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 92.38  E-value: 8.35e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  467 STSTSFG-SAPTtstVFSSALSTSTGFGGilSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGS 545
Cdd:NF033849  218 QKSISFGvSLPM---MYAANLGQSAGTGY--GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTS 292
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  546 SSTSANFGGTLSTSIcfdgSPSTGAGFGGALNTSASF--GSVLNTSTGFGGAMSTSAdfggtlstsvcfGGSPGTSVSFG 623
Cdd:NF033849  293 ESESTGQSSSVGTSE----SQSHGTTEGTSTTDSSSHsqSSSYNVSSGTGVSSSHSD------------GTSQSTSISHS 356
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  624 SALNTNAGYGGAVSTNTDFGGTLSTSVCFggSPSTSAGFGGALNtnasfgcavstsasfsgavstsacfsGAPITNPGFG 703
Cdd:NF033849  357 ESSSESTGTSVGHSTSSSVSSSESSSRSS--SSGVSGGFSGGIA--------------------------GGGVTSEGLG 408
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  704 GAFSTSAGFGGALSTAAdFGGTPSNSIGFGAapSTSVSFGGAHGTSLcfggapstslcfgsasntnlcfggppSTSACFS 783
Cdd:NF033849  409 ASQGGSEGWGSGDSVQS-VSQSYGSSSSTGT--SSGHSDSSSHSTSS--------------------------GQADSVS 459
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  784 GATSPSfcDGPSTSTGFSFGNGlstnagfggglnTSAGFGGGLGTSAGFSGGLSTSSGfdGGLGTSAGFGGGPGTSTGFG 863
Cdd:NF033849  460 QGTSWS--EGTGTSQGQSVGTS------------ESWSTSQSETDSVGDSTGTSESVS--QGDGRSTGRSESQGTSLGTS 523
                         410       420
                  ....*....|....*....|
gi 403310651  864 GGLGTSAGFSGGLGTSAGFG 883
Cdd:NF033849  524 GGRTSGAGGSMGLGPSISLG 543
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
457-741 1.34e-15

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 81.98  E-value: 1.34e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  457 STTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSI------CFGGSPCTST 530
Cdd:NF033849  244 GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSEsqshgtTEGTSTTDSS 323
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  531 GFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGF----GGALNTSASFGSVLNTSTGFGGAMSTSADFGGTL 606
Cdd:NF033849  324 SHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTsvghSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVT 403
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  607 STSvcFGGSPGTSVSFGSA---LNTNAGYGGAVSTNTDFGGTLST--SVCFGGSPSTSAGFGGALNTNASFGCAVSTSAS 681
Cdd:NF033849  404 SEG--LGASQGGSEGWGSGdsvQSVSQSYGSSSSTGTSSGHSDSSshSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSES 481
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 403310651  682 FSG----AVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGaaPSTSVS 741
Cdd:NF033849  482 WSTsqseTDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLG--PSISLG 543
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
329-1028 8.04e-14

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 76.34  E-value: 8.04e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  329 GASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSF 408
Cdd:COG3210   791 GAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSL 870
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  409 SSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALST 488
Cdd:COG3210   871 AATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQG 950
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  489 STGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPST 568
Cdd:COG3210   951 NAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTAS 1030
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  569 GAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLST 648
Cdd:COG3210  1031 ATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTT 1110
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  649 SVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSN 728
Cdd:COG3210  1111 TSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAAT 1190
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  729 SIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLST 808
Cdd:COG3210  1191 EGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGAT 1270
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  809 NAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVT 888
Cdd:COG3210  1271 STVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANT 1350
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  889 SDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIV 968
Cdd:COG3210  1351 GLNGGNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSA 1430
                         650       660       670       680       690       700
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  969 GFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 1028
Cdd:COG3210  1431 TTGTGGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGN 1490
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
361-683 1.28e-11

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 68.88  E-value: 1.28e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  361 TFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSFSSEASISF--GGMPCTSASFSGgvsssfsgpl 438
Cdd:NF033849  246 SVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSsvGTSESQSHGTTE---------- 315
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  439 STSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLST 518
Cdd:NF033849  316 GTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSG 395
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  519 SIcfGGSPCTSTGFGGTLSTSVSFGGSSSTSanfggtlSTSICFDGSPSTGAGFGGALNTSASFGSvlNTSTGFGGAMST 598
Cdd:NF033849  396 GI--AGGGVTSEGLGASQGGSEGWGSGDSVQ-------SVSQSYGSSSSTGTSSGHSDSSSHSTSS--GQADSVSQGTSW 464
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  599 SADFGGTLSTSVcfggspGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVST 678
Cdd:NF033849  465 SEGTGTSQGQSV------GTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGP 538

                  ....*
gi 403310651  679 SASFS 683
Cdd:NF033849  539 SISLG 543
AidA COG3468
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ...
483-914 7.85e-11

Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442691 [Multi-domain]  Cd Length: 846  Bit Score: 66.51  E-value: 7.85e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  483 SSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICF 562
Cdd:COG3468     3 SGGGGGATGLGGGGTGGGGGLGGTGGGNAGLGIGNGGGGGAASGSGAGGVAGNGGGGGGGAGGGGGGAGSGGGLAGAGSG 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  563 DGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSvcfGGSPGTSVSFGSALNTNAGYGGAVSTNTDF 642
Cdd:COG3468    83 GTGGNSTGGGGGNSGTGGTGGGGGGGGSGNGGGGGGGGGGGGTGGGG---GGGTGSAGGGGGGGGGGTGVGGTGAAAAGG 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  643 GGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADF 722
Cdd:COG3468   160 GTGSGGGGSGGGGGAGGGGGGGAGGSGGAGSTGSGAGGGGGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGGGVG 239
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  723 GGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSF 802
Cdd:COG3468   240 GGGGSAGGTGGGGLTGGGAAGTGGGGGGTGTGSGGGGGGGANGGGSGGGGGASGTGGGGTASTGGGGGGGGGNGGGGGGG 319
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  803 GNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGF 882
Cdd:COG3468   320 SNAGGGSGGGGGGGGGGGGGGTTLNGAGSAGGGTGAALAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDGVGTGLTTGG 399
                         410       420       430
                  ....*....|....*....|....*....|..
gi 403310651  883 GGGLVTSDGFGGGLGTNASFGSTLGTSAGFSG 914
Cdd:COG3468   400 TGNNGGGGVGGGGGGGLTLTGGTLTVNGNYTG 431
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
314-1028 1.68e-10

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 65.56  E-value: 1.68e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  314 AQENADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSA 393
Cdd:COG3210   606 GSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGG 685
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  394 ASISFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFG 473
Cdd:COG3210   686 TTGTTLNAATGGTLNNAGNTLTISTGSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGVTITSGNAGTLSIGLTANT 765
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  474 SAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFG 553
Cdd:COG3210   766 TASGTTLTLANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNT 845
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  554 GTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYG 633
Cdd:COG3210   846 TDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGG 925
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  634 GAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFG 713
Cdd:COG3210   926 LTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTT 1005
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  714 GALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTslcfGSASNTNLCFGGPPSTSACFSGATSPSFCDG 793
Cdd:COG3210  1006 ASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNA----SGISGGNAAALTASGTAGTTGGTAASNGGGG 1081
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  794 PSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFS 873
Cdd:COG3210  1082 TAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSA 1161
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  874 GGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGE 953
Cdd:COG3210  1162 SAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQT 1241
                         650       660       670       680       690       700       710
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 403310651  954 PSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 1028
Cdd:COG3210  1242 GSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGT 1316
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
317-948 2.22e-10

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 65.17  E-value: 2.22e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  317 NADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASI 396
Cdd:COG3210   115 TLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAGNNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGV 194
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  397 SFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAP 476
Cdd:COG3210   195 TGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAGVISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIG 274
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  477 TTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTL 556
Cdd:COG3210   275 TTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGT 354
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  557 STSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAV 636
Cdd:COG3210   355 TGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLG 434
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  637 STNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSAcfsgapITNPGFGGAFSTSAGFGGAL 716
Cdd:COG3210   435 ITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGG------GIGTVTTNATISNNAGGDAN 508
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  717 STAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPST 796
Cdd:COG3210   509 GIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSA 588
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  797 STGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGL 876
Cdd:COG3210   589 TGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAG 668
                         570       580       590       600       610       620       630
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 403310651  877 GTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTST 948
Cdd:COG3210   669 GTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVTGQIGALANANGDTVTFGNLGTGAT 740
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
439-915 2.35e-09

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 61.72  E-value: 2.35e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  439 STSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLST 518
Cdd:COG4625    18 GGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGG 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  519 SICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMST 598
Cdd:COG4625    98 GGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGG 177
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  599 SADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVST 678
Cdd:COG4625   178 GGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGG 257
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  679 SASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPST 758
Cdd:COG4625   258 NGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 337
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  759 SLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLST 838
Cdd:COG4625   338 GGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGG 417
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 403310651  839 SSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGG 915
Cdd:COG4625   418 GAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGGG 494
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
699-923 8.56e-09

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 59.30  E-value: 8.56e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   699 NPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPstslcFGSASNTNLCFGGPPST 778
Cdd:pfam15967    5 SFGGGPGSTATAGGGFSFGAAAASNPGSTGGFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASS 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   779 SAC--FSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLgtsaGFSGGLSTSSGFDGGLGTSAGFGGGP 856
Cdd:pfam15967   80 TAAtgPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGL----SLGSVLTSTAAQQGATGFTLNLGGTP 155
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 403310651   857 GTSTGFGGGLGTSagfsgglGTSAGFGGGLVTSDGfGGGLGTNASFGSTLGTSAGFSGGLSTSDGFG 923
Cdd:pfam15967  156 ATTTAVSTGLSLG-------STLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLG 214
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
403-888 9.28e-08

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 56.33  E-value: 9.28e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  403 STSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVF 482
Cdd:COG4625    14 GGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGV 93
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  483 SSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTLSTSICF 562
Cdd:COG4625    94 GGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGG 173
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  563 DGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVcfGGSPGTSVSFGSALNTNAGYGGAVSTNTDF 642
Cdd:COG4625   174 GGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGG--GGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 251
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  643 GGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGALSTAADF 722
Cdd:COG4625   252 GGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 331
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  723 GGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFCDGPSTSTGFSF 802
Cdd:COG4625   332 GGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGG 411
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  803 GNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGF 882
Cdd:COG4625   412 GAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVN 491

                  ....*.
gi 403310651  883 GGGLVT 888
Cdd:COG4625   492 GGGNYT 497
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
455-1034 5.05e-07

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 54.01  E-value: 5.05e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  455 TLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGG 534
Cdd:COG5295     5 AGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAASSVA 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  535 TLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGG 614
Cdd:COG5295    85 SGGASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATATGSST 164
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  615 SPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSG 694
Cdd:COG5295   165 ANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSASAGAA 244
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  695 APITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIG--FGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCF 772
Cdd:COG5295   245 SGNATTASASSVSGSAVAAGTASTATTASTTAASGAAgtATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGAAALG 324
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  773 GGPPSTSACFSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGF 852
Cdd:COG5295   325 SAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGSGGSSTG 404
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  853 GGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDR 932
Cdd:COG5295   405 ASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAATSS 484
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  933 GLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGG 1012
Cdd:COG5295   485 AAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSVAVGNNTAT 564
                         570       580
                  ....*....|....*....|..
gi 403310651 1013 PSTSAGFGSGAASLGACGFSYG 1034
Cdd:COG5295   565 GANSVALGAGSVASGANSVSVG 586
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
370-871 8.18e-07

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 53.24  E-value: 8.18e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  370 ASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGAS 449
Cdd:COG4625     1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  450 SGFGGTLSTTAGFSGVLSTSTSFGSAPTTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTS 529
Cdd:COG4625    81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  530 TGFGGTLSTSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTS 609
Cdd:COG4625   161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  610 VCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTS 689
Cdd:COG4625   241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  690 ACFSGAPITNPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTN 769
Cdd:COG4625   321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  770 LCFGGPPSTSACFSGATSPSFcDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTS 849
Cdd:COG4625   401 GGGGAGGTGGGGAGGGGGAAG-GGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLT 479
                         490       500
                  ....*....|....*....|..
gi 403310651  850 AGFGGGPGTSTGFGGGLGTSAG 871
Cdd:COG4625   480 GNNTYTGTTTVNGGGNYTQSAG 501
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
585-799 1.64e-06

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 52.06  E-value: 1.64e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  585 VLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGG 664
Cdd:COG3469     1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  665 ALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNpgfggafSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGG 744
Cdd:COG3469    81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTT-------STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTV 153
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 403310651  745 AHGTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACfSGATSPSFCDGPSTSTG 799
Cdd:COG3469   154 SGTETATGGTTTTSTTTTTTSASTTPSATTTATATTA-SGATTPSATTTATTTGP 207
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
652-884 3.58e-06

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 50.82  E-value: 3.58e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   652 FGGSPSTSAGFGGALntnaSFGCAVSTSASFSGAVSTSAcFSGAPITNPGfggAFSTSAGFGGALstaadFGGTPSNSIG 731
Cdd:pfam15967    6 FGGGPGSTATAGGGF----SFGAAAASNPGSTGGFSFGT-LGAAPAATAT---TTTATLGLGGGL-----FGQKPATGFT 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   732 FGAAPSTSvsfgGAHGTSLCFGGAPSTSlcfgSASNTNLCFGGPPSTSAcfsgATSPSFCDGPSTSTGFSFGNGLSTNAG 811
Cdd:pfam15967   73 FGTPASST----AATGPTGLTLGTPAAT----TAASTGFSLGFNKPAAS----ATPFSLPASSTSGGGLSLGSVLTSTAA 140
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 403310651   812 FGGGLNTSAGFGGGLGTSAGFSGGL---STSSGFDGGLGTSAGfGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGG 884
Cdd:pfam15967  141 QQGATGFTLNLGGTPATTTAVSTGLslgSTLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
798-1034 8.94e-06

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 49.67  E-value: 8.94e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   798 TGFSFGNGLSTNAGFGGGLNtsagFGGGLGTSAGFSGGLstssGFDGGLGTSAGFGGGPGTSTGFGGGLgtsagFSGGLG 877
Cdd:pfam15967    2 SGFSFGGGPGSTATAGGGFS----FGAAAASNPGSTGGF----SFGTLGAAPAATATTTTATLGLGGGL-----FGQKPA 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   878 TSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSrpnaSFDRGLSTIIGFGSGSNTSTGFTGEPSTS 957
Cdd:pfam15967   69 TGFTFGTPASSTAATGPTGLTLGTPAATTAASTGFSLGFNKPAASAT----PFSLPASSTSGGGLSLGSVLTSTAAQQGA 144
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   958 TGFSSGPSSIVGFSGGPSTGVGFCSGPSTSG---FSGGPSTGAGfgggpNTGAGFGGGPSTSAGFGSGAASLGACGFSYG 1034
Cdd:pfam15967  145 TGFTLNLGGTPATTTAVSTGLSLGSTLTSLGgslFQNTNSTGLG-----QTTLGLTLLATSTAPVSAPAASEGLGGLDFS 219
PTZ00395 PTZ00395
Sec24-related protein; Provisional
612-834 2.19e-05

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 48.92  E-value: 2.19e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  612 FGGSPGTSVSFGSALNTNAGYGgavstNTDFGGTLSTSvcfggSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSAC 691
Cdd:PTZ00395  339 YGGFHDGSPNAASAGAPFNGLG-----NQADGGHINQV-----HPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAG 408
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  692 FSGAPITNPGFggafsTSAGFggalsTAADFGGTPSNSigfgaAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNlc 771
Cdd:PTZ00395  409 FSNAGYSNPGN-----SNPGY-----NNAPNSNTPYNN-----PPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSN-- 471
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 403310651  772 fgGPPSTS----ACFSGATSPSFCDGPS---------TSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSG 834
Cdd:PTZ00395  472 --APPSSAkdhhSAYHAAYQHRAANQPAanlptanqpAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNG 545
PPE COG5651
PPE-repeat protein [Function unknown];
772-1004 3.10e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 47.58  E-value: 3.10e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  772 FGGPPSTSACFSGATSPSFCDGPSTSTGFSFGNglstnagfggglntsAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAG 851
Cdd:COG5651   167 FTQPPPTITNPGGLLGAQNAGSGNTSSNPGFAN---------------LGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGF 231
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  852 FGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFD 931
Cdd:COG5651   232 AGTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLG 311
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 403310651  932 RGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPN 1004
Cdd:COG5651   312 AGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAA 384
PPE COG5651
PPE-repeat protein [Function unknown];
833-1028 3.16e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 47.58  E-value: 3.16e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  833 SGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGtnaSFGSTLGTSAGF 912
Cdd:COG5651   177 PGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAA---AAAAAAAAAAGA 253
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  913 SGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGG 992
Cdd:COG5651   254 GASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAA 333
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 403310651  993 PSTGAGFGGGPNTGAGFGGGPSTSAGFGSGAASLGA 1028
Cdd:COG5651   334 AAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGG 369
PPE COG5651
PPE-repeat protein [Function unknown];
803-1021 6.49e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 46.42  E-value: 6.49e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  803 GNGLSTNAGFGGGLNTSAGFGgglgtSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGF 882
Cdd:COG5651   178 GGLLGAQNAGSGNTSSNPGFA-----NLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAG 252
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  883 GGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDgfgsrPNASFDRGLSTIIGFGSGSNTSTGFTGePSTSTGFSS 962
Cdd:COG5651   253 AGASAALASLAATLLNASSLGLAATAASSAATNLGLAG-----SPLGLAGGGAGAAAATGLGLGAGGAAG-AAGATGAGA 326
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 403310651  963 GPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGS 1021
Cdd:COG5651   327 ALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
632-854 1.21e-04

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 45.81  E-value: 1.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   632 YGGAVSTNTDFGGTLSTSVCFGGSPSTSAG--FGGALNTNASFGCAVSTSASFSGAVstsacFSGAPITNPGFGGAFSTS 709
Cdd:pfam15967    6 FGGGPGSTATAGGGFSFGAAAASNPGSTGGfsFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASST 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   710 AGFGGALSTaadfGGTPSNSigfgAAPSTSVSFGGAHGTslcfGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPS 789
Cdd:pfam15967   81 AATGPTGLT----LGTPAAT----TAASTGFSLGFNKPA----ASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFT 148
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 403310651   790 FCDGPSTSTGFSFGNGL---STNAGFGGGLNTSAGfGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGG 854
Cdd:pfam15967  149 LNLGGTPATTTAVSTGLslgSTLTSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
793-1001 1.53e-04

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 45.81  E-value: 1.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   793 GPSTSTGFSFGNGLSTNAGFGGGLntsaGFGGGLGTSAGFSGGLSTSSGFDGGLgtsagFGGGPGTSTGFGGGLGTSAGF 872
Cdd:pfam15967   13 TATAGGGFSFGAAAASNPGSTGGF----SFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASSTAAT 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   873 SGGLGTSAGFGGGLVTSDGFGGGLGTNASFGS--TLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGfGSGSNTSTGF 950
Cdd:pfam15967   84 GPTGLTLGTPAATTAASTGFSLGFNKPAASATpfSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLG-GTPATTTAVS 162
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 403310651   951 TGEP--STSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGG 1001
Cdd:pfam15967  163 TGLSlgSTLTSLGGSLFQNTNSTGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
PTZ00395 PTZ00395
Sec24-related protein; Provisional
861-1007 3.41e-04

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 44.68  E-value: 3.41e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  861 GFGGGL--GTSAGF-SGGLGTSAGFGG-GLVTSD---GFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPnasfdrg 933
Cdd:PTZ00395  341 GFHDGSpnAASAGApFNGLGNQADGGHiNQVHPDargAWAGGPHSNASYNCAAYSNAAQSNAAQSNAGFSNAG------- 413
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 403310651  934 lstiigFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPstsgFSGGPSTGAGFGGGPNTGA 1007
Cdd:PTZ00395  414 ------YSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLP----YSNTPYSNAPLSNAPPSSA 477
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
785-1006 5.56e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.97  E-value: 5.56e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  785 ATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGG 864
Cdd:COG3469     1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  865 GLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFgsrpNASFDRGLSTIIGFGSGS 944
Cdd:COG3469    81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGA----SATSSAGSTTTTTTVSGT 156
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 403310651  945 NTSTGFTGEPSTSTGFSSGPSSivgfsggPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTG 1006
Cdd:COG3469   157 ETATGGTTTTSTTTTTTSASTT-------PSATTTATATTASGATTPSATTTATTTGPPTPG 211
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
824-1027 6.10e-04

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 43.45  E-value: 6.10e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  824 GGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGfggGLVTSDGFGGGLGTNASFG 903
Cdd:cd21118   125 GGHGAYGSQGGPGVQGHGIPGGTGGPWASGGNYGTNSLGGSVGQGGNGGPLNYGTNSQ---GAVAQPGYGTVRGNNQNSG 201
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  904 STLGTSAGFSGGLSTSDGfGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSivGFSGGPSTGVGFCSG 983
Cdd:cd21118   202 CTNPPPSGSHESFSNSGG-SSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSG--GSNGGSSGNSGSGSG 278
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 403310651  984 PSTSGFSGGPSTGAGFGGGPNTGAGFG---GGPSTSAGFGSGAASLG 1027
Cdd:cd21118   279 GSSSGGSNGWGGSSSSGGSGGSGGGNKpecNNPGNDVRMAGGGGSQG 325
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
317-859 7.31e-04

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 43.61  E-value: 7.31e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  317 NADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSGGPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASI 396
Cdd:COG4625     2 GGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGG 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  397 SFGCAHSTSTSFSSEASISFGGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAP 476
Cdd:COG4625    82 GGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAG 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  477 TTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGSSSTSANFGGTL 556
Cdd:COG4625   162 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGG 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  557 STSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAV 636
Cdd:COG4625   242 GGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 321
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  637 STNTDFGGTLSTSVCFGGSPSTSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGAL 716
Cdd:COG4625   322 GGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGG 401
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  717 STAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSlcfGSASNTNLCFGGPPSTSACFSGATSPSFCDGPST 796
Cdd:COG4625   402 GGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGG---ATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTL 478
                         490       500       510       520       530       540
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 403310651  797 STGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGLSTSSGFDGGLGTSAGFGGGPGTS 859
Cdd:COG4625   479 TGNNTYTGTTTVNGGGNYTQSAGSTLAVEVDAANSDRLVVTGTATLNGGTVVVLAGGYAPGTT 541
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
525-710 1.46e-03

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 42.35  E-value: 1.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   525 SPCTSTG---FGGTLSTSVSFGGSSSTSANFGGTLstsicFDGSPSTGAGFGGALNTSASFGSVLNT----------STG 591
Cdd:pfam15967   28 SNPGSTGgfsFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASSTAATGPTGLTlgtpaattaaSTG 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   592 FGGAMSTSAdfGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGS---PSTSAGFGGALNT 668
Cdd:pfam15967  103 FSLGFNKPA--ASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGTPATTTAVSTGlslGSTLTSLGGSLFQ 180
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 403310651   669 NASfGCAVSTSASFSGAVSTSACFSGAPITNPGFGGA-FSTSA 710
Cdd:pfam15967  181 NTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLGGLdFSTSS 222
PPE COG5651
PPE-repeat protein [Function unknown];
856-1028 7.14e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 39.88  E-value: 7.14e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  856 PGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLS 935
Cdd:COG5651   171 PPTITNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAA 250
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  936 TIIGFGSGSNTSTGFTGEPSTSTGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPST 1015
Cdd:COG5651   251 AGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGA 330
                         170
                  ....*....|...
gi 403310651 1016 SAGFGSGAASLGA 1028
Cdd:COG5651   331 GAAAAAAGAAAGA 343
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
592-768 7.30e-03

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 40.04  E-value: 7.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   592 FGGAMSTSADFGGTLSTSVCFGGSPGTS--VSFGSALNTNAGYGGAVSTNTDFGGTLstsvcFGGSPSTSAGFGGALNTN 669
Cdd:pfam15967    6 FGGGPGSTATAGGGFSFGAAAASNPGSTggFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASST 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651   670 ASFGCAVSTSASFSGAVSTSACFSgapitnPGFGGAFSTSAGFGGALSTAADFGGTPSNSIGFGAAPSTSVSFGGAHGTS 749
Cdd:pfam15967   81 AATGPTGLTLGTPAATTAASTGFS------LGFNKPAASATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLNLGGT 154
                          170
                   ....*....|....*....
gi 403310651   750 LCFGGAPSTSLCFGSASNT 768
Cdd:pfam15967  155 PATTTAVSTGLSLGSTLTS 173
PRK13729 PRK13729
conjugal transfer pilus assembly protein TraB; Provisional
851-895 8.81e-03

conjugal transfer pilus assembly protein TraB; Provisional


Pssm-ID: 184281 [Multi-domain]  Cd Length: 475  Bit Score: 39.81  E-value: 8.81e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 403310651  851 GFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGG 895
Cdd:PRK13729  324 GWAWGAGFVDGIGQGMERASQPAVGLGATAAYGAGDVLKMGIGGG 368
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
704-931 9.84e-03

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 39.60  E-value: 9.84e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  704 GAFSTSAGFGGALSTAADFGGTPSNSIG-FGAAPSTSVSFGGAHGTSLCFGGAPSTSLC---FGSASNTNL---CFGGPP 776
Cdd:cd21118   128 GAYGSQGGPGVQGHGIPGGTGGPWASGGnYGTNSLGGSVGQGGNGGPLNYGTNSQGAVAqpgYGTVRGNNQnsgCTNPPP 207
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 403310651  777 STSACFSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLN--TSAGFGGGLGTSAGFSGGLSTSSG-FDGGLGTSAGFG 853
Cdd:cd21118   208 SGSHESFSNSGGSSSSGSSGSQGSHGSNGQGSSGSSGGQGNggNNGSSSSNSGNSGGSNGGSSGNSGsGSGGSSSGGSNG 287
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 403310651  854 GGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGLGTNASFGSTLGTSAgfSGGLSTSDGFGSRPNASFD 931
Cdd:cd21118   288 WGGSSSSGGSGGSGGGNKPECNNPGNDVRMAGGGGSQGSKESSGSHGSNGGNGQAEA--VGGLNTLNSDASTLPFNFD 363
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH