NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|562154454|ref|NP_001274094|]
View 

nuclear pore complex protein Nup98-Nup96 isoform 3 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Nucleoporin2 pfam04096
Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional ...
721-863 5.02e-64

Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional transport across the nucleoporin complex in nucleocytoplasmic transport. The mammalian nuclear pore complex (NPC) is comprised of approximately 30 unique proteins, collectively known as nucleoporins. This family includes yeast family members such as Nup145p as well as vertebrate Nup98. The NUP C-terminal domains of Nup98 and Nup145 possess peptidase S59 autoproteolytic activity. The autoproteolytic sites of Nup98 and Nup145 each occur immediately C-terminal to the NUP C-terminal domain. Thus, although this domain occurs in the middle of each precursor polypeptide, it winds up at the C-terminal end of the N-terminal cleavage product. Cleavage of the peptide chains are necessary for the proper targeting to the nuclear pore.


:

Pssm-ID: 461171  Cd Length: 143  Bit Score: 213.12  E-value: 5.02e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   721 KVGYYTIPSMDDLAKITnEKGECIVSDFTIGRKGYGSIYFEGDVNLTNLNLDDI----VHIRRKEVIVYVDDNQKPPVGE 796
Cdd:pfam04096    1 KGDYWTSPSLEELKKMS-REQLSSVENFTVGRKGYGSVRFLGPVDLTGLDLDEIfgkiVKFEPREVTVYPDESSKPPVGQ 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 562154454   797 GLNRKAEVTLDGVWPTDKTSRCLIKSPDRLADINYEGRLEavsRKQGAQFKEYRPETGSWVFKVSHF 863
Cdd:pfam04096   80 GLNVPATITLENVWPRDKDTKEPIKDPSGPRLEKHIERLK---RVQGTEFVSYDPETGTWTFKVEHF 143
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
25-334 9.66e-10

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 63.10  E-value: 9.66e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   25 QNTGFGTTSGGAFGTS-AFGSSNNTGGLFGNSQTKPGGL---FGTSSFSQPATSTSTGFGFGTSTGTSNSLFGTASTGTS 100
Cdd:NF033849  255 QSHSVGTSESHSVGTSqSQSHTTGHGSTRGWSHTQSTSEsesTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVS 334
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  101 LFSSQNNAFAQNKPTGFGnFGTSTSSGGLFGTT--NTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGV 178
Cdd:NF033849  335 SGTGVSSSHSDGTSQSTS-ISHSESSSESTGTSvgHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGG 413
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  179 STNISTKhqcitamkeyesksleelrlEDYQANRKGPQNQVGGGTTAglfGSSPATSSATGLFSSSTTNSAFSYGQNKTA 258
Cdd:NF033849  414 SEGWGSG--------------------DSVQSVSQSYGSSSSTGTSS---GHSDSSSHSTSSGQADSVSQGTSWSEGTGT 470
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 562154454  259 fgTSTTGFGTNPGglFGQQNQQTTSlFSKPFGQATTTPN-TGFSFGNTSTLGQPSTNTMGlfgvtQASQPGGLFGTA 334
Cdd:NF033849  471 --SQGQSVGTSES--WSTSQSETDS-VGDSTGTSESVSQgDGRSTGRSESQGTSLGTSGG-----RTSGAGGSMGLG 537
Nucleoporin_FG2 super family cl37900
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
228-465 2.09e-03

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


The actual alignment was detected with superfamily member pfam15967:

Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 42.35  E-value: 2.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   228 FGSSPATSSATGlfsssttnSAFSYGQNKTAFGTSTTGFGTNPGGLFGQQNQQTT----SLFSKPFGQattTPNTGFSFG 303
Cdd:pfam15967    6 FGGGPGSTATAG--------GGFSFGAAAASNPGSTGGFSFGTLGAAPAATATTTtatlGLGGGLFGQ---KPATGFTFG 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   304 NTSTlgqpstntmglfgVTQASQPGGLFGTATNTSTGTafgtgtglfgqpNTGFGavgstlFGNNK----LTTFGTSTTS 379
Cdd:pfam15967   75 TPAS-------------STAATGPTGLTLGTPAATTAA------------STGFS------LGFNKpaasATPFSLPASS 123
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   380 APSFGTTSGGLFGFGTNNSGSSIF----GSKPAAGTLGTGLGTGFGTALGAGqASLFGNNQPKiggPLGTGAFGAPGFNT 455
Cdd:pfam15967  124 TSGGGLSLGSVLTSTAAQQGATGFtlnlGGTPATTTAVSTGLSLGSTLTSLG-GSLFQNTNST---GLGQTTLGLTLLAT 199
                          250
                   ....*....|
gi 562154454   456 STAILGFGAP 465
Cdd:pfam15967  200 STAPVSAPAA 209
 
Name Accession Description Interval E-value
Nucleoporin2 pfam04096
Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional ...
721-863 5.02e-64

Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional transport across the nucleoporin complex in nucleocytoplasmic transport. The mammalian nuclear pore complex (NPC) is comprised of approximately 30 unique proteins, collectively known as nucleoporins. This family includes yeast family members such as Nup145p as well as vertebrate Nup98. The NUP C-terminal domains of Nup98 and Nup145 possess peptidase S59 autoproteolytic activity. The autoproteolytic sites of Nup98 and Nup145 each occur immediately C-terminal to the NUP C-terminal domain. Thus, although this domain occurs in the middle of each precursor polypeptide, it winds up at the C-terminal end of the N-terminal cleavage product. Cleavage of the peptide chains are necessary for the proper targeting to the nuclear pore.


Pssm-ID: 461171  Cd Length: 143  Bit Score: 213.12  E-value: 5.02e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   721 KVGYYTIPSMDDLAKITnEKGECIVSDFTIGRKGYGSIYFEGDVNLTNLNLDDI----VHIRRKEVIVYVDDNQKPPVGE 796
Cdd:pfam04096    1 KGDYWTSPSLEELKKMS-REQLSSVENFTVGRKGYGSVRFLGPVDLTGLDLDEIfgkiVKFEPREVTVYPDESSKPPVGQ 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 562154454   797 GLNRKAEVTLDGVWPTDKTSRCLIKSPDRLADINYEGRLEavsRKQGAQFKEYRPETGSWVFKVSHF 863
Cdd:pfam04096   80 GLNVPATITLENVWPRDKDTKEPIKDPSGPRLEKHIERLK---RVQGTEFVSYDPETGTWTFKVEHF 143
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
25-334 9.66e-10

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 63.10  E-value: 9.66e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   25 QNTGFGTTSGGAFGTS-AFGSSNNTGGLFGNSQTKPGGL---FGTSSFSQPATSTSTGFGFGTSTGTSNSLFGTASTGTS 100
Cdd:NF033849  255 QSHSVGTSESHSVGTSqSQSHTTGHGSTRGWSHTQSTSEsesTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVS 334
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  101 LFSSQNNAFAQNKPTGFGnFGTSTSSGGLFGTT--NTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGV 178
Cdd:NF033849  335 SGTGVSSSHSDGTSQSTS-ISHSESSSESTGTSvgHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGG 413
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  179 STNISTKhqcitamkeyesksleelrlEDYQANRKGPQNQVGGGTTAglfGSSPATSSATGLFSSSTTNSAFSYGQNKTA 258
Cdd:NF033849  414 SEGWGSG--------------------DSVQSVSQSYGSSSSTGTSS---GHSDSSSHSTSSGQADSVSQGTSWSEGTGT 470
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 562154454  259 fgTSTTGFGTNPGglFGQQNQQTTSlFSKPFGQATTTPN-TGFSFGNTSTLGQPSTNTMGlfgvtQASQPGGLFGTA 334
Cdd:NF033849  471 --SQGQSVGTSES--WSTSQSETDS-VGDSTGTSESVSQgDGRSTGRSESQGTSLGTSGG-----RTSGAGGSMGLG 537
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
26-449 2.41e-09

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 61.72  E-value: 2.41e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTSNSLFGTASTGTSLFSSQ 105
Cdd:COG4625    85 GGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGG 164
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  106 NNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTK 185
Cdd:COG4625   165 GGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 244
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  186 HQCITAMKEYESKSLeelrleDYQANRKGPQNQVGGGTTAGLFGSSPATSSATGLFSSSTTNSAFSYGQNKTAFGTSTTG 265
Cdd:COG4625   245 GGGAGGGGGGGGGNG------GGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 318
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  266 FGTNPGGLFGQQNQQTTSLFSKPFGQATTTPNTGFSFGNTSTLGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGT 345
Cdd:COG4625   319 GGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGG 398
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  346 GTGLFGQPNTGFGAVGSTLFGNNKLTTFGTSTTSAPSFGTTSGGLFGFGTNNSGSSIFGSKPAAGTLGTGLGTGFGTALG 425
Cdd:COG4625   399 GGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTL 478
                         410       420
                  ....*....|....*....|....
gi 562154454  426 AGQASLFGNNQPKIGGPLGTGAFG 449
Cdd:COG4625   479 TGNNTYTGTTTVNGGGNYTQSAGS 502
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
44-263 4.26e-08

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 57.37  E-value: 4.26e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454    44 SSNNTGGLFGNSQTKPGGL-FGTSSFSQPatSTSTGFGFGTSTGTSNSlfgTASTGTSLFSSQNNAFAQNKPTGFGnfgt 122
Cdd:pfam15967    2 SGFSFGGGPGSTATAGGGFsFGAAAASNP--GSTGGFSFGTLGAAPAA---TATTTTATLGLGGGLFGQKPATGFT---- 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   123 stssgglFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIS---TKHQCITAMKEYESKS 199
Cdd:pfam15967   73 -------FGTPASSTAATGPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGgglSLGSVLTSTAAQQGAT 145
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 562154454   200 LEELRLedyqanrkGPQNQVGGGTTAGLFGSSPATSSATGLFSSSTTNSAFSYGQNKTAFGTST 263
Cdd:pfam15967  146 GFTLNL--------GGTPATTTAVSTGLSLGSTLTSLGGSLFQNTNSTGLGQTTLGLTLLATST 201
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
228-465 2.09e-03

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 42.35  E-value: 2.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   228 FGSSPATSSATGlfsssttnSAFSYGQNKTAFGTSTTGFGTNPGGLFGQQNQQTT----SLFSKPFGQattTPNTGFSFG 303
Cdd:pfam15967    6 FGGGPGSTATAG--------GGFSFGAAAASNPGSTGGFSFGTLGAAPAATATTTtatlGLGGGLFGQ---KPATGFTFG 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   304 NTSTlgqpstntmglfgVTQASQPGGLFGTATNTSTGTafgtgtglfgqpNTGFGavgstlFGNNK----LTTFGTSTTS 379
Cdd:pfam15967   75 TPAS-------------STAATGPTGLTLGTPAATTAA------------STGFS------LGFNKpaasATPFSLPASS 123
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   380 APSFGTTSGGLFGFGTNNSGSSIF----GSKPAAGTLGTGLGTGFGTALGAGqASLFGNNQPKiggPLGTGAFGAPGFNT 455
Cdd:pfam15967  124 TSGGGLSLGSVLTSTAAQQGATGFtlnlGGTPATTTAVSTGLSLGSTLTSLG-GSLFQNTNST---GLGQTTLGLTLLAT 199
                          250
                   ....*....|
gi 562154454   456 STAILGFGAP 465
Cdd:pfam15967  200 STAPVSAPAA 209
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
72-416 3.67e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 41.53  E-value: 3.67e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   72 ATSTSTGFGFGTSTGTSNSLfgTASTGTSLFSSQNNAFAQNKPTGFG-NFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGP 150
Cdd:NF033849  236 GQSAGTGYGESVGHSTSQGQ--SHSVGTSESHSVGTSQSQSHTTGHGsTRGWSHTQSTSESESTGQSSSVGTSESQSHGT 313
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  151 SSFTAapTGTTIKFNPPTGTDTMVKAGVSTNISTKHQCITAMKEYESKSLeelrledyqanrkGPQNQVGGGTTAGLFGS 230
Cdd:NF033849  314 TEGTS--TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSE-------------STGTSVGHSTSSSVSSS 378
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  231 SPATSSATGLFSSSTTNSAFSYGQNKTAFGTS---TTGFGTNPGGLFGQQNQQTTSLFSKPFGQATTTPNtGFSFGNTST 307
Cdd:NF033849  379 ESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASqggSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSH-STSSGQADS 457
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  308 LGQPSTNTMGLfGVTQASQPGGlfgtatntstgtafgtgtglfGQPNTGFGAVGSTlFGNnkltTFGTSTTSAPSFGTTS 387
Cdd:NF033849  458 VSQGTSWSEGT-GTSQGQSVGT---------------------SESWSTSQSETDS-VGD----STGTSESVSQGDGRST 510
                         330       340
                  ....*....|....*....|....*....
gi 562154454  388 GglfgfgtNNSGSSIFGSKPAAGTLGTGL 416
Cdd:NF033849  511 G-------RSESQGTSLGTSGGRTSGAGG 532
34 PHA02584
long tail fiber, proximal subunit; Provisional
25-263 5.88e-03

long tail fiber, proximal subunit; Provisional


Pssm-ID: 222890 [Multi-domain]  Cd Length: 1229  Bit Score: 40.89  E-value: 5.88e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   25 QNTGFGTTSGGAFGTSAFGSSNNTGglfgnsqtkpgglfGTSSFSQPATSTSTGF-GFGTSTGTSNSLFGTASTGTSLFS 103
Cdd:PHA02584  944 QNTSNGTVVVVDETSIAFYSQNNTT--------------GNIVFNIDGTVDPINVnANGTLNATGVATNGRAVYAEGGGI 1009
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  104 SQNNAFAQNKPTGFGNF-GTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNI 182
Cdd:PHA02584 1010 ARTNNAARAITGGFTIRnDGSTTVFLLTAAGDQTGGFNGLKSLIINNANGQVTINDNYIINAGGTIMSGGLTVNSRIRSQ 1089
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  183 STKHQCITAMKEyeskslEELRLEDYQANRKGPQNQVGGGTTAGLFGSSPATSSATGlfsssttNSAFSYGQNKTAFGTS 262
Cdd:PHA02584 1090 GTKASYTRAPTA------DTVGFWSVDINDSATYNQFPGYFQMVTKTKSPGTLTQFG-------NTLDSLYQDWSPDGRT 1156

                  .
gi 562154454  263 T 263
Cdd:PHA02584 1157 T 1157
PPE COG5651
PPE-repeat protein [Function unknown];
233-465 8.66e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 39.88  E-value: 8.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  233 ATSSATGLFSSST-----TNSAFSYGQNKTAFGTSTTGFGTNPGGLFGQQNQQTTSLFS--KPFGQATTTPNTGFSfgnt 305
Cdd:COG5651   157 ASAAAVALTPFTQppptiTNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSgsGPIGLNSGPGNTGFA---- 232
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  306 stlgqpSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGTGTGLFGQPNTGFGAVGSTLFGNNkLTTFGTSTTSAPSFGT 385
Cdd:COG5651   233 ------GTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLA-GSPLGLAGGGAGAAAA 305
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  386 TSGGLFGFGTNNSGSSIFGSKPAAGTLGTGLGTGFGTALGAGQASLFGNNQPKIGGPLGTGAFGAPGFNTSTAILGFGAP 465
Cdd:COG5651   306 TGLGLGAGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
 
Name Accession Description Interval E-value
Nucleoporin2 pfam04096
Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional ...
721-863 5.02e-64

Nucleoporin autopeptidase; The nuclear pore complex protein plays a role in bidirectional transport across the nucleoporin complex in nucleocytoplasmic transport. The mammalian nuclear pore complex (NPC) is comprised of approximately 30 unique proteins, collectively known as nucleoporins. This family includes yeast family members such as Nup145p as well as vertebrate Nup98. The NUP C-terminal domains of Nup98 and Nup145 possess peptidase S59 autoproteolytic activity. The autoproteolytic sites of Nup98 and Nup145 each occur immediately C-terminal to the NUP C-terminal domain. Thus, although this domain occurs in the middle of each precursor polypeptide, it winds up at the C-terminal end of the N-terminal cleavage product. Cleavage of the peptide chains are necessary for the proper targeting to the nuclear pore.


Pssm-ID: 461171  Cd Length: 143  Bit Score: 213.12  E-value: 5.02e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   721 KVGYYTIPSMDDLAKITnEKGECIVSDFTIGRKGYGSIYFEGDVNLTNLNLDDI----VHIRRKEVIVYVDDNQKPPVGE 796
Cdd:pfam04096    1 KGDYWTSPSLEELKKMS-REQLSSVENFTVGRKGYGSVRFLGPVDLTGLDLDEIfgkiVKFEPREVTVYPDESSKPPVGQ 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 562154454   797 GLNRKAEVTLDGVWPTDKTSRCLIKSPDRLADINYEGRLEavsRKQGAQFKEYRPETGSWVFKVSHF 863
Cdd:pfam04096   80 GLNVPATITLENVWPRDKDTKEPIKDPSGPRLEKHIERLK---RVQGTEFVSYDPETGTWTFKVEHF 143
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
25-334 9.66e-10

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 63.10  E-value: 9.66e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   25 QNTGFGTTSGGAFGTS-AFGSSNNTGGLFGNSQTKPGGL---FGTSSFSQPATSTSTGFGFGTSTGTSNSLFGTASTGTS 100
Cdd:NF033849  255 QSHSVGTSESHSVGTSqSQSHTTGHGSTRGWSHTQSTSEsesTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVS 334
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  101 LFSSQNNAFAQNKPTGFGnFGTSTSSGGLFGTT--NTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGV 178
Cdd:NF033849  335 SGTGVSSSHSDGTSQSTS-ISHSESSSESTGTSvgHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGG 413
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  179 STNISTKhqcitamkeyesksleelrlEDYQANRKGPQNQVGGGTTAglfGSSPATSSATGLFSSSTTNSAFSYGQNKTA 258
Cdd:NF033849  414 SEGWGSG--------------------DSVQSVSQSYGSSSSTGTSS---GHSDSSSHSTSSGQADSVSQGTSWSEGTGT 470
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 562154454  259 fgTSTTGFGTNPGglFGQQNQQTTSlFSKPFGQATTTPN-TGFSFGNTSTLGQPSTNTMGlfgvtQASQPGGLFGTA 334
Cdd:NF033849  471 --SQGQSVGTSES--WSTSQSETDS-VGDSTGTSESVSQgDGRSTGRSESQGTSLGTSGG-----RTSGAGGSMGLG 537
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
26-449 2.41e-09

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 61.72  E-value: 2.41e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTSNSLFGTASTGTSLFSSQ 105
Cdd:COG4625    85 GGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGG 164
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  106 NNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTK 185
Cdd:COG4625   165 GGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 244
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  186 HQCITAMKEYESKSLeelrleDYQANRKGPQNQVGGGTTAGLFGSSPATSSATGLFSSSTTNSAFSYGQNKTAFGTSTTG 265
Cdd:COG4625   245 GGGAGGGGGGGGGNG------GGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 318
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  266 FGTNPGGLFGQQNQQTTSLFSKPFGQATTTPNTGFSFGNTSTLGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGT 345
Cdd:COG4625   319 GGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGG 398
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  346 GTGLFGQPNTGFGAVGSTLFGNNKLTTFGTSTTSAPSFGTTSGGLFGFGTNNSGSSIFGSKPAAGTLGTGLGTGFGTALG 425
Cdd:COG4625   399 GGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTL 478
                         410       420
                  ....*....|....*....|....
gi 562154454  426 AGQASLFGNNQPKIGGPLGTGAFG 449
Cdd:COG4625   479 TGNNTYTGTTTVNGGGNYTQSAGS 502
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
44-263 4.26e-08

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 57.37  E-value: 4.26e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454    44 SSNNTGGLFGNSQTKPGGL-FGTSSFSQPatSTSTGFGFGTSTGTSNSlfgTASTGTSLFSSQNNAFAQNKPTGFGnfgt 122
Cdd:pfam15967    2 SGFSFGGGPGSTATAGGGFsFGAAAASNP--GSTGGFSFGTLGAAPAA---TATTTTATLGLGGGLFGQKPATGFT---- 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   123 stssgglFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIS---TKHQCITAMKEYESKS 199
Cdd:pfam15967   73 -------FGTPASSTAATGPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGgglSLGSVLTSTAAQQGAT 145
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 562154454   200 LEELRLedyqanrkGPQNQVGGGTTAGLFGSSPATSSATGLFSSSTTNSAFSYGQNKTAFGTST 263
Cdd:pfam15967  146 GFTLNL--------GGTPATTTAVSTGLSLGSTLTSLGGSLFQNTNSTGLGQTTLGLTLLATST 201
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
40-149 1.80e-06

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 47.23  E-value: 1.80e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454    40 SAFGSSNNT-GGLFGNSQTKP---GGLFGTSSFSQPATSTSTGFGFGTSTGTSNSLFGTASTGTslfssqnnafaqnkpt 115
Cdd:pfam13634    1 GLFGAATSTsGGLFGNTSTTAasgGGLFGAASTATATTSGGGLFGNSSSNAPSGGLFGATNTTT---------------- 64
                           90       100       110
                   ....*....|....*....|....*....|....
gi 562154454   116 gfgnfgTSTSSGGLFGTTNTTSNPfgSTSGSLFG 149
Cdd:pfam13634   65 ------QTATGGGLFGNNAATTTS--TTGGGLFG 90
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
239-332 2.91e-06

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 46.46  E-value: 2.91e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   239 GLFSSSTTNSAFSYGQNKTAFGTSTTGFGTNpgglfgqQNQQTTSLFSKPFGQATTTPNTGFSFGNTSTLGQPSTNTmGL 318
Cdd:pfam13634    1 GLFGAATSTSGGLFGNTSTTAASGGGLFGAA-------STATATTSGGGLFGNSSSNAPSGGLFGATNTTTQTATGG-GL 72
                           90
                   ....*....|....*...
gi 562154454   319 FGVTQASQP----GGLFG 332
Cdd:pfam13634   73 FGNNAATTTsttgGGLFG 90
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
24-484 7.70e-05

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 47.07  E-value: 7.70e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   24 GQNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTSNSLFGTASTGTSLFS 103
Cdd:COG3210   825 TATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTN 904
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  104 SQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNIS 183
Cdd:COG3210   905 AASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSA 984
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  184 TKHQCIT-AMKEYESKSLEELRLEDYQANRKGPQNQVGGGTTAGLFGSSPATSSATGLFSSSTTNSAFSYGQNKTAFGTS 262
Cdd:COG3210   985 GSTGGVIaATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTAS 1064
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  263 TTGFGTNPGGLFGQQNQQTTSLFSKPFGQATTTPNTGFSFGNTSTLGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTA 342
Cdd:COG3210  1065 GTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGT 1144
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  343 FGTGTGLFGQPNTGFGAVGSTLFGNNKLTTFGTSTTSAPSFGTTSGGLFGFGTNNSGSSIFGSKPAAGTLGTGLGTGFGT 422
Cdd:COG3210  1145 LTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTNVTTTTTLTA 1224
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 562154454  423 ALGAGQASLFGNNQPKIGGPLGTGAFGAPGFNTSTAILGFGAPQAPVALTDPNASAAQQAVL 484
Cdd:COG3210  1225 SDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATATGSTV 1286
PPE COG5651
PPE-repeat protein [Function unknown];
31-183 9.33e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 46.04  E-value: 9.33e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   31 TTSGGAFGTSAFGSSNN-------TGGLFGNSQTKPGGL-FGTSSFSQPATSTSTGFgFGTSTGTSNSLFGTASTGTSLF 102
Cdd:COG5651   175 TNPGGLLGAQNAGSGNTssnpgfaNLGLTGLNQVGIGGLnSGSGPIGLNSGPGNTGF-AGTGAAAGAAAAAAAAAAAAGA 253
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  103 SSQNN-----AFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAG 177
Cdd:COG5651   254 GASAAlaslaATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAA 333

                  ....*.
gi 562154454  178 VSTNIS 183
Cdd:COG5651   334 AAAAGA 339
AidA COG3468
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ...
26-375 1.28e-04

Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442691 [Multi-domain]  Cd Length: 846  Bit Score: 46.48  E-value: 1.28e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTSNSLFGTASTGTSLFSSQ 105
Cdd:COG3468   100 GTGGGGGGGGSGNGGGGGGGGGGGGTGGGGGGGTGSAGGGGGGGGGGTGVGGTGAAAAGGGTGSGGGGSGGGGGAGGGGG 179
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  106 NNAFAQnkpTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTK 185
Cdd:COG3468   180 GGAGGS---GGAGSTGSGAGGGGGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGGGVGGGGGSAGGTGGGGLTGG 256
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  186 HQCITAMKEYESKSLEELRLEDYQANRKGPQNQVGGGTTAGLFGSSPATSSATGLFSSSTTNSAFSYGQNKTAFGTSTTG 265
Cdd:COG3468   257 GAAGTGGGGGGTGTGSGGGGGGGANGGGSGGGGGASGTGGGGTASTGGGGGGGGGNGGGGGGGSNAGGGSGGGGGGGGGG 336
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  266 FGTNPGGLFGQQNQQTTSlfskpfGQATTTPNTGFSFGNTSTLGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGT 345
Cdd:COG3468   337 GGGGTTLNGAGSAGGGTG------AALAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDGVGTGLTTGGTGNNGGGGVGG 410
                         330       340       350
                  ....*....|....*....|....*....|
gi 562154454  346 GTGLFGQPNTGFGAVGSTLFGNNKLTTFGT 375
Cdd:COG3468   411 GGGGGLTLTGGTLTVNGNYTGNNGTLVLNT 440
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
272-366 2.63e-04

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 41.06  E-value: 2.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   272 GLFGQQNQQTTSLFSkpfGQATTTPNTGFSFGNTSTLGQPSTNTMGLFGVTQASQPGGLFGTATNtstGTAFGTGTGLFG 351
Cdd:pfam13634    1 GLFGAATSTSGGLFG---NTSTTAASGGGLFGAASTATATTSGGGLFGNSSSNAPSGGLFGATNT---TTQTATGGGLFG 74
                           90
                   ....*....|....*.
gi 562154454   352 Q-PNTGFGAVGSTLFG 366
Cdd:pfam13634   75 NnAATTTSTTGGGLFG 90
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
26-172 6.15e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.97  E-value: 6.15e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTST----------GTSNSLFGTA 95
Cdd:COG3469    52 AASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTvtttstgagsVTSTTSSTAG 131
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 562154454   96 STGTSLFSSQNNAFAQNKPTGFGNFGTSTSSGGlfgTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDT 172
Cdd:COG3469   132 STTTSGASATSSAGSTTTTTTVSGTETATGGTT---TTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTT 205
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
5-167 9.60e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 43.20  E-value: 9.60e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454    5 SFGTPFGGSTGGFGTTSTFGQNTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTS 84
Cdd:COG3469    54 SGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGST 133
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   85 TGTSNSLFGTASTGTSLFSSQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNpfgSTSGSLFGPSSFTAAPTGTTIKF 164
Cdd:COG3469   134 TTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATA---TTASGATTPSATTTATTTGPPTP 210

                  ...
gi 562154454  165 NPP 167
Cdd:COG3469   211 GLP 213
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
26-401 1.05e-03

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 43.23  E-value: 1.05e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   26 NTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTSTGTSNSLFGTASTGTSLFSSQ 105
Cdd:COG4625   173 GGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGG 252
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  106 NNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTK 185
Cdd:COG4625   253 GGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 332
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  186 hqcitamkeyesksleelrleDYQANRKGPQNQVGGGTTAGLFGSSPATSSATGLFSSSTTNSAFSYGQNKTAFGTSTTG 265
Cdd:COG4625   333 ---------------------GAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGG 391
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  266 FGTNPGGLFGQQNQQTTSLFSKPFGQATTTPNTGFSFGNTSTLGQPSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGT 345
Cdd:COG4625   392 GGGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGS 471
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 562154454  346 GTGLFGQPNTGFGAVGSTLFGNNKLTTFGTSTTSAPSFGTTSGGLFGFGTNNSGSS 401
Cdd:COG4625   472 GAGTLTLTGNNTYTGTTTVNGGGNYTQSAGSTLAVEVDAANSDRLVVTGTATLNGG 527
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
27-182 1.95e-03

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 42.35  E-value: 1.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454    27 TGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTST-----------------GFGFGT------ 83
Cdd:pfam15967    2 SGFSFGGGPGSTATAGGGFSFGAAAASNPGSTGGFSFGTLGAAPAATATTTtatlglggglfgqkpatGFTFGTpassta 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454    84 STGTSNSLFGTASTGTSlfSSQNNAFAQNKPTG----FGNFGTSTSSGGL-FGTTNTTSNPFGSTSGSLFG----PSSFT 154
Cdd:pfam15967   82 ATGPTGLTLGTPAATTA--ASTGFSLGFNKPAAsatpFSLPASSTSGGGLsLGSVLTSTAAQQGATGFTLNlggtPATTT 159
                          170       180
                   ....*....|....*....|....*...
gi 562154454   155 AAPTGTTIKFNPPTGTDTMVKAGVSTNI 182
Cdd:pfam15967  160 AVSTGLSLGSTLTSLGGSLFQNTNSTGL 187
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
228-465 2.09e-03

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 42.35  E-value: 2.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   228 FGSSPATSSATGlfsssttnSAFSYGQNKTAFGTSTTGFGTNPGGLFGQQNQQTT----SLFSKPFGQattTPNTGFSFG 303
Cdd:pfam15967    6 FGGGPGSTATAG--------GGFSFGAAAASNPGSTGGFSFGTLGAAPAATATTTtatlGLGGGLFGQ---KPATGFTFG 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   304 NTSTlgqpstntmglfgVTQASQPGGLFGTATNTSTGTafgtgtglfgqpNTGFGavgstlFGNNK----LTTFGTSTTS 379
Cdd:pfam15967   75 TPAS-------------STAATGPTGLTLGTPAATTAA------------STGFS------LGFNKpaasATPFSLPASS 123
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   380 APSFGTTSGGLFGFGTNNSGSSIF----GSKPAAGTLGTGLGTGFGTALGAGqASLFGNNQPKiggPLGTGAFGAPGFNT 455
Cdd:pfam15967  124 TSGGGLSLGSVLTSTAAQQGATGFtlnlGGTPATTTAVSTGLSLGSTLTSLG-GSLFQNTNST---GLGQTTLGLTLLAT 199
                          250
                   ....*....|
gi 562154454   456 STAILGFGAP 465
Cdd:pfam15967  200 STAPVSAPAA 209
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
72-416 3.67e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 41.53  E-value: 3.67e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   72 ATSTSTGFGFGTSTGTSNSLfgTASTGTSLFSSQNNAFAQNKPTGFG-NFGTSTSSGGLFGTTNTTSNPFGSTSGSLFGP 150
Cdd:NF033849  236 GQSAGTGYGESVGHSTSQGQ--SHSVGTSESHSVGTSQSQSHTTGHGsTRGWSHTQSTSESESTGQSSSVGTSESQSHGT 313
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  151 SSFTAapTGTTIKFNPPTGTDTMVKAGVSTNISTKHQCITAMKEYESKSLeelrledyqanrkGPQNQVGGGTTAGLFGS 230
Cdd:NF033849  314 TEGTS--TTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSE-------------STGTSVGHSTSSSVSSS 378
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  231 SPATSSATGLFSSSTTNSAFSYGQNKTAFGTS---TTGFGTNPGGLFGQQNQQTTSLFSKPFGQATTTPNtGFSFGNTST 307
Cdd:NF033849  379 ESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASqggSEGWGSGDSVQSVSQSYGSSSSTGTSSGHSDSSSH-STSSGQADS 457
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  308 LGQPSTNTMGLfGVTQASQPGGlfgtatntstgtafgtgtglfGQPNTGFGAVGSTlFGNnkltTFGTSTTSAPSFGTTS 387
Cdd:NF033849  458 VSQGTSWSEGT-GTSQGQSVGT---------------------SESWSTSQSETDS-VGD----STGTSESVSQGDGRST 510
                         330       340
                  ....*....|....*....|....*....
gi 562154454  388 GglfgfgtNNSGSSIFGSKPAAGTLGTGL 416
Cdd:NF033849  511 G-------RSESQGTSLGTSGGRTSGAGG 532
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
25-93 4.98e-03

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 37.21  E-value: 4.98e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 562154454    25 QNTGFGTTSGGAFGTSAFGSSNNTGG-LFGNSQTKP--GGLFGTSSFSQPATSTSTGFGFGTSTGTSN---SLFG 93
Cdd:pfam13634   16 NTSTTAASGGGLFGAASTATATTSGGgLFGNSSSNApsGGLFGATNTTTQTATGGGLFGNNAATTTSTtggGLFG 90
34 PHA02584
long tail fiber, proximal subunit; Provisional
25-263 5.88e-03

long tail fiber, proximal subunit; Provisional


Pssm-ID: 222890 [Multi-domain]  Cd Length: 1229  Bit Score: 40.89  E-value: 5.88e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   25 QNTGFGTTSGGAFGTSAFGSSNNTGglfgnsqtkpgglfGTSSFSQPATSTSTGF-GFGTSTGTSNSLFGTASTGTSLFS 103
Cdd:PHA02584  944 QNTSNGTVVVVDETSIAFYSQNNTT--------------GNIVFNIDGTVDPINVnANGTLNATGVATNGRAVYAEGGGI 1009
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  104 SQNNAFAQNKPTGFGNF-GTSTSSGGLFGTTNTTSNPFGSTSGSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNI 182
Cdd:PHA02584 1010 ARTNNAARAITGGFTIRnDGSTTVFLLTAAGDQTGGFNGLKSLIINNANGQVTINDNYIINAGGTIMSGGLTVNSRIRSQ 1089
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  183 STKHQCITAMKEyeskslEELRLEDYQANRKGPQNQVGGGTTAGLFGSSPATSSATGlfsssttNSAFSYGQNKTAFGTS 262
Cdd:PHA02584 1090 GTKASYTRAPTA------DTVGFWSVDINDSATYNQFPGYFQMVTKTKSPGTLTQFG-------NTLDSLYQDWSPDGRT 1156

                  .
gi 562154454  263 T 263
Cdd:PHA02584 1157 T 1157
PTZ00473 PTZ00473
Plasmodium Vir superfamily; Provisional
26-100 6.66e-03

Plasmodium Vir superfamily; Provisional


Pssm-ID: 240430 [Multi-domain]  Cd Length: 420  Bit Score: 40.22  E-value: 6.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454   26 NTGFGTTSGGAF--------------GTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFgFGTSTGTSNSL 91
Cdd:PTZ00473  315 RGPYNANYGGQFnsrsgrtgssesirGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGGGSQS-GGGSTYGGSST 393

                  ....*....
gi 562154454   92 FGTASTGTS 100
Cdd:PTZ00473  394 FDGSSRGSS 402
PPE COG5651
PPE-repeat protein [Function unknown];
233-465 8.66e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 39.88  E-value: 8.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  233 ATSSATGLFSSST-----TNSAFSYGQNKTAFGTSTTGFGTNPGGLFGQQNQQTTSLFS--KPFGQATTTPNTGFSfgnt 305
Cdd:COG5651   157 ASAAAVALTPFTQppptiTNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSgsGPIGLNSGPGNTGFA---- 232
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  306 stlgqpSTNTMGLFGVTQASQPGGLFGTATNTSTGTAFGTGTGLFGQPNTGFGAVGSTLFGNNkLTTFGTSTTSAPSFGT 385
Cdd:COG5651   233 ------GTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLA-GSPLGLAGGGAGAAAA 305
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 562154454  386 TSGGLFGFGTNNSGSSIFGSKPAAGTLGTGLGTGFGTALGAGQASLFGNNQPKIGGPLGTGAFGAPGFNTSTAILGFGAP 465
Cdd:COG5651   306 TGLGLGAGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH