|
Name |
Accession |
Description |
Interval |
E-value |
| 2a57 |
TIGR00939 |
Equilibrative Nucleoside Transporter (ENT); [Transport and binding proteins, Nucleosides, ... |
361-720 |
3.95e-11 |
|
Equilibrative Nucleoside Transporter (ENT); [Transport and binding proteins, Nucleosides, purines and pyrimidines]
Pssm-ID: 273352 Cd Length: 437 Bit Score: 65.89 E-value: 3.95e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 361 FFWGLCSLLSWNTIVAAMPYIQTL----------------PVPNIGNVVTMCSLIPLLFGTCLDAL--FDFNMILMHDIS 422
Cdd:TIGR00939 2 FMLGVGVLLPWNAFITAPQYFIEYykyaqnipeaipssskMWKHFNTYYTLASQLPSLLFNSLNLFliFRIPVTVRLLGG 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 423 AAINLSIFILVLMTQKAICSPYVIVGFVAAAGLVNG---ALMGTTSRIVLKNFGSR-LQPFFAGQAISAIFTTSLSVLFY 498
Cdd:TIGR00939 82 LVILLVVVILVMVLVKVQTSETGFFVTTMASVVIINsgmALLQGSLFGLAGVFPSTySSAVMSGQGLAGVLTSLAMILVK 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 499 AFNLDS----------YVVACIMLVLGCVLELISAWITYAKF----------------------SEVIKLPR-------- 538
Cdd:TIGR00939 162 ASGNDShglkksalgyFGTPCVVQLICIVCYLLLPKLPFARYylqkkldkgagedetkgelrskAEQNGIPHggdqpspt 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 539 LQKFSTFVNQTKEQMTGLKKGLVEA-FSMNKTTMVSVILMFVFVTMYFPAVIADFTYEMN---DAWSKCSLFLAFAMGDY 614
Cdd:TIGR00939 242 LVLDWEKEPESPDEPQKPLKTSVWVvFTKVWLLAFSVVFVFTVTLSVFPAITTAVTSSGLglsNWFYPIICFLLFNLFDW 321
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 615 ---TIKSCVMILKKKPRSALIILFLtNSILILILFLCNVKPSylHFFQPLIKHYVPMYMVCYLFGAVH----SFNMLLIT 687
Cdd:TIGR00939 322 lgrSLTSKFMWPDEDSRWLPILSFL-RVLFIPLFLLCNYPQR--SRLPVFFPGDAYFIILMLLFGFSNgylgSLSMCLAP 398
|
410 420 430
....*....|....*....|....*....|....
gi 422933831 688 ANIKTHTSH-AGSLLQMCIYIGLGLGSIISLGFR 720
Cdd:TIGR00939 399 RQVDPHEREvAGALMVIFLLVGLALGAVLSFLFV 432
|
|
| Nucleoside_tran |
pfam01733 |
Nucleoside transporter; This is a family of nucleoside transporters. In mammalian cells ... |
451-720 |
5.05e-08 |
|
Nucleoside transporter; This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their own de novo synthesis pathways. Also in this family is mouse and human nucleolar protein HNP36, a protein of unknown function; although it has been hypothesized to be a plasma membrane nucleoside transporter.
Pssm-ID: 396340 Cd Length: 286 Bit Score: 55.12 E-value: 5.05e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 451 AAAGLVNGALMGttsriVLKNFGSR-LQPFFAGQAISAIFTTSLSVLFYAFNLDS-------YVVACIMLVLGCVLELIS 522
Cdd:pfam01733 9 SFNALLQNSLFG-----LAGVFPSTyTQAVMSGQGLAGVLVSVVMILTKASNSDLsesallyFGTSIVVVIVCAICLNVL 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 523 AWITYAKFSEVIKLprlqKFSTFVNQtKEQMTGLKKGLVEAFSMNKTTMVSVILMFVFVTMYFPAVIADFTYEMNDAWS- 601
Cdd:pfam01733 84 KKLRFYKYHWQLKL----EGPTFDNS-DEQEENLKVLFFVILAKIKVLALSVVLIFTVTLSVFPAVTAEVVSSLPTWEAy 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 602 --KCSLFLAFAMGDY---TIKSCVMILKKKPRsALIILFLTNSILILILFLCNVKPSYlHFFQPLIKHYVPMYMVCYLFG 676
Cdd:pfam01733 159 fiPVLCFLLFNLFDWlgrSLAAWFMWPGQDPR-WLPVLVILRFLFIPLFLLCNYKPQS-RALPVLFESDAWFIIFMCLFA 236
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 422933831 677 AVH----SFNMLLITANIKTHTSH-AGSLLQMCIYIGLGLGSIISLGFR 720
Cdd:pfam01733 237 FSNgylsSLCMCYAPKSVPPHESEtAGAIMVFFLCLGLALGAVISFVFR 285
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
60-241 |
1.95e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 47.83 E-value: 1.95e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 60 NVSSVSVLTTTIGTSSVVASSTTDQGHGTSSGVSAATSGhNNGSTSTTVATTTSATATTTTTTSTTPAATTESSTTGASS 139
Cdd:COG3469 31 SVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASST-AATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGT 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 140 ASTSDASTTGTTAASTSSATTTVAPVASTSSASTTVAPVASTSSASTTVAPVASTSSASTTVATGASTTVASSTTGTSAN 219
Cdd:COG3469 110 STVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATT 189
|
170 180
....*....|....*....|..
gi 422933831 220 TTSASTLEPSSTTTKRTPSASS 241
Cdd:COG3469 190 ASGATTPSATTTATTTGPPTPG 211
|
|
| PRK10905 |
PRK10905 |
cell division protein DamX; Validated |
162-240 |
2.65e-05 |
|
cell division protein DamX; Validated
Pssm-ID: 236792 [Multi-domain] Cd Length: 328 Bit Score: 46.85 E-value: 2.65e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 162 VAPVASTSSASTTVAPVASTSS--ASTTVAPVASTSSASTTVATGASTTVASSTTGTSANTTSASTLEPSSTTTKRTPSA 239
Cdd:PRK10905 176 PKPVAQTPKRTEPAAPVASTKApaATSTPAPKETATTAPVQTASPAQTTATPAAGGKTAGNVGSLKSAPSSHYTLQLSSS 255
|
.
gi 422933831 240 S 240
Cdd:PRK10905 256 S 256
|
|
| Metaviral_G |
pfam09595 |
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ... |
164-241 |
5.17e-05 |
|
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.
Pssm-ID: 462833 [Multi-domain] Cd Length: 183 Bit Score: 44.56 E-value: 5.17e-05
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 422933831 164 PVASTSSASTTVAPVASTSSASTTVAPVASTssastTVATGASTTVASSTTGTSANTTSA-STLEPSSTTTKRTPSASS 241
Cdd:pfam09595 103 PLEPAAKTKPSEHEPANPPDASNRLSPPDAS-----TAAIREARTFRKPSTGKRNNPSSAqSDQSPPRANHEAIGRANP 176
|
|
| SP4_N |
cd22536 |
N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins ... |
166-241 |
1.52e-03 |
|
N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. Human SP4 is a risk gene of multiple psychiatric disorders including schizophrenia, bipolar disorder, and major depression. SP4 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP4.
Pssm-ID: 411773 [Multi-domain] Cd Length: 623 Bit Score: 41.83 E-value: 1.52e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 166 ASTSSASTTVAPVASTSSASTTVAPVASTSSASTTVATG---ASTTVASSTTG--TSANTTSASTLEPSSTTTKRTPSAS 240
Cdd:cd22536 268 GGVSNGNQLVSTPITTASVSTMPESPSSSTTCTTTASTSltsSDTLVSSAETGqyASTAASSERTEEEPQTSAAESEAQS 347
|
.
gi 422933831 241 S 241
Cdd:cd22536 348 S 348
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
69-240 |
7.42e-03 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 39.60 E-value: 7.42e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 69 TTIGTSSVVASSTTD-------QGHGTSSGVSAATS-----GHNNGSTSTTVATTTSATATTTTTTSTTPAATTESSTTG 136
Cdd:NF033849 313 TTEGTSTTDSSSHSQsssynvsSGTGVSSSHSDGTSqstsiSHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGG 392
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 137 ASSASTSDASTTGTTAAS--------TSSATTTVAPVASTSSASTTVAPVASTSSASTTVApVASTSSASTTVATGASTT 208
Cdd:NF033849 393 FSGGIAGGGVTSEGLGASqggsegwgSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSG-QADSVSQGTSWSEGTGTS 471
|
170 180 190
....*....|....*....|....*....|..
gi 422933831 209 VASSTTGTSANTTSASTLEPSSTTTKRTPSAS 240
Cdd:NF033849 472 QGQSVGTSESWSTSQSETDSVGDSTGTSESVS 503
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| 2a57 |
TIGR00939 |
Equilibrative Nucleoside Transporter (ENT); [Transport and binding proteins, Nucleosides, ... |
361-720 |
3.95e-11 |
|
Equilibrative Nucleoside Transporter (ENT); [Transport and binding proteins, Nucleosides, purines and pyrimidines]
Pssm-ID: 273352 Cd Length: 437 Bit Score: 65.89 E-value: 3.95e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 361 FFWGLCSLLSWNTIVAAMPYIQTL----------------PVPNIGNVVTMCSLIPLLFGTCLDAL--FDFNMILMHDIS 422
Cdd:TIGR00939 2 FMLGVGVLLPWNAFITAPQYFIEYykyaqnipeaipssskMWKHFNTYYTLASQLPSLLFNSLNLFliFRIPVTVRLLGG 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 423 AAINLSIFILVLMTQKAICSPYVIVGFVAAAGLVNG---ALMGTTSRIVLKNFGSR-LQPFFAGQAISAIFTTSLSVLFY 498
Cdd:TIGR00939 82 LVILLVVVILVMVLVKVQTSETGFFVTTMASVVIINsgmALLQGSLFGLAGVFPSTySSAVMSGQGLAGVLTSLAMILVK 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 499 AFNLDS----------YVVACIMLVLGCVLELISAWITYAKF----------------------SEVIKLPR-------- 538
Cdd:TIGR00939 162 ASGNDShglkksalgyFGTPCVVQLICIVCYLLLPKLPFARYylqkkldkgagedetkgelrskAEQNGIPHggdqpspt 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 539 LQKFSTFVNQTKEQMTGLKKGLVEA-FSMNKTTMVSVILMFVFVTMYFPAVIADFTYEMN---DAWSKCSLFLAFAMGDY 614
Cdd:TIGR00939 242 LVLDWEKEPESPDEPQKPLKTSVWVvFTKVWLLAFSVVFVFTVTLSVFPAITTAVTSSGLglsNWFYPIICFLLFNLFDW 321
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 615 ---TIKSCVMILKKKPRSALIILFLtNSILILILFLCNVKPSylHFFQPLIKHYVPMYMVCYLFGAVH----SFNMLLIT 687
Cdd:TIGR00939 322 lgrSLTSKFMWPDEDSRWLPILSFL-RVLFIPLFLLCNYPQR--SRLPVFFPGDAYFIILMLLFGFSNgylgSLSMCLAP 398
|
410 420 430
....*....|....*....|....*....|....
gi 422933831 688 ANIKTHTSH-AGSLLQMCIYIGLGLGSIISLGFR 720
Cdd:TIGR00939 399 RQVDPHEREvAGALMVIFLLVGLALGAVLSFLFV 432
|
|
| Nucleoside_tran |
pfam01733 |
Nucleoside transporter; This is a family of nucleoside transporters. In mammalian cells ... |
451-720 |
5.05e-08 |
|
Nucleoside transporter; This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their own de novo synthesis pathways. Also in this family is mouse and human nucleolar protein HNP36, a protein of unknown function; although it has been hypothesized to be a plasma membrane nucleoside transporter.
Pssm-ID: 396340 Cd Length: 286 Bit Score: 55.12 E-value: 5.05e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 451 AAAGLVNGALMGttsriVLKNFGSR-LQPFFAGQAISAIFTTSLSVLFYAFNLDS-------YVVACIMLVLGCVLELIS 522
Cdd:pfam01733 9 SFNALLQNSLFG-----LAGVFPSTyTQAVMSGQGLAGVLVSVVMILTKASNSDLsesallyFGTSIVVVIVCAICLNVL 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 523 AWITYAKFSEVIKLprlqKFSTFVNQtKEQMTGLKKGLVEAFSMNKTTMVSVILMFVFVTMYFPAVIADFTYEMNDAWS- 601
Cdd:pfam01733 84 KKLRFYKYHWQLKL----EGPTFDNS-DEQEENLKVLFFVILAKIKVLALSVVLIFTVTLSVFPAVTAEVVSSLPTWEAy 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 602 --KCSLFLAFAMGDY---TIKSCVMILKKKPRsALIILFLTNSILILILFLCNVKPSYlHFFQPLIKHYVPMYMVCYLFG 676
Cdd:pfam01733 159 fiPVLCFLLFNLFDWlgrSLAAWFMWPGQDPR-WLPVLVILRFLFIPLFLLCNYKPQS-RALPVLFESDAWFIIFMCLFA 236
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 422933831 677 AVH----SFNMLLITANIKTHTSH-AGSLLQMCIYIGLGLGSIISLGFR 720
Cdd:pfam01733 237 FSNgylsSLCMCYAPKSVPPHESEtAGAIMVFFLCLGLALGAVISFVFR 285
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
60-241 |
1.95e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 47.83 E-value: 1.95e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 60 NVSSVSVLTTTIGTSSVVASSTTDQGHGTSSGVSAATSGhNNGSTSTTVATTTSATATTTTTTSTTPAATTESSTTGASS 139
Cdd:COG3469 31 SVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASST-AATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGT 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 140 ASTSDASTTGTTAASTSSATTTVAPVASTSSASTTVAPVASTSSASTTVAPVASTSSASTTVATGASTTVASSTTGTSAN 219
Cdd:COG3469 110 STVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATT 189
|
170 180
....*....|....*....|..
gi 422933831 220 TTSASTLEPSSTTTKRTPSASS 241
Cdd:COG3469 190 ASGATTPSATTTATTTGPPTPG 211
|
|
| PRK10905 |
PRK10905 |
cell division protein DamX; Validated |
162-240 |
2.65e-05 |
|
cell division protein DamX; Validated
Pssm-ID: 236792 [Multi-domain] Cd Length: 328 Bit Score: 46.85 E-value: 2.65e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 162 VAPVASTSSASTTVAPVASTSS--ASTTVAPVASTSSASTTVATGASTTVASSTTGTSANTTSASTLEPSSTTTKRTPSA 239
Cdd:PRK10905 176 PKPVAQTPKRTEPAAPVASTKApaATSTPAPKETATTAPVQTASPAQTTATPAAGGKTAGNVGSLKSAPSSHYTLQLSSS 255
|
.
gi 422933831 240 S 240
Cdd:PRK10905 256 S 256
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
60-247 |
3.03e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 47.05 E-value: 3.03e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 60 NVSSVSVLTTTIGTSSVVASSTTDQGHGTSSGVSAATSGHNNGSTSTTVATTTSATATTTTTTSTTPAATTESSTTGASS 139
Cdd:COG3469 17 ATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATL 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 140 ASTSDASTTGTTAASTSsatttVAPVASTSSASTTVAPVASTSSASTTVAPVASTSSASTTVATGASTTVASSTTGTSAN 219
Cdd:COG3469 97 VATSTASGANTGTSTVT-----TTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTT 171
|
170 180
....*....|....*....|....*...
gi 422933831 220 TTSASTLEPSSTTTKRTPSASSTTSKTP 247
Cdd:COG3469 172 TTSASTTPSATTTATATTASGATTPSAT 199
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
169-238 |
3.10e-05 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 46.05 E-value: 3.10e-05
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 422933831 169 SSASTTVAPVASTSSAST--TVAPVASTSSASTTVATGASTTVASSTTGTSANTTSASTLEPSSTTTKRTPS 238
Cdd:PHA03255 82 TSTGTTVTPVPTTSNASTinVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLS 153
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
19-235 |
3.44e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 47.05 E-value: 3.44e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 19 TNSTLAASTSQVTTASQATLSTTATLPPTTASTAATDTTTTNVSSVSVLTTTIGTSSVVASSTTDQGHGTSSGVSAATSG 98
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 99 HNNGSTSTTVATTTSATATTTTTTSTTPAATTESSTTGASSASTSDASTTGTTAASTSSATTTVAPVASTSSASTTVAPV 178
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETAT 160
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 422933831 179 ASTSSASTTVAPvaSTSSASTTVATGASTTVASSTTGTSANTTSASTLEPSSTTTKR 235
Cdd:COG3469 161 GGTTTTSTTTTT--TSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| Metaviral_G |
pfam09595 |
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ... |
164-241 |
5.17e-05 |
|
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.
Pssm-ID: 462833 [Multi-domain] Cd Length: 183 Bit Score: 44.56 E-value: 5.17e-05
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 422933831 164 PVASTSSASTTVAPVASTSSASTTVAPVASTssastTVATGASTTVASSTTGTSANTTSA-STLEPSSTTTKRTPSASS 241
Cdd:pfam09595 103 PLEPAAKTKPSEHEPANPPDASNRLSPPDAS-----TAAIREARTFRKPSTGKRNNPSSAqSDQSPPRANHEAIGRANP 176
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
13-226 |
7.47e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 45.90 E-value: 7.47e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 13 TQATLATNSTLAASTSQVTTASQATlsttatlppTTASTAATDTTTTNVSSVSVLTTTIGTSSVVASSTTDQGHGTSSGV 92
Cdd:COG3469 14 GASATAVTLLGAAATAASVTLTAAT---------ATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATA 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 93 SAATSGHNNGSTSTTVATTTSATATTTTTTSTTPAATTeSSTTGASSASTSDASTTGTTAASTSSATTTVAPVASTSSAS 172
Cdd:COG3469 85 AAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSV-TSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGT 163
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 422933831 173 TTVAPVASTSSASTTVAPVASTSSASTTVATGASTTVASSTTGTSANTTSASTL 226
Cdd:COG3469 164 TTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKHVL 217
|
|
| PRK12495 |
PRK12495 |
hypothetical protein; Provisional |
163-269 |
7.91e-05 |
|
hypothetical protein; Provisional
Pssm-ID: 183558 [Multi-domain] Cd Length: 226 Bit Score: 44.47 E-value: 7.91e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 163 APVASTSSASTTVAPVA-STSSASTTVAPVASTSSASTTVATGASTTVASSTTGTSANTTSASTLEPSSTTTKRTPSASS 241
Cdd:PRK12495 82 AEATAPSDAGSQASPDDdAQPAAEAEAADQSAPPEASSTSATDEAATDPPATAAARDGPTPDPTAQPATPDERRSPRQRP 161
|
90 100
....*....|....*....|....*...
gi 422933831 242 TTSKTPVTVTTtvqtdQGQQGDGDVQPA 269
Cdd:PRK12495 162 PVSGEPPTPST-----PDAHVAGTLQAA 184
|
|
| PRK11907 |
PRK11907 |
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase; |
162-234 |
4.72e-04 |
|
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
Pssm-ID: 237019 [Multi-domain] Cd Length: 814 Bit Score: 43.69 E-value: 4.72e-04
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 422933831 162 VAPVASTSSASTTVAPVAS--TSSASTTVAPVASTSS--ASTTVATGASTTVASSTTgTSANTTSASTLEPSSTTTK 234
Cdd:PRK11907 32 VTTTPATSTEAEQTTPVESdaTEEADNTETPVAATTAaeAPSSSETAETSDPTSEAT-DTTTSEARTVTPAATETSK 107
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
163-224 |
9.90e-04 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 41.94 E-value: 9.90e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 422933831 163 APVASTSSASTTVAPVASTSSASTTVAPV---ASTSSASTTVATGASTTVASSTTGTSANTTSAS 224
Cdd:PRK10856 185 TPTNSQTPAVATAPAPAVDPQQNAVVAPSqanVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAA 249
|
|
| SP4_N |
cd22536 |
N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins ... |
166-241 |
1.52e-03 |
|
N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. Human SP4 is a risk gene of multiple psychiatric disorders including schizophrenia, bipolar disorder, and major depression. SP4 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP4.
Pssm-ID: 411773 [Multi-domain] Cd Length: 623 Bit Score: 41.83 E-value: 1.52e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 166 ASTSSASTTVAPVASTSSASTTVAPVASTSSASTTVATG---ASTTVASSTTG--TSANTTSASTLEPSSTTTKRTPSAS 240
Cdd:cd22536 268 GGVSNGNQLVSTPITTASVSTMPESPSSSTTCTTTASTSltsSDTLVSSAETGqyASTAASSERTEEEPQTSAAESEAQS 347
|
.
gi 422933831 241 S 241
Cdd:cd22536 348 S 348
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
167-240 |
2.76e-03 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 40.39 E-value: 2.76e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 422933831 167 STSSASTTVAPVASTSSASTTVAPVASTSSASTTVATGASTTVASSTTGTSANTTSASTLEPSSTTTKRTPSAS 240
Cdd:PRK10856 168 TTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQ 241
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
61-241 |
3.56e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 40.51 E-value: 3.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 61 VSSVSVLTTTIGTSSVVASSTtdqghGTSSGVSAATSGHNNGSTSTTVATTTSATATTTTTTSTTPAATTESSTTGASSA 140
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVT-----LLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSST 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 141 STSDASTTGTTAASTSSATTTVAPVA------STSSASTTVAPVASTSSASTTVAPVASTSSASTTVATGASTTVASS-- 212
Cdd:COG3469 76 TSTTATATAAAAAATSTSATLVATSTasgantGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVsg 155
|
170 180 190
....*....|....*....|....*....|....*.
gi 422933831 213 -------TTGTSANTTSASTLEPSSTTTKRTPSASS 241
Cdd:COG3469 156 tetatggTTTTSTTTTTTSASTTPSATTTATATTAS 191
|
|
| SP4_N |
cd22536 |
N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins ... |
162-241 |
4.67e-03 |
|
N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. Human SP4 is a risk gene of multiple psychiatric disorders including schizophrenia, bipolar disorder, and major depression. SP4 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP4.
Pssm-ID: 411773 [Multi-domain] Cd Length: 623 Bit Score: 40.29 E-value: 4.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 162 VAPVASTSSASTTVAPVASTSSASTTvapVASTSSASTTVATGASTTVASSTTGTSANTTSastlEPSSTTTKRTPSASS 241
Cdd:cd22536 277 VSTPITTASVSTMPESPSSSTTCTTT---ASTSLTSSDTLVSSAETGQYASTAASSERTEE----EPQTSAAESEAQSSS 349
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
15-241 |
7.19e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 39.56 E-value: 7.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 15 ATLATNSTLAASTSQVTTASQATLSTTATLPPTTASTAATDTTTTNVSSVSVLTTTIGTSSVVASSTTDQGHGTSSGVSA 94
Cdd:pfam17823 79 AHLNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAP 158
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 95 ATSghnngsTSTTVATTTSATATTTTTTSTTPAATTESSTTGASSASTSDASTTGTTAASTSSATTTVAPVASTSSAST- 173
Cdd:pfam17823 159 RAA------IAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVg 232
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 422933831 174 TVAPVASTSSASTTVAPVASTSSASTTVATGASTTVASSTTGTSANTTSASTLEPSSTTTkRTPSASS 241
Cdd:pfam17823 233 NSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTMA-RNPAAPM 299
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
163-230 |
7.35e-03 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 39.24 E-value: 7.35e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 422933831 163 APVASTSSASTT-----VAPVASTSSASTTVAPVASTSSASTTVATGASTTVASSTTGTSANTTSASTLEPSS 230
Cdd:PRK10856 177 APAAPVDTTPTNsqtpaVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAA 249
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
69-240 |
7.42e-03 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 39.60 E-value: 7.42e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 69 TTIGTSSVVASSTTD-------QGHGTSSGVSAATS-----GHNNGSTSTTVATTTSATATTTTTTSTTPAATTESSTTG 136
Cdd:NF033849 313 TTEGTSTTDSSSHSQsssynvsSGTGVSSSHSDGTSqstsiSHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGG 392
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 137 ASSASTSDASTTGTTAAS--------TSSATTTVAPVASTSSASTTVAPVASTSSASTTVApVASTSSASTTVATGASTT 208
Cdd:NF033849 393 FSGGIAGGGVTSEGLGASqggsegwgSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSG-QADSVSQGTSWSEGTGTS 471
|
170 180 190
....*....|....*....|....*....|..
gi 422933831 209 VASSTTGTSANTTSASTLEPSSTTTKRTPSAS 240
Cdd:NF033849 472 QGQSVGTSESWSTSQSETDSVGDSTGTSESVS 503
|
|
| motB |
PRK12799 |
flagellar motor protein MotB; Reviewed |
164-273 |
7.58e-03 |
|
flagellar motor protein MotB; Reviewed
Pssm-ID: 183756 [Multi-domain] Cd Length: 421 Bit Score: 39.31 E-value: 7.58e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 422933831 164 PVASTSSASTTVAPVASTSSASTTVAPVASTSSASTtvaTGASTTVASSTTGTSANTTSASTLEPSSTTTKRTPSASSTT 243
Cdd:PRK12799 300 PVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTT---QSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQP 376
|
90 100 110
....*....|....*....|....*....|
gi 422933831 244 SKTPVTVTTtvqtdqgQQGDGDVQPAKNDP 273
Cdd:PRK12799 377 QPMSTTETQ-------QSSTGNITSTANGP 399
|
|
| PRK11907 |
PRK11907 |
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase; |
165-240 |
8.38e-03 |
|
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
Pssm-ID: 237019 [Multi-domain] Cd Length: 814 Bit Score: 39.45 E-value: 8.38e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 422933831 165 VASTSSASTTVAPVASTSSASTTVAPVASTSsASTTVATGASTTVASSTTGTSANTTSASTLEPSSTTTKRTPSAS 240
Cdd:PRK11907 11 VALTLALLTASNPKLAQAEEIVTTTPATSTE-AEQTTPVESDATEEADNTETPVAATTAAEAPSSSETAETSDPTS 85
|
|
| PRK13042 |
PRK13042 |
superantigen-like protein SSL4; Reviewed; |
165-238 |
9.25e-03 |
|
superantigen-like protein SSL4; Reviewed;
Pssm-ID: 183854 [Multi-domain] Cd Length: 291 Bit Score: 38.85 E-value: 9.25e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 422933831 165 VASTSSASTTVAP---VASTSSASTTVAPVASTSSASTTvaTGASTTVASSTTGTSANTTSASTLE-PSSTTTKRTPS 238
Cdd:PRK13042 22 TTTTQAANATTPSstkVEAPQSTPPSTKVEAPQSKPNAT--TPPSTKVEAPQQTPNATTPSSTKVEtPQSPTTKQVPT 97
|
|
|