NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|578830424|ref|XP_005257116|]
View 

collagen alpha-1(I) chain isoform X3 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COLFI pfam01410
Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
921-1157 2.09e-159

Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


:

Pssm-ID: 460199  Cd Length: 233  Bit Score: 472.60  E-value: 2.09e-159
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424   921 RDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDLKMCHSDWKSGEYWIDPNQGCNLDAIKVFCNMETGETCVYPTQPS 1000
Cdd:pfam01410    1 RDEEVMATLKSLSQQIENIRSPDGSKKNPARTCRDLKLCHPDWKSGEYWIDPNQGCTRDAIKVFCNFETGETCIYPTKAS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  1001 VAQKNWYISKNPkdkrHVWFGESMTDGFQFEYGGQGSDPADVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQQTGNLKK 1080
Cdd:pfam01410   81 IPRKNWWTKESK----HVWFGEFMNGGSQFSYGVDGVGPSVAAVQLTFLRLLSTEASQNITYHCKNSVAYMDQATGNLKK 156
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 578830424  1081 ALLLQGSNEIEIRAEGNSRFTYSVTVDGCTSHTGAWGKTVIEYKTTKTSRLPIIDVAPLDVGAPDQEFGFDVGPVCF 1157
Cdd:pfam01410  157 ALLLQGSNDEEIRAEGNSRFTYTVLEDGCTKRTGQWGKTVIEYRTQKVSRLPIVDIAPMDIGGADQEFGVEVGPVCF 233
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
233-458 3.17e-30

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 125.40  E-value: 3.17e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  233 GEAGKPGRPGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGAKGDAGPAGPKGEPGSPGENGAPGQMGPRGLPGErgr 312
Cdd:NF038329  120 GEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP--- 196
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  313 pgapgpagpageRGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDlgapgpsGARGERGFPGERGVQGPPGPAGPRG 392
Cdd:NF038329  197 ------------RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD-------GQQGPDGDPGPTGEDGPQGPDGPAG 257
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 578830424  393 ANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPGPKGDRGDAGPKGADGSPGKDGVRG 458
Cdd:NF038329  258 KDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPG 323
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
602-806 7.66e-22

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 99.98  E-value: 7.66e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  602 GPRGETGPAgrpgevgppGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGP 681
Cdd:NF038329  117 GEKGEPGPA---------GPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGP 187
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  682 SGASGERGPPGPMGPPGLAGPPGESGREGAPGAEGSPGRDGSPGaKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRG 761
Cdd:NF038329  188 AGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRG 266
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 578830424  762 ETGPAGPAGPVGPVGARGPAGPQGPRGDKGETGEQGDRGIKGHRG 806
Cdd:NF038329  267 EAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDG 311
VWC pfam00093
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with ...
40-95 1.18e-21

von Willebrand factor type C domain; The high cutoff was used to prevent overlap with pfam00094.


:

Pssm-ID: 278520  Cd Length: 57  Bit Score: 89.02  E-value: 1.18e-21
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 578830424    40 CVQNGLRYHDRDVWKPEPCRICVCDNGKVLCDDVICdETKNCPGA--EVPEGECCPVC 95
Cdd:pfam00093    1 CVQNGVVYENGETWKPDLCTICTCDDGKVLCDKIIC-PPLDCPNPrlEIPPGECCPVC 57
SPT5 super family cl34925
Transcription elongation factor SPT5 [Transcription];
352-576 1.51e-03

Transcription elongation factor SPT5 [Transcription];


The actual alignment was detected with superfamily member COG5164:

Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 42.71  E-value: 1.51e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  352 PGEQGVPGDLGAPGPSGARGERGFPGERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAG 431
Cdd:COG5164     6 PGKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQ 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  432 LPG---PKGDRGDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPgDKGESGPSGPAGPTGA-RGAPGDRGEPGPPGPAGF 507
Cdd:COG5164    86 NQGgtrPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPP-SGGSTTPPGDGGSTPPgPGSTGPGGSTTPPGDGGS 164
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 578830424  508 AGPPGADGQPGAKGEPGDAGAKGDAGPPGPAGPAGPPGPIGNVGAPGAKGARGSAGPPGATGFPGAAGR 576
Cdd:COG5164   165 TTPPGPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKTGPKDQ 233
 
Name Accession Description Interval E-value
COLFI pfam01410
Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
921-1157 2.09e-159

Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 460199  Cd Length: 233  Bit Score: 472.60  E-value: 2.09e-159
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424   921 RDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDLKMCHSDWKSGEYWIDPNQGCNLDAIKVFCNMETGETCVYPTQPS 1000
Cdd:pfam01410    1 RDEEVMATLKSLSQQIENIRSPDGSKKNPARTCRDLKLCHPDWKSGEYWIDPNQGCTRDAIKVFCNFETGETCIYPTKAS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  1001 VAQKNWYISKNPkdkrHVWFGESMTDGFQFEYGGQGSDPADVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQQTGNLKK 1080
Cdd:pfam01410   81 IPRKNWWTKESK----HVWFGEFMNGGSQFSYGVDGVGPSVAAVQLTFLRLLSTEASQNITYHCKNSVAYMDQATGNLKK 156
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 578830424  1081 ALLLQGSNEIEIRAEGNSRFTYSVTVDGCTSHTGAWGKTVIEYKTTKTSRLPIIDVAPLDVGAPDQEFGFDVGPVCF 1157
Cdd:pfam01410  157 ALLLQGSNDEEIRAEGNSRFTYTVLEDGCTKRTGQWGKTVIEYRTQKVSRLPIVDIAPMDIGGADQEFGVEVGPVCF 233
COLFI smart00038
Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
922-1158 4.73e-140

Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 197483  Cd Length: 232  Bit Score: 422.26  E-value: 4.73e-140
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424    922 DLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDLKMCHSDWKSGEYWIDPNQGCNLDAIKVFCNMETGETCVYPTQPSV 1001
Cdd:smart00038    1 DEEVFASLKSLNNQIEQLKSPTGSRKNPARTCKDLKLCHPEWKSGEYWVDPNQGCIRDAIKVFCNFETGETCVSPSPSSI 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424   1002 AQKNWYISKNPkdkrHVWFGESMTDGFQFEYGGQGSDPaDVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQQTGNLKKA 1081
Cdd:smart00038   81 PRKTWYSGKSK----HVWFGETMNGGFKFSYGDSEGPP-VGVVQLTFLRLLSTEAHQNITYHCKNSVAYMDEATGNLKKA 155
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 578830424   1082 LLLQGSNEIEIRAEGNSRFTYSVTVDGCTSHTGAWGKTVIEYKTTKTSRLPIIDVAPLDVGAPDQEFGFDVGPVCFL 1158
Cdd:smart00038  156 LRLRGSNDVELSAEGNSKFTYEVLEDGCQKRTGKWGKTVIEYRTKKTERLPIVDIAPSDIGGPDQEFGVEIGPVCFS 232
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
233-458 3.17e-30

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 125.40  E-value: 3.17e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  233 GEAGKPGRPGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGAKGDAGPAGPKGEPGSPGENGAPGQMGPRGLPGErgr 312
Cdd:NF038329  120 GEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP--- 196
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  313 pgapgpagpageRGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDlgapgpsGARGERGFPGERGVQGPPGPAGPRG 392
Cdd:NF038329  197 ------------RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD-------GQQGPDGDPGPTGEDGPQGPDGPAG 257
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 578830424  393 ANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPGPKGDRGDAGPKGADGSPGKDGVRG 458
Cdd:NF038329  258 KDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPG 323
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
273-518 1.50e-26

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 114.23  E-value: 1.50e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  273 LDGAKGDAGPAGPKGEPGSPGENGAPGQMGPRGLPGERGRPGAPGPAGPAGERGEQGPAGSPGFQGlpgpagPPGEAGKP 352
Cdd:NF038329  115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDG------EAGAKGPA 188
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  353 GEQGVPGDLGAPGPSGARGERGFPGERGVQGPPGPAGPRGANGaPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGL 432
Cdd:NF038329  189 GEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGE 267
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  433 PGPKGDRGDAGPKGADGSPGKDGVRGltgpigppgpagapgDKGESGPSGPAGPTGARGAPGDRGEPGPPGPAGFAGPPG 512
Cdd:NF038329  268 AGPDGPDGKDGERGPVGPAGKDGQNG---------------KDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDG 332

                  ....*.
gi 578830424  513 ADGQPG 518
Cdd:NF038329  333 KDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
202-453 2.88e-23

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 104.22  E-value: 2.88e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  202 QGPPGEPGEPGAsgpmgprgppgppgkNGDDGEAGKPGRPGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGAKGDAG 281
Cdd:NF038329  137 RGDRGETGPAGP---------------AGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETG 201
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  282 PAGPKGEPGSPGENGAPGQMGPRGLPGERGrpgapgpagpageRGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDL 361
Cdd:NF038329  202 PAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-------------DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEA 268
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  362 GAPGPSGARGERGFPGErgvqgppgpagprgangaPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPGPkgdRGD 441
Cdd:NF038329  269 GPDGPDGKDGERGPVGP------------------AGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGK---DGL 327
                         250
                  ....*....|..
gi 578830424  442 AGPKGADGSPGK 453
Cdd:NF038329  328 PGKDGKDGQPGK 339
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
602-806 7.66e-22

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 99.98  E-value: 7.66e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  602 GPRGETGPAgrpgevgppGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGP 681
Cdd:NF038329  117 GEKGEPGPA---------GPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGP 187
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  682 SGASGERGPPGPMGPPGLAGPPGESGREGAPGAEGSPGRDGSPGaKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRG 761
Cdd:NF038329  188 AGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRG 266
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 578830424  762 ETGPAGPAGPVGPVGARGPAGPQGPRGDKGETGEQGDRGIKGHRG 806
Cdd:NF038329  267 EAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDG 311
VWC pfam00093
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with ...
40-95 1.18e-21

von Willebrand factor type C domain; The high cutoff was used to prevent overlap with pfam00094.


Pssm-ID: 278520  Cd Length: 57  Bit Score: 89.02  E-value: 1.18e-21
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 578830424    40 CVQNGLRYHDRDVWKPEPCRICVCDNGKVLCDDVICdETKNCPGA--EVPEGECCPVC 95
Cdd:pfam00093    1 CVQNGVVYENGETWKPDLCTICTCDDGKVLCDKIIC-PPLDCPNPrlEIPPGECCPVC 57
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
416-725 4.36e-21

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 97.67  E-value: 4.36e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  416 GAPGLQGMPGERGAAGLPGPKGDRGDAGPKGADGSPGKDGVRGltgpigppgpagapgDKGESGPSGPAGPTGARGapgd 495
Cdd:NF038329  117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERG---------------EKGPAGPQGEAGPQGPAG---- 177
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  496 rgepgppgpagfagppgADGQPGAKGEPGDAGAKGDAgppgpagpagppgpignvGAPGAKGARGSAGPPGATGFPGAAG 575
Cdd:NF038329  178 -----------------KDGEAGAKGPAGEKGPQGPR------------------GETGPAGEQGPAGPAGPDGEAGPAG 222
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  576 rvgppgpSGNAGPPGPPGPAGKEGGKGPRGETGPAgrpgevgppgppGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVV 655
Cdd:NF038329  223 -------EDGPAGPAGDGQQGPDGDPGPTGEDGPQ------------GPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPV 283
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  656 GLPGQRGERGFPGLPGPSGEPGKQGPSGAsgergppgpmgppglAGPPGESGREGAPGAEGSPGRDGSPG 725
Cdd:NF038329  284 GPAGKDGQNGKDGLPGKDGKDGQNGKDGL---------------PGKDGKDGQPGKDGLPGKDGKDGQPG 338
VWC smart00214
von Willebrand factor (vWF) type C domain;
40-95 1.49e-20

von Willebrand factor (vWF) type C domain;


Pssm-ID: 214564  Cd Length: 59  Bit Score: 86.03  E-value: 1.49e-20
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*....
gi 578830424     40 CVQNGLRYHDRDVWKPEPCRICVCDNGK-VLCDDVICDETKNCPGAE--VPEGECCPVC 95
Cdd:smart00214    1 CVHNGRVYNDGETWKPDPCQICTCLDGTtVLCDPVECPPPPDCPNPErvKPPGECCPRC 59
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
593-806 4.53e-19

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 91.51  E-value: 4.53e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  593 GPAGKEGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGP 672
Cdd:NF038329  126 GPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGE 205
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  673 SGEPGKQGPSGASGERGPPGPMGPPGLAG--PPGESGREGAPGAEGSPGRDGSPGAKGDRGETgpagppgapgapgapgp 750
Cdd:NF038329  206 QGPAGPAGPDGEAGPAGEDGPAGPAGDGQqgPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEA----------------- 268
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 578830424  751 vGPAGKSGDRGETGPAGPAGPVGPVGARGPAGPQGPRGDKGETGEQGDRGIKGHRG 806
Cdd:NF038329  269 -GPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPG 323
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
474-786 7.83e-16

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 81.49  E-value: 7.83e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  474 DKGESGPSGPAGPTGARGAPgdrgepgppgpagfagppgadGQPGAKGEPGDAgakgdagppgpagpaGPPGPIGNVGAP 553
Cdd:NF038329  118 EKGEPGPAGPAGPAGEQGPR---------------------GDRGETGPAGPA---------------GPPGPQGERGEK 161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  554 GAKGARGSAGPPGATGFPGAAGrvgppgpsgNAGPPGPPGPAGKEGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGSPGa 633
Cdd:NF038329  162 GPAGPQGEAGPQGPAGKDGEAG---------AKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG- 231
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  634 DGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGasgergppgpmgPPGLAGPPGESGREGAPG 713
Cdd:NF038329  232 DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERG------------PVGPAGKDGQNGKDGLPG 299
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578830424  714 AEGSPGRDGSPGAKGDRGEtgpagppgapgapgapgpvgpAGKSGDRGETGPAGPAGPVGPVGARGPAGPQGP 786
Cdd:NF038329  300 KDGKDGQNGKDGLPGKDGK---------------------DGQPGKDGLPGKDGKDGQPGKPAPKTPEVPQKP 351
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
707-848 1.47e-12

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 71.09  E-value: 1.47e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  707 GREGAPGAEGSPGRDGSPGAKGDRGETGPAGPPGAPGAPgapgpvgpagksGDRGETGPAGPAGPVGPVGARGPAGPQGP 786
Cdd:NF038329  117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQ------------GERGEKGPAGPQGEAGPQGPAGKDGEAGA 184
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 578830424  787 RGDKGETGEQGDRGIKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGPRGPPGsAGAPGKDG 848
Cdd:NF038329  185 KGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGP-DGDPGPTG 245
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
239-295 1.07e-08

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 52.50  E-value: 1.07e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 578830424   239 GRPGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGAKGDAGPAGPKGEPGSPGEN 295
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
728-849 1.99e-08

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 57.99  E-value: 1.99e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  728 GDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGPVGPVGARGPAGPQGPRGDKGETGEQGDRGIKGHRGF 807
Cdd:NF038329  117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 578830424  808 SGLQGPPGPPGSPGEQGPSGASGPAGPRGPPGSAGAP--GKDGL 849
Cdd:NF038329  197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGqqGPDGD 240
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
272-491 3.26e-06

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 51.18  E-value: 3.26e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  272 GLDGAKGDAGPAGPKGEPGSPGENGAPGQMGPRGLPGERGRPGAPGPAGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGK 351
Cdd:COG5164     7 GKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQN 86
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  352 PGEQGVPGDLGAPGPSGARGERGFPGERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAG 431
Cdd:COG5164    87 QGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPGGSTTPPGDGGSTT 166
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  432 LPGPKGDRGDAGPKGADGSPGKDGVrgltgpigppgpaGAPGDKGESGPSGPAGPTGARG 491
Cdd:COG5164   167 PPGPGGSTTPPDDGGSTTPPNKGET-------------GTDIPTGGTPRQGPDGPVKKDD 213
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
551-794 2.61e-05

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 48.10  E-value: 2.61e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  551 GAPGAKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGS 630
Cdd:COG5164     7 GKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQN 86
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  631 PGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGP--SGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGR 708
Cdd:COG5164    87 QGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPpsGGSTTPPGDGGSTPPGPGSTGPGGSTTPPGDGGSTT 166
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  709 EGAPGAEGSPGRDGSPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGpvGPVGARGPAGPQGPRG 788
Cdd:COG5164   167 PPGPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKT--GPKDQRPKTNPIERRG 244

                  ....*.
gi 578830424  789 DKGETG 794
Cdd:COG5164   245 PERPEA 250
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
232-292 7.10e-05

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 46.82  E-value: 7.10e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 578830424  232 DGEAGKPGRPGERGPPGPQGARglpGTAGLPGMKGHRGFSGLDGAKGDAGPAGPKGEPGSP 292
Cdd:NF038329  283 VGPAGKDGQNGKDGLPGKDGKD---GQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKP 340
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
629-678 8.04e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.33  E-value: 8.04e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 578830424   629 GSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGK 678
Cdd:pfam01391    7 GPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
GGGWT_bact NF040941
fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, ...
952-989 8.88e-04

fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, describes a conserved domain found in eukaryotic proteins such as fibrinogen beta and gamma chains, fincolin, and angiopoietin. This model describes a small homology domain, about 46 amino acids long, found in the PF00147 homology region of those proteins but also as a much shorter homology domain in bacterial proteins that may lack homology to those proteins, or to each other, outside this region. The signature motif, at the C-terminus of this domain, is YCDxTTDGGGWxLV.


Pssm-ID: 468872 [Multi-domain]  Cd Length: 46  Bit Score: 38.31  E-value: 8.88e-04
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 578830424  952 TCRDLKMCHSDWKSGEYWIDPNQGCNLDAIKVFCNMET 989
Cdd:NF040941    1 SCWEILQAGPSAPSGVYWIDPDGMGGLAPFQVYCDMTT 38
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
352-576 1.51e-03

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 42.71  E-value: 1.51e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  352 PGEQGVPGDLGAPGPSGARGERGFPGERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAG 431
Cdd:COG5164     6 PGKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQ 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  432 LPG---PKGDRGDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPgDKGESGPSGPAGPTGA-RGAPGDRGEPGPPGPAGF 507
Cdd:COG5164    86 NQGgtrPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPP-SGGSTTPPGDGGSTPPgPGSTGPGGSTTPPGDGGS 164
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 578830424  508 AGPPGADGQPGAKGEPGDAGAKGDAGPPGPAGPAGPPGPIGNVGAPGAKGARGSAGPPGATGFPGAAGR 576
Cdd:COG5164   165 TTPPGPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKTGPKDQ 233
PRK12678 PRK12678
transcription termination factor Rho; Provisional
602-805 8.06e-03

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 40.27  E-value: 8.06e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  602 GPRGETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAG-APGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQG 680
Cdd:PRK12678   64 AAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAkAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAAR 143
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  681 PSGASGERGPPGPMGPPGLAGPPGESGREGAPGAEGSPGRDGSPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDR 760
Cdd:PRK12678  144 KAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDG 223
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 578830424  761 GETGPAGPAGPVGPVGARGPAGPQGPRGDKGETGEQGDRGIKGHR 805
Cdd:PRK12678  224 GDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRD 268
 
Name Accession Description Interval E-value
COLFI pfam01410
Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
921-1157 2.09e-159

Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 460199  Cd Length: 233  Bit Score: 472.60  E-value: 2.09e-159
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424   921 RDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDLKMCHSDWKSGEYWIDPNQGCNLDAIKVFCNMETGETCVYPTQPS 1000
Cdd:pfam01410    1 RDEEVMATLKSLSQQIENIRSPDGSKKNPARTCRDLKLCHPDWKSGEYWIDPNQGCTRDAIKVFCNFETGETCIYPTKAS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  1001 VAQKNWYISKNPkdkrHVWFGESMTDGFQFEYGGQGSDPADVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQQTGNLKK 1080
Cdd:pfam01410   81 IPRKNWWTKESK----HVWFGEFMNGGSQFSYGVDGVGPSVAAVQLTFLRLLSTEASQNITYHCKNSVAYMDQATGNLKK 156
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 578830424  1081 ALLLQGSNEIEIRAEGNSRFTYSVTVDGCTSHTGAWGKTVIEYKTTKTSRLPIIDVAPLDVGAPDQEFGFDVGPVCF 1157
Cdd:pfam01410  157 ALLLQGSNDEEIRAEGNSRFTYTVLEDGCTKRTGQWGKTVIEYRTQKVSRLPIVDIAPMDIGGADQEFGVEVGPVCF 233
COLFI smart00038
Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
922-1158 4.73e-140

Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 197483  Cd Length: 232  Bit Score: 422.26  E-value: 4.73e-140
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424    922 DLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDLKMCHSDWKSGEYWIDPNQGCNLDAIKVFCNMETGETCVYPTQPSV 1001
Cdd:smart00038    1 DEEVFASLKSLNNQIEQLKSPTGSRKNPARTCKDLKLCHPEWKSGEYWVDPNQGCIRDAIKVFCNFETGETCVSPSPSSI 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424   1002 AQKNWYISKNPkdkrHVWFGESMTDGFQFEYGGQGSDPaDVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQQTGNLKKA 1081
Cdd:smart00038   81 PRKTWYSGKSK----HVWFGETMNGGFKFSYGDSEGPP-VGVVQLTFLRLLSTEAHQNITYHCKNSVAYMDEATGNLKKA 155
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 578830424   1082 LLLQGSNEIEIRAEGNSRFTYSVTVDGCTSHTGAWGKTVIEYKTTKTSRLPIIDVAPLDVGAPDQEFGFDVGPVCFL 1158
Cdd:smart00038  156 LRLRGSNDVELSAEGNSKFTYEVLEDGCQKRTGKWGKTVIEYRTKKTERLPIVDIAPSDIGGPDQEFGVEIGPVCFS 232
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
233-458 3.17e-30

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 125.40  E-value: 3.17e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  233 GEAGKPGRPGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGAKGDAGPAGPKGEPGSPGENGAPGQMGPRGLPGErgr 312
Cdd:NF038329  120 GEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP--- 196
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  313 pgapgpagpageRGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDlgapgpsGARGERGFPGERGVQGPPGPAGPRG 392
Cdd:NF038329  197 ------------RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD-------GQQGPDGDPGPTGEDGPQGPDGPAG 257
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 578830424  393 ANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPGPKGDRGDAGPKGADGSPGKDGVRG 458
Cdd:NF038329  258 KDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPG 323
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
273-518 1.50e-26

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 114.23  E-value: 1.50e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  273 LDGAKGDAGPAGPKGEPGSPGENGAPGQMGPRGLPGERGRPGAPGPAGPAGERGEQGPAGSPGFQGlpgpagPPGEAGKP 352
Cdd:NF038329  115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDG------EAGAKGPA 188
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  353 GEQGVPGDLGAPGPSGARGERGFPGERGVQGPPGPAGPRGANGaPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGL 432
Cdd:NF038329  189 GEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGE 267
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  433 PGPKGDRGDAGPKGADGSPGKDGVRGltgpigppgpagapgDKGESGPSGPAGPTGARGAPGDRGEPGPPGPAGFAGPPG 512
Cdd:NF038329  268 AGPDGPDGKDGERGPVGPAGKDGQNG---------------KDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDG 332

                  ....*.
gi 578830424  513 ADGQPG 518
Cdd:NF038329  333 KDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
202-453 2.88e-23

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 104.22  E-value: 2.88e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  202 QGPPGEPGEPGAsgpmgprgppgppgkNGDDGEAGKPGRPGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGAKGDAG 281
Cdd:NF038329  137 RGDRGETGPAGP---------------AGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETG 201
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  282 PAGPKGEPGSPGENGAPGQMGPRGLPGERGrpgapgpagpageRGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDL 361
Cdd:NF038329  202 PAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-------------DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEA 268
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  362 GAPGPSGARGERGFPGErgvqgppgpagprgangaPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPGPkgdRGD 441
Cdd:NF038329  269 GPDGPDGKDGERGPVGP------------------AGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGK---DGL 327
                         250
                  ....*....|..
gi 578830424  442 AGPKGADGSPGK 453
Cdd:NF038329  328 PGKDGKDGQPGK 339
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
602-806 7.66e-22

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 99.98  E-value: 7.66e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  602 GPRGETGPAgrpgevgppGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGP 681
Cdd:NF038329  117 GEKGEPGPA---------GPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGP 187
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  682 SGASGERGPPGPMGPPGLAGPPGESGREGAPGAEGSPGRDGSPGaKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRG 761
Cdd:NF038329  188 AGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRG 266
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 578830424  762 ETGPAGPAGPVGPVGARGPAGPQGPRGDKGETGEQGDRGIKGHRG 806
Cdd:NF038329  267 EAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDG 311
VWC pfam00093
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with ...
40-95 1.18e-21

von Willebrand factor type C domain; The high cutoff was used to prevent overlap with pfam00094.


Pssm-ID: 278520  Cd Length: 57  Bit Score: 89.02  E-value: 1.18e-21
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 578830424    40 CVQNGLRYHDRDVWKPEPCRICVCDNGKVLCDDVICdETKNCPGA--EVPEGECCPVC 95
Cdd:pfam00093    1 CVQNGVVYENGETWKPDLCTICTCDDGKVLCDKIIC-PPLDCPNPrlEIPPGECCPVC 57
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
416-725 4.36e-21

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 97.67  E-value: 4.36e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  416 GAPGLQGMPGERGAAGLPGPKGDRGDAGPKGADGSPGKDGVRGltgpigppgpagapgDKGESGPSGPAGPTGARGapgd 495
Cdd:NF038329  117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERG---------------EKGPAGPQGEAGPQGPAG---- 177
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  496 rgepgppgpagfagppgADGQPGAKGEPGDAGAKGDAgppgpagpagppgpignvGAPGAKGARGSAGPPGATGFPGAAG 575
Cdd:NF038329  178 -----------------KDGEAGAKGPAGEKGPQGPR------------------GETGPAGEQGPAGPAGPDGEAGPAG 222
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  576 rvgppgpSGNAGPPGPPGPAGKEGGKGPRGETGPAgrpgevgppgppGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVV 655
Cdd:NF038329  223 -------EDGPAGPAGDGQQGPDGDPGPTGEDGPQ------------GPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPV 283
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  656 GLPGQRGERGFPGLPGPSGEPGKQGPSGAsgergppgpmgppglAGPPGESGREGAPGAEGSPGRDGSPG 725
Cdd:NF038329  284 GPAGKDGQNGKDGLPGKDGKDGQNGKDGL---------------PGKDGKDGQPGKDGLPGKDGKDGQPG 338
VWC smart00214
von Willebrand factor (vWF) type C domain;
40-95 1.49e-20

von Willebrand factor (vWF) type C domain;


Pssm-ID: 214564  Cd Length: 59  Bit Score: 86.03  E-value: 1.49e-20
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*....
gi 578830424     40 CVQNGLRYHDRDVWKPEPCRICVCDNGK-VLCDDVICDETKNCPGAE--VPEGECCPVC 95
Cdd:smart00214    1 CVHNGRVYNDGETWKPDPCQICTCLDGTtVLCDPVECPPPPDCPNPErvKPPGECCPRC 59
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
593-806 4.53e-19

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 91.51  E-value: 4.53e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  593 GPAGKEGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGP 672
Cdd:NF038329  126 GPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGE 205
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  673 SGEPGKQGPSGASGERGPPGPMGPPGLAG--PPGESGREGAPGAEGSPGRDGSPGAKGDRGETgpagppgapgapgapgp 750
Cdd:NF038329  206 QGPAGPAGPDGEAGPAGEDGPAGPAGDGQqgPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEA----------------- 268
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 578830424  751 vGPAGKSGDRGETGPAGPAGPVGPVGARGPAGPQGPRGDKGETGEQGDRGIKGHRG 806
Cdd:NF038329  269 -GPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPG 323
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
474-786 7.83e-16

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 81.49  E-value: 7.83e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  474 DKGESGPSGPAGPTGARGAPgdrgepgppgpagfagppgadGQPGAKGEPGDAgakgdagppgpagpaGPPGPIGNVGAP 553
Cdd:NF038329  118 EKGEPGPAGPAGPAGEQGPR---------------------GDRGETGPAGPA---------------GPPGPQGERGEK 161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  554 GAKGARGSAGPPGATGFPGAAGrvgppgpsgNAGPPGPPGPAGKEGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGSPGa 633
Cdd:NF038329  162 GPAGPQGEAGPQGPAGKDGEAG---------AKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG- 231
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  634 DGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGasgergppgpmgPPGLAGPPGESGREGAPG 713
Cdd:NF038329  232 DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERG------------PVGPAGKDGQNGKDGLPG 299
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578830424  714 AEGSPGRDGSPGAKGDRGEtgpagppgapgapgapgpvgpAGKSGDRGETGPAGPAGPVGPVGARGPAGPQGP 786
Cdd:NF038329  300 KDGKDGQNGKDGLPGKDGK---------------------DGQPGKDGLPGKDGKDGQPGKPAPKTPEVPQKP 351
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
707-848 1.47e-12

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 71.09  E-value: 1.47e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  707 GREGAPGAEGSPGRDGSPGAKGDRGETGPAGPPGAPGAPgapgpvgpagksGDRGETGPAGPAGPVGPVGARGPAGPQGP 786
Cdd:NF038329  117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQ------------GERGEKGPAGPQGEAGPQGPAGKDGEAGA 184
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 578830424  787 RGDKGETGEQGDRGIKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGPRGPPGsAGAPGKDG 848
Cdd:NF038329  185 KGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGP-DGDPGPTG 245
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
239-295 1.07e-08

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 52.50  E-value: 1.07e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 578830424   239 GRPGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGAKGDAGPAGPKGEPGSPGEN 295
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
728-849 1.99e-08

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 57.99  E-value: 1.99e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  728 GDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGPVGPVGARGPAGPQGPRGDKGETGEQGDRGIKGHRGF 807
Cdd:NF038329  117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 578830424  808 SGLQGPPGPPGSPGEQGPSGASGPAGPRGPPGSAGAP--GKDGL 849
Cdd:NF038329  197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGqqGPDGD 240
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
272-491 3.26e-06

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 51.18  E-value: 3.26e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  272 GLDGAKGDAGPAGPKGEPGSPGENGAPGQMGPRGLPGERGRPGAPGPAGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGK 351
Cdd:COG5164     7 GKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQN 86
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  352 PGEQGVPGDLGAPGPSGARGERGFPGERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAG 431
Cdd:COG5164    87 QGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPGGSTTPPGDGGSTT 166
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  432 LPGPKGDRGDAGPKGADGSPGKDGVrgltgpigppgpaGAPGDKGESGPSGPAGPTGARG 491
Cdd:COG5164   167 PPGPGGSTTPPDDGGSTTPPNKGET-------------GTDIPTGGTPRQGPDGPVKKDD 213
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
551-794 2.61e-05

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 48.10  E-value: 2.61e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  551 GAPGAKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGS 630
Cdd:COG5164     7 GKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQN 86
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  631 PGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGP--SGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGR 708
Cdd:COG5164    87 QGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPpsGGSTTPPGDGGSTPPGPGSTGPGGSTTPPGDGGSTT 166
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  709 EGAPGAEGSPGRDGSPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGpvGPVGARGPAGPQGPRG 788
Cdd:COG5164   167 PPGPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKT--GPKDQRPKTNPIERRG 244

                  ....*.
gi 578830424  789 DKGETG 794
Cdd:COG5164   245 PERPEA 250
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
232-292 7.10e-05

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 46.82  E-value: 7.10e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 578830424  232 DGEAGKPGRPGERGPPGPQGARglpGTAGLPGMKGHRGFSGLDGAKGDAGPAGPKGEPGSP 292
Cdd:NF038329  283 VGPAGKDGQNGKDGLPGKDGKD---GQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKP 340
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
629-678 8.04e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.33  E-value: 8.04e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 578830424   629 GSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGK 678
Cdd:pfam01391    7 GPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
413-460 1.47e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.55  E-value: 1.47e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 578830424   413 GSQGAPGLQGMPGERGAAGLPGPKGDRGDAGPKGADGSPGKDGVRGLT 460
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPP 48
GGGWT_bact NF040941
fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, ...
952-989 8.88e-04

fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, describes a conserved domain found in eukaryotic proteins such as fibrinogen beta and gamma chains, fincolin, and angiopoietin. This model describes a small homology domain, about 46 amino acids long, found in the PF00147 homology region of those proteins but also as a much shorter homology domain in bacterial proteins that may lack homology to those proteins, or to each other, outside this region. The signature motif, at the C-terminus of this domain, is YCDxTTDGGGWxLV.


Pssm-ID: 468872 [Multi-domain]  Cd Length: 46  Bit Score: 38.31  E-value: 8.88e-04
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 578830424  952 TCRDLKMCHSDWKSGEYWIDPNQGCNLDAIKVFCNMET 989
Cdd:NF040941    1 SCWEILQAGPSAPSGVYWIDPDGMGGLAPFQVYCDMTT 38
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
352-576 1.51e-03

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 42.71  E-value: 1.51e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  352 PGEQGVPGDLGAPGPSGARGERGFPGERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAG 431
Cdd:COG5164     6 PGKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQ 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  432 LPG---PKGDRGDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPgDKGESGPSGPAGPTGA-RGAPGDRGEPGPPGPAGF 507
Cdd:COG5164    86 NQGgtrPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPP-SGGSTTPPGDGGSTPPgPGSTGPGGSTTPPGDGGS 164
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 578830424  508 AGPPGADGQPGAKGEPGDAGAKGDAGPPGPAGPAGPPGPIGNVGAPGAKGARGSAGPPGATGFPGAAGR 576
Cdd:COG5164   165 TTPPGPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKTGPKDQ 233
VWC_out smart00215
von Willebrand factor (vWF) type C domain;
40-102 4.41e-03

von Willebrand factor (vWF) type C domain;


Pssm-ID: 214565  Cd Length: 67  Bit Score: 36.77  E-value: 4.41e-03
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578830424     40 CVQNGLRYHDRDVWKpEPCRICVCDNGKVLCDDVicdetkNCPGAEVPEGECCPVCPDGSESP 102
Cdd:smart00215    1 CWNNGSYYPPGAKWD-DDCNRCTCLNGRVSCTKV------WCGPKPCLLHNLSGECPLGQGCV 56
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
631-877 6.96e-03

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 40.40  E-value: 6.96e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  631 PGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREG 710
Cdd:COG5164     6 PGKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQ 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  711 APGAEGSPGRDGSPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAgksgDRGETGPAGPAGPVGP-VGARGPAGPQGPRGD 789
Cdd:COG5164    86 NQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPP----SGGSTTPPGDGGSTPPgPGSTGPGGSTTPPGD 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  790 KGETGEQGDRGIKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGPRGPPGSAGAPGKDGLNGLPGPIGPPGPRGRTGDAG 869
Cdd:COG5164   162 GGSTTPPGPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKTGPKDQRPKTNPIE 241

                  ....*...
gi 578830424  870 PVGPPGPP 877
Cdd:COG5164   242 RRGPERPE 249
PRK12678 PRK12678
transcription termination factor Rho; Provisional
602-805 8.06e-03

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 40.27  E-value: 8.06e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  602 GPRGETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAG-APGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQG 680
Cdd:PRK12678   64 AAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAkAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAAR 143
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578830424  681 PSGASGERGPPGPMGPPGLAGPPGESGREGAPGAEGSPGRDGSPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDR 760
Cdd:PRK12678  144 KAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDG 223
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 578830424  761 GETGPAGPAGPVGPVGARGPAGPQGPRGDKGETGEQGDRGIKGHR 805
Cdd:PRK12678  224 GDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRD 268
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
380-435 9.81e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 35.55  E-value: 9.81e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 578830424   380 GVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPGP 435
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH