NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|767945408|ref|XP_011513660|]
View 

collagen alpha-1(XXVIII) chain isoform X1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
VWA pfam00092
von Willebrand factor type A domain;
798-974 8.41e-42

von Willebrand factor type A domain;


:

Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 150.89  E-value: 8.41e-42
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   798 ELVFVIDSSESVGPENFQIIKNFVKTMADRVALDLATARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGT-Y 876
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLDIGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGTtN 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   877 TATALQ-AANDMFED---ARPGVKKVALVITDGQTDSRDkekLTEVVKNASDTNVEIFVIGvVKKNDPnfeifhKEMNLI 952
Cdd:pfam00092   81 TGKALKyALENLFSSaagARPGAPKVVVLLTDGRSQDGD---PEEVARELKSAGVTVFAVG-VGNADD------EELRKI 150
                          170       180
                   ....*....|....*....|....
gi 767945408   953 ATDP--EHVYQFDDFFTLQDTLKQ 974
Cdd:pfam00092  151 ASEPgeGHVFTVSDFEALEDLQDQ 174
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
272-582 1.33e-34

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 138.50  E-value: 1.33e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  272 GNAQKGEAGERGPGGIPGYKGDKGERGECGKPGIKGDKGSPGPYGPKGPRGIQgitgppgdpgpkgfqgnkgepgppgpy 351
Cdd:NF038329  115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQ--------------------------- 167
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  352 gspgapgiGQQGIKGERGQEGRPGAPGPigVGEPGQPGPRGPEGVPGERGLPGEgfPGPKGEKGSEGPTGPQGLQGLSIK 431
Cdd:NF038329  168 --------GEAGPQGPAGKDGEAGAKGP--AGEKGPQGPRGETGPAGEQGPAGP--AGPDGEAGPAGEDGPAGPAGDGQQ 235
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  432 GEKGDIGPVGPQGPMGIPGigsqgeqgiqgpigppgpqgpagqgLPGSKGEVGQMGPTGPRGPVG-IGVQGPKGEPGSIG 510
Cdd:NF038329  236 GPDGDPGPTGEDGPQGPDG-------------------------PAGKDGPRGDRGEAGPDGPDGkDGERGPVGPAGKDG 290
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767945408  511 LPGQPGVPGEDGAAGKKGEAGLPgargpegppgkgqpgpkgdeGKKGSKGNQGQRGLPGPEGPKGEPGIMGP 582
Cdd:NF038329  291 QNGKDGLPGKDGKDGQNGKDGLP--------------------GKDGKDGQPGKDGLPGKDGKDGQPGKPAP 342
Kunitz-type super family cl00101
Kunitz/Bovine pancreatic trypsin inhibitor (BPTI) domain; This family contains the Kunitz ...
1072-1122 4.53e-32

Kunitz/Bovine pancreatic trypsin inhibitor (BPTI) domain; This family contains the Kunitz domain which is a common structural fold found in a family of reversible serine protease inhibitors. This domain is thought to have evolved over 500 million years and is ubiquitous in all kingdoms of life and has been incorporated into many different genes. In general, each domain is encoded by a single exon. Some genes encode proteins with a single Kunitz domain, e.g. bovine pancreatic trypsin inhibitor (BPTI), trophoblast Kunitz domain protein (TKDP), amyloid beta-protein precursor (ABPP), as well as Kunitz-type venom peptides such as dendrotoxin. Genes that encode multiple Kunitz domains include hepatocyte growth factor activator inhibitors HAI1 and HAI2 (two domains), tissue factor pathway inhibitor TFPI1 and TFPI2 (three domains) and Caenorhabditis elegans papilin (eleven domains). In addition, the Kunitz domain has been integrated into multi-domain proteins, e.g. the collagen alpha3(VI), alpha1(VII) and alpha1(XXVIII) chains, WFIKKN1 (containing WAP, Follistatin/Kazal, Immunoglobulin, two Kunitz and NTR domains) and papilin. Furthermore, each domain within a multi-Kunitz domain protein may exhibit different protease activity, such as for the three tandemly repeated domains within both tissue factor pathway inhibitors 1 and 2. The Kunitz domain is a representative of alpha/beta proteins with irregular secondary structure stabilized by three disulfide bonds and presenting three peptide loops that can be varied without introducing much destabilization to the scaffold. Protease inhibitors meet the scaffold criteria in that they are small, stable and capable of evolving the binding activity of exposed peptide loops through targeted randomization to construct combinatorial libraries. Kunitz domain-based scaffolds have been successfully utilized to construct and select a library of protease inhibitors with the potential for therapeutic application.


The actual alignment was detected with superfamily member cd22628:

Pssm-ID: 444694  Cd Length: 51  Bit Score: 118.54  E-value: 4.53e-32
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22628     1 CLEPLDPGPCREYVVKWYYDKQANSCAQFWYGGCEGNRNRFETEEECRKTC 51
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
478-759 1.13e-27

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 117.70  E-value: 1.13e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  478 GSKGEVGQMGPTGPRGPVGI-GVQGPKGEPGSIGLPGQPGVPGEDGAAGKKGEAGLPGARgpegppgkgqpgpkgdeGKK 556
Cdd:NF038329  117 GEKGEPGPAGPAGPAGEQGPrGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPA-----------------GKD 179
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  557 GSKGNQGQRGLPGPEGPKGEPGIMGPFGMPGtsipgPPGPKGDRGGPGIPGFKGEPGLSIRGPKGVQGPRGPVGAPGLKG 636
Cdd:NF038329  180 GEAGAKGPAGEKGPQGPRGETGPAGEQGPAG-----PAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDG 254
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  637 dgypgvpgprglpgppgpmglrGVGDTGAKGEPGVRGPpgpsgprgvgtQGPKGDTGQKGLPGPPgppgygsqGIKGEQG 716
Cdd:NF038329  255 ----------------------PAGKDGPRGDRGEAGP-----------DGPDGKDGERGPVGPA--------GKDGQNG 293
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*..
gi 767945408  717 PQGFPGPKGTMGH----GLPGQKGEHGERGDVGKKGDKGEIGEPGSP 759
Cdd:NF038329  294 KDGLPGKDGKDGQngkdGLPGKDGKDGQPGKDGLPGKDGKDGQPGKP 340
vWFA_subfamily_ECM cd01450
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
47-209 6.05e-27

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains


:

Pssm-ID: 238727 [Multi-domain]  Cd Length: 161  Bit Score: 108.15  E-value: 6.05e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   47 IDIVFIVDSSESSKIALFDKQKDFVDSLSDKIfqltpgRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKSMNL 126
Cdd:cd01450     1 LDIVFLLDGSESVGPENFEKVKDFIEKLVEKL------DIGPDKTRVGLVQYSDDVRVEFSLNDYKSKDDLLKAVKNLKY 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  127 I-GQGTFSYYAISNATRLL--KREGRKDGVKVVLLMTDGIDHpKNPDVQSISEDARISGISFITIALSTvVNEAKLRLIS 203
Cdd:cd01450    75 LgGGGTNTGKALQYALEQLfsESNARENVPKVIIVLTDGRSD-DGGDPKEAAAKLKDEGIKVFVVGVGP-ADEEELREIA 152

                  ....*.
gi 767945408  204 GDSSSE 209
Cdd:cd01450   153 SCPSER 158
 
Name Accession Description Interval E-value
VWA pfam00092
von Willebrand factor type A domain;
798-974 8.41e-42

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 150.89  E-value: 8.41e-42
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   798 ELVFVIDSSESVGPENFQIIKNFVKTMADRVALDLATARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGT-Y 876
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLDIGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGTtN 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   877 TATALQ-AANDMFED---ARPGVKKVALVITDGQTDSRDkekLTEVVKNASDTNVEIFVIGvVKKNDPnfeifhKEMNLI 952
Cdd:pfam00092   81 TGKALKyALENLFSSaagARPGAPKVVVLLTDGRSQDGD---PEEVARELKSAGVTVFAVG-VGNADD------EELRKI 150
                          170       180
                   ....*....|....*....|....
gi 767945408   953 ATDP--EHVYQFDDFFTLQDTLKQ 974
Cdd:pfam00092  151 ASEPgeGHVFTVSDFEALEDLQDQ 174
vWFA_subfamily_ECM cd01450
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
797-960 4.25e-39

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains


Pssm-ID: 238727 [Multi-domain]  Cd Length: 161  Bit Score: 142.82  E-value: 4.25e-39
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  797 LELVFVIDSSESVGPENFQIIKNFVKTMADRVALDLATARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYL-GEGT 875
Cdd:cd01450     1 LDIVFLLDGSESVGPENFEKVKDFIEKLVEKLDIGPDKTRVGLVQYSDDVRVEFSLNDYKSKDDLLKAVKNLKYLgGGGT 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  876 YTATALQAANDMF---EDARPGVKKVALVITDGQtdSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDpnfeifhKEMNLI 952
Cdd:cd01450    81 NTGKALQYALEQLfseSNARENVPKVIIVLTDGR--SDDGGDPKEAAAKLKDEGIKVFVVGVGPADE-------EELREI 151
                         170
                  ....*....|
gi 767945408  953 ATDP--EHVY 960
Cdd:cd01450   152 ASCPseRHVF 161
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
799-972 1.33e-37

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 139.13  E-value: 1.33e-37
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408    799 LVFVIDSSESVGPENFQIIKNFVKTMADRVALDLATARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQY-LGEGTYT 877
Cdd:smart00327    2 VVFLLDGSGSMGGNRFELAKEFVLKLVEQLDIGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYkLGGGTNL 81
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408    878 ATALQAANDMFED----ARPGVKKVALVITDGQTDSRDKEKLtEVVKNASDTNVEIFVIGVvkKNDPNFEifhkEMNLIA 953
Cdd:smart00327   82 GAALQYALENLFSksagSRRGAPKVVILITDGESNDGPKDLL-KAAKELKRSGVKVFVVGV--GNDVDEE----ELKKLA 154
                           170       180
                    ....*....|....*....|.
gi 767945408    954 TDP--EHVYQFDDFFTLQDTL 972
Cdd:smart00327  155 SAPggVYVFLPELLDLLIDLL 175
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
272-582 1.33e-34

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 138.50  E-value: 1.33e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  272 GNAQKGEAGERGPGGIPGYKGDKGERGECGKPGIKGDKGSPGPYGPKGPRGIQgitgppgdpgpkgfqgnkgepgppgpy 351
Cdd:NF038329  115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQ--------------------------- 167
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  352 gspgapgiGQQGIKGERGQEGRPGAPGPigVGEPGQPGPRGPEGVPGERGLPGEgfPGPKGEKGSEGPTGPQGLQGLSIK 431
Cdd:NF038329  168 --------GEAGPQGPAGKDGEAGAKGP--AGEKGPQGPRGETGPAGEQGPAGP--AGPDGEAGPAGEDGPAGPAGDGQQ 235
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  432 GEKGDIGPVGPQGPMGIPGigsqgeqgiqgpigppgpqgpagqgLPGSKGEVGQMGPTGPRGPVG-IGVQGPKGEPGSIG 510
Cdd:NF038329  236 GPDGDPGPTGEDGPQGPDG-------------------------PAGKDGPRGDRGEAGPDGPDGkDGERGPVGPAGKDG 290
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767945408  511 LPGQPGVPGEDGAAGKKGEAGLPgargpegppgkgqpgpkgdeGKKGSKGNQGQRGLPGPEGPKGEPGIMGP 582
Cdd:NF038329  291 QNGKDGLPGKDGKDGQNGKDGLP--------------------GKDGKDGQPGKDGLPGKDGKDGQPGKPAP 342
Kunitz_collagen_alpha1_XXVIII cd22628
Kunitz-type domain from the alpha1 chain of type XXVIII collagen, and similar proteins; This ...
1072-1122 4.53e-32

Kunitz-type domain from the alpha1 chain of type XXVIII collagen, and similar proteins; This model includes the Kunitz-type domain from the alpha1 chain of type XXVIII collagen (collagen alpha-1(XXVIII) chain) and similar proteins. The zebrafish has four collagen XXVIII genes all of which are differentially expressed in the liver, thymus, muscle, intestine and skin; only the alpha1 chain contains the Kunitz domain which is often proteolytically processed. Mammals only contain the alpha1 collagen chain, expressed mostly in dorsal root ganglia and peripheral nerves. The Kunitz domain is found at the C-terminus, and is most related to Kunitz domains of papilin and alpha3(VI) collagen. This domain is similar to that of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor) that shows an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438671  Cd Length: 51  Bit Score: 118.54  E-value: 4.53e-32
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22628     1 CLEPLDPGPCREYVVKWYYDKQANSCAQFWYGGCEGNRNRFETEEECRKTC 51
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
423-637 7.68e-30

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 124.25  E-value: 7.68e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  423 QGLQGLSIKGEKGDIGPVGPQGPMGIPGI-GSQGEQGIQGPIGPPGPQGPagqglPGSKGEVGQMGPTGPRGPVgiGVQG 501
Cdd:NF038329  108 EGLQQLKGDGEKGEPGPAGPAGPAGEQGPrGDRGETGPAGPAGPPGPQGE-----RGEKGPAGPQGEAGPQGPA--GKDG 180
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  502 PKGEPGSIGLPGQPGVPGEDGAAGKKGEAGLPGARGPEGPPGKGQPGPKGDEGKKGSKGNQGQRGLPGPEGPKGEPGIMG 581
Cdd:NF038329  181 EAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDG 260
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 767945408  582 PFGMPG-TSIPGPPGPKGDRGGPGIPGFKGEPGLsiRGPKGVQGPRGPVGAPGLKGD 637
Cdd:NF038329  261 PRGDRGeAGPDGPDGKDGERGPVGPAGKDGQNGK--DGLPGKDGKDGQNGKDGLPGK 315
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
478-759 1.13e-27

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 117.70  E-value: 1.13e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  478 GSKGEVGQMGPTGPRGPVGI-GVQGPKGEPGSIGLPGQPGVPGEDGAAGKKGEAGLPGARgpegppgkgqpgpkgdeGKK 556
Cdd:NF038329  117 GEKGEPGPAGPAGPAGEQGPrGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPA-----------------GKD 179
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  557 GSKGNQGQRGLPGPEGPKGEPGIMGPFGMPGtsipgPPGPKGDRGGPGIPGFKGEPGLSIRGPKGVQGPRGPVGAPGLKG 636
Cdd:NF038329  180 GEAGAKGPAGEKGPQGPRGETGPAGEQGPAG-----PAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDG 254
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  637 dgypgvpgprglpgppgpmglrGVGDTGAKGEPGVRGPpgpsgprgvgtQGPKGDTGQKGLPGPPgppgygsqGIKGEQG 716
Cdd:NF038329  255 ----------------------PAGKDGPRGDRGEAGP-----------DGPDGKDGERGPVGPA--------GKDGQNG 293
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*..
gi 767945408  717 PQGFPGPKGTMGH----GLPGQKGEHGERGDVGKKGDKGEIGEPGSP 759
Cdd:NF038329  294 KDGLPGKDGKDGQngkdGLPGKDGKDGQPGKDGLPGKDGKDGQPGKP 340
vWFA_subfamily_ECM cd01450
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
47-209 6.05e-27

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains


Pssm-ID: 238727 [Multi-domain]  Cd Length: 161  Bit Score: 108.15  E-value: 6.05e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   47 IDIVFIVDSSESSKIALFDKQKDFVDSLSDKIfqltpgRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKSMNL 126
Cdd:cd01450     1 LDIVFLLDGSESVGPENFEKVKDFIEKLVEKL------DIGPDKTRVGLVQYSDDVRVEFSLNDYKSKDDLLKAVKNLKY 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  127 I-GQGTFSYYAISNATRLL--KREGRKDGVKVVLLMTDGIDHpKNPDVQSISEDARISGISFITIALSTvVNEAKLRLIS 203
Cdd:cd01450    75 LgGGGTNTGKALQYALEQLfsESNARENVPKVIIVLTDGRSD-DGGDPKEAAAKLKDEGIKVFVVGVGP-ADEEELREIA 152

                  ....*.
gi 767945408  204 GDSSSE 209
Cdd:cd01450   153 SCPSER 158
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
260-533 1.81e-25

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 111.15  E-value: 1.81e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  260 GIKGERGPKGNPGNA-QKGEAGERGPGGIPGYKGDKGERGECGKPGIKGDKGSPGPYGPKGPRGIQGI------TGPPGD 332
Cdd:NF038329  117 GEKGEPGPAGPAGPAgEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAkgpageKGPQGP 196
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  333 PGPKGFQGNKGEPGPPGPYgspgapgigqqgikGERGQEGRPGAPGPIGVGEPGQPGPRGPEGVPGERGLPGEgfPGPKG 412
Cdd:NF038329  197 RGETGPAGEQGPAGPAGPD--------------GEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGP--AGKDG 260
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  413 EKGSEGPTGPQGLqglsiKGEKGDIGPVGPQGPmgipgigsqgeqgiqgpigppgpqgpagqglPGSKGEVGQMGPTGPR 492
Cdd:NF038329  261 PRGDRGEAGPDGP-----DGKDGERGPVGPAGK-------------------------------DGQNGKDGLPGKDGKD 304
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|.
gi 767945408  493 GPvgigvQGPKGEPGSIGLPGQPGVPGEDGAAGKKGEAGLP 533
Cdd:NF038329  305 GQ-----NGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKP 340
VWA pfam00092
von Willebrand factor type A domain;
48-226 1.02e-24

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 101.97  E-value: 1.02e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408    48 DIVFIVDSSESSKIALFDKQKDFVDSLSDkifQLTPGrslEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKSMNLI 127
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVE---SLDIG---PDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYL 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   128 GQGTFSY-YAISNATRLL---KREGRKDGVKVVLLMTDGidHPKNPDVQSISEDARISGISFITIALSTVVNEAkLRLIS 203
Cdd:pfam00092   75 GGGTTNTgKALKYALENLfssAAGARPGAPKVVVLLTDG--RSQDGDPEEVARELKSAGVTVFAVGVGNADDEE-LRKIA 151
                          170       180
                   ....*....|....*....|...
gi 767945408   204 GDSSSEPTLLLSDPTLVDKIQDR 226
Cdd:pfam00092  152 SEPGEGHVFTVSDFEALEDLQDQ 174
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
48-227 8.00e-23

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 96.75  E-value: 8.00e-23
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408     48 DIVFIVDSSESSKIALFDKQKDFVDSLsdkifqLTPGRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKSMNLI 127
Cdd:smart00327    1 DVVFLLDGSGSMGGNRFELAKEFVLKL------VEQLDIGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYK 74
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408    128 -GQGTFSYYAISNATRLLKRE---GRKDGVKVVLLMTDGIDHPKNPDVQSISEDARISGISFITIALSTVVNEAKLRLIS 203
Cdd:smart00327   75 lGGGTNLGAALQYALENLFSKsagSRRGAPKVVILITDGESNDGPKDLLKAAKELKRSGVKVFVVGVGNDVDEEELKKLA 154
                           170       180
                    ....*....|....*....|....
gi 767945408    204 GDSSSEPTLLlsdPTLVDKIQDRL 227
Cdd:smart00327  155 SAPGGVYVFL---PELLDLLIDLL 175
KU smart00131
BPTI/Kunitz family of serine protease inhibitors; Serine protease inhibitors. One member of ...
1070-1122 3.04e-20

BPTI/Kunitz family of serine protease inhibitors; Serine protease inhibitors. One member of the family is encoded by an alternatively-spliced form of Alzheimer's amyloid beta-protein.


Pssm-ID: 197529  Cd Length: 53  Bit Score: 85.01  E-value: 3.04e-20
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 767945408   1070 PRCLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:smart00131    1 DVCLLPPDTGPCGGSIPRYYYDPETGTCEPFTYGGCGGNANNFESLEECERTC 53
Kunitz_BPTI pfam00014
Kunitz/Bovine pancreatic trypsin inhibitor domain; Indicative of a protease inhibitor, usually ...
1071-1122 1.84e-19

Kunitz/Bovine pancreatic trypsin inhibitor domain; Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure. Certain family members are similar to the tick anticoagulant peptide (TAP). This is a highly selective inhibitor of factor Xa in the blood coagulation pathways. TAP molecules are highly dipolar, and are arranged to form a twisted two- stranded antiparallel beta-sheet followed by an alpha helix.


Pssm-ID: 425421  Cd Length: 53  Bit Score: 82.69  E-value: 1.84e-19
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 767945408  1071 RCLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:pfam00014    1 ICSLPPDSGPCKASIPRWYYNPTTGTCEPFTYGGCGGNANNFESLEECESTC 52
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
793-934 2.25e-15

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 77.29  E-value: 2.25e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  793 KETPLELVFVIDSSESVGPEN-FQIIKNFVKTMADRValdLATARIGIINYSHKVEKVANLKqfSSKDDFKLAVDNMQyL 871
Cdd:COG1240    89 PQRGRDVVLVVDASGSMAAENrLEAAKGALLDFLDDY---RPRDRVGLVAFGGEAEVLLPLT--RDREALKRALDELP-P 162
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767945408  872 GEGTYTATALQAANDMFEDARPGVKKVALVITDGQtDSRDKEKLTEVVKNASDTNVEIFVIGV 934
Cdd:COG1240   163 GGGTPLGDALALALELLKRADPARRKVIVLLTDGR-DNAGRIDPLEAAELAAAAGIRIYTIGV 224
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
257-445 7.84e-15

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 78.41  E-value: 7.84e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  257 GNPGIKGERGPKGNPGNAQKGEAGERGPGGIPGYKGDKGERGECGKPGIKGDKGSPGPYGPKGPRGIQGITGPPGDPGPK 336
Cdd:NF038329  210 GPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKD 289
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  337 GFQGNKgepgppgpygspgapgiGQQGIKGERGQEGRPGAPGPigVGEPGQPGPRGPEGVPGERGLPGEgfPGPKGEKGS 416
Cdd:NF038329  290 GQNGKD-----------------GLPGKDGKDGQNGKDGLPGK--DGKDGQPGKDGLPGKDGKDGQPGK--PAPKTPEVP 348
                         170       180       190
                  ....*....|....*....|....*....|
gi 767945408  417 EGP-TGPQGLQGLSIKGEKGDIGPvGPQGP 445
Cdd:NF038329  349 QKPdTAPHTPKTPQIPGQSKDVTP-APQNP 377
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
47-203 1.96e-12

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 68.81  E-value: 1.96e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   47 IDIVFIVDSSES----SKIalfDKQKDFVDSLSDkifQLTPGRsleydiKLAALQFSSSVQIDPPFSSwkDLQTFKQKVK 122
Cdd:COG1240    93 RDVVLVVDASGSmaaeNRL---EAAKGALLDFLD---DYRPRD------RVGLVAFGGEAEVLLPLTR--DREALKRALD 158
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  123 SMNlIGQGTFSYYAISNATRLLKREgRKDGVKVVLLMTDGIDHPKNPDVQSISEDARISGISFITIALST-VVNEAKLRL 201
Cdd:COG1240   159 ELP-PGGGTPLGDALALALELLKRA-DPARRKVIVLLTDGRDNAGRIDPLEAAELAAAAGIRIYTIGVGTeAVDEGLLRE 236

                  ..
gi 767945408  202 IS 203
Cdd:COG1240   237 IA 238
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
360-419 8.46e-09

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 52.50  E-value: 8.46e-09
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   360 GQQGIKGERGQEGRPGAPGPIgvGEPGQPGPRGPEGVPGERGLPgeGFPGPKGEKGSEGP 419
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPP--GPPGPPGPPGEPGPPGPPGPP--GPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
554-612 3.36e-08

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 50.96  E-value: 3.36e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 767945408   554 GKKGSKGNQGQRGLPGPEGPKGEPGIMGPFGMPGTsiPGPPGPKGDRGGPGIPGFKGEP 612
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGP--PGPPGPPGPPGPPGAPGAPGPP 57
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
257-510 5.89e-08

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 56.58  E-value: 5.89e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  257 GNPGIKGERGPKGNPG-NAQKGEAGERGPGGIPGYKGDKGERGECGKPGIKGDKGSPGPYGPKGPRGIQGITGPPGDPGP 335
Cdd:COG5164    19 TPAGSQGSTKPAQNQGsTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQNQGGTRPAGNTGG 98
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  336 KGFQGNKGEPGPPGPYGSPGAPGIGQQGIKGERGQEGRPGAPG--PIGVGEPGQPGPRGPEGVPGERGLPGEGfpGPKGE 413
Cdd:COG5164    99 TTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGstPPGPGSTGPGGSTTPPGDGGSTTPPGPG--GSTTP 176
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  414 KGSEGPTGPQGLQGLSIKGEKGDIGPVGPQGPMGIPGIGSQGEQGIQGPIGPPGPQGPAGQGLPGSKGEVGQMGPTGPRG 493
Cdd:COG5164   177 PDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKTGPKDQRPKTNPIERRGPERPEAAALPAE 256
                         250
                  ....*....|....*..
gi 767945408  494 PVGIGVQGPKGEPGSIG 510
Cdd:COG5164   257 LTALEAENRAANPEPAT 273
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
489-774 1.51e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 45.79  E-value: 1.51e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  489 TGPRGPVGIGVQGPKGE---PGSIGLPGQPGVPGEDGAAGKKGEAGLPGargpegppgkgqpgpkgDEGKKGSKGNQGQR 565
Cdd:COG5164     1 TGLYGPGKTGPSDPGGVttpAGSQGSTKPAQNQGSTRPAGNTGGTRPAQ-----------------NQGSTTPAGNTGGT 63
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  566 GLPGPEGPKGEPGIMGPFGMPGTsiPGPPGPKGDRGGPGIPGFKGEPGLSirGPKGVQGPRGPVGAPGlKGDGYPGVPGP 645
Cdd:COG5164    64 RPAGNQGATGPAQNQGGTTPAQN--QGGTRPAGNTGGTTPAGDGGATGPP--DDGGATGPPDDGGSTT-PPSGGSTTPPG 138
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  646 RGLPGPPGPMGLRGVGDTGAKGEPGVRGPPgpsgprgvgtqGPKGDTGQKGLPGPPGPPGYGSQGIKGEQGPQGFPGPKG 725
Cdd:COG5164   139 DGGSTPPGPGSTGPGGSTTPPGDGGSTTPP-----------GPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDG 207
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|
gi 767945408  726 tmGHGLPGQKGEHGERGDV-GKKGDKGEIGEPGSPGKQGLQGPKGDLGLT 774
Cdd:COG5164   208 --PVKKDDKNGKGNPPDDRgGKTGPKDQRPKTNPIERRGPERPEAAALPA 255
PHA03169 PHA03169
hypothetical protein; Provisional
256-420 1.90e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 41.88  E-value: 1.90e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  256 HGNPGIKGERGPKGN-------PGNAQKGEAGERGPGGIPGYKGDKGERGECGKPGIKGDKGSPGPYGPKGPRGIQGitg 328
Cdd:PHA03169   81 HGEKEERGQGGPSGSgsesvgsPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNP--- 157
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  329 PPGDPGPKGFQGNKGEPGPPGPYGSPGAPGIGQQGIKGERGQEGRPGAPGPIGVGEPGQPGPRGPEGVPGERGLPGEGFP 408
Cdd:PHA03169  158 SPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEP 237
                         170
                  ....*....|..
gi 767945408  409 GPKGEKGSEGPT 420
Cdd:PHA03169  238 TEPEREGPPFPG 249
 
Name Accession Description Interval E-value
VWA pfam00092
von Willebrand factor type A domain;
798-974 8.41e-42

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 150.89  E-value: 8.41e-42
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   798 ELVFVIDSSESVGPENFQIIKNFVKTMADRVALDLATARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGT-Y 876
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLDIGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGTtN 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   877 TATALQ-AANDMFED---ARPGVKKVALVITDGQTDSRDkekLTEVVKNASDTNVEIFVIGvVKKNDPnfeifhKEMNLI 952
Cdd:pfam00092   81 TGKALKyALENLFSSaagARPGAPKVVVLLTDGRSQDGD---PEEVARELKSAGVTVFAVG-VGNADD------EELRKI 150
                          170       180
                   ....*....|....*....|....
gi 767945408   953 ATDP--EHVYQFDDFFTLQDTLKQ 974
Cdd:pfam00092  151 ASEPgeGHVFTVSDFEALEDLQDQ 174
vWFA_subfamily_ECM cd01450
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
797-960 4.25e-39

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains


Pssm-ID: 238727 [Multi-domain]  Cd Length: 161  Bit Score: 142.82  E-value: 4.25e-39
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  797 LELVFVIDSSESVGPENFQIIKNFVKTMADRVALDLATARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYL-GEGT 875
Cdd:cd01450     1 LDIVFLLDGSESVGPENFEKVKDFIEKLVEKLDIGPDKTRVGLVQYSDDVRVEFSLNDYKSKDDLLKAVKNLKYLgGGGT 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  876 YTATALQAANDMF---EDARPGVKKVALVITDGQtdSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDpnfeifhKEMNLI 952
Cdd:cd01450    81 NTGKALQYALEQLfseSNARENVPKVIIVLTDGR--SDDGGDPKEAAAKLKDEGIKVFVVGVGPADE-------EELREI 151
                         170
                  ....*....|
gi 767945408  953 ATDP--EHVY 960
Cdd:cd01450   152 ASCPseRHVF 161
vWA_collagen cd01472
von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This ...
799-965 1.51e-38

von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.


Pssm-ID: 238749 [Multi-domain]  Cd Length: 164  Bit Score: 141.21  E-value: 1.51e-38
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  799 LVFVIDSSESVGPENFQIIKNFVKTMADRVALDLATARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTA 878
Cdd:cd01472     3 IVFLVDGSESIGLSNFNLVKDFVKRVVERLDIGPDGVRVGVVQYSDDPRTEFYLNTYRSKDDVLEAVKNLRYIGGGTNTG 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  879 TALQ-AANDMFE---DARPGVKKVALVITDGQtdSRDkeKLTEVVKNASDTNVEIFVIGvVKKNDPNfeifhkEMNLIAT 954
Cdd:cd01472    83 KALKyVRENLFTeasGSREGVPKVLVVITDGK--SQD--DVEEPAVELKQAGIEVFAVG-VKNADEE------ELKQIAS 151
                         170
                  ....*....|...
gi 767945408  955 DP--EHVYQFDDF 965
Cdd:cd01472   152 DPkeLYVFNVADF 164
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
799-972 1.33e-37

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 139.13  E-value: 1.33e-37
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408    799 LVFVIDSSESVGPENFQIIKNFVKTMADRVALDLATARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQY-LGEGTYT 877
Cdd:smart00327    2 VVFLLDGSGSMGGNRFELAKEFVLKLVEQLDIGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYkLGGGTNL 81
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408    878 ATALQAANDMFED----ARPGVKKVALVITDGQTDSRDKEKLtEVVKNASDTNVEIFVIGVvkKNDPNFEifhkEMNLIA 953
Cdd:smart00327   82 GAALQYALENLFSksagSRRGAPKVVILITDGESNDGPKDLL-KAAKELKRSGVKVFVVGV--GNDVDEE----ELKKLA 154
                           170       180
                    ....*....|....*....|.
gi 767945408    954 TDP--EHVYQFDDFFTLQDTL 972
Cdd:smart00327  155 SAPggVYVFLPELLDLLIDLL 175
vWA_Matrilin cd01475
VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and ...
796-985 1.87e-36

VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.


Pssm-ID: 238752 [Multi-domain]  Cd Length: 224  Bit Score: 137.52  E-value: 1.87e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  796 PLELVFVIDSSESVGPENFQIIKNFVKTMADRVALDLATARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGT 875
Cdd:cd01475     2 PTDLVFLIDSSRSVRPENFELVKQFLNQIIDSLDVGPDATRVGLVQYSSTVKQEFPLGRFKSKADLKRAVRRMEYLETGT 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  876 YTATALQ-AANDMF---EDARPG---VKKVALVITDGqtdsRDKEKLTEVVKNASDTNVEIFVIGVVKKNDpnfeifhKE 948
Cdd:cd01475    82 MTGLAIQyAMNNAFseaEGARPGserVPRVGIVVTDG----RPQDDVSEVAAKARALGIEMFAVGVGRADE-------EE 150
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 767945408  949 MNLIATDP--EHVYQFDDFFTLqDTLKQKLFQKICEDFD 985
Cdd:cd01475   151 LREIASEPlaDHVFYVEDFSTI-EELTKKFQGKICVVPD 188
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
272-582 1.33e-34

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 138.50  E-value: 1.33e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  272 GNAQKGEAGERGPGGIPGYKGDKGERGECGKPGIKGDKGSPGPYGPKGPRGIQgitgppgdpgpkgfqgnkgepgppgpy 351
Cdd:NF038329  115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQ--------------------------- 167
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  352 gspgapgiGQQGIKGERGQEGRPGAPGPigVGEPGQPGPRGPEGVPGERGLPGEgfPGPKGEKGSEGPTGPQGLQGLSIK 431
Cdd:NF038329  168 --------GEAGPQGPAGKDGEAGAKGP--AGEKGPQGPRGETGPAGEQGPAGP--AGPDGEAGPAGEDGPAGPAGDGQQ 235
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  432 GEKGDIGPVGPQGPMGIPGigsqgeqgiqgpigppgpqgpagqgLPGSKGEVGQMGPTGPRGPVG-IGVQGPKGEPGSIG 510
Cdd:NF038329  236 GPDGDPGPTGEDGPQGPDG-------------------------PAGKDGPRGDRGEAGPDGPDGkDGERGPVGPAGKDG 290
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767945408  511 LPGQPGVPGEDGAAGKKGEAGLPgargpegppgkgqpgpkgdeGKKGSKGNQGQRGLPGPEGPKGEPGIMGP 582
Cdd:NF038329  291 QNGKDGLPGKDGKDGQNGKDGLP--------------------GKDGKDGQPGKDGLPGKDGKDGQPGKPAP 342
Kunitz_collagen_alpha1_XXVIII cd22628
Kunitz-type domain from the alpha1 chain of type XXVIII collagen, and similar proteins; This ...
1072-1122 4.53e-32

Kunitz-type domain from the alpha1 chain of type XXVIII collagen, and similar proteins; This model includes the Kunitz-type domain from the alpha1 chain of type XXVIII collagen (collagen alpha-1(XXVIII) chain) and similar proteins. The zebrafish has four collagen XXVIII genes all of which are differentially expressed in the liver, thymus, muscle, intestine and skin; only the alpha1 chain contains the Kunitz domain which is often proteolytically processed. Mammals only contain the alpha1 collagen chain, expressed mostly in dorsal root ganglia and peripheral nerves. The Kunitz domain is found at the C-terminus, and is most related to Kunitz domains of papilin and alpha3(VI) collagen. This domain is similar to that of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor) that shows an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438671  Cd Length: 51  Bit Score: 118.54  E-value: 4.53e-32
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22628     1 CLEPLDPGPCREYVVKWYYDKQANSCAQFWYGGCEGNRNRFETEEECRKTC 51
vWA_collagen_alpha_1-VI-type cd01480
VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable ...
796-956 1.02e-31

VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238757 [Multi-domain]  Cd Length: 186  Bit Score: 122.49  E-value: 1.02e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  796 PLELVFVIDSSESVGPENFQIIKNFVKTMADRVALDLA------TARIGIINYSHKVEKVANLKQF-SSKDDFKLAVDNM 868
Cdd:cd01480     2 PVDITFVLDSSESVGLQNFDITKNFVKRVAERFLKDYYrkdpagSWRVGVVQYSDQQEVEAGFLRDiRNYTSLKEAVDNL 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  869 QYLGEGTYTATALQAAND-MFEDARPGVKKVALVITDGQTDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDPNfeifhk 947
Cdd:cd01480    82 EYIGGGTFTDCALKYATEqLLEGSHQKENKFLLVITDGHSDGSPDGGIEKAVNEADHLGIKIFFVAVGSQNEEP------ 155

                  ....*....
gi 767945408  948 eMNLIATDP 956
Cdd:cd01480   156 -LSRIACDG 163
vWA_integrins_alpha_subunit cd01469
Integrins are a class of adhesion receptors that link the extracellular matrix to the ...
797-970 1.96e-30

Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions.


Pssm-ID: 238746 [Multi-domain]  Cd Length: 177  Bit Score: 118.61  E-value: 1.96e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  797 LELVFVIDSSESVGPENFQIIKNFVKTMADRVALDLATARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTY 876
Cdd:cd01469     1 MDIVFVLDGSGSIYPDDFQKVKNFLSTVMKKLDIGPTKTQFGLVQYSESFRTEFTLNEYRTKEEPLSLVKHISQLLGLTN 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  877 TATALQ-AANDMFED---ARPGVKKVALVITDGqtDSRDKEKLTEVVKNASDTNVEIFVIGVVKK-NDPNfeiFHKEMNL 951
Cdd:cd01469    81 TATAIQyVVTELFSEsngARKDATKVLVVITDG--ESHDDPLLKDVIPQAEREGIIRYAIGVGGHfQREN---SREELKT 155
                         170       180
                  ....*....|....*....|.
gi 767945408  952 IATDP--EHVYQFDDFFTLQD 970
Cdd:cd01469   156 IASKPpeEHFFNVTDFAALKD 176
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
423-637 7.68e-30

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 124.25  E-value: 7.68e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  423 QGLQGLSIKGEKGDIGPVGPQGPMGIPGI-GSQGEQGIQGPIGPPGPQGPagqglPGSKGEVGQMGPTGPRGPVgiGVQG 501
Cdd:NF038329  108 EGLQQLKGDGEKGEPGPAGPAGPAGEQGPrGDRGETGPAGPAGPPGPQGE-----RGEKGPAGPQGEAGPQGPA--GKDG 180
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  502 PKGEPGSIGLPGQPGVPGEDGAAGKKGEAGLPGARGPEGPPGKGQPGPKGDEGKKGSKGNQGQRGLPGPEGPKGEPGIMG 581
Cdd:NF038329  181 EAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDG 260
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 767945408  582 PFGMPG-TSIPGPPGPKGDRGGPGIPGFKGEPGLsiRGPKGVQGPRGPVGAPGLKGD 637
Cdd:NF038329  261 PRGDRGeAGPDGPDGKDGERGPVGPAGKDGQNGK--DGLPGKDGKDGQNGKDGLPGK 315
vWFA cd00198
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
799-962 2.86e-29

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains.


Pssm-ID: 238119 [Multi-domain]  Cd Length: 161  Bit Score: 114.59  E-value: 2.86e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  799 LVFVIDSSESVGPENFQIIKNFVKTMADRVALDLATARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQY-LGEGTYT 877
Cdd:cd00198     3 IVFLLDVSGSMGGEKLDKAKEALKALVSSLSASPPGDRVGLVTFGSNARVVLPLTTDTDKADLLEAIDALKKgLGGGTNI 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  878 ATALQAANDMFEDA-RPGVKKVALVITDGQTDSrDKEKLTEVVKNASDTNVEIFVIGVvkKNDPNFEIFHKemnlIATDP 956
Cdd:cd00198    83 GAALRLALELLKSAkRPNARRVIILLTDGEPND-GPELLAEAARELRKLGITVYTIGI--GDDANEDELKE----IADKT 155

                  ....*.
gi 767945408  957 EHVYQF 962
Cdd:cd00198   156 TGGAVF 161
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
478-759 1.13e-27

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 117.70  E-value: 1.13e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  478 GSKGEVGQMGPTGPRGPVGI-GVQGPKGEPGSIGLPGQPGVPGEDGAAGKKGEAGLPGARgpegppgkgqpgpkgdeGKK 556
Cdd:NF038329  117 GEKGEPGPAGPAGPAGEQGPrGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPA-----------------GKD 179
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  557 GSKGNQGQRGLPGPEGPKGEPGIMGPFGMPGtsipgPPGPKGDRGGPGIPGFKGEPGLSIRGPKGVQGPRGPVGAPGLKG 636
Cdd:NF038329  180 GEAGAKGPAGEKGPQGPRGETGPAGEQGPAG-----PAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDG 254
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  637 dgypgvpgprglpgppgpmglrGVGDTGAKGEPGVRGPpgpsgprgvgtQGPKGDTGQKGLPGPPgppgygsqGIKGEQG 716
Cdd:NF038329  255 ----------------------PAGKDGPRGDRGEAGP-----------DGPDGKDGERGPVGPA--------GKDGQNG 293
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*..
gi 767945408  717 PQGFPGPKGTMGH----GLPGQKGEHGERGDVGKKGDKGEIGEPGSP 759
Cdd:NF038329  294 KDGLPGKDGKDGQngkdGLPGKDGKDGQPGKDGLPGKDGKDGQPGKP 340
vWFA_subfamily_ECM cd01450
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
47-209 6.05e-27

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains


Pssm-ID: 238727 [Multi-domain]  Cd Length: 161  Bit Score: 108.15  E-value: 6.05e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   47 IDIVFIVDSSESSKIALFDKQKDFVDSLSDKIfqltpgRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKSMNL 126
Cdd:cd01450     1 LDIVFLLDGSESVGPENFEKVKDFIEKLVEKL------DIGPDKTRVGLVQYSDDVRVEFSLNDYKSKDDLLKAVKNLKY 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  127 I-GQGTFSYYAISNATRLL--KREGRKDGVKVVLLMTDGIDHpKNPDVQSISEDARISGISFITIALSTvVNEAKLRLIS 203
Cdd:cd01450    75 LgGGGTNTGKALQYALEQLfsESNARENVPKVIIVLTDGRSD-DGGDPKEAAAKLKDEGIKVFVVGVGP-ADEEELREIA 152

                  ....*.
gi 767945408  204 GDSSSE 209
Cdd:cd01450   153 SCPSER 158
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
798-965 1.79e-25

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 103.91  E-value: 1.79e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  798 ELVFVIDSSESVGPENFQIIKNFVKTMADrvALDLATA--RIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGT 875
Cdd:cd01482     2 DIVFLVDGSWSIGRSNFNLVRSFLSSVVE--AFEIGPDgvQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNT 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  876 YTATALQ--AANDMFEDA--RPGVKKVALVITDGQTdSRDKEKLTEVVKNASdtnVEIFVIGvVKKNDPNfeifhkEMNL 951
Cdd:cd01482    80 RTGKALThvREKNFTPDAgaRPGVPKVVILITDGKS-QDDVELPARVLRNLG---VNVFAVG-VKDADES------ELKM 148
                         170
                  ....*....|....*.
gi 767945408  952 IATDP--EHVYQFDDF 965
Cdd:cd01482   149 IASKPseTHVFNVADF 164
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
260-533 1.81e-25

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 111.15  E-value: 1.81e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  260 GIKGERGPKGNPGNA-QKGEAGERGPGGIPGYKGDKGERGECGKPGIKGDKGSPGPYGPKGPRGIQGI------TGPPGD 332
Cdd:NF038329  117 GEKGEPGPAGPAGPAgEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAkgpageKGPQGP 196
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  333 PGPKGFQGNKGEPGPPGPYgspgapgigqqgikGERGQEGRPGAPGPIGVGEPGQPGPRGPEGVPGERGLPGEgfPGPKG 412
Cdd:NF038329  197 RGETGPAGEQGPAGPAGPD--------------GEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGP--AGKDG 260
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  413 EKGSEGPTGPQGLqglsiKGEKGDIGPVGPQGPmgipgigsqgeqgiqgpigppgpqgpagqglPGSKGEVGQMGPTGPR 492
Cdd:NF038329  261 PRGDRGEAGPDGP-----DGKDGERGPVGPAGK-------------------------------DGQNGKDGLPGKDGKD 304
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|.
gi 767945408  493 GPvgigvQGPKGEPGSIGLPGQPGVPGEDGAAGKKGEAGLP 533
Cdd:NF038329  305 GQ-----NGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKP 340
VWA pfam00092
von Willebrand factor type A domain;
48-226 1.02e-24

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 101.97  E-value: 1.02e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408    48 DIVFIVDSSESSKIALFDKQKDFVDSLSDkifQLTPGrslEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKSMNLI 127
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVE---SLDIG---PDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYL 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   128 GQGTFSY-YAISNATRLL---KREGRKDGVKVVLLMTDGidHPKNPDVQSISEDARISGISFITIALSTVVNEAkLRLIS 203
Cdd:pfam00092   75 GGGTTNTgKALKYALENLfssAAGARPGAPKVVVLLTDG--RSQDGDPEEVARELKSAGVTVFAVGVGNADDEE-LRKIA 151
                          170       180
                   ....*....|....*....|...
gi 767945408   204 GDSSSEPTLLLSDPTLVDKIQDR 226
Cdd:pfam00092  152 SEPGEGHVFTVSDFEALEDLQDQ 174
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
48-227 8.00e-23

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 96.75  E-value: 8.00e-23
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408     48 DIVFIVDSSESSKIALFDKQKDFVDSLsdkifqLTPGRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKSMNLI 127
Cdd:smart00327    1 DVVFLLDGSGSMGGNRFELAKEFVLKL------VEQLDIGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYK 74
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408    128 -GQGTFSYYAISNATRLLKRE---GRKDGVKVVLLMTDGIDHPKNPDVQSISEDARISGISFITIALSTVVNEAKLRLIS 203
Cdd:smart00327   75 lGGGTNLGAALQYALENLFSKsagSRRGAPKVVILITDGESNDGPKDLLKAAKELKRSGVKVFVVGVGNDVDEEELKKLA 154
                           170       180
                    ....*....|....*....|....
gi 767945408    204 GDSSSEPTLLlsdPTLVDKIQDRL 227
Cdd:smart00327  155 SAPGGVYVFL---PELLDLLIDLL 175
vWFA cd00198
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
47-208 1.87e-22

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains.


Pssm-ID: 238119 [Multi-domain]  Cd Length: 161  Bit Score: 95.33  E-value: 1.87e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   47 IDIVFIVDSSESSKIALFDKQKDFVDSLSDKIfqltpgRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKSMNL 126
Cdd:cd00198     1 ADIVFLLDVSGSMGGEKLDKAKEALKALVSSL------SASPPGDRVGLVTFGSNARVVLPLTTDTDKADLLEAIDALKK 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  127 IGQGTFSYY-AISNATRLLKREGRKDGVKVVLLMTDGIDHPKNPDVQSISEDARISGISFITIALSTVVNEAKLRLISGD 205
Cdd:cd00198    75 GLGGGTNIGaALRLALELLKSAKRPNARRVIILLTDGEPNDGPELLAEAARELRKLGITVYTIGIGDDANEDELKEIADK 154

                  ...
gi 767945408  206 SSS 208
Cdd:cd00198   155 TTG 157
Kunitz_papilin cd22635
Kunitz domain of papilin, and similar proteins; This model includes the Kunitz domain found in ...
1081-1122 1.85e-21

Kunitz domain of papilin, and similar proteins; This model includes the Kunitz domain found in human and mouse papilin, and similar proteins. Papilin is an extracellular matrix glycoprotein that has been found in many organisms to be involved in thin matrix layers during gastrulation, matrix associated with wandering, phagocytic hemocytes, basement membranes and space-filling matrix during Drosophila development. It is a multidomain protein that primarily occurs in basement membranes. Papilins interact with several extracellular matrix components and ADAMTS enzymes, influences cell rearrangements and may modulate metalloproteinases during organogenesis. Papilins exist in mammals and invertebrates as a set of related, though not necessarily identical proteins. Mammalian papilin contains a single Kunitz domain, while other papilins such as that from Caenorhabditis elegans, contains multiple Kunitz domains. These domains are similar to Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor) that shows an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438678  Cd Length: 52  Bit Score: 88.47  E-value: 1.85e-21
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 767945408 1081 CGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22635    11 CGDYVQRWYYDPATGACNRFWYGGCGGNANRFATEAECLRTC 52
Kunitz_collagen_alpha3_VI cd22629
Kunitz-type domain from the alpha3 chain of human type VI collagen, and similar proteins; This ...
1079-1122 1.92e-21

Kunitz-type domain from the alpha3 chain of human type VI collagen, and similar proteins; This model includes the Kunitz-type domain from the alpha3 chain of type VI collagen (collagen alpha 3(VI) chain), encoded by COL6A3 gene. Collagen VI is a widely expressed member of the triple helix-containing protein superfamily of collagens and forms beaded microfibrils that anchor large interstitial structures. Immediately after fibril formation, the Kunitz domain can be cleaved off. Mutations in the alpha1, alpha2, and alpha3 chains of collagen VI cause myopathies ranging from the severe Ullrich congenital muscular dystrophy to the milder Bethlem myopathy, including intermediate forms. Early onset isolated dystonia, a neurological disease, has been shown to be caused by mutations in the alpha3 chain. Findings also indicated potential associations between COL6A3 polymorphisms and lung cancer risk. This domain is similar to that of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438672  Cd Length: 53  Bit Score: 88.58  E-value: 1.92e-21
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 767945408 1079 GNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22629    10 GTCRDFVLKWYYDPETKSCARFWYGGCGGNENRFDSQEECEKVC 53
KU smart00131
BPTI/Kunitz family of serine protease inhibitors; Serine protease inhibitors. One member of ...
1070-1122 3.04e-20

BPTI/Kunitz family of serine protease inhibitors; Serine protease inhibitors. One member of the family is encoded by an alternatively-spliced form of Alzheimer's amyloid beta-protein.


Pssm-ID: 197529  Cd Length: 53  Bit Score: 85.01  E-value: 3.04e-20
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 767945408   1070 PRCLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:smart00131    1 DVCLLPPDTGPCGGSIPRYYYDPETGTCEPFTYGGCGGNANNFESLEECERTC 53
Kunitz-type cd00109
Kunitz/Bovine pancreatic trypsin inhibitor (BPTI) domain; This family contains the Kunitz ...
1072-1122 3.04e-20

Kunitz/Bovine pancreatic trypsin inhibitor (BPTI) domain; This family contains the Kunitz domain which is a common structural fold found in a family of reversible serine protease inhibitors. This domain is thought to have evolved over 500 million years and is ubiquitous in all kingdoms of life and has been incorporated into many different genes. In general, each domain is encoded by a single exon. Some genes encode proteins with a single Kunitz domain, e.g. bovine pancreatic trypsin inhibitor (BPTI), trophoblast Kunitz domain protein (TKDP), amyloid beta-protein precursor (ABPP), as well as Kunitz-type venom peptides such as dendrotoxin. Genes that encode multiple Kunitz domains include hepatocyte growth factor activator inhibitors HAI1 and HAI2 (two domains), tissue factor pathway inhibitor TFPI1 and TFPI2 (three domains) and Caenorhabditis elegans papilin (eleven domains). In addition, the Kunitz domain has been integrated into multi-domain proteins, e.g. the collagen alpha3(VI), alpha1(VII) and alpha1(XXVIII) chains, WFIKKN1 (containing WAP, Follistatin/Kazal, Immunoglobulin, two Kunitz and NTR domains) and papilin. Furthermore, each domain within a multi-Kunitz domain protein may exhibit different protease activity, such as for the three tandemly repeated domains within both tissue factor pathway inhibitors 1 and 2. The Kunitz domain is a representative of alpha/beta proteins with irregular secondary structure stabilized by three disulfide bonds and presenting three peptide loops that can be varied without introducing much destabilization to the scaffold. Protease inhibitors meet the scaffold criteria in that they are small, stable and capable of evolving the binding activity of exposed peptide loops through targeted randomization to construct combinatorial libraries. Kunitz domain-based scaffolds have been successfully utilized to construct and select a library of protease inhibitors with the potential for therapeutic application.


Pssm-ID: 438633  Cd Length: 51  Bit Score: 84.91  E-value: 3.04e-20
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd00109     1 CLLPPDPGPCRAYFPRWYYNSETGQCEEFIYGGCGGNANNFETKEECEATC 51
Kunitz_BPTI pfam00014
Kunitz/Bovine pancreatic trypsin inhibitor domain; Indicative of a protease inhibitor, usually ...
1071-1122 1.84e-19

Kunitz/Bovine pancreatic trypsin inhibitor domain; Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure. Certain family members are similar to the tick anticoagulant peptide (TAP). This is a highly selective inhibitor of factor Xa in the blood coagulation pathways. TAP molecules are highly dipolar, and are arranged to form a twisted two- stranded antiparallel beta-sheet followed by an alpha helix.


Pssm-ID: 425421  Cd Length: 53  Bit Score: 82.69  E-value: 1.84e-19
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 767945408  1071 RCLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:pfam00014    1 ICSLPPDSGPCKASIPRWYYNPTTGTCEPFTYGGCGGNANNFESLEECESTC 52
Kunitz_papilin_lacunin-like cd22639
Drosophila melanogaster Kunitz domain 1, Manduca sexta lacunin Kunitz domain 1, and simialr ...
1078-1123 1.88e-19

Drosophila melanogaster Kunitz domain 1, Manduca sexta lacunin Kunitz domain 1, and simialr proteins; This model includes Drosophila melanogaster Kunitz domain 1 of papilin and Manduca sexta Kunitz domain 1 of lacunin, and similar proteins. D. melanogaster papilin is an essential extracellular matrix (ECM) protein that influences cell rearrangements. It may act by modulating metalloproteinase action during organogenesis and is able to non-competitively inhibit procollagen N-proteinase, an ADAMTS metalloproteinase. M. sexta lacunin is a large multidomain ECM containing several domains including several Kunitz-type protease inhibitors, thrombospondin type I, immunoglobulin-like and others. It exerts multiple effects on a variety of cell behaviors associated with the complex phenomenon of epithelial morphogenesis. These domains are similar to Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor) that shows an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438681  Cd Length: 52  Bit Score: 82.62  E-value: 1.88e-19
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 767945408 1078 PGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETCI 1123
Cdd:cd22639     7 RGPCRNYTVKWYFDMAYGGCSRFWYGGCGGNGNRFDTEEECKAVCV 52
Kunitz_collagen_alpha6_VI cd22630
Kunitz-type domain from the alpha6 chain of human type VI collagen, and similar proteins; This ...
1079-1124 2.39e-19

Kunitz-type domain from the alpha6 chain of human type VI collagen, and similar proteins; This model includes the Kunitz-type domain from the alpha6 chain of type VI collagen (collagen alpha 6(VI) chain), encoded by COL6A6 gene, and similar proteins. Collagen VI is a widely expressed member of the triple helix-containing protein superfamily of collagens and forms beaded microfibrils that anchor large interstitial structures. Immediately after fibril formation, the Kunitz domain can be cleaved off. This domain is similar to that of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor) that shows an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438673  Cd Length: 55  Bit Score: 82.65  E-value: 2.39e-19
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 767945408 1079 GNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETCIQ 1124
Cdd:cd22630    10 GECQNYVLKWYYDQEQKECSQFWYGGCGGNKNRFETQEECEALCVK 55
vWA_collagen_alpha_1-VI-type cd01480
VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable ...
45-208 4.89e-19

VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238757 [Multi-domain]  Cd Length: 186  Bit Score: 86.29  E-value: 4.89e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   45 CFIDIVFIVDSSESSKIALFDKQKDFVDSLSDKI--FQLTPGRSLeyDIKLAALQFSSSVQIDPPF-SSWKDLQTFKQKV 121
Cdd:cd01480     1 GPVDITFVLDSSESVGLQNFDITKNFVKRVAERFlkDYYRKDPAG--SWRVGVVQYSDQQEVEAGFlRDIRNYTSLKEAV 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  122 KSMNLIGQGTFSYYAISNATRLLKREGRKDGVKVVLLMTDGiDHPKNPD--VQSISEDARISGISFITIALSTVVNEaKL 199
Cdd:cd01480    79 DNLEYIGGGTFTDCALKYATEQLLEGSHQKENKFLLVITDG-HSDGSPDggIEKAVNEADHLGIKIFFVAVGSQNEE-PL 156

                  ....*....
gi 767945408  200 RLISGDSSS 208
Cdd:cd01480   157 SRIACDGKS 165
Kunitz_collagen_alpha6_VI-like cd22631
Kunitz-type domain from the alpha6 chain of fish type VI collagen, and similar proteins; This ...
1072-1122 5.34e-18

Kunitz-type domain from the alpha6 chain of fish type VI collagen, and similar proteins; This model includes the Kunitz-type domain from the alpha6 chain of type VI collagen (collagen alpha 6(VI) chain) and similar proteins. Collagen VI is a widely expressed member of the triple helix-containing protein superfamily of collagens and forms beaded microfibrils that anchor large interstitial structures. Immediately after fibril formation, the Kunitz domain can be cleaved off. This domain is similar to that of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor) that shows an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438674 [Multi-domain]  Cd Length: 51  Bit Score: 78.42  E-value: 5.34e-18
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22631     1 CLLGQDAGSCQNYTMMWFFDSKQGRCSRFWYGGCGGNANRFETQEECENLC 51
Kunitz_papilin_mig6-like cd22637
Drosophila melanogaster Kunitz domains 5, 6, 7, and Caenorhabditis elegans Kunitz domain 5 of ...
1072-1122 4.30e-17

Drosophila melanogaster Kunitz domains 5, 6, 7, and Caenorhabditis elegans Kunitz domain 5 of papilin, and similar domains; This model includes Kunitz domains from papilins with multiple Kunitz domains, such as Drosophila melanogaster Kunitz domains 5, 6, 7, and Caenorhabditis elegans Kunitz domain 5 of papilin, among others. Papilins are essential for embryonic development. D. melanogaster papilin is an essential extracellular matrix (ECM) protein that influences cell rearrangements. It may act by modulating metalloproteinases action during organogenesis and is able to non-competitively inhibit procollagen N-proteinase, an ADAMTS metalloproteinase. C. elegans papilin (also called abnormal cell migration protein 6) mig-6 encodes long (MIG-6L) and short (MIG-6S) isoforms of the extracellular matrix protein papilin, each required for distinct aspects of distal tip cell (DTC) migration and both isoforms have an N-terminal papilin cassette, lagrin repeats and six C-terminal Kunitz-type serine proteinase inhibitory domains. It plays a role in embryogenesis, the second phase of distal cell tip migration and is required for distribution of the metalloproteinase, mig-17, during organogenesis. These domains are similar to Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor) that shows an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438679  Cd Length: 51  Bit Score: 75.86  E-value: 4.30e-17
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22637     1 CDQPKDTGPCDNWVLKWYYDSKKGSCRQFYYGGCGGNDNRFDTEEECEARC 51
VWA_2 pfam13519
von Willebrand factor type A domain;
799-902 2.03e-16

von Willebrand factor type A domain;


Pssm-ID: 463909 [Multi-domain]  Cd Length: 103  Bit Score: 75.79  E-value: 2.03e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   799 LVFVIDSSES-----VGPENFQIIKNFVKTMADRvaldLATARIGIINYSHKVEKVANLKqfSSKDDFKLAVDNMQYLGE 873
Cdd:pfam13519    1 LVFVLDTSGSmrngdYGPTRLEAAKDAVLALLKS----LPGDRVGLVTFGDGPEVLIPLT--KDRAKILRALRRLEPKGG 74
                           90       100
                   ....*....|....*....|....*....
gi 767945408   874 GTYTATALQAANDMFEDARPGVKKVALVI 902
Cdd:pfam13519   75 GTNLAAALQLARAALKHRRKNQPRRIVLI 103
vWA_collagen cd01472
von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This ...
47-209 2.03e-16

von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.


Pssm-ID: 238749 [Multi-domain]  Cd Length: 164  Bit Score: 78.04  E-value: 2.03e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   47 IDIVFIVDSSESSKIALFDKQKDFVDSLSDkifQLTPGRSleyDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKSMNL 126
Cdd:cd01472     1 ADIVFLVDGSESIGLSNFNLVKDFVKRVVE---RLDIGPD---GVRVGVVQYSDDPRTEFYLNTYRSKDDVLEAVKNLRY 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  127 IGQGTFSYYAISNATRLL--KREGRKDGV-KVVLLMTDGidhpKNPDvqSISEDA---RISGISFITIALSTVVNEaKLR 200
Cdd:cd01472    75 IGGGTNTGKALKYVRENLftEASGSREGVpKVLVVITDG----KSQD--DVEEPAvelKQAGIEVFAVGVKNADEE-ELK 147

                  ....*....
gi 767945408  201 LISGDSSSE 209
Cdd:cd01472   148 QIASDPKEL 156
vWA_collagen_alpha3-VI-like cd01481
VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable ...
798-965 2.53e-16

VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238758  Cd Length: 165  Bit Score: 77.75  E-value: 2.53e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  798 ELVFVIDSSESVGPENFQIIKNFVKTMADRVALDLATARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYL-GEGTY 876
Cdd:cd01481     2 DIVFLIDGSDNVGSGNFPAIRDFIERIVQSLDVGPDKIRVAVVQFSDTPRPEFYLNTHSTKADVLGAVRRLRLRgGSQLN 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  877 TATALQ-AANDMFEDA-----RPGVKKVALVITDGQTDSrDKEKLTEVVKNASdtnVEIFVIGvVKKNDPNfeifhkEMN 950
Cdd:cd01481    82 TGSALDyVVKNLFTKSagsriEEGVPQFLVLITGGKSQD-DVERPAVALKRAG---IVPFAIG-ARNADLA------ELQ 150
                         170
                  ....*....|....*
gi 767945408  951 LIATDPEHVYQFDDF 965
Cdd:cd01481   151 QIAFDPSFVFQVSDF 165
Kunitz_collagen_alpha1_VII cd22627
Kunitz-type domain from the alpha1 chain of type VII collagen, and similar proteins; This ...
1069-1122 9.17e-16

Kunitz-type domain from the alpha1 chain of type VII collagen, and similar proteins; This model includes the Kunitz-type domain from the alpha1 chain of type VII collagen (collagen alpha-1(VII) chain also called long-chain collagen or LC collagen) and similar proteins. LC collagen, encoded by the COL7A1 gene, is a stratified squamous epithelial basement membrane protein that forms anchoring fibrils which may contribute to epithelial basement membrane organization and adherence by interacting with extracellular matrix (ECM) proteins such as type IV collagen. So far, over 800 COL7A1 mutations have been reported, including missense, nonsense, splicing, insertion, and deletion mutations which to varying degrees leads to deficiency of type VII collagen. Epidermolysis bullosa acquisita (EBA) is an autoimmune acquired blistering skin disease resulting from autoantibodies to type VII collagen. The COL7A1 protein contains a Kunitz domain, the deactivation of which induces tumorigenesis. This domain is similar to that of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), that shows an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438670  Cd Length: 53  Bit Score: 72.28  E-value: 9.17e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 767945408 1069 DPrCLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22627     1 DP-CLLPMDEGSCSDYTLLWYYHQKAGECRPFVYGGCGGNANRFSSKEDCELRC 53
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
793-934 2.25e-15

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 77.29  E-value: 2.25e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  793 KETPLELVFVIDSSESVGPEN-FQIIKNFVKTMADRValdLATARIGIINYSHKVEKVANLKqfSSKDDFKLAVDNMQyL 871
Cdd:COG1240    89 PQRGRDVVLVVDASGSMAAENrLEAAKGALLDFLDDY---RPRDRVGLVAFGGEAEVLLPLT--RDREALKRALDELP-P 162
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767945408  872 GEGTYTATALQAANDMFEDARPGVKKVALVITDGQtDSRDKEKLTEVVKNASDTNVEIFVIGV 934
Cdd:COG1240   163 GGGTPLGDALALALELLKRADPARRKVIVLLTDGR-DNAGRIDPLEAAELAAAAGIRIYTIGV 224
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
257-445 7.84e-15

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 78.41  E-value: 7.84e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  257 GNPGIKGERGPKGNPGNAQKGEAGERGPGGIPGYKGDKGERGECGKPGIKGDKGSPGPYGPKGPRGIQGITGPPGDPGPK 336
Cdd:NF038329  210 GPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKD 289
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  337 GFQGNKgepgppgpygspgapgiGQQGIKGERGQEGRPGAPGPigVGEPGQPGPRGPEGVPGERGLPGEgfPGPKGEKGS 416
Cdd:NF038329  290 GQNGKD-----------------GLPGKDGKDGQNGKDGLPGK--DGKDGQPGKDGLPGKDGKDGQPGK--PAPKTPEVP 348
                         170       180       190
                  ....*....|....*....|....*....|
gi 767945408  417 EGP-TGPQGLQGLSIKGEKGDIGPvGPQGP 445
Cdd:NF038329  349 QKPdTAPHTPKTPQIPGQSKDVTP-APQNP 377
Kunitz_amblin-like cd22638
Caenorhabditis elegans Kunitz domain 11 of papilin (also called abnormal cell migration ...
1072-1122 1.80e-14

Caenorhabditis elegans Kunitz domain 11 of papilin (also called abnormal cell migration protein 6 or mig-6), Amblyomma hebraeum amblin domain 1, and similar proteins; This model includes Caenorhabditis elegans Kunitz domain 11 of papilin (also called abnormal cell migration protein 6 or mig-6) and domain 1 of Amblyomma hebraeum amblin, and similar proteins. C. elegans papilin (also called abnormal cell migration protein 6) mig-6 encodes long (MIG-6L) and short (MIG-6S) isoforms of the extracellular matrix protein papilin, each required for distinct aspects of distal tip cell (DTC) migration and both isoforms have an N-terminal papilin cassette, lagrin repeats and six C-terminal Kunitz-type serine proteinase inhibitory domains. It plays a role in embryogenesis, the second phase of distal cell tip migration and is required for distribution of the metalloproteinase, mig-17, during organogenesis. Amblin contains two Kunitz-like domains and specifically inhibits thrombin. These domains are similar to Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), that shows an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438680  Cd Length: 51  Bit Score: 68.57  E-value: 1.80e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22638     1 CTLKPETGPCRAYIEKWYYDPSTQSCKTFIYGGCGGNGNRFDSEEDCQETC 51
Kunitz_WFIKKN_1-like cd22605
first Kunitz domain of WAP, Kazal, immunoglobulin, Kunitz and NTR domain-containing proteins; ...
1072-1122 4.84e-14

first Kunitz domain of WAP, Kazal, immunoglobulin, Kunitz and NTR domain-containing proteins; This subfamily includes WAP, Kazal, immunoglobulin, Kunitz and NTR domain-containing protein 1 (WFIKKN1, WFKN1), WFIKKN2 (WFKN2), and similar proteins. WFIKKN proteins are protease inhibitors that contain two distinct Kunitz-type protease inhibitor domains. They may have serine protease- and metalloprotease-inhibitor activity. This model represents the first Kunitz domain that is similar to Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor) that shows an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438648  Cd Length: 52  Bit Score: 67.39  E-value: 4.84e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22605     2 CLKEPDREDCGEEQVRWYFDAKRGNCFTFTYGGCDGNRNHFETYEECRLAC 52
VWA_integrin_invertebrates cd01476
VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have ...
48-205 1.00e-13

VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands.


Pssm-ID: 238753 [Multi-domain]  Cd Length: 163  Bit Score: 70.12  E-value: 1.00e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   48 DIVFIVDSSESSKiALFDKQKDFVDSLSDKIfQLTPgrslEYDiKLAALQFSSSVQ--IDPPFSSWKDLQTFKQKVKSMN 125
Cdd:cd01476     2 DLLFVLDSSGSVR-GKFEKYKKYIERIVEGL-EIGP----TAT-RVALITYSGRGRqrVRFNLPKHNDGEELLEKVDNLR 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  126 LIGQGTFSYYAISNATRLLKR-EGRKDGV-KVVLLMTDGIDHpknPDVQSISEDARiSGISFITIALST----VVNEAKL 199
Cdd:cd01476    75 FIGGTTATGAAIEVALQQLDPsEGRREGIpKVVVVLTDGRSH---DDPEKQARILR-AVPNIETFAVGTgdpgTVDTEEL 150

                  ....*.
gi 767945408  200 RLISGD 205
Cdd:cd01476   151 HSITGN 156
VWA_integrin_invertebrates cd01476
VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have ...
797-960 1.07e-13

VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands.


Pssm-ID: 238753 [Multi-domain]  Cd Length: 163  Bit Score: 70.12  E-value: 1.07e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  797 LELVFVIDSSESVGPEnFQIIKNFVKTMADRVALDLATARIGIINYS-HKVEKVA-NLKQFSSKDDFKLAVDNMQYLGEG 874
Cdd:cd01476     1 LDLLFVLDSSGSVRGK-FEKYKKYIERIVEGLEIGPTATRVALITYSgRGRQRVRfNLPKHNDGEELLEKVDNLRFIGGT 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  875 TYTATALQAANDMFED---ARPGVKKVALVITDGQTdSRDKEKLTEVVKnaSDTNVEIFVIGVVKKNDPNfeifHKEMNL 951
Cdd:cd01476    80 TATGAAIEVALQQLDPsegRREGIPKVVVVLTDGRS-HDDPEKQARILR--AVPNIETFAVGTGDPGTVD----TEELHS 152

                  ....*....
gi 767945408  952 IATDPEHVY 960
Cdd:cd01476   153 ITGNEDHIF 161
Kunitz_HAI1_2-like cd22624
Kunitz domain 2 of hepatocyte growth factor activator inhibitor-1 (HAI1); This model includes ...
1071-1122 1.43e-13

Kunitz domain 2 of hepatocyte growth factor activator inhibitor-1 (HAI1); This model includes Kunitz domain 2 (KD2) of hepatocyte growth factor activator inhibitor type 1 (HAI-1 or HAI1, also known as Kunitz-type protease inhibitor 1), a membrane-bound multidomain protein essential to the integrity of the basement membrane during placental development. HAI-1 contains an extracellular region and several internal domains that include two Kunitz domains separated in sequence but spatially closed to each other, and their interdomain interactions have evolved to stimulate the inhibitory activity of an integrated Kunitz. While the Kunitz domain 1 (KD1) is the major inhibitory domain of HAI-1 and involved in auto-inhibition of the extracellular region via steric blockage of its active site in the HAI-1 compact tertiary structure, studies show that deletion of HAI-1 Kunitz domain 2 (KD2) and the extracellular region enhanced inhibition of matriptase. HAI-1 KD2 has been shown to have potent inhibitory activity against trypsin, but it cannot inhibit hepatocyte growth factor activator (HGFA), and matriptase. HAI-1 is also important in maintaining postnatal homeostasis in many tissues, including keratinization of the epidermis, hair development, colonic epithelium integrity, proliferation and cell fate of neural progenitor cells, and tissue injury and repair. The interaction between HAI-1 and matriptase is critical for tissue morphogenesis and cellular biology. HAI-1:matriptase ratio imbalance results in tumorigenesis; slight overexpression of matriptase relative to HAI-1 causes spontaneous squamous cell carcinoma, a phenotype that can be effectively reversed back to wild type by additional expression of HAI-1, indicating the need for a tight functional relationship between the two to maintain homeostasis. The structure of KD2 is similar to those of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438667  Cd Length: 61  Bit Score: 66.39  E-value: 1.43e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|..
gi 767945408 1071 RCLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22624     1 RCTEPPVTGPCRASFTRWYYDPLSRKCHRFTYGGCDGNENNFETEDECMETC 52
Kunitz_conkunitzin cd22593
conkunitzin-S1 and -S2, and similar proteins; This model includes Kunitz-type conkunitzin-S1 ...
1072-1122 9.44e-13

conkunitzin-S1 and -S2, and similar proteins; This model includes Kunitz-type conkunitzin-S1 (Cs1) and -S2 (Cs2). Conkunitzins are pore-modulating toxins that block voltage-dependent potassium channels (Kvs) by exploiting inherent slow inactivation to block K+ channels. Cs1 binds to the channel turrets and disrupts the structural water hydrogen-bonding network, exposing the peripheral water pockets of ion channels and triggering an asymmetric collapse of the pore. Conus bullatus conkunitzin-B1, expressed in the venom duct, specifically blocks voltage-activated potassium channels (Kv) of the Shaker family. Members of this subfamily contain 2 disulfide bonds instead of the 3 present in most Kunitz domain proteins.


Pssm-ID: 438636  Cd Length: 51  Bit Score: 63.78  E-value: 9.44e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22593     1 CSLPLDEGSGNSSLTRWYYDPKKGQCKPFTYKGKGGNENNFLTKEDCEETC 51
Kunitz_TFPI2_1-like cd22616
Kunitz domain 1 (KD1) of tissue factor pathway inhibitor 2 (TFPI2) and similar proteins; This ...
1072-1122 1.27e-12

Kunitz domain 1 (KD1) of tissue factor pathway inhibitor 2 (TFPI2) and similar proteins; This model represents the Kunitz-type domain 1 (KD1) of tissue factor pathway inhibitor 2 (TFPI2 or TFPI-2) and similar proteins. TFPI2 exhibits inhibitory activity primarily toward trypsin, plasmin, and factor VIIa (FVIIa)/tissue factor (TF) via its KD1. It is believed to be the major inhibitor of plasmin in the extracellular matrix (ECM) but has little inhibitory activity toward urokinase-type plasminogen activator, tissue-type plasminogen activator, or thrombin. TFPI2 specifically inhibits the proteases via the P1 arginine residue in KD1. The TFPI2 domains KD2 and KD3 appear to have no discernible inhibitory activity and may serve to bind to nearby proteins to localize TFPI2 in the ECM. Structure studies of KD1 complexed with proteases may help in the development of specific and potent KD1 domain protein that may have a large pharmacologic impact in preventing tumor metastasis, retinal degeneration, and degradation of collagen in the ECM. The structure of this domain is similar to that of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438659  Cd Length: 57  Bit Score: 63.41  E-value: 1.27e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22616     5 CLLPPDEGPCRALIPRYYYDRYTQTCREFSYGGCEGNANNFESLEDCEKTC 55
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
47-203 1.96e-12

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 68.81  E-value: 1.96e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   47 IDIVFIVDSSES----SKIalfDKQKDFVDSLSDkifQLTPGRsleydiKLAALQFSSSVQIDPPFSSwkDLQTFKQKVK 122
Cdd:COG1240    93 RDVVLVVDASGSmaaeNRL---EAAKGALLDFLD---DYRPRD------RVGLVAFGGEAEVLLPLTR--DREALKRALD 158
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  123 SMNlIGQGTFSYYAISNATRLLKREgRKDGVKVVLLMTDGIDHPKNPDVQSISEDARISGISFITIALST-VVNEAKLRL 201
Cdd:COG1240   159 ELP-PGGGTPLGDALALALELLKRA-DPARRKVIVLLTDGRDNAGRIDPLEAAELAAAAGIRIYTIGVGTeAVDEGLLRE 236

                  ..
gi 767945408  202 IS 203
Cdd:COG1240   237 IA 238
vWA_complement_factors cd01470
Complement factors B and C2 are two critical proteases for complement activation. They both ...
802-971 2.13e-12

Complement factors B and C2 are two critical proteases for complement activation. They both contain three CCP or Sushi domains, a trypsin-type serine protease domain and a single VWA domain with a conserved metal ion dependent adhesion site referred commonly as the MIDAS motif. Orthologues of these molecules are found from echinoderms to chordates. During complement activation, the CCP domains are cleaved off, resulting in the formation of an active protease that cleaves and activates complement C3. Complement C2 is in the classical pathway and complement B is in the alternative pathway. The interaction of C2 with C4 and of factor B with C3b are both dependent on Mg2+ binding sites within the VWA domains and the VWA domain of factor B has been shown to mediate the binding of C3. This is consistent with the common inferred function of VWA domains as magnesium-dependent protein interaction domains.


Pssm-ID: 238747 [Multi-domain]  Cd Length: 198  Bit Score: 67.31  E-value: 2.13e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  802 VIDSSESVGPENFQIIKNFVKTMADRVALDLATARIGIINYSHKVEKVANLKQFSS--KDDFKLAVDNMQY----LGEGT 875
Cdd:cd01470     6 ALDASDSIGEEDFDEAKNAIKTLIEKISSYEVSPRYEIISYASDPKEIVSIRDFNSndADDVIKRLEDFNYddhgDKTGT 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  876 YTATALQAANDM-------FEDARPGVKKVALVITDGQT-----------DSRDKEKLTEVVKNASDTNVEIFVIGVVKk 937
Cdd:cd01470    86 NTAAALKKVYERmalekvrNKEAFNETRHVIILFTDGKSnmggsplptvdKIKNLVYKNNKSDNPREDYLDVYVFGVGD- 164
                         170       180       190
                  ....*....|....*....|....*....|....
gi 767945408  938 ndpnfEIFHKEMNLIATDPEHVYQfddFFTLQDT 971
Cdd:cd01470   165 -----DVNKEELNDLASKKDNERH---FFKLKDY 190
vWA_Matrilin cd01475
VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and ...
47-243 2.41e-12

VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.


Pssm-ID: 238752 [Multi-domain]  Cd Length: 224  Bit Score: 67.80  E-value: 2.41e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   47 IDIVFIVDSSESSKIALFDKQKDFVDSLSDKIfqltpgrsleyDIKLAA-----LQFSSSVQIDPPFSSWKDLQTFKQKV 121
Cdd:cd01475     3 TDLVFLIDSSRSVRPENFELVKQFLNQIIDSL-----------DVGPDAtrvglVQYSSTVKQEFPLGRFKSKADLKRAV 71
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  122 KSMNLIGQGTFSYYAISNATRLL------KREGRKDGVKVVLLMTDGidHPKNpDVQSISEDARISGISFITIALSTVVn 195
Cdd:cd01475    72 RRMEYLETGTMTGLAIQYAMNNAfseaegARPGSERVPRVGIVVTDG--RPQD-DVSEVAAKARALGIEMFAVGVGRAD- 147
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*....
gi 767945408  196 EAKLRLISGDSSSEPTLLLSDPTLVDKIQDRLDilfEKKCE-RKICECE 243
Cdd:cd01475   148 EEELREIASEPLADHVFYVEDFSTIEELTKKFQ---GKICVvPDLCATL 193
YfbK COG2304
Secreted protein containing bacterial Ig-like domain and vWFA domain [General function ...
795-934 2.55e-12

Secreted protein containing bacterial Ig-like domain and vWFA domain [General function prediction only];


Pssm-ID: 441879 [Multi-domain]  Cd Length: 289  Bit Score: 68.97  E-value: 2.55e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  795 TPLELVFVIDSSESVGPENFQIIKNFVKTMADRvaLDlATARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQyLGEG 874
Cdd:COG2304    90 PPLNLVFVIDVSGSMSGDKLELAKEAAKLLVDQ--LR-PGDRVSIVTFAGDARVLLPPTPATDRAKILAAIDRLQ-AGGG 165
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767945408  875 TYTATALQAANDMFEDA-RPGVKKVALVITDGQTDS--RDKEKLTEVVKNASDTNVEIFVIGV 934
Cdd:COG2304   166 TALGAGLELAYELARKHfIPGRVNRVILLTDGDANVgiTDPEELLKLAEEAREEGITLTTLGV 228
vWA_micronemal_protein cd01471
Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a ...
797-934 1.71e-11

Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners.


Pssm-ID: 238748 [Multi-domain]  Cd Length: 186  Bit Score: 64.33  E-value: 1.71e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  797 LELVFVIDSSESVGPEN-FQIIKNFVKTMADRVALDLATARIGIINYSHKVEKVANLKQFSSKD-DFKL----AVDNMQY 870
Cdd:cd01471     1 LDLYLLVDGSGSIGYSNwVTHVVPFLHTFVQNLNISPDEINLYLVTFSTNAKELIRLSSPNSTNkDLALnairALLSLYY 80
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 767945408  871 LGEGTYTATALQAANDMFED---ARPGVKKVALVITDGQTDSrdKEKLTEVVKNASDTNVEIFVIGV 934
Cdd:cd01471    81 PNGSTNTTSALLVVEKHLFDtrgNRENAPQLVIIMTDGIPDS--KFRTLKEARKLRERGVIIAVLGV 145
Kunitz_dendrotoxin cd22595
dendrotoxins I, K, B and similar proteins; This group includes toxins isolated from snake ...
1072-1123 3.36e-11

dendrotoxins I, K, B and similar proteins; This group includes toxins isolated from snake venoms, such as dendrotoxins (DTXs) I, K and B, mambaquaretin-1 (MQ-1) and calcicludine. The dendrotoxins have little or no anti-protease activity but have been shown to block certain subtypes of voltage dependent potassium channels in neurons. Dendroaspis angusticeps (green mamba) alpha-dendrotoxin is a neurotoxin that enhances acetylcholine release at neuromuscular junctions. Studies with cloned K(+) channels show that this toxin blocks Kv1.1, Kv1.2 and Kv1.6 channels in the nanomolar range, whereas Dendroaspis polylepis (black mamba) dendrotoxin K preferentially blocks Kv1.1 channels. Also, structural analogs of dendrotoxins have facilitated defining the molecular recognition properties of different types of K(+) channels, and therefore, dendrotoxins are widely used as probes for studying the function of K(+) channels in physiology and pathophysiology. The structures of these toxins are similar to that of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438638  Cd Length: 56  Bit Score: 59.38  E-value: 3.36e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|..
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETCI 1123
Cdd:cd22595     4 CKLPVRPGPCKAFISAFYYNWKAKKCHPFTYSGCGGNANRFKTIEECRRTCV 55
Kunitz_TFPI1_2-like cd22614
Kunitz protease inhibitor (KPI) domain 2 (KPI-2 or K2) of tissue factor pathway inhibitor ...
1072-1122 5.02e-11

Kunitz protease inhibitor (KPI) domain 2 (KPI-2 or K2) of tissue factor pathway inhibitor (TFPI); This model represents the second Kunitz-type domain (K2 or KPI-2) of tissue factor pathway inhibitor (TFPI or TFPI1), also known as extrinsic pathway inhibitor (EPI) or lipoprotein-associated coagulation inhibitor (LACI). TFPI down-regulates the extrinsic coagulation pathway via inhibition of activated factor X (FXa or Xa) and FVIIa (VIIa). It inhibits activated FXa via a "slow-tight binding mechanism", i.e. rapid formation of a loose FXa-TFPI complex that then slowly isomerizes to a tight FXa-TFPI* complex. Subsequent inhibition of FVIIa is facilitated by the presence of tissue factor (TF) and FXa, which together rapidly and efficiently form a quaternary FXa-TFPI-TF-FVIIa complex in which the activity of FXa and FVIIa are inhibited. TFPI consists of 3 Kunitz-type protease inhibitor (KPI) domains in a tandem arrangement; the K2 domain is exposed on functionally active TFPI pools in circulation in blood, in platelets, and attached to the endothelium. While the K1 (or KPI-1) domain of TFPI has been shown to bind and inhibit FVIIa, the K2 domain inhibits FXa by binding directly to the active site and forming a FXa:TFPI complex. A close interaction between the TFPI K2 domain and the FXa active site is essential for the FXa inhibitory action of TFPI and for the formation of an inactive TF/FVIIa/FXa/TFPI complex which then prevents FXa generation. Thus, blockage of K2 would prevent TFPI binding to both FXa and FVIIa/TF, and fully abolish TFPI inhibition of the coagulation cascade. The structure of the K2 domain is similar to those of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438657  Cd Length: 56  Bit Score: 58.86  E-value: 5.02e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22614     5 CFLEEDPGICRGLITRYFYNNQSKQCERFKYGGCLGNQNNFESLEECQNTC 55
Kunitz_eppin cd22611
Kunitz domain of epididymal protease inhibitor eppin and similar proteins; This subfamily ...
1078-1124 5.60e-11

Kunitz domain of epididymal protease inhibitor eppin and similar proteins; This subfamily includes the Kunitz inhibitor domain protein eppin (also called Cancer/testis antigen 71 or CT71, epididymal protease inhibitor, protease inhibitor WAP7, serine protease inhibitor-like with Kunitz and WAP domains 1, or WAP four-disulfide core domain protein 7) as well as WAP four-disulfide core domain proteins 6A and 6B in mice, and similar proteins. Eppin is a serine protease inhibitor that plays an essential role in male reproduction and fertility. It modulates the hydrolysis of seminal fluid protein semenogelin 1 (SEMG1) by the serine protease kallikrein-related peptidase 3 (KLK3, PSA), provides antimicrobial protection for spermatozoa in the ejaculate coagulum, and binds SEMG1, thereby inhibiting sperm motility. Thus, eppin could potentially be used as a target for male contraception. These domains are similar to Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor) that shows an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438654  Cd Length: 57  Bit Score: 58.95  E-value: 5.60e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 767945408 1078 PGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETCIQ 1124
Cdd:cd22611     9 SGPCMAYFPRWWYDKETNTCSKFIYGGCQGNNNNFQSEAICQNICKK 55
Kunitz_SmCI_3-like cd22603
third Kunitz domain of Carboxypeptidase Inhibitor SmCI and similar domains; This group ...
1072-1122 7.55e-11

third Kunitz domain of Carboxypeptidase Inhibitor SmCI and similar domains; This group includes Sabellastarte magnifica carboxypeptidase inhibitor (SmCI), Bombyx mori cocoon shell-associated trypsin inhibitor (CSTI), Bombus terrestris Kunitz-type serine protease inhibitor Bt-KTI, and similar domains. SmCI is a tri-domain BPTI-Kunitz inhibitor capable of inhibiting serine proteases and A-like metallocarboxypeptidases. While the BPTI-Kunitz family of proteins includes voltage gated channel blockers and inhibitors of serine proteases, SmCI is the only BPTI-Kunitz protein capable of inhibiting metallocarboxypeptidases. Binding studies show that SmCI is able to bind three trypsin molecules under saturating conditions, but only one elastase interacts with the inhibitor. Additionally, SmCI can bind serine proteases and carboxypeptidases at the same time (at least in the ratio 1:1:1), thus becoming the first protease inhibitor that simultaneously blocks these two mechanistic classes of enzymes. CSTI and Bt-KTI are single Kunitz domain proteins that inhibit trypsin; in addition, Bt-KTI also inhibits plasmin. This model contains the third Kunitz domain of SmCI which has a structure similar to those of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438646  Cd Length: 53  Bit Score: 58.21  E-value: 7.55e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22603     3 CLLPSETGPCKGSFPRYYYDKETGKCKEFIYGGCQGNANNFETKEECERAC 53
Kunitz_bikunin_2-like cd22597
second Kunitz domain of bikunin and similar proteins; This subfamily includes the C-terminal ...
1072-1122 1.06e-10

second Kunitz domain of bikunin and similar proteins; This subfamily includes the C-terminal domain of bikunin (also known as inter-alpha-trypsin inhibitor light chain (ITI-LC) or urinary trypsin inhibitor), a plasma protease inhibitor, that is associated with inflammation and stabilizes the extracellular matrix. Bikunin is encoded together with alpha-1-microglobulin (A1M) by an alpha-1-microglobulin/bikunin precursor (AMBP) gene that is tightly controlled by several hepatocyte-enriched nuclear (HEN) factors, and cleaved by a furin-like protease that releases the two mature molecules. Bikunin is a Kunitz-type serine protease inhibitor, found in vertebrate serum and urine, modified by a chondroitin sulfate (CS) chain. The structures of these toxins are similar to that of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds. Bikunin contains two Kunitz domains; this model represents the second repeat.


Pssm-ID: 438640  Cd Length: 55  Bit Score: 58.16  E-value: 1.06e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22597     4 CRLPIVPGPCKGFVDLWAFDAVQGKCVPFSYGGCQGNGNKFYSEKECEEYC 54
Kunitz_boophilin_2-like cd22600
second Kunitz domain of Rhipicephalus microplus boophilin and similar proteins; This group ...
1072-1123 2.25e-10

second Kunitz domain of Rhipicephalus microplus boophilin and similar proteins; This group includes venom serine protease inhibitors such as Rhipicephalus microplus and Ixodes scapularis boofilin, among others. Boophilin prevents blood clot formation to allow successful feeding and digestion through its inhibition activity of thrombin and other host anticoagulating factors like kallikrein, coagulation factor VII, or plasmin; it interacts with the host thrombin and trypsin. The structures of these domains are similar to those of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds. Rhipicephalus microplus boophilin contains two Kunitz domains; this model represents the second repeat.


Pssm-ID: 438643  Cd Length: 54  Bit Score: 57.05  E-value: 2.25e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|..
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETCI 1123
Cdd:cd22600     2 CKPAAESGLCAAYLERWFFNVTTGACETFVYGGCGGNANNYKSQEECELACL 53
TerY COG4245
Uncharacterized conserved protein YegL, contains vWA domain of TerY type [Function unknown];
793-966 3.56e-10

Uncharacterized conserved protein YegL, contains vWA domain of TerY type [Function unknown];


Pssm-ID: 443387 [Multi-domain]  Cd Length: 196  Bit Score: 60.71  E-value: 3.56e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  793 KETPLELVFVIDSSESVGPENFQIIKNFVKTMADRVALD---LATARIGIINYSHKVEkvaNLKQFSSKDDFKLavDNMQ 869
Cdd:COG4245     2 PMRRLPVYLLLDTSGSMSGEPIEALNEGLQALIDELRQDpyaLETVEVSVITFDGEAK---VLLPLTDLEDFQP--PDLS 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  870 YLGeGTYTATALQAANDMFED-------ARPGVKKVALV-ITDGQ-TDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDP 940
Cdd:COG4245    77 ASG-GTPLGAALELLLDLIERrvqkytaEGKGDWRPVVFlITDGEpTDSDWEAALQRLKDGEAAKKANIFAIGVGPDADT 155
                         170       180
                  ....*....|....*....|....*...
gi 767945408  941 NF--EIFHKEMNLIATDPEhvyQFDDFF 966
Cdd:COG4245   156 EVlkQLTDPVRALDALDGL---DFREFF 180
Kunitz_PPTI-like cd22608
Pseudocerastes persicus trypsin inhibitor (PPTI), Kunitz-type serine protease inhibitor ...
1078-1122 4.65e-10

Pseudocerastes persicus trypsin inhibitor (PPTI), Kunitz-type serine protease inhibitor bitisilin, and similar proteins; This group contains Pseudocerastes persicus trypsin inhibitor (PPTI), Bitis gabonica Kunitz-type serine protease inhibitor bitisilin-1 (BG-11), -2 (BG-15) and -3 (two-Kunitz protease inhibitor), Oxyuranus scutellatus scutellatus taicatoxin, and serine protease inhibitor component (TSPI, also called venom protease inhibitor 1 or venom protease inhibitor 2), among others. PPTI from P. persicus venom shows inhibitory effect against trypsin proteolytic activity and has similarities to dendrotoxins (DTXs), with corresponding functionally important residues. Studies have shown the ability of PPTI to inhibit voltage-gated potassium channels, and consequently have dual functionality. Bitilisins 1, 2, and 3 are serine protease inhibitors expressed in snake venom glands; bitsilin-3 consists of two Kunitz protease inhibitor domains. Taicatoxin inhibits trypsin, tissue kallikrein, elastase, plasmin and factor Xa, and is also known to block the voltage-dependent L-type calcium channels from the heart, and the small conductance calcium-activated potassium channels (KCa) in chromaffin cells and in the brain. The structures of these Kunitz-type proteins are similar to other Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438651  Cd Length: 54  Bit Score: 56.15  E-value: 4.65e-10
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 767945408 1078 PGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22608    10 PGPCKAYIPRFYYNSASNKCQQFIYGGCKGNANNFETKDECRYTC 54
vWA_integrins_alpha_subunit cd01469
Integrins are a class of adhesion receptors that link the extracellular matrix to the ...
47-216 9.07e-10

Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions.


Pssm-ID: 238746 [Multi-domain]  Cd Length: 177  Bit Score: 58.91  E-value: 9.07e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   47 IDIVFIVDSSESSKIALFDKQKDFVDSLSDKiFQLTPGrsleyDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKSMNL 126
Cdd:cd01469     1 MDIVFVLDGSGSIYPDDFQKVKNFLSTVMKK-LDIGPT-----KTQFGLVQYSESFRTEFTLNEYRTKEEPLSLVKHISQ 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  127 IGQGTFSYYAISNATRLLKRE---GRKDGVKVVLLMTDGIDH--PKNPDVQSISEDARIsgISFItIAL------STVVN 195
Cdd:cd01469    75 LLGLTNTATAIQYVVTELFSEsngARKDATKVLVVITDGESHddPLLKDVIPQAEREGI--IRYA-IGVgghfqrENSRE 151
                         170       180
                  ....*....|....*....|.
gi 767945408  196 EakLRLISGDSSSEPTLLLSD 216
Cdd:cd01469   152 E--LKTIASKPPEEHFFNVTD 170
Kunitz_BmTI-like cd22604
Kunitz-type serine protease inhibitor 6 (BmTI-6), A (BmTI-A), and similar proteins; This group ...
1068-1122 9.10e-10

Kunitz-type serine protease inhibitor 6 (BmTI-6), A (BmTI-A), and similar proteins; This group includes Kunitz-type serine protease inhibitors 6 (BmTI-6) and A (BmTI-A), both of which inhibit bovine trypsin, bovine chymotrypsin, human plasmin, human plasma kallikrein and human neutrophil elastase, but not bovine thrombin, human factor Xa or porcine pancreatic kallikrein. They may play a role in blocking blood coagulation during the larvae fixation on cattle. This subfamily also includes Rhipicephalus microplus protease inhibitor carrapatin. These proteins are similar to Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor) that shows an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438647 [Multi-domain]  Cd Length: 56  Bit Score: 55.53  E-value: 9.10e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 767945408 1068 EDPRCLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22604     2 FEKQCSPTADSGPCFAYFPMWWYNVKTGQCEEFIYGGCQGNDNRYETEEECEKTC 56
Kunitz_HAI2_1-like cd22621
Kunitz-type domain 1 (KD1) of hepatocyte growth factor activator inhibitor type 2 (HAI-2), and ...
1071-1122 1.25e-09

Kunitz-type domain 1 (KD1) of hepatocyte growth factor activator inhibitor type 2 (HAI-2), and similar proteins; This model includes the Kunitz domain 1 (KD1) of hepatocyte growth factor activator inhibitor type 2 (HAI-2 or HAI2, also known as placental bikunin or Kunitz-type protease inhibitor 2). HAI-2 is composed of two Kunitz domains that strongly inhibit many serine proteases with sub-nanomolar affinities. HAI-2 Kunitz domain 1 (KD1) has been found to be the domain responsible for inhibition of hepatocyte growth factor (HGF) activator; activated HGF/scatter factor (HGF/SF) binds to its receptor tyrosine kinase MET to induce dimerization and initiate phosphorylation cascades leading to comprehensive cellular changes that, in the deregulated context of cancer, drive malignant transformation and progression. HAI-2 has been found to be a natural tumor suppressor in renal cell carcinoma, breast cancer and prostate cancer; its loss leads to tumor growth and progression in part due to increased MET signaling. HAI-2 is also a specific substrate for mesotrypsin, which is up-regulated with progression in prostate cancers and shown to contribute to invasion and metastasis; these activities of mesotrypsin may in part be mediated through cleavage and inactivation of HAI-2, resulting in increases in HGF/SF activation and MET signaling. HAI-2 is a physiological inhibitor of hepsin and matriptase, two type II transmembrane serine proteases that, like HGF activator, can convert latent pro-HGF/SF into the two-chain active signaling heterodimer. The structures of these KD1 domains are similar to those of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438664  Cd Length: 53  Bit Score: 54.79  E-value: 1.25e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|..
gi 767945408 1071 RCLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22621     2 FCHLPKVVGRCRASFPRWWYNATSQSCQEFIFGGCKGNLNNFLSEQECLQKC 53
Kunitz_HAI1_1-like cd22623
Kunitz domain 1 of hepatocyte growth factor activator inhibitor-1 (HAI-1); This model includes ...
1071-1122 1.57e-09

Kunitz domain 1 of hepatocyte growth factor activator inhibitor-1 (HAI-1); This model includes Kunitz domain 1 (KD1) of hepatocyte growth factor activator inhibitor type 1 (HAI1 or HAI-1, also known as Kunitz-type protease inhibitor 1), a membrane-bound multidomain protein essential to the integrity of the basement membrane during placental development. HAI-1 contains an extracellular region and several internal domains that include two Kunitz domains separated in sequence but spatially closed to each other, and their interdomain interactions have evolved to stimulate the inhibitory activity of an integrated Kunitz. KD1, the major inhibitory domain of HAI-1, is involved in auto-inhibition of the extracellular region via steric blockage of its active site in the HAI-1 compact tertiary structure; presence of the target protease causes changes in the HAI-1 structure to an extended conformation. HAI-1 has been shown to inhibit several serine proteases such as matripase, hepsin, trypsin, hepatocyte growth factor activator (HGFA), and prostasin. It is also important in maintaining postnatal homeostasis in many tissues, including keratinization of the epidermis, hair development, colonic epithelium integrity, proliferation and cell fate of neural progenitor cells, and tissue injury and repair. The interaction between HAI-1 and matriptase is critical for tissue morphogenesis and cellular biology. HAI-1:matriptase ratio imbalance results in tumorigenesis; slight overexpression of matriptase relative to HAI-1 causes spontaneous squamous cell carcinoma, a phenotype that can be effectively reversed back to wild type by additional expression of HAI-1, indicating the need for a tight functional relationship between the two to maintain homeostasis. The structures of these domains are similar to those of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438666  Cd Length: 59  Bit Score: 54.86  E-value: 1.57e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|..
gi 767945408 1071 RCLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22623     5 YCLAPKKVGPCRGSFPRWHYNAASGKCEEFVFGGCKGNKNNYLSEEECLSAC 56
Kunitz_bikunin_1-like cd22596
first Kunitz domain of bikunin and similar proteins; This subfamily includes the N-terminal ...
1072-1122 2.00e-09

first Kunitz domain of bikunin and similar proteins; This subfamily includes the N-terminal domain of bikunin (also known as inter-alpha-trypsin inhibitor light chain (ITI-LC) or urinary trypsin inhibitor), a plasma protease inhibitor, that is associated with inflammation and stabilizes the extracellular matrix. It is encoded together with alpha-1-microglobulin (A1M) by an alpha-1-microglobulin/bikunin precursor (AMBP) gene that is tightly controlled by several hepatocyte-enriched nuclear (HEN) factors, and cleaved by a furin-like protease that releases the two mature molecules. Bikunin is a Kunitz-type serine protease inhibitor, found in vertebrate serum and urine, modified by a chondroitin sulfate (CS) chain. The structures of these toxins are similar to that of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds. Bikunin contains two Kunitz domains; this model represents the first repeat.


Pssm-ID: 438639  Cd Length: 54  Bit Score: 54.18  E-value: 2.00e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22596     3 CKLPPDAGPCFGMIQRYFYNSSSMACQTFNYGGCLGNQNNFVTEKECLQTC 53
Kunitz_actitoxin-like cd22633
Kunitz-type actitoxins such as Anemonia viridis U-actitoxin-Avd3l, and similar proteins; This ...
1069-1122 3.07e-09

Kunitz-type actitoxins such as Anemonia viridis U-actitoxin-Avd3l, and similar proteins; This model includes the Kunitz-type actitoxins such as Anemonia viridis U-actitoxin-Avd3l (also called U-AITX-Avd3l or AsKC9), Anthopleura elegantissima KappaPI-actitoxin-Ael3a (also called KappaPI-AITX-Ael3a or Kunitz-type serine protease inhibitor APEKTx1) and Anthopleura aff. xanthogrammica PI-actitoxin-Axm2b (also called PI-AITX-Axm2b or Kunitz-type proteinase inhibitor AXPI-II). U-AITX-Avd3l and KappaPI-AITX-Ael3a are dual-function toxins that inhibit both the serine protease trypsin and voltage-gated potassium channels Kv1.2/KCNA2. These proteins are similar to Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor) that shows an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438676  Cd Length: 55  Bit Score: 53.69  E-value: 3.07e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 767945408 1069 DPRCLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22633     2 NSICLLPKDVGGCRARFPRYYYNSSTRRCEKFRYGGCGGNANNFHTLEECEKVC 55
Kunitz_SmCI_1-like cd22601
first Kunitz domain of Carboxypeptidase Inhibitor SmCI and similar domains; This group ...
1078-1123 4.17e-09

first Kunitz domain of Carboxypeptidase Inhibitor SmCI and similar domains; This group includes Sabellastarte magnifica carboxypeptidase inhibitor (SmCI), a tri-domain BPTI-Kunitz inhibitor capable of inhibiting serine proteases and A-like metallocarboxypeptidases. While the BPTI-Kunitz family of proteins includes voltage gated channel blockers and inhibitors of serine proteases, SmCI is the only BPTI-Kunitz protein capable of inhibiting metallocarboxypeptidases. Binding studies show that SmCI is able to bind three trypsin molecules under saturating conditions, but only one elastase interacts with the inhibitor. Additionally, SmCI can bind serine proteases and carboxypeptidases at the same time (at least in the ratio 1:1:1), thus becoming the first protease inhibitor that simultaneously blocks these two mechanistic classes of enzymes. This model contains the first Kunitz domain of SmCI, which has a structure similar to those of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438644  Cd Length: 55  Bit Score: 53.66  E-value: 4.17e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 767945408 1078 PGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETCI 1123
Cdd:cd22601    10 RGPCTAYIPRWFYNKTTKKCEKFVYGGCQGNKNRFETKDDCLANCG 55
Kunitz_ixolaris_2 cd22626
Kunitz-type domain 2 (K2) of Ixolaris, and similar proteins; This model includes the second ...
1072-1122 5.52e-09

Kunitz-type domain 2 (K2) of Ixolaris, and similar proteins; This model includes the second Kunitz-type domain (K2) of ixolaris from the venomous organism Conus striatus. Ixolaris is a potent tick salivary anticoagulant that binds coagulation factor Xa (FXa) and zymogen FX, and forms a quaternary tissue factor (TF)/FVIIa/FX(a)/Ixolaris inhibitory complex. It blocks TF-induced coagulation and PAR2 (proteinase-activated receptor 2) signaling, and prevents thrombosis, tumor growth, and immune activation. Ixolaris consists of 2 Kunitz domains (K1 and K2), both of which recognize the heparin-binding (pro)exosite (HBE) on FX. This model contains K2, an extraordinarily dynamic domain that encompasses several residues involved in FX binding. Its backbone plasticity is critical for ixolaris biological activity. This domain contains 2 disulfide bonds instead of the 3 typical of Kunitz domain proteins.


Pssm-ID: 438669  Cd Length: 51  Bit Score: 52.85  E-value: 5.52e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22626     1 CSLELDYGVGKAYIPRWYFNTSNARCEMFIFGGIGGNKNNFETLEECKKTC 51
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
360-419 8.46e-09

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 52.50  E-value: 8.46e-09
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   360 GQQGIKGERGQEGRPGAPGPIgvGEPGQPGPRGPEGVPGERGLPgeGFPGPKGEKGSEGP 419
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPP--GPPGPPGPPGEPGPPGPPGPP--GPPGPPGAPGAPGP 56
Kunitz_SCI-I-like cd22634
chymotrypsin inhibitor SCI-I_III-like; This model includes the Kunitz-type chymotrypsin ...
1068-1122 9.56e-09

chymotrypsin inhibitor SCI-I_III-like; This model includes the Kunitz-type chymotrypsin inhibitors SCI-III and SCI-I, and similar proteins in insects. SCI-III and SCI-I inhibit chymotrypsin, avoiding the accidental chymotrypsin-mediated activation of prophenoloxidase. This enzyme is required by the insect immune system to produce melanin which is used to engulf foreign objects. This subfamily also includes Kunitz-type male accessory gland peptide with protease inhibitory activity, synthesized and secreted by male accessory glands of Drosophila funebris; it may play a role as an acrosin inhibitor involved in reproduction. These proteins are similar to Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor) that shows an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438677  Cd Length: 57  Bit Score: 52.51  E-value: 9.56e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 767945408 1068 EDPRCLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22634     3 GQPHSLGGGDGISCFAYIPSWSYNPDKNECEEFIYGGCGGNDNRFSTKAECEQKC 57
vWA_micronemal_protein cd01471
Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a ...
47-209 1.33e-08

Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners.


Pssm-ID: 238748 [Multi-domain]  Cd Length: 186  Bit Score: 55.85  E-value: 1.33e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   47 IDIVFIVDSSESSKIA-LFDKQKDFVDSLSDKIfQLTPGrsleyDIKLAALQFSSSV--QID--PPFSSWKDLQTFK-QK 120
Cdd:cd01471     1 LDLYLLVDGSGSIGYSnWVTHVVPFLHTFVQNL-NISPD-----EINLYLVTFSTNAkeLIRlsSPNSTNKDLALNAiRA 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  121 VKSMNLIGQGTFSYYAISNATRLLK--REGRKDGVKVVLLMTDGIdhpKNPDVQSISEDARIS--GISFITIALSTVVNE 196
Cdd:cd01471    75 LLSLYYPNGSTNTTSALLVVEKHLFdtRGNRENAPQLVIIMTDGI---PDSKFRTLKEARKLRerGVIIAVLGVGQGVNH 151
                         170
                  ....*....|...
gi 767945408  197 AKLRLISGDSSSE 209
Cdd:cd01471   152 EENRSLVGCDPDD 164
Kunitz_TFPI2_2-like cd22617
Kunitz domain 2 (KD2) of tissue factor pathway inhibitor 2 (TFPI2) and similar proteins; This ...
1071-1122 1.52e-08

Kunitz domain 2 (KD2) of tissue factor pathway inhibitor 2 (TFPI2) and similar proteins; This model represents the Kunitz-type domain 2 (KD2) of tissue factor pathway inhibitor 2 (TFPI2 or TFPI-2) and similar proteins. TFPI2 exhibits inhibitory activity primarily toward trypsin, plasmin, and factor VIIa (FVIIa)/tissue factor (TF) via its KD1. It is believed to be the major inhibitor of plasmin in the extracellular matrix (ECM) but has little inhibitory activity toward urokinase-type plasminogen activator, tissue-type plasminogen activator, or thrombin. While TFPI2 specifically inhibits the proteases via the P1 arginine residue in KD1, domains KD2 and KD3 appear to have no discernible inhibitory activity and may serve to bind to nearby proteins to localize TFPI2 in the ECM. This domain is similar to that of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor) that shows an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438660  Cd Length: 54  Bit Score: 52.00  E-value: 1.52e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|..
gi 767945408 1071 RCLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22617     3 VCREVPDEGPCRALITRYFYNMTSMRCEEFTYGGCYGNGNNFRDKSSCISAC 54
Kunitz_TFPI1_TFPI2_3-like cd22615
Kunitz protease inhibitor (KPI) domain 3 (KPI-3 or K3) of tissue factor pathway inhibitor ...
1072-1122 1.54e-08

Kunitz protease inhibitor (KPI) domain 3 (KPI-3 or K3) of tissue factor pathway inhibitor (TFPI) and TFPI2, and similar proteins; This model represents the third Kunitz-type domain (K3 or KPI-3) of tissue factor pathway inhibitor (TFPI or TFPI1), also known as extrinsic pathway inhibitor (EPI) or lipoprotein-associated coagulation inhibitor (LACI), and of TFPI2 (or TFPI-2). TFPI1 down-regulates the extrinsic coagulation pathway via inhibition of activated factor X (FXa or Xa) and FVIIa (VIIa). It inhibits activated FXa via a "slow-tight binding mechanism", i.e. rapid formation of a loose FXa-TFPI1 complex that then slowly isomerizes to a tight FXa-TFPI1* complex. Subsequent inhibition of FVIIa is facilitated by the presence of tissue factor (TF) and FXa, which together rapidly and efficiently form a quaternary FXa-TFPI1-TF-FVIIa complex in which the activity of FXa and FVIIa are inhibited. TFPI1 consists of 3 Kunitz-type protease inhibitor (KPI) domains in a tandem arrangement; while the K1 domain of TFPI has been shown to bind and inhibit FVIIa and the K2 domain similarly inhibits FXa, the K3 domain has no known inhibitory function. However, Protein S, which functions as a cofactor for TFPI to efficiently enhance TFPI inhibition of FXa and FXa activated TF-VIIa, is dependent on direct interactions with two important residues within K3, a Glutamate and an Arginine. This model also includes TFPI2 Kunitz domain 3 (KD3). TFPI2 exhibits inhibitory activity primarily toward trypsin, plasmin, and factor VIIa (FVIIa)/tissue factor (TF) via its KD1. It is believed to be the major inhibitor of plasmin in the extracellular matrix (ECM) but has little inhibitory activity toward urokinase-type plasminogen activator, tissue-type plasminogen activator, or thrombin. While TFPI2 specifically inhibits the proteases via the P1 arginine residue in KD1, domains KD2 and KD3 appear to have no discernible inhibitory activity and may serve to bind to nearby proteins to localize TFPI2 in the ECM. The structure of this domain is similar to that of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438658  Cd Length: 54  Bit Score: 51.91  E-value: 1.54e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22615     4 CLSPKDEGLCSASVTRYYYNSATKTCEPFNYTGCGGNNNNFTSKKDCLRVC 54
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
369-422 2.39e-08

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 51.34  E-value: 2.39e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 767945408   369 GQEGRPGAPGPIG-VGEPGQPGPRGPEGVPGERGLPGE-GFPGPKGEKGSEGPTGP 422
Cdd:pfam01391    1 GPPGPPGPPGPPGpPGPPGPPGPPGPPGPPGEPGPPGPpGPPGPPGPPGAPGAPGP 56
Kunitz_ELP-like cd22632
early lactation protein (ELP), colostrum trypsin inhibitor (CTI), and similar proteins; This ...
1079-1122 2.61e-08

early lactation protein (ELP), colostrum trypsin inhibitor (CTI), and similar proteins; This model includes the Kunitz-type proteins, colostrum trypsin inhibitor (CTI, also called colostrum BPI) and early lactation protein (ELP). In marsupials, the ELP gene is expressed in the mammary gland and the protein is secreted into milk during early lactation. Mature ELP shares approximately 55.4% similarity with the colostrum-specific bovine CTI protein. Marsupial ELP and eutherian CTI both have a single Kunitz domain and are secreted only during the early lactation phases, suggesting that this protein may have an important role in the immunologically immature young of these species. These proteins are similar to Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor) that shows an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438675  Cd Length: 55  Bit Score: 51.28  E-value: 2.61e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 767945408 1079 GNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22632    11 GPCRSNILRYFYNSTSRECEPFIYGGCNGNANNFETVEMCLRTC 54
Kunitz_WFIKKN_2-like cd22606
second Kunitz domain of WAP, Kazal, immunoglobulin, Kunitz and NTR domain-containing proteins; ...
1079-1122 3.11e-08

second Kunitz domain of WAP, Kazal, immunoglobulin, Kunitz and NTR domain-containing proteins; This subfamily includes WAP, Kazal, immunoglobulin, Kunitz and NTR domain-containing protein 1 (WFIKKN1, WFKN1), WFIKKN2 (WFKN2), and similar proteins. WFIKKN proteins are protease inhibitors that contain two distinct Kunitz-type protease inhibitor domains. They may have serine protease- and metalloprotease-inhibitor activity. This model represents the second Kunitz (KU2) domain, which has been shown to inhibit trypsin, but not chymotrypsin, elastase, plasmin, pancreatic kallikrein, lung tryptase, plasma kallikrein, thrombin, urokinase or tissue plasminogen activator. However, the inhibition constant of this domain for bovine trypsin is about five orders of magnitudes lower than that of bovine pancreatic trypsin inhibitor (BPTI) for trypsin. This could be due to unfavorable side-chain conformation of a tryptophan at P2' site which is incompatible with a trypsin complex; typical trypsin inhibitors of the Kunitz family feature a tyrosine residue or other less bulky residues at this site. The structure of KU2 is similar to those of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438649  Cd Length: 53  Bit Score: 50.82  E-value: 3.11e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 767945408 1079 GNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22606     9 GPCKAWEPRWAYNSLLKQCQSFVYGGCEGNENNFESKEACEDAC 52
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
554-612 3.36e-08

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 50.96  E-value: 3.36e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 767945408   554 GKKGSKGNQGQRGLPGPEGPKGEPGIMGPFGMPGTsiPGPPGPKGDRGGPGIPGFKGEP 612
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGP--PGPPGPPGPPGPPGAPGAPGPP 57
Kunitz_textilinin-like cd22594
venom Kunitz-type proteins such as textilinin, BF9 and PILP; This group includes toxins ...
1078-1122 4.45e-08

venom Kunitz-type proteins such as textilinin, BF9 and PILP; This group includes toxins isolated from snake venoms, such as textilinin, vestiginin, spermatin, mulgin, venom basic protease inhibitor IX (BF9), and protease inhibitor-like protein (PILP), among others. Pseudonaja textilis textilinin-1 is a Kunitz-type serine protease inhibitor that binds to and blocks the activity of a range of serine proteases, including plasmin and trypsin. Ability of testilinin to inhibit plasmin, a protease involved in fibrinolysis, raises the possibility that it may be used as an alternative to aprotinin (Trasylol), which is a systemic antibleeding agent in surgery. Also included is the Bungarus fasciatus fraction IX (BF9), a chymotrypsin inhibitor that binds chymotrypsin but not trypsin. Protease inhibitor-like proteins PILP-1 and PILP-2 show weak binding and inhibition of matrix metalloproteinase-2 (MMP-2) and show an activity in inhibiting migration and invasion of neuroblastoma; they do not inhibit chymotrypsin or trypsin. The structures of these toxins are similar to those of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438637  Cd Length: 56  Bit Score: 50.39  E-value: 4.45e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 767945408 1078 PGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22594    11 PGPCNAYKPAFYYNPASHKCLEFIYGGCGGNANNFKTIDECHRTC 55
Kunitz_TFPI1_1-like cd22613
Kunitz protease inhibitor (KPI) domain 1 (KPI-1 or K1) of tissue factor pathway inhibitor ...
1079-1123 4.85e-08

Kunitz protease inhibitor (KPI) domain 1 (KPI-1 or K1) of tissue factor pathway inhibitor (TFPI); This model represents the first Kunitz-type domain (K1 or KPI-1) of tissue factor pathway inhibitor (TFPI or TFPI1), also known as extrinsic pathway inhibitor (EPI) or lipoprotein-associated coagulation inhibitor (LACI). TFPI down-regulates the extrinsic coagulation pathway via inhibition of activated factor X (FXa or Xa) and FVIIa (VIIa). It inhibits activated FXa via a "slow-tight binding mechanism", i.e. rapid formation of a loose FXa-TFPI complex that then slowly isomerizes to a tight FXa-TFPI* complex. Subsequent inhibition of FVIIa is facilitated by the presence of tissue factor (TF) and FXa, which together rapidly and efficiently form a quaternary FXa-TFPI-TF-FVIIa complex in which the activity of FXa and FVIIa are inhibited. TFPI consists of 3 Kunitz-type protease inhibitor (KPI) domains in a tandem arrangement; The K1 domain of TFPI has been shown to bind and inhibit FVIIa while the K2 domain similarly inhibits FXa. Small peptide blocking inhibition of FXa and TF-FVIIa by TFPI shows that domain K1 is not only important for FVIIa inhibition but also for FXa inhibition, i.e. for the transition of the loose to the tight FXa-TFPI complex. The structure of the K1 domain is similar to those of other Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438656  Cd Length: 55  Bit Score: 50.43  E-value: 4.85e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 767945408 1079 GNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETCI 1123
Cdd:cd22613    11 GPCKAIMKRFFFNIFTRQCEEFIYGGCEGNENRFETLEECKKTCI 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
481-536 5.13e-08

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 50.57  E-value: 5.13e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 767945408   481 GEVGQMGPTGPRGPVGI-GVQGPKGEPGSIGLPGQPGVPGEDGAAGKKGEAGLPGAR 536
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPpGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
257-510 5.89e-08

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 56.58  E-value: 5.89e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  257 GNPGIKGERGPKGNPG-NAQKGEAGERGPGGIPGYKGDKGERGECGKPGIKGDKGSPGPYGPKGPRGIQGITGPPGDPGP 335
Cdd:COG5164    19 TPAGSQGSTKPAQNQGsTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQNQGGTRPAGNTGG 98
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  336 KGFQGNKGEPGPPGPYGSPGAPGIGQQGIKGERGQEGRPGAPG--PIGVGEPGQPGPRGPEGVPGERGLPGEGfpGPKGE 413
Cdd:COG5164    99 TTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGstPPGPGSTGPGGSTTPPGDGGSTTPPGPG--GSTTP 176
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  414 KGSEGPTGPQGLQGLSIKGEKGDIGPVGPQGPMGIPGIGSQGEQGIQGPIGPPGPQGPAGQGLPGSKGEVGQMGPTGPRG 493
Cdd:COG5164   177 PDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKTGPKDQRPKTNPIERRGPERPEAAALPAE 256
                         250
                  ....*....|....*..
gi 767945408  494 PVGIGVQGPKGEPGSIG 510
Cdd:COG5164   257 LTALEAENRAANPEPAT 273
Kunitz_HAI2_2-like cd22622
Kunitz-type domain 2 (KD2) of hepatocyte growth factor activator inhibitor type 2 (HAI-2), and ...
1079-1122 6.05e-08

Kunitz-type domain 2 (KD2) of hepatocyte growth factor activator inhibitor type 2 (HAI-2), and similar proteins; This model includes Kunitz domain 2 (KD2) of hepatocyte growth factor activator inhibitor type 2 (HAI-2 or HAI2, also known as placental bikunin or Kunitz-type protease inhibitor 2). HAI-2 is composed of two Kunitz domains that strongly inhibit many serine proteases with sub-nanomolar affinities. It has been found to be a natural tumor suppressor in renal cell carcinoma, breast cancer and prostate cancer, the loss of which leads to tumor growth and progression attributable at least in part to increased MET signaling. HAI-2 is a specific substrate of mesotrypsin which is up-regulated with progression in prostate cancers and shown to contribute to invasion and metastasis; these activities of mesotrypsin may in part be mediated through cleavage and inactivation of HAI-2, resulting in increases in hetatocyte growth factor/scatter factor (HGF/SF) activation and MET signaling. HAI-2 is a physiological inhibitor of hepsin and matriptase, two type II transmembrane serine proteases that, like HGF activator, can convert latent pro-HGF/SF into the two-chain active signaling heterodimer. KD2 is similar to KD1, whose structure is similar to those of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438665  Cd Length: 53  Bit Score: 50.05  E-value: 6.05e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 767945408 1079 GNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22622    10 GPCRAAFPRWYYDPESQSCKEFIYGGCRGNKNNYLSEEECMDRC 53
vWA_subgroup cd01465
VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood ...
797-934 6.54e-08

VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains. Not much is known about the function of the VWA domain in these proteins. The members do have a conserved MIDAS motif. The biochemical function however is not known.


Pssm-ID: 238742 [Multi-domain]  Cd Length: 170  Bit Score: 53.43  E-value: 6.54e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  797 LELVFVIDSSESVGPENFQIIKNFVKTMADRVAldlATARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGeGTY 876
Cdd:cd01465     1 LNLVFVIDRSGSMDGPKLPLVKSALKLLVDQLR---PDDRLAIVTYDGAAETVLPATPVRDKAAILAAIDRLTAGG-STA 76
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767945408  877 TATALQAANDMFEDAR-PGVKKVALVITDGQ--TDSRDKEKLTEVVKNASDTNVEIFVIGV 934
Cdd:cd01465    77 GGAGIQLGYQEAQKHFvPGGVNRILLATDGDfnVGETDPDELARLVAQKRESGITLSTLGF 137
ViaA COG2425
Uncharacterized conserved protein, contains a von Willebrand factor type A (vWA) domain ...
799-942 6.94e-08

Uncharacterized conserved protein, contains a von Willebrand factor type A (vWA) domain [Function unknown];


Pssm-ID: 441973 [Multi-domain]  Cd Length: 263  Bit Score: 55.07  E-value: 6.94e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  799 LVFVIDSSESVGPENFQIiknfvktmADRVALDLATA-----RIGIINYSHKVEKVAnlkQFSSKDDFKLAVDNMQYL-- 871
Cdd:COG2425   121 VVLCVDTSGSMAGSKEAA--------AKAAALALLRAlrpnrRFGVILFDTEVVEDL---PLTADDGLEDAIEFLSGLfa 189
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767945408  872 GEGTYTATALQAANDMFEDARPGvKKVALVITDGQTDSRDKEKLTEVvkNASDTNVEIFVIGVVKKNDPNF 942
Cdd:COG2425   190 GGGTDIAPALRAALELLEEPDYR-NADIVLITDGEAGVSPEELLREV--RAKESGVRLFTVAIGDAGNPGL 257
Kunitz_B2B cd22619
Kunitz-type serine protease inhibitor subunit of beta 2-bungarotoxin, and similar proteins; ...
1070-1122 7.50e-08

Kunitz-type serine protease inhibitor subunit of beta 2-bungarotoxin, and similar proteins; This model includes the Kunitz inhibitor subunit of beta 2-bungarotoxin, a presynaptic neurotoxin of the Bungarus multicinctus venom. Beta-bungarotoxin is a heterodimeric neurotoxin consisting of a phospholipase subunit linked by a disulfide bond to the Kunitz protease inhibitor subunit; the latter subunit is homologous to venom basic protease inhibitors but has no protease inhibitor activity and is non-toxic. The beta-bungarotoxin Kunitz subunit serves to guide the toxin to its site of action on the presynaptic membrane by virtue of a high-affinity interaction with a specific subclass of voltage-sensitive potassium channels. This subfamily also includes Kunitz-type serine protease inhibitor homolog beta-bungarotoxin B1 chain and protease inhibitor-like protein 1 (PILP-1). The B1 chain also has no protease inhibitor activity but blocks voltage-gated potassium channels, while PILP-1 inhibits trypsin. The structures of these domains are similar to those of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438662  Cd Length: 58  Bit Score: 49.86  E-value: 7.50e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 767945408 1070 PRCLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22619     5 PDCDKPPDTKRCKRVVRAFYYNPSAKTCLQFVYGGCNGNGNHFKSKALCRCHC 57
ViaA COG2425
Uncharacterized conserved protein, contains a von Willebrand factor type A (vWA) domain ...
48-203 8.72e-08

Uncharacterized conserved protein, contains a von Willebrand factor type A (vWA) domain [Function unknown];


Pssm-ID: 441973 [Multi-domain]  Cd Length: 263  Bit Score: 54.69  E-value: 8.72e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   48 DIVFIVDSSES---SKIAlfdkqkdFVDSLSDKIFQ-LTPGRsleydiKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123
Cdd:COG2425   120 PVVLCVDTSGSmagSKEA-------AAKAAALALLRaLRPNR------RFGVILFDTEVVEDLPLTADDGLEDAIEFLSG 186
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  124 MNLIGqGTFSYYAISNATRLLKREGRKDgvKVVLLMTDGIDHpkNPDVQSISE-DARISGISFITIALSTVVNEAKLRLI 202
Cdd:COG2425   187 LFAGG-GTDIAPALRAALELLEEPDYRN--ADIVLITDGEAG--VSPEELLREvRAKESGVRLFTVAIGDAGNPGLLEAL 261

                  .
gi 767945408  203 S 203
Cdd:COG2425   262 A 262
Kunitz_ABPP-like cd22607
Kunitz domain found in the amyloid-beta precursor protein (ABPP) subfamily; This subfamily ...
1072-1122 1.04e-07

Kunitz domain found in the amyloid-beta precursor protein (ABPP) subfamily; This subfamily includes the amyloid-beta precursor protein (ABPP, also called APP, APPI, Alzheimer disease amyloid protein, amyloid-beta A4 protein, cerebral vascular amyloid peptide (CVAP), protease nexin II (PN2)), as well as amyloid-like protein 2 (APLP2, also called amyloid protein homolog or APPH), among others. ABPP/APPI is an inhibitor of serine proteases such as anionic and cationic trypsins. For example, APPI-4M is a variant that specifically inhibits Kallikrein (KLK)-related peptidase 6 (KLK6), which is highly upregulated in several types of cancer where its increased activity promotes cancer invasion and metastasis. Amyloid-like protein 2 (APLP2) inhibits trypsin, chymotrypsin, plasmin, factor XIA, and plasma and glandular kallikrein, and may play a role in the regulation of hemostasis. Proteins in this subfamily contain a single Kunitz domain, with a structure similar to those of other Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438650  Cd Length: 52  Bit Score: 49.35  E-value: 1.04e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22607     2 CSEQAETGPCRAMMPRWYFDVTEGKCAPFIYGGCGGNRNNFESEEYCMAVC 52
vWA_F09G8-8_type cd01477
VWA F09G8.8 type: Von Willebrand factor type A (vWA) domain was originally found in the blood ...
788-951 1.39e-07

VWA F09G8.8 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains. The members of this subgroup lack the MIDAS motif. This subgroup is found only in C. elegans and the members identified thus far are always found fused to a C-Lectin type domain. Biochemical function thus far has not be attributed to any of the members of this subgroup.


Pssm-ID: 238754 [Multi-domain]  Cd Length: 193  Bit Score: 53.19  E-value: 1.39e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  788 CGPKCKETPLELVFVIDSSESVGPENFQIIKNFVKTM---ADRVALDLA---TARIGIINYSHKVEKVANLKQFSSKDDF 861
Cdd:cd01477    11 CGSDIKNLWLDIVFVVDNSKGMTQGGLWQVRATISSLfgsSSQIGTDYDdprSTRVGLVTYNSNATVVADLNDLQSFDDL 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  862 KLAVD---NMQYLGEGTYTATALQAANDMF----EDARPGVKKVALVIT-----DGQTDSRDkeklteVVKNASDTNVEI 929
Cdd:cd01477    91 YSQIQgslTDVSSTNASYLDTGLQAAEQMLaagkRTSRENYKKVVIVFAsdyndEGSNDPRP------IAARLKSTGIAI 164
                         170       180
                  ....*....|....*....|....*..
gi 767945408  930 FVIGVVKKNDPNF-----EIFHKEMNL 951
Cdd:cd01477   165 ITVAFTQDESSNLldklgKIASPGMNF 191
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
584-636 1.48e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 49.03  E-value: 1.48e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 767945408   584 GMPGTsiPGPPGPKGDRGGPGIPGFKGEPGLsiRGPKGVQGPRGPVGAPGLKG 636
Cdd:pfam01391    1 GPPGP--PGPPGPPGPPGPPGPPGPPGPPGP--PGEPGPPGPPGPPGPPGPPG 49
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
383-442 2.39e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 48.64  E-value: 2.39e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   383 GEPGQPGPRGPEGVPGERGLPGEgfPGPKGEKGSEGPTGPQGLqglsiKGEKGDIGPVGP 442
Cdd:pfam01391    4 GPPGPPGPPGPPGPPGPPGPPGP--PGPPGEPGPPGPPGPPGP-----PGPPGAPGAPGP 56
Kunitz_TKDP-like cd22609
trophoblast Kunitz domain protein (TKDP) and similar proteins; This model contains the ...
1072-1122 4.01e-07

trophoblast Kunitz domain protein (TKDP) and similar proteins; This model contains the trophoblast Kunitz domain protein 1 (TKDP-1) and splice variant TKDP-4, among others, which are Kunitz inhibitor domain proteins. TKDP-1 is expressed in the trophectoderm which forms the outer epithelial layer of the trophoblast, and may play a role in mediating maternal-conceptus interactions in the immediate preimplantation period. However, it does not appear to have proteinase inhibitory activity. These domains are similar to those of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor) that shows an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438652  Cd Length: 52  Bit Score: 47.83  E-value: 4.01e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22609     2 CLEPKVVGVCKASMTRYFYNAQTGHCEQFVYGGCGGNRNNFLTLEDCMKTC 52
Kunitz_boophilin_1-like cd22599
first Kunitz domain of Rhipicephalus microplus boophilin and similar proteins; This group ...
1077-1123 4.25e-07

first Kunitz domain of Rhipicephalus microplus boophilin and similar proteins; This group includes venom serine protease inhibitors such as Rhipicephalus microplus and Ixodes scapularis boofilin, among others. Boophilin prevents blood clot formation to allow successful feeding and digestion through its inhibition activity of thrombin and other host anticoagulating factors like kallikrein, coagulation factor VII, or plasmin; it interacts with the host thrombin and trypsin. The structures of these domains are similar to those of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds. Rhipicephalus microplus boophilin contains two Kunitz domains; this model represents the first repeat.


Pssm-ID: 438642  Cd Length: 61  Bit Score: 47.85  E-value: 4.25e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 767945408 1077 KPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETCI 1123
Cdd:cd22599    11 DEGICRALIPRFYFNTETGQCTEFIYGGCGGNENNFETIEECEKACG 57
Kunitz_huwentoxin cd22598
Kunitz-type toxin huwentoxin-XI; This model contains Kunitz-type serine protease inhibitor ...
1079-1122 4.92e-07

Kunitz-type toxin huwentoxin-XI; This model contains Kunitz-type serine protease inhibitor huwentoxin-XI, including U15-theraphotoxin-Hs1g (also called U15-TRTX-Hs1g or Huwentoxin HW11c39), and kappaPI-theraphotoxin-Hs1a (also called KappaPI-TRTX-Hs1a or Huwentoxin-HW11g8). Huwentoxin-XI is a bifunctional toxin that inhibits both serine proteases (trypsin) and voltage-gated potassium channels (Kv) via surfaces displayed on opposite faces of the toxin. The structures of these domains are similar to those of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438641  Cd Length: 53  Bit Score: 47.68  E-value: 4.92e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 767945408 1079 GNCGEYVVRWYYDKQvnSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22598    10 GRCKASFERWYFNGR--TCAKFIYGGCGGNDNKFPTQEACMKRC 51
vWA_ATR cd01474
ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, ...
798-966 7.60e-07

ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, the causative agent of anthrax. ATR is the cellular receptor for the anthrax protective antigen and facilitates entry of the toxin into cells. The VWA domain in ATR contains the toxin binding site and mediates interaction with protective antigen. The binding is mediated by divalent cations that binds to the MIDAS motif. These proteins are a family of vertebrate ECM receptors expressed by endothelial cells.


Pssm-ID: 238751 [Multi-domain]  Cd Length: 185  Bit Score: 50.59  E-value: 7.60e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  798 ELVFVIDSSESVGPENFQIIkNFVKTMADRvaLDLATARIGIINYSHKVEKVANLKQFSSKDDFKLAV-DNMQYLGEgTY 876
Cdd:cd01474     6 DLYFVLDKSGSVAANWIEIY-DFVEQLVDR--FNSPGLRFSFITFSTRATKILPLTDDSSAIIKGLEVlKKVTPSGQ-TY 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  877 TATALQAAND-MFEDARPGVKKVALVI--TDGQTDSRDKEKLTEVVKNASDTNVEIFVIGVvkkndpnFEIFHKEMNLIA 953
Cdd:cd01474    82 IHEGLENANEqIFNRNGGGRETVSVIIalTDGQLLLNGHKYPEHEAKLSRKLGAIVYCVGV-------TDFLKSQLINIA 154
                         170
                  ....*....|...
gi 767945408  954 TDPEHVYQFDDFF 966
Cdd:cd01474   155 DSKEYVFPVTSGF 167
Kunitz_SHPI cd22618
Stichodactyla helianthus Kunitz inhibitor protein ShPI-1, Heteractis crispa protease inhibitor ...
1072-1122 8.90e-07

Stichodactyla helianthus Kunitz inhibitor protein ShPI-1, Heteractis crispa protease inhibitor stichotoxin-Hcr2e, and similar proteins; This model includes Kunitz inhibitor protein ShPI-1, the major protease inhibitor from the sea anemone Stichodactyla helianthus, as well as protease inhibitor stichotoxin-Hcr2e (also called PI- stichotoxin-Hcr2e, PI-SHTX-Hcr2e, or Kunitz-type serine protease inhibitor InhVJ) and HCRG1 from Heteractis crispa. ShPI-1 has an unusually broad specificity toward several serine proteases, including trypsin, chymotrypsin, human neutrophil elastase, kallikrein and plasmin, and can also bind aspartic and cysteine proteases, such as pepsin and papain, respectively. PI-SHTX-Hcr2e and HCRG1 inhibit trypsin and chymotrypsin, but do not inhibit the serine proteases plasmin, thrombin, kallikrein, the cysteine proteinase papain, and the aspartic protease pepsin. The structures of these domains are similar to those of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438661  Cd Length: 53  Bit Score: 46.76  E-value: 8.90e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22618     2 CSEPKVVGPCKAYFPRFYFDSETGKCTPFIYGGCGGNGNNFETLHACRAIC 52
Kunitz_SmCI_2-like cd22602
second Kunitz domain of Carboxypeptidase Inhibitor SmCI and similar domains; This group ...
1077-1122 1.22e-06

second Kunitz domain of Carboxypeptidase Inhibitor SmCI and similar domains; This group includes Sabellastarte magnifica carboxypeptidase inhibitor (SmCI), a tri-domain BPTI-Kunitz inhibitor capable of inhibiting serine proteases and A-like metallocarboxypeptidases. While the BPTI-Kunitz family of proteins includes voltage gated channel blockers and inhibitors of serine proteases, SmCI is the only BPTI-Kunitz protein capable of inhibiting metallocarboxypeptidases. Binding studies show that SmCI is able to bind three trypsin molecules under saturating conditions, but only one elastase interacts with the inhibitor. Additionally, SmCI can bind serine proteases and carboxypeptidases at the same time (at least in the ratio 1:1:1), thus becoming the first protease inhibitor that simultaneously blocks these two mechanistic classes of enzymes. This model contains the second Kunitz domain of SmCI, which has a structure similar to those of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438645  Cd Length: 51  Bit Score: 46.38  E-value: 1.22e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 767945408 1077 KPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22602     6 KVGPCRVSARRWFHNPETEKCEVFIYGGCHGNANRFATETECQEVC 51
Kunitz_BPTI cd22592
bovine pancreatic trypsin inhibitor; This model contains bovine pancreatic trypsin inhibitor ...
1072-1122 1.86e-06

bovine pancreatic trypsin inhibitor; This model contains bovine pancreatic trypsin inhibitor (BPTI, also known as pancreatic Kunitz inhibitor, aprotinin, or trypsin-kallikrein inhibitor), a small protein that inhibits the action of the trypsin, and is thus a member of the serine protease family of inhibitors. This class of enzymes contains conserved cysteine residues that form 3 disulfide bonds to stabilize the three-dimensional structure. BPTI has a relatively broad specificity, inhibiting trypsin as well as chymotrypsin, and elastase-like serine (pro)enzymes capable of very different primary specificity. It reacts rapidly with serine proteases to form stable complexes, but the enzyme:inhibitor complex formation may involve several intermediates corresponding to discrete reaction steps. Furthermore, BPTI inhibits the nitric oxide synthase type-I and -II action, and impairs K+ transport by Ca2+-activated K+ channels. Clinically, BPTI is used in certain surgical interventions, such as cardiopulmonary surgery and orthotopic liver transplantation since it significantly reduces hemorrhagic complications.


Pssm-ID: 438635  Cd Length: 52  Bit Score: 45.71  E-value: 1.86e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 767945408 1072 CLEALKPGNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22592     2 CLEPPYTGPCKARIIRYFYNAKSGLCETFVYGGCRAKRNNFLSAEDCMRTC 52
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
263-321 3.03e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 45.56  E-value: 3.03e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 767945408   263 GERGPKGNPGnaQKGEAGERGPGGIPGYKGDKGERGECGKPGIKGDKGSPGPYGPKGPR 321
Cdd:pfam01391    1 GPPGPPGPPG--PPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
vWA_interalpha_trypsin_inhibitor cd01461
vWA_interalpha trypsin inhibitor (ITI): ITI is a glycoprotein composed of three polypeptides- ...
795-942 3.88e-06

vWA_interalpha trypsin inhibitor (ITI): ITI is a glycoprotein composed of three polypeptides- two heavy chains and one light chain (bikunin). Bikunin confers the protease-inhibitor function while the heavy chains are involved in rendering stability to the extracellular matrix by binding to hyaluronic acid. The heavy chains carry the VWA domain with a conserved MIDAS motif. Although the exact role of the VWA domains remains unknown, it has been speculated to be involved in mediating protein-protein interactions with the components of the extracellular matrix.


Pssm-ID: 238738 [Multi-domain]  Cd Length: 171  Bit Score: 48.37  E-value: 3.88e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  795 TPLELVFVIDSSESVGPENFQIIKNfvktmADRVAL-DL-ATARIGIINYSHKVEKVANLKQFSSKDDFKLAVD---NMQ 869
Cdd:cd01461     1 LPKEVVFVIDTSGSMSGTKIEQTKE-----ALLTALkDLpPGDYFNIIGFSDTVEEFSPSSVSATAENVAAAIEyvnRLQ 75
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767945408  870 YLGeGTYTATALQAANDMFEDARPGVKKVALViTDGQTDsrDKEKLTEVVKNASDTNVEIFVIGVvkKNDPNF 942
Cdd:cd01461    76 ALG-GTNMNDALEAALELLNSSPGSVPQIILL-TDGEVT--NESQILKNVREALSGRIRLFTFGI--GSDVNT 142
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
277-325 4.44e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 44.79  E-value: 4.44e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 767945408   277 GEAGERGPGGIPGYKGDKGERGECGKPGIKGDKGSPGPYGPKGPRGIQG 325
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPG 49
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
48-208 4.49e-06

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 48.05  E-value: 4.49e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   48 DIVFIVDSSESSKIALFDKQKDFVDSLSdKIFQLTPGRsleydIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKSMNLI 127
Cdd:cd01482     2 DIVFLVDGSWSIGRSNFNLVRSFLSSVV-EAFEIGPDG-----VQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYK 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  128 GQGTFSYYAISNA--TRLLKREG-RKDGVKVVLLMTDGidhPKNPDVQSISEDARISGISFITIALSTVVnEAKLRLISG 204
Cdd:cd01482    76 GGNTRTGKALTHVreKNFTPDAGaRPGVPKVVILITDG---KSQDDVELPARVLRNLGVNVFAVGVKDAD-ESELKMIAS 151

                  ....
gi 767945408  205 DSSS 208
Cdd:cd01482   152 KPSE 155
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
478-638 1.13e-05

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 49.26  E-value: 1.13e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  478 GSKGEVGQMGPTGPRGPVG-------IGVQGPKGEPGSIGLPGQPGVPGEDGAAGKKGEAGLPGArgPEGPPGKGQPGPK 550
Cdd:COG5164    16 GVTTPAGSQGSTKPAQNQGstrpagnTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGG--TTPAQNQGGTRPA 93
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  551 GDEGKKGSKGNQGQRGLPGPEGPKGEPGIMGPFGMPGTSIPGPPGPKGDR-GGPGIPGFKGepGLSIRGPKGVQGPRGPV 629
Cdd:COG5164    94 GNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTpPGPGSTGPGG--STTPPGDGGSTTPPGPG 171

                  ....*....
gi 767945408  630 GAPGLKGDG 638
Cdd:COG5164   172 GSTTPPDDG 180
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
484-538 1.44e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.64  E-value: 1.44e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 767945408   484 GQMGPTGPRGPvgigvQGPKGEPGSIGLPGQPGVPGEDGAAGKKGEAGLPGARGP 538
Cdd:pfam01391    1 GPPGPPGPPGP-----PGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGA 50
VWA_2 pfam13519
von Willebrand factor type A domain;
49-158 1.82e-05

von Willebrand factor type A domain;


Pssm-ID: 463909 [Multi-domain]  Cd Length: 103  Bit Score: 44.59  E-value: 1.82e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408    49 IVFIVDSSES-----SKIALFDKQKDFVDSLsdkiFQLTPGRsleydiKLAALQFSSSVQIDPPFSswKDLQTFKQKVKS 123
Cdd:pfam13519    1 LVFVLDTSGSmrngdYGPTRLEAAKDAVLAL----LKSLPGD------RVGLVTFGDGPEVLIPLT--KDRAKILRALRR 68
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 767945408   124 MNLIGQGTFSYYAISNATRLLKREGRKDGVKVVLL 158
Cdd:pfam13519   69 LEPKGGGTNLAAALQLARAALKHRRKNQPRRIVLI 103
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
525-600 2.65e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.87  E-value: 2.65e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 767945408   525 GKKGEAGLPGARgpegppgkgqpgpkgdeGKKGSKGNQGQRGLPGPEGPKGEPGIMGPFGMPGTsiPGPPGPKGDR 600
Cdd:pfam01391    1 GPPGPPGPPGPP-----------------GPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGP--PGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
498-577 3.29e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.48  E-value: 3.29e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   498 GVQGPKGEPGSIGLPGQPGVPGEDGAAGKKGEAGLPGARGPegppgkgqpgpkgdegkkgskgnQGQRGLPGPEGPKGEP 577
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGP-----------------------PGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
716-772 4.08e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.10  E-value: 4.08e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 767945408   716 GPQGFPGPKGtmghgLPGQKGEHGERGDVGKKGDKGEIGEPGSPGKQGLQGPKGDLG 772
Cdd:pfam01391    1 GPPGPPGPPG-----PPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPG 52
vWA_CTRP cd01473
CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an ...
799-935 4.62e-05

CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an important phenomenon in parasite invasion and in malaria associated pathology.CTRP encodes a protein containing a putative signal sequence followed by a long extracellular region of 1990 amino acids, a transmembrane domain, and a short cytoplasmic segment. The extracellular region of CTRP contains two separated adhesive domains. The first domain contains six 210-amino acid-long homologous VWA domain repeats. The second domain contains seven repeats of 87-60 amino acids in length, which share similarities with the thrombospondin type 1 domain found in a variety of adhesive molecules. Finally, CTRP also contains consensus motifs found in the superfamily of haematopoietin receptors. The VWA domains in these proteins likely mediate protein-protein interactions.


Pssm-ID: 238750 [Multi-domain]  Cd Length: 192  Bit Score: 45.39  E-value: 4.62e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  799 LVFVIDSSESVGPENF---------QIIKNFVktmadrvaLDLATARIGIINYSHKVEKVANLKQFSSKDDFKL-----A 864
Cdd:cd01473     3 LTLILDESASIGYSNWrkdvipfteKIINNLN--------ISKDKVHVGILLFAEKNRDVVPFSDEERYDKNELlkkinD 74
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 767945408  865 VDNMQYLGEGTYTATALQAANDMF---EDARPGVKKVALVITDGQTDSRDKEKLTEVVKNASDTNVEIFVIGVV 935
Cdd:cd01473    75 LKNSYRSGGETYIVEALKYGLKNYtkhGNRRKDAPKVTMLFTDGNDTSASKKELQDISLLYKEENVKLLVVGVG 148
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
280-330 1.11e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.94  E-value: 1.11e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 767945408   280 GERGPGGIPGYKGDKGERGECGKPGIKGDKGSPGPYGPKGPRGIQGITGPP 330
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAP 51
Kunitz_ornithodorin_C-like cd22612
C-terminal Kunitz domain of inhibitor ornithodorin and similar proteins; The Kunitz inhibitor ...
1078-1122 1.17e-04

C-terminal Kunitz domain of inhibitor ornithodorin and similar proteins; The Kunitz inhibitor ornithodorin is a highly selective and potent thrombin inhibitor isolated from blood sucking soft tick Ornithodoros moubata. Ornithodorin is a two-domain protein that resembles the tick anticoagulant peptide (TAP) isolated from the same organism, especially the N-terminal domain; this model contains the C-terminal domain. While the N-terminal domain binds to the active site of thrombin, this C-terminal domain binds at the fibrinogen recognition exosite. The structure of this domain is similar to those of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438655  Cd Length: 49  Bit Score: 40.73  E-value: 1.17e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 767945408 1078 PGNCGEYVVRWYYDKQVNSCARFWFsGCNGSGNRFNSEKECQETC 1122
Cdd:cd22612     6 PTSCAEGAEITYYDSDSRTCKVLAA-GCPSGENAFESEIECQVAC 49
vWA_CTRP cd01473
CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an ...
48-204 1.47e-04

CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an important phenomenon in parasite invasion and in malaria associated pathology.CTRP encodes a protein containing a putative signal sequence followed by a long extracellular region of 1990 amino acids, a transmembrane domain, and a short cytoplasmic segment. The extracellular region of CTRP contains two separated adhesive domains. The first domain contains six 210-amino acid-long homologous VWA domain repeats. The second domain contains seven repeats of 87-60 amino acids in length, which share similarities with the thrombospondin type 1 domain found in a variety of adhesive molecules. Finally, CTRP also contains consensus motifs found in the superfamily of haematopoietin receptors. The VWA domains in these proteins likely mediate protein-protein interactions.


Pssm-ID: 238750 [Multi-domain]  Cd Length: 192  Bit Score: 43.85  E-value: 1.47e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   48 DIVFIVDssESSKIALFDKQKDFVDSLSDKIFQLTPGrslEYDIKLAALQFSSSVQIDPPFS---SWKDLQTFKQKVKSM 124
Cdd:cd01473     2 DLTLILD--ESASIGYSNWRKDVIPFTEKIINNLNIS---KDKVHVGILLFAEKNRDVVPFSdeeRYDKNELLKKINDLK 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  125 NLIGQGTFSY------YAISNATRLLKRegRKDGVKVVLLMTDGID-HPKNPDVQSISEDARISGISFITIALSTvVNEA 197
Cdd:cd01473    77 NSYRSGGETYivealkYGLKNYTKHGNR--RKDAPKVTMLFTDGNDtSASKKELQDISLLYKEENVKLLVVGVGA-ASEN 153

                  ....*..
gi 767945408  198 KLRLISG 204
Cdd:cd01473   154 KLKLLAG 160
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
489-774 1.51e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 45.79  E-value: 1.51e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  489 TGPRGPVGIGVQGPKGE---PGSIGLPGQPGVPGEDGAAGKKGEAGLPGargpegppgkgqpgpkgDEGKKGSKGNQGQR 565
Cdd:COG5164     1 TGLYGPGKTGPSDPGGVttpAGSQGSTKPAQNQGSTRPAGNTGGTRPAQ-----------------NQGSTTPAGNTGGT 63
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  566 GLPGPEGPKGEPGIMGPFGMPGTsiPGPPGPKGDRGGPGIPGFKGEPGLSirGPKGVQGPRGPVGAPGlKGDGYPGVPGP 645
Cdd:COG5164    64 RPAGNQGATGPAQNQGGTTPAQN--QGGTRPAGNTGGTTPAGDGGATGPP--DDGGATGPPDDGGSTT-PPSGGSTTPPG 138
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  646 RGLPGPPGPMGLRGVGDTGAKGEPGVRGPPgpsgprgvgtqGPKGDTGQKGLPGPPGPPGYGSQGIKGEQGPQGFPGPKG 725
Cdd:COG5164   139 DGGSTPPGPGSTGPGGSTTPPGDGGSTTPP-----------GPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDG 207
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|
gi 767945408  726 tmGHGLPGQKGEHGERGDV-GKKGDKGEIGEPGSPGKQGLQGPKGDLGLT 774
Cdd:COG5164   208 --PVKKDDKNGKGNPPDDRgGKTGPKDQRPKTNPIERRGPERPEAAALPA 255
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
386-450 1.68e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.55  E-value: 1.68e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 767945408   386 GQPGPRGPEGVPGERGlpgegFPGPKGEKGSEGPTGPqglqglsikgekgdIGPVGPQGPMGIPG 450
Cdd:pfam01391    1 GPPGPPGPPGPPGPPG-----PPGPPGPPGPPGPPGE--------------PGPPGPPGPPGPPG 46
VWA_3 pfam13768
von Willebrand factor type A domain;
47-203 5.97e-04

von Willebrand factor type A domain;


Pssm-ID: 372716 [Multi-domain]  Cd Length: 155  Bit Score: 41.61  E-value: 5.97e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408    47 IDIVFIVDSSESSKialfDKQKDFVDSLSDKIFQLTPGrsleydIKLAALQFSSSVQidPPFSSWK-----DLQTFKQKV 121
Cdd:pfam13768    1 GDVVIVVDVSSSMS----GEPKLQKDALSVALRQLPTG------DKFAVLGFGTLPR--PLFPGWRvvsprSLQEAFQFI 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   122 KSMNLIGQGTFSYYAISNATRLLKREGRkdgVKVVLLMTDGIDHPKNPDVQSISEDARiSGISFITIALSTVVNEAKLRL 201
Cdd:pfam13768   69 KTLQPPLGGSDLLGALKEAVRAPASPGY---IRHVLLLTDGSPMQGETRVSDLISRAP-GKIRFFAYGLGASISAPMLQL 144

                   ..
gi 767945408   202 IS 203
Cdd:pfam13768  145 LA 146
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
257-633 6.23e-04

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 43.84  E-value: 6.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   257 GNPGIKGERGPKGNPGNAQKGEAGERGPGGIPGYKGDKGERGECGKPGikgdkGSPGPYGPKGPRGIQGitgppgdpgpk 336
Cdd:pfam09606  106 PGGPMGQQMGGPGTASNLLASLGRPQMPMGGAGFPSQMSRVGRMQPGG-----QAGGMMQPSSGQPGSG----------- 169
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   337 gfQGNKGEPGPPGPYGSPGAPGIGQQGIKGErGQEGRPGAPGPIGvgePGQPGPRGPEGVPGERGL-PGEGFPGPKGEKG 415
Cdd:pfam09606  170 --TPNQMGPNGGPGQGQAGGMNGGQQGPMGG-QMPPQMGVPGMPG---PADAGAQMGQQAQANGGMnPQQMGGAPNQVAM 243
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   416 SEGPTGP------------------QGLQGLSIKGEKGDIGPVGPQGPMGIPGIGSQGeqgiqgpigppgpqgpagqgLP 477
Cdd:pfam09606  244 QQQQPQQqgqqsqlgmginqmqqmpQGVGGGAGQGGPGQPMGPPGQQPGAMPNVMSIG--------------------DQ 303
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   478 GSKGEVGQMGPTGprgpvGIGVQGPKGEPGSIGLPGQPGvpGEDGAAGKKGEAGLPGARGPEGPPGKGQPGPKGDEGKKG 557
Cdd:pfam09606  304 NNYQQQQTRQQQQ-----QQGGNHPAAHQQQMNQSVGQG--GQVVALGGLNHLETWNPGNFGGLGANPMQRGQPGMMSSP 376
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   558 SKGNQGQ-RGLPGPEGPKGEPGIMGPFGMPGTSIPGPPGPKGDRGGPG-IPGFKGEPGLSIRGPK--GVQGPRGPVGAPG 633
Cdd:pfam09606  377 SPVPGQQvRQVTPNQFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPAlIPSPSPQMSQQPAQQRtiGQDSPGGSLNTPG 456
vWA_BatA_type cd01467
VWA BatA type: Von Willebrand factor type A (vWA) domain was originally found in the blood ...
798-944 1.12e-03

VWA BatA type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses. In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains. Members of this subgroup are bacterial in origin. They are typified by the presence of a MIDAS motif.


Pssm-ID: 238744 [Multi-domain]  Cd Length: 180  Bit Score: 41.16  E-value: 1.12e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  798 ELVFVIDSSESVGPENF----------QIIKNFVKTM-ADRVAL----DLATARIGIINySHKVEKvANLKQFSSKDdfk 862
Cdd:cd01467     4 DIMIALDVSGSMLAQDFvkpsrleaakEVLSDFIDRReNDRIGLvvfaGAAFTQAPLTL-DRESLK-ELLEDIKIGL--- 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  863 lavdnmqyLGEGTYTATALQAANDMFEDARPgVKKVALVITDGQTDSRDKEKLTeVVKNASDTNVEIFVIGV-----VKK 937
Cdd:cd01467    79 --------AGQGTAIGDAIGLAIKRLKNSEA-KERVIVLLTDGENNAGEIDPAT-AAELAKNKGVRIYTIGVgksgsGPK 148

                  ....*..
gi 767945408  938 NDPNFEI 944
Cdd:cd01467   149 PDGSTIL 155
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
360-403 1.78e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.47  E-value: 1.78e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 767945408   360 GQQGIKGERGQEGRPGAPGPIG-VGEPGQPGPRGPEGVPGERGLP 403
Cdd:pfam01391   13 GPPGPPGPPGPPGPPGPPGEPGpPGPPGPPGPPGPPGAPGAPGPP 57
PHA03169 PHA03169
hypothetical protein; Provisional
256-420 1.90e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 41.88  E-value: 1.90e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  256 HGNPGIKGERGPKGN-------PGNAQKGEAGERGPGGIPGYKGDKGERGECGKPGIKGDKGSPGPYGPKGPRGIQGitg 328
Cdd:PHA03169   81 HGEKEERGQGGPSGSgsesvgsPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNP--- 157
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408  329 PPGDPGPKGFQGNKGEPGPPGPYGSPGAPGIGQQGIKGERGQEGRPGAPGPIGVGEPGQPGPRGPEGVPGERGLPGEGFP 408
Cdd:PHA03169  158 SPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEP 237
                         170
                  ....*....|..
gi 767945408  409 GPKGEKGSEGPT 420
Cdd:PHA03169  238 TEPEREGPPFPG 249
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
256-303 3.32e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.70  E-value: 3.32e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 767945408   256 HGNPGIKGERGPKGNPGNAqkGEAGERGPGGIPGYKGDKGERGECGKP 303
Cdd:pfam01391   12 PGPPGPPGPPGPPGPPGPP--GEPGPPGPPGPPGPPGPPGAPGAPGPP 57
vWA_collagen_alpha3-VI-like cd01481
VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable ...
48-128 3.41e-03

VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238758  Cd Length: 165  Bit Score: 39.61  E-value: 3.41e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767945408   48 DIVFIVDSSESSKIALFDKQKDFVDSLsdkIFQLTPGRSLeydIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKSMNLI 127
Cdd:cd01481     2 DIVFLIDGSDNVGSGNFPAIRDFIERI---VQSLDVGPDK---IRVAVVQFSDTPRPEFYLNTHSTKADVLGAVRRLRLR 75

                  .
gi 767945408  128 G 128
Cdd:cd01481    76 G 76
PHA03264 PHA03264
envelope glycoprotein D; Provisional
365-422 4.06e-03

envelope glycoprotein D; Provisional


Pssm-ID: 223029 [Multi-domain]  Cd Length: 416  Bit Score: 41.14  E-value: 4.06e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 767945408  365 KGERGQEgRPGAPGPIGVGEPGQPGPRGPEGVPGERGLPGEGFPGPKGEKGSEGPTGP 422
Cdd:PHA03264  286 KPEPGPV-EDGAPGRETGGEGEGPEPAGRDGAAGGEPKPGPPRPAPDADRPEGWPSLE 342
Kunitz_MitTx cd22610
Micrurus tener tener Kunitz-type neurotoxin MitTx-alpha; Micrurus tener tener Kunitz-type ...
1068-1122 4.53e-03

Micrurus tener tener Kunitz-type neurotoxin MitTx-alpha; Micrurus tener tener Kunitz-type neurotoxin MitTx-alpha is a subunit of the pain-inducing, heterodimeric polypeptide toxin that activates acid sensing ion channel a (ASIC1a) at nanomolar concentrations in a pH-independent manner. Acid sensing ion channels (ASICs) are sodium-selective, voltage-independent and amiloride-blockable ion channels that detect extracellular protons produced during inflammation or ischemic injury, and belong to the superfamily of degenerin/epithelial sodium channels. Subtype ASICa is expressed by primary afferent sensory neurons and is activated by MitTx. MitTx consists of two, non-covalently associated alpha and beta subunits that resemble Kunitz and phospholipase-A2 proteins, respectively, and together they function as a potent and selective ASIC1a agonist. The MitTx-alpha structures is similar to those of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438653  Cd Length: 59  Bit Score: 36.54  E-value: 4.53e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 767945408 1068 EDPRCLEalkpgNCGEYVVRWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22610     8 EDPPFFQ-----KCGAFVDSYYFNRSRITCVHFFYGQCDVNQNHFTTMSECNRVC 57
Kunitz_ixolaris_1 cd22625
Kunitz-type domain 1 (K1) of Ixolaris, and similar proteins; This model includes the first ...
1087-1122 9.34e-03

Kunitz-type domain 1 (K1) of Ixolaris, and similar proteins; This model includes the first Kunitz-type domain (K1) of ixolaris from the venomous organism Conus striatus. Ixolaris is a potent tick salivary anticoagulant that binds coagulation factor Xa (FXa) and zymogen FX, and forms a quaternary tissue factor (TF)/FVIIa/FX(a)/Ixolaris inhibitory complex. It blocks TF-induced coagulation and PAR2 (proteinase-activated receptor 2) signaling, and prevents thrombosis, tumor growth, and immune activation. Ixolaris consists of 2 Kunitz domains (K1 and K2), both of which recognize the heparin-binding (pro)exosite (HBE) on FX. While K2 is an extraordinarily dynamic domain that encompasses several residues involved in FX binding, K1 domain keeps as a rigid platform supporting the conformational dynamic of the K2 domain, forming a salt bridge with FXa. The structure of this domain is similar to that of Kunitz-type proteinase inhibitors such as BPTI (bovine pancreatic trypsin inhibitor), showing an alpha/beta fold with irregular secondary structure stabilized by three disulfide bonds.


Pssm-ID: 438668  Cd Length: 53  Bit Score: 35.70  E-value: 9.34e-03
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 767945408 1087 RWYYDKQVNSCARFWFSGCNGSGNRFNSEKECQETC 1122
Cdd:cd22625    18 RYGYNKKTQQCEEFLGTECGGGGNSFEEAKECWSSC 53
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH