NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|156071472|ref|NP_001758|]
View 

T-cell surface antigen CD2 isoform 2 precursor [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
IgV_CD2_like_N cd05775
N-terminal immunoglobulin (Ig)-like domain of T-cell surface antigen CD2, and similar domains; ...
29-127 6.31e-25

N-terminal immunoglobulin (Ig)-like domain of T-cell surface antigen CD2, and similar domains; The members here are composed of the N-terminal immunoglobulin (Ig)-like domain (or domain 1) of T-cell surface antigen Clusters of Differentiation (CD) 2 and similar proteins. CD2 is a T-cell specific surface glycoprotein and is critically important for mediating adhesion between T cells and antigen-presenting cells or between cytolytic T cells and target cells. CD2 is located on chromosome 1 at 1p13 in humans and on chromosome 3 in mice. CD2 contains an extracellular domain with two or Ig-like domains, a single transmembrane segment, and a cytoplasmic region rich in proline and basic residues.


:

Pssm-ID: 409431  Cd Length: 98  Bit Score: 96.65  E-value: 6.31e-25
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  29 NALETWGALGQDINLDIPSFQmsDDIDDIKWEKTSDKkkIAQFRK-EKETFKE--KDTYKLFKN-GTLKIKHLKTDDQDI 104
Cdd:cd05775    1 SSGEVYGALGGNVTLTISSLQ--DDIDEIKWKKTKDK--IVEWENnIGPTYFGsfKDRVLLDKEsGSLTIKNLTKEDSGT 76
                         90       100
                 ....*....|....*....|...
gi 156071472 105 YKVSIYDTKGKnVLEKIFDLKIQ 127
Cdd:cd05775   77 YELEITSTNGK-VLSSKFTLEVL 98
C2-set pfam05790
Immunoglobulin C2-set domain;
135-205 7.89e-16

Immunoglobulin C2-set domain;


:

Pssm-ID: 399065  Cd Length: 80  Bit Score: 71.61  E-value: 7.89e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  135 ISWTCINTTLTCEVMNGT-------DPELNLYQDGKH-LKLSQRVITHKWTTSLSAKFKCTA-GNKVSKESSVEPVSCPE 205
Cdd:pfam05790   1 VTVSCSNNLLTCEVLELTlpkgskmDPSLKLKGQEAKsLETKKLESTFQPTTEDSGTWVCLAsDNDQKKLESVIEVLVLE 80
Amelogenin super family cl33250
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
265-338 6.53e-07

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


The actual alignment was detected with superfamily member smart00818:

Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 48.63  E-value: 6.53e-07
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 156071472   265 PHQIPASTPQNPATSQHPP-PPPGHRSQAPS--HRPPPPGHrVQHQPQKRPPA-PSGTQVHQQKGPPLPRPRVQPKPP 338
Cdd:smart00818  55 HHHIPVLPAQQPVVPQQPLmPVPGQHSMTPTqhHQPNLPQP-AQQPFQPQPLQpPQPQQPMQPQPPVHPIPPLPPQPP 131
 
Name Accession Description Interval E-value
IgV_CD2_like_N cd05775
N-terminal immunoglobulin (Ig)-like domain of T-cell surface antigen CD2, and similar domains; ...
29-127 6.31e-25

N-terminal immunoglobulin (Ig)-like domain of T-cell surface antigen CD2, and similar domains; The members here are composed of the N-terminal immunoglobulin (Ig)-like domain (or domain 1) of T-cell surface antigen Clusters of Differentiation (CD) 2 and similar proteins. CD2 is a T-cell specific surface glycoprotein and is critically important for mediating adhesion between T cells and antigen-presenting cells or between cytolytic T cells and target cells. CD2 is located on chromosome 1 at 1p13 in humans and on chromosome 3 in mice. CD2 contains an extracellular domain with two or Ig-like domains, a single transmembrane segment, and a cytoplasmic region rich in proline and basic residues.


Pssm-ID: 409431  Cd Length: 98  Bit Score: 96.65  E-value: 6.31e-25
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  29 NALETWGALGQDINLDIPSFQmsDDIDDIKWEKTSDKkkIAQFRK-EKETFKE--KDTYKLFKN-GTLKIKHLKTDDQDI 104
Cdd:cd05775    1 SSGEVYGALGGNVTLTISSLQ--DDIDEIKWKKTKDK--IVEWENnIGPTYFGsfKDRVLLDKEsGSLTIKNLTKEDSGT 76
                         90       100
                 ....*....|....*....|...
gi 156071472 105 YKVSIYDTKGKnVLEKIFDLKIQ 127
Cdd:cd05775   77 YELEITSTNGK-VLSSKFTLEVL 98
C2-set pfam05790
Immunoglobulin C2-set domain;
135-205 7.89e-16

Immunoglobulin C2-set domain;


Pssm-ID: 399065  Cd Length: 80  Bit Score: 71.61  E-value: 7.89e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  135 ISWTCINTTLTCEVMNGT-------DPELNLYQDGKH-LKLSQRVITHKWTTSLSAKFKCTA-GNKVSKESSVEPVSCPE 205
Cdd:pfam05790   1 VTVSCSNNLLTCEVLELTlpkgskmDPSLKLKGQEAKsLETKKLESTFQPTTEDSGTWVCLAsDNDQKKLESVIEVLVLE 80
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
265-338 6.53e-07

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 48.63  E-value: 6.53e-07
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 156071472   265 PHQIPASTPQNPATSQHPP-PPPGHRSQAPS--HRPPPPGHrVQHQPQKRPPA-PSGTQVHQQKGPPLPRPRVQPKPP 338
Cdd:smart00818  55 HHHIPVLPAQQPVVPQQPLmPVPGQHSMTPTqhHQPNLPQP-AQQPFQPQPLQpPQPQQPMQPQPPVHPIPPLPPQPP 131
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
264-338 8.06e-07

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 48.50  E-value: 8.06e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  264 KPHQIPASTPQNPATSQHPPPPPGHRSQAP------SHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPRVQPKP 337
Cdd:pfam15240  76 PPPQGGKQKPQGPPPQGGPRPPPGKPQGPPpqggnqQQGPPPPGKPQGPPPQGGGPPPQGGNQQGPPPPPPGNPQGPPQR 155

                  .
gi 156071472  338 P 338
Cdd:pfam15240 156 P 156
PHA03247 PHA03247
large tegument protein UL36; Provisional
252-348 4.19e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 4.19e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  252 RAHRVATE--ERGRKPHQIPASTPQNPATSQHPPPPpghRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGP--- 326
Cdd:PHA03247 2882 PVRRLARPavSRSTESFALPPDQPERPPQPQAPPPP---QPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPsga 2958
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 156071472  327 --------------PLPRPRV-QPKPPHGAAENSLSP 348
Cdd:PHA03247 2959 vpqpwlgalvpgrvAVPRFRVpQPAPSREAPASSTPP 2995
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
265-350 1.84e-04

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 42.97  E-value: 1.84e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472 265 PHQIPASTPQNPatsqhPPPPPGHRSQAPSHRPPPpghrvqhqPQKRPPAPsgtqvhqqkgPPLPRPRVQPKPPHGAAEN 344
Cdd:NF040983  86 PNKVPPPPPPPP-----PPPPPPPTPPPPPPPPPP--------PPPPSPPP----------PPPPSPPPSPPPPTTTPPT 142

                 ....*.
gi 156071472 345 SLSPSS 350
Cdd:NF040983 143 RTTPST 148
COG3416 COG3416
Uncharacterized conserved protein, DUF2076 domain [Function unknown];
262-317 1.70e-03

Uncharacterized conserved protein, DUF2076 domain [Function unknown];


Pssm-ID: 442642 [Multi-domain]  Cd Length: 237  Bit Score: 39.24  E-value: 1.70e-03
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 156071472 262 GRKPHQIPASTPQNPATSQHPPPPPGHRSQAPShRPPPPGHRVQHQPQKRPPAPSG 317
Cdd:COG3416   91 GGGQRPPPAPQPSQPGPQQQPAPPSGPWGQAAP-QQPGYGQPQYGQPAAGPSGGGG 145
 
Name Accession Description Interval E-value
IgV_CD2_like_N cd05775
N-terminal immunoglobulin (Ig)-like domain of T-cell surface antigen CD2, and similar domains; ...
29-127 6.31e-25

N-terminal immunoglobulin (Ig)-like domain of T-cell surface antigen CD2, and similar domains; The members here are composed of the N-terminal immunoglobulin (Ig)-like domain (or domain 1) of T-cell surface antigen Clusters of Differentiation (CD) 2 and similar proteins. CD2 is a T-cell specific surface glycoprotein and is critically important for mediating adhesion between T cells and antigen-presenting cells or between cytolytic T cells and target cells. CD2 is located on chromosome 1 at 1p13 in humans and on chromosome 3 in mice. CD2 contains an extracellular domain with two or Ig-like domains, a single transmembrane segment, and a cytoplasmic region rich in proline and basic residues.


Pssm-ID: 409431  Cd Length: 98  Bit Score: 96.65  E-value: 6.31e-25
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  29 NALETWGALGQDINLDIPSFQmsDDIDDIKWEKTSDKkkIAQFRK-EKETFKE--KDTYKLFKN-GTLKIKHLKTDDQDI 104
Cdd:cd05775    1 SSGEVYGALGGNVTLTISSLQ--DDIDEIKWKKTKDK--IVEWENnIGPTYFGsfKDRVLLDKEsGSLTIKNLTKEDSGT 76
                         90       100
                 ....*....|....*....|...
gi 156071472 105 YKVSIYDTKGKnVLEKIFDLKIQ 127
Cdd:cd05775   77 YELEITSTNGK-VLSSKFTLEVL 98
C2-set pfam05790
Immunoglobulin C2-set domain;
135-205 7.89e-16

Immunoglobulin C2-set domain;


Pssm-ID: 399065  Cd Length: 80  Bit Score: 71.61  E-value: 7.89e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  135 ISWTCINTTLTCEVMNGT-------DPELNLYQDGKH-LKLSQRVITHKWTTSLSAKFKCTA-GNKVSKESSVEPVSCPE 205
Cdd:pfam05790   1 VTVSCSNNLLTCEVLELTlpkgskmDPSLKLKGQEAKsLETKKLESTFQPTTEDSGTWVCLAsDNDQKKLESVIEVLVLE 80
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
265-338 6.53e-07

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 48.63  E-value: 6.53e-07
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 156071472   265 PHQIPASTPQNPATSQHPP-PPPGHRSQAPS--HRPPPPGHrVQHQPQKRPPA-PSGTQVHQQKGPPLPRPRVQPKPP 338
Cdd:smart00818  55 HHHIPVLPAQQPVVPQQPLmPVPGQHSMTPTqhHQPNLPQP-AQQPFQPQPLQpPQPQQPMQPQPPVHPIPPLPPQPP 131
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
264-338 8.06e-07

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 48.50  E-value: 8.06e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  264 KPHQIPASTPQNPATSQHPPPPPGHRSQAP------SHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPRVQPKP 337
Cdd:pfam15240  76 PPPQGGKQKPQGPPPQGGPRPPPGKPQGPPpqggnqQQGPPPPGKPQGPPPQGGGPPPQGGNQQGPPPPPPGNPQGPPQR 155

                  .
gi 156071472  338 P 338
Cdd:pfam15240 156 P 156
PHA03247 PHA03247
large tegument protein UL36; Provisional
252-348 4.19e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 4.19e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  252 RAHRVATE--ERGRKPHQIPASTPQNPATSQHPPPPpghRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGP--- 326
Cdd:PHA03247 2882 PVRRLARPavSRSTESFALPPDQPERPPQPQAPPPP---QPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPsga 2958
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 156071472  327 --------------PLPRPRV-QPKPPHGAAENSLSP 348
Cdd:PHA03247 2959 vpqpwlgalvpgrvAVPRFRVpQPAPSREAPASSTPP 2995
PHA03247 PHA03247
large tegument protein UL36; Provisional
261-345 4.30e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 4.30e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  261 RGRKPhqiPASTPQNPATSQHPP--------PPPGHRSQA----PSHRPPPPGHRVQHQPQKRPPAPSGTQvHQQKGPPL 328
Cdd:PHA03247 2863 RRRPP---SRSPAAKPAAPARPPvrrlarpaVSRSTESFAlppdQPERPPQPQAPPPPQPQPQPPPPPQPQ-PPPPPPPR 2938
                          90
                  ....*....|....*...
gi 156071472  329 PRPRVQPKP-PHGAAENS 345
Cdd:PHA03247 2939 PQPPLAPTTdPAGAGEPS 2956
PHA03247 PHA03247
large tegument protein UL36; Provisional
241-349 4.66e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 4.66e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  241 RSRRNDEELETRAHRVATEERGRKPHQiPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQV 320
Cdd:PHA03247 2585 RARRPDAPPQSARPRAPVDDRGDPRGP-APPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSR 2663
                          90       100
                  ....*....|....*....|....*....
gi 156071472  321 HQQKGPPlPRPRVQPKPPHGAAENSLSPS 349
Cdd:PHA03247 2664 PRRARRL-GRAAQASSPPQRPRRRAARPT 2691
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
262-340 2.70e-05

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 43.87  E-value: 2.70e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  262 GRKPHQIPASTPQN----PATSQHPPPPPGHrsQAPSHRPPPPGHRVQHQPQKRPPAPSGTqvhQQKGPPLPRPRVQPKP 337
Cdd:pfam15240  52 GGFPPQPPASDDPPgpppPGGPQQPPPQGGK--QKPQGPPPQGGPRPPPGKPQGPPPQGGN---QQQGPPPPGKPQGPPP 126

                  ...
gi 156071472  338 PHG 340
Cdd:pfam15240 127 QGG 129
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
267-339 3.06e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 45.80  E-value: 3.06e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 156071472  267 QIPASTPQNPATSQHPP-PPPGHRSQAPSHRPPPPGHrvQHQPQKRPPAPsgtQVHQQKGPP---LPRPRVQPKPPH 339
Cdd:pfam09770 206 QAKKPAQQPAPAPAQPPaAPPAQQAQQQQQFPPQIQQ--QQQPQQQPQQP---QQHPGQGHPvtiLQRPQSPQPDPA 277
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
247-342 3.17e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 45.80  E-value: 3.17e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  247 EELET--RAH-RVATEERGRKPHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPghrVQHQPQKRPPAPSGTQVHQQ 323
Cdd:pfam09770 197 EEVEAamRAQaKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQP---QQHPGQGHPVTILQRPQSPQ 273
                          90
                  ....*....|....*....
gi 156071472  324 KGPPLPRPRVQPKPPHGAA 342
Cdd:pfam09770 274 PDPAQPSIQPQAQQFHQQP 292
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
265-351 4.58e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 45.53  E-value: 4.58e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  265 PHQIPAS---TPQNPATSQHPPPP----PGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPP------LPRP 331
Cdd:pfam03154 290 QHPVPPQpfpLTPQSSQSQVPPGPspaaPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPttpipqLPNP 369
                          90       100
                  ....*....|....*....|
gi 156071472  332 RVQPKPPHGAAENSLSPSSN 351
Cdd:pfam03154 370 QSHKHPPHLSGPSPFQMNSN 389
PHA03247 PHA03247
large tegument protein UL36; Provisional
255-342 5.63e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 5.63e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  255 RVATEERGRKPHQIP--ASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPR 332
Cdd:PHA03247 2660 RVSRPRRARRLGRAAqaSSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAP 2739
                          90
                  ....*....|
gi 156071472  333 VQPKPPHGAA 342
Cdd:PHA03247 2740 APPAVPAGPA 2749
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
265-339 5.85e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 45.03  E-value: 5.85e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 156071472  265 PHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPRVQPKPPH 339
Cdd:pfam09770 222 PAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVP 296
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
265-335 8.75e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 44.26  E-value: 8.75e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 156071472  265 PHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPpgHRVQHQ--PQKRPPAPSGTQVHQQKGPPLPRPRVQP 335
Cdd:pfam09770 229 QAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPV--TILQRPqsPQPDPAQPSIQPQAQQFHQQPPPVPVQP 299
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
253-344 8.84e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.21  E-value: 8.84e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472 253 AHRVATEERGRKPHQIPASTPQnPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPR 332
Cdd:PRK07764 419 AAAAPAPAAAPQPAPAPAPAPA-PPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAP 497
                         90
                 ....*....|..
gi 156071472 333 VQPKPPHGAAEN 344
Cdd:PRK07764 498 AAPAAPAGADDA 509
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
265-341 8.90e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 44.26  E-value: 8.90e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  265 PHQIPASTPQNPatSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQ------------QKGPPLPRPR 332
Cdd:pfam09770 246 PQQQPQQPQQHP--GQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQilqnpnrlsaarVGYPQNPQPG 323

                  ....*....
gi 156071472  333 VQPKPPHGA 341
Cdd:pfam09770 324 VQPAPAHQA 332
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
264-339 1.41e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 43.87  E-value: 1.41e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 156071472  264 KPHQiPASTPQNPATSQHPPPPPGHRSQA---PsHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPRVQPKPPH 339
Cdd:pfam09770 275 DPAQ-PSIQPQAQQFHQQPPPVPVQPTQIlqnP-NRLSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPIITHPQQ 351
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
239-345 1.59e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 43.62  E-value: 1.59e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  239 KQRSRRNDEELETRAHRVATEERGRKPHQIPASTPQNPATSQHP----------PPPPGHRSQAPSHRPPPPGHRVQHQP 308
Cdd:PHA03307  809 ADAASRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPARSSEsskskpaaagGRARGKNGRRRPRPPEPRARPGAAAP 888
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 156071472  309 QKRPPAPSGTQVHQQKGPPLPRPRVQPKPPHGAAENS 345
Cdd:PHA03307  889 PKAAAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDPRG 925
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
265-350 1.84e-04

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 42.97  E-value: 1.84e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472 265 PHQIPASTPQNPatsqhPPPPPGHRSQAPSHRPPPpghrvqhqPQKRPPAPsgtqvhqqkgPPLPRPRVQPKPPHGAAEN 344
Cdd:NF040983  86 PNKVPPPPPPPP-----PPPPPPPTPPPPPPPPPP--------PPPPSPPP----------PPPPSPPPSPPPPTTTPPT 142

                 ....*.
gi 156071472 345 SLSPSS 350
Cdd:NF040983 143 RTTPST 148
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
268-338 1.98e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.22  E-value: 1.98e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  268 IPASTPQNPATSQHPPPP---PGHRSQAPSHRPPPP-------GHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPRVQPKP 337
Cdd:pfam03154 278 MPHSLQTGPSHMQHPVPPqpfPLTPQSSQSQVPPGPspaapgqSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKP 357

                  .
gi 156071472  338 P 338
Cdd:pfam03154 358 P 358
PRK10927 PRK10927
cell division protein FtsN;
250-337 2.42e-04

cell division protein FtsN;


Pssm-ID: 236797 [Multi-domain]  Cd Length: 319  Bit Score: 42.36  E-value: 2.42e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472 250 ETRAHRVATEERGRKPHQIPASTP--QNPATSQHPPPPPGHRSQAP--SHRPPPPGHRVQHQPQK---RPPAPSGTQVHQ 322
Cdd:PRK10927 144 QTPEQRQQTLQRQRQAQQLAEQQRlaQQSRTTEQSWQQQTRTSQAApvQAQPRQSKPASTQQPYQdllQTPAHTTAQSKP 223
                         90
                 ....*....|....*
gi 156071472 323 QKGPPLPRPRVQPKP 337
Cdd:PRK10927 224 QQAAPVTRAADAPKP 238
PHA03247 PHA03247
large tegument protein UL36; Provisional
251-338 2.64e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.00  E-value: 2.64e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  251 TRAHRV---ATEERGRKPHQIPASTPqnpATSQHPPPPPGhrsqapSHRPPPPGhrvqhqPQKRPPAPSGTQVHQQKGPP 327
Cdd:PHA03247 2584 SRARRPdapPQSARPRAPVDDRGDPR---GPAPPSPLPPD------THAPDPPP------PSPSPAANEPDPHPPPTVPP 2648
                          90
                  ....*....|.
gi 156071472  328 LPRPRVQPKPP 338
Cdd:PHA03247 2649 PERPRDDPAPG 2659
Gag_spuma pfam03276
Spumavirus gag protein;
265-342 3.59e-04

Spumavirus gag protein;


Pssm-ID: 460872 [Multi-domain]  Cd Length: 614  Bit Score: 42.43  E-value: 3.59e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  265 PHQIPASTPQN--PATSQHPPP------PPGHR-SQAPSHRP-PPPGHRVQHQPQKR---PPAPSGTQVHQQKGPPLPRP 331
Cdd:pfam03276 218 PGNIARSLGDDimPSLGDAGMPqprfafHPGNPfAEAEGHPFaEAEGERPRDIPRAPridAPSAPAIPAIQPIAPPMIPP 297
                          90
                  ....*....|..
gi 156071472  332 R-VQPKPPHGAA 342
Cdd:pfam03276 298 IgAPIPIPHGAS 309
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
257-350 3.74e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.28  E-value: 3.74e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472 257 ATEERGRKPHQIPASTPQNPATSQHPPPPPghRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPRVQPK 336
Cdd:PRK07764 396 AAAPSAAAAAPAAAPAPAAAAPAAAAAPAP--AAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAA 473
                         90
                 ....*....|....
gi 156071472 337 PPHGAAENSLSPSS 350
Cdd:PRK07764 474 PEPTAAPAPAPPAA 487
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
253-350 4.14e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 42.01  E-value: 4.14e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472 253 AHRVATEERGRKPHQIPASTPQNPATSQhPPPPPGHRSQAPSHRPPPPGHRVqhqPQKRPPAPSGTQVHQQKGPPLPrPR 332
Cdd:PRK14951 391 AAPVAQAAAAPAPAAAPAAAASAPAAPP-AAAPPAPVAAPAAAAPAAAPAAA---PAAVALAPAPPAQAAPETVAIP-VR 465
                         90
                 ....*....|....*...
gi 156071472 333 VQPKPPHGAAENSLSPSS 350
Cdd:PRK14951 466 VAPEPAVASAAPAPAAAP 483
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
265-339 4.95e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.06  E-value: 4.95e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  265 PHQIPASTPQNPATSQHPPPPPGHRSQAPSHRP---PPPGHRVQHQPQKRPPAPSGTQVHQQ-----KGPPLPRPrVQPK 336
Cdd:pfam03154 207 PPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPqrlPSPHPPLQPMTQPPPPSQVSPQPLPQpslhgQMPPMPHS-LQTG 285

                  ...
gi 156071472  337 PPH 339
Cdd:pfam03154 286 PSH 288
PHA03247 PHA03247
large tegument protein UL36; Provisional
236-350 5.87e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 5.87e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  236 KRKKQRSRrndeeletraHRVATEERGRKPHQIPASTPQNPATSQHPPPPPGhrsqaPSHRPPPPGHRVqhqPQKRPPAP 315
Cdd:PHA03247  382 TRKRRSAR----------HAATPFARGPGGDDQTRPAAPVPASVPTPAPTPV-----PASAPPPPATPL---PSAEPGSD 443
                          90       100       110
                  ....*....|....*....|....*....|....*
gi 156071472  316 SGTQVhqqkgPPLPRPRVQPKPPHGAAENSLSPSS 350
Cdd:PHA03247  444 DGPAP-----PPERQPPAPATEPAPDDPDDATRKA 473
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
264-349 9.59e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.29  E-value: 9.59e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  264 KPHQIPASTPQNPATSQHPP------PPPGHRSQA-PSHRPPP------PGHRVQHQPQKRPPAPSGTQVHQQKGPPLPR 330
Cdd:pfam03154 250 QPMTQPPPPSQVSPQPLPQPslhgqmPPMPHSLQTgPSHMQHPvppqpfPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPP 329
                          90
                  ....*....|....*....
gi 156071472  331 PRVQPKPPHGAAENSLSPS 349
Cdd:pfam03154 330 SQSQLQSQQPPREQPLPPA 348
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
260-350 1.02e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.12  E-value: 1.02e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472 260 ERGRKPHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRP-----PAPSGTQVHQQKGPPLPRPRVQ 334
Cdd:PRK07764 610 EEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDAsdggdGWPAKAGGAAPAAPPPAPAPAA 689
                         90
                 ....*....|....*.
gi 156071472 335 PKPPHGAAENSLSPSS 350
Cdd:PRK07764 690 PAAPAGAAPAQPAPAP 705
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
253-343 1.04e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.12  E-value: 1.04e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472 253 AHRVATEERgRKPHQIPASTPQNPATSQHPPPP----PGHRSQAPSHRPPPPGHRVQH--QPQKRPPAPSGTQVHQQKGP 326
Cdd:PRK07764 375 LARLERLER-RLGVAGGAGAPAAAAPSAAAAAPaaapAPAAAAPAAAAAPAPAAAPQPapAPAPAPAPPSPAGNAPAGGA 453
                         90
                 ....*....|....*..
gi 156071472 327 PLPRPRVQPKPPHGAAE 343
Cdd:PRK07764 454 PSPPPAAAPSAQPAPAP 470
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
273-338 1.05e-03

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 39.25  E-value: 1.05e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 156071472  273 PQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQkrPPAPSgtqvhQQKGPPL--PRPRVQPKPP 338
Cdd:pfam15240 106 PQGGNQQQGPPPPGKPQGPPPQGGGPPPQGGNQQGPP--PPPPG-----NPQGPPQrpPQPGNPQGPP 166
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
257-338 1.25e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 40.92  E-value: 1.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  257 ATEERGRKPHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRP--------PAPSGTQVHQQKGPPL 328
Cdd:PHA03307  328 STSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGrptrrrarAAVAGRARRRDATGRF 407
                          90
                  ....*....|
gi 156071472  329 PRPRVQPKPP 338
Cdd:PHA03307  408 PAGRPRPSPL 417
PRK10263 PRK10263
DNA translocase FtsK; Provisional
250-348 1.52e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 40.45  E-value: 1.52e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  250 ETRAHRVATEERGRKPHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKrpPAPSGTQVHQQKGPPLP 329
Cdd:PRK10263  752 VQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVA--PQPQYQQPQQPVAPQPQ 829
                          90
                  ....*....|....*....
gi 156071472  330 RPRVQPKPPHGAAENSLSP 348
Cdd:PRK10263  830 YQQPQQPVAPQPQDTLLHP 848
PHA03247 PHA03247
large tegument protein UL36; Provisional
269-349 1.55e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 40.69  E-value: 1.55e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  269 PASTPQNPATSQHPPPP---------PGH--RSQAPSHRPP-----PPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPR 332
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPslplggsvaPGGdvRRRPPSRSPAakpaaPARPPVRRLARPAVSRSTESFALPPDQPERPPQP 2911
                          90
                  ....*....|....*..
gi 156071472  333 VQPKPPHGAAENSLSPS 349
Cdd:PHA03247 2912 QAPPPPQPQPQPPPPPQ 2928
IgV_CEACAM_like cd05741
Immunoglobulin (Ig)-like domain of carcinoembryonic antigen (CEA) related cell adhesion ...
35-116 1.56e-03

Immunoglobulin (Ig)-like domain of carcinoembryonic antigen (CEA) related cell adhesion molecule (CEACAM) and similar proteins; The members here are composed of the immunoglobulin (Ig)-like domain in carcinoembryonic antigen (CEA) related cell adhesion molecule (CEACAM) and related domains. The CEA family is a group of anchored or secreted glycoproteins, expressed by epithelial cells, leukocytes, endothelial cells and placenta. The CEA family is divided into the CEACAM and pregnancy-specific glycoprotein (PSG) subfamilies. This group represents the CEACAM subfamily. CEACAM1 has many important cellular functions: it is a cell adhesion molecule and a signaling molecule that regulates the growth of tumor cells, an angiogenic factor, and a receptor for bacterial and viral pathogens, including mouse hepatitis virus (MHV). In mice, four isoforms of CEACAM1 generated by alternative splicing have either two (D1, D4) or four (D1-D4) Ig-like domains on the cell surface. This family corresponds to the D1 Ig-like domain. Also belonging to this group is the N-terminal immunoglobulin (Ig)-like domain of the signaling lymphocyte activation molecule (SLAM) family, CD84-like family. The SLAM family is a group of immune-cell specific receptors that can regulate both adaptive and innate immune responses. SLAM family proteins are organized as an extracellular domain with having two or four Ig-like domains, a single transmembrane segment, and a cytoplasmic region having Tyr-based motifs. The extracellular domain is organized as a membrane-distal Ig variable (IgV) domain that is responsible for ligand recognition and a membrane-proximal truncated Ig constant-2 (IgC2) domain.


Pssm-ID: 409403  Cd Length: 102  Bit Score: 37.50  E-value: 1.56e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  35 GALGQDINLDIPSFQMSDDidDIKWEKTSD---KKKIAQFRKEKETF----KEKDTYKLFKNGTLKIKHLKTDDQDIYKV 107
Cdd:cd05741    7 GAEGKNVLLLVPNLQTPLK--SVSWYKGKQvsrNDEIAEYENSSDEFragsAFSGREYIYTNGSLLIQNITLSDTGFYTL 84

                 ....*....
gi 156071472 108 SIYDTKGKN 116
Cdd:cd05741   85 ESTNIGGKT 93
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
269-349 1.63e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 40.35  E-value: 1.63e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472 269 PASTPQNPATSQHPPPPPGhrSQAPSHRPPPPGHRVQHQPQKrPPAPSGTQVHQQKGPPLPRPRVQPKPPhGAAENSLSP 348
Cdd:PRK07764 425 PAAAPQPAPAPAPAPAPPS--PAGNAPAGGAPSPPPAAAPSA-QPAPAPAAAPEPTAAPAPAPPAAPAPA-AAPAAPAAP 500

                 .
gi 156071472 349 S 349
Cdd:PRK07764 501 A 501
COG3416 COG3416
Uncharacterized conserved protein, DUF2076 domain [Function unknown];
262-317 1.70e-03

Uncharacterized conserved protein, DUF2076 domain [Function unknown];


Pssm-ID: 442642 [Multi-domain]  Cd Length: 237  Bit Score: 39.24  E-value: 1.70e-03
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 156071472 262 GRKPHQIPASTPQNPATSQHPPPPPGHRSQAPShRPPPPGHRVQHQPQKRPPAPSG 317
Cdd:COG3416   91 GGGQRPPPAPQPSQPGPQQQPAPPSGPWGQAAP-QQPGYGQPQYGQPAAGPSGGGG 145
PHA03247 PHA03247
large tegument protein UL36; Provisional
252-350 1.76e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 40.31  E-value: 1.76e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  252 RAHRVATEERGRKPHQIPASTPQNPATSQHPP---PPPGHRSQAPSHRPPP--PGHRVQHQPQKRPPAPSGTQVHQQKGP 326
Cdd:PHA03247  344 RQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHPkraSLPTRKRRSARHAATPfaRGPGGDDQTRPAAPVPASVPTPAPTPV 423
                          90       100
                  ....*....|....*....|....*
gi 156071472  327 PLPRPRVQPKP-PHGAAENSLSPSS 350
Cdd:PHA03247  424 PASAPPPPATPlPSAEPGSDDGPAP 448
PHA03247 PHA03247
large tegument protein UL36; Provisional
237-340 1.97e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 40.31  E-value: 1.97e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  237 RKKQRSRRNDEELETRAHRVATEERGRKPHQIPASTPQNPATSQHPPPPPGHRSQAP----SHRPPPPGHRVQHQPQKRP 312
Cdd:PHA03247  382 TRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAepgsDDGPAPPPERQPPAPATEP 461
                          90       100
                  ....*....|....*....|....*...
gi 156071472  313 PAPSGTQVHQQKGPPLpRPRVQPKPPHG 340
Cdd:PHA03247  462 APDDPDDATRKALDAL-RERRPPEPPGA 488
PHA03378 PHA03378
EBNA-3B; Provisional
257-337 2.47e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 40.05  E-value: 2.47e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472 257 ATEERGRKPHQIPASTPQNPATSQHPPPPPGhrsqAPSHRPPP---PGHRVQHQPQKRPPAPsgtQVHQQKGP-PLPRPR 332
Cdd:PHA03378 729 AAPGRARPPAAAPGRARPPAAAPGRARPPAA----APGRARPPaaaPGAPTPQPPPQAPPAP---QQRPRGAPtPQPPPQ 801

                 ....*
gi 156071472 333 VQPKP 337
Cdd:PHA03378 802 AGPTS 806
PHA03419 PHA03419
E4 protein; Provisional
266-349 2.47e-03

E4 protein; Provisional


Pssm-ID: 223079 [Multi-domain]  Cd Length: 200  Bit Score: 38.78  E-value: 2.47e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472 266 HQIPASTPQNPATSQHP-------PPPPGHRSQAP-SHRP-----PPPGHR--VQHQP---------QKRPPAPSGTQVH 321
Cdd:PHA03419  43 RQLETGYPFCPPTTPHPssqpppcPPSPGHPPQTNdTHEKdlalqPPPGGKkkEKKKKetekpaqggEKPDQGPEAKGEG 122
                         90       100
                 ....*....|....*....|....*...
gi 156071472 322 QQKGPPLPRPRVQPKPPHGAAENSLSPS 349
Cdd:PHA03419 123 EGHEPEDPPPEDTPPPPGGEGEVEGGPS 150
Ig_2 pfam13895
Immunoglobulin domain; This domain contains immunoglobulin-like domains.
141-197 2.80e-03

Immunoglobulin domain; This domain contains immunoglobulin-like domains.


Pssm-ID: 464026 [Multi-domain]  Cd Length: 79  Bit Score: 36.22  E-value: 2.80e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 156071472  141 NTTLTCEVMNGTDPELNLYQDGKHLKLSQRVITHKWTTSLSAKFKCTAGNKVSKESS 197
Cdd:pfam13895  16 PVTLTCSAPGNPPPSYTWYKDGSAISSSPNFFTLSVSAEDSGTYTCVARNGRGGKVS 72
Jun pfam03957
Jun-like transcription factor;
268-316 2.84e-03

Jun-like transcription factor;


Pssm-ID: 461108 [Multi-domain]  Cd Length: 231  Bit Score: 38.74  E-value: 2.84e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 156071472  268 IPASTPQNPATSQHPPPPPGhRSQAPSHRPPPPGHRVQHQPQKRPPAPS 316
Cdd:pfam03957 179 APAQPPQPVSYAAEPPPFAV-PVQHPPPGRPPRLQALKEEPQTVPEVPS 226
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
270-343 2.97e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 39.34  E-value: 2.97e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 156071472 270 ASTPQNPATSQHPPPPPG--HRSQAPSHRPPPPGHRVQHQPQKRPPAPSgtqvhqqkGPPLPRPRVQPKPPHGAAE 343
Cdd:PRK14965 381 APAPPSAAWGAPTPAAPAapPPAAAPPVPPAAPARPAAARPAPAPAPPA--------AAAPPARSADPAAAASAGD 448
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
268-350 3.11e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 39.41  E-value: 3.11e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472 268 IPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQ--PQKRPPAPSGTQVHQQKGPPLPRPRVQPKPPHGAAENS 345
Cdd:PRK14950 360 LVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPkePVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVD 439

                 ....*
gi 156071472 346 LSPSS 350
Cdd:PRK14950 440 EKPKY 444
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
257-342 3.25e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 39.58  E-value: 3.25e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472 257 ATEERGRKPHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPL-----PRP 331
Cdd:PRK07764 620 APAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAapagaAPA 699
                         90
                 ....*....|.
gi 156071472 332 RVQPKPPHGAA 342
Cdd:PRK07764 700 QPAPAPAATPP 710
DUF6264 pfam19779
Family of unknown function (DUF6264); This family of putative integral membrane proteins is ...
257-317 3.29e-03

Family of unknown function (DUF6264); This family of putative integral membrane proteins is functionally uncharacterized. This family of proteins is found in bacteria. Proteins in this family are typically between 179 and 218 amino acids in length.


Pssm-ID: 466182  Cd Length: 182  Bit Score: 38.01  E-value: 3.29e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 156071472  257 ATEERGRKPHQIPASTpQNPATSQHPPPPPGHRSQAPSHRP---PPPGHRVQHQPQKRPPAPSG 317
Cdd:pfam19779   8 APPGWQRAPIGDPAAA-AAAAPPAAPAPAAPAPPAAPAAPPaapPPPGAPAPGAPAAARRARRW 70
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
266-338 3.60e-03

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 37.85  E-value: 3.60e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 156071472   266 HQIPASTPQNPATsqHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPsgTQVHQQKGPPLPRPRVQPKPP 338
Cdd:smart00818  37 HQIIPVSQQHPPT--HTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTP--TQHHQPNLPQPAQQPFQPQPL 105
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
261-338 4.23e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 39.08  E-value: 4.23e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472 261 RGRKPHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQ--QKGPPLPRPRVQPKPP 338
Cdd:PRK07994 360 HPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQllAARQQLQRAQGATKAK 439
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
247-338 4.54e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 39.00  E-value: 4.54e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  247 EELETRAHRVATEERGRKPHqiPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGP 326
Cdd:PHA03307  103 EGSPTPPGPSSPDPPPPTPP--PASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPE 180
                          90
                  ....*....|..
gi 156071472  327 PLPRPRVQPKPP 338
Cdd:PHA03307  181 ETARAPSSPPAE 192
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
265-350 5.29e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 38.81  E-value: 5.29e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472 265 PHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPRVQPKPPHGAAEN 344
Cdd:PRK07764 650 PEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGAS 729

                 ....*.
gi 156071472 345 SLSPSS 350
Cdd:PRK07764 730 APSPAA 735
PHA03378 PHA03378
EBNA-3B; Provisional
251-337 5.31e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 38.89  E-value: 5.31e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472 251 TRAHRVATEERGRKPHQIPASTPQNPATSQHPPPPPGHRSQAPshRPPPPGHRVQHQPQKRPPAPSGTQvhQQKGPPLP- 329
Cdd:PHA03378 712 GRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRA--RPPAAAPGRARPPAAAPGAPTPQP--PPQAPPAPq 787

                 ....*....
gi 156071472 330 -RPRVQPKP 337
Cdd:PHA03378 788 qRPRGAPTP 796
PHA03378 PHA03378
EBNA-3B; Provisional
251-338 5.36e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 38.89  E-value: 5.36e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472 251 TRAHRVATEERGRKPHQIPASTPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVqhqPQKRP-PAPSGTQvhQQKGPPLP 329
Cdd:PHA03378 732 GRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPA---PQQRPrGAPTPQP--PPQAGPTS 806

                 ....*....
gi 156071472 330 RPRVQPKPP 338
Cdd:PHA03378 807 MQLMPRAAP 815
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
265-341 5.55e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 38.86  E-value: 5.55e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  265 PHQIPAST---PQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQ--HQPQKRPPAPSGTQVHQQKG----PPLPRPRVQP 335
Cdd:pfam09770 258 GQGHPVTIlqrPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQilQNPNRLSAARVGYPQNPQPGvqpaPAHQAHRQQG 337

                  ....*.
gi 156071472  336 KPPHGA 341
Cdd:pfam09770 338 SFGRQA 343
PRK12757 PRK12757
cell division protein FtsN; Provisional
265-338 5.70e-03

cell division protein FtsN; Provisional


Pssm-ID: 237191 [Multi-domain]  Cd Length: 256  Bit Score: 38.10  E-value: 5.70e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 156071472 265 PHQIPASTPQNPATSQHPPPPPghRSQAPSHRPPPPGHRVQHQPQKRPPapsgtqVHQQKGPPLPRPRVQPKPP 338
Cdd:PRK12757 111 TPQVPRSTVQIQQQAQQQQPPA--TTAQPQPVTPPRQTTAPVQPQTPAP------VRTQPAAPVTQAVEAPKVE 176
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
272-338 5.97e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 38.52  E-value: 5.97e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 156071472 272 TPQNPATSQHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAP----SGTQVHQQKGPPLPRPRVQPKPP 338
Cdd:PTZ00449 596 KPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPqrpsSPERPEGPKIIKSPKPPKSPKPP 666
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
252-349 6.84e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 38.59  E-value: 6.84e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  252 RAHRVATEERGRKPHQIPASTPQNPA-------TSQHPP---PPPGH-RSQAPSHRPPPPGHRVQHQPQKRPPAPSGtqv 320
Cdd:pfam03154 370 QSHKHPPHLSGPSPFQMNSNLPPPPAlkplsslSTHHPPsahPPPLQlMPQSQQLPPPPAQPPVLTQSQSLPPPAAS--- 446
                          90       100       110
                  ....*....|....*....|....*....|...
gi 156071472  321 HQQKGPPLPRPRVQPKPPH----GAAENSLSPS 349
Cdd:pfam03154 447 HPPTSGLHQVPSQSPFPQHpfvpGGPPPITPPS 479
SOBP pfam15279
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ...
262-351 7.07e-03

Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.


Pssm-ID: 464609 [Multi-domain]  Cd Length: 325  Bit Score: 37.87  E-value: 7.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 156071472  262 GRKPHQIPAS-TPQNPATS--QHPPPPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSgtqvhqqkgPPLPRPRVQPKPP 338
Cdd:pfam15279 158 GRPPGSPPMSmTPRGLLGKpqQHPPPSPLPAFMEPSSMPPPFLRPPPSIPQPNSPLSN---------PMLPGIGPPPKPP 228
                          90
                  ....*....|...
gi 156071472  339 HgaaenSLSPSSN 351
Cdd:pfam15279 229 R-----NLGPPSN 236
PHA03420 PHA03420
E4 protein; Provisional
277-349 8.60e-03

E4 protein; Provisional


Pssm-ID: 177648  Cd Length: 137  Bit Score: 36.13  E-value: 8.60e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 156071472 277 ATSQHPP-PPPGHRSQAPSHRPPPPGHRVQHQPQKRPPAPSGTQVHQQKGPPLPRPRVQPKPPHGAAENSLSPS 349
Cdd:PHA03420  28 AENQHIQrEGGNHRTWDPADYLDRPHHHPHRHQQDDHHLQDRQHLPQQHLQRPHHPLSPQCQQNVTGKPVLQQE 101
PRK01297 PRK01297
ATP-dependent RNA helicase RhlB; Provisional
277-337 9.97e-03

ATP-dependent RNA helicase RhlB; Provisional


Pssm-ID: 234938 [Multi-domain]  Cd Length: 475  Bit Score: 37.58  E-value: 9.97e-03
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 156071472 277 ATSQHPPPPPGHRSQAPSHRPPPPghrvqHQPQKRPPAPSGTQVHQQKGPPLPRPRVQPKP 337
Cdd:PRK01297  12 GEAEQPAPAPPSPAAAPAPPPPAK-----TAAPATKAAAPAAAAPRAEKPKKDKPRRERKP 67
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH