NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2462599775|ref|XP_054207238|]
View 

slit homolog 2 protein isoform X2 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
LamG smart00282
Laminin G domain;
1163-1296 7.88e-37

Laminin G domain;


:

Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 135.54  E-value: 7.88e-37
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  1163 NITLQIATDEDSGILLY---KGDKDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGG 1239
Cdd:smart00282    1 SISFSFRTTSPNGLLLYagsKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462599775  1240 NPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYINSE 1296
Cdd:smart00282   81 NRVSGESPGGLTILNLDGPLYLGGLP-----EDLKLPPLPVTPGFRGCIRNLKVNGK 132
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
56-211 4.12e-23

Leucine-rich repeat (LRR) protein [Transcription];


:

Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 103.86  E-value: 4.12e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775   56 NTERLDLNGNNITRITKtDFAGLRHLRVLQLMENKISTIERgAFQDLKELERLRLNRNHLQLFPELLfLGTAKLYRLDLS 135
Cdd:COG4886    114 NLESLDLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLPEEL-GNLTNLKELDLS 190
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462599775  136 ENQIQAIPrKAFRGAVDIKNLQLDYNQISCIEDgAFRALRDLEVLTLNNNNITRLSvaSFNHMPKLRTFRLHSNNL 211
Cdd:COG4886    191 NNQITDLP-EPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQL 262
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
524-841 3.09e-19

Leucine-rich repeat (LRR) protein [Transcription];


:

Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 92.30  E-value: 3.09e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  524 SNQKLNKIPEHIPQYTAELRLNNNEFTVLEATGIFKKLPQLRKINFSNNKITDIEEGafegasgvneilltsnrlenvqh 603
Cdd:COG4886     75 LLLSLLLLSLLLLGLTDLGDLTNLTELDLSGNEELSNLTNLESLDLSGNQLTDLPEE----------------------- 131
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  604 kmFKGLESLKTLMLRSNRITCVGnDSFIGLSSVRLLSLYDNQITTVaPGAFDTLHSLSTLNLlanpfncncylawlgewl 683
Cdd:COG4886    132 --LANLTNLKELDLSNNQLTDLP-EPLGNLTNLKSLDLSNNQLTDL-PEELGNLTNLKELDL------------------ 189
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  684 rkkrivTGNPrcqkpyfLKEIPiqdvaiqdftcddgnddnscSPLSRCPTectcldtvvrcsnkglkvlpkgiprdVTEL 763
Cdd:COG4886    190 ------SNNQ-------ITDLP--------------------EPLGNLTN--------------------------LEEL 210
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462599775  764 YLDGNQFTLVPKELSNYKHLTLIILSYNRLRCIPprTFDGLKSLRLLSLHGNDISVVPEGAfnDLSALSHLAIGANPL 841
Cdd:COG4886    211 DLSGNQLTDLPEPLANLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQL 284
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
306-671 8.82e-18

Leucine-rich repeat (LRR) protein [Transcription];


:

Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 87.68  E-value: 8.82e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  306 ITEIRLEQNTikvippgAFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKITELPKSLFEglfslqllllna 385
Cdd:COG4886     98 LTELDLSGNE-------ELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQLTDLPEPLGN------------ 157
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  386 nkinclrvdafqdLHNLNLLSLYDNKLQTIAKGtFSPLRaiqtmhlaqnpficdcHLKWLadYLHTNPIETsgarctspr 465
Cdd:COG4886    158 -------------LTNLKSLDLSNNQLTDLPEE-LGNLT----------------NLKEL--DLSNNQITD--------- 196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  466 rlankrigqikskkfrcsakeqyfIPGTEDYRSKLsgdcfadlacpekcrcegTTVDCSNQKLNKIPEHIPQYTA--ELR 543
Cdd:COG4886    197 ------------------------LPEPLGNLTNL------------------EELDLSGNQLTDLPEPLANLTNleTLD 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  544 LNNNEFTVLEAtgiFKKLPQLRKINFSNNKITDIEEGAfegasgvneilltsnrlenvqhkmfkGLESLKTLMLRSNRIT 623
Cdd:COG4886    235 LSNNQLTDLPE---LGNLTNLEELDLSNNQLTDLPPLA--------------------------NLTNLKTLDLSNNQLT 285
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 2462599775  624 CVGNDSFIGLSSVRLLSLYDNQITTVAPGAFDTLHSLSTLNLLANPFN 671
Cdd:COG4886    286 DLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGL 333
PCC super family cl28216
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
812-993 2.34e-11

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


The actual alignment was detected with superfamily member TIGR00864:

Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 69.34  E-value: 2.34e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  812 LHGNDISVVPEGAFNDLSALSHLAIGANPLYCDCNMQWLSDWVKSE---YKEPGIARCAGPGEMADKLLLTTPSKKFTCq 888
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKgvkVRQPEAALCAGPGALAGQPLLGIPLLDSGC- 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  889 gpvDVNILAkcnpCLSNPCKNDGTCNSDPVDFYRCTcPYGFKGQDCDVpihACISNPCKHGGTchlkeGEEDgfWCICAD 968
Cdd:TIGR00864   81 ---DEEYVA----CLKDNSSGGGAARSELVIFSAAH-EGLFQPEACNA---FCFSAGHGLAAL-----GEQG--ECLCGA 142
                          170       180
                   ....*....|....*....|....*
gi 2462599775  969 GFEGENCEVNVDDCEDNDCENNSTC 993
Cdd:TIGR00864  143 AQPSEANFACESLCSGPPPPPAAAC 167
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1056-1092 2.74e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.02  E-value: 2.74e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 2462599775 1056 DFDDCQD-NKCKNGAHCTDAVNGYTCICPEGYSGLFCE 1092
Cdd:cd00054      1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
978-1014 4.75e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.63  E-value: 4.75e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 2462599775  978 NVDDCED-NDCENNSTCVDGINNYTCLCPPEYTGELCE 1014
Cdd:cd00054      1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
LRRNT smart00013
Leucine rich repeat N-terminal domain;
27-58 5.73e-06

Leucine rich repeat N-terminal domain;


:

Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 44.23  E-value: 5.73e-06
                            10        20        30
                    ....*....|....*....|....*....|..
gi 2462599775    27 ACPAQCSCSGSTVDCHGLALRSVPRNIPRNTE 58
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTT 32
PCC super family cl28216
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
182-260 9.61e-06

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


The actual alignment was detected with superfamily member TIGR00864:

Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 50.85  E-value: 9.61e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  182 LNNNNITRLSVASFNHMPKLRTFRLHSNNLYCDCHLAWLSDWLRQ------RPRVglyTQCMGPSHLRGHNVAEVQKREF 255
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEkgvkvrQPEA---ALCAGPGALAGQPLLGIPLLDS 78

                   ....*
gi 2462599775  256 VCSDE 260
Cdd:TIGR00864   79 GCDEE 83
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1018-1053 1.29e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.39  E-value: 1.29e-05
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 2462599775 1018 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYVGEHC 1053
Cdd:cd00054      3 DECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
LRRNT smart00013
Leucine rich repeat N-terminal domain;
276-308 3.01e-05

Leucine rich repeat N-terminal domain;


:

Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 42.30  E-value: 3.01e-05
                            10        20        30
                    ....*....|....*....|....*....|...
gi 2462599775   276 HCPAACTCSNNIVDCRGKGLTEIPTNLPETITE 308
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1110-1137 5.98e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 36.08  E-value: 5.98e-03
                           10        20
                   ....*....|....*....|....*...
gi 2462599775 1110 CQNGAQCIVRINEPICQCLPGYQGEKCE 1137
Cdd:cd00054     11 CQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
 
Name Accession Description Interval E-value
LamG smart00282
Laminin G domain;
1163-1296 7.88e-37

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 135.54  E-value: 7.88e-37
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  1163 NITLQIATDEDSGILLY---KGDKDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGG 1239
Cdd:smart00282    1 SISFSFRTTSPNGLLLYagsKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462599775  1240 NPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYINSE 1296
Cdd:smart00282   81 NRVSGESPGGLTILNLDGPLYLGGLP-----EDLKLPPLPVTPGFRGCIRNLKVNGK 132
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
1141-1294 8.54e-33

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 124.84  E-value: 8.54e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775 1141 SVNFiNKESYLQIP-SAKVRPQTNITLQIATDEDSGILLYKGDK---DHIAVELYRGRVRASYDTGSHPASaIYSVETIN 1216
Cdd:cd00110      1 GVSF-SGSSYVRLPtLPAPRTRLSISFSFRTTSPNGLLLYAGSQnggDFLALELEDGRLVLRYDLGSGSLV-LSSKTPLN 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462599775 1217 DGNFHIVELLALDQSLSLSVDGGNPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYIN 1294
Cdd:cd00110     79 DGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLP-----EDLKSPGLPVSPGFVGCIRDLKVN 151
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
1170-1296 8.50e-32

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 120.99  E-value: 8.50e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775 1170 TDEDSGILLYKGD--KDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGGNPKIITNL 1247
Cdd:pfam02210    3 TRQPNGLLLYAGGggSDFLALELVNGRLVLRYDLGSGPESLLSSGKNLNDGQWHSVRVERNGNTLTLSVDGQTVVSSLPP 82
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 2462599775 1248 SKQSTLNFDSPLYVGGMPGKSNVASLRQAPGqngtsFHGCIRNLYINSE 1296
Cdd:pfam02210   83 GESLLLNLNGPLYLGGLPPLLLLPALPVRAG-----FVGCIRDVRVNGE 126
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
56-211 4.12e-23

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 103.86  E-value: 4.12e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775   56 NTERLDLNGNNITRITKtDFAGLRHLRVLQLMENKISTIERgAFQDLKELERLRLNRNHLQLFPELLfLGTAKLYRLDLS 135
Cdd:COG4886    114 NLESLDLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLPEEL-GNLTNLKELDLS 190
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462599775  136 ENQIQAIPrKAFRGAVDIKNLQLDYNQISCIEDgAFRALRDLEVLTLNNNNITRLSvaSFNHMPKLRTFRLHSNNL 211
Cdd:COG4886    191 NNQITDLP-EPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQL 262
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
524-841 3.09e-19

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 92.30  E-value: 3.09e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  524 SNQKLNKIPEHIPQYTAELRLNNNEFTVLEATGIFKKLPQLRKINFSNNKITDIEEGafegasgvneilltsnrlenvqh 603
Cdd:COG4886     75 LLLSLLLLSLLLLGLTDLGDLTNLTELDLSGNEELSNLTNLESLDLSGNQLTDLPEE----------------------- 131
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  604 kmFKGLESLKTLMLRSNRITCVGnDSFIGLSSVRLLSLYDNQITTVaPGAFDTLHSLSTLNLlanpfncncylawlgewl 683
Cdd:COG4886    132 --LANLTNLKELDLSNNQLTDLP-EPLGNLTNLKSLDLSNNQLTDL-PEELGNLTNLKELDL------------------ 189
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  684 rkkrivTGNPrcqkpyfLKEIPiqdvaiqdftcddgnddnscSPLSRCPTectcldtvvrcsnkglkvlpkgiprdVTEL 763
Cdd:COG4886    190 ------SNNQ-------ITDLP--------------------EPLGNLTN--------------------------LEEL 210
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462599775  764 YLDGNQFTLVPKELSNYKHLTLIILSYNRLRCIPprTFDGLKSLRLLSLHGNDISVVPEGAfnDLSALSHLAIGANPL 841
Cdd:COG4886    211 DLSGNQLTDLPEPLANLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQL 284
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
306-671 8.82e-18

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 87.68  E-value: 8.82e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  306 ITEIRLEQNTikvippgAFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKITELPKSLFEglfslqllllna 385
Cdd:COG4886     98 LTELDLSGNE-------ELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQLTDLPEPLGN------------ 157
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  386 nkinclrvdafqdLHNLNLLSLYDNKLQTIAKGtFSPLRaiqtmhlaqnpficdcHLKWLadYLHTNPIETsgarctspr 465
Cdd:COG4886    158 -------------LTNLKSLDLSNNQLTDLPEE-LGNLT----------------NLKEL--DLSNNQITD--------- 196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  466 rlankrigqikskkfrcsakeqyfIPGTEDYRSKLsgdcfadlacpekcrcegTTVDCSNQKLNKIPEHIPQYTA--ELR 543
Cdd:COG4886    197 ------------------------LPEPLGNLTNL------------------EELDLSGNQLTDLPEPLANLTNleTLD 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  544 LNNNEFTVLEAtgiFKKLPQLRKINFSNNKITDIEEGAfegasgvneilltsnrlenvqhkmfkGLESLKTLMLRSNRIT 623
Cdd:COG4886    235 LSNNQLTDLPE---LGNLTNLEELDLSNNQLTDLPPLA--------------------------NLTNLKTLDLSNNQLT 285
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 2462599775  624 CVGNDSFIGLSSVRLLSLYDNQITTVAPGAFDTLHSLSTLNLLANPFN 671
Cdd:COG4886    286 DLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGL 333
LRR_8 pfam13855
Leucine rich repeat;
55-115 4.39e-14

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 67.93  E-value: 4.39e-14
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462599775   55 RNTERLDLNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNHL 115
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
586-646 1.67e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 63.70  E-value: 1.67e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462599775  586 SGVNEILLTSNRLENVQHKMFKGLESLKTLMLRSNRITCVGNDSFIGLSSVRLLSLYDNQI 646
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
307-364 2.60e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.93  E-value: 2.60e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2462599775  307 TEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKI 364
Cdd:pfam13855    4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
812-993 2.34e-11

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 69.34  E-value: 2.34e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  812 LHGNDISVVPEGAFNDLSALSHLAIGANPLYCDCNMQWLSDWVKSE---YKEPGIARCAGPGEMADKLLLTTPSKKFTCq 888
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKgvkVRQPEAALCAGPGALAGQPLLGIPLLDSGC- 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  889 gpvDVNILAkcnpCLSNPCKNDGTCNSDPVDFYRCTcPYGFKGQDCDVpihACISNPCKHGGTchlkeGEEDgfWCICAD 968
Cdd:TIGR00864   81 ---DEEYVA----CLKDNSSGGGAARSELVIFSAAH-EGLFQPEACNA---FCFSAGHGLAAL-----GEQG--ECLCGA 142
                          170       180
                   ....*....|....*....|....*
gi 2462599775  969 GFEGENCEVNVDDCEDNDCENNSTC 993
Cdd:TIGR00864  143 AQPSEANFACESLCSGPPPPPAAAC 167
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
641-717 1.07e-10

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 67.03  E-value: 1.07e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  641 LYDNQITTVAPGAFDTLHSLSTLNLLANPFNCNCYLAWLGEWLRKKRIVTGNPR---CQKPYFLKEIPIQDVAIQDFTCD 717
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEaalCAGPGALAGQPLLGIPLLDSGCD 81
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
60-211 1.14e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 62.88  E-value: 1.14e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775   60 LDLNGNNITRITktDFAGLRHLRVLQLMENKISTIErgAFQDLKELERLRLNRNHLQlfpELLFLGT-AKLYRLDLSENQ 138
Cdd:cd21340      7 LYLNDKNITKID--NLSLCKNLKVLYLYDNKITKIE--NLEFLTNLTHLYLQNNQIE---KIENLENlVNLKKLYLGGNR 79
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462599775  139 IQAIprKAFRGAVDIKNLQLDYNQIS-----CIEDGAFRALRD-LEVLTLNNNNITrlSVASFNHMPKLRTFRLHSNNL 211
Cdd:cd21340     80 ISVV--EGLENLTNLEELHIENQRLPpgeklTFDPRSLAALSNsLRVLNISGNNID--SLEPLAPLRNLEQLDASNNQI 154
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
55-211 3.47e-08

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 58.32  E-value: 3.47e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775   55 RNTERLDLNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNhlQLFPELL-FLGTAKLYRLD 133
Cdd:PLN00113   404 RSLRRVRLQDNSFSGELPSEFTKLPLVYFLDISNNNLQGRINSRKWDMPSLQMLSLARN--KFFGGLPdSFGSKRLENLD 481
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462599775  134 LSENQIQ-AIPRKaFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNITRLSVASFNHMPKLRTFRLHSNNL 211
Cdd:PLN00113   482 LSRNQFSgAVPRK-LGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQL 559
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1056-1092 2.74e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.02  E-value: 2.74e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 2462599775 1056 DFDDCQD-NKCKNGAHCTDAVNGYTCICPEGYSGLFCE 1092
Cdd:cd00054      1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
978-1014 4.75e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.63  E-value: 4.75e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 2462599775  978 NVDDCED-NDCENNSTCVDGINNYTCLCPPEYTGELCE 1014
Cdd:cd00054      1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
740-834 6.53e-07

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 51.71  E-value: 6.53e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  740 TVVRCSNKGLKVLPK-GIPRDVTELYLDGNQFTLVPKeLSNYKHLTLIILSYNRLRCIPPrtFDGLKSLRLLSLHGNDIS 818
Cdd:cd21340      5 THLYLNDKNITKIDNlSLCKNLKVLYLYDNKITKIEN-LEFLTNLTHLYLQNNQIEKIEN--LENLVNLKKLYLGGNRIS 81
                           90
                   ....*....|....*.
gi 2462599775  819 VVpEGaFNDLSALSHL 834
Cdd:cd21340     82 VV-EG-LENLTNLEEL 95
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
291-371 2.68e-06

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 52.01  E-value: 2.68e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  291 RGKGLTEIPTNLPETITEIRLEQNTIKVIP---PGAfspykkLRRIDLSNNQISELAPDAFQGLRSLNslvLYGNKITEL 367
Cdd:PRK15370   228 NSNQLTSIPATLPDTIQEMELSINRITELPerlPSA------LQSLDLFHNKISCLPENLPEELRYLS---VYDNSIRTL 298

                   ....
gi 2462599775  368 PKSL 371
Cdd:PRK15370   299 PAHL 302
LRRCT smart00082
Leucine rich repeat C-terminal domain;
839-888 4.52e-06

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 45.11  E-value: 4.52e-06
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 2462599775   839 NPLYCDCNMQWLSDWVKSE--YKEPGIARCAGPGEMADKLLLTTPSkKFTCQ 888
Cdd:smart00082    1 NPFICDCELRWLLRWLQANehLQDPVDLRCASPSSLRGPLLELLHS-EFKCP 51
LRRNT smart00013
Leucine rich repeat N-terminal domain;
27-58 5.73e-06

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 44.23  E-value: 5.73e-06
                            10        20        30
                    ....*....|....*....|....*....|..
gi 2462599775    27 ACPAQCSCSGSTVDCHGLALRSVPRNIPRNTE 58
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTT 32
EGF_CA smart00179
Calcium-binding EGF-like domain;
1056-1092 8.37e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 44.16  E-value: 8.37e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 2462599775  1056 DFDDCQ-DNKCKNGAHCTDAVNGYTCICPEGYS-GLFCE 1092
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
182-260 9.61e-06

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 50.85  E-value: 9.61e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  182 LNNNNITRLSVASFNHMPKLRTFRLHSNNLYCDCHLAWLSDWLRQ------RPRVglyTQCMGPSHLRGHNVAEVQKREF 255
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEkgvkvrQPEA---ALCAGPGALAGQPLLGIPLLDS 78

                   ....*
gi 2462599775  256 VCSDE 260
Cdd:TIGR00864   79 GCDEE 83
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1060-1088 1.05e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 43.53  E-value: 1.05e-05
                           10        20
                   ....*....|....*....|....*....
gi 2462599775 1060 CQDNKCKNGAHCTDAVNGYTCICPEGYSG 1088
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTG 29
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1018-1053 1.29e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.39  E-value: 1.29e-05
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 2462599775 1018 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYVGEHC 1053
Cdd:cd00054      3 DECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
LRRNT smart00013
Leucine rich repeat N-terminal domain;
731-762 1.33e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 43.46  E-value: 1.33e-05
                            10        20        30
                    ....*....|....*....|....*....|..
gi 2462599775   731 CPTECTCLDTVVRCSNKGLKVLPKGIPRDVTE 762
Cdd:smart00013    2 CPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
EGF_CA smart00179
Calcium-binding EGF-like domain;
978-1014 1.75e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 43.00  E-value: 1.75e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 2462599775   978 NVDDCE-DNDCENNSTCVDGINNYTCLCPPEYT-GELCE 1014
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
LRRCT smart00082
Leucine rich repeat C-terminal domain;
209-258 2.52e-05

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 43.19  E-value: 2.52e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 2462599775   209 NNLYCDCHLAWLSDWLRQRPRV--GLYTQCMGPSHLRGhNVAEVQKREFVCS 258
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHLqdPVDLRCASPSSLRG-PLLELLHSEFKCP 51
LRRNT smart00013
Leucine rich repeat N-terminal domain;
276-308 3.01e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 42.30  E-value: 3.01e-05
                            10        20        30
                    ....*....|....*....|....*....|...
gi 2462599775   276 HCPAACTCSNNIVDCRGKGLTEIPTNLPETITE 308
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
749-841 1.08e-04

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 47.00  E-value: 1.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  749 LKVLPKGIPRDVTELYLDGNQFTLVPKELSNykHLTLIILSYNRLRCIPPRTFDGlksLRLLSLHGNDISVVPEgafNDL 828
Cdd:PRK15370   232 LTSIPATLPDTIQEMELSINRITELPERLPS--ALQSLDLFHNKISCLPENLPEE---LRYLSVYDNSIRTLPA---HLP 303
                           90
                   ....*....|...
gi 2462599775  829 SALSHLAIGANPL 841
Cdd:PRK15370   304 SGITHLNVQSNSL 316
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
982-1011 1.21e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 40.44  E-value: 1.21e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 2462599775  982 CEDNDCENNSTCVDGINNYTCLCPPEYTGE 1011
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
27-54 1.74e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 39.92  E-value: 1.74e-04
                           10        20
                   ....*....|....*....|....*...
gi 2462599775   27 ACPAQCSCSGSTVDCHGLALRSVPRNIP 54
Cdd:pfam01462    1 ACPVPCHCSATVVNCSDRGLTAVPRDLP 28
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
407-470 2.25e-04

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 46.23  E-value: 2.25e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462599775  407 LYDNKLQTIAKGTFSPLRAIQTMHLAQNPFICDCHLKWLADYLHTNPIET---SGARCTSPRRLANK 470
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVrqpEAALCAGPGALAGQ 68
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
277-303 2.78e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 39.53  E-value: 2.78e-04
                           10        20
                   ....*....|....*....|....*..
gi 2462599775  277 CPAACTCSNNIVDCRGKGLTEIPTNLP 303
Cdd:pfam01462    2 CPVPCHCSATVVNCSDRGLTAVPRDLP 28
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1023-1052 2.90e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 39.29  E-value: 2.90e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 2462599775 1023 DLNPCQHDSKCILTPKGFKCDCTPGYVGEH 1052
Cdd:pfam00008    2 APNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
55-371 3.01e-04

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 45.61  E-value: 3.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775   55 RNTERLDLNGNNITRITKTDFAGLRHLRVLQLMENKISTierGAFQDL---KELERLRLNRNHLQ-LFPELLfLGTAKLY 130
Cdd:PLN00113   308 QNLEILHLFSNNFTGKIPVALTSLPRLQVLQLWSNKFSG---EIPKNLgkhNNLTVLDLSTNNLTgEIPEGL-CSSGNLF 383
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  131 RLDLSENQIQAIPRKAFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNIT-RLSVASFNhMPKLRTFRLHSN 209
Cdd:PLN00113   384 KLILFSNSLEGEIPKSLGACRSLRRVRLQDNSFSGELPSEFTKLPLVYFLDISNNNLQgRINSRKWD-MPSLQMLSLARN 462
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  210 NLYcdchlawlsdwlrqrprvGLYTQCMGPSHLRGHNVAEVQKREFVcsdeeegHQSFMapscsvlhcpaactcsnnivd 289
Cdd:PLN00113   463 KFF------------------GGLPDSFGSKRLENLDLSRNQFSGAV-------PRKLG--------------------- 496
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  290 crgkglteiptNLPEtITEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKIT-ELP 368
Cdd:PLN00113   497 -----------SLSE-LMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQLSgEIP 564

                   ...
gi 2462599775  369 KSL 371
Cdd:PLN00113   565 KNL 567
LRRCT smart00082
Leucine rich repeat C-terminal domain;
434-464 1.06e-03

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 38.57  E-value: 1.06e-03
                            10        20        30
                    ....*....|....*....|....*....|...
gi 2462599775   434 NPFICDCHLKWLADYLHTNPI--ETSGARCTSP 464
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHlqDPVDLRCASP 33
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
944-976 1.60e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.62  E-value: 1.60e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 2462599775  944 NPCKHGGTCHLKEGeedGFWCICADGFEGENCE 976
Cdd:cd00054      9 NPCQNGGTCVNTVG---SYRCSCPPGYTGRNCE 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
941-974 2.98e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.59  E-value: 2.98e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 2462599775  941 CISNPCKHGGTCHLKEGeedGFWCICADGFEGEN 974
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPG---GYTCICPEGYTGKR 31
EGF_CA smart00179
Calcium-binding EGF-like domain;
1018-1054 3.82e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 36.46  E-value: 3.82e-03
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 2462599775  1018 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYV-GEHCD 1054
Cdd:smart00179    3 DECASG-NPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1110-1137 5.98e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 36.08  E-value: 5.98e-03
                           10        20
                   ....*....|....*....|....*...
gi 2462599775 1110 CQNGAQCIVRINEPICQCLPGYQGEKCE 1137
Cdd:cd00054     11 CQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1105-1134 6.73e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 35.44  E-value: 6.73e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2462599775 1105 CDNFDCQNGAQCIVRINEPICQCLPGYQGE 1134
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
 
Name Accession Description Interval E-value
LamG smart00282
Laminin G domain;
1163-1296 7.88e-37

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 135.54  E-value: 7.88e-37
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  1163 NITLQIATDEDSGILLY---KGDKDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGG 1239
Cdd:smart00282    1 SISFSFRTTSPNGLLLYagsKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462599775  1240 NPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYINSE 1296
Cdd:smart00282   81 NRVSGESPGGLTILNLDGPLYLGGLP-----EDLKLPPLPVTPGFRGCIRNLKVNGK 132
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
1141-1294 8.54e-33

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 124.84  E-value: 8.54e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775 1141 SVNFiNKESYLQIP-SAKVRPQTNITLQIATDEDSGILLYKGDK---DHIAVELYRGRVRASYDTGSHPASaIYSVETIN 1216
Cdd:cd00110      1 GVSF-SGSSYVRLPtLPAPRTRLSISFSFRTTSPNGLLLYAGSQnggDFLALELEDGRLVLRYDLGSGSLV-LSSKTPLN 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462599775 1217 DGNFHIVELLALDQSLSLSVDGGNPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYIN 1294
Cdd:cd00110     79 DGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLP-----EDLKSPGLPVSPGFVGCIRDLKVN 151
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
1170-1296 8.50e-32

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 120.99  E-value: 8.50e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775 1170 TDEDSGILLYKGD--KDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGGNPKIITNL 1247
Cdd:pfam02210    3 TRQPNGLLLYAGGggSDFLALELVNGRLVLRYDLGSGPESLLSSGKNLNDGQWHSVRVERNGNTLTLSVDGQTVVSSLPP 82
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 2462599775 1248 SKQSTLNFDSPLYVGGMPGKSNVASLRQAPGqngtsFHGCIRNLYINSE 1296
Cdd:pfam02210   83 GESLLLNLNGPLYLGGLPPLLLLPALPVRAG-----FVGCIRDVRVNGE 126
Laminin_G_1 pfam00054
Laminin G domain;
1168-1299 1.88e-28

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 111.64  E-value: 1.88e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775 1168 IATDEDSGILLYKGDKDH---IAVELYRGRVRASYDTGSHPASaIYSVETINDGNFHIVELLALDQSLSLSVDGG-NPKI 1243
Cdd:pfam00054    1 FRTTEPSGLLLYNGTQTErdfLALELRDGRLEVSYDLGSGAAV-VRSGDKLNDGKWHSVELERNGRSGTLSVDGEaRPTG 79
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462599775 1244 ITNLSKQSTLNFDSPLYVGGMPgkSNVASLRQAPgqNGTSFHGCIRNLYINSELQD 1299
Cdd:pfam00054   80 ESPLGATTDLDVDGPLYVGGLP--SLGVKKRRLA--ISPSFDGCIRDVIVNGKPLD 131
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
56-211 4.12e-23

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 103.86  E-value: 4.12e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775   56 NTERLDLNGNNITRITKtDFAGLRHLRVLQLMENKISTIERgAFQDLKELERLRLNRNHLQLFPELLfLGTAKLYRLDLS 135
Cdd:COG4886    114 NLESLDLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLPEEL-GNLTNLKELDLS 190
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462599775  136 ENQIQAIPrKAFRGAVDIKNLQLDYNQISCIEDgAFRALRDLEVLTLNNNNITRLSvaSFNHMPKLRTFRLHSNNL 211
Cdd:COG4886    191 NNQITDLP-EPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQL 262
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
52-452 4.34e-21

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 97.70  E-value: 4.34e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775   52 NIPRNTERLDLNGNNITRITKTDFAGLRHLRVLQLMENKistiergAFQDLKELERLRLNRNHLQLFPELLFLGTaKLYR 131
Cdd:COG4886     69 LSLLLLLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGNE-------ELSNLTNLESLDLSGNQLTDLPEELANLT-NLKE 140
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  132 LDLSENQIQAIPrKAFRGAVDIKNLQLDYNQISCIeDGAFRALRDLEVLTLNNNNITRLSvASFNHMPKLRTFRLHSNNl 211
Cdd:COG4886    141 LDLSNNQLTDLP-EPLGNLTNLKSLDLSNNQLTDL-PEELGNLTNLKELDLSNNQITDLP-EPLGNLTNLEELDLSGNQ- 216
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  212 ycdchlawlsdwlrqrprvglytqcmgpshlrghnvaevqkrefvcsdeeeghqsfmapscsvlhcpaactcsnnivdcr 291
Cdd:COG4886        --------------------------------------------------------------------------------
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  292 gkgLTEIPTNLPE--TITEIRLEQNTIKVIPpgAFSPYKKLRRIDLSNNQISELAPDAfqGLRSLNSLVLYGNKITELP- 368
Cdd:COG4886    217 ---LTDLPEPLANltNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQLTDLKl 289
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  369 KSLFEGLFSLQLLLLNANKINCLRVDAFQDLHNLNLLSLYDNKLQTIAKGTFSPLRAIQTMHLAQNPFICDCHLKWLADY 448
Cdd:COG4886    290 KELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLALLTLLLLLNLLSLLLTLLLTL 369

                   ....
gi 2462599775  449 LHTN 452
Cdd:COG4886    370 GLLG 373
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
524-841 3.09e-19

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 92.30  E-value: 3.09e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  524 SNQKLNKIPEHIPQYTAELRLNNNEFTVLEATGIFKKLPQLRKINFSNNKITDIEEGafegasgvneilltsnrlenvqh 603
Cdd:COG4886     75 LLLSLLLLSLLLLGLTDLGDLTNLTELDLSGNEELSNLTNLESLDLSGNQLTDLPEE----------------------- 131
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  604 kmFKGLESLKTLMLRSNRITCVGnDSFIGLSSVRLLSLYDNQITTVaPGAFDTLHSLSTLNLlanpfncncylawlgewl 683
Cdd:COG4886    132 --LANLTNLKELDLSNNQLTDLP-EPLGNLTNLKSLDLSNNQLTDL-PEELGNLTNLKELDL------------------ 189
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  684 rkkrivTGNPrcqkpyfLKEIPiqdvaiqdftcddgnddnscSPLSRCPTectcldtvvrcsnkglkvlpkgiprdVTEL 763
Cdd:COG4886    190 ------SNNQ-------ITDLP--------------------EPLGNLTN--------------------------LEEL 210
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462599775  764 YLDGNQFTLVPKELSNYKHLTLIILSYNRLRCIPprTFDGLKSLRLLSLHGNDISVVPEGAfnDLSALSHLAIGANPL 841
Cdd:COG4886    211 DLSGNQLTDLPEPLANLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQL 284
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
38-195 3.33e-19

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 91.92  E-value: 3.33e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775   38 TVDCHGLALRSVPRNIPRNT--ERLDLNGNNITRITKtDFAGLRHLRVLQLMENKISTIErGAFQDLKELERLRLNRNHL 115
Cdd:COG4886    140 ELDLSNNQLTDLPEPLGNLTnlKSLDLSNNQLTDLPE-ELGNLTNLKELDLSNNQITDLP-EPLGNLTNLEELDLSGNQL 217
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  116 QLFPELLFlGTAKLYRLDLSENQIQAIPrkAFRGAVDIKNLQLDYNQISCIEDGAfrALRDLEVLTLNNNNITRLSVASF 195
Cdd:COG4886    218 TDLPEPLA-NLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQLTDLKLKEL 292
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
306-671 8.82e-18

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 87.68  E-value: 8.82e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  306 ITEIRLEQNTikvippgAFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKITELPKSLFEglfslqllllna 385
Cdd:COG4886     98 LTELDLSGNE-------ELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQLTDLPEPLGN------------ 157
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  386 nkinclrvdafqdLHNLNLLSLYDNKLQTIAKGtFSPLRaiqtmhlaqnpficdcHLKWLadYLHTNPIETsgarctspr 465
Cdd:COG4886    158 -------------LTNLKSLDLSNNQLTDLPEE-LGNLT----------------NLKEL--DLSNNQITD--------- 196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  466 rlankrigqikskkfrcsakeqyfIPGTEDYRSKLsgdcfadlacpekcrcegTTVDCSNQKLNKIPEHIPQYTA--ELR 543
Cdd:COG4886    197 ------------------------LPEPLGNLTNL------------------EELDLSGNQLTDLPEPLANLTNleTLD 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  544 LNNNEFTVLEAtgiFKKLPQLRKINFSNNKITDIEEGAfegasgvneilltsnrlenvqhkmfkGLESLKTLMLRSNRIT 623
Cdd:COG4886    235 LSNNQLTDLPE---LGNLTNLEELDLSNNQLTDLPPLA--------------------------NLTNLKTLDLSNNQLT 285
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 2462599775  624 CVGNDSFIGLSSVRLLSLYDNQITTVAPGAFDTLHSLSTLNLLANPFN 671
Cdd:COG4886    286 DLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGL 333
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
287-456 1.44e-16

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 83.83  E-value: 1.44e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  287 IVDCRGKGLTEIPTNLPE--TITEIRLEQNTIKVIPPgAFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKI 364
Cdd:COG4886    140 ELDLSNNQLTDLPEPLGNltNLKSLDLSNNQLTDLPE-ELGNLTNLKELDLSNNQITDL-PEPLGNLTNLEELDLSGNQL 217
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  365 TELPKSLfeglfslqllllnankinclrvdafQDLHNLNLLSLYDNKLQTIAKgtFSPLRAIQTMHLAQN-----PFICD 439
Cdd:COG4886    218 TDLPEPL-------------------------ANLTNLETLDLSNNQLTDLPE--LGNLTNLEELDLSNNqltdlPPLAN 270
                          170
                   ....*....|....*...
gi 2462599775  440 CH-LKWLadYLHTNPIET 456
Cdd:COG4886    271 LTnLKTL--DLSNNQLTD 286
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
38-192 1.04e-15

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 81.13  E-value: 1.04e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775   38 TVDCHGLALRSVPRNIPR--NTERLDLNGNNITRITKTdFAGLRHLRVLQLMENKISTIERgAFQDLKELERLRLNRNHL 115
Cdd:COG4886    163 SLDLSNNQLTDLPEELGNltNLKELDLSNNQITDLPEP-LGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQL 240
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462599775  116 QLFPELLFLgtAKLYRLDLSENQIQAIPrkAFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNITRLSV 192
Cdd:COG4886    241 TDLPELGNL--TNLEELDLSNNQLTDLP--PLANLTNLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLNLLEL 313
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
519-841 1.52e-15

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 80.75  E-value: 1.52e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  519 TTVDCSNQKLNKIPEHIPQYTAELRLNNNEFTVLEATGIFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRL 598
Cdd:COG4886      3 LLLLSLTLKLLLLLLLELLTTLILLLLLLLLLLALLLLSLLSLLLLLTLLLSLLLRDLLLSSLLLLLSLLLLLLLSLLLL 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  599 ENVQ--HKMFKGLESLKTLMLRsnritcvGNDSFIGLSSVRLLSLYDNQITTVaPGAFDTLHSLSTLNLLANPfncncyl 676
Cdd:COG4886     83 SLLLlgLTDLGDLTNLTELDLS-------GNEELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQ------- 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  677 awlgewlrkkrivtgnprcqkpyfLKEIPiqdvaiqdftcddgnddnscSPLSRCPTectcLdTVVRCSNKGLKVLPKGI 756
Cdd:COG4886    148 ------------------------LTDLP--------------------EPLGNLTN----L-KSLDLSNNQLTDLPEEL 178
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  757 PR--DVTELYLDGNQFTLVPKELSNYKHLTLIILSYNRLRCIPPrTFDGLKSLRLLSLHGNDISVVPEgaFNDLSALSHL 834
Cdd:COG4886    179 GNltNLKELDLSNNQITDLPEPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLPE--LGNLTNLEEL 255

                   ....*..
gi 2462599775  835 AIGANPL 841
Cdd:COG4886    256 DLSNNQL 262
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
306-684 4.64e-15

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 79.21  E-value: 4.64e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  306 ITEIRLEQNTIKVIPPgAFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKITELPKSLfeglfslqllllna 385
Cdd:COG4886    115 LESLDLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDL-PEPLGNLTNLKSLDLSNNQLTDLPEEL-------------- 178
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  386 nkinclrvdafQDLHNLNLLSLYDNKLQTIAKgTFSPLRAIQTMHLAQNPFicdchlkwladylhtNPIETSGARCTspr 465
Cdd:COG4886    179 -----------GNLTNLKELDLSNNQITDLPE-PLGNLTNLEELDLSGNQL---------------TDLPEPLANLT--- 228
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  466 rlankrigqikskkfrcsakeqyfipgtedyrsKLsgdcfadlacpekcrcegTTVDCSNQKLNKIPE--HIPQYTaELR 543
Cdd:COG4886    229 ---------------------------------NL------------------ETLDLSNNQLTDLPElgNLTNLE-ELD 256
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  544 LNNNEFTVLEATGifkKLPQLRKINFSNNKITDIEEGAFEGASGVNeiLLTSNRLENVQHKMFKGLESLKTLMLRSNRIT 623
Cdd:COG4886    257 LSNNQLTDLPPLA---NLTNLKTLDLSNNQLTDLKLKELELLLGLN--SLLLLLLLLNLLELLILLLLLTTLLLLLLLLK 331
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462599775  624 CVGNDSFIGLSSVRLLSLYDNQITTVAPGAFDTLHSLSTLNLLANPFNCNCYLAWLGEWLR 684
Cdd:COG4886    332 GLLVTLTTLALSLSLLALLTLLLLLNLLSLLLTLLLTLGLLGLLEATLLTLALLLLTLLLL 392
LRR_8 pfam13855
Leucine rich repeat;
55-115 4.39e-14

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 67.93  E-value: 4.39e-14
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462599775   55 RNTERLDLNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNHL 115
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
586-646 1.67e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 63.70  E-value: 1.67e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462599775  586 SGVNEILLTSNRLENVQHKMFKGLESLKTLMLRSNRITCVGNDSFIGLSSVRLLSLYDNQI 646
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
611-670 2.00e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 63.31  E-value: 2.00e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  611 SLKTLMLRSNRITCVGNDSFIGLSSVRLLSLYDNQITTVAPGAFDTLHSLSTLNLLANPF 670
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
307-364 2.60e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.93  E-value: 2.60e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2462599775  307 TEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKI 364
Cdd:pfam13855    4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
154-211 8.56e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 61.77  E-value: 8.56e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2462599775  154 KNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNITRLSVASFNHMPKLRTFRLHSNNL 211
Cdd:pfam13855    4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
812-993 2.34e-11

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 69.34  E-value: 2.34e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  812 LHGNDISVVPEGAFNDLSALSHLAIGANPLYCDCNMQWLSDWVKSE---YKEPGIARCAGPGEMADKLLLTTPSKKFTCq 888
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKgvkVRQPEAALCAGPGALAGQPLLGIPLLDSGC- 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  889 gpvDVNILAkcnpCLSNPCKNDGTCNSDPVDFYRCTcPYGFKGQDCDVpihACISNPCKHGGTchlkeGEEDgfWCICAD 968
Cdd:TIGR00864   81 ---DEEYVA----CLKDNSSGGGAARSELVIFSAAH-EGLFQPEACNA---FCFSAGHGLAAL-----GEQG--ECLCGA 142
                          170       180
                   ....*....|....*....|....*
gi 2462599775  969 GFEGENCEVNVDDCEDNDCENNSTC 993
Cdd:TIGR00864  143 AQPSEANFACESLCSGPPPPPAAAC 167
LRR_8 pfam13855
Leucine rich repeat;
760-817 5.02e-11

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 59.46  E-value: 5.02e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2462599775  760 VTELYLDGNQFTLVPKE-LSNYKHLTLIILSYNRLRCIPPRTFDGLKSLRLLSLHGNDI 817
Cdd:pfam13855    3 LRSLDLSNNRLTSLDDGaFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
128-187 9.38e-11

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 58.69  E-value: 9.38e-11
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  128 KLYRLDLSENQIQAIPRKAFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNI 187
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
641-717 1.07e-10

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 67.03  E-value: 1.07e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  641 LYDNQITTVAPGAFDTLHSLSTLNLLANPFNCNCYLAWLGEWLRKKRIVTGNPR---CQKPYFLKEIPIQDVAIQDFTCD 717
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEaalCAGPGALAGQPLLGIPLLDSGCD 81
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
60-211 1.14e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 62.88  E-value: 1.14e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775   60 LDLNGNNITRITktDFAGLRHLRVLQLMENKISTIErgAFQDLKELERLRLNRNHLQlfpELLFLGT-AKLYRLDLSENQ 138
Cdd:cd21340      7 LYLNDKNITKID--NLSLCKNLKVLYLYDNKITKIE--NLEFLTNLTHLYLQNNQIE---KIENLENlVNLKKLYLGGNR 79
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462599775  139 IQAIprKAFRGAVDIKNLQLDYNQIS-----CIEDGAFRALRD-LEVLTLNNNNITrlSVASFNHMPKLRTFRLHSNNL 211
Cdd:cd21340     80 ISVV--EGLENLTNLEELHIENQRLPpgeklTFDPRSLAALSNsLRVLNISGNNID--SLEPLAPLRNLEQLDASNNQI 154
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
55-209 1.89e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 62.50  E-value: 1.89e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775   55 RNTERLDLNGNNITRItkTDFAGLRHLRVLQLMENKISTIErgAFQDLKELERLRLNRNHLQLFPELLF-----LGTAK- 128
Cdd:cd21340     46 TNLTHLYLQNNQIEKI--ENLENLVNLKKLYLGGNRISVVE--GLENLTNLEELHIENQRLPPGEKLTFdprslAALSNs 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  129 LYRLDLSENQIQaiprkafrgavDIKNLQldynqisciedgafrALRDLEVLTLNNNNITRLSVAS--FNHMPKLRTFRL 206
Cdd:cd21340    122 LRVLNISGNNID-----------SLEPLA---------------PLRNLEQLDASNNQISDLEELLdlLSSWPSLRELDL 175

                   ...
gi 2462599775  207 HSN 209
Cdd:cd21340    176 TGN 178
LRR_8 pfam13855
Leucine rich repeat;
782-841 1.93e-10

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 57.92  E-value: 1.93e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  782 HLTLIILSYNRLRCIPPRTFDGLKSLRLLSLHGNDISVVPEGAFNDLSALSHLAIGANPL 841
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
329-412 4.05e-09

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 54.07  E-value: 4.05e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  329 KLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKITELPKslfeglfslqllllnankinclrvDAFQDLHNLNLLSLY 408
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSP------------------------GAFSGLPSLRYLDLS 57

                   ....
gi 2462599775  409 DNKL 412
Cdd:pfam13855   58 GNRL 61
LRR_8 pfam13855
Leucine rich repeat;
541-598 4.09e-09

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 54.07  E-value: 4.09e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2462599775  541 ELRLNNNEFTVLEAtGIFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRL 598
Cdd:pfam13855    5 SLDLSNNRLTSLDD-GAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
79-139 4.79e-09

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 53.68  E-value: 4.79e-09
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462599775   79 RHLRVLQLMENKISTIERGAFQDLKELERLRLNRNHLQLFPELLFLGTAKLYRLDLSENQI 139
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
55-211 3.47e-08

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 58.32  E-value: 3.47e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775   55 RNTERLDLNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNhlQLFPELL-FLGTAKLYRLD 133
Cdd:PLN00113   404 RSLRRVRLQDNSFSGELPSEFTKLPLVYFLDISNNNLQGRINSRKWDMPSLQMLSLARN--KFFGGLPdSFGSKRLENLD 481
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462599775  134 LSENQIQ-AIPRKaFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNITRLSVASFNHMPKLRTFRLHSNNL 211
Cdd:PLN00113   482 LSRNQFSgAVPRK-LGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQL 559
LRR_8 pfam13855
Leucine rich repeat;
384-436 4.64e-08

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 50.99  E-value: 4.64e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2462599775  384 NANKINCLRVDAFQDLHNLNLLSLYDNKLQTIAKGTFSPLRAIQTMHLAQNPF 436
Cdd:pfam13855    9 SNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1056-1092 2.74e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.02  E-value: 2.74e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 2462599775 1056 DFDDCQD-NKCKNGAHCTDAVNGYTCICPEGYSGLFCE 1092
Cdd:cd00054      1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
978-1014 4.75e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.63  E-value: 4.75e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 2462599775  978 NVDDCED-NDCENNSTCVDGINNYTCLCPPEYTGELCE 1014
Cdd:cd00054      1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
740-834 6.53e-07

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 51.71  E-value: 6.53e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  740 TVVRCSNKGLKVLPK-GIPRDVTELYLDGNQFTLVPKeLSNYKHLTLIILSYNRLRCIPPrtFDGLKSLRLLSLHGNDIS 818
Cdd:cd21340      5 THLYLNDKNITKIDNlSLCKNLKVLYLYDNKITKIEN-LEFLTNLTHLYLQNNQIEKIEN--LENLVNLKKLYLGGNRIS 81
                           90
                   ....*....|....*.
gi 2462599775  819 VVpEGaFNDLSALSHL 834
Cdd:cd21340     82 VV-EG-LENLTNLEEL 95
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
291-371 2.68e-06

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 52.01  E-value: 2.68e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  291 RGKGLTEIPTNLPETITEIRLEQNTIKVIP---PGAfspykkLRRIDLSNNQISELAPDAFQGLRSLNslvLYGNKITEL 367
Cdd:PRK15370   228 NSNQLTSIPATLPDTIQEMELSINRITELPerlPSA------LQSLDLFHNKISCLPENLPEELRYLS---VYDNSIRTL 298

                   ....
gi 2462599775  368 PKSL 371
Cdd:PRK15370   299 PAHL 302
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
294-415 3.32e-06

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 52.01  E-value: 3.32e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  294 GLTEIPTNLPETITEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRslnslvLYGNKITELPKSLfe 373
Cdd:PRK15370   189 GLTTIPACIPEQITTLILDNNELKSLPENLQGNIKTLYANSNQLTSIPATLPDTIQEME------LSINRITELPERL-- 260
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 2462599775  374 gLFSLQLLLLNANKINCLRvDAFQDlhNLNLLSLYDNKLQTI 415
Cdd:PRK15370   261 -PSALQSLDLFHNKISCLP-ENLPE--ELRYLSVYDNSIRTL 298
LRRCT smart00082
Leucine rich repeat C-terminal domain;
839-888 4.52e-06

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 45.11  E-value: 4.52e-06
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 2462599775   839 NPLYCDCNMQWLSDWVKSE--YKEPGIARCAGPGEMADKLLLTTPSkKFTCQ 888
Cdd:smart00082    1 NPFICDCELRWLLRWLQANehLQDPVDLRCASPSSLRGPLLELLHS-EFKCP 51
LRRNT smart00013
Leucine rich repeat N-terminal domain;
27-58 5.73e-06

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 44.23  E-value: 5.73e-06
                            10        20        30
                    ....*....|....*....|....*....|..
gi 2462599775    27 ACPAQCSCSGSTVDCHGLALRSVPRNIPRNTE 58
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTT 32
EGF_CA smart00179
Calcium-binding EGF-like domain;
1056-1092 8.37e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 44.16  E-value: 8.37e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 2462599775  1056 DFDDCQ-DNKCKNGAHCTDAVNGYTCICPEGYS-GLFCE 1092
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
182-260 9.61e-06

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 50.85  E-value: 9.61e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  182 LNNNNITRLSVASFNHMPKLRTFRLHSNNLYCDCHLAWLSDWLRQ------RPRVglyTQCMGPSHLRGHNVAEVQKREF 255
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEkgvkvrQPEA---ALCAGPGALAGQPLLGIPLLDS 78

                   ....*
gi 2462599775  256 VCSDE 260
Cdd:TIGR00864   79 GCDEE 83
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1060-1088 1.05e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 43.53  E-value: 1.05e-05
                           10        20
                   ....*....|....*....|....*....
gi 2462599775 1060 CQDNKCKNGAHCTDAVNGYTCICPEGYSG 1088
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTG 29
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1018-1053 1.29e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.39  E-value: 1.29e-05
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 2462599775 1018 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYVGEHC 1053
Cdd:cd00054      3 DECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
LRRNT smart00013
Leucine rich repeat N-terminal domain;
731-762 1.33e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 43.46  E-value: 1.33e-05
                            10        20        30
                    ....*....|....*....|....*....|..
gi 2462599775   731 CPTECTCLDTVVRCSNKGLKVLPKGIPRDVTE 762
Cdd:smart00013    2 CPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
EGF_CA smart00179
Calcium-binding EGF-like domain;
978-1014 1.75e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 43.00  E-value: 1.75e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 2462599775   978 NVDDCE-DNDCENNSTCVDGINNYTCLCPPEYT-GELCE 1014
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
LRRCT smart00082
Leucine rich repeat C-terminal domain;
209-258 2.52e-05

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 43.19  E-value: 2.52e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 2462599775   209 NNLYCDCHLAWLSDWLRQRPRV--GLYTQCMGPSHLRGhNVAEVQKREFVCS 258
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHLqdPVDLRCASPSSLRG-PLLELLHSEFKCP 51
LRRNT smart00013
Leucine rich repeat N-terminal domain;
276-308 3.01e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 42.30  E-value: 3.01e-05
                            10        20        30
                    ....*....|....*....|....*....|...
gi 2462599775   276 HCPAACTCSNNIVDCRGKGLTEIPTNLPETITE 308
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
43-214 4.29e-05

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 47.35  E-value: 4.29e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775   43 GLALRSVPRNIPRNT--ERLDLNGNNITRITKTDFAGLRH---LRVLQLMENKIS-TIER---GAFQDLKE-LERLRLNR 112
Cdd:cd00116     67 PRGLQSLLQGLTKGCglQELDLSDNALGPDGCGVLESLLRsssLQELKLNNNGLGdRGLRllaKGLKDLPPaLEKLVLGR 146
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  113 NHL------QLFPELLFLGtaKLYRLDLSENQI--QAIPR--KAFRGAVDIKNLQLDYNQISCIED----GAFRALRDLE 178
Cdd:cd00116    147 NRLegasceALAKALRANR--DLKELNLANNGIgdAGIRAlaEGLKANCNLEVLDLNNNGLTDEGAsalaETLASLKSLE 224
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 2462599775  179 VLTLNNNNIT-----RLSVASFNHMPKLRTFRLHSNNLYCD 214
Cdd:cd00116    225 VLNLGDNNLTdagaaALASALLSPNISLLTLSLSCNDITDD 265
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
524-669 7.90e-05

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 47.09  E-value: 7.90e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  524 SNQKLNKIPEHIPQYTA--ELRLNNNEFTVLEATGIFKKL---PQLRKINFSNNKITDieegafEGASGVNEILLTSNRL 598
Cdd:COG5238    193 GDEGIEELAEALTQNTTvtTLWLKRNPIGDEGAEILAEALkgnKSLTTLDLSNNQIGD------EGVIALAEALKNNTTV 266
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  599 E-------NVQH-------KMFKGLESLKTLMLRSNRItcvGNDSFIGL-------SSVRLLSLYDNQITTV-APGAFDT 656
Cdd:COG5238    267 EtlylsgnQIGAegaialaKALQGNTTLTSLDLSVNRI---GDEGAIALaeglqgnKTLHTLNLAYNGIGAQgAIALAKA 343
                          170
                   ....*....|....*.
gi 2462599775  657 LH---SLSTLNLLANP 669
Cdd:COG5238    344 LQentTLHSLDLSDNQ 359
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
749-841 1.08e-04

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 47.00  E-value: 1.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  749 LKVLPKGIPRDVTELYLDGNQFTLVPKELSNykHLTLIILSYNRLRCIPPRTFDGlksLRLLSLHGNDISVVPEgafNDL 828
Cdd:PRK15370   232 LTSIPATLPDTIQEMELSINRITELPERLPS--ALQSLDLFHNKISCLPENLPEE---LRYLSVYDNSIRTLPA---HLP 303
                           90
                   ....*....|...
gi 2462599775  829 SALSHLAIGANPL 841
Cdd:PRK15370   304 SGITHLNVQSNSL 316
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
982-1011 1.21e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 40.44  E-value: 1.21e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 2462599775  982 CEDNDCENNSTCVDGINNYTCLCPPEYTGE 1011
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
LRR_8 pfam13855
Leucine rich repeat;
177-211 1.38e-04

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 41.36  E-value: 1.38e-04
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 2462599775  177 LEVLTLNNNNITRLSVASFNHMPKLRTFRLHSNNL 211
Cdd:pfam13855    3 LRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLL 37
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
27-54 1.74e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 39.92  E-value: 1.74e-04
                           10        20
                   ....*....|....*....|....*...
gi 2462599775   27 ACPAQCSCSGSTVDCHGLALRSVPRNIP 54
Cdd:pfam01462    1 ACPVPCHCSATVVNCSDRGLTAVPRDLP 28
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
731-757 1.90e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 39.92  E-value: 1.90e-04
                           10        20
                   ....*....|....*....|....*..
gi 2462599775  731 CPTECTCLDTVVRCSNKGLKVLPKGIP 757
Cdd:pfam01462    2 CPVPCHCSATVVNCSDRGLTAVPRDLP 28
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
407-470 2.25e-04

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 46.23  E-value: 2.25e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462599775  407 LYDNKLQTIAKGTFSPLRAIQTMHLAQNPFICDCHLKWLADYLHTNPIET---SGARCTSPRRLANK 470
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVrqpEAALCAGPGALAGQ 68
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
277-303 2.78e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 39.53  E-value: 2.78e-04
                           10        20
                   ....*....|....*....|....*..
gi 2462599775  277 CPAACTCSNNIVDCRGKGLTEIPTNLP 303
Cdd:pfam01462    2 CPVPCHCSATVVNCSDRGLTAVPRDLP 28
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1023-1052 2.90e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 39.29  E-value: 2.90e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 2462599775 1023 DLNPCQHDSKCILTPKGFKCDCTPGYVGEH 1052
Cdd:pfam00008    2 APNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
55-371 3.01e-04

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 45.61  E-value: 3.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775   55 RNTERLDLNGNNITRITKTDFAGLRHLRVLQLMENKISTierGAFQDL---KELERLRLNRNHLQ-LFPELLfLGTAKLY 130
Cdd:PLN00113   308 QNLEILHLFSNNFTGKIPVALTSLPRLQVLQLWSNKFSG---EIPKNLgkhNNLTVLDLSTNNLTgEIPEGL-CSSGNLF 383
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  131 RLDLSENQIQAIPRKAFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNIT-RLSVASFNhMPKLRTFRLHSN 209
Cdd:PLN00113   384 KLILFSNSLEGEIPKSLGACRSLRRVRLQDNSFSGELPSEFTKLPLVYFLDISNNNLQgRINSRKWD-MPSLQMLSLARN 462
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  210 NLYcdchlawlsdwlrqrprvGLYTQCMGPSHLRGHNVAEVQKREFVcsdeeegHQSFMapscsvlhcpaactcsnnivd 289
Cdd:PLN00113   463 KFF------------------GGLPDSFGSKRLENLDLSRNQFSGAV-------PRKLG--------------------- 496
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  290 crgkglteiptNLPEtITEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKIT-ELP 368
Cdd:PLN00113   497 -----------SLSE-LMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQLSgEIP 564

                   ...
gi 2462599775  369 KSL 371
Cdd:PLN00113   565 KNL 567
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
1065-1086 3.17e-04

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 39.24  E-value: 3.17e-04
                           10        20
                   ....*....|....*....|..
gi 2462599775 1065 CKNGAHCTDAVNGYTCICPEGY 1086
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPPGY 22
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
177-211 5.61e-04

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 39.15  E-value: 5.61e-04
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 2462599775  177 LEVLTLNNNNITRLSvaSFNHMPKLRTFRLHSNNL 211
Cdd:pfam12799    3 LEVLDLSNNQITDIP--PLAKLPNLETLDLSGNNK 35
LRRCT smart00082
Leucine rich repeat C-terminal domain;
668-717 6.04e-04

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 38.95  E-value: 6.04e-04
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 2462599775   668 NPFNCNCYLAWLGEWLRKKRIV--TGNPRCQKPYFLKEiPIQDVAIQDFTCD 717
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHLqdPVDLRCASPSSLRG-PLLELLHSEFKCP 51
LRRNT smart00013
Leucine rich repeat N-terminal domain;
509-539 6.51e-04

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 38.45  E-value: 6.51e-04
                            10        20        30
                    ....*....|....*....|....*....|.
gi 2462599775   509 ACPEKCRCEGTTVDCSNQKLNKIPEHIPQYT 539
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDT 31
LRRCT smart00082
Leucine rich repeat C-terminal domain;
434-464 1.06e-03

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 38.57  E-value: 1.06e-03
                            10        20        30
                    ....*....|....*....|....*....|...
gi 2462599775   434 NPFICDCHLKWLADYLHTNPI--ETSGARCTSP 464
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHlqDPVDLRCASP 33
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
1061-1088 1.06e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 37.84  E-value: 1.06e-03
                           10        20
                   ....*....|....*....|....*...
gi 2462599775 1061 QDNKCKNGAHCTDAVNGYTCICPEGYSG 1088
Cdd:cd00053      4 ASNPCSNGGTCVNTPGSYRCVCPPGYTG 31
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
35-190 1.15e-03

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 43.24  E-value: 1.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775   35 SGSTVDCHGL-ALRSVPRNiPRNTERLDLNGNNIT-----RITKTdFAGLRHLRVLQLMENKIStiERGA------FQDL 102
Cdd:COG5238    244 SNNQIGDEGViALAEALKN-NTTVETLYLSGNQIGaegaiALAKA-LQGNTTLTSLDLSVNRIG--DEGAialaegLQGN 319
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  103 KELERLRLNRNHLQLfPELLFLGTA-----KLYRLDLSENQIQAIPRKAF----RGAVDIKNLQLDYNQISciEDGAfRA 173
Cdd:COG5238    320 KTLHTLNLAYNGIGA-QGAIALAKAlqentTLHSLDLSDNQIGDEGAIALakylEGNTTLRELNLGKNNIG--KQGA-EA 395
                          170
                   ....*....|....*..
gi 2462599775  174 LRDLevltLNNNNITRL 190
Cdd:COG5238    396 LIDA----LQTNRLHTL 408
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
944-976 1.60e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.62  E-value: 1.60e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 2462599775  944 NPCKHGGTCHLKEGeedGFWCICADGFEGENCE 976
Cdd:cd00054      9 NPCQNGGTCVNTVG---SYRCSCPPGYTGRNCE 38
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
328-368 1.67e-03

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 37.61  E-value: 1.67e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 2462599775  328 KKLRRIDLSNNQISELapDAFQGLRSLNSLVLYGN-KITELP 368
Cdd:pfam12799    1 PNLEVLDLSNNQITDI--PPLAKLPNLETLDLSGNnKITDLS 40
PLN03150 PLN03150
hypothetical protein; Provisional
746-818 1.71e-03

hypothetical protein; Provisional


Pssm-ID: 178695 [Multi-domain]  Cd Length: 623  Bit Score: 42.88  E-value: 1.71e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462599775  746 NKGLK-VLPKGIP--RDVTELYLDGNQFT-LVPKELSNYKHLTLIILSYNRLRCIPPRTFDGLKSLRLLSLHGNDIS 818
Cdd:PLN03150   427 NQGLRgFIPNDISklRHLQSINLSGNSIRgNIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSLRILNLNGNSLS 503
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
527-670 1.93e-03

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 41.31  E-value: 1.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  527 KLNKIP--EHIPQYTaELRLNNNEFTVLEAtgiFKKLPQLRKINFSNNKITDIEegAFEGASGVNEILLTSNRLENVQHK 604
Cdd:cd21340     35 KITKIEnlEFLTNLT-HLYLQNNQIEKIEN---LENLVNLKKLYLGGNRISVVE--GLENLTNLEELHIENQRLPPGEKL 108
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462599775  605 MF-----KGL-ESLKTLMLRSNRITCVgnDSFIGLSSVRLLSLYDNQITTVAP--GAFDTLHSLSTLNLLANPF 670
Cdd:cd21340    109 TFdprslAALsNSLRVLNISGNNIDSL--EPLAPLRNLEQLDASNNQISDLEEllDLLSSWPSLRELDLTGNPV 180
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
983-1014 2.56e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 37.07  E-value: 2.56e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 2462599775  983 EDNDCENNSTCVDGINNYTCLCPPEYTGEL-CE 1014
Cdd:cd00053      4 ASNPCSNGGTCVNTPGSYRCVCPPGYTGDRsCE 36
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
941-974 2.98e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.59  E-value: 2.98e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 2462599775  941 CISNPCKHGGTCHLKEGeedGFWCICADGFEGEN 974
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPG---GYTCICPEGYTGKR 31
LRR_TYP smart00369
Leucine-rich repeats, typical (most populated) subfamily;
329-350 3.57e-03

Leucine-rich repeats, typical (most populated) subfamily;


Pssm-ID: 197687 [Multi-domain]  Cd Length: 24  Bit Score: 36.18  E-value: 3.57e-03
                            10        20
                    ....*....|....*....|..
gi 2462599775   329 KLRRIDLSNNQISELAPDAFQG 350
Cdd:smart00369    3 NLRELDLSNNQLSSLPPGAFQG 24
LRR smart00370
Leucine-rich repeats, outliers;
329-350 3.57e-03

Leucine-rich repeats, outliers;


Pssm-ID: 197688 [Multi-domain]  Cd Length: 24  Bit Score: 36.18  E-value: 3.57e-03
                            10        20
                    ....*....|....*....|..
gi 2462599775   329 KLRRIDLSNNQISELAPDAFQG 350
Cdd:smart00370    3 NLRELDLSNNQLSSLPPGAFQG 24
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
284-453 3.64e-03

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 41.70  E-value: 3.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  284 SNNIVDCRGKGLTEIPTnLPETITEIRLEQNTIKviPPGA------FSPYKKLRRIDLSNNQIS-----ELApDAFQGLR 352
Cdd:COG5238    189 CNQIGDEGIEELAEALT-QNTTVTTLWLKRNPIG--DEGAeilaeaLKGNKSLTTLDLSNNQIGdegviALA-EALKNNT 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  353 SLNSLVLYGNKITE-----LPKSLfEGLFSLQLLLLNANKINCLRVDAFQDL----HNLNLLSLYDNKLQT-----IAKg 418
Cdd:COG5238    265 TVETLYLSGNQIGAegaiaLAKAL-QGNTTLTSLDLSVNRIGDEGAIALAEGlqgnKTLHTLNLAYNGIGAqgaiaLAK- 342
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 2462599775  419 TFSPLRAIQTMHLAQNPfICDCHLKWLADYLHTNP 453
Cdd:COG5238    343 ALQENTTLHSLDLSDNQ-IGDEGAIALAKYLEGNT 376
EGF_CA smart00179
Calcium-binding EGF-like domain;
1018-1054 3.82e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 36.46  E-value: 3.82e-03
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 2462599775  1018 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYV-GEHCD 1054
Cdd:smart00179    3 DECASG-NPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
987-1008 3.90e-03

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 36.16  E-value: 3.90e-03
                           10        20
                   ....*....|....*....|..
gi 2462599775  987 CENNSTCVDGINNYTCLCPPEY 1008
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPPGY 22
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
1025-1052 3.94e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 36.30  E-value: 3.94e-03
                           10        20
                   ....*....|....*....|....*...
gi 2462599775 1025 NPCQHDSKCILTPKGFKCDCTPGYVGEH 1052
Cdd:cd00053      6 NPCSNGGTCVNTPGSYRCVCPPGYTGDR 33
LRR_5 pfam13306
BspA type Leucine rich repeat region (6 copies); This family includes a number of leucine rich ...
293-374 4.08e-03

BspA type Leucine rich repeat region (6 copies); This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.


Pssm-ID: 463839 [Multi-domain]  Cd Length: 127  Bit Score: 39.07  E-value: 4.08e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  293 KGLTEIptNLPETITEIR--------LE----QNTIKVIPPGAFSPYKKLRRIDLSNNqISELAPDAFQGLrSLNSLVLy 360
Cdd:pfam13306   34 TSLKSI--TLPSSLTSIGsyafyncsLTsitiPSSLTSIGEYAFSNCSNLKSITLPSN-LTSIGSYAFSNC-SLKSITI- 108
                           90
                   ....*....|....
gi 2462599775  361 GNKITELPKSLFEG 374
Cdd:pfam13306  109 PSSVTTIGSYAFSN 122
LRR_9 pfam14580
Leucine-rich repeat;
544-623 5.42e-03

Leucine-rich repeat;


Pssm-ID: 405295 [Multi-domain]  Cd Length: 175  Bit Score: 39.36  E-value: 5.42e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  544 LNNNEFTVLEAtgiFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRL-ENVQHKMFKGLESLKTLMLRSNRI 622
Cdd:pfam14580   49 FSDNEIRKLDG---FPLLRRLKTLLLNNNRICRIGEGLGEALPNLTELILTNNNLqELGDLDPLASLKKLTFLSLLRNPV 125

                   .
gi 2462599775  623 T 623
Cdd:pfam14580  126 T 126
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1110-1137 5.98e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 36.08  E-value: 5.98e-03
                           10        20
                   ....*....|....*....|....*...
gi 2462599775 1110 CQNGAQCIVRINEPICQCLPGYQGEKCE 1137
Cdd:cd00054     11 CQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
321-671 6.44e-03

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 41.37  E-value: 6.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  321 PGAFSPYKKLRRIDLSNNQISELAPDAFQGLRSLNSLVL-YGNKITELPKSLFEgLFSLQLLLLNANKINCLRVDAFQDL 399
Cdd:PLN00113   205 PRELGQMKSLKWIYLGYNNLSGEIPYEIGGLTSLNHLDLvYNNLTGPIPSSLGN-LKNLQYLFLYQNKLSGPIPPSIFSL 283
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  400 HNLNLLSLYDNKLQTIAKGTFSPLRAIQTMHLAQNPFicdchlkwladylhTNPIetSGARCTSPRRlankRIGQIKSKK 479
Cdd:PLN00113   284 QKLISLDLSDNSLSGEIPELVIQLQNLEILHLFSNNF--------------TGKI--PVALTSLPRL----QVLQLWSNK 343
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  480 FrcsakeqyfipgtedyrsklSGDCFADLACPEKCrcegTTVDCSNQKLN-KIPE--------------------HIP-- 536
Cdd:PLN00113   344 F--------------------SGEIPKNLGKHNNL----TVLDLSTNNLTgEIPEglcssgnlfklilfsnslegEIPks 399
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  537 ----QYTAELRLNNNEFTVlEATGIFKKLPQLRKINFSNNKIT-DIEEGAFEGASgVNEILLTSNRLENVQHKMFkGLES 611
Cdd:PLN00113   400 lgacRSLRRVRLQDNSFSG-ELPSEFTKLPLVYFLDISNNNLQgRINSRKWDMPS-LQMLSLARNKFFGGLPDSF-GSKR 476
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  612 LKTLMLRSNRITCVGNDSFIGLSSVRLLSLYDNQITTVAPGAFDTLHSLSTLNLLANPFN 671
Cdd:PLN00113   477 LENLDLSRNQFSGAVPRKLGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLS 536
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
902-931 6.54e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 35.82  E-value: 6.54e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2462599775  902 CLSNPCKNDGTCNSDPVDfYRCTCPYGFKG 931
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGG-YTCICPEGYTG 29
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
60-412 6.55e-03

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 40.99  E-value: 6.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775   60 LDLNGNNITRITKTDFAGLRHLRVLQLMENKIS-TIERGAFqDLKELERLRLNRNHLQ-LFPELLflgtAKLYRLDL--- 134
Cdd:PLN00113   241 LDLVYNNLTGPIPSSLGNLKNLQYLFLYQNKLSgPIPPSIF-SLQKLISLDLSDNSLSgEIPELV----IQLQNLEIlhl 315
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  135 -SENQIQAIPRkAFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNIT-RL--SVASFNHMPKLRTFrlhSNN 210
Cdd:PLN00113   316 fSNNFTGKIPV-ALTSLPRLQVLQLWSNKFSGEIPKNLGKHNNLTVLDLSTNNLTgEIpeGLCSSGNLFKLILF---SNS 391
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  211 LYCDCHLAwLSDwLRQRPRVGLYTqcmgpSHLRGHNVAEVQKREFVcsdeeeghqSFMAPScsvlhcpaactcSNNIVDC 290
Cdd:PLN00113   392 LEGEIPKS-LGA-CRSLRRVRLQD-----NSFSGELPSEFTKLPLV---------YFLDIS------------NNNLQGR 443
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462599775  291 RGKGLTEIPTnlpetITEIRLEQNTIKVIPPGAFSPyKKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKIT-ELPK 369
Cdd:PLN00113   444 INSRKWDMPS-----LQMLSLARNKFFGGLPDSFGS-KRLENLDLSRNQFSGAVPRKLGSLSELMQLKLSENKLSgEIPD 517
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 2462599775  370 SLfEGLFSLQLLLLNANKINCLRVDAFQDLHNLNLLSLYDNKL 412
Cdd:PLN00113   518 EL-SSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQL 559
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1105-1134 6.73e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 35.44  E-value: 6.73e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2462599775 1105 CDNFDCQNGAQCIVRINEPICQCLPGYQGE 1134
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
541-579 8.34e-03

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 35.68  E-value: 8.34e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 2462599775  541 ELRLNNNEFTVLEAtgiFKKLPQLRKINFS-NNKITDIEE 579
Cdd:pfam12799    5 VLDLSNNQITDIPP---LAKLPNLETLDLSgNNKITDLSD 41
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
1110-1131 9.44e-03

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 35.00  E-value: 9.44e-03
                           10        20
                   ....*....|....*....|..
gi 2462599775 1110 CQNGAQCIVRINEPICQCLPGY 1131
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPPGY 22
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
900-934 9.50e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 35.31  E-value: 9.50e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 2462599775  900 NPCLS-NPCKNDGTCNSDPVDfYRCTCPYGFKGQDC 934
Cdd:cd00054      3 DECASgNPCQNGGTCVNTVGS-YRCSCPPGYTGRNC 37
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH