NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2217306237|ref|XP_047290167|]
View 

polycystin-1 isoform X6 [Homo sapiens]

Protein Classification

CLECT and PLAT_polycystin domain-containing protein( domain architecture ID 13202464)

protein containing domains CLECT, GPS, PLAT_polycystin, and PKD_channel

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PCC super family cl28216
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
27-2703 0e+00

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


The actual alignment was detected with superfamily member TIGR00864:

Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 4838.88  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237   27 DISNNKISTLEEGIFANLFNLSEINLSGNPFECDCGLAWLPRWAEEQQVRVVQPEAATCAGPGSLAGQPLLGIPLLDSGC 106
Cdd:TIGR00864    1 DISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAGPGALAGQPLLGIPLLDSGC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  107 GEEYVACLPDNSSGTVAAVS----FSAAHEGLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSASFACLSLCSGPP 182
Cdd:TIGR00864   81 DEEYVACLKDNSSGGGAARSelviFSAAHEGLFQPEACNAFCFSAGHGLAALGEQGECLCGAAQPSEANFACESLCSGPP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  183 PPPAPTCRGPTLLQHVFPASPGATLVGPHGPLASGQLAAFHIAAPLPVTATRWDFGDGSAEVDAAGP----AASHRYVLP 258
Cdd:TIGR00864  161 PPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQLAAFHAAAPLAPTAMRWDFGDGSAEVDAAGAggttAASHKYGHP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  259 GRYHVTAVLALGAGSALLGTDVQVEAAPAALELVCPSSVQSDESLDLSIQNRGGSGLEAAYSIVALGEEPARAVHPLCPS 338
Cdd:TIGR00864  241 GRYHVSAMGALGAGKALAGGDVQVEAAPAALELHCPSLVQADESLDLSIQNRGGSDLDAAWKITAHGEEPAKASHPHCPK 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  339 DTEIFPGNGHCYRLVVEKAAWLQAQEQCQAWAGAALAMVDSPAVQRFLVSRVTRSLD--VWIGFSTVQGVEVGPAPQGEA 416
Cdd:TIGR00864  321 DGEIFEENGHCFQIVPEEAAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSLDrgVWIGFSDVNGAEKGPAHQGEA 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  417 FSLESCQNWLPGEPHPATAEHCVRLGPTGWCNTDLCSAPHSYVCELQPGGPVQDAENLLVGAPSGDLQGPLTPLAQQDGL 496
Cdd:TIGR00864  401 FEAEECEEGLAGEPHPARAEHCVRLDPRGQCNSDLCNAPHAYVCELNPGGPVPDAENFAMGAASFDLHGLLQALAAMDGL 480
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  497 SAP-HEPVEVMVFPGLRLSREAFLTTAEFGTQELRRPAQLRLQVYRL---LSTAGTPENGSEPESRSPDNRTQLAPACMP 572
Cdd:TIGR00864  481 PAPpHEGVEVLLFPALRFSRAAFLSSAEFGTQELRRPAHILFQIYRLrcrLPGAGGPACGPEAECRPPDNRSADAPACMK 560
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  573 GGRWCPGANICLPLDASCHPQACANGCTSGPGLPGAP----YALWREFLFSVPAGPPAQYSlllpvcqvlacvlspcvpr 648
Cdd:TIGR00864  561 GEQWCPFAHICLPLDAPCHPQACANGCSQGHGLPGAArmplYALQREFLFSLPAGPAAHVL------------------- 621
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  649 satvlpvlvtstvgrlleVTLHGQDVLMLPGDLVGLQHDAGPGALLHCSPAPGHPGPRAPYLSANASSWLPH-------- 720
Cdd:TIGR00864  622 ------------------LQDHGEDLLMLPGDLIALQHDAGPAALIHCQPAPGHPGPRAPVFAANASEWFGHnntpvppd 683
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  721 ----------------LPAQLEGTWA----CPACALRLLAATEQLTVLLGLRPNPGLRLPGRYEVRAEVGNGVSRHNLSC 780
Cdd:TIGR00864  684 nlagdgadplpdpeldLKALLEGTRAswleCAACAIRLLAAGEQETRLLGAELNAGLPLPGLYELLAESAKGSDLHNASC 763
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  781 SFDVVSPVAGLRVIYPAPRDGRLYVPTNGSALVLQVDSGANATATARWPGGSVSARFENVCPALVA-------TFVPGCP 853
Cdd:TIGR00864  764 SFDVLPPLAGLRVIHPAPQDGRLFLESNGSALLLQVDSGANAEAKAFWPGGNSSARFENVCPAEFAsrlchpsTFEGGCA 843
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  854 WETNDTLFSVVALPWLSEGEHV----VDVVVENSASRANLSLRVTAEEPICGLRATPSPEARVLQGVLVRYSPVVEAGSD 929
Cdd:TIGR00864  844 EEAEDSLFAVLALNWLKEGEHTgpvqVDLMAENNASEANLSLLVQAEEPICGLRAQPHPAARVLMESLVRYSASVEAGSD 923
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  930 MVFRWTINDKQSLTFQNVVFNVIYQSAAVFKLSpedaamavLTASNHVSNVTVNYNVTVERMNRMQGLQVSTVPAVLSPN 1009
Cdd:TIGR00864  924 MTFKWTIDDKPFFTFQNTVFNVIYQHAAVFKLS--------LTAMNHVSNLTEDFNVTVDRLNPMQGLQVKGVPAVLPPG 995
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1010 ATLALTAGVLVDSAVEVAFLWTFGDGEQALHQFQPPYNESFPVPDPSVAQVLVEHNVMHTYAAPGEYLLTVLASNAFENL 1089
Cdd:TIGR00864  996 ATLALTAGVLIDMAVEAAFLWSFGDGEQALFEFKPPYNESFPCPDPSPAQVLLEHNVMHIYAAPGEYLATVLASNAFENI 1075
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1090 TQQVPVSVRASLPSVAVGVSDGVLVAGRPVTFYPHPLPSPGGVLYTWDFGDGSPVLTQSQPAANHTYASRGTYHVRLEVN 1169
Cdd:TIGR00864 1076 SQQINMSVRAILPRVAIGTEDGLLLAGKPADFEAHPLPSPGGIHYEWDFGDGSALLQGRQPAAAHTFAKRGPFHVCLEVN 1155
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1170 NTVSGAAAQADVRVFEELRGLSVDMSLAVEQGAPVVVSAAVQTGDNITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTV 1249
Cdd:TIGR00864 1156 NTISGAAACADMFAFEEIEGLSADMSLATELGAATTVRAALQSGDNITWTFDMGDGKSLSGPEATVEHKYAKAGNCTVNI 1235
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1250 GAASPAGHLARSLHVLVFVLEVLRVEPAACIPTQPDARLTAYVTGNPAHYLFDWTFGDGSSNTTVRGCPTVTHNFTRSGT 1329
Cdd:TIGR00864 1236 GAANAAGHGARIIHVEVFVFEVAGIEPAACIGEHADANFRARVSGNAAHYLFDWSFGDGSPNETHHGCPGISHNFRGNGT 1315
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1330 FPLALVLSSRVNRAHYFTSICVEPEVGNVTLQPERQFVQLGDEAWLVACAWPPFPYRYTWDFGTEEAAPTRARGPEVTFI 1409
Cdd:TIGR00864 1316 FPLALTISSGVNKAHFFTQICVEPELGKISLQAEKQFFALGDEAQFQACAEPEFNYRYEWDFGGEEAAPLPAAGAEVTFI 1395
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1410 YRDPGSYLVTVTASNNISAANDSALVEVQEPVLVTSIKVNGSLG--LELQQPYLFSAVGRGRPASYLWDLGDGGWLEGPE 1487
Cdd:TIGR00864 1396 YNDPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGSHGnnLELGQPYLFSAFGRARNASYLWDFGDGGLLEGPE 1475
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1488 VTHAYNSTGDFTVRVAGWNEVSRSEAWLNVTVKRRVRGLVVNASRTVVPLNGSVSFSTSLEAGSDVRYSWVLCDRCTPIP 1567
Cdd:TIGR00864 1476 ILHAFNSPGDFNIRLAAANEVGKNEATLNVAVKARVRGLTINASLTNVPLNGSVHFEAHLDAGDDVRFSWILCDHCTPIF 1555
                         1610      1620      1630      1640      1650      1660      1670      1680
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1568 GGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVYVLQLIEGLQVVGG-------------GRYFPTNHTVQLQAVVRDG 1634
Cdd:TIGR00864 1556 GGNTIFYTFRSVGTFNIIVTAENDVGAAQASIFLFVLQEIEGLQILGEtaegggggvqeldGCYFETNHTVQFHAGFKDG 1635
                         1690      1700      1710      1720      1730      1740      1750      1760
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1635 TNVSYSWTAWRD---RGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWADCTMDFVEPVGWLMVAASPNPAAVNTSV 1711
Cdd:TIGR00864 1636 TNLSFSWNAILDnepDGPAFAGSGKGAKLNPLEAGPCDIFLQAANLLGQATADCTIDFLEPAGNLMLAASDNPAAVNALI 1715
                         1770      1780      1790      1800      1810      1820      1830      1840
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1712 TLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPGLHLVTMTAGNPLGSANATVEVDVQVPVSGLSIRASEPGGS- 1790
Cdd:TIGR00864 1716 NLSAELAEGSGLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLVTMKAFNELGSANASEEVDVQEPISGLKIRAADAGEQn 1795
                         1850      1860      1870      1880      1890      1900      1910      1920
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1791 FVAAGSSVPFWGQLATGTNVSWCWAVPGGSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSATYNLTAEEPIVGLVLWAS 1870
Cdd:TIGR00864 1796 FFAADSSVCFQGELATGTNVSWCWAIDGGSSKMGKHACMTFPDAGTFAIRLNASNAVSGKSASREFFAEEPIFGLELKAS 1875
                         1930      1940      1950      1960      1970      1980      1990      2000
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1871 SKVVAPGQLVHFQILLAAGSAVTFRLQVGGANPEVL-PGPRFSHSFPRVGDHVVSVRGKNHVSWAQAQVRIVVLEAVSGL 1949
Cdd:TIGR00864 1876 KKIAAIGEKVEFQILLAAGSAVNFRLQIGGAAPEVLqPGPRFSHSFPRVDDHMVNLRAKNEVSCAQANLHIEVLEAVRGL 1955
                         2010      2020      2030      2040      2050      2060      2070      2080
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1950 QVPNCCEPGIATGTERNFTARVQRGSRVAYAWYFSLQKVQGDSLVILSGRDVTYTPVAAGLLEIQVRAFNALGSENRTLV 2029
Cdd:TIGR00864 1956 QIPDCCAAGIATGEEKNFTANVQRGKPVAFAWTFDLHHLHGDSLVIHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQ 2035
                         2090      2100      2110      2120      2130      2140      2150      2160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2030 LEVQDAVQYVALQSGP--CFTNRSAQFEAATSPSPRRVAYHWDFGDGSPGQDTDEPRAEHSYLRPGDYRVQVNASNLVSF 2107
Cdd:TIGR00864 2036 LEAQDALMDAALQAGPqdCFTNKMAQFEAATSPKPNFMACHWDFGDGSAGQDTDEPRAEHEYLHPGDYRVQVNASNLVSF 2115
                         2170      2180      2190      2200      2210      2220      2230      2240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2108 FVAQATVTVQVLACREPEVDVVLPLQVLMRRSQRNYLEAHVDLRDCVTYQTEYRWEVYRTASCQRPGRPARVALPG---- 2183
Cdd:TIGR00864 2116 FSAHAEINVQVLACEEPEVDVVLALQLAIRRSQPNLLEAHVDLKDCLRYGAEYLWEILRAASCDNDGHFARGALNGatrs 2195
                         2250      2260      2270      2280      2290      2300      2310      2320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2184 ---------VDVSRPRLVLPRLALPVGHYCFVFVVSFGDTPLTQSIQANVTVAPERLVPIIEGGSYRVWSDTRDLVLDGS 2254
Cdd:TIGR00864 2196 fpviplpaeVDVQRLQLSLPKLALAAGHYCFVFSLSFEDTPLKKAACANLGVAAARLMPIIEGGSYRVWSDTQDLQLDAE 2275
                         2330      2340      2350      2360      2370      2380      2390      2400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2255 ESYDPNLEDGDQTPLSFHWACVASTQREAGGCALNFGPRG-SSTVTIPRERLAAGVEYTFSLTVWKAGRKEEATNQTVLI 2333
Cdd:TIGR00864 2276 ESYDPNLDDDDQSLLHFHWACQASSKGEAGCCALNFGLGGkGPTLGIPGEELAAGIEYTFKLSIGKAGMKEEATNQTVLI 2355
                         2410      2420      2430      2440      2450      2460      2470      2480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2334 RSGRVPIVSLECVSCKAQAVYEVSRSSYVYLEGRCLNCSSGSKRGRWAARTFSNKTLVLDETTTSTGSAGMRLVLRRGVL 2413
Cdd:TIGR00864 2356 QSGHIPIVSLECVSCKAQALYEVSQNSYVYLEGRCLNCQSGFHRGRWAARTFQNDTLVLDESSTSTGSAGMNLVLRQGVL 2435
                         2490      2500      2510      2520      2530      2540      2550      2560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2414 RDGEGYTFTLTVLGRSGEEEGCASIRLSPNRPPLGGSCRLFPLG--------------AVHALTTKVHFECTGWHDAEDA 2479
Cdd:TIGR00864 2436 HDGEGYNFTLHVLDDSGDEEGAASIRLHHNMPPDGGECHLFPGGetgqehgdkedevwAIEALLDKVHFECSGWHDAEDA 2515
                         2570      2580      2590      2600      2610      2620      2630      2640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2480 GAPLVYALLLRRCRQGHCEEFCVYKGSLSSYGAVLPPGFR-PHFEVGLAVVVQDQLGAAVVALNRSLAITLPEPNGSATG 2558
Cdd:TIGR00864 2516 EAPLLYALLLNRCRDDHCEEFCVYKGSLPEHGAFLPPGFRsAHFEVGLAITVEDHLGAAIRALNKSIAITLPDPNGEASG 2595
                         2650      2660      2670      2680      2690      2700      2710      2720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2559 LTVWLHGLTASVLPGLLRQADPQHVIEYSLALVTVLNEYERALDVAAEPKHERQHRAQIRKNITETLVSLRVHTVDDIQQ 2638
Cdd:TIGR00864 2596 LPHWLHDLIASKLKGLLDQADFQHVIELSLALITVLNEYEQALDSAAEPKHERGHRAQIRKNITEALTALDLHTVDDIQQ 2675
                         2730      2740      2750      2760      2770      2780
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217306237 2639 IAAALAQCMGPSRELVCRSCLKQTLHKLEAMMLILQAETTAGTVTPTAIGDSILNITGDLIHLAS 2703
Cdd:TIGR00864 2676 IAAALAQCMAPSREFICEECLKQTLHKLEAMLEILQADTKAGIVTPTAIADNILNIMGDLIHLAS 2740
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
3093-3212 9.28e-58

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


:

Pssm-ID: 238850  Cd Length: 120  Bit Score: 196.34  E-value: 9.28e-58
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3093 FKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLDGD--RAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAW 3170
Cdd:cd01752      1 YLYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPekPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSW 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 2217306237 3171 FLQHVIVRDLQTARSAFFLVNDWLSVETEanGGLVEKEVLAA 3212
Cdd:cd01752     81 YLSRVIVRDLQTGKKWFFLCNDWLSVEEG--DGTVERTFPVA 120
Polycystin_dom super family cl48672
Polycystin domain; This domain represents the polycystin domain from group II of Transient ...
3688-3866 5.32e-56

Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.


The actual alignment was detected with superfamily member pfam20519:

Pssm-ID: 466668  Cd Length: 199  Bit Score: 194.56  E-value: 5.32e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3688 AFLAITRSEELWPWMAHVLLPYVHGNQS------------SPELGPPRLRQVRLQEA--LYPDPPGPRVHTCSAAGGFST 3753
Cdd:pfam20519    1 GLLTVTDLDDIWDWLSSVLLPALHSNKTpsglpgsfiayeSLLLGVPRLRQLRVRNSscLVHDKFVREINECHAGYSPPS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3754 SDY----DVGWESPHNGSGTW-AYSAPDLL-GAWSWGSCAVYDSGGYVQELGLSLEESRDRLRFLQLHNWLDNRSRAVFL 3827
Cdd:pfam20519   81 EDRklysALPYKPVHYGSKYWfIYTPPGLLmGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVFV 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 2217306237 3828 ELTRYSPAVGLHAAVTLRLEFPAAGRALAALSVRPFALR 3866
Cdd:pfam20519  161 DFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQSVKLL 199
PKD_channel super family cl37568
Polycystin cation channel; This family contains the cation channel region from group II of ...
3867-4088 9.39e-44

Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.


The actual alignment was detected with superfamily member pfam08016:

Pssm-ID: 462341 [Multi-domain]  Cd Length: 225  Bit Score: 160.52  E-value: 9.39e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3867 RLSAGLSLPLLT-SVCLLLFAVHFAVAEARTWHREGR------WRVLRLgawarwLLVALTAATALVRLAQLGAADRQWT 3939
Cdd:pfam08016    1 RYVTNRSLFILLcEIVFVVFFLYFVVEEILKIRKHRPsylrsvWNLLDL------AIVILSVVLIVLNIYRDFLADRLIK 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3940 rFVRGRPRRFTSFDQVAQLSSAARGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELLGVTLGLVVLGVAYAQLA 4019
Cdd:pfam08016   75 -SVEASPVTFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFG 153
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217306237 4020 ILLVSSCVDSLWSVAQALLVLCPgtglsTLCPAESWH--------LSPLLCVGLWALRLWGALRLGAVILRWRYHAL 4088
Cdd:pfam08016  154 YLLFGTQAPNFSNFVKSILTLFR-----TILGDFGYNeifsgnrvLGPLLFLTFVFLVIFILLNLFLAIINDSYVEV 225
GPS super family cl02559
GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for ...
2986-3035 3.20e-10

GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for auto-proteolysis, so is thus named, GPS. The GPS motif is a conserved sequence of ~40 amino acids containing canonical cysteine and tryptophan residues, and is the most highly conserved part of the domain. In most, if not all, cell-adhesion GPCRs these undergo autoproteolysis in the GPS between a conserved aliphatic residue (usually a leucine) and a threonine, serine, or cysteine residue. In higher eukaryotes this motif is found embedded in the C-terminal beta-stranded part of a GAIN domain - GPCR-Autoproteolysis INducing (GAIN). The GAIN-GPS domain adopts a fold in which the GPS motif, at the C-terminus, forms five beta-strands that are tightly integrated into the overall GAIN domain. The GPS motif, evolutionarily conserved from tetrahymena to mammals, is the only extracellular domain shared by all human cell-adhesion GPCRs and PKD proteins, and is the locus of multiple human disease mutations. The GAIN-GPS domain is both necessary and sufficient functionally for autoproteolysis, suggesting an autoproteolytic mechanism whereby the overall GAIN domain fine-tunes the chemical environment in the GPS to catalyze peptide bond hydrolysis. In the cell-adhesion GPCRs and PKD proteins, the GPS motif is always located at the end of their long N-terminal extracellular regions, immediately before the first transmembrane helix of the respective protein.


The actual alignment was detected with superfamily member smart00303:

Pssm-ID: 470616  Cd Length: 49  Bit Score: 58.17  E-value: 3.20e-10
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 2217306237  2986 YTSLCQYFSEEDMVWRTEGLLPLEETSpRQAVCLTRHLTAFGASLFVPPS 3035
Cdd:smart00303    1 FNPICVFWDESSGEWSTRGCELLETNG-THTTCSCNHLTTFAVLMDVPPI 49
 
Name Accession Description Interval E-value
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
27-2703 0e+00

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 4838.88  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237   27 DISNNKISTLEEGIFANLFNLSEINLSGNPFECDCGLAWLPRWAEEQQVRVVQPEAATCAGPGSLAGQPLLGIPLLDSGC 106
Cdd:TIGR00864    1 DISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAGPGALAGQPLLGIPLLDSGC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  107 GEEYVACLPDNSSGTVAAVS----FSAAHEGLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSASFACLSLCSGPP 182
Cdd:TIGR00864   81 DEEYVACLKDNSSGGGAARSelviFSAAHEGLFQPEACNAFCFSAGHGLAALGEQGECLCGAAQPSEANFACESLCSGPP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  183 PPPAPTCRGPTLLQHVFPASPGATLVGPHGPLASGQLAAFHIAAPLPVTATRWDFGDGSAEVDAAGP----AASHRYVLP 258
Cdd:TIGR00864  161 PPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQLAAFHAAAPLAPTAMRWDFGDGSAEVDAAGAggttAASHKYGHP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  259 GRYHVTAVLALGAGSALLGTDVQVEAAPAALELVCPSSVQSDESLDLSIQNRGGSGLEAAYSIVALGEEPARAVHPLCPS 338
Cdd:TIGR00864  241 GRYHVSAMGALGAGKALAGGDVQVEAAPAALELHCPSLVQADESLDLSIQNRGGSDLDAAWKITAHGEEPAKASHPHCPK 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  339 DTEIFPGNGHCYRLVVEKAAWLQAQEQCQAWAGAALAMVDSPAVQRFLVSRVTRSLD--VWIGFSTVQGVEVGPAPQGEA 416
Cdd:TIGR00864  321 DGEIFEENGHCFQIVPEEAAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSLDrgVWIGFSDVNGAEKGPAHQGEA 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  417 FSLESCQNWLPGEPHPATAEHCVRLGPTGWCNTDLCSAPHSYVCELQPGGPVQDAENLLVGAPSGDLQGPLTPLAQQDGL 496
Cdd:TIGR00864  401 FEAEECEEGLAGEPHPARAEHCVRLDPRGQCNSDLCNAPHAYVCELNPGGPVPDAENFAMGAASFDLHGLLQALAAMDGL 480
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  497 SAP-HEPVEVMVFPGLRLSREAFLTTAEFGTQELRRPAQLRLQVYRL---LSTAGTPENGSEPESRSPDNRTQLAPACMP 572
Cdd:TIGR00864  481 PAPpHEGVEVLLFPALRFSRAAFLSSAEFGTQELRRPAHILFQIYRLrcrLPGAGGPACGPEAECRPPDNRSADAPACMK 560
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  573 GGRWCPGANICLPLDASCHPQACANGCTSGPGLPGAP----YALWREFLFSVPAGPPAQYSlllpvcqvlacvlspcvpr 648
Cdd:TIGR00864  561 GEQWCPFAHICLPLDAPCHPQACANGCSQGHGLPGAArmplYALQREFLFSLPAGPAAHVL------------------- 621
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  649 satvlpvlvtstvgrlleVTLHGQDVLMLPGDLVGLQHDAGPGALLHCSPAPGHPGPRAPYLSANASSWLPH-------- 720
Cdd:TIGR00864  622 ------------------LQDHGEDLLMLPGDLIALQHDAGPAALIHCQPAPGHPGPRAPVFAANASEWFGHnntpvppd 683
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  721 ----------------LPAQLEGTWA----CPACALRLLAATEQLTVLLGLRPNPGLRLPGRYEVRAEVGNGVSRHNLSC 780
Cdd:TIGR00864  684 nlagdgadplpdpeldLKALLEGTRAswleCAACAIRLLAAGEQETRLLGAELNAGLPLPGLYELLAESAKGSDLHNASC 763
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  781 SFDVVSPVAGLRVIYPAPRDGRLYVPTNGSALVLQVDSGANATATARWPGGSVSARFENVCPALVA-------TFVPGCP 853
Cdd:TIGR00864  764 SFDVLPPLAGLRVIHPAPQDGRLFLESNGSALLLQVDSGANAEAKAFWPGGNSSARFENVCPAEFAsrlchpsTFEGGCA 843
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  854 WETNDTLFSVVALPWLSEGEHV----VDVVVENSASRANLSLRVTAEEPICGLRATPSPEARVLQGVLVRYSPVVEAGSD 929
Cdd:TIGR00864  844 EEAEDSLFAVLALNWLKEGEHTgpvqVDLMAENNASEANLSLLVQAEEPICGLRAQPHPAARVLMESLVRYSASVEAGSD 923
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  930 MVFRWTINDKQSLTFQNVVFNVIYQSAAVFKLSpedaamavLTASNHVSNVTVNYNVTVERMNRMQGLQVSTVPAVLSPN 1009
Cdd:TIGR00864  924 MTFKWTIDDKPFFTFQNTVFNVIYQHAAVFKLS--------LTAMNHVSNLTEDFNVTVDRLNPMQGLQVKGVPAVLPPG 995
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1010 ATLALTAGVLVDSAVEVAFLWTFGDGEQALHQFQPPYNESFPVPDPSVAQVLVEHNVMHTYAAPGEYLLTVLASNAFENL 1089
Cdd:TIGR00864  996 ATLALTAGVLIDMAVEAAFLWSFGDGEQALFEFKPPYNESFPCPDPSPAQVLLEHNVMHIYAAPGEYLATVLASNAFENI 1075
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1090 TQQVPVSVRASLPSVAVGVSDGVLVAGRPVTFYPHPLPSPGGVLYTWDFGDGSPVLTQSQPAANHTYASRGTYHVRLEVN 1169
Cdd:TIGR00864 1076 SQQINMSVRAILPRVAIGTEDGLLLAGKPADFEAHPLPSPGGIHYEWDFGDGSALLQGRQPAAAHTFAKRGPFHVCLEVN 1155
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1170 NTVSGAAAQADVRVFEELRGLSVDMSLAVEQGAPVVVSAAVQTGDNITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTV 1249
Cdd:TIGR00864 1156 NTISGAAACADMFAFEEIEGLSADMSLATELGAATTVRAALQSGDNITWTFDMGDGKSLSGPEATVEHKYAKAGNCTVNI 1235
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1250 GAASPAGHLARSLHVLVFVLEVLRVEPAACIPTQPDARLTAYVTGNPAHYLFDWTFGDGSSNTTVRGCPTVTHNFTRSGT 1329
Cdd:TIGR00864 1236 GAANAAGHGARIIHVEVFVFEVAGIEPAACIGEHADANFRARVSGNAAHYLFDWSFGDGSPNETHHGCPGISHNFRGNGT 1315
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1330 FPLALVLSSRVNRAHYFTSICVEPEVGNVTLQPERQFVQLGDEAWLVACAWPPFPYRYTWDFGTEEAAPTRARGPEVTFI 1409
Cdd:TIGR00864 1316 FPLALTISSGVNKAHFFTQICVEPELGKISLQAEKQFFALGDEAQFQACAEPEFNYRYEWDFGGEEAAPLPAAGAEVTFI 1395
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1410 YRDPGSYLVTVTASNNISAANDSALVEVQEPVLVTSIKVNGSLG--LELQQPYLFSAVGRGRPASYLWDLGDGGWLEGPE 1487
Cdd:TIGR00864 1396 YNDPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGSHGnnLELGQPYLFSAFGRARNASYLWDFGDGGLLEGPE 1475
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1488 VTHAYNSTGDFTVRVAGWNEVSRSEAWLNVTVKRRVRGLVVNASRTVVPLNGSVSFSTSLEAGSDVRYSWVLCDRCTPIP 1567
Cdd:TIGR00864 1476 ILHAFNSPGDFNIRLAAANEVGKNEATLNVAVKARVRGLTINASLTNVPLNGSVHFEAHLDAGDDVRFSWILCDHCTPIF 1555
                         1610      1620      1630      1640      1650      1660      1670      1680
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1568 GGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVYVLQLIEGLQVVGG-------------GRYFPTNHTVQLQAVVRDG 1634
Cdd:TIGR00864 1556 GGNTIFYTFRSVGTFNIIVTAENDVGAAQASIFLFVLQEIEGLQILGEtaegggggvqeldGCYFETNHTVQFHAGFKDG 1635
                         1690      1700      1710      1720      1730      1740      1750      1760
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1635 TNVSYSWTAWRD---RGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWADCTMDFVEPVGWLMVAASPNPAAVNTSV 1711
Cdd:TIGR00864 1636 TNLSFSWNAILDnepDGPAFAGSGKGAKLNPLEAGPCDIFLQAANLLGQATADCTIDFLEPAGNLMLAASDNPAAVNALI 1715
                         1770      1780      1790      1800      1810      1820      1830      1840
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1712 TLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPGLHLVTMTAGNPLGSANATVEVDVQVPVSGLSIRASEPGGS- 1790
Cdd:TIGR00864 1716 NLSAELAEGSGLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLVTMKAFNELGSANASEEVDVQEPISGLKIRAADAGEQn 1795
                         1850      1860      1870      1880      1890      1900      1910      1920
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1791 FVAAGSSVPFWGQLATGTNVSWCWAVPGGSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSATYNLTAEEPIVGLVLWAS 1870
Cdd:TIGR00864 1796 FFAADSSVCFQGELATGTNVSWCWAIDGGSSKMGKHACMTFPDAGTFAIRLNASNAVSGKSASREFFAEEPIFGLELKAS 1875
                         1930      1940      1950      1960      1970      1980      1990      2000
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1871 SKVVAPGQLVHFQILLAAGSAVTFRLQVGGANPEVL-PGPRFSHSFPRVGDHVVSVRGKNHVSWAQAQVRIVVLEAVSGL 1949
Cdd:TIGR00864 1876 KKIAAIGEKVEFQILLAAGSAVNFRLQIGGAAPEVLqPGPRFSHSFPRVDDHMVNLRAKNEVSCAQANLHIEVLEAVRGL 1955
                         2010      2020      2030      2040      2050      2060      2070      2080
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1950 QVPNCCEPGIATGTERNFTARVQRGSRVAYAWYFSLQKVQGDSLVILSGRDVTYTPVAAGLLEIQVRAFNALGSENRTLV 2029
Cdd:TIGR00864 1956 QIPDCCAAGIATGEEKNFTANVQRGKPVAFAWTFDLHHLHGDSLVIHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQ 2035
                         2090      2100      2110      2120      2130      2140      2150      2160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2030 LEVQDAVQYVALQSGP--CFTNRSAQFEAATSPSPRRVAYHWDFGDGSPGQDTDEPRAEHSYLRPGDYRVQVNASNLVSF 2107
Cdd:TIGR00864 2036 LEAQDALMDAALQAGPqdCFTNKMAQFEAATSPKPNFMACHWDFGDGSAGQDTDEPRAEHEYLHPGDYRVQVNASNLVSF 2115
                         2170      2180      2190      2200      2210      2220      2230      2240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2108 FVAQATVTVQVLACREPEVDVVLPLQVLMRRSQRNYLEAHVDLRDCVTYQTEYRWEVYRTASCQRPGRPARVALPG---- 2183
Cdd:TIGR00864 2116 FSAHAEINVQVLACEEPEVDVVLALQLAIRRSQPNLLEAHVDLKDCLRYGAEYLWEILRAASCDNDGHFARGALNGatrs 2195
                         2250      2260      2270      2280      2290      2300      2310      2320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2184 ---------VDVSRPRLVLPRLALPVGHYCFVFVVSFGDTPLTQSIQANVTVAPERLVPIIEGGSYRVWSDTRDLVLDGS 2254
Cdd:TIGR00864 2196 fpviplpaeVDVQRLQLSLPKLALAAGHYCFVFSLSFEDTPLKKAACANLGVAAARLMPIIEGGSYRVWSDTQDLQLDAE 2275
                         2330      2340      2350      2360      2370      2380      2390      2400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2255 ESYDPNLEDGDQTPLSFHWACVASTQREAGGCALNFGPRG-SSTVTIPRERLAAGVEYTFSLTVWKAGRKEEATNQTVLI 2333
Cdd:TIGR00864 2276 ESYDPNLDDDDQSLLHFHWACQASSKGEAGCCALNFGLGGkGPTLGIPGEELAAGIEYTFKLSIGKAGMKEEATNQTVLI 2355
                         2410      2420      2430      2440      2450      2460      2470      2480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2334 RSGRVPIVSLECVSCKAQAVYEVSRSSYVYLEGRCLNCSSGSKRGRWAARTFSNKTLVLDETTTSTGSAGMRLVLRRGVL 2413
Cdd:TIGR00864 2356 QSGHIPIVSLECVSCKAQALYEVSQNSYVYLEGRCLNCQSGFHRGRWAARTFQNDTLVLDESSTSTGSAGMNLVLRQGVL 2435
                         2490      2500      2510      2520      2530      2540      2550      2560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2414 RDGEGYTFTLTVLGRSGEEEGCASIRLSPNRPPLGGSCRLFPLG--------------AVHALTTKVHFECTGWHDAEDA 2479
Cdd:TIGR00864 2436 HDGEGYNFTLHVLDDSGDEEGAASIRLHHNMPPDGGECHLFPGGetgqehgdkedevwAIEALLDKVHFECSGWHDAEDA 2515
                         2570      2580      2590      2600      2610      2620      2630      2640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2480 GAPLVYALLLRRCRQGHCEEFCVYKGSLSSYGAVLPPGFR-PHFEVGLAVVVQDQLGAAVVALNRSLAITLPEPNGSATG 2558
Cdd:TIGR00864 2516 EAPLLYALLLNRCRDDHCEEFCVYKGSLPEHGAFLPPGFRsAHFEVGLAITVEDHLGAAIRALNKSIAITLPDPNGEASG 2595
                         2650      2660      2670      2680      2690      2700      2710      2720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2559 LTVWLHGLTASVLPGLLRQADPQHVIEYSLALVTVLNEYERALDVAAEPKHERQHRAQIRKNITETLVSLRVHTVDDIQQ 2638
Cdd:TIGR00864 2596 LPHWLHDLIASKLKGLLDQADFQHVIELSLALITVLNEYEQALDSAAEPKHERGHRAQIRKNITEALTALDLHTVDDIQQ 2675
                         2730      2740      2750      2760      2770      2780
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217306237 2639 IAAALAQCMGPSRELVCRSCLKQTLHKLEAMMLILQAETTAGTVTPTAIGDSILNITGDLIHLAS 2703
Cdd:TIGR00864 2676 IAAALAQCMAPSREFICEECLKQTLHKLEAMLEILQADTKAGIVTPTAIADNILNIMGDLIHLAS 2740
REJ pfam02010
REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor ...
2146-2589 1.17e-132

REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly Swiss:Q26627. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains.


Pssm-ID: 366875 [Multi-domain]  Cd Length: 448  Bit Score: 424.99  E-value: 1.17e-132
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2146 AHVDLRDCV-TYQTEYRWEVYRTASC---QRPGRPARVALPGVDvsrprlvLPRLALPVGHYCFVFVVSFGDTP-LTQSI 2220
Cdd:pfam02010    1 ASVELNGCFsAYTIDYLWSVFTVSSNlnlQTISSPKDLVLPQLT-------IPSGTLPYGTYVFTLTVSLSSTPsLAGTD 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2221 QANVTVAPERLVPIIEGGSYRVWSDTRDLVLDGSESYDPNLEDGDQTPLSFHWACVAST------QREAGGCA-----LN 2289
Cdd:pfam02010   74 IITVTVQPSPLVAVIDGGSSRVVGYNQDLTLDGSESYDPDVDPGSSSGLTYLWSCRRSSsgdnplLNNDPVCFsdqneGT 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2290 FGPRGSSTVTIPRERLAAGVEYTFSLTVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRssYVYLEGRCL 2369
Cdd:pfam02010  154 LLQSTSSSLTIPASTLQANVTYTFKLTVSKGSRNSASTTQTILVVDGNPPIIILSCISNCNRKNNPVDR--LVLLASTCL 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2370 NCSSGSKRG--RWAARTFSNKTLVLD--ETTTSTGSAGMRLVLRRGVLRDGEGYTFTLTVLGRSGEEEGCASIRLSPNRP 2445
Cdd:pfam02010  232 NCSSDLSDVtyRWLSLGSENTSLVLDqlNSQTSTGRSGPYLVIKAGVLQSGVSYRFTLIVTVYPGLVSGLASISFITNAP 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2446 PLGGSCRLFPLGAvHALTTKVHFECTGWHDAEDagaPLVYALLLRRCRQGHCEEFCVYKGSLS-SYGAVLPPGFRPH-FE 2523
Cdd:pfam02010  312 PTGGTCSVTPTEG-TALETKFTVTCQGWTDDDL---PLTYQFGDISFREASEEWFLLYEGSSQiSISTFLPPGLPANdYQ 387
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306237 2524 VGLAVVVQDQLGAAvVALNRSLAITLPEPNGSatglTVWLHGLTASVLPGLLRQADPQHVIEYSLA 2589
Cdd:pfam02010  388 VTVVVVVYDSLGAA-TSVSLTITVTPPSSSDE----LLYFLLGTTSDLSALLQSGDPQQAAQLILA 448
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
3093-3212 9.28e-58

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238850  Cd Length: 120  Bit Score: 196.34  E-value: 9.28e-58
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3093 FKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLDGD--RAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAW 3170
Cdd:cd01752      1 YLYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPekPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSW 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 2217306237 3171 FLQHVIVRDLQTARSAFFLVNDWLSVETEanGGLVEKEVLAA 3212
Cdd:cd01752     81 YLSRVIVRDLQTGKKWFFLCNDWLSVEEG--DGTVERTFPVA 120
Polycystin_dom pfam20519
Polycystin domain; This domain represents the polycystin domain from group II of Transient ...
3688-3866 5.32e-56

Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.


Pssm-ID: 466668  Cd Length: 199  Bit Score: 194.56  E-value: 5.32e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3688 AFLAITRSEELWPWMAHVLLPYVHGNQS------------SPELGPPRLRQVRLQEA--LYPDPPGPRVHTCSAAGGFST 3753
Cdd:pfam20519    1 GLLTVTDLDDIWDWLSSVLLPALHSNKTpsglpgsfiayeSLLLGVPRLRQLRVRNSscLVHDKFVREINECHAGYSPPS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3754 SDY----DVGWESPHNGSGTW-AYSAPDLL-GAWSWGSCAVYDSGGYVQELGLSLEESRDRLRFLQLHNWLDNRSRAVFL 3827
Cdd:pfam20519   81 EDRklysALPYKPVHYGSKYWfIYTPPGLLmGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVFV 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 2217306237 3828 ELTRYSPAVGLHAAVTLRLEFPAAGRALAALSVRPFALR 3866
Cdd:pfam20519  161 DFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQSVKLL 199
PKD_channel pfam08016
Polycystin cation channel; This family contains the cation channel region from group II of ...
3867-4088 9.39e-44

Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.


Pssm-ID: 462341 [Multi-domain]  Cd Length: 225  Bit Score: 160.52  E-value: 9.39e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3867 RLSAGLSLPLLT-SVCLLLFAVHFAVAEARTWHREGR------WRVLRLgawarwLLVALTAATALVRLAQLGAADRQWT 3939
Cdd:pfam08016    1 RYVTNRSLFILLcEIVFVVFFLYFVVEEILKIRKHRPsylrsvWNLLDL------AIVILSVVLIVLNIYRDFLADRLIK 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3940 rFVRGRPRRFTSFDQVAQLSSAARGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELLGVTLGLVVLGVAYAQLA 4019
Cdd:pfam08016   75 -SVEASPVTFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFG 153
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217306237 4020 ILLVSSCVDSLWSVAQALLVLCPgtglsTLCPAESWH--------LSPLLCVGLWALRLWGALRLGAVILRWRYHAL 4088
Cdd:pfam08016  154 YLLFGTQAPNFSNFVKSILTLFR-----TILGDFGYNeifsgnrvLGPLLFLTFVFLVIFILLNLFLAIINDSYVEV 225
WSC smart00321
present in yeast cell wall integrity and stress response component proteins; Domain present in ...
107-201 2.41e-24

present in yeast cell wall integrity and stress response component proteins; Domain present in WSC proteins, polycystin and fungal exoglucanase


Pssm-ID: 214616 [Multi-domain]  Cd Length: 95  Bit Score: 99.85  E-value: 2.41e-24
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237   107 GEEYVACLPDNSSGTVAAVSFSAAHegLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSA-----SFACLSLCSGp 181
Cdd:smart00321    1 GATYVGCYSDNSSRTLAAVSSYAYH--NMSVEACSNFCFSAGYALAALENGNECYCGDSLPSTSvsasdSSQCSTTCSG- 77
                            90       100
                    ....*....|....*....|
gi 2217306237   182 ppPPAPTCRGPTLLQHVFPA 201
Cdd:smart00321   78 --YPAEVCGGPNRLSVYVLA 95
PLAT pfam01477
PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. ...
3095-3198 2.82e-23

PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.


Pssm-ID: 396180  Cd Length: 115  Bit Score: 97.50  E-value: 2.82e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3095 YEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLDGDR-AFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAWFLQ 3173
Cdd:pfam01477    1 YQVKVVTGDELGAGTDADVYISLYGKVGESAQLEITLDNpDFERGAEDSFEIDTDWDVGAILKINLHWDNNGLSDEWFLK 80
                           90       100
                   ....*....|....*....|....*.
gi 2217306237 3174 HVIV-RDLQTARSAFFLVNDWLSVET 3198
Cdd:pfam01477   81 SITVeVPGETGGKYTFPCNSWVYGSK 106
LH2 smart00308
Lipoxygenase homology 2 (beta barrel) domain;
3093-3195 1.94e-19

Lipoxygenase homology 2 (beta barrel) domain;


Pssm-ID: 214608 [Multi-domain]  Cd Length: 105  Bit Score: 86.16  E-value: 1.94e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  3093 FKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLD--GDRAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKglSPAW 3170
Cdd:smart00308    1 GKYKVTVTTGGLDFAGTTASVSLSLVGAEGDGKESKLDylFKGIFARGSTYEFTFDVDEDFGELGAVKIKNEHR--HPEW 78
                            90       100
                    ....*....|....*....|....*
gi 2217306237  3171 FLQHVIVRDLQTARSAFFLVNDWLS 3195
Cdd:smart00308   79 FLKSITVKDLPTGGKYHFPCNSWVY 103
CLECT cd00037
C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type ...
348-461 1.00e-16

C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.


Pssm-ID: 153057 [Multi-domain]  Cd Length: 116  Bit Score: 78.82  E-value: 1.00e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  348 HCYRLVVEKAAWLQAQEQCQAWaGAALAMVDSPAVQRFLVSRVTRSL--DVWIGFSTVQGVEVGPAPQGEAFSleSCQNW 425
Cdd:cd00037      1 SCYKFSTEKLTWEEAQEYCRSL-GGHLASIHSEEENDFLASLLKKSSssDVWIGLNDLSSEGTWKWSDGSPLV--DYTNW 77
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 2217306237  426 LPGEPHPATAEHCVRL--GPTGWCNTDLCSAPHSYVCE 461
Cdd:cd00037     78 APGEPNPGGSEDCVVLssSSDGKWNDVSCSSKLPFICE 115
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1444-1750 1.06e-12

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 72.78  E-value: 1.06e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1444 TSIKVNGSLGLELQqpylFSAVGRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRVAGWNEV-SRSEAWLNVTVKRR 1522
Cdd:COG3291      2 TATPTSGCAPLTVQ----FTDTSSGNATSYEWDFGDGTTSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVGAP 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1523 VRGLVVNASRTVVPLNGSVSFSTSLEAGSDVrYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVY 1602
Cdd:COG3291     78 NPGVTTVTTSTTVTTLANTANGGATTVVAGS-TVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVT 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1603 VLQLIEGLQVVGGGRYFPTNHTVQLQAVVRDGTNVSYSWTAWRDRGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAW 1682
Cdd:COG3291    157 TSVSTTDVTSDGTTSASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTDVTLT 236
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217306237 1683 ADCTMDFVEPVGWLMVAASPNPAAVNTSVTLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPG 1750
Cdd:COG3291    237 GISTGDAGTPGTNTVTTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNVSTTAD 304
GPS smart00303
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin ...
2986-3035 3.20e-10

G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin REJ and polycystin.


Pssm-ID: 197639  Cd Length: 49  Bit Score: 58.17  E-value: 3.20e-10
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 2217306237  2986 YTSLCQYFSEEDMVWRTEGLLPLEETSpRQAVCLTRHLTAFGASLFVPPS 3035
Cdd:smart00303    1 FNPICVFWDESSGEWSTRGCELLETNG-THTTCSCNHLTTFAVLMDVPPI 49
PHA03247 PHA03247
large tegument protein UL36; Provisional
426-758 2.65e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 2.65e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  426 LPGEPHPATAEHCVrlgPTGWCnTDLCSAPHSYVCELQPGGPVQDAENLLVGAPSGDLQGPLTPLA-----------QQD 494
Cdd:PHA03247  2555 LPPAAPPAAPDRSV---PPPRP-APRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPlppdthapdppPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  495 GLSAPHEPVEVMVFPGL------------------RLSREAFLTTAEFGTQELRRPAqLRLQVYRLLSTAGTPENGSEPE 556
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPpperprddpapgrvsrprRARRLGRAAQASSPPQRPRRRA-ARPTVGSLTSLADPPPPPPTPE 2709
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  557 SRsPDNRTQLAPAcMPGGRWCPGANICLPLDASCHPQACANGCTSGPGLPGAPYAlwreflfsvPAGPPaqyslllpvcq 636
Cdd:PHA03247  2710 PA-PHALVSATPL-PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT---------TAGPP----------- 2767
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  637 vlacvlSPCVPRS-ATVLPVLVTSTVGRLLEVTLHGQDVLMLPGDLVGLQHDAGPGALLHCSPAPGHPGPRAPYLSANAS 715
Cdd:PHA03247  2768 ------APAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP 2841
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 2217306237  716 SWLPHLPAQLEGTWACPACALRLLAATEQLTVLLGLRPNPGLR 758
Cdd:PHA03247  2842 PPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVR 2884
 
Name Accession Description Interval E-value
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
27-2703 0e+00

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 4838.88  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237   27 DISNNKISTLEEGIFANLFNLSEINLSGNPFECDCGLAWLPRWAEEQQVRVVQPEAATCAGPGSLAGQPLLGIPLLDSGC 106
Cdd:TIGR00864    1 DISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAGPGALAGQPLLGIPLLDSGC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  107 GEEYVACLPDNSSGTVAAVS----FSAAHEGLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSASFACLSLCSGPP 182
Cdd:TIGR00864   81 DEEYVACLKDNSSGGGAARSelviFSAAHEGLFQPEACNAFCFSAGHGLAALGEQGECLCGAAQPSEANFACESLCSGPP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  183 PPPAPTCRGPTLLQHVFPASPGATLVGPHGPLASGQLAAFHIAAPLPVTATRWDFGDGSAEVDAAGP----AASHRYVLP 258
Cdd:TIGR00864  161 PPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQLAAFHAAAPLAPTAMRWDFGDGSAEVDAAGAggttAASHKYGHP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  259 GRYHVTAVLALGAGSALLGTDVQVEAAPAALELVCPSSVQSDESLDLSIQNRGGSGLEAAYSIVALGEEPARAVHPLCPS 338
Cdd:TIGR00864  241 GRYHVSAMGALGAGKALAGGDVQVEAAPAALELHCPSLVQADESLDLSIQNRGGSDLDAAWKITAHGEEPAKASHPHCPK 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  339 DTEIFPGNGHCYRLVVEKAAWLQAQEQCQAWAGAALAMVDSPAVQRFLVSRVTRSLD--VWIGFSTVQGVEVGPAPQGEA 416
Cdd:TIGR00864  321 DGEIFEENGHCFQIVPEEAAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSLDrgVWIGFSDVNGAEKGPAHQGEA 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  417 FSLESCQNWLPGEPHPATAEHCVRLGPTGWCNTDLCSAPHSYVCELQPGGPVQDAENLLVGAPSGDLQGPLTPLAQQDGL 496
Cdd:TIGR00864  401 FEAEECEEGLAGEPHPARAEHCVRLDPRGQCNSDLCNAPHAYVCELNPGGPVPDAENFAMGAASFDLHGLLQALAAMDGL 480
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  497 SAP-HEPVEVMVFPGLRLSREAFLTTAEFGTQELRRPAQLRLQVYRL---LSTAGTPENGSEPESRSPDNRTQLAPACMP 572
Cdd:TIGR00864  481 PAPpHEGVEVLLFPALRFSRAAFLSSAEFGTQELRRPAHILFQIYRLrcrLPGAGGPACGPEAECRPPDNRSADAPACMK 560
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  573 GGRWCPGANICLPLDASCHPQACANGCTSGPGLPGAP----YALWREFLFSVPAGPPAQYSlllpvcqvlacvlspcvpr 648
Cdd:TIGR00864  561 GEQWCPFAHICLPLDAPCHPQACANGCSQGHGLPGAArmplYALQREFLFSLPAGPAAHVL------------------- 621
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  649 satvlpvlvtstvgrlleVTLHGQDVLMLPGDLVGLQHDAGPGALLHCSPAPGHPGPRAPYLSANASSWLPH-------- 720
Cdd:TIGR00864  622 ------------------LQDHGEDLLMLPGDLIALQHDAGPAALIHCQPAPGHPGPRAPVFAANASEWFGHnntpvppd 683
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  721 ----------------LPAQLEGTWA----CPACALRLLAATEQLTVLLGLRPNPGLRLPGRYEVRAEVGNGVSRHNLSC 780
Cdd:TIGR00864  684 nlagdgadplpdpeldLKALLEGTRAswleCAACAIRLLAAGEQETRLLGAELNAGLPLPGLYELLAESAKGSDLHNASC 763
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  781 SFDVVSPVAGLRVIYPAPRDGRLYVPTNGSALVLQVDSGANATATARWPGGSVSARFENVCPALVA-------TFVPGCP 853
Cdd:TIGR00864  764 SFDVLPPLAGLRVIHPAPQDGRLFLESNGSALLLQVDSGANAEAKAFWPGGNSSARFENVCPAEFAsrlchpsTFEGGCA 843
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  854 WETNDTLFSVVALPWLSEGEHV----VDVVVENSASRANLSLRVTAEEPICGLRATPSPEARVLQGVLVRYSPVVEAGSD 929
Cdd:TIGR00864  844 EEAEDSLFAVLALNWLKEGEHTgpvqVDLMAENNASEANLSLLVQAEEPICGLRAQPHPAARVLMESLVRYSASVEAGSD 923
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  930 MVFRWTINDKQSLTFQNVVFNVIYQSAAVFKLSpedaamavLTASNHVSNVTVNYNVTVERMNRMQGLQVSTVPAVLSPN 1009
Cdd:TIGR00864  924 MTFKWTIDDKPFFTFQNTVFNVIYQHAAVFKLS--------LTAMNHVSNLTEDFNVTVDRLNPMQGLQVKGVPAVLPPG 995
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1010 ATLALTAGVLVDSAVEVAFLWTFGDGEQALHQFQPPYNESFPVPDPSVAQVLVEHNVMHTYAAPGEYLLTVLASNAFENL 1089
Cdd:TIGR00864  996 ATLALTAGVLIDMAVEAAFLWSFGDGEQALFEFKPPYNESFPCPDPSPAQVLLEHNVMHIYAAPGEYLATVLASNAFENI 1075
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1090 TQQVPVSVRASLPSVAVGVSDGVLVAGRPVTFYPHPLPSPGGVLYTWDFGDGSPVLTQSQPAANHTYASRGTYHVRLEVN 1169
Cdd:TIGR00864 1076 SQQINMSVRAILPRVAIGTEDGLLLAGKPADFEAHPLPSPGGIHYEWDFGDGSALLQGRQPAAAHTFAKRGPFHVCLEVN 1155
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1170 NTVSGAAAQADVRVFEELRGLSVDMSLAVEQGAPVVVSAAVQTGDNITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTV 1249
Cdd:TIGR00864 1156 NTISGAAACADMFAFEEIEGLSADMSLATELGAATTVRAALQSGDNITWTFDMGDGKSLSGPEATVEHKYAKAGNCTVNI 1235
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1250 GAASPAGHLARSLHVLVFVLEVLRVEPAACIPTQPDARLTAYVTGNPAHYLFDWTFGDGSSNTTVRGCPTVTHNFTRSGT 1329
Cdd:TIGR00864 1236 GAANAAGHGARIIHVEVFVFEVAGIEPAACIGEHADANFRARVSGNAAHYLFDWSFGDGSPNETHHGCPGISHNFRGNGT 1315
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1330 FPLALVLSSRVNRAHYFTSICVEPEVGNVTLQPERQFVQLGDEAWLVACAWPPFPYRYTWDFGTEEAAPTRARGPEVTFI 1409
Cdd:TIGR00864 1316 FPLALTISSGVNKAHFFTQICVEPELGKISLQAEKQFFALGDEAQFQACAEPEFNYRYEWDFGGEEAAPLPAAGAEVTFI 1395
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1410 YRDPGSYLVTVTASNNISAANDSALVEVQEPVLVTSIKVNGSLG--LELQQPYLFSAVGRGRPASYLWDLGDGGWLEGPE 1487
Cdd:TIGR00864 1396 YNDPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGSHGnnLELGQPYLFSAFGRARNASYLWDFGDGGLLEGPE 1475
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1488 VTHAYNSTGDFTVRVAGWNEVSRSEAWLNVTVKRRVRGLVVNASRTVVPLNGSVSFSTSLEAGSDVRYSWVLCDRCTPIP 1567
Cdd:TIGR00864 1476 ILHAFNSPGDFNIRLAAANEVGKNEATLNVAVKARVRGLTINASLTNVPLNGSVHFEAHLDAGDDVRFSWILCDHCTPIF 1555
                         1610      1620      1630      1640      1650      1660      1670      1680
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1568 GGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVYVLQLIEGLQVVGG-------------GRYFPTNHTVQLQAVVRDG 1634
Cdd:TIGR00864 1556 GGNTIFYTFRSVGTFNIIVTAENDVGAAQASIFLFVLQEIEGLQILGEtaegggggvqeldGCYFETNHTVQFHAGFKDG 1635
                         1690      1700      1710      1720      1730      1740      1750      1760
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1635 TNVSYSWTAWRD---RGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWADCTMDFVEPVGWLMVAASPNPAAVNTSV 1711
Cdd:TIGR00864 1636 TNLSFSWNAILDnepDGPAFAGSGKGAKLNPLEAGPCDIFLQAANLLGQATADCTIDFLEPAGNLMLAASDNPAAVNALI 1715
                         1770      1780      1790      1800      1810      1820      1830      1840
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1712 TLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPGLHLVTMTAGNPLGSANATVEVDVQVPVSGLSIRASEPGGS- 1790
Cdd:TIGR00864 1716 NLSAELAEGSGLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLVTMKAFNELGSANASEEVDVQEPISGLKIRAADAGEQn 1795
                         1850      1860      1870      1880      1890      1900      1910      1920
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1791 FVAAGSSVPFWGQLATGTNVSWCWAVPGGSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSATYNLTAEEPIVGLVLWAS 1870
Cdd:TIGR00864 1796 FFAADSSVCFQGELATGTNVSWCWAIDGGSSKMGKHACMTFPDAGTFAIRLNASNAVSGKSASREFFAEEPIFGLELKAS 1875
                         1930      1940      1950      1960      1970      1980      1990      2000
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1871 SKVVAPGQLVHFQILLAAGSAVTFRLQVGGANPEVL-PGPRFSHSFPRVGDHVVSVRGKNHVSWAQAQVRIVVLEAVSGL 1949
Cdd:TIGR00864 1876 KKIAAIGEKVEFQILLAAGSAVNFRLQIGGAAPEVLqPGPRFSHSFPRVDDHMVNLRAKNEVSCAQANLHIEVLEAVRGL 1955
                         2010      2020      2030      2040      2050      2060      2070      2080
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1950 QVPNCCEPGIATGTERNFTARVQRGSRVAYAWYFSLQKVQGDSLVILSGRDVTYTPVAAGLLEIQVRAFNALGSENRTLV 2029
Cdd:TIGR00864 1956 QIPDCCAAGIATGEEKNFTANVQRGKPVAFAWTFDLHHLHGDSLVIHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQ 2035
                         2090      2100      2110      2120      2130      2140      2150      2160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2030 LEVQDAVQYVALQSGP--CFTNRSAQFEAATSPSPRRVAYHWDFGDGSPGQDTDEPRAEHSYLRPGDYRVQVNASNLVSF 2107
Cdd:TIGR00864 2036 LEAQDALMDAALQAGPqdCFTNKMAQFEAATSPKPNFMACHWDFGDGSAGQDTDEPRAEHEYLHPGDYRVQVNASNLVSF 2115
                         2170      2180      2190      2200      2210      2220      2230      2240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2108 FVAQATVTVQVLACREPEVDVVLPLQVLMRRSQRNYLEAHVDLRDCVTYQTEYRWEVYRTASCQRPGRPARVALPG---- 2183
Cdd:TIGR00864 2116 FSAHAEINVQVLACEEPEVDVVLALQLAIRRSQPNLLEAHVDLKDCLRYGAEYLWEILRAASCDNDGHFARGALNGatrs 2195
                         2250      2260      2270      2280      2290      2300      2310      2320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2184 ---------VDVSRPRLVLPRLALPVGHYCFVFVVSFGDTPLTQSIQANVTVAPERLVPIIEGGSYRVWSDTRDLVLDGS 2254
Cdd:TIGR00864 2196 fpviplpaeVDVQRLQLSLPKLALAAGHYCFVFSLSFEDTPLKKAACANLGVAAARLMPIIEGGSYRVWSDTQDLQLDAE 2275
                         2330      2340      2350      2360      2370      2380      2390      2400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2255 ESYDPNLEDGDQTPLSFHWACVASTQREAGGCALNFGPRG-SSTVTIPRERLAAGVEYTFSLTVWKAGRKEEATNQTVLI 2333
Cdd:TIGR00864 2276 ESYDPNLDDDDQSLLHFHWACQASSKGEAGCCALNFGLGGkGPTLGIPGEELAAGIEYTFKLSIGKAGMKEEATNQTVLI 2355
                         2410      2420      2430      2440      2450      2460      2470      2480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2334 RSGRVPIVSLECVSCKAQAVYEVSRSSYVYLEGRCLNCSSGSKRGRWAARTFSNKTLVLDETTTSTGSAGMRLVLRRGVL 2413
Cdd:TIGR00864 2356 QSGHIPIVSLECVSCKAQALYEVSQNSYVYLEGRCLNCQSGFHRGRWAARTFQNDTLVLDESSTSTGSAGMNLVLRQGVL 2435
                         2490      2500      2510      2520      2530      2540      2550      2560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2414 RDGEGYTFTLTVLGRSGEEEGCASIRLSPNRPPLGGSCRLFPLG--------------AVHALTTKVHFECTGWHDAEDA 2479
Cdd:TIGR00864 2436 HDGEGYNFTLHVLDDSGDEEGAASIRLHHNMPPDGGECHLFPGGetgqehgdkedevwAIEALLDKVHFECSGWHDAEDA 2515
                         2570      2580      2590      2600      2610      2620      2630      2640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2480 GAPLVYALLLRRCRQGHCEEFCVYKGSLSSYGAVLPPGFR-PHFEVGLAVVVQDQLGAAVVALNRSLAITLPEPNGSATG 2558
Cdd:TIGR00864 2516 EAPLLYALLLNRCRDDHCEEFCVYKGSLPEHGAFLPPGFRsAHFEVGLAITVEDHLGAAIRALNKSIAITLPDPNGEASG 2595
                         2650      2660      2670      2680      2690      2700      2710      2720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2559 LTVWLHGLTASVLPGLLRQADPQHVIEYSLALVTVLNEYERALDVAAEPKHERQHRAQIRKNITETLVSLRVHTVDDIQQ 2638
Cdd:TIGR00864 2596 LPHWLHDLIASKLKGLLDQADFQHVIELSLALITVLNEYEQALDSAAEPKHERGHRAQIRKNITEALTALDLHTVDDIQQ 2675
                         2730      2740      2750      2760      2770      2780
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217306237 2639 IAAALAQCMGPSRELVCRSCLKQTLHKLEAMMLILQAETTAGTVTPTAIGDSILNITGDLIHLAS 2703
Cdd:TIGR00864 2676 IAAALAQCMAPSREFICEECLKQTLHKLEAMLEILQADTKAGIVTPTAIADNILNIMGDLIHLAS 2740
REJ pfam02010
REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor ...
2146-2589 1.17e-132

REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly Swiss:Q26627. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains.


Pssm-ID: 366875 [Multi-domain]  Cd Length: 448  Bit Score: 424.99  E-value: 1.17e-132
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2146 AHVDLRDCV-TYQTEYRWEVYRTASC---QRPGRPARVALPGVDvsrprlvLPRLALPVGHYCFVFVVSFGDTP-LTQSI 2220
Cdd:pfam02010    1 ASVELNGCFsAYTIDYLWSVFTVSSNlnlQTISSPKDLVLPQLT-------IPSGTLPYGTYVFTLTVSLSSTPsLAGTD 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2221 QANVTVAPERLVPIIEGGSYRVWSDTRDLVLDGSESYDPNLEDGDQTPLSFHWACVAST------QREAGGCA-----LN 2289
Cdd:pfam02010   74 IITVTVQPSPLVAVIDGGSSRVVGYNQDLTLDGSESYDPDVDPGSSSGLTYLWSCRRSSsgdnplLNNDPVCFsdqneGT 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2290 FGPRGSSTVTIPRERLAAGVEYTFSLTVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRssYVYLEGRCL 2369
Cdd:pfam02010  154 LLQSTSSSLTIPASTLQANVTYTFKLTVSKGSRNSASTTQTILVVDGNPPIIILSCISNCNRKNNPVDR--LVLLASTCL 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2370 NCSSGSKRG--RWAARTFSNKTLVLD--ETTTSTGSAGMRLVLRRGVLRDGEGYTFTLTVLGRSGEEEGCASIRLSPNRP 2445
Cdd:pfam02010  232 NCSSDLSDVtyRWLSLGSENTSLVLDqlNSQTSTGRSGPYLVIKAGVLQSGVSYRFTLIVTVYPGLVSGLASISFITNAP 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 2446 PLGGSCRLFPLGAvHALTTKVHFECTGWHDAEDagaPLVYALLLRRCRQGHCEEFCVYKGSLS-SYGAVLPPGFRPH-FE 2523
Cdd:pfam02010  312 PTGGTCSVTPTEG-TALETKFTVTCQGWTDDDL---PLTYQFGDISFREASEEWFLLYEGSSQiSISTFLPPGLPANdYQ 387
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306237 2524 VGLAVVVQDQLGAAvVALNRSLAITLPEPNGSatglTVWLHGLTASVLPGLLRQADPQHVIEYSLA 2589
Cdd:pfam02010  388 VTVVVVVYDSLGAA-TSVSLTITVTPPSSSDE----LLYFLLGTTSDLSALLQSGDPQQAAQLILA 448
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
3093-3212 9.28e-58

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238850  Cd Length: 120  Bit Score: 196.34  E-value: 9.28e-58
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3093 FKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLDGD--RAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAW 3170
Cdd:cd01752      1 YLYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPekPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSW 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 2217306237 3171 FLQHVIVRDLQTARSAFFLVNDWLSVETEanGGLVEKEVLAA 3212
Cdd:cd01752     81 YLSRVIVRDLQTGKKWFFLCNDWLSVEEG--DGTVERTFPVA 120
Polycystin_dom pfam20519
Polycystin domain; This domain represents the polycystin domain from group II of Transient ...
3688-3866 5.32e-56

Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.


Pssm-ID: 466668  Cd Length: 199  Bit Score: 194.56  E-value: 5.32e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3688 AFLAITRSEELWPWMAHVLLPYVHGNQS------------SPELGPPRLRQVRLQEA--LYPDPPGPRVHTCSAAGGFST 3753
Cdd:pfam20519    1 GLLTVTDLDDIWDWLSSVLLPALHSNKTpsglpgsfiayeSLLLGVPRLRQLRVRNSscLVHDKFVREINECHAGYSPPS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3754 SDY----DVGWESPHNGSGTW-AYSAPDLL-GAWSWGSCAVYDSGGYVQELGLSLEESRDRLRFLQLHNWLDNRSRAVFL 3827
Cdd:pfam20519   81 EDRklysALPYKPVHYGSKYWfIYTPPGLLmGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVFV 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 2217306237 3828 ELTRYSPAVGLHAAVTLRLEFPAAGRALAALSVRPFALR 3866
Cdd:pfam20519  161 DFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQSVKLL 199
PKD_channel pfam08016
Polycystin cation channel; This family contains the cation channel region from group II of ...
3867-4088 9.39e-44

Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.


Pssm-ID: 462341 [Multi-domain]  Cd Length: 225  Bit Score: 160.52  E-value: 9.39e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3867 RLSAGLSLPLLT-SVCLLLFAVHFAVAEARTWHREGR------WRVLRLgawarwLLVALTAATALVRLAQLGAADRQWT 3939
Cdd:pfam08016    1 RYVTNRSLFILLcEIVFVVFFLYFVVEEILKIRKHRPsylrsvWNLLDL------AIVILSVVLIVLNIYRDFLADRLIK 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3940 rFVRGRPRRFTSFDQVAQLSSAARGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELLGVTLGLVVLGVAYAQLA 4019
Cdd:pfam08016   75 -SVEASPVTFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFG 153
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217306237 4020 ILLVSSCVDSLWSVAQALLVLCPgtglsTLCPAESWH--------LSPLLCVGLWALRLWGALRLGAVILRWRYHAL 4088
Cdd:pfam08016  154 YLLFGTQAPNFSNFVKSILTLFR-----TILGDFGYNeifsgnrvLGPLLFLTFVFLVIFILLNLFLAIINDSYVEV 225
PLAT_repeat cd01756
PLAT/LH2 domain repeats of family of proteins with unknown function. In general, PLAT/LH2 ...
3093-3212 6.21e-28

PLAT/LH2 domain repeats of family of proteins with unknown function. In general, PLAT/LH2 consists of an eight stranded beta-barrel and it's proposed function is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238854  Cd Length: 120  Bit Score: 111.11  E-value: 6.21e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3093 FKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLD---GDRAFHRNSLDIFRIATPhSLGSVWKIRVWHDNKGLSPA 3169
Cdd:cd01756      1 VTYEVTVKTGDVKGAGTDANVFITLYGENGDTGKRKLKksnNKNKFERGQTDKFTVEAV-DLGKLKKIRIGHDNSGLGAG 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 2217306237 3170 WFLQHVIVRDLQTARSAFFLVNDWLSveTEANGGLVEKEVLAA 3212
Cdd:cd01756     80 WFLDKVEIREPGTGDEYTFPCNRWLD--KDEDDGQIVRELYPS 120
WSC smart00321
present in yeast cell wall integrity and stress response component proteins; Domain present in ...
107-201 2.41e-24

present in yeast cell wall integrity and stress response component proteins; Domain present in WSC proteins, polycystin and fungal exoglucanase


Pssm-ID: 214616 [Multi-domain]  Cd Length: 95  Bit Score: 99.85  E-value: 2.41e-24
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237   107 GEEYVACLPDNSSGTVAAVSFSAAHegLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSA-----SFACLSLCSGp 181
Cdd:smart00321    1 GATYVGCYSDNSSRTLAAVSSYAYH--NMSVEACSNFCFSAGYALAALENGNECYCGDSLPSTSvsasdSSQCSTTCSG- 77
                            90       100
                    ....*....|....*....|
gi 2217306237   182 ppPPAPTCRGPTLLQHVFPA 201
Cdd:smart00321   78 --YPAEVCGGPNRLSVYVLA 95
PLAT pfam01477
PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. ...
3095-3198 2.82e-23

PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.


Pssm-ID: 396180  Cd Length: 115  Bit Score: 97.50  E-value: 2.82e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3095 YEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLDGDR-AFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAWFLQ 3173
Cdd:pfam01477    1 YQVKVVTGDELGAGTDADVYISLYGKVGESAQLEITLDNpDFERGAEDSFEIDTDWDVGAILKINLHWDNNGLSDEWFLK 80
                           90       100
                   ....*....|....*....|....*.
gi 2217306237 3174 HVIV-RDLQTARSAFFLVNDWLSVET 3198
Cdd:pfam01477   81 SITVeVPGETGGKYTFPCNSWVYGSK 106
LH2 smart00308
Lipoxygenase homology 2 (beta barrel) domain;
3093-3195 1.94e-19

Lipoxygenase homology 2 (beta barrel) domain;


Pssm-ID: 214608 [Multi-domain]  Cd Length: 105  Bit Score: 86.16  E-value: 1.94e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  3093 FKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLD--GDRAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKglSPAW 3170
Cdd:smart00308    1 GKYKVTVTTGGLDFAGTTASVSLSLVGAEGDGKESKLDylFKGIFARGSTYEFTFDVDEDFGELGAVKIKNEHR--HPEW 78
                            90       100
                    ....*....|....*....|....*
gi 2217306237  3171 FLQHVIVRDLQTARSAFFLVNDWLS 3195
Cdd:smart00308   79 FLKSITVKDLPTGGKYHFPCNSWVY 103
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1698-1767 1.43e-17

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 79.74  E-value: 1.43e-17
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1698 VAASPNPAAVNTSVTLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPGLHLVTMTAGNPLGSANA 1767
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
CLECT smart00034
C-type lectin (CTL) or carbohydrate-recognition domain (CRD); Many of these domains function ...
336-461 1.50e-17

C-type lectin (CTL) or carbohydrate-recognition domain (CRD); Many of these domains function as calcium-dependent carbohydrate binding modules.


Pssm-ID: 214480 [Multi-domain]  Cd Length: 124  Bit Score: 81.49  E-value: 1.50e-17
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237   336 CPSDTEIFpgNGHCYRLVVEKAAWLQAQEQCQAwAGAALAMVDSPAVQRFL---VSRVTRSLDVWIGFSTVQGVEVGPAP 412
Cdd:smart00034    1 CPSGWISY--GGKCYKFSTEKKTWEDAQAFCQS-LGGHLASIHSEAENDFVaslLKNSGSSDYYWIGLSDPDSNGSWQWS 77
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|
gi 2217306237   413 QGeaFSLESCQNWLPGEPhPATAEHCVRLGPTGWC-NTDLCSAPHSYVCE 461
Cdd:smart00034   78 DG--SGPVSYSNWAPGEP-NNSSGDCVVLSTSGGKwNDVSCTSKLPFVCE 124
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1103-1184 7.12e-17

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 77.88  E-value: 7.12e-17
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  1103 SVAVGVSDGVLVAGRPVTFYPHPLPSPGGVLYTWDFGDGSpvlTQSQPAANHTYASRGTYHVRLEVNNTVSGAAAQADVR 1182
Cdd:smart00089    1 VADVSASPTVGVAGESVTFTATSSDDGSIVSYTWDFGDGT---SSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVV 77

                    ..
gi 2217306237  1183 VF 1184
Cdd:smart00089   78 VQ 79
CLECT cd00037
C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type ...
348-461 1.00e-16

C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.


Pssm-ID: 153057 [Multi-domain]  Cd Length: 116  Bit Score: 78.82  E-value: 1.00e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  348 HCYRLVVEKAAWLQAQEQCQAWaGAALAMVDSPAVQRFLVSRVTRSL--DVWIGFSTVQGVEVGPAPQGEAFSleSCQNW 425
Cdd:cd00037      1 SCYKFSTEKLTWEEAQEYCRSL-GGHLASIHSEEENDFLASLLKKSSssDVWIGLNDLSSEGTWKWSDGSPLV--DYTNW 77
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 2217306237  426 LPGEPHPATAEHCVRL--GPTGWCNTDLCSAPHSYVCE 461
Cdd:cd00037     78 APGEPNPGGSEDCVVLssSSDGKWNDVSCSSKLPFICE 115
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1867-1936 1.48e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 76.66  E-value: 1.48e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1867 LWASSKVVAPGQLVHFQILLAAGSAVTFRLQVGGANPEVLPGPRFSHSFPRVGDHVVSVRGKNHVSWAQA 1936
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1612-1683 2.06e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 76.27  E-value: 2.06e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217306237 1612 VVGGGRYFPTNHTVQLQAVVRDGTNVSYSWTAWRdrGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWA 1683
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFGD--SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PLAT cd00113
PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology 2) domain. ...
3094-3195 2.21e-16

PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology 2) domain. It consists of an eight stranded beta-barrel. The domain can be found in various domain architectures, in case of lipoxygenases, alpha toxin, lipases and polycystin, but also as a single domain or as repeats.The putative function of this domain is to facilitate access to sequestered membrane or micelle bound substrates.


Pssm-ID: 238061  Cd Length: 116  Bit Score: 77.76  E-value: 2.21e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3094 KYEILVKTGWGRGSGTTAHVGIMLYGVD-SRSGHRHLDGDRAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAWFL 3172
Cdd:cd00113      2 RYTVTIKTGDKKGAGTDSNISLALYGENgNSSDIPILDGPGSFERGSTDTFQIDLKLDIGDITKVYLRRDGSGLSDGWYC 81
                           90       100
                   ....*....|....*....|...
gi 2217306237 3173 QHVIVRDLQTARSAFFLVNDWLS 3195
Cdd:cd00113     82 ESITVQALGTKKVYTFPVNRWVL 104
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1528-1597 3.02e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 75.89  E-value: 3.02e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1528 VNASRTVVPLNGSVSFSTSLEAGSDVRYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQD 1597
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
2043-2110 4.00e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 75.50  E-value: 4.00e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217306237 2043 SGPCFTNRSAQFEAaTSPSPRRVAYHWDFGDgSPGQDTDEPRAEHSYLRPGDYRVQVNASNLVSFFVA 2110
Cdd:pfam00801    5 GTVVAAGQPVTFTA-TLADGSNVTYTWDFGD-SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1106-1177 4.96e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 75.12  E-value: 4.96e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217306237 1106 VGVSDGVLVAGRPVTFYPHpLPSPGGVLYTWDFGDgSPVLTQSQPAANHTYASRGTYHVRLEVNNTVSGAAA 1177
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTAT-LADGSNVTYTWDFGD-SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1360-1431 9.81e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 74.35  E-value: 9.81e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217306237 1360 LQPERQFVQLGDEAWLVACAWPPFPYRYTWDFGteEAAPTRARGPEVTFIYRDPGSYLVTVTASNNISAAND 1431
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFG--DSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
2035-2117 3.15e-15

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 73.25  E-value: 3.15e-15
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  2035 AVQYVALQSGPcfTNRSAQFEAATSPSPRRVAYHWDFGDGSpgqDTDEPRAEHSYLRPGDYRVQVNASNLVSFFVAQATV 2114
Cdd:smart00089    2 ADVSASPTVGV--AGESVTFTATSSDDGSIVSYTWDFGDGT---SSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTV 76

                    ...
gi 2217306237  2115 TVQ 2117
Cdd:smart00089   77 VVQ 79
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1195-1260 7.91e-15

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 72.03  E-value: 7.91e-15
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306237 1195 SLAVEQGAPVVVSAAVQTGDNITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTVGAASPAGHLAR 1260
Cdd:pfam00801    5 GTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1787-1852 9.61e-15

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 71.65  E-value: 9.61e-15
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217306237 1787 PGGSFVAAGSSVPFWGQLATGTNVSWCWAVPG--GSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSA 1852
Cdd:pfam00801    3 ASGTVVAAGQPVTFTATLADGSNVTYTWDFGDspGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1104-1178 3.70e-14

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 70.22  E-value: 3.70e-14
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306237 1104 VAVGVSDGVLV-AGRPVTFYPHPLPSPGGVLYTWDFGDGSpVLTQSQPAANHTYASRGTYHVRLEVNNTVSGAAAQ 1178
Cdd:cd00146      1 PTASVSAPPVAeLGASVTFSASDSSGGSIVSYKWDFGDGE-VSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTK 75
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1525-1603 1.19e-13

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 69.02  E-value: 1.19e-13
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  1525 GLVVNASRTVVPLNGSVSFS-TSLEAGSDVRYSWVLCDRctPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVYV 1603
Cdd:smart00089    1 VADVSASPTVGVAGESVTFTaTSSDDGSIVSYTWDFGDG--TSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVV 78
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1443-1520 2.94e-13

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 67.86  E-value: 2.94e-13
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  1443 VTSIKVNGSLGLeLQQPYLFSAV--GRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRVAGWNEVSRSEAWLNVTVK 1520
Cdd:smart00089    1 VADVSASPTVGV-AGESVTFTATssDDGSIVSYTWDFGDGTSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1444-1750 1.06e-12

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 72.78  E-value: 1.06e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1444 TSIKVNGSLGLELQqpylFSAVGRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRVAGWNEV-SRSEAWLNVTVKRR 1522
Cdd:COG3291      2 TATPTSGCAPLTVQ----FTDTSSGNATSYEWDFGDGTTSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVGAP 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1523 VRGLVVNASRTVVPLNGSVSFSTSLEAGSDVrYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVY 1602
Cdd:COG3291     78 NPGVTTVTTSTTVTTLANTANGGATTVVAGS-TVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVT 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1603 VLQLIEGLQVVGGGRYFPTNHTVQLQAVVRDGTNVSYSWTAWRDRGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAW 1682
Cdd:COG3291    157 TSVSTTDVTSDGTTSASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTDVTLT 236
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217306237 1683 ADCTMDFVEPVGWLMVAASPNPAAVNTSVTLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPG 1750
Cdd:COG3291    237 GISTGDAGTPGTNTVTTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNVSTTAD 304
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1448-1513 2.36e-12

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 64.72  E-value: 2.36e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217306237 1448 VNGSLGLELQQPYLFSA-VGRGRPASYLWDLGD--GGWLEGPEVTHAYNSTGDFTVRVAGWNEVSRSEA 1513
Cdd:pfam00801    2 SASGTVVAAGQPVTFTAtLADGSNVTYTWDFGDspGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1700-1774 2.51e-12

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 65.17  E-value: 2.51e-12
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306237  1700 ASPNPAAVNTSVTLSAELAG-GSGVVYTWSLEEGLSWetSEPFTTHSFPTPGLHLVTMTAGNPLGSANATVEVDVQ 1774
Cdd:smart00089    6 ASPTVGVAGESVTFTATSSDdGSIVSYTWDFGDGTSS--TGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1794-1853 3.75e-12

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 64.39  E-value: 3.75e-12
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217306237  1794 AGSSVPFWGQLAT-GTNVSWCWAVPGGSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSAT 1853
Cdd:smart00089   13 AGESVTFTATSSDdGSIVSYTWDFGDGTSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASAT 73
WSC pfam01822
WSC domain; This domain is involved in carbohydrate binding.
110-180 6.23e-12

WSC domain; This domain is involved in carbohydrate binding.


Pssm-ID: 460348  Cd Length: 82  Bit Score: 64.02  E-value: 6.23e-12
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217306237  110 YVACLPDNSsGTVAAVSFSAAHEGLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSASFA----CLSLCSG 180
Cdd:pfam01822    1 YLGCYSDGT-GGRRLLLGSSGDYDDMTPEKCIAFCSAAGYTYAGLEYGGECYCGNSLPSGSALAdssdCNTPCPG 74
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1955-2026 1.96e-11

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 62.02  E-value: 1.96e-11
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217306237 1955 CEPGIATGTERNFTARVQRGSRVAYAWYFslqkvqGDSLV-ILSGRDVTYTPVAAGLLEIQVRAFNALGSENR 2026
Cdd:pfam00801    4 SGTVVAAGQPVTFTATLADGSNVTYTWDF------GDSPGtSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
2048-2118 2.08e-11

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 62.51  E-value: 2.08e-11
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217306237 2048 TNRSAQFEAATSPSPRRVAYHWDFGDGSpGQDTDEPRAEHSYLRPGDYRVQVNASNLVSfFVAQATVTVQV 2118
Cdd:cd00146     13 LGASVTFSASDSSGGSIVSYKWDFGDGE-VSSSGEPTVTHTYTKPGTYTVTLTVTNAVG-SSSTKTTTVVV 81
LRRCT smart00082
Leucine rich repeat C-terminal domain;
55-107 2.24e-11

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 61.29  E-value: 2.24e-11
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 2217306237    55 NPFECDCGLAWLPRWAeEQQVRVVQPEAATCAGPGSLAGqPLLGIPLLDSGCG 107
Cdd:smart00082    1 NPFICDCELRWLLRWL-QANEHLQDPVDLRCASPSSLRG-PLLELLHSEFKCP 51
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1527-1603 5.30e-11

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 61.36  E-value: 5.30e-11
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217306237 1527 VVNASRTVVPLNGSVSFS-TSLEAGSDVRYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQ-DSIFVYV 1603
Cdd:cd00146      3 ASVSAPPVAELGASVTFSaSDSSGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSStKTTTVVV 81
CLECT_DC-SIGN_like cd03590
C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific ...
336-461 5.34e-11

C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR); CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.


Pssm-ID: 153060 [Multi-domain]  Cd Length: 126  Bit Score: 62.71  E-value: 5.34e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  336 CPSDTEIFpgNGHCYRLVVEKAAWLQAQEQCQAwAGAALAMVDSPAVQRFLVSRVTRSLDVWIGFStVQGVE-----V-G 409
Cdd:cd03590      1 CPTNWKSF--QSSCYFFSTEKKSWEESRQFCED-MGAHLVIINSQEEQEFISKILSGNRSYWIGLS-DEETEgewkwVdG 76
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306237  410 PAPqgeafsLESCQNWLPGEP--HPATAEHCVRLGPT--GWcNTDLCSAPHSYVCE 461
Cdd:cd03590     77 TPL------NSSKTFWHPGEPnnWGGGGEDCAELVYDsgGW-NDVPCNLEYRWICE 125
LRR_8 pfam13855
Leucine rich repeat;
3-57 8.86e-11

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 59.85  E-value: 8.86e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217306237    3 DVSHNLLRALDVGLLANLSALAELDISNNKISTLEEGIFANLFNLSEINLSGNPF 57
Cdd:pfam13855    7 DLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
207-274 1.05e-10

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 60.09  E-value: 1.05e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217306237  207 LVGPHGPLASGQLAAFHIA-APLPVTATRWDFGDgSAEVDAAGPAASHRYVLPGRYHVTAVLALGAGSA 274
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATlADGSNVTYTWDFGD-SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSA 68
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1271-1352 2.03e-10

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 59.77  E-value: 2.03e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  1271 VLRVEPAACIPTQP-DARLTAYVTGNPAHYLFDWTFGDGssnTTVRGcPTVTHNFTRSGTFPLALVLSSRVNRAHYFTSI 1349
Cdd:smart00089    1 VADVSASPTVGVAGeSVTFTATSSDDGSIVSYTWDFGDG---TSSTG-PTVTHTYTKPGTYTVTLTVTNAVGSASATVTV 76

                    ...
gi 2217306237  1350 CVE 1352
Cdd:smart00089   77 VVQ 79
GPS smart00303
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin ...
2986-3035 3.20e-10

G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin REJ and polycystin.


Pssm-ID: 197639  Cd Length: 49  Bit Score: 58.17  E-value: 3.20e-10
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 2217306237  2986 YTSLCQYFSEEDMVWRTEGLLPLEETSpRQAVCLTRHLTAFGASLFVPPS 3035
Cdd:smart00303    1 FNPICVFWDESSGEWSTRGCELLETNG-THTTCSCNHLTTFAVLMDVPPI 49
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1382-1438 4.89e-10

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 58.62  E-value: 4.89e-10
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 2217306237  1382 PFPYRYTWDFGTEeaapTRARGPEVTFIYRDPGSYLVTVTASNNISAANDSALVEVQ 1438
Cdd:smart00089   27 GSIVSYTWDFGDG----TSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1620-1687 3.84e-09

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 55.92  E-value: 3.84e-09
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217306237  1620 PTNHTVQLQAVVR-DGTNVSYSWtawrDRGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWADCTM 1687
Cdd:smart00089   12 VAGESVTFTATSSdDGSIVSYTW----DFGDGTSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTV 76
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1197-1256 5.17e-09

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 55.58  E-value: 5.17e-09
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217306237 1197 AVEQGAPVVVSAAVQ-TGDNITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTVGAASPAG 1256
Cdd:cd00146     10 VAELGASVTFSASDSsGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVG 70
CLECT_REG-1_like cd03594
C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and ...
336-461 6.57e-09

C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2); CLECT_REG-1_like: C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. REG-1 is a proliferating factor which participates in various kinds of tissue regeneration including pancreatic beta-cell regeneration, regeneration of intestinal mucosa, regeneration of motor neurons, and perhaps in tissue regeneration of damaged heart. REG-1 may play a role on the pathophysiology of Alzheimer's disease and in the development of gastric cancers. Its expression is correlated with reduced survival from early-stage colorectal cancer. REG-1 also binds and aggregates several bacterial strains from the intestinal flora and it has been suggested that it is involved in the control of the intestinal bacterial ecosystem. Rat lithostathine has calcium carbonate crystal inhibitor activity in vitro. REG-IV is unregulated in pancreatic, gastric, hepatocellular, and prostrate adenocarcinomas. REG-IV activates the EGF receptor/Akt/AP-1 signaling pathway in colorectal carcinoma. Ansocalcin, SCA-1 and -2 are found at high concentration in the calcified egg shell layer of goose and ostrich, respectively and tend to form aggregates. Ansocalcin nucleates calcite crystal aggregates in vitro.


Pssm-ID: 153064 [Multi-domain]  Cd Length: 129  Bit Score: 57.00  E-value: 6.57e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  336 CPSDTeiFPGNGHCYRLVVEKAAWLQAQEQCQAW-AGAALAMVDSPAVQRFLVSRV----TRSLDVWIGFSTVQGVEVGP 410
Cdd:cd03594      1 CPKGW--LPYKGNCYGYFRQPLSWSDAELFCQKYgPGAHLASIHSPAEAAAIASLIssyqKAYQPVWIGLHDPQQSRGWE 78
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217306237  411 APQGEAFSLEScqnWLPGEPHPaTAEHCVRL-GPTG---WcNTDLCSAPHSYVCE 461
Cdd:cd03594     79 WSDGSKLDYRS---WDRNPPYA-RGGYCAELsRSTGflkW-NDANCEERNPFICK 128
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1201-1267 8.99e-09

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 55.15  E-value: 8.99e-09
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217306237  1201 GAPVVVSAAVQT-GDNITWTFDMGDGTVLSGPeaTVEHVYLRAQNCTVTVGAASPAGHLARSLHVLVF 1267
Cdd:smart00089   14 GESVTFTATSSDdGSIVSYTWDFGDGTSSTGP--TVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1115-1325 2.28e-08

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 59.30  E-value: 2.28e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1115 AGRPVTFYPHplpSPGGVL-YTWDFGDGSpvlTQSQPAANHTYASRGTYHVRLEVNNTV-SGAAAQADVRVFEELRGLSV 1192
Cdd:COG3291     10 APLTVQFTDT---SSGNATsYEWDFGDGT---TSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVGAPNPGVTT 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1193 DMSLAVEQGAPVVVSAAVQTGDNITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTVGAASPAGHLARSLHVLVFVLEVL 1272
Cdd:COG3291     84 VTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVTTSVSTTD 163
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2217306237 1273 RVEPAACIPTQPDARLTAYVTGNPAHYLFDWTFGDGSSNTTVRGCPTVTHNFT 1325
Cdd:COG3291    164 VTSDGTTSASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATS 216
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1455-1519 2.86e-08

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 53.65  E-value: 2.86e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1455 ELQQPYLFSAV--GRGRPASYLWDLGDGGWL--EGPEVTHAYNSTGDFTVRVAGWNEVSRSEA-WLNVTV 1519
Cdd:cd00146     12 ELGASVTFSASdsSGGSIVSYKWDFGDGEVSssGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTkTTTVVV 81
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1782-1853 3.54e-08

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 53.27  E-value: 3.54e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217306237 1782 IRASEPGGSFVAAGSSVPFWGQLATG---TNVSWCWAVPGGSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSAT 1853
Cdd:cd00146      1 PTASVSAPPVAELGASVTFSASDSSGgsiVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTK 75
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
999-1091 4.40e-08

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 52.77  E-value: 4.40e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  999 VSTVPAVLSPNATLALTAGVLvdSAVEVAFLWTFGDgeqalhqfqppynesfpvpdpSVAQVLVEHNVMHTYAAPGEYLL 1078
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLA--DGSNVTYTWDFGD---------------------SPGTSGSGPTVTHTYLSPGTYTV 57
                           90
                   ....*....|...
gi 2217306237 1079 TVLASNAFENLTQ 1091
Cdd:pfam00801   58 TLTASNAVGSANA 70
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1698-1771 8.13e-08

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 52.50  E-value: 8.13e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217306237 1698 VAASPNPAA-VNTSVTLSAE-LAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPGLHLVTMTAGNPLGSAN---ATVEV 1771
Cdd:cd00146      3 ASVSAPPVAeLGASVTFSASdSSGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSStktTTVVV 81
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
996-1097 9.29e-08

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 52.07  E-value: 9.29e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237   996 GLQVSTVPAVLSPNATLALTAGVLVDSAVeVAFLWTFGDGeqalhqfqppynesfpvpdpsvaQVLVEHNVMHTYAAPGE 1075
Cdd:smart00089    1 VADVSASPTVGVAGESVTFTATSSDDGSI-VSYTWDFGDG-----------------------TSSTGPTVTHTYTKPGT 56
                            90       100
                    ....*....|....*....|..
gi 2217306237  1076 YLLTVLASNAFENLTQQVPVSV 1097
Cdd:smart00089   57 YTVTLTVTNAVGSASATVTVVV 78
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
3-58 1.29e-07

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 57.25  E-value: 1.29e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306237    3 DVSHNLLRALDVGLlANLSALAELDISNNKISTLEEgiFANLFNLSEINLSGNPFE 58
Cdd:COG4886    211 DLSGNQLTDLPEPL-ANLTNLETLDLSNNQLTDLPE--LGNLTNLEELDLSNNQLT 263
PLAT_plant_stress cd01754
PLAT/LH2 domain of plant-specific single domain protein family with unknown function. Many of ...
3095-3202 1.42e-07

PLAT/LH2 domain of plant-specific single domain protein family with unknown function. Many of its members are stress induced. In general, PLAT/LH2 consists of an eight stranded beta-barrel and it's proposed function is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238852  Cd Length: 129  Bit Score: 53.31  E-value: 1.42e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3095 YEILVKTGWGRGSGTTAHVGIMLYGVDSRS---------GHRHLDGDRAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKG 3165
Cdd:cd01754      3 YTIYVQTGSIWKAGTDSRISLQIYDADGPGlrianleawGGLMGAGHDYFERGNLDRFSGRGPCLPSPPCWMNLTSDGTG 82
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 2217306237 3166 LSPAWFLQHVIVRDL-QTARSA--FFLVNDWLSVETEANG 3202
Cdd:cd01754     83 NHPGWYVNYVEVTQAgQHAPCMqhLFAVEQWLATDESPYM 122
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1127-1183 1.81e-07

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 51.50  E-value: 1.81e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2217306237 1127 PSPGGVL-YTWDFGDGSpvlTQSQPAANHTYASRGTYHVRLEVNNTVSGAAAQADVRV 1183
Cdd:pfam18911   29 DPDGDILsYRWDFGDGT---TATGANVSHTYAAPGTYTVTLTVTDDSGASNSTATDTV 83
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1282-1355 2.38e-07

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 50.96  E-value: 2.38e-07
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217306237 1282 TQPDARLTAYVTGNPAHYLFDWTFGDGssNTTVRGCPTVTHNFTRSGTFPLALVLSSRVNRAhyfTSICVEPEV 1355
Cdd:cd00146     13 LGASVTFSASDSSGGSIVSYKWDFGDG--EVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSS---STKTTTVVV 81
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1279-1344 2.77e-07

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 50.46  E-value: 2.77e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217306237 1279 CIPTQPDARLTAYV-TGNPAHYLfdWTFGDgsSNTTVRGCPTVTHNFTRSGTFPLALVLSSRVNRAH 1344
Cdd:pfam00801    7 VVAAGQPVTFTATLaDGSNVTYT--WDFGD--SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSAN 69
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1863-1942 3.55e-07

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 50.53  E-value: 3.55e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  1863 VGLVLWASSKVVApGQLVHFQI-LLAAGSAVTFRLQVGgaNPEVLPGPRFSHSFPRVGDHVVSVRGKNHVSWAQAQVRIV 1941
Cdd:smart00089    1 VADVSASPTVGVA-GESVTFTAtSSDDGSIVSYTWDFG--DGTSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVV 77

                    .
gi 2217306237  1942 V 1942
Cdd:smart00089   78 V 78
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
3-58 7.32e-07

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 54.94  E-value: 7.32e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306237    3 DVSHNLLRALDVglLANLSALAELDISNNKISTLEEgiFANLFNLSEINLSGNPFE 58
Cdd:COG4886    234 DLSNNQLTDLPE--LGNLTNLEELDLSNNQLTDLPP--LANLTNLKTLDLSNNQLT 285
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
3-58 8.27e-07

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 54.94  E-value: 8.27e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306237    3 DVSHNLLRALDVGLlANLSALAELDISNNKISTLEEGIfANLFNLSEINLSGNPFE 58
Cdd:COG4886    188 DLSNNQITDLPEPL-GNLTNLEELDLSGNQLTDLPEPL-ANLTNLETLDLSNNQLT 241
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1384-1437 9.37e-07

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 49.42  E-value: 9.37e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217306237 1384 PYRYTWDFGTEEAAPTRarGPEVTFIYRDPGSYLVTVTASNNISAAN-DSALVEV 1437
Cdd:cd00146     29 IVSYKWDFGDGEVSSSG--EPTVTHTYTKPGTYTVTLTVTNAVGSSStKTTTVVV 81
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
3-57 1.20e-06

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 52.48  E-value: 1.20e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2217306237    3 DVSHNLLRALDVglLANLSALAELDISNNKISTLEE--GIFANLFNLSEINLSGNPF 57
Cdd:cd21340    126 NISGNNIDSLEP--LAPLRNLEQLDASNNQISDLEEllDLLSSWPSLRELDLTGNPV 180
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1700-1877 1.93e-06

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 53.14  E-value: 1.93e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1700 ASPNPAAVNTSVTLSAeLAGGSGVVYTWSLEEGLSweTSEPFTTHSFPTPGLHLVTMTAGNPLGSANA-----TVEVDVQ 1774
Cdd:COG3291      3 ATPTSGCAPLTVQFTD-TSSGNATSYEWDFGDGTT--STEANPSHTYTTPGTYTVTLTVTDAAGCSDTttktiTVGAPNP 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1775 VPVSGLSIRASEPGGSFVAAGSSVPFWGQLATGTNVSWCWAVPGGSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSATY 1854
Cdd:COG3291     80 GVTTVTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVTTSV 159
                          170       180
                   ....*....|....*....|...
gi 2217306237 1855 NLTAEEPIVGLVLWASSKVVAPG 1877
Cdd:COG3291    160 STTDVTSDGTTSASTNPSVTTDT 182
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1462-1502 2.35e-06

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 48.42  E-value: 2.35e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2217306237 1462 FSAVGRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRV 1502
Cdd:pfam18911   26 ASDDPDGDILSYRWDFGDGTTATGANVSHTYAAPGTYTVTL 66
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
3-58 2.61e-06

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 53.40  E-value: 2.61e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306237    3 DVSHNLLRALDVGLlANLSALAELDISNNKISTLEEGIfANLFNLSEINLSGNPFE 58
Cdd:COG4886    165 DLSNNQLTDLPEEL-GNLTNLKELDLSNNQITDLPEPL-GNLTNLEELDLSGNQLT 218
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
3-58 2.73e-06

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 53.01  E-value: 2.73e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306237    3 DVSHNLLRALDVGLlANLSALAELDISNNKISTLEEGIfANLFNLSEINLSGNPFE 58
Cdd:COG4886    142 DLSNNQLTDLPEPL-GNLTNLKSLDLSNNQLTDLPEEL-GNLTNLKELDLSNNQIT 195
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1383-1684 4.26e-06

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 52.36  E-value: 4.26e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1383 FPYRYTWDFGTeeaaPTRARGPEVTFIYRDPGSYLVTVTASNNI-SAANDSALVEVQEPVLVTSIKVNGSlglelqqpYL 1461
Cdd:COG3291     23 NATSYEWDFGD----GTTSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVGAPNPGVTTVTTST--------TV 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1462 FSAVGRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRVAGWNEVSRSEAWLNVTVKRRVRGLVVNASRTVVPLNGSV 1541
Cdd:COG3291     91 TTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVTTSVSTTDVTSDGTT 170
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1542 SFSTSLEAGSDVRYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVYVLQLIEGLQVVGGGRYFPT 1621
Cdd:COG3291    171 SASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTDVTLTGISTGDAGTPGTNT 250
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217306237 1622 NHTVQLQAVVRDGTNVSYSWTAWRDRGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWAD 1684
Cdd:COG3291    251 VTTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNVSTTADVTGGTATLA 313
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
206-278 5.78e-06

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 47.11  E-value: 5.78e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306237  206 TLVGPHGPLAS-GQLAAFHIAAPLP--VTATRWDFGDGSAEVdAAGPAASHRYVLPGRYHVTAVLALGAGSALLGT 278
Cdd:cd00146      2 TASVSAPPVAElGASVTFSASDSSGgsIVSYKWDFGDGEVSS-SGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTKT 76
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
3-58 1.11e-05

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 51.09  E-value: 1.11e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306237    3 DVSHNLLRALDVGLlANLSALAELDISNNKISTLEEGIfANLFNLSEINLSGNPFE 58
Cdd:COG4886    119 DLSGNQLTDLPEEL-ANLTNLKELDLSNNQLTDLPEPL-GNLTNLKSLDLSNNQLT 172
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1000-1359 1.69e-05

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 50.44  E-value: 1.69e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1000 STVPAVLSPNATLALTAgvlVDSAVEVAFLWTFGDGEQAlhqfqppynesfpvpdpsvaqvlVEHNVMHTYAAPGEYLLT 1079
Cdd:COG3291      2 TATPTSGCAPLTVQFTD---TSSGNATSYEWDFGDGTTS-----------------------TEANPSHTYTTPGTYTVT 55
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1080 VLASNAF-ENLTQQVPVSVRASLPSVAVGVSDGVLVAGRPVTFYPHPLPSPGGVLYTWDFGDGSPVLTQSQPAANHTYAS 1158
Cdd:COG3291     56 LTVTDAAgCSDTTTKTITVGAPNPGVTTVTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTT 135
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1159 RGTYHVRLEVNNTVSGAAAQADVRVFEELRGLSVDmslaveqGAPVVVSAAVQTGDNITWTFDMGDGTVLSGPEATVEHV 1238
Cdd:COG3291    136 TGTDTGLTGSTGTASDTATVTTSVSTTDVTSDGTT-------SASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTA 208
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1239 YLRAQNCTVTVGAASPAGHLARSLHVLVFVLEVLRVEPAACIPTQPDARLTAYVTGNPAHYLFDWTFGDGSSNTTVRGCP 1318
Cdd:COG3291    209 GVTTGATSGTSGTGSATSGVAVTDVTLTGISTGDAGTPGTNTVTTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGL 288
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|.
gi 2217306237 1319 TVTHNFTRSGTFPLALVLSSRVNRAHYFTSICVEPEVGNVT 1359
Cdd:COG3291    289 GTTTAITPGNVSTTADVTGGTATLAVSSTLTTNDTTGSSST 329
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
230-274 2.20e-05

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 45.72  E-value: 2.20e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 2217306237  230 VTATRWDFGDGSAevdAAGPAASHRYVLPGRYHVTAVLALGAGSA 274
Cdd:pfam18911   34 ILSYRWDFGDGTT---ATGANVSHTYAAPGTYTVTLTVTDDSGAS 75
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1386-1437 2.34e-05

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 45.34  E-value: 2.34e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2217306237 1386 RYTWDFGTEeaapTRARGPEVTFIYRDPGSYLVTVTASNNISAANDSALVEV 1437
Cdd:pfam18911   36 SYRWDFGDG----TTATGANVSHTYAAPGTYTVTLTVTDDSGASNSTATDTV 83
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1957-2033 2.76e-05

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 45.13  E-value: 2.76e-05
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217306237  1957 PGIATGTERNFTARVQR-GSRVAYAWYFslqkvqGDSLViLSGRDVTYTPVAAGLLEIQVRAFNALGSENRTLVLEVQ 2033
Cdd:smart00089    9 TVGVAGESVTFTATSSDdGSIVSYTWDF------GDGTS-STGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
231-283 2.76e-05

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 45.13  E-value: 2.76e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 2217306237   231 TATRWDFGDGSaevDAAGPAASHRYVLPGRYHVTAVLALGAGSALLGTDVQVE 283
Cdd:smart00089   30 VSYTWDFGDGT---SSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1866-1942 3.93e-05

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 44.79  E-value: 3.93e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217306237 1866 VLWASSKVVAPGQLVHFQI-LLAAGSAVTFRLQVGGANPEVLPGPRFSHSFPRVGDHVVSVRGKNHVSWAQAQVRIVV 1942
Cdd:cd00146      3 ASVSAPPVAELGASVTFSAsDSSGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTKTTTVV 80
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
2051-2117 4.21e-05

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 48.90  E-value: 4.21e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217306237 2051 SAQFEAATSPSPrrVAYHWDFGDGSpgqDTDEPRAEHSYLRPGDYRVQVNASNLV-SFFVAQATVTVQ 2117
Cdd:COG3291     13 TVQFTDTSSGNA--TSYEWDFGDGT---TSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVG 75
CLECT_NK_receptors_like cd03593
C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs); ...
336-461 5.04e-05

C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs); CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.


Pssm-ID: 153063  Cd Length: 116  Bit Score: 45.40  E-value: 5.04e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  336 CPSDTEIFpgNGHCYRLVVEKAAWLQAQEQCQAwAGAALAMVDSPAVQRFLvSRVTRSLDVWIGFSTVQGVEVGPAPQGE 415
Cdd:cd03593      1 CPKDWICY--GNKCYYFSMEKKTWNESKEACSS-KNSSLLKIDDEEELEFL-QSQIGSSSYWIGLSREKSEKPWKWIDGS 76
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 2217306237  416 AFSlescqNWLpgEPHPATAE-HCVRLGPTGwCNTDLCSAPHSYVCE 461
Cdd:cd03593     77 PLN-----NLF--NIRGSTKSgNCAYLSSTG-IYSEDCSTKKRWICE 115
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
9-58 5.28e-05

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 49.16  E-value: 5.28e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 2217306237    9 LRALDVGLLANLSALAELDISNNKISTLEEGIfANLFNLSEINLSGNPFE 58
Cdd:COG4886    101 LDLSGNEELSNLTNLESLDLSGNQLTDLPEEL-ANLTNLKELDLSNNQLT 149
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
2051-2103 8.17e-05

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 43.80  E-value: 8.17e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217306237 2051 SAQFEAATS--PSPRRVAYHWDFGDGSPGqdtDEPRAEHSYLRPGDYRVQVNASN 2103
Cdd:pfam18911   19 TVTFDASASddPDGDILSYRWDFGDGTTA---TGANVSHTYAAPGTYTVTLTVTD 70
CLECT_CEL-1_like cd03589
C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and ...
336-462 1.37e-04

C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and Echinoidin from Anthocidaris crassispina; CLECT_CEL-1_like: C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and Echinoidin from Anthocidaris crassispina. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. The CEL-1 CTLD binds three calcium ions and has a high specificity for N-acteylgalactosamine (GalNAc). CEL-1 exhibits strong cytotoxicity which is inhibited by GalNAc. This protein may play a role as a toxin defending against predation. Echinoidin is found in the coelomic fluid of the sea urchin and is specific for GalBeta1-3GalNAc. Echinoidin has a cell adhesive activity towards human cancer cells which is not mediated through the CTLD. Both CEL-1 and Echinoidin are multimeric proteins comprised of multiple dimers linked by disulfide bonds.


Pssm-ID: 153059  Cd Length: 137  Bit Score: 44.66  E-value: 1.37e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  336 CPSDTEIFpgNGHCYRLVVEKAAWLQAQEQCQAWAG----AALAMVDSPAVQRFL------VSRVTRSLDVWIGFStvQG 405
Cdd:cd03589      1 CPTFWTAF--GGYCYRFFGDRLTWEEAELRCRSFSIpgliAHLVSIHSQEENDFVydlfesSRGPDTPYGLWIGLH--DR 76
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217306237  406 VEVGPA--PQGEAFSLEscqNWLPGEPHPA-TAEHCVRLGPTG-----WcNTDLCSAPHSYVCEL 462
Cdd:cd03589     77 TSEGPFewTDGSPVDFT---KWAGGQPDNYgGNEDCVQMWRRGdagqsW-NDMPCDAVFPYICKM 137
PksD COG3321
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ...
3731-4256 1.42e-04

Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 442550 [Multi-domain]  Cd Length: 1386  Bit Score: 48.33  E-value: 1.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3731 EALYPDPPGPRV--------HTCSAAGGFSTSDYDVGWESPHNGSGTWAYSAPDLLGAWSWGSCAVYDSGGYVQELGLSL 3802
Cdd:COG3321    850 SALYPGRGRRRVplptypfqREDAAAALLAAALAAALAAAAALGALLLAALAAALAAALLALAAAAAAALALAAAALAAL 929
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3803 EESRDRLRFLQLHNWLDNRSRAVFLELTRYSPAVGLHAAVTLRLEFPAAGRALAALSVRPFALRRLSAGLSLPLLTSVCL 3882
Cdd:COG3321    930 LALVALAAAAAALLALAAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAALALLAAAA 1009
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3883 LLFAVHFAVAEARTWHREGRWRVLRLGAWARWLLVALTAATALVRLAQLGAADRQWTRFVRGRPRRFTSFDQVAQLSSAA 3962
Cdd:COG3321   1010 LLLAAAAAAAALLALAALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALAELALAAAALALAAA 1089
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 3963 RGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELLGVTLGLVVLGVAYAQLAILLVSSCVDSLWSVAQALLVLCP 4042
Cdd:COG3321   1090 LAAAALALALAALAAALLLLALLAALALAAAAAALLALAALLAAAAAAAALAAAAAAAAALALAAAAAALAAALAAALLA 1169
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 4043 GTGLSTLCPAESWHLSPLLCVGLWALRLWGALRLGAVILRWRYHALRGELYRPAWEPQDYEMVELFLRRLRLWMGLSKVK 4122
Cdd:COG3321   1170 AAALLLALALALAAALAAALAGLAALLLAALLAALLAALLALALAALAAAAAALLAAAAAAAALALLALAAAAAAVAALA 1249
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 4123 EFRHKVRFEGMEPLPSRSSRGSKVSPDVPPPSAGSDASHPSTSSSQLDGLSVSLGRLGTRCEPEPSRLQAVFEALLTQFD 4202
Cdd:COG3321   1250 AAAAALLAALAALALLAAAAGLAALAAAAAAAAAALALAAAAAAAAAALAALLAAAAAAAAAAAAAAAAAALAAALLAAA 1329
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217306237 4203 RLNQA-TEDVYQLEQQLHSLQGRRSSRAPAGSSRGPSPGLRPALPSRLARASRGV 4256
Cdd:COG3321   1330 LAALAaAVAAALALAAAAAAAAAAAAAAAAAAALAAAAGAAAAAAALALAALAAA 1384
PHA03247 PHA03247
large tegument protein UL36; Provisional
426-758 2.65e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 2.65e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  426 LPGEPHPATAEHCVrlgPTGWCnTDLCSAPHSYVCELQPGGPVQDAENLLVGAPSGDLQGPLTPLA-----------QQD 494
Cdd:PHA03247  2555 LPPAAPPAAPDRSV---PPPRP-APRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPlppdthapdppPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  495 GLSAPHEPVEVMVFPGL------------------RLSREAFLTTAEFGTQELRRPAqLRLQVYRLLSTAGTPENGSEPE 556
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPpperprddpapgrvsrprRARRLGRAAQASSPPQRPRRRA-ARPTVGSLTSLADPPPPPPTPE 2709
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  557 SRsPDNRTQLAPAcMPGGRWCPGANICLPLDASCHPQACANGCTSGPGLPGAPYAlwreflfsvPAGPPaqyslllpvcq 636
Cdd:PHA03247  2710 PA-PHALVSATPL-PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT---------TAGPP----------- 2767
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237  637 vlacvlSPCVPRS-ATVLPVLVTSTVGRLLEVTLHGQDVLMLPGDLVGLQHDAGPGALLHCSPAPGHPGPRAPYLSANAS 715
Cdd:PHA03247  2768 ------APAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP 2841
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 2217306237  716 SWLPHLPAQLEGTWACPACALRLLAATEQLTVLLGLRPNPGLR 758
Cdd:PHA03247  2842 PPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVR 2884
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1618-1690 6.87e-04

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 41.33  E-value: 6.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1618 YFPTNHTVQLQAVVR-------DGTNVSYSWTaWRDrGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWADCTMDFV 1690
Cdd:cd00146      4 SVSAPPVAELGASVTfsasdssGGSIVSYKWD-FGD-GEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTKTTTVVV 81
LRR_8 pfam13855
Leucine rich repeat;
23-55 1.23e-03

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 39.82  E-value: 1.23e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 2217306237   23 LAELDISNNKISTLEEGIFANLFNLSEINLSGN 55
Cdd:pfam13855    3 LRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNN 35
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1698-1775 2.19e-03

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 39.95  E-value: 2.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306237 1698 VAASPNPAAVNTSVTLSAE---LAGGSGVVYTWSLEEGLSWETSEPftTHSFPTPGLHLVTMTAGNPLGSANATVEVDVQ 1774
Cdd:pfam18911    7 DAGGDRIVAEGETVTFDASasdDPDGDILSYRWDFGDGTTATGANV--SHTYAAPGTYTVTLTVTDDSGASNSTATDTVT 84

                   .
gi 2217306237 1775 V 1775
Cdd:pfam18911   85 V 85
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH