NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2217306235|ref|XP_047290166|]
View 

polycystin-1 isoform X5 [Homo sapiens]

Protein Classification

CLECT and PLAT_polycystin domain-containing protein( domain architecture ID 13202464)

protein containing domains CLECT, GPS, PLAT_polycystin, and PKD_channel

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PCC super family cl28216
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
46-2722 0e+00

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


The actual alignment was detected with superfamily member TIGR00864:

Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 4838.88  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235   46 DISNNKISTLEEGIFANLFNLSEINLSGNPFECDCGLAWLPRWAEEQQVRVVQPEAATCAGPGSLAGQPLLGIPLLDSGC 125
Cdd:TIGR00864    1 DISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAGPGALAGQPLLGIPLLDSGC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  126 GEEYVACLPDNSSGTVAAVS----FSAAHEGLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSASFACLSLCSGPP 201
Cdd:TIGR00864   81 DEEYVACLKDNSSGGGAARSelviFSAAHEGLFQPEACNAFCFSAGHGLAALGEQGECLCGAAQPSEANFACESLCSGPP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  202 PPPAPTCRGPTLLQHVFPASPGATLVGPHGPLASGQLAAFHIAAPLPVTATRWDFGDGSAEVDAAGP----AASHRYVLP 277
Cdd:TIGR00864  161 PPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQLAAFHAAAPLAPTAMRWDFGDGSAEVDAAGAggttAASHKYGHP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  278 GRYHVTAVLALGAGSALLGTDVQVEAAPAALELVCPSSVQSDESLDLSIQNRGGSGLEAAYSIVALGEEPARAVHPLCPS 357
Cdd:TIGR00864  241 GRYHVSAMGALGAGKALAGGDVQVEAAPAALELHCPSLVQADESLDLSIQNRGGSDLDAAWKITAHGEEPAKASHPHCPK 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  358 DTEIFPGNGHCYRLVVEKAAWLQAQEQCQAWAGAALAMVDSPAVQRFLVSRVTRSLD--VWIGFSTVQGVEVGPAPQGEA 435
Cdd:TIGR00864  321 DGEIFEENGHCFQIVPEEAAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSLDrgVWIGFSDVNGAEKGPAHQGEA 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  436 FSLESCQNWLPGEPHPATAEHCVRLGPTGWCNTDLCSAPHSYVCELQPGGPVQDAENLLVGAPSGDLQGPLTPLAQQDGL 515
Cdd:TIGR00864  401 FEAEECEEGLAGEPHPARAEHCVRLDPRGQCNSDLCNAPHAYVCELNPGGPVPDAENFAMGAASFDLHGLLQALAAMDGL 480
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  516 SAP-HEPVEVMVFPGLRLSREAFLTTAEFGTQELRRPAQLRLQVYRL---LSTAGTPENGSEPESRSPDNRTQLAPACMP 591
Cdd:TIGR00864  481 PAPpHEGVEVLLFPALRFSRAAFLSSAEFGTQELRRPAHILFQIYRLrcrLPGAGGPACGPEAECRPPDNRSADAPACMK 560
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  592 GGRWCPGANICLPLDASCHPQACANGCTSGPGLPGAP----YALWREFLFSVPAGPPAQYSlllpvcqvlacvlspcvpr 667
Cdd:TIGR00864  561 GEQWCPFAHICLPLDAPCHPQACANGCSQGHGLPGAArmplYALQREFLFSLPAGPAAHVL------------------- 621
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  668 satvlpvlvtstvgrlleVTLHGQDVLMLPGDLVGLQHDAGPGALLHCSPAPGHPGPRAPYLSANASSWLPH-------- 739
Cdd:TIGR00864  622 ------------------LQDHGEDLLMLPGDLIALQHDAGPAALIHCQPAPGHPGPRAPVFAANASEWFGHnntpvppd 683
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  740 ----------------LPAQLEGTWA----CPACALRLLAATEQLTVLLGLRPNPGLRLPGRYEVRAEVGNGVSRHNLSC 799
Cdd:TIGR00864  684 nlagdgadplpdpeldLKALLEGTRAswleCAACAIRLLAAGEQETRLLGAELNAGLPLPGLYELLAESAKGSDLHNASC 763
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  800 SFDVVSPVAGLRVIYPAPRDGRLYVPTNGSALVLQVDSGANATATARWPGGSVSARFENVCPALVA-------TFVPGCP 872
Cdd:TIGR00864  764 SFDVLPPLAGLRVIHPAPQDGRLFLESNGSALLLQVDSGANAEAKAFWPGGNSSARFENVCPAEFAsrlchpsTFEGGCA 843
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  873 WETNDTLFSVVALPWLSEGEHV----VDVVVENSASRANLSLRVTAEEPICGLRATPSPEARVLQGVLVRYSPVVEAGSD 948
Cdd:TIGR00864  844 EEAEDSLFAVLALNWLKEGEHTgpvqVDLMAENNASEANLSLLVQAEEPICGLRAQPHPAARVLMESLVRYSASVEAGSD 923
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  949 MVFRWTINDKQSLTFQNVVFNVIYQSAAVFKLSpedaamavLTASNHVSNVTVNYNVTVERMNRMQGLQVSTVPAVLSPN 1028
Cdd:TIGR00864  924 MTFKWTIDDKPFFTFQNTVFNVIYQHAAVFKLS--------LTAMNHVSNLTEDFNVTVDRLNPMQGLQVKGVPAVLPPG 995
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1029 ATLALTAGVLVDSAVEVAFLWTFGDGEQALHQFQPPYNESFPVPDPSVAQVLVEHNVMHTYAAPGEYLLTVLASNAFENL 1108
Cdd:TIGR00864  996 ATLALTAGVLIDMAVEAAFLWSFGDGEQALFEFKPPYNESFPCPDPSPAQVLLEHNVMHIYAAPGEYLATVLASNAFENI 1075
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1109 TQQVPVSVRASLPSVAVGVSDGVLVAGRPVTFYPHPLPSPGGVLYTWDFGDGSPVLTQSQPAANHTYASRGTYHVRLEVN 1188
Cdd:TIGR00864 1076 SQQINMSVRAILPRVAIGTEDGLLLAGKPADFEAHPLPSPGGIHYEWDFGDGSALLQGRQPAAAHTFAKRGPFHVCLEVN 1155
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1189 NTVSGAAAQADVRVFEELRGLSVDMSLAVEQGAPVVVSAAVQTGDNITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTV 1268
Cdd:TIGR00864 1156 NTISGAAACADMFAFEEIEGLSADMSLATELGAATTVRAALQSGDNITWTFDMGDGKSLSGPEATVEHKYAKAGNCTVNI 1235
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1269 GAASPAGHLARSLHVLVFVLEVLRVEPAACIPTQPDARLTAYVTGNPAHYLFDWTFGDGSSNTTVRGCPTVTHNFTRSGT 1348
Cdd:TIGR00864 1236 GAANAAGHGARIIHVEVFVFEVAGIEPAACIGEHADANFRARVSGNAAHYLFDWSFGDGSPNETHHGCPGISHNFRGNGT 1315
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1349 FPLALVLSSRVNRAHYFTSICVEPEVGNVTLQPERQFVQLGDEAWLVACAWPPFPYRYTWDFGTEEAAPTRARGPEVTFI 1428
Cdd:TIGR00864 1316 FPLALTISSGVNKAHFFTQICVEPELGKISLQAEKQFFALGDEAQFQACAEPEFNYRYEWDFGGEEAAPLPAAGAEVTFI 1395
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1429 YRDPGSYLVTVTASNNISAANDSALVEVQEPVLVTSIKVNGSLG--LELQQPYLFSAVGRGRPASYLWDLGDGGWLEGPE 1506
Cdd:TIGR00864 1396 YNDPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGSHGnnLELGQPYLFSAFGRARNASYLWDFGDGGLLEGPE 1475
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1507 VTHAYNSTGDFTVRVAGWNEVSRSEAWLNVTVKRRVRGLVVNASRTVVPLNGSVSFSTSLEAGSDVRYSWVLCDRCTPIP 1586
Cdd:TIGR00864 1476 ILHAFNSPGDFNIRLAAANEVGKNEATLNVAVKARVRGLTINASLTNVPLNGSVHFEAHLDAGDDVRFSWILCDHCTPIF 1555
                         1610      1620      1630      1640      1650      1660      1670      1680
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1587 GGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVYVLQLIEGLQVVGG-------------GRYFPTNHTVQLQAVVRDG 1653
Cdd:TIGR00864 1556 GGNTIFYTFRSVGTFNIIVTAENDVGAAQASIFLFVLQEIEGLQILGEtaegggggvqeldGCYFETNHTVQFHAGFKDG 1635
                         1690      1700      1710      1720      1730      1740      1750      1760
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1654 TNVSYSWTAWRD---RGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWADCTMDFVEPVGWLMVAASPNPAAVNTSV 1730
Cdd:TIGR00864 1636 TNLSFSWNAILDnepDGPAFAGSGKGAKLNPLEAGPCDIFLQAANLLGQATADCTIDFLEPAGNLMLAASDNPAAVNALI 1715
                         1770      1780      1790      1800      1810      1820      1830      1840
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1731 TLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPGLHLVTMTAGNPLGSANATVEVDVQVPVSGLSIRASEPGGS- 1809
Cdd:TIGR00864 1716 NLSAELAEGSGLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLVTMKAFNELGSANASEEVDVQEPISGLKIRAADAGEQn 1795
                         1850      1860      1870      1880      1890      1900      1910      1920
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1810 FVAAGSSVPFWGQLATGTNVSWCWAVPGGSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSATYNLTAEEPIVGLVLWAS 1889
Cdd:TIGR00864 1796 FFAADSSVCFQGELATGTNVSWCWAIDGGSSKMGKHACMTFPDAGTFAIRLNASNAVSGKSASREFFAEEPIFGLELKAS 1875
                         1930      1940      1950      1960      1970      1980      1990      2000
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1890 SKVVAPGQLVHFQILLAAGSAVTFRLQVGGANPEVL-PGPRFSHSFPRVGDHVVSVRGKNHVSWAQAQVRIVVLEAVSGL 1968
Cdd:TIGR00864 1876 KKIAAIGEKVEFQILLAAGSAVNFRLQIGGAAPEVLqPGPRFSHSFPRVDDHMVNLRAKNEVSCAQANLHIEVLEAVRGL 1955
                         2010      2020      2030      2040      2050      2060      2070      2080
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1969 QVPNCCEPGIATGTERNFTARVQRGSRVAYAWYFSLQKVQGDSLVILSGRDVTYTPVAAGLLEIQVRAFNALGSENRTLV 2048
Cdd:TIGR00864 1956 QIPDCCAAGIATGEEKNFTANVQRGKPVAFAWTFDLHHLHGDSLVIHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQ 2035
                         2090      2100      2110      2120      2130      2140      2150      2160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2049 LEVQDAVQYVALQSGP--CFTNRSAQFEAATSPSPRRVAYHWDFGDGSPGQDTDEPRAEHSYLRPGDYRVQVNASNLVSF 2126
Cdd:TIGR00864 2036 LEAQDALMDAALQAGPqdCFTNKMAQFEAATSPKPNFMACHWDFGDGSAGQDTDEPRAEHEYLHPGDYRVQVNASNLVSF 2115
                         2170      2180      2190      2200      2210      2220      2230      2240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2127 FVAQATVTVQVLACREPEVDVVLPLQVLMRRSQRNYLEAHVDLRDCVTYQTEYRWEVYRTASCQRPGRPARVALPG---- 2202
Cdd:TIGR00864 2116 FSAHAEINVQVLACEEPEVDVVLALQLAIRRSQPNLLEAHVDLKDCLRYGAEYLWEILRAASCDNDGHFARGALNGatrs 2195
                         2250      2260      2270      2280      2290      2300      2310      2320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2203 ---------VDVSRPRLVLPRLALPVGHYCFVFVVSFGDTPLTQSIQANVTVAPERLVPIIEGGSYRVWSDTRDLVLDGS 2273
Cdd:TIGR00864 2196 fpviplpaeVDVQRLQLSLPKLALAAGHYCFVFSLSFEDTPLKKAACANLGVAAARLMPIIEGGSYRVWSDTQDLQLDAE 2275
                         2330      2340      2350      2360      2370      2380      2390      2400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2274 ESYDPNLEDGDQTPLSFHWACVASTQREAGGCALNFGPRG-SSTVTIPRERLAAGVEYTFSLTVWKAGRKEEATNQTVLI 2352
Cdd:TIGR00864 2276 ESYDPNLDDDDQSLLHFHWACQASSKGEAGCCALNFGLGGkGPTLGIPGEELAAGIEYTFKLSIGKAGMKEEATNQTVLI 2355
                         2410      2420      2430      2440      2450      2460      2470      2480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2353 RSGRVPIVSLECVSCKAQAVYEVSRSSYVYLEGRCLNCSSGSKRGRWAARTFSNKTLVLDETTTSTGSAGMRLVLRRGVL 2432
Cdd:TIGR00864 2356 QSGHIPIVSLECVSCKAQALYEVSQNSYVYLEGRCLNCQSGFHRGRWAARTFQNDTLVLDESSTSTGSAGMNLVLRQGVL 2435
                         2490      2500      2510      2520      2530      2540      2550      2560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2433 RDGEGYTFTLTVLGRSGEEEGCASIRLSPNRPPLGGSCRLFPLG--------------AVHALTTKVHFECTGWHDAEDA 2498
Cdd:TIGR00864 2436 HDGEGYNFTLHVLDDSGDEEGAASIRLHHNMPPDGGECHLFPGGetgqehgdkedevwAIEALLDKVHFECSGWHDAEDA 2515
                         2570      2580      2590      2600      2610      2620      2630      2640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2499 GAPLVYALLLRRCRQGHCEEFCVYKGSLSSYGAVLPPGFR-PHFEVGLAVVVQDQLGAAVVALNRSLAITLPEPNGSATG 2577
Cdd:TIGR00864 2516 EAPLLYALLLNRCRDDHCEEFCVYKGSLPEHGAFLPPGFRsAHFEVGLAITVEDHLGAAIRALNKSIAITLPDPNGEASG 2595
                         2650      2660      2670      2680      2690      2700      2710      2720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2578 LTVWLHGLTASVLPGLLRQADPQHVIEYSLALVTVLNEYERALDVAAEPKHERQHRAQIRKNITETLVSLRVHTVDDIQQ 2657
Cdd:TIGR00864 2596 LPHWLHDLIASKLKGLLDQADFQHVIELSLALITVLNEYEQALDSAAEPKHERGHRAQIRKNITEALTALDLHTVDDIQQ 2675
                         2730      2740      2750      2760      2770      2780
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217306235 2658 IAAALAQCMGPSRELVCRSCLKQTLHKLEAMMLILQAETTAGTVTPTAIGDSILNITGDLIHLAS 2722
Cdd:TIGR00864 2676 IAAALAQCMAPSREFICEECLKQTLHKLEAMLEILQADTKAGIVTPTAIADNILNIMGDLIHLAS 2740
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
3112-3231 8.54e-58

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


:

Pssm-ID: 238850  Cd Length: 120  Bit Score: 196.34  E-value: 8.54e-58
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3112 FKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLDGD--RAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAW 3189
Cdd:cd01752      1 YLYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPekPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSW 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 2217306235 3190 FLQHVIVRDLQTARSAFFLVNDWLSVETEanGGLVEKEVLAA 3231
Cdd:cd01752     81 YLSRVIVRDLQTGKKWFFLCNDWLSVEEG--DGTVERTFPVA 120
Polycystin_dom super family cl48672
Polycystin domain; This domain represents the polycystin domain from group II of Transient ...
3707-3885 5.14e-56

Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.


The actual alignment was detected with superfamily member pfam20519:

Pssm-ID: 466668  Cd Length: 199  Bit Score: 194.56  E-value: 5.14e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3707 AFLAITRSEELWPWMAHVLLPYVHGNQS------------SPELGPPRLRQVRLQEA--LYPDPPGPRVHTCSAAGGFST 3772
Cdd:pfam20519    1 GLLTVTDLDDIWDWLSSVLLPALHSNKTpsglpgsfiayeSLLLGVPRLRQLRVRNSscLVHDKFVREINECHAGYSPPS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3773 SDY----DVGWESPHNGSGTW-AYSAPDLL-GAWSWGSCAVYDSGGYVQELGLSLEESRDRLRFLQLHNWLDNRSRAVFL 3846
Cdd:pfam20519   81 EDRklysALPYKPVHYGSKYWfIYTPPGLLmGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVFV 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 2217306235 3847 ELTRYSPAVGLHAAVTLRLEFPAAGRALAALSVRPFALR 3885
Cdd:pfam20519  161 DFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQSVKLL 199
PKD_channel super family cl37568
Polycystin cation channel; This family contains the cation channel region from group II of ...
3886-4107 7.71e-44

Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.


The actual alignment was detected with superfamily member pfam08016:

Pssm-ID: 462341 [Multi-domain]  Cd Length: 225  Bit Score: 160.52  E-value: 7.71e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3886 RLSAGLSLPLLT-SVCLLLFAVHFAVAEARTWHREGR------WRVLRLgawarwLLVALTAATALVRLAQLGAADRQWT 3958
Cdd:pfam08016    1 RYVTNRSLFILLcEIVFVVFFLYFVVEEILKIRKHRPsylrsvWNLLDL------AIVILSVVLIVLNIYRDFLADRLIK 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3959 rFVRGRPRRFTSFDQVAQLSSAARGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELLGVTLGLVVLGVAYAQLA 4038
Cdd:pfam08016   75 -SVEASPVTFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFG 153
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217306235 4039 ILLVSSCVDSLWSVAQALLVLCPgtglsTLCPAESWH--------LSPLLCVGLWALRLWGALRLGAVILRWRYHAL 4107
Cdd:pfam08016  154 YLLFGTQAPNFSNFVKSILTLFR-----TILGDFGYNeifsgnrvLGPLLFLTFVFLVIFILLNLFLAIINDSYVEV 225
GPS super family cl02559
GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for ...
3005-3054 3.18e-10

GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for auto-proteolysis, so is thus named, GPS. The GPS motif is a conserved sequence of ~40 amino acids containing canonical cysteine and tryptophan residues, and is the most highly conserved part of the domain. In most, if not all, cell-adhesion GPCRs these undergo autoproteolysis in the GPS between a conserved aliphatic residue (usually a leucine) and a threonine, serine, or cysteine residue. In higher eukaryotes this motif is found embedded in the C-terminal beta-stranded part of a GAIN domain - GPCR-Autoproteolysis INducing (GAIN). The GAIN-GPS domain adopts a fold in which the GPS motif, at the C-terminus, forms five beta-strands that are tightly integrated into the overall GAIN domain. The GPS motif, evolutionarily conserved from tetrahymena to mammals, is the only extracellular domain shared by all human cell-adhesion GPCRs and PKD proteins, and is the locus of multiple human disease mutations. The GAIN-GPS domain is both necessary and sufficient functionally for autoproteolysis, suggesting an autoproteolytic mechanism whereby the overall GAIN domain fine-tunes the chemical environment in the GPS to catalyze peptide bond hydrolysis. In the cell-adhesion GPCRs and PKD proteins, the GPS motif is always located at the end of their long N-terminal extracellular regions, immediately before the first transmembrane helix of the respective protein.


The actual alignment was detected with superfamily member smart00303:

Pssm-ID: 470616  Cd Length: 49  Bit Score: 58.17  E-value: 3.18e-10
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 2217306235  3005 YTSLCQYFSEEDMVWRTEGLLPLEETSpRQAVCLTRHLTAFGASLFVPPS 3054
Cdd:smart00303    1 FNPICVFWDESSGEWSTRGCELLETNG-THTTCSCNHLTTFAVLMDVPPI 49
 
Name Accession Description Interval E-value
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
46-2722 0e+00

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 4838.88  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235   46 DISNNKISTLEEGIFANLFNLSEINLSGNPFECDCGLAWLPRWAEEQQVRVVQPEAATCAGPGSLAGQPLLGIPLLDSGC 125
Cdd:TIGR00864    1 DISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAGPGALAGQPLLGIPLLDSGC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  126 GEEYVACLPDNSSGTVAAVS----FSAAHEGLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSASFACLSLCSGPP 201
Cdd:TIGR00864   81 DEEYVACLKDNSSGGGAARSelviFSAAHEGLFQPEACNAFCFSAGHGLAALGEQGECLCGAAQPSEANFACESLCSGPP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  202 PPPAPTCRGPTLLQHVFPASPGATLVGPHGPLASGQLAAFHIAAPLPVTATRWDFGDGSAEVDAAGP----AASHRYVLP 277
Cdd:TIGR00864  161 PPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQLAAFHAAAPLAPTAMRWDFGDGSAEVDAAGAggttAASHKYGHP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  278 GRYHVTAVLALGAGSALLGTDVQVEAAPAALELVCPSSVQSDESLDLSIQNRGGSGLEAAYSIVALGEEPARAVHPLCPS 357
Cdd:TIGR00864  241 GRYHVSAMGALGAGKALAGGDVQVEAAPAALELHCPSLVQADESLDLSIQNRGGSDLDAAWKITAHGEEPAKASHPHCPK 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  358 DTEIFPGNGHCYRLVVEKAAWLQAQEQCQAWAGAALAMVDSPAVQRFLVSRVTRSLD--VWIGFSTVQGVEVGPAPQGEA 435
Cdd:TIGR00864  321 DGEIFEENGHCFQIVPEEAAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSLDrgVWIGFSDVNGAEKGPAHQGEA 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  436 FSLESCQNWLPGEPHPATAEHCVRLGPTGWCNTDLCSAPHSYVCELQPGGPVQDAENLLVGAPSGDLQGPLTPLAQQDGL 515
Cdd:TIGR00864  401 FEAEECEEGLAGEPHPARAEHCVRLDPRGQCNSDLCNAPHAYVCELNPGGPVPDAENFAMGAASFDLHGLLQALAAMDGL 480
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  516 SAP-HEPVEVMVFPGLRLSREAFLTTAEFGTQELRRPAQLRLQVYRL---LSTAGTPENGSEPESRSPDNRTQLAPACMP 591
Cdd:TIGR00864  481 PAPpHEGVEVLLFPALRFSRAAFLSSAEFGTQELRRPAHILFQIYRLrcrLPGAGGPACGPEAECRPPDNRSADAPACMK 560
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  592 GGRWCPGANICLPLDASCHPQACANGCTSGPGLPGAP----YALWREFLFSVPAGPPAQYSlllpvcqvlacvlspcvpr 667
Cdd:TIGR00864  561 GEQWCPFAHICLPLDAPCHPQACANGCSQGHGLPGAArmplYALQREFLFSLPAGPAAHVL------------------- 621
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  668 satvlpvlvtstvgrlleVTLHGQDVLMLPGDLVGLQHDAGPGALLHCSPAPGHPGPRAPYLSANASSWLPH-------- 739
Cdd:TIGR00864  622 ------------------LQDHGEDLLMLPGDLIALQHDAGPAALIHCQPAPGHPGPRAPVFAANASEWFGHnntpvppd 683
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  740 ----------------LPAQLEGTWA----CPACALRLLAATEQLTVLLGLRPNPGLRLPGRYEVRAEVGNGVSRHNLSC 799
Cdd:TIGR00864  684 nlagdgadplpdpeldLKALLEGTRAswleCAACAIRLLAAGEQETRLLGAELNAGLPLPGLYELLAESAKGSDLHNASC 763
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  800 SFDVVSPVAGLRVIYPAPRDGRLYVPTNGSALVLQVDSGANATATARWPGGSVSARFENVCPALVA-------TFVPGCP 872
Cdd:TIGR00864  764 SFDVLPPLAGLRVIHPAPQDGRLFLESNGSALLLQVDSGANAEAKAFWPGGNSSARFENVCPAEFAsrlchpsTFEGGCA 843
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  873 WETNDTLFSVVALPWLSEGEHV----VDVVVENSASRANLSLRVTAEEPICGLRATPSPEARVLQGVLVRYSPVVEAGSD 948
Cdd:TIGR00864  844 EEAEDSLFAVLALNWLKEGEHTgpvqVDLMAENNASEANLSLLVQAEEPICGLRAQPHPAARVLMESLVRYSASVEAGSD 923
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  949 MVFRWTINDKQSLTFQNVVFNVIYQSAAVFKLSpedaamavLTASNHVSNVTVNYNVTVERMNRMQGLQVSTVPAVLSPN 1028
Cdd:TIGR00864  924 MTFKWTIDDKPFFTFQNTVFNVIYQHAAVFKLS--------LTAMNHVSNLTEDFNVTVDRLNPMQGLQVKGVPAVLPPG 995
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1029 ATLALTAGVLVDSAVEVAFLWTFGDGEQALHQFQPPYNESFPVPDPSVAQVLVEHNVMHTYAAPGEYLLTVLASNAFENL 1108
Cdd:TIGR00864  996 ATLALTAGVLIDMAVEAAFLWSFGDGEQALFEFKPPYNESFPCPDPSPAQVLLEHNVMHIYAAPGEYLATVLASNAFENI 1075
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1109 TQQVPVSVRASLPSVAVGVSDGVLVAGRPVTFYPHPLPSPGGVLYTWDFGDGSPVLTQSQPAANHTYASRGTYHVRLEVN 1188
Cdd:TIGR00864 1076 SQQINMSVRAILPRVAIGTEDGLLLAGKPADFEAHPLPSPGGIHYEWDFGDGSALLQGRQPAAAHTFAKRGPFHVCLEVN 1155
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1189 NTVSGAAAQADVRVFEELRGLSVDMSLAVEQGAPVVVSAAVQTGDNITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTV 1268
Cdd:TIGR00864 1156 NTISGAAACADMFAFEEIEGLSADMSLATELGAATTVRAALQSGDNITWTFDMGDGKSLSGPEATVEHKYAKAGNCTVNI 1235
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1269 GAASPAGHLARSLHVLVFVLEVLRVEPAACIPTQPDARLTAYVTGNPAHYLFDWTFGDGSSNTTVRGCPTVTHNFTRSGT 1348
Cdd:TIGR00864 1236 GAANAAGHGARIIHVEVFVFEVAGIEPAACIGEHADANFRARVSGNAAHYLFDWSFGDGSPNETHHGCPGISHNFRGNGT 1315
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1349 FPLALVLSSRVNRAHYFTSICVEPEVGNVTLQPERQFVQLGDEAWLVACAWPPFPYRYTWDFGTEEAAPTRARGPEVTFI 1428
Cdd:TIGR00864 1316 FPLALTISSGVNKAHFFTQICVEPELGKISLQAEKQFFALGDEAQFQACAEPEFNYRYEWDFGGEEAAPLPAAGAEVTFI 1395
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1429 YRDPGSYLVTVTASNNISAANDSALVEVQEPVLVTSIKVNGSLG--LELQQPYLFSAVGRGRPASYLWDLGDGGWLEGPE 1506
Cdd:TIGR00864 1396 YNDPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGSHGnnLELGQPYLFSAFGRARNASYLWDFGDGGLLEGPE 1475
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1507 VTHAYNSTGDFTVRVAGWNEVSRSEAWLNVTVKRRVRGLVVNASRTVVPLNGSVSFSTSLEAGSDVRYSWVLCDRCTPIP 1586
Cdd:TIGR00864 1476 ILHAFNSPGDFNIRLAAANEVGKNEATLNVAVKARVRGLTINASLTNVPLNGSVHFEAHLDAGDDVRFSWILCDHCTPIF 1555
                         1610      1620      1630      1640      1650      1660      1670      1680
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1587 GGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVYVLQLIEGLQVVGG-------------GRYFPTNHTVQLQAVVRDG 1653
Cdd:TIGR00864 1556 GGNTIFYTFRSVGTFNIIVTAENDVGAAQASIFLFVLQEIEGLQILGEtaegggggvqeldGCYFETNHTVQFHAGFKDG 1635
                         1690      1700      1710      1720      1730      1740      1750      1760
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1654 TNVSYSWTAWRD---RGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWADCTMDFVEPVGWLMVAASPNPAAVNTSV 1730
Cdd:TIGR00864 1636 TNLSFSWNAILDnepDGPAFAGSGKGAKLNPLEAGPCDIFLQAANLLGQATADCTIDFLEPAGNLMLAASDNPAAVNALI 1715
                         1770      1780      1790      1800      1810      1820      1830      1840
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1731 TLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPGLHLVTMTAGNPLGSANATVEVDVQVPVSGLSIRASEPGGS- 1809
Cdd:TIGR00864 1716 NLSAELAEGSGLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLVTMKAFNELGSANASEEVDVQEPISGLKIRAADAGEQn 1795
                         1850      1860      1870      1880      1890      1900      1910      1920
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1810 FVAAGSSVPFWGQLATGTNVSWCWAVPGGSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSATYNLTAEEPIVGLVLWAS 1889
Cdd:TIGR00864 1796 FFAADSSVCFQGELATGTNVSWCWAIDGGSSKMGKHACMTFPDAGTFAIRLNASNAVSGKSASREFFAEEPIFGLELKAS 1875
                         1930      1940      1950      1960      1970      1980      1990      2000
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1890 SKVVAPGQLVHFQILLAAGSAVTFRLQVGGANPEVL-PGPRFSHSFPRVGDHVVSVRGKNHVSWAQAQVRIVVLEAVSGL 1968
Cdd:TIGR00864 1876 KKIAAIGEKVEFQILLAAGSAVNFRLQIGGAAPEVLqPGPRFSHSFPRVDDHMVNLRAKNEVSCAQANLHIEVLEAVRGL 1955
                         2010      2020      2030      2040      2050      2060      2070      2080
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1969 QVPNCCEPGIATGTERNFTARVQRGSRVAYAWYFSLQKVQGDSLVILSGRDVTYTPVAAGLLEIQVRAFNALGSENRTLV 2048
Cdd:TIGR00864 1956 QIPDCCAAGIATGEEKNFTANVQRGKPVAFAWTFDLHHLHGDSLVIHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQ 2035
                         2090      2100      2110      2120      2130      2140      2150      2160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2049 LEVQDAVQYVALQSGP--CFTNRSAQFEAATSPSPRRVAYHWDFGDGSPGQDTDEPRAEHSYLRPGDYRVQVNASNLVSF 2126
Cdd:TIGR00864 2036 LEAQDALMDAALQAGPqdCFTNKMAQFEAATSPKPNFMACHWDFGDGSAGQDTDEPRAEHEYLHPGDYRVQVNASNLVSF 2115
                         2170      2180      2190      2200      2210      2220      2230      2240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2127 FVAQATVTVQVLACREPEVDVVLPLQVLMRRSQRNYLEAHVDLRDCVTYQTEYRWEVYRTASCQRPGRPARVALPG---- 2202
Cdd:TIGR00864 2116 FSAHAEINVQVLACEEPEVDVVLALQLAIRRSQPNLLEAHVDLKDCLRYGAEYLWEILRAASCDNDGHFARGALNGatrs 2195
                         2250      2260      2270      2280      2290      2300      2310      2320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2203 ---------VDVSRPRLVLPRLALPVGHYCFVFVVSFGDTPLTQSIQANVTVAPERLVPIIEGGSYRVWSDTRDLVLDGS 2273
Cdd:TIGR00864 2196 fpviplpaeVDVQRLQLSLPKLALAAGHYCFVFSLSFEDTPLKKAACANLGVAAARLMPIIEGGSYRVWSDTQDLQLDAE 2275
                         2330      2340      2350      2360      2370      2380      2390      2400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2274 ESYDPNLEDGDQTPLSFHWACVASTQREAGGCALNFGPRG-SSTVTIPRERLAAGVEYTFSLTVWKAGRKEEATNQTVLI 2352
Cdd:TIGR00864 2276 ESYDPNLDDDDQSLLHFHWACQASSKGEAGCCALNFGLGGkGPTLGIPGEELAAGIEYTFKLSIGKAGMKEEATNQTVLI 2355
                         2410      2420      2430      2440      2450      2460      2470      2480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2353 RSGRVPIVSLECVSCKAQAVYEVSRSSYVYLEGRCLNCSSGSKRGRWAARTFSNKTLVLDETTTSTGSAGMRLVLRRGVL 2432
Cdd:TIGR00864 2356 QSGHIPIVSLECVSCKAQALYEVSQNSYVYLEGRCLNCQSGFHRGRWAARTFQNDTLVLDESSTSTGSAGMNLVLRQGVL 2435
                         2490      2500      2510      2520      2530      2540      2550      2560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2433 RDGEGYTFTLTVLGRSGEEEGCASIRLSPNRPPLGGSCRLFPLG--------------AVHALTTKVHFECTGWHDAEDA 2498
Cdd:TIGR00864 2436 HDGEGYNFTLHVLDDSGDEEGAASIRLHHNMPPDGGECHLFPGGetgqehgdkedevwAIEALLDKVHFECSGWHDAEDA 2515
                         2570      2580      2590      2600      2610      2620      2630      2640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2499 GAPLVYALLLRRCRQGHCEEFCVYKGSLSSYGAVLPPGFR-PHFEVGLAVVVQDQLGAAVVALNRSLAITLPEPNGSATG 2577
Cdd:TIGR00864 2516 EAPLLYALLLNRCRDDHCEEFCVYKGSLPEHGAFLPPGFRsAHFEVGLAITVEDHLGAAIRALNKSIAITLPDPNGEASG 2595
                         2650      2660      2670      2680      2690      2700      2710      2720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2578 LTVWLHGLTASVLPGLLRQADPQHVIEYSLALVTVLNEYERALDVAAEPKHERQHRAQIRKNITETLVSLRVHTVDDIQQ 2657
Cdd:TIGR00864 2596 LPHWLHDLIASKLKGLLDQADFQHVIELSLALITVLNEYEQALDSAAEPKHERGHRAQIRKNITEALTALDLHTVDDIQQ 2675
                         2730      2740      2750      2760      2770      2780
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217306235 2658 IAAALAQCMGPSRELVCRSCLKQTLHKLEAMMLILQAETTAGTVTPTAIGDSILNITGDLIHLAS 2722
Cdd:TIGR00864 2676 IAAALAQCMAPSREFICEECLKQTLHKLEAMLEILQADTKAGIVTPTAIADNILNIMGDLIHLAS 2740
REJ pfam02010
REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor ...
2165-2608 1.08e-132

REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly Swiss:Q26627. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains.


Pssm-ID: 366875 [Multi-domain]  Cd Length: 448  Bit Score: 425.38  E-value: 1.08e-132
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2165 AHVDLRDCV-TYQTEYRWEVYRTASC---QRPGRPARVALPGVDvsrprlvLPRLALPVGHYCFVFVVSFGDTP-LTQSI 2239
Cdd:pfam02010    1 ASVELNGCFsAYTIDYLWSVFTVSSNlnlQTISSPKDLVLPQLT-------IPSGTLPYGTYVFTLTVSLSSTPsLAGTD 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2240 QANVTVAPERLVPIIEGGSYRVWSDTRDLVLDGSESYDPNLEDGDQTPLSFHWACVAST------QREAGGCA-----LN 2308
Cdd:pfam02010   74 IITVTVQPSPLVAVIDGGSSRVVGYNQDLTLDGSESYDPDVDPGSSSGLTYLWSCRRSSsgdnplLNNDPVCFsdqneGT 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2309 FGPRGSSTVTIPRERLAAGVEYTFSLTVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRssYVYLEGRCL 2388
Cdd:pfam02010  154 LLQSTSSSLTIPASTLQANVTYTFKLTVSKGSRNSASTTQTILVVDGNPPIIILSCISNCNRKNNPVDR--LVLLASTCL 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2389 NCSSGSKRG--RWAARTFSNKTLVLD--ETTTSTGSAGMRLVLRRGVLRDGEGYTFTLTVLGRSGEEEGCASIRLSPNRP 2464
Cdd:pfam02010  232 NCSSDLSDVtyRWLSLGSENTSLVLDqlNSQTSTGRSGPYLVIKAGVLQSGVSYRFTLIVTVYPGLVSGLASISFITNAP 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2465 PLGGSCRLFPLGAvHALTTKVHFECTGWHDAEDagaPLVYALLLRRCRQGHCEEFCVYKGSLS-SYGAVLPPGFRPH-FE 2542
Cdd:pfam02010  312 PTGGTCSVTPTEG-TALETKFTVTCQGWTDDDL---PLTYQFGDISFREASEEWFLLYEGSSQiSISTFLPPGLPANdYQ 387
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306235 2543 VGLAVVVQDQLGAAvVALNRSLAITLPEPNGSatglTVWLHGLTASVLPGLLRQADPQHVIEYSLA 2608
Cdd:pfam02010  388 VTVVVVVYDSLGAA-TSVSLTITVTPPSSSDE----LLYFLLGTTSDLSALLQSGDPQQAAQLILA 448
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
3112-3231 8.54e-58

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238850  Cd Length: 120  Bit Score: 196.34  E-value: 8.54e-58
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3112 FKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLDGD--RAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAW 3189
Cdd:cd01752      1 YLYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPekPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSW 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 2217306235 3190 FLQHVIVRDLQTARSAFFLVNDWLSVETEanGGLVEKEVLAA 3231
Cdd:cd01752     81 YLSRVIVRDLQTGKKWFFLCNDWLSVEEG--DGTVERTFPVA 120
Polycystin_dom pfam20519
Polycystin domain; This domain represents the polycystin domain from group II of Transient ...
3707-3885 5.14e-56

Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.


Pssm-ID: 466668  Cd Length: 199  Bit Score: 194.56  E-value: 5.14e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3707 AFLAITRSEELWPWMAHVLLPYVHGNQS------------SPELGPPRLRQVRLQEA--LYPDPPGPRVHTCSAAGGFST 3772
Cdd:pfam20519    1 GLLTVTDLDDIWDWLSSVLLPALHSNKTpsglpgsfiayeSLLLGVPRLRQLRVRNSscLVHDKFVREINECHAGYSPPS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3773 SDY----DVGWESPHNGSGTW-AYSAPDLL-GAWSWGSCAVYDSGGYVQELGLSLEESRDRLRFLQLHNWLDNRSRAVFL 3846
Cdd:pfam20519   81 EDRklysALPYKPVHYGSKYWfIYTPPGLLmGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVFV 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 2217306235 3847 ELTRYSPAVGLHAAVTLRLEFPAAGRALAALSVRPFALR 3885
Cdd:pfam20519  161 DFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQSVKLL 199
PKD_channel pfam08016
Polycystin cation channel; This family contains the cation channel region from group II of ...
3886-4107 7.71e-44

Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.


Pssm-ID: 462341 [Multi-domain]  Cd Length: 225  Bit Score: 160.52  E-value: 7.71e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3886 RLSAGLSLPLLT-SVCLLLFAVHFAVAEARTWHREGR------WRVLRLgawarwLLVALTAATALVRLAQLGAADRQWT 3958
Cdd:pfam08016    1 RYVTNRSLFILLcEIVFVVFFLYFVVEEILKIRKHRPsylrsvWNLLDL------AIVILSVVLIVLNIYRDFLADRLIK 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3959 rFVRGRPRRFTSFDQVAQLSSAARGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELLGVTLGLVVLGVAYAQLA 4038
Cdd:pfam08016   75 -SVEASPVTFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFG 153
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217306235 4039 ILLVSSCVDSLWSVAQALLVLCPgtglsTLCPAESWH--------LSPLLCVGLWALRLWGALRLGAVILRWRYHAL 4107
Cdd:pfam08016  154 YLLFGTQAPNFSNFVKSILTLFR-----TILGDFGYNeifsgnrvLGPLLFLTFVFLVIFILLNLFLAIINDSYVEV 225
WSC smart00321
present in yeast cell wall integrity and stress response component proteins; Domain present in ...
126-220 2.52e-24

present in yeast cell wall integrity and stress response component proteins; Domain present in WSC proteins, polycystin and fungal exoglucanase


Pssm-ID: 214616 [Multi-domain]  Cd Length: 95  Bit Score: 99.85  E-value: 2.52e-24
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235   126 GEEYVACLPDNSSGTVAAVSFSAAHegLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSA-----SFACLSLCSGp 200
Cdd:smart00321    1 GATYVGCYSDNSSRTLAAVSSYAYH--NMSVEACSNFCFSAGYALAALENGNECYCGDSLPSTSvsasdSSQCSTTCSG- 77
                            90       100
                    ....*....|....*....|
gi 2217306235   201 ppPPAPTCRGPTLLQHVFPA 220
Cdd:smart00321   78 --YPAEVCGGPNRLSVYVLA 95
PLAT pfam01477
PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. ...
3114-3217 2.83e-23

PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.


Pssm-ID: 396180  Cd Length: 115  Bit Score: 97.50  E-value: 2.83e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3114 YEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLDGDR-AFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAWFLQ 3192
Cdd:pfam01477    1 YQVKVVTGDELGAGTDADVYISLYGKVGESAQLEITLDNpDFERGAEDSFEIDTDWDVGAILKINLHWDNNGLSDEWFLK 80
                           90       100
                   ....*....|....*....|....*.
gi 2217306235 3193 HVIV-RDLQTARSAFFLVNDWLSVET 3217
Cdd:pfam01477   81 SITVeVPGETGGKYTFPCNSWVYGSK 106
LH2 smart00308
Lipoxygenase homology 2 (beta barrel) domain;
3112-3214 1.95e-19

Lipoxygenase homology 2 (beta barrel) domain;


Pssm-ID: 214608 [Multi-domain]  Cd Length: 105  Bit Score: 86.16  E-value: 1.95e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  3112 FKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLD--GDRAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKglSPAW 3189
Cdd:smart00308    1 GKYKVTVTTGGLDFAGTTASVSLSLVGAEGDGKESKLDylFKGIFARGSTYEFTFDVDEDFGELGAVKIKNEHR--HPEW 78
                            90       100
                    ....*....|....*....|....*
gi 2217306235  3190 FLQHVIVRDLQTARSAFFLVNDWLS 3214
Cdd:smart00308   79 FLKSITVKDLPTGGKYHFPCNSWVY 103
CLECT cd00037
C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type ...
367-480 1.01e-16

C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.


Pssm-ID: 153057 [Multi-domain]  Cd Length: 116  Bit Score: 78.82  E-value: 1.01e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  367 HCYRLVVEKAAWLQAQEQCQAWaGAALAMVDSPAVQRFLVSRVTRSL--DVWIGFSTVQGVEVGPAPQGEAFSleSCQNW 444
Cdd:cd00037      1 SCYKFSTEKLTWEEAQEYCRSL-GGHLASIHSEEENDFLASLLKKSSssDVWIGLNDLSSEGTWKWSDGSPLV--DYTNW 77
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 2217306235  445 LPGEPHPATAEHCVRL--GPTGWCNTDLCSAPHSYVCE 480
Cdd:cd00037     78 APGEPNPGGSEDCVVLssSSDGKWNDVSCSSKLPFICE 115
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1463-1769 9.91e-13

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 72.78  E-value: 9.91e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1463 TSIKVNGSLGLELQqpylFSAVGRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRVAGWNEV-SRSEAWLNVTVKRR 1541
Cdd:COG3291      2 TATPTSGCAPLTVQ----FTDTSSGNATSYEWDFGDGTTSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVGAP 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1542 VRGLVVNASRTVVPLNGSVSFSTSLEAGSDVrYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVY 1621
Cdd:COG3291     78 NPGVTTVTTSTTVTTLANTANGGATTVVAGS-TVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVT 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1622 VLQLIEGLQVVGGGRYFPTNHTVQLQAVVRDGTNVSYSWTAWRDRGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAW 1701
Cdd:COG3291    157 TSVSTTDVTSDGTTSASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTDVTLT 236
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217306235 1702 ADCTMDFVEPVGWLMVAASPNPAAVNTSVTLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPG 1769
Cdd:COG3291    237 GISTGDAGTPGTNTVTTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNVSTTAD 304
GPS smart00303
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin ...
3005-3054 3.18e-10

G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin REJ and polycystin.


Pssm-ID: 197639  Cd Length: 49  Bit Score: 58.17  E-value: 3.18e-10
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 2217306235  3005 YTSLCQYFSEEDMVWRTEGLLPLEETSpRQAVCLTRHLTAFGASLFVPPS 3054
Cdd:smart00303    1 FNPICVFWDESSGEWSTRGCELLETNG-THTTCSCNHLTTFAVLMDVPPI 49
PHA03247 PHA03247
large tegument protein UL36; Provisional
445-777 2.99e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 2.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  445 LPGEPHPATAEHCVrlgPTGWCnTDLCSAPHSYVCELQPGGPVQDAENLLVGAPSGDLQGPLTPLA-----------QQD 513
Cdd:PHA03247  2555 LPPAAPPAAPDRSV---PPPRP-APRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPlppdthapdppPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  514 GLSAPHEPVEVMVFPGL------------------RLSREAFLTTAEFGTQELRRPAqLRLQVYRLLSTAGTPENGSEPE 575
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPpperprddpapgrvsrprRARRLGRAAQASSPPQRPRRRA-ARPTVGSLTSLADPPPPPPTPE 2709
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  576 SRsPDNRTQLAPAcMPGGRWCPGANICLPLDASCHPQACANGCTSGPGLPGAPYAlwreflfsvPAGPPaqyslllpvcq 655
Cdd:PHA03247  2710 PA-PHALVSATPL-PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT---------TAGPP----------- 2767
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  656 vlacvlSPCVPRS-ATVLPVLVTSTVGRLLEVTLHGQDVLMLPGDLVGLQHDAGPGALLHCSPAPGHPGPRAPYLSANAS 734
Cdd:PHA03247  2768 ------APAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP 2841
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 2217306235  735 SWLPHLPAQLEGTWACPACALRLLAATEQLTVLLGLRPNPGLR 777
Cdd:PHA03247  2842 PPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVR 2884
 
Name Accession Description Interval E-value
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
46-2722 0e+00

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 4838.88  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235   46 DISNNKISTLEEGIFANLFNLSEINLSGNPFECDCGLAWLPRWAEEQQVRVVQPEAATCAGPGSLAGQPLLGIPLLDSGC 125
Cdd:TIGR00864    1 DISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAGPGALAGQPLLGIPLLDSGC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  126 GEEYVACLPDNSSGTVAAVS----FSAAHEGLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSASFACLSLCSGPP 201
Cdd:TIGR00864   81 DEEYVACLKDNSSGGGAARSelviFSAAHEGLFQPEACNAFCFSAGHGLAALGEQGECLCGAAQPSEANFACESLCSGPP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  202 PPPAPTCRGPTLLQHVFPASPGATLVGPHGPLASGQLAAFHIAAPLPVTATRWDFGDGSAEVDAAGP----AASHRYVLP 277
Cdd:TIGR00864  161 PPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQLAAFHAAAPLAPTAMRWDFGDGSAEVDAAGAggttAASHKYGHP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  278 GRYHVTAVLALGAGSALLGTDVQVEAAPAALELVCPSSVQSDESLDLSIQNRGGSGLEAAYSIVALGEEPARAVHPLCPS 357
Cdd:TIGR00864  241 GRYHVSAMGALGAGKALAGGDVQVEAAPAALELHCPSLVQADESLDLSIQNRGGSDLDAAWKITAHGEEPAKASHPHCPK 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  358 DTEIFPGNGHCYRLVVEKAAWLQAQEQCQAWAGAALAMVDSPAVQRFLVSRVTRSLD--VWIGFSTVQGVEVGPAPQGEA 435
Cdd:TIGR00864  321 DGEIFEENGHCFQIVPEEAAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSLDrgVWIGFSDVNGAEKGPAHQGEA 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  436 FSLESCQNWLPGEPHPATAEHCVRLGPTGWCNTDLCSAPHSYVCELQPGGPVQDAENLLVGAPSGDLQGPLTPLAQQDGL 515
Cdd:TIGR00864  401 FEAEECEEGLAGEPHPARAEHCVRLDPRGQCNSDLCNAPHAYVCELNPGGPVPDAENFAMGAASFDLHGLLQALAAMDGL 480
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  516 SAP-HEPVEVMVFPGLRLSREAFLTTAEFGTQELRRPAQLRLQVYRL---LSTAGTPENGSEPESRSPDNRTQLAPACMP 591
Cdd:TIGR00864  481 PAPpHEGVEVLLFPALRFSRAAFLSSAEFGTQELRRPAHILFQIYRLrcrLPGAGGPACGPEAECRPPDNRSADAPACMK 560
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  592 GGRWCPGANICLPLDASCHPQACANGCTSGPGLPGAP----YALWREFLFSVPAGPPAQYSlllpvcqvlacvlspcvpr 667
Cdd:TIGR00864  561 GEQWCPFAHICLPLDAPCHPQACANGCSQGHGLPGAArmplYALQREFLFSLPAGPAAHVL------------------- 621
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  668 satvlpvlvtstvgrlleVTLHGQDVLMLPGDLVGLQHDAGPGALLHCSPAPGHPGPRAPYLSANASSWLPH-------- 739
Cdd:TIGR00864  622 ------------------LQDHGEDLLMLPGDLIALQHDAGPAALIHCQPAPGHPGPRAPVFAANASEWFGHnntpvppd 683
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  740 ----------------LPAQLEGTWA----CPACALRLLAATEQLTVLLGLRPNPGLRLPGRYEVRAEVGNGVSRHNLSC 799
Cdd:TIGR00864  684 nlagdgadplpdpeldLKALLEGTRAswleCAACAIRLLAAGEQETRLLGAELNAGLPLPGLYELLAESAKGSDLHNASC 763
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  800 SFDVVSPVAGLRVIYPAPRDGRLYVPTNGSALVLQVDSGANATATARWPGGSVSARFENVCPALVA-------TFVPGCP 872
Cdd:TIGR00864  764 SFDVLPPLAGLRVIHPAPQDGRLFLESNGSALLLQVDSGANAEAKAFWPGGNSSARFENVCPAEFAsrlchpsTFEGGCA 843
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  873 WETNDTLFSVVALPWLSEGEHV----VDVVVENSASRANLSLRVTAEEPICGLRATPSPEARVLQGVLVRYSPVVEAGSD 948
Cdd:TIGR00864  844 EEAEDSLFAVLALNWLKEGEHTgpvqVDLMAENNASEANLSLLVQAEEPICGLRAQPHPAARVLMESLVRYSASVEAGSD 923
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  949 MVFRWTINDKQSLTFQNVVFNVIYQSAAVFKLSpedaamavLTASNHVSNVTVNYNVTVERMNRMQGLQVSTVPAVLSPN 1028
Cdd:TIGR00864  924 MTFKWTIDDKPFFTFQNTVFNVIYQHAAVFKLS--------LTAMNHVSNLTEDFNVTVDRLNPMQGLQVKGVPAVLPPG 995
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1029 ATLALTAGVLVDSAVEVAFLWTFGDGEQALHQFQPPYNESFPVPDPSVAQVLVEHNVMHTYAAPGEYLLTVLASNAFENL 1108
Cdd:TIGR00864  996 ATLALTAGVLIDMAVEAAFLWSFGDGEQALFEFKPPYNESFPCPDPSPAQVLLEHNVMHIYAAPGEYLATVLASNAFENI 1075
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1109 TQQVPVSVRASLPSVAVGVSDGVLVAGRPVTFYPHPLPSPGGVLYTWDFGDGSPVLTQSQPAANHTYASRGTYHVRLEVN 1188
Cdd:TIGR00864 1076 SQQINMSVRAILPRVAIGTEDGLLLAGKPADFEAHPLPSPGGIHYEWDFGDGSALLQGRQPAAAHTFAKRGPFHVCLEVN 1155
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1189 NTVSGAAAQADVRVFEELRGLSVDMSLAVEQGAPVVVSAAVQTGDNITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTV 1268
Cdd:TIGR00864 1156 NTISGAAACADMFAFEEIEGLSADMSLATELGAATTVRAALQSGDNITWTFDMGDGKSLSGPEATVEHKYAKAGNCTVNI 1235
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1269 GAASPAGHLARSLHVLVFVLEVLRVEPAACIPTQPDARLTAYVTGNPAHYLFDWTFGDGSSNTTVRGCPTVTHNFTRSGT 1348
Cdd:TIGR00864 1236 GAANAAGHGARIIHVEVFVFEVAGIEPAACIGEHADANFRARVSGNAAHYLFDWSFGDGSPNETHHGCPGISHNFRGNGT 1315
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1349 FPLALVLSSRVNRAHYFTSICVEPEVGNVTLQPERQFVQLGDEAWLVACAWPPFPYRYTWDFGTEEAAPTRARGPEVTFI 1428
Cdd:TIGR00864 1316 FPLALTISSGVNKAHFFTQICVEPELGKISLQAEKQFFALGDEAQFQACAEPEFNYRYEWDFGGEEAAPLPAAGAEVTFI 1395
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1429 YRDPGSYLVTVTASNNISAANDSALVEVQEPVLVTSIKVNGSLG--LELQQPYLFSAVGRGRPASYLWDLGDGGWLEGPE 1506
Cdd:TIGR00864 1396 YNDPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGSHGnnLELGQPYLFSAFGRARNASYLWDFGDGGLLEGPE 1475
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1507 VTHAYNSTGDFTVRVAGWNEVSRSEAWLNVTVKRRVRGLVVNASRTVVPLNGSVSFSTSLEAGSDVRYSWVLCDRCTPIP 1586
Cdd:TIGR00864 1476 ILHAFNSPGDFNIRLAAANEVGKNEATLNVAVKARVRGLTINASLTNVPLNGSVHFEAHLDAGDDVRFSWILCDHCTPIF 1555
                         1610      1620      1630      1640      1650      1660      1670      1680
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1587 GGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVYVLQLIEGLQVVGG-------------GRYFPTNHTVQLQAVVRDG 1653
Cdd:TIGR00864 1556 GGNTIFYTFRSVGTFNIIVTAENDVGAAQASIFLFVLQEIEGLQILGEtaegggggvqeldGCYFETNHTVQFHAGFKDG 1635
                         1690      1700      1710      1720      1730      1740      1750      1760
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1654 TNVSYSWTAWRD---RGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWADCTMDFVEPVGWLMVAASPNPAAVNTSV 1730
Cdd:TIGR00864 1636 TNLSFSWNAILDnepDGPAFAGSGKGAKLNPLEAGPCDIFLQAANLLGQATADCTIDFLEPAGNLMLAASDNPAAVNALI 1715
                         1770      1780      1790      1800      1810      1820      1830      1840
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1731 TLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPGLHLVTMTAGNPLGSANATVEVDVQVPVSGLSIRASEPGGS- 1809
Cdd:TIGR00864 1716 NLSAELAEGSGLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLVTMKAFNELGSANASEEVDVQEPISGLKIRAADAGEQn 1795
                         1850      1860      1870      1880      1890      1900      1910      1920
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1810 FVAAGSSVPFWGQLATGTNVSWCWAVPGGSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSATYNLTAEEPIVGLVLWAS 1889
Cdd:TIGR00864 1796 FFAADSSVCFQGELATGTNVSWCWAIDGGSSKMGKHACMTFPDAGTFAIRLNASNAVSGKSASREFFAEEPIFGLELKAS 1875
                         1930      1940      1950      1960      1970      1980      1990      2000
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1890 SKVVAPGQLVHFQILLAAGSAVTFRLQVGGANPEVL-PGPRFSHSFPRVGDHVVSVRGKNHVSWAQAQVRIVVLEAVSGL 1968
Cdd:TIGR00864 1876 KKIAAIGEKVEFQILLAAGSAVNFRLQIGGAAPEVLqPGPRFSHSFPRVDDHMVNLRAKNEVSCAQANLHIEVLEAVRGL 1955
                         2010      2020      2030      2040      2050      2060      2070      2080
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1969 QVPNCCEPGIATGTERNFTARVQRGSRVAYAWYFSLQKVQGDSLVILSGRDVTYTPVAAGLLEIQVRAFNALGSENRTLV 2048
Cdd:TIGR00864 1956 QIPDCCAAGIATGEEKNFTANVQRGKPVAFAWTFDLHHLHGDSLVIHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQ 2035
                         2090      2100      2110      2120      2130      2140      2150      2160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2049 LEVQDAVQYVALQSGP--CFTNRSAQFEAATSPSPRRVAYHWDFGDGSPGQDTDEPRAEHSYLRPGDYRVQVNASNLVSF 2126
Cdd:TIGR00864 2036 LEAQDALMDAALQAGPqdCFTNKMAQFEAATSPKPNFMACHWDFGDGSAGQDTDEPRAEHEYLHPGDYRVQVNASNLVSF 2115
                         2170      2180      2190      2200      2210      2220      2230      2240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2127 FVAQATVTVQVLACREPEVDVVLPLQVLMRRSQRNYLEAHVDLRDCVTYQTEYRWEVYRTASCQRPGRPARVALPG---- 2202
Cdd:TIGR00864 2116 FSAHAEINVQVLACEEPEVDVVLALQLAIRRSQPNLLEAHVDLKDCLRYGAEYLWEILRAASCDNDGHFARGALNGatrs 2195
                         2250      2260      2270      2280      2290      2300      2310      2320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2203 ---------VDVSRPRLVLPRLALPVGHYCFVFVVSFGDTPLTQSIQANVTVAPERLVPIIEGGSYRVWSDTRDLVLDGS 2273
Cdd:TIGR00864 2196 fpviplpaeVDVQRLQLSLPKLALAAGHYCFVFSLSFEDTPLKKAACANLGVAAARLMPIIEGGSYRVWSDTQDLQLDAE 2275
                         2330      2340      2350      2360      2370      2380      2390      2400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2274 ESYDPNLEDGDQTPLSFHWACVASTQREAGGCALNFGPRG-SSTVTIPRERLAAGVEYTFSLTVWKAGRKEEATNQTVLI 2352
Cdd:TIGR00864 2276 ESYDPNLDDDDQSLLHFHWACQASSKGEAGCCALNFGLGGkGPTLGIPGEELAAGIEYTFKLSIGKAGMKEEATNQTVLI 2355
                         2410      2420      2430      2440      2450      2460      2470      2480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2353 RSGRVPIVSLECVSCKAQAVYEVSRSSYVYLEGRCLNCSSGSKRGRWAARTFSNKTLVLDETTTSTGSAGMRLVLRRGVL 2432
Cdd:TIGR00864 2356 QSGHIPIVSLECVSCKAQALYEVSQNSYVYLEGRCLNCQSGFHRGRWAARTFQNDTLVLDESSTSTGSAGMNLVLRQGVL 2435
                         2490      2500      2510      2520      2530      2540      2550      2560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2433 RDGEGYTFTLTVLGRSGEEEGCASIRLSPNRPPLGGSCRLFPLG--------------AVHALTTKVHFECTGWHDAEDA 2498
Cdd:TIGR00864 2436 HDGEGYNFTLHVLDDSGDEEGAASIRLHHNMPPDGGECHLFPGGetgqehgdkedevwAIEALLDKVHFECSGWHDAEDA 2515
                         2570      2580      2590      2600      2610      2620      2630      2640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2499 GAPLVYALLLRRCRQGHCEEFCVYKGSLSSYGAVLPPGFR-PHFEVGLAVVVQDQLGAAVVALNRSLAITLPEPNGSATG 2577
Cdd:TIGR00864 2516 EAPLLYALLLNRCRDDHCEEFCVYKGSLPEHGAFLPPGFRsAHFEVGLAITVEDHLGAAIRALNKSIAITLPDPNGEASG 2595
                         2650      2660      2670      2680      2690      2700      2710      2720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2578 LTVWLHGLTASVLPGLLRQADPQHVIEYSLALVTVLNEYERALDVAAEPKHERQHRAQIRKNITETLVSLRVHTVDDIQQ 2657
Cdd:TIGR00864 2596 LPHWLHDLIASKLKGLLDQADFQHVIELSLALITVLNEYEQALDSAAEPKHERGHRAQIRKNITEALTALDLHTVDDIQQ 2675
                         2730      2740      2750      2760      2770      2780
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217306235 2658 IAAALAQCMGPSRELVCRSCLKQTLHKLEAMMLILQAETTAGTVTPTAIGDSILNITGDLIHLAS 2722
Cdd:TIGR00864 2676 IAAALAQCMAPSREFICEECLKQTLHKLEAMLEILQADTKAGIVTPTAIADNILNIMGDLIHLAS 2740
REJ pfam02010
REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor ...
2165-2608 1.08e-132

REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly Swiss:Q26627. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains.


Pssm-ID: 366875 [Multi-domain]  Cd Length: 448  Bit Score: 425.38  E-value: 1.08e-132
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2165 AHVDLRDCV-TYQTEYRWEVYRTASC---QRPGRPARVALPGVDvsrprlvLPRLALPVGHYCFVFVVSFGDTP-LTQSI 2239
Cdd:pfam02010    1 ASVELNGCFsAYTIDYLWSVFTVSSNlnlQTISSPKDLVLPQLT-------IPSGTLPYGTYVFTLTVSLSSTPsLAGTD 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2240 QANVTVAPERLVPIIEGGSYRVWSDTRDLVLDGSESYDPNLEDGDQTPLSFHWACVAST------QREAGGCA-----LN 2308
Cdd:pfam02010   74 IITVTVQPSPLVAVIDGGSSRVVGYNQDLTLDGSESYDPDVDPGSSSGLTYLWSCRRSSsgdnplLNNDPVCFsdqneGT 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2309 FGPRGSSTVTIPRERLAAGVEYTFSLTVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRssYVYLEGRCL 2388
Cdd:pfam02010  154 LLQSTSSSLTIPASTLQANVTYTFKLTVSKGSRNSASTTQTILVVDGNPPIIILSCISNCNRKNNPVDR--LVLLASTCL 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2389 NCSSGSKRG--RWAARTFSNKTLVLD--ETTTSTGSAGMRLVLRRGVLRDGEGYTFTLTVLGRSGEEEGCASIRLSPNRP 2464
Cdd:pfam02010  232 NCSSDLSDVtyRWLSLGSENTSLVLDqlNSQTSTGRSGPYLVIKAGVLQSGVSYRFTLIVTVYPGLVSGLASISFITNAP 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 2465 PLGGSCRLFPLGAvHALTTKVHFECTGWHDAEDagaPLVYALLLRRCRQGHCEEFCVYKGSLS-SYGAVLPPGFRPH-FE 2542
Cdd:pfam02010  312 PTGGTCSVTPTEG-TALETKFTVTCQGWTDDDL---PLTYQFGDISFREASEEWFLLYEGSSQiSISTFLPPGLPANdYQ 387
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306235 2543 VGLAVVVQDQLGAAvVALNRSLAITLPEPNGSatglTVWLHGLTASVLPGLLRQADPQHVIEYSLA 2608
Cdd:pfam02010  388 VTVVVVVYDSLGAA-TSVSLTITVTPPSSSDE----LLYFLLGTTSDLSALLQSGDPQQAAQLILA 448
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
3112-3231 8.54e-58

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238850  Cd Length: 120  Bit Score: 196.34  E-value: 8.54e-58
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3112 FKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLDGD--RAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAW 3189
Cdd:cd01752      1 YLYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPekPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSW 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 2217306235 3190 FLQHVIVRDLQTARSAFFLVNDWLSVETEanGGLVEKEVLAA 3231
Cdd:cd01752     81 YLSRVIVRDLQTGKKWFFLCNDWLSVEEG--DGTVERTFPVA 120
Polycystin_dom pfam20519
Polycystin domain; This domain represents the polycystin domain from group II of Transient ...
3707-3885 5.14e-56

Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.


Pssm-ID: 466668  Cd Length: 199  Bit Score: 194.56  E-value: 5.14e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3707 AFLAITRSEELWPWMAHVLLPYVHGNQS------------SPELGPPRLRQVRLQEA--LYPDPPGPRVHTCSAAGGFST 3772
Cdd:pfam20519    1 GLLTVTDLDDIWDWLSSVLLPALHSNKTpsglpgsfiayeSLLLGVPRLRQLRVRNSscLVHDKFVREINECHAGYSPPS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3773 SDY----DVGWESPHNGSGTW-AYSAPDLL-GAWSWGSCAVYDSGGYVQELGLSLEESRDRLRFLQLHNWLDNRSRAVFL 3846
Cdd:pfam20519   81 EDRklysALPYKPVHYGSKYWfIYTPPGLLmGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVFV 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 2217306235 3847 ELTRYSPAVGLHAAVTLRLEFPAAGRALAALSVRPFALR 3885
Cdd:pfam20519  161 DFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQSVKLL 199
PKD_channel pfam08016
Polycystin cation channel; This family contains the cation channel region from group II of ...
3886-4107 7.71e-44

Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.


Pssm-ID: 462341 [Multi-domain]  Cd Length: 225  Bit Score: 160.52  E-value: 7.71e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3886 RLSAGLSLPLLT-SVCLLLFAVHFAVAEARTWHREGR------WRVLRLgawarwLLVALTAATALVRLAQLGAADRQWT 3958
Cdd:pfam08016    1 RYVTNRSLFILLcEIVFVVFFLYFVVEEILKIRKHRPsylrsvWNLLDL------AIVILSVVLIVLNIYRDFLADRLIK 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3959 rFVRGRPRRFTSFDQVAQLSSAARGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELLGVTLGLVVLGVAYAQLA 4038
Cdd:pfam08016   75 -SVEASPVTFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFG 153
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217306235 4039 ILLVSSCVDSLWSVAQALLVLCPgtglsTLCPAESWH--------LSPLLCVGLWALRLWGALRLGAVILRWRYHAL 4107
Cdd:pfam08016  154 YLLFGTQAPNFSNFVKSILTLFR-----TILGDFGYNeifsgnrvLGPLLFLTFVFLVIFILLNLFLAIINDSYVEV 225
PLAT_repeat cd01756
PLAT/LH2 domain repeats of family of proteins with unknown function. In general, PLAT/LH2 ...
3112-3231 6.24e-28

PLAT/LH2 domain repeats of family of proteins with unknown function. In general, PLAT/LH2 consists of an eight stranded beta-barrel and it's proposed function is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238854  Cd Length: 120  Bit Score: 111.11  E-value: 6.24e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3112 FKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLD---GDRAFHRNSLDIFRIATPhSLGSVWKIRVWHDNKGLSPA 3188
Cdd:cd01756      1 VTYEVTVKTGDVKGAGTDANVFITLYGENGDTGKRKLKksnNKNKFERGQTDKFTVEAV-DLGKLKKIRIGHDNSGLGAG 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 2217306235 3189 WFLQHVIVRDLQTARSAFFLVNDWLSveTEANGGLVEKEVLAA 3231
Cdd:cd01756     80 WFLDKVEIREPGTGDEYTFPCNRWLD--KDEDDGQIVRELYPS 120
WSC smart00321
present in yeast cell wall integrity and stress response component proteins; Domain present in ...
126-220 2.52e-24

present in yeast cell wall integrity and stress response component proteins; Domain present in WSC proteins, polycystin and fungal exoglucanase


Pssm-ID: 214616 [Multi-domain]  Cd Length: 95  Bit Score: 99.85  E-value: 2.52e-24
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235   126 GEEYVACLPDNSSGTVAAVSFSAAHegLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSA-----SFACLSLCSGp 200
Cdd:smart00321    1 GATYVGCYSDNSSRTLAAVSSYAYH--NMSVEACSNFCFSAGYALAALENGNECYCGDSLPSTSvsasdSSQCSTTCSG- 77
                            90       100
                    ....*....|....*....|
gi 2217306235   201 ppPPAPTCRGPTLLQHVFPA 220
Cdd:smart00321   78 --YPAEVCGGPNRLSVYVLA 95
PLAT pfam01477
PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. ...
3114-3217 2.83e-23

PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.


Pssm-ID: 396180  Cd Length: 115  Bit Score: 97.50  E-value: 2.83e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3114 YEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLDGDR-AFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAWFLQ 3192
Cdd:pfam01477    1 YQVKVVTGDELGAGTDADVYISLYGKVGESAQLEITLDNpDFERGAEDSFEIDTDWDVGAILKINLHWDNNGLSDEWFLK 80
                           90       100
                   ....*....|....*....|....*.
gi 2217306235 3193 HVIV-RDLQTARSAFFLVNDWLSVET 3217
Cdd:pfam01477   81 SITVeVPGETGGKYTFPCNSWVYGSK 106
LH2 smart00308
Lipoxygenase homology 2 (beta barrel) domain;
3112-3214 1.95e-19

Lipoxygenase homology 2 (beta barrel) domain;


Pssm-ID: 214608 [Multi-domain]  Cd Length: 105  Bit Score: 86.16  E-value: 1.95e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  3112 FKYEILVKTGWGRGSGTTAHVGIMLYGVDSRSGHRHLD--GDRAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKglSPAW 3189
Cdd:smart00308    1 GKYKVTVTTGGLDFAGTTASVSLSLVGAEGDGKESKLDylFKGIFARGSTYEFTFDVDEDFGELGAVKIKNEHR--HPEW 78
                            90       100
                    ....*....|....*....|....*
gi 2217306235  3190 FLQHVIVRDLQTARSAFFLVNDWLS 3214
Cdd:smart00308   79 FLKSITVKDLPTGGKYHFPCNSWVY 103
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1717-1786 1.30e-17

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 79.74  E-value: 1.30e-17
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1717 VAASPNPAAVNTSVTLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPGLHLVTMTAGNPLGSANA 1786
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
CLECT smart00034
C-type lectin (CTL) or carbohydrate-recognition domain (CRD); Many of these domains function ...
355-480 1.51e-17

C-type lectin (CTL) or carbohydrate-recognition domain (CRD); Many of these domains function as calcium-dependent carbohydrate binding modules.


Pssm-ID: 214480 [Multi-domain]  Cd Length: 124  Bit Score: 81.49  E-value: 1.51e-17
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235   355 CPSDTEIFpgNGHCYRLVVEKAAWLQAQEQCQAwAGAALAMVDSPAVQRFL---VSRVTRSLDVWIGFSTVQGVEVGPAP 431
Cdd:smart00034    1 CPSGWISY--GGKCYKFSTEKKTWEDAQAFCQS-LGGHLASIHSEAENDFVaslLKNSGSSDYYWIGLSDPDSNGSWQWS 77
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|
gi 2217306235   432 QGeaFSLESCQNWLPGEPhPATAEHCVRLGPTGWC-NTDLCSAPHSYVCE 480
Cdd:smart00034   78 DG--SGPVSYSNWAPGEP-NNSSGDCVVLSTSGGKwNDVSCTSKLPFVCE 124
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1122-1203 6.62e-17

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 78.26  E-value: 6.62e-17
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  1122 SVAVGVSDGVLVAGRPVTFYPHPLPSPGGVLYTWDFGDGSpvlTQSQPAANHTYASRGTYHVRLEVNNTVSGAAAQADVR 1201
Cdd:smart00089    1 VADVSASPTVGVAGESVTFTATSSDDGSIVSYTWDFGDGT---SSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVV 77

                    ..
gi 2217306235  1202 VF 1203
Cdd:smart00089   78 VQ 79
CLECT cd00037
C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type ...
367-480 1.01e-16

C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.


Pssm-ID: 153057 [Multi-domain]  Cd Length: 116  Bit Score: 78.82  E-value: 1.01e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  367 HCYRLVVEKAAWLQAQEQCQAWaGAALAMVDSPAVQRFLVSRVTRSL--DVWIGFSTVQGVEVGPAPQGEAFSleSCQNW 444
Cdd:cd00037      1 SCYKFSTEKLTWEEAQEYCRSL-GGHLASIHSEEENDFLASLLKKSSssDVWIGLNDLSSEGTWKWSDGSPLV--DYTNW 77
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 2217306235  445 LPGEPHPATAEHCVRL--GPTGWCNTDLCSAPHSYVCE 480
Cdd:cd00037     78 APGEPNPGGSEDCVVLssSSDGKWNDVSCSSKLPFICE 115
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1886-1955 1.43e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 76.66  E-value: 1.43e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1886 LWASSKVVAPGQLVHFQILLAAGSAVTFRLQVGGANPEVLPGPRFSHSFPRVGDHVVSVRGKNHVSWAQA 1955
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1631-1702 1.92e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 76.27  E-value: 1.92e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217306235 1631 VVGGGRYFPTNHTVQLQAVVRDGTNVSYSWTAWRdrGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWA 1702
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFGD--SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PLAT cd00113
PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology 2) domain. ...
3113-3214 2.22e-16

PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology 2) domain. It consists of an eight stranded beta-barrel. The domain can be found in various domain architectures, in case of lipoxygenases, alpha toxin, lipases and polycystin, but also as a single domain or as repeats.The putative function of this domain is to facilitate access to sequestered membrane or micelle bound substrates.


Pssm-ID: 238061  Cd Length: 116  Bit Score: 77.76  E-value: 2.22e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3113 KYEILVKTGWGRGSGTTAHVGIMLYGVD-SRSGHRHLDGDRAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKGLSPAWFL 3191
Cdd:cd00113      2 RYTVTIKTGDKKGAGTDSNISLALYGENgNSSDIPILDGPGSFERGSTDTFQIDLKLDIGDITKVYLRRDGSGLSDGWYC 81
                           90       100
                   ....*....|....*....|...
gi 2217306235 3192 QHVIVRDLQTARSAFFLVNDWLS 3214
Cdd:cd00113     82 ESITVQALGTKKVYTFPVNRWVL 104
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1547-1616 2.86e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 75.89  E-value: 2.86e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1547 VNASRTVVPLNGSVSFSTSLEAGSDVRYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQD 1616
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
2062-2129 3.83e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 75.50  E-value: 3.83e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217306235 2062 SGPCFTNRSAQFEAaTSPSPRRVAYHWDFGDgSPGQDTDEPRAEHSYLRPGDYRVQVNASNLVSFFVA 2129
Cdd:pfam00801    5 GTVVAAGQPVTFTA-TLADGSNVTYTWDFGD-SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1125-1196 4.65e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 75.50  E-value: 4.65e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217306235 1125 VGVSDGVLVAGRPVTFYPHpLPSPGGVLYTWDFGDgSPVLTQSQPAANHTYASRGTYHVRLEVNNTVSGAAA 1196
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTAT-LADGSNVTYTWDFGD-SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1379-1450 9.48e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 74.35  E-value: 9.48e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217306235 1379 LQPERQFVQLGDEAWLVACAWPPFPYRYTWDFGteEAAPTRARGPEVTFIYRDPGSYLVTVTASNNISAAND 1450
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFG--DSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
2054-2136 2.96e-15

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 73.25  E-value: 2.96e-15
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  2054 AVQYVALQSGPcfTNRSAQFEAATSPSPRRVAYHWDFGDGSpgqDTDEPRAEHSYLRPGDYRVQVNASNLVSFFVAQATV 2133
Cdd:smart00089    2 ADVSASPTVGV--AGESVTFTATSSDDGSIVSYTWDFGDGT---SSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTV 76

                    ...
gi 2217306235  2134 TVQ 2136
Cdd:smart00089   77 VVQ 79
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1214-1279 7.42e-15

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 72.03  E-value: 7.42e-15
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306235 1214 SLAVEQGAPVVVSAAVQTGDNITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTVGAASPAGHLAR 1279
Cdd:pfam00801    5 GTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1806-1871 9.38e-15

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 71.65  E-value: 9.38e-15
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217306235 1806 PGGSFVAAGSSVPFWGQLATGTNVSWCWAVPG--GSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSA 1871
Cdd:pfam00801    3 ASGTVVAAGQPVTFTATLADGSNVTYTWDFGDspGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1123-1197 3.54e-14

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 70.22  E-value: 3.54e-14
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306235 1123 VAVGVSDGVLV-AGRPVTFYPHPLPSPGGVLYTWDFGDGSpVLTQSQPAANHTYASRGTYHVRLEVNNTVSGAAAQ 1197
Cdd:cd00146      1 PTASVSAPPVAeLGASVTFSASDSSGGSIVSYKWDFGDGE-VSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTK 75
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1544-1622 1.11e-13

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 69.02  E-value: 1.11e-13
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  1544 GLVVNASRTVVPLNGSVSFS-TSLEAGSDVRYSWVLCDRctPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVYV 1622
Cdd:smart00089    1 VADVSASPTVGVAGESVTFTaTSSDDGSIVSYTWDFGDG--TSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVV 78
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1462-1539 2.73e-13

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 67.86  E-value: 2.73e-13
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  1462 VTSIKVNGSLGLeLQQPYLFSAV--GRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRVAGWNEVSRSEAWLNVTVK 1539
Cdd:smart00089    1 VADVSASPTVGV-AGESVTFTATssDDGSIVSYTWDFGDGTSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1463-1769 9.91e-13

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 72.78  E-value: 9.91e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1463 TSIKVNGSLGLELQqpylFSAVGRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRVAGWNEV-SRSEAWLNVTVKRR 1541
Cdd:COG3291      2 TATPTSGCAPLTVQ----FTDTSSGNATSYEWDFGDGTTSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVGAP 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1542 VRGLVVNASRTVVPLNGSVSFSTSLEAGSDVrYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVY 1621
Cdd:COG3291     78 NPGVTTVTTSTTVTTLANTANGGATTVVAGS-TVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVT 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1622 VLQLIEGLQVVGGGRYFPTNHTVQLQAVVRDGTNVSYSWTAWRDRGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAW 1701
Cdd:COG3291    157 TSVSTTDVTSDGTTSASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTDVTLT 236
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217306235 1702 ADCTMDFVEPVGWLMVAASPNPAAVNTSVTLSAELAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPG 1769
Cdd:COG3291    237 GISTGDAGTPGTNTVTTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNVSTTAD 304
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1467-1532 2.24e-12

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 64.72  E-value: 2.24e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217306235 1467 VNGSLGLELQQPYLFSA-VGRGRPASYLWDLGD--GGWLEGPEVTHAYNSTGDFTVRVAGWNEVSRSEA 1532
Cdd:pfam00801    2 SASGTVVAAGQPVTFTAtLADGSNVTYTWDFGDspGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1719-1793 2.33e-12

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 65.17  E-value: 2.33e-12
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306235  1719 ASPNPAAVNTSVTLSAELAG-GSGVVYTWSLEEGLSWetSEPFTTHSFPTPGLHLVTMTAGNPLGSANATVEVDVQ 1793
Cdd:smart00089    6 ASPTVGVAGESVTFTATSSDdGSIVSYTWDFGDGTSS--TGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1813-1872 3.69e-12

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 64.78  E-value: 3.69e-12
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217306235  1813 AGSSVPFWGQLAT-GTNVSWCWAVPGGSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSAT 1872
Cdd:smart00089   13 AGESVTFTATSSDdGSIVSYTWDFGDGTSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASAT 73
WSC pfam01822
WSC domain; This domain is involved in carbohydrate binding.
129-199 6.57e-12

WSC domain; This domain is involved in carbohydrate binding.


Pssm-ID: 460348  Cd Length: 82  Bit Score: 64.02  E-value: 6.57e-12
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217306235  129 YVACLPDNSsGTVAAVSFSAAHEGLLQPEACSAFCFSTGQGLAALSEQGWCLCGAAQPSSASFA----CLSLCSG 199
Cdd:pfam01822    1 YLGCYSDGT-GGRRLLLGSSGDYDDMTPEKCIAFCSAAGYTYAGLEYGGECYCGNSLPSGSALAdssdCNTPCPG 74
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1974-2045 1.79e-11

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 62.40  E-value: 1.79e-11
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217306235 1974 CEPGIATGTERNFTARVQRGSRVAYAWYFslqkvqGDSLV-ILSGRDVTYTPVAAGLLEIQVRAFNALGSENR 2045
Cdd:pfam00801    4 SGTVVAAGQPVTFTATLADGSNVTYTWDF------GDSPGtSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
2067-2137 1.97e-11

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 62.51  E-value: 1.97e-11
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217306235 2067 TNRSAQFEAATSPSPRRVAYHWDFGDGSpGQDTDEPRAEHSYLRPGDYRVQVNASNLVSfFVAQATVTVQV 2137
Cdd:cd00146     13 LGASVTFSASDSSGGSIVSYKWDFGDGE-VSSSGEPTVTHTYTKPGTYTVTLTVTNAVG-SSSTKTTTVVV 81
LRRCT smart00082
Leucine rich repeat C-terminal domain;
74-126 2.34e-11

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 61.29  E-value: 2.34e-11
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 2217306235    74 NPFECDCGLAWLPRWAeEQQVRVVQPEAATCAGPGSLAGqPLLGIPLLDSGCG 126
Cdd:smart00082    1 NPFICDCELRWLLRWL-QANEHLQDPVDLRCASPSSLRG-PLLELLHSEFKCP 51
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1546-1622 5.12e-11

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 61.36  E-value: 5.12e-11
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217306235 1546 VVNASRTVVPLNGSVSFS-TSLEAGSDVRYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQ-DSIFVYV 1622
Cdd:cd00146      3 ASVSAPPVAELGASVTFSaSDSSGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSStKTTTVVV 81
CLECT_DC-SIGN_like cd03590
C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific ...
355-480 5.37e-11

C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR); CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.


Pssm-ID: 153060 [Multi-domain]  Cd Length: 126  Bit Score: 62.71  E-value: 5.37e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  355 CPSDTEIFpgNGHCYRLVVEKAAWLQAQEQCQAwAGAALAMVDSPAVQRFLVSRVTRSLDVWIGFStVQGVE-----V-G 428
Cdd:cd03590      1 CPTNWKSF--QSSCYFFSTEKKSWEESRQFCED-MGAHLVIINSQEEQEFISKILSGNRSYWIGLS-DEETEgewkwVdG 76
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306235  429 PAPqgeafsLESCQNWLPGEP--HPATAEHCVRLGPT--GWcNTDLCSAPHSYVCE 480
Cdd:cd03590     77 TPL------NSSKTFWHPGEPnnWGGGGEDCAELVYDsgGW-NDVPCNLEYRWICE 125
LRR_8 pfam13855
Leucine rich repeat;
22-76 8.90e-11

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 59.85  E-value: 8.90e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217306235   22 DVSHNLLRALDVGLLANLSALAELDISNNKISTLEEGIFANLFNLSEINLSGNPF 76
Cdd:pfam13855    7 DLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
226-293 1.01e-10

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 60.09  E-value: 1.01e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217306235  226 LVGPHGPLASGQLAAFHIA-APLPVTATRWDFGDgSAEVDAAGPAASHRYVLPGRYHVTAVLALGAGSA 293
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATlADGSNVTYTWDFGD-SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSA 68
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1290-1371 1.89e-10

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 59.77  E-value: 1.89e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  1290 VLRVEPAACIPTQP-DARLTAYVTGNPAHYLFDWTFGDGssnTTVRGcPTVTHNFTRSGTFPLALVLSSRVNRAHYFTSI 1368
Cdd:smart00089    1 VADVSASPTVGVAGeSVTFTATSSDDGSIVSYTWDFGDG---TSSTG-PTVTHTYTKPGTYTVTLTVTNAVGSASATVTV 76

                    ...
gi 2217306235  1369 CVE 1371
Cdd:smart00089   77 VVQ 79
GPS smart00303
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin ...
3005-3054 3.18e-10

G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin REJ and polycystin.


Pssm-ID: 197639  Cd Length: 49  Bit Score: 58.17  E-value: 3.18e-10
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 2217306235  3005 YTSLCQYFSEEDMVWRTEGLLPLEETSpRQAVCLTRHLTAFGASLFVPPS 3054
Cdd:smart00303    1 FNPICVFWDESSGEWSTRGCELLETNG-THTTCSCNHLTTFAVLMDVPPI 49
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1401-1457 4.54e-10

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 58.62  E-value: 4.54e-10
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 2217306235  1401 PFPYRYTWDFGTEeaapTRARGPEVTFIYRDPGSYLVTVTASNNISAANDSALVEVQ 1457
Cdd:smart00089   27 GSIVSYTWDFGDG----TSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1639-1706 3.64e-09

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 55.92  E-value: 3.64e-09
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217306235  1639 PTNHTVQLQAVVR-DGTNVSYSWtawrDRGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWADCTM 1706
Cdd:smart00089   12 VAGESVTFTATSSdDGSIVSYTW----DFGDGTSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTV 76
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1216-1275 4.99e-09

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 55.97  E-value: 4.99e-09
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217306235 1216 AVEQGAPVVVSAAVQ-TGDNITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTVGAASPAG 1275
Cdd:cd00146     10 VAELGASVTFSASDSsGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVG 70
CLECT_REG-1_like cd03594
C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and ...
355-480 6.54e-09

C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2); CLECT_REG-1_like: C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. REG-1 is a proliferating factor which participates in various kinds of tissue regeneration including pancreatic beta-cell regeneration, regeneration of intestinal mucosa, regeneration of motor neurons, and perhaps in tissue regeneration of damaged heart. REG-1 may play a role on the pathophysiology of Alzheimer's disease and in the development of gastric cancers. Its expression is correlated with reduced survival from early-stage colorectal cancer. REG-1 also binds and aggregates several bacterial strains from the intestinal flora and it has been suggested that it is involved in the control of the intestinal bacterial ecosystem. Rat lithostathine has calcium carbonate crystal inhibitor activity in vitro. REG-IV is unregulated in pancreatic, gastric, hepatocellular, and prostrate adenocarcinomas. REG-IV activates the EGF receptor/Akt/AP-1 signaling pathway in colorectal carcinoma. Ansocalcin, SCA-1 and -2 are found at high concentration in the calcified egg shell layer of goose and ostrich, respectively and tend to form aggregates. Ansocalcin nucleates calcite crystal aggregates in vitro.


Pssm-ID: 153064 [Multi-domain]  Cd Length: 129  Bit Score: 57.00  E-value: 6.54e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  355 CPSDTeiFPGNGHCYRLVVEKAAWLQAQEQCQAW-AGAALAMVDSPAVQRFLVSRV----TRSLDVWIGFSTVQGVEVGP 429
Cdd:cd03594      1 CPKGW--LPYKGNCYGYFRQPLSWSDAELFCQKYgPGAHLASIHSPAEAAAIASLIssyqKAYQPVWIGLHDPQQSRGWE 78
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217306235  430 APQGEAFSLEScqnWLPGEPHPaTAEHCVRL-GPTG---WcNTDLCSAPHSYVCE 480
Cdd:cd03594     79 WSDGSKLDYRS---WDRNPPYA-RGGYCAELsRSTGflkW-NDANCEERNPFICK 128
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1220-1286 8.43e-09

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 55.15  E-value: 8.43e-09
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217306235  1220 GAPVVVSAAVQT-GDNITWTFDMGDGTVLSGPeaTVEHVYLRAQNCTVTVGAASPAGHLARSLHVLVF 1286
Cdd:smart00089   14 GESVTFTATSSDdGSIVSYTWDFGDGTSSTGP--TVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1134-1344 1.99e-08

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 59.68  E-value: 1.99e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1134 AGRPVTFYPHplpSPGGVL-YTWDFGDGSpvlTQSQPAANHTYASRGTYHVRLEVNNTV-SGAAAQADVRVFEELRGLSV 1211
Cdd:COG3291     10 APLTVQFTDT---SSGNATsYEWDFGDGT---TSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVGAPNPGVTT 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1212 DMSLAVEQGAPVVVSAAVQTGDNITWTFDMGDGTVLSGPEATVEHVYLRAQNCTVTVGAASPAGHLARSLHVLVFVLEVL 1291
Cdd:COG3291     84 VTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVTTSVSTTD 163
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2217306235 1292 RVEPAACIPTQPDARLTAYVTGNPAHYLFDWTFGDGSSNTTVRGCPTVTHNFT 1344
Cdd:COG3291    164 VTSDGTTSASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATS 216
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1474-1538 2.87e-08

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 53.65  E-value: 2.87e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1474 ELQQPYLFSAV--GRGRPASYLWDLGDGGWL--EGPEVTHAYNSTGDFTVRVAGWNEVSRSEA-WLNVTV 1538
Cdd:cd00146     12 ELGASVTFSASdsSGGSIVSYKWDFGDGEVSssGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTkTTTVVV 81
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1801-1872 3.39e-08

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 53.27  E-value: 3.39e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217306235 1801 IRASEPGGSFVAAGSSVPFWGQLATG---TNVSWCWAVPGGSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSAT 1872
Cdd:cd00146      1 PTASVSAPPVAELGASVTFSASDSSGgsiVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTK 75
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1018-1110 4.16e-08

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 52.77  E-value: 4.16e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1018 VSTVPAVLSPNATLALTAGVLvdSAVEVAFLWTFGDgeqalhqfqppynesfpvpdpSVAQVLVEHNVMHTYAAPGEYLL 1097
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLA--DGSNVTYTWDFGD---------------------SPGTSGSGPTVTHTYLSPGTYTV 57
                           90
                   ....*....|...
gi 2217306235 1098 TVLASNAFENLTQ 1110
Cdd:pfam00801   58 TLTASNAVGSANA 70
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1717-1790 7.78e-08

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 52.50  E-value: 7.78e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217306235 1717 VAASPNPAA-VNTSVTLSAE-LAGGSGVVYTWSLEEGLSWETSEPFTTHSFPTPGLHLVTMTAGNPLGSAN---ATVEV 1790
Cdd:cd00146      3 ASVSAPPVAeLGASVTFSASdSSGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSStktTTVVV 81
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1015-1116 8.89e-08

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 52.07  E-value: 8.89e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  1015 GLQVSTVPAVLSPNATLALTAGVLVDSAVeVAFLWTFGDGeqalhqfqppynesfpvpdpsvaQVLVEHNVMHTYAAPGE 1094
Cdd:smart00089    1 VADVSASPTVGVAGESVTFTATSSDDGSI-VSYTWDFGDG-----------------------TSSTGPTVTHTYTKPGT 56
                            90       100
                    ....*....|....*....|..
gi 2217306235  1095 YLLTVLASNAFENLTQQVPVSV 1116
Cdd:smart00089   57 YTVTLTVTNAVGSASATVTVVV 78
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
22-77 1.30e-07

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 57.25  E-value: 1.30e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306235   22 DVSHNLLRALDVGLlANLSALAELDISNNKISTLEEgiFANLFNLSEINLSGNPFE 77
Cdd:COG4886    211 DLSGNQLTDLPEPL-ANLTNLETLDLSNNQLTDLPE--LGNLTNLEELDLSNNQLT 263
PLAT_plant_stress cd01754
PLAT/LH2 domain of plant-specific single domain protein family with unknown function. Many of ...
3114-3221 1.41e-07

PLAT/LH2 domain of plant-specific single domain protein family with unknown function. Many of its members are stress induced. In general, PLAT/LH2 consists of an eight stranded beta-barrel and it's proposed function is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238852  Cd Length: 129  Bit Score: 53.31  E-value: 1.41e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3114 YEILVKTGWGRGSGTTAHVGIMLYGVDSRS---------GHRHLDGDRAFHRNSLDIFRIATPHSLGSVWKIRVWHDNKG 3184
Cdd:cd01754      3 YTIYVQTGSIWKAGTDSRISLQIYDADGPGlrianleawGGLMGAGHDYFERGNLDRFSGRGPCLPSPPCWMNLTSDGTG 82
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 2217306235 3185 LSPAWFLQHVIVRDL-QTARSA--FFLVNDWLSVETEANG 3221
Cdd:cd01754     83 NHPGWYVNYVEVTQAgQHAPCMqhLFAVEQWLATDESPYM 122
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1146-1202 1.79e-07

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 51.50  E-value: 1.79e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2217306235 1146 PSPGGVL-YTWDFGDGSpvlTQSQPAANHTYASRGTYHVRLEVNNTVSGAAAQADVRV 1202
Cdd:pfam18911   29 DPDGDILsYRWDFGDGT---TATGANVSHTYAAPGTYTVTLTVTDDSGASNSTATDTV 83
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1301-1374 2.35e-07

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 50.96  E-value: 2.35e-07
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217306235 1301 TQPDARLTAYVTGNPAHYLFDWTFGDGssNTTVRGCPTVTHNFTRSGTFPLALVLSSRVNRAhyfTSICVEPEV 1374
Cdd:cd00146     13 LGASVTFSASDSSGGSIVSYKWDFGDG--EVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSS---STKTTTVVV 81
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1298-1363 2.62e-07

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 50.46  E-value: 2.62e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217306235 1298 CIPTQPDARLTAYV-TGNPAHYLfdWTFGDgsSNTTVRGCPTVTHNFTRSGTFPLALVLSSRVNRAH 1363
Cdd:pfam00801    7 VVAAGQPVTFTATLaDGSNVTYT--WDFGD--SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSAN 69
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1882-1961 3.36e-07

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 50.53  E-value: 3.36e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  1882 VGLVLWASSKVVApGQLVHFQI-LLAAGSAVTFRLQVGgaNPEVLPGPRFSHSFPRVGDHVVSVRGKNHVSWAQAQVRIV 1960
Cdd:smart00089    1 VADVSASPTVGVA-GESVTFTAtSSDDGSIVSYTWDFG--DGTSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVV 77

                    .
gi 2217306235  1961 V 1961
Cdd:smart00089   78 V 78
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
22-77 7.36e-07

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 54.94  E-value: 7.36e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306235   22 DVSHNLLRALDVglLANLSALAELDISNNKISTLEEgiFANLFNLSEINLSGNPFE 77
Cdd:COG4886    234 DLSNNQLTDLPE--LGNLTNLEELDLSNNQLTDLPP--LANLTNLKTLDLSNNQLT 285
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
22-77 8.30e-07

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 54.94  E-value: 8.30e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306235   22 DVSHNLLRALDVGLlANLSALAELDISNNKISTLEEGIfANLFNLSEINLSGNPFE 77
Cdd:COG4886    188 DLSNNQITDLPEPL-GNLTNLEELDLSGNQLTDLPEPL-ANLTNLETLDLSNNQLT 241
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1403-1456 9.05e-07

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 49.42  E-value: 9.05e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217306235 1403 PYRYTWDFGTEEAAPTRarGPEVTFIYRDPGSYLVTVTASNNISAAN-DSALVEV 1456
Cdd:cd00146     29 IVSYKWDFGDGEVSSSG--EPTVTHTYTKPGTYTVTLTVTNAVGSSStKTTTVVV 81
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
22-76 1.21e-06

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 52.48  E-value: 1.21e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2217306235   22 DVSHNLLRALDVglLANLSALAELDISNNKISTLEE--GIFANLFNLSEINLSGNPF 76
Cdd:cd21340    126 NISGNNIDSLEP--LAPLRNLEQLDASNNQISDLEEllDLLSSWPSLRELDLTGNPV 180
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1719-1896 1.74e-06

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 53.52  E-value: 1.74e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1719 ASPNPAAVNTSVTLSAeLAGGSGVVYTWSLEEGLSweTSEPFTTHSFPTPGLHLVTMTAGNPLGSANA-----TVEVDVQ 1793
Cdd:COG3291      3 ATPTSGCAPLTVQFTD-TSSGNATSYEWDFGDGTT--STEANPSHTYTTPGTYTVTLTVTDAAGCSDTttktiTVGAPNP 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1794 VPVSGLSIRASEPGGSFVAAGSSVPFWGQLATGTNVSWCWAVPGGSSKRGPHVTMVFPDAGTFSIRLNASNAVSWVSATY 1873
Cdd:COG3291     80 GVTTVTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVTTSV 159
                          170       180
                   ....*....|....*....|...
gi 2217306235 1874 NLTAEEPIVGLVLWASSKVVAPG 1896
Cdd:COG3291    160 STTDVTSDGTTSASTNPSVTTDT 182
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1481-1521 2.34e-06

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 48.42  E-value: 2.34e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2217306235 1481 FSAVGRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRV 1521
Cdd:pfam18911   26 ASDDPDGDILSYRWDFGDGTTATGANVSHTYAAPGTYTVTL 66
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
22-77 2.63e-06

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 53.40  E-value: 2.63e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306235   22 DVSHNLLRALDVGLlANLSALAELDISNNKISTLEEGIfANLFNLSEINLSGNPFE 77
Cdd:COG4886    165 DLSNNQLTDLPEEL-GNLTNLKELDLSNNQITDLPEPL-GNLTNLEELDLSGNQLT 218
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
22-77 2.74e-06

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 53.01  E-value: 2.74e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306235   22 DVSHNLLRALDVGLlANLSALAELDISNNKISTLEEGIfANLFNLSEINLSGNPFE 77
Cdd:COG4886    142 DLSNNQLTDLPEPL-GNLTNLKSLDLSNNQLTDLPEEL-GNLTNLKELDLSNNQIT 195
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1402-1703 3.91e-06

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 52.36  E-value: 3.91e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1402 FPYRYTWDFGTeeaaPTRARGPEVTFIYRDPGSYLVTVTASNNI-SAANDSALVEVQEPVLVTSIKVNGSlglelqqpYL 1480
Cdd:COG3291     23 NATSYEWDFGD----GTTSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVGAPNPGVTTVTTST--------TV 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1481 FSAVGRGRPASYLWDLGDGGWLEGPEVTHAYNSTGDFTVRVAGWNEVSRSEAWLNVTVKRRVRGLVVNASRTVVPLNGSV 1560
Cdd:COG3291     91 TTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVTTSVSTTDVTSDGTT 170
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1561 SFSTSLEAGSDVRYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFVYVLQLIEGLQVVGGGRYFPT 1640
Cdd:COG3291    171 SASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTDVTLTGISTGDAGTPGTNT 250
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217306235 1641 NHTVQLQAVVRDGTNVSYSWTAWRDRGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWAD 1703
Cdd:COG3291    251 VTTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNVSTTADVTGGTATLA 313
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
225-297 5.47e-06

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 47.11  E-value: 5.47e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306235  225 TLVGPHGPLAS-GQLAAFHIAAPLP--VTATRWDFGDGSAEVdAAGPAASHRYVLPGRYHVTAVLALGAGSALLGT 297
Cdd:cd00146      2 TASVSAPPVAElGASVTFSASDSSGgsIVSYKWDFGDGEVSS-SGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTKT 76
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
22-77 1.11e-05

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 51.09  E-value: 1.11e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2217306235   22 DVSHNLLRALDVGLlANLSALAELDISNNKISTLEEGIfANLFNLSEINLSGNPFE 77
Cdd:COG4886    119 DLSGNQLTDLPEEL-ANLTNLKELDLSNNQLTDLPEPL-GNLTNLKSLDLSNNQLT 172
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1019-1378 1.52e-05

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 50.44  E-value: 1.52e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1019 STVPAVLSPNATLALTAgvlVDSAVEVAFLWTFGDGEQAlhqfqppynesfpvpdpsvaqvlVEHNVMHTYAAPGEYLLT 1098
Cdd:COG3291      2 TATPTSGCAPLTVQFTD---TSSGNATSYEWDFGDGTTS-----------------------TEANPSHTYTTPGTYTVT 55
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1099 VLASNAF-ENLTQQVPVSVRASLPSVAVGVSDGVLVAGRPVTFYPHPLPSPGGVLYTWDFGDGSPVLTQSQPAANHTYAS 1177
Cdd:COG3291     56 LTVTDAAgCSDTTTKTITVGAPNPGVTTVTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTT 135
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1178 RGTYHVRLEVNNTVSGAAAQADVRVFEELRGLSVDmslaveqGAPVVVSAAVQTGDNITWTFDMGDGTVLSGPEATVEHV 1257
Cdd:COG3291    136 TGTDTGLTGSTGTASDTATVTTSVSTTDVTSDGTT-------SASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTA 208
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1258 YLRAQNCTVTVGAASPAGHLARSLHVLVFVLEVLRVEPAACIPTQPDARLTAYVTGNPAHYLFDWTFGDGSSNTTVRGCP 1337
Cdd:COG3291    209 GVTTGATSGTSGTGSATSGVAVTDVTLTGISTGDAGTPGTNTVTTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGL 288
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|.
gi 2217306235 1338 TVTHNFTRSGTFPLALVLSSRVNRAHYFTSICVEPEVGNVT 1378
Cdd:COG3291    289 GTTTAITPGNVSTTADVTGGTATLAVSSTLTTNDTTGSSST 329
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
249-293 2.17e-05

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 45.72  E-value: 2.17e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 2217306235  249 VTATRWDFGDGSAevdAAGPAASHRYVLPGRYHVTAVLALGAGSA 293
Cdd:pfam18911   34 ILSYRWDFGDGTT---ATGANVSHTYAAPGTYTVTLTVTDDSGAS 75
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1405-1456 2.32e-05

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 45.34  E-value: 2.32e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2217306235 1405 RYTWDFGTEeaapTRARGPEVTFIYRDPGSYLVTVTASNNISAANDSALVEV 1456
Cdd:pfam18911   36 SYRWDFGDG----TTATGANVSHTYAAPGTYTVTLTVTDDSGASNSTATDTV 83
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
250-302 2.62e-05

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 45.13  E-value: 2.62e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 2217306235   250 TATRWDFGDGSaevDAAGPAASHRYVLPGRYHVTAVLALGAGSALLGTDVQVE 302
Cdd:smart00089   30 VSYTWDFGDGT---SSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1976-2052 2.67e-05

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 45.13  E-value: 2.67e-05
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217306235  1976 PGIATGTERNFTARVQR-GSRVAYAWYFslqkvqGDSLViLSGRDVTYTPVAAGLLEIQVRAFNALGSENRTLVLEVQ 2052
Cdd:smart00089    9 TVGVAGESVTFTATSSDdGSIVSYTWDF------GDGTS-STGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
2070-2136 3.74e-05

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 49.28  E-value: 3.74e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217306235 2070 SAQFEAATSPSPrrVAYHWDFGDGSpgqDTDEPRAEHSYLRPGDYRVQVNASNLV-SFFVAQATVTVQ 2136
Cdd:COG3291     13 TVQFTDTSSGNA--TSYEWDFGDGT---TSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVG 75
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1885-1961 3.76e-05

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 44.79  E-value: 3.76e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217306235 1885 VLWASSKVVAPGQLVHFQI-LLAAGSAVTFRLQVGGANPEVLPGPRFSHSFPRVGDHVVSVRGKNHVSWAQAQVRIVV 1961
Cdd:cd00146      3 ASVSAPPVAELGASVTFSAsDSSGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTKTTTVV 80
CLECT_NK_receptors_like cd03593
C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs); ...
355-480 5.06e-05

C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs); CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.


Pssm-ID: 153063  Cd Length: 116  Bit Score: 45.40  E-value: 5.06e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  355 CPSDTEIFpgNGHCYRLVVEKAAWLQAQEQCQAwAGAALAMVDSPAVQRFLvSRVTRSLDVWIGFSTVQGVEVGPAPQGE 434
Cdd:cd03593      1 CPKDWICY--GNKCYYFSMEKKTWNESKEACSS-KNSSLLKIDDEEELEFL-QSQIGSSSYWIGLSREKSEKPWKWIDGS 76
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 2217306235  435 AFSlescqNWLpgEPHPATAE-HCVRLGPTGwCNTDLCSAPHSYVCE 480
Cdd:cd03593     77 PLN-----NLF--NIRGSTKSgNCAYLSSTG-IYSEDCSTKKRWICE 115
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
28-77 5.31e-05

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 49.16  E-value: 5.31e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 2217306235   28 LRALDVGLLANLSALAELDISNNKISTLEEGIfANLFNLSEINLSGNPFE 77
Cdd:COG4886    101 LDLSGNEELSNLTNLESLDLSGNQLTDLPEEL-ANLTNLKELDLSNNQLT 149
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
2070-2122 8.13e-05

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 43.80  E-value: 8.13e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217306235 2070 SAQFEAATS--PSPRRVAYHWDFGDGSPGqdtDEPRAEHSYLRPGDYRVQVNASN 2122
Cdd:pfam18911   19 TVTFDASASddPDGDILSYRWDFGDGTTA---TGANVSHTYAAPGTYTVTLTVTD 70
CLECT_CEL-1_like cd03589
C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and ...
355-481 1.34e-04

C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and Echinoidin from Anthocidaris crassispina; CLECT_CEL-1_like: C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and Echinoidin from Anthocidaris crassispina. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. The CEL-1 CTLD binds three calcium ions and has a high specificity for N-acteylgalactosamine (GalNAc). CEL-1 exhibits strong cytotoxicity which is inhibited by GalNAc. This protein may play a role as a toxin defending against predation. Echinoidin is found in the coelomic fluid of the sea urchin and is specific for GalBeta1-3GalNAc. Echinoidin has a cell adhesive activity towards human cancer cells which is not mediated through the CTLD. Both CEL-1 and Echinoidin are multimeric proteins comprised of multiple dimers linked by disulfide bonds.


Pssm-ID: 153059  Cd Length: 137  Bit Score: 44.66  E-value: 1.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  355 CPSDTEIFpgNGHCYRLVVEKAAWLQAQEQCQAWAG----AALAMVDSPAVQRFL------VSRVTRSLDVWIGFStvQG 424
Cdd:cd03589      1 CPTFWTAF--GGYCYRFFGDRLTWEEAELRCRSFSIpgliAHLVSIHSQEENDFVydlfesSRGPDTPYGLWIGLH--DR 76
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217306235  425 VEVGPA--PQGEAFSLEscqNWLPGEPHPA-TAEHCVRLGPTG-----WcNTDLCSAPHSYVCEL 481
Cdd:cd03589     77 TSEGPFewTDGSPVDFT---KWAGGQPDNYgGNEDCVQMWRRGdagqsW-NDMPCDAVFPYICKM 137
PksD COG3321
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ...
3750-4275 1.45e-04

Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 442550 [Multi-domain]  Cd Length: 1386  Bit Score: 48.33  E-value: 1.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3750 EALYPDPPGPRV--------HTCSAAGGFSTSDYDVGWESPHNGSGTWAYSAPDLLGAWSWGSCAVYDSGGYVQELGLSL 3821
Cdd:COG3321    850 SALYPGRGRRRVplptypfqREDAAAALLAAALAAALAAAAALGALLLAALAAALAAALLALAAAAAAALALAAAALAAL 929
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3822 EESRDRLRFLQLHNWLDNRSRAVFLELTRYSPAVGLHAAVTLRLEFPAAGRALAALSVRPFALRRLSAGLSLPLLTSVCL 3901
Cdd:COG3321    930 LALVALAAAAAALLALAAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAALALLAAAA 1009
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3902 LLFAVHFAVAEARTWHREGRWRVLRLGAWARWLLVALTAATALVRLAQLGAADRQWTRFVRGRPRRFTSFDQVAQLSSAA 3981
Cdd:COG3321   1010 LLLAAAAAAAALLALAALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALAELALAAAALALAAA 1089
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 3982 RGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELLGVTLGLVVLGVAYAQLAILLVSSCVDSLWSVAQALLVLCP 4061
Cdd:COG3321   1090 LAAAALALALAALAAALLLLALLAALALAAAAAALLALAALLAAAAAAAALAAAAAAAAALALAAAAAALAAALAAALLA 1169
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 4062 GTGLSTLCPAESWHLSPLLCVGLWALRLWGALRLGAVILRWRYHALRGELYRPAWEPQDYEMVELFLRRLRLWMGLSKVK 4141
Cdd:COG3321   1170 AAALLLALALALAAALAAALAGLAALLLAALLAALLAALLALALAALAAAAAALLAAAAAAAALALLALAAAAAAVAALA 1249
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 4142 EFRHKVRFEGMEPLPSRSSRGSKVSPDVPPPSAGSDASHPSTSSSQLDGLSVSLGRLGTRCEPEPSRLQAVFEALLTQFD 4221
Cdd:COG3321   1250 AAAAALLAALAALALLAAAAGLAALAAAAAAAAAALALAAAAAAAAAALAALLAAAAAAAAAAAAAAAAAALAAALLAAA 1329
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217306235 4222 RLNQA-TEDVYQLEQQLHSLQGRRSSRAPAGSSRGPSPGLRPALPSRLARASRGV 4275
Cdd:COG3321   1330 LAALAaAVAAALALAAAAAAAAAAAAAAAAAAALAAAAGAAAAAAALALAALAAA 1384
PHA03247 PHA03247
large tegument protein UL36; Provisional
445-777 2.99e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 2.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  445 LPGEPHPATAEHCVrlgPTGWCnTDLCSAPHSYVCELQPGGPVQDAENLLVGAPSGDLQGPLTPLA-----------QQD 513
Cdd:PHA03247  2555 LPPAAPPAAPDRSV---PPPRP-APRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPlppdthapdppPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  514 GLSAPHEPVEVMVFPGL------------------RLSREAFLTTAEFGTQELRRPAqLRLQVYRLLSTAGTPENGSEPE 575
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPpperprddpapgrvsrprRARRLGRAAQASSPPQRPRRRA-ARPTVGSLTSLADPPPPPPTPE 2709
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  576 SRsPDNRTQLAPAcMPGGRWCPGANICLPLDASCHPQACANGCTSGPGLPGAPYAlwreflfsvPAGPPaqyslllpvcq 655
Cdd:PHA03247  2710 PA-PHALVSATPL-PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT---------TAGPP----------- 2767
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235  656 vlacvlSPCVPRS-ATVLPVLVTSTVGRLLEVTLHGQDVLMLPGDLVGLQHDAGPGALLHCSPAPGHPGPRAPYLSANAS 734
Cdd:PHA03247  2768 ------APAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP 2841
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 2217306235  735 SWLPHLPAQLEGTWACPACALRLLAATEQLTVLLGLRPNPGLR 777
Cdd:PHA03247  2842 PPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVR 2884
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1637-1709 6.63e-04

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 41.33  E-value: 6.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1637 YFPTNHTVQLQAVVR-------DGTNVSYSWTaWRDrGPALAGSGKGFSLTVLEAGTYHVQLRATNMLGSAWADCTMDFV 1709
Cdd:cd00146      4 SVSAPPVAELGASVTfsasdssGGSIVSYKWD-FGD-GEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTKTTTVVV 81
LRR_8 pfam13855
Leucine rich repeat;
42-74 1.23e-03

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 39.82  E-value: 1.23e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 2217306235   42 LAELDISNNKISTLEEGIFANLFNLSEINLSGN 74
Cdd:pfam13855    3 LRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNN 35
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1717-1794 2.20e-03

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 39.95  E-value: 2.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217306235 1717 VAASPNPAAVNTSVTLSAE---LAGGSGVVYTWSLEEGLSWETSEPftTHSFPTPGLHLVTMTAGNPLGSANATVEVDVQ 1793
Cdd:pfam18911    7 DAGGDRIVAEGETVTFDASasdDPDGDILSYRWDFGDGTTATGANV--SHTYAAPGTYTVTLTVTDDSGASNSTATDTVT 84

                   .
gi 2217306235 1794 V 1794
Cdd:pfam18911   85 V 85
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH