|
Name |
Accession |
Description |
Interval |
E-value |
| Ten_N |
pfam06484 |
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ... |
10-374 |
0e+00 |
|
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).
Pssm-ID: 461932 [Multi-domain] Cd Length: 367 Bit Score: 645.88 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 10 SLTRGRCGKECRYTSSSLDSEDCRVPTQKSYSSSETLKAYDHDSRMHYGNRVTDLVHRESDEFSRQGTNFTLAELGICEP 89
Cdd:pfam06484 1 SLTKRRRDKERRYTSSSADSEECRVPTQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 90 SP-HRSGYCSDMGILHQGYSLSTGSDADSDTEGGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTDSDNENKSDDDNG 168
Cdd:pfam06484 81 SPrHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKSDNENG 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 169 RPIPPTSSSSLLPsaqlpSSHNPPP---VSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPPNHHSQ 245
Cdd:pfam06484 161 PPIPPSSSSSSPV-----EQHSPPPpslNENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQPPNFQNH 235
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 246 STLRPPLPP-PHNHT-LSHHHSSANSLNRNSLTNRRSQIHAP-APAPNDLATTPESVQLQDSWVLNSNVPLETRHFLFKT 322
Cdd:pfam06484 236 SRLRTPPPPlPPPHKqNQHHHPSINSLNRSSLTNRRNPSPAPtASLPAELQSTQESVQLQDSWVLNSNVPLETRHFLFKT 315
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|..
gi 154090989 323 SSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPSKYCSWK 374
Cdd:pfam06484 316 GTGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1230-1560 |
7.67e-47 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 172.33 E-value: 7.67e-47
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1230 PVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNSPGHKYY----LAVDPvTGSLYVSDTNSRRIYRV 1303
Cdd:cd14953 25 PSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRKI 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1304 kslsgakDLAGNSEVVAGTGEqclpfdeARCGDGGKAVDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLLG 1381
Cdd:cd14953 104 -------TPDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG 169
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1382 sndlTAVRPLSCDSSMDVAQVRleWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGRPmhcqvpGIDYSLSKLA 1459
Cdd:cd14953 170 ----TGGAGYAGDGPATAAQFN--NPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG------TAGFSGDGGA 236
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1460 IHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncicYSGDDAYATDAILNSPSSL 1539
Cdd:cd14953 237 TAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGV 302
|
330 340
....*....|....*....|.
gi 154090989 1540 AVAPDGTIYIADLGNIRIRAV 1560
Cdd:cd14953 303 AVDAAGNLYVADTGNNRIRKI 323
|
|
| Tox-GHH |
pfam15636 |
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ... |
2680-2757 |
4.19e-37 |
|
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.
Pssm-ID: 464783 Cd Length: 78 Bit Score: 135.05 E-value: 4.19e-37
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 154090989 2680 EEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLR 2757
Cdd:pfam15636 1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1521-2458 |
1.61e-31 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 136.04 E-value: 1.61e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1521 YSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASP---GEQELYVFNADGIHQYTVS 1597
Cdd:COG3209 105 LTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAgggASAYGLTLGGAAAGPATGV 184
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1598 LVTGEYLYNFTYSADNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKAVSTQNLELGLMTYDGNTG 1677
Cdd:COG3209 185 GTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTG 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1678 LLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNS 1757
Cdd:COG3209 265 AGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGG 344
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1758 YQLCNNGTLRVMYANGMAVSFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLL 1837
Cdd:COG3209 345 TTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAG 424
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1838 SIDYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFA 1917
Cdd:COG3209 425 ALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDD 504
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1918 DGKVWSYSYLDKSMVLLLQSQRQYIF--------EYDSSDRLHAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFD 1989
Cdd:COG3209 505 TLGGTTTTTAGARGLVVTTGTTLTLGttttatlsATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGT 584
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1990 YSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKVGPLVDKQIYRF 2069
Cdd:COG3209 585 TGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTG 664
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 2070 SEEGMINARFDYTYHDNSFRIASikpVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGR 2149
Cdd:COG3209 665 TGTGVTAGLTTLATGGTTVGGGT---GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTT 741
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 2150 IKEVQYEMF-RSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLH----- 2223
Cdd:COG3209 742 GTLTTTSTTtTTTAGALTYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTsvitv 821
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 2224 LLNPGNSARLMPLRYDLRDRITRLGDVQykidDDGYLCQRgsdiFEYNSKGLLTRAynKASGWSVQYRYDGVGRRASyKT 2303
Cdd:COG3209 822 GSGGGTDLQDRTYTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTS-RT 890
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 2304 NLGHHlQYFYSDLHNPTRITHvynhSNSEITSLYYDLQGHlfamesssgeeyyvaSDNTGTPLAVFSINGLMIKQLQYTA 2383
Cdd:COG3209 891 DGGTT-TYTYDALGRLVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDP 950
|
890 900 910 920 930 940 950
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 154090989 2384 YGEIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwrNVGKEPAPfNLYMFKNNNPLSN 2458
Cdd:COG3209 951 FGNLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-----PIGLAGGL-NLYAYVGNNPVNY 1019
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1230-1560 |
2.87e-12 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 69.66 E-value: 2.87e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1230 PVALAVGIDGSLFVGDF--NYIRRIFPsrnvtsilelRNKEFK-HSNSPGHKYY-LAVDPvTGSLYVSDTNSRRIYRVks 1305
Cdd:COG4257 19 PRDVAVDPDGAVWFTDQggGRIGRLDP----------ATGEFTeYPLGGGSGPHgIAVDP-DGNLWFTDNGNNRIGRI-- 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1306 lsGAKDlaGNSEVVAGTGEQCLPFdearcgdggkavdatlmsprGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTLlgs 1382
Cdd:COG4257 86 --DPKT--GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEF--- 138
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1383 ndltavrPLSCDSSMdvaqvrlewPTDLAVNPmDNSLYV--LENNVILRI-TENHQVSIIAGrpmhcqvpgidyslskla 1459
Cdd:COG4257 139 -------PLPTGGAG---------PYGIAVDP-DGNLWVtdFGANAIGRIdPDTGTLTEYAL------------------ 183
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1460 iHSALESASAIAISHTGVLYITETDEKKINRLRqvTTNGEIcllagaasdcdckndvncicysgdDAYATDAILNSPSSL 1539
Cdd:COG4257 184 -PTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTV------------------------TEYPLPGGGARPYGV 236
|
330 340
....*....|....*....|.
gi 154090989 1540 AVAPDGTIYIADLGNIRIRAV 1560
Cdd:COG4257 237 AVDGDGRVWFAESGANRIVRF 257
|
|
| Rhs_assc_core |
TIGR03696 |
RHS repeat-associated core domain; This model represents a conserved unique core sequence ... |
2381-2458 |
7.16e-10 |
|
RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.
Pssm-ID: 274730 [Multi-domain] Cd Length: 77 Bit Score: 57.51 E-value: 7.16e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 2381 YTAYGEIYYDSNPDFQmVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwrnvgkePA----PFNLYMFKNNNPL 2456
Cdd:TIGR03696 1 YDPYGEVLSESGAAPN-PLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD----------PIglggGLNLYAYVGNNPV 69
|
..
gi 154090989 2457 SN 2458
Cdd:TIGR03696 70 NW 71
|
|
| acid_disulf_rpt |
NF033662 |
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ... |
833-863 |
4.00e-08 |
|
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.
Pssm-ID: 411265 [Multi-domain] Cd Length: 32 Bit Score: 50.98 E-value: 4.00e-08
10 20 30
....*....|....*....|....*....|.
gi 154090989 833 AMETSCADNKDNEGDGLVDCLDPDCCLQSAC 863
Cdd:NF033662 2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
644-789 |
5.65e-07 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 51.30 E-value: 5.65e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 644 SCGGHGS-CIDGNCVCaagykGEHCeeVDC-LDP--------TCSSHGVCVNGECLCSPGwgglncelaRVQCPDQCSgh 713
Cdd:NF041328 13 GCPEPGAvCPEGLSVC-----GGAC--VDLrSDPsncgacgvACGAGQTCVAGACGCGPG---------TVACGGACV-- 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 714 gtylpdsgLCSCDPNWMGpdcsvVCSVDCGTHGVCIGGACR--CEEGWT--GAAC-DQRVCHPRCIEHGT-CKDGKcECR 787
Cdd:NF041328 75 --------DTASDPAHCG-----ACGAACAPGQVCEGGACReaCSEGLTrcGGACvDLATDPLHCGACGVaCDPGE-SCR 140
|
..
gi 154090989 788 EG 789
Cdd:NF041328 141 GG 142
|
|
| DUF5885 |
pfam19232 |
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ... |
575-734 |
6.48e-07 |
|
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.
Pssm-ID: 437064 Cd Length: 265 Bit Score: 53.47 E-value: 6.48e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 575 DCPRNCHGNGECVSGLCH--------------CFPGFLGADCAKAAC--PVLCsGNGQ----------YSKGTCQ----C 624
Cdd:pfam19232 11 DCTPPCGGTQVCIDRQCKdntlacttdaqcgtCMTCVAGACTPKASCcgGVTC-GAGQtcdaktntcvYVKGYCSadhpC 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 625 YSGwkgAECDVPMNQCI-DPSCG-GHGS-CIDG-----------------NCVCAAG--YKGEH-CEEV--------DCL 673
Cdd:pfam19232 90 PSG---SACDTAKNACIaQPPYGpDSGKgCVRGfgawiweldpatnsgvwRCRCANGslYNSAHeCSPLadqtlcaaENL 166
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 674 DPTC---------------SSHGVCVN-------------GECLCSPGWGGLNCELARvqcpdQCSGHGTYLPDSGLCSC 725
Cdd:pfam19232 167 DPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTEDR-----TCNGRGTWNETTGQCAC 241
|
250 260
....*....|....*....|....
gi 154090989 726 ------------DPN---WMGPDC 734
Cdd:pfam19232 242 nidfsghnscgdDNNctsWTGPRC 265
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
1672-1708 |
6.41e-05 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 42.20 E-value: 6.41e-05
10 20 30
....*....|....*....|....*....|....*..
gi 154090989 1672 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTG 1708
Cdd:pfam05593 1 YDAA-GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
580-602 |
7.34e-04 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 38.87 E-value: 7.34e-04
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
671-700 |
2.60e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 37.62 E-value: 2.60e-03
10 20 30
....*....|....*....|....*....|....*
gi 154090989 671 DCLDPT-CSSHGVCVNGE----CLCSPGWGGLNCE 700
Cdd:cd00054 4 ECASGNpCQNGGTCVNTVgsyrCSCPPGYTGRNCE 38
|
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
797-830 |
4.13e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 37.23 E-value: 4.13e-03
10 20 30
....*....|....*....|....*....|....*.
gi 154090989 797 IDGC--PDLCNGNGRCTLGQNSWQCVCQTGWRGPGC 830
Cdd:cd00054 2 IDECasGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
738-810 |
4.23e-03 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 40.13 E-value: 4.23e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 738 CSVDCGTHGVCIGGACRCEEGWT--GAACDQRV--------CHPRCIEHGTCKDGKCE--CREGWngEHCTiDGCPDLCN 805
Cdd:NF041328 45 CGVACGAGQTCVAGACGCGPGTVacGGACVDTAsdpahcgaCGAACAPGQVCEGGACReaCSEGL--TRCG-GACVDLAT 121
|
....*
gi 154090989 806 GNGRC 810
Cdd:NF041328 122 DPLHC 126
|
|
| NHL_like_5 |
cd14963 |
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ... |
1533-1624 |
8.60e-03 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271333 [Multi-domain] Cd Length: 268 Bit Score: 40.74 E-value: 8.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1533 LNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYE----AASPG-----EQELYVFNADGiHQYTVSLVTGEY 1603
Cdd:cd14963 55 FKYPYGIAVDSDGNIYVADLYNGRIQVFDPDGKFLKYFPEKKdrvkLISPAglaidDGKLYVSDVKK-HKVIVFDLEGKL 133
|
90 100
....*....|....*....|....*....
gi 154090989 1604 LYNF--------TYSADNDVTelIDNNGN 1624
Cdd:cd14963 134 LLEFgkpgsepgELSYPNGIA--VDEDGN 160
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Ten_N |
pfam06484 |
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ... |
10-374 |
0e+00 |
|
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).
Pssm-ID: 461932 [Multi-domain] Cd Length: 367 Bit Score: 645.88 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 10 SLTRGRCGKECRYTSSSLDSEDCRVPTQKSYSSSETLKAYDHDSRMHYGNRVTDLVHRESDEFSRQGTNFTLAELGICEP 89
Cdd:pfam06484 1 SLTKRRRDKERRYTSSSADSEECRVPTQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 90 SP-HRSGYCSDMGILHQGYSLSTGSDADSDTEGGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTDSDNENKSDDDNG 168
Cdd:pfam06484 81 SPrHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKSDNENG 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 169 RPIPPTSSSSLLPsaqlpSSHNPPP---VSCQMPLLDSNTSHQIMDTNPDEEFSPNSYLLRACSGPQQASSSGPPNHHSQ 245
Cdd:pfam06484 161 PPIPPSSSSSSPV-----EQHSPPPpslNENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQPPNFQNH 235
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 246 STLRPPLPP-PHNHT-LSHHHSSANSLNRNSLTNRRSQIHAP-APAPNDLATTPESVQLQDSWVLNSNVPLETRHFLFKT 322
Cdd:pfam06484 236 SRLRTPPPPlPPPHKqNQHHHPSINSLNRSSLTNRRNPSPAPtASLPAELQSTQESVQLQDSWVLNSNVPLETRHFLFKT 315
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|..
gi 154090989 323 SSGSTPLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPSKYCSWK 374
Cdd:pfam06484 316 GTGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1230-1560 |
7.67e-47 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 172.33 E-value: 7.67e-47
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1230 PVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNSPGHKYY----LAVDPvTGSLYVSDTNSRRIYRV 1303
Cdd:cd14953 25 PSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRKI 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1304 kslsgakDLAGNSEVVAGTGEqclpfdeARCGDGGKAVDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLLG 1381
Cdd:cd14953 104 -------TPDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG 169
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1382 sndlTAVRPLSCDSSMDVAQVRleWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGRPmhcqvpGIDYSLSKLA 1459
Cdd:cd14953 170 ----TGGAGYAGDGPATAAQFN--NPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG------TAGFSGDGGA 236
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1460 IHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncicYSGDDAYATDAILNSPSSL 1539
Cdd:cd14953 237 TAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGV 302
|
330 340
....*....|....*....|.
gi 154090989 1540 AVAPDGTIYIADLGNIRIRAV 1560
Cdd:cd14953 303 AVDAAGNLYVADTGNNRIRKI 323
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1281-1561 |
1.83e-40 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 153.84 E-value: 1.83e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1281 LAVDPvTGSLYVSDTNSRRIYRVkslsgakDLAGNSEVVAGTGEqclpfdEARCGDGGKAvdATLMSPRGIAVDKNGLMY 1360
Cdd:cd14953 28 VAVDA-AGNLYVADRGNHRIRKI-------TPDGVVTTVAGTGT------AGFADGGGAA--AQFNTPSGVAVDAAGNLY 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1361 FVDAT--MIRKVDQNGIISTLLGsndlTAVRPLSCDSSMDVAQvrLEWPTDLAVNPMDNsLYVLE--NNVILRITENHQV 1436
Cdd:cd14953 92 VADTGnhRIRKITPDGVVSTLAG----TGTAGFSDDGGATAAQ--FNYPTGVAVDAAGN-LYVADtgNHRIRKITPDGVV 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1437 SIIAGRPmhcqVPGidYSLSKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndv 1516
Cdd:cd14953 165 TTVAGTG----GAG--YAGDGPATAAQFNNPTGVAVDAAGNLYVADRGN---HRIRKITPDGVVTTVAGTGTA------- 228
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 154090989 1517 ncicYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVS 1561
Cdd:cd14953 229 ----GFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGNHRIRKIT 269
|
|
| Tox-GHH |
pfam15636 |
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ... |
2680-2757 |
4.19e-37 |
|
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.
Pssm-ID: 464783 Cd Length: 78 Bit Score: 135.05 E-value: 4.19e-37
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 154090989 2680 EEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLR 2757
Cdd:pfam15636 1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1318-1561 |
5.75e-32 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 128.80 E-value: 5.75e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1318 VVAGTGeqclpfdeARCGDGGKAVDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLL-----GSNDLTAvrp 1390
Cdd:cd14953 3 TVAGSG--------TAGFSGGGGTAARFNSPSGVAVDAAGNLYVADRGnhRIRKITPDGVVTTVAgtgtaGFADGGG--- 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1391 lscdssmdvAQVRLEWPTDLAVNPMDNsLYV--LENNVILRITENHQVSIIAGRPmhcqVPGidYSLSKLAIHSALESAS 1468
Cdd:cd14953 72 ---------AAAQFNTPSGVAVDAAGN-LYVadTGNHRIRKITPDGVVSTLAGTG----TAG--FSDDGGATAAQFNYPT 135
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1469 AIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncicYSGDDAYATDAILNSPSSLAVAPDGTIY 1548
Cdd:cd14953 136 GVAVDAAGNLYVADTGN---HRIRKITPDGVVTTVAGTGGA-----------GYAGDGPATAAQFNNPTGVAVDAAGNLY 201
|
250
....*....|...
gi 154090989 1549 IADLGNIRIRAVS 1561
Cdd:cd14953 202 VADRGNHRIRKIT 214
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1521-2458 |
1.61e-31 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 136.04 E-value: 1.61e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1521 YSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASP---GEQELYVFNADGIHQYTVS 1597
Cdd:COG3209 105 LTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAgggASAYGLTLGGAAAGPATGV 184
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1598 LVTGEYLYNFTYSADNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKAVSTQNLELGLMTYDGNTG 1677
Cdd:COG3209 185 GTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTG 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1678 LLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNS 1757
Cdd:COG3209 265 AGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGG 344
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1758 YQLCNNGTLRVMYANGMAVSFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLL 1837
Cdd:COG3209 345 TTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAG 424
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1838 SIDYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFA 1917
Cdd:COG3209 425 ALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDD 504
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1918 DGKVWSYSYLDKSMVLLLQSQRQYIF--------EYDSSDRLHAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFD 1989
Cdd:COG3209 505 TLGGTTTTTAGARGLVVTTGTTLTLGttttatlsATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGT 584
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1990 YSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKVGPLVDKQIYRF 2069
Cdd:COG3209 585 TGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTG 664
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 2070 SEEGMINARFDYTYHDNSFRIASikpVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGR 2149
Cdd:COG3209 665 TGTGVTAGLTTLATGGTTVGGGT---GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTT 741
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 2150 IKEVQYEMF-RSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLH----- 2223
Cdd:COG3209 742 GTLTTTSTTtTTTAGALTYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTsvitv 821
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 2224 LLNPGNSARLMPLRYDLRDRITRLGDVQykidDDGYLCQRgsdiFEYNSKGLLTRAynKASGWSVQYRYDGVGRRASyKT 2303
Cdd:COG3209 822 GSGGGTDLQDRTYTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTS-RT 890
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 2304 NLGHHlQYFYSDLHNPTRITHvynhSNSEITSLYYDLQGHlfamesssgeeyyvaSDNTGTPLAVFSINGLMIKQLQYTA 2383
Cdd:COG3209 891 DGGTT-TYTYDALGRLVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDP 950
|
890 900 910 920 930 940 950
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 154090989 2384 YGEIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwrNVGKEPAPfNLYMFKNNNPLSN 2458
Cdd:COG3209 951 FGNLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-----PIGLAGGL-NLYAYVGNNPVNY 1019
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1281-1579 |
1.23e-17 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 85.45 E-value: 1.23e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1281 LAVDPvTGSLYVSDTNSRRIYRVkslsgakDLAGNSEVVAGTGeqclpfdearcGDGgkavDATLMSPRGIAVDKNGLMY 1360
Cdd:cd05819 13 IAVDS-SGNIYVADTGNNRIQVF-------DPDGNFITSFGSF-----------GSG----DGQFNEPAGVAVDSDGNLY 69
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1361 FVDAT--MIRKVDQNGIISTLLGSNDLTavrplscdssmdvaQVRLEWPTDLAVNPMDNsLYVL--ENNVILRITENHQV 1436
Cdd:cd05819 70 VADTGnhRIQKFDPDGNFLASFGGSGDG--------------DGEFNGPRGIAVDSSGN-IYVAdtGNHRIQKFDPDGEF 134
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1437 SIIAGrpmhcqvpgidyslSKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGaasdcdckndv 1516
Cdd:cd05819 135 LTTFG--------------SGGSGPGQFNGPTGVAVDSDGNIYVADTGN---HRIQVFDPDGNFLTTFG----------- 186
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 154090989 1517 ncicysgdDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASPG 1579
Cdd:cd05819 187 --------STGTGPGQFNYPTGIAVDSDGNIYVADSGNNRVQVFDPDGAGFGGNGNFLGSDGQ 241
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1200-1370 |
1.97e-17 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 86.04 E-value: 1.97e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1200 IITSIMGNGRRRSiscpSCNGLAEGNKLLAPVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSILELRNKEFKHS----- 1272
Cdd:cd14953 163 VVTTVAGTGGAGY----AGDGPATAAQFNNPTGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFSGDggata 238
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1273 ---NSPghkYYLAVDPvTGSLYVSDTNSRRIYRVkslsgakDLAGNSEVVAGTGeQCLPfdearcGDGGKAVDATLMSPR 1349
Cdd:cd14953 239 aqlNNP---TGVAVDA-AGNLYVADSGNHRIRKI-------TPAGVVTTVAGGG-AGFS------GDGGPATSAQFNNPT 300
|
170 180
....*....|....*....|...
gi 154090989 1350 GIAVDKNGLMYFVDAT--MIRKV 1370
Cdd:cd14953 301 GVAVDAAGNLYVADTGnnRIRKI 323
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1225-1558 |
2.37e-16 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 81.60 E-value: 2.37e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1225 NKLLAPVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNSPghkYYLAVDPvTGSLYVSDTNSRRIYR 1302
Cdd:cd05819 5 GELNNPQGIAVDSSGNIYVADTgnNRIQVFDPDGNFITSFGSFGSGDGQFNEP---AGVAVDS-DGNLYVADTGNHRIQK 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1303 VkslsgakDLAGNSEVVAGTGeqclpfdearcGDGgkavDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLL 1380
Cdd:cd05819 81 F-------DPDGNFLASFGGS-----------GDG----DGEFNGPRGIAVDSSGNIYVADTGnhRIQKFDPDGEFLTTF 138
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1381 GSNdltavrplscdsSMDVAQvrLEWPTDLAVNPmDNSLYVLE--NNVILRITENHQVSIIAGRPmhCQVPGidyslskl 1458
Cdd:cd05819 139 GSG------------GSGPGQ--FNGPTGVAVDS-DGNIYVADtgNHRIQVFDPDGNFLTTFGST--GTGPG-------- 193
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1459 aihsALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGaasdcdckndvncicysgdDAYATDAILNSPSS 1538
Cdd:cd05819 194 ----QFNYPTGIAVDSDGNIYVADSGN---NRVQVFDPDGAGFGGNG-------------------NFLGSDGQFNRPSG 247
|
330 340
....*....|....*....|
gi 154090989 1539 LAVAPDGTIYIADLGNIRIR 1558
Cdd:cd05819 248 LAVDSDGNLYVADTGNNRIQ 267
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1219-1430 |
3.15e-15 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 78.51 E-value: 3.15e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1219 NGLAEGNkLLAPVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSIL---ELRNKEFkhsNSPghkYYLAVDPvTGSLYVS 1293
Cdd:cd05819 94 SGDGDGE-FNGPRGIAVDSSGNIYVADTgnHRIQKFDPDGEFLTTFgsgGSGPGQF---NGP---TGVAVDS-DGNIYVA 165
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1294 DTNSRRIYRVKSlsgakdlagNSEVVAGTGEQClpfdearcgdggkAVDATLMSPRGIAVDKNGLMYFVDATM--IRKVD 1371
Cdd:cd05819 166 DTGNHRIQVFDP---------DGNFLTTFGSTG-------------TGPGQFNYPTGIAVDSDGNIYVADSGNnrVQVFD 223
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 154090989 1372 QNGIISTLLGSNdltavrplscdssmDVAQVRLEWPTDLAVNPmDNSLYVLE--NNVILRI 1430
Cdd:cd05819 224 PDGAGFGGNGNF--------------LGSDGQFNRPSGLAVDS-DGNLYVADtgNNRIQVF 269
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1342-1573 |
7.81e-15 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 77.36 E-value: 7.81e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1342 DATLMSPRGIAVDKNGLMYFVDATM--IRKVDQNGIISTLLGSNDltavrplscdssmdVAQVRLEWPTDLAVNPmDNSL 1419
Cdd:cd05819 4 PGELNNPQGIAVDSSGNIYVADTGNnrIQVFDPDGNFITSFGSFG--------------SGDGQFNEPAGVAVDS-DGNL 68
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1420 YVL--ENNVILRITENHQVSIIAGRPmhcqvpGIDYSlsklaihsALESASAIAISHTGVLYITETDEkkiNRLRQVTTN 1497
Cdd:cd05819 69 YVAdtGNHRIQKFDPDGNFLASFGGS------GDGDG--------EFNGPRGIAVDSSGNIYVADTGN---HRIQKFDPD 131
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 154090989 1498 GEIcllagaasdcdckndVNCICYSGddayATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQY 1573
Cdd:cd05819 132 GEF---------------LTTFGSGG----SGPGQFNGPTGVAVDSDGNIYVADTGNHRIQVFDPDGNFLTTFGST 188
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1226-1491 |
1.10e-13 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 73.89 E-value: 1.10e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1226 KLLAPVALAVGIDGSLFVGDFNYIR-RIFPS----RNVTSILELRNKEFkhsNSPghkYYLAVDPvTGSLYVSDTNSRRI 1300
Cdd:cd05819 53 QFNEPAGVAVDSDGNLYVADTGNHRiQKFDPdgnfLASFGGSGDGDGEF---NGP---RGIAVDS-SGNIYVADTGNHRI 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1301 YRVkslsgakDLAGNSEVVAGtgeqclpfdearcgdGGKAVDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIIST 1378
Cdd:cd05819 126 QKF-------DPDGEFLTTFG---------------SGGSGPGQFNGPTGVAVDSDGNIYVADTGnhRIQVFDPDGNFLT 183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1379 LLGSNdltavrplscdssmDVAQVRLEWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGrpmhcqvpgidyslS 1456
Cdd:cd05819 184 TFGST--------------GTGPGQFNYPTGIAVDSDGN-IYVADsgNNRVQVFDPDGAGFGGNG--------------N 234
|
250 260 270
....*....|....*....|....*....|....*
gi 154090989 1457 KLAIHSALESASAIAISHTGVLYITETDEKKINRL 1491
Cdd:cd05819 235 FLGSDGQFNRPSGLAVDSDGNLYVADTGNNRIQVF 269
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1230-1560 |
2.87e-12 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 69.66 E-value: 2.87e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1230 PVALAVGIDGSLFVGDF--NYIRRIFPsrnvtsilelRNKEFK-HSNSPGHKYY-LAVDPvTGSLYVSDTNSRRIYRVks 1305
Cdd:COG4257 19 PRDVAVDPDGAVWFTDQggGRIGRLDP----------ATGEFTeYPLGGGSGPHgIAVDP-DGNLWFTDNGNNRIGRI-- 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1306 lsGAKDlaGNSEVVAGTGEQCLPFdearcgdggkavdatlmsprGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTLlgs 1382
Cdd:COG4257 86 --DPKT--GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEF--- 138
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1383 ndltavrPLSCDSSMdvaqvrlewPTDLAVNPmDNSLYV--LENNVILRI-TENHQVSIIAGrpmhcqvpgidyslskla 1459
Cdd:COG4257 139 -------PLPTGGAG---------PYGIAVDP-DGNLWVtdFGANAIGRIdPDTGTLTEYAL------------------ 183
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1460 iHSALESASAIAISHTGVLYITETDEKKINRLRqvTTNGEIcllagaasdcdckndvncicysgdDAYATDAILNSPSSL 1539
Cdd:COG4257 184 -PTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTV------------------------TEYPLPGGGARPYGV 236
|
330 340
....*....|....*....|.
gi 154090989 1540 AVAPDGTIYIADLGNIRIRAV 1560
Cdd:COG4257 237 AVDGDGRVWFAESGANRIVRF 257
|
|
| Rhs_assc_core |
TIGR03696 |
RHS repeat-associated core domain; This model represents a conserved unique core sequence ... |
2381-2458 |
7.16e-10 |
|
RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.
Pssm-ID: 274730 [Multi-domain] Cd Length: 77 Bit Score: 57.51 E-value: 7.16e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 2381 YTAYGEIYYDSNPDFQmVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwrnvgkePA----PFNLYMFKNNNPL 2456
Cdd:TIGR03696 1 YDPYGEVLSESGAAPN-PLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD----------PIglggGLNLYAYVGNNPV 69
|
..
gi 154090989 2457 SN 2458
Cdd:TIGR03696 70 NW 71
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1267-1590 |
1.84e-09 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 61.19 E-value: 1.84e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1267 KEFKHSNSPGHKYYLAVDPvTGSLYVSDTNSRRIYRVkslsgakDLAgnsevvagTGEqclpFDEARCGDGGkavdatlm 1346
Cdd:COG4257 8 TEYPVPAPGSGPRDVAVDP-DGAVWFTDQGGGRIGRL-------DPA--------TGE----FTEYPLGGGS-------- 59
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1347 SPRGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTLLGSNDLTAvrplscdssmdvaqvrlewPTDLAVNPmDNSLYV-- 1421
Cdd:COG4257 60 GPHGIAVDPDGNLWFTDNGnnRIGRIDpKTGEITTFALPGGGSN-------------------PHGIAFDP-DGNLWFtd 119
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1422 LENNVILRIT-ENHQVSIIAGRPMHCQvpgidyslsklaihsalesASAIAISHTGVLYITETdekKINRLRQVTT-NGE 1499
Cdd:COG4257 120 QGGNRIGRLDpATGEVTEFPLPTGGAG-------------------PYGIAVDPDGNLWVTDF---GANAIGRIDPdTGT 177
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1500 IcllagaasdcdckndvncicysgdDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSknkPVLNAFNQYeAASPG 1579
Cdd:COG4257 178 L------------------------TEYALPTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD---PKTGTVTEY-PLPGG 229
|
330
....*....|...
gi 154090989 1580 EQELY--VFNADG 1590
Cdd:COG4257 230 GARPYgvAVDGDG 242
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1225-1500 |
1.90e-08 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 58.11 E-value: 1.90e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1225 NKLLAPVALAVGIDGSLFVGD--FNYIRRIFPSRNVTSILELRNKEfkhsNSPghkYYLAVDPvTGSLYVSDTNSRRIYR 1302
Cdd:COG4257 56 GGGSGPHGIAVDPDGNLWFTDngNNRIGRIDPKTGEITTFALPGGG----SNP---HGIAFDP-DGNLWFTDQGGNRIGR 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1303 VkslsgakDLAGNsEVVAGTgeqcLPFDEARcgdggkavdatlmsPRGIAVDKNGLMYFVD--ATMIRKVD-QNGIISTL 1379
Cdd:COG4257 128 L-------DPATG-EVTEFP----LPTGGAG--------------PYGIAVDPDGNLWVTDfgANAIGRIDpDTGTLTEY 181
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1380 LGSNDLTAvrplscdssmdvaqvrlewPTDLAVNPmDNSLYVLE--NNVILRITENhqvsiiagrpmhcqvpgiDYSLSK 1457
Cdd:COG4257 182 ALPTPGAG-------------------PRGLAVDP-DGNLWVADtgSGRIGRFDPK------------------TGTVTE 223
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 154090989 1458 LAIHSALESASAIAISHTGVLYITETDekkINRLRQVTTNGEI 1500
Cdd:COG4257 224 YPLPGGGARPYGVAVDGDGRVWFAESG---ANRIVRFDPDTEL 263
|
|
| acid_disulf_rpt |
NF033662 |
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ... |
833-863 |
4.00e-08 |
|
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.
Pssm-ID: 411265 [Multi-domain] Cd Length: 32 Bit Score: 50.98 E-value: 4.00e-08
10 20 30
....*....|....*....|....*....|.
gi 154090989 833 AMETSCADNKDNEGDGLVDCLDPDCCLQSAC 863
Cdd:NF033662 2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
|
|
| NHL_PKND_like |
cd14952 |
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ... |
1281-1557 |
6.60e-08 |
|
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271322 [Multi-domain] Cd Length: 247 Bit Score: 56.06 E-value: 6.60e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1281 LAVDPvTGSLYVSDTNSRRIYRvkslsgakdLAgnsevvAGTGEQC-LPFDEarcgdggkavdatLMSPRGIAVDKNGLM 1359
Cdd:cd14952 15 VAVDA-AGNVYVADSGNNRVLK---------LA------AGSTTQTvLPFTG-------------LYQPQGVAVDAAGTV 65
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1360 YFVDAtmirkvDQNGIISTLLGSNDLTAVrPLScdssmdvaqvRLEWPTDLAVNPMDNsLYVLE--NNVILRITenhqvs 1437
Cdd:cd14952 66 YVTDF------GNNRVLKLAAGSTTQTVL-PFT----------GLNDPTGVAVDAAGN-VYVADtgNNRVLKLA------ 121
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1438 iiAGRPMHCQVPGIDyslsklaihsaLESASAIAISHTGVLYITETDEKKINRLRQVTT-------NGeicLLAGAASDC 1510
Cdd:cd14952 122 --AGSNTQTVLPFTG-----------LSNPDGVAVDGAGNVYVTDTGNNRVLKLAAGSTtqtvlpfTG---LNSPSGVAV 185
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 154090989 1511 DCKNDVncicYSGD---------DAYATDAI------LNSPSSLAVAPDGTIYIADLGNIRI 1557
Cdd:cd14952 186 DTAGNV----YVTDhgnnrvlklAAGSTTPTvlpftgLNGPLGVAVDAAGNVYVADRGNDRV 243
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
644-789 |
5.65e-07 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 51.30 E-value: 5.65e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 644 SCGGHGS-CIDGNCVCaagykGEHCeeVDC-LDP--------TCSSHGVCVNGECLCSPGwgglncelaRVQCPDQCSgh 713
Cdd:NF041328 13 GCPEPGAvCPEGLSVC-----GGAC--VDLrSDPsncgacgvACGAGQTCVAGACGCGPG---------TVACGGACV-- 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 714 gtylpdsgLCSCDPNWMGpdcsvVCSVDCGTHGVCIGGACR--CEEGWT--GAAC-DQRVCHPRCIEHGT-CKDGKcECR 787
Cdd:NF041328 75 --------DTASDPAHCG-----ACGAACAPGQVCEGGACReaCSEGLTrcGGACvDLATDPLHCGACGVaCDPGE-SCR 140
|
..
gi 154090989 788 EG 789
Cdd:NF041328 141 GG 142
|
|
| DUF5885 |
pfam19232 |
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ... |
575-734 |
6.48e-07 |
|
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.
Pssm-ID: 437064 Cd Length: 265 Bit Score: 53.47 E-value: 6.48e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 575 DCPRNCHGNGECVSGLCH--------------CFPGFLGADCAKAAC--PVLCsGNGQ----------YSKGTCQ----C 624
Cdd:pfam19232 11 DCTPPCGGTQVCIDRQCKdntlacttdaqcgtCMTCVAGACTPKASCcgGVTC-GAGQtcdaktntcvYVKGYCSadhpC 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 625 YSGwkgAECDVPMNQCI-DPSCG-GHGS-CIDG-----------------NCVCAAG--YKGEH-CEEV--------DCL 673
Cdd:pfam19232 90 PSG---SACDTAKNACIaQPPYGpDSGKgCVRGfgawiweldpatnsgvwRCRCANGslYNSAHeCSPLadqtlcaaENL 166
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 674 DPTC---------------SSHGVCVN-------------GECLCSPGWGGLNCELARvqcpdQCSGHGTYLPDSGLCSC 725
Cdd:pfam19232 167 DPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTEDR-----TCNGRGTWNETTGQCAC 241
|
250 260
....*....|....*....|....
gi 154090989 726 ------------DPN---WMGPDC 734
Cdd:pfam19232 242 nidfsghnscgdDNNctsWTGPRC 265
|
|
| NHL_like_2 |
cd14957 |
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ... |
1345-1624 |
6.60e-06 |
|
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271327 [Multi-domain] Cd Length: 280 Bit Score: 50.34 E-value: 6.60e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1345 LMSPRGIAVDKNGLMYFVDA--TMIRKVDQNGIISTLLGSNDLTavrplscdssmdvaQVRLEWPTDLAVNPMDNsLYVL 1422
Cdd:cd14957 17 FNTPRGIAVDSAGNIYVADTgnNRIQVFTSSGVYSYSIGSGGTG--------------SGQFNSPYGIAVDSNGN-IYVA 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1423 EnnvilriTENHQVSII--AGrpmhcqvpGIDYSL-SKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGE 1499
Cdd:cd14957 82 D-------TDNNRIQVFnsSG--------VYQYSIgTGGSGDGQFNGPYGIAVDSNGNIYVADTGN---HRIQVFTSSGT 143
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1500 ICllagaasdcdckndvncicYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRavsknkpvlnafnqyeaaspg 1579
Cdd:cd14957 144 FS-------------------YSIGSGGTGPGQFNGPQGIAVDSDGNIYVADTGNHRIQ--------------------- 183
|
250 260 270 280
....*....|....*....|....*....|....*....|....*.
gi 154090989 1580 eqelyVFNADGIHQYTV-SLVTGEYLYNFTYsadnDVTelIDNNGN 1624
Cdd:cd14957 184 -----VFTSSGTFQYTFgSSGSGPGQFSDPY----GIA--VDSDGN 218
|
|
| NHL_like_2 |
cd14957 |
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ... |
1230-1558 |
1.93e-05 |
|
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271327 [Multi-domain] Cd Length: 280 Bit Score: 49.19 E-value: 1.93e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1230 PVALAVGIDGSLFVGDFNYIR-RIF-PSRNVTSIL---ELRNKEFkhsNSPghkYYLAVDPvTGSLYVSDTNSRRIyRVK 1304
Cdd:cd14957 20 PRGIAVDSAGNIYVADTGNNRiQVFtSSGVYSYSIgsgGTGSGQF---NSP---YGIAVDS-NGNIYVADTDNNRI-QVF 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1305 SLSGAKDLAgnsevVAGTGEQCLPFDEarcgdggkavdatlmsPRGIAVDKNGLMYFVDA--TMIRKVDQNGIISTLLGS 1382
Cdd:cd14957 92 NSSGVYQYS-----IGTGGSGDGQFNG----------------PYGIAVDSNGNIYVADTgnHRIQVFTSSGTFSYSIGS 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1383 ndltavrplscdSSMDVAQVRLewPTDLAVNPMDNsLYVLENNvilriteNHQVSII--AGRPmhcqvpgiDYSL-SKLA 1459
Cdd:cd14957 151 ------------GGTGPGQFNG--PQGIAVDSDGN-IYVADTG-------NHRIQVFtsSGTF--------QYTFgSSGS 200
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1460 IHSALESASAIAISHTGVLYITETDEKKInrlrQVTTNgeicllagaasdcdckndvncicySGDDAYA------TDAIL 1533
Cdd:cd14957 201 GPGQFSDPYGIAVDSDGNIYVADTGNHRI----QVFTS------------------------SGAYQYSigtsgsGNGQF 252
|
330 340
....*....|....*....|....*
gi 154090989 1534 NSPSSLAVAPDGTIYIADLGNIRIR 1558
Cdd:cd14957 253 NYPYGIAVDNDGKIYVADSNNNRIQ 277
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1522-1563 |
2.02e-05 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 49.45 E-value: 2.02e-05
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 154090989 1522 SGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKN 1563
Cdd:cd14953 11 GFSGGGGTAARFNSPSGVAVDAAGNLYVADRGNHRIRKITPD 52
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
1672-1708 |
6.41e-05 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 42.20 E-value: 6.41e-05
10 20 30
....*....|....*....|....*....|....*..
gi 154090989 1672 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTG 1708
Cdd:pfam05593 1 YDAA-GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
|
|
| NHL_PKND_like |
cd14952 |
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ... |
1227-1421 |
7.45e-05 |
|
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271322 [Multi-domain] Cd Length: 247 Bit Score: 46.82 E-value: 7.45e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1227 LLAPVALAVGIDGSLFVGDFNYIR--RIFPSRNVTSILELRNKEFKHSnspghkyyLAVDPVtGSLYVSDTNSRRIYRVK 1304
Cdd:cd14952 51 LYQPQGVAVDAAGTVYVTDFGNNRvlKLAAGSTTQTVLPFTGLNDPTG--------VAVDAA-GNVYVADTGNNRVLKLA 121
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1305 S------------LSGAKDLA------------GNSEVV---AGTGEQC-LPFDEarcgdggkavdatLMSPRGIAVDKN 1356
Cdd:cd14952 122 AgsntqtvlpftgLSNPDGVAvdgagnvyvtdtGNNRVLklaAGSTTQTvLPFTG-------------LNSPSGVAVDTA 188
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 154090989 1357 GLMYFVDAtmirkvDQNGIISTLLGSNDLTAVrPLScdssmdvaqvRLEWPTDLAVNPmDNSLYV 1421
Cdd:cd14952 189 GNVYVTDH------GNNRVLKLAAGSTTPTVL-PFT----------GLNGPLGVAVDA-AGNVYV 235
|
|
| YvrE |
COG3386 |
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase ... |
1233-1393 |
3.84e-04 |
|
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase YvrE is part of the Pathway/BioSystem: Non-phosphorylated Entner-Doudoroff pathway
Pssm-ID: 442613 [Multi-domain] Cd Length: 266 Bit Score: 44.88 E-value: 3.84e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1233 LAVGIDGSLFVGDFNY------IRRIFPSRNVTSILElrnkEFKHSNSpghkyyLAVDPVTGSLYVSDTNSRRIYRVksl 1306
Cdd:COG3386 98 GVVDPDGRLYFTDMGEylptgaLYRVDPDGSLRVLAD----GLTFPNG------IAFSPDGRTLYVADTGAGRIYRF--- 164
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1307 sgakDLAGNSEVVAGTgeqclPFDEARCGDGGkavdatlmsPRGIAVDKNGLMY--FVDATMIRKVDQNGiisTLLGSND 1384
Cdd:COG3386 165 ----DLDADGTLGNRR-----VFADLPDGPGG---------PDGLAVDADGNLWvaLWGGGGVVRFDPDG---ELLGRIE 223
|
....*....
gi 154090989 1385 LTAVRPLSC 1393
Cdd:COG3386 224 LPERRPTNV 232
|
|
| Keratin_B2 |
pfam01500 |
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized ... |
667-784 |
4.68e-04 |
|
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.
Pssm-ID: 366678 [Multi-domain] Cd Length: 161 Bit Score: 43.24 E-value: 4.68e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 667 CEEVDCLDPTCSSHGVCvnGECLCSPGWGGLNCeLARVQCPDQCSGHGTYLPDSGLCSCDPNWMGPDCSVVCSVDCGTHG 746
Cdd:pfam01500 4 CGTSFCGFPTCSTGGTC--GSGCCQPCCCQSSC-CRPSCCQTSCCQPTTFQSSCCRPTCQPCCQTSCCQPTCCQTSSCQT 80
|
90 100 110
....*....|....*....|....*....|....*...
gi 154090989 747 VCIGGACRCEEGWTGAACDQRVCHPRCIEHGTCKDGKC 784
Cdd:pfam01500 81 GCGGIGYGQEGSSGAVSSRTRWCRPDCRVEGTCLPPCC 118
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
580-602 |
7.34e-04 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 38.87 E-value: 7.34e-04
|
| DSL |
pfam01414 |
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ... |
755-797 |
1.83e-03 |
|
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.
Pssm-ID: 460202 Cd Length: 46 Bit Score: 38.37 E-value: 1.83e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 154090989 755 CEEGWTGAACDqRVCHPR--CIEHGTC-KDGKCECREGWNGEHCTI 797
Cdd:pfam01414 1 CDENYYGSTCS-KFCRPRddKFGHYTCdANGNKVCLPGWTGPYCDK 45
|
|
| NHL_like_5 |
cd14963 |
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ... |
1220-1384 |
2.50e-03 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271333 [Multi-domain] Cd Length: 268 Bit Score: 42.28 E-value: 2.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1220 GLAEGnKLLAPVALAVGIDGSLFVGDFnYIRRI------------FPSRnvtsilelrnKEFKHSNSPGHkyyLAVDpvT 1287
Cdd:cd14963 49 GTGPG-EFKYPYGIAVDSDGNIYVADL-YNGRIqvfdpdgkflkyFPEK----------KDRVKLISPAG---LAID--D 111
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1288 GSLYVSDTNSRRIYrvkslsgakdlagnseVVAGTGEQCLPFDEARCGDGgkavdaTLMSPRGIAVDKNGLMYFVDATMI 1367
Cdd:cd14963 112 GKLYVSDVKKHKVI----------------VFDLEGKLLLEFGKPGSEPG------ELSYPNGIAVDEDGNIYVADSGNG 169
|
170 180
....*....|....*....|
gi 154090989 1368 R-KV-DQNG-IISTLLGSND 1384
Cdd:cd14963 170 RiQVfDKNGkFIKELNGSPD 189
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
645-667 |
2.58e-03 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 37.33 E-value: 2.58e-03
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
671-700 |
2.60e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 37.62 E-value: 2.60e-03
10 20 30
....*....|....*....|....*....|....*
gi 154090989 671 DCLDPT-CSSHGVCVNGE----CLCSPGWGGLNCE 700
Cdd:cd00054 4 ECASGNpCQNGGTCVNTVgsyrCSCPPGYTGRNCE 38
|
|
| DSL |
pfam01414 |
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ... |
725-766 |
2.85e-03 |
|
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.
Pssm-ID: 460202 Cd Length: 46 Bit Score: 37.60 E-value: 2.85e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 154090989 725 CDPNWMGPDCSVVCSV--DCGTHGVC-IGGACRCEEGWTGAACDQ 766
Cdd:pfam01414 1 CDENYYGSTCSKFCRPrdDKFGHYTCdANGNKVCLPGWTGPYCDK 45
|
|
| NHL_like_6 |
cd14962 |
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ... |
1279-1483 |
3.15e-03 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271332 [Multi-domain] Cd Length: 271 Bit Score: 42.19 E-value: 3.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1279 YYLAVDPvTGSLYVSDTNSRRIYRvkslsgakdlagnsevvagtgeqclpFDEARcgdG-----GKAVDATLMSPRGIAV 1353
Cdd:cd14962 15 YGVAADG-RGRIYVADTGRGAVFV--------------------------FDLPN---GkvfviGNAGPNRFVSPIGVAI 64
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1354 DKNGLMYFVDAT--MIRKVDQNGIISTLLGSNDLtavrplscdssmdvaQVRlewPTDLAVNPMDNSLYVLEnnvilriT 1431
Cdd:cd14962 65 DANGNLYVSDAElgKVFVFDRDGKFLRAIGAGAL---------------FKR---PTGIAVDPAGKRLYVVD-------T 119
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 154090989 1432 ENHQVSIIAGRPMHCQVPGIDYSlsklaIHSALESASAIAISHTGVLYITET 1483
Cdd:cd14962 120 LAHKVKVFDLDGRLLFDIGKRGS-----GPGEFNLPTDLAVDRDGNLYVTDT 166
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
710-734 |
3.57e-03 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 36.94 E-value: 3.57e-03
|
| YD_repeat_2x |
TIGR01643 |
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ... |
1672-1714 |
3.66e-03 |
|
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Pssm-ID: 273728 [Multi-domain] Cd Length: 42 Bit Score: 37.18 E-value: 3.66e-03
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 154090989 1672 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLH 1714
Cdd:TIGR01643 1 YDAA-GRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
|
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
638-668 |
3.71e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 37.23 E-value: 3.71e-03
10 20 30
....*....|....*....|....*....|....*.
gi 154090989 638 NQCIDPS-CGGHGSCIDG----NCVCAAGYKGEHCE 668
Cdd:cd00054 3 DECASGNpCQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
|
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
797-830 |
4.13e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 37.23 E-value: 4.13e-03
10 20 30
....*....|....*....|....*....|....*.
gi 154090989 797 IDGC--PDLCNGNGRCTLGQNSWQCVCQTGWRGPGC 830
Cdd:cd00054 2 IDECasGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
738-810 |
4.23e-03 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 40.13 E-value: 4.23e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 738 CSVDCGTHGVCIGGACRCEEGWT--GAACDQRV--------CHPRCIEHGTCKDGKCE--CREGWngEHCTiDGCPDLCN 805
Cdd:NF041328 45 CGVACGAGQTCVAGACGCGPGTVacGGACVDTAsdpahcgaCGAACAPGQVCEGGACReaCSEGL--TRCG-GACVDLAT 121
|
....*
gi 154090989 806 GNGRC 810
Cdd:NF041328 122 DPLHC 126
|
|
| SOBP |
pfam15279 |
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ... |
185-349 |
5.76e-03 |
|
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.
Pssm-ID: 464609 [Multi-domain] Cd Length: 325 Bit Score: 41.72 E-value: 5.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 185 LPSSHNPPPVScQMPLLDSNTSHQIMDTNPDEEF----SPNSYLLRACSGPQQ---ASSSGPPNHHSQSTLRPPLPPPhn 257
Cdd:pfam15279 127 APKPHEPPSLP-PPPLPPKKGRRHRPGLHPPLGRppgsPPMSMTPRGLLGKPQqhpPPSPLPAFMEPSSMPPPFLRPP-- 203
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 258 htlsHHHSSANSlnrnSLTNRRSQIHAPAPAP-NDLATTPEsvqlqdswvlnsnvPLEtRHFLFKTSSGSTPLFSSSSPG 336
Cdd:pfam15279 204 ----PSIPQPNS----PLSNPMLPGIGPPPKPpRNLGPPSN--------------PMH-RPPFSPHHPPPPPTPPGPPPG 260
|
170
....*....|...
gi 154090989 337 YPLTSGTVYTPPP 349
Cdd:pfam15279 261 LPPPPPRGFTPPF 273
|
|
| NHL_PKND_like |
cd14952 |
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ... |
1230-1363 |
6.86e-03 |
|
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271322 [Multi-domain] Cd Length: 247 Bit Score: 41.04 E-value: 6.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1230 PVALAVGIDGSLFVGDF--NYIRRIFPSRNVTSILElrnkeFKHSNSPGHkyyLAVDPvTGSLYVSDTNSRRIYRvksls 1307
Cdd:cd14952 138 PDGVAVDGAGNVYVTDTgnNRVLKLAAGSTTQTVLP-----FTGLNSPSG---VAVDT-AGNVYVTDHGNNRVLK----- 203
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*..
gi 154090989 1308 gakdLAgnsevvAGTGEQC-LPFDEarcgdggkavdatLMSPRGIAVDKNGLMYFVD 1363
Cdd:cd14952 204 ----LA------AGSTTPTvLPFTG-------------LNGPLGVAVDAAGNVYVAD 237
|
|
| NHL-2_like |
cd14951 |
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ... |
1475-1560 |
8.30e-03 |
|
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271321 [Multi-domain] Cd Length: 334 Bit Score: 41.02 E-value: 8.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1475 TGVLYITETDEKKINRL----RQVTTngeiclLAGaasdcdckndvncicySGDDAYA-TDAILNSPSSLAVAPDGTIYI 1549
Cdd:cd14951 206 DGSVYVADTYNHKIKRVdpatGEVST------LAG----------------TGKAGYKdLEAQFSEPSGLVVDGDGRLYV 263
|
90
....*....|.
gi 154090989 1550 ADLGNIRIRAV 1560
Cdd:cd14951 264 ADTNNHRIRRL 274
|
|
| NHL_like_5 |
cd14963 |
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ... |
1533-1624 |
8.60e-03 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271333 [Multi-domain] Cd Length: 268 Bit Score: 40.74 E-value: 8.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154090989 1533 LNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYE----AASPG-----EQELYVFNADGiHQYTVSLVTGEY 1603
Cdd:cd14963 55 FKYPYGIAVDSDGNIYVADLYNGRIQVFDPDGKFLKYFPEKKdrvkLISPAglaidDGKLYVSDVKK-HKVIVFDLEGKL 133
|
90 100
....*....|....*....|....*....
gi 154090989 1604 LYNF--------TYSADNDVTelIDNNGN 1624
Cdd:cd14963 134 LLEFgkpgsepgELSYPNGIA--VDEDGN 160
|
|
|