NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|569009290|ref|XP_006541555|]
View 

teneurin-1 isoform X3 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Ten_N super family cl24184
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
23-317 1.26e-78

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


The actual alignment was detected with superfamily member pfam06484:

Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 265.69  E-value: 1.26e-78
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290    23 YTSSSDESEDGRKPRQ-SFNSRETLHEYNQELRRNYNSQSRK----------RKDVEKSTQEIEFCETPPTLCSGYHTDM 91
Cdd:pfam06484   13 YTSSSADSEECRVPTQkSYSSSETLKAFDHDSRMLYGNRVKDmvhkeadefsRQGQNFSLRELGICEPSPRHGLAYCTEM 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290    92 hSVSRHGYQLEMGSDVDTETEGAASPDHALRMWIRGMKSEHSSCLSSRANSALSLTDTDHERKSDGENGFKFSPVCCDME 171
Cdd:pfam06484   93 -GLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKSDNENGPPIPPSSSSSS 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290   172 APAD--------------------SAQDMQSSPHNQFTFRPLPPPPPPPHACTCARKPP--------------------- 210
Cdd:pfam06484  172 PVEQhsppppslnenqrpllgnnaSHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQPPnfqnhsrlrtpppplppphkq 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290   211 -----PTVDSLQRRSMTTRSQPS----PAAPAPPTSTQDSVHLHNSWVLNSNIPLETRHFLFKHGSGSSAIFSAASQNYP 281
Cdd:pfam06484  252 nqhhhPSINSLNRSSLTNRRNPSpaptASLPAELQSTQESVQLQDSWVLNSNVPLETRHFLFKTGTGTTPLFCTASPGYP 331
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 569009290   282 LTSNTVYSPPPRPLPRSTFSRPAFTFNKPYRCCNWK 317
Cdd:pfam06484  332 LTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
NHL super family cl18310
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1196-1520 9.87e-39

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


The actual alignment was detected with superfamily member cd14953:

Pssm-ID: 302697 [Multi-domain]  Cd Length: 323  Bit Score: 148.83  E-value: 9.87e-39
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1196 PVALASGPDGSVYVGDF--NFVRRIFPSGNsVSILELS--------TSPAHKYY----LAMDPmSESLYLSDTNTRKVYK 1261
Cdd:cd14953    25 PSGVAVDAAGNLYVADRgnHRIRKITPDGV-VTTVAGTgtagfadgGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRK 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1262 LkslvetkDLSKNFEVVAGTGDQclpfdqsHCGDGGKASEASLNSPRGITVDRHGFIYFVDGT--MIRRIDENAVITTVI 1339
Cdd:cd14953   103 I-------TPDGVVSTLAGTGTA-------GFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVA 168
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1340 GsnglTSTQPLSCD-SGmdiTQVRLEWPTDLAVNPMDNsLYVLD--NNIVLQISENRRVRIIAGRPIHCQVPGIDhflvs 1416
Cdd:cd14953   169 G----TGGAGYAGDgPA---TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTGTAGFSGDGG----- 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1417 kvAIHSTLESARAISVSHSGLLFIAETDErkvNRIQQVTTNGEISIIAGAPTDcdckidpncdcFSGDGGYAKDAKMKAP 1496
Cdd:cd14953   236 --ATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNP 299
                         330       340
                  ....*....|....*....|....
gi 569009290 1497 SSLAVSPDGTLYVADLGNVRIRTI 1520
Cdd:cd14953   300 TGVAVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2640-2717 1.52e-35

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


:

Pssm-ID: 464783  Cd Length: 78  Bit Score: 130.42  E-value: 1.52e-35
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 569009290  2640 EEKNHVLEMARQRAVAQAWTQEQRRLQEGEEGTRVWTEGEKQQLLGTGRVQGYDGYFVLSVEQYLELSDSANNIHFMR 2717
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1483-2416 7.33e-31

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


:

Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 133.73  E-value: 7.33e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1483 GDGGYAKDAKMKAPSSLAVSPDGTLYVADLGNVRIRTISKNQAHLNDMNLYEIASPADQELYQFTVNGTHLHTMNLITRD 1562
Cdd:COG3209   119 VSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGTGAVTLATGLAGS 198
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1563 YVYNFTYNAEGDLGAITSSNGNSVHIRRDAGGMPLWLVVPGGQVYWLTISSNGVLKRVSAQGYNLALMTYPGNTGLLATK 1642
Cdd:COG3209   199 ALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGASGAGLDAST 278
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1643 SNENGWTTVYEYDPEGHLTNATFPTGEVSSFHSDLEKLTKVALDTSNRENVLMSTNLTATSTIYILKQEN---TQSTYRV 1719
Cdd:COG3209   279 GTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTvggGGSLTLG 358
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1720 SPDGSLRVTFASGMEINLSSEPHILAGAVNPTLGKCNISLPGEHNANLIEWRQRKEQNKGNVSAFERRLRAHNRNLLSID 1799
Cdd:COG3209   359 GYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTG 438
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1800 FDHMTRTGKIYDDHRKFTLRILYDQTGRPILWSPVSRYNEVNITYSPSGLVTFIQRGTWNEKMEYDQSGKIISRTWADGK 1879
Cdd:COG3209   439 GGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARG 518
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1880 IWSYTYLEKSVMLLLHSQRRYIFEYDQSDCLLSVTMPSMVRHSLQTMLSVGYYRNIYTPPDSSTSFIQDYSRDGRLLQTL 1959
Cdd:COG3209   519 LVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTT 598
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1960 HLGTGRRVLYKYTKQARLSEILYDTTQVTLTYEESSGVIKTIHLMHDGFicTIRYRQTGPLIGRQIFRFSEEGLVNARFD 2039
Cdd:COG3209   599 TTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGT--GVTTTGTTTTRATGTTGTGTGVTAGLTTL 676
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 2040 YSYNNFRVTSMQAVINETPLPIDLYRYVDVSGRTEQFGKFSVINYDLNQVITTTVMKHTKIFNANGQVIEVQYEILKAIA 2119
Cdd:COG3209   677 ATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAG 756
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 2120 YWmTIQYDNMGRMVICDIRVGVDANITRYFYEYDADGQLQTVSVNDKIQWRYSYDLNGNINLLSHGNSARLTPL-----R 2194
Cdd:COG3209   757 AL-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGGGTDLqdrtyT 835
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 2195 YDLRDRITRLgeiqykmdEDGFLRQRGNDIFEYNSNGLLQKAYNkvSGWTVQYYYDGLGRRVASKSSLGQHLQFFYADLa 2274
Cdd:COG3209   836 YDAAGNITSI--------TDALRAGTLTQTYTYDALGRLTSATD--PGTTESYTYDANGNLTSRTDGGTTTYTYDALGR- 904
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 2275 nPIRVTHlynhTSAEITSLYYDLQGHliamelssgeeyyvaCDNMGTPLAVFSSRGQVIKEILYTPYGDIYHDTYPDFEV 2354
Cdd:COG3209   905 -LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSGAAAN 964
                         890       900       910       920       930       940
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 569009290 2355 IIGFHGGLYDFLTKLVHLGQRDYDVVAGRWTTPNhhiwkqlnllpkP------FNLYSFENNYPVGKI 2416
Cdd:COG3209   965 PLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD------------PiglaggLNLYAYVGNNPVNYV 1020
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
802-827 2.42e-09

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


:

Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 54.44  E-value: 2.42e-09
                          10        20
                  ....*....|....*....|....*.
gi 569009290  802 CGDNLDNDGDGLTDCVDPDCCQQSNC 827
Cdd:NF033662    7 CSDGIDNDGDGLTDCADPDCAGNPVC 32
DUF5885 super family cl44670
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ...
529-720 4.44e-06

Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.


The actual alignment was detected with superfamily member pfam19232:

Pssm-ID: 437064  Cd Length: 265  Bit Score: 50.78  E-value: 4.44e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290   529 DDCSTNCNGNGECISGHCH-----------------CFPGFLGPdcaRDSCpvlCGG----NGE----------YEKGHC 577
Cdd:pfam19232   10 DDCTPPCGGTQVCIDRQCKdntlacttdaqcgtcmtCVAGACTP---KASC---CGGvtcgAGQtcdaktntcvYVKGYC 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290   578 VC-RNGWKGPECDVPEEQCIDPTCFGHGT---CIMGV-----------------CICV------------PGYKGEICEE 624
Cdd:pfam19232   84 SAdHPCPSGSACDTAKNACIAQPPYGPDSgkgCVRGFgawiweldpatnsgvwrCRCAngslynsahecsPLADQTLCAA 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290   625 EDcLDPMC------------------------SSHGICVK----GECHCSTGWGGVNCETplpicQEQCSGHGTFLLDTG 676
Cdd:pfam19232  164 EN-LDPNAlvpassvpafaaygwgnqpvlinkSTAGAAVPsplaGVCPCKPGWAGGSCTE-----DRTCNGRGTWNETTG 237
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....
gi 569009290   677 VCSCDPKWTGSDcstelctmECGSHGVCSRgicqceegWVGPTC 720
Cdd:pfam19232  238 QCACNIDFSGHN--------SCGDDNNCTS--------WTGPRC 265
DSL super family cl19567
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
711-754 2.17e-04

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


The actual alignment was detected with superfamily member pfam01414:

Pssm-ID: 473190  Cd Length: 46  Bit Score: 41.07  E-value: 2.17e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 569009290   711 CEEGWVGPTCEeRSC--------HSHCAEHGQCkdgkcECSPGWEGDHCTIA 754
Cdd:pfam01414    1 CDENYYGSTCS-KFCrprddkfgHYTCDANGNK-----VCLPGWTGPYCDKP 46
 
Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
23-317 1.26e-78

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 265.69  E-value: 1.26e-78
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290    23 YTSSSDESEDGRKPRQ-SFNSRETLHEYNQELRRNYNSQSRK----------RKDVEKSTQEIEFCETPPTLCSGYHTDM 91
Cdd:pfam06484   13 YTSSSADSEECRVPTQkSYSSSETLKAFDHDSRMLYGNRVKDmvhkeadefsRQGQNFSLRELGICEPSPRHGLAYCTEM 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290    92 hSVSRHGYQLEMGSDVDTETEGAASPDHALRMWIRGMKSEHSSCLSSRANSALSLTDTDHERKSDGENGFKFSPVCCDME 171
Cdd:pfam06484   93 -GLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKSDNENGPPIPPSSSSSS 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290   172 APAD--------------------SAQDMQSSPHNQFTFRPLPPPPPPPHACTCARKPP--------------------- 210
Cdd:pfam06484  172 PVEQhsppppslnenqrpllgnnaSHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQPPnfqnhsrlrtpppplppphkq 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290   211 -----PTVDSLQRRSMTTRSQPS----PAAPAPPTSTQDSVHLHNSWVLNSNIPLETRHFLFKHGSGSSAIFSAASQNYP 281
Cdd:pfam06484  252 nqhhhPSINSLNRSSLTNRRNPSpaptASLPAELQSTQESVQLQDSWVLNSNVPLETRHFLFKTGTGTTPLFCTASPGYP 331
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 569009290   282 LTSNTVYSPPPRPLPRSTFSRPAFTFNKPYRCCNWK 317
Cdd:pfam06484  332 LTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1196-1520 9.87e-39

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 148.83  E-value: 9.87e-39
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1196 PVALASGPDGSVYVGDF--NFVRRIFPSGNsVSILELS--------TSPAHKYY----LAMDPmSESLYLSDTNTRKVYK 1261
Cdd:cd14953    25 PSGVAVDAAGNLYVADRgnHRIRKITPDGV-VTTVAGTgtagfadgGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRK 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1262 LkslvetkDLSKNFEVVAGTGDQclpfdqsHCGDGGKASEASLNSPRGITVDRHGFIYFVDGT--MIRRIDENAVITTVI 1339
Cdd:cd14953   103 I-------TPDGVVSTLAGTGTA-------GFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVA 168
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1340 GsnglTSTQPLSCD-SGmdiTQVRLEWPTDLAVNPMDNsLYVLD--NNIVLQISENRRVRIIAGRPIHCQVPGIDhflvs 1416
Cdd:cd14953   169 G----TGGAGYAGDgPA---TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTGTAGFSGDGG----- 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1417 kvAIHSTLESARAISVSHSGLLFIAETDErkvNRIQQVTTNGEISIIAGAPTDcdckidpncdcFSGDGGYAKDAKMKAP 1496
Cdd:cd14953   236 --ATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNP 299
                         330       340
                  ....*....|....*....|....
gi 569009290 1497 SSLAVSPDGTLYVADLGNVRIRTI 1520
Cdd:cd14953   300 TGVAVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2640-2717 1.52e-35

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 130.42  E-value: 1.52e-35
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 569009290  2640 EEKNHVLEMARQRAVAQAWTQEQRRLQEGEEGTRVWTEGEKQQLLGTGRVQGYDGYFVLSVEQYLELSDSANNIHFMR 2717
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1483-2416 7.33e-31

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 133.73  E-value: 7.33e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1483 GDGGYAKDAKMKAPSSLAVSPDGTLYVADLGNVRIRTISKNQAHLNDMNLYEIASPADQELYQFTVNGTHLHTMNLITRD 1562
Cdd:COG3209   119 VSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGTGAVTLATGLAGS 198
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1563 YVYNFTYNAEGDLGAITSSNGNSVHIRRDAGGMPLWLVVPGGQVYWLTISSNGVLKRVSAQGYNLALMTYPGNTGLLATK 1642
Cdd:COG3209   199 ALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGASGAGLDAST 278
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1643 SNENGWTTVYEYDPEGHLTNATFPTGEVSSFHSDLEKLTKVALDTSNRENVLMSTNLTATSTIYILKQEN---TQSTYRV 1719
Cdd:COG3209   279 GTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTvggGGSLTLG 358
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1720 SPDGSLRVTFASGMEINLSSEPHILAGAVNPTLGKCNISLPGEHNANLIEWRQRKEQNKGNVSAFERRLRAHNRNLLSID 1799
Cdd:COG3209   359 GYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTG 438
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1800 FDHMTRTGKIYDDHRKFTLRILYDQTGRPILWSPVSRYNEVNITYSPSGLVTFIQRGTWNEKMEYDQSGKIISRTWADGK 1879
Cdd:COG3209   439 GGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARG 518
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1880 IWSYTYLEKSVMLLLHSQRRYIFEYDQSDCLLSVTMPSMVRHSLQTMLSVGYYRNIYTPPDSSTSFIQDYSRDGRLLQTL 1959
Cdd:COG3209   519 LVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTT 598
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1960 HLGTGRRVLYKYTKQARLSEILYDTTQVTLTYEESSGVIKTIHLMHDGFicTIRYRQTGPLIGRQIFRFSEEGLVNARFD 2039
Cdd:COG3209   599 TTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGT--GVTTTGTTTTRATGTTGTGTGVTAGLTTL 676
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 2040 YSYNNFRVTSMQAVINETPLPIDLYRYVDVSGRTEQFGKFSVINYDLNQVITTTVMKHTKIFNANGQVIEVQYEILKAIA 2119
Cdd:COG3209   677 ATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAG 756
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 2120 YWmTIQYDNMGRMVICDIRVGVDANITRYFYEYDADGQLQTVSVNDKIQWRYSYDLNGNINLLSHGNSARLTPL-----R 2194
Cdd:COG3209   757 AL-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGGGTDLqdrtyT 835
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 2195 YDLRDRITRLgeiqykmdEDGFLRQRGNDIFEYNSNGLLQKAYNkvSGWTVQYYYDGLGRRVASKSSLGQHLQFFYADLa 2274
Cdd:COG3209   836 YDAAGNITSI--------TDALRAGTLTQTYTYDALGRLTSATD--PGTTESYTYDANGNLTSRTDGGTTTYTYDALGR- 904
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 2275 nPIRVTHlynhTSAEITSLYYDLQGHliamelssgeeyyvaCDNMGTPLAVFSSRGQVIKEILYTPYGDIYHDTYPDFEV 2354
Cdd:COG3209   905 -LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSGAAAN 964
                         890       900       910       920       930       940
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 569009290 2355 IIGFHGGLYDFLTKLVHLGQRDYDVVAGRWTTPNhhiwkqlnllpkP------FNLYSFENNYPVGKI 2416
Cdd:COG3209   965 PLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD------------PiglaggLNLYAYVGNNPVNYV 1020
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1196-1518 2.26e-09

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 60.80  E-value: 2.26e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1196 PVALASGPDGSVYVGDF--NFVRRIFPSGNSVSILELsTSPAHKYYLAMDPmSESLYLSDTNTRKVYKLkslveTKDlSK 1273
Cdd:COG4257    19 PRDVAVDPDGAVWFTDQggGRIGRLDPATGEFTEYPL-GGGSGPHGIAVDP-DGNLWFTDNGNNRIGRI-----DPK-TG 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1274 NFEVVAGTGDQCLPFdqshcgdggkaseaslnsprGITVDRHGFIYFVDGT--MIRRIDenavittviGSNGLTSTQPLS 1351
Cdd:COG4257    91 EITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLD---------PATGEVTEFPLP 141
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1352 CDSGMditqvrlewPTDLAVNPmDNSLYVLDNnivlqisENRRVRIIagrpihcqvpGIDHFLVSKVAIHSTLESARAIS 1431
Cdd:COG4257   142 TGGAG---------PYGIAVDP-DGNLWVTDF-------GANAIGRI----------DPDTGTLTEYALPTPGAGPRGLA 194
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1432 VSHSGLLFIAETDERKVNRIQqvTTNGEISIIAGAPTDcdckidpncdcfsgdggyakdakmKAPSSLAVSPDGTLYVAD 1511
Cdd:COG4257   195 VDPDGNLWVADTGSGRIGRFD--PKTGTVTEYPLPGGG------------------------ARPYGVAVDGDGRVWFAE 248

                  ....*..
gi 569009290 1512 LGNVRIR 1518
Cdd:COG4257   249 SGANRIV 255
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
802-827 2.42e-09

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 54.44  E-value: 2.42e-09
                          10        20
                  ....*....|....*....|....*.
gi 569009290  802 CGDNLDNDGDGLTDCVDPDCCQQSNC 827
Cdd:NF033662    7 CSDGIDNDGDGLTDCADPDCAGNPVC 32
DUF5885 pfam19232
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ...
529-720 4.44e-06

Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.


Pssm-ID: 437064  Cd Length: 265  Bit Score: 50.78  E-value: 4.44e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290   529 DDCSTNCNGNGECISGHCH-----------------CFPGFLGPdcaRDSCpvlCGG----NGE----------YEKGHC 577
Cdd:pfam19232   10 DDCTPPCGGTQVCIDRQCKdntlacttdaqcgtcmtCVAGACTP---KASC---CGGvtcgAGQtcdaktntcvYVKGYC 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290   578 VC-RNGWKGPECDVPEEQCIDPTCFGHGT---CIMGV-----------------CICV------------PGYKGEICEE 624
Cdd:pfam19232   84 SAdHPCPSGSACDTAKNACIAQPPYGPDSgkgCVRGFgawiweldpatnsgvwrCRCAngslynsahecsPLADQTLCAA 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290   625 EDcLDPMC------------------------SSHGICVK----GECHCSTGWGGVNCETplpicQEQCSGHGTFLLDTG 676
Cdd:pfam19232  164 EN-LDPNAlvpassvpafaaygwgnqpvlinkSTAGAAVPsplaGVCPCKPGWAGGSCTE-----DRTCNGRGTWNETTG 237
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....
gi 569009290   677 VCSCDPKWTGSDcstelctmECGSHGVCSRgicqceegWVGPTC 720
Cdd:pfam19232  238 QCACNIDFSGHN--------SCGDDNNCTS--------WTGPRC 265
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
2338-2416 1.63e-05

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 45.18  E-value: 1.63e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290  2338 YTPYGDIYHDTyPDFEVIIGFHGGLYDFLTKLVHLGQRDYDVVAGRWTTPNhhiwkqlnllpkPF------NLYSFENNY 2411
Cdd:TIGR03696    1 YDPYGEVLSES-GAAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD------------PIglggglNLYAYVGNN 67

                   ....*
gi 569009290  2412 PVGKI 2416
Cdd:TIGR03696   68 PVNWV 72
RHS_core NF041261
RHS element core protein;
1567-1682 3.43e-05

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 49.62  E-value: 3.43e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1567 FTYNAEGDLGAITSSNGNSVHIRRDAGGMPLwlvvpggqvywltISSNGVLKRvsAQGYNLAlmtypgntGLLATKSNEN 1646
Cdd:NF041261  602 YEYNAAGDLTAVITPDGNRSETQYDAWGKAV-------------STTQGGLTR--SMEYDAA--------GRITTLTNEN 658
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 569009290 1647 GWTTVYEYDPEGHLTNATFPTGEVSSFHSDLE-KLTK 1682
Cdd:NF041261  659 GSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTgKLTQ 695
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
711-754 2.17e-04

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 41.07  E-value: 2.17e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 569009290   711 CEEGWVGPTCEeRSC--------HSHCAEHGQCkdgkcECSPGWEGDHCTIA 754
Cdd:pfam01414    1 CDENYYGSTCS-KFCrprddkfgHYTCDANGNK-----VCLPGWTGPYCDKP 46
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
651-794 2.96e-03

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 40.51  E-value: 2.96e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290  651 GVNCETPLPICQEQCsghgtflLDTgvcSCDPkwtgSDCSTelCTMECGSHGVCSRGICQCEEGWV--GPTC-EERS--- 724
Cdd:NF041328   18 GAVCPEGLSVCGGAC-------VDL---RSDP----SNCGA--CGVACGAGQTCVAGACGCGPGTVacGGACvDTASdpa 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 569009290  725 ----CHSHCAEHGQCKDGKCecspgwegdhctiahyldavRDGCP-GLCFGNGRCT-LDQNGWHCvcqvGWSGTGC 794
Cdd:NF041328   82 hcgaCGAACAPGQVCEGGAC--------------------REACSeGLTRCGGACVdLATDPLHC----GACGVAC 133
 
Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
23-317 1.26e-78

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 265.69  E-value: 1.26e-78
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290    23 YTSSSDESEDGRKPRQ-SFNSRETLHEYNQELRRNYNSQSRK----------RKDVEKSTQEIEFCETPPTLCSGYHTDM 91
Cdd:pfam06484   13 YTSSSADSEECRVPTQkSYSSSETLKAFDHDSRMLYGNRVKDmvhkeadefsRQGQNFSLRELGICEPSPRHGLAYCTEM 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290    92 hSVSRHGYQLEMGSDVDTETEGAASPDHALRMWIRGMKSEHSSCLSSRANSALSLTDTDHERKSDGENGFKFSPVCCDME 171
Cdd:pfam06484   93 -GLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKSDNENGPPIPPSSSSSS 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290   172 APAD--------------------SAQDMQSSPHNQFTFRPLPPPPPPPHACTCARKPP--------------------- 210
Cdd:pfam06484  172 PVEQhsppppslnenqrpllgnnaSHPILDSDPDEEFSPNSYLVRTGSGPQSAPSEQPPnfqnhsrlrtpppplppphkq 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290   211 -----PTVDSLQRRSMTTRSQPS----PAAPAPPTSTQDSVHLHNSWVLNSNIPLETRHFLFKHGSGSSAIFSAASQNYP 281
Cdd:pfam06484  252 nqhhhPSINSLNRSSLTNRRNPSpaptASLPAELQSTQESVQLQDSWVLNSNVPLETRHFLFKTGTGTTPLFCTASPGYP 331
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 569009290   282 LTSNTVYSPPPRPLPRSTFSRPAFTFNKPYRCCNWK 317
Cdd:pfam06484  332 LTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1196-1520 9.87e-39

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 148.83  E-value: 9.87e-39
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1196 PVALASGPDGSVYVGDF--NFVRRIFPSGNsVSILELS--------TSPAHKYY----LAMDPmSESLYLSDTNTRKVYK 1261
Cdd:cd14953    25 PSGVAVDAAGNLYVADRgnHRIRKITPDGV-VTTVAGTgtagfadgGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRK 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1262 LkslvetkDLSKNFEVVAGTGDQclpfdqsHCGDGGKASEASLNSPRGITVDRHGFIYFVDGT--MIRRIDENAVITTVI 1339
Cdd:cd14953   103 I-------TPDGVVSTLAGTGTA-------GFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVA 168
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1340 GsnglTSTQPLSCD-SGmdiTQVRLEWPTDLAVNPMDNsLYVLD--NNIVLQISENRRVRIIAGRPIHCQVPGIDhflvs 1416
Cdd:cd14953   169 G----TGGAGYAGDgPA---TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTGTAGFSGDGG----- 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1417 kvAIHSTLESARAISVSHSGLLFIAETDErkvNRIQQVTTNGEISIIAGAPTDcdckidpncdcFSGDGGYAKDAKMKAP 1496
Cdd:cd14953   236 --ATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNP 299
                         330       340
                  ....*....|....*....|....
gi 569009290 1497 SSLAVSPDGTLYVADLGNVRIRTI 1520
Cdd:cd14953   300 TGVAVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2640-2717 1.52e-35

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 130.42  E-value: 1.52e-35
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 569009290  2640 EEKNHVLEMARQRAVAQAWTQEQRRLQEGEEGTRVWTEGEKQQLLGTGRVQGYDGYFVLSVEQYLELSDSANNIHFMR 2717
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1240-1521 6.05e-34

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 134.58  E-value: 6.05e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1240 LAMDPmSESLYLSDTNTRKVYKL--KSLVETkdlsknfevVAGTGDQclpfdqshcG-DGGKASEASLNSPRGITVDRHG 1316
Cdd:cd14953    28 VAVDA-AGNLYVADRGNHRIRKItpDGVVTT---------VAGTGTA---------GfADGGGAAAQFNTPSGVAVDAAG 88
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1317 FIYFVDGT--MIRRIDENAVITTVIGsnglTSTQPLSCDSGMdiTQVRLEWPTDLAVNPMDNsLYVLD--NNIVLQISEN 1392
Cdd:cd14953    89 NLYVADTGnhRIRKITPDGVVSTLAG----TGTAGFSDDGGA--TAAQFNYPTGVAVDAAGN-LYVADtgNHRIRKITPD 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1393 RRVRIIAGRPihcqVPGidhFLVSKVAIHSTLESARAISVSHSGLLFIAETDErkvNRIQQVTTNGEISIIAGAPTDCdc 1472
Cdd:cd14953   162 GVVTTVAGTG----GAG---YAGDGPATAAQFNNPTGVAVDAAGNLYVADRGN---HRIRKITPDGVVTTVAGTGTAG-- 229
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*....
gi 569009290 1473 kidpncdcFSGDGGyAKDAKMKAPSSLAVSPDGTLYVADLGNVRIRTIS 1521
Cdd:cd14953   230 --------FSGDGG-ATAAQLNNPTGVAVDAAGNLYVADSGNHRIRKIT 269
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1483-2416 7.33e-31

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 133.73  E-value: 7.33e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1483 GDGGYAKDAKMKAPSSLAVSPDGTLYVADLGNVRIRTISKNQAHLNDMNLYEIASPADQELYQFTVNGTHLHTMNLITRD 1562
Cdd:COG3209   119 VSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGTGAVTLATGLAGS 198
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1563 YVYNFTYNAEGDLGAITSSNGNSVHIRRDAGGMPLWLVVPGGQVYWLTISSNGVLKRVSAQGYNLALMTYPGNTGLLATK 1642
Cdd:COG3209   199 ALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGASGAGLDAST 278
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1643 SNENGWTTVYEYDPEGHLTNATFPTGEVSSFHSDLEKLTKVALDTSNRENVLMSTNLTATSTIYILKQEN---TQSTYRV 1719
Cdd:COG3209   279 GTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTvggGGSLTLG 358
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1720 SPDGSLRVTFASGMEINLSSEPHILAGAVNPTLGKCNISLPGEHNANLIEWRQRKEQNKGNVSAFERRLRAHNRNLLSID 1799
Cdd:COG3209   359 GYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTG 438
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1800 FDHMTRTGKIYDDHRKFTLRILYDQTGRPILWSPVSRYNEVNITYSPSGLVTFIQRGTWNEKMEYDQSGKIISRTWADGK 1879
Cdd:COG3209   439 GGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARG 518
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1880 IWSYTYLEKSVMLLLHSQRRYIFEYDQSDCLLSVTMPSMVRHSLQTMLSVGYYRNIYTPPDSSTSFIQDYSRDGRLLQTL 1959
Cdd:COG3209   519 LVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTT 598
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1960 HLGTGRRVLYKYTKQARLSEILYDTTQVTLTYEESSGVIKTIHLMHDGFicTIRYRQTGPLIGRQIFRFSEEGLVNARFD 2039
Cdd:COG3209   599 TTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGT--GVTTTGTTTTRATGTTGTGTGVTAGLTTL 676
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 2040 YSYNNFRVTSMQAVINETPLPIDLYRYVDVSGRTEQFGKFSVINYDLNQVITTTVMKHTKIFNANGQVIEVQYEILKAIA 2119
Cdd:COG3209   677 ATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAG 756
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 2120 YWmTIQYDNMGRMVICDIRVGVDANITRYFYEYDADGQLQTVSVNDKIQWRYSYDLNGNINLLSHGNSARLTPL-----R 2194
Cdd:COG3209   757 AL-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGGGTDLqdrtyT 835
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 2195 YDLRDRITRLgeiqykmdEDGFLRQRGNDIFEYNSNGLLQKAYNkvSGWTVQYYYDGLGRRVASKSSLGQHLQFFYADLa 2274
Cdd:COG3209   836 YDAAGNITSI--------TDALRAGTLTQTYTYDALGRLTSATD--PGTTESYTYDANGNLTSRTDGGTTTYTYDALGR- 904
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 2275 nPIRVTHlynhTSAEITSLYYDLQGHliamelssgeeyyvaCDNMGTPLAVFSSRGQVIKEILYTPYGDIYHDTYPDFEV 2354
Cdd:COG3209   905 -LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSGAAAN 964
                         890       900       910       920       930       940
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 569009290 2355 IIGFHGGLYDFLTKLVHLGQRDYDVVAGRWTTPNhhiwkqlnllpkP------FNLYSFENNYPVGKI 2416
Cdd:COG3209   965 PLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD------------PiglaggLNLYAYVGNNPVNYV 1020
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1277-1521 9.62e-30

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 122.64  E-value: 9.62e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1277 VVAGTGDqclpfdqSHCGDGGKASeASLNSPRGITVDRHGFIYFVDGT--MIRRIDENAVITTVIGsnglTSTQPLSCDS 1354
Cdd:cd14953     3 TVAGSGT-------AGFSGGGGTA-ARFNSPSGVAVDAAGNLYVADRGnhRIRKITPDGVVTTVAG----TGTAGFADGG 70
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1355 GmdiTQVRLEWPTDLAVNPMDNsLYVLD--NNIVLQISENRRVRIIAGRpihcqvpGIDHFLVSKVAIHSTLESARAISV 1432
Cdd:cd14953    71 G---AAAQFNTPSGVAVDAAGN-LYVADtgNHRIRKITPDGVVSTLAGT-------GTAGFSDDGGATAAQFNYPTGVAV 139
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1433 SHSGLLFIAETderKVNRIQQVTTNGEISIIAGAPTDcdckidpncdcFSGDGGYAKDAKMKAPSSLAVSPDGTLYVADL 1512
Cdd:cd14953   140 DAAGNLYVADT---GNHRIRKITPDGVVTTVAGTGGA-----------GYAGDGPATAAQFNNPTGVAVDAAGNLYVADR 205

                  ....*....
gi 569009290 1513 GNVRIRTIS 1521
Cdd:cd14953   206 GNHRIRKIT 214
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1190-1520 4.08e-22

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 98.54  E-value: 4.08e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1190 NNKLFAPVALASGPDGSVYVGDF--NFVRRIFPSGNSV-SILELSTSPAHKYY---LAMDPmSESLYLSDTNTRKVYKLk 1263
Cdd:cd05819     4 PGELNNPQGIAVDSSGNIYVADTgnNRIQVFDPDGNFItSFGSFGSGDGQFNEpagVAVDS-DGNLYVADTGNHRIQKF- 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1264 slvetkDLSKNFEVVAGTGDQclpfdqshcGDGGkaseasLNSPRGITVDRHGFIYFVDgTM---IRRIDENAVITTVIG 1340
Cdd:cd05819    82 ------DPDGNFLASFGGSGD---------GDGE------FNGPRGIAVDSSGNIYVAD-TGnhrIQKFDPDGEFLTTFG 139
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1341 SNGLTSTQplscdsgmditqvrLEWPTDLAVNPmDNSLYVLDnnivlqiSENRRVRIIA--GRPIhcqvpgidhFLV-SK 1417
Cdd:cd05819   140 SGGSGPGQ--------------FNGPTGVAVDS-DGNIYVAD-------TGNHRIQVFDpdGNFL---------TTFgST 188
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1418 VAIHSTLESARAISVSHSGLLFIAETDErkvNRIQqvttngeisiiagaptdcdcKIDPNCDCFSGDGGYA-KDAKMKAP 1496
Cdd:cd05819   189 GTGPGQFNYPTGIAVDSDGNIYVADSGN---NRVQ--------------------VFDPDGAGFGGNGNFLgSDGQFNRP 245
                         330       340
                  ....*....|....*....|....
gi 569009290 1497 SSLAVSPDGTLYVADLGNVRIRTI 1520
Cdd:cd05819   246 SGLAVDSDGNLYVADTGNNRIQVF 269
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1304-1528 1.66e-18

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 88.14  E-value: 1.66e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1304 LNSPRGITVDRHGFIYFVDGTM--IRRIDENAVITTVIGSNGLTSTQplscdsgmditqvrLEWPTDLAVNPmDNSLYVL 1381
Cdd:cd05819     7 LNNPQGIAVDSSGNIYVADTGNnrIQVFDPDGNFITSFGSFGSGDGQ--------------FNEPAGVAVDS-DGNLYVA 71
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1382 DnnivlqiSENRRVRII--AGRPI-HCQVPGIDhflvskvaiHSTLESARAISVSHSGLLFIAETDErkvNRIQQVTTNG 1458
Cdd:cd05819    72 D-------TGNHRIQKFdpDGNFLaSFGGSGDG---------DGEFNGPRGIAVDSSGNIYVADTGN---HRIQKFDPDG 132
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1459 EISIIAGAPTDCDCK--------IDPN-----CDC-------FSGDGGY--------AKDAKMKAPSSLAVSPDGTLYVA 1510
Cdd:cd05819   133 EFLTTFGSGGSGPGQfngptgvaVDSDgniyvADTgnhriqvFDPDGNFlttfgstgTGPGQFNYPTGIAVDSDGNIYVA 212
                         250
                  ....*....|....*...
gi 569009290 1511 DLGNVRIRTISKNQAHLN 1528
Cdd:cd05819   213 DSGNNRVQVFDPDGAGFG 230
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1166-1389 1.66e-16

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 83.35  E-value: 1.66e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1166 VIATIMGNGHQRSVActncNGPAHNNKLFAPVALASGPDGSVYVGDF--NFVRRIFPSGNsvsilelstspahkyylamd 1243
Cdd:cd14953   163 VVTTVAGTGGAGYAG----DGPATAAQFNNPTGVAVDAAGNLYVADRgnHRIRKITPDGV-------------------- 218
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1244 pmseslylsdtntrkvyklkslVETkdlsknfevVAGTGDQclPFdqshcGDGGKASEASLNSPRGITVDRHGFIYFVD- 1322
Cdd:cd14953   219 ----------------------VTT---------VAGTGTA--GF-----SGDGGATAAQLNNPTGVAVDAAGNLYVADs 260
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 569009290 1323 --GTmIRRIDENAVITTVIGSnglTSTQPLSCDSGmdiTQVRLEWPTDLAVNPmDNSLYVLD--NNIVLQI 1389
Cdd:cd14953   261 gnHR-IRKITPAGVVTTVAGG---GAGFSGDGGPA---TSAQFNNPTGVAVDA-AGNLYVADtgNNRIRKI 323
NHL_like_3 cd14956
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1296-1517 1.09e-09

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271326 [Multi-domain]  Cd Length: 274  Bit Score: 61.91  E-value: 1.09e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1296 GGKASEAslNSPRGITVDRHGFIYFVD--GTMIRRIDENaviTTVIGSNGLTSTQPLScdsgmditqvrLEWPTDLAVNP 1373
Cdd:cd14956   100 GSGPGQF--NAPRGVAVDADGNLYVADfgNQRIQKFDPD---GSFLRQWGGTGIEPGS-----------FNYPRGVAVDP 163
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1374 mDNSLYVLDnnivlqiSENRRVriiagrpihcQVPGIDHFLVSKVAIHST----LESARAISVSHSGLLFIAETDErkvN 1449
Cdd:cd14956   164 -DGTLYVAD-------TYNDRI----------QVFDNDGAFLRKWGGRGTgpgqFNYPYGIAIDPDGNVFVADFGN---N 222
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 569009290 1450 RIQQVTTNGEISIIAGAPTdcdckidpncdcfSGDGgyakdaKMKAPSSLAVSPDGTLYVADLGNVRI 1517
Cdd:cd14956   223 RIQKFTADGTFLTSWGSPG-------------TGPG------QFKNPWGVVVDADGTVYVADSNNNRV 271
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1196-1518 2.26e-09

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 60.80  E-value: 2.26e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1196 PVALASGPDGSVYVGDF--NFVRRIFPSGNSVSILELsTSPAHKYYLAMDPmSESLYLSDTNTRKVYKLkslveTKDlSK 1273
Cdd:COG4257    19 PRDVAVDPDGAVWFTDQggGRIGRLDPATGEFTEYPL-GGGSGPHGIAVDP-DGNLWFTDNGNNRIGRI-----DPK-TG 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1274 NFEVVAGTGDQCLPFdqshcgdggkaseaslnsprGITVDRHGFIYFVDGT--MIRRIDenavittviGSNGLTSTQPLS 1351
Cdd:COG4257    91 EITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLD---------PATGEVTEFPLP 141
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1352 CDSGMditqvrlewPTDLAVNPmDNSLYVLDNnivlqisENRRVRIIagrpihcqvpGIDHFLVSKVAIHSTLESARAIS 1431
Cdd:COG4257   142 TGGAG---------PYGIAVDP-DGNLWVTDF-------GANAIGRI----------DPDTGTLTEYALPTPGAGPRGLA 194
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1432 VSHSGLLFIAETDERKVNRIQqvTTNGEISIIAGAPTDcdckidpncdcfsgdggyakdakmKAPSSLAVSPDGTLYVAD 1511
Cdd:COG4257   195 VDPDGNLWVADTGSGRIGRFD--PKTGTVTEYPLPGGG------------------------ARPYGVAVDGDGRVWFAE 248

                  ....*..
gi 569009290 1512 LGNVRIR 1518
Cdd:COG4257   249 SGANRIV 255
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
802-827 2.42e-09

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 54.44  E-value: 2.42e-09
                          10        20
                  ....*....|....*....|....*.
gi 569009290  802 CGDNLDNDGDGLTDCVDPDCCQQSNC 827
Cdd:NF033662    7 CSDGIDNDGDGLTDCADPDCAGNPVC 32
NHL_like_3 cd14956
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1307-1517 2.48e-09

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271326 [Multi-domain]  Cd Length: 274  Bit Score: 60.76  E-value: 2.48e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1307 PRGITVDRHGFIYFVDGT--MIRRIDENAVITTVIGSNGLTSTQplscdsgmditqvrLEWPTDLAVNPmDNSLYVLD-- 1382
Cdd:cd14956    62 PRGLAVDKDGWLYVADYWgdRIQVFTLTGELQTIGGSSGSGPGQ--------------FNAPRGVAVDA-DGNLYVADfg 126
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1383 NNIVLQISENRR-VRIIAGRPIHcqvPGidHFLvskvaihstleSARAISVSHSGLLFIAETderKVNRIQQVTTNGEIS 1461
Cdd:cd14956   127 NQRIQKFDPDGSfLRQWGGTGIE---PG--SFN-----------YPRGVAVDPDGTLYVADT---YNDRIQVFDNDGAFL 187
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 569009290 1462 IIAGAPtdcdckidpncdcFSGDGgyakdaKMKAPSSLAVSPDGTLYVADLGNVRI 1517
Cdd:cd14956   188 RKWGGR-------------GTGPG------QFNYPYGIAIDPDGNVFVADFGNNRI 224
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
1196-1517 2.21e-08

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 57.60  E-value: 2.21e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1196 PVALASGPDGSVYVGDfnfvrrifpSGNSVsILEL---STSPA--------HKYYLAMDPmSESLYLSDTNTRKVYKLks 1264
Cdd:cd14952    12 PGGVAVDAAGNVYVAD---------SGNNR-VLKLaagSTTQTvlpftglyQPQGVAVDA-AGTVYVTDFGNNRVLKL-- 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1265 lvetkdlsknfevVAGTGDQC-LPFdqshcgdggkaseASLNSPRGITVDRHGFIYFVDGTmirridENAVITTVIGSNg 1343
Cdd:cd14952    79 -------------AAGSTTQTvLPF-------------TGLNDPTGVAVDAAGNVYVADTG------NNRVLKLAAGSN- 125
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1344 ltstqplscdsgmdiTQVRLEW-----PTDLAVNPMDNsLYVLDnnivlqiSENRRVRiiagrpihcqvpgidhflvsKV 1418
Cdd:cd14952   126 ---------------TQTVLPFtglsnPDGVAVDGAGN-VYVTD-------TGNNRVL--------------------KL 162
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1419 AIHST---------LESARAISVSHSGLLFIAETDErkvNRIQQVTtngeisiiAGAPTdcdckidPNCDCFSGdggyak 1489
Cdd:cd14952   163 AAGSTtqtvlpftgLNSPSGVAVDTAGNVYVTDHGN---NRVLKLA--------AGSTT-------PTVLPFTG------ 218
                         330       340
                  ....*....|....*....|....*...
gi 569009290 1490 dakMKAPSSLAVSPDGTLYVADLGNVRI 1517
Cdd:cd14952   219 ---LNGPLGVAVDAAGNVYVADRGNDRV 243
NHL_like_5 cd14963
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1192-1398 7.81e-08

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271333 [Multi-domain]  Cd Length: 268  Bit Score: 56.15  E-value: 7.81e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1192 KLFAPVALASGPDGSVYVGDF------------NFVRRIFPSGNSVSIlelsTSPAHkyyLAMDpmSESLYLSDTNTRKV 1259
Cdd:cd14963    54 EFKYPYGIAVDSDGNIYVADLyngriqvfdpdgKFLKYFPEKKDRVKL----ISPAG---LAID--DGKLYVSDVKKHKV 124
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1260 Y-------KLKSLVETKD-------------LSKNFEVVAGTGDQ-CLPFDQSHCG----DGGKASEASLNSPRGITVDR 1314
Cdd:cd14963   125 IvfdlegkLLLEFGKPGSepgelsypngiavDEDGNIYVADSGNGrIQVFDKNGKFikelNGSPDGKSGFVNPRGIAVDP 204
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1315 HGFIYFVDgTMIRRI---DENAVITTVIGSNGLtstqplscdsgmDITQVRLewPTDLAVNPmDNSLYVLDnnivlqiSE 1391
Cdd:cd14963   205 DGNLYVVD-NLSHRVyvfDEQGKELFTFGGRGK------------DDGQFNL--PNGLFIDD-DGRLYVTD-------RE 261

                  ....*..
gi 569009290 1392 NRRVRII 1398
Cdd:cd14963   262 NNRVAVY 268
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1195-1464 2.45e-07

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 54.64  E-value: 2.45e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1195 APVALASGPDGSVYVGDFNF--VRRIFPSGNSVSILELSTSPAHKYYLAMDPmSESLYLSDTNTRKVYKLkslvetkDLS 1272
Cdd:COG4257    60 GPHGIAVDPDGNLWFTDNGNnrIGRIDPKTGEITTFALPGGGSNPHGIAFDP-DGNLWFTDQGGNRIGRL-------DPA 131
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1273 KN-FEVVAGTGDQclpfdqshcgdggkaseaslNSPRGITVDRHGFIYFVD--GTMIRRID-ENAVITTVIGSNGLTStq 1348
Cdd:COG4257   132 TGeVTEFPLPTGG--------------------AGPYGIAVDPDGNLWVTDfgANAIGRIDpDTGTLTEYALPTPGAG-- 189
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1349 plscdsgmditqvrlewPTDLAVNPmDNSLYVLDnnivlqiSENRRVRII---AGRpihcqvpgidhflVSKVAIHSTLE 1425
Cdd:COG4257   190 -----------------PRGLAVDP-DGNLWVAD-------TGSGRIGRFdpkTGT-------------VTEYPLPGGGA 231
                         250       260       270
                  ....*....|....*....|....*....|....*....
gi 569009290 1426 SARAISVSHSGLLFIAETDerkVNRIQQVTTNGEISIIA 1464
Cdd:COG4257   232 RPYGVAVDGDGRVWFAESG---ANRIVRFDPDTELTEYV 267
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1301-1520 5.74e-07

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 53.48  E-value: 5.74e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1301 EASLNSPRGITVDRHGFIYFVDGT--MIRRID-ENAVITTVIGSNGLTStqplscdsgmditqvrlewPTDLAVNPmDNS 1377
Cdd:COG4257    55 LGGGSGPHGIAVDPDGNLWFTDNGnnRIGRIDpKTGEITTFALPGGGSN-------------------PHGIAFDP-DGN 114
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1378 LYVLDNNivlqiseNRRVRII---AGRpihcqvpgidhflVSKVAIHSTLESARAISVSHSGLLFIAEtdeRKVNRIQQV 1454
Cdd:COG4257   115 LWFTDQG-------GNRIGRLdpaTGE-------------VTEFPLPTGGAGPYGIAVDPDGNLWVTD---FGANAIGRI 171
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 569009290 1455 TT-NGEISIIAGaPTdcdckidpncdcfsgdggyakdaKMKAPSSLAVSPDGTLYVADLGNVRIRTI 1520
Cdd:COG4257   172 DPdTGTLTEYAL-PT-----------------------PGAGPRGLAVDPDGNLWVADTGSGRIGRF 214
NHL_TRIM71_like cd14954
NHL repeat domain of the tripartite motif-containing protein 71 (TRIM71) and related proteins; ...
1304-1517 6.99e-07

NHL repeat domain of the tripartite motif-containing protein 71 (TRIM71) and related proteins; The E3 ubiquitin-protein ligase TRIM71 (LIN-41) is a RING-finger domain containing protein that has been associated with a variety of activities. The NHL repeat domain appears responsible for targeting TRIM71 to mRNAs, and TRIM71 appears responsible for translational repression and mRNA decay. Together with BRAT, TRIM71 may be part of a family of mRNA repressors that regulate proliferation and differentiation. TRIM has been shown to negatively regulate stability of Lin28B, which inhibits the pre-let-7 miRNA precursor from maturing by recruiting the terminal uriyltransferase TUT4. This family also contains the Caenorhabditis elegans NHL repeat containing 1 (NHL-1), a RING-finger-containing protein that was shown to interact with E2 ubiquitin conjugating enzymes in two-hybrid screens. Its domain architecture resembles that of the E3 ubiquitin protein ligases TRIM2, TRIM32, and TRIM71. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271324 [Multi-domain]  Cd Length: 285  Bit Score: 53.71  E-value: 6.99e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1304 LNSPRGITVDRHGFIYFVD--GTMIRRIDENAVITTVIGSNGLTSTQplscdsgmditqvrLEWPTDLAVNPMDNsLYVL 1381
Cdd:cd14954    70 FDRPAGVAVNSRGRIIVADkdNHRIQVFDLNGRFLLKFGERGTKNGQ--------------FNYPWGVAVDSEGR-IYVS 134
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1382 DnnivlqiSENRRVRIIA--GRPIH---CQVPGIDHFlvskvaihstlESARAISVSHSGLLFIAETDErkvNRIQQVTT 1456
Cdd:cd14954   135 D-------TRNHRVQVFDsdGQFIRkfgFEGAGPGQL-----------DSPRGVAVNPDGNIVVSDFNN---HRLQVFDP 193
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1457 NGE-ISIIAGAPTDCDC-------KIDPN-----CDC-------FSGDGGYAK--------DAKMKAPSSLAVSPDGTLY 1508
Cdd:cd14954   194 DGQfLRFFGSEGSGNGQfkrprgvAVDDEgniivADSgnhrvqvFSPDGEFLCsfgtegngEGQFDRPSGVAVTPDGRIV 273

                  ....*....
gi 569009290 1509 VADLGNVRI 1517
Cdd:cd14954   274 VVDRGNHRI 282
NHL_like_6 cd14962
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1196-1517 8.68e-07

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271332 [Multi-domain]  Cd Length: 271  Bit Score: 52.97  E-value: 8.68e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1196 PVALASGPDGSVYVGDfnfvrrifPSGNSVSILELstsPAHKYY---------------LAMDPmSESLYLSDTNTRKVY 1260
Cdd:cd14962    14 PYGVAADGRGRIYVAD--------TGRGAVFVFDL---PNGKVFvignagpnrfvspigVAIDA-NGNLYVSDAELGKVF 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1261 KLkslvetkDLSKNFEVVAGTGDQclpfdqshcgdggkaseasLNSPRGITVDRHG-FIYFVD--GTMIRRIDENAVITT 1337
Cdd:cd14962    82 VF-------DRDGKFLRAIGAGAL-------------------FKRPTGIAVDPAGkRLYVVDtlAHKVKVFDLDGRLLF 135
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1338 VIGSNGltstqplscdSGmditQVRLEWPTDLAVNPMDNsLYVLDnnivlqiSENRRVRII--AGRPIHC-----QVPGi 1410
Cdd:cd14962   136 DIGKRG----------SG----PGEFNLPTDLAVDRDGN-LYVTD-------TMNFRVQIFdaDGKFLRSfgergDGPG- 192
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1411 dhflvskvaihsTLESARAISVSHSGLLFIAETderKVNRIQQVTTNGEISIIAGAPtdcdckidpncdcFSGDGGYAkd 1490
Cdd:cd14962   193 ------------SFARPKGIAVDSEGNIYVVDA---AFDNVQIFNPEGELLLTVGGP-------------GSGPGEFY-- 242
                         330       340
                  ....*....|....*....|....*..
gi 569009290 1491 akmkAPSSLAVSPDGTLYVADLGNVRI 1517
Cdd:cd14962   243 ----LPSGIAIDKDDRIYVVDQFNRRI 265
NHL-2_like cd14951
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ...
1297-1520 1.84e-06

NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271321 [Multi-domain]  Cd Length: 334  Bit Score: 52.58  E-value: 1.84e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1297 GKASEASLNSPRGITVDRHGFIYFVDgT---MIRRID-ENAVITTVIGsnglTSTQPLSCDSGMDITQVRLEWPTDLAVN 1372
Cdd:cd14951    11 GSFAEASFNEPQGLALLPGNILYVAD-TenhALRKIDlETGTVTTLAG----TGEQGRDGEGGGPGREQPLSSPWDVAWG 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1373 PMDNSLYV------------LDNNIVLQISENRRVRIIAGRPIH----CQVPGI----DHFL------------------ 1414
Cdd:cd14951    86 PEDDILYIamagthqiwaydLDTGTCRVFAGSGNEGNRNGPYPHeawfAQPSGLslagWGELfvadsessairavslkdg 165
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1415 VSKVAIHSTL---------------ESAR-----AISVSHSGLLFIAETDERKVNRIQQVTtnGEISIIAGaptdcdcki 1474
Cdd:cd14951   166 GVKTLVGGTRvgtglfdfgdrdgpgAEALlqhplGVAALPDGSVYVADTYNHKIKRVDPAT--GEVSTLAG--------- 234
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*..
gi 569009290 1475 dpncdcfSGDGGYAKDAKMKA-PSSLAVSPDGTLYVADLGNVRIRTI 1520
Cdd:cd14951   235 -------TGKAGYKDLEAQFSePSGLVVDGDGRLYVADTNNHRIRRL 274
DUF5885 pfam19232
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ...
529-720 4.44e-06

Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.


Pssm-ID: 437064  Cd Length: 265  Bit Score: 50.78  E-value: 4.44e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290   529 DDCSTNCNGNGECISGHCH-----------------CFPGFLGPdcaRDSCpvlCGG----NGE----------YEKGHC 577
Cdd:pfam19232   10 DDCTPPCGGTQVCIDRQCKdntlacttdaqcgtcmtCVAGACTP---KASC---CGGvtcgAGQtcdaktntcvYVKGYC 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290   578 VC-RNGWKGPECDVPEEQCIDPTCFGHGT---CIMGV-----------------CICV------------PGYKGEICEE 624
Cdd:pfam19232   84 SAdHPCPSGSACDTAKNACIAQPPYGPDSgkgCVRGFgawiweldpatnsgvwrCRCAngslynsahecsPLADQTLCAA 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290   625 EDcLDPMC------------------------SSHGICVK----GECHCSTGWGGVNCETplpicQEQCSGHGTFLLDTG 676
Cdd:pfam19232  164 EN-LDPNAlvpassvpafaaygwgnqpvlinkSTAGAAVPsplaGVCPCKPGWAGGSCTE-----DRTCNGRGTWNETTG 237
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....
gi 569009290   677 VCSCDPKWTGSDcstelctmECGSHGVCSRgicqceegWVGPTC 720
Cdd:pfam19232  238 QCACNIDFSGHN--------SCGDDNNCTS--------WTGPRC 265
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
2338-2416 1.63e-05

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 45.18  E-value: 1.63e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290  2338 YTPYGDIYHDTyPDFEVIIGFHGGLYDFLTKLVHLGQRDYDVVAGRWTTPNhhiwkqlnllpkPF------NLYSFENNY 2411
Cdd:TIGR03696    1 YDPYGEVLSES-GAAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD------------PIglggglNLYAYVGNN 67

                   ....*
gi 569009290  2412 PVGKI 2416
Cdd:TIGR03696   68 PVNWV 72
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
1844-1884 1.82e-05

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 43.73  E-value: 1.82e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 569009290  1844 YSPSG-LVTFIQRGTWNEKMEYDQSGKIISRTWADGKIWSYT 1884
Cdd:TIGR01643    1 YDAAGrLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1191-1262 2.94e-05

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 48.48  E-value: 2.94e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 569009290 1191 NKLFAPVALASGPDGSVYVGDF--NFVRRIFPSGNSVSILELSTSPAHKYYLAMDPMSeSLYLSDTNTRKVYKL 1262
Cdd:COG4257   185 TPGAGPRGLAVDPDGNLWVADTgsGRIGRFDPKTGTVTEYPLPGGGARPYGVAVDGDG-RVWFAESGANRIVRF 257
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1481-1523 3.11e-05

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 48.68  E-value: 3.11e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 569009290 1481 FSGDGGyaKDAKMKAPSSLAVSPDGTLYVADLGNVRIRTISKN 1523
Cdd:cd14953    12 FSGGGG--TAARFNSPSGVAVDAAGNLYVADRGNHRIRKITPD 52
RHS_core NF041261
RHS element core protein;
1567-1682 3.43e-05

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 49.62  E-value: 3.43e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1567 FTYNAEGDLGAITSSNGNSVHIRRDAGGMPLwlvvpggqvywltISSNGVLKRvsAQGYNLAlmtypgntGLLATKSNEN 1646
Cdd:NF041261  602 YEYNAAGDLTAVITPDGNRSETQYDAWGKAV-------------STTQGGLTR--SMEYDAA--------GRITTLTNEN 658
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 569009290 1647 GWTTVYEYDPEGHLTNATFPTGEVSSFHSDLE-KLTK 1682
Cdd:NF041261  659 GSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTgKLTQ 695
NHL_like_3 cd14956
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1366-1528 2.06e-04

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271326 [Multi-domain]  Cd Length: 274  Bit Score: 45.74  E-value: 2.06e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1366 PTDLAVNPMDNSLYVLDNNIVLQI--SENRRVRIIAGRPihcqvPGIDHFlvskvaihstlESARAISVSHSGLLFIAET 1443
Cdd:cd14956    15 PRGIAVDADDNVYVADARNGRIQVfdKDGTFLRRFGTTG-----DGPGQF-----------GRPRGLAVDKDGWLYVADY 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1444 DErkvNRIQQVTTNGEISIIAGAPTdcdckidpncdcfSGDGGYAkdakmkAPSSLAVSPDGTLYVADLGNVRIRTISKN 1523
Cdd:cd14956    79 WG---DRIQVFTLTGELQTIGGSSG-------------SGPGQFN------APRGVAVDADGNLYVADFGNQRIQKFDPD 136

                  ....*
gi 569009290 1524 QAHLN 1528
Cdd:cd14956   137 GSFLR 141
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
711-754 2.17e-04

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 41.07  E-value: 2.17e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 569009290   711 CEEGWVGPTCEeRSC--------HSHCAEHGQCkdgkcECSPGWEGDHCTIA 754
Cdd:pfam01414    1 CDENYYGSTCS-KFCrprddkfgHYTCDANGNK-----VCLPGWTGPYCDKP 46
YvrE COG3386
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase ...
1199-1339 2.25e-04

Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase YvrE is part of the Pathway/BioSystem: Non-phosphorylated Entner-Doudoroff pathway


Pssm-ID: 442613 [Multi-domain]  Cd Length: 266  Bit Score: 45.65  E-value: 2.25e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1199 LASGPDGSVYVGDFNFVR------RIFPSGnSVSIL--ELSTSPAhkyyLAMDPMSESLYLSDTNTRKVYKLkSLVETKD 1270
Cdd:COG3386    98 GVVDPDGRLYFTDMGEYLptgalyRVDPDG-SLRVLadGLTFPNG----IAFSPDGRTLYVADTGAGRIYRF-DLDADGT 171
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 569009290 1271 LSkNFEVVAgtgdqclpfdQSHCGDGGkaseaslnsPRGITVDRHGFIY--FVDGTMIRRIDENAVITTVI 1339
Cdd:COG3386   172 LG-NRRVFA----------DLPDGPGG---------PDGLAVDADGNLWvaLWGGGGVVRFDPDGELLGRI 222
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
1173-1397 1.11e-03

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 43.41  E-value: 1.11e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1173 NGHQRSVACTNCNGPAHNNklfaPVALASGPDGSVYVGD-FNFVRRIFPS--GNSVSILELSTSPAHKYYL---AMDPmS 1246
Cdd:cd14957    95 GVYQYSIGTGGSGDGQFNG----PYGIAVDSNGNIYVADtGNHRIQVFTSsgTFSYSIGSGGTGPGQFNGPqgiAVDS-D 169
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1247 ESLYLSDTNTRKVYKLKSlvetkdlSKNFEVVAGTGDQclpfdqshcGDGGkaseasLNSPRGITVDRHGFIYFVDgTMI 1326
Cdd:cd14957   170 GNIYVADTGNHRIQVFTS-------SGTFQYTFGSSGS---------GPGQ------FSDPYGIAVDSDGNIYVAD-TGN 226
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 569009290 1327 RRI---DENAVITTVIGSNGLTSTQplscdsgmditqvrLEWPTDLAVNPmDNSLYVLDNNivlqiseNRRVRI 1397
Cdd:cd14957   227 HRIqvfTSSGAYQYSIGTSGSGNGQ--------------FNYPYGIAVDN-DGKIYVADSN-------NNRIQV 278
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
665-689 1.82e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 37.71  E-value: 1.82e-03
                           10        20
                   ....*....|....*....|....*
gi 569009290   665 CSGHGTFLLDTGVCSCDPKWTGSDC 689
Cdd:pfam07974    2 CSGRGTCVNQCGKCVCDSGYQGATC 26
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
697-720 2.01e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 37.71  E-value: 2.01e-03
                           10        20
                   ....*....|....*....|....*.
gi 569009290   697 ECGSHGVCSR--GICQCEEGWVGPTC 720
Cdd:pfam07974    1 ICSGRGTCVNqcGKCVCDSGYQGATC 26
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1489-1523 2.66e-03

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 42.31  E-value: 2.66e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 569009290 1489 KDAKMKAPSSLAVSPDGTLYVADLGNVRIRTISKN 1523
Cdd:cd05819     3 GPGELNNPQGIAVDSSGNIYVADTGNNRIQVFDPD 37
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
651-794 2.96e-03

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 40.51  E-value: 2.96e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290  651 GVNCETPLPICQEQCsghgtflLDTgvcSCDPkwtgSDCSTelCTMECGSHGVCSRGICQCEEGWV--GPTC-EERS--- 724
Cdd:NF041328   18 GAVCPEGLSVCGGAC-------VDL---RSDP----SNCGA--CGVACGAGQTCVAGACGCGPGTVacGGACvDTASdpa 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 569009290  725 ----CHSHCAEHGQCKDGKCecspgwegdhctiahyldavRDGCP-GLCFGNGRCT-LDQNGWHCvcqvGWSGTGC 794
Cdd:NF041328   82 hcgaCGAACAPGQVCEGGAC--------------------REACSeGLTRCGGACVdLATDPLHC----GACGVAC 133
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
535-557 3.69e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 36.94  E-value: 3.69e-03
                           10        20
                   ....*....|....*....|....*
gi 569009290   535 CNGNGECIS--GHCHCFPGFLGPDC 557
Cdd:pfam07974    2 CSGRGTCVNqcGKCVCDSGYQGATC 26
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
729-751 4.72e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 36.56  E-value: 4.72e-03
                           10        20
                   ....*....|....*....|....*
gi 569009290   729 CAEHGQCKD--GKCECSPGWEGDHC 751
Cdd:pfam07974    2 CSGRGTCVNqcGKCVCDSGYQGATC 26
NHL_like_6 cd14962
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1186-1348 5.92e-03

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271332 [Multi-domain]  Cd Length: 271  Bit Score: 41.42  E-value: 5.92e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1186 GPAHNNKLFAPVALASGPDGSVYVGD--------FN----FVRRIfpsGNSvsilELSTSPAHkyyLAMDPMSESLYLSD 1253
Cdd:cd14962    49 GNAGPNRFVSPIGVAIDANGNLYVSDaelgkvfvFDrdgkFLRAI---GAG----ALFKRPTG---IAVDPAGKRLYVVD 118
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290 1254 TNTRKVYKLkslvetkDLSKNFEVVAGTgdqclpfdqshcgDGGKASEasLNSPRGITVDRHGFIYFVDgTMIRRI---D 1330
Cdd:cd14962   119 TLAHKVKVF-------DLDGRLLFDIGK-------------RGSGPGE--FNLPTDLAVDRDGNLYVTD-TMNFRVqifD 175
                         170
                  ....*....|....*...
gi 569009290 1331 ENAVITTVIGSNGLTSTQ 1348
Cdd:cd14962   176 ADGKFLRSFGERGDGPGS 193
Keratin_B2 pfam01500
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized ...
595-731 7.88e-03

Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.


Pssm-ID: 366678 [Multi-domain]  Cd Length: 161  Bit Score: 39.77  E-value: 7.88e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 569009290   595 CIDPTCFGHGTCIMGVCicvpgykGEICEEEDCLDPMCSSHGIC--VKGECHCSTGWGGVNCETPL--PIC------QEQ 664
Cdd:pfam01500    9 CGFPTCSTGGTCGSGCC-------QPCCCQSSCCRPSCCQTSCCqpTTFQSSCCRPTCQPCCQTSCcqPTCcqtsscQTG 81
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 569009290   665 CSGHGTFLL-DTGVCSCDPKWTGSDCSTE-LCTMECGSHGVCSRGICQ--------CEEGWVGPTCEERSCHSHCAE 731
Cdd:pfam01500   82 CGGIGYGQEgSSGAVSSRTRWCRPDCRVEgTCLPPCCVVSCTPPTCCQlhhaqascCRPSYCGQSCCRPACCCQCSE 158
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH