NCBI Conserved Domain Search

Conserved domains on [gi|1609559026|ref|NP_001356390|]

View

nuclear factor 1 B-type isoform 8 [Homo sapiens]

Protein Classification

nuclear factor I( domain architecture ID 12106891)

nuclear factor I (NFI) is a CCAAT-box-binding protein active in transcription and DNA replication

Graphical summary

Zoom to residue level

show extra options »

Show site features Horizontal zoom: ×

List of domain hits

Name

Accession

Description

Interval

E-value

CTF_NFI

pfam00859

CTF/NF-I family transcription modulation region;

209-489

1.63e-119

CTF/NF-I family transcription modulation region;

Pssm-ID: 459967 [Multi-domain] Cd Length: 288 Bit Score: 354.60 E-value: 1.63e-119

                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026 209 EDSFVKSGVFNVSELVRVSRTPITQGTGVNFPIGEIPSqPYYHDMNSGVNLQRSLSSPPS--SKRPKTISIDENMEPSPT 286
Cdd:pfam00859   1 QDSFVTSGVFSVTELVRVSRTPVATGTGPNFSLGELQG-PLYYDLNPGVGLRRSLPSTSSsgSKRHKSGSMEDDVDTSPG 79

                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026 287 GDFYPSPSSPAAGSRTW-HERDQDMSSPTTMKKPEKPLFSSASPQDSSPRLSTFPQHHHPGIpgVAHSVIStRTPPPPSP 365
Cdd:pfam00859  80 GDYYRSPSSPASSSRNWpHDVEGGMSSPVKKKKPDKSDFSSPSPQDSSPRLMAFTQHHRPVI--AVHSGIS-RSPHPSSA 156

                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026 366 LPFPTQAILpPAPSSYFSHPTIRYPPHLnPQDTLKNYVP--SYDPSSPQTSQPNGSGQvvGKVPGHF--TPVLAPSPHPS 441
Cdd:pfam00859 157 LHFPSSSIL-QQPSSYFPHPAIRYPPHL-PQDPLKDLVSlaCYDPSSQQPSQPNGSGQ--GKVPGHFisTQMLAPPPHPP 232

                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1609559026 442 AVRPVTLSMtDTKPITTSTEA---------YTASGTSQANRYVGLSPRDPSFLHQQQ 489
Cdd:pfam00859 233 VARPVPLPM-DTKPITTSTEGgassptsptYSAPGTPPANRFVGLGPRDPGFLYQAQ 288

NfI_DNAbd_pre-N

pfam10524

Nuclear factor I protein pre-N-terminus; The Nuclear factor I (NFI) family of site-specific ...

10-47

3.78e-19

Nuclear factor I protein pre-N-terminus; The Nuclear factor I (NFI) family of site-specific DNA-binding proteins (also known as CTF or CAAT box transcription factor) functions both in viral DNA replication and in the regulation of gene expression in higher organizms. The N-terminal 200 residues contains the DNA-binding and dimerization domain, but also has an 8-47 residue highly conserved region 5' of this, whose function is not known. Deletion of the N-terminal 200 amino acids removes the DNA-binding activity, dimerization-ability and the stimulation of adenovirus DNA replication.

Pssm-ID: 463134 Cd Length: 41 Bit Score: 80.73 E-value: 3.78e-19

                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1609559026  10 QDEFHPFIEALLPHVRAIAYTWFNLQARKRKYFKKHEK 47
Cdd:pfam10524   4 QEDFHPFIEALLPYVKAFAYTWFNLQAAKRRHYKKHDK 41

MH1 super family

cl45991

MH1 domain; The MH1 (MAD homology 1) domain is found at the amino terminus of MAD related ...

69-173

9.91e-18

MH1 domain; The MH1 (MAD homology 1) domain is found at the amino terminus of MAD related proteins such as Smads. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain shows that a highly conserved 11 residue beta hairpin is used to bind the DNA consensus sequence GNCN in the major groove, shown to be vital for the transcriptional activation of target genes. Not all examples of MH1 can bind to DNA however. Smad2 cannot bind DNA and has a large insertion within the hairpin that presumably abolishes DNA binding. A basic helix (H2) in MH1 with the nuclear localization signal KKLKK has been shown to be essential for Smad3 nuclear import. Smads also use the MH1 domain to interact with transcription factors such as Jun, TFE3, Sp1, and Runx.

The actual alignment was detected with superfamily member pfam03165:

Pssm-ID: 460833 Cd Length: 103 Bit Score: 78.57 E-value: 9.91e-18

                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026  69 KQKWASRLLAKLRKDIrqEYREDFVLTVTGK---KHPCCVLSN--------PDQKGKIRRIDClrqadKVWRL-DLVMVI 136
Cdd:pfam03165   1 LKKAVESLLKKLKKKI--QQLEELELAVESRgdpPTGCVTIPRsldgrlqvAGRKGLPHVIYC-----RLWRWpDLQSQH 73

                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 1609559026 137 LFKGIPLESTDGErlMKSPHCtnpalCVQPHHITVSV 173
Cdd:pfam03165  74 ELKAIPTCETAFE--SKKDEV-----CINPYHYSRVE 103

Name

Accession

Description

Interval

E-value

CTF_NFI

pfam00859

CTF/NF-I family transcription modulation region;

209-489

1.63e-119

CTF/NF-I family transcription modulation region;

Pssm-ID: 459967 [Multi-domain] Cd Length: 288 Bit Score: 354.60 E-value: 1.63e-119

                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026 209 EDSFVKSGVFNVSELVRVSRTPITQGTGVNFPIGEIPSqPYYHDMNSGVNLQRSLSSPPS--SKRPKTISIDENMEPSPT 286
Cdd:pfam00859   1 QDSFVTSGVFSVTELVRVSRTPVATGTGPNFSLGELQG-PLYYDLNPGVGLRRSLPSTSSsgSKRHKSGSMEDDVDTSPG 79

                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026 287 GDFYPSPSSPAAGSRTW-HERDQDMSSPTTMKKPEKPLFSSASPQDSSPRLSTFPQHHHPGIpgVAHSVIStRTPPPPSP 365
Cdd:pfam00859  80 GDYYRSPSSPASSSRNWpHDVEGGMSSPVKKKKPDKSDFSSPSPQDSSPRLMAFTQHHRPVI--AVHSGIS-RSPHPSSA 156

                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026 366 LPFPTQAILpPAPSSYFSHPTIRYPPHLnPQDTLKNYVP--SYDPSSPQTSQPNGSGQvvGKVPGHF--TPVLAPSPHPS 441
Cdd:pfam00859 157 LHFPSSSIL-QQPSSYFPHPAIRYPPHL-PQDPLKDLVSlaCYDPSSQQPSQPNGSGQ--GKVPGHFisTQMLAPPPHPP 232

                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1609559026 442 AVRPVTLSMtDTKPITTSTEA---------YTASGTSQANRYVGLSPRDPSFLHQQQ 489
Cdd:pfam00859 233 VARPVPLPM-DTKPITTSTEGgassptsptYSAPGTPPANRFVGLGPRDPGFLYQAQ 288

NfI_DNAbd_pre-N

pfam10524

Nuclear factor I protein pre-N-terminus; The Nuclear factor I (NFI) family of site-specific ...

10-47

3.78e-19

Pssm-ID: 463134 Cd Length: 41 Bit Score: 80.73 E-value: 3.78e-19

                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1609559026  10 QDEFHPFIEALLPHVRAIAYTWFNLQARKRKYFKKHEK 47
Cdd:pfam10524   4 QEDFHPFIEALLPYVKAFAYTWFNLQAAKRRHYKKHDK 41

MH1

pfam03165

MH1 domain; The MH1 (MAD homology 1) domain is found at the amino terminus of MAD related ...

69-173

9.91e-18

Pssm-ID: 460833 Cd Length: 103 Bit Score: 78.57 E-value: 9.91e-18

                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026  69 KQKWASRLLAKLRKDIrqEYREDFVLTVTGK---KHPCCVLSN--------PDQKGKIRRIDClrqadKVWRL-DLVMVI 136
Cdd:pfam03165   1 LKKAVESLLKKLKKKI--QQLEELELAVESRgdpPTGCVTIPRsldgrlqvAGRKGLPHVIYC-----RLWRWpDLQSQH 73

                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 1609559026 137 LFKGIPLESTDGErlMKSPHCtnpalCVQPHHITVSV 173
Cdd:pfam03165  74 ELKAIPTCETAFE--SKKDEV-----CINPYHYSRVE 103

DWA

smart00523

Domain A in dwarfin family proteins;

68-176

5.46e-17

Domain A in dwarfin family proteins;

Pssm-ID: 214708 Cd Length: 109 Bit Score: 76.65 E-value: 5.46e-17

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026   68 IKQKWASRLLAKLRKDIRQEYREDFVLTVTGKKHPC--CVLSNPDQKGKirridcLRQADKVWRLDLVMVILFKGIPLES 145
Cdd:smart00523   1 VEEKWAKKATESLLKKLKKKQLEELLQAVESKGGPPtrCVLIPRSLDGR------LQVAHRKGLPHVLYCRLFRWPDLQS 74

                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1609559026  146 tdGERLMKSPHC------TNPALCVQPHHITVSVKEL 176
Cdd:smart00523  75 --PHELKALPTCehafesKSDEVCCNPYHYSRVERPE 109

PTZ00449

104 kDa microneme/rhoptry antigen; Provisional

265-438

6.54e-04

104 kDa microneme/rhoptry antigen; Provisional

Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 42.75 E-value: 6.54e-04

                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026 265 SPPSSKRPKTISIDEnmepSPTGDFYP-SPSSPAAGSR-TWHERDQDMSSPTTMKKPEKPlfssASPQDSSPRLSTFPQH 342
Cdd:PTZ00449  608 RPKSPKLPELLDIPK----SPKRPESPkSPKRPPPPQRpSSPERPEGPKIIKSPKPPKSP----KPPFDPKFKEKFYDDY 679

                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026 343 hhpgIPGVAHSVISTRTPPPPSPLPFPTQAILPPAPSSYFSHPtiRYPPHLNPQDTLKNYVPSYDPSSPQTSQPNGSGQV 422
Cdd:PTZ00449  680 ----LDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTP--RPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPP 753

                         170
                  ....*....|....*..
gi 1609559026 423 VGK-VPGHFTPVLAPSP 438
Cdd:PTZ00449  754 EEErTFFHETPADTPLP 770

Name

Accession

Description

Interval

E-value

CTF_NFI

pfam00859

CTF/NF-I family transcription modulation region;

209-489

1.63e-119

CTF/NF-I family transcription modulation region;

Pssm-ID: 459967 [Multi-domain] Cd Length: 288 Bit Score: 354.60 E-value: 1.63e-119

                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026 209 EDSFVKSGVFNVSELVRVSRTPITQGTGVNFPIGEIPSqPYYHDMNSGVNLQRSLSSPPS--SKRPKTISIDENMEPSPT 286
Cdd:pfam00859   1 QDSFVTSGVFSVTELVRVSRTPVATGTGPNFSLGELQG-PLYYDLNPGVGLRRSLPSTSSsgSKRHKSGSMEDDVDTSPG 79

                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026 287 GDFYPSPSSPAAGSRTW-HERDQDMSSPTTMKKPEKPLFSSASPQDSSPRLSTFPQHHHPGIpgVAHSVIStRTPPPPSP 365
Cdd:pfam00859  80 GDYYRSPSSPASSSRNWpHDVEGGMSSPVKKKKPDKSDFSSPSPQDSSPRLMAFTQHHRPVI--AVHSGIS-RSPHPSSA 156

                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026 366 LPFPTQAILpPAPSSYFSHPTIRYPPHLnPQDTLKNYVP--SYDPSSPQTSQPNGSGQvvGKVPGHF--TPVLAPSPHPS 441
Cdd:pfam00859 157 LHFPSSSIL-QQPSSYFPHPAIRYPPHL-PQDPLKDLVSlaCYDPSSQQPSQPNGSGQ--GKVPGHFisTQMLAPPPHPP 232

                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1609559026 442 AVRPVTLSMtDTKPITTSTEA---------YTASGTSQANRYVGLSPRDPSFLHQQQ 489
Cdd:pfam00859 233 VARPVPLPM-DTKPITTSTEGgassptsptYSAPGTPPANRFVGLGPRDPGFLYQAQ 288

NfI_DNAbd_pre-N

pfam10524

Nuclear factor I protein pre-N-terminus; The Nuclear factor I (NFI) family of site-specific ...

10-47

3.78e-19

Pssm-ID: 463134 Cd Length: 41 Bit Score: 80.73 E-value: 3.78e-19

                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1609559026  10 QDEFHPFIEALLPHVRAIAYTWFNLQARKRKYFKKHEK 47
Cdd:pfam10524   4 QEDFHPFIEALLPYVKAFAYTWFNLQAAKRRHYKKHDK 41

MH1

pfam03165

MH1 domain; The MH1 (MAD homology 1) domain is found at the amino terminus of MAD related ...

69-173

9.91e-18

Pssm-ID: 460833 Cd Length: 103 Bit Score: 78.57 E-value: 9.91e-18

                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026  69 KQKWASRLLAKLRKDIrqEYREDFVLTVTGK---KHPCCVLSN--------PDQKGKIRRIDClrqadKVWRL-DLVMVI 136
Cdd:pfam03165   1 LKKAVESLLKKLKKKI--QQLEELELAVESRgdpPTGCVTIPRsldgrlqvAGRKGLPHVIYC-----RLWRWpDLQSQH 73

                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 1609559026 137 LFKGIPLESTDGErlMKSPHCtnpalCVQPHHITVSV 173
Cdd:pfam03165  74 ELKAIPTCETAFE--SKKDEV-----CINPYHYSRVE 103

DWA

smart00523

Domain A in dwarfin family proteins;

68-176

5.46e-17

Domain A in dwarfin family proteins;

Pssm-ID: 214708 Cd Length: 109 Bit Score: 76.65 E-value: 5.46e-17

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026   68 IKQKWASRLLAKLRKDIRQEYREDFVLTVTGKKHPC--CVLSNPDQKGKirridcLRQADKVWRLDLVMVILFKGIPLES 145
Cdd:smart00523   1 VEEKWAKKATESLLKKLKKKQLEELLQAVESKGGPPtrCVLIPRSLDGR------LQVAHRKGLPHVLYCRLFRWPDLQS 74

                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1609559026  146 tdGERLMKSPHC------TNPALCVQPHHITVSVKEL 176
Cdd:smart00523  75 --PHELKALPTCehafesKSDEVCCNPYHYSRVERPE 109

PTZ00449

104 kDa microneme/rhoptry antigen; Provisional

265-438

6.54e-04

104 kDa microneme/rhoptry antigen; Provisional

Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 42.75 E-value: 6.54e-04

                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026 265 SPPSSKRPKTISIDEnmepSPTGDFYP-SPSSPAAGSR-TWHERDQDMSSPTTMKKPEKPlfssASPQDSSPRLSTFPQH 342
Cdd:PTZ00449  608 RPKSPKLPELLDIPK----SPKRPESPkSPKRPPPPQRpSSPERPEGPKIIKSPKPPKSP----KPPFDPKFKEKFYDDY 679

                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026 343 hhpgIPGVAHSVISTRTPPPPSPLPFPTQAILPPAPSSYFSHPtiRYPPHLNPQDTLKNYVPSYDPSSPQTSQPNGSGQV 422
Cdd:PTZ00449  680 ----LDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTP--RPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPP 753

                         170
                  ....*....|....*..
gi 1609559026 423 VGK-VPGHFTPVLAPSP 438
Cdd:PTZ00449  754 EEErTFFHETPADTPLP 770

PHA03247

large tegument protein UL36; Provisional

264-482

9.13e-04

large tegument protein UL36; Provisional

Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.23 E-value: 9.13e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026  264 SSPPSSKRPKTISIDENmepSPTGDFYPSPSSPA-------AGSRTWHERDQDMSSPTTMKKPEKPLFSSASPQDSSPRL 336
Cdd:PHA03247  2590 DAPPQSARPRAPVDDRG---DPRGPAPPSPLPPDthapdppPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR 2666

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026  337 STFPqhhhpgipgvahsvisTRTPPPPSPLPFPTQAILPP--APSSYFSHPTiryPPHLNPQdtlknyvPSYDPSSPQTS 414
Cdd:PHA03247  2667 ARRL----------------GRAAQASSPPQRPRRRAARPtvGSLTSLADPP---PPPPTPE-------PAPHALVSATP 2720

                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1609559026  415 QPNGSGQVVGKVPghfTPVLAPSPHPSAVRPVTlSMTDTKPITTSTEAYTASGTSQANRYVGLSPRDP 482
Cdd:PHA03247  2721 LPPGPAAARQASP---ALPAAPAPPAVPAGPAT-PGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLT 2784

Enamelin

pfam15362

Enamelin; ENAMELIN is involved in the mineralization and structural organization of enamel. It ...

289-329

1.25e-03

Enamelin; ENAMELIN is involved in the mineralization and structural organization of enamel. It is necessary for the extension of enamel during the secretory stage of dental enamel formation. The proteins are expressed in teeth, particularly in odontoblasts, ameloblasts and cementoblasts.

Pssm-ID: 464672 [Multi-domain] Cd Length: 907 Bit Score: 41.74 E-value: 1.25e-03

                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 1609559026 289 FYPSPSSPAAGSRTWHERDQdmsSPTTMKKPEKPLFSSASP 329
Cdd:pfam15362 393 YDPRENSPYLRSNTWDERDD---SPNTMGQPENPLYPMNTP 430

Herpes_BLLF1

pfam05109

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...

255-472

1.69e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.

Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 41.44 E-value: 1.69e-03

                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026 255 SGVNLQRSLSSPPSSKrpKTISIDENMEPSPTGD------FYPSPSSPAAGSRTwheRDQDMSSPTTMKKPEKPLFSSAS 328
Cdd:pfam05109 450 SSTHVPTNLTAPASTG--PTVSTADVTSPTPAGTtsgaspVTPSPSPRDNGTES---KAPDMTSPTSAVTTPTPNATSPT 524

                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1609559026 329 PQDSSPRlstfPQHHHPGIPGVAhsvistrtppppsplpfPTQAILPPAPSSYFSHPTIRYPphlNPQDTLKNYVPSYDP 408
Cdd:pfam05109 525 PAVTTPT----PNATSPTLGKTS-----------------PTSAVTTPTPNATSPTPAVTTP---TPNATIPTLGKTSPT 580

                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1609559026 409 SSPQTSQPNGSGQVVGKVpghfTPVLAPSPHpsavrpvTLSMTDTKPITTS--TEAYTASGTSQAN 472
Cdd:pfam05109 581 SAVTTPTPNATSPTVGET----SPQANTTNH-------TLGGTSSTPVVTSppKNATSAVTTGQHN 635

Blast search parameters

Data Source:	Precalculated data, version = cdd.v.3.21
Preset Options:	Database: CDSEARCH/cdd Low complexity filter: no Composition Based Adjustment: yes E-value threshold: 0.01