NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720385419|ref|XP_030104713|]
View 

cleavage and polyadenylation specificity factor subunit 1 isoform X2 [Mus musculus]

Protein Classification

CPSF1 family protein( domain architecture ID 13419888)

CPSF1 family protein similar to Arabidopsis thaliana cleavage and polyadenylation specificity factor subunit 1 (CPSF1), the RNA recognition subunit of CPSF that plays a key role in pre-mRNA 3'-end formation, recognizing the AAUAAA signal sequence and interacting with poly(A)polymerase and other factors to bring about cleavage and poly(A) addition

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
CPSF_A pfam03178
CPSF A subunit region; This family includes a region that lies towards the C-terminus of the ...
737-1072 7.65e-97

CPSF A subunit region; This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs. The function of the aligned region is unknown but may be involved in RNA/DNA binding.


:

Pssm-ID: 427182  Cd Length: 319  Bit Score: 310.29  E-value: 7.65e-97
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  737 AFSIQLISPVSWEAIPnaRIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCR-GRILIMDVIEVvpep 815
Cdd:pfam03178    1 ASCIRLVDPITKEVID--TLELEENEAVLSVKSVNLEDSSTTKGKEEYLVVGTAFDLGEDPAARsGRILVFEIIEV---- 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  816 gqPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSL-RASELTGMAFIDTQLYIHQMISVKNFILAADVMKS 894
Cdd:pfam03178   75 --PETNRKLKLVHKTEVKGAVTALAEFQGRLLAGQGQKLRVYDLgEDKSLLPKAFLDTGVYVVDLKVFGNRIIVGDLMKS 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  895 ISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNaqlGFLVSDRDRNLMVYMYLPEAKESFGG-MRLLRRADFHVGAHVN 973
Cdd:pfam03178  153 VTFVGYDEEPYRLIEFARDTQPRWVTAAEFLDGD---TVLVADKFGNLHVLRYDPDVPESLDGdPRLLVRAEFHLGETVT 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  974 TFWRTPC-RGAAEGPSKKSVVWenkhitwfATLDGGIGLLLP-MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVD 1051
Cdd:pfam03178  230 SFRKGSLvPGGSESPSSPQLLY--------GTLDGSIGLLVPfISEEDYRFLQSLQQQLRDELPHLGGLDHRAFRSYYTP 301
                          330       340
                   ....*....|....*....|.
gi 1720385419 1052 RRilqnAVRNVLDGELLNRYL 1072
Cdd:pfam03178  302 PR----TVKGVIDGDLLERFL 318
SFT1 super family cl34923
Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification];
48-1087 2.20e-89

Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification];


The actual alignment was detected with superfamily member COG5161:

Pssm-ID: 227490 [Multi-domain]  Cd Length: 1319  Bit Score: 314.60  E-value: 2.20e-89
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419   48 LFLGSRLGNSLLLKYTEKLqepPASSVREAadkeeppskkKRVEPAVGWTGGKTvpQDEVDEIEV---YGSEAQSGTQLA 124
Cdd:COG5161    404 FFGGVGDSNSRVLRIKSLL---PTIETRAS----------EGVGPLEGGNDEEM--DDEYSAPENklfGNKEQEVRRQDE 468
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  125 TYSFEVCDSMLNIGPCANAAVGEPAFLSEEFQNSPEPdLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVi 204
Cdd:COG5161    469 PYDAELFNALSNAGPITDFAVGKVDVEKGLPIPNIGL-LNLVVTKGSDSEAALAVEGTSLEPCICTVSSFIPLEIVWSQ- 546
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  205 apvrkeeeetpkaesteqepSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTSGFATQGPTVFAGNIGDNRYIVQ 284
Cdd:COG5161    547 --------------------KIRGYLRCSRALDFYILSRVSDSRIFRWSEEFLLEVSGEYTRDVNTLLFVEFGEENRVVQ 606
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  285 VSPLGIRLLEG-VNQLHFIPVDLGApIVQCAVADPYVVIMSAEGHVTMFLLKSDSyggrhhrLALHKPPLhhqskVIALC 363
Cdd:COG5161    607 VTPSYLLRYDQdLRMLGRVEFASRA-VEARSVRDPLILVVRDSGKILTFYDREKN-------MRLFKIDL-----VTCLA 673
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  364 LYRDVSGMFTTESRLGgarDELGGRSGSEAEGLgSETSPTVDDEeemlygdssalFSPSkeearrsSQPPADRDPAPFKA 443
Cdd:COG5161    674 DAKNKSFVLSDSNSLG---IFDIGKRISQLEPC-LVKGLPYAIQ-----------FSPE-------ASPAMDLAGEEDGD 731
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  444 DPthwcllVRENGTMEIYQLPDwrLVFlvknfpvgqrvlvdssfgqpttqgevrkeeatrqgELPLVKEVLLVALGSRQS 523
Cdd:COG5161    732 DQ------LTEISMSLTYNLID--MLF-----------------------------------RLPSIGNYMVAYLGLDLK 768
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  524 RPYL-LVHVDQELLIYEAFPhdsqlgqgnlkvrfkkvPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIY 602
Cdd:COG5161    769 EEYLfDNSLSSEIVFYKTHL-----------------PRHVSFNLNVTRNDLAITGAPDNADIKAFSSVGRIDMVFIKAV 831
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  603 GYSGVFICGPSPHWLLVTGRGALRLHPMGiDGPIDSFAPFHNvncpRGFLYFNRQGELRISVLPAYLSYDA-PWPVRKIP 681
Cdd:COG5161    832 GHSFMFVTGKGPFLCRSRYTSSSKAFHRG-NIPLVSVIPLSK----RGYLMVDNVLGVRASQYVFDNGYVGnKNPVKRTP 906
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  682 LRCTAHYVAYHVESKVYAVATstntpCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPnaRIELEEW 761
Cdd:COG5161    907 KHKTLQKLVYHCAGRYMVVGS-----CEEAGFSPKGEDGESGIPVDTNVPHAEGYRFYVDLYSPKSWEVID--TYEFDEN 979
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  762 EHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 841
Cdd:COG5161    980 EYVFHIKYLILDDMQGTKGKSPYILVGTTFIEGEDRPARGRLHVLEIISVVPSPGSPFTDCKLKVLGIEETKGTVVRVCE 1059
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  842 CNGHLVSAIGQKIFLWSL-RASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVY 920
Cdd:COG5161   1060 VRGKIALCQGQKVMVRKIdRSSGIIPVGFYDLHIFTSSIKVVKNLLLAGDIYQGLSFFGFQSEPYRMHLISSSEPLRNAT 1139
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  921 SVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGA---HVNTFWRTPCRGAAEGPSKKSVvwenk 997
Cdd:COG5161   1140 STEFLVTGNELYFLCCDAKGNIHGLTYSPNNPISMSGARLVKRSSFTLHSaeiKMNLLPRNSEFGAGFKKNFIMV----- 1214
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  998 hitwFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRmLHVDRRILQNAVRNVLDGELLNRYLYLSTM 1077
Cdd:COG5161   1215 ----YSRSDGMLIHVVPISDAHYRRLLGIQTAIMARLKSVGGLNPRDYR-LNSDIHLHSLSLRSPLDLHIINLFSYFDMS 1289
                         1050
                   ....*....|
gi 1720385419 1078 ERSELAKKIG 1087
Cdd:COG5161   1290 TRESVASKAG 1299
 
Name Accession Description Interval E-value
CPSF_A pfam03178
CPSF A subunit region; This family includes a region that lies towards the C-terminus of the ...
737-1072 7.65e-97

CPSF A subunit region; This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs. The function of the aligned region is unknown but may be involved in RNA/DNA binding.


Pssm-ID: 427182  Cd Length: 319  Bit Score: 310.29  E-value: 7.65e-97
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  737 AFSIQLISPVSWEAIPnaRIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCR-GRILIMDVIEVvpep 815
Cdd:pfam03178    1 ASCIRLVDPITKEVID--TLELEENEAVLSVKSVNLEDSSTTKGKEEYLVVGTAFDLGEDPAARsGRILVFEIIEV---- 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  816 gqPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSL-RASELTGMAFIDTQLYIHQMISVKNFILAADVMKS 894
Cdd:pfam03178   75 --PETNRKLKLVHKTEVKGAVTALAEFQGRLLAGQGQKLRVYDLgEDKSLLPKAFLDTGVYVVDLKVFGNRIIVGDLMKS 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  895 ISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNaqlGFLVSDRDRNLMVYMYLPEAKESFGG-MRLLRRADFHVGAHVN 973
Cdd:pfam03178  153 VTFVGYDEEPYRLIEFARDTQPRWVTAAEFLDGD---TVLVADKFGNLHVLRYDPDVPESLDGdPRLLVRAEFHLGETVT 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  974 TFWRTPC-RGAAEGPSKKSVVWenkhitwfATLDGGIGLLLP-MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVD 1051
Cdd:pfam03178  230 SFRKGSLvPGGSESPSSPQLLY--------GTLDGSIGLLVPfISEEDYRFLQSLQQQLRDELPHLGGLDHRAFRSYYTP 301
                          330       340
                   ....*....|....*....|.
gi 1720385419 1052 RRilqnAVRNVLDGELLNRYL 1072
Cdd:pfam03178  302 PR----TVKGVIDGDLLERFL 318
SFT1 COG5161
Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification];
48-1087 2.20e-89

Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification];


Pssm-ID: 227490 [Multi-domain]  Cd Length: 1319  Bit Score: 314.60  E-value: 2.20e-89
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419   48 LFLGSRLGNSLLLKYTEKLqepPASSVREAadkeeppskkKRVEPAVGWTGGKTvpQDEVDEIEV---YGSEAQSGTQLA 124
Cdd:COG5161    404 FFGGVGDSNSRVLRIKSLL---PTIETRAS----------EGVGPLEGGNDEEM--DDEYSAPENklfGNKEQEVRRQDE 468
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  125 TYSFEVCDSMLNIGPCANAAVGEPAFLSEEFQNSPEPdLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVi 204
Cdd:COG5161    469 PYDAELFNALSNAGPITDFAVGKVDVEKGLPIPNIGL-LNLVVTKGSDSEAALAVEGTSLEPCICTVSSFIPLEIVWSQ- 546
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  205 apvrkeeeetpkaesteqepSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTSGFATQGPTVFAGNIGDNRYIVQ 284
Cdd:COG5161    547 --------------------KIRGYLRCSRALDFYILSRVSDSRIFRWSEEFLLEVSGEYTRDVNTLLFVEFGEENRVVQ 606
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  285 VSPLGIRLLEG-VNQLHFIPVDLGApIVQCAVADPYVVIMSAEGHVTMFLLKSDSyggrhhrLALHKPPLhhqskVIALC 363
Cdd:COG5161    607 VTPSYLLRYDQdLRMLGRVEFASRA-VEARSVRDPLILVVRDSGKILTFYDREKN-------MRLFKIDL-----VTCLA 673
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  364 LYRDVSGMFTTESRLGgarDELGGRSGSEAEGLgSETSPTVDDEeemlygdssalFSPSkeearrsSQPPADRDPAPFKA 443
Cdd:COG5161    674 DAKNKSFVLSDSNSLG---IFDIGKRISQLEPC-LVKGLPYAIQ-----------FSPE-------ASPAMDLAGEEDGD 731
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  444 DPthwcllVRENGTMEIYQLPDwrLVFlvknfpvgqrvlvdssfgqpttqgevrkeeatrqgELPLVKEVLLVALGSRQS 523
Cdd:COG5161    732 DQ------LTEISMSLTYNLID--MLF-----------------------------------RLPSIGNYMVAYLGLDLK 768
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  524 RPYL-LVHVDQELLIYEAFPhdsqlgqgnlkvrfkkvPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIY 602
Cdd:COG5161    769 EEYLfDNSLSSEIVFYKTHL-----------------PRHVSFNLNVTRNDLAITGAPDNADIKAFSSVGRIDMVFIKAV 831
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  603 GYSGVFICGPSPHWLLVTGRGALRLHPMGiDGPIDSFAPFHNvncpRGFLYFNRQGELRISVLPAYLSYDA-PWPVRKIP 681
Cdd:COG5161    832 GHSFMFVTGKGPFLCRSRYTSSSKAFHRG-NIPLVSVIPLSK----RGYLMVDNVLGVRASQYVFDNGYVGnKNPVKRTP 906
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  682 LRCTAHYVAYHVESKVYAVATstntpCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPnaRIELEEW 761
Cdd:COG5161    907 KHKTLQKLVYHCAGRYMVVGS-----CEEAGFSPKGEDGESGIPVDTNVPHAEGYRFYVDLYSPKSWEVID--TYEFDEN 979
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  762 EHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 841
Cdd:COG5161    980 EYVFHIKYLILDDMQGTKGKSPYILVGTTFIEGEDRPARGRLHVLEIISVVPSPGSPFTDCKLKVLGIEETKGTVVRVCE 1059
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  842 CNGHLVSAIGQKIFLWSL-RASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVY 920
Cdd:COG5161   1060 VRGKIALCQGQKVMVRKIdRSSGIIPVGFYDLHIFTSSIKVVKNLLLAGDIYQGLSFFGFQSEPYRMHLISSSEPLRNAT 1139
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  921 SVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGA---HVNTFWRTPCRGAAEGPSKKSVvwenk 997
Cdd:COG5161   1140 STEFLVTGNELYFLCCDAKGNIHGLTYSPNNPISMSGARLVKRSSFTLHSaeiKMNLLPRNSEFGAGFKKNFIMV----- 1214
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  998 hitwFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRmLHVDRRILQNAVRNVLDGELLNRYLYLSTM 1077
Cdd:COG5161   1215 ----YSRSDGMLIHVVPISDAHYRRLLGIQTAIMARLKSVGGLNPRDYR-LNSDIHLHSLSLRSPLDLHIINLFSYFDMS 1289
                         1050
                   ....*....|
gi 1720385419 1078 ERSELAKKIG 1087
Cdd:COG5161   1290 TRESVASKAG 1299
MMS1_N pfam10433
Mono-functional DNA-alkylating methyl methanesulfonate N-term; MMS1 is a protein that protects ...
8-336 6.32e-15

Mono-functional DNA-alkylating methyl methanesulfonate N-term; MMS1 is a protein that protects against replication-dependent DNA damage in Saccharomyces cerevisiae. MMS1 belongs to the DDB1 family of cullin 4 adaptors and the two proteins are homologous. MMS1 bridges the interaction of MMS22 and Crt10 with Cul8/Rtt101. Cul8/Rtt101 is a cullin protein involved in the regulation of DNA replication subsequent to DNA damage. The N-terminal region of MMS1 and the C-terminal of MMS22 are required for the the MMS1-MMS22 interaction. The human HIV-1 virion-associated protein Vpr assembles with DDB1 through interaction with DCAF1 (chromatin assembly factor) to form an E3 ubiquitin ligase that targets cellular substrates for proteasome-mediated degradation and subsequent G2 arrest.


Pssm-ID: 463091  Cd Length: 486  Bit Score: 78.85  E-value: 6.32e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419    8 GEIYVLTLITD------GMRSVRAFHFDKAaasvltTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASsvreaadke 81
Cdd:pfam10433  223 GDLYLLTIENDednvvtSIKIGYFGTTSVA------SALVILDNGFLFVASEFGDSQLYQIDARGDDDLSN--------- 287
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419   82 eppskkkrvepavgwtggktvpqdevdeievygseaqsgtqlatysFEVCDSMLNIGPCANAAVgepAFLSEEfqnspeP 161
Cdd:pfam10433  288 ----------------------------------------------LELVQTFSNWAPILDFVV---MDLGGE------D 312
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  162 DLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPG--CYDMWTViapvrkeeeetpkaesteqePSAPKAEEDGrrhgFL 239
Cdd:pfam10433  313 TARIYTCSGAGKRGSLRSLRHGVGAEELAVSEEPGspITGVWTL--------------------KSSPEDEYDD----YL 368
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  240 ILSREDSTMILQ-TGQEIMELDT-SGFATQGPTVFAGNIGDNRyIVQVSPLGIRLLEGVNQLHFIPVDLGAPIVQCAVAD 317
Cdd:pfam10433  369 VVSFVNETRVLSiDGDGVEEVDEdSGFLLSVPTLAAGNLGDGR-LLQVTPNGIRLIDSDKRISEWKPPGGKSITAAAANG 447
                          330
                   ....*....|....*....
gi 1720385419  318 PYVVIMSAEGHVTMFLLKS 336
Cdd:pfam10433  448 RQVLLALSGGELVYFEIST 466
 
Name Accession Description Interval E-value
CPSF_A pfam03178
CPSF A subunit region; This family includes a region that lies towards the C-terminus of the ...
737-1072 7.65e-97

CPSF A subunit region; This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs. The function of the aligned region is unknown but may be involved in RNA/DNA binding.


Pssm-ID: 427182  Cd Length: 319  Bit Score: 310.29  E-value: 7.65e-97
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  737 AFSIQLISPVSWEAIPnaRIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCR-GRILIMDVIEVvpep 815
Cdd:pfam03178    1 ASCIRLVDPITKEVID--TLELEENEAVLSVKSVNLEDSSTTKGKEEYLVVGTAFDLGEDPAARsGRILVFEIIEV---- 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  816 gqPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSL-RASELTGMAFIDTQLYIHQMISVKNFILAADVMKS 894
Cdd:pfam03178   75 --PETNRKLKLVHKTEVKGAVTALAEFQGRLLAGQGQKLRVYDLgEDKSLLPKAFLDTGVYVVDLKVFGNRIIVGDLMKS 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  895 ISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNaqlGFLVSDRDRNLMVYMYLPEAKESFGG-MRLLRRADFHVGAHVN 973
Cdd:pfam03178  153 VTFVGYDEEPYRLIEFARDTQPRWVTAAEFLDGD---TVLVADKFGNLHVLRYDPDVPESLDGdPRLLVRAEFHLGETVT 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  974 TFWRTPC-RGAAEGPSKKSVVWenkhitwfATLDGGIGLLLP-MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVD 1051
Cdd:pfam03178  230 SFRKGSLvPGGSESPSSPQLLY--------GTLDGSIGLLVPfISEEDYRFLQSLQQQLRDELPHLGGLDHRAFRSYYTP 301
                          330       340
                   ....*....|....*....|.
gi 1720385419 1052 RRilqnAVRNVLDGELLNRYL 1072
Cdd:pfam03178  302 PR----TVKGVIDGDLLERFL 318
SFT1 COG5161
Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification];
48-1087 2.20e-89

Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification];


Pssm-ID: 227490 [Multi-domain]  Cd Length: 1319  Bit Score: 314.60  E-value: 2.20e-89
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419   48 LFLGSRLGNSLLLKYTEKLqepPASSVREAadkeeppskkKRVEPAVGWTGGKTvpQDEVDEIEV---YGSEAQSGTQLA 124
Cdd:COG5161    404 FFGGVGDSNSRVLRIKSLL---PTIETRAS----------EGVGPLEGGNDEEM--DDEYSAPENklfGNKEQEVRRQDE 468
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  125 TYSFEVCDSMLNIGPCANAAVGEPAFLSEEFQNSPEPdLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVi 204
Cdd:COG5161    469 PYDAELFNALSNAGPITDFAVGKVDVEKGLPIPNIGL-LNLVVTKGSDSEAALAVEGTSLEPCICTVSSFIPLEIVWSQ- 546
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  205 apvrkeeeetpkaesteqepSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTSGFATQGPTVFAGNIGDNRYIVQ 284
Cdd:COG5161    547 --------------------KIRGYLRCSRALDFYILSRVSDSRIFRWSEEFLLEVSGEYTRDVNTLLFVEFGEENRVVQ 606
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  285 VSPLGIRLLEG-VNQLHFIPVDLGApIVQCAVADPYVVIMSAEGHVTMFLLKSDSyggrhhrLALHKPPLhhqskVIALC 363
Cdd:COG5161    607 VTPSYLLRYDQdLRMLGRVEFASRA-VEARSVRDPLILVVRDSGKILTFYDREKN-------MRLFKIDL-----VTCLA 673
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  364 LYRDVSGMFTTESRLGgarDELGGRSGSEAEGLgSETSPTVDDEeemlygdssalFSPSkeearrsSQPPADRDPAPFKA 443
Cdd:COG5161    674 DAKNKSFVLSDSNSLG---IFDIGKRISQLEPC-LVKGLPYAIQ-----------FSPE-------ASPAMDLAGEEDGD 731
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  444 DPthwcllVRENGTMEIYQLPDwrLVFlvknfpvgqrvlvdssfgqpttqgevrkeeatrqgELPLVKEVLLVALGSRQS 523
Cdd:COG5161    732 DQ------LTEISMSLTYNLID--MLF-----------------------------------RLPSIGNYMVAYLGLDLK 768
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  524 RPYL-LVHVDQELLIYEAFPhdsqlgqgnlkvrfkkvPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIY 602
Cdd:COG5161    769 EEYLfDNSLSSEIVFYKTHL-----------------PRHVSFNLNVTRNDLAITGAPDNADIKAFSSVGRIDMVFIKAV 831
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  603 GYSGVFICGPSPHWLLVTGRGALRLHPMGiDGPIDSFAPFHNvncpRGFLYFNRQGELRISVLPAYLSYDA-PWPVRKIP 681
Cdd:COG5161    832 GHSFMFVTGKGPFLCRSRYTSSSKAFHRG-NIPLVSVIPLSK----RGYLMVDNVLGVRASQYVFDNGYVGnKNPVKRTP 906
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  682 LRCTAHYVAYHVESKVYAVATstntpCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPnaRIELEEW 761
Cdd:COG5161    907 KHKTLQKLVYHCAGRYMVVGS-----CEEAGFSPKGEDGESGIPVDTNVPHAEGYRFYVDLYSPKSWEVID--TYEFDEN 979
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  762 EHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 841
Cdd:COG5161    980 EYVFHIKYLILDDMQGTKGKSPYILVGTTFIEGEDRPARGRLHVLEIISVVPSPGSPFTDCKLKVLGIEETKGTVVRVCE 1059
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  842 CNGHLVSAIGQKIFLWSL-RASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVY 920
Cdd:COG5161   1060 VRGKIALCQGQKVMVRKIdRSSGIIPVGFYDLHIFTSSIKVVKNLLLAGDIYQGLSFFGFQSEPYRMHLISSSEPLRNAT 1139
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  921 SVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGA---HVNTFWRTPCRGAAEGPSKKSVvwenk 997
Cdd:COG5161   1140 STEFLVTGNELYFLCCDAKGNIHGLTYSPNNPISMSGARLVKRSSFTLHSaeiKMNLLPRNSEFGAGFKKNFIMV----- 1214
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  998 hitwFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRmLHVDRRILQNAVRNVLDGELLNRYLYLSTM 1077
Cdd:COG5161   1215 ----YSRSDGMLIHVVPISDAHYRRLLGIQTAIMARLKSVGGLNPRDYR-LNSDIHLHSLSLRSPLDLHIINLFSYFDMS 1289
                         1050
                   ....*....|
gi 1720385419 1078 ERSELAKKIG 1087
Cdd:COG5161   1290 TRESVASKAG 1299
MMS1_N pfam10433
Mono-functional DNA-alkylating methyl methanesulfonate N-term; MMS1 is a protein that protects ...
8-336 6.32e-15

Mono-functional DNA-alkylating methyl methanesulfonate N-term; MMS1 is a protein that protects against replication-dependent DNA damage in Saccharomyces cerevisiae. MMS1 belongs to the DDB1 family of cullin 4 adaptors and the two proteins are homologous. MMS1 bridges the interaction of MMS22 and Crt10 with Cul8/Rtt101. Cul8/Rtt101 is a cullin protein involved in the regulation of DNA replication subsequent to DNA damage. The N-terminal region of MMS1 and the C-terminal of MMS22 are required for the the MMS1-MMS22 interaction. The human HIV-1 virion-associated protein Vpr assembles with DDB1 through interaction with DCAF1 (chromatin assembly factor) to form an E3 ubiquitin ligase that targets cellular substrates for proteasome-mediated degradation and subsequent G2 arrest.


Pssm-ID: 463091  Cd Length: 486  Bit Score: 78.85  E-value: 6.32e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419    8 GEIYVLTLITD------GMRSVRAFHFDKAaasvltTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASsvreaadke 81
Cdd:pfam10433  223 GDLYLLTIENDednvvtSIKIGYFGTTSVA------SALVILDNGFLFVASEFGDSQLYQIDARGDDDLSN--------- 287
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419   82 eppskkkrvepavgwtggktvpqdevdeievygseaqsgtqlatysFEVCDSMLNIGPCANAAVgepAFLSEEfqnspeP 161
Cdd:pfam10433  288 ----------------------------------------------LELVQTFSNWAPILDFVV---MDLGGE------D 312
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  162 DLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPG--CYDMWTViapvrkeeeetpkaesteqePSAPKAEEDGrrhgFL 239
Cdd:pfam10433  313 TARIYTCSGAGKRGSLRSLRHGVGAEELAVSEEPGspITGVWTL--------------------KSSPEDEYDD----YL 368
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720385419  240 ILSREDSTMILQ-TGQEIMELDT-SGFATQGPTVFAGNIGDNRyIVQVSPLGIRLLEGVNQLHFIPVDLGAPIVQCAVAD 317
Cdd:pfam10433  369 VVSFVNETRVLSiDGDGVEEVDEdSGFLLSVPTLAAGNLGDGR-LLQVTPNGIRLIDSDKRISEWKPPGGKSITAAAANG 447
                          330
                   ....*....|....*....
gi 1720385419  318 PYVVIMSAEGHVTMFLLKS 336
Cdd:pfam10433  448 RQVLLALSGGELVYFEIST 466
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH