NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1370469953|ref|XP_024306305|]
View 

pre-mRNA-processing-splicing factor 8 isoform X1 [Homo sapiens]

Protein Classification

PRP8 family protein( domain architecture ID 11474092)

PRP8 family protein similar to Homo sapiens pre-mRNA-processing-splicing factor 8 that plays role in pre-mRNA splicing as core component of precatalytic, catalytic and postcatalytic spliceosomal complexes, both of the predominant U2-type spliceosome and the minor U12-type spliceosome

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PRP8 COG5178
U5 snRNP spliceosome subunit [RNA processing and modification];
1-2335 0e+00

U5 snRNP spliceosome subunit [RNA processing and modification];


:

Pssm-ID: 227505 [Multi-domain]  Cd Length: 2365  Bit Score: 3576.09  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953    1 MAGVFP-YRGPGNPVPG-------PLAPLPDYMSEEK--------------LQEKARKWQQLQ-AKRYAEKRKFGFVDAQ 57
Cdd:COG5178      1 MASLPPgNPPPPPPPPGfeppsqpPPPPPPGVNVKKRsrkqlsivgdilghSGNPIYSLRVSDkPVKLGNKAKTLHVLTL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953   58 KEDMPPEHVRKIIRDHGDMTNRKFRHDKRVYLGALKYMPHAVLKLLENMPMPWEQIRDVPVLYHITGAISFVNEIPWVIE 137
Cdd:COG5178     81 KAPIPPEHLRKIQSPCSDMPSVLTKVDKRSYLGALKYLPHAVLKLLENMPSPWEDVSEVKVLYHCHGAITFVNEVPRVIE 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  138 PVYISQWGSMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNILDVEPLEAIQLELDPEEDAPVLDWFYDHQPLRDSR 217
Cdd:COG5178    161 PQLFAQWGLCWSPMRREKRDRYSFKRKRFPPFDDLEPPLSKSQWVLGVEPLMPINIRLDRMDDEHVRDWVYTSRDLEDHP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  218 KyVNGSTYQRWQFTLPMMSTLYRLANQLLTDLVDDNYFYLFDLKAFFTSKALNMAIPGGPKFEPLVRDINLQDEDWNEFN 297
Cdd:COG5178    241 S-VNGAMYRRWKYMLPAMHNLLRLMPMLWESIRDVNYVYLFSGLSFFVAKALNVAIPGGPKFEPLYSRESAEFEDENEFN 319
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  298 DINKIIIRQPIRTEYKIAFPYLYNNLPHHVHLTWYHTPNVVFIKT-EDPDLPAFYFDPLINPISHR----HSVKSQEPLP 372
Cdd:COG5178    320 GIVRIIRRPPIDDEYPVAFPGLYNSRPRSVAVECYGSPECRDVFLdEDEDYPANFKDPLINPILGVqldnHPYDGKGSNE 399
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  373 D----DDEEFELPEFvEPFLKDTPLYTDNTANGIALLWAPRPFNLRSGRTRRALDIPLVKNWYREHCPAGQPVKVRVSYQ 448
Cdd:COG5178    400 EscvmERKLFSEPIF-YPYLYNESTEVRTTERAHLLLKNPFPFNKGKGRAERAQDVPLDKPWLLGHCLQERPVKVPVSYQ 478
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  449 KLLKYYVLNALKHRPPKAQKKRYLFRSFKATKFFQSTKLDWVEVGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVK 528
Cdd:COG5178    479 KLLKNYVRNMLHKTRPRPHTNTHLLKELKNTKYFQRTEIDWVEAGLQLCRQGHNMLSLLIHRKGLTYLHLDYNFNLKPTK 558
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  529 TLTTKERKKSRFGNAFHLCREVLRLTKLVVDSHVQYRLGNVDAFQLADGLQYIFAHVGQLTGMYRYKYKLMRQIRMCKDL 608
Cdd:COG5178    559 TLTTKERKKSRVGNSFHLMREMLKFIKLIVDIHVQFRLGNIDAYQLADGVHYILNHVGQLTGIYRYKYKLMKQIRACKDW 638
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  609 KHLIYYRFNTGpVGKGPGCGFWAAGWRVWLFFMRGITPLLERWLGNLLARQFEGRhSKGVAKTVTKQRVESHFDLELRAA 688
Cdd:COG5178    639 KHLIYYAFNEG-VGKGPGCGFWGPQWRVWLFFLRGHIPLLERYIGNLVTRQFEGR-SDYNPKPLTKQRSDSGYDLELRRQ 716
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  689 VMHDILDMMPEGIKQNKARTILQHLSEAWRCWKANIPWKVPGLPTPIENMILRYVKAKADWWTNTAHYNRERIRRGATVD 768
Cdd:COG5178    717 VMADILSMLPEGIRQTKVRTILQHLSEAWRCWKANIPWHVPGEPAPILEVIRRYIKSKADLWTSSAHFNRERISRGAGVG 796
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  769 KTVCKKNLGRLTRLYLKAEQERQHNYLKDGPYITAEEAVAVYTTTVHWLESRRFSPIPFPPLSYKHDTKLLILALERLKE 848
Cdd:COG5178    797 KTKEKKNLGRLTRLWVKLEQERQVDSAKVGPKSTKEEAKRIGKITVLWLESRMFEPIPFPPLRYKEDTKILVLALEYLKS 876
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  849 AYSVKSRLNQSQREELGLIEQAYDNPHEALSRIKRHLLTQRAFKEVGIEFMDLYSHLVPVYDVEPLEKITDAYLDQYLWY 928
Cdd:COG5178    877 KYTGKIRLNESTREELALLEKAYDNPHDTLFRIKKSLLTQRSFKEVGITLMRHYDGAIPVYSVDPVEKIVDAYLDQYLWY 956
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  929 EADKRRLFPPWIKPADTEPPPLLVYKWCQGINNLQDVWETSEGECNVMLESRFEKMYEKIDLTLLNRLLRLIVDHNIADY 1008
Cdd:COG5178    957 EADRRNLFPEWIKPSDSEMPPLLVYKWCQGINNLKAAWDTSNGERLVLYETKLEGIMEKVDNTLLNRLLKLVLDPNLADY 1036
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1009 MTAKNNVVINYKDMNHTNSYGIIRGLQFASFIVQYYGLVMDLLVLGLHRASEMAGPPQMPNDFLSFQDIATEAAHPIRLF 1088
Cdd:COG5178   1037 IIAKNNVVVVYKDMSHTNHYGLIRGLQFSSFIYQFYGLVVDLLVLGLQRATEIAGPADAPNVFMDFKSRATETSHPIRLY 1116
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1089 CRYIDRIHIFFRFTADEARDLIQRYLTEHPDPNNENIVGYNN--KKCWPRDARMRLMKHDVNLGRAVFWDIKNRLPRSVT 1166
Cdd:COG5178   1117 TRYMDDIYIVFRFQRKEEDSLLEDYLRENPDPEEANNERYRNyfKGCWPDDCRMRLGPLDVNLGRAVFWEILRRCPHSLT 1196
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1167 TVQWENSFVSVYSKDNPNLLFNMCGFECRILPKCRTSyEEFTHKDGVWNLQNEVTKERTAQCFLRVDDESMQRFHNRVRQ 1246
Cdd:COG5178   1197 ATRWEPSFGSVYSKINPNLLFSMVGFEVRILPKIRKI-EERSLSSGVWRLGDGRTKQRTAHANLAVSEGGIEMFESRIRH 1275
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1247 ILMASGSTTFTKIVNKWNTALIGLMTYFREAVVNTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVVFYTPKELGGLG 1326
Cdd:COG5178   1276 ILMTSGSTTFTKVATKWNTQLIALVTYYREAICDTKGLLDKLVKAERLIQNRVKKGLNSKMPVRFPPAVFYAPKELGGLG 1355
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1327 MLSMGHVLIPQSDLRWSKQTDVGITHFRSGMSHEEDQLIPNLYRYIQPWESEFIDSQRVWAEYALKRQEAIAQNRRLTLE 1406
Cdd:COG5178   1356 MLSVGHILIPHSDLEWSKQTDTGITHFRSGMTTNGERLIPAAMRYISRWEYEFEDSQRVWAEYARKRQEAGQQNRRLTLE 1435
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1407 DLEDSWDRGIPRINTLFQKDRHTLAYDKGWRVRTDFKQYQVLKQNPFWWTHQRHDGKLWNLNNYRTDMIQALGGVEGILE 1486
Cdd:COG5178   1436 DLEMSWDRGIPRISTLFQRDRHTLAYDRGFRMRSEFKQYSLKPNNPFWWTDAKHDGKLWSLNRYRLDVIQALGGVEGILE 1515
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1487 HTLFKGTYFPTWEGLFWEKASGFEESMKWKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKI 1566
Cdd:COG5178   1516 HTLFKATGFRSWEGLFWEKASGFEESMKFKKLTNAQRMGLSQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGILMHGKI 1595
                         1610      1620      1630      1640      1650      1660      1670      1680
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1567 PTLKISLIQIFRAHLWQKIHESIVMDLCQVFDQELDALEIETVQKETIHPRKSYKMNSSCADILLFASYKWNVSRPSLLA 1646
Cdd:COG5178   1596 PTLKISLIQIFRNHLWQKIHESVVGDLCQVLDKELDVLQIETVQKETVHPRKSYKMNSSCADILLSGAYDWCVSSPSLLL 1675
                         1690      1700      1710      1720      1730      1740      1750      1760
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1647 DSKDVMDSTTTQKYWIDIQLRWGDYDSHDIERYARAKFLDYTTDNMSIYPSPTGVLIAIDLAYNLHSAYGNWFPGSKPLI 1726
Cdd:COG5178   1676 EERDGGSNVRTNKLWIDVQLRWGDYDSHDIHRYARAKFLDYTTDPQSMYPSPTGVVIGIDLCYNMWSAYGNWNEGLKPLI 1755
                         1770      1780      1790      1800      1810      1820      1830      1840
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1727 QQAMAKIMKANPALYVLRERIRKGLQLYSSEPTEPYLSSQNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGA 1806
Cdd:COG5178   1756 QSSMERIMKANPALYVLRERIRKGLQLYTSEPQEQYLSSSNYAELFSNSIDLFVDDTNVYRVTLHKTFEGNLTTKPINGA 1835
                         1850      1860      1870      1880      1890      1900      1910      1920
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1807 IFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEVAALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKG 1886
Cdd:COG5178   1836 IFVLNPATGNLFLKVIHTSVWAGQKRLIQLAKWKTAEEVFALGRSLPVEEQPKQIIVTRKSMLDPLEVHILDFPNISIRT 1915
                         1930      1940      1950      1960      1970      1980      1990      2000
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1887 SELQLPFQACLKVEKFGDLILKATEPQMVLFNLYDDWLKTISSYTAFSRLILILRALHVNNDRAKVILKPDKTTITEPHH 1966
Cdd:COG5178   1916 CELALPFSAVMGIDKIRDLILRATEPQMVLFNLYDDWLQETSSYTAFSRLLLVLRALDVNEERVKEILRPDKSIITKINH 1995
                         2010      2020      2030      2040      2050      2060      2070      2080
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1967 IWPTLTDEEWIKVEVQLKDLILADYGKKNNVNVASLTQSEIRDIILGMEISAPSQQRQQIAEIEKQTKE--QSQLTATQT 2044
Cdd:COG5178   1996 LWPGFSDSQWIKKEIQLRDLILDRYCSKHNINPSGLTQSEVRDIILGFRISAPSGARQETAETEKQNSEkaLSRPTNVST 2075
                         2090      2100      2110      2120      2130      2140      2150      2160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 2045 RTVNKHGDEIITSTTSNYETQTFSSKTEWRVRAISAANLHLRTNHIYVSSDDIKETG-YTYILPKNVLKKFICISDLRAQ 2123
Cdd:COG5178   2076 KTINGWGREYVVLDGMIYEGEKFSSKEEWRSEAIRTGPLELRTKNIYVTADENEESIqQMYRLPLNLLEKFMRISDPHVQ 2155
                         2170      2180      2190      2200      2210      2220      2230      2240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 2124 IAGYLYGVSPPDNPQVKEIRCIVMVPQWGTHQTVH----LPGQLPQHEYLkemEPLGWIHTQPNESPQLSPQDVTTHAKI 2199
Cdd:COG5178   2156 VAGLVYGKSGSDNPQIKEILSFGLVPQLGSLSGVQsssfVPHDLPGDEDL---EILGWIHTQDDELPYLEVAGVLTHRKK 2232
                         2250      2260      2270      2280      2290      2300      2310      2320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 2200 MADnPSWDgekTIIITCSFTPGSCTLTAYKLTPSGYEWGRQNTDKGNN-PKGYLPSHYERVQMLLSDRFLGFFMVPAQSS 2278
Cdd:COG5178   2233 IVD-PEWD---AVTLTVSYLPGSISLRAYVVKKEGCNWGSKNMDINSDeAIGVEPVLGKDCQLLLSDRIQGVFYVPEEEV 2308
                         2330      2340      2350      2360      2370
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1370469953 2279 WNYNFMGVRHDPNMKYELQLANPKEFYHEVHRPSHFLNFALLQEGEVYSADREDLYA 2335
Cdd:COG5178   2309 WNYNFAGPFFDDRLEYTWKIGMPLGFYDGFHRPGHFSRFYELRAGGRLEEWQEDAFA 2365
 
Name Accession Description Interval E-value
PRP8 COG5178
U5 snRNP spliceosome subunit [RNA processing and modification];
1-2335 0e+00

U5 snRNP spliceosome subunit [RNA processing and modification];


Pssm-ID: 227505 [Multi-domain]  Cd Length: 2365  Bit Score: 3576.09  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953    1 MAGVFP-YRGPGNPVPG-------PLAPLPDYMSEEK--------------LQEKARKWQQLQ-AKRYAEKRKFGFVDAQ 57
Cdd:COG5178      1 MASLPPgNPPPPPPPPGfeppsqpPPPPPPGVNVKKRsrkqlsivgdilghSGNPIYSLRVSDkPVKLGNKAKTLHVLTL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953   58 KEDMPPEHVRKIIRDHGDMTNRKFRHDKRVYLGALKYMPHAVLKLLENMPMPWEQIRDVPVLYHITGAISFVNEIPWVIE 137
Cdd:COG5178     81 KAPIPPEHLRKIQSPCSDMPSVLTKVDKRSYLGALKYLPHAVLKLLENMPSPWEDVSEVKVLYHCHGAITFVNEVPRVIE 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  138 PVYISQWGSMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNILDVEPLEAIQLELDPEEDAPVLDWFYDHQPLRDSR 217
Cdd:COG5178    161 PQLFAQWGLCWSPMRREKRDRYSFKRKRFPPFDDLEPPLSKSQWVLGVEPLMPINIRLDRMDDEHVRDWVYTSRDLEDHP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  218 KyVNGSTYQRWQFTLPMMSTLYRLANQLLTDLVDDNYFYLFDLKAFFTSKALNMAIPGGPKFEPLVRDINLQDEDWNEFN 297
Cdd:COG5178    241 S-VNGAMYRRWKYMLPAMHNLLRLMPMLWESIRDVNYVYLFSGLSFFVAKALNVAIPGGPKFEPLYSRESAEFEDENEFN 319
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  298 DINKIIIRQPIRTEYKIAFPYLYNNLPHHVHLTWYHTPNVVFIKT-EDPDLPAFYFDPLINPISHR----HSVKSQEPLP 372
Cdd:COG5178    320 GIVRIIRRPPIDDEYPVAFPGLYNSRPRSVAVECYGSPECRDVFLdEDEDYPANFKDPLINPILGVqldnHPYDGKGSNE 399
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  373 D----DDEEFELPEFvEPFLKDTPLYTDNTANGIALLWAPRPFNLRSGRTRRALDIPLVKNWYREHCPAGQPVKVRVSYQ 448
Cdd:COG5178    400 EscvmERKLFSEPIF-YPYLYNESTEVRTTERAHLLLKNPFPFNKGKGRAERAQDVPLDKPWLLGHCLQERPVKVPVSYQ 478
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  449 KLLKYYVLNALKHRPPKAQKKRYLFRSFKATKFFQSTKLDWVEVGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVK 528
Cdd:COG5178    479 KLLKNYVRNMLHKTRPRPHTNTHLLKELKNTKYFQRTEIDWVEAGLQLCRQGHNMLSLLIHRKGLTYLHLDYNFNLKPTK 558
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  529 TLTTKERKKSRFGNAFHLCREVLRLTKLVVDSHVQYRLGNVDAFQLADGLQYIFAHVGQLTGMYRYKYKLMRQIRMCKDL 608
Cdd:COG5178    559 TLTTKERKKSRVGNSFHLMREMLKFIKLIVDIHVQFRLGNIDAYQLADGVHYILNHVGQLTGIYRYKYKLMKQIRACKDW 638
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  609 KHLIYYRFNTGpVGKGPGCGFWAAGWRVWLFFMRGITPLLERWLGNLLARQFEGRhSKGVAKTVTKQRVESHFDLELRAA 688
Cdd:COG5178    639 KHLIYYAFNEG-VGKGPGCGFWGPQWRVWLFFLRGHIPLLERYIGNLVTRQFEGR-SDYNPKPLTKQRSDSGYDLELRRQ 716
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  689 VMHDILDMMPEGIKQNKARTILQHLSEAWRCWKANIPWKVPGLPTPIENMILRYVKAKADWWTNTAHYNRERIRRGATVD 768
Cdd:COG5178    717 VMADILSMLPEGIRQTKVRTILQHLSEAWRCWKANIPWHVPGEPAPILEVIRRYIKSKADLWTSSAHFNRERISRGAGVG 796
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  769 KTVCKKNLGRLTRLYLKAEQERQHNYLKDGPYITAEEAVAVYTTTVHWLESRRFSPIPFPPLSYKHDTKLLILALERLKE 848
Cdd:COG5178    797 KTKEKKNLGRLTRLWVKLEQERQVDSAKVGPKSTKEEAKRIGKITVLWLESRMFEPIPFPPLRYKEDTKILVLALEYLKS 876
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  849 AYSVKSRLNQSQREELGLIEQAYDNPHEALSRIKRHLLTQRAFKEVGIEFMDLYSHLVPVYDVEPLEKITDAYLDQYLWY 928
Cdd:COG5178    877 KYTGKIRLNESTREELALLEKAYDNPHDTLFRIKKSLLTQRSFKEVGITLMRHYDGAIPVYSVDPVEKIVDAYLDQYLWY 956
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  929 EADKRRLFPPWIKPADTEPPPLLVYKWCQGINNLQDVWETSEGECNVMLESRFEKMYEKIDLTLLNRLLRLIVDHNIADY 1008
Cdd:COG5178    957 EADRRNLFPEWIKPSDSEMPPLLVYKWCQGINNLKAAWDTSNGERLVLYETKLEGIMEKVDNTLLNRLLKLVLDPNLADY 1036
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1009 MTAKNNVVINYKDMNHTNSYGIIRGLQFASFIVQYYGLVMDLLVLGLHRASEMAGPPQMPNDFLSFQDIATEAAHPIRLF 1088
Cdd:COG5178   1037 IIAKNNVVVVYKDMSHTNHYGLIRGLQFSSFIYQFYGLVVDLLVLGLQRATEIAGPADAPNVFMDFKSRATETSHPIRLY 1116
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1089 CRYIDRIHIFFRFTADEARDLIQRYLTEHPDPNNENIVGYNN--KKCWPRDARMRLMKHDVNLGRAVFWDIKNRLPRSVT 1166
Cdd:COG5178   1117 TRYMDDIYIVFRFQRKEEDSLLEDYLRENPDPEEANNERYRNyfKGCWPDDCRMRLGPLDVNLGRAVFWEILRRCPHSLT 1196
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1167 TVQWENSFVSVYSKDNPNLLFNMCGFECRILPKCRTSyEEFTHKDGVWNLQNEVTKERTAQCFLRVDDESMQRFHNRVRQ 1246
Cdd:COG5178   1197 ATRWEPSFGSVYSKINPNLLFSMVGFEVRILPKIRKI-EERSLSSGVWRLGDGRTKQRTAHANLAVSEGGIEMFESRIRH 1275
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1247 ILMASGSTTFTKIVNKWNTALIGLMTYFREAVVNTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVVFYTPKELGGLG 1326
Cdd:COG5178   1276 ILMTSGSTTFTKVATKWNTQLIALVTYYREAICDTKGLLDKLVKAERLIQNRVKKGLNSKMPVRFPPAVFYAPKELGGLG 1355
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1327 MLSMGHVLIPQSDLRWSKQTDVGITHFRSGMSHEEDQLIPNLYRYIQPWESEFIDSQRVWAEYALKRQEAIAQNRRLTLE 1406
Cdd:COG5178   1356 MLSVGHILIPHSDLEWSKQTDTGITHFRSGMTTNGERLIPAAMRYISRWEYEFEDSQRVWAEYARKRQEAGQQNRRLTLE 1435
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1407 DLEDSWDRGIPRINTLFQKDRHTLAYDKGWRVRTDFKQYQVLKQNPFWWTHQRHDGKLWNLNNYRTDMIQALGGVEGILE 1486
Cdd:COG5178   1436 DLEMSWDRGIPRISTLFQRDRHTLAYDRGFRMRSEFKQYSLKPNNPFWWTDAKHDGKLWSLNRYRLDVIQALGGVEGILE 1515
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1487 HTLFKGTYFPTWEGLFWEKASGFEESMKWKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKI 1566
Cdd:COG5178   1516 HTLFKATGFRSWEGLFWEKASGFEESMKFKKLTNAQRMGLSQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGILMHGKI 1595
                         1610      1620      1630      1640      1650      1660      1670      1680
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1567 PTLKISLIQIFRAHLWQKIHESIVMDLCQVFDQELDALEIETVQKETIHPRKSYKMNSSCADILLFASYKWNVSRPSLLA 1646
Cdd:COG5178   1596 PTLKISLIQIFRNHLWQKIHESVVGDLCQVLDKELDVLQIETVQKETVHPRKSYKMNSSCADILLSGAYDWCVSSPSLLL 1675
                         1690      1700      1710      1720      1730      1740      1750      1760
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1647 DSKDVMDSTTTQKYWIDIQLRWGDYDSHDIERYARAKFLDYTTDNMSIYPSPTGVLIAIDLAYNLHSAYGNWFPGSKPLI 1726
Cdd:COG5178   1676 EERDGGSNVRTNKLWIDVQLRWGDYDSHDIHRYARAKFLDYTTDPQSMYPSPTGVVIGIDLCYNMWSAYGNWNEGLKPLI 1755
                         1770      1780      1790      1800      1810      1820      1830      1840
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1727 QQAMAKIMKANPALYVLRERIRKGLQLYSSEPTEPYLSSQNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGA 1806
Cdd:COG5178   1756 QSSMERIMKANPALYVLRERIRKGLQLYTSEPQEQYLSSSNYAELFSNSIDLFVDDTNVYRVTLHKTFEGNLTTKPINGA 1835
                         1850      1860      1870      1880      1890      1900      1910      1920
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1807 IFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEVAALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKG 1886
Cdd:COG5178   1836 IFVLNPATGNLFLKVIHTSVWAGQKRLIQLAKWKTAEEVFALGRSLPVEEQPKQIIVTRKSMLDPLEVHILDFPNISIRT 1915
                         1930      1940      1950      1960      1970      1980      1990      2000
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1887 SELQLPFQACLKVEKFGDLILKATEPQMVLFNLYDDWLKTISSYTAFSRLILILRALHVNNDRAKVILKPDKTTITEPHH 1966
Cdd:COG5178   1916 CELALPFSAVMGIDKIRDLILRATEPQMVLFNLYDDWLQETSSYTAFSRLLLVLRALDVNEERVKEILRPDKSIITKINH 1995
                         2010      2020      2030      2040      2050      2060      2070      2080
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1967 IWPTLTDEEWIKVEVQLKDLILADYGKKNNVNVASLTQSEIRDIILGMEISAPSQQRQQIAEIEKQTKE--QSQLTATQT 2044
Cdd:COG5178   1996 LWPGFSDSQWIKKEIQLRDLILDRYCSKHNINPSGLTQSEVRDIILGFRISAPSGARQETAETEKQNSEkaLSRPTNVST 2075
                         2090      2100      2110      2120      2130      2140      2150      2160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 2045 RTVNKHGDEIITSTTSNYETQTFSSKTEWRVRAISAANLHLRTNHIYVSSDDIKETG-YTYILPKNVLKKFICISDLRAQ 2123
Cdd:COG5178   2076 KTINGWGREYVVLDGMIYEGEKFSSKEEWRSEAIRTGPLELRTKNIYVTADENEESIqQMYRLPLNLLEKFMRISDPHVQ 2155
                         2170      2180      2190      2200      2210      2220      2230      2240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 2124 IAGYLYGVSPPDNPQVKEIRCIVMVPQWGTHQTVH----LPGQLPQHEYLkemEPLGWIHTQPNESPQLSPQDVTTHAKI 2199
Cdd:COG5178   2156 VAGLVYGKSGSDNPQIKEILSFGLVPQLGSLSGVQsssfVPHDLPGDEDL---EILGWIHTQDDELPYLEVAGVLTHRKK 2232
                         2250      2260      2270      2280      2290      2300      2310      2320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 2200 MADnPSWDgekTIIITCSFTPGSCTLTAYKLTPSGYEWGRQNTDKGNN-PKGYLPSHYERVQMLLSDRFLGFFMVPAQSS 2278
Cdd:COG5178   2233 IVD-PEWD---AVTLTVSYLPGSISLRAYVVKKEGCNWGSKNMDINSDeAIGVEPVLGKDCQLLLSDRIQGVFYVPEEEV 2308
                         2330      2340      2350      2360      2370
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1370469953 2279 WNYNFMGVRHDPNMKYELQLANPKEFYHEVHRPSHFLNFALLQEGEVYSADREDLYA 2335
Cdd:COG5178   2309 WNYNFAGPFFDDRLEYTWKIGMPLGFYDGFHRPGHFSRFYELRAGGRLEEWQEDAFA 2365
PROCN pfam08083
PROCN (NUC071) domain; The PROCN domain is the central domain in pre-mRNA splicing factors of ...
395-796 0e+00

PROCN (NUC071) domain; The PROCN domain is the central domain in pre-mRNA splicing factors of PRO8 family.


Pssm-ID: 400431  Cd Length: 402  Bit Score: 873.25  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  395 TDNTANGIALLWAPRPFNLRSGRTRRALDIPLVKNWYREHCPAGQPVKVRVSYQKLLKYYVLNALKHRPPKAQKKRYLFR 474
Cdd:pfam08083    1 NENTKDGISLYWAPYPFNRRSGRTKRAQDVPLIKHWYREHPPSNYPVKVRVSYQKLLKNYVLNELHHRKQKKHKKKNLLK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  475 SFKATKFFQSTKLDWVEVGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRFGNAFHLCREVLRLT 554
Cdd:pfam08083   81 SLKNTKFFQQTTIDWVEAGLQVCRQGHNMLNLLIHRKGLTYLHLDYNFNLKPTKTLTTKERKKSRFGNAFHLIRELLRLT 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  555 KLVVDSHVQYRLGNVDAFQLADGLQYIFAHVGQLTGMYRYKYKLMRQIRMCKDLKHLIYYRFNTGPVGKGPGCGFWAAGW 634
Cdd:pfam08083  161 KLLVDAHVQYRLGNIDAYQLADGLQYIFNHVGQLTGMYRYKYKVMRQIRACKDLKHVIYSRFNTGEVGKGPGCGFWAPSW 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  635 RVWLFFMRGITPLLERWLGNLLARQFEGRHSKGVAKTVTKQRVESHFDLELRAAVMHDILDMMPEGIKQNKARTILQHLS 714
Cdd:pfam08083  241 RVWIFFLRGIIPLLERWLGNLLARQFEGRKSKDVAKTITKQRVESYYDLELRASVMHDILDMMPEGVKQNKARTILQHLS 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  715 EAWRCWKANIPWKVPGLPTPIENMILRYVKAKADWWTNTAHYNRERIRRGATVDKTVCKKNLGRLTRLYLKAEQERQHNY 794
Cdd:pfam08083  321 EAWRCWKANIPWKVPGLPEPIENIILRYVKAKADWWTSVAHYNRERIKRGATVDKTVAKKNLGRLTRLWLKAEQERQANY 400

                   ..
gi 1370469953  795 LK 796
Cdd:pfam08083  401 LK 402
RNase_H_like_Prp8_IV cd13838
Ribonuclease-like Prp8 domain IV core; This family contains Prp8 domain IV, which adopts a ...
1766-2016 0e+00

Ribonuclease-like Prp8 domain IV core; This family contains Prp8 domain IV, which adopts a RNase H like fold within its core structure but with little sequence similarity. Prp8, a spliceosome protein, interacts directly with the splice sites and branch regions of precursor-mRNAs and spliceosomal RNAs associated with catalysis of the two steps of splicing. Catalysis of RNA cleavage by RNase H-like proteins involves a two-metal mechanism in which adjacently-bound divalent magnesium ions promote hydrolysis by activation of a water nucleophile and stabilization of the transition-state. However, the Prp8 domain IV contains only one of the canonical metal-binding sites and the coordinating side chains are spatially conserved with respect to Mg2+-coordinating residues within the RNase H fold.


Pssm-ID: 260013  Cd Length: 251  Bit Score: 575.45  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1766 QNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEV 1845
Cdd:cd13838      1 QNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1846 AALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKVEKFGDLILKATEPQMVLFNLYDDWLK 1925
Cdd:cd13838     81 AALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKIEKFGDLILKATEPQMVLFNLYDDWLK 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1926 TISSYTAFSRLILILRALHVNNDRAKVILKPDKTTITEPHHIWPTLTDEEWIKVEVQLKDLILADYGKKNNVNVASLTQS 2005
Cdd:cd13838    161 TISSYTAFSRLILILRALHVNNEKAKIILKPDKTVITEPHHIWPTLSDEEWIKVEVQLKDLILADYGKKNNVNVASLTQS 240
                          250
                   ....*....|.
gi 1370469953 2006 EIRDIILGMEI 2016
Cdd:cd13838    241 EIRDIILGMEI 251
JAB_MPN smart00232
JAB/MPN domain; Domain in Jun kinase activation domain binding protein and proteasomal ...
2102-2233 3.57e-28

JAB/MPN domain; Domain in Jun kinase activation domain binding protein and proteasomal subunits. Domain at Mpr1p and Pad1p N-termini. Domain of unknown function.


Pssm-ID: 214573  Cd Length: 135  Bit Score: 111.70  E-value: 3.57e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  2102 YTYILPKNVLKKFIciSDLRAQIAGYLYGVSPPDNPQVKEIRCIVMVPQWGTHQTV-HLPGQLPQHEYLK---EMEPLGW 2177
Cdd:smart00232    4 VHPLVPLNILKHAI--RDGPEEVCGVLLGKSNKDRPEVKEVFAVPNEPQDDSVQEYdEDYSHLMDEELKKvnkDLEIVGW 81
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*.
gi 1370469953  2178 IHTQPNESPQLSPQDVTTHAKIMADNPSWDGeKTIIITCSFTpGSCTLTAYKLTPS 2233
Cdd:smart00232   82 YHSHPDESPFPSEVDVATHESYQAPWPISVV-LIVDPIKSFQ-GRLSLRAFRLTPE 135
 
Name Accession Description Interval E-value
PRP8 COG5178
U5 snRNP spliceosome subunit [RNA processing and modification];
1-2335 0e+00

U5 snRNP spliceosome subunit [RNA processing and modification];


Pssm-ID: 227505 [Multi-domain]  Cd Length: 2365  Bit Score: 3576.09  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953    1 MAGVFP-YRGPGNPVPG-------PLAPLPDYMSEEK--------------LQEKARKWQQLQ-AKRYAEKRKFGFVDAQ 57
Cdd:COG5178      1 MASLPPgNPPPPPPPPGfeppsqpPPPPPPGVNVKKRsrkqlsivgdilghSGNPIYSLRVSDkPVKLGNKAKTLHVLTL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953   58 KEDMPPEHVRKIIRDHGDMTNRKFRHDKRVYLGALKYMPHAVLKLLENMPMPWEQIRDVPVLYHITGAISFVNEIPWVIE 137
Cdd:COG5178     81 KAPIPPEHLRKIQSPCSDMPSVLTKVDKRSYLGALKYLPHAVLKLLENMPSPWEDVSEVKVLYHCHGAITFVNEVPRVIE 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  138 PVYISQWGSMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNILDVEPLEAIQLELDPEEDAPVLDWFYDHQPLRDSR 217
Cdd:COG5178    161 PQLFAQWGLCWSPMRREKRDRYSFKRKRFPPFDDLEPPLSKSQWVLGVEPLMPINIRLDRMDDEHVRDWVYTSRDLEDHP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  218 KyVNGSTYQRWQFTLPMMSTLYRLANQLLTDLVDDNYFYLFDLKAFFTSKALNMAIPGGPKFEPLVRDINLQDEDWNEFN 297
Cdd:COG5178    241 S-VNGAMYRRWKYMLPAMHNLLRLMPMLWESIRDVNYVYLFSGLSFFVAKALNVAIPGGPKFEPLYSRESAEFEDENEFN 319
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  298 DINKIIIRQPIRTEYKIAFPYLYNNLPHHVHLTWYHTPNVVFIKT-EDPDLPAFYFDPLINPISHR----HSVKSQEPLP 372
Cdd:COG5178    320 GIVRIIRRPPIDDEYPVAFPGLYNSRPRSVAVECYGSPECRDVFLdEDEDYPANFKDPLINPILGVqldnHPYDGKGSNE 399
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  373 D----DDEEFELPEFvEPFLKDTPLYTDNTANGIALLWAPRPFNLRSGRTRRALDIPLVKNWYREHCPAGQPVKVRVSYQ 448
Cdd:COG5178    400 EscvmERKLFSEPIF-YPYLYNESTEVRTTERAHLLLKNPFPFNKGKGRAERAQDVPLDKPWLLGHCLQERPVKVPVSYQ 478
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  449 KLLKYYVLNALKHRPPKAQKKRYLFRSFKATKFFQSTKLDWVEVGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVK 528
Cdd:COG5178    479 KLLKNYVRNMLHKTRPRPHTNTHLLKELKNTKYFQRTEIDWVEAGLQLCRQGHNMLSLLIHRKGLTYLHLDYNFNLKPTK 558
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  529 TLTTKERKKSRFGNAFHLCREVLRLTKLVVDSHVQYRLGNVDAFQLADGLQYIFAHVGQLTGMYRYKYKLMRQIRMCKDL 608
Cdd:COG5178    559 TLTTKERKKSRVGNSFHLMREMLKFIKLIVDIHVQFRLGNIDAYQLADGVHYILNHVGQLTGIYRYKYKLMKQIRACKDW 638
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  609 KHLIYYRFNTGpVGKGPGCGFWAAGWRVWLFFMRGITPLLERWLGNLLARQFEGRhSKGVAKTVTKQRVESHFDLELRAA 688
Cdd:COG5178    639 KHLIYYAFNEG-VGKGPGCGFWGPQWRVWLFFLRGHIPLLERYIGNLVTRQFEGR-SDYNPKPLTKQRSDSGYDLELRRQ 716
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  689 VMHDILDMMPEGIKQNKARTILQHLSEAWRCWKANIPWKVPGLPTPIENMILRYVKAKADWWTNTAHYNRERIRRGATVD 768
Cdd:COG5178    717 VMADILSMLPEGIRQTKVRTILQHLSEAWRCWKANIPWHVPGEPAPILEVIRRYIKSKADLWTSSAHFNRERISRGAGVG 796
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  769 KTVCKKNLGRLTRLYLKAEQERQHNYLKDGPYITAEEAVAVYTTTVHWLESRRFSPIPFPPLSYKHDTKLLILALERLKE 848
Cdd:COG5178    797 KTKEKKNLGRLTRLWVKLEQERQVDSAKVGPKSTKEEAKRIGKITVLWLESRMFEPIPFPPLRYKEDTKILVLALEYLKS 876
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  849 AYSVKSRLNQSQREELGLIEQAYDNPHEALSRIKRHLLTQRAFKEVGIEFMDLYSHLVPVYDVEPLEKITDAYLDQYLWY 928
Cdd:COG5178    877 KYTGKIRLNESTREELALLEKAYDNPHDTLFRIKKSLLTQRSFKEVGITLMRHYDGAIPVYSVDPVEKIVDAYLDQYLWY 956
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  929 EADKRRLFPPWIKPADTEPPPLLVYKWCQGINNLQDVWETSEGECNVMLESRFEKMYEKIDLTLLNRLLRLIVDHNIADY 1008
Cdd:COG5178    957 EADRRNLFPEWIKPSDSEMPPLLVYKWCQGINNLKAAWDTSNGERLVLYETKLEGIMEKVDNTLLNRLLKLVLDPNLADY 1036
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1009 MTAKNNVVINYKDMNHTNSYGIIRGLQFASFIVQYYGLVMDLLVLGLHRASEMAGPPQMPNDFLSFQDIATEAAHPIRLF 1088
Cdd:COG5178   1037 IIAKNNVVVVYKDMSHTNHYGLIRGLQFSSFIYQFYGLVVDLLVLGLQRATEIAGPADAPNVFMDFKSRATETSHPIRLY 1116
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1089 CRYIDRIHIFFRFTADEARDLIQRYLTEHPDPNNENIVGYNN--KKCWPRDARMRLMKHDVNLGRAVFWDIKNRLPRSVT 1166
Cdd:COG5178   1117 TRYMDDIYIVFRFQRKEEDSLLEDYLRENPDPEEANNERYRNyfKGCWPDDCRMRLGPLDVNLGRAVFWEILRRCPHSLT 1196
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1167 TVQWENSFVSVYSKDNPNLLFNMCGFECRILPKCRTSyEEFTHKDGVWNLQNEVTKERTAQCFLRVDDESMQRFHNRVRQ 1246
Cdd:COG5178   1197 ATRWEPSFGSVYSKINPNLLFSMVGFEVRILPKIRKI-EERSLSSGVWRLGDGRTKQRTAHANLAVSEGGIEMFESRIRH 1275
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1247 ILMASGSTTFTKIVNKWNTALIGLMTYFREAVVNTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVVFYTPKELGGLG 1326
Cdd:COG5178   1276 ILMTSGSTTFTKVATKWNTQLIALVTYYREAICDTKGLLDKLVKAERLIQNRVKKGLNSKMPVRFPPAVFYAPKELGGLG 1355
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1327 MLSMGHVLIPQSDLRWSKQTDVGITHFRSGMSHEEDQLIPNLYRYIQPWESEFIDSQRVWAEYALKRQEAIAQNRRLTLE 1406
Cdd:COG5178   1356 MLSVGHILIPHSDLEWSKQTDTGITHFRSGMTTNGERLIPAAMRYISRWEYEFEDSQRVWAEYARKRQEAGQQNRRLTLE 1435
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1407 DLEDSWDRGIPRINTLFQKDRHTLAYDKGWRVRTDFKQYQVLKQNPFWWTHQRHDGKLWNLNNYRTDMIQALGGVEGILE 1486
Cdd:COG5178   1436 DLEMSWDRGIPRISTLFQRDRHTLAYDRGFRMRSEFKQYSLKPNNPFWWTDAKHDGKLWSLNRYRLDVIQALGGVEGILE 1515
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1487 HTLFKGTYFPTWEGLFWEKASGFEESMKWKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKI 1566
Cdd:COG5178   1516 HTLFKATGFRSWEGLFWEKASGFEESMKFKKLTNAQRMGLSQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGILMHGKI 1595
                         1610      1620      1630      1640      1650      1660      1670      1680
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1567 PTLKISLIQIFRAHLWQKIHESIVMDLCQVFDQELDALEIETVQKETIHPRKSYKMNSSCADILLFASYKWNVSRPSLLA 1646
Cdd:COG5178   1596 PTLKISLIQIFRNHLWQKIHESVVGDLCQVLDKELDVLQIETVQKETVHPRKSYKMNSSCADILLSGAYDWCVSSPSLLL 1675
                         1690      1700      1710      1720      1730      1740      1750      1760
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1647 DSKDVMDSTTTQKYWIDIQLRWGDYDSHDIERYARAKFLDYTTDNMSIYPSPTGVLIAIDLAYNLHSAYGNWFPGSKPLI 1726
Cdd:COG5178   1676 EERDGGSNVRTNKLWIDVQLRWGDYDSHDIHRYARAKFLDYTTDPQSMYPSPTGVVIGIDLCYNMWSAYGNWNEGLKPLI 1755
                         1770      1780      1790      1800      1810      1820      1830      1840
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1727 QQAMAKIMKANPALYVLRERIRKGLQLYSSEPTEPYLSSQNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGA 1806
Cdd:COG5178   1756 QSSMERIMKANPALYVLRERIRKGLQLYTSEPQEQYLSSSNYAELFSNSIDLFVDDTNVYRVTLHKTFEGNLTTKPINGA 1835
                         1850      1860      1870      1880      1890      1900      1910      1920
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1807 IFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEVAALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKG 1886
Cdd:COG5178   1836 IFVLNPATGNLFLKVIHTSVWAGQKRLIQLAKWKTAEEVFALGRSLPVEEQPKQIIVTRKSMLDPLEVHILDFPNISIRT 1915
                         1930      1940      1950      1960      1970      1980      1990      2000
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1887 SELQLPFQACLKVEKFGDLILKATEPQMVLFNLYDDWLKTISSYTAFSRLILILRALHVNNDRAKVILKPDKTTITEPHH 1966
Cdd:COG5178   1916 CELALPFSAVMGIDKIRDLILRATEPQMVLFNLYDDWLQETSSYTAFSRLLLVLRALDVNEERVKEILRPDKSIITKINH 1995
                         2010      2020      2030      2040      2050      2060      2070      2080
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1967 IWPTLTDEEWIKVEVQLKDLILADYGKKNNVNVASLTQSEIRDIILGMEISAPSQQRQQIAEIEKQTKE--QSQLTATQT 2044
Cdd:COG5178   1996 LWPGFSDSQWIKKEIQLRDLILDRYCSKHNINPSGLTQSEVRDIILGFRISAPSGARQETAETEKQNSEkaLSRPTNVST 2075
                         2090      2100      2110      2120      2130      2140      2150      2160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 2045 RTVNKHGDEIITSTTSNYETQTFSSKTEWRVRAISAANLHLRTNHIYVSSDDIKETG-YTYILPKNVLKKFICISDLRAQ 2123
Cdd:COG5178   2076 KTINGWGREYVVLDGMIYEGEKFSSKEEWRSEAIRTGPLELRTKNIYVTADENEESIqQMYRLPLNLLEKFMRISDPHVQ 2155
                         2170      2180      2190      2200      2210      2220      2230      2240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 2124 IAGYLYGVSPPDNPQVKEIRCIVMVPQWGTHQTVH----LPGQLPQHEYLkemEPLGWIHTQPNESPQLSPQDVTTHAKI 2199
Cdd:COG5178   2156 VAGLVYGKSGSDNPQIKEILSFGLVPQLGSLSGVQsssfVPHDLPGDEDL---EILGWIHTQDDELPYLEVAGVLTHRKK 2232
                         2250      2260      2270      2280      2290      2300      2310      2320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 2200 MADnPSWDgekTIIITCSFTPGSCTLTAYKLTPSGYEWGRQNTDKGNN-PKGYLPSHYERVQMLLSDRFLGFFMVPAQSS 2278
Cdd:COG5178   2233 IVD-PEWD---AVTLTVSYLPGSISLRAYVVKKEGCNWGSKNMDINSDeAIGVEPVLGKDCQLLLSDRIQGVFYVPEEEV 2308
                         2330      2340      2350      2360      2370
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1370469953 2279 WNYNFMGVRHDPNMKYELQLANPKEFYHEVHRPSHFLNFALLQEGEVYSADREDLYA 2335
Cdd:COG5178   2309 WNYNFAGPFFDDRLEYTWKIGMPLGFYDGFHRPGHFSRFYELRAGGRLEEWQEDAFA 2365
PROCN pfam08083
PROCN (NUC071) domain; The PROCN domain is the central domain in pre-mRNA splicing factors of ...
395-796 0e+00

PROCN (NUC071) domain; The PROCN domain is the central domain in pre-mRNA splicing factors of PRO8 family.


Pssm-ID: 400431  Cd Length: 402  Bit Score: 873.25  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  395 TDNTANGIALLWAPRPFNLRSGRTRRALDIPLVKNWYREHCPAGQPVKVRVSYQKLLKYYVLNALKHRPPKAQKKRYLFR 474
Cdd:pfam08083    1 NENTKDGISLYWAPYPFNRRSGRTKRAQDVPLIKHWYREHPPSNYPVKVRVSYQKLLKNYVLNELHHRKQKKHKKKNLLK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  475 SFKATKFFQSTKLDWVEVGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRFGNAFHLCREVLRLT 554
Cdd:pfam08083   81 SLKNTKFFQQTTIDWVEAGLQVCRQGHNMLNLLIHRKGLTYLHLDYNFNLKPTKTLTTKERKKSRFGNAFHLIRELLRLT 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  555 KLVVDSHVQYRLGNVDAFQLADGLQYIFAHVGQLTGMYRYKYKLMRQIRMCKDLKHLIYYRFNTGPVGKGPGCGFWAAGW 634
Cdd:pfam08083  161 KLLVDAHVQYRLGNIDAYQLADGLQYIFNHVGQLTGMYRYKYKVMRQIRACKDLKHVIYSRFNTGEVGKGPGCGFWAPSW 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  635 RVWLFFMRGITPLLERWLGNLLARQFEGRHSKGVAKTVTKQRVESHFDLELRAAVMHDILDMMPEGIKQNKARTILQHLS 714
Cdd:pfam08083  241 RVWIFFLRGIIPLLERWLGNLLARQFEGRKSKDVAKTITKQRVESYYDLELRASVMHDILDMMPEGVKQNKARTILQHLS 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  715 EAWRCWKANIPWKVPGLPTPIENMILRYVKAKADWWTNTAHYNRERIRRGATVDKTVCKKNLGRLTRLYLKAEQERQHNY 794
Cdd:pfam08083  321 EAWRCWKANIPWKVPGLPEPIENIILRYVKAKADWWTSVAHYNRERIKRGATVDKTVAKKNLGRLTRLWLKAEQERQANY 400

                   ..
gi 1370469953  795 LK 796
Cdd:pfam08083  401 LK 402
RNase_H_like_Prp8_IV cd13838
Ribonuclease-like Prp8 domain IV core; This family contains Prp8 domain IV, which adopts a ...
1766-2016 0e+00

Ribonuclease-like Prp8 domain IV core; This family contains Prp8 domain IV, which adopts a RNase H like fold within its core structure but with little sequence similarity. Prp8, a spliceosome protein, interacts directly with the splice sites and branch regions of precursor-mRNAs and spliceosomal RNAs associated with catalysis of the two steps of splicing. Catalysis of RNA cleavage by RNase H-like proteins involves a two-metal mechanism in which adjacently-bound divalent magnesium ions promote hydrolysis by activation of a water nucleophile and stabilization of the transition-state. However, the Prp8 domain IV contains only one of the canonical metal-binding sites and the coordinating side chains are spatially conserved with respect to Mg2+-coordinating residues within the RNase H fold.


Pssm-ID: 260013  Cd Length: 251  Bit Score: 575.45  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1766 QNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEV 1845
Cdd:cd13838      1 QNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1846 AALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKVEKFGDLILKATEPQMVLFNLYDDWLK 1925
Cdd:cd13838     81 AALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKIEKFGDLILKATEPQMVLFNLYDDWLK 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1926 TISSYTAFSRLILILRALHVNNDRAKVILKPDKTTITEPHHIWPTLTDEEWIKVEVQLKDLILADYGKKNNVNVASLTQS 2005
Cdd:cd13838    161 TISSYTAFSRLILILRALHVNNEKAKIILKPDKTVITEPHHIWPTLSDEEWIKVEVQLKDLILADYGKKNNVNVASLTQS 240
                          250
                   ....*....|.
gi 1370469953 2006 EIRDIILGMEI 2016
Cdd:cd13838    241 EIRDIILGMEI 251
MPN_PRP8 cd08056
Mpr1p, Pad1p N-terminal (MPN) domains without isopeptidase activity found in splicing factor ...
2067-2317 2.31e-179

Mpr1p, Pad1p N-terminal (MPN) domains without isopeptidase activity found in splicing factor Prp8; Members of this family are found in pre-mRNA-processing factor 8 (Prp8) which is a critical splicing factor, interacting with several other spliceosomal proteins, snRNAs, and the pre-mRNA, thus organizing and stabilizing the spliceosome catalytic core. Prp8 is one of the largest and most highly conserved of nuclear proteins, occupying a central position in the catalytic core of the spliceosome. Its C-terminal domain exhibits a JAB1/MPN-like core similar to deubiquitinating enzymes, but does not show catalytic isopeptidase activity, possibly because the putative isopeptidase center is covered by insertions and terminal appendices that are grafted onto this core, thus impairing the metal binding site. It is proposed that this domain is a protein interaction domain instead of a Zn(2+)-dependent metalloenzyme as proposed for some MPN proteins. The DEAD-box protein Brr2 and the GTPase Snu114 bind to the Prp8 C-terminus, a region where mutations in human Prp8 (hPrp8) cause a severe form of the genetic disorder retinitis pigmentosa, RP13, which leads to progressive photoreceptor degeneration in the retina and eventual blindness. At the N-terminus of Prp8, there are several domains, including a highly variable nuclear localization signal (NLS) motif rich in prolines, a conserved RNA recognition motif (RRM), and U5 and U6 snRNA binding sites.


Pssm-ID: 163687 [Multi-domain]  Cd Length: 252  Bit Score: 546.84  E-value: 2.31e-179
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 2067 FSSKTEWRVRAISAANLHLRTNHIYVSSDDIKETGYTYILPKNVLKKFICISDLRAQIAGYLYGVSPPDNPQVKEIRCIV 2146
Cdd:cd08056      1 FSSKTDWRVRAIAATNLHLRTKNIYVSSDDIKETGYTYILPKNLLKKFISISDLRTQIAGYLYGKSPPDNPQVKEIRCIV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 2147 MVPQWGTHQTVHLPGQLPQHEYLKEMEPLGWIHTQPNESPQLSPQDVTTHAKIMADNPSWDGEKTIIITCSFTPGSCTLT 2226
Cdd:cd08056     81 LVPQLGTHQTVTLPQQLPQHEYLEDLEPLGWIHTQPNELPQLSPQDVTTHAKILADNPSWDGEKTVILTCSFTPGSCSLT 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 2227 AYKLTPSGYEWGRQNTDKGNN-PKGYLPSHYERVQMLLSDRFLGFFMVPAQSSWNYNFMGVRHDPNMKYELQLANPKEFY 2305
Cdd:cd08056    161 AYKLTPEGYEWGKQNKDLGNNtPKGYSPSFYEKVQLLLSDRFLGFFLVPEDGVWNYNFMGAKHSPNMKYDLKLDIPKEFY 240
                          250
                   ....*....|..
gi 1370469953 2306 HEVHRPSHFLNF 2317
Cdd:cd08056    241 HELHRPTHFLQF 252
PRP8_domainIV pfam12134
PRP8 domain IV core; This domain is found in eukaryotes, and is about 20 amino acids in length. ...
1760-1989 1.25e-164

PRP8 domain IV core; This domain is found in eukaryotes, and is about 20 amino acids in length. It is found associated with pfam10597, pfam10596, pfam10598, pfam08083, pfam08082, pfam01398, pfam08084. There is a conserved LILR sequence motif. The domain is a selenomethionine domain in a subunit of the spliceosome. The function of PRP8 domain IV is believed to be interaction with the splicosomal core.


Pssm-ID: 432353  Cd Length: 230  Bit Score: 504.97  E-value: 1.25e-164
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1760 EPYLSSQNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKW 1839
Cdd:pfam12134    1 EPFLNSQNYAELFSNQTQWFVDDTNVYRVTVHKTFEGNLTTKPVNGAVFILNPKTGQLFLKIIHTSVWAGQKRLSQLAKW 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1840 KTAEEVAALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKVEKFGDLILKATEPQMVLFNL 1919
Cdd:pfam12134   81 KTAEEVAALVRSLPPEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIRGSELHLPFPAALKIEKLGDLVLKATEPQMVLFNI 160
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1920 YDDWLKTISSYTAFSRLILILRALHVNNDRAKVILKPDKTTITEPHHIWPTLTDEEWIKVEVQLKDLILA 1989
Cdd:pfam12134  161 YDDWLKSISPYTAFSRLILILRALHINPEKTKMILRPDPTVVTKPHHLWPTLSDQQWIEVEIQLRDLILS 230
PRO8NT pfam08082
PRO8NT (NUC069), PrP8 N-terminal domain; The PRO8NT domain is found at the N-terminus of ...
58-209 8.61e-116

PRO8NT (NUC069), PrP8 N-terminal domain; The PRO8NT domain is found at the N-terminus of pre-mRNA splicing factors of PRO8 family. The NLS or nuclear localization signal for these spliceosome proteins begins at the start and runs for 60 residues. N-terminal to this domain is a highly variable proline-rich region.


Pssm-ID: 462361 [Multi-domain]  Cd Length: 152  Bit Score: 362.77  E-value: 8.61e-116
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953   58 KEDMPPEHVRKIIRDHGDMTNRKFRHDKRVYLGALKYMPHAVLKLLENMPMPWEQIRDVPVLYHITGAISFVNEIPWVIE 137
Cdd:pfam08082    1 KEDMPPEHLRKIIKDHGDMSSKKFRSDKRSYLGALKYMPHAVLKLLENMPMPWEEVREVKVLYHVTGAITFVNEIPRVIE 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1370469953  138 PVYISQWGSMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNILDVEPLEAIQLELDPEEDAPVLDWFYD 209
Cdd:pfam08082   81 PVYIAQWSTMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNILDVEPLEAIQMELDEEEDAAVIDWFYD 152
U6-snRNA_bdg pfam10596
U6-snRNA interacting domain of PrP8; This domain incorporates the interacting site for the ...
1442-1600 2.99e-108

U6-snRNA interacting domain of PrP8; This domain incorporates the interacting site for the U6-snRNA as part of the U4/U6.U5 tri-snRNPs complex of the spliceosome, and is the prime candidate for the role of cofactor for the spliceosome's RNA core. The essential spliceosomal protein Prp8 interacts with U5 and U6 snRNAs and with specific pre-mRNA sequences that participate in catalysis. This close association with crucial RNA sequences, together with extensive genetic evidence, suggests that Prp8 could directly affect the function of the catalytic core, perhaps acting as a splicing cofactor.


Pssm-ID: 431383  Cd Length: 159  Bit Score: 341.77  E-value: 2.99e-108
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1442 FKQYQVLKQNPFWWTHQRHDGKLWNLNNYRTDMIQALGGVEGILEHTLFKGTYFPTWEGLFWEKASGFEESMKWKKLTNA 1521
Cdd:pfam10596    1 FKKYQLLKYNPFWWTHARHDGKLWNLERYRTDIIQALGGVEGILEHTLFKATGFPSWEGLFWDKASGFEESLKYKKLTNA 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1370469953 1522 QRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKIPTLKISLIQIFRAHLWQKIHESIVMDLCQVFDQE 1600
Cdd:pfam10596   81 QRQGLSQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGVFMHGKLPTLKISLIQIFRGHLWQKIHESVVMDLCQVLDQE 159
U5_2-snRNA_bdg pfam10597
U5-snRNA binding site 2 of PrP8; The essential spliceosomal protein Prp8 interacts with U5 and ...
1210-1343 1.73e-89

U5-snRNA binding site 2 of PrP8; The essential spliceosomal protein Prp8 interacts with U5 and U6 snRNAs and with specific pre-mRNA sequences that participate in catalysis. This close association with crucial RNA sequences, together with extensive genetic evidence, suggests that Prp8 could directly affect the function of the catalytic core, perhaps acting as a splicing cofactor.


Pssm-ID: 402297  Cd Length: 134  Bit Score: 286.97  E-value: 1.73e-89
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 1210 KDGVWNLQNEVTKERTAQCFLRVDDESMQRFHNRVRQILMASGSTTFTKIVNKWNTALIGLMTYFREAVVNTQELLDLLV 1289
Cdd:pfam10597    1 NEGVWDLINESTKERTAKAFLQVSEESINNFRNRIRQILMSSGSTTFTKIANKWNTALIALFTYFREAIVSTEELLDILV 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1370469953 1290 KCENKIQTRIKIGLNSKMPSRFPPVVFYTPKELGGLGMLSMGHVLIPQSDLRWS 1343
Cdd:pfam10597   81 KCETRVQNRVKLGLNSKMPSRFPPAVFYTPKELGGLGMLSAGHVLIPASDLRWS 134
PROCT pfam08084
PROCT (NUC072) domain; The PROCT domain is the C-terminal domain in pre-mRNA splicing factors ...
2212-2321 1.21e-67

PROCT (NUC072) domain; The PROCT domain is the C-terminal domain in pre-mRNA splicing factors of PRO8 family.


Pssm-ID: 462362  Cd Length: 111  Bit Score: 223.19  E-value: 1.21e-67
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 2212 IIITCSFTPGSCTLTAYKLTPSGYEWGRQNTDK-GNNPKGYLPSHYERVQMLLSDRFLGFFMVPAQSSWNYNFMGVRHDP 2290
Cdd:pfam08084    1 ITLTVSFTPGSVSLSAYTLTPEGYEWGRQNKDLiSDNPQGFSPSFSEKVQLLLSDRILGFFLVPEDGVWNYSFMGASFNP 80
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1370469953 2291 NMKYELQLANPKEFYHEVHRPSHFLNFALLQ 2321
Cdd:pfam08084   81 NMKYDLKLDIPLEFYDELHRPTHFLNFNELE 111
RRM_4 pfam10598
RNA recognition motif of the spliceosomal PrP8; The large RNA-protein complex of the ...
986-1077 5.14e-53

RNA recognition motif of the spliceosomal PrP8; The large RNA-protein complex of the spliceosome catalyzes pre-mRNA splicing. One of the most conserved core proteins is PrP8 which occupies a central position in the catalytic core of the spliceosome, and has been implicated in several crucial molecular rearrangements that occur there, and has recently come under the spotlight for its role in the inherited human disease, Retinitis Pigmentosa. The RNA-recognition motif of PrP8 is highly conserved and provides a possible RNA binding centre for the 5-prime SS, BP, or 3-prime SS of pre-mRNA which are known to contact with Prp8. The most conserved regions of an RRM are defined as the RNP1 and RNP2 sequences. Recognition of RNA targets can also be modulated by a number of other factors, most notably the two loops beta1-alpha1, beta2-beta3 and the amino acid residues C-terminal to the RNP2 domain.


Pssm-ID: 463163  Cd Length: 92  Bit Score: 180.90  E-value: 5.14e-53
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  986 EKIDLTLLNRLLRLIVDHNIADYMTAKNNVVINYKDMNHTNSYGIIRGLQFASFIVQYYGLVMDLLVLGLHRASEMAGPP 1065
Cdd:pfam10598    1 EKIDLTLLNRLLRLIMDHNLADYITAKNNVVLTFKDMSHVNSYGLIRGLQFSSFVYQYYGLVLDLLILGLQRASEIAGPP 80
                           90
                   ....*....|..
gi 1370469953 1066 QMPNDFLSFQDI 1077
Cdd:pfam10598   81 QMPNEFLQFKDI 92
JAB_MPN smart00232
JAB/MPN domain; Domain in Jun kinase activation domain binding protein and proteasomal ...
2102-2233 3.57e-28

JAB/MPN domain; Domain in Jun kinase activation domain binding protein and proteasomal subunits. Domain at Mpr1p and Pad1p N-termini. Domain of unknown function.


Pssm-ID: 214573  Cd Length: 135  Bit Score: 111.70  E-value: 3.57e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953  2102 YTYILPKNVLKKFIciSDLRAQIAGYLYGVSPPDNPQVKEIRCIVMVPQWGTHQTV-HLPGQLPQHEYLK---EMEPLGW 2177
Cdd:smart00232    4 VHPLVPLNILKHAI--RDGPEEVCGVLLGKSNKDRPEVKEVFAVPNEPQDDSVQEYdEDYSHLMDEELKKvnkDLEIVGW 81
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*.
gi 1370469953  2178 IHTQPNESPQLSPQDVTTHAKIMADNPSWDGeKTIIITCSFTpGSCTLTAYKLTPS 2233
Cdd:smart00232   82 YHSHPDESPFPSEVDVATHESYQAPWPISVV-LIVDPIKSFQ-GRLSLRAFRLTPE 135
JAB pfam01398
JAB1/Mov34/MPN/PAD-1 ubiquitin protease; Members of this family are found in proteasome ...
2099-2205 6.12e-23

JAB1/Mov34/MPN/PAD-1 ubiquitin protease; Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain, JABP1 domain or JAMM domain. These are metalloenzymes that function as the ubiquitin isopeptidase/ deubiquitinase in the ubiquitin-based signalling and protein turnover pathways in eukaryotes. Versions of the domain in prokaryotic cognates of the ubiquitin-modification pathway are shown to have a similar role, and the archael protein from Haloferax volcanii is found to cleave ubiquitin-like small archaeal modifier proteins (SAMP1/2) from protein conjugates.


Pssm-ID: 396120  Cd Length: 117  Bit Score: 95.87  E-value: 6.12e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 2099 ETGYTYILPKNVLKKFICISDLRAQI----AGYLYGVSPPDNpqVKEIRCIVMVPQWGTHQTVHLPGQLPQ-----HEYL 2169
Cdd:pfam01398    1 SSVRTVIIHPLVLLKILDHANRGGKIgeevMGVLLGKLEGDG--TIEITNSFALPQEETEDDVNAVALDQEymenmHEML 78
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 1370469953 2170 KE----MEPLGWIHTQPNEsPQLSPQDVTTHAKIMADNPS 2205
Cdd:pfam01398   79 KKvnrkEEVVGWYHTHPGL-CWLSSVDVHTHALYQRMIPE 117
MPN cd07767
Mpr1p, Pad1p N-terminal (MPN) domains; MPN (also known as Mov34, PAD-1, JAMM, JAB, MPN+) ...
2111-2221 3.28e-22

Mpr1p, Pad1p N-terminal (MPN) domains; MPN (also known as Mov34, PAD-1, JAMM, JAB, MPN+) domains are found in the N-terminal termini of proteins with a variety of functions; they are components of the proteasome regulatory subunits, the signalosome (CSN), eukaryotic translation initiation factor 3 (eIF3) complexes, and regulators of transcription factors. These domains are isopeptidases that release ubiquitin from ubiquitinated proteins (thus having deubiquitinating (DUB) activity) that are tagged for degradation. Catalytically active MPN domains contain a metalloprotease signature known as the JAB1/MPN/Mov34 metalloenzyme (JAMM) motif. For example, Rpn11 (also known as POH1 or PSMD14), a subunit of the 19S proteasome lid is involved in the ATP-dependent degradation of ubiquitinated proteins, contains the conserved JAMM motif involved in zinc ion coordination. Poh1 is a regulator of c-Jun, an important regulator of cell proliferation, differentiation, survival and death. JAB1 is a component of the COP9 signalosome (CSN), a regulatory particle of the ubiquitin (Ub)/26S proteasome system occurring in all eukaryotic cells; it cleaves the ubiquitin-like protein NEDD8 from the cullin subunit of the SCF (Skp1, Cullins, F-box proteins) family of E3 ubiquitin ligases. AMSH (associated molecule with the SH3 domain of STAM, also known as STAMBP), a member of JAMM/MPN+ deubiquitinases (DUBs), specifically cleaves Lys 63-linked polyubiquitin (poly-Ub) chains, thus facilitating the recycling and subsequent trafficking of receptors to the cell surface. Similarly, BRCC36, part of the nuclear complex that includes BRCA1 protein and is targeted to DNA damage foci after irradiation, specifically disassembles K63-linked polyUb. BRCC36 is aberrantly expressed in sporadic breast tumors, indicative of a potential role in the pathogenesis of the disease. Some variants of the JAB1/MPN domains lack key residues in their JAMM motif and are unable to coordinate a metal ion. Comparisons of key catalytic and metal binding residues explain why the MPN-containing proteins Mov34/PSMD7, Rpn8, CSN6, Prp8p, and the translation initiation factor 3 subunits f (p47) and h (p40) do not show catalytic isopeptidase activity. It has been proposed that the MPN domain in these proteins has a primarily structural function.


Pssm-ID: 163686 [Multi-domain]  Cd Length: 116  Bit Score: 93.73  E-value: 3.28e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370469953 2111 LKKFI--CISDLRAQIAGYLYGVSPPDnpqVKEIRCIVMVPQWGTHQTVHLP--GQLPQHEYLKEMEPLGWIHTQPNESP 2186
Cdd:cd07767      1 LKMFLdaAKSINGKEVIGLLYGSKTKK---VLDVDEVIAVPFDEGDKDDNVWflMYLDFKKLNAGLRIVGWYHTHPKPSC 77
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 1370469953 2187 QLSPQDVTTHAKIMAdnpswDGEKTIIITCSFTPG 2221
Cdd:cd07767     78 FLSPNDLATHELFQR-----YFPEKVMIIVDVKPK 107
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH