NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|663854069|ref|NP_001287712|]
View 

protein Largen isoform 1 [Homo sapiens]

Protein Classification

Largen family protein( domain architecture ID 12172577)

Largen family protein containing a DUF4589 domain, similar to Homo sapiens protein Largen (also called proline-rich protein 16), a regulator of cell size that promotes cell size increase independently of mTOR and Hippo signaling pathways

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DUF4589 pfam15252
Domain of unknown function (DUF4589); This protein family is a domain of unknown function. The ...
53-304 2.22e-62

Domain of unknown function (DUF4589); This protein family is a domain of unknown function. The precise function of the protein domain remains to be elucidated. This family of proteins is found in eukaryotes and are typically between 215 and 293 amino acids in length. The protein contains two conserved sequence motifs: SSS and KST.


:

Pssm-ID: 464592  Cd Length: 232  Bit Score: 197.33  E-value: 2.22e-62
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 663854069   53 EVVDQIDTLTSDLQLEDEmTDSSKTDTLNSS---SSGTTASSLEKIKVQANAPLIKPPAhpsaILTVLRKPNPPPpppRL 129
Cdd:pfam15252   1 EVVSQIDKLTSDFQLELE-PDDWTTATLSSTsssDKGGGPFDLGKLDFMTADILSDSWE----FCSFLDKSTPSP---RL 72
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 663854069  130 TPVKCEDPkrvvptanpVKTNGTLLRNGGLPggpnkIPNGDICCIPNSNLDKAPVQLLMHRPEKDRCPqaGPRERVRFNE 209
Cdd:pfam15252  73 TPPESEDP---------GKGPGYRLMNGGLP-----IPNGPRIETPDSSSEEAFSSAPLLRHEKQRTP--GTRERVRFSD 136
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 663854069  210 KVQYHGYCPDCDTRYNIKNRevhlHSEPVH---PPGKIPHQGPPLPPTPHLPPFPLENGGmGISHSNSFP---PIRPATV 283
Cdd:pfam15252 137 KVLYHALCCDDDERYDEDNR----HEEPEDgasLPLDPPHCCPSSSPPPPPLPPFLNPSF-PPVPPCVKPrpsPLKPGRR 211
                         250       260
                  ....*....|....*....|.
gi 663854069  284 PPPTAPKPQKTILRKSTTTTV 304
Cdd:pfam15252 212 GKTTRNSSTQTVSDKSTQTTL 232
 
Name Accession Description Interval E-value
DUF4589 pfam15252
Domain of unknown function (DUF4589); This protein family is a domain of unknown function. The ...
53-304 2.22e-62

Domain of unknown function (DUF4589); This protein family is a domain of unknown function. The precise function of the protein domain remains to be elucidated. This family of proteins is found in eukaryotes and are typically between 215 and 293 amino acids in length. The protein contains two conserved sequence motifs: SSS and KST.


Pssm-ID: 464592  Cd Length: 232  Bit Score: 197.33  E-value: 2.22e-62
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 663854069   53 EVVDQIDTLTSDLQLEDEmTDSSKTDTLNSS---SSGTTASSLEKIKVQANAPLIKPPAhpsaILTVLRKPNPPPpppRL 129
Cdd:pfam15252   1 EVVSQIDKLTSDFQLELE-PDDWTTATLSSTsssDKGGGPFDLGKLDFMTADILSDSWE----FCSFLDKSTPSP---RL 72
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 663854069  130 TPVKCEDPkrvvptanpVKTNGTLLRNGGLPggpnkIPNGDICCIPNSNLDKAPVQLLMHRPEKDRCPqaGPRERVRFNE 209
Cdd:pfam15252  73 TPPESEDP---------GKGPGYRLMNGGLP-----IPNGPRIETPDSSSEEAFSSAPLLRHEKQRTP--GTRERVRFSD 136
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 663854069  210 KVQYHGYCPDCDTRYNIKNRevhlHSEPVH---PPGKIPHQGPPLPPTPHLPPFPLENGGmGISHSNSFP---PIRPATV 283
Cdd:pfam15252 137 KVLYHALCCDDDERYDEDNR----HEEPEDgasLPLDPPHCCPSSSPPPPPLPPFLNPSF-PPVPPCVKPrpsPLKPGRR 211
                         250       260
                  ....*....|....*....|.
gi 663854069  284 PPPTAPKPQKTILRKSTTTTV 304
Cdd:pfam15252 212 GKTTRNSSTQTVSDKSTQTTL 232
 
Name Accession Description Interval E-value
DUF4589 pfam15252
Domain of unknown function (DUF4589); This protein family is a domain of unknown function. The ...
53-304 2.22e-62

Domain of unknown function (DUF4589); This protein family is a domain of unknown function. The precise function of the protein domain remains to be elucidated. This family of proteins is found in eukaryotes and are typically between 215 and 293 amino acids in length. The protein contains two conserved sequence motifs: SSS and KST.


Pssm-ID: 464592  Cd Length: 232  Bit Score: 197.33  E-value: 2.22e-62
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 663854069   53 EVVDQIDTLTSDLQLEDEmTDSSKTDTLNSS---SSGTTASSLEKIKVQANAPLIKPPAhpsaILTVLRKPNPPPpppRL 129
Cdd:pfam15252   1 EVVSQIDKLTSDFQLELE-PDDWTTATLSSTsssDKGGGPFDLGKLDFMTADILSDSWE----FCSFLDKSTPSP---RL 72
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 663854069  130 TPVKCEDPkrvvptanpVKTNGTLLRNGGLPggpnkIPNGDICCIPNSNLDKAPVQLLMHRPEKDRCPqaGPRERVRFNE 209
Cdd:pfam15252  73 TPPESEDP---------GKGPGYRLMNGGLP-----IPNGPRIETPDSSSEEAFSSAPLLRHEKQRTP--GTRERVRFSD 136
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 663854069  210 KVQYHGYCPDCDTRYNIKNRevhlHSEPVH---PPGKIPHQGPPLPPTPHLPPFPLENGGmGISHSNSFP---PIRPATV 283
Cdd:pfam15252 137 KVLYHALCCDDDERYDEDNR----HEEPEDgasLPLDPPHCCPSSSPPPPPLPPFLNPSF-PPVPPCVKPrpsPLKPGRR 211
                         250       260
                  ....*....|....*....|.
gi 663854069  284 PPPTAPKPQKTILRKSTTTTV 304
Cdd:pfam15252 212 GKTTRNSSTQTVSDKSTQTTL 232
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH