MGP Database

MGP001981

Record overview

MGPD IDMGP001981
Gene ID4585
SpeciesHomo sapiens (Human)
Gene Namemucin 4, cell surface associated
Gene Symbol MUC4
SynonymsASGP; MUC-4; HSA276359;
Alternate namesmucin-4; ascites sialoglycoprotein; mucin 4, tracheobronchial; pancreatic adenocarcinoma mucin; testis mucin; tracheobronchial mucin;
Chromosome3
Map Location3q29
SummaryThe major constituents of mucus, the viscous secretion that covers epithelial surfaces such as those in the trachea, colon, and cervix, are highly glycosylated proteins called mucins. These glycoproteins play important roles in the protection of the epithelial cells and have been implicated in epithelial renewal and differentiation. This gene encodes an integral membrane glycoprotein found on the cell surface, although secreted isoforms may exist. At least two dozen transcript variants of this gene have been found, although for many of them the full-length transcript has not been determined or they are found only in tumor tissues. This gene contains a region in the coding sequence which has a variable number (>100) of 48 nt tandem repeats. [provided by RefSeq, Jul 2008]
OrthologsView orthologs and multiple alignments for MUC4

Proteins

mucin-4 isoform a precursor
Refseq ID:NP_060876
Protein GI:257471027
UniProt ID:Q99102
mRNA ID:NM_018406
Length:5412
RefSeq Status:
MKGARWRRVPWVSLSCLCLCLLPHVVPGTTEDTLITGSKTAAPVTSTGSTTATLEGQSTAASSRTSNQDISASSQNHQTKSTETTSKAQTDTLTQMMTST
LFSSPSVHNVMETAPPDEMTTSFPSSVTNTLMMTSKTITMTTSTDSTLGNTEETSTAGTESSTPVTSAVSITAGQEGQSRTTSWRTSIQDTSASSQNHWT
RSTQTTRESQTSTLTHRTTSTPSFSPSVHNVTGTVSQKTSPSGETATSSLCSVTNTSMMTSEKITVTTSTGSTLGNPGETSSVPVTGSLMPVTSAALVTF
DPEGQSPATFSRTSTQDTTAFSKNHQTQSVETTRVSQINTLNTLTPVTTSTVLSSPSGFNPSGTVSQETFPSGETTTSSPSSVSNTFLVTSKVFRMPTSR
DSTLGNTEETSLSVSGTISAITSKVSTIWWSDTLSTALSPSSLPPKISTAFHTQQSEGAETTGRPHERSSFSPGVSQEIFTLHETTTWPSSFSSKGHTTW
SQTELPSTSTGAATRLVTGNPSTGTAGTIPRVPSKVSAIGEPGEPTTYSSHSTTLPKTTGAGAQTQWTQETGTTGEALLSSPSYSVTQMIKTATSPSSSP
MLDRHTSQQITTAPSTNHSTIHSTSTSPQESPAVSQRGHTQAPQTTQESQTTRSVSPMTDTKTVTTPGSSFTASGHSPSEIVPQDAPTISAATTFAPAPT
GDGHTTQAPTTALQAAPSSHDATLGPSGGTSLSKTGALTLANSVVSTPGGPEGQWTSASASTSPDTAAAMTHTHQAESTEASGQTQTSEPASSGSRTTSA
GTATPSSSGASGTTPSGSEGISTSGETTRFSSNPSRDSHTTQSTTELLSASASHGAIPVSTGMASSIVPGTFHPTLSEASTAGRPTGQSSPTSPSASPQE
TAAISRMAQTQRTRTSRGSDTISLASQATDTFSTVPPTPPSITSTGLTSPQTETHTLSPSGSGKTFTTALISNATPLPVTYASSASTGHTTPLHVTDASS
VSTGHATPLPVTSPSSVSTGHTTPLPVTDTSSESTGHVTPLPVTSFSSASTGDSTPLPVTDTSSASTGHVTPLPVTSLSSASTGDTTPLPVTDTSSASTG
HATSLPVTDTSSVSTGHTTPLPVTDTSSASTGHATSLPVTDTSSVSTGHTTPLHVTDASSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHATP
LLVTDTSSASTGHATPLPVTDASSVSTDHATSLPVTIPSAASTGHTTPLPVTDTSSASTGQATSLLVTDTSSVSTGDTTPLPVTSTSSASTGHVTPLHVT
SPSSASTGHATPLPVTSLSSASTGDTMPLPVTSPSSASTGDTTPLPVTDASSVSTGHTTPLHVTDASSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSS
ASTGHATPLLVTDTSSASTGHATPLPVTDASSVSTDHATSLPVTIPSAASTGHTTPLPVTDTSSASTGQATSLLVTDTSSVSTGDTTPLPVTSTSSASTG
HVTPLHVTSPSSASTGHATPLPVTSLSSASTGDTMPLPVTSPSSASTGDTTPLPVTDASSVSTGHTTPLPVTSPSSASTGHTTPLPVTDTSSASKGDTTP
LPVTSPSSASTGHTTPLPVTDTSSASTGDTTPLPVTNASSLSTGHATPLHVTSPSSASTGHATPLPVTSTSSASTGHATPLPVTGLSSATTDDTTRLPVT
DVSSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHASPLLVTDASSASTGQATPLPVTDTSSVSTAHATPLPVTGLSSASTDDTTRLPVTDVSS
ASTGQAIPLPVTSPSSASTGDTTPLPVTDASSASTGDTTSLPVTIPSSASSGHTTSLPVTDASSVSTGHATSLLVTDASSVSTGDTTPLPVTDTNSASTG
DTTPLHVTDASSVSTGHATSLPVTSLSSASTGDTTPLPVTSPSSASSGHTTPLPVTDASSVPTGHATSLPVTDASSVSTGHATPLPVTDASSVSTGHATP
LPVTDTSSVSTGQATPLPVTSLSSASTGDTTPLPVTDTSSASTGQDTPLPVTSLSSVSTGDTTPLPVTNPSSASTGHATPLLVTDASSISTGHATSLLVT
DASSVSTGHATALHDTDASSLSTGDTTPLPVTSPSSTSTGDTTPLPVTETSSVSTGHATSLPVTDTSSASTGHATSLPVTDTSSASTGHATPLPVTDTSS
ASTGQATPLPVTSPSSASTGHAIPLLVTDTSSASTGQATPLPVTSLSSASTGDTTPLPVTDASSVSTGHATSLPVTSLSSVSTGDTTPLPVTSPSSASTG
HATPLHVTDASSASTGHATPLPVTSLSSASTGDTTPLPVTSPSSASTGHATPLHVTDASSVSTGDTTPLPVTSSSSASSGHTTPLPVTDASSASTGDTTP
LPVTDTSSASTGHATHLPVTGLSSASTGDTTRLPVTNVSSASTGHATPLPVTSTSSASTGDTTPLPGTDTSSVSTGHTTPLLVTDASSVSTGDTTRLPVT
SPSSASTGHTTPLPVTDTPSASTGDTTPLPVTNASSLSTRHATSLHVTSPSSASTGHATSLPVTDTSAASTGHATPLPVTSTSSASTGDTTPLPVTDTYS
ASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHATPLLVTDASSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHATSLPVTDTSSASTG
DTTSLPVTDTSSAYTGDTTSLPVTDTSSSSTGDTTPLLVTETSSVSTGDTTPLPVTDTSSASTGHATPLPVTNTSSVSTGHATPLHVTSPSSASTGHTTP
LPVTDASSVSTGHATSLPVTDASSVFTGHATSLPVTIPSSASSGHTTPLPVTDASSVSTGHATSLPVTDASSVSTGHATPLPVTDASSVSTGHATPLPLT
SLSSVSTGDTTPLPVTDTSSASTGQATPLPVTSLSSVSTGDTTPLPVTDTSSASTGHATSLPVTDTSSASTGHATPLPDTDTSSASTGHATLLPVTDTSS
ASIGHATSLPVTDTSSISTGHATPLHVTSPSSASTGHATPLPVTDTSSASTGHANPLHVTSPSSASTGHATPLPVTDTSSASTGHATPLPVTSLSSVSTG
DTTPLPVTSPSSASTGHTTPLPVTDTSSASTGQATALPVTSTSSASTGDTTPLPVTDTSSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHATP
LLVTDASSASTGQATPLPVTSLSSVSTGDTTPLPVTSPSSASTGHATSLPVTDTSSASTGDTTSLPVTDTSSAYTGDTTSLPVTDTSSSSTGDTTPLLVT
ETSSVSTGHATPLLVTDASSASTGHATPLHVTSPSSASTGDTTPVPVTDTSSVSTGHATPLPVTGLSSASTGDTTRLPVTDISSASTGQATPLPVTNTSS
VSTGDTMPLPVTSPSSASTGHATPLPVTSTSSASTGHATPVPVTSTSSASTGHTTPLPVTDTSSASTGDTTPLPVTSPSSASTGHTTPLHVTIPSSASTG
DTSTLPVTGASSASTGHATPLPVTDTSSVSTGHATPLPVTSLSSVSTGDTTPLPVTDASSASTGQATPLPVTSLSSVSTGDTTPLLVTDASSVSTGHATP
LPVTDTSSASTGDTTRLPVTDTSSASTGQATPLPVTSLSSVSTGDTTPLLVTDASSVSTGHATPLPVTDTSSASTGDTTRLPVTDTSSASTGQATPLPVT
IPSSSSSGHTTPLPVTSTSSVSTGHVTPLHVTSPSSASTGHVTPLPVTSTSSASTGHATPLLVTDASSVSTGHATPLPVTDASSASTGDTTPLPVTDTSS
ASTGQATPLPVTSLSSVSTGDTTPLPVTDASSASTGHATPLPVTIPSSVSTGDTMPLPVTSPSSASTGHATPLPVTGLSSASTGDTTPLPVTDTSSASTR
HATPLPVTDTSSASTDDTTRLPVTDVSSASTGHATPLPVTSTSSASTGDTTPLPVTDTSSVSTGHATSLPVTSRSSASTGHATPLPVTDTSSVSTGHATP
LPVTSTSSVSTGHATPLPVTSPSSASTGHATPVPVTSTSSASTGDTTPLPVTNASSLSTGHATPLHVTSPSSASRGDTSTLPVTDASSASTGHATPLPLT
SLSSVSTGDTTPLPVTDTSSASTGQATPLPVTSLSSVSTGDTTPLPVTIPSSASSGHTTSLPVTDASSVSTGHGTPLPVTSTSSASTGDTTPLPVTDTSS
ASTGHATPLPVTDTSSASTGHATPLPVTSLSSVSTGHATPLAVSSATSASTVSSDSPLKMETPGMTTPSLKTDGGRRTATSPPPTTSQTIISTIPSTAMH
TRSTAAPIPILPERGVSLFPYGAGAGDLEFVRRTVDFTSPLFKPATGFPLGSSLRDSLYFTDNGQIIFPESDYQIFSYPNPLPTGFTGRDPVALVAPFWD
DADFSTGRGTTFYQEYETFYGEHSLLVQQAESWIRKMTNNGGYKARWALKVTWVNAHAYPAQWTLGSNTYQAILSTDGSRSYALFLYQSGGMQWDVAQRS
GNPVLMGFSSGDGYFENSPLMSQPVWERYRPDRFLNSNSGLQGLQFYRLHREERPNYRLECLQWLKSQPRWPSWGWNQVSCPCSWQQGRRDLRFQPVSIG
RWGLGSRQLCSFTSWRGGVCCSYGPWGEFREGWHVQRPWQLAQELEPQSWCCRWNDKPYLCALYQQRRPHVGCATYRPPQPAWMFGDPHITTLDGVSYTF
NGLGDFLLVGAQDGNSSFLLQGRTAQTGSAQATNFIAFAAQYRSSSLGPVTVQWLLEPHDAIRVLLDNQTVTFQPDHEDGGGQETFNATGVLLSRNGSEV
SASFDGWATVSVIALSNILHASASLPPEYQNRTEGLLGVWNNNPEDDFRMPNGSTIPPGSPEEMLFHFGMTWQINGTGLLGKRNDQLPSNFTPVFYSQLQ
KNSSWAEHLISNCDGDSSCIYDTLALRNASIGLHTREVSKNYEQANATLNQYPPSINGGRVIEAYKGQTTLIQYTSNAEDANFTLRDSCTDLELFENGTL
LWTPKSLEPFTLEILARSAKIGLASALQPRTVVCHCNAESQCLYNQTSRVGNSSLEVAGCKCDGGTFGRYCEGSEDACEEPCFPSVHCVPGKGCEACPPN
LTGDGRHCAALGSSFLCQNQSCPVNYCYNQGHCYISQTLGCQPMCTCPPAFTDSRCFLAGNNFSPTVNLELPLRVIQLLLSEEENASMAEVNASVAYRLG
TLDMRAFLRNSQVERIDSAAPASGSPIQHWMVISEFQYRPRGPVIDFLNNQLLAAVVEAFLYHVPRRSEEPRNDVVFQPISGEDVRDVTALNVSTLKAYF
RCDGYKGYDLVYSPQSGFTCVSPCSRGYCDHGGQCQHLPSGPRCSCVSFSIYTAWGEHCEHLSMKLDAFFGIFFGALGGLLLLGVGTFVVLRFWGCSGAR
FSYFLNSAEALP
 
sig_peptide: 1..28
inference: COORDINATES: ab initio prediction:SignalP:4.0
calculated_mol_wt: 3178
peptide sequence: 
MKGARWRRVPWVSLSCLCLCLLPHVVPG
 
mucin-4 isoform d precursor
Refseq ID:NP_004523
Protein GI:112382233
UniProt ID:Q99102
mRNA ID:NM_004532
Length:1176
RefSeq Status:
MKGARWRRVPWVSLSCLCLCLLPHVVPGMTTPSLKTDGGRRTATSPPPTTSQTIISTIPSTAMHTRSTAAPIPILPERGVSLFPYGAGAGDLEFVRRTVD
FTSPLFKPATGFPLGSSLRDSLYFTDNGQIIFPESDYQIFSYPNPLPTGFTGRDPVALVAPFWDDADFSTGRGTTFYQEYETFYGEHSLLVQQAESWIRK
MTNNGGYKARWALKVTWVNAHAYPAQWTLGSNTYQAILSTDGSRSYALFLYQSGGMQWDVAQRSGNPVLMGFSSGDGYFENSPLMSQPVWERYRPDRFLN
SNSGLQGLQFYRLHREERPNYRLECLQWLKSQPRWPSWGWNQVSCPCSWQQGRRDLRFQPVSIGRWGLGSRQLCSFTSWRGGVCCSYGPWGEFREGWHVQ
RPWQLAQELEPQSWCCRWNDKPYLCALYQQRRPHVGCATYRPPQPAWMFGDPHITTLDGVSYTFNGLGDFLLVGAQDGNSSFLLQGRTAQTGSAQATNFI
AFAAQYRSSSLGPVTVQWLLEPHDAIRVLLDNQTVTFQPDHEDGGGQETFNATGVLLSRNGSEVSASFDGWATVSVIALSNILHASASLPPEYQNRTEGL
LGVWNNNPEDDFRMPNGSTIPPGSPEEMLFHFGMTWQINGTGLLGKRNDQLPSNFTPVFYSQLQKNSSWAEHLISNCDGDSSCIYDTLALRNASIGLHTR
EVSKNYEQANATLNQYPPSINGGRVIEAYKGQTTLIQYTSNAEDANFTLRDSCTDLELFENGTLLWTPKSLEPFTLEILARSAKIGLASALQPRTVVCHC
NAESQCLYNQTSRVGNSSLEVAGCKCDGGTFGRYCEGSEDACEEPCFPSVHCVPGKGCEACPPNLTGDGRHCAALGSSFLCQNQSCPVNYCYNQGHCYIS
QTLGCQPMCTCPPAFTDSRCFLAGNNFSPTVNLELPLRVIQLLLSEEENASMAEVNASVAYRLGTLDMRAFLRNSQVERIDSAAPASGSPIQHWMVISEF
QYRPRGPVIDFLNNQLLAAVVEAFLYHVPRRSEEPRNDVVFQPISGEDVRDVTALNVSTLKAYFRCDGYKGYDLVYSPQSGFTCVSPCSRGYCDHGGQCQ
HLPSGPRCSCVSFSIYTAWGEHCEHLSMKLDAFFGIFFGALGGLLLLGVGTFVVLRFWGCSGARFSYFLNSAEALP
 
sig_peptide: 1..28
inference: COORDINATES: ab initio prediction:SignalP:4.0
calculated_mol_wt: 3178
peptide sequence: 
MKGARWRRVPWVSLSCLCLCLLPHVVPG

sig_peptide: 1..28
inference: COORDINATES: ab initio prediction:SignalP:4.0
calculated_mol_wt: 3178
peptide sequence: 
MKGARWRRVPWVSLSCLCLCLLPHVVPG
 
mucin-4 isoform e precursor
Refseq ID:NP_612154
Protein GI:112382231
UniProt ID:Q99102
mRNA ID:NM_138297
Length:1125
RefSeq Status:
MKGARWRRVPWVSLSCLCLCLLPHVVPGVSLFPYGAGAGDLEFVRRTVDFTSPLFKPATGFPLGSSLRDSLYFTDNGQIIFPESDYQIFSYPNPLPTGFT
GRDPVALVAPFWDDADFSTGRGTTFYQEYETFYGEHSLLVQQAESWIRKMTNNGGYKARWALKVTWVNAHAYPAQWTLGSNTYQAILSTDGSRSYALFLY
QSGGMQWDVAQRSGNPVLMGFSSGDGYFENSPLMSQPVWERYRPDRFLNSNSGLQGLQFYRLHREERPNYRLECLQWLKSQPRWPSWGWNQVSCPCSWQQ
GRRDLRFQPVSIGRWGLGSRQLCSFTSWRGGVCCSYGPWGEFREGWHVQRPWQLAQELEPQSWCCRWNDKPYLCALYQQRRPHVGCATYRPPQPAWMFGD
PHITTLDGVSYTFNGLGDFLLVGAQDGNSSFLLQGRTAQTGSAQATNFIAFAAQYRSSSLGPVTVQWLLEPHDAIRVLLDNQTVTFQPDHEDGGGQETFN
ATGVLLSRNGSEVSASFDGWATVSVIALSNILHASASLPPEYQNRTEGLLGVWNNNPEDDFRMPNGSTIPPGSPEEMLFHFGMTWQINGTGLLGKRNDQL
PSNFTPVFYSQLQKNSSWAEHLISNCDGDSSCIYDTLALRNASIGLHTREVSKNYEQANATLNQYPPSINGGRVIEAYKGQTTLIQYTSNAEDANFTLRD
SCTDLELFENGTLLWTPKSLEPFTLEILARSAKIGLASALQPRTVVCHCNAESQCLYNQTSRVGNSSLEVAGCKCDGGTFGRYCEGSEDACEEPCFPSVH
CVPGKGCEACPPNLTGDGRHCAALGSSFLCQNQSCPVNYCYNQGHCYISQTLGCQPMCTCPPAFTDSRCFLAGNNFSPTVNLELPLRVIQLLLSEEENAS
MAEVNASVAYRLGTLDMRAFLRNSQVERIDSAAPASGSPIQHWMVISEFQYRPRGPVIDFLNNQLLAAVVEAFLYHVPRRSEEPRNDVVFQPISGEDVRD
VTALNVSTLKAYFRCDGYKGYDLVYSPQSGFTCVSPCSRGYCDHGGQCQHLPSGPRCSCVSFSIYTAWGEHCEHLSMKLDAFFGIFFGALGGLLLLGVGT
FVVLRFWGCSGARFSYFLNSAEALP
 
sig_peptide: 1..28
inference: COORDINATES: ab initio prediction:SignalP:4.0
calculated_mol_wt: 3178
peptide sequence: 
MKGARWRRVPWVSLSCLCLCLLPHVVPG

sig_peptide: 1..28
inference: COORDINATES: ab initio prediction:SignalP:4.0
calculated_mol_wt: 3178
peptide sequence: 
MKGARWRRVPWVSLSCLCLCLLPHVVPG

sig_peptide: 1..28
inference: COORDINATES: ab initio prediction:SignalP:4.0
calculated_mol_wt: 3178
peptide sequence: 
MKGARWRRVPWVSLSCLCLCLLPHVVPG
 
  logo