DP00588: Genome polyprotein [Isoform Genome polyprotein]FASTA viewXML view

General information
DisProt:DP00588
Name:Genome polyprotein [Isoform Genome polyprotein]
Synonym(s):POLG_HCVH
Core protein p21 [cleavage product 1]
Capsid protein C
p21
Core protein p19 [cleavage product 2]
Envelope glycoprotein E1 [cleavage product 3]
gp32
gp35
Envelope glycoprotein E2 [cleavage product 4]
NS1
gp68
gp70
p7 [cleavage product 5]
Protease NS2-3 [cleavage product 6]
p23
Serine protease/NTPase/helicase NS3 [cleavage product 7]
Hepacivirin
NS3P
p70
Non-structural protein 4A [cleavage product 8]
NS4A
p8
Non-structural protein 4B [cleavage product 9]
NS4B
p27
Non-structural protein 5A [cleavage product 10]
NS5A
p56
RNA-directed RNA polymerase [cleavage product 11]
NS5B
p68
First appeared in release:Release 4.9 (01/06/2009)
UniProt:P27958
UniGene: 
SwissProt: POLG_HCVH
TrEMBL:  
NCBI (GI): 130461
Source organism:Hepatitis C virus genotype 1a (isolate H) (HCV)
Sequence length:3011
Percent disordered:-1%
Homologues: 


Native sequence

        10         20         30         40         50         60
         |          |          |          |          |          |
MSTNPKPQRK TKRNTNRRPQ DVKFPGGGQI VGGVYLLPRR GPRLGVRATR KTSERSQPRG - 60
RRQPIPKARR PEGRTWAQPG YPWPLYGNEG CGWAGWLLSP RGSRPSWGPT DPRRRSRNLG - 120
KVIDTLTCGF ADLMGYIPLV GAPLGGAARA LAHGVRVLED GVNYATGNLP GCSFSIFLLA - 180
LLSCLTVPAS AYQVRNSSGL YHVTNDCPNS SVVYEAADAI LHTPGCVPCV REGNASRCWV - 240
AVTPTVATRD GKLPTTQLRR HIDLLVGSAT LCSALYVGDL CGSVFLVGQL FTFSPRHHWT - 300
TQDCNCSIYP GHITGHRMAW NMMMNWSPTA ALVVAQLLRI PQAIMDMIAG AHWGVLAGIK - 360
YFSMVGNWAK VLVVLLLFAG VDAETHVTGG NAGRTTAGLV GLLTPGAKQN IQLINTNGSW - 420
HINSTALNCN ESLNTGWLAG LFYQHKFNSS GCPERLASCR RLTDFAQGWG PISYANGSGL - 480
DERPYCWHYP PRPCGIVPAK SVCGPVYCFT PSPVVVGTTD RSGAPTYSWG ANDTDVFVLN - 540
NTRPPLGNWF GCTWMNSTGF TKVCGAPPCV IGGVGNNTLL CPTDCFRKYP EATYSRCGSG - 600
PRITPRCMVD YPYRLWHYPC TINYTIFKVR MYVGGVEHRL EAACNWTRGE RCDLEDRDRS - 660
ELSPLLLSTT QWQVLPCSFT TLPALSTGLI HLHQNIVDVQ YLYGVGSSIA SWAIKWEYVV - 720
LLFLLLADAR VCSCLWMMLL ISQAEAALEN LVILNAASLA GTHGLVSFLV FFCFAWYLKG - 780
RWVPGAVYAL YGMWPLLLLL LALPQRAYAL DTEVAASCGG VVLVGLMALT LSPYYKRYIS - 840
WCMWWLQYFL TRVEAQLHVW VPPLNVRGGR DAVILLTCVV HPALVFDITK LLLAIFGPLW - 900
ILQASLLKVP YFVRVQGLLR ICALARKIAG GHYVQMAIIK LGALTGTCVY NHLAPLRDWA - 960
HNGLRDLAVA VEPVVFSRME TKLITWGADT AACGDIINGL PVSARRGQEI LLGPADGMVS - 1020
KGWRLLAPIT AYAQQTRGLL GCIITSLTGR DKNQVEGEVQ IVSTATQTFL ATCINGVCWT - 1080
VYHGAGTRTI ASPKGPVIQT YTNVDQDLVG WPAPQGSRSL TPCTCGSSDL YLVTRHADVI - 1140
PVRRRGDSRG SLLSPRPISY LKGSSGGPLL CPTGHAVGLF RAAVCTRGVA KAVDFIPVEN - 1200
LETTMRSPVF TDNSSPPAVP QSFQVAHLHA PTGSGKSTKV PAAYAAKGYK VLVLNPSVAA - 1260
TLGFGAYMSK AHGVDPNIRT GVRTITTGSP ITYSTYGKFL ADAGCSGGAY DIIICDECHS - 1320
TDATSISGIG TVLDQAETAG ARLVVLATAT PPGSVTVSHP NIEEVALSTT GEIPFYGKAI - 1380
PLEVIKGGRH LIFCHSKKKC DELAAKLVAL GINAVAYYRG LDVSVIPTSG DVVVVSTDAL - 1440
MTGFTGDFDS VIDCNTCVTQ TVDFSLDPTF TIETTTLPQD AVSRTQRRGR TGRGKPGIYR - 1500
FVAPGERPSG MFDSSVLCEC YDAGCAWYEL TPAETTVRLR AYMNTPGLPV CQDHLGFWEG - 1560
VFTGLTHIDA HFLSQTKQSG ENFPYLVAYQ ATVCARAQAP PPSWDQMRKC LIRLKPTLHG - 1620
PTPLLYRLGA VQNEVTLTHP ITKYIMTCMS ADLEVVTSTW VLVGGVLAAL AAYCLSTGCV - 1680
VIVGRIVLSG KPAIIPDREV LYQEFDEMEE CSQHLPYIEQ GMMLAEQFKQ KALGLLQTAS - 1740
RHAEVITPAV QTNWQKLEVF WAKHMWNFIS GIQYLAGLST LPGNPAIASL MAFTAAVTSP - 1800
LTTGQTLLFN ILGGWVAAQL AAPGAATAFV GAGLAGAALD SVGLGKVLVD ILAGYGAGVA - 1860
GALVAFKIMS GEVPSTEDLV NLLPAILSPG ALAVGVVFAS ILRRRVGPGE GAVQWMNRLI - 1920
AFASRGNHVS PTHYVPESDA AARVTAILSS LTVTQLLRRL HQWISSECTT PCSGSWLRDI - 1980
WDWICEVLSD FKTWLKAKLM PQLPGIPFVS CQRGYRGVWR GDGIMHTRCH CGAEITGHVK - 2040
NGTMRIVGPR TCKNMWSGTF FINAYTTGPC TPLPAPNYKF ALWRVSAEEY VEIRRVGDFH - 2100
YVSGMTTDNL KCPCQIPSPE FFTELDGVRL HRFAPPCKPL LREEVSFRVG LHEYPVGSQL - 2160
PCEPEPDVAV LTSMLTDPSH ITAEAAGRRL ARGSPPSMAS SSASQLSAPS LKATCTANHD - 2220
SPDAELIEAN LLWRQEMGGN ITRVESENKV VILDSFDPLV AEEDEREVSV PAEILRKSRR - 2280
FAPALPVWAR PDYNPLLVET WKKPDYEPPV VHGCPLPPPR SPPVPPPRKK RTVVLTESTL - 2340
PTALAELATK SFGSSSTSGI TGDNTTTSSE PAPSGCPPDS DVESYSSMPP LEGEPGDPDL - 2400
SDGSWSTVSS GADTEDVVCC SMSYSWTGAL VTPCAAEEQK LPINALSNSL LRHHNLVYST - 2460
TSRSACQRKK KVTFDRLQVL DSHYQDVLKE VKAAASKVKA NLLSVEEACS LAPPHSAKSK - 2520
FGYGAKDVRC HARKAVAHIN SVWKDLLEDS VTPIDTTIMA KNEVFCVQPE KGGRKPARLI - 2580
VFPDLGVRVC EKMALYDVVS KLPLAVMGSS YGFQYSPGQR VEFLVQAWKS KKTPMGLSYD - 2640
TRCFDSTVTE SDIRTEEAIY QCCDLDPQAR VAIKSLTERL YVGGPLTNSR GENCGYRRCR - 2700
ASRVLTTSCG NTLTRYIKAR AACRAAGLQD CTMLVCGDDL VVICESAGVQ EDAASLRAFT - 2760
EAMTRYSAPP GDPPQPEYDL ELITSCSSNV SVAHDGAGKR VYYLTRDPTT PLARAAWETA - 2820
RHTPVNSWLG NIIMFAPTLW ARMILMTHFF SVLIARDQLE QALNCEIYGA CYSIEPLDLP - 2880
PIIQRLHGLS AFSLHSYSPG EINRVAACLR KLGVPPLRAW RHRAWSVRAR LLARGGKAAI - 2940
CGKYLFNWAV RTKLKLTPIT AAGRLDLSGW FTAGYSGGDI YHSVSHARPR WFWFCLLLLA - 3000
AGVGIYLLPN R



Functional narrative    

Core protein packages viral RNA to form a viral nucleocapsid, and promotes virion budding. Modulates viral translation initiation by interacting with HCV IRES and 40S ribosomal subunit. Also regulates many host cellular functions such as signaling pathways and apoptosis. Prevents the establishment of cellular antiviral state by blocking the interferon-alpha/beta (IFN-alpha/beta) and IFN-gamma signaling pathways and by inducing human STAT1 degradation. Thought to play a role in virus-mediated cell transformation leading to hepatocellular carcinomas. Interacts with, and activates STAT3 leading to cellular transformation. May repress the promoter of p53, and sequester CREB3 and SP110 isoform 3/Sp110b in the cytoplasm. Also represses cell cycle negative regulating factor CDKN1A, thereby interrupting an important check point of normal cell cycle regulation. Targets transcription factors involved in the regulation of inflammatory responses and in the immune response: suppresses NK-kappaB activation, and activates AP-1. Could mediate apoptotic pathways through association with TNF-type receptors TNFRSF1A and LTBR, although its effect on death receptor-induced apoptosis remains controversial. Enhances TRAIL mediated apoptosis, suggesting that it might play a role in immune-mediated liver cell injury. Seric core protein is able to bind C1QR1 at the T-cell surface, resulting in down-regulation of T-lymphocytes proliferation. May transactivate human MYC, Rous sarcoma virus LTR, and SV40 promoters. May suppress the human FOS and HIV-1 LTR activity. Alters lipid metabolism by interacting with hepatocellular proteins involved in lipid accumulation and storage. Core protein induces up-regulation of FAS promoter activity, and thereby probably contributes to the increased triglyceride accumulation in hepatocytes (steatosis). E1 and E2 glycoproteins form a heterodimer that is involved in virus attachment to the host cell, virion internalization through clathrin-dependent endocytosis and fusion with host membrane. E1/E2 heterodimer binds to human LDLR, CD81 and SCARB1/SR-BI receptors, but this binding is not sufficient for infection, some additional liver specific cofactors may be needed. The fusion function may possibly be carried by E1. E2 inhibits human EIF2AK2/PKR activation, preventing the establishment of an antiviral state. E2 is a viral ligand for CD209/DC-SIGN and CLEC4M/DC-SIGNR, which are respectively found on dendritic cells (DCs), and on liver sinusoidal endothelial cells and macrophage-like cells of lymph node sinuses. These interactions allow capture of circulating HCV particles by these cells and subsequent transmission to permissive cells. DCs act as sentinels in various tissues where they entrap pathogens and convey them to local lymphoid tissue or lymph node for establishment of immunity. Capture of circulating HCV particles by these SIGN+ cells may facilitate virus infection of proximal hepatocytes and lymphocyte subpopulations and may be essential for the establishment of persistent infection. P7 seems to be a heptameric ion channel protein (viroporin) and is inhibited by the antiviral drug amantadine By similarity. Also inhibited by long-alkyl-chain iminosugar derivatives. Essential for infectivity. Protease NS2-3 is a cysteine protease responsible for the autocatalytic cleavage of NS2-NS3. Seems to undergo self-inactivation following maturation. NS3 displays three enzymatic activities: serine protease, NTPase and RNA helicase. NS3 serine protease, in association with NS4A, is responsible for the cleavages of NS3-NS4A, NS4A-NS4B, NS4B-NS5A and NS5A-NS5B. NS3/NS4A complex also prevents phosphorylation of human IRF3, thus preventing the establishment of dsRNA induced antiviral state. NS3 RNA helicase binds to RNA and unwinds dsRNA in the 3' to 5' direction, and likely RNA stable secondary structure in the template strand. Cleaves and inhibits the host antiviral protein MAVS. NS4B induces a specific membrane alteration that serves as a scaffold for the virus replication complex. This membrane alteration gives rise to the so-called ER-derived membranous web that contains the replication complex. NS4B polymerization or in protein-protein interactions activity may contribute to its function in membranous web formation. NS5A is a component of the replication complex involved in RNA-binding. Its interaction with Human VAPB may target the viral replication complex to vesicles. Down-regulates viral IRES translation initiation. Mediates interferon resistance, presumably by interacting with and inhibiting human EIF2AK2/PKR. Seems to inhibit apoptosis by interacting with BIN1 and FKBP8. The hyperphosphorylated form of NS5A is an inhibitor of viral replication. NS5B is a RNA-dependent RNA polymerase that plays an essential role in the virus replication. (UniProt)



Map of ordered and disordered regions







Note: 'Mouse' over a region to see the start and stop residues. Click on a region to see detailed information.


Comments


Previous entry DP00588 has been split into polyprotein (DP00588) and cleavage products (DP00588_C0XX).



NOTE on Organism: DP00588 is organism Hepatitis C virus genotype 1a (isolate H) (HCV), aka HCVH; DP00615 is Hepatitis C virus genotype 1b (isolate Con1) (HCV) aka HCV1b or HCVCO. These entries share 85% identity.


If you have any comments or wish to provide additional references to this protein or its disordered region(s), please click here to e-mail us.


Disprot-footer
Contact us