US20050170337A1 - Oligomerization of hepatitis delta antigen - Google Patents
Oligomerization of hepatitis delta antigen Download PDFInfo
- Publication number
- US20050170337A1 US20050170337A1 US11/038,652 US3865205A US2005170337A1 US 20050170337 A1 US20050170337 A1 US 20050170337A1 US 3865205 A US3865205 A US 3865205A US 2005170337 A1 US2005170337 A1 US 2005170337A1
- Authority
- US
- United States
- Prior art keywords
- hdag
- nucleic acid
- acid molecule
- binding
- binding moiety
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 239000000427 antigen Substances 0.000 title description 54
- 108091007433 antigens Proteins 0.000 title description 54
- 102000036639 antigens Human genes 0.000 title description 54
- 208000037262 Hepatitis delta Diseases 0.000 title description 40
- 208000005331 Hepatitis D Diseases 0.000 title description 13
- 208000029570 hepatitis D virus infection Diseases 0.000 title description 13
- 238000006384 oligomerization reaction Methods 0.000 title description 11
- 230000027455 binding Effects 0.000 claims abstract description 194
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 165
- 210000004027 cell Anatomy 0.000 claims abstract description 145
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 140
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 133
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 133
- 239000013598 vector Substances 0.000 claims abstract description 80
- 230000004927 fusion Effects 0.000 claims abstract description 77
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 70
- 238000000034 method Methods 0.000 claims abstract description 63
- 239000012634 fragment Substances 0.000 claims abstract description 41
- 230000014509 gene expression Effects 0.000 claims abstract description 36
- 108090000623 proteins and genes Proteins 0.000 claims description 147
- 150000001413 amino acids Chemical class 0.000 claims description 74
- 239000003446 ligand Substances 0.000 claims description 51
- 229920001184 polypeptide Polymers 0.000 claims description 43
- 239000002773 nucleotide Substances 0.000 claims description 32
- 125000003729 nucleotide group Chemical group 0.000 claims description 32
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 30
- 230000006870 function Effects 0.000 claims description 22
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 20
- 238000010367 cloning Methods 0.000 claims description 18
- 239000012636 effector Substances 0.000 claims description 15
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 13
- 108010077850 Nuclear Localization Signals Proteins 0.000 claims description 12
- 230000000295 complement effect Effects 0.000 claims description 10
- 230000035772 mutation Effects 0.000 claims description 10
- 230000008878 coupling Effects 0.000 claims description 9
- 238000010168 coupling process Methods 0.000 claims description 9
- 238000005859 coupling reaction Methods 0.000 claims description 9
- 238000006243 chemical reaction Methods 0.000 claims description 8
- 238000004519 manufacturing process Methods 0.000 claims description 8
- 239000000758 substrate Substances 0.000 claims description 8
- 102000004190 Enzymes Human genes 0.000 claims description 7
- 108090000790 Enzymes Proteins 0.000 claims description 7
- 108091034117 Oligonucleotide Proteins 0.000 claims description 7
- 230000037361 pathway Effects 0.000 claims description 6
- 108010001857 Cell Surface Receptors Proteins 0.000 claims description 5
- 101710163270 Nuclease Proteins 0.000 claims description 5
- 230000002708 enhancing effect Effects 0.000 claims description 5
- 238000002764 solid phase assay Methods 0.000 claims description 5
- 102000002260 Alkaline Phosphatase Human genes 0.000 claims description 3
- 108020004774 Alkaline Phosphatase Proteins 0.000 claims description 3
- 238000012286 ELISA Assay Methods 0.000 claims description 3
- 108010043121 Green Fluorescent Proteins Proteins 0.000 claims description 3
- 102000004144 Green Fluorescent Proteins Human genes 0.000 claims description 3
- 238000012258 culturing Methods 0.000 claims description 3
- 239000005090 green fluorescent protein Substances 0.000 claims description 3
- 102000006240 membrane receptors Human genes 0.000 claims 1
- 230000015572 biosynthetic process Effects 0.000 abstract description 16
- 210000004899 c-terminal region Anatomy 0.000 abstract description 15
- 108020001507 fusion proteins Proteins 0.000 abstract description 15
- 102000037865 fusion proteins Human genes 0.000 abstract description 15
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 abstract description 5
- 238000012412 chemical coupling Methods 0.000 abstract 1
- 235000018102 proteins Nutrition 0.000 description 102
- 102000004169 proteins and genes Human genes 0.000 description 102
- 235000001014 amino acid Nutrition 0.000 description 73
- 229940024606 amino acid Drugs 0.000 description 73
- 102000005962 receptors Human genes 0.000 description 39
- 108020003175 receptors Proteins 0.000 description 39
- 239000000539 dimer Substances 0.000 description 38
- 230000003993 interaction Effects 0.000 description 31
- 239000000178 monomer Substances 0.000 description 30
- 241000724709 Hepatitis delta virus Species 0.000 description 29
- 125000005647 linker group Chemical group 0.000 description 29
- 230000010076 replication Effects 0.000 description 27
- 230000001413 cellular effect Effects 0.000 description 25
- 230000002209 hydrophobic effect Effects 0.000 description 23
- 239000013078 crystal Substances 0.000 description 20
- 241000588724 Escherichia coli Species 0.000 description 18
- 239000013612 plasmid Substances 0.000 description 17
- 102000018697 Membrane Proteins Human genes 0.000 description 16
- 108010052285 Membrane Proteins Proteins 0.000 description 16
- 239000003814 drug Substances 0.000 description 16
- 239000000126 substance Substances 0.000 description 16
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 15
- 239000003795 chemical substances by application Substances 0.000 description 15
- 108020000999 Viral RNA Proteins 0.000 description 14
- 229940079593 drug Drugs 0.000 description 14
- 108020004414 DNA Proteins 0.000 description 13
- 125000001165 hydrophobic group Chemical group 0.000 description 13
- 238000006467 substitution reaction Methods 0.000 description 13
- 108020004705 Codon Proteins 0.000 description 12
- 241000700605 Viruses Species 0.000 description 12
- 125000004429 atom Chemical group 0.000 description 12
- 229910052739 hydrogen Inorganic materials 0.000 description 12
- 239000001257 hydrogen Substances 0.000 description 12
- 239000000243 solution Substances 0.000 description 11
- 230000001225 therapeutic effect Effects 0.000 description 11
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 10
- 241000894006 Bacteria Species 0.000 description 10
- 239000003550 marker Substances 0.000 description 10
- 239000000816 peptidomimetic Substances 0.000 description 10
- 238000003752 polymerase chain reaction Methods 0.000 description 10
- 230000003612 virological effect Effects 0.000 description 10
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 9
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 9
- 241000282414 Homo sapiens Species 0.000 description 9
- 108700026244 Open Reading Frames Proteins 0.000 description 9
- 101710149279 Small delta antigen Proteins 0.000 description 9
- 230000002378 acidificating effect Effects 0.000 description 9
- -1 adhesion molecules Proteins 0.000 description 9
- 230000000694 effects Effects 0.000 description 9
- 238000009396 hybridization Methods 0.000 description 9
- 102000004127 Cytokines Human genes 0.000 description 8
- 239000007995 HEPES buffer Substances 0.000 description 8
- 241001465754 Metazoa Species 0.000 description 8
- 235000018417 cysteine Nutrition 0.000 description 8
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 8
- 239000000203 mixture Substances 0.000 description 8
- 239000011780 sodium chloride Substances 0.000 description 8
- 238000012360 testing method Methods 0.000 description 8
- 238000001890 transfection Methods 0.000 description 8
- 108090000695 Cytokines Proteins 0.000 description 7
- 241000700721 Hepatitis B virus Species 0.000 description 7
- 108010029485 Protein Isoforms Proteins 0.000 description 7
- 102000001708 Protein Isoforms Human genes 0.000 description 7
- 150000001412 amines Chemical class 0.000 description 7
- 239000003623 enhancer Substances 0.000 description 7
- 210000003527 eukaryotic cell Anatomy 0.000 description 7
- 150000002632 lipids Chemical class 0.000 description 7
- 230000007246 mechanism Effects 0.000 description 7
- 239000000047 product Substances 0.000 description 7
- 241000894007 species Species 0.000 description 7
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 6
- 108090000565 Capsid Proteins Proteins 0.000 description 6
- 102100023321 Ceruloplasmin Human genes 0.000 description 6
- 102000053602 DNA Human genes 0.000 description 6
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 6
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 6
- 238000012217 deletion Methods 0.000 description 6
- 230000037430 deletion Effects 0.000 description 6
- 229940088598 enzyme Drugs 0.000 description 6
- 239000003102 growth factor Substances 0.000 description 6
- 229910052757 nitrogen Inorganic materials 0.000 description 6
- 239000002904 solvent Substances 0.000 description 6
- 210000001519 tissue Anatomy 0.000 description 6
- 239000013603 viral vector Substances 0.000 description 6
- 102000014914 Carrier Proteins Human genes 0.000 description 5
- 101100109292 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) arg-13 gene Proteins 0.000 description 5
- 101100545229 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) ZDS2 gene Proteins 0.000 description 5
- 108700005078 Synthetic Genes Proteins 0.000 description 5
- 101100167209 Ustilago maydis (strain 521 / FGSC 9021) CHS8 gene Proteins 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 238000004132 cross linking Methods 0.000 description 5
- 125000000524 functional group Chemical group 0.000 description 5
- 230000002163 immunogen Effects 0.000 description 5
- 238000001727 in vivo Methods 0.000 description 5
- 238000004949 mass spectrometry Methods 0.000 description 5
- BASFCYQUMIYNBI-UHFFFAOYSA-N platinum Chemical compound [Pt] BASFCYQUMIYNBI-UHFFFAOYSA-N 0.000 description 5
- 102000040430 polynucleotide Human genes 0.000 description 5
- 108091033319 polynucleotide Proteins 0.000 description 5
- 239000002157 polynucleotide Substances 0.000 description 5
- 108020001580 protein domains Proteins 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 102000000844 Cell Surface Receptors Human genes 0.000 description 4
- 102000019034 Chemokines Human genes 0.000 description 4
- 108010012236 Chemokines Proteins 0.000 description 4
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 4
- 206010028980 Neoplasm Diseases 0.000 description 4
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 4
- 108020005067 RNA Splice Sites Proteins 0.000 description 4
- 230000004570 RNA-binding Effects 0.000 description 4
- 108020004682 Single-Stranded DNA Proteins 0.000 description 4
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 4
- 238000007792 addition Methods 0.000 description 4
- 150000001408 amides Chemical group 0.000 description 4
- 230000003321 amplification Effects 0.000 description 4
- 150000001450 anions Chemical class 0.000 description 4
- DCBDOYDVQJVXOH-UHFFFAOYSA-N azane;1h-indole Chemical compound N.C1=CC=C2NC=CC2=C1 DCBDOYDVQJVXOH-UHFFFAOYSA-N 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 230000001588 bifunctional effect Effects 0.000 description 4
- 108091008324 binding proteins Proteins 0.000 description 4
- 150000001768 cations Chemical class 0.000 description 4
- 150000001875 compounds Chemical group 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 238000003199 nucleic acid amplification method Methods 0.000 description 4
- 230000001105 regulatory effect Effects 0.000 description 4
- 230000001177 retroviral effect Effects 0.000 description 4
- 238000002864 sequence alignment Methods 0.000 description 4
- 239000007787 solid Substances 0.000 description 4
- 150000003431 steroids Chemical class 0.000 description 4
- 235000000346 sugar Nutrition 0.000 description 4
- 238000002560 therapeutic procedure Methods 0.000 description 4
- 241001515965 unidentified phage Species 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- 108091026890 Coding region Proteins 0.000 description 3
- 108050006400 Cyclin Proteins 0.000 description 3
- 241000701022 Cytomegalovirus Species 0.000 description 3
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N Glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- 101710158865 Large delta antigen Proteins 0.000 description 3
- 239000012124 Opti-MEM Substances 0.000 description 3
- 102100036691 Proliferating cell nuclear antigen Human genes 0.000 description 3
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 3
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical group [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 3
- 108091023040 Transcription factor Proteins 0.000 description 3
- 102000040945 Transcription factor Human genes 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 150000001298 alcohols Chemical class 0.000 description 3
- 125000001931 aliphatic group Chemical group 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 3
- 230000004071 biological effect Effects 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 3
- 150000001720 carbohydrates Chemical class 0.000 description 3
- 235000014633 carbohydrates Nutrition 0.000 description 3
- 229910052799 carbon Inorganic materials 0.000 description 3
- 150000005829 chemical entities Chemical class 0.000 description 3
- 238000002983 circular dichroism Methods 0.000 description 3
- 238000013480 data collection Methods 0.000 description 3
- 238000006471 dimerization reaction Methods 0.000 description 3
- 238000009472 formulation Methods 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 238000003875 gradient-accelerated spectroscopy Methods 0.000 description 3
- 230000012010 growth Effects 0.000 description 3
- 208000002672 hepatitis B Diseases 0.000 description 3
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 3
- 230000001965 increasing effect Effects 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 238000002955 isolation Methods 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 238000001840 matrix-assisted laser desorption--ionisation time-of-flight mass spectrometry Methods 0.000 description 3
- 239000012528 membrane Substances 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000008520 organization Effects 0.000 description 3
- 229910052760 oxygen Inorganic materials 0.000 description 3
- 239000001301 oxygen Substances 0.000 description 3
- 102000035123 post-translationally modified proteins Human genes 0.000 description 3
- 108091005626 post-translationally modified proteins Proteins 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 238000003127 radioimmunoassay Methods 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 3
- 150000003384 small molecules Chemical class 0.000 description 3
- 210000001082 somatic cell Anatomy 0.000 description 3
- 230000000087 stabilizing effect Effects 0.000 description 3
- 210000000130 stem cell Anatomy 0.000 description 3
- 229910052717 sulfur Chemical group 0.000 description 3
- 239000011593 sulfur Chemical group 0.000 description 3
- 125000003396 thiol group Chemical group [H]S* 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 125000001493 tyrosinyl group Chemical group [H]OC1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 3
- 241000701447 unidentified baculovirus Species 0.000 description 3
- 239000003981 vehicle Substances 0.000 description 3
- 230000029812 viral genome replication Effects 0.000 description 3
- 238000005406 washing Methods 0.000 description 3
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 2
- 241000193830 Bacillus <bacterium> Species 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- 108010078791 Carrier Proteins Proteins 0.000 description 2
- 108090000994 Catalytic RNA Proteins 0.000 description 2
- 102000053642 Catalytic RNA Human genes 0.000 description 2
- 241000282693 Cercopithecidae Species 0.000 description 2
- 102000009410 Chemokine receptor Human genes 0.000 description 2
- 108050000299 Chemokine receptor Proteins 0.000 description 2
- 108700010070 Codon Usage Proteins 0.000 description 2
- 208000003322 Coinfection Diseases 0.000 description 2
- 206010010144 Completed suicide Diseases 0.000 description 2
- 102100025698 Cytosolic carboxypeptidase 4 Human genes 0.000 description 2
- 206010059866 Drug resistance Diseases 0.000 description 2
- 101800003838 Epidermal growth factor Proteins 0.000 description 2
- 102400001368 Epidermal growth factor Human genes 0.000 description 2
- 108010087819 Fc receptors Proteins 0.000 description 2
- 102000009109 Fc receptors Human genes 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 101000932590 Homo sapiens Cytosolic carboxypeptidase 4 Proteins 0.000 description 2
- 108060003951 Immunoglobulin Proteins 0.000 description 2
- 102000014150 Interferons Human genes 0.000 description 2
- 108010050904 Interferons Proteins 0.000 description 2
- 102000015696 Interleukins Human genes 0.000 description 2
- 108010063738 Interleukins Proteins 0.000 description 2
- 108091092195 Intron Proteins 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- 102000000853 LDL receptors Human genes 0.000 description 2
- 108010001831 LDL receptors Proteins 0.000 description 2
- 241000713666 Lentivirus Species 0.000 description 2
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 2
- 101001033003 Mus musculus Granzyme F Proteins 0.000 description 2
- 108091007494 Nucleic acid- binding domains Proteins 0.000 description 2
- 108090001074 Nucleocapsid Proteins Proteins 0.000 description 2
- 241001494479 Pecora Species 0.000 description 2
- 241000235648 Pichia Species 0.000 description 2
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 2
- 102000009572 RNA Polymerase II Human genes 0.000 description 2
- 108010009460 RNA Polymerase II Proteins 0.000 description 2
- 241000235070 Saccharomyces Species 0.000 description 2
- 101100149312 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SER1 gene Proteins 0.000 description 2
- 101100113084 Schizosaccharomyces pombe (strain 972 / ATCC 24843) mcs2 gene Proteins 0.000 description 2
- 229920002684 Sepharose Polymers 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 2
- 239000012505 Superdex™ Substances 0.000 description 2
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 2
- 241000711975 Vesicular stomatitis virus Species 0.000 description 2
- 108010067390 Viral Proteins Proteins 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- 125000003277 amino group Chemical group 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- 210000004102 animal cell Anatomy 0.000 description 2
- 230000000890 antigenic effect Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 125000003118 aryl group Chemical group 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- UUQMNUMQCIQDMZ-UHFFFAOYSA-N betahistine Chemical compound CNCCC1=CC=CC=N1 UUQMNUMQCIQDMZ-UHFFFAOYSA-N 0.000 description 2
- 239000011230 binding agent Substances 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 238000010382 chemical cross-linking Methods 0.000 description 2
- 230000001684 chronic effect Effects 0.000 description 2
- 238000000978 circular dichroism spectroscopy Methods 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 238000002425 crystallisation Methods 0.000 description 2
- 230000008025 crystallization Effects 0.000 description 2
- 108010057085 cytokine receptors Proteins 0.000 description 2
- 229940127089 cytotoxic agent Drugs 0.000 description 2
- 239000002254 cytotoxic agent Substances 0.000 description 2
- 231100000599 cytotoxic agent Toxicity 0.000 description 2
- 238000010494 dissociation reaction Methods 0.000 description 2
- 230000005593 dissociations Effects 0.000 description 2
- VYFYYTLLBUKUHU-UHFFFAOYSA-N dopamine Chemical compound NCCC1=CC=C(O)C(O)=C1 VYFYYTLLBUKUHU-UHFFFAOYSA-N 0.000 description 2
- 239000003937 drug carrier Substances 0.000 description 2
- 241001493065 dsRNA viruses Species 0.000 description 2
- 238000000132 electrospray ionisation Methods 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 210000002308 embryonic cell Anatomy 0.000 description 2
- 239000000839 emulsion Substances 0.000 description 2
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 2
- 229940116977 epidermal growth factor Drugs 0.000 description 2
- 150000002148 esters Chemical class 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000013613 expression plasmid Substances 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 210000002950 fibroblast Anatomy 0.000 description 2
- 238000001415 gene therapy Methods 0.000 description 2
- 150000004676 glycans Chemical class 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 2
- 208000006454 hepatitis Diseases 0.000 description 2
- 231100000283 hepatitis Toxicity 0.000 description 2
- 150000001261 hydroxy acids Chemical class 0.000 description 2
- 230000028993 immune response Effects 0.000 description 2
- 102000018358 immunoglobulin Human genes 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 2
- 229940047124 interferons Drugs 0.000 description 2
- 229940047122 interleukins Drugs 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- 238000007918 intramuscular administration Methods 0.000 description 2
- 238000001990 intravenous administration Methods 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- 235000005772 leucine Nutrition 0.000 description 2
- 208000019423 liver disease Diseases 0.000 description 2
- 239000008176 lyophilized powder Substances 0.000 description 2
- 239000006166 lysate Substances 0.000 description 2
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical compound [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 2
- 230000030648 nucleus localization Effects 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 238000004806 packaging method and process Methods 0.000 description 2
- 238000007911 parenteral administration Methods 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 2
- 108091005981 phosphorylated proteins Proteins 0.000 description 2
- 229910052697 platinum Inorganic materials 0.000 description 2
- 229920000728 polyester Polymers 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 229920001282 polysaccharide Polymers 0.000 description 2
- 239000005017 polysaccharide Substances 0.000 description 2
- 239000003755 preservative agent Substances 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 239000011541 reaction mixture Substances 0.000 description 2
- 210000003705 ribosome Anatomy 0.000 description 2
- 108091092562 ribozyme Proteins 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 239000000377 silicon dioxide Substances 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N silicon dioxide Inorganic materials O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- 238000002922 simulated annealing Methods 0.000 description 2
- 230000000392 somatic effect Effects 0.000 description 2
- 238000000527 sonication Methods 0.000 description 2
- 238000007920 subcutaneous administration Methods 0.000 description 2
- 125000000472 sulfonyl group Chemical group *S(*)(=O)=O 0.000 description 2
- 239000013589 supplement Substances 0.000 description 2
- 239000000829 suppository Substances 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 239000003826 tablet Substances 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 229940124597 therapeutic agent Drugs 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 102000035160 transmembrane proteins Human genes 0.000 description 2
- 108091005703 transmembrane proteins Proteins 0.000 description 2
- 102000003390 tumor necrosis factor Human genes 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 241001430294 unidentified retrovirus Species 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- VBEQCZHXXJYVRD-GACYYNSASA-N uroanthelone Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C(C)C)[C@@H](C)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)CNC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CS)NC(=O)CNC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O)C(C)C)[C@@H](C)CC)C1=CC=C(O)C=C1 VBEQCZHXXJYVRD-GACYYNSASA-N 0.000 description 2
- 229960005486 vaccine Drugs 0.000 description 2
- 125000003088 (fluoren-9-ylmethoxy)carbonyl group Chemical group 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- JFJNVIPVOCESGZ-UHFFFAOYSA-N 2,3-dipyridin-2-ylpyridine Chemical compound N1=CC=CC=C1C1=CC=CN=C1C1=CC=CC=N1 JFJNVIPVOCESGZ-UHFFFAOYSA-N 0.000 description 1
- MHOFGBJTSNWTDT-UHFFFAOYSA-M 2-[n-ethyl-4-[(6-methoxy-3-methyl-1,3-benzothiazol-3-ium-2-yl)diazenyl]anilino]ethanol;methyl sulfate Chemical compound COS([O-])(=O)=O.C1=CC(N(CCO)CC)=CC=C1N=NC1=[N+](C)C2=CC=C(OC)C=C2S1 MHOFGBJTSNWTDT-UHFFFAOYSA-M 0.000 description 1
- 101710169336 5'-deoxyadenosine deaminase Proteins 0.000 description 1
- 102100036664 Adenosine deaminase Human genes 0.000 description 1
- 108060003345 Adrenergic Receptor Proteins 0.000 description 1
- 102000017910 Adrenergic receptor Human genes 0.000 description 1
- 241000710929 Alphavirus Species 0.000 description 1
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 1
- 239000005695 Ammonium acetate Substances 0.000 description 1
- 229920000856 Amylose Polymers 0.000 description 1
- 108020004491 Antisense DNA Proteins 0.000 description 1
- 102000018655 Apolipoproteins C Human genes 0.000 description 1
- 108010027070 Apolipoproteins C Proteins 0.000 description 1
- 102000013918 Apolipoproteins E Human genes 0.000 description 1
- 108010025628 Apolipoproteins E Proteins 0.000 description 1
- 101100107610 Arabidopsis thaliana ABCF4 gene Proteins 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 241000714230 Avian leukemia virus Species 0.000 description 1
- 241001485018 Baboon endogenous virus Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- LSNNMFCWUKXFEE-UHFFFAOYSA-M Bisulfite Chemical compound OS([O-])=O LSNNMFCWUKXFEE-UHFFFAOYSA-M 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 241000714266 Bovine leukemia virus Species 0.000 description 1
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 1
- 108090000312 Calcium Channels Proteins 0.000 description 1
- 102000003922 Calcium Channels Human genes 0.000 description 1
- 102100025064 Cellular tumor antigen p53 Human genes 0.000 description 1
- 108091006146 Channels Proteins 0.000 description 1
- 108091028075 Circular RNA Proteins 0.000 description 1
- 108091062157 Cis-regulatory element Proteins 0.000 description 1
- 102100022641 Coagulation factor IX Human genes 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 238000011537 Coomassie blue staining Methods 0.000 description 1
- 241000711573 Coronaviridae Species 0.000 description 1
- 102100025287 Cytochrome b Human genes 0.000 description 1
- 108010075028 Cytochromes b Proteins 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- 241000450599 DNA viruses Species 0.000 description 1
- 241001533413 Deltavirus Species 0.000 description 1
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 1
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- 102000001301 EGF receptor Human genes 0.000 description 1
- 108060006698 EGF receptor Proteins 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 108010092408 Eosinophil Peroxidase Proteins 0.000 description 1
- 102100028471 Eosinophil peroxidase Human genes 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 241001524679 Escherichia virus M13 Species 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 1
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 1
- 108010076282 Factor IX Proteins 0.000 description 1
- 108010054218 Factor VIII Proteins 0.000 description 1
- 102000001690 Factor VIII Human genes 0.000 description 1
- 241000714165 Feline leukemia virus Species 0.000 description 1
- 241000714174 Feline sarcoma virus Species 0.000 description 1
- 241000710831 Flavivirus Species 0.000 description 1
- 208000000666 Fowlpox Diseases 0.000 description 1
- IECPWNUMDGFDKC-UHFFFAOYSA-N Fusicsaeure Natural products C12C(O)CC3C(=C(CCC=C(C)C)C(O)=O)C(OC(C)=O)CC3(C)C1(C)CCC1C2(C)CCC(O)C1C IECPWNUMDGFDKC-UHFFFAOYSA-N 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- 241000713813 Gibbon ape leukemia virus Species 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 102000004547 Glucosylceramidase Human genes 0.000 description 1
- 108010017544 Glucosylceramidase Proteins 0.000 description 1
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Natural products NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 108010017080 Granulocyte Colony-Stimulating Factor Proteins 0.000 description 1
- 102000004269 Granulocyte Colony-Stimulating Factor Human genes 0.000 description 1
- 108010017213 Granulocyte-Macrophage Colony-Stimulating Factor Proteins 0.000 description 1
- 102000004457 Granulocyte-Macrophage Colony-Stimulating Factor Human genes 0.000 description 1
- 241000941423 Grom virus Species 0.000 description 1
- 101000600434 Homo sapiens Putative uncharacterized protein encoded by MIR7-3HG Proteins 0.000 description 1
- 101000716102 Homo sapiens T-cell surface glycoprotein CD4 Proteins 0.000 description 1
- 101000946843 Homo sapiens T-cell surface glycoprotein CD8 alpha chain Proteins 0.000 description 1
- 101000611183 Homo sapiens Tumor necrosis factor Proteins 0.000 description 1
- 108091006905 Human Serum Albumin Proteins 0.000 description 1
- 102000008100 Human Serum Albumin Human genes 0.000 description 1
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 1
- 241000725303 Human immunodeficiency virus Species 0.000 description 1
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 1
- 102000018071 Immunoglobulin Fc Fragments Human genes 0.000 description 1
- 108010091135 Immunoglobulin Fc Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102000004877 Insulin Human genes 0.000 description 1
- 108090001061 Insulin Proteins 0.000 description 1
- 108010001127 Insulin Receptor Proteins 0.000 description 1
- 102000003746 Insulin Receptor Human genes 0.000 description 1
- 108010064593 Intercellular Adhesion Molecule-1 Proteins 0.000 description 1
- 102100037877 Intercellular adhesion molecule 1 Human genes 0.000 description 1
- 102000001617 Interferon Receptors Human genes 0.000 description 1
- 108010054267 Interferon Receptors Proteins 0.000 description 1
- 108090000862 Ion Channels Proteins 0.000 description 1
- 102000004310 Ion Channels Human genes 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- 125000000174 L-prolyl group Chemical group [H]N1C([H])([H])C([H])([H])C([H])([H])[C@@]1([H])C(*)=O 0.000 description 1
- 125000000510 L-tryptophano group Chemical group [H]C1=C([H])C([H])=C2N([H])C([H])=C(C([H])([H])[C@@]([H])(C(O[H])=O)N([H])[*])C2=C1[H] 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 108010046938 Macrophage Colony-Stimulating Factor Proteins 0.000 description 1
- 102000007651 Macrophage Colony-Stimulating Factor Human genes 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- 241000713821 Mason-Pfizer monkey virus Species 0.000 description 1
- 201000005505 Measles Diseases 0.000 description 1
- UMQAIKKJIZYHQC-UHFFFAOYSA-M Milling yellow 3G Chemical compound ClC=1C=CC(=C(C=1)S(=O)(=O)[O-])N1N=C(C(=C1O)N=NC1=CC=C(C=C1)OS(=O)(=O)C1=CC=C(C=C1)C)C.[Na+] UMQAIKKJIZYHQC-UHFFFAOYSA-M 0.000 description 1
- 241000713862 Moloney murine sarcoma virus Species 0.000 description 1
- 241000713333 Mouse mammary tumor virus Species 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 241000714177 Murine leukemia virus Species 0.000 description 1
- 101100059320 Mus musculus Ccdc85b gene Proteins 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 241000714209 Norwalk virus Species 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 241000702244 Orthoreovirus Species 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 108091008606 PDGF receptors Proteins 0.000 description 1
- 108020002230 Pancreatic Ribonuclease Proteins 0.000 description 1
- 102000005891 Pancreatic ribonuclease Human genes 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108010001014 Plasminogen Activators Proteins 0.000 description 1
- 102000001938 Plasminogen Activators Human genes 0.000 description 1
- 102000011653 Platelet-Derived Growth Factor Receptors Human genes 0.000 description 1
- 239000004952 Polyamide Substances 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 239000004372 Polyvinyl alcohol Substances 0.000 description 1
- 102000004257 Potassium Channel Human genes 0.000 description 1
- 102000029797 Prion Human genes 0.000 description 1
- 108091000054 Prion Proteins 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 241000125945 Protoparvovirus Species 0.000 description 1
- 102100037401 Putative uncharacterized protein encoded by MIR7-3HG Human genes 0.000 description 1
- 238000010357 RNA editing Methods 0.000 description 1
- 230000026279 RNA modification Effects 0.000 description 1
- 239000012980 RPMI-1640 medium Substances 0.000 description 1
- 206010037742 Rabies Diseases 0.000 description 1
- 241000711798 Rabies lyssavirus Species 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 101710119735 Replication termination protein Proteins 0.000 description 1
- 241000714474 Rous sarcoma virus Species 0.000 description 1
- 101100068078 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GCN4 gene Proteins 0.000 description 1
- 101100379247 Salmo trutta apoa1 gene Proteins 0.000 description 1
- 206010039491 Sarcoma Diseases 0.000 description 1
- 241001515849 Satellite Viruses Species 0.000 description 1
- 102100030053 Secreted frizzled-related protein 3 Human genes 0.000 description 1
- 241000713311 Simian immunodeficiency virus Species 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- 241000713675 Spumavirus Species 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 101000874347 Streptococcus agalactiae IgA FC receptor Proteins 0.000 description 1
- 108010023197 Streptokinase Proteins 0.000 description 1
- 230000024932 T cell mediated immunity Effects 0.000 description 1
- 108091008874 T cell receptors Proteins 0.000 description 1
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 1
- 208000000389 T-cell leukemia Diseases 0.000 description 1
- 208000028530 T-cell lymphoblastic leukemia/lymphoma Diseases 0.000 description 1
- 102100036011 T-cell surface glycoprotein CD4 Human genes 0.000 description 1
- 102100034922 T-cell surface glycoprotein CD8 alpha chain Human genes 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 108010001244 Tli polymerase Proteins 0.000 description 1
- 108060008683 Tumor Necrosis Factor Receptor Proteins 0.000 description 1
- 102000044209 Tumor Suppressor Genes Human genes 0.000 description 1
- 108700025716 Tumor Suppressor Genes Proteins 0.000 description 1
- 102100040247 Tumor necrosis factor Human genes 0.000 description 1
- 206010046865 Vaccinia virus infection Diseases 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Chemical compound CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 108010020277 WD repeat containing planar cell polarity effector Proteins 0.000 description 1
- 102000006757 Wnt Receptors Human genes 0.000 description 1
- 108010047118 Wnt Receptors Proteins 0.000 description 1
- 241000714205 Woolly monkey sarcoma virus Species 0.000 description 1
- 108010084455 Zeocin Proteins 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 108091005646 acetylated proteins Proteins 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108091005598 amidated proteins Proteins 0.000 description 1
- 229940043376 ammonium acetate Drugs 0.000 description 1
- 235000019257 ammonium acetate Nutrition 0.000 description 1
- QGZKDVFQNNGYKY-UHFFFAOYSA-O ammonium group Chemical group [NH4+] QGZKDVFQNNGYKY-UHFFFAOYSA-O 0.000 description 1
- 238000012870 ammonium sulfate precipitation Methods 0.000 description 1
- 239000003708 ampul Substances 0.000 description 1
- 238000013103 analytical ultracentrifugation Methods 0.000 description 1
- 239000002870 angiogenesis inducing agent Substances 0.000 description 1
- 230000001772 anti-angiogenic effect Effects 0.000 description 1
- 239000003816 antisense DNA Substances 0.000 description 1
- 239000003443 antiviral agent Substances 0.000 description 1
- 229940121357 antivirals Drugs 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 210000001130 astrocyte Anatomy 0.000 description 1
- 239000012298 atmosphere Substances 0.000 description 1
- 230000002238 attenuated effect Effects 0.000 description 1
- 239000012752 auxiliary agent Substances 0.000 description 1
- 208000004668 avian leukosis Diseases 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 102000039554 bZIP family Human genes 0.000 description 1
- 108091067354 bZIP family Proteins 0.000 description 1
- 238000005452 bending Methods 0.000 description 1
- SQVRNKJHWKZAKO-UHFFFAOYSA-N beta-N-Acetyl-D-neuraminic acid Natural products CC(=O)NC1C(O)CC(O)(C(O)=O)OC1C(O)C(O)CO SQVRNKJHWKZAKO-UHFFFAOYSA-N 0.000 description 1
- 239000003833 bile salt Substances 0.000 description 1
- 229940093761 bile salts Drugs 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 229920001222 biopolymer Polymers 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 210000000601 blood cell Anatomy 0.000 description 1
- 229960000182 blood factors Drugs 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 210000000234 capsid Anatomy 0.000 description 1
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 1
- 210000004413 cardiac myocyte Anatomy 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 230000022534 cell killing Effects 0.000 description 1
- 125000003636 chemical group Chemical group 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000004040 coloring Methods 0.000 description 1
- 239000003431 cross linking reagent Substances 0.000 description 1
- JHIVVAPYMSGYDF-UHFFFAOYSA-N cyclohexanone Chemical compound O=C1CCCCC1 JHIVVAPYMSGYDF-UHFFFAOYSA-N 0.000 description 1
- 102000003675 cytokine receptors Human genes 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 108091005591 demethylated proteins Proteins 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 210000004443 dendritic cell Anatomy 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000013400 design of experiment Methods 0.000 description 1
- 230000000368 destabilizing effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 238000011026 diafiltration Methods 0.000 description 1
- 238000002405 diagnostic procedure Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- 235000014113 dietary fatty acids Nutrition 0.000 description 1
- 238000009792 diffusion process Methods 0.000 description 1
- 102000038379 digestive enzymes Human genes 0.000 description 1
- 108091007734 digestive enzymes Proteins 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 229960003638 dopamine Drugs 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000005421 electrostatic potential Methods 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000002532 enzyme inhibitor Substances 0.000 description 1
- 210000002919 epithelial cell Anatomy 0.000 description 1
- 210000000981 epithelium Anatomy 0.000 description 1
- 210000003743 erythrocyte Anatomy 0.000 description 1
- 210000002744 extracellular matrix Anatomy 0.000 description 1
- 229960004222 factor ix Drugs 0.000 description 1
- 229960000301 factor viii Drugs 0.000 description 1
- 229930195729 fatty acid Natural products 0.000 description 1
- 239000000194 fatty acid Substances 0.000 description 1
- 239000012894 fetal calf serum Substances 0.000 description 1
- 239000000796 flavoring agent Substances 0.000 description 1
- 235000013355 food flavoring agent Nutrition 0.000 description 1
- 235000003599 food sweetener Nutrition 0.000 description 1
- 208000020217 fulminant viral hepatitis Diseases 0.000 description 1
- 229960004675 fusidic acid Drugs 0.000 description 1
- IECPWNUMDGFDKC-MZJAQBGESA-N fusidic acid Chemical compound O[C@@H]([C@@H]12)C[C@H]3\C(=C(/CCC=C(C)C)C(O)=O)[C@@H](OC(C)=O)C[C@]3(C)[C@@]2(C)CC[C@@H]2[C@]1(C)CC[C@@H](O)[C@H]2C IECPWNUMDGFDKC-MZJAQBGESA-N 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 102000034356 gene-regulatory proteins Human genes 0.000 description 1
- 108091006104 gene-regulatory proteins Proteins 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 1
- 108091005608 glycosylated proteins Proteins 0.000 description 1
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 1
- 150000002357 guanidines Chemical class 0.000 description 1
- 238000012188 high-throughput screening assay Methods 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 230000028996 humoral immune response Effects 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 229920003063 hydroxymethyl cellulose Polymers 0.000 description 1
- 229940031574 hydroxymethyl cellulose Drugs 0.000 description 1
- 238000010166 immunofluorescence Methods 0.000 description 1
- 238000010820 immunofluorescence microscopy Methods 0.000 description 1
- 239000007943 implant Substances 0.000 description 1
- 238000002513 implantation Methods 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 208000037797 influenza A Diseases 0.000 description 1
- 239000012194 insect media Substances 0.000 description 1
- 229940125396 insulin Drugs 0.000 description 1
- 102000006495 integrins Human genes 0.000 description 1
- 108010044426 integrins Proteins 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 150000002614 leucines Chemical class 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 230000033001 locomotion Effects 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 210000002540 macrophage Anatomy 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 238000001471 micro-filtration Methods 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 229940031348 multivalent vaccine Drugs 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 210000004498 neuroglial cell Anatomy 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 1
- 239000002687 nonaqueous vehicle Substances 0.000 description 1
- 239000000346 nonvolatile oil Substances 0.000 description 1
- 230000005937 nuclear translocation Effects 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 229920001542 oligosaccharide Polymers 0.000 description 1
- 150000002482 oligosaccharides Chemical class 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
- 244000045947 parasite Species 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- 230000002688 persistence Effects 0.000 description 1
- 239000000825 pharmaceutical preparation Substances 0.000 description 1
- 230000003285 pharmacodynamic effect Effects 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- CWCMIVBLVUHDHK-ZSNHEYEWSA-N phleomycin D1 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC[C@@H](N=1)C=1SC=C(N=1)C(=O)NCCCCNC(N)=N)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C CWCMIVBLVUHDHK-ZSNHEYEWSA-N 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- 150000003904 phospholipids Chemical class 0.000 description 1
- 125000005496 phosphonium group Chemical group 0.000 description 1
- 229940127126 plasminogen activator Drugs 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 231100000572 poisoning Toxicity 0.000 description 1
- 230000000607 poisoning effect Effects 0.000 description 1
- 229920000747 poly(lactic acid) Polymers 0.000 description 1
- 229920000548 poly(silane) polymer Polymers 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920002647 polyamide Polymers 0.000 description 1
- 108010094020 polyglycine Proteins 0.000 description 1
- 229920000232 polyglycine polymer Polymers 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 229920002451 polyvinyl alcohol Polymers 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 108020001213 potassium channel Proteins 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 125000001500 prolyl group Chemical group [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 230000005180 public health Effects 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 108010043277 recombinant soluble CD4 Proteins 0.000 description 1
- 230000007115 recruitment Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 238000004007 reversed phase HPLC Methods 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 102220240346 rs764757062 Human genes 0.000 description 1
- 239000012266 salt solution Substances 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- SQVRNKJHWKZAKO-OQPLDHBCSA-N sialic acid Chemical compound CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)OC1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-OQPLDHBCSA-N 0.000 description 1
- 238000007086 side reaction Methods 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- RMAQACBXLXPBSY-UHFFFAOYSA-N silicic acid Chemical compound O[Si](O)(O)O RMAQACBXLXPBSY-UHFFFAOYSA-N 0.000 description 1
- 235000012239 silicon dioxide Nutrition 0.000 description 1
- 210000000329 smooth muscle myocyte Anatomy 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008174 sterile solution Substances 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 229960005202 streptokinase Drugs 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 239000003765 sweetening agent Substances 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 239000000454 talc Substances 0.000 description 1
- 229910052623 talc Inorganic materials 0.000 description 1
- 235000012222 talc Nutrition 0.000 description 1
- ISIJQEHRDSCQIU-UHFFFAOYSA-N tert-butyl 2,7-diazaspiro[4.5]decane-7-carboxylate Chemical compound C1N(C(=O)OC(C)(C)C)CCCC11CNCC1 ISIJQEHRDSCQIU-UHFFFAOYSA-N 0.000 description 1
- 150000003573 thiols Chemical class 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 102000003298 tumor necrosis factor receptor Human genes 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241000712461 unidentified influenza virus Species 0.000 description 1
- 208000007089 vaccinia Diseases 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 229920002554 vinyl polymer Polymers 0.000 description 1
- 239000003643 water by type Substances 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2299/00—Coordinates from 3D structures of peptides, e.g. proteins or enzymes
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/10011—Arenaviridae
- C12N2760/10111—Deltavirus, e.g. hepatitis delta virus
- C12N2760/10122—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
Definitions
- the hepatitis D virus is a small satellite virus of hepatitis B virus (HBV). Coinfection with HBV and HDV causes severe and sometimes fatal liver disease in humans.
- the HDV genome encodes a single known protein, the hepatitis delta antigen (HDAg).
- This invention is based on the discovery of the high resolution crystal structure of a synthetic peptide corresponding to residues 12-60 of the hepatitis delta antigen (HDAg).
- This peptide includes a coiled-coil region believed to be important for dimerization of HDAg.
- the peptide forms an antiparallel coiled coil with hydrophobic residues near the termini of each peptide forming an extensive hydrophobic core with residues C-terminal to the coiled-coil domain in the dimer protein.
- the crystal structure shows how HDAg forms dimers, but, surprisingly, also shows the dimers forming an octameric structure that forms a large 50 ⁇ ring lined with basic sidechains.
- the dimers associate further to form octamers through residues in the coiled-coil domain that are not involved in a heptad repeat, as well as through residues C-terminal to this region.
- the crystal structure of the peptide and cross-linking hydrodynamic studies which show that the full-length recombinant protein also forms octamers suggest that the structure of the delta antigen represents a previously unseen organization of a viral nucleocapsid protein. This N-terminal octamer can serve as a convenient high-valency framework for linking a variety of functional peptides and domains.
- the invention includes HDAg proteins, including derivatives, mutants and fragments, and nucleic acid molecules encoding HDAg.
- Derivatives of HDAg protein include fusion molecules.
- the fusion molecule comprises HDAg and at least one binding moiety bound, for example, to the HDAg through the C terminus, N terminus and/or other amino acid.
- the binding moiety can be selected from the group consisting of an antigen, an antibody, a ligand, a receptor, an enzyme, a ligand interaction peptide, a chemical, an effector, an oligonucleotide, a signal amplification peptide, an enhancer recognition protein, a promoter binding protein, a label, a growth factor, a cytokine, a nuclease, a small organic molecule, a test substance, a cytotoxic agent, a substrate, a solid substrate, a drug, or a fragment thereof.
- the fusion molecules of the invention can also comprise two binding moieties which are binding partners.
- the fusion molecule can be a fusion protein.
- the HDAg and the binding moiety can be chemically linked or the HDAg and the binding moiety can be expressed as a single unit.
- the invention also relates to coiled-coil oligomers comprising at least two such fusion molecules.
- the coiled-coil oligomer can be an octamer.
- the two fusion molecules can be the same or different.
- a nucleic acid molecule can comprise a nucleotide sequence depicted in FIG. 9 , nucleotides 37-150 of FIG. 9 , nucleotides 37-186 of FIG. 9 , FIG. 10 , nucleotides 1421-1566 of FIG. 10 , nucleotides 1457-1566 of FIG. 10 , FIG. 15 or FIG. 16 .
- the nucleic acid molecule can also comprise a nucleotide sequence which encodes a polypeptide comprising an amino acid sequence depicted in a row of FIG. 1 , amino acids 12-48 of a row of FIG. 1 , the top row of FIG. 3C , FIG.
- amino acids 12-48 of a row of FIG. 9 , FIG. 10 , amino acids 12-88 of FIG. 10 , FIG. 11 or FIG. 17 are also included. Also included are complementary strands of these sequences, DNA sequences that hybridize to the sequences, RNA sequences transcribed from the sequences, or a fragment or mutation thereof, which encodes a coiled-coil oligomer.
- An isolated nucleic acid molecule can be a fusion molecule described herein.
- the invention also includes fusion genes comprising an HDAg nucleic acid molecule operably linked to a nucleic acid molecule encoding a heterologous (non-HDAg) peptide.
- a polypeptide comprises an amino acid sequence encoded by an HDAg nucleic acid molecule.
- the molecules can comprise a polypeptide having an amino acid sequence selected from the group consisting of an amino acid sequence depicted in a row of FIG. 1 , amino acids 12-48 of a row of FIG. 1 , amino acids 12-60 of a row of FIG. 1 , the top row of FIG. 3C , FIG. 9 , amino acids 12-48 of FIG. 9 , amino acids 12-60 of FIG. 9 , FIG. 10 , FIG. 11 and FIG.
- the peptide can be a derivative peptide wherein a serine residue is substituted with cysteine.
- the molecules can comprise a polypeptide comprising an amino acid sequence of amino acids 12-88 of HDAg, or a fragment or derivative thereof which forms a coiled-coil oligomer and nuclear localization signal.
- the polypeptides can be encoded by fusion genes comprising HDAg. It is possible that the molecule can be larger than the 12-48 or 12-60 or 12-88 amino acids, for example. It may be desirable to make a 12-65 or 10-93 peptide, for example.
- the invention also includes vectors which can express HDAg.
- the vectors can comprise a nucleic acid molecule which encodes a subunit of an HDAg coiled-coil octamer.
- the nucleic acid molecule can comprise a sequence listed above.
- the nucleic acid molecule can encode a fusion molecule.
- the vector can be a nucleic acid molecule encoding HDAg and at least one multiple cloning site. The multiple cloning site(s) can be located 3′ to the nucleic acid molecule encoding HDAg or 5′ to the nucleic acid molecule encoding HDAg.
- the vector can further comprise a nucleic acid molecule encoding a nuclear localization signal.
- a vector can further comprise a nucleic acid molecule which encodes a heterologous gene.
- the vector can express a fusion molecule of HDAg wherein a first heterologous gene encodes a first binding moiety and a second heterologous gene encodes a second binding moiety.
- the invention also encompasses host cells which comprise a nucleic acid molecule which encodes a molecule of HDAg, including a fusion molecule.
- the invention also includes methods of manufacturing such a host cell comprising a nucleic acid molecule encoding a fusion molecule comprising HDAg and at least one binding moiety, by introducing a vector of the invention into the host cell.
- the invention also relates to methods of using the molecules, i.e., peptides, nucleic acids, and vectors of the invention.
- One method comprises expressing a high valency display of at least one binding moiety comprising introducing into a cell with a vector comprising a nucleic acid molecule encoding HDAg and a nucleic acid molecule encoding the binding entity and culturing the cell under conditions sufficient to permit expression of the binding moiety and HDAg.
- the invention also encompasses a method of enhancing interaction between binding partners comprising contacting a fusion molecule of HDAg with a second binding moiety wherein the first and second moieties are binding partners.
- the fusion molecule can present the first and second moieties.
- the interaction between ligands can occur in solution, on membranes or on surfaces.
- the fusion molecule can be a subunit of a coiled-coil oligomer, e.g., an octamer, and the first and second moieties are bound to the oligomer.
- fusion of a first cell and a second cell is enhanced.
- the invention also includes a method for delivering molecules to a cell comprising contacting them with an HDAg fusion molecule.
- the binding moiety is an oligonucleotide.
- the oligonucleotide can hybridize to a nucleic acid molecule in the cell.
- the fusion molecule can further comprise a double-stranded nuclease.
- the fusion molecule comprises a first binding moiety and a second binding moiety wherein the first binding moiety interacts with a binding partner and the second binding moiety functions as an effector.
- the first binding moiety can interact with a cell surface receptor on a cell and the second binding moiety can kill the cell.
- the invention also includes a method of amplifying a signal in a solid phase assay comprising coupling an HDAg octamer with at least one copy of a domain which interacts with a ligand and at least two copies of a label.
- the label can be, for example, alkaline phosphatase, a radiolabel, streptadavin, and green fluorescent protein.
- the solid phase assay is an ELISA assay.
- the invention also encompasses a method of facilitating exchange of substrates and products comprising coupling an HDAg oligomer to at least two enzymes which function in a linked pathway.
- the invention also encompasses a method of enhancing a reaction between at least two binding partners comprising coupling the binding partners to an HDAg oligomer.
- the method of enhancing a reaction between two binding partners comprises coupling one binding partner to an HDAg oligomer and contacting the oligomer to a second binding partner.
- FIG. 1 depicts sequence alignment of 11 serotypes of hepatitis delta antigen (HDAg) between amino acids 12 to 60.
- Asterisks, * indicate residues which make up the a and d positions in the heptad repeat in the predicted coiled-coil region.
- Bold pink and purple indicate residues involved in the hydrophobic interactions in the dimer between the two termini.
- the “ds” indicate residues involved in the dimer-dimer interface.
- FIG. 2 depicts the final atomic model superimposed upon a portion of the final 1.8 A resolution 2F O -F C map.
- the map is contoured at 1.2 ⁇ and shows the residues involved in the interaction between monomers at the A and C regions. Orientation is similar to that in FIG. 4 . Yellow indicates carbon, red indicates oxygen and blue indicates nitrogen. The figure was produced using BOBSCRIPT.
- FIG. 3A depicts C ⁇ trace of the peptide ⁇ 12-60(Y). A region pink, B region green, and C region purple. The individual helix takes a sharp bend at proline 49 (Pro49).
- FIG. 3B is a ribbon diagram of the view in FIG. 3A rotated 90° along the horizontal axis. The sidechains have been added and the C region of the peptide (residues 50-60(Y)) has been removed for clarity. Sidechains are colored as follows: hydrophobic gray, polar yellow, acidic red and basic blue.
- FIG. 3C is the amino acid sequence of the long helix formed from residues 12 to 48 displayed in the antiparallel orientation of the peptide.
- FIG. 4 depicts the monomer-monomer interactions.
- the regions are colored as follows: A region pink, B region green, and C region purple.
- the white row of X's indicates a hydrogen bond between the sidechain of Glu45 (E45) and the indole nitrogen of Trp20 (W20).
- This figure was produced using RIBBONS, Carson, M. & Bugg, C. E., J. Mol. Graphics, 4:121-122 (1986).
- FIG. 5 depicts the interactions of dimers in the P2 1 2 1 2 unit cell.
- the unit cell is outlined in black and the directions of the A and B axes are shown.
- the two independent copies of the dimer in the asymmetric unit are colored orange and blue.
- the view is looking down the crystallographic 2-fold axis. This figure was produced using the program MOLSCRIPT (Kraulis, P. J. J. Appl. Cryst. 24:946-950 (1991)).
- FIG. 6 depicts the dimer-dimer interface.
- FIG. 6A illustrates that the dimer-dimer interface is composed of a four-helix bundle made of the N and C termini of two dimers, one from across the crystallographic two-fold axis. One (unlabeled) dimer is colored yellow and the other (labeled) dimer is colored according to the scheme used in FIG. 1 .
- FIG. 6B depicts the view in FIG. 6A rotated 90° around the y axis. This figure was created with RIBBONS (Carson, M. & Bugg, C. E., J. Mol. Graph. 4:121-122 (1986)).
- RIBBONS Carlson, M. & Bugg, C. E., J. Mol. Graph. 4:121-122 (1986)
- FIG. 7B illustrates the hole formed by the octamer.
- FIG. 8 graphically illustrates MALDI-TOF mass spectrometry analysis of recombinant small delta antigen (r-HDAg-S) ( FIG. 8A ), and the glutaraldehyde cross-linked protein ( FIG. 8B ).
- FIG. 9 depicts the sequence of a synthetic gene for optimized expression of HDAg-S in E. coli.
- the synthetic gene has been modified such that the codon usage which is unusual in the natural gene is assistant with the known preferences for codon usage in E. coli.
- the underlined sequences correspond to the eight primers used in the first round of PCR.
- the primers used in the second round are indicated with a dotted underline.
- the amino acid sequence is shown above the DNA sequence by the one-letter amino acid code.
- the restriction sites used in cloning are shown in italics.
- FIG. 10 depicts the complete sequence of human HDV cDNA, and the predicted amino-acid sequence of human HDV delta antigen.
- FIG. 11 depicts synthetic peptides from the multimer-forming domain of HDAg.
- A Structural organization of HDAg. The lightly stippled region is the multimer-forming domain (amino acid residues 12-60), the solid regions are the RNA-binding domains, and the heavily stippled region is the C-terminal extension of large HDAg. Hydrophobic residues contributing to the heptad repeat are shown in boldface type,
- B Amino acid sequences of three HDAg peptides.
- FIG. 12 depicts use of the delta antigen as a scaffold.
- FIG. 12A depicts an oligomerization domain and spheres which represent potential effectors/ligands/nucleic acid binding domains etc. that have been fused to the C-terminus of the oligomerization domain of the delta antigen.
- FIG. 12B depicts binding domain which is a multimer, specifically a tetramer.
- a similar fusion of up to eight effector/legands/nucleic acid binding domains could be made at the N-terminus, and constructs with fusion domains at up to eight N-termini and up to eight C-termini could also be made.
- FIG. 13 depicts plasmids comprising HDAg for expression in bacteria.
- Dg refers to a delta antigen sequence (with or without nuclear localization sequence);
- MCS1 refers to a multiple cloning site for insertion of a heterologous gene at N-terminal end of delta antigen;
- MCS2 refers to a multiple cloning site for insertion of a heterologous gene a C-terminal end of delta antigen;
- Ori refers to the origin of replication for the bacteria;
- drug refers to a drug marker;
- f1 ori refers to origin of replication of single-stranded DNA by a bacteriophage;
- promoter refers to bacterial promoter.
- FIG. 13A depicts MCS1 3′ to HDAg;
- FIG. 13B depicts MCS1 5′ to HDAg;
- FIG. 13C depicts MCS1 3′ and MCS2 5′ to HDAg.
- FIG. 14 depicts a plasmid comprising HDAg for expression in eukaryotic cells.
- Dg refers to a delta antigen sequence (with or without nuclear localization sequence).
- MCS1 refers to plasmids comprising replication of plasma in bacteria;
- Ori refers to origin of expression or replication of the plasmid in bacterial cells;
- drug1 refers to a drug marker (e.g. ampicillin, kanamycin) for propagation of plasmid in E. coli;
- drug2 refers to a drug marker or eukaryotic drug resistance (e.g.
- f1 ori refers to origin of replication of single-stranded DNA by bacteriophage
- promoter refers to eukaryotic promoter
- FIG. 15 is a comparison of the wildtype nucleotide sequence of HDAg-S and the sequence of the synthetic HDAg gene for optimized expression in E. coli.
- FIG. 16 is the nucleotide sequence of the synthetic open reading frame (ORF) for the synthetic HDAg.
- FIG. 17 is a comparison of the protein amino acid sequence encoded by the wildtype ORF and the synthetic ORF, showing complete (100%) identity.
- FIG. 18 depicts the nucleotide sequences of the primers used for the two polymerase chain reactions (PCR) to create the synthetic gene.
- Primer1-primer8 were used in the first round of PCR and primer9-primer10 were used in the second round of PCR.
- the present invention relates to the discovery of the oligomeric structure of the hepatitis delta antigen (HDAg) which serves as a convenient high-valency framework for linking a variety of binding partners, including functional peptides and domains.
- the structure of the antigen includes a doughnut-shaped octamer comprising N-terminal antiparallel coiled-coil domains and stabilizing C-terminal domains.
- the invention includes HDAg proteins, including derivatives, mutants and fragments, and nucleic acid molecules encoding HDAg. It also includes an altered HDAg gene for the capsid protein wherein the codons conform to use preferences for E. coli.
- fusion molecules e.g., fusion proteins, in which one or more binding moieties are attached to one or both termini of a monomer and coiled-coil oligomers (e.g., octamers) formed from the monomers.
- Coiled-coil oligomers of the present invention can comprise one or more fusion molecules as described herein.
- the binding moieties can be, for example the same (homologous) or different (heterologous) binding partners.
- the invention also includes vectors and cassette expression systems which can be used to produce the fusion molecules.
- the vectors comprise HDAg and one or more binding moieties which are operably linked to HDAg.
- the invention also relates to cells comprising HDAg nucleic acid, e.g. cells transformed with such vectors, and to methods of producing such cells.
- the invention also includes therapeutic and diagnostic methods involving HDAg.
- HDAg includes both the large and small delta antigens (HDAg-S and HDAg-L).
- HDAg encompasses native (“wild type”) proteins and also includes derivatives, mutations, and functional protein or polypeptide fragments of the native protein and/or proteins or polypeptides where one or more amino acids have been deleted, added or substituted.
- HDAg can be isolated and/or purified, or it can be recombinant or prepared by synthetic techniques described herein or known to those of skill in the art.
- HDAg proteins or fragments can be isolated from the cell of origin or produced synthetically or recombinantly. In a preferred embodiment, the protein is isolated to the substantial absence of conspecific proteins.
- a conspecific protein is a protein other than HDAg which can be obtained from the cell of origin for the protein or its nucleic acid.
- the proteins (and nucleic acids) described herein can be preferably isolated, by known methods, to a purity of at least about 50% by weight, more preferably at least about 75% and most preferably to substantial homogeneity. “Substantial homogeneity” refers to the substantial absence of conspecific proteins.
- An HDAg peptide can, e.g., include all or a portion of the amino acids depicted FIGS. 1, 3C , 10 , 11 or 17 .
- An HDAg peptide can be encoded by an isolated and/or purified or recombinant nucleic acid molecule or a fusion gene (nucleic acid molecule) such as those described herein.
- an isolated and purified polypeptide has an amino acid sequence depicted in a row of FIG. 1 , amino acids 12-48 of a row of FIG. 1 , amino acids 12-60 of a row of FIG. 1 , a row of FIG. 3C , FIG. 9 , amino acids 12-48 of FIG. 9 , amino acids 12-60 of FIG. 9 , FIG. 10 , FIG. 11 , FIG. 17 or a fragment or derivative thereof, which forms a coiled-coil oligomer.
- an isolated and purified polypeptide has the amino acid sequence of amino acids 12-88 of HDAg, or a fragment or derivative thereof, which forms a coiled-coil oligomer and nuclear localization signal.
- “Homology” is defined herein as sequence identity.
- the protein or polypeptide shares at least about 50% sequence identity or homology and more preferably at least about 75% identify or at least about 90% identity with the corresponding sequences of the native protein, for example, with FIG. 10 .
- the phrase “substantially the same sequence” is intended to include sequences which bind the viral protein and possess a high percentage of (e.g., at least 90%, preferably at least about 95%) amino acid sequence identity with the native sequence.
- a derivative, e.g., a mutant or variant can possess substantially the same amino acid sequence as the native protein.
- amino acid sequence substitutions
- the modifications to the amino acid sequence can be conserved or non-conserved, natural or unnatural amino acids.
- the residues that function to form or stabilize the coiled-coil domain or binding sites thereof can be substituted, e.g., conservatively, or they can be maintained.
- Amino acids of the native sequence for substitution, deletion or conservation can be identified, for example, by a sequence alignment between proteins from different serotypes from related species or other related proteins.
- the amino acids which are deleted, added or substituted are amino acids which are not “conserved” between serotypes or species, for example, the amino acids so identified in the sequence alignment exemplified in FIG. 1 .
- conserveed amino acids may also be substituted.
- a suitable derivative or mutant of the HDAg protein is a protein possessing a consensus sequence of the originating species.
- the derivative does not contain substitutes of the residues of Arg13, Leu17, Trp20, Arg24, Trp50 or Leu51. In another embodiment, it contains only conservative substitutions in this region. Hydrophobic residues, for example Ile16, Leu17, Trp20, Trp50 and Leu51 can be maintained, or they can be replaced with other hydrophobic amino acids, for example, those from the group consisting of Trp, Phe, Ala, Pro, Leu, Met, Ile and Val. In another example, the residues Glu31, Lys38, Trp20 and Glu45 are not substituted or are substituted conservatively. In addition, Arg13 and Arg24 can be maintained (not substituted) or substituted conservatively.
- the residues of FIG. 1 which are involved in hydrophobic interactions are substituted with other hydrophobic residues.
- hydrophobic residues can be substituted for other hydrophobic residues
- polar residues can be substituted for other polar residues
- acidic residues can be substituted for other acidic residues
- basic residues for other basic residues.
- the residues labeled in FIG. 3A can be maintained (not substituted) or can be replaced with amino acids with similar characteristics.
- the amino acids at the ‘a’ and ‘d’ positions of the heptad repeat for example, those indicated with an asterisk in FIG. 1 or those listed in bold in FIG.
- 3C can be conserved (maintained), or they can be substituted conservatively, e.g., replaced with hydrophobic amino acids.
- the residues involved with the dimer-dimer interface e.g., residues marked with a ‘d’ in FIG. 1 or residues labeled in FIG. 6
- the residues indicated in FIG. 4 can be maintained.
- the derivatives, e.g. mutant and wild-type peptides can crystallize isomorphously.
- at least one serine residue, e.g., Ser22 is replaced with a cysteine.
- Trp20 is replaced with Ala20.
- substitutions based on amino acid characteristics can be made.
- the polar amino acid residues can be substituted and the hydrophobic amino acids can be maintained.
- the nonhydrophobic residues can be substituted and the hydrophobic residues can be maintained.
- a derivative can comprise substitutions of any or all of the amino acid residues in the following positions: 14, 15, 18, 19, 22, 24, 25, 26, 28, 29, 31, 32, 33, 35, 36, 38, 39, 40, 42, 43, 45, 46 and 47.
- the nonhydrophobic residues can be substituted such that acidic amino acids are alternated with basic amino acids.
- hydrophilic residues in the C terminal region can be substituted to optimize stability of the helix, for example by presenting one or more amino acids which form disulfide bonds, strong ionic bonds or cross-linked moieties, with the corresponding amino acid of another subunit.
- residues 53, 55 and 60 are substituted.
- derivatives e.g., coiled-coil subunits, which improve (e.g., optimize) stability of the coiled-coil structure or which improve cross-linkage involving the structure, or which improve the ability of the structure to be immobilized on a solid substrate.
- a peptide is synthesized that corresponds to residues 12-60 of HDAg, and includes a C-terminal tyrosine, enabling the peptide to be labeled, e.g., with I 25 , for use in a radioimmunoassay.
- the peptide is ⁇ 12-60(Y).
- Yet other derivatives are proteins which consist essentially of the amino acid sequence of a given protein (e.g., possess the relevant sequence and, optionally, other amino acids residing at the termini which do not significantly alter or detract from the properties of the protein).
- a “functional” fragment, derivative, mutant, or allelic variant is of sufficient length and/or structure as to possess one or more biological activities of the protein.
- a biological activity of the protein is formation of a coiled-coil oligomer, e.g., an octamer, for example, an octamer doughnut-shaped structure.
- the protein derivative is conserved within the coiled-coil regions but is lacking in or mutated within one or more other regions (e.g., sequences not within the coiled-coil.
- suitable fragments include peptides lacking fragments which encode or stabilize the coiled coil, for example, amino acids 12-48, or the peptides depicted in FIG. 11 .
- One example includes fragments which lack all or part of the region C-terminal to the proline bend (e.g. C-Terminal to Pro49).
- Another fragment includes the coiled coil and nuclear localization signal (e.g., amino acids 12-88); or solely the nuclear localization signal (amino acids 68-88).
- Yet another example includes HDAg which encodes the coiled-coil region but is lacking all or a portion of the nuclear localization signal. In one embodiment, all or a portion of one or both termini of a monomer is absent or mutated.
- the C region of the peptide (e.g., residues 50-60) can be mutated or all or a portion can be eliminated.
- derivatives includes peptides which possess amino acid modifications or additions which are characterized by a functional group which can react with a compound substituted by a “binding moiety”, such as those described above or with a cross-linking agent.
- Yet other biologically activities are the ability of HDAg-S to function as a trans activator of replication and the ability of HDAg-L to act as an inhibitor of replication.
- the biological activity of the protein is antigenic or immunogenic activity.
- Fragments of the protein can possess at least about 10 amino acids from the 12-48 amino acid region, preferably at least about 20 amino acids. In other embodiments, the fragment possesses essentially all of the amino acids of the full-length protein (e.g., at least about 85%, or at least about 95%).
- HDAg includes monomers and oligomers, e.g., dimers and octamers, comprising the monomers as subunits.
- the invention encompasses HDAg coiled-coil oligomers, e.g. octamers.
- Fusion molecules are intended to be included within the definition of HDAg derivatives and can be made by linking one or more binding moieties, e.g., chemicals or peptides, to the HDAg protein or fragment, for example, through a covalent bond or preferably a peptide bond or cysteine group.
- binding moieties e.g., chemicals or peptides
- derivatives, such as fusion proteins can comprise the amino acid sequence of the HDAg protein or fragment and a binding moeity such as a given protein, e.g., a native protein.
- a “binding moiety,” as the term is defined herein, includes a chemical entity which is bound to HDAg.
- the binding can be via a covalent bond (e.g. through a cysteine group), ionic bonding, hydrogen bonding or other mechanism.
- the binding moiety and the HDAg can be expressed as a single unit.
- the binding moiety can be a peptide (including post-translationally modified proteins, such as amidated, demethylated, glycosylated or phosphorylated proteins), sugar, lipid, steroid, nucleic acid, small molecule, anion or cation, drug, chemical or combination thereof which binds the specified binding partner (e.g., a target molecule).
- the binding will possess a high affinity.
- binding moieties include an antigen, an antibody, a ligand, a receptor, an enzyme, a (ligand) interaction peptide, a chemical, an effector, an oligonucleotide, a signal amplification peptide, an enhancer recognition protein, a promoter binding protein, a label, a growth factor, a cytokine, a nuclease, a small organic molecule, a test substance, a cytotoxic agent, a substrate, a solid substrate, a drug or a fragment thereof.
- binding moieties can be the same (homologous) or different (heterologous).
- the binding moieties can be binding partners. Examples of binding partners include, but are not limited to, antigen-antibody and ligand-receptor.
- First and second binding moieties can also include the following pairs: enzyme1-enzyme2, (ligand) interaction peptide-effector peptide (or chemical), oligonucleotide-nuclease, interaction agent (e.g. peptide signal amplification agent, (e.g. peptide), enhancer recognition agent (e.g. protein)-promoter-binding agent (e.g. protein), enhancer recognition agent-promoter binding agent, ligand-label, test substance-label, targeting agent—effector agent, drug (or hormone) -label, or any other combination.
- enzyme1-enzyme2 ligand interaction peptide-effector peptide (or chemical), oligonucleotide-nuclea
- a first binding moiety binds, a target molecule on a target cell (e.g. a surface protein) and the binding partner is the surface protein or target cell.
- a target cell e.g. a surface protein
- the “target cell” is defined as the cell which is intended to be contacted by the fusion cell.
- the target cell is of animal origin and can be a stem cell or somatic cell.
- Suitable animal cells for use on the claimed invention can be of, for example, mammalian and avian origin. Examples,of mammalian cells include human, bovine, ovine, porcine, murine, rabbit cells.
- the cell may be an embryonic cell, bone marrow stem cell or other progenitor cell.
- the cell can be, for example, an epithelial cell, fibroblast, smooth muscle cell, blood cell (including a hematopoietic cell, red blood cell, T-cell, B-cell, etc.), tumor cell, cardiac muscle cell, macrophage, dendritic cell, neuronal cell (e.g., a glial cell or astrocyte), or pathogen-infected cell (e.g., those infected by bacteria, viruses, virusoids, parasites, or prions).
- an epithelial cell including a hematopoietic cell, red blood cell, T-cell, B-cell, etc.
- tumor cell e.g., a glial cell or astrocyte
- neuronal cell e.g., a glial cell or astrocyte
- pathogen-infected cell e.g., those infected by bacteria, viruses, virusoids, parasites, or prions.
- cells isolated from a specific tissue are categorized as a “cell-type.”
- the cells can be obtained commercially or from a depository or obtained directly from an animal, such as by biopsy.
- the cell need not be isolated at all from the animal where, for example, it is desirable to deliver the vector to the animal in gene therapy.
- Cells can typically be characterized by markers expressed at their surface that are termed “surface markers”. These surface markers include surface proteins or target molecules, such as cellular receptors, adhesion molecules, transporter proteins, components of the extracellular matrix and the like. These markers, proteins and molecules also include specific carbohydrates and/or lipid moieties, for example, conjugated to proteins. In one embodiment, a binding moiety on a fusion molecule can bind to one or more surface proteins on the target cell. Surface proteins can be tissue- or cell-type specific (e.g. as in surface markers) or can be found on the surface of many cells. Typically, the surface marker, protein or molecule is a transmembrane protein with one or more domains which extend to the exterior of the cell (e.g. the extracellular domain).
- surface markers include surface proteins or target molecules, such as cellular receptors, adhesion molecules, transporter proteins, components of the extracellular matrix and the like. These markers, proteins and molecules also include specific carbohydrates and/or lipid moieties, for example, conjugated to proteins.
- the surface protein selected for the invention is preferably specific to the tissue.
- specific to the tissue it is meant that the protein be present on the targeted cell-type but not present (or present at a significantly lower concentration) on a substantial number of other cell-types. While it can be desirable, and even preferred, to select a surface protein which is unique to the target cell, it is not required for the claimed invention. It is to be appreciated, however, that specific delivery may not be required where the cell or cells are contacted with the viral vector in pure or substantially pure form, such as can be the case in an in vitro gene transfer. As such, the surface protein or targeted protein for the first binding moiety may be present on many different cell-types, specific or even unique to the targeted cell-type.
- the surface protein can be a cellular receptor or other protein, preferably a cellular receptor.
- cellular receptors include receptors for cytokines, growth factors, and include, in particular epidermal growth factor receptors, platelet derived growth factor receptors, interferon receptors, insulin receptors, proteins with seven transmembrane domains including chemokine receptors and frizzled related proteins (Wnt receptors), immunoglobulin-related proteins including MHC proteins, CD4, CD8, ICAM-1, etc., tumor necrosis factor-related proteins including the type I and type II TNF receptors, Fas, DR3, DR4, CAR1, etc., low density lipoprotein receptor, integrins, and, in some instances, the Fc receptor.
- cytokines include epidermal growth factor receptors, platelet derived growth factor receptors, interferon receptors, insulin receptors, proteins with seven transmembrane domains including chemokine receptors and frizzled related proteins (Wnt receptors), immunoglobulin-related proteins
- the binding moiety can be selected or derived from native ligands or binding partners to the surface protein of the target cell.
- the binding moiety can be a polypeptide comprising at least the receptor-binding portion of the native ligand.
- a “native ligand” or “native binding partner” is defined herein as the molecule naturally produced by, for example, the animal or species which binds to the surface protein in nature.
- the binding moiety is a polypeptide or protein.
- the native ligand of a cytokine receptor can be the native cytokine.
- the binding moiety can comprise a binding fragment of an antibody, such as the variable region or a single chain antibody.
- a binding moiety comprises a binding fragment of an antibody
- many antibodies to surface proteins are known or are commercially available, as are the amino acid sequences which are responsible for binding.
- novel antibodies can be prepared by methods known in the art, such as by Harlow and Lane, “Antibodies, A Laboratory Manual,” Cold Spring Harbor Laboratory (1988).
- the binding fragment can comprise an antibody fragment, for example, the constant region or, the variable region (e.g., Fc fragment or FAb′ fragment).
- a binding moiety can be a polypeptide ligand to a cellular receptor.
- preferred ligands are growth factors, epidermal growth factor, interleukins, GM-CSF, G-CSF, M-CSF, EPO, TNF, interferons, and chemokines.
- the receptor is a transferon receptor.
- the binding moiety can have an amino acid sequence which is the same or substantially the same as an amino acid sequence of at least the receptor-binding portion of a native ligand for the cellular receptor. Similar to cellular receptors, many of the corresponding ligands have been identified, sequenced and characterized, including the portions thereof which bind to the receptor. The binding moiety can, therefore, include the same or substantially the same sequence of the entire native ligand. Alternatively, binding moiety comprises the receptor binding portion of the native ligand, eliminating, in some cases, the effector function of the ligand.
- the binding moiety is selected or derived from native ligands or binding partners to a cellular surface molecule of a target cell.
- a “cellular surface molecule” as defined herein can be a peptide (including post-translationally modified proteins, such as amidated, demethylated, methylated, prenylated, palmitoylated, glycosylated, myristylated, acetylated or phosphorylated proteins), sugar, lipid, steroid, anion or cation, or a combination thereof which binds the first binding moiety.
- the binding of the cellular surface molecule to the binding moiety of the bifunctional molecule will be of high affinity. Examples of high affinity have a dissociation constant of 10 ⁇ 5 M (preferably 10 ⁇ 8 M) or better.
- the cellular surface molecule need not be ⁇ specific” for the target cell.
- the cellular surface molecule is specific for a desired viral vector.
- specific delivery of Influenza A viral vectors can employ sialic acid cellular surface molecules for entry into a target cell whereas targeting of VSV viral vectors can employ a phospholipid as the surface molecule.
- the cellular surface molecule for the first binding moiety can be present on many different cell-types, specific or even unique to the target cell.
- the effector function can be desirable, thereby stimulating or modulating the cellular activity of the target cell which can enhance therapy.
- a therapy can be desirable is in the delivery of a negative selection marker or suicide protein to a tumor where the target cell is a lymphokine and the ligand is a cytokine. Where the lymphokine is stimulated, the cell, can also possess therapeutic value in the recruitment of an endogenous immune response against the tumor, thereby increasing the therapeutic benefit of the therapy.
- substantially the same sequence is intended to include sequences which bind the surface protein and possess a high percentage of (e.g., at least about 90%, preferably at least about 95%) sequence identity with the native sequence.
- the modifications to the sequence can be conserved or non-conserved, natural and unnatural, amino acids and are preferably outside of the binding domain. Amino acids of the native sequence for substitution, deletion, or conservation can be identified, for example, by a sequence alignment between proteins from related species or other related proteins.
- a second binding moiety which is a chemical entity which binds to HDAg.
- the binding can be via a covalent bond, ionic bonding, hydrogen bonding or other mechanism.
- the second binding moiety can be the same or different from the first.
- it can be a peptide, sugar, lipid, steroid, nucleic acid, small molecule, anion or cation, or combination thereof which binds the HDAg.
- the second binding moiety of the fusion protein is also a polypeptide.
- One embodiment of the second binding moiety comprises an antigen-binding fragment of an antibody which recognizes and binds to an antigen.
- Ligand receptors which are cellular receptors can be transmembrane proteins comprising intracellular, transmembrane (characterized by highly hydrophobic regions in the sequence) and extracellular domains.
- the second binding moiety can comprise the native extracellular domain of a receptor molecule.
- Fusion proteins can be made conveniently through known methods, e.g. recombinantly.
- the binding moieties can be directly bonded to HDAg or can be bonded to HDAg through a linking moiety. Where one or both of the moieties are polypeptides, a peptide bond or peptide linker may be preferred, thereby obtaining a “fusion protein”.
- the “fusion protein” of the HDAg and one or more moieties can be expressed by a single nucleic acid construct in series.
- One or more moieties and HDAg alternatively be linked directly or indirectly other than via a peptide bond or peptide linker, thereby obtaining a “conjugate”.
- the bond can be covalent, as in a peptide bond, ionic bond or hydrogen bond.
- a binding moiety can be bonded to the N terminus of HDAg via the C terminus, or vice versa or both. It is acknowledged that one fusion protein may possess greater activity than a second fusion protein due to conformational or steric considerations.
- the binding moieties can be, for example, monomers, dimers and tetramers.
- binding moieties are not polypeptides, they can be joined via chemical reaction through functional groups present on each moiety which, under the appropriate conditions, will react with each other.
- functional groups for example, acid groups (or activated derivatives thereof) can be reacted with amines, alcohols or thiols to form amide or ester bonds, as is known in the art.
- a linking moiety is employed to link the binding moieties, e.g. binding partners, to HDAg.
- the linker can preferably be a flexible linker and sufficient in length to separate the moieties in space, thereby not restricting the ability of the fusion molecule to bind independently and maintain the proper conformation.
- the linker moiety will generally be a peptide, polypeptide, or a “pseudopeptide”.
- a “pseudopeptide” is a bifunctional linker which contains at least one non-amino acid and reacts to form a peptide bond, or other bond, with the terminal amine or carboxyl group of the moiety.
- a peptide characterized by substitution of the terminal amine for a carboxyl group can function to react with the amine terminus of each moiety.
- Such as linker is considered to be a “pseudopeptide.”
- a peptide characterized by substitution of the terminal carboxyl for an amine group can function to react with the carboxyl terminus of each moiety.
- the linker will be a peptide linker which will link the amine terminus of a moiety to the carboxyl terminus of HDAg or vice versa.
- One advantage to such a molecule is the ability to express the fusion protein in a recombinant host cell with a single nucleic acid construct.
- Peptide linkers can be obtained from immunoglobulin hinge regions, such as a proline-rich region. Also, linkers can be characterized by little steric hindrance, thereby permitting maximal independent movement of the two moieties, such as with a polyglycine linker. Alternatively, the linker selected to be reactive to or inert to cellular proteases can be desirable. In another embodiment, the linker can be selected to avoid or minimize an immune response against the fusion molecule.
- the length of the linker also is not particularly critical. Typically, the length of the linker can be between about 2 and about 20 amino acids. As can be seen, the selection of the particular linking group is not critical to the invention.
- the linker can be a bifunctional compound which will react with other functional groups on the binding moieties or HDAg, such as in the reaction of acids and amines or alcohols (as present in peptides, carbohydrates and lipids, for example) in the formation of amides or esters.
- a preferred combination of the above first and second binding moieties includes one binding partner, e.g. a polypeptide ligand to a cell-type specific cellular receptor linked, via a peptide linker, through a terminus of the ligand to the terminus of HDAg.
- a second binding partner e.g. a extracellular domain of a cellular receptor or a mutant thereof, can be linked to the same or different HDAg subunit in the same manner.
- the C terminus of a binding polypeptide is linked to the N terminus of HDAg via the polypeptide linker or the N terminus of the first binding polypeptide is linked to the C terminus of the HDAg via the polypeptide linker.
- peptidomimetics molecules which are not polypeptides, but which mimic aspects of their structures to bind to the same site
- polysaccharides can be prepared that have the same functional groups as the polypeptides of the invention, and which interact with binding partners in a similar manner.
- the peptidomimetic can comprise at least two components, a binding entity or entities and a backbone or supporting structure entity.
- the binding entities of the peptidomimetic are the chemical atoms or groups which will react or complex (as in the formation of a hydrogen or covalent bond) with a binding partner.
- the binding entities in a peptidomimetic are the same as the polypeptide moieties.
- the binding entities can be an atom or chemical group which will react with the binding partner in the same or similar manner as the polypeptide.
- binding entities suitable for use in designing a peptidomimetic for a basic amino acid in a polypeptide are nitrogen containing groups, such as amines, ammoniums, guanidines and amides or phosphoniums.
- binding entities suitable for use in designing a peptidomimetic for an acidic amino acid in a polypeptide can be, for example, carboxyl, lower alkyl carboxylic acid ester, sulfonic acid, a lower alkyl sulfonic acid ester or a phosphorous acid or ester thereof.
- the supporting structure is the chemical entity that, when bound to the binding moiety or moieties, provides the three dimensional configuration of the peptidomimetic.
- the supporting structure can be organic or inorganic. Examples of organic supporting structures include polysaccharides and polymers (such as, polyvinyl alcohol or polylactide). It is preferred that the supporting structure possess substantially the same size and dimensions as the polypeptide backbone or supporting structure. This can be determined by calculating or measuring the size of the atoms and bonds of the polypeptide and peptidomimetic. For example, the nitrogen of the peptide bond can be substituted with oxygen or sulfur, thereby forming a polyester backbone.
- the carbonyl of the peptide bond can be substituted with a sulfonyl group or sulfonyl group, thereby forming a polyamide.
- Reverse amides of the peptide can be made (e.g., substituting one or more —CONH— groups for a —NHCO— group).
- the peptide backbone can be substituted with a polysilane backbone.
- peptidomimetic compounds can be manufactured by art-known and art-recognized methods.
- a polyester corresponding to a given peptide can be prepared by the substituting a hydroxyl group for each corresponding amine group on the amino acids, thereby preparing a hydroxyacid and sequentially esterifying the hydroxyacids, optionally blocking the basic side chains and acids to minimize side reactions. Determining an appropriate chemical synthesis route can generally be readily identified upon determining the chemical structure using no more than routine skill.
- the fusion molecules can be manufactured according to methods generally known in the art.
- the fusion molecule can be manufactured employing known organic synthesis methods useful for reacting a functional or reactive group on the moiety with a functional or reactive group on the other moiety or, preferably, a linker.
- synthesis derivation or inactivation of the functional group(s) required for binding to the moiety's binding partner should be avoided.
- Appropriate syntheses are highly dependent upon the chemical nature of the binding moiety and, generally, can be selected from an organic chemistry text, such as March et al. Advanced Organic Chemistry, 3rd Edition (1985) John E. Wiley & Sons, Inc., New York, N.Y., or other known methods.
- the fusion molecule can be a conjugate or a fusion protein and manufactured according to known methods. Where a fusion protein is desired, the molecule can be manufactured according to known methods of recombinant DNA technology.
- the fusion protein can be expressed by a nucleic acid molecule comprising sequences which code for both moieties, such as by a fusion gene (nucleic acid molecule).
- the invention further relates to nucleic acid molecules, including fusion genes, which encode HDAg fragments, mutants and derivatives.
- Recombinant or isolated nucleic acid molecules of the invention encode an HDAg protein (including the e.g., native proteins, fragments, derivatives, mutants and allelic variants) as defined herein.
- a nucleic acid molecule of the present invention can be double-stranded or single-stranded and can be a DNA molecule, such as cDNA or genomic DNA, or an RNA molecule.
- the nucleic acid molecule can be placed in a construct, which can be inserted into a vector.
- the nucleic acid molecule can include one or more exons, with or without, as appropriate, introns.
- the nucleic acid molecule contains a single open reading frame which encodes HDAg and one or more binding moieties and, optionally, a signal sequence and/or a polypeptide linker, when present.
- the nucleic acid molecule contains a first exon which begins with an ATG, encodes a binding moiety, and optionally the polypeptide linker, and ends with a splice donor site.
- the construct would also contain an HDAg-coding nucleic acid sequence and would further would contain an intron followed by a second exon which begins with a splice acceptor site and, optionally, a polypeptide linker, coding sequences for a second binding moiety and ending with a stop codon.
- Alternative combinations of these elements would be apparent to the person of skill in the art.
- the nucleic acid molecule can include sequences which encode HDAg, and one or more moieties, as well as one or more of the following optional sequences, in a functional relationship: regulatory sequences (as will be discussed in more detail below) a start codon, a signal or leader sequence, splice donor sites, splice acceptor sites, introns, a stop codon, transcription termination sequences, 5′ and 3′ untranslated regions, polyadenylation sequences, negative and/or positive selective markers, and replication sequences.
- regulatory sequences as will be discussed in more detail below
- the coding regions of the nucleic acid molecule code for HDAg and the binding moeity or moieties and any polypeptide linkers present.
- the binding moiety is a native ligand or cellular surface protein (e.g. a cellular receptor), or a binding fragment thereof
- the nucleic acid molecule coding regions can correspond to the native sequences which encode a binding moiety. Because many amino acids are encoded by a plurality of codons, the coding sequence can be mutated to result in the same amino acid sequence. This may be advantageous where a codon is preferred by the selected host cell.
- the HDAg gene can be altered such that the codons conform to the known codon use preferences for E. coli. See FIG. 9 and FIGS. 15-17 .
- the gene can be inserted into a convenient expression vector which allows production of several forms of the capsid protein including residues 1-84 (terminated in the middle domain), the short isoform and the long isoform. Dingle et al., J. Virol, (1998). All three forms express well.
- the nucleic acid molecule comprises the or corresponding coding nucleotide sequence of FIG. 9, 10 , 15 - 16 , or substantially the same sequences thereof, or the complement thereof.
- the nucleic acid molecule does not possess the nucleotide sequence of GenBank Accession #M28267.
- the nucleic acid molecule can be, for example, isolated and/or purified or recombinant.
- the nucleic acid molecule comprises the nucleotide sequence depicted in FIG. 9 , nucleotides 37-150 of FIG. 9 , nucleotides 37-186 of FIG. 9 , FIG. 10 , nucleotides 1421-1566 of FIG. 10 or nucleotides 1457-1566 of FIG. 10 , FIG. 15 , FIG. 16 ; or a fragment or mutation thereof, which encodes a coiled-coil oligomer.
- the nucleic acid molecule comprises a nucleotide sequence encoding a polypeptide comprising an amino acid sequence depicted in a row of FIG. 1 , amino acids 12-48 of a row of FIG. 1 , FIG.
- FIG. 9 amino acids 12-48 of a row of FIG. 9 , FIG. 10 , amino acids 12-88 of FIG. 10 , FIG. 9 , or ⁇ 12-60(Y).
- complementary strands of these sequences DNA sequences that hybridize to these sequences and RNA sequences transcribed from these sequences.
- fragments or mutations thereof, which encode a coiled-coil oligomer are also encompassed in the invention.
- the nucleic acid molecule encodes a polypeptide of a fusion molecule described herein.
- fusion nucleic acid molecules comprising an HDAg nucleic acid molecule operably linked to a heterologous nucleic acid molecule (“heterologous gene”), which encodes a peptide which is not HDAg and not derived therefrom (i.e., “heterologous protein”).
- heterologous gene which encodes a peptide which is not HDAg and not derived therefrom
- the binding moiety is a mutation or variant of a native sequence, as provided above, generally, the nucleic acid sequence can be mutated correspondingly. It may also be preferred for ease of manufacture of the nucleic acid sequence to maintain as much of the native sequence as possible.
- the nucleic acid molecule shares at least about 50% sequence identity with the corresponding native sequence such as the coding region, for example, the coiled-coil region, e.g., amino acids 12-48 or amino acids 12-60.
- sequence identity is at least about 65%, more preferably, 75%.
- percent sequence identity is at least about 90%, and still more preferably, at least about 95%.
- Recombinant nucleic acid molecules meeting these criteria comprise nucleic acids having sequences identical to sequences of naturally occurring genes, including polymorphic or allelic variants, and portions (fragments) thereof, or variants of the naturally occurring genes.
- Such variants include mutants differing by the addition, deletion or substitution of one or more residues, modified nucleic acids in which one or more residues are modified (e.g., DNA or RNA analogs), and mutants comprising one or more modified residues.
- nucleic acid molecules coding for suitable binding moieties are known in the art and can be obtained from, for example, GENBANK. Alternatively, other sequences can be employed, such as homologs of known genes.
- hybridization e.g., under high stringency conditions or moderate stringency conditions.
- “Stringency conditions” for hybridization is a term of art which refers to the conditions of temperature and buffer concentration which permit hybridization of a particular nucleic acid to a second nucleic acid in which the first nucleic acid may be perfectly complementary to the second, or the first and second may share some degree of complementarity which is less than perfect.
- certain high stringency conditions can be used which distinguish perfectly complementary nucleic acids from those of less complementarity.
- “High stringency conditions” and “moderate stringency conditions” for nucleic acid hybridizations are explained on pages 2.10.1-2.10.16 (see particularly 2.10.8-11) and pages 6.3.1-6 in Current Protocols in Molecular Biology (Ausubel, F. M. et al., eds., Vol. 1, containing supplements up through Supplement 29, 1995), the teachings of which are hereby incorporated by reference.
- the exact conditions which determine the stringency of hybridization depend not only on ionic strength, temperature and the concentration of destabilizing agents such as formamide, but also on factors such as the length of the nucleic acid sequence, base composition, percent mismatch between hybridizing sequences and the frequency of occurrence of subsets of that sequence within other non-identical sequences. Thus, high or moderate stringency conditions can be determined empirically.
- hybridization conditions By varying hybridization conditions from a level of stringency at which no hybridization occurs to a level at which hybridization is first observed, conditions which will allow a given sequence to hybridize (e.g. selectively) with the most similar sequences in the sample can be determined.
- washing conditions are-described in Krause, M. H. and S. A. Aaronson, Methods in Enzymology, 200:546-556 (1991). Also, see especially page 2.10.11 in Current Protocols in Molecular Biology (supra), which describes how to determine washing conditions for moderate or low stringency conditions. Washing is the step in which conditions are usually set so as to determine a minimum level of complementarity of the hybrids. Generally, starting from the lowest temperature at which only homologous hybridization occurs, each ° C. by which the final wash temperature is reduced (holding SSC concentration constant) allows an increase by 1% in the maximum extent of mismatching among the sequences that hybridize. Generally, doubling the concentration of SSC results in an increase in T m of ⁇ 17° C.
- washing temperature can be determined empirically for high, moderate or low stringency, depending on the level of mismatch sought.
- the following table provides an example of each condition of stringency. % Allowed ° C. Stringency mismatch Temperature % Formamide High 6.6 52 50 Medium 13 45 50 Low 27 45 22
- Selective isolation or “selective hybridization”, is defined herein as embracing the isolation of a sufficiently few number of molecules (preferably one) as to readily permit the identification of the nucleic acid of interest.
- the nucleic acid molecule also preferably comprises regulatory sequences. Regulatory sequences include cis-acting elements that control transcription and regulation such as, promoter sequences, enhancers, ribosomal binding sites, and transcription binding sites. Selection of the promoter will generally depend upon the desired route for expressing the protein. For example, where the molecule will be introduced (e.g. transformed) into a cell by a viral vector, e.g. a plasmid, preferred promoter sequences include viral, such as retroviral or adenoviral, promoters. Examples of suitable promoters include the cytomegalovirus immediate-early promoter, the retroviral LTR, SV40, and TK promoter.
- the selected promoter is recognized by the host cell.
- the construct is a cassette expression system.
- a suitable promoter which can be used can include the native promoter for the binding moiety which appears first in the construct.
- the elements which comprise the nucleic acid molecule can be isolated from nature, modified from native sequences or manufactured de novo, as described, for example, in the above-referenced texts. The elements can then be isolated and fused together by methods known in the art, such as exploiting and manufacturing compatible cloning or restriction sites.
- the nucleic acid molecules can be inserted into a construct, e.g. a vector, such as a plasmid or cassette expression system, which can, optionally, replicate and/or integrate into a recombinant host cell, by known methods.
- a construct e.g. a vector, such as a plasmid or cassette expression system, which can, optionally, replicate and/or integrate into a recombinant host cell, by known methods.
- the vectors of the present invention comprise a nucleic acid molecule which encodes HDAg (e.g. an HDAg monomer).
- the monomer can be a subunit of an HDAg coiled-coil oligomer, e.g. an octamer.
- the oligomer can comprise an HDAg polypeptide as described herein.
- the nucleic acid molecule thus includes any of the nucleic acid molecules described herein, for example, a native (wild type) nucleic acid, or a fragment, mutant or derivative.
- nucleic acids encoding full-length HDAg e.g. HDAg-S or HDAg-L
- a fragment or derivative thereof e.g.
- Preferred vectors comprise a nucleic acid molecule comprising nucleotide sequence depicted in FIG. 9 , nucleotides 37-150 of FIG. 9 , nucleotides 37-186 of FIG. 9 , FIG. 10 , nucleotides 1421-1566 of FIG. 10 or nucleotides 1457-1566, FIG. 10 , FIG. 15 and FIG. 16 .
- Preferred vectors also comprise a nucleic acid comprising a nucleotide sequence encoding a polypeptide comprising an amino acid sequence depicted in a row of FIG. 1 , amino acids 12-48 of a row of FIG. 1 , the top row of FIG. 3 c, FIG. 9 , amino acids 12-48 of a row of FIG. 9 , FIG. 10 , amino acids 12-88 of FIG. 10 , FIG. 11 , FIG. 17 or 8 12-60(Y).
- vectors comprise nucleic acids comprising sequences which are the complementary strands of the above, DNA sequences which hybridize to these sequences, RNA sequences transcribed from these sequences, and fragments and mutations thereof, which encode a coiled-coil oligomer, e.g. an octamer.
- Vectors can also comprise fusion molecules comprising HDAg and at least one binding moiety, as described herein.
- the vector additionally comprises at least one multiple cloning site.
- a multiple cloning site comprises a cleavage sites for commonly used restriction sites to facilitate incorporation of foreign (non-HDAg) gene, e.g. a cassette.
- a multiple cloning site can be located 3′ or 5′ to the nucleic acid molecule encoding HDAg.
- There can be multiple cloning sites for example, there can be a multiple, cloning site 3′ of the HDAg nucleic acid molecule and another multiple cloning site 5′ to the HDAg nucleic acid molecule.
- the multiple cloning site can be located in a flanking region.
- the vector can further comprise nucleic acid encoding a nuclear localization signal, e.g. an HDAg nuclear localization signal, for example, amino acids 68-88 of HDAg, as shown in FIG. 9 .
- the vector can also comprise an HDAg nucleic acid molecule comprising a sequence encoding a coiled coil and a nuclear localization signal.
- the vectors of the invention can be used for the expression of a fusion molecule as described herein.
- a heterologous (non-HDAg) nucleic acid molecule e.g., a gene, encoding a binding moiety of a fusion molecule can be inserted into a vector comprising HDAg, e.g., into a multiple cloning site.
- a vector can comprise one heterologous gene or more than one heterologous gene.
- the genes can be the same or different.
- a first heterologous gene can encode a first binding moiety and a second heterologous gene can encode a second binding moiety.
- the first and the second binding moieties can be binding partners as described herein (for example, single chain antibody and antigen, ligand and receptor, components of a linked pathway, etc).
- the heterologous gene or genes and the nucleic acid encoding HDAg can be operably linked, e.g., in the same open reading frame. Where HDAg nucleic acid and a heterologous gene encoding a binding moiety are operably linked, they are expressed as a single protein unit, i.e., a fusion molecule.
- a vector comprises an HDAg nucleic acid molecule (e.g. a nucleic acid cassette) encoding a monomer capable of being a unit of a coiled-coil octamerization scaffold and a heterologous gene encoding a binding moiety, wherein the expressed binding moiety is bound to one terminal of the monomer, e.g. the N terminus or the C terminus. Where there are two expressed heterologous genes, each end of the monomer can be bound to an expressed binding moiety.
- an HDAg nucleic acid molecule e.g. a nucleic acid cassette
- Vectors can additionally comprise a nucleic acid molecule encoding a nuclear localization signal, which can transport protein expressed by the vector to the nucleus of a cell.
- the vectors described herein can express nucleic acid, e.g. a fusion gene, in a host cell, e.g. a procaryotic or eukaryotic cell.
- the vector can be expressed in a bacteria cell, for example, Escherischia, e.g. E. coli.
- the nucleic acid in the vector can also be expressed in Bacillus. It can also be expressed in baculoviruses, pichia expressions systems, and animal tissue or cells, for example insect, mammal, e.g., a human, or yeast (such as Saccharomyces ). Examples of specific cells include somatic or embryonic cells, HeLa cells, human 293 cells, monkey COS-7 cells, etc.
- the vector can comprise a number of other components.
- the vector can comprise a marker, for example a positive or negative selection marker, e.g. ampicillin or kanamycin.
- the vector can comprise two markers, wherein the first marker is capable of detecting propagation of a vector (e.g. a plasmid) in a bacterial cell and a second marker is capable of detecting propagation of the vector in a eukaryotic cell.
- the vector can comprise an origin of replication for bacteria and an origin of replication that is capable of mediating production of a single-stranded DNA by a bacteriophage, such as f1 phage or M13 phage.
- the vector can comprise a promoter.
- the promoter is a viral promoter, such as a retroviral or adenoviral promoter.
- suitable promoters include T7, lac, trc, tac, CMV, SV40, the cytomegalovirus immediate-early promoter, the retroviral LTR and the TK promoter.
- the promoter can be selected for high-level expression.
- a promoter can be selected for optimal expression in bacteria, (e.g. T7, lac, trc, tac etc.) or in a eukaryotic cell (e.g. CMV or SV40).
- the vector can also comprise enhancers, ribosomal binding sites and transcription binding sites.
- a vector comprises HDAg nucleic acid (with or without a nuclear local signal), a heterologous gene, a marker, an origin of replication for a host cell, an origin of replication capable of mediating production of single-stranded DNA by a bacteriophage, a promoter, and a ribosome binding site.
- the origin of replication is selected for maintaining plasmid expression is E. coli.
- a vector for overexpression of hepatitis delta antigen in E. coli for example, a vector produced by the method as herein described in Example 2, below, e.g. a vector comprising a nucleic acid molecule sequence optimized for expression of HDAg in E. coli.
- the gene can be inserted into a vector which allows production of several forms of the capsid protein including residues 1-84 (terminated in the middle domain), the short isoform and the long isoform.
- the vector is pR5 ⁇ V5.
- Another preferred embodiment is a cassette expression system which allows any expressed sequence or sequences (e.g.
- the HDAg gene is mutated such that in the expressed peptide, a serine residue of HDAg is replaced with (substituted by) a cysteine to allow for convenient chemical cross-linking of the octamerization domain, e.g., to an inert support matrix (e.g. polyethylene glycol), to a synthetic peptide, to an oligosaccharide, to a small organic molecule, or to lipids.
- FIG. 12 depicts a representation of a construct containing eight appended protein domains on an HDAg octameric framework.
- the vector can be viral.
- Viral vectors include baculovirus, retrovirus, adenovirus, parvovirus (e.g., adeno-associated viruses), coronavirus, negative strand RNA viruses such as orthomyxovirus (e.g., influenza virus), rhabdovirus (e.g., rabies and vesicular stomatitis virus), paramyxovirus (e.g.
- RNA viruses such as picomavirus and alphavirus
- double stranded DNA viruses including adenovirus, herpesvirus (e.g., Herpes Simplex virus types 1 and 2, Epstein-Barr virus, cytomegalovirus), and poxvirus (e.g., vaccinia, fowlpox and canarypox).
- herpesvirus e.g., Herpes Simplex virus types 1 and 2, Epstein-Barr virus, cytomegalovirus
- poxvirus e.g., vaccinia, fowlpox and canarypox
- Other viruses include Norwalk virus, togavirus, flavivirus, reoviruses, papovavirus, hepadnavirus, and hepatitis virus, for example.
- retroviruses include: avian leukosis-sarcoma, mammalian C-type, B-type viruses, D-type viruses, HTLV-BLV group, lentivirus, spumavirus.
- Other examples include murine leukemia viruses, murine sarcoma viruses, mouse mammary tumor virus, bovine leukemia virus, feline leukemia virus, feline sarcoma virus, avian leukemia virus, human T-cell leukemia virus, baboon endogenous virus, Gibbon ape leukemia virus, Mason Pfizer monkey virus, simian immunodeficiency virus, simian sarcoma virus, Rous sarcoma virus and lentiviruses.
- a nucleic acid molecule described herein can be introduced (incorporated or inserted) into the host cell, by known methods. Such cells, comprising such nucleic acid molecules, are encompassed in the invention.
- the host cell can be a eukaryotic or prokaryotic cell and includes, for example, baculoviruses, Pichia expression systems, yeast (such as Saccharomyces ), bacteria (such as, Escherichia or Bacillus ), animal cells or tissue, including insect or mammalian cells (such as somatic or embryonic human cells, Chinese hamster ovary cells, HeLa cells, human 293 cells and monkey COS-7 cells, etc.).
- transfecting or transforming cells examples include calcium phosphate precipitation, electroporation, microinjection, infection, lipofection and direct uptake. Methods for preparing such recombinant host cells are described in more detail in Sambrook et al., “Molecular Cloning: A Laboratory Manual,” Second Edition (1989) and Ausubel, et al. “Current Protocols in Molecular Biology,” (1992), for example.
- the host cell is then maintained under suitable conditions for expression and recovering the molecule, e.g. a fusion molecule.
- the cells are maintained in a suitable buffer and/or growth medium or nutrient source for growth of the cells and expression of the gene product(s).
- the growth media are not critical to the invention, are generally known in the art and include sources of carbon, nitrogen and sulfur. Examples include Dulbeccos modified eagles media (DMEM), RPMI-1640, M199 and Grace's insect media.
- DMEM Dulbeccos modified eagles media
- RPMI-1640 RPMI-1640
- M199 and Grace's insect media.
- the selection of a buffer is not critical to the invention.
- the pH which can be selected is generally one tolerated by or optimal for growth for the host cell.
- the cell is maintained under a suitable temperature and atmosphere.
- Anaerobic host cells are generally maintained under anaerobic conditions.
- the host cell is aerobic and the host cell is maintained under atmospheric conditions or other suitable conditions for growth.
- the temperature should also be selected so that the host cell tolerates the process and can be for example, between about 300 and 40° C. for mammilian cells and between 20 and 40° C. for bacteria, yeast and insect cells.
- the recombinant molecules, including fusion molecules, produced by the processes described herein can be isolated and purified by known means.
- suitable purification and isolation processes are generally known and include ammonium sulfate precipitation, dialysis, gel filtration, immunoaffinity, chromatography, electrophoresis, ultrafiltration, microfiltration or diafiltration.
- the fusion molecule can incorporate commonly used sequence tags e.g. his, tag or fla to facilitate purification via ligand affinity chromatograph.
- the fusion molecule is preferably purified substantially prior to use, particularly where the protein will be employed as an in vivo therapeutic, although the degree of purity is not necessarily critical where the molecule is to be used in vitro.
- the bifunctional molecule can be isolated to about 50% purity (by weight), more preferably to about 80% by weight or about 95% by-weight. It is most preferred to employ a molecule which is essentially pure (e.g., about 99% by weight or to homogeneity).
- Fusion molecules which are prepared according to the above method can be used directly in the disclosed methods or can be screened for an activity prior to use.
- the fusion molecule (or mixtures of fusion molecules) can be contacted with, for example, the binding partner of a binding moiety of the fusion molecule under conditions suitable for binding and then assayed for binding.
- a fusion molecule comprising a ligand can be screened for the ability to bind the ligand's receptor, or the binding protein of the ligand's receptor, in vitro, by contacting the receptor (or portion thereof) and the fusion molecule under conditions suitable for binding and detecting binding.
- the HDAg molecules of the invention are useful in a variety of methods.
- the N-terminal octamer may serve as a convenient high valency framework for linking, presenting or delivering a variety of binding moieties, e.g. as described above.
- the molecules are useful in the delivery of one or more therapeutic agents, such as drugs, proteins or polynucleotides (e.g., genes) or products thereof to a patient.
- the polynucleotide or the product thereof can be a therapeutic agent.
- therapeutic polynucleotide includes RNA (e.g., ribozymes) and antisense DNA that prevents or interferes with the expression of an undesired protein in the target cell.
- the polynucleotide can also encode a heterologous therapeutic protein.
- a heterologous protein or polynucleotide is one which is not HDAg.
- therapeutic proteins include antigens or immunogens such as a polyvalent vaccine, cytokines, tumor necrosis factor, interferons, interleukins, adenosine deaminase, insulin, T-cell receptors, soluble CD4, epidermal growth factor, human growth factor, blood factors, such as Factor VIII, Factor IX, cytochrome b, glucocerebrosidase, ApoE, ApoC, ApoAI, the LDL receptor, negative selection markers or “suicide proteins”, such as thymidine kinase (including the HSV, CMV, VZV TK), anti-angiogenic factors, Fc receptors, plasminogen activators, such as t-PA, u-PA and streptokinase, dopamine, MHC, tumor suppressor
- the ligand and/or receptor can be peptides (including post-translationally modified proteins) and/or small molecules (including sugars, steroids, lipids, anions or cations).
- the ligands and ligand-cell specific receptors can be known or unknown. Where the ligand is known and the receptor is unknown, ligand-cell specific receptors can be identified, for example, by screening for host cells transfected with nucleotides encoding potential receptors.
- the ligands can be secreted (such as chemokines) or non-secreted (such as the extracellular domains of chemokines receptors) proteins.
- a library of host cells displaying putative ligand-cell surface receptors can be obtained by transfecting suitable host cells with nucleic acid constructs, including but not limited to cDNA or genomic libraries, under appropriate regulatory control to result in the expression of cell-surface receptors on the host cell.
- the fusion molecule with ligand is added to the population of host cells under conditions suitable for introduction. Introduction can be detected, for example, with a label.
- a similar approach can be used to select unknown ligands in the case where the ligand-cell specific receptors are known and the ligand is unknown.
- a library of fusion molecules with putative ligands e.g., chemokines
- a similar approach can be used to identify unknown ligands, test substances, drugs wherein the cell surface receptor is known, where the fusion molecule comprises a binding moiety which is a receptor which binds a surface molecule.
- the host cell expresses a distinct ligand or a collection of recombinant ligands. Ligand-receptor binding can be detected following introduction of the molecule to the host cell.
- an antigen or immunogen can be expressed heterologously (e.g., by recombinant insertion of a nucleic acid sequence which encodes the antigen or immunogen (including antigenic or immunogenic fragments) into a vector comprising HDAg).
- the antigen or immunogen and HDAg can be expressed in a live attenuated, pseudotyped virus vaccine, for example.
- the methods can be used to generate humoral and cellular immune responses, e.g. via expression of heterologous pathogen-derived proteins or fragments thereof in specific target cells.
- the dosage administered (e.g., the effective amount) will, of course, vary depending upon known factors such as the pharmacodynamic characteristics of the particular agent, e.g., the therapeutic binding entity, and its mode and route of administration; age, health, and weight of the recipient; nature and extent of symptoms, kind of concurrent treatment, frequency of treatment, and the effect desired.
- Effective doses can be determined by those of skill in the art.
- An effective dose of an agent is an amount sufficient to relieve the individual of the symptoms of the disorder which the agent is intended to treat.
- Methods of introduction of the agent at the site of treatment include, but are not limited to, intradermal, intramuscular, intraperitoneal, intravenous, subcutaneous, oral, intranasal, gene therapy, cellular implantation or particle bombardment.
- Other suitable methods include or employ biodegradable devices and slow release polymeric devices.
- parenteral administration e.g., intravenous, subcutaneous, or intramuscular, would ordinarily be used to optimize absorption.
- injectable, sterile solutions preferably oily or aqueous solutions, as well as suspensions, emulsions, or implants, including suppositories.
- the molecule comprising the agent can be administered in a solution, suspension, emulsion or lyophilized powder in association with a pharmaceutically acceptable parenteral vehicle.
- parenteral vehicle examples include water, saline, Ringer's solution, dextrose solution, and 5% human serum albumin.
- Liposomes and nonaqueous vehicles such as fixed oils can also be used.
- the vehicle or lyophilized powder can contain additives that maintain isotonicity (e.g., sodium chloride, mannitol) and chemical stability (e.g., buffers and preservatives).
- the formulation is sterilized by commonly used techniques. Suitable pharmaceutical carriers are described in the most recent edition of Remington's Pharmaceutical Sciences, A. Osol, a standard reference text in this field of art. Ampules are convenient unit dosages.
- Formulations for transdermal or transmucosal administration generally include penetrants such as fusidic acid or bile salts in combination with detergents or surface-active agents. The formulation can then be manufactured as aerosols, suppositories, or patches.
- Oral agents may be administered if formulated as to be protected from digestive enzymes. If administered orally, the SCR-P will be administered in a therapeutic composition which may also include an appropriate carrier (e.g., a physiologically compatible carrier), a flavoring agent and a sweetener.
- an appropriate carrier e.g., a physiologically compatible carrier
- Suitable pharmaceutical carriers include, but are not limited to water, salt solutions, alcohols, polyethylene glycols, gelatin, carbohydrates such as lactose, amylose or starch, magnesium stearate, talc, silicic acid, viscous parafin, fatty acid esters, hydroxymethylcellulose, polyvinyl pyrolidone, etc.
- the pharmaceutical preparations can be sterilized and, if desired, mixed with auxiliary agents, e.g., lubricants, preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic pressure, buffers, coloring, and/or aromatic substances and the like which do not deleteriously react with the active compounds. They can also be combined where desired with other active agents, e.g., enzyme inhibitors, to reduce metabolic degradation.
- auxiliary agents e.g., lubricants, preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic pressure
- HDAg molecules e.g. fusion molecules
- vectors e.g. cassette expression systems
- nucleic acid molecules such as the vectors described herein
- the vector comprising all or part of a nucleic acid sequence of FIG. 9 or FIG. 15 (synthetic) can be used to overexpress thee hepatitis delta antigen in bacteria.
- multiple copies of a binding moiety can be expressed using the vectors described herein, e.g., by inserting a nucleic acid cassette encoding the moiety into the vector, transforming a host cell with the vector and culturing the cell under conditions sufficient for expression of the moiety. Up to sixteen copies of the moiety (eight at the C terminus of the HDAg monomer and eight at the N terminus) can be made in this way.
- the vector is one depicted in a Figure selected from the group consisting of FIG. 13A, 13B , 13 C and 14 .
- a vector or molecule described herein can be used for a high valency expression of a binding moiety, for example, a peptide or protein domain, e.g., an antigen.
- the vector can be used to express a high valency display of at least two different peptides or protein domains, to enhance interaction between ligands. The interaction between ligands can occur in solution, on membranes or on surfaces.
- interaction e.g., fusion
- interaction between two cells (or two cell types) can be mediated or enhanced by, for example, associating an HDAg octamer expressing multiple copies of a domain which interacts with a ligand on the surface of cell type one and multiple copies of a domain which interacts with a ligand on the surface of cell type two.
- This method could work for embodiments involving more than two cells or cell types as well.
- a fusion molecule e.g., an octamer construct of the present invention, which can be coupled to a surface, for example, by chemical cross-linking or by inclusion of at least one copy of a second domain which interacts with a ligand displayed on a surface, can be used to display multiple copies of a domain on a surface.
- the octamer can be used to express different enzymes from a linked pathway on a single framework, e.g., for facilitating rapid exchange of substrates and products between enzymes from a single pathway. The enzymes implicated Krebs in the can be cycle.
- the octamer can also be used to link a first binding moiety, e.g. a peptide or domain mediating (specifying) an interaction (an “interaction” domain) and a second binding moiety, which mediates an effect (an “effector” domain).
- a first binding moiety e.g. a peptide or domain mediating (specifying) an interaction (an “interaction” domain) and a second binding moiety, which mediates an effect (an “effector” domain).
- the effector domain is a chemical, e.g. a drug, linked to the octamer via a free —SH group on the octamer.
- the interaction domain mediates interaction with a specific receptor on a cell surface, and the effector domain generates a specific function, such as cell killing.
- the octamer can contain one or more copies of an domain for interaction with a ligand, and one or more copies of a domain which is a label (e.g. alkaline phosphatase, radiolabel, streptodevice, and green fluorescent protein) for amplifying a signal in a solid phase assay, e.g. an ELISA assay.
- a label e.g. alkaline phosphatase, radiolabel, streptodevice, and green fluorescent protein
- an oligomer construct can be used to couple an oligonucleotide which interacts (e.g. hybridizes) with a nucleic acid molecule in the cell (e.g. a specific complementary DNA or RNA sequence) and one or more copies of an effector to target specific DNA or RNA sequences for cleavage.
- the effector is a double-stranded nuclease.
- the octamer is a mutant with a free sulfhydryl (—SH) group.
- An octamer construct can be used to couple multiple copies of a ligand in such a way that the interaction of the octamer with a cell triggers signaling or internalization by pathways which depend on the multimerization of a receptor or ligand on the cell surface.
- the vectors can be used to promote interaction between intracellular components in a signal transduction pathway, for example components which are upstream or downstream from each other.
- system can be used to mediate efficient (drive) gene expression by coupling (e.g. covalently) an enhancer recognition protein and at least one promoter binding protein.
- the system can be used as a high valency trap to identify the vector can be used as a diagnostic.
- the binding moiety is Sp120 and the molecule is used to test for the presence of Human Immunodeficiency Virus.
- at least one binding moiety is a drug which has a therapeutic effect when administered to an animal.
- a molecule of the present invention is used to screen test substances for an effect.
- an agent can be administered to a patient, wherein the agent will inhibit formation of the coiled coil HDAg oligomer.
- Peptides e.g.
- the inventions described herein are based upon the discovery that the HDAg protein oligomerizes to a coiled-coil olctamer.
- the HDAg protein is derived from the Hepatitis D virus which bind to peptides or protein domains.
- At least one binding moiety is a drug which has a therapeutic effect when administered to an animal.
- a molecule of the present invention is used to screen test substances for an effect.
- an agent can be administered to a patient, wherein the agent will inhibit formation of the coiled coil HDAg oligomer.
- the inventions described herein are based on the discovery that the HDAg protein oligomerizes to a coiled-coil octamer.
- the HDAg protein is derived from the Hepatitis D virus.
- Hepatitis B virus infection alone generally causes mild, sometimes chronic, hepatitis
- coinfection of hepatitis D virus (HDV) with hepatitis B virus (HBV) causes severe, and often fatal, liver disease in humans, and is the most common cause of fulminant viral hepatitis, Hoofnagle, J. H., J. Am. Med. Assoc., 261:1321-1325 (1989).
- the virus is an obligatory subviral satellite of HBV, requiring the hepatitis B surface antigen (HBsAg) for assembly and cell-to-cell transmission.
- HBsAg hepatitis B surface antigen
- Hepatitis delta encodes all of the information required to direct replication of its RNA genome by the host RNA Pol II. Efficient transmission of hepatitis delta virus requires that the viral RNA and the capsid protein be encapsidated within the hepatitis B virus surface antigen.
- the viral genome is a 1.7 kilobase single-stranded circular RNA, which is approximately 70% complementary to itself, Wang, K. S.
- HDAg exists in two isoforms. Early in the life cycle of the virus, HDAg is expressed as a 195-amino acid protein, the small hepatitis delta antigen (s-HDAg), which functions as a transactivator of HDV RNA replication. This form predominates early in infection. Kuo, M. Y. P, et al., J. Virol. 63:1945-1950 (1989). Later in the life cycle of the virus, there is an RNA editing event that changes the UAG stop codon of the HDAg-S to a UGG codon, encoding a tryptophan.
- s-HDAg small hepatitis delta antigen
- HDAg-L is a potent inhibitor (dominant repressor) of HDV replication, Chao, M. et al., J. Virol. 64:5066-5069 (1990), Glenn, J. S. & M. J. Virol. 65:2357-2361 (1991), and is also involved in packaging the viral RNA, Chang, F. L. et al. Proc. Natl. Acad. Sci.
- the N-terminal third of the small delta antigen contains a putative coiled-coil sequence, Xia, Y. P. & Lai, M. M. C., J. Virol. 66:6641-6848 (1992), Wang, J. G. & Lemon, S. M., J. Virol. 67:446-454 (1993), Chang, M. F. et al., J. Virol, 67:2529-2536 (1993), comprising heptad repeats, which is followed by a linker domain which contains bipartite nuclear localization signal.
- HDAg contains two arginine-rich motifs that have been shown to bind to the viral RNA.
- the C-terminal segment of s-HDAg is proline- and glycine-rich.
- L-HDAg is prenylated at the extreme C terminus and it is believed that this part of the molecule interacts with HBsAg and the membranes of the endoplasmic reticulum. Hwang, S. B. & Lai, M.
- the coiled-coil domain has been shown to be required for a number of the functions of both small and large delta antigens. Mutations that destroy or alter the coiled-coil domain either greatly reduce or totally eliminate the ability of the HDAg-S to function as a trans activator of replication, Chang, M. F. et al., J. Virol., 68:646-653 (1994), Chang, M. F. et al., J. Virol. 62:2403-2410 (1998), Lin, J. H. et al., J. Virol. 64:4051-4058 (1990), Chao, M. et al., J. Virol. 65:4057-4062 (1991), Xia, Y. P.
- HDAg-L is believed to disrupt the homo-oligomeric small antigen multimers, essentially poisoning the HDAg-S complex.
- the protein is not a polymerase, and RNA amplification is thought to be mediated by host cell RNA polymerase II, MacNaughton, T. G. et al., Virology 184:387-390 (1991), Fu., T. B. & Taylor, J. et al., J. Virol. 67:6965-6972 (1993).
- the peptide sequence was conceptually divided into three segments based on the presence of two potential helix breakers Gly23 (G23) and Pro49 (P49); segments A (residues 12-24), B (residues 25-49), and C (residues 50-60) ( FIG. 1 ).
- G23 G23
- Pro49 Pro49
- segments A residues 12-24
- B residues 25-49
- C residues 50-60
- FIG. 1 The full-length peptide ⁇ 12-60(Y) and two shorter peptides that corresponded to regions A+B and B+C were synthesized.
- Described herein for the first time is the crystal structure of the peptide ⁇ 12-60(Y) to 1.8 ⁇ resolution.
- the structure reveals that the capsid protein dimerizes as an unusual antiparallel coiled coil.
- the dimers further oligomerize to form an octamer.
- the octamer forms an open, square planar structure with an antiparallel dimer forming each side of the square.
- Crosslinking and hydrodynamic studies suggest that both the peptide and the full-length short isoform exist as stable octamers in solution.
- the structure of the peptide lends new insights into the mechanism by which HDAg dimerizes and further associates into higher ordered structures. The structure also explains why residues C-terminal to the predicted coiled-coil domain, and the helix-breaking proline residues are important for the stabilization of the coiled-coil structure.
- the peptide structure has important consequences for the in vivo oligomerization of HDAg.
- the unique octameric structure which is observed in the crystal structure also suggests that the N-terminus of the molecule may have a previously undetermined function.
- the peptide structure shows that hydrophobic residues from the N terminus of one monomer (region A), not involved in the heptad repeat, interact with residues outside of the predicted coiled-coil domain near the C terminus of the other monomer (region C) to form a hydrophobic core Trp20 (W20), Leu24 (L24), Trp50 (W50), Leu51 (L51) sandwiched between Arg13 (R13) and Arg24 (R24). This may stabilize the structure by keeping the ends of the helix from fraying.
- An additional stabilizing feature is a hydrogen bond between the sidechain of Glu45 (E45) and the indole nitrogen of Trp20 (W20).
- hydrophobic residues as well as the glutamic acid residue, are highly conserved in the 10 different strains of HDV identified to date ( FIG. 1 ). In fact, they are more conserved than those residues in the heptad repeat making up the hydrophobic core of the long helix ( FIG. 1 ).
- HDAg-L does disrupt the conformation of the oligomer of HDAg-S it probably does not do so directly through the multimerization domain, given that the large and small delta antigen share the same sequence within this region. Rather, it is possible that this ⁇ 7 or ⁇ n ⁇ m structure can no longer interact with host factors. Also, since the C terminus of the L-HDAg interacts with the endoplasmic reticulum (ER) membrane and with HBsAg for assembly, it could redirect the complex elsewhere in the cell, preventing the nuclear translocation of s-HDAg which is required for HDV replication.
- ER endoplasmic reticulum
- the octamer that is formed by the peptide is reminiscent of proteins that form clamps around DNA, such as PCNA. Talluru, S. R., et al., Cell 79:1233-1243 (1994).
- the 50 ⁇ hole formed by the octameric structure is lined with basic side chains, suggesting that the N terminus of the protein not only may act as a dimerization/oligomerization domain, but also that it may function either as a clamp around the viral RNA or other nucleic acid or perhaps even function as a spool for nucleic acid.
- the large size of the hole may be necessary to accommodate the viral RNA which is only 70% self complementary, and would possess a number of regions of bulged out single-stranded sequence, increasing the radius of gyration of the RNA as well as bending the RNA.
- the octameric structure also implies that there may be as many as four RNA-binding domains on each side of the octamer.
- This portion of the molecule may also bind another protein, especially one that is acidic, such as the recently discovered delta antigen interacting protein A (dipA), a cellular protein which has been found to interact with the HDAg Brazas, R. et al., Science 274:90-94 (1996) and; based on its amino acid sequence, would have an isoelectric point of 4.9.
- dipA delta antigen interacting protein A
- the hepatitis delta antigen (HDAg) the sole protein made by the hepatitis delta virus (HDV), is essential for viral replication in vivo. Oligomerization of the protein is necessary for both the transactivating function of the small delta antigen (HDAg-S) and the trans dominant inhibitory effect of the large delta antigen (HDAg-L).
- the structure of the peptide ⁇ 12-60(Y) that corresponds to the predicted coiled-coil domain of the hepatitis delta antigen HDAg suggests that delta antigen HDAg not only dimerizes through an antiparallel coiled coil, but also forms octamers.
- the coiled coil is stabilized by hydrophobic residues C terminal to the coiled-coil domain. These C-terminal residues interact with hydrophobic residues in the N terminus of the coiled-coil region.
- the hydrophobic core of the dimer is extended by further hydrophobic interactions at the interface between dimers in the octameric structure.
- these unique interactions at the termini of the monomer and dimer interfaces might provide a good target for antivirals against HDV, since disruption of oligomerization can prevent replication in vivo.
- the surprising octameric structure of the peptide suggests that the capsid of the delta antigen (HDAg) will look very different from the known structures of other viral nucleocapsid proteins.
- the octameric structure also suggests important implications for binding of HDAg to the viral RNA, since as many as four of the arginine-rich RNA-binding domains might be needed for binding to the viral RNA.
- the very basic hole in the octamer suggests that this portion of the molecule may act as a sort of “clamp” around an acidic molecule, such as viral RNA, another nucleic acid or a cellular factor.
- HDAg in viral replication is unclear.
- the protein may only function as a shuttle, binding to the viral RNA and transporting it into the nucleus of the infected cell. It is possible that HDAg functions to recruit host cell transcriptional machinery to the viral RNA.
- the discovery of the structure enables the design of experiments to determine whether the N terminus of the molecule has RNA-binding capabilities and investigate the mechanism of oligomerization and inhibition of small antigen by the large antigen.
- a systematic examination of the amino acids involved in dimerization and oligomerization would allow the determination of the mechanism by which HDAg-L inhibits HDAg-S.
- the unique interactions at the termini of the coiled-coil region provide a new framework to be exploited in the de novo design of stable antiparallel coiled coils.
- PEPTIDE SYNTHESIS The ⁇ 12-60(Y) peptide was obtained by Erickson, B. and Lemon, S. M. and was synthesized and purified as described previously in Rozzelle, J. E. et al., Proc. Natl. Acad. Sci. USA, 92:382-386 (1995), incorporated herein by reference in its entirety.
- the peptide ( FIG. 11B ) was assembled by fluorenylmethoxycarbonyl chemistry and purified by reversed-phase HPLC. It was N ⁇ -acetylated and C ⁇ -amidated. Crude peptide in 0.05% trifluoroacetic acid was separated on an octyl-silica column [C 8 , Applied Biosystems, 250 mm ⁇ 10 mm (i.d.), 300- ⁇ pore size] by elution at 3 ml/min over 50 min with a linear gradient of 20-42% acetonitrile in 0.05% trifluoroacetic acid.
- Peptide ⁇ 12-60(Y) was eluted at 36% acetonitrile (monitored at 230 nm). The homogeneity of the individual fractions was determined on an analytical octyl-silica column. The expected mass of the peptide was confirmed by electrospray ionization (ESI) mass spectrometry: peptide ⁇ 12-60(Y), m/z 6034.1 ⁇ 1.2 (calcd. 6033.7).
- ESI electrospray ionization
- the peptide from the 12-60 region of HDAg was synthesized ( FIG. 11B ).
- Peptide ⁇ 12-60(Y) included segments A, B, and C.
- Segment B contains three heptads in which the first and fourth heptad positions are occupied by five leucines and one isoleucine, and is probably part of an ⁇ -helical coiled coil.
- a tyrosine residue, (Y) was added, Lys 60 , to the C terminus of ⁇ 12-60(Y) to permit radioiodination.
- CD SPECTROSCOPY The ⁇ -helicity and the temperature at the midpoint of thermal denaturation (T m ) of the peptides were determined by CD spectroscopy. All three peptides had high ⁇ -helicity in PBS at 5° C.
- the ratio of the mean residue ellipticity of the negative bands near 222 nm and 208 nm ([ ⁇ 222 ])/([ ⁇ 208 ]) is an indicator of coiled-coil formation. Values close to 1.0 indicate an ⁇ -helical coiled coil and values near 0.8 indicate isolated ⁇ -helices. At 5° C., this ratio was 0.98 for ⁇ 12-60(Y).
- this ratio was 0.94 for ⁇ 12-60(Y), consistent with persistence of a coiled-coil structure. In contrast, at 37° C. this ratio was only 0.79 for ⁇ 2-49 and 0.76 for 625-60(Y), inconsistent with a coiled-coil structure.
- EXPRESSION PLASMIDS pR5 ⁇ V5 was constructed for the high-level expression of HDAg-S in Escherichia coli.
- the protein sequence of the American strain with the HDAg-S (GenBank accession no. M28267) was back-translated with the program BACKTRANSLATE. (This program was from the Wisconsin Package, versions 9.0 [Genetics Computer Group, Madison, Wis.], with E. coli codon frequencies obtained from gopher://weeds.mgh.harvard.edu:70/Oftp%3Aweeds.mgh.harvard.edu@/pub/codon/eco.cod.) With the sequence obtained as shown in FIG. 9 and FIG.
- the plasmid pR5 ⁇ V5 was constructed by a two-step PCR method, as described previously. Casimuro, D. R. et al., Biochemistry, 26:6640-6648(1995), with the exception that Vent polymerase (New England BioLabs) was used instead of Taq polymerase. Eight overlapping synthetic primers were synthesized ( FIG. 9 and FIG. 18 ). Changes in the back-translated sequences were made so that the overlaps of the PCR primers would have approximately the same melting temperature. Primers were electrophoresed into a 10% sequencing gel, visualized by UV shadowing, and excised from the gel. The primers were then purified with a Waters Sep-Pak column.
- the first PCR contained 4 pmol of each of the eight primers in a 100- ⁇ l reaction mixture. Ten microliters of the first PCR was added to a second reaction mixture that contained an upstream primer (5′-GGGCATATGAGCCGTAGCGA) and a downstream (5′-GCGCCATGGTTTACGGAAAG) primer designed to amplify the desired full-length product. Both reactions involved a hot start at 94° C. followed by 30 cycles of 1 min. at 94° C., 1 min at 57° C., and 1 min at 72° C., with a final 5-min extension at 72° C.
- the PCR product from the second reaction was cloned into the vector pCR-Blunt (Invitrogen), which allows selection based on disruption of a toxic gene. Plasmids isolated from colonies were checked for the insert by restriction digest mapping. The open reading frame of HDAg-S was subdloned into expression vector pRSETb (Invitrogen). The sequence of the resultant plasmid, pR5 ⁇ V5, was verified by dye termination sequencing.
- PROTEIN PURIFICATION Recombinant HDAg-S ( ⁇ Ag-S) was expressed and purified as follows. Plasmid pR5 ⁇ V5 was transformed into BL21 (DE3)pLysS cells (Novagen). A single colony was used to inoculate a 100-ml overnight culture. Ten milliters of this overnight culture was used to inoculate a 1-liter culture. At an optical density of between 0.4 and 0.6, the cells were induced with 3 ml of 100 mM IPTG (isopropyl- ⁇ -D-thiogalactopyranoside). Cell growth was continued for 3 h, and then cells were pelleted at 5,000 ⁇ g for 10 min. The cells were resuspended in 15 ml of 50 mM HEPES (pH 7.5)—250 mM NaCl—1 mM MgC 12 and stored at ⁇ 20° C. until needed.
- the frozen cells (45 ml corresponding to three 1-liter cultures) were thawed, and one Complete Protease Inhibitor tablet (Boehringer Mannheim) was added, along with RNase A and DNase I, to a final concentration of 50 ⁇ g/ml. Cells were lysed by sonication and pelleted at 10,000 ⁇ g for 30 min.
- the lysate was diluted threefold with 50 mM HEPES buffer (pH 7.5) and then applied to a 10 ⁇ 1.5-cm Fast SP Sepharose column (Pharmacia) equilibrated with 50 mM HEPES buffer (pH 7.5) and eluted with a salt gradient from 0 to 1 M NaCl in 50 mM HEPES (pH 7.5).
- the fractions containing ⁇ Ag-S were applied to a Superdex S-200 column (Pharmacia) equilibrated with 50 mM HEPES (pH 7.5), 500 mM NaCl, and 5% glycerol.
- the HDAg-S obtained was >85% pure as judged by Coomassie blue staining of a sodium dodecyl sulfate gel.
- Proteins with a histidine tag were purified as follows. Proteins expressed in E. coli were affinity purified with a Talon column according to the recommendations of the manufacturer (Clontech). Proteins expressed in mammalian cells were purified by the Invitrogen Xpress System. In both cases, the fractions containing the purified protein were identified by sodium dodecyl sulfate gel electrophoresis, pooled, dialyzed, and concentrated.
- HDAg-S or HDV RNA was omitted.
- cDNA transfections 500 ng of plasmid DNA was used.
- the transfection mixture was changed to Dulbecco's modified Eagle's medium supplemented with 10% fetal calf serum.
- cells were reseeded into a 30-mm-diameter dish containing a glass coverslip; at 8 days, the cells were examined by immunofluorescence microscopy.
- Eight days length was chosen for three reasons: (I) to avoid detection of the transfected HDAg-S; (ii) In the immunofluorescence assays, 8 days corresponded to the peak signal for a cell undergoing RNP-iniated replication; (iii) At 8 days, HDAg-L, created as a consequence of both RNA editing and genome replication, could be readily detected (Luo et al. J. Virol., 64:1021-1027 (1990)). In contrast, for Northern analyses, genome replication was detectable as early as 2 days.
- the peptide ⁇ 12-60(Y) was dissolved in 50 mM acetate, pH 4.8, 50 mM NaCl and brought to a concentration of 15 mg/ml.
- the crystals of the ⁇ 12-60(Y) peptide were grown at 22° C. by the vapor diffusion method.
- the peptide (2 ⁇ l of a 15 mg/ml solution) was mixed with 2 ⁇ l of the reservoir solution containing 100 mM sodium acetate, pH 4.8, and 100 mM sodium citrate, pH 5.6, on a coverslip and then inverted over the reservoir solution. Crystals appeared within 3-4 days, and grew as large as 0.5 ⁇ 0.3 ⁇ 0.3 mm.
- a peptide was synthesized with serine 22 replaced by a cysteine, ⁇ S22C12-60(Y).
- the 8S22C12-60(Y) peptide was reacted with an excess of platinum terpyridine, dialyzed overnight against water, and then freeze-dried.
- the peptide was then reconstituted at 15 mg/ml in 50 mM acetate, 50 mM NaCl, 5 mM DTT, pH 4.8, and crystallized by the same conditions as that of the wild type-peptide. This peptide crystallized isomorphously with the ⁇ 12-60(Y).
- the coverslips containing the crystals were inverted and cryosolvent (reservoir solution containing 30% glycerol) was slowly mixed with the drops and continuously replaced until no mixing was observed.
- the crystals were mounted in nylon loops and frozen directly in the nitrogen stream. Crystals used at Brookhaven were stored in liquid nitrogen until the time of data collection.
- Two native data sets were collected at Beamline X12C at the National Synchrotron Light Source at Brookhaven National Lab using X-rays of wavelength 1.15 ⁇ (Table 1).
- the heavy atom data set was collected on a Siemens rotating anode with a multiwire detector (Table 1). Data from the native crystals was processed using DENZO, Otwinowski, Z.
- STRUCTURE DETERMINATION AND MODEL BUILDING The positions of the heavy atom sites were determined using SHELXS-86 (Sheldrick, G. M., Acta. Cryst. A. 46:467-473 (1990)). The positions of the heavy atom sites were refined using MLPHARE (Otwinowski, Z., Proceedings of the CCP 4 Study Weekend, 80-86 SERC Daresbury Laboratory, Warrington, UK (1991)), and initial SIRAS phases were calculated. The data was then subjected to a round of solvent flattening with histogram matching using DM (Zhang, K. Y. J. & Main, P. Acta. Cryst. A. 46:41-46 (1990).
- a map was calculated which clearly showed the position of the two dimers in the asymmetric unit, and an initial model was built into the initial SIRAS map using the program O (Jones, T. A. et al. Acta Cryst. A., 47:110-119 (1990)).
- the structure was refined using X-PLOR v 3.8.9, Brunger, A. T., Yale University Press, New Haven, Conn. (1992). Rounds of positional refinement, followed by simulated annealing and B-factor refinement, were carried out with rebuilding of the structure using O between cycles of refinement.
- omit maps which excluded 10 residues at a time, were used to check the progress of refinement.
- PROTEIN EXPRESSION AND PURIFICATION The pR5 ⁇ V5 plasmid, Dingle, K. et al. J. Virol., 72(6):4783-4788 (1998) which contains a synthetic gene for the small delta antigen, HDAg-S, was transformed into BL21 (DE3)pLysS cells (Novagen) and purified as described previously in Example 2. See also Dingle, K. et al., supra. Briefly, 45 ml of frozen cells, corresponding to three 1 L cultures, were thawed and one protease inhibitor tablet (Boehringer Mannheim) was added, as well as RNAse A and DNAse I to a final concentration of 50 ⁇ g/ml.
- protease inhibitor tablet Boehringer Mannheim
- the sample was then applied to a Superdex S-200 column (Pharmacia) equilibrated with 50 mM Hepes, pH 7.5, 500 mM NaCl and 5% glycerol. The elution of the protein from the column was monitored by UV absorbance at 280 nm.
- the solvent-flattened map was easily interpretable, and clearly showed two dimers in the asymmetric unit.
- Rounds of positional refinement, simulated annealing, temperature factor refinement using X-PLOR (Bruinger, A. T., Yale University Press, New Haven, Conn. (1992)), and manual rebuilding using O (Jones, T. A. et al., Acta Cryst. A. 47:110-19 (1990)) led to the current model (Table 2, FIG. 2 ).
- the current model has an R factor of 22.5% and a free R factor of 27% with good geometry (r.m.s.d. bond 0.007 ⁇ and r.m.s.d. bond angles 1.0°).
- the four monomers in the asymmetric unit superimpose well onto one another, with an average r.m.s.d. for mainchain atoms of 0.81 ⁇ and for all non-hydrogen atoms 1.51 ⁇ .
- the main differences in the monomers are those residues involved in crystal packing interactions.
- Each monomer is composed of a long, N-terminal helix, approximately 60 ⁇ in length, interrupted by a sharp bend at proline 49 (Pro49), and continuing on into another short helix.
- the long helices of each of two monomers wrap around each other forming an antiparallel coiled coil ( FIG. 3 a, FIG. 3 b ), which straightens out at the N terminus.
- Only one of the four possible salt bridges between Glu31 (E31) and Lys38 (K38) is seen.
- the charged groups are slightly farther apart (3.8 ⁇ , 4.2 ⁇ and 4.4 ⁇ versus 2.9 ⁇ ) and the sidechains are hydrogen bonded to nearby solvent molecules.
- the sidechain of Glu45 (E45) is hydrogen bonded to the indole nitrogen of Trp20 (W20).
- Trp20 (W20) does not. Even though the C ⁇ -C ⁇ vector of Trp20 (W20) points out of the interface as would be expected for a sidechain in the a position of a heptad repeat, the sidechain of Trp20 is flipped away from the core of the coiled coil and into a hydrophobic region formed between segment A (residues 12-24) of one monomer, and segment C (50-60) of its partner within the peptide dimer. The dimer shows primarily hydrophobic interactions between residues in the A and C regions.
- Ile16 (I16), Leu17 (L17), Trp20 (W20), Trp50 (W50), and Leu51 (L51) are the sidechains primarily involved in this hydrophobic region, which is capped by the aliphatic portion of the sidechains of Arg13 (R13) and Arg24 (R24) ( FIG. 4 ).
- the primary non-hydrophobic, monomer-monomer interactions near this region involve the formation of a hydrogen bond between Trp20 (W20) and Glu45 (E45) ( FIG. 4 ).
- the heptad repeat is also unusual in that it contains a glycine at position 23.
- each dimer associates with three other dimers to form a doughnut-like octamer ( FIG. 5 ).
- the octameric complex forms a pseudo-centered (C222) cell.
- the octamer is widely open with a central “hole”, 50 ⁇ in diameter.
- the open structure of the octamer is reminiscent of several other proteins, including Proliferating cell nuclear antigen (PCNA), in which the hole that is formed is believed to encircle DNA (Talluru, S. R. et al., Cell, 79:1233-1243 (1994)). It is this octameric structure which is the translational repeating unit in the crystal ( FIG. 5 ).
- PCNA Proliferating cell nuclear antigen
- the dimer-dimer interface is a four-helix bundle formed across the crystallographic two-fold axis.
- the interface of the two dimers consists of hydrophobic residues in region A of the coiled-coil domain Leu17 (L17) and Val21 (V21) but also includes residues C-terminal to the coiled-coil domain, region C, between residues 50 to 60 Trp50 (W50), Ile54 (I54), Ile57 (I57) and Ile58 (I58) ( FIG. 6 ).
- Trp50 is involved in both the formation of the dimer as well as the octamer.
- Formation of the octamer buries an additional 800 ⁇ 2 of surface area per monomer, which means that approximately 40% of the total surface area of each monomer is buried.
- the 50 ⁇ diameter hole framed by the four dimers is lined with basic sidechains ( FIGS. 7 a and b ).
- the hole is large enough to accommodate an RNA molecule.
- Residues Lys26 (K26) and Lys38 (K38) which had been modeled in as alanine were changed to lysine for this calculation.
- the electrostatic surface was calculated using GRASP (Nicholls, A. Columbia University, New York, N.Y.(1992)), and rendered using RASTER3D (Merritt, E. A. & Murphy, M. E. P., Acta.
- R anom ⁇ ⁇ ⁇ ⁇ I ⁇ + ( h , k , l ) > - ⁇ I ⁇ - ( h , k , l ) > ⁇ / ⁇ ⁇ ( ⁇ I ⁇ + ( h , k , l ) > + ⁇ I ⁇ - ( h , k , l ) > )
- ⁇ I+/ ⁇ (h,k,l) > represents the statistically weighted average intensity of symmetry-equivalent reflections.
- ⁇ ⁇ R iso ⁇ ( h , k , l ) ⁇ ⁇ ( F PH - F P ) ⁇ / ⁇ ( F P ) .
- # ⁇ R cullis ⁇ ( h , k , l ) ⁇ ⁇ ⁇ F PH ⁇ - ⁇ F P + F H ⁇ ⁇ / ⁇ ( h , k , l ) ⁇ ⁇ F PH - F P ⁇ ; number in parentheses represents R cullis for centric reflections.
- ⁇ ⁇ R free ⁇ ( h , k , l ) ⁇ ⁇ ( ⁇ F o ⁇ - ⁇ F c ⁇ ) ⁇ / ⁇ ( F o ) , for a test set composed of 10% of the data selected randomly.
- r-HDAg-S was prepared as described above.
- the samples for mass spectrometry were prepared as follows: the r-HDAg-S was dialyzed overnight against water.
- Cross-linked protein was prepared by the addition of 5 ⁇ l of 0.5% glutaraldehyde to 40 ⁇ l of rHDAg-S for 5 minutes, and quenched by the addition of 5 ⁇ l of 1 M ammonium acetate.
- Mass spectrometry was performed in the BCMP Biopolymer facility on a Persceptive Biosystems Voyager-DE mass spectrometer.
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Virology (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
This invention relates to HDAg peptides, including mutants, derivatives fragments and fusion molecules, including fusion proteins, coiled-coil oligomers, nucleic acid molecules, vectors comprising HDAg nucleic acid molecules, cells comprising said molecules, methods of multivalent expression and association of binding moieties of HDAg fusion molecules, and methods of use involving the molecules. The molecules are particularly useful as a framework for multivalent display via formation of C-terminal and/or N-terminal fusion proteins or via chemical coupling to chemically reactive sidechains, e.g. cysteine residues.
Description
- This application is a divisional of U.S. application Ser. No. 09/347,175, filed Jul. 1, 1999, which claims priority to U.S. Provisional Application No. 60/091,609 filed Jul. 2, 1998. The entire teachings of the above applications are incorporated herein by reference.
- The invention was made with Government support from the National Institutes of Health under grant A132480 and R01-HL37974 and the U.S. Public Health Service (GM42031) and the Giovanni Armorisse Harvard Center for Structural Biology. The Government has certain rights in the invention.
- The hepatitis D virus (HDV) is a small satellite virus of hepatitis B virus (HBV). Coinfection with HBV and HDV causes severe and sometimes fatal liver disease in humans. The HDV genome encodes a single known protein, the hepatitis delta antigen (HDAg).
- This invention is based on the discovery of the high resolution crystal structure of a synthetic peptide corresponding to residues 12-60 of the hepatitis delta antigen (HDAg). This peptide includes a coiled-coil region believed to be important for dimerization of HDAg. The peptide forms an antiparallel coiled coil with hydrophobic residues near the termini of each peptide forming an extensive hydrophobic core with residues C-terminal to the coiled-coil domain in the dimer protein. The crystal structure shows how HDAg forms dimers, but, surprisingly, also shows the dimers forming an octameric structure that forms a large 50 Å ring lined with basic sidechains.
- The dimers associate further to form octamers through residues in the coiled-coil domain that are not involved in a heptad repeat, as well as through residues C-terminal to this region. The crystal structure of the peptide and cross-linking hydrodynamic studies which show that the full-length recombinant protein also forms octamers suggest that the structure of the delta antigen represents a previously unseen organization of a viral nucleocapsid protein. This N-terminal octamer can serve as a convenient high-valency framework for linking a variety of functional peptides and domains.
- The invention includes HDAg proteins, including derivatives, mutants and fragments, and nucleic acid molecules encoding HDAg. Derivatives of HDAg protein include fusion molecules. In one embodiment, the fusion molecule comprises HDAg and at least one binding moiety bound, for example, to the HDAg through the C terminus, N terminus and/or other amino acid. The binding moiety can be selected from the group consisting of an antigen, an antibody, a ligand, a receptor, an enzyme, a ligand interaction peptide, a chemical, an effector, an oligonucleotide, a signal amplification peptide, an enhancer recognition protein, a promoter binding protein, a label, a growth factor, a cytokine, a nuclease, a small organic molecule, a test substance, a cytotoxic agent, a substrate, a solid substrate, a drug, or a fragment thereof. The fusion molecules of the invention can also comprise two binding moieties which are binding partners. The fusion molecule can be a fusion protein. The HDAg and the binding moiety can be chemically linked or the HDAg and the binding moiety can be expressed as a single unit.
- The invention also relates to coiled-coil oligomers comprising at least two such fusion molecules. The coiled-coil oligomer can be an octamer. In the coiled-coil oligomer, the two fusion molecules can be the same or different.
- The invention also relates to nucleic acid molecules. For example, a nucleic acid molecule can comprise a nucleotide sequence depicted in
FIG. 9 , nucleotides 37-150 ofFIG. 9 , nucleotides 37-186 ofFIG. 9 ,FIG. 10 , nucleotides 1421-1566 ofFIG. 10 , nucleotides 1457-1566 ofFIG. 10 ,FIG. 15 orFIG. 16 . The nucleic acid molecule can also comprise a nucleotide sequence which encodes a polypeptide comprising an amino acid sequence depicted in a row ofFIG. 1 , amino acids 12-48 of a row ofFIG. 1 , the top row ofFIG. 3C ,FIG. 9 , amino acids 12-48 of a row ofFIG. 9 ,FIG. 10 , amino acids 12-88 ofFIG. 10 ,FIG. 11 orFIG. 17 . Also included are complementary strands of these sequences, DNA sequences that hybridize to the sequences, RNA sequences transcribed from the sequences, or a fragment or mutation thereof, which encodes a coiled-coil oligomer. - An isolated nucleic acid molecule can be a fusion molecule described herein. The invention also includes fusion genes comprising an HDAg nucleic acid molecule operably linked to a nucleic acid molecule encoding a heterologous (non-HDAg) peptide.
- Also encompassed in the scope of the invention are isolated, purified and/or recombinant peptides and molecules comprising peptides. In one embodiment, a polypeptide comprises an amino acid sequence encoded by an HDAg nucleic acid molecule. The molecules can comprise a polypeptide having an amino acid sequence selected from the group consisting of an amino acid sequence depicted in a row of
FIG. 1 , amino acids 12-48 of a row ofFIG. 1 , amino acids 12-60 of a row ofFIG. 1 , the top row ofFIG. 3C ,FIG. 9 , amino acids 12-48 ofFIG. 9 , amino acids 12-60 ofFIG. 9 ,FIG. 10 ,FIG. 11 andFIG. 17 , or a fragment or derivative thereof which forms a coiled-coil oligomer. The peptide can be a derivative peptide wherein a serine residue is substituted with cysteine. The molecules can comprise a polypeptide comprising an amino acid sequence of amino acids 12-88 of HDAg, or a fragment or derivative thereof which forms a coiled-coil oligomer and nuclear localization signal. The polypeptides can be encoded by fusion genes comprising HDAg. It is possible that the molecule can be larger than the 12-48 or 12-60 or 12-88 amino acids, for example. It may be desirable to make a 12-65 or 10-93 peptide, for example. - The invention also includes vectors which can express HDAg. The vectors can comprise a nucleic acid molecule which encodes a subunit of an HDAg coiled-coil octamer. The nucleic acid molecule can comprise a sequence listed above. The nucleic acid molecule can encode a fusion molecule. The vector can be a nucleic acid molecule encoding HDAg and at least one multiple cloning site. The multiple cloning site(s) can be located 3′ to the nucleic acid molecule encoding HDAg or 5′ to the nucleic acid molecule encoding HDAg. There can be two or more multiple coding sites, wherein at least one multiple coding site is located in a
flanking region 3′ to the nucleic acid molecular encoding HDAg and/or at least one multiple coding site is located in aflanking region 5′ to the nucleic acid molecule encoding HDAg. The vector can further comprise a nucleic acid molecule encoding a nuclear localization signal. A vector can further comprise a nucleic acid molecule which encodes a heterologous gene. The vector can express a fusion molecule of HDAg wherein a first heterologous gene encodes a first binding moiety and a second heterologous gene encodes a second binding moiety. - The invention also encompasses host cells which comprise a nucleic acid molecule which encodes a molecule of HDAg, including a fusion molecule.
- The invention also includes methods of manufacturing such a host cell comprising a nucleic acid molecule encoding a fusion molecule comprising HDAg and at least one binding moiety, by introducing a vector of the invention into the host cell.
- The invention also relates to methods of using the molecules, i.e., peptides, nucleic acids, and vectors of the invention. One method comprises expressing a high valency display of at least one binding moiety comprising introducing into a cell with a vector comprising a nucleic acid molecule encoding HDAg and a nucleic acid molecule encoding the binding entity and culturing the cell under conditions sufficient to permit expression of the binding moiety and HDAg.
- The invention also encompasses a method of enhancing interaction between binding partners comprising contacting a fusion molecule of HDAg with a second binding moiety wherein the first and second moieties are binding partners. The fusion molecule can present the first and second moieties. The interaction between ligands can occur in solution, on membranes or on surfaces. The fusion molecule can be a subunit of a coiled-coil oligomer, e.g., an octamer, and the first and second moieties are bound to the oligomer. In one embodiment, fusion of a first cell and a second cell is enhanced.
- The invention also includes a method for delivering molecules to a cell comprising contacting them with an HDAg fusion molecule. In one embodiment, the binding moiety is an oligonucleotide. The oligonucleotide can hybridize to a nucleic acid molecule in the cell. The fusion molecule can further comprise a double-stranded nuclease. In one embodiment, the fusion molecule comprises a first binding moiety and a second binding moiety wherein the first binding moiety interacts with a binding partner and the second binding moiety functions as an effector. The first binding moiety can interact with a cell surface receptor on a cell and the second binding moiety can kill the cell.
- The invention also includes a method of amplifying a signal in a solid phase assay comprising coupling an HDAg octamer with at least one copy of a domain which interacts with a ligand and at least two copies of a label. The label can be, for example, alkaline phosphatase, a radiolabel, streptadavin, and green fluorescent protein. In one embodiment, the solid phase assay is an ELISA assay. The invention also encompasses a method of facilitating exchange of substrates and products comprising coupling an HDAg oligomer to at least two enzymes which function in a linked pathway.
- The invention also encompasses a method of enhancing a reaction between at least two binding partners comprising coupling the binding partners to an HDAg oligomer. In a different embodiment, the method of enhancing a reaction between two binding partners comprises coupling one binding partner to an HDAg oligomer and contacting the oligomer to a second binding partner.
- The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawings will be provided by the Office upon request and payment of the necessary fee.
-
FIG. 1 depicts sequence alignment of 11 serotypes of hepatitis delta antigen (HDAg) betweenamino acids 12 to 60. Asterisks, *, indicate residues which make up the a and d positions in the heptad repeat in the predicted coiled-coil region. Bold pink and purple indicate residues involved in the hydrophobic interactions in the dimer between the two termini. The “ds” indicate residues involved in the dimer-dimer interface. A region (pink), B (green), C (purple). -
FIG. 2 depicts the final atomic model superimposed upon a portion of the final 1.8 A resolution 2FO-FC map. The map is contoured at 1.2σ and shows the residues involved in the interaction between monomers at the A and C regions. Orientation is similar to that inFIG. 4 . Yellow indicates carbon, red indicates oxygen and blue indicates nitrogen. The figure was produced using BOBSCRIPT. -
FIG. 3A depicts Cα trace of the peptide δ12-60(Y). A region pink, B region green, and C region purple. The individual helix takes a sharp bend at proline 49 (Pro49).FIG. 3B is a ribbon diagram of the view inFIG. 3A rotated 90° along the horizontal axis. The sidechains have been added and the C region of the peptide (residues 50-60(Y)) has been removed for clarity. Sidechains are colored as follows: hydrophobic gray, polar yellow, acidic red and basic blue.FIG. 3C is the amino acid sequence of the long helix formed fromresidues 12 to 48 displayed in the antiparallel orientation of the peptide. The letters above the amino acid sequence represent the heptad repeat (abcdefg)n where the a and d residues tend to be hydrophobic. The residues involved in the heptad repeat at the a and d positions are shown in bold. -
FIG. 4 depicts the monomer-monomer interactions. The regions are colored as follows: A region pink, B region green, and C region purple. The white row of X's indicates a hydrogen bond between the sidechain of Glu45 (E45) and the indole nitrogen of Trp20 (W20). This figure was produced using RIBBONS, Carson, M. & Bugg, C. E., J. Mol. Graphics, 4:121-122 (1986). -
FIG. 5 depicts the interactions of dimers in theP2 1212 unit cell. The unit cell is outlined in black and the directions of the A and B axes are shown. The two independent copies of the dimer in the asymmetric unit are colored orange and blue. The view is looking down the crystallographic 2-fold axis. This figure was produced using the program MOLSCRIPT (Kraulis, P. J. J. Appl. Cryst. 24:946-950 (1991)). -
FIG. 6 depicts the dimer-dimer interface.FIG. 6A illustrates that the dimer-dimer interface is composed of a four-helix bundle made of the N and C termini of two dimers, one from across the crystallographic two-fold axis. One (unlabeled) dimer is colored yellow and the other (labeled) dimer is colored according to the scheme used inFIG. 1 .FIG. 6B depicts the view inFIG. 6A rotated 90° around the y axis. This figure was created with RIBBONS (Carson, M. & Bugg, C. E., J. Mol. Graph. 4:121-122 (1986)). -
FIG. 7A is a GRASP electrostatic potential surface of the octameric δ12-60(Y) peptide contoured at=10 kT/e (positive potential blue) and=10 kT/e (negative potential in red) (K is Boltzman's constant and T is temp ° K. The edges and the lining of the large 50 Å hole are basic.FIG. 7B illustrates the hole formed by the octamer. -
FIG. 8 graphically illustrates MALDI-TOF mass spectrometry analysis of recombinant small delta antigen (r-HDAg-S) (FIG. 8A ), and the glutaraldehyde cross-linked protein (FIG. 8B ). -
FIG. 9 depicts the sequence of a synthetic gene for optimized expression of HDAg-S in E. coli. The synthetic gene has been modified such that the codon usage which is unusual in the natural gene is assistant with the known preferences for codon usage in E. coli. The underlined sequences correspond to the eight primers used in the first round of PCR. The primers used in the second round are indicated with a dotted underline. The amino acid sequence is shown above the DNA sequence by the one-letter amino acid code. The restriction sites used in cloning are shown in italics. -
FIG. 10 depicts the complete sequence of human HDV cDNA, and the predicted amino-acid sequence of human HDV delta antigen. -
FIG. 11 depicts synthetic peptides from the multimer-forming domain of HDAg. (A) Structural organization of HDAg. The lightly stippled region is the multimer-forming domain (amino acid residues 12-60), the solid regions are the RNA-binding domains, and the heavily stippled region is the C-terminal extension of large HDAg. Hydrophobic residues contributing to the heptad repeat are shown in boldface type, (B) Amino acid sequences of three HDAg peptides. -
FIG. 12 depicts use of the delta antigen as a scaffold. A construct containing eight appended protein domains on the octameric framework.FIG. 12A depicts an oligomerization domain and spheres which represent potential effectors/ligands/nucleic acid binding domains etc. that have been fused to the C-terminus of the oligomerization domain of the delta antigen.FIG. 12B depicts binding domain which is a multimer, specifically a tetramer. A similar fusion of up to eight effector/legands/nucleic acid binding domains could be made at the N-terminus, and constructs with fusion domains at up to eight N-termini and up to eight C-termini could also be made. -
FIG. 13 depicts plasmids comprising HDAg for expression in bacteria. “DAg” refers to a delta antigen sequence (with or without nuclear localization sequence); “MCS1” refers to a multiple cloning site for insertion of a heterologous gene at N-terminal end of delta antigen; “MCS2” refers to a multiple cloning site for insertion of a heterologous gene a C-terminal end of delta antigen; “Ori” refers to the origin of replication for the bacteria; “drug” refers to a drug marker; “f1 ori” refers to origin of replication of single-stranded DNA by a bacteriophage; “promoter” refers to bacterial promoter.FIG. 13A depictsMCS1 3′ to HDAg;FIG. 13B depictsMCS1 5′ to HDAg; andFIG. 13C depictsMCS1 3′ andMCS2 5′ to HDAg. -
FIG. 14 depicts a plasmid comprising HDAg for expression in eukaryotic cells. “DAg refers to a delta antigen sequence (with or without nuclear localization sequence). “MCS1” refers to plasmids comprising replication of plasma in bacteria; “Ori” refers to origin of expression or replication of the plasmid in bacterial cells; “drug1” refers to a drug marker (e.g. ampicillin, kanamycin) for propagation of plasmid in E. coli; “drug2” refers to a drug marker or eukaryotic drug resistance (e.g. neomycin, zeocin, hybromycin), for propagation of plasmid in a eukaryotic cell; “f1 ori” refers to origin of replication of single-stranded DNA by bacteriophage; “promoter” refers to eukaryotic promoter. -
FIG. 15 is a comparison of the wildtype nucleotide sequence of HDAg-S and the sequence of the synthetic HDAg gene for optimized expression in E. coli. -
FIG. 16 is the nucleotide sequence of the synthetic open reading frame (ORF) for the synthetic HDAg. -
FIG. 17 is a comparison of the protein amino acid sequence encoded by the wildtype ORF and the synthetic ORF, showing complete (100%) identity. -
FIG. 18 depicts the nucleotide sequences of the primers used for the two polymerase chain reactions (PCR) to create the synthetic gene. Primer1-primer8 were used in the first round of PCR and primer9-primer10 were used in the second round of PCR. - As set forth above, the present invention relates to the discovery of the oligomeric structure of the hepatitis delta antigen (HDAg) which serves as a convenient high-valency framework for linking a variety of binding partners, including functional peptides and domains. The structure of the antigen includes a doughnut-shaped octamer comprising N-terminal antiparallel coiled-coil domains and stabilizing C-terminal domains. The invention includes HDAg proteins, including derivatives, mutants and fragments, and nucleic acid molecules encoding HDAg. It also includes an altered HDAg gene for the capsid protein wherein the codons conform to use preferences for E. coli. Included in the derivatives are fusion molecules, e.g., fusion proteins, in which one or more binding moieties are attached to one or both termini of a monomer and coiled-coil oligomers (e.g., octamers) formed from the monomers. Coiled-coil oligomers of the present invention can comprise one or more fusion molecules as described herein. The binding moieties can be, for example the same (homologous) or different (heterologous) binding partners. The invention also includes vectors and cassette expression systems which can be used to produce the fusion molecules. The vectors comprise HDAg and one or more binding moieties which are operably linked to HDAg. The invention also relates to cells comprising HDAg nucleic acid, e.g. cells transformed with such vectors, and to methods of producing such cells. The invention also includes therapeutic and diagnostic methods involving HDAg.
- HDAg Peptides
- HDAg, as defined herein, includes both the large and small delta antigens (HDAg-S and HDAg-L). HDAg encompasses native (“wild type”) proteins and also includes derivatives, mutations, and functional protein or polypeptide fragments of the native protein and/or proteins or polypeptides where one or more amino acids have been deleted, added or substituted.
- HDAg can be isolated and/or purified, or it can be recombinant or prepared by synthetic techniques described herein or known to those of skill in the art. HDAg proteins or fragments can be isolated from the cell of origin or produced synthetically or recombinantly. In a preferred embodiment, the protein is isolated to the substantial absence of conspecific proteins. A conspecific protein is a protein other than HDAg which can be obtained from the cell of origin for the protein or its nucleic acid. The proteins (and nucleic acids) described herein can be preferably isolated, by known methods, to a purity of at least about 50% by weight, more preferably at least about 75% and most preferably to substantial homogeneity. “Substantial homogeneity” refers to the substantial absence of conspecific proteins.
- An HDAg peptide can, e.g., include all or a portion of the amino acids depicted
FIGS. 1, 3C , 10, 11 or 17. An HDAg peptide can be encoded by an isolated and/or purified or recombinant nucleic acid molecule or a fusion gene (nucleic acid molecule) such as those described herein. In a preferred embodiment, an isolated and purified polypeptide has an amino acid sequence depicted in a row ofFIG. 1 , amino acids 12-48 of a row ofFIG. 1 , amino acids 12-60 of a row ofFIG. 1 , a row ofFIG. 3C ,FIG. 9 , amino acids 12-48 ofFIG. 9 , amino acids 12-60 ofFIG. 9 ,FIG. 10 ,FIG. 11 ,FIG. 17 or a fragment or derivative thereof, which forms a coiled-coil oligomer. - In another embodiment, an isolated and purified polypeptide has the amino acid sequence of amino acids 12-88 of HDAg, or a fragment or derivative thereof, which forms a coiled-coil oligomer and nuclear localization signal.
- “Homology” is defined herein as sequence identity. Preferably, the protein or polypeptide shares at least about 50% sequence identity or homology and more preferably at least about 75% identify or at least about 90% identity with the corresponding sequences of the native protein, for example, with
FIG. 10 . The phrase “substantially the same sequence” is intended to include sequences which bind the viral protein and possess a high percentage of (e.g., at least 90%, preferably at least about 95%) amino acid sequence identity with the native sequence. For example, a derivative, e.g., a mutant or variant can possess substantially the same amino acid sequence as the native protein. - The modifications to the amino acid sequence (substitutions) can be conserved or non-conserved, natural or unnatural amino acids. The residues that function to form or stabilize the coiled-coil domain or binding sites thereof can be substituted, e.g., conservatively, or they can be maintained. Amino acids of the native sequence for substitution, deletion or conservation can be identified, for example, by a sequence alignment between proteins from different serotypes from related species or other related proteins. In one embodiment, the amino acids which are deleted, added or substituted are amino acids which are not “conserved” between serotypes or species, for example, the amino acids so identified in the sequence alignment exemplified in
FIG. 1 . Conserved amino acids may also be substituted. In one embodiment they are substituted conservatively, for example, substituted by structurally similar amino acids. The phrase “conservative amino acids substitutions” is intended to mean substitutions of amino acids which possess similar side chains (e.g., hydrophobic, hydrophilic, basic acidic, aromatic, and aliphatic) as is known in the art. See, for example, Hermanson, G.I. Bioconjugate Techniques, Academic Press, Inc. San Diego, Calif. (1996). Conservative substitutions include amino acid substitutions of one hydrophobic amino acid for another, for example within the following grouping: W, F, A, P, L, M, I, V. Acidic amino acids include E and D; basic amino acids include K, R, and H. Polar amino acids include S, T, N, Q and G and amide residues include Q and N. An example of a suitable derivative or mutant of the HDAg protein is a protein possessing a consensus sequence of the originating species. - In one embodiment, the derivative does not contain substitutes of the residues of Arg13, Leu17, Trp20, Arg24, Trp50 or Leu51. In another embodiment, it contains only conservative substitutions in this region. Hydrophobic residues, for example Ile16, Leu17, Trp20, Trp50 and Leu51 can be maintained, or they can be replaced with other hydrophobic amino acids, for example, those from the group consisting of Trp, Phe, Ala, Pro, Leu, Met, Ile and Val. In another example, the residues Glu31, Lys38, Trp20 and Glu45 are not substituted or are substituted conservatively. In addition, Arg13 and Arg24 can be maintained (not substituted) or substituted conservatively. In another embodiment, the residues of
FIG. 1 which are involved in hydrophobic interactions are substituted with other hydrophobic residues. InFIG. 3A , hydrophobic residues can be substituted for other hydrophobic residues, polar residues can be substituted for other polar residues, acidic residues can be substituted for other acidic residues, and/or basic residues for other basic residues. In one example, the residues labeled inFIG. 3A can be maintained (not substituted) or can be replaced with amino acids with similar characteristics. The amino acids at the ‘a’ and ‘d’ positions of the heptad repeat (for example, those indicated with an asterisk inFIG. 1 or those listed in bold inFIG. 3C ), can be conserved (maintained), or they can be substituted conservatively, e.g., replaced with hydrophobic amino acids. The residues involved with the dimer-dimer interface (e.g., residues marked with a ‘d’ inFIG. 1 or residues labeled inFIG. 6 ) can be maintained. The residues indicated inFIG. 4 can be maintained. The derivatives, e.g. mutant and wild-type peptides, can crystallize isomorphously. In one preferred embodiment, at least one serine residue, e.g., Ser22, is replaced with a cysteine. In another embodiment, Trp20 is replaced with Ala20. - A variety of substitutions based on amino acid characteristics can be made. For example, the polar amino acid residues can be substituted and the hydrophobic amino acids can be maintained. In addition the nonhydrophobic residues can be substituted and the hydrophobic residues can be maintained. In one embodiment, a derivative can comprise substitutions of any or all of the amino acid residues in the following positions: 14, 15, 18, 19, 22, 24, 25, 26, 28, 29, 31, 32, 33, 35, 36, 38, 39, 40, 42, 43, 45, 46 and 47. The nonhydrophobic residues can be substituted such that acidic amino acids are alternated with basic amino acids. The hydrophilic residues in the C terminal region can be substituted to optimize stability of the helix, for example by presenting one or more amino acids which form disulfide bonds, strong ionic bonds or cross-linked moieties, with the corresponding amino acid of another subunit. In one embodiment,
residues - Especially preferred are derivatives, e.g., coiled-coil subunits, which improve (e.g., optimize) stability of the coiled-coil structure or which improve cross-linkage involving the structure, or which improve the ability of the structure to be immobilized on a solid substrate.
- The term derivative is also intended to include proteins which have been labeled, such as with a radioactive or calorimetric label. Such derivatives are more readily detected in an assay. In one embodiment, a peptide is synthesized that corresponds to residues 12-60 of HDAg, and includes a C-terminal tyrosine, enabling the peptide to be labeled, e.g., with I25, for use in a radioimmunoassay. In one embodiment, the peptide is δ12-60(Y). Yet other derivatives are proteins which consist essentially of the amino acid sequence of a given protein (e.g., possess the relevant sequence and, optionally, other amino acids residing at the termini which do not significantly alter or detract from the properties of the protein).
- A “functional” fragment, derivative, mutant, or allelic variant is of sufficient length and/or structure as to possess one or more biological activities of the protein. One example of such a biological activity of the protein is formation of a coiled-coil oligomer, e.g., an octamer, for example, an octamer doughnut-shaped structure. In one embodiment, the protein derivative is conserved within the coiled-coil regions but is lacking in or mutated within one or more other regions (e.g., sequences not within the coiled-coil. Examples of suitable fragments include peptides lacking fragments which encode or stabilize the coiled coil, for example, amino acids 12-48, or the peptides depicted in
FIG. 11 . One example includes fragments which lack all or part of the region C-terminal to the proline bend (e.g. C-Terminal to Pro49). Another fragment includes the coiled coil and nuclear localization signal (e.g., amino acids 12-88); or solely the nuclear localization signal (amino acids 68-88). Xia et al., J. Virol. 66:914-21 (1992). Yet another example includes HDAg which encodes the coiled-coil region but is lacking all or a portion of the nuclear localization signal. In one embodiment, all or a portion of one or both termini of a monomer is absent or mutated. For example, the C region of the peptide (e.g., residues 50-60) can be mutated or all or a portion can be eliminated. Yet another example of derivatives includes peptides which possess amino acid modifications or additions which are characterized by a functional group which can react with a compound substituted by a “binding moiety”, such as those described above or with a cross-linking agent. - Yet other biologically activities are the ability of HDAg-S to function as a trans activator of replication and the ability of HDAg-L to act as an inhibitor of replication. In yet another embodiment, the biological activity of the protein is antigenic or immunogenic activity.
- Fragments of the protein can possess at least about 10 amino acids from the 12-48 amino acid region, preferably at least about 20 amino acids. In other embodiments, the fragment possesses essentially all of the amino acids of the full-length protein (e.g., at least about 85%, or at least about 95%).
- HDAg includes monomers and oligomers, e.g., dimers and octamers, comprising the monomers as subunits. The invention encompasses HDAg coiled-coil oligomers, e.g. octamers.
- Fusion Molecules
- Fusion molecules are intended to be included within the definition of HDAg derivatives and can be made by linking one or more binding moieties, e.g., chemicals or peptides, to the HDAg protein or fragment, for example, through a covalent bond or preferably a peptide bond or cysteine group. As such, derivatives, such as fusion proteins, can comprise the amino acid sequence of the HDAg protein or fragment and a binding moeity such as a given protein, e.g., a native protein.
- A “binding moiety,” as the term is defined herein, includes a chemical entity which is bound to HDAg. The binding can be via a covalent bond (e.g. through a cysteine group), ionic bonding, hydrogen bonding or other mechanism. The binding moiety and the HDAg can be expressed as a single unit. The binding moiety can be a peptide (including post-translationally modified proteins, such as amidated, demethylated, glycosylated or phosphorylated proteins), sugar, lipid, steroid, nucleic acid, small molecule, anion or cation, drug, chemical or combination thereof which binds the specified binding partner (e.g., a target molecule). Preferably, the binding will possess a high affinity. Examples of high affinity can have a dissociation constant of 10−5M (preferably 10−8M) or lower. Examples of binding moieties include an antigen, an antibody, a ligand, a receptor, an enzyme, a (ligand) interaction peptide, a chemical, an effector, an oligonucleotide, a signal amplification peptide, an enhancer recognition protein, a promoter binding protein, a label, a growth factor, a cytokine, a nuclease, a small organic molecule, a test substance, a cytotoxic agent, a substrate, a solid substrate, a drug or a fragment thereof.
- Where there are two or more binding moieties, the binding moieties can be the same (homologous) or different (heterologous). The binding moieties can be binding partners. Examples of binding partners include, but are not limited to, antigen-antibody and ligand-receptor. First and second binding moieties can also include the following pairs: enzyme1-enzyme2, (ligand) interaction peptide-effector peptide (or chemical), oligonucleotide-nuclease, interaction agent (e.g. peptide signal amplification agent, (e.g. peptide), enhancer recognition agent (e.g. protein)-promoter-binding agent (e.g. protein), enhancer recognition agent-promoter binding agent, ligand-label, test substance-label, targeting agent—effector agent, drug (or hormone) -label, or any other combination.
- In one embodiment, a first binding moiety binds, a target molecule on a target cell (e.g. a surface protein) and the binding partner is the surface protein or target cell. The “target cell” is defined as the cell which is intended to be contacted by the fusion cell. Typically, the target cell is of animal origin and can be a stem cell or somatic cell. Suitable animal cells for use on the claimed invention can be of, for example, mammalian and avian origin. Examples,of mammalian cells include human, bovine, ovine, porcine, murine, rabbit cells. The cell may be an embryonic cell, bone marrow stem cell or other progenitor cell. Where the cell is a somatic cell, the cell can be, for example, an epithelial cell, fibroblast, smooth muscle cell, blood cell (including a hematopoietic cell, red blood cell, T-cell, B-cell, etc.), tumor cell, cardiac muscle cell, macrophage, dendritic cell, neuronal cell (e.g., a glial cell or astrocyte), or pathogen-infected cell (e.g., those infected by bacteria, viruses, virusoids, parasites, or prions).
- Typically, cells isolated from a specific tissue (such as epithelium, fibroblast or hematopoietic cells) are categorized as a “cell-type.” The cells can be obtained commercially or from a depository or obtained directly from an animal, such as by biopsy. Alternatively, the cell need not be isolated at all from the animal where, for example, it is desirable to deliver the vector to the animal in gene therapy.
- Cells can typically be characterized by markers expressed at their surface that are termed “surface markers”. These surface markers include surface proteins or target molecules, such as cellular receptors, adhesion molecules, transporter proteins, components of the extracellular matrix and the like. These markers, proteins and molecules also include specific carbohydrates and/or lipid moieties, for example, conjugated to proteins. In one embodiment, a binding moiety on a fusion molecule can bind to one or more surface proteins on the target cell. Surface proteins can be tissue- or cell-type specific (e.g. as in surface markers) or can be found on the surface of many cells. Typically, the surface marker, protein or molecule is a transmembrane protein with one or more domains which extend to the exterior of the cell (e.g. the extracellular domain). Where cell-type specific delivery is desired (as in in vivo delivery of a drug), the surface protein selected for the invention is preferably specific to the tissue. By “specific” to the tissue, it is meant that the protein be present on the targeted cell-type but not present (or present at a significantly lower concentration) on a substantial number of other cell-types. While it can be desirable, and even preferred, to select a surface protein which is unique to the target cell, it is not required for the claimed invention. It is to be appreciated, however, that specific delivery may not be required where the cell or cells are contacted with the viral vector in pure or substantially pure form, such as can be the case in an in vitro gene transfer. As such, the surface protein or targeted protein for the first binding moiety may be present on many different cell-types, specific or even unique to the targeted cell-type.
- As set forth above, the surface protein can be a cellular receptor or other protein, preferably a cellular receptor. Examples of cellular receptors include receptors for cytokines, growth factors, and include, in particular epidermal growth factor receptors, platelet derived growth factor receptors, interferon receptors, insulin receptors, proteins with seven transmembrane domains including chemokine receptors and frizzled related proteins (Wnt receptors), immunoglobulin-related proteins including MHC proteins, CD4, CD8, ICAM-1, etc., tumor necrosis factor-related proteins including the type I and type II TNF receptors, Fas, DR3, DR4, CAR1, etc., low density lipoprotein receptor, integrins, and, in some instances, the Fc receptor.
- Other examples of surface proteins which can be used in the present invention include cell-bound tumor antigens. Many of these surface proteins are commercially available and/or have been characterized in the art, including the amino acid and nucleic acid sequences, which can be obtained from, for example, GENBANK, as well as the specific binding characteristics and domains. Cytokine and chemokine receptors are reviewed for example, in Miyama, et al. Ann. Rev. Immunol., 10:295-331 (1992), Murphy, Ann. Rev. Immunol. 12:593-633 (1994) and Miller et al. Critical Reviews in Immunol. 12:17-46 (1992).
- The binding moiety can be selected or derived from native ligands or binding partners to the surface protein of the target cell. In the case of a cellular receptor, for example, for a cytokine or growth factor, the binding moiety can be a polypeptide comprising at least the receptor-binding portion of the native ligand. A “native ligand” or “native binding partner” is defined herein as the molecule naturally produced by, for example, the animal or species which binds to the surface protein in nature. Preferably, the binding moiety is a polypeptide or protein. As such, the native ligand of a cytokine receptor can be the native cytokine. In another embodiment, the binding moiety can comprise a binding fragment of an antibody, such as the variable region or a single chain antibody.
- Where a binding moiety comprises a binding fragment of an antibody, many antibodies to surface proteins are known or are commercially available, as are the amino acid sequences which are responsible for binding. Alternatively, novel antibodies can be prepared by methods known in the art, such as by Harlow and Lane, “Antibodies, A Laboratory Manual,” Cold Spring Harbor Laboratory (1988). The binding fragment can comprise an antibody fragment, for example, the constant region or, the variable region (e.g., Fc fragment or FAb′ fragment).
- A binding moiety can be a polypeptide ligand to a cellular receptor. Examples of preferred ligands are growth factors, epidermal growth factor, interleukins, GM-CSF, G-CSF, M-CSF, EPO, TNF, interferons, and chemokines. In one embodiment, the receptor is a transferon receptor.
- The binding moiety can have an amino acid sequence which is the same or substantially the same as an amino acid sequence of at least the receptor-binding portion of a native ligand for the cellular receptor. Similar to cellular receptors, many of the corresponding ligands have been identified, sequenced and characterized, including the portions thereof which bind to the receptor. The binding moiety can, therefore, include the same or substantially the same sequence of the entire native ligand. Alternatively, binding moiety comprises the receptor binding portion of the native ligand, eliminating, in some cases, the effector function of the ligand.
- In another embodiment, the binding moiety is selected or derived from native ligands or binding partners to a cellular surface molecule of a target cell. A “cellular surface molecule” as defined herein can be a peptide (including post-translationally modified proteins, such as amidated, demethylated, methylated, prenylated, palmitoylated, glycosylated, myristylated, acetylated or phosphorylated proteins), sugar, lipid, steroid, anion or cation, or a combination thereof which binds the first binding moiety. Preferably, the binding of the cellular surface molecule to the binding moiety of the bifunctional molecule will be of high affinity. Examples of high affinity have a dissociation constant of 10−5M (preferably 10−8M) or better.
- The cellular surface molecule need not be ∂specific” for the target cell. However, the cellular surface molecule is specific for a desired viral vector. For example, specific delivery of Influenza A viral vectors can employ sialic acid cellular surface molecules for entry into a target cell whereas targeting of VSV viral vectors can employ a phospholipid as the surface molecule. As such, the cellular surface molecule for the first binding moiety can be present on many different cell-types, specific or even unique to the target cell.
- In other embodiments, the effector function can be desirable, thereby stimulating or modulating the cellular activity of the target cell which can enhance therapy. An example of where such a therapy can be desirable is in the delivery of a negative selection marker or suicide protein to a tumor where the target cell is a lymphokine and the ligand is a cytokine. Where the lymphokine is stimulated, the cell, can also possess therapeutic value in the recruitment of an endogenous immune response against the tumor, thereby increasing the therapeutic benefit of the therapy.
- The phrase “substantially the same sequence” is intended to include sequences which bind the surface protein and possess a high percentage of (e.g., at least about 90%, preferably at least about 95%) sequence identity with the native sequence. The modifications to the sequence can be conserved or non-conserved, natural and unnatural, amino acids and are preferably outside of the binding domain. Amino acids of the native sequence for substitution, deletion, or conservation can be identified, for example, by a sequence alignment between proteins from related species or other related proteins.
- In addition to the first binding moiety, there can be a second binding moiety which is a chemical entity which binds to HDAg. The binding can be via a covalent bond, ionic bonding, hydrogen bonding or other mechanism. The second binding moiety can be the same or different from the first. For example, it can be a peptide, sugar, lipid, steroid, nucleic acid, small molecule, anion or cation, or combination thereof which binds the HDAg. In one embodiment, the second binding moiety of the fusion protein is also a polypeptide. One embodiment of the second binding moiety comprises an antigen-binding fragment of an antibody which recognizes and binds to an antigen.
- Ligand receptors which are cellular receptors can be transmembrane proteins comprising intracellular, transmembrane (characterized by highly hydrophobic regions in the sequence) and extracellular domains. In one embodiment, the second binding moiety can comprise the native extracellular domain of a receptor molecule.
- Fusion proteins can be made conveniently through known methods, e.g. recombinantly. The binding moieties can be directly bonded to HDAg or can be bonded to HDAg through a linking moiety. Where one or both of the moieties are polypeptides, a peptide bond or peptide linker may be preferred, thereby obtaining a “fusion protein”. The “fusion protein” of the HDAg and one or more moieties can be expressed by a single nucleic acid construct in series. One or more moieties and HDAg alternatively be linked directly or indirectly other than via a peptide bond or peptide linker, thereby obtaining a “conjugate”.
- Where the moieties and HDAg are directly bonded to each other, the bond can be covalent, as in a peptide bond, ionic bond or hydrogen bond. Where the bond is a peptide bond, a binding moiety can be bonded to the N terminus of HDAg via the C terminus, or vice versa or both. It is acknowledged that one fusion protein may possess greater activity than a second fusion protein due to conformational or steric considerations. The binding moieties can be, for example, monomers, dimers and tetramers.
- Where one or more of the binding moieties are not polypeptides, they can be joined via chemical reaction through functional groups present on each moiety which, under the appropriate conditions, will react with each other. For example, acid groups (or activated derivatives thereof) can be reacted with amines, alcohols or thiols to form amide or ester bonds, as is known in the art.
- Alternatively and advantageously, a linking moiety is employed to link the binding moieties, e.g. binding partners, to HDAg. The linker can preferably be a flexible linker and sufficient in length to separate the moieties in space, thereby not restricting the ability of the fusion molecule to bind independently and maintain the proper conformation. Again, where both moieties are polypeptides, the linker moiety will generally be a peptide, polypeptide, or a “pseudopeptide”. A “pseudopeptide” is a bifunctional linker which contains at least one non-amino acid and reacts to form a peptide bond, or other bond, with the terminal amine or carboxyl group of the moiety. For example, a peptide characterized by substitution of the terminal amine for a carboxyl group can function to react with the amine terminus of each moiety. Such as linker is considered to be a “pseudopeptide.” Similarly, a peptide characterized by substitution of the terminal carboxyl for an amine group can function to react with the carboxyl terminus of each moiety.
- Generally, however, the linker will be a peptide linker which will link the amine terminus of a moiety to the carboxyl terminus of HDAg or vice versa. One advantage to such a molecule is the ability to express the fusion protein in a recombinant host cell with a single nucleic acid construct.
- Peptide linkers can be obtained from immunoglobulin hinge regions, such as a proline-rich region. Also, linkers can be characterized by little steric hindrance, thereby permitting maximal independent movement of the two moieties, such as with a polyglycine linker. Alternatively, the linker selected to be reactive to or inert to cellular proteases can be desirable. In another embodiment, the linker can be selected to avoid or minimize an immune response against the fusion molecule. The length of the linker also is not particularly critical. Typically, the length of the linker can be between about 2 and about 20 amino acids. As can be seen, the selection of the particular linking group is not critical to the invention.
- In yet another embodiment, the linker can be a bifunctional compound which will react with other functional groups on the binding moieties or HDAg, such as in the reaction of acids and amines or alcohols (as present in peptides, carbohydrates and lipids, for example) in the formation of amides or esters.
- A preferred combination of the above first and second binding moieties includes one binding partner, e.g. a polypeptide ligand to a cell-type specific cellular receptor linked, via a peptide linker, through a terminus of the ligand to the terminus of HDAg. A second binding partner, e.g. a extracellular domain of a cellular receptor or a mutant thereof, can be linked to the same or different HDAg subunit in the same manner.
- For example, the C terminus of a binding polypeptide is linked to the N terminus of HDAg via the polypeptide linker or the N terminus of the first binding polypeptide is linked to the C terminus of the HDAg via the polypeptide linker.
- In another aspect of the invention, peptidomimetics (molecules which are not polypeptides, but which mimic aspects of their structures to bind to the same site) that are based upon the above-described polypeptides, can also be used. For example, polysaccharides can be prepared that have the same functional groups as the polypeptides of the invention, and which interact with binding partners in a similar manner. Peptidomimetics can be designed, for example, by establishing the three=dimensional structure of the polypeptide in the environment in which it is bound or will bind to the binding partner. The peptidomimetic can comprise at least two components, a binding entity or entities and a backbone or supporting structure entity.
- The binding entities of the peptidomimetic are the chemical atoms or groups which will react or complex (as in the formation of a hydrogen or covalent bond) with a binding partner. In general, the binding entities in a peptidomimetic are the same as the polypeptide moieties. Alternatively, the binding entities can be an atom or chemical group which will react with the binding partner in the same or similar manner as the polypeptide. Examples of binding entities suitable for use in designing a peptidomimetic for a basic amino acid in a polypeptide are nitrogen containing groups, such as amines, ammoniums, guanidines and amides or phosphoniums. Examples of binding entities suitable for use in designing a peptidomimetic for an acidic amino acid in a polypeptide can be, for example, carboxyl, lower alkyl carboxylic acid ester, sulfonic acid, a lower alkyl sulfonic acid ester or a phosphorous acid or ester thereof.
- The supporting structure is the chemical entity that, when bound to the binding moiety or moieties, provides the three dimensional configuration of the peptidomimetic. The supporting structure can be organic or inorganic. Examples of organic supporting structures include polysaccharides and polymers (such as, polyvinyl alcohol or polylactide). It is preferred that the supporting structure possess substantially the same size and dimensions as the polypeptide backbone or supporting structure. This can be determined by calculating or measuring the size of the atoms and bonds of the polypeptide and peptidomimetic. For example, the nitrogen of the peptide bond can be substituted with oxygen or sulfur, thereby forming a polyester backbone. Likewise, the carbonyl of the peptide bond can be substituted with a sulfonyl group or sulfonyl group, thereby forming a polyamide. Reverse amides of the peptide can be made (e.g., substituting one or more —CONH— groups for a —NHCO— group). In addition, the peptide backbone can be substituted with a polysilane backbone.
- These peptidomimetic compounds can be manufactured by art-known and art-recognized methods. For example, a polyester corresponding to a given peptide can be prepared by the substituting a hydroxyl group for each corresponding amine group on the amino acids, thereby preparing a hydroxyacid and sequentially esterifying the hydroxyacids, optionally blocking the basic side chains and acids to minimize side reactions. Determining an appropriate chemical synthesis route can generally be readily identified upon determining the chemical structure using no more than routine skill.
- The fusion molecules can be manufactured according to methods generally known in the art. For example, where one or both of the binding moieties is a nonpeptide, the fusion molecule can be manufactured employing known organic synthesis methods useful for reacting a functional or reactive group on the moiety with a functional or reactive group on the other moiety or, preferably, a linker. In carrying out the synthesis, derivation or inactivation of the functional group(s) required for binding to the moiety's binding partner should be avoided. Appropriate syntheses are highly dependent upon the chemical nature of the binding moiety and, generally, can be selected from an organic chemistry text, such as March et al. Advanced Organic Chemistry, 3rd Edition (1985) John E. Wiley & Sons, Inc., New York, N.Y., or other known methods.
- Where the binding moieties are polypeptides, the fusion molecule can be a conjugate or a fusion protein and manufactured according to known methods. Where a fusion protein is desired, the molecule can be manufactured according to known methods of recombinant DNA technology. For example, the fusion protein can be expressed by a nucleic acid molecule comprising sequences which code for both moieties, such as by a fusion gene (nucleic acid molecule). Thus, the invention further relates to nucleic acid molecules, including fusion genes, which encode HDAg fragments, mutants and derivatives.
- Nucleic Acid Molecules
- Recombinant or isolated nucleic acid molecules of the invention, in one embodiment, encode an HDAg protein (including the e.g., native proteins, fragments, derivatives, mutants and allelic variants) as defined herein. A nucleic acid molecule of the present invention can be double-stranded or single-stranded and can be a DNA molecule, such as cDNA or genomic DNA, or an RNA molecule. The nucleic acid molecule can be placed in a construct, which can be inserted into a vector. As such, the nucleic acid molecule can include one or more exons, with or without, as appropriate, introns. In one embodiment, the nucleic acid molecule contains a single open reading frame which encodes HDAg and one or more binding moieties and, optionally, a signal sequence and/or a polypeptide linker, when present. By way of example in a multi-exon construct, the nucleic acid molecule contains a first exon which begins with an ATG, encodes a binding moiety, and optionally the polypeptide linker, and ends with a splice donor site. The construct would also contain an HDAg-coding nucleic acid sequence and would further would contain an intron followed by a second exon which begins with a splice acceptor site and, optionally, a polypeptide linker, coding sequences for a second binding moiety and ending with a stop codon. Alternative combinations of these elements would be apparent to the person of skill in the art.
- As such, the nucleic acid molecule can include sequences which encode HDAg, and one or more moieties, as well as one or more of the following optional sequences, in a functional relationship: regulatory sequences (as will be discussed in more detail below) a start codon, a signal or leader sequence, splice donor sites, splice acceptor sites, introns, a stop codon, transcription termination sequences, 5′ and 3′ untranslated regions, polyadenylation sequences, negative and/or positive selective markers, and replication sequences.
- The coding regions of the nucleic acid molecule code for HDAg and the binding moeity or moieties and any polypeptide linkers present. Where the binding moiety is a native ligand or cellular surface protein (e.g. a cellular receptor), or a binding fragment thereof, the nucleic acid molecule coding regions can correspond to the native sequences which encode a binding moiety. Because many amino acids are encoded by a plurality of codons, the coding sequence can be mutated to result in the same amino acid sequence. This may be advantageous where a codon is preferred by the selected host cell. In one embodiment, the HDAg gene can be altered such that the codons conform to the known codon use preferences for E. coli. See
FIG. 9 andFIGS. 15-17 . The gene can be inserted into a convenient expression vector which allows production of several forms of the capsid protein including residues 1-84 (terminated in the middle domain), the short isoform and the long isoform. Dingle et al., J. Virol, (1998). All three forms express well. Preferably, the nucleic acid molecule comprises the or corresponding coding nucleotide sequence ofFIG. 9, 10 , 15-16, or substantially the same sequences thereof, or the complement thereof. In another embodiment, the nucleic acid molecule does not possess the nucleotide sequence of GenBank Accession #M28267. The nucleic acid molecule can be, for example, isolated and/or purified or recombinant. - In a preferred embodiment, the nucleic acid molecule comprises the nucleotide sequence depicted in
FIG. 9 , nucleotides 37-150 ofFIG. 9 , nucleotides 37-186 ofFIG. 9 ,FIG. 10 , nucleotides 1421-1566 ofFIG. 10 or nucleotides 1457-1566 ofFIG. 10 ,FIG. 15 ,FIG. 16 ; or a fragment or mutation thereof, which encodes a coiled-coil oligomer. In another preferred embodiment, the nucleic acid molecule comprises a nucleotide sequence encoding a polypeptide comprising an amino acid sequence depicted in a row ofFIG. 1 , amino acids 12-48 of a row ofFIG. 1 ,FIG. 3C ,FIG. 9 , amino acids 12-48 of a row ofFIG. 9 ,FIG. 10 , amino acids 12-88 ofFIG. 10 ,FIG. 9 , or δ12-60(Y). Also encompassed in the invention are complementary strands of these sequences, DNA sequences that hybridize to these sequences and RNA sequences transcribed from these sequences. Also included are fragments or mutations thereof, which encode a coiled-coil oligomer. - In one embodiment, the nucleic acid molecule encodes a polypeptide of a fusion molecule described herein.
- Also included are fusion nucleic acid molecules (e.g. fusion genes) comprising an HDAg nucleic acid molecule operably linked to a heterologous nucleic acid molecule (“heterologous gene”), which encodes a peptide which is not HDAg and not derived therefrom (i.e., “heterologous protein”). Where the binding moiety is a mutation or variant of a native sequence, as provided above, generally, the nucleic acid sequence can be mutated correspondingly. It may also be preferred for ease of manufacture of the nucleic acid sequence to maintain as much of the native sequence as possible. In one embodiment, the nucleic acid molecule shares at least about 50% sequence identity with the corresponding native sequence such as the coding region, for example, the coiled-coil region, e.g., amino acids 12-48 or amino acids 12-60. In one embodiment, the sequence identity is at least about 65%, more preferably, 75%. In a more preferred embodiment, the percent sequence identity is at least about 90%, and still more preferably, at least about 95%.
- Recombinant nucleic acid molecules meeting these criteria comprise nucleic acids having sequences identical to sequences of naturally occurring genes, including polymorphic or allelic variants, and portions (fragments) thereof, or variants of the naturally occurring genes. Such variants include mutants differing by the addition, deletion or substitution of one or more residues, modified nucleic acids in which one or more residues are modified (e.g., DNA or RNA analogs), and mutants comprising one or more modified residues.
- Many nucleic acid molecules coding for suitable binding moieties are known in the art and can be obtained from, for example, GENBANK. Alternatively, other sequences can be employed, such as homologs of known genes.
- Such homologous nucleic acids, including DNA or RNA, can be detected and/or isolated by hybridization (e.g., under high stringency conditions or moderate stringency conditions). “Stringency conditions” for hybridization is a term of art which refers to the conditions of temperature and buffer concentration which permit hybridization of a particular nucleic acid to a second nucleic acid in which the first nucleic acid may be perfectly complementary to the second, or the first and second may share some degree of complementarity which is less than perfect. For example, certain high stringency conditions can be used which distinguish perfectly complementary nucleic acids from those of less complementarity. “High stringency conditions” and “moderate stringency conditions” for nucleic acid hybridizations are explained on pages 2.10.1-2.10.16 (see particularly 2.10.8-11) and pages 6.3.1-6 in Current Protocols in Molecular Biology (Ausubel, F. M. et al., eds., Vol. 1, containing supplements up through Supplement 29, 1995), the teachings of which are hereby incorporated by reference. The exact conditions which determine the stringency of hybridization depend not only on ionic strength, temperature and the concentration of destabilizing agents such as formamide, but also on factors such as the length of the nucleic acid sequence, base composition, percent mismatch between hybridizing sequences and the frequency of occurrence of subsets of that sequence within other non-identical sequences. Thus, high or moderate stringency conditions can be determined empirically.
- By varying hybridization conditions from a level of stringency at which no hybridization occurs to a level at which hybridization is first observed, conditions which will allow a given sequence to hybridize (e.g. selectively) with the most similar sequences in the sample can be determined.
- Exemplary conditions are-described in Krause, M. H. and S. A. Aaronson, Methods in Enzymology, 200:546-556 (1991). Also, see especially page 2.10.11 in Current Protocols in Molecular Biology (supra), which describes how to determine washing conditions for moderate or low stringency conditions. Washing is the step in which conditions are usually set so as to determine a minimum level of complementarity of the hybrids. Generally, starting from the lowest temperature at which only homologous hybridization occurs, each ° C. by which the final wash temperature is reduced (holding SSC concentration constant) allows an increase by 1% in the maximum extent of mismatching among the sequences that hybridize. Generally, doubling the concentration of SSC results in an increase in Tm of ˜17° C. Using these guidelines, the washing temperature can be determined empirically for high, moderate or low stringency, depending on the level of mismatch sought. The following table provides an example of each condition of stringency.
% Allowed ° C. Stringency mismatch Temperature % Formamide High 6.6 52 50 Medium 13 45 50 Low 27 45 22 - “Selective isolation”, or “selective hybridization”, is defined herein as embracing the isolation of a sufficiently few number of molecules (preferably one) as to readily permit the identification of the nucleic acid of interest.
- The nucleic acid molecule also preferably comprises regulatory sequences. Regulatory sequences include cis-acting elements that control transcription and regulation such as, promoter sequences, enhancers, ribosomal binding sites, and transcription binding sites. Selection of the promoter will generally depend upon the desired route for expressing the protein. For example, where the molecule will be introduced (e.g. transformed) into a cell by a viral vector, e.g. a plasmid, preferred promoter sequences include viral, such as retroviral or adenoviral, promoters. Examples of suitable promoters include the cytomegalovirus immediate-early promoter, the retroviral LTR, SV40, and TK promoter. Where the molecule is to be expressed in a recombinant eukaryotic or prokaryotic cell, the selected promoter is recognized by the host cell. In one embodiment the construct is a cassette expression system. A suitable promoter which can be used can include the native promoter for the binding moiety which appears first in the construct.
- The elements which comprise the nucleic acid molecule can be isolated from nature, modified from native sequences or manufactured de novo, as described, for example, in the above-referenced texts. The elements can then be isolated and fused together by methods known in the art, such as exploiting and manufacturing compatible cloning or restriction sites.
- Vectors and Host Cells
- The nucleic acid molecules can be inserted into a construct, e.g. a vector, such as a plasmid or cassette expression system, which can, optionally, replicate and/or integrate into a recombinant host cell, by known methods.
- The vectors of the present invention comprise a nucleic acid molecule which encodes HDAg (e.g. an HDAg monomer). The monomer can be a subunit of an HDAg coiled-coil oligomer, e.g. an octamer. The oligomer can comprise an HDAg polypeptide as described herein. The nucleic acid molecule thus includes any of the nucleic acid molecules described herein, for example, a native (wild type) nucleic acid, or a fragment, mutant or derivative. Especially preferred are nucleic acids encoding full-length HDAg (e.g. HDAg-S or HDAg-L) or a fragment or derivative thereof, (e.g. a functional fragment) capable of forming a coiled-coil octamer (e.g. an N-terminal coiled-coil octamer). Preferred vectors comprise a nucleic acid molecule comprising nucleotide sequence depicted in
FIG. 9 , nucleotides 37-150 ofFIG. 9 , nucleotides 37-186 ofFIG. 9 ,FIG. 10 , nucleotides 1421-1566 ofFIG. 10 or nucleotides 1457-1566,FIG. 10 ,FIG. 15 andFIG. 16 . Preferred vectors also comprise a nucleic acid comprising a nucleotide sequence encoding a polypeptide comprising an amino acid sequence depicted in a row ofFIG. 1 , amino acids 12-48 of a row ofFIG. 1 , the top row ofFIG. 3 c,FIG. 9 , amino acids 12-48 of a row ofFIG. 9 ,FIG. 10 , amino acids 12-88 ofFIG. 10 ,FIG. 11 ,FIG. 17 or 8 12-60(Y). Other preferred vectors comprise nucleic acids comprising sequences which are the complementary strands of the above, DNA sequences which hybridize to these sequences, RNA sequences transcribed from these sequences, and fragments and mutations thereof, which encode a coiled-coil oligomer, e.g. an octamer. Vectors can also comprise fusion molecules comprising HDAg and at least one binding moiety, as described herein. - In a preferred embodiment, the vector additionally comprises at least one multiple cloning site. A multiple cloning site comprises a cleavage sites for commonly used restriction sites to facilitate incorporation of foreign (non-HDAg) gene, e.g. a cassette. A multiple cloning site can be located 3′ or 5′ to the nucleic acid molecule encoding HDAg. There can be multiple cloning sites, for example, there can be a multiple,
cloning site 3′ of the HDAg nucleic acid molecule and anothermultiple cloning site 5′ to the HDAg nucleic acid molecule. The multiple cloning site can be located in a flanking region. - The vector can further comprise nucleic acid encoding a nuclear localization signal, e.g. an HDAg nuclear localization signal, for example, amino acids 68-88 of HDAg, as shown in
FIG. 9 . The vector can also comprise an HDAg nucleic acid molecule comprising a sequence encoding a coiled coil and a nuclear localization signal. - The vectors of the invention can be used for the expression of a fusion molecule as described herein. A heterologous (non-HDAg) nucleic acid molecule, e.g., a gene, encoding a binding moiety of a fusion molecule can be inserted into a vector comprising HDAg, e.g., into a multiple cloning site. A vector can comprise one heterologous gene or more than one heterologous gene. The genes can be the same or different. A first heterologous gene can encode a first binding moiety and a second heterologous gene can encode a second binding moiety. The first and the second binding moieties can be binding partners as described herein (for example, single chain antibody and antigen, ligand and receptor, components of a linked pathway, etc). The heterologous gene or genes and the nucleic acid encoding HDAg can be operably linked, e.g., in the same open reading frame. Where HDAg nucleic acid and a heterologous gene encoding a binding moiety are operably linked, they are expressed as a single protein unit, i.e., a fusion molecule.
- In one embodiment, a vector comprises an HDAg nucleic acid molecule (e.g. a nucleic acid cassette) encoding a monomer capable of being a unit of a coiled-coil octamerization scaffold and a heterologous gene encoding a binding moiety, wherein the expressed binding moiety is bound to one terminal of the monomer, e.g. the N terminus or the C terminus. Where there are two expressed heterologous genes, each end of the monomer can be bound to an expressed binding moiety.
- Vectors can additionally comprise a nucleic acid molecule encoding a nuclear localization signal, which can transport protein expressed by the vector to the nucleus of a cell.
- The vectors described herein can express nucleic acid, e.g. a fusion gene, in a host cell, e.g. a procaryotic or eukaryotic cell. In one embodiment, the vector can be expressed in a bacteria cell, for example, Escherischia, e.g. E. coli. The nucleic acid in the vector can also be expressed in Bacillus. It can also be expressed in baculoviruses, pichia expressions systems, and animal tissue or cells, for example insect, mammal, e.g., a human, or yeast (such as Saccharomyces). Examples of specific cells include somatic or embryonic cells, HeLa cells, human 293 cells, monkey COS-7 cells, etc.
- The vector can comprise a number of other components. For example, the vector can comprise a marker, for example a positive or negative selection marker, e.g. ampicillin or kanamycin. The vector can comprise two markers, wherein the first marker is capable of detecting propagation of a vector (e.g. a plasmid) in a bacterial cell and a second marker is capable of detecting propagation of the vector in a eukaryotic cell. The vector can comprise an origin of replication for bacteria and an origin of replication that is capable of mediating production of a single-stranded DNA by a bacteriophage, such as f1 phage or M13 phage.
- The vector can comprise a promoter. In one embodiment, the promoter is a viral promoter, such as a retroviral or adenoviral promoter. Examples of suitable promoters include T7, lac, trc, tac, CMV, SV40, the cytomegalovirus immediate-early promoter, the retroviral LTR and the TK promoter. In a preferred embodiment, the promoter can be selected for high-level expression. A promoter can be selected for optimal expression in bacteria, (e.g. T7, lac, trc, tac etc.) or in a eukaryotic cell (e.g. CMV or SV40).
- The vector can also comprise enhancers, ribosomal binding sites and transcription binding sites. In one embodiment is a vector depicted in
FIG. 13A , B or C orFIG. 14 . In one embodiment, a vector comprises HDAg nucleic acid (with or without a nuclear local signal), a heterologous gene, a marker, an origin of replication for a host cell, an origin of replication capable of mediating production of single-stranded DNA by a bacteriophage, a promoter, and a ribosome binding site. In one embodiment, the origin of replication is selected for maintaining plasmid expression is E. coli. - Especially preferred is a vector for overexpression of hepatitis delta antigen in E. coli, for example, a vector produced by the method as herein described in Example 2, below, e.g. a vector comprising a nucleic acid molecule sequence optimized for expression of HDAg in E. coli. The gene can be inserted into a vector which allows production of several forms of the capsid protein including residues 1-84 (terminated in the middle domain), the short isoform and the long isoform. In a preferred embodiment, the vector is pR5δV5. Another preferred embodiment is a cassette expression system which allows any expressed sequence or sequences (e.g. a binding moiety) to be appended to the N-terminus of C-terminus of the HDAg octamerization scaffold. In a preferred embodiment, the HDAg gene is mutated such that in the expressed peptide, a serine residue of HDAg is replaced with (substituted by) a cysteine to allow for convenient chemical cross-linking of the octamerization domain, e.g., to an inert support matrix (e.g. polyethylene glycol), to a synthetic peptide, to an oligosaccharide, to a small organic molecule, or to lipids.
FIG. 12 (A and B) depicts a representation of a construct containing eight appended protein domains on an HDAg octameric framework. - The vector can be viral. Viral vectors include baculovirus, retrovirus, adenovirus, parvovirus (e.g., adeno-associated viruses), coronavirus, negative strand RNA viruses such as orthomyxovirus (e.g., influenza virus), rhabdovirus (e.g., rabies and vesicular stomatitis virus), paramyxovirus (e.g. measles and Sendai), positive strand RNA viruses such as picomavirus and alphavirus, and double stranded DNA viruses including adenovirus, herpesvirus (e.g., Herpes
Simplex virus types - A nucleic acid molecule described herein can be introduced (incorporated or inserted) into the host cell, by known methods. Such cells, comprising such nucleic acid molecules, are encompassed in the invention. The host cell can be a eukaryotic or prokaryotic cell and includes, for example, baculoviruses, Pichia expression systems, yeast (such as Saccharomyces), bacteria (such as, Escherichia or Bacillus), animal cells or tissue, including insect or mammalian cells (such as somatic or embryonic human cells, Chinese hamster ovary cells, HeLa cells, human 293 cells and monkey COS-7 cells, etc.). Examples of suitable methods of transfecting or transforming cells include calcium phosphate precipitation, electroporation, microinjection, infection, lipofection and direct uptake. Methods for preparing such recombinant host cells are described in more detail in Sambrook et al., “Molecular Cloning: A Laboratory Manual,” Second Edition (1989) and Ausubel, et al. “Current Protocols in Molecular Biology,” (1992), for example.
- The host cell is then maintained under suitable conditions for expression and recovering the molecule, e.g. a fusion molecule. Generally, the cells are maintained in a suitable buffer and/or growth medium or nutrient source for growth of the cells and expression of the gene product(s). The growth media are not critical to the invention, are generally known in the art and include sources of carbon, nitrogen and sulfur. Examples include Dulbeccos modified eagles media (DMEM), RPMI-1640, M199 and Grace's insect media. Again, the selection of a buffer is not critical to the invention. The pH which can be selected is generally one tolerated by or optimal for growth for the host cell.
- The cell is maintained under a suitable temperature and atmosphere. Anaerobic host cells are generally maintained under anaerobic conditions. Alternatively, the host cell is aerobic and the host cell is maintained under atmospheric conditions or other suitable conditions for growth. The temperature should also be selected so that the host cell tolerates the process and can be for example, between about 300 and 40° C. for mammilian cells and between 20 and 40° C. for bacteria, yeast and insect cells.
- The recombinant molecules, including fusion molecules, produced by the processes described herein can be isolated and purified by known means. Examples of suitable purification and isolation processes are generally known and include ammonium sulfate precipitation, dialysis, gel filtration, immunoaffinity, chromatography, electrophoresis, ultrafiltration, microfiltration or diafiltration.
- In addition the fusion molecule can incorporate commonly used sequence tags e.g. his, tag or fla to facilitate purification via ligand affinity chromatograph. The fusion molecule is preferably purified substantially prior to use, particularly where the protein will be employed as an in vivo therapeutic, although the degree of purity is not necessarily critical where the molecule is to be used in vitro. In one embodiment, the bifunctional molecule can be isolated to about 50% purity (by weight), more preferably to about 80% by weight or about 95% by-weight. It is most preferred to employ a molecule which is essentially pure (e.g., about 99% by weight or to homogeneity).
- Fusion molecules which are prepared according to the above method can be used directly in the disclosed methods or can be screened for an activity prior to use. To screen the fusion molecule for activity, for example, in vitro, the fusion molecule (or mixtures of fusion molecules) can be contacted with, for example, the binding partner of a binding moiety of the fusion molecule under conditions suitable for binding and then assayed for binding. For example, a fusion molecule comprising a ligand can be screened for the ability to bind the ligand's receptor, or the binding protein of the ligand's receptor, in vitro, by contacting the receptor (or portion thereof) and the fusion molecule under conditions suitable for binding and detecting binding.
- Methods
- The HDAg molecules of the invention are useful in a variety of methods. The N-terminal octamer may serve as a convenient high valency framework for linking, presenting or delivering a variety of binding moieties, e.g. as described above. For example, the molecules are useful in the delivery of one or more therapeutic agents, such as drugs, proteins or polynucleotides (e.g., genes) or products thereof to a patient. The polynucleotide or the product thereof can be a therapeutic agent. In one embodiment, therapeutic polynucleotide includes RNA (e.g., ribozymes) and antisense DNA that prevents or interferes with the expression of an undesired protein in the target cell. The polynucleotide can also encode a heterologous therapeutic protein. A heterologous protein or polynucleotide is one which is not HDAg. Examples of therapeutic proteins include antigens or immunogens such as a polyvalent vaccine, cytokines, tumor necrosis factor, interferons, interleukins, adenosine deaminase, insulin, T-cell receptors, soluble CD4, epidermal growth factor, human growth factor, blood factors, such as Factor VIII, Factor IX, cytochrome b, glucocerebrosidase, ApoE, ApoC, ApoAI, the LDL receptor, negative selection markers or “suicide proteins”, such as thymidine kinase (including the HSV, CMV, VZV TK), anti-angiogenic factors, Fc receptors, plasminogen activators, such as t-PA, u-PA and streptokinase, dopamine, MHC, tumor suppressor genes such as p53 and Rb, monoclonal antibodies, antigen binding fragments or constant regions thereof, drug resistance genes, ion channels, such as a calcium channel or a potassium channel, and adrenergic receptors, etc.
- Also encompassed by the present invention are the use of HDAg fusion molecules for high through put screening assays, such as for detecting ligand and cell specific receptor binding pairs. The ligand and/or receptor can be peptides (including post-translationally modified proteins) and/or small molecules (including sugars, steroids, lipids, anions or cations). The ligands and ligand-cell specific receptors can be known or unknown. Where the ligand is known and the receptor is unknown, ligand-cell specific receptors can be identified, for example, by screening for host cells transfected with nucleotides encoding potential receptors. For example, the ligands can be secreted (such as chemokines) or non-secreted (such as the extracellular domains of chemokines receptors) proteins.
- A library of host cells displaying putative ligand-cell surface receptors can be obtained by transfecting suitable host cells with nucleic acid constructs, including but not limited to cDNA or genomic libraries, under appropriate regulatory control to result in the expression of cell-surface receptors on the host cell. The fusion molecule with ligand is added to the population of host cells under conditions suitable for introduction. Introduction can be detected, for example, with a label. A similar approach can be used to select unknown ligands in the case where the ligand-cell specific receptors are known and the ligand is unknown. In this embodiment, a library of fusion molecules with putative ligands (e.g., chemokines) can be obtained and contacted with one or more host cells displaying cell surface receptors.
- A similar approach can be used to identify unknown ligands, test substances, drugs wherein the cell surface receptor is known, where the fusion molecule comprises a binding moiety which is a receptor which binds a surface molecule. The host cell expresses a distinct ligand or a collection of recombinant ligands. Ligand-receptor binding can be detected following introduction of the molecule to the host cell.
- The invention is also particularly useful for vaccine delivery. In this embodiment, an antigen or immunogen can be expressed heterologously (e.g., by recombinant insertion of a nucleic acid sequence which encodes the antigen or immunogen (including antigenic or immunogenic fragments) into a vector comprising HDAg). Alternatively, the antigen or immunogen and HDAg can be expressed in a live attenuated, pseudotyped virus vaccine, for example. Generally, the methods can be used to generate humoral and cellular immune responses, e.g. via expression of heterologous pathogen-derived proteins or fragments thereof in specific target cells.
- The dosage administered (e.g., the effective amount) will, of course, vary depending upon known factors such as the pharmacodynamic characteristics of the particular agent, e.g., the therapeutic binding entity, and its mode and route of administration; age, health, and weight of the recipient; nature and extent of symptoms, kind of concurrent treatment, frequency of treatment, and the effect desired.
- It can be administered one to several times per day, depending on the mode of administration. Effective doses can be determined by those of skill in the art. An effective dose of an agent is an amount sufficient to relieve the individual of the symptoms of the disorder which the agent is intended to treat.
- Methods of introduction of the agent at the site of treatment include, but are not limited to, intradermal, intramuscular, intraperitoneal, intravenous, subcutaneous, oral, intranasal, gene therapy, cellular implantation or particle bombardment. Other suitable methods include or employ biodegradable devices and slow release polymeric devices.
- Because proteins are subject to being digested when administered orally, parenteral administration, e.g., intravenous, subcutaneous, or intramuscular, would ordinarily be used to optimize absorption.
- For parenteral administration, particularly suitable are injectable, sterile solutions, preferably oily or aqueous solutions, as well as suspensions, emulsions, or implants, including suppositories. The molecule comprising the agent can be administered in a solution, suspension, emulsion or lyophilized powder in association with a pharmaceutically acceptable parenteral vehicle. Examples of such vehicles are water, saline, Ringer's solution, dextrose solution, and 5% human serum albumin. Liposomes and nonaqueous vehicles such as fixed oils can also be used. The vehicle or lyophilized powder can contain additives that maintain isotonicity (e.g., sodium chloride, mannitol) and chemical stability (e.g., buffers and preservatives). The formulation is sterilized by commonly used techniques. Suitable pharmaceutical carriers are described in the most recent edition of Remington's Pharmaceutical Sciences, A. Osol, a standard reference text in this field of art. Ampules are convenient unit dosages. Formulations for transdermal or transmucosal administration generally include penetrants such as fusidic acid or bile salts in combination with detergents or surface-active agents. The formulation can then be manufactured as aerosols, suppositories, or patches.
- Oral agents may be administered if formulated as to be protected from digestive enzymes. If administered orally, the SCR-P will be administered in a therapeutic composition which may also include an appropriate carrier (e.g., a physiologically compatible carrier), a flavoring agent and a sweetener.
- Suitable pharmaceutical carriers include, but are not limited to water, salt solutions, alcohols, polyethylene glycols, gelatin, carbohydrates such as lactose, amylose or starch, magnesium stearate, talc, silicic acid, viscous parafin, fatty acid esters, hydroxymethylcellulose, polyvinyl pyrolidone, etc. The pharmaceutical preparations can be sterilized and, if desired, mixed with auxiliary agents, e.g., lubricants, preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic pressure, buffers, coloring, and/or aromatic substances and the like which do not deleteriously react with the active compounds. They can also be combined where desired with other active agents, e.g., enzyme inhibitors, to reduce metabolic degradation.
- Using procedures similar to those described above, HDAg molecules (e.g. fusion molecules) and vectors (e.g. cassette expression systems) comprising nucleic acid molecules, such as the vectors described herein, can be used for a variety of purposes. For example, the vector comprising all or part of a nucleic acid sequence of
FIG. 9 orFIG. 15 (synthetic) can be used to overexpress thee hepatitis delta antigen in bacteria. - In a preferred embodiment, multiple copies of a binding moiety (e.g. a peptide or domain) can be expressed using the vectors described herein, e.g., by inserting a nucleic acid cassette encoding the moiety into the vector, transforming a host cell with the vector and culturing the cell under conditions sufficient for expression of the moiety. Up to sixteen copies of the moiety (eight at the C terminus of the HDAg monomer and eight at the N terminus) can be made in this way. In a preferred embodiment, the vector is one depicted in a Figure selected from the group consisting of
FIG. 13A, 13B , 13C and 14. - In one embodiment, a vector or molecule described herein can be used for a high valency expression of a binding moiety, for example, a peptide or protein domain, e.g., an antigen. In another embodiment, the vector can be used to express a high valency display of at least two different peptides or protein domains, to enhance interaction between ligands. The interaction between ligands can occur in solution, on membranes or on surfaces.
- In another embodiment, interaction (e.g., fusion) between two cells (or two cell types) can be mediated or enhanced by, for example, associating an HDAg octamer expressing multiple copies of a domain which interacts with a ligand on the surface of cell type one and multiple copies of a domain which interacts with a ligand on the surface of cell type two. This method could work for embodiments involving more than two cells or cell types as well. A fusion molecule, e.g., an octamer construct of the present invention, which can be coupled to a surface, for example, by chemical cross-linking or by inclusion of at least one copy of a second domain which interacts with a ligand displayed on a surface, can be used to display multiple copies of a domain on a surface. In another embodiment, the octamer can be used to express different enzymes from a linked pathway on a single framework, e.g., for facilitating rapid exchange of substrates and products between enzymes from a single pathway. The enzymes implicated Krebs in the can be cycle.
- In another embodiment, the octamer can also be used to link a first binding moiety, e.g. a peptide or domain mediating (specifying) an interaction (an “interaction” domain) and a second binding moiety, which mediates an effect (an “effector” domain). In another embodiment, the effector domain is a chemical, e.g. a drug, linked to the octamer via a free —SH group on the octamer. In a preferred embodiment, the interaction domain mediates interaction with a specific receptor on a cell surface, and the effector domain generates a specific function, such as cell killing.
- In another embodiment, the octamer can contain one or more copies of an domain for interaction with a ligand, and one or more copies of a domain which is a label (e.g. alkaline phosphatase, radiolabel, streptodevice, and green fluorescent protein) for amplifying a signal in a solid phase assay, e.g. an ELISA assay.
- In another embodiment, an oligomer construct can be used to couple an oligonucleotide which interacts (e.g. hybridizes) with a nucleic acid molecule in the cell (e.g. a specific complementary DNA or RNA sequence) and one or more copies of an effector to target specific DNA or RNA sequences for cleavage. In a preferred embodiment, the effector is a double-stranded nuclease. In a preferred embodiment, the octamer is a mutant with a free sulfhydryl (—SH) group. An octamer construct can be used to couple multiple copies of a ligand in such a way that the interaction of the octamer with a cell triggers signaling or internalization by pathways which depend on the multimerization of a receptor or ligand on the cell surface. The vectors can be used to promote interaction between intracellular components in a signal transduction pathway, for example components which are upstream or downstream from each other.
- In another embodiment, the system can be used to mediate efficient (drive) gene expression by coupling (e.g. covalently) an enhancer recognition protein and at least one promoter binding protein.
- In another embodiment, the system can be used as a high valency trap to identify the vector can be used as a diagnostic. In one embodiment, the binding moiety is Sp120 and the molecule is used to test for the presence of Human Immunodeficiency Virus. In another embodiment, at least one binding moiety is a drug which has a therapeutic effect when administered to an animal. In another embodiment, a molecule of the present invention is used to screen test substances for an effect. In another embodiment, an agent can be administered to a patient, wherein the agent will inhibit formation of the coiled coil HDAg oligomer. Peptides (e.g. proteins) and nucleic acids, as discussed above, the inventions described herein are based upon the discovery that the HDAg protein oligomerizes to a coiled-coil olctamer. The HDAg protein is derived from the Hepatitis D virus which bind to peptides or protein domains.
- In another embodiment, at least one binding moiety is a drug which has a therapeutic effect when administered to an animal.
- In another embodiment, a molecule of the present invention is used to screen test substances for an effect.
- In another embodiment, an agent can be administered to a patient, wherein the agent will inhibit formation of the coiled coil HDAg oligomer.
- As discussed above, the inventions described herein are based on the discovery that the HDAg protein oligomerizes to a coiled-coil octamer. The HDAg protein is derived from the Hepatitis D virus.
- Whereas Hepatitis B virus infection alone generally causes mild, sometimes chronic, hepatitis, coinfection of hepatitis D virus (HDV) with hepatitis B virus (HBV) causes severe, and often fatal, liver disease in humans, and is the most common cause of fulminant viral hepatitis, Hoofnagle, J. H., J. Am. Med. Assoc., 261:1321-1325 (1989). The virus is an obligatory subviral satellite of HBV, requiring the hepatitis B surface antigen (HBsAg) for assembly and cell-to-cell transmission. Rizzetto, M. et al., Proc. Natl. Acad. Sci. USA, 77:6124-6128 (1980). However, the viral genome can replicate in the absence of HBV. Kuo, M. Y. P et al., J. Virol. 63:1945-1950 (1989). Hepatitis delta encodes all of the information required to direct replication of its RNA genome by the host RNA Pol II. Efficient transmission of hepatitis delta virus requires that the viral RNA and the capsid protein be encapsidated within the hepatitis B virus surface antigen. The viral genome is a 1.7 kilobase single-stranded circular RNA, which is approximately 70% complementary to itself, Wang, K. S. et al., Nature, 323:508-513 (1986), and forms a rod-like structure, Kos., A. et al., Nature, 323:558-560 (1986). The virus is believed to replicate by a double rolling-circle mechanism in infected cells, Taylor, J., Cell, 61:371-373 (1990). Both the genomic and antigenomic strands of the virus contain ribozymes, Wu, H. et al., Proc. Natl. Acad. Sci. USA 86:1831-1835 (1989), Wu, H. N. et al, Science 243:652-654 (1989), Sharmeen, L. et al., J. Virol., 62:2674-2679 (1988), Kuo, M. et al., J. Virol. 62:4439-4444, which are responsible for reducing multimeric viral genomes into unit length and for directing the religation of the linear genomes, Sharmeen, L. et al., J. Virol. 63:1428-1430 (1989). The antigenomic strand of the genome encodes the only viral protein known to be associated with HDV, the hepatitis delta antigen (HDAg) (also known as delta virus capsid protein). Wang, K. S. et al., Nature 323:508-513 (1986), Makino, S. et al. Nature, 329:343-346.
- HDAg exists in two isoforms. Early in the life cycle of the virus, HDAg is expressed as a 195-amino acid protein, the small hepatitis delta antigen (s-HDAg), which functions as a transactivator of HDV RNA replication. This form predominates early in infection. Kuo, M. Y. P, et al., J. Virol. 63:1945-1950 (1989). Later in the life cycle of the virus, there is an RNA editing event that changes the UAG stop codon of the HDAg-S to a UGG codon, encoding a tryptophan. This allows translation to proceed for an additional 19 amino acids, resulting in a 214-amino acid residue form of the protein, the large delta antigen (HDAg-L). The 19 amino acids include a stop signal which allows the large isoform to be farnesylated at its terminus. HDAg-L is a potent inhibitor (dominant repressor) of HDV replication, Chao, M. et al., J. Virol. 64:5066-5069 (1990), Glenn, J. S. & M. J. Virol. 65:2357-2361 (1991), and is also involved in packaging the viral RNA, Chang, F. L. et al. Proc. Natl. Acad. Sci. USA 88:8490-8494 (1991), Wang, C. J. et al., J. Virol. 65:6630-6636 (1991), Ryu, W. S., et al., J. Virol. 66:2310-2315 (1992), and coencapsidation, i.e., the copackaging of the small antigens into the viral particle. It also directs association with the hepatitis B antigen. Chang, F. L. et al. Proc. Natl. Acad. Sci. USA 88:8490-8494 (1991), Ryu, W. S. et al., J. Virol. 66:2310-2315 (1992), Chang, M. F. et al., J. Virol., 68:646-653 (1994). Both the large and small antigens are highly specific RNA-binding phosphoproteins. Chang, M. F. et al., J. Virol. 62:2403-2410, Lin, J. H. et al., J. Virol. 64:4051-4058 (1990) and have been shown to recognize specifically the viral rod-like structure of the HDV viral genomes, Chao, M. et al., J. Virol. 65:4057-4062 (1991). Crosslinking studies have shown that both proteins can exist as either homomultimers (all small antigen or all large antigen) or as heteromultimeric structures (a mixture of small and large antigen) Xia, Y. P. & Lai, M. M. C., J. Virol. 66:6641-6848 (1992), Wang, J. G. & Lemon, S. M., J. Virol. 67:446-454 (1993), Chang, M. F. et al., J. Virol, 67:2529-2536 (1993).
- There have been a number of structure-function studies of both the large and small delta antigens. The N-terminal third of the small delta antigen contains a putative coiled-coil sequence, Xia, Y. P. & Lai, M. M. C., J. Virol. 66:6641-6848 (1992), Wang, J. G. & Lemon, S. M., J. Virol. 67:446-454 (1993), Chang, M. F. et al., J. Virol, 67:2529-2536 (1993), comprising heptad repeats, which is followed by a linker domain which contains bipartite nuclear localization signal. Xia, Y. P. et al., J. Virol. 66:914-921 (1992). The middle portion of HDAg contains two arginine-rich motifs that have been shown to bind to the viral RNA. Lee, C. Z. et al., J. Virol., 67:2221-2227 (1993). The C-terminal segment of s-HDAg is proline- and glycine-rich. Lazinski, D. W. & Taylor, J. M. J. Virol., 67:2672-2680 (1993). L-HDAg is prenylated at the extreme C terminus and it is believed that this part of the molecule interacts with HBsAg and the membranes of the endoplasmic reticulum. Hwang, S. B. & Lai, M. M. C. J. Virol., 67:7659-7662 (1993), de Bruin, W. et al., Virus. Res. 31:27-37 (1994). There is also some evidence that common segments of the large and small antigens may have subtly different conformations. Hwang, S. B. & Lai, M. M. C. Virology 193:924-931 (1993), Hwang S. B. & Lai, M M C, J. Virol. 68:2958-2964 (1994).
- The coiled-coil domain has been shown to be required for a number of the functions of both small and large delta antigens. Mutations that destroy or alter the coiled-coil domain either greatly reduce or totally eliminate the ability of the HDAg-S to function as a trans activator of replication, Chang, M. F. et al., J. Virol., 68:646-653 (1994), Chang, M. F. et al., J. Virol. 62:2403-2410 (1998), Lin, J. H. et al., J. Virol. 64:4051-4058 (1990), Chao, M. et al., J. Virol. 65:4057-4062 (1991), Xia, Y. P. et al., J. Virol., 66:6641-6648 (1992). These same mutations also prevent the HDAg-L from inhibiting HDV RNA replication and inhibit its function in mediating the copackaging of the small antigen, Chang, M. F. et al., J. Virol., 68:646-653 (1994), Chang, M. F. et al., J. Virol. 62:2403-2410, Lin, J. H. et al., J. Virol. 64:4051-4058 (1990), Chao, M. et al., J. Virol. 65:4057-4062 (1991). Transfection of cells undergoing HDV replication with a plasmid containing just the N-terminal one-third of the delta antigen (which contains the coiled-coil domain) inhibited HDV replication, Xia, Y. P. & Lai, M. M. C., J. Virol., 66:6641-6648 (1992). However, removal of the coiled-coil domain does not prevent the delta antigen from binding the viral RNA, Lin, J. H. et al., J. Virol., 64:4051 -4058 (1990) nor does it prevent the HDAg-L from packaging the viral RNA, Chen, P. J., et al., J. Virol., 66:2853-2859 (1992). A “black sheep” model has been proposed for the mechanism of inhibition of the HDV replication. HDAg-L is believed to disrupt the homo-oligomeric small antigen multimers, essentially poisoning the HDAg-S complex. Xia, Y. P. & Lai, M. M. C., J. Virol., 66:6641-6648 (1992). While the precise role of HDAg-S in replication of HDV is unknown, the protein is not a polymerase, and RNA amplification is thought to be mediated by host cell RNA polymerase II, MacNaughton, T. G. et al., Virology 184:387-390 (1991), Fu., T. B. & Taylor, J. et al., J. Virol. 67:6965-6972 (1993).
- Biophysical studies were undertaken to examine the coiled-coil domain of HDAg. Rozzelle, J. E., Jr. et al., Proc. Natl. Acad. USA, 92:382-386 (1995). As described in Example 1, a peptide was synthesized that corresponded to
residues 12 to 60 of the δ12-60(Y). This region includes the N-terminal heptad repeats. The peptide also included a C-terminal tyrosine so that the peptide could be labeled with I125 for use in a radioimmunoassay. The peptide sequence was conceptually divided into three segments based on the presence of two potential helix breakers Gly23 (G23) and Pro49 (P49); segments A (residues 12-24), B (residues 25-49), and C (residues 50-60) (FIG. 1 ). The full-length peptide δ12-60(Y) and two shorter peptides that corresponded to regions A+B and B+C were synthesized. A number of biophysical experiments, including circular dichroism (CD), mass spectrometry, and analytical ultracentrifugation, clearly showed that the δ12-60(Y) peptide was largely helical and formed a coiled coil Rozzelle, J. E., Jr. et al., Proc. Natl. Acad. USA, 92:382-386 (1995). The shorter peptides formed much less stable structures and were considerably less helical than δ12-60(Y). Human polyclonal antibodies from hemophilic patients who were chronic carriers of HBV and HDV reacted with the δ12-60(Y) peptide, in both an ELISA and in a sandwich radioimmunoassay. Rozzelle, J. E., Jr. et al., Proc. Natl. Acad. USA, 92:382-386 (1995), Wang, J. G. et al. J. Virol. 64:1108-1116. Subsequent studies indicated that monoclonal antibodies against the peptide recognized a conformational epitope only presented by the full-length peptide and not the shorter, extensively overlapping peptides, Rozzelle, J. E., Jr. et al., Proc. Natl. Acad. USA, 92:382-386 (1995). - Described herein for the first time is the crystal structure of the peptide δ12-60(Y) to 1.8 Å resolution. The structure reveals that the capsid protein dimerizes as an unusual antiparallel coiled coil. In the crystal structure, the dimers further oligomerize to form an octamer. The octamer forms an open, square planar structure with an antiparallel dimer forming each side of the square. Crosslinking and hydrodynamic studies suggest that both the peptide and the full-length short isoform exist as stable octamers in solution.
- The structure of the peptide lends new insights into the mechanism by which HDAg dimerizes and further associates into higher ordered structures. The structure also explains why residues C-terminal to the predicted coiled-coil domain, and the helix-breaking proline residues are important for the stabilization of the coiled-coil structure. The peptide structure has important consequences for the in vivo oligomerization of HDAg. The unique octameric structure which is observed in the crystal structure also suggests that the N-terminus of the molecule may have a previously undetermined function.
- When the HDAg open reading frame was originally examined, amino acids from residue 13 to 47 were identified as possibly forming a coiled coil. Glutaraldehyde cross-linking studies of full-length HDAg, as well as of the peptide, confirmed the formation of dimers, tetramers and higher-ordered structures, Wang, J. G. & Lemon, S. M., J. Virol., 67:446-454 (1993), Rozzelle, J. E., Jr. et al., Proc. Natl. Acad. USA, 92:382-386 (1995). The crystal structure of the peptide clearly shows how monomers come together to form antiparallel dimers as well as a higher-ordered octameric structure. The structure of δ12-60(Y) also agrees well with previous circular dichroism studies of the peptide, which indicated that the two ends of the peptide (regions A and C) were important for the structural stability of the coiled coil. Rozzelle, J. E., Jr. et al., Proc. Natl. Acad. USA, 92:382-386 (1995). Shorter synthesized peptides that were missing either the A or C regions (A+B and B+C), were significantly less helical than the full-length peptide (A+B+C; 37%, 45% and 84% respectively at 37° C.). The peptide structure shows that hydrophobic residues from the N terminus of one monomer (region A), not involved in the heptad repeat, interact with residues outside of the predicted coiled-coil domain near the C terminus of the other monomer (region C) to form a hydrophobic core Trp20 (W20), Leu24 (L24), Trp50 (W50), Leu51 (L51) sandwiched between Arg13 (R13) and Arg24 (R24). This may stabilize the structure by keeping the ends of the helix from fraying. An additional stabilizing feature is a hydrogen bond between the sidechain of Glu45 (E45) and the indole nitrogen of Trp20 (W20). These hydrophobic residues, as well as the glutamic acid residue, are highly conserved in the 10 different strains of HDV identified to date (
FIG. 1 ). In fact, they are more conserved than those residues in the heptad repeat making up the hydrophobic core of the long helix (FIG. 1 ). - As described in Example 3, cross-linking studies of full-length recombinant small delta antigen (r-HDAg-S or r-δAg-S) also demonstrated that the recombinant protein forms octamers in solution. This indicates that the octamer form seen in the crystal may not be an artifact of crystallization, but rather may represent the true state of the oligomerization of the delta antigen. A study by Chang and colleagues found that a deletion in the HDAg-L, just C terminal to the coiled-coil domain (
residues 50 to 75), prevented the HDAg-S from being copackaged with the HDAg-L, Chang, M. F. J. Virol., 66:6019-6027 (1992). HDAg-L with this same deletion could not inhibit HDV replication, whereas a deletion in L-DHAg of residues 65 to 75 could. This suggested that the coiled-coil domain alone is not sufficient for the interaction between the large and small antigens, and that a subdomain betweenresidues 50 and 65 is also necessary for this interaction. The crystal structure of δ12-60(Y) indeed shows the importance of residues between 50 and 60 in the formation of the peptide oligomer. They are not only involved in stabilizing the δ12-60(Y) dimer Trp50 (W50) and Leu51 (L51) but are also involved in the formation of the dimer-dimer interface Trp50 (W50), Ile54 (I54), and Ile58 (I58). - Prior to the studies described here, the overall organization of the HDAg oligomer was unknown. The structure of the δ12-60(Y) peptide suggests a number of interesting considerations about the function of the coiled-coil domain of the hepatitis delta antigen. For example, Lai and coworkers, Xia, Y. P., et al., J. Virol. 66:6641-6648 (1992), inferring from previous data that showed that as little as 12% of HDAg-L is needed to inhibit 90% of viral activity, Chao, M., et al., J. Virol., 64:5066-5069 (1990), proposed that as little as one part of HDAg-L in eight parts of HDAg-S could inhibit viral replication. Their “black sheep model” proposed that the HDAg-L either disrupted the conformation of the oligomer of HDAg-S, therefore preventing it from binding to host factors, or that the presence of HDAg-L in the complex prevents the complex from interacting with host factors. This would seem in agreement with the peptide structure of octameric δ12-60(Y) and the results of the MALDI-TOF mass spectrometry analysis. If HDAg-L does disrupt the conformation of the oligomer of HDAg-S it probably does not do so directly through the multimerization domain, given that the large and small delta antigen share the same sequence within this region. Rather, it is possible that this α7 or αnβm structure can no longer interact with host factors. Also, since the C terminus of the L-HDAg interacts with the endoplasmic reticulum (ER) membrane and with HBsAg for assembly, it could redirect the complex elsewhere in the cell, preventing the nuclear translocation of s-HDAg which is required for HDV replication.
- Discovery of the organizational structure also provides information regarding possible undetermined functions of the N terminus. The octamer that is formed by the peptide is reminiscent of proteins that form clamps around DNA, such as PCNA. Talluru, S. R., et al., Cell 79:1233-1243 (1994). The 50 Å hole formed by the octameric structure is lined with basic side chains, suggesting that the N terminus of the protein not only may act as a dimerization/oligomerization domain, but also that it may function either as a clamp around the viral RNA or other nucleic acid or perhaps even function as a spool for nucleic acid. There is a report that peptides corresponding to the extreme N-terminal portion of the
HDAg residues 2 to 27 and 2 to 17 can bind the viral RNA Poisson, F., et al., J. Gen. Virol. 74:2473-2478 (1993), Poisson, F. et al., J. Virol Methods 55:381-389 (1995). Since the δ12-60(Y) structure is missingresidues 2 to 11, it is impossible to say what role they play in binding the viral RNA. Of the remaining residues, only Lys25 (K25) and Lys26 (K26), which point into the hole of the octamer, seem likely to play a role in binding RNA by potentially binding the phosphate backbone of the viral RNA. - The large size of the hole may be necessary to accommodate the viral RNA which is only 70% self complementary, and would possess a number of regions of bulged out single-stranded sequence, increasing the radius of gyration of the RNA as well as bending the RNA. Lilley, D. M. J., Proc. Natl Acad. Sci. USA, 92:7140-7142 (1995). The octameric structure also implies that there may be as many as four RNA-binding domains on each side of the octamer. This portion of the molecule may also bind another protein, especially one that is acidic, such as the recently discovered delta antigen interacting protein A (dipA), a cellular protein which has been found to interact with the HDAg Brazas, R. et al., Science 274:90-94 (1996) and; based on its amino acid sequence, would have an isoelectric point of 4.9.
- Many investigators have referred to the putative coiled-coil domain of the delta antigen as a leucine zipper-like region. Experiments involving mutations in this region were interpreted assuming the coiled-coil domain of the delta antigen would resemble the parallel coiled-coil of the bZIP family of transcription factors, such as GCN4. HDAg dimerizes through an antiparallel coiled-coil domain, rather than a standard parallel coiled coil.
- Although algorithms have been designed to determine the oligomerization state of a coiled coil, Woolfson, D. N., et al., Protein Sci. 4:1596-1607 (1995), Wolf, E., et al., Protein Sci. 6:1179-1189 (1997), they cannot determine the orientation of the predicted coiled coil. The discovery that this region forms an antiparallel coiled coil demonstrates that additional biochemical or genetic evidence, such as provided herein, is necessary to determine whether a predicted coiled-coil domain adopts a parallel or antiparallel conformation. Along with the structure of the δ12-60(Y) peptide, there are other examples of molecules that dimerize through antiparallel coiled-coil domains, such as the Escherichia coli regulatory protein AraC, Soisson, S. M., et al., Science 276:421-425 (1997) and the replication terminator protein from Bacillus subtilis, Bussiere, D. E. et al., Cell 80:651-660 (1995).
- The hepatitis delta antigen (HDAg), the sole protein made by the hepatitis delta virus (HDV), is essential for viral replication in vivo. Oligomerization of the protein is necessary for both the transactivating function of the small delta antigen (HDAg-S) and the trans dominant inhibitory effect of the large delta antigen (HDAg-L). The structure of the peptide δ12-60(Y) that corresponds to the predicted coiled-coil domain of the hepatitis delta antigen HDAg suggests that delta antigen HDAg not only dimerizes through an antiparallel coiled coil, but also forms octamers. Interestingly, the coiled coil is stabilized by hydrophobic residues C terminal to the coiled-coil domain. These C-terminal residues interact with hydrophobic residues in the N terminus of the coiled-coil region. The hydrophobic core of the dimer is extended by further hydrophobic interactions at the interface between dimers in the octameric structure. In contrast to the rather promiscuous interactions between the coiled-coil domain, these unique interactions at the termini of the monomer and dimer interfaces might provide a good target for antivirals against HDV, since disruption of oligomerization can prevent replication in vivo.
- The surprising octameric structure of the peptide suggests that the capsid of the delta antigen (HDAg) will look very different from the known structures of other viral nucleocapsid proteins. The octameric structure also suggests important implications for binding of HDAg to the viral RNA, since as many as four of the arginine-rich RNA-binding domains might be needed for binding to the viral RNA. The very basic hole in the octamer suggests that this portion of the molecule may act as a sort of “clamp” around an acidic molecule, such as viral RNA, another nucleic acid or a cellular factor.
- The exact function of HDAg in viral replication is unclear. The protein may only function as a shuttle, binding to the viral RNA and transporting it into the nucleus of the infected cell. It is possible that HDAg functions to recruit host cell transcriptional machinery to the viral RNA. The discovery of the structure enables the design of experiments to determine whether the N terminus of the molecule has RNA-binding capabilities and investigate the mechanism of oligomerization and inhibition of small antigen by the large antigen. A systematic examination of the amino acids involved in dimerization and oligomerization would allow the determination of the mechanism by which HDAg-L inhibits HDAg-S. Furthermore, the unique interactions at the termini of the coiled-coil region provide a new framework to be exploited in the de novo design of stable antiparallel coiled coils.
- The examples presented below are provided as further guidance and are not to be construed as limiting the invention in any way.
- Materials and Methods
- PEPTIDE SYNTHESIS: The δ12-60(Y) peptide was obtained by Erickson, B. and Lemon, S. M. and was synthesized and purified as described previously in Rozzelle, J. E. et al., Proc. Natl. Acad. Sci. USA, 92:382-386 (1995), incorporated herein by reference in its entirety.
- The peptide (
FIG. 11B ) was assembled by fluorenylmethoxycarbonyl chemistry and purified by reversed-phase HPLC. It was Nα-acetylated and Cα-amidated. Crude peptide in 0.05% trifluoroacetic acid was separated on an octyl-silica column [C8, Applied Biosystems, 250 mm×10 mm (i.d.), 300-Å pore size] by elution at 3 ml/min over 50 min with a linear gradient of 20-42% acetonitrile in 0.05% trifluoroacetic acid. Peptide δ12-60(Y) was eluted at 36% acetonitrile (monitored at 230 nm). The homogeneity of the individual fractions was determined on an analytical octyl-silica column. The expected mass of the peptide was confirmed by electrospray ionization (ESI) mass spectrometry: peptide δ12-60(Y), m/z 6034.1±1.2 (calcd. 6033.7). - The peptide from the 12-60 region of HDAg was synthesized (
FIG. 11B ). Peptide δ12-60(Y) included segments A, B, and C. Segment B contains three heptads in which the first and fourth heptad positions are occupied by five leucines and one isoleucine, and is probably part of an α-helical coiled coil. A tyrosine residue, (Y), was added, Lys60, to the C terminus of δ12-60(Y) to permit radioiodination. - CD SPECTROSCOPY. The α-helicity and the temperature at the midpoint of thermal denaturation (Tm) of the peptides were determined by CD spectroscopy. All three peptides had high α-helicity in PBS at 5° C. The ratio of the mean residue ellipticity of the negative bands near 222 nm and 208 nm ([θ222])/([θ208]) is an indicator of coiled-coil formation. Values close to 1.0 indicate an α-helical coiled coil and values near 0.8 indicate isolated α-helices. At 5° C., this ratio was 0.98 for δ12-60(Y). At 37° C., this ratio was 0.94 for δ12-60(Y), consistent with persistence of a coiled-coil structure. In contrast, at 37° C. this ratio was only 0.79 for δ2-49 and 0.76 for 625-60(Y), inconsistent with a coiled-coil structure.
- EXPRESSION PLASMIDS: pR5δV5 was constructed for the high-level expression of HDAg-S in Escherichia coli. The protein sequence of the American strain with the HDAg-S (GenBank accession no. M28267) was back-translated with the program BACKTRANSLATE. (This program was from the Wisconsin Package, versions 9.0 [Genetics Computer Group, Madison, Wis.], with E. coli codon frequencies obtained from gopher://weeds.mgh.harvard.edu:70/Oftp%3Aweeds.mgh.harvard.edu@/pub/codon/eco.cod.) With the sequence obtained as shown in
FIG. 9 andFIG. 18 , the plasmid pR5δV5 was constructed by a two-step PCR method, as described previously. Casimuro, D. R. et al., Biochemistry, 26:6640-6648(1995), with the exception that Vent polymerase (New England BioLabs) was used instead of Taq polymerase. Eight overlapping synthetic primers were synthesized (FIG. 9 andFIG. 18 ). Changes in the back-translated sequences were made so that the overlaps of the PCR primers would have approximately the same melting temperature. Primers were electrophoresed into a 10% sequencing gel, visualized by UV shadowing, and excised from the gel. The primers were then purified with a Waters Sep-Pak column. - The first PCR contained 4 pmol of each of the eight primers in a 100-μl reaction mixture. Ten microliters of the first PCR was added to a second reaction mixture that contained an upstream primer (5′-GGGCATATGAGCCGTAGCGA) and a downstream (5′-GCGCCATGGTTTACGGAAAG) primer designed to amplify the desired full-length product. Both reactions involved a hot start at 94° C. followed by 30 cycles of 1 min. at 94° C., 1 min at 57° C., and 1 min at 72° C., with a final 5-min extension at 72° C.
- The PCR product from the second reaction was cloned into the vector pCR-Blunt (Invitrogen), which allows selection based on disruption of a toxic gene. Plasmids isolated from colonies were checked for the insert by restriction digest mapping. The open reading frame of HDAg-S was subdloned into expression vector pRSETb (Invitrogen). The sequence of the resultant plasmid, pR5δV5, was verified by dye termination sequencing.
- PROTEIN PURIFICATION: Recombinant HDAg-S (δAg-S) was expressed and purified as follows. Plasmid pR5δV5 was transformed into BL21 (DE3)pLysS cells (Novagen). A single colony was used to inoculate a 100-ml overnight culture. Ten milliters of this overnight culture was used to inoculate a 1-liter culture. At an optical density of between 0.4 and 0.6, the cells were induced with 3 ml of 100 mM IPTG (isopropyl-β-D-thiogalactopyranoside). Cell growth was continued for 3 h, and then cells were pelleted at 5,000×g for 10 min. The cells were resuspended in 15 ml of 50 mM HEPES (pH 7.5)—250 mM NaCl—1 mM MgC12 and stored at −20° C. until needed.
- The frozen cells (45 ml corresponding to three 1-liter cultures) were thawed, and one Complete Protease Inhibitor tablet (Boehringer Mannheim) was added, along with RNase A and DNase I, to a final concentration of 50 μg/ml. Cells were lysed by sonication and pelleted at 10,000×g for 30 min. The lysate was diluted threefold with 50 mM HEPES buffer (pH 7.5) and then applied to a 10×1.5-cm Fast SP Sepharose column (Pharmacia) equilibrated with 50 mM HEPES buffer (pH 7.5) and eluted with a salt gradient from 0 to 1 M NaCl in 50 mM HEPES (pH 7.5). The fractions containing δAg-S were applied to a Superdex S-200 column (Pharmacia) equilibrated with 50 mM HEPES (pH 7.5), 500 mM NaCl, and 5% glycerol. The HDAg-S obtained was >85% pure as judged by Coomassie blue staining of a sodium dodecyl sulfate gel.
- Proteins with a histidine tag were purified as follows. Proteins expressed in E. coli were affinity purified with a Talon column according to the recommendations of the manufacturer (Clontech). Proteins expressed in mammalian cells were purified by the Invitrogen Xpress System. In both cases, the fractions containing the purified protein were identified by sodium dodecyl sulfate gel electrophoresis, pooled, dialyzed, and concentrated.
- TRANSFECTION: Plastic 16-mm-diameter tissue culture wells (Costar) were seeded with approximately 0.1×106 Huh7 cells, Nakabayoshi, H. et al. Cancer Res., 42:3858-3863 (1982). For transfections with assembled RNP, 0.25 to 900 ng of HDAg-S and 500 ng of genomic HDV RNA in 125 μl of Opti-MEM were combined with 2.7 μl of lipofectamine (2 mg/ml) in 125 μl of Opti-MEM, incubated for 30 min at room temperature, and applied to cells that had been washed with Opti-MEM (Hawley-Nelson, P. et al., Focus, 15:73-79(1993)). In control transfections, either HDAg-S or HDV RNA was omitted. For cDNA transfections, 500 ng of plasmid DNA was used. At 5 h after transfection, the transfection mixture was changed to Dulbecco's modified Eagle's medium supplemented with 10% fetal calf serum. At 4 days after transfection, cells were reseeded into a 30-mm-diameter dish containing a glass coverslip; at 8 days, the cells were examined by immunofluorescence microscopy. Eight days length was chosen for three reasons: (I) to avoid detection of the transfected HDAg-S; (ii) In the immunofluorescence assays, 8 days corresponded to the peak signal for a cell undergoing RNP-iniated replication; (iii) At 8 days, HDAg-L, created as a consequence of both RNA editing and genome replication, could be readily detected (Luo et al. J. Virol., 64:1021-1027 (1990)). In contrast, for Northern analyses, genome replication was detectable as early as 2 days.
- Results
- Initial studies with E. coli demonstrated poor expression of HDAg-S from the wild-type sequence. About 18% of the codons in the natural HDAg sequence are rarely used by E. coli. Attempted overexpression of codons that are rare in E. coli not only can inhibit expression but also can lead to misincorporation (Del Tito, B. J. et al., J. Bacterial, 177:7086-7091(1995) Therefore, a nucleotide sequence was designed which maintained the amino acid sequence, but increased the percentage of codons that were most favored for expression in E. coli from 26% to 85%. This optimized sequence (
FIG. 9 ) was used to construct expression plasmid pR5δV5. Thus, a 40-fold increase in expression was obtained and the recombinant protein was purified to >85% homogeneity. - Materials and Methods
- CRYSTALLIZATION AND DATA COLLECTION: The peptide δ12-60(Y) was dissolved in 50 mM acetate, pH 4.8, 50 mM NaCl and brought to a concentration of 15 mg/ml. The crystals of the δ12-60(Y) peptide were grown at 22° C. by the vapor diffusion method. The peptide (2 μl of a 15 mg/ml solution) was mixed with 2 μl of the reservoir solution containing 100 mM sodium acetate, pH 4.8, and 100 mM sodium citrate, pH 5.6, on a coverslip and then inverted over the reservoir solution. Crystals appeared within 3-4 days, and grew as large as 0.5×0.3×0.3 mm. Crystals belonged to
space group P2 1212 with unit cell parameters a=109.2 Å, b=85.3 Å, c=29.4 Å, α=β=γ=90°. When attempts to find a heavy atom derivative failed, a peptide was synthesized with serine 22 replaced by a cysteine, δS22C12-60(Y). The 8S22C12-60(Y) peptide was reacted with an excess of platinum terpyridine, dialyzed overnight against water, and then freeze-dried. The peptide was then reconstituted at 15 mg/ml in 50 mM acetate, 50 mM NaCl, 5 mM DTT, pH 4.8, and crystallized by the same conditions as that of the wild type-peptide. This peptide crystallized isomorphously with the δ12-60(Y). - The coverslips containing the crystals were inverted and cryosolvent (reservoir solution containing 30% glycerol) was slowly mixed with the drops and continuously replaced until no mixing was observed. The crystals were mounted in nylon loops and frozen directly in the nitrogen stream. Crystals used at Brookhaven were stored in liquid nitrogen until the time of data collection. Two native data sets were collected at Beamline X12C at the National Synchrotron Light Source at Brookhaven National Lab using X-rays of wavelength 1.15 Å (Table 1). The heavy atom data set was collected on a Siemens rotating anode with a multiwire detector (Table 1). Data from the native crystals was processed using DENZO, Otwinowski, Z. SERC Daresbury Laboratory, Warrington, UK:(56-62 (1993)) and SCALEPACK. Data from the heavy atom derivative was integrated using the program BUDDHA (Blum, M. et al., J. Appl. Cryst. 20:235-242 (1987)) and processed using ROTAVATA and AGROVATA from the CCP4 package (CPP4 Acta. Cryst. D., 50:760-763 (1994)). Structure factors from both data sets were calculated using TRUNCATE (CPP4, supra). Data from the native and derivative were scaled together using SCALEIT (CPP4, supra).
- STRUCTURE DETERMINATION AND MODEL BUILDING: The positions of the heavy atom sites were determined using SHELXS-86 (Sheldrick, G. M., Acta. Cryst. A. 46:467-473 (1990)). The positions of the heavy atom sites were refined using MLPHARE (Otwinowski, Z., Proceedings of the CCP4 Study Weekend, 80-86 SERC Daresbury Laboratory, Warrington, UK (1991)), and initial SIRAS phases were calculated. The data was then subjected to a round of solvent flattening with histogram matching using DM (Zhang, K. Y. J. & Main, P. Acta. Cryst. A. 46:41-46 (1990). A map was calculated which clearly showed the position of the two dimers in the asymmetric unit, and an initial model was built into the initial SIRAS map using the program O (Jones, T. A. et al. Acta Cryst. A., 47:110-119 (1990)). The structure was refined using X-PLORv3.8.9, Brunger, A. T., Yale University Press, New Haven, Conn. (1992). Rounds of positional refinement, followed by simulated annealing and B-factor refinement, were carried out with rebuilding of the structure using O between cycles of refinement. During the initial model building and refinement, omit maps, which excluded 10 residues at a time, were used to check the progress of refinement.
- SURFACE AND ELECTROSTATIC CALCULATIONS: Surface calculations were performed using the surface option in QUANTA version 4.0. Electrostatic calculations were performed with GRASP version 1.3.
- PROTEIN EXPRESSION AND PURIFICATION: The pR5δV5 plasmid, Dingle, K. et al. J. Virol., 72(6):4783-4788 (1998) which contains a synthetic gene for the small delta antigen, HDAg-S, was transformed into BL21 (DE3)pLysS cells (Novagen) and purified as described previously in Example 2. See also Dingle, K. et al., supra. Briefly, 45 ml of frozen cells, corresponding to three 1 L cultures, were thawed and one protease inhibitor tablet (Boehringer Mannheim) was added, as well as RNAse A and DNAse I to a final concentration of 50 μg/ml. Cells lysed by sonication were pelleted at 10,000×g for 30 minutes. The lysate was diluted three-fold with 50 mM HEPES buffer, pH 7.5, and then applied to a 10×1.5 cm Fast SP Sepharose (Pharmacia) column equilibrated with 50 mM HEPES buffer, pH 7.5, and eluted using a salt gradient from 0-1M NaCl in 50 mM HEPES, pH 7.5. The fractions containing recombinant small delta antigen (rδAg-S [r-HDAg-S]) were assayed using SDS-PAGE and pooled. The sample was then applied to a Superdex S-200 column (Pharmacia) equilibrated with 50 mM Hepes, pH 7.5, 500 mM NaCl and 5% glycerol. The elution of the protein from the column was monitored by UV absorbance at 280 nm.
- Results
- Attempts to find a heavy atom derivative using the peptide with the wild-type sequence of the American strain of HDAg failed. Thus, a new peptide was synthesized with a cysteine replacing serine 22 (Ser22) (this residue demonstrates considerable variation in different strains of HDV,
FIG. 1 ). The cysteine mutant and wild-type peptides crystallized isomorphously. The presence of cysteine 22 (Cys22) allowed the preparation of a platinum terpyridine derivative, facilitating the determination of the structure using SIRAS methods (Table 1). Retrospective examination of the model confirmed that the Pt was bound to the sulfur of cysteine 22 (Cys22). - The solvent-flattened map was easily interpretable, and clearly showed two dimers in the asymmetric unit. Rounds of positional refinement, simulated annealing, temperature factor refinement using X-PLOR (Bruinger, A. T., Yale University Press, New Haven, Conn. (1992)), and manual rebuilding using O (Jones, T. A. et al., Acta Cryst. A. 47:110-19 (1990)), led to the current model (Table 2,
FIG. 2 ). The current model has an R factor of 22.5% and a free R factor of 27% with good geometry (r.m.s.d. bond 0.007 Å and r.m.s.d. bond angles 1.0°). A number of sidechains exposed to the large solvent channel, as well as the first residue in the chain and the last residue in one of the chains, are disordered. The four monomers in the asymmetric unit superimpose well onto one another, with an average r.m.s.d. for mainchain atoms of 0.81 Å and for all non-hydrogen atoms 1.51 Å. The main differences in the monomers are those residues involved in crystal packing interactions. - The coordinates have been deposited in the Brookhaven Protein Data Bank (accession number 1A92).
- Each monomer is composed of a long, N-terminal helix, approximately 60 Å in length, interrupted by a sharp bend at proline 49 (Pro49), and continuing on into another short helix. The long helices of each of two monomers wrap around each other forming an antiparallel coiled coil (
FIG. 3 a,FIG. 3 b), which straightens out at the N terminus. Only one of the four possible salt bridges between Glu31 (E31) and Lys38 (K38) is seen. In the other three cases, the charged groups are slightly farther apart (3.8 Å, 4.2 Å and 4.4 Å versus 2.9 Å) and the sidechains are hydrogen bonded to nearby solvent molecules. The sidechain of Glu45 (E45) is hydrogen bonded to the indole nitrogen of Trp20 (W20). The sidechain of Asn48 (N48), which is located at the C terminus of the long helix, completes the hydrogen-bonding pattern of the helix by making a hydrogen bond back to the mainchain oxygen of Leu44 (L44). The formation of the dimer buries 2650 Å2 of surface area, approximately 26% of the total surface area. - Although the majority of residues in the heptad repeat (
FIG. 3 c) of the predicted coiled-coil region do pack as expected, Trp20 (W20) does not. Even though the Cα-Cβ vector of Trp20 (W20) points out of the interface as would be expected for a sidechain in the a position of a heptad repeat, the sidechain of Trp20 is flipped away from the core of the coiled coil and into a hydrophobic region formed between segment A (residues 12-24) of one monomer, and segment C (50-60) of its partner within the peptide dimer. The dimer shows primarily hydrophobic interactions between residues in the A and C regions. Ile16 (I16), Leu17 (L17), Trp20 (W20), Trp50 (W50), and Leu51 (L51) are the sidechains primarily involved in this hydrophobic region, which is capped by the aliphatic portion of the sidechains of Arg13 (R13) and Arg24 (R24) (FIG. 4 ). The primary non-hydrophobic, monomer-monomer interactions near this region involve the formation of a hydrogen bond between Trp20 (W20) and Glu45 (E45) (FIG. 4 ). The heptad repeat is also unusual in that it contains a glycine at position 23. If the monomers were oriented in a parallel fashion, a large hole in the middle of the hydrophobic core of the dimer would result. However, since the strands are arranged antiparallel, the large sidechain of Ile41 (I41) packs into the hole formed by Gly23 (G23). The dimer is stabilized by hydrophobic interactions other than the residues in the heptad repeat. Residues from the N-termini of each monomer, Ile16 (I16), Leu17 (L17), Trp20 (W20) from one monomer and Trp50 (W50), Leu51 (L51), and Ile54 (I54) from the other, form a hydrophobic core which is protected from solvent by the aliphatic portions of Arg13 (R13) and Arg24 (R24). There is also a hydrogen bond between the sidechain of Glu45 (E45) and the indole nitrogen of Trp20 (W20) (O—N distance 2.8 Å). - In the crystal, each dimer associates with three other dimers to form a doughnut-like octamer (
FIG. 5 ). The octameric complex forms a pseudo-centered (C222) cell. The octamer is widely open with a central “hole”, 50 Å in diameter. The open structure of the octamer is reminiscent of several other proteins, including Proliferating cell nuclear antigen (PCNA), in which the hole that is formed is believed to encircle DNA (Talluru, S. R. et al., Cell, 79:1233-1243 (1994)). It is this octameric structure which is the translational repeating unit in the crystal (FIG. 5 ). The dimer-dimer interface is a four-helix bundle formed across the crystallographic two-fold axis. The interface of the two dimers consists of hydrophobic residues in region A of the coiled-coil domain Leu17 (L17) and Val21 (V21) but also includes residues C-terminal to the coiled-coil domain, region C, betweenresidues 50 to 60 Trp50 (W50), Ile54 (I54), Ile57 (I57) and Ile58 (I58) (FIG. 6 ). Thus, hydrophobic residue from both helices pack in the interface, essentially extending the hydrophobic core mentioned above. Trp50 is involved in both the formation of the dimer as well as the octamer. Formation of the octamer buries an additional 800 Å2 of surface area per monomer, which means that approximately 40% of the total surface area of each monomer is buried. The 50 Å diameter hole framed by the four dimers is lined with basic sidechains (FIGS. 7 a and b). The hole is large enough to accommodate an RNA molecule. Residues Lys26 (K26) and Lys38 (K38) which had been modeled in as alanine were changed to lysine for this calculation. The electrostatic surface was calculated using GRASP (Nicholls, A. Columbia University, New York, N.Y.(1992)), and rendered using RASTER3D (Merritt, E. A. & Murphy, M. E. P., Acta. Cryst. D. 50:869-873 (1994)).TABLE 1 Data Collection Statistics Native* S22C Pt Spacegroup P2 1212 P2 1212Unit cell (a, b, c) 109.2, 85.3, 29.4 110.3, 86.3, 29.6 Temperature of data −160 −165 collection (° C.) Resolution (Å) 15-1.73 86-2.5 Number of reflections 221, 286 44, 362 Number of unique reflections 28, 279 10, 013 Completeness† (%) 94 (35) 97 I/σ† 51 (7) 6.0 Multiplicity 7.8 4.4 Rsym †‡§ (%) 4.2 (18) 6.7 [14.0] Riso ¶ (%) — 30.5 Rcullis# — 0.62 (0.52) Rcullis anon ¥ — 0.84 Phasing power** — 2.2 (1.7) *Data are from two crystals. †Numbers in parentheses represent values in the highest resolutions shell. where <I(h,k,l) > represents the sigma weighted average intensity of symmetry-equivalent reflections. §The number in square brackets represents where <I+/−(h,k,l) > represents the statistically weighted average intensity of symmetry-equivalent reflections. number in parentheses represents Rcullis for centric reflections. **Phasing power = <|FH|/| |FPH| − |FP + FH| |>; number in parentheses is the power for centric reflections -
TABLE 2 Refinement Statistics Resolution range (Å) 15-1.8 Rworking*(%) 22.5 Rfree †(%) 27.0 Non-hydrogen protein atoms 1785 Solvent atoms 114 Rms from ideal geometry bond lengths (Å) 0.007 bond angles (°) 1.0 dihedral angles (°) 16.9 impropers (°) 0.59 Average B factor overall (Å2) 29.3 mainchain 22.5 sidechain 32.4 solvent 34.7 for a working set composed of 90% of the data. for a test set composed of 10% of the data selected randomly. - Materials and Methods:
- r-HDAg-S was prepared as described above. The samples for mass spectrometry were prepared as follows: the r-HDAg-S was dialyzed overnight against water. Cross-linked protein was prepared by the addition of 5 μl of 0.5% glutaraldehyde to 40 μl of rHDAg-S for 5 minutes, and quenched by the addition of 5 μl of 1 M ammonium acetate. Mass spectrometry was performed in the BCMP Biopolymer facility on a Persceptive Biosystems Voyager-DE mass spectrometer.
- Results
- Previous studies have suggested that both the peptide and natural HDAg derived from infected liver form multimers in solution (Wang, J. G. & Lemon S. M. J. Virol, 67:446-454 (1993)), (Rozzelle, J. E., Jr. et al. Proc. Natl. Sci. USA, 92:382-386 (1995)). In order to investigate the significance of the octamer formed by the peptide, MALDI-TOF mass spectrometry was used to determine the mass of monomeric and oligomeric forms of recombinant small delta antigen, r-HDAg-S. The uncrosslinked protein has a mass of 2,1832 Da (
FIG. 8A ), which is the correct mass within 0.01% of the amino acid sequence of the American strain of the small delta antigen (HDAg-S) (Genbank accession #-M28267) minus the first methionine residue. The primary species of the cross-linked rHDAg-S had a mass of 176,282 Da (FIG. 8B ). The M+1 and M+2 peaks of the octamer were the only significant peaks in the spectrum. The ratio of the masses of the cross-linked species to the monomer is 8.1:1. - While this invention has been particularly shown and described with references to preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the claims.
Claims (36)
1. An isolated nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of:
a) a nucleotide sequence depicted in FIG. 9 , nucleotides 37-150 of FIG. 9 , nucleotides 37-186 of FIG. 9 , FIG. 10 , nucleotides 1421-1566 of FIG. 10 , nucleotides 1457-1566 of FIG. 10 , FIG. 15 and FIG. 16 ;
b) a complementary strand of the sequence of a);
c) DNA sequences that hybridize to the sequence of a) or b); and
d) RNA sequences transcribed from the sequences of a), b) or c), or a fragment or mutation thereof, which encodes a coiled-coil oligomer.
2. An isolated nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of:
a) a nucleotide sequence encoding a polypeptide comprising an amino acid sequence depicted in a row of FIG. 1 , amino acids 12-48 of a row of FIG. 1 , the top row of FIG. 3C , FIG. 9 , amino acids 12-48 of a row of FIG. 9 , FIG. 10 , amino acids 12-88 of FIG. 10 , FIG. 11 and FIG. 17 ;
b) the complementary strand of the sequence of a);
c) RNA sequences transcribed from the sequences of a) or b), or a fragment or mutation thereof, which encodes a coiled-coil oligomer.
3. An isolated nucleic acid molecule encoding a fusion molecule comprising a fusion molecule comprising HDAg and at least one binding moiety.
4. A fusion gene comprising a nucleic acid molecule of claim 1 operably linked to a nucleic acid molecule encoding a heterologous peptide.
5. A fusion gene comprising a nucleic acid molecule of claim 2 operably linked to a nucleic acid molecule encoding a heterologous peptide.
6. A vector comprising a nucleic acid molecule which encodes a subunit of an HDAg coiled-coil octamer.
7. A vector comprising a nucleic acid molecule of claim 1 .
8. A vector comprising a nucleic acid molecule of claim 2 .
9. A vector comprising a nucleic acid molecule of claim 3 .
10. A vector comprising a nucleic acid molecule encoding a fusion molecule comprising HDAg and at least one binding moiety.
11. A vector comprising a nucleic acid molecule encoding HDAg and at least one multiple cloning site.
12. The vector of claim 11 wherein at least one multiple cloning site is located 3′ to the nucleic acid molecule encoding HDAg.
13. The vector of claim 11 wherein at least one multiple coding site is located 5′ to the nucleic acid molecule encoding HDAg.
14. The vector of claim 11 wherein there are at least two multiple coding sites, wherein at least one multiple coding site is located in a flanking region 3′ to the nucleic acid molecule encoding HDAg and at least one multiple coding site is located in a flanking region 5′ to the nucleic acid molecule encoding HDAg.
15. A vector comprising a nucleic acid molecule of claim 1 and at least one multiple cloning site.
16. A vector comprising a nucleic acid molecule of claim 2 and at least one multiple cloning site.
17. The vector of claim 16 further comprising a nucleic acid molecule encoding a nuclear localization signal.
18. The vector for expression of the fusion molecule comprising HDAg and at least one binding moiety wherein a first heterologous gene encodes a first binding moiety and a second heterologous gene encodes a second binding moiety.
19. A vector of claim 11 which further comprises a nucleic acid molecule which encodes a heterologous gene.
20. A host cell which comprises a nucleic acid molecule which encodes a fusion molecule comprising HDAg and at least one binding moiety.
21. A host cell which comprises a nucleic acid molecule of claim 1 .
22. A host cell which comprises a nucleic acid molecule of claim 2 .
23. A method of manufacturing a host cell comprising a nucleic acid molecule encoding a fusion molecule comprising HDAg and at least one binding moiety comprising introducing a vector of claim 10 into the host cell.
24. A method of expressing a high valency display of at least one binding moiety comprising introducing into a cell a vector comprising a nucleic acid molecule encoding HDAg and a nucleic acid molecule encoding the binding moiety and culturing the cell under conditions sufficient to permit expression of a fusion molecule comprising the binding moiety and HDAg.
25. A method for delivering molecules to a cell comprising contacting them with a fusion molecule comprising HDAg and at least one binding moiety.
26. The method of claim 25 wherein the binding moiety is an oligonucleotide.
27. The method of claim 25 wherein the oligonucleotide hybridizes to a nucleic acid molecule in the cell.
28. The method of claim 26 wherein said fusion molecule further comprises a double-stranded nuclease.
29. The method of claim 25 wherein the fusion molecule comprises a first binding moiety and a second binding moiety wherein the first binding moiety interacts with a binding partner and the second binding moiety functions as an effector.
30. The method of claim 29 wherein the first binding moiety interacts with a cell surface receptor and the second binding moiety can kill the cell.
31. A method of amplifying a signal in a solid phase assay comprising coupling an HDAg octamer with at least one copy of a domain which interacts with a ligand and at least two copies of a label.
32. A method of claim 31 wherein the label is selected from the group consisting of alkaline phosphatase, a radiolabel, streptadavin and green fluorescent protein.
33. The method of claim 31 wherein the solid phase assay is an ELISA assay.
34. A method of facilitating exchange of substrates and products comprising coupling an HDAg oligomer to at least two enzymes which function in a linked pathway.
35. A method of enhancing a reaction between binding partners comprising coupling the binding partners to an HDAg oligomer.
36. A method of enhancing a reaction between two binding partners comprising coupling one binding partner to an HDAg oligomer and contacting the oligomer to a second binding partner.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/038,652 US20050170337A1 (en) | 1998-07-02 | 2005-01-18 | Oligomerization of hepatitis delta antigen |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US9160998P | 1998-07-02 | 1998-07-02 | |
US09/347,175 US6844171B1 (en) | 1998-07-02 | 1999-07-01 | Oligomerization of hepatitis delta antigen |
US11/038,652 US20050170337A1 (en) | 1998-07-02 | 2005-01-18 | Oligomerization of hepatitis delta antigen |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/347,175 Division US6844171B1 (en) | 1998-07-02 | 1999-07-01 | Oligomerization of hepatitis delta antigen |
Publications (1)
Publication Number | Publication Date |
---|---|
US20050170337A1 true US20050170337A1 (en) | 2005-08-04 |
Family
ID=33566958
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/347,175 Expired - Fee Related US6844171B1 (en) | 1998-07-02 | 1999-07-01 | Oligomerization of hepatitis delta antigen |
US11/038,652 Abandoned US20050170337A1 (en) | 1998-07-02 | 2005-01-18 | Oligomerization of hepatitis delta antigen |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/347,175 Expired - Fee Related US6844171B1 (en) | 1998-07-02 | 1999-07-01 | Oligomerization of hepatitis delta antigen |
Country Status (1)
Country | Link |
---|---|
US (2) | US6844171B1 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010092963A1 (en) * | 2009-02-10 | 2010-08-19 | 国立大学法人 琉球大学 | Drug transporter, and adjuvant and vaccine each utilizing same |
WO2016183366A3 (en) * | 2015-05-12 | 2016-12-29 | Protiva Biotherapeutics, Inc. | Compositions and methods for silencing expression of hepatitis d virus rna |
WO2017132332A1 (en) * | 2016-01-28 | 2017-08-03 | Svenska Vaccinfabriken Produktion Ab | Chimeric hepatitis d virus antigen and hepatitis b virus pre s1 genes for use alone or in vaccines contaning hepatitis b virus genes |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2830019B1 (en) * | 2001-09-24 | 2005-01-07 | Assist Publ Hopitaux De Paris | NUCLEIC ACID MOLECULES OF HDV, THEIR FRAGMENTS AND THEIR APLLICATIONS |
AU2002367566B2 (en) * | 2001-11-20 | 2008-10-16 | Duke University | Interfacial biomaterials |
US20130331297A1 (en) | 2010-07-16 | 2013-12-12 | Avantgen, Inc. | Novel peptides and uses thereof |
ES2671381T3 (en) | 2011-06-14 | 2018-06-06 | Globeimmune, Inc. | Immunotherapeutic compositions for the treatment or prevention of hepatitis delta virus infection |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE3788902T3 (en) * | 1986-06-17 | 2004-11-25 | Chiron Corp. (N.D.Ges.D. Staates Delaware), Emeryville | Hepatitis delta diagnostics and vaccines, their manufacture and use. |
IT1241003B (en) * | 1990-11-05 | 1993-12-27 | Sorin Biomedica Spa | PROCEDURE FOR THE PURIFICATION OF AN EQUIVALENT RECOMBINING POLYPEPTIDE HDIV, EQUIVALENT POLYPEPTIDE HDAG AND IMMUNOLOGICAL TESTS USING IT |
WO1996020953A2 (en) * | 1994-12-30 | 1996-07-11 | The University Of North Carolina At Chapel Hill | Synthetic multimeric peptide with delta hepatitis virus antigenic activity |
-
1999
- 1999-07-01 US US09/347,175 patent/US6844171B1/en not_active Expired - Fee Related
-
2005
- 2005-01-18 US US11/038,652 patent/US20050170337A1/en not_active Abandoned
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010092963A1 (en) * | 2009-02-10 | 2010-08-19 | 国立大学法人 琉球大学 | Drug transporter, and adjuvant and vaccine each utilizing same |
US8580274B2 (en) | 2009-02-10 | 2013-11-12 | University Of The Ryukyus | Drug transporter, and adjuvant and vaccine each utilizing same |
WO2016183366A3 (en) * | 2015-05-12 | 2016-12-29 | Protiva Biotherapeutics, Inc. | Compositions and methods for silencing expression of hepatitis d virus rna |
WO2017132332A1 (en) * | 2016-01-28 | 2017-08-03 | Svenska Vaccinfabriken Produktion Ab | Chimeric hepatitis d virus antigen and hepatitis b virus pre s1 genes for use alone or in vaccines contaning hepatitis b virus genes |
US10905760B2 (en) | 2016-01-28 | 2021-02-02 | Svenska Vaccinfabriken Produktion Ab | Chimeric hepatitis D virus antigen and hepatitis B virus pre S1 genes for use alone or in vaccines containing hepatitis B virus genes |
US11478544B2 (en) | 2016-01-28 | 2022-10-25 | Svenska Vaccinfabriken Produktion Ab | Chimeric hepatitis D virus antigen and hepatitis B virus pre S1 genes for use alone or in vaccines contaning hepatitis B virus genes |
Also Published As
Publication number | Publication date |
---|---|
US6844171B1 (en) | 2005-01-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8192953B2 (en) | Trimerising module | |
EP0652895B1 (en) | Compounds which inhibit hiv replication | |
JP4925540B2 (en) | 5-helix protein | |
Center et al. | Crystallization of a trimeric human T cell leukemia virus type 1 gp21 ectodomain fragment as a chimera with maltose‐binding protein | |
Naeve et al. | Fatty acids on the A/Japan/305/57 influenza virus hemagglutinin have a role in membrane fusion. | |
ES2307550T3 (en) | DI-U OLIGOMERO OF A DI-, TRI-, TETRA- OR PENTAMETER OF RECOMBINANT FUSION PROTEINS. | |
US6911205B2 (en) | Stabilized soluble glycoprotein trimers | |
Shu et al. | Trimerization specificity in HIV-1 gp41: analysis with a GCN4 leucine zipper model | |
Barnett et al. | Structure and mechanism of a coreceptor for infection by a pathogenic feline retrovirus | |
KR20230009445A (en) | Stabilized coronavirus spike protein fusion protein | |
JP2011515074A (en) | Expression purification of influenza virus PA and crystal structure of PA N-terminal and PA C-terminal and PB1 N-terminal polypeptide complex | |
Maerz et al. | Functional implications of the human T-lymphotropic virus type 1 transmembrane glycoprotein helical hairpin structure | |
US6844171B1 (en) | Oligomerization of hepatitis delta antigen | |
Kraft et al. | Structure of D-63 from Sulfolobus spindle-shaped virus 1: surface properties of the dimeric four-helix bundle suggest an adaptor protein function | |
Bringolf et al. | Dimerization efficiency of canine distemper virus matrix protein regulates membrane-budding activity | |
CN113061168B (en) | Truncated fever with thrombocytopenia syndrome virus Gn protein and application thereof | |
TW201343670A (en) | Stable peptide mimetics of the HIV-1 gp41 pre-hairpin intermediate | |
WO1998047916A1 (en) | Bifunctional polypeptides for cell-specific viral targeting | |
US20080213292A1 (en) | Soluble And Stabilized Trimeric Form Of Gp41 Polypeptides | |
US20010043931A1 (en) | Human respiratory syncytial virus | |
WO2005018666A1 (en) | Polypeptide multimers having antiviral activity | |
Baumann et al. | Identification of a potential modification site in human stromal cell‐derived factor‐1 | |
CN1980952A (en) | Multimerised HIV fusion inhibitors | |
CN114751992B (en) | A universal vaccine for coronavirus and its application | |
US20230357325A1 (en) | Composition and method to stabilize coronavirus spike glycoproteins in pre-fusion conformation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |