+

WO2004113540A1 - Expression of heterologous protein - Google Patents

Expression of heterologous protein Download PDF

Info

Publication number
WO2004113540A1
WO2004113540A1 PCT/GB2004/002432 GB2004002432W WO2004113540A1 WO 2004113540 A1 WO2004113540 A1 WO 2004113540A1 GB 2004002432 W GB2004002432 W GB 2004002432W WO 2004113540 A1 WO2004113540 A1 WO 2004113540A1
Authority
WO
WIPO (PCT)
Prior art keywords
nucleic acid
acid molecule
polypeptide
encodes
cell
Prior art date
Application number
PCT/GB2004/002432
Other languages
French (fr)
Inventor
Peter Eastmond
Original Assignee
The University Of York
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from GB0314102A external-priority patent/GB0314102D0/en
Priority claimed from GB0402293A external-priority patent/GB0402293D0/en
Application filed by The University Of York filed Critical The University Of York
Publication of WO2004113540A1 publication Critical patent/WO2004113540A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8242Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
    • C12N15/8257Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits for the production of primary gene products, e.g. pharmaceutical products, interferon
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/43504Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates
    • C07K14/43563Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates from insects
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/62DNA sequences coding for fusion proteins
    • C12N15/625DNA sequences coding for fusion proteins containing a sequence coding for a signal sequence
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8216Methods for controlling, regulating or enhancing expression of transgenes in plant cells
    • C12N15/8222Developmentally regulated expression systems, tissue, organ specific, temporal or spatial regulation
    • C12N15/823Reproductive tissue-specific promoters
    • C12N15/8234Seed-specific, e.g. embryo, endosperm
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1048Glycosyltransferases (2.4)
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • A61K2039/54Medicinal preparations containing antigens or antibodies characterised by the route of administration
    • A61K2039/541Mucosal route
    • A61K2039/542Mucosal route oral/gastrointestinal
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/01Fusion polypeptide containing a localisation/targetting motif
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/30Non-immunoglobulin-derived peptide or protein having an immunoglobulin constant or Fc region, or a fragment thereof, attached thereto
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/50Fusion polypeptide containing protease site

Definitions

  • the invention relates to the manufacture of recombinant proteins; vectors for use in the recombinant manufacture of protein and recombinant proteins that are adapted for purification.
  • recombinant proteins require a high standard of quality control since many of these proteins are used as pharmaceuticals, for example: growth hormone; leptin; erythropoietin; prolactin, interleukins; interferon's.
  • vaccines particularly subunit vaccines, (vaccines based on a defined antigen, for example gpl20 of HIV)
  • the production of recombinant protein in cell expression systems is based either on prokaryotic cell expression or eukaryotic cell expression. The latter is preferred when post-translation modifications to the protein are required.
  • Eukaryotic systems include the use of mammalian cells, e.g. Chinese Hamster Ovary cells; insect cells e.g. Spodoptera spp; fungal/yeast (e.g. Saccharomyces spp, Pichia spp, Aspergillus spp) and plants.
  • mammalian cells e.g. Chinese Hamster Ovary cells
  • insect cells e.g. Spodoptera spp
  • fungal/yeast e.g. Saccharomyces spp, Pichia spp, Aspergillus spp
  • heterologous recombinant proteins in an expression system relates to a variety of proteins used in both industrial and medical applications.
  • tumour suppressor polypeptides include tumour suppressor polypeptides (e.g. p53 polypeptide, the APC polypeptide, the DPC-4 polypeptide, the BRCA-1 polypeptide, the BRCA-2 polypeptide, the WT-1 polypeptide, the retinoblastoma polypeptide (Lee, et al. (1987) Nature 329:642), the MMAC-1 polypeptide, the adenomatous polyposis coli protein (United States Patent 5,783,666), the deleted in colon carcinoma (DCC) polypeptide, the MMSC-2 polypeptide, the NF-1 polypeptide, nasopharyngeal carcinoma tumour suppressor polypeptide (Cheng, et al. 1998.
  • tumour suppressor polypeptides e.g. p53 polypeptide, the APC polypeptide, the DPC-4 polypeptide, the BRCA-1 polypeptide, the BRCA-2 polypeptide, the WT-1 polypeptide, the retinoblastom
  • Therapeutic polypeptides also include "antigenic polypeptides" (e.g. tumour rejection antigens the MAGE, BAGE, GAGE and DAGE families of tumour rejection antigens, see Schulz et al Proc Natl Acad Sci USA, 1991, 88, pp991-993).
  • Antigenic polypeptides also includes polypeptide antigens used in the preparation of vaccines which provide protection against infectious agents.
  • viruses such as Human Immunodeficiency Virus (HIV1 & 2); Human T Cell Leukaemia Virus (HTLV 1 & 2); Ebola virus; Human Papilloma Virus (e.g. HPV-2, HPV-5, HPV-8 HPV-16, HPV-18, HPV-31, HPV-33, HPV-52, HPV-54 and HPV-56); papovavirus; rhinovirus; poliovirus; herpesvirus; adenovirus; Epstein Barr virus; influenza virus, hepatitis B and C viruses.
  • HIV1 & 2 Human Immunodeficiency Virus
  • HTLV 1 & 2 Human T Cell Leukaemia Virus
  • Ebola virus Human Papilloma Virus
  • HPV-2, HPV-5, HPV-8 HPV-16, HPV-18, HPV-31, HPV-33, HPV-52, HPV-54 and HPV-56 papovavirus
  • rhinovirus poliovirus
  • herpesvirus
  • Antigens derived from pathogenic bacteria such as Staphylococcus aureus; Staphylococcus epidermidis; Enterococcus faecalis; Mycobacterium tuberculsis; Streptococcus group B; Streptoccocus pneumoniae; Helicobacter pylori; Neisseria gonorrhea; Streptococcus group A; Borrelia burgdorferi; Coccidiodes immitis; Histoplasma sapsulatum; Neisseria meningitidis ; Shigella flexneri; Escherichia coli; Haemophilus influenzae.
  • Antigens derived from parasites such as Trypanosoma spp, Plasmodia spp, Schiztosoma spp; and pathogenic fungi such as Candida spp.
  • subunit vaccines e.g. vaccines in which the immunogen is a purified protein
  • the development of subunit vaccines has been the focus of considerable research in recent years and requires the isolation and purification of the candidate protein from a complex mixture of proteins.
  • the emergence of new pathogens and the growth of antibiotic resistance have created a need to develop new vaccines and to identify further candidate molecules useful in the development of subunit vaccines.
  • routes via which vaccines may be administered to an animal The traditional route is by transdermal injection of the vaccine containing an immunogen that invokes an immune response.
  • An alternative route that is not typically undertaken is an orally administered vaccine.
  • the immunogen contacts the immune system via the mammalian muscosal membrane which lines the small intestine through the activity of so called M cells.
  • Peyers Patches The M cells overlie regions of the intestine referred to as Peyers Patches and function to sample antigens contained in ingested material and are therefore essential components in mediating mucosal immunity.
  • Peyers Patches have been identified as major sites for the adherence and entry of enteric pathogens such as Escherichia, Salmonella, Yersina and Shigella.
  • Plant derived vaccines have been developed which utilise plant oil bodies which have been engineered to present an antigen to which an immune response is desired, see WO01/95934. Plants that bear oilseeds such as soybean, rapeseed and sunflower store triglycerides in their seeds which provide an energy store for germination and subsequent seedling development. Further examples of this are described in WO93/20216; WO96/21029 and WO98/21115.
  • Therapeutic polypeptides can be "cytotoxic polypeptides" (e.g. pseudomonas exotoxin, ricin toxin, diptheria toxin); or polypeptides which are "cytostatic polypeptides” (e.g. p21, the retinoblastoma polypeptide, the E2F-Rb polypeptide, cyclin dependent kinase inhibitors such as P 16, pi 5, pi 8 and pi 9, the growth arrest specific homeobox (GAX) polypeptide as described in Branellec, et al, see WO97/16459 and WO96/30385.
  • cytotoxic polypeptides e.g. pseudomonas exotoxin, ricin toxin, diptheria toxin
  • cytostatic polypeptides e.g. p21, the retinoblastoma polypeptide, the E2F-Rb polypeptide, cyclin dependent kinase inhibitors such as P 16, pi 5, pi 8 and
  • therapeutic antibodies are monoclonal antibodies or at least the active binding fragments thereof.
  • Monoclonal antibodies may be humanised or chimeric antibodies.
  • a chimeric antibody is produced by recombinant methods to contain the variable region of an antibody with an invariant or constant region of a human antibody.
  • a humanised antibody is produced by recombinant methods to combine the complementarity determining regions (CDRs) of an antibody with both the constant (C) regions and the framework regions from the variable (V) regions of a human antibody.
  • oilseeds such as soybean, rapeseed and sunflower store triglycerides in their seeds which provide an energy store for germination and subsequent seedling development.
  • the oil globules form oil bodies in the cytoplasm that are membrane bound organelles that have protein in the membrane surface.
  • An example of a protein found associated with oil bodies is an oleosm.
  • the oleosins are a family of proteins that have divergent amino and carboxyl termini with more highly conserved lipophilic central domain. These proteins are strongly associated with oil bodies and it is thought that the association is due in part to the lipophilic domain.
  • OBL1 and OBL 2 from castor bean that have different lipid specificities.
  • the sequence of OBL 1 and OBL 2 has revealed a lipophilic domain that lacks any homology to that found in the oleosin described in WO93/21320 and represents an alternative means to localise polypeptides which comprise this sequence to oil bodies and thereby facilitate their purification.
  • nucleic acid molecule that encodes a chimeric polypeptide wherein said nucleic acid molecule comprises a first part selected from the group consisting of: i) a nucleic acid molecule consisting of a nucleic acid sequence which encodes an amino acid sequence as represented in Figure 3; ii) a nucleic acid molecule which hybridises to the nucleic acid molecule in (i) and which specifically targets said polypeptide to an oil body; and iii) a nucleic acid molecule which differs from the nucleic acid molecules of (i) and (ii) due to the degeneracy in the genetic code; and a second part that encodes an heterologous polypeptide.
  • heterologous polypeptide is a polypeptide to which a targeting domain as herein disclosed has been fused which would not naturally include such a targeting domain.
  • said targeting domain is encoded by a nucleic acid molecule comprising a nucleic acid sequence which hybridises under stringent hybridisation conditions to a nucleic acid molecule which encodes an amino acid sequence as represented in Figure 3.
  • nucleic acid hybrids that are stable after washing in O.lx SSC, 0.1% SDS at 60°C. It is well known in the art that optimal hybridisation conditions can be calculated if the sequence of the nucleic acid is known.
  • hybridisation conditions uses 4 - 6 x SSPE (20x SSPE contains 175.3g NaCl, 88.2g NaH 2 PO 4 H 2 O and 7.4g EDTA dissolved to 1 litre and the pH adjusted to 7.4); 5-10x Denhardts solution (50x Denhardts solution contains 5g Ficoll (type 400, Pharmacia), 5g polyvinylpyrrolidone and 5g bovine serum albumen; lOO ⁇ g-l.Omg ml sonicated salmon/herring DNA; 0.1- 1.0% sodium dodecyl sulphate; optionally 40-60% deionised formamide.
  • Hybridisation temperature will vary depending on the GC content of the nucleic acid target sequence but will typically be between 42°- 65° C.
  • said nucleic acid molecule comprises a nucleic acid sequence that has at least 30% homology to the nucleic acid sequence which encodes an amino acid sequence as represented in Figure 3.
  • said homology is at least 40%; 50%; 60%; 70%; 80%; 90%; or at least 99% identity with the nucleic acid sequence that encodes the amino acid sequence as represented in Figure 3.
  • a chimeric polypeptide encoded by a nucleic acid molecule according to the invention.
  • Recombinant protein production relates to the synthesis of protein in expression systems (e.g. bacterial expression systems, yeast expression systems, insect expression systems, mammalian expression systems and plant expression systems).
  • Heterologous polypeptides include commercially important polypeptides, for example enzymes used in biocatalysis (e.g. restriction enzymes, enzymes used in industrial processing; e.g. amylases, proteases, nucleases, lipases) and therapeutic polypeptides as herein disclosed.
  • the large scale production of recombinant proteins requires a high standard of quality control since many of these proteins are used as pharmaceuticals, for example, interleukins, growth hormone, erythropoietin, interferon.
  • vaccines particularly subunit vaccines, (vaccines based on a defined antigen, for example gpl20 of HIV)
  • a defined antigen for example gpl20 of HIV
  • said second part encodes an enzyme
  • said second part encodes a pharmaceutical polypeptide.
  • said second part encodes an antigenic polypeptide.
  • said second part encodes an antigenic polypeptide associated with an infectious disease condition.
  • said second part encodes a polypeptide associated with a viral disease condition.
  • said viral disease is the result of an infection caused by a virus selected from the families consisting of: Picornaviridae;
  • Caliciviridae Caliciviridae; Togaviridae; Flaviridae; Coronaviridae; Rhabdoviridae; Filoviridae;
  • Paramyxoviridae Orthomyxoviridae; Bunyaviridae; Arenaviridae; Reoviridae; Bimavirdae; Retroviridae; Hepadnaviridae; Parvoviridae; Papoviridae; Adenoviridae;
  • Herpesviridae and Poxviridae.
  • said virus is selected from the families consisting of: Adenoviridae; Coronaviridae; Herpesviridae; Orthomyxoviridae; Picornaviridae; Poxviridae; Reoviridae; and Retroviridae.
  • said second part encodes a polypeptide associated with a bacterial disease condition.
  • said bacterial disease is caused by a bacterial pathogen selected from the group consisting of: Actinomyces; Aeromonas; Bacillus; Bacteroides; Bordetella; Brucella; Compylobacter; Capnocyclophaga; Chlamydia; Clostridium; Corynebacterium; Eikenella; Erysipelothrix; Escherichia; Fusobacterium; Hemophilus; Klebseilla; Legionella; Leptospira; Listeria; Mycobacterium; Mycoplasma; Neisseria; Nocardia; Pasteurella; Proteus; Pseudomonas; Rickettsia; Salmonella; Selenomonas; Shigelia; Staphylococcus; Streptococcus; Treponema; Vibro and Yersnia.
  • a bacterial pathogen selected from the group consisting of: Actinomyces; Aeromonas; Bacillus; Bac
  • said second part encodes a polypeptide associated with a fungal disease condition.
  • said fungal disease condition is caused by a fungal pathogen selected from the group consisting of: Aspercillus; Blastomyces; Candida; Cryptococcus; Histoplasma; Phycomyces; and Saccharomyces.
  • a fungal pathogen selected from the group consisting of: Aspercillus; Blastomyces; Candida; Cryptococcus; Histoplasma; Phycomyces; and Saccharomyces.
  • said second part encodes a polypeptide associated with a parasitic disease condition.
  • said parasitic disease condition is caused by a parasitic pathogen selected from the group consisting of: Trypanosoma; Leishmania; Schistosoma; Plasmodium; Entamoeba; Eimeria; and Trichomonas.
  • a parasitic pathogen selected from the group consisting of: Trypanosoma; Leishmania; Schistosoma; Plasmodium; Entamoeba; Eimeria; and Trichomonas.
  • said parasitic pathogen is selected from the group consisting of: Entamoeba; Eimeria; and Trichomonas.
  • said second part encodes a polypeptide associated with cancer.
  • said polypeptide is a tumour rejection antigen polypeptide.
  • said second part encodes a polypeptide that is an adhesive polypeptide, preferably said adhesive polypeptide is a mussel adhesive polypeptide.
  • Mussel-derived adhesives possess a variety of characteristics that make them of particular interest to material scientists.
  • the adhesives display a high bonding strength, versatility to a variety of bonding surfaces, resistance and durability, non- toxicity, waterproof and cure without high temperatures properties, characteristics which give these adhesives a range of potential uses and which no man-made adhesive on the market currently possesses. These properties have led to predictions of wide-ranging applications in engineering and medicine. For example a requirement for a natural, strong and waterproof adhesive can be found in the marine industry, where it can be used in ship-building. Medical uses, where such an adhesive, with its further advantages of being neither cytotoxic nor antigenic could be used in orthopaedic and dental applications.
  • nucleic acid molecule which encodes a polypeptide comprising the amino acid motif
  • said adhesive polypeptide comprises the amino acid sequence motif: A-K-P-S-Y-P-P-T-Y-K.
  • said adhesive polypeptide comprises of at least one motif which has the amino acid sequence A-K- P-S-Y-P-P-T-Y-K.
  • said adhesive polypeptide consists of a plurality of said motifs linked in tandem.
  • said adhesive polypeptide comprises at least 10 motifs, preferably greater than 10 motifs. More preferably still said adhesive polypeptide comprises at least 50 motifs, preferably between 50 and 100 motifs, more preferably between 60-85 motifs.
  • said hydroxylation is to an amino acid at position 6 and/or 7 and/or 9 of at least one motif of the amino acid sequence, X-K-X-X-Y-P-X-X-Y-K. More preferably still said hydroxylation is to amino acid position 6 and/or 7 and or 9 of at least one motif of the amino acid sequence, A-K-P- S-Y-P-P-T-Y-K.
  • said adhesive polypeptide further comprises a second motif wherein said second motif is represented by the amino acid sequence: X P X P X X X X Y K.
  • said second motif is selected from the group consisting of at least one of P P K P T P P T Y K; S P P P P V Y K Y K; or S P P P P T P V Y K, or combinations thereof.
  • nucleic acid molecule encodes a secretion signal, for example an expansin secretion signal.
  • said secretion signal comprises the amino acid sequence: MAFSYSPFSSLFLLPFFFVFTFADYGGWQSGHATFYGGGDASGT, or functional variant thereof.
  • said adhesive polypeptide comprises an endoplasmic reticulum targeting sequence, preferably said sequence is represented by the amino acid sequence HDEL, or functional variant thereof.
  • said second part encodes a spider polypeptide, preferably a spider silk polypeptide.
  • said spider silk polypeptide is encoded by a nucleic acid molecule comprising a nucleic acid sequence selected from the group consisting of: i) a nucleic acid sequence consisting of the sequence as represented in
  • Figure 4a or 4b ii) a nucleic acid sequence which encodes a polypeptide domain comprising the amino acid sequence AGRGQGGYGQGAGG and at least two motifs rich in polyalanine wherein said polyalanine motifs comprise at least 6 alanine amino acid residues; iii) a nucleic acid sequence which hybridises to the sequence presented in
  • said domain comprises a motif comprising a repetitive tripeptide sequence, GGX (glycine:glycine: X is any amino acid residue), and at least one SS(serine: serine)-polyalanine sequence.
  • said second part encodes a polypeptide with glucosylfransferase activity.
  • GTases are enzymes that post-translationally transfer glucosyl residues from an activated nucleotide sugar to monomeric and polymeric acceptor molecules such as other sugars, proteins, lipids and other organic substrates. These glucosylated molecules take part in diverse metabolic pathways and processes. The transfer of a glucosyl moiety can alter the acceptor's bioactivity, solubility and transport properties within the cell and throughout the plant.
  • One family of GTases in higher plants is defined by the presence of a C-terminal consensus sequence.
  • the GTases of this family function in the cytosol of plant cells and catalyse the transfer of glucose to small molecular weight substrates, such as phenylpropanoid derivatives, coumarins, flavonoids, other secondary metabolites and molecules known to act as plant hormones.
  • Other GTases transfers glucosyl residues to peptides and polypeptides.
  • said glucosylfransferase is encoded by a nucleic acid molecule selected from the group consisting of: i) a nucleic acid molecule selected from the group consisting of nucleic acid sequences as represented in Table 1; ii) a nucleic acid molecule which hybridise to the sequences represented in (ii) above; and which have glucosylfransferase activity; and iii) nucleic acid molecules consisting of nucleic acid sequences which are degenerate as a result of the genetic code to the sequences defined in (i) and (ii) above.
  • nucleic acid molecules hybridise under stringent hybridisation conditions to the nucleic acid molecules as represented in Table 1.
  • the GTase sequences disclosed in Table 1 are cDNA sequences to which the oil body targeting domain can be fused typically as an in-frame translational fusion.
  • said nucleic acid encodes a cleavage site to facilitate the purification of said second part from said first part.
  • said cleavage site is a proteolytic cleavage site.
  • a vector comprising a nucleic acid molecule that encodes a chimeric polypeptide according to the invention.
  • the nucleic acid in the vector is operably linked to an appropriate promoter or other regulatory elements for transcription in a host cell such as a prokaryotic, (e.g. bacterial), or eukaryotic (e.g. fungal, plant, mammalian or insect cell).
  • a host cell such as a prokaryotic, (e.g. bacterial), or eukaryotic (e.g. fungal, plant, mammalian or insect cell).
  • the vector may be a bi-functional expression vector which functions in multiple hosts.
  • this may contain its native promoter or other regulatory elements and in the case of cDNA this may be under the control of an appropriate promoter or other regulatory elements for expression in the host cell.
  • promoter is meant a nucleotide sequence upstream from the transcriptional initiation site and which contains all the regulatory regions required for transcription.
  • Suitable promoters include constitutive, tissue-specific, inducible, developmental or other promoters for expression in plant cells comprised in plants depending on design.
  • Such promoters include viral, fungal, bacterial, animal and plant-derived promoters capable of functioning in plant cells.
  • Constitutive promoters include, for example CaMV 35S promoter (Odell et al (1985) Nature 313, 9810-812); rice actin (McElroy et al (1990) Plant Cell 2: 163-171); ubiquitin (Christian et al . (1989) Plant Mol. Biol. 18 (675-689); pEMU (Last et al (1991) Theor Appl. Genet. 81: 581-588); MAS (Velten et al (1984) EMBO J. 3. 2723-2730); ALS promoter (U.S. Application Seriel No. 08/409,297), and the like.
  • Other constitutive promoters include those in U.S. Patent Nos. 5,608,149; 5,608,144; 5,604,121; 5,569,597; 5,466,785; 5,399,680, 5,268,463; and 5,608,142.
  • Chemical-regulated promoters can be used to modulate the expression of a gene in a plant through the application of an exogenous chemical regulator.
  • the promoter may be a chemical-inducible promoter, where application of the chemical induced gene expression, or a chemical-repressible promoter, where application of the chemical represses gene expression.
  • Chemical-inducible promoters are known in the art and include, but are not limited to, the maize In2-2 promoter, which is activated by benzenesulfonamide herbicide safeners, the maize GST promoter, which is activated by hydrophobic electrophilic compounds that are used as pre-emergent herbicides, and the tobacco PR-la promoter, which is activated by salicylic acid.
  • promoters of interest include steroid- responsive promoters (see, for example, the glucocorticoid-inducible promoter in Schena et al (1991) Proc. Natl. Acad. Sci. USA 88: 10421-10425 and McNellie et al. (1998) Plant J. 14(2): 247-257) and tetracycline-inducible and tetracycline- repressible promoters (see, for example, Gatz et al. (1991) Mol. Gen. Genet. 227: 229-237, and US Patent Nos. 5,814,618 and 5,789,156, herein incorporated by reference.
  • tissue-specific promoters can be utilised.
  • Tissue-specific promoters include those described by Yamamoto et al. (1997) Plant J. 12(2): 255-265; Kawamata et al (1997) Plant Cell Physiol. 38(7): 792-803; Hansen et al (1997) Mol. Gen. Genet. 254(3): 337-343; Russell et al. (1997) Transgenic Res. 6(2): 157-168; Rinehart et al (1996) Plant Physiol. 112(3): 1331-1341; Van Camp et al (1996) Plant Physiol. 112(2): 525-535; Canevascni et al (1996) Plant Physiol.
  • tissue specific promoter is a promoter which is active during the accumulation of oil in developing oil seeds, (for example see Broun et al. (1998) Plant J. 13(2): 201-210.
  • operably linked means joined as part of the same nucleic acid molecule, suitably positioned and oriented for transcription to be initiated from the promoter.
  • DNA operably linked to a promoter is "under transcriptional initiation regulation" of the promoter, hi a preferred embodiment the promoter is an inducible promoter or a developmentally regulated promoter.
  • vectors are nucleic acid constructs which operate as plant vectors. Specific procedures and vectors previously used with wide success upon plants are described by Guerineau and Mullineaux (1993) (Plant transformation and expression vectors. In: Plant Molecular Biology Labfax (Croy RRD ed) Oxford, BIOS Scientific Publishers, pp 121-148. Suitable vectors may include plant viral-derived vectors (see e.g. EP-A-194809).
  • Vectors may also include selectable genetic marker such as those that confer selectable phenotypes such as resistance to herbicides (e.g. kanamycin, hygromycin, phosphinotricin, chlorsulfuron, methotrexate, gentamycin, spectinomycin, imidazolinones and glyphosate).
  • selectable genetic marker such as those that confer selectable phenotypes such as resistance to herbicides (e.g. kanamycin, hygromycin, phosphinotricin, chlorsulfuron, methotrexate, gentamycin, spectinomycin, imidazolinones and glyphosate).
  • said vectors are vectors suitable for mammalian cell transfection or yeast cell transfection.
  • multi-copy vectors such as 2 ⁇ episomal vectors are preferred.
  • yeast CEN vectors and integrating vectors such as YIP vectors are suitable for transformation of yeast species such as Saccharomyces cerevisiae and Pichia spp.
  • a cell transfected or transformed with at least one nucleic acid molecule or vector according to the invention is provided.
  • said cell is eukaryotic cell.
  • said cell is a plant cell.
  • a plant comprising a cell according to the invention.
  • said plant is selected from: corn (Zea mays), canola (Brassica napus, Brassica rapa ssp.), flax (Linum usitatissimum), alfalfa (Medicago sativd), rice (Oryza sativa), rye (Secale cerale), sorghum (Sorghum bicolor, Sorghum vulgare), sunflower (Helianthus annus), wheat (Tritium aestivum), soybean (Glycine max), tobacco (Nicotiana tabacum), potato (Solanum tuberosum), peanuts (Arachis hypogaed), cotton (Gossypium hirsutum), sweet potato (Iopmoea batatus), cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nucifer ⁇ ), pineapple (Anana comosus), citris tree (Cirrus' spp.)
  • plants of the present invention are crop plants (for example, cereals and pulses, maize, wheat, potatoes, tapioca, rice, sorghum, millet, cassava, barley, pea), and other root, tuber or seed crops.
  • Important seed crops are oil-seed rape, sugar beet, maize, sunflower, soybean,sorghum, and flax (linseed).
  • Horticultural plants to which the present invention may be applied may include lettuce, endive, and vegetable brassicas including cabbage, broccoli, and cauliflower.
  • the present invention may be applied in tobacco, cucurbits, carrot, strawberry, sunflower, tomato, pepper.
  • Grain plants that provide seeds of interest include oil-seed plants and leguminous plants. Seeds of interest include grain seeds, such as corn, wheat, barley, rice, sorghum, rye, etc.
  • said plant is an oil seed plant.
  • Oil seed plants include cotton, soybean, safflower, sunflower, Brassica, maize, alfalfa, palm, coconut, etc.
  • Leguminous plants include beans and peas. Beans include guar, locust bean, fenugreek, soybean, garden beans, cowpea, mungbean, lima bean, fava been, lentils, chickpea, etc.
  • said oil seed plant is safflower.
  • said cell is a prokaryotic cell.
  • a seed comprising a cell according to the invention.
  • a vaccine preparation comprising a cell, or a fraction of a cell, according to the invention.
  • said preparation is an oil body containing fraction.
  • said fraction includes a chimeric polypeptide according to the invention associated with oil bodies in said fraction.
  • said preparation includes at least one carrier.
  • a carrier is an immunogenic molecule which, when bound to a second molecule augments immune responses to the latter.
  • Some antigens are not intrinsically immunogenic (i.e. not immunogenic in their own right) yet may be capable of generating antibody responses when associated with a foreign protein molecule such as keyhole-limpet haemocyanin or tetanus toxoid.
  • Certain antigens which lack T-cell epitopes, such as polymers with a repeating B-cell epitope are intrinsically immunogenic to a limited extent. These are known as T-independent antigens.
  • Such antigens benefit from association with a carrier such as tetanus toxoid, under which circumstance they elicit much stronger antibody responses.
  • said preparation includes at least one adjuvant.
  • An adjuvant is a substance or procedure which augments specific immune responses to antigens by modulating the activity of immune cells. Examples of adjuvant include, by example only, Freund's adjuvant, moray deputies, liposome's. Adjuvant are distinct from carriers.
  • the vaccine preparations may conveniently be presented in unit dosage form and may be prepared by any of the methods well-known in the art of pharmacy. In general, the vaccine preparations are prepared by uniformly and intimately bringing the active polypeptide into association with a liquid stabiliser, a finely divided solid stabiliser, or both, and then, if necessary, shaping the product.
  • Solid dosage forms for oral administration include capsules, tablets, pills, powders and granules.
  • the active polypeptide is typically mixed with at least one inert, pharmaceutically acceptable recipient such as sodium citrate or declaim phosphate and/or one or more: a) fillers or extenders such as starches, lactose, sucrose, glucose, manifold and silica acid; b) binders such as carboxymethylcellulose, alginates, gelatin, polyvinylpyrrolidone, sucrose and acacia; c) humectants such as glycerol; d) disintegrating agents such as agar-agar, calcium carbonate, potato or tapioca starch, alginic acid, certain silicates and sodium carbonate; e) solution retarding agents such as paraffin; f) absorption accelerators such as quaternary ammonium compounds; g) wetting agents such as cetyl alcohol and glycerol monostearate; h) absorbents such as kaolin and bentonite clay and i) lubricants such as
  • Solid compositions of a similar type may also be employed as fillers in soft and hard-filled gelatin capsules using such excipients as lactose or milk sugar as well as high molecular weight polyethylene glycol, for example.
  • the solid dosage forms of tablets, dragees, capsules, pills, and granules can be prepared with coatings and shells such as enteric coatings and other coatings well known in the pharmaceutical formulating art. They may optionally contain opacifying agents and may also be of a composition such that they release the active ingredient(s) only, or preferentially, in a certain part of the intestinal tract, and/or in delayed fashion.
  • Examples of embedding compositions that can be used include polymeric substances and waxes.
  • a method to immunise an animal to at least the second part of the chimeric polypeptide according to the invention comprising administering an effective amount of a chimeric polypeptide or preparation according to the invention sufficient to stimulate an immune response to said second part.
  • said animal is human.
  • said animal is selected from the group consisting of: mouse; rat; hamster; goat; cow, horse, pig, dog, cat, or sheep.
  • said immune response is the production of antibodies to said chimeric polypeptide.
  • said immune response is the production of T-helper cells which recognise the second part of said chimeric polypeptide.
  • a preferred route of administration is oral.
  • intradermal, subcutaneous, intramuscular or intranasal immunisation is within the scope of the invention.
  • an antibody obtainable by the method according to the invention.
  • said antibody is a therapeutic antibody.
  • said antibody is a diagnostic antibody.
  • said diagnostic antibody is provided with a label or tag.
  • said antibody is a monoclonal antibody or binding fragment thereof.
  • said antibody is a humanised or chimeric antibody.
  • a chimeric antibody is produced by recombinant methods to contain the variable region of an antibody with an invariant or constant region of a human antibody.
  • a humanised antibody is produced by recombinant methods to combine the complementarity determining regions (CDRs) of an antibody with both the constant (C) regions and the framework regions from the variable (V) regions of a human antibody.
  • Chimeric antibodies are recombinant antibodies in which all of the V-regions of a mouse or rat antibody are combined with human antibody C-regions.
  • Humanised antibodies are recombinant hybrid antibodies which fuse the comphmentarity determining regions from a rodent antibody V-region with the framework regions from the human antibody V-regions. The C-regions from the human antibody are also used.
  • the complimentarity determining regions (CDRs) are the regions within the N- terminal domain of both the heavy and light chain of the antibody to where the majority of the variation of the V-region is restricted. These regions form loops at the surface of the antibody molecule. These loops provide the binding surface between the antibody and antigen.
  • Antibodies from non-human animals provoke an immune response to the foreign antibody and its removal from the circulation.
  • Both chimeric and humanised antibodies have reduced antigenicity when injected to a human subject because there is a reduced amount of rodent (i.e. foreign) antibody within the recombinant hybrid antibody, while the human antibody regions do not elicit an immune response. This results in a weaker immune response and a decrease in the clearance of the antibody. This is clearly desirable when using therapeutic antibodies in the treatment of human diseases.
  • Humanised antibodies are designed to have less "foreign" antibody regions and are therefore thought to be less immunogenic than chimeric antibodies.
  • scFv's single chain antibody variable region fragments
  • scFv's single chain antibody variable region fragments
  • a hybridoma exists for a specific monoclonal antibody it is well within the knowledge of the skilled person to isolate scFv's from mRNA extracted from said hybridoma via RT PCR.
  • phage display screening can be undertaken to identify clones expressing scFv's.
  • said fragments are "domain antibody fragments". Domain antibodies are the smallest binding part of an antibody (approximately 13kDa). Examples of this technology is disclosed in US6, 248, 516, US6, 291, 158, US6,127, 197 and EP0368684 which are all incorporated by reference in their entirety.
  • said antibodies are opsonic antibodies.
  • Phagocytosis is mediated by macrophages and polymorphic leukocytes and involves the ingestion and digestion of micro-organisms, damaged or dead cells, cell debris, insoluble particles and activated clotting factors.
  • Opsonins are agents which facilitate the phagocytosis of the above foreign bodies.
  • Opsonic antibodies are therefore antibodies which provide the same function.
  • Examples of opsonins are the Fc portion of an antibody or compliment C3. Antibodies raised by immunisation and in the form of an immune complex with antigen may bring about opsonisation via the fixation of complement on the antigen, or molecules in its immediate microenvironment.
  • a method for preparing a hybridoma cell-line producing monoclonal antibodies comprising the steps of: i) immunising an immunocompetent mammal with a chimeric polypeptide or preparation according to the invention; ii) fusing lymphocytes of the immunised immunocompetent mammal with myeloma cells to form hybridoma cells; iii) screening monoclonal antibodies produced by the hybridoma cells of step (ii) for binding activity to polypeptide of the invention; iv) culturing the hybridoma cells to proliferate and/or to secrete said monoclonal antibody; and v) recovering the monoclonal antibody from the culture supernatant.
  • said immunocompetent mammal is a rodent, for example a mouse, rat or hamster.
  • hybridoma cell-line obtainable by the method according to the invention.
  • a method to prepare a vaccine preparation according to the invention comprising; i) forming a cell extract from a plant cell culture or plant according to the invention; and ii) separating said extract into an oil body rich fraction.
  • said extract is incubated with a protease which cleaves said second part from said first part.
  • Figure la is the nucleic acid sequence of OLB1
  • Figure lb is the amino acid sequence ofOLBl
  • Figure 2a is the nucleic acid sequence of OLB 2;
  • Figure 2b is the amino acid sequence of OLB2;
  • Figure 3 is the amino acid sequence of the hydrophobic domain of OLB 1;
  • Figure 4a and 4b is the nucleic acid sequence of a spider silk protein
  • Table 1 shows the nucleic acid sequences of plant GTase cDNA's.
  • Oil bodies were isolated from the endosperm of imbibed castor beans and the membranes delipidated using the methods of Hills et al., (2). Peripheral proteins were removed from the oil bodies using urea buffer, according to Millichip et al, (3). The membrane fraction was solublized in SDS-loading buffer, heated at 70°C for 10 min and the polypeptides separated on a 14% SDS-PAGE gel as described by Laemmli (5) and stained using 0.2% (w/v) Coomassie blue R-250 in methanol: acetic acid: water (4:1:5 v/v/v).
  • Sequencing-grade modified porcine trypsin (Promega) was dissolved in the 50 mM acetic acid supplied by the manufacturer, then diluted 5-fold by adding 25 mM ammonium bicarbonate containing 0.1% (v.v) octyl- ⁇ -D-glucopyranoside (Sigma) to give a final trypsin concentration is 0.02 ⁇ g/ ⁇ L. Gel pieces were rehydrated by adding 10 ⁇ L of trypsin solution, and after 30 min enough trypsin solution was added to cover the gel. Digests were incubated overnight at 37°C.
  • Tryptic digests were applied directly to the MALDI target plate. Dried peptides were redissolved in 0.5 ⁇ L of formic acid followed by addition of 9.5 ⁇ L of water, and one or more 0.5 ⁇ L aliquots were spotted and dried onto the MALDI target plate. A matrix solution containing 10 mg/mL 4-hydroxy-c-cyano-cinnamic acid (Sigma) in 50% aqueous (v:v) acetonitrile containing 0.1%) TFA (v:v) was freshly prepared. Serial 2-fold dilutions of matrix solution were spotted onto the MALDI target plate, and the dilution that gave a uniform layer of crystals when viewed in the mass spectrometer was selected for use with samples. A 0.5 ⁇ L aliquot of matrix solution was pipetted onto the dried peptide spot. In some cases, the matrix was added directly to the sample droplet on the target plate before drying.
  • MALDI mass spectra were obtained using an Applied Biosystems 4700 Proteomics Analyzer (CTS version, Applied Biosystems, Foster City, CA, USA) in reflection mode with an accelerating voltage of 20 kV.
  • MS spectra were acquired with a total of 1000 laser pulses over a mass range of m z 800-4000.
  • Final mass spectra were the summation of 20 sub-spectra, each acquired with 50 laser pulses, and internally calibrated using the tryptic peptides at m/z 842.509 and 2211.104.
  • Monoisotopic masses were obtained from centroids of raw, unsmoothed data.
  • a Source 1 accelerating voltage of 8 kV, a collision energy of 1 kV, and a Source 2 accelerating voltage of 15 kV were used. Air was used as the collision gas at the instrument's 'medium' pressure setting with a recharge threshold of 9.9xl0 "7 torr, which produced a Source 2 pressure of about lxlO "6 torr.
  • the precursor mass window was set to +/-10 "Da", and the metastable suppressor was enabled. The default calibration was used for MS/MS spectra.
  • Mass spectral data obtained in batch mode were submitted to database searching using a locally-running copy of the Mascot program (Matrix Science Ltd., versionl.7).
  • Batch-acquired MS and MS/MS spectral data were submitted to a combined peptide mass fingerprint and MS/MS ion search through the Applied Biosystems GPS Explorer software interface (version 1.0) to Mascot.
  • Peptide sequence tags were generated from CID-MS/MS spectra by manual interpretation or using a de novo sequencing program supplied by Applied Biosystems.
  • RNA extraction, cDNA synthesis and PCR - Total RNA from various tissues was isolated using the RNeasy kit from Qiagen.
  • the synthesis of single stranded cDNA was carried out using SuperscriptTM II RNase H " reverse transcriptase from Invitrogen.
  • Degenerate primers corresponding to peptide sequences 1 and 2 were designed (5'-ttgatagtirtyagyttyaga and 5'-ctgtccraatgtrtaiarctt) and used to amplify a fragment of the acid lipase (RcOBLl) cDNA from imbibed seed endosperm.
  • Gene specific primers (5'-gaccacttggtatgggcatatgatgg and 5'-catgtcattgcagtaaaccaccctga) were then used to obtain the full-length cDNA sequence by 3'- and 5 '-RACE using the SMARTTM RACE cDNA Amplification kit from BD Biosciences, following the manufacturers protocols.
  • Primers to a castor actin-like gene (RcACT) (5'- cgttctctccttgtatgccagtggtc and 5'-gagctgctcttggcagtctcaagttc) were used as a constitutive control for RT-PCR experiments on various tissues.
  • a partial clone of RcOBL2 was obtained by performing PCR on cDNA from four-day old endosperm using different combinations of four pairs of degenerate primers designed using alignments of multiple OBL-like genes (5'- rtyagyttyagaggiachgarc or 5'-rtyagyttyagaggiachgayc or 5'-rtyagyttyagaggcachgarc or 5'- rtyagyttyagaggcachgayc with 5'-giaccatrtcrttrtrtabac or 5'-giaccatrtcrttrcartabac or 5'-giaciayrtcrtttrtabac or 5'-giaciayrtcrttrcartabac).
  • Gene specific primers (5'- taggtctgggcaacagaagtgacgctac and 5'-tgcccaaatgtgtatatgttcagcaacc) were then used to obtain the full-length RcOBLl cDNA sequence by 3'- and 5 '-RACE using the SMARTTM RACE cDNA Amplification kit from BD Biosciences, following the manufacturers protocols.
  • TAG triacylglycerol
  • CHAPS 3-cholamidopropyl- dimethylammonio-1-propanesulphonate
  • PEP propyl endopeptidase
  • TFA trifluoroacetic acid
  • MALDI matrix-assisted laser desorption/ionization
  • MS mass spectrometry
  • CE collision-induced dissociation
  • PCR polymerase chain reaction
  • RT-PCR reverse transcriptase-PCR
  • RACE-PCR rapid amplification if cDNA ends- PCR
  • PAGE polyacrylamide gel electrophoresis
  • MBP maltose binding protein.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Biomedical Technology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Molecular Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • Medicinal Chemistry (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Physics & Mathematics (AREA)
  • Immunology (AREA)
  • Cell Biology (AREA)
  • Public Health (AREA)
  • Animal Behavior & Ethology (AREA)
  • Epidemiology (AREA)
  • Veterinary Medicine (AREA)
  • Mycology (AREA)
  • Insects & Arthropods (AREA)
  • Tropical Medicine & Parasitology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Toxicology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Developmental Biology & Embryology (AREA)
  • Pregnancy & Childbirth (AREA)
  • Reproductive Health (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Peptides Or Proteins (AREA)

Abstract

The invention relates to the manufacture of recombinant proteins; vectors for use in the recombinant manufacture of protein and recombinant proteins that are adapted for purification.

Description

Expression of Heterologous Protein
The invention relates to the manufacture of recombinant proteins; vectors for use in the recombinant manufacture of protein and recombinant proteins that are adapted for purification.
The large scale production of recombinant proteins requires a high standard of quality control since many of these proteins are used as pharmaceuticals, for example: growth hormone; leptin; erythropoietin; prolactin, interleukins; interferon's. Moreover, the development of vaccines, particularly subunit vaccines, (vaccines based on a defined antigen, for example gpl20 of HIV), requires the production of large amounts of pure protein free from contaminating antigens which may provoke anaphylaxis. The production of recombinant protein in cell expression systems is based either on prokaryotic cell expression or eukaryotic cell expression. The latter is preferred when post-translation modifications to the protein are required. Eukaryotic systems include the use of mammalian cells, e.g. Chinese Hamster Ovary cells; insect cells e.g. Spodoptera spp; fungal/yeast (e.g. Saccharomyces spp, Pichia spp, Aspergillus spp) and plants.
The production of heterologous recombinant proteins in an expression system relates to a variety of proteins used in both industrial and medical applications.
For example, "therapeutic polypeptides" include tumour suppressor polypeptides (e.g. p53 polypeptide, the APC polypeptide, the DPC-4 polypeptide, the BRCA-1 polypeptide, the BRCA-2 polypeptide, the WT-1 polypeptide, the retinoblastoma polypeptide (Lee, et al. (1987) Nature 329:642), the MMAC-1 polypeptide, the adenomatous polyposis coli protein (United States Patent 5,783,666), the deleted in colon carcinoma (DCC) polypeptide, the MMSC-2 polypeptide, the NF-1 polypeptide, nasopharyngeal carcinoma tumour suppressor polypeptide (Cheng, et al. 1998. Proc. Nat. Acad. Sci. 95:3042-3047), the MTSl polypeptide, the CD 4 polypeptide, the NF-1 polypeptide, the NF2 polypeptide, and the VHL polypeptide. Therapeutic polypeptides also include "antigenic polypeptides" (e.g. tumour rejection antigens the MAGE, BAGE, GAGE and DAGE families of tumour rejection antigens, see Schulz et al Proc Natl Acad Sci USA, 1991, 88, pp991-993). Antigenic polypeptides also includes polypeptide antigens used in the preparation of vaccines which provide protection against infectious agents. For example, viruses such as Human Immunodeficiency Virus (HIV1 & 2); Human T Cell Leukaemia Virus (HTLV 1 & 2); Ebola virus; Human Papilloma Virus (e.g. HPV-2, HPV-5, HPV-8 HPV-16, HPV-18, HPV-31, HPV-33, HPV-52, HPV-54 and HPV-56); papovavirus; rhinovirus; poliovirus; herpesvirus; adenovirus; Epstein Barr virus; influenza virus, hepatitis B and C viruses. Antigens derived from pathogenic bacteria such as Staphylococcus aureus; Staphylococcus epidermidis; Enterococcus faecalis; Mycobacterium tuberculsis; Streptococcus group B; Streptoccocus pneumoniae; Helicobacter pylori; Neisseria gonorrhea; Streptococcus group A; Borrelia burgdorferi; Coccidiodes immitis; Histoplasma sapsulatum; Neisseria meningitidis ; Shigella flexneri; Escherichia coli; Haemophilus influenzae. Antigens derived from parasites such as Trypanosoma spp, Plasmodia spp, Schiztosoma spp; and pathogenic fungi such as Candida spp.
The development of subunit vaccines (e.g. vaccines in which the immunogen is a purified protein) has been the focus of considerable research in recent years and requires the isolation and purification of the candidate protein from a complex mixture of proteins. The emergence of new pathogens and the growth of antibiotic resistance have created a need to develop new vaccines and to identify further candidate molecules useful in the development of subunit vaccines. There are a number of routes via which vaccines may be administered to an animal. The traditional route is by transdermal injection of the vaccine containing an immunogen that invokes an immune response. An alternative route that is not typically undertaken is an orally administered vaccine. The immunogen contacts the immune system via the mammalian muscosal membrane which lines the small intestine through the activity of so called M cells. The M cells overlie regions of the intestine referred to as Peyers Patches and function to sample antigens contained in ingested material and are therefore essential components in mediating mucosal immunity. Peyers Patches have been identified as major sites for the adherence and entry of enteric pathogens such as Escherichia, Salmonella, Yersina and Shigella.
Plant derived vaccines have been developed which utilise plant oil bodies which have been engineered to present an antigen to which an immune response is desired, see WO01/95934. Plants that bear oilseeds such as soybean, rapeseed and sunflower store triglycerides in their seeds which provide an energy store for germination and subsequent seedling development. Further examples of this are described in WO93/20216; WO96/21029 and WO98/21115.
Therapeutic polypeptides can be "cytotoxic polypeptides" (e.g. pseudomonas exotoxin, ricin toxin, diptheria toxin); or polypeptides which are "cytostatic polypeptides" (e.g. p21, the retinoblastoma polypeptide, the E2F-Rb polypeptide, cyclin dependent kinase inhibitors such as P 16, pi 5, pi 8 and pi 9, the growth arrest specific homeobox (GAX) polypeptide as described in Branellec, et al, see WO97/16459 and WO96/30385.
Also included within the scope of therapeutic polypeptides are therapeutic antibodies. Preferably said antibodies are monoclonal antibodies or at least the active binding fragments thereof. Monoclonal antibodies may be humanised or chimeric antibodies. A chimeric antibody is produced by recombinant methods to contain the variable region of an antibody with an invariant or constant region of a human antibody. A humanised antibody is produced by recombinant methods to combine the complementarity determining regions (CDRs) of an antibody with both the constant (C) regions and the framework regions from the variable (V) regions of a human antibody.
Plants that bear oilseeds such as soybean, rapeseed and sunflower store triglycerides in their seeds which provide an energy store for germination and subsequent seedling development. In the seeds the oil globules form oil bodies in the cytoplasm that are membrane bound organelles that have protein in the membrane surface. An example of a protein found associated with oil bodies is an oleosm. The oleosins are a family of proteins that have divergent amino and carboxyl termini with more highly conserved lipophilic central domain. These proteins are strongly associated with oil bodies and it is thought that the association is due in part to the lipophilic domain.
We have cloned two plant lipases referred to as OBL1 and OBL 2 from castor bean that have different lipid specificities. The sequence of OBL 1 and OBL 2 has revealed a lipophilic domain that lacks any homology to that found in the oleosin described in WO93/21320 and represents an alternative means to localise polypeptides which comprise this sequence to oil bodies and thereby facilitate their purification.
According to an aspect of the invention there is provided a nucleic acid molecule that encodes a chimeric polypeptide wherein said nucleic acid molecule comprises a first part selected from the group consisting of: i) a nucleic acid molecule consisting of a nucleic acid sequence which encodes an amino acid sequence as represented in Figure 3; ii) a nucleic acid molecule which hybridises to the nucleic acid molecule in (i) and which specifically targets said polypeptide to an oil body; and iii) a nucleic acid molecule which differs from the nucleic acid molecules of (i) and (ii) due to the degeneracy in the genetic code; and a second part that encodes an heterologous polypeptide.
An heterologous polypeptide is a polypeptide to which a targeting domain as herein disclosed has been fused which would not naturally include such a targeting domain.
In a preferred embodiment of the invention said targeting domain is encoded by a nucleic acid molecule comprising a nucleic acid sequence which hybridises under stringent hybridisation conditions to a nucleic acid molecule which encodes an amino acid sequence as represented in Figure 3.
Stringent hybridisation/washing conditions are well known in the art. For example, nucleic acid hybrids that are stable after washing in O.lx SSC, 0.1% SDS at 60°C. It is well known in the art that optimal hybridisation conditions can be calculated if the sequence of the nucleic acid is known. Typically, hybridisation conditions uses 4 - 6 x SSPE (20x SSPE contains 175.3g NaCl, 88.2g NaH2PO4 H2O and 7.4g EDTA dissolved to 1 litre and the pH adjusted to 7.4); 5-10x Denhardts solution (50x Denhardts solution contains 5g Ficoll (type 400, Pharmacia), 5g polyvinylpyrrolidone and 5g bovine serum albumen; lOOμg-l.Omg ml sonicated salmon/herring DNA; 0.1- 1.0% sodium dodecyl sulphate; optionally 40-60% deionised formamide. Hybridisation temperature will vary depending on the GC content of the nucleic acid target sequence but will typically be between 42°- 65° C.
In a preferred embodiment of the invention said nucleic acid molecule comprises a nucleic acid sequence that has at least 30% homology to the nucleic acid sequence which encodes an amino acid sequence as represented in Figure 3. Preferably said homology is at least 40%; 50%; 60%; 70%; 80%; 90%; or at least 99% identity with the nucleic acid sequence that encodes the amino acid sequence as represented in Figure 3.
According to a further aspect of the invention there is provided a chimeric polypeptide encoded by a nucleic acid molecule according to the invention.
Recombinant protein production relates to the synthesis of protein in expression systems (e.g. bacterial expression systems, yeast expression systems, insect expression systems, mammalian expression systems and plant expression systems). Heterologous polypeptides include commercially important polypeptides, for example enzymes used in biocatalysis (e.g. restriction enzymes, enzymes used in industrial processing; e.g. amylases, proteases, nucleases, lipases) and therapeutic polypeptides as herein disclosed. The large scale production of recombinant proteins requires a high standard of quality control since many of these proteins are used as pharmaceuticals, for example, interleukins, growth hormone, erythropoietin, interferon. Moreover, the development of vaccines, particularly subunit vaccines, (vaccines based on a defined antigen, for example gpl20 of HIV), requires the production of large amounts of pure protein free from contaminating antigens which may provoke anaphylaxis.
In a preferred embodiment of the invention said second part encodes an enzyme.
In a further preferred embodiment of the invention said second part encodes a pharmaceutical polypeptide.
In a yet further preferred embodiment of the invention said second part encodes an antigenic polypeptide. Preferably said second part encodes an antigenic polypeptide associated with an infectious disease condition.
i a preferred embodiment of the invention said second part encodes a polypeptide associated with a viral disease condition.
hi a preferred embodiment of the invention said viral disease is the result of an infection caused by a virus selected from the families consisting of: Picornaviridae;
Caliciviridae; Togaviridae; Flaviridae; Coronaviridae; Rhabdoviridae; Filoviridae;
Paramyxoviridae; Orthomyxoviridae; Bunyaviridae; Arenaviridae; Reoviridae; Bimavirdae; Retroviridae; Hepadnaviridae; Parvoviridae; Papoviridae; Adenoviridae;
Herpesviridae; and Poxviridae.
In a preferred embodiment of the invention said virus is selected from the families consisting of: Adenoviridae; Coronaviridae; Herpesviridae; Orthomyxoviridae; Picornaviridae; Poxviridae; Reoviridae; and Retroviridae. In a further preferred embodiment of the invention said second part encodes a polypeptide associated with a bacterial disease condition.
In a preferred embodiment of the invention said bacterial disease is caused by a bacterial pathogen selected from the group consisting of: Actinomyces; Aeromonas; Bacillus; Bacteroides; Bordetella; Brucella; Compylobacter; Capnocyclophaga; Chlamydia; Clostridium; Corynebacterium; Eikenella; Erysipelothrix; Escherichia; Fusobacterium; Hemophilus; Klebseilla; Legionella; Leptospira; Listeria; Mycobacterium; Mycoplasma; Neisseria; Nocardia; Pasteurella; Proteus; Pseudomonas; Rickettsia; Salmonella; Selenomonas; Shigelia; Staphylococcus; Streptococcus; Treponema; Vibro and Yersnia.
In a further preferred embodiment of the invention said second part encodes a polypeptide associated with a fungal disease condition.
In a preferred embodiment of the invention said fungal disease condition is caused by a fungal pathogen selected from the group consisting of: Aspercillus; Blastomyces; Candida; Cryptococcus; Histoplasma; Phycomyces; and Saccharomyces.
In a further preferred embodiment of the invention said second part encodes a polypeptide associated with a parasitic disease condition.
In a preferred embodiment of the invention said parasitic disease condition is caused by a parasitic pathogen selected from the group consisting of: Trypanosoma; Leishmania; Schistosoma; Plasmodium; Entamoeba; Eimeria; and Trichomonas.
hi a preferred embodiment of the invention said parasitic pathogen is selected from the group consisting of: Entamoeba; Eimeria; and Trichomonas.
In an alternative preferred embodiment of the invention said second part encodes a polypeptide associated with cancer. In a preferred embodiment of the invention said polypeptide is a tumour rejection antigen polypeptide.
In an alternative preferred embodiment of the invention said second part encodes a polypeptide that is an adhesive polypeptide, preferably said adhesive polypeptide is a mussel adhesive polypeptide.
Mussel-derived adhesives possess a variety of characteristics that make them of particular interest to material scientists. The adhesives display a high bonding strength, versatility to a variety of bonding surfaces, resistance and durability, non- toxicity, waterproof and cure without high temperatures properties, characteristics which give these adhesives a range of potential uses and which no man-made adhesive on the market currently possesses. These properties have led to predictions of wide-ranging applications in engineering and medicine. For example a requirement for a natural, strong and waterproof adhesive can be found in the marine industry, where it can be used in ship-building. Medical uses, where such an adhesive, with its further advantages of being neither cytotoxic nor antigenic could be used in orthopaedic and dental applications.
In a preferred embodiment of the invention said mussel adhesive polypeptide is encoded by a nucleic acid molecule selected from the group consisting of:
i) a nucleic acid molecule which encodes a polypeptide comprising the amino acid motif
X-K-X-X-Y-P-X-X-Y-K wherein X is any amino acid residue; ii) a nucleic acid molecule which encodes a polypeptide wherein at least one of the residues in said motif is modified by hydroxylation; and iii) a nucleic acid molecule which encodes a polypeptide which has adhesive properties. In a preferred embodiment of the invention said adhesive polypeptide comprises the amino acid sequence motif: A-K-P-S-Y-P-P-T-Y-K. Preferably said adhesive polypeptide comprises of at least one motif which has the amino acid sequence A-K- P-S-Y-P-P-T-Y-K. Preferably said adhesive polypeptide consists of a plurality of said motifs linked in tandem.
In a preferred embodiment of the invention said adhesive polypeptide comprises at least 10 motifs, preferably greater than 10 motifs. More preferably still said adhesive polypeptide comprises at least 50 motifs, preferably between 50 and 100 motifs, more preferably between 60-85 motifs.
In a further preferred embodiment of the invention said hydroxylation is to an amino acid at position 6 and/or 7 and/or 9 of at least one motif of the amino acid sequence, X-K-X-X-Y-P-X-X-Y-K. More preferably still said hydroxylation is to amino acid position 6 and/or 7 and or 9 of at least one motif of the amino acid sequence, A-K-P- S-Y-P-P-T-Y-K.
In a further preferred embodiment of the invention said adhesive polypeptide further comprises a second motif wherein said second motif is represented by the amino acid sequence: X P X P X X X X Y K.
Preferably said second motif is selected from the group consisting of at least one of P P K P T P P T Y K; S P P P P V Y K Y K; or S P P P P T P V Y K, or combinations thereof.
In a further preferred embodiment of the invention said nucleic acid molecule encodes a secretion signal, for example an expansin secretion signal.
h a preferred embodiment of the invention said secretion signal comprises the amino acid sequence: MAFSYSPFSSLFLLPFFFVFTFADYGGWQSGHATFYGGGDASGT, or functional variant thereof.
In a yet further preferred embodiment of the invention said adhesive polypeptide comprises an endoplasmic reticulum targeting sequence, preferably said sequence is represented by the amino acid sequence HDEL, or functional variant thereof.
In a further preferred embodiment of the invention said second part encodes a spider polypeptide, preferably a spider silk polypeptide.
In our co-pending application WO04/016651, we describe a new spider silk protein. In addition to the tensile strength and elasticity of the silk, the additional characteristics of super contraction, insolubility and non-allergenicity make the use of silk in medical and industrial applications an attractive option. Medical applications include; artificial tendons/ligaments, wound-closure systems, such as vascular wound repair devices, haemostatic dressings, patches, glues and sutures. Industrial applications include; bullet-proof vests, light-weight body armour, cables, ropes and parachute cords, even fishing line. Other advantages in using spider silk rather than the synthetic alternatives such as Kevlar™ is that this fibre is derived from petrochemicals, which are processed under extreme, often polluting conditions and add to the waste accumulating in landfills.
In a preferred embodiment of the invention said spider silk polypeptide is encoded by a nucleic acid molecule comprising a nucleic acid sequence selected from the group consisting of: i) a nucleic acid sequence consisting of the sequence as represented in
Figure 4a or 4b; ii) a nucleic acid sequence which encodes a polypeptide domain comprising the amino acid sequence AGRGQGGYGQGAGG and at least two motifs rich in polyalanine wherein said polyalanine motifs comprise at least 6 alanine amino acid residues; iii) a nucleic acid sequence which hybridises to the sequence presented in
(i) and (ii) above and which encodes a silk polypeptide; and iv) a nucleic acid sequences that are degenerate as a result of the genetic code to the nucleic acid sequence defined in (i) - (iii) above.
In a preferred embodiment of the invention said domain comprises a motif comprising a repetitive tripeptide sequence, GGX (glycine:glycine: X is any amino acid residue), and at least one SS(serine: serine)-polyalanine sequence.
In a yet further preferred embodiment of the invention said second part encodes a polypeptide with glucosylfransferase activity.
Typically, GTases are enzymes that post-translationally transfer glucosyl residues from an activated nucleotide sugar to monomeric and polymeric acceptor molecules such as other sugars, proteins, lipids and other organic substrates. These glucosylated molecules take part in diverse metabolic pathways and processes. The transfer of a glucosyl moiety can alter the acceptor's bioactivity, solubility and transport properties within the cell and throughout the plant. One family of GTases in higher plants is defined by the presence of a C-terminal consensus sequence. The GTases of this family function in the cytosol of plant cells and catalyse the transfer of glucose to small molecular weight substrates, such as phenylpropanoid derivatives, coumarins, flavonoids, other secondary metabolites and molecules known to act as plant hormones. Other GTases transfers glucosyl residues to peptides and polypeptides.
In a preferred embodiment of the invention said glucosylfransferase is encoded by a nucleic acid molecule selected from the group consisting of: i) a nucleic acid molecule selected from the group consisting of nucleic acid sequences as represented in Table 1; ii) a nucleic acid molecule which hybridise to the sequences represented in (ii) above; and which have glucosylfransferase activity; and iii) nucleic acid molecules consisting of nucleic acid sequences which are degenerate as a result of the genetic code to the sequences defined in (i) and (ii) above.
I a preferred embodiment of the invention said nucleic acid molecules hybridise under stringent hybridisation conditions to the nucleic acid molecules as represented in Table 1.
The GTase sequences disclosed in Table 1 are cDNA sequences to which the oil body targeting domain can be fused typically as an in-frame translational fusion.
In a still further preferred embodiment of the invention said nucleic acid encodes a cleavage site to facilitate the purification of said second part from said first part. Preferably said cleavage site is a proteolytic cleavage site.
According to a further aspect of the invention there is provided a vector comprising a nucleic acid molecule that encodes a chimeric polypeptide according to the invention.
Preferably the nucleic acid in the vector is operably linked to an appropriate promoter or other regulatory elements for transcription in a host cell such as a prokaryotic, (e.g. bacterial), or eukaryotic (e.g. fungal, plant, mammalian or insect cell). The vector may be a bi-functional expression vector which functions in multiple hosts. In the example of nucleic acids encoding polypeptides according to the invention this may contain its native promoter or other regulatory elements and in the case of cDNA this may be under the control of an appropriate promoter or other regulatory elements for expression in the host cell.
By "promoter" is meant a nucleotide sequence upstream from the transcriptional initiation site and which contains all the regulatory regions required for transcription.
Suitable promoters include constitutive, tissue-specific, inducible, developmental or other promoters for expression in plant cells comprised in plants depending on design. Such promoters include viral, fungal, bacterial, animal and plant-derived promoters capable of functioning in plant cells.
Constitutive promoters include, for example CaMV 35S promoter (Odell et al (1985) Nature 313, 9810-812); rice actin (McElroy et al (1990) Plant Cell 2: 163-171); ubiquitin (Christian et al . (1989) Plant Mol. Biol. 18 (675-689); pEMU (Last et al (1991) Theor Appl. Genet. 81: 581-588); MAS (Velten et al (1984) EMBO J. 3. 2723-2730); ALS promoter (U.S. Application Seriel No. 08/409,297), and the like. Other constitutive promoters include those in U.S. Patent Nos. 5,608,149; 5,608,144; 5,604,121; 5,569,597; 5,466,785; 5,399,680, 5,268,463; and 5,608,142.
Chemical-regulated promoters can be used to modulate the expression of a gene in a plant through the application of an exogenous chemical regulator. Depending upon the objective, the promoter may be a chemical-inducible promoter, where application of the chemical induced gene expression, or a chemical-repressible promoter, where application of the chemical represses gene expression. Chemical-inducible promoters are known in the art and include, but are not limited to, the maize In2-2 promoter, which is activated by benzenesulfonamide herbicide safeners, the maize GST promoter, which is activated by hydrophobic electrophilic compounds that are used as pre-emergent herbicides, and the tobacco PR-la promoter, which is activated by salicylic acid. Other chemical-regulated promoters of interest include steroid- responsive promoters (see, for example, the glucocorticoid-inducible promoter in Schena et al (1991) Proc. Natl. Acad. Sci. USA 88: 10421-10425 and McNellie et al. (1998) Plant J. 14(2): 247-257) and tetracycline-inducible and tetracycline- repressible promoters (see, for example, Gatz et al. (1991) Mol. Gen. Genet. 227: 229-237, and US Patent Nos. 5,814,618 and 5,789,156, herein incorporated by reference. Where enhanced expression in particular tissues is desired, tissue-specific promoters can be utilised. Tissue-specific promoters include those described by Yamamoto et al. (1997) Plant J. 12(2): 255-265; Kawamata et al (1997) Plant Cell Physiol. 38(7): 792-803; Hansen et al (1997) Mol. Gen. Genet. 254(3): 337-343; Russell et al. (1997) Transgenic Res. 6(2): 157-168; Rinehart et al (1996) Plant Physiol. 112(3): 1331-1341; Van Camp et al (1996) Plant Physiol. 112(2): 525-535; Canevascni et al (1996) Plant Physiol. 112(2): 513-524; Yamamoto et al (1994) Plant Cell Physiol. 35(5): 773-778; Lam (1994) Results Probl. Cell Differ. 20: 181-196; Orozco et al (1993) Plant Mol. Biol. 23(6): 1129-1138; Mutsuoka et al (1993) Proc. Natl. Acad. Sci. USA 90(20): 9586-9590; and Guevara-Garcia et al (1993) Plant J. 4(3): 495-50.
In a preferred embodiment of the invention said tissue specific promoter is a promoter which is active during the accumulation of oil in developing oil seeds, (for example see Broun et al. (1998) Plant J. 13(2): 201-210.
"Operably linked" means joined as part of the same nucleic acid molecule, suitably positioned and oriented for transcription to be initiated from the promoter. DNA operably linked to a promoter is "under transcriptional initiation regulation" of the promoter, hi a preferred embodiment the promoter is an inducible promoter or a developmentally regulated promoter.
Particular vectors are nucleic acid constructs which operate as plant vectors. Specific procedures and vectors previously used with wide success upon plants are described by Guerineau and Mullineaux (1993) (Plant transformation and expression vectors. In: Plant Molecular Biology Labfax (Croy RRD ed) Oxford, BIOS Scientific Publishers, pp 121-148. Suitable vectors may include plant viral-derived vectors (see e.g. EP-A-194809).
Vectors may also include selectable genetic marker such as those that confer selectable phenotypes such as resistance to herbicides (e.g. kanamycin, hygromycin, phosphinotricin, chlorsulfuron, methotrexate, gentamycin, spectinomycin, imidazolinones and glyphosate).
Alternatively, or in addition, said vectors are vectors suitable for mammalian cell transfection or yeast cell transfection. In the latter example multi-copy vectors such as 2μ episomal vectors are preferred. Alternatively yeast CEN vectors and integrating vectors such as YIP vectors are suitable for transformation of yeast species such as Saccharomyces cerevisiae and Pichia spp.
According to a yet further aspect of the invention there is provided a method to produce a chimeric polypeptide according to the invention comprising the steps of:
i) providing a cell according to the invention and growth conditions conducive to the production of a chimeric polypeptide according to the invention; and optionally
ii) purifying said chimeric polypeptide from said cell or growth media.
According to a further aspect of the invention there is provided a cell transfected or transformed with at least one nucleic acid molecule or vector according to the invention.
In a preferred embodiment of the invention said cell is eukaryotic cell.
h a preferred embodiment of the invention said cell is a plant cell.
According to a further aspect of the invention there is provided a plant comprising a cell according to the invention.
In a preferred embodiment of the invention said plant is selected from: corn (Zea mays), canola (Brassica napus, Brassica rapa ssp.), flax (Linum usitatissimum), alfalfa (Medicago sativd), rice (Oryza sativa), rye (Secale cerale), sorghum (Sorghum bicolor, Sorghum vulgare), sunflower (Helianthus annus), wheat (Tritium aestivum), soybean (Glycine max), tobacco (Nicotiana tabacum), potato (Solanum tuberosum), peanuts (Arachis hypogaed), cotton (Gossypium hirsutum), sweet potato (Iopmoea batatus), cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nuciferά), pineapple (Anana comosus), citris tree (Cirrus' spp.) cocoa (Theobroma cacao), tea (Camellia senensis), banana (Musa spp.), avacado (Persea americana), fig (Ficus casica), guava (Psidium guajava), mango (Mangifer indica), olive (Olea europaea), papaya (Carica papaya), cashew (Anacardium occidentale), macadamia (Macadamia inter grifolia), almond (Prunus amygdalus), sugar beets (Beta vulgaris), oats, barley, vegetables and ornamentals.
Preferably, plants of the present invention are crop plants (for example, cereals and pulses, maize, wheat, potatoes, tapioca, rice, sorghum, millet, cassava, barley, pea), and other root, tuber or seed crops. Important seed crops are oil-seed rape, sugar beet, maize, sunflower, soybean,sorghum, and flax (linseed). Horticultural plants to which the present invention may be applied may include lettuce, endive, and vegetable brassicas including cabbage, broccoli, and cauliflower. The present invention may be applied in tobacco, cucurbits, carrot, strawberry, sunflower, tomato, pepper. Grain plants that provide seeds of interest include oil-seed plants and leguminous plants. Seeds of interest include grain seeds, such as corn, wheat, barley, rice, sorghum, rye, etc.
In a preferred embodiment of the invention said plant is an oil seed plant.
Oil seed plants include cotton, soybean, safflower, sunflower, Brassica, maize, alfalfa, palm, coconut, etc. Leguminous plants include beans and peas. Beans include guar, locust bean, fenugreek, soybean, garden beans, cowpea, mungbean, lima bean, fava been, lentils, chickpea, etc. In a preferred embodiment of the invention said oil seed plant is safflower.
hi a further preferred embodiment of the invention said cell is a prokaryotic cell.
According to a further aspect of the invention there is provided a seed comprising a cell according to the invention.
According to a further aspect of the invention there is provided a vaccine preparation comprising a cell, or a fraction of a cell, according to the invention.
In a preferred embodiment of the invention said preparation is an oil body containing fraction. Preferably said fraction includes a chimeric polypeptide according to the invention associated with oil bodies in said fraction.
In a further preferred embodiment of the invention said preparation includes at least one carrier.
The term carrier is construed in the following manner. A carrier is an immunogenic molecule which, when bound to a second molecule augments immune responses to the latter. Some antigens are not intrinsically immunogenic (i.e. not immunogenic in their own right) yet may be capable of generating antibody responses when associated with a foreign protein molecule such as keyhole-limpet haemocyanin or tetanus toxoid. Certain antigens which lack T-cell epitopes, such as polymers with a repeating B-cell epitope are intrinsically immunogenic to a limited extent. These are known as T-independent antigens. Such antigens benefit from association with a carrier such as tetanus toxoid, under which circumstance they elicit much stronger antibody responses.
In a further preferred embodiment of the invention said preparation includes at least one adjuvant. An adjuvant is a substance or procedure which augments specific immune responses to antigens by modulating the activity of immune cells. Examples of adjuvant include, by example only, Freund's adjuvant, moray deputies, liposome's. Adjuvant are distinct from carriers.
The vaccine preparations may conveniently be presented in unit dosage form and may be prepared by any of the methods well-known in the art of pharmacy. In general, the vaccine preparations are prepared by uniformly and intimately bringing the active polypeptide into association with a liquid stabiliser, a finely divided solid stabiliser, or both, and then, if necessary, shaping the product. Solid dosage forms for oral administration include capsules, tablets, pills, powders and granules. In such solid dosage forms, the active polypeptide is typically mixed with at least one inert, pharmaceutically acceptable recipient such as sodium citrate or declaim phosphate and/or one or more: a) fillers or extenders such as starches, lactose, sucrose, glucose, manifold and silica acid; b) binders such as carboxymethylcellulose, alginates, gelatin, polyvinylpyrrolidone, sucrose and acacia; c) humectants such as glycerol; d) disintegrating agents such as agar-agar, calcium carbonate, potato or tapioca starch, alginic acid, certain silicates and sodium carbonate; e) solution retarding agents such as paraffin; f) absorption accelerators such as quaternary ammonium compounds; g) wetting agents such as cetyl alcohol and glycerol monostearate; h) absorbents such as kaolin and bentonite clay and i) lubricants such as talc, calcium stearate, magnesium stearate, solid polyethylene glycols, sodium lauryl sulfate and mixtures thereof, hi the case of capsules, tablets and pills, the dosage form may also comprise buffering agents. Solid compositions of a similar type may also be employed as fillers in soft and hard-filled gelatin capsules using such excipients as lactose or milk sugar as well as high molecular weight polyethylene glycol, for example. The solid dosage forms of tablets, dragees, capsules, pills, and granules can be prepared with coatings and shells such as enteric coatings and other coatings well known in the pharmaceutical formulating art. They may optionally contain opacifying agents and may also be of a composition such that they release the active ingredient(s) only, or preferentially, in a certain part of the intestinal tract, and/or in delayed fashion. Examples of embedding compositions that can be used include polymeric substances and waxes.
According to a further aspect of the invention there is provided a method to immunise an animal to at least the second part of the chimeric polypeptide according to the invention, comprising administering an effective amount of a chimeric polypeptide or preparation according to the invention sufficient to stimulate an immune response to said second part.
In a preferred method of the invention said animal is human.
In an alternative preferred method of the invention said animal is selected from the group consisting of: mouse; rat; hamster; goat; cow, horse, pig, dog, cat, or sheep.
In a further preferred method of the invention said immune response is the production of antibodies to said chimeric polypeptide.
In an alternative preferred method of the invention said immune response is the production of T-helper cells which recognise the second part of said chimeric polypeptide.
A preferred route of administration is oral. However, intradermal, subcutaneous, intramuscular or intranasal immunisation is within the scope of the invention.
According to a yet further aspect of the invention there is provided an antibody obtainable by the method according to the invention.
In a preferred embodiment of the invention said antibody is a therapeutic antibody.
In a further preferred embodiment of the invention said antibody is a diagnostic antibody. Preferably said diagnostic antibody is provided with a label or tag. In a preferred embodiment of the invention said antibody is a monoclonal antibody or binding fragment thereof. Preferably said antibody is a humanised or chimeric antibody.
A chimeric antibody is produced by recombinant methods to contain the variable region of an antibody with an invariant or constant region of a human antibody. A humanised antibody is produced by recombinant methods to combine the complementarity determining regions (CDRs) of an antibody with both the constant (C) regions and the framework regions from the variable (V) regions of a human antibody.
Chimeric antibodies are recombinant antibodies in which all of the V-regions of a mouse or rat antibody are combined with human antibody C-regions. Humanised antibodies are recombinant hybrid antibodies which fuse the comphmentarity determining regions from a rodent antibody V-region with the framework regions from the human antibody V-regions. The C-regions from the human antibody are also used. The complimentarity determining regions (CDRs) are the regions within the N- terminal domain of both the heavy and light chain of the antibody to where the majority of the variation of the V-region is restricted. These regions form loops at the surface of the antibody molecule. These loops provide the binding surface between the antibody and antigen.
Antibodies from non-human animals provoke an immune response to the foreign antibody and its removal from the circulation. Both chimeric and humanised antibodies have reduced antigenicity when injected to a human subject because there is a reduced amount of rodent (i.e. foreign) antibody within the recombinant hybrid antibody, while the human antibody regions do not elicit an immune response. This results in a weaker immune response and a decrease in the clearance of the antibody. This is clearly desirable when using therapeutic antibodies in the treatment of human diseases. Humanised antibodies are designed to have less "foreign" antibody regions and are therefore thought to be less immunogenic than chimeric antibodies.
It is also possible to create single variable regions, so called single chain antibody variable region fragments (scFv's). If a hybridoma exists for a specific monoclonal antibody it is well within the knowledge of the skilled person to isolate scFv's from mRNA extracted from said hybridoma via RT PCR. Alternatively, phage display screening can be undertaken to identify clones expressing scFv's. Alternatively said fragments are "domain antibody fragments". Domain antibodies are the smallest binding part of an antibody (approximately 13kDa). Examples of this technology is disclosed in US6, 248, 516, US6, 291, 158, US6,127, 197 and EP0368684 which are all incorporated by reference in their entirety.
In a further preferred embodiment of the invention said antibodies are opsonic antibodies.
Phagocytosis is mediated by macrophages and polymorphic leukocytes and involves the ingestion and digestion of micro-organisms, damaged or dead cells, cell debris, insoluble particles and activated clotting factors. Opsonins are agents which facilitate the phagocytosis of the above foreign bodies. Opsonic antibodies are therefore antibodies which provide the same function. Examples of opsonins are the Fc portion of an antibody or compliment C3. Antibodies raised by immunisation and in the form of an immune complex with antigen may bring about opsonisation via the fixation of complement on the antigen, or molecules in its immediate microenvironment.
In a further aspect of the invention there is provided a method for preparing a hybridoma cell-line producing monoclonal antibodies according to the invention comprising the steps of: i) immunising an immunocompetent mammal with a chimeric polypeptide or preparation according to the invention; ii) fusing lymphocytes of the immunised immunocompetent mammal with myeloma cells to form hybridoma cells; iii) screening monoclonal antibodies produced by the hybridoma cells of step (ii) for binding activity to polypeptide of the invention; iv) culturing the hybridoma cells to proliferate and/or to secrete said monoclonal antibody; and v) recovering the monoclonal antibody from the culture supernatant.
Preferably, said immunocompetent mammal is a rodent, for example a mouse, rat or hamster.
According to a further aspect of the invention there is provided a hybridoma cell-line obtainable by the method according to the invention.
According to a further aspect of the invention there is provided a method to prepare a vaccine preparation according to the invention comprising; i) forming a cell extract from a plant cell culture or plant according to the invention; and ii) separating said extract into an oil body rich fraction.
In a preferred embodiment of the invention said extract is incubated with a protease which cleaves said second part from said first part.
Throughout the description and claims of this specification, the word "comprise" and variations of this word, for example "comprising" and "comprises", means "including but not limited to", and is not intended to (and does not) exclude other moieties, additives, components, integers or steps.
An embodiment of the invention will be described by example only and with reference to the following figures; Figure la is the nucleic acid sequence of OLB1; Figure lb is the amino acid sequence ofOLBl;
Figure 2a is the nucleic acid sequence of OLB 2; Figure 2b is the amino acid sequence of OLB2;
Figure 3 is the amino acid sequence of the hydrophobic domain of OLB 1;
Figure 4a and 4b is the nucleic acid sequence of a spider silk protein; and
Table 1 shows the nucleic acid sequences of plant GTase cDNA's.
Materials and Methods
Materials- Castor beans (Ricinus communis var. gibsonii) were supplied by Chiltern Seeds (Ulverston, Cumbria, UK). The beans were soaked in running water for 1 d and germinated in the dark on moist vermiculite at 30°C. All reagents were obtained from Sigma-Aldrich Co. (Gillingham, UK), except for glycerol tri[l-14C]oleate that was from Amersham Biosciences UK Ltd. (Little Chalfont, UK) and Flavobacterium meningsepticum propyl endopeptidase (PEP) from Seikagaku America (Falmouth, USA).
Preparation of oil body membranes and proteolytic treatment — Oil bodies were isolated from the endosperm of imbibed castor beans and the membranes delipidated using the methods of Hills et al., (2). Peripheral proteins were removed from the oil bodies using urea buffer, according to Millichip et al, (3). The membrane fraction was solublized in SDS-loading buffer, heated at 70°C for 10 min and the polypeptides separated on a 14% SDS-PAGE gel as described by Laemmli (5) and stained using 0.2% (w/v) Coomassie blue R-250 in methanol: acetic acid: water (4:1:5 v/v/v). For proteolytic treatments purified urea washed oil bodies were incubated in 0.1M phosphate buffer (pH 7) containing 0.2 U ml"1 of PEP for lh at 37°C. The oil bodies were recovered by centrifugation. Tryptic digestion and peptide sequence analysis - fri-gel tryptic digestion was performed by washing the gel pieces in Eppendorf tubes three times with 50% (v:v) aqueous acetonitrile containing 25 mM ammonium bicarbonate and drying the gel pieces in a vacuum concentrator for 30 min. Sequencing-grade modified porcine trypsin (Promega) was dissolved in the 50 mM acetic acid supplied by the manufacturer, then diluted 5-fold by adding 25 mM ammonium bicarbonate containing 0.1% (v.v) octyl-β-D-glucopyranoside (Sigma) to give a final trypsin concentration is 0.02 μg/μL. Gel pieces were rehydrated by adding 10 μL of trypsin solution, and after 30 min enough trypsin solution was added to cover the gel. Digests were incubated overnight at 37°C.
Tryptic digests were applied directly to the MALDI target plate. Dried peptides were redissolved in 0.5 μL of formic acid followed by addition of 9.5 μL of water, and one or more 0.5 μL aliquots were spotted and dried onto the MALDI target plate. A matrix solution containing 10 mg/mL 4-hydroxy-c-cyano-cinnamic acid (Sigma) in 50% aqueous (v:v) acetonitrile containing 0.1%) TFA (v:v) was freshly prepared. Serial 2-fold dilutions of matrix solution were spotted onto the MALDI target plate, and the dilution that gave a uniform layer of crystals when viewed in the mass spectrometer was selected for use with samples. A 0.5 μL aliquot of matrix solution was pipetted onto the dried peptide spot. In some cases, the matrix was added directly to the sample droplet on the target plate before drying.
Positive-ion MALDI mass spectra were obtained using an Applied Biosystems 4700 Proteomics Analyzer (CTS version, Applied Biosystems, Foster City, CA, USA) in reflection mode with an accelerating voltage of 20 kV. MS spectra were acquired with a total of 1000 laser pulses over a mass range of m z 800-4000. Final mass spectra were the summation of 20 sub-spectra, each acquired with 50 laser pulses, and internally calibrated using the tryptic peptides at m/z 842.509 and 2211.104. Monoisotopic masses were obtained from centroids of raw, unsmoothed data. For CID-MS/MS, a Source 1 accelerating voltage of 8 kV, a collision energy of 1 kV, and a Source 2 accelerating voltage of 15 kV were used. Air was used as the collision gas at the instrument's 'medium' pressure setting with a recharge threshold of 9.9xl0"7 torr, which produced a Source 2 pressure of about lxlO"6 torr. The precursor mass window was set to +/-10 "Da", and the metastable suppressor was enabled. The default calibration was used for MS/MS spectra.
Mass spectral data obtained in batch mode were submitted to database searching using a locally-running copy of the Mascot program (Matrix Science Ltd., versionl.7). Batch-acquired MS and MS/MS spectral data were submitted to a combined peptide mass fingerprint and MS/MS ion search through the Applied Biosystems GPS Explorer software interface (version 1.0) to Mascot. Search criteria included: Maximum missed cleavages, 1; Variable modifications, Oxidation (M); Peptide tolerance, 25 ppm; MS/MS tolerance, 0.2 Da. Peptide sequence tags were generated from CID-MS/MS spectra by manual interpretation or using a de novo sequencing program supplied by Applied Biosystems.
RNA extraction, cDNA synthesis and PCR - Total RNA from various tissues was isolated using the RNeasy kit from Qiagen. The synthesis of single stranded cDNA was carried out using Superscript™ II RNase H" reverse transcriptase from Invitrogen. Degenerate primers corresponding to peptide sequences 1 and 2 were designed (5'-ttgatagtirtyagyttyaga and 5'-ctgtccraatgtrtaiarctt) and used to amplify a fragment of the acid lipase (RcOBLl) cDNA from imbibed seed endosperm. Gene specific primers (5'-gaccacttggtatgggcatatgatgg and 5'-catgtcattgcagtaaaccaccctga) were then used to obtain the full-length cDNA sequence by 3'- and 5 '-RACE using the SMART™ RACE cDNA Amplification kit from BD Biosciences, following the manufacturers protocols. Primers to a castor actin-like gene (RcACT) (5'- cgttctctccttgtatgccagtggtc and 5'-gagctgctcttggcagtctcaagttc) were used as a constitutive control for RT-PCR experiments on various tissues. A partial clone of RcOBL2 was obtained by performing PCR on cDNA from four-day old endosperm using different combinations of four pairs of degenerate primers designed using alignments of multiple OBL-like genes (5'- rtyagyttyagaggiachgarc or 5'-rtyagyttyagaggiachgayc or 5'-rtyagyttyagaggcachgarc or 5'- rtyagyttyagaggcachgayc with 5'-giaccatrtcrttrttrtabac or 5'-giaccatrtcrttrcartabac or 5'-giaciayrtcrttrttrtabac or 5'-giaciayrtcrttrcartabac). Gene specific primers (5'- taggtctgggcaacagaagtgacgctac and 5'-tgcccaaatgtgtatatgttcagcaacc) were then used to obtain the full-length RcOBLl cDNA sequence by 3'- and 5 '-RACE using the SMART™ RACE cDNA Amplification kit from BD Biosciences, following the manufacturers protocols.
ABBREVIATIONS
The abbreviations used are: TAG, triacylglycerol; CHAPS, 3-cholamidopropyl- dimethylammonio-1-propanesulphonate; PEP, propyl endopeptidase; TFA, trifluoroacetic acid; MALDI, matrix-assisted laser desorption/ionization; MS, mass spectrometry; CE), collision-induced dissociation; PCR, polymerase chain reaction; RT-PCR, reverse transcriptase-PCR; RACE-PCR, rapid amplification if cDNA ends- PCR; PAGE, polyacrylamide gel electrophoresis; MBP, maltose binding protein.
REFERENCES
1. Fuchs, C. Vine, N. and Hills, M. J. (1996) J. Plant Physiol. 149, 23-29.
2. Hills, M.J., Murphy, DJ. and Beevers, H. (1989) Plant Physiol. 89, 1006-1010.
3. Millichip, M.5 Tatham, A.S., Jackson, F., Griffiths, G., Shewry, P.R. and Stobart, K.A. (1996) Biochem. J. 314, 333-337.
4. Bradford, M.M. (1976) Anal Biochem. 72, 248-254. 5. Laemmli, U.K. (1970) Nature 227, 680-685. >UGT71B1 TABLE 1
ATGAAAGTAGAACTTGTGTTCATACCATCGCCGGGCGTTGGCCATATCCGAGCAACAACG GCGTTAGCAAAGCTTCTCGTTGCCAGCGACAACCGCCTCTCCGTCACTCTCATCGTCATT CCTTCACGAGTCTCCGACGACGCTTCTTCCTCCGTCTACACGAACTCCGAAGACCGTCTC CGCTACATCCTCCTCCCCGCCCGAGATCAAACTACTGATCTCGTATCTTACATCGACAGC CAGAAACCACAAGTAAGAGCCGTCGTGTCCAAGGTCGCTGGAGATGTTTCAACACGTTCA GACTCACGGCTAGCTGGGATTGTCGTAGACATGTTCTGCACGTCCATGATAGACATCGCC GATGAGTTTAACCTCTCGGCTTATATCTTCTACACGTCCAACGCTTCTTATCTCGGGCTA CAGTTCCACGTTCAATCTCTTTACGACGAGAAAGAACTCGACGTAAGTGAGTTCAAAGAT ACGGAGATGAAGTTTGACGTTCCAACTCTGACTCAGCCTTTTCCGGCAAAATGTTTGCCT TCAGTGATGCTAAACAAGAAATGGTTTCCTTACGTTTTGGGTCGAGCTAGAAGTTTTAGA GCAACGAAGGGTATTTTGGTAAATTCGGTGGCTGACATGGAACCTCAGGCGTTGAGTTTC TTTTCCGGTGGAAATGGGAATACAAATATCCCTCCGGTGTACGCGGTTGGGCCCATTATG GACTTAGAATCTAGCGGCGATGAAGAGAAGAGAAAGGAGATTTTACATTGGCTAAAAGAG CAACCGACGAAATCTGTAGTGTTTCTCTGTTTTGGGAGCATGGGAGGTTTCAGTGAGGAA CAAGCAAGAGAAATAGCTGTGGCGCTCGAGCGAAGCGGACACAGGTTTCTCTGGTCGCTT CGCCGCGCTTCTCCTGTTGGAAACAAGTCTAATCCTCCTCCCGGAGAATTCACGAACTTA GAGGAGATTCTTCCAAAAGGGTTTTTAGATCGGACGGTGGAGATAGGGAAGATCATAAGC TGGGCACCACAAGTAGATGTGTTGAATAGTCCTGCTATAGGAGCGTTCGTGACACATTGT GGATGGAACTCAATTCTCGAGAGTCTTTGGTTCGGTGTTCCGATGGCGGCGTGGCCTATC TATGCTGAGCAACAGTTTAACGCGTTTCATATGGTGGATGAGCTTGGTTTAGCGGCGGAG GTAAAGAAGGAGTACCGTAGAGATTTTCTGGTGGAGGAGCCGGAGATTGTGACGGCTGAT GAGATAGAGAGAGGGATCAAGTGTGCGATGGAGCAGGATAGCAAGATGAGGAAGAGGGTG ATGGAGATGAAGGATAAGCTCCACGTGGCGTTGGTGGACGGTGGATCTTCGAACTGTGCT CTAAAGAAGTTTGTTCAAGACGTGGTCGATAATGTTCCATAA
>UGT71B2
ATGAAACTGGAGCTGGTGTTCATACCATCACCTGGTGACGGACATCTCCGGCCATTAGTG GAGGTAGCTAAGCTTCATGTTGACCGTGACGACCATCTCTCCATCACCATCATCATCATC CCTCAGATGCATGGATTTAGTAGCAGTAACTCTTCTTCTTACATCGCTTCTCTCTCCTCT GATTCTGAAGAACGTCTTAGCTACAACGTTCTCTCCGTCCCTGATAAACCAGACTCCGAT GACACCAAACCACATTTTTTCGACTACATTGATAACTTCAAGCCGCAGGTCAAAGCCACG GTGGAAAAACTTACTGACCCGGGTCCACCAGATTCGCCGTCGCGTCTTGCTGGATTCGTG GTGGATATGTTTTGCATGATGATGATTGATGTCGCTAATGAGTTTGGTGTTCCCAGTTAC ATGTTTTACACATCCAACGCAACGTTTCTTGGATTGCAAGTTCATGTTGAATACCTTTAC GACGTTAAGAACTATGACGTTAGTGACCTCAAGGACTCGGACACTACTGAGCTGGAAGTT CCTTGTTTGACTCGTCCTTTACCGGTTAAGTGTTTCCCCTCGGTTCTATTAACCAAGGAG TGGTTACCGGTTATGTTTAGACAAACCAGAAGATTCCGAGAAACTAAAGGTATTTTGGTA AATACATTCGCTGAGCTTGAGCCTCAAGCTATGAAGTTTTTCTCCGGCGTAGATAGTCCT CTGCCTACGGTGTACACAGTTGGACCGGTTATGAATCTTAAAATCAACGGTCCAAATTCA TCTGACGATAAGCAATCGGAGATCCTACGGTGGCTAGACGAGCAGCCACGTAAATCCGTT GTTTTCCTCTGTTTCGGAAGCATGGGAGGTTTCCGTGAGGGCCAAGCTAAAGAAATCGCA ATCGCGCTTGAGCGAAGTGGTCACCGCTTTGTCTGGTCTCTTCGTCGTGCTCAACCAAAA GGATCGATAGGACCTCCCGAAGAATTTACGAATCTTGAGGAAATTCTCCCGGAAGGATTC TTGGAACGGACGGCAGAGATAGGAAAGATTGTAGGTTGGGCTCCACAAAGCGCCATTCTA GCAAATCCTGCGATCGGAGGGTTCGTGTCGCATTGTGGATGGAACTCGACGCTAGAGAGT CTATGGTTCGGAGTTCCGATGGCTACGTGGCCGCTTTACGCAGAGCAACAAGTTAACGCG TTCGAGATGGTTGAGGAGCTAGGGCTAGCGGTGGAGGTCCGAAATAGTTTCCGAGGAGAT TTCATGGCGGCGGATGATGAGTTGATGACGGCAGAGGAGATAGAGAGAGGGATCCGGTGT TTGATGGAGCAGGATAGTGACGTGAGGAGTAGAGTGAAGGAGATGAGCGAGAAGAGTCAC GTAGCTTTAATGGACGGTGGATCTTCGCACGTTGCTCTTCTAAAGTTTATTCAAGACGTC ACTAAGAATATCTCTTGA
>UGT71B5 ATGAAGATTGAGCTTGTGTTCATACCTTTGCCGGGGATTGGTCATCTCAGGCCAACCGTG AAGCTAGCGAAGCAACTCATAGGCAGCGAAAACCGTCTTTCGATCACCATAATCATCATC CCTTCAAGATTTGACGCCGGTGATGCATCCGCCTGTATCGCATCTCTCACCACGTTGTCT CAAGATGATCGCCTCCATTACGAATCCATATCCGTCGCAAAACAACCACCAACCTCCGAC CCGGATCCTGTTCCGGCTCAAGTGTACATAGAGAAACAAAAGACGAAAGTGAGAGATGCA GTCGCGGCGAGAATCGTCGATCCAACAAGAAAGCTCGCGGGATTCGTGGTGGACATGTTC TGTTCCTCGATGATCGATGTAGCTAACGAGTTTGGAGTTCCGTGTTATATGGTATACACA TCGAACGCTACGTTTTTAGGAACCATGCTTCACGTTCAACAAATGTACGATCAAAAGAAG TATGACGTCAGCGAGTTAGAAAACTCGGTCACCGAGTTGGAGTTTCCGTCTCTGACTCGT CCTTATCCAGTGAAGTGTCTTCCTCATATCCTCACTTCAAAGGAGTGGTTACCTCTCTCT CTAGCTCAAGCTAGGTGTTTCCGGAAGATGAAGGGTATTTTGGTAAATACAGTTGCTGAG CTTGAACCTCACGCTTTGAAAATGTTCAATATTAATGGTGACGATCTTCCTCAAGTTTAT CCTGTTGGACCAGTGTTGCATCTCGAAAACGGCAATGACGATGATGAGAAGCAATCGGAA ATTTTGCGGTGGCTCGACGAGCAACCGTCTAAATCTGTTGTGTTTCTCTGCTTTGGGAGC TTGGGAGGTTTCACTGAAGAACAAACAAGAGAAACCGCTGTGGCCCTAGATAGAAGCGGT CAGCGGTTTCTTTGGTGTCTTCGTCACGCATCGCCAAATATAAAAACAGATCGTCCCAGA GATTACACGAATCTTGAGGAGGTTTTACCGGAGGGGTTCTTGGAACGGACTTTGGATAGA GGGAAAGTGATTGGATGGGCACCACAAGTGGCGGTACTAGAGAAGCCGGCGATAGGAGGG TTTGTCACTCACTGCGGTTGGAACTCTATTTTAGAGAGCTTGTGGTTCGGTGTTCCAATG GTGACGTGGCCGCTATACGCGGAACAGAAGGTTAACGCGTTTGAGATGGTTGAGGAGCTG GGTTTGGCGGTGGAGATACGGAAGTACTTAAAAGGAGATTTGTTCGCCGGAGAGATGGAG ACGGTTACCGCGGAGGATATAGAGAGAGCCATTAGGCGTGTGATGGAGCAAGACAGTGAC GTTAGGAACAACGTGAAAGAGATGGCGGAGAAGTGCCACTTCGCGTTAATGGACGGTGGA TCTTCGAAGGCGGCTTTGGAAAAGTTTATTCAAGACGTGATAGAGAATATGGATTAA
>UGT71B6
ATGAAAATAGAGCTAGTATTCATTCCCTCTCCGGCAATTAGTCATCTCATGGCGACGGTA GAGATGGCGGAGCAACTAGTTGATAAAAACGACAACCTCTCTATCACCGTAATCATCATA TCTTTTAGTTCTAAAAATACATCCATGATCACCTCTCTTACATCCAACAACCGCCTCCGG TACGAAATAATCTCCGGAGGAGATCAACAACCAACGGAGCTCAAAGCAACTGATTCCCAC ATCCAAAGTCTAAAGCCACTGGTGAGAGACGCGGTTGCTAAACTCGTAGATTCCACTCTA CCAGACGCGCCTCGTCTTGCGGGATTCGTTGTTGACATGTACTGCACGTCGATGATCGAT GTCGCTAACGAATTTGGCGTCCCTAGTTACTTGTTTTACACCTCTAACGCTGGATTTCTT GGACTTTTGCTTCACATTCAGTTCATGTACGATGCAGAGGATATCTATGACATGAGCGAA TTAGAAGACTCTGACGTAGAGTTGGTGGTTCCGAGTTTGACTAGTCCTTATCCGTTGAAA TGTCTTCCTTACATTTTCAAATCAAAAGAGTGGCTCACTTTTTTTGTAACTCAAGCGAGA AGATTCAGAGAAACTAAGGGCATTTTGGTAAACACGGTTCCTGACTTGGAACCTCAAGCG TTGACGTTTCTTTCCAATGGTAACATTCCACGTGCTTACCCAGTAGGACCATTGTTGCAT CTCAAAAACGTAAATTGTGATTACGTGGACAAGAAGCAATCGGAGATTTTACGGTGGCTA GACGAGCAACCGCCAAGATCTGTAGTGTTCCTCTGTTTCGGGAGCATGGGAGGGTTCAGT GAGGAACAAGTGAGAGAAACCGCATTAGCTCTCGATCGAAGCGGCCACCGGTTTCTTTGG TCTCTCCGTCGTGCATCTCCGAATATATTGAGAGAGCCTCCCGGAGAATTCACAAACCTA GAGGAGATTCTCCCAGAAGGGTTTTTCGATCGGACGGCTAACAGAGGAAAGGTTATCGGA TGGGCTGAACAGGTGGCCATATTGGCGAAGCCGGCGATCGGAGGTTTTGTTTCTCACGGC GGATGGAATTCGACGTTGGAGAGTTTGTGGTTTGGTGTTCCGATGGCGATTTGGCCGCTT TACGCTGAACAGAAGTTTAACGCTTTCGAGATGGTGGAAGAGCTTGGTTTGGCTGTGGAG ATCAAGAAGCATTGGCGAGGAGATCTTTTGTTGGGGAGGTCGGAGATTGTGACGGCGGAG GAGATTGAGAAAGGAATCATATGTTTGATGGAGCAAGACAGTGACGTCAGGAAGAGAGTG AATGAGATCAGCGAGAAGTGCCACGTGGCTTTAATGGACGGTGGATCGTCAGAAACTGCT TTGAAAAGATTTATTCAAGACGTAACGGAGAATATTGCTTGGTCGGAAACTGAAAGCTAG
>UGT71B7
ATGAAATTTGAGCTTGTTTTCATCCCCTATCCCGGAATCGGTCATCTCCGATCAACGGTA GAAATGGCAAAGCTACTAGTGGACCGTGAAACTCGTCTCTCTATCTCCGTTATCATCCTT CCTTTCATTTCCGAAGGCGAAGTCGGTGCTTCCGATTACATCGCAGCCCTCTCCGCCTCA TCCAACAACCGCCTCCGCTACGAAGTTATCTCCGCCGTAGATCAACCAACCATCGAGATG ACGACAATTGAAATCCATATGAAGAACCAAGAACCAAAGGTGAGAAGCACCGTTGCAAAA CTCCTTGAAGACTATTCGTCTAAACCGGACTCGCCGAAGATCGCTGGCTTTGTTCTAGAC ATGTTTTGCACTTCGATGGTAGATGTAGCGAACGAGTTTGGTTTCCCGAGTTATATGTTT TACACCTCCAGTGCCGGGATTCTCTCAGTTACATATCATGTTCAAATGTTGTGCGATGAG AACAAGTACGATGTTAGTGAAAATGATTATGCAGACTCGGAAGCTGTGTTGAACTTTCCG AGTTTGAGTCGTCCTTATCCGGTGAAGTGTCTTCCTCACGCTCTGGCAGCTAATATGTGG CTCCCGGTGTTTGTAAACCAAGCGAGAAAGTTTAGGGAGATGAAAGGTATTTTGGTAAAT ACTGTTGCTGAGCTTGAACCTTATGTGTTAAAGTTTCTTTCTAGTAGTGATACTCCTCCT GTTTATCCTGTTGGACCATTGTTGCATCTTGAGAACCAACGTGATGATTCTAAGGACGAG AAACGGTTGGAGATTATACGGTGGTTGGATCAGCAACCACCAAGTTCGGTTGTGTTTCTC TGCTTTGGGAGCATGGGAGGCTTCGGTGAGGAACAAGTAAGAGAGATCGCAATCGCGTTA GAGCGAAGTGGGCACCGGTTTCTCTGGTCTCTTCGTCGCGCATCTCCGAATATATTCAAA GAACTTCCAGGAGAGTTTACTAATCTAGAGGAAGTTCTCCCGGAAGGATTCTTTGATCGA ACGAAAGATATAGGTAAAGTGATTGGATGGGCTCCACAAGTAGCCGTTCTTGCGAATCCG GCTATAGGAGGTTTCGTAACTCATTGCGGGTGGAATTCTACGCTAGAGAGTCTTTGGTTT GGTGTTCCAACAGCTGCATGGCCGTTATACGCAGAGCAGAAGTTCAATGCTTTCTTAATG GTGGAGGAGCTTGGATTGGCGGTGGAGATAAGGAAGTATTGGCGAGGTGAACATTTGGCG GGATTACCGACGGCTACTGTGACAGCGGAGGAGATAGAGAAAGCAATCATGTGTCTAATG GAACAAGATAGTGACGTGAGGAAAAGAGTGAAGGATATGAGCGAGAAATGCCATGTGGCT TTAATGGATGGTGGATCGTCGCGTACTGCGTTGCAAAAGTTTATTGAAGAGGTTGCGAAG AATATAGTTTCACTAGATAAGGAATTTGAGCATGTAGCTCTTAAATGA
>UGT71B8
ATGAACAAATTTGCGCTTGTCTTCGTACCATTTCCTATACTTGGTCATCTCAAATCAACC GCCGAGATGGCTAAGCTACTAGTGGAGCAAGAAACTCGCCTCTCTATCTCCATTATCATC CTTCCTCTTCTTTCCGGAGACGACGTCAGTGCTTCCGCTTATATCTCAGCTCTTTCCGCC GCATCCAACGACCGCCTTCACTATGAAGTGATCTCGGACGGAGATCAACCAACCGTCGGG TTACATGTCGATAACCACATCCCGATGGTGAAACGTACCGTTGCAAAACTCGTTGATGAC TACTCAAGGCGGCCGGACTCGCCGAGGCTCGCTGGTTTAGTTGTTGACATGTTTTGTATC TCGGTGATAGACGTGGCTAATGAGGTTAGTGTTCCGTGTTACTTGTTTTACACGTCAAAC GTTGGGATTCTTGCTCTTGGGTTACATATTCAGATGTTGTTTGATAAGAAGGAGTACAGT GTCAGTGAAACTGATTTTGAAGACTCGGAAGTTGTGTTGGATGTTCCGAGTTTGACTTGT CCTTATCCGGTGAAGTGTCTTCCTTATGGTTTGGCAACGAAAGAGTGGCTTCCTATGTAT CTAAATCAAGGTAGAAGATTCAGAGAGATGAAAGGTATTTTGGTAAATACTTTTGCTGAG CTTGAACCTTATGCGTTGGAGTCTCTTCACTCTAGTGGTGATACTCCTCGTGCTTATCCA GTGGGACCATTGTTGCATCTCGAGAACCATGTTGACGGTTCTAAAGACGAGAAGGGTTCG GACATTTTACGGTGGTTAGATGAACAACCACCTAAATCGGTAGTGTTCCTCTGCTTTGGA AGCATAGGAGGCTTTAACGAGGAACAAGCAAGAGAAATGGCCATTGCACTTGAGAGAAGT GGTCACCGCTTCTTGTGGTCTCTTCGCCGTGCATCTCGAGATATAGATAAGGAACTTCCC GGAGAATTCAAGAATCTTGAAGAAATTCTCCCGGAAGGATTCTTTGATCGGACAAAGGAT AAAGGAAAGGTGATCGGATGGGCTCCACAAGTAGCCGTGCTGGCTAAGCCAGCAATCGGA GGTTTTGTTACTCATTGCGGGTGGAACTCGATACTCGAGAGTCTTTGGTTCGGTGTTCCT ATAGCGCCATGGCCGTTATACGCTGAGCAGAAGTTTAATGCTTTCGTGATGGTGGAGGAG CTTGGTTTGGCAGTGAAGATAAGAAAGTATTGGCGAGGCGATCAGTTGGTGGGAACGGCG ACGGTCATAGTGACGGCAGAGGAGATAGAGAGAGGAATCAGATGTTTGATGGAGCAAGAT AGTGACGTGAGGAATAGAGTGAAGGAGATGAGTAAGAAATGTCACATGGCTTTAAAGGAT GGTGGCTCGTCTCAATCTGCTTTGAAATTATTTATTCAAGACGTTACGAAGTATATTGCT TGA
>UGT71C1
ATGGGGAAGCAAGAAGATGCAGAGCTCGTCATCATACCTTTCCCTTTCTCCGGACACATT CTCGCAACAATCGAACTCGCCAAACGTCTCATAAGTCAAGACAATCCTCGGATCCACACC ATCACCATCCTCTATTGGGGATTACCTTTTATTCCTCAAGCTGACACAATCGCTTTCCTC CGATCCCTAGTCAAAAATGAGCCTCGTATCCGTCTCGTTACGTTGCCCGAAGTCCAAGAC CCTCCACCAATGGAACTCTTTGTGGAATTTGCCGAATCTTACATTCTTGAATACGTCAAG AAAATGGTTCCCATCATCAGAGAAGCTCTCTCCACTCTCTTGTCTTCCCGCGATGAATCG GGTTCAGTTCGTGTGGCTGGATTGGTTCTTGACTTCTTCTGCGTCCCTATGATCGATGTA GGAAACGAGTTTAATCTCCCTTCTTACATTTTCTTGACGTGTAGCGCAGGGTTCTTGGGT ATGATGAAGTATCTTCCAGAGAGACACCGCGAAATCAAATCGGAATTCAACCGGAGCTTC AACGAGGAGTTGAATCTCATTCCTGGTTATGTCAACTCTGTTCCTACTAAGGTTTTGCCG TCAGGTCTATTCATGAAAGAGACCTACGAGCCTTGGGTCGAACTAGCAGAGAGGTTTCCT GAAGCTAAGGGTATTTTGGTTAATTCATACACAGCTCTCGAGCCAAACGGTTTTAAATAT TTCGATCGTTGTCCGGATAACTACCCAACCATTTACCCAATCGGGCCGATATTATGCTCC AACGACCGTCCGAATTTGGACTCATCGGAACGAGATCGGATCATAACTTGGCTAGATGAC CAACCCGAGTCATCGGTCGTGTTCCTCTGTTTCGGGAGCTTGAAGAATCTCAGCGCTACT CAGATCAACGAGATAGCTCAAGCCTTAGAGATCGTTGACTGCAAATTCATCTGGTCGTTT CGAACCAACCCGAAGGAGTACGCGAGCCCTTACGAGGCTCTACCACACGGGTTCATGGAC CGGGTCATGGATCAAGGCATTGTTTGTGGTTGGGCTCCTCAAGTTGAAATCCTAGCCCAT AAAGCTGTGGGAGGATTCGTATCTCATTGTGGTTGGAACTCGATATTGGAGAGTTTGGGT TTCGGCGTTCCAATCGCCACGTGGCCGATGTACGCGGAACAACAACTAAACGCGTTCACG ATGGTGAAGGAGCTTGGTTTAGCCTTGGAGATGCGGTTGGATTACGTGTCGGAAGATGGA GATATAGTGAAAGCTGATGAGATCGCAGGAACCGTTAGATCTTTAATGGACGGTGTGGAT GTGCCGAAGAGTAAAGTGAAGGAGATTGCTGAGGCGGGAAAAGAAGCTGTGGACGGTGGA TCTTCGTTTCTTGCGGTTAAAAGATTCATCGGTGACTTGATCGACGGCGTTTCTATAAGT AAGTAG
>UGT71C2
ATGGCGAAGCAGCAAGAAGCAGAGCTCATCTTCATCCCATTTCCAATCCCCGGACACATT CTCGCCACAATCGAACTCGCGAAACGTCTCATCAGTCACCAACCTAGTCGGATCCACACC ATCACCATCCTCCATTGGAGCTTACCTTTTCTTCCTCAATCTGACACTATCGCCTTCCTC AAATCCCTAATCGAAACAGAGTCTCGTATCCGTCTCATTACCTTACCCGATGTCCAAAAC CCTCCACCAATGGAGCTATTTGTGAAAGCTTCCGAATCTTACATTCTTGAATACGTCAAG AAAATGGTTCCTTTGGTCAGAAACGCTCTCTCCACTCTCTTGTCTTCTCGTGATGAATCG GATTCAGTTCATGTCGCCGGATTAGTTCTTGATTTCTTCTGTGTCCCTTTGATCGATGTC GGAAACGAGTTTAATCTCCCTTCTTACATCTTCTTGACGTGTAGCGCAAGTTTCTTGGGT ATGATGAAGTATCTTCTGGAGAGAAACCGCGAAACCAAACCGGAACTTAACCGGAGCTCT GACGAGGAAACAATATCAGTTCCTGGTTTTGTTAACTCCGTTCCGGTTAAAGTTTTGCCA CCGGGTTTGTTCACGACTGAGTCTTACGAAGCTTGGGTCGAAATGGCGGAAAGGTTCCCT GAAGCCAAGGGTATTTTGGTCAATTCATTTGAATCTCTAGAACGTAACGCTTTTGATTAT TTCGATCGTCGTCCGGATAATTACCCACCCGTTTACCCAATCGGGCCAATTCTATGCTCC AACGATCGTCCGAATTTGGATTTATCGGAACGAGACCGGATCTTGAAATGGCTCGATGAC CAACCCGAGTCATCTGTTGTGTTTCTCTGCTTCGGGAGCTTGAAGAGTCTCGCTGCGTCT CAGATTAAAGAGATCGCTCAAGCCTTAGAGCTCGTCGGAATCAGATTCGTCTGGTCGATT CGAACGGACCCGAAGGAGTACGCGAGCCCGAACGAGATTTTACCGGACGGGTTTATGAAC CGAGTCATGGGTTTGGGCCTTGTTTGTGGTTGGGCTCCTCAAGTTGAAATTCTGGCCCAT AAAGCAATTGGAGGGTTCGTGTCACACTGCGGTTGGAACTCGATATTGGAGAGTTTGCGT TTCGGAGTTCCAATTGCCACGTGGCCAATGTACGCGGAACAACAACTAAACGCGTTCACG ATTGTGAAGGAGCTTGGTTTGGCGTTGGAGATGCGGTTGGATTACGTGTCGGAATATGGA GAAATCGTGAAAGCTGATGAAATCGCAGGAGCCGTACGATCTTTGATGGACGGTGAGGAT GTGCCGAGGAGGAAACTGAAGGAGATTGCGGAGGCGGGAAAAGAGGCTGTGATGGACGGT GGATCTTCGTTTGTTGCGGTTAAAAGATTCATAGATGGGCTTTGA
>UGT71C3
ATGAAAGCAGAAGCAGAGATCATCTTCGTTACATATCCATCCCCTGGTCATCTTCTTGTC TCCATTGAATTCGCTAAATCTCTCATCAAACGTGATGATCGCATCCACACCATCACCATC CTCTACTGGGCTTTACCTCTCGCTCCTCAAGCCCACCTTTTCGCTAAGTCCCTCGTTGCT TCACAGCCTCGAATCCGTCTCCTTGCGTTGCCTGATGTTCAAAACCCTCCACCATTGGAA CTCTTCTTTAAAGCTCCCGAAGCTTATATTCTTGAGTCCACCAAGAAAACAGTTCCTTTA GTCAGAGACGCTCTCTCCACTCTAGTTTCTTCACGTAAAGAATCCGGTTCGGTTCGTGTA GTCGGTTTGGTTATCGATTTTTTTTGTGTTCCAATGATCGAAGTGGCAAACGAGCTTAAC CTTCCTTCTTACATCTTCCTAACGTGTAACGCTGGGTTTTTAAGTATGATGAAGTATCTC CCTGAGAGACATCGCATAACCACTTCTGAGCTAGATTTAAGCTCCGGCAACGTAGAACAT CCAATTCCTGGCTACGTCTGCTCCGTGCCGACGAAGGTTTTGCCTCCAGGTCTATTCGTG AGAGAGTCCTACGAGGCTTGGGTCGAGATTGCAGAGAAGTTCCCTGGAGCCAAGGGCATT TTGGTAAACTCAGTCACATGTCTTGAGCAGAATGCATTTGATTACTTCGCTCGTCTTGAT GAGAACTATCCTCCGGTTTACCCGGTCGGACCGGTTCTTAGTTTGAAGGATCGTCCGTCT CCAAATCTGGACGCATCGGACCGGGATCGGATCATGAGATGGCTCGAGGACCAGCCGGAG TCGTCAATTGTGTATATCTGCTTCGGAAGCCTCGGAATCATTGGCAAGCTGCAGATTGAA GAGATAGCTGAAGCCTTGGAACTCACCGGCCACAGGTTTCTTTGGTCAATACGTACAAAT CCGACGGAGAAAGCGAGCCCGTACGATCTGTTGCCGGAGGGATTTCTCGATCGGACGGCC AGTAAGGGATTGGTGTGTGATTGGGCCCCGCAAGTAGAAGTTCTGGCCCATAAAGCGCTC GGAGGATTCGTGTCTCACTGCGGTTGGAACTCTGTACTGGAGAGCTTATGGTTCGGTGTT CCGATCGCCACGTGGCCAATGTACGCTGAGCAACAGTTAAACGCATTCTCGATGGTGAAG GAGTTAGGGTTAGCCGTGGAGCTGCGTTTAGACTACGTTTCGGCGTACGGAGAGATAGTA AAAGCTGAGGAGATCGCGGGAGCCATACGATCATTGATGGACGGTGAGGATACGCCGAGG AAGAGAGTGAAGGAGATGGCGGAAGCGGCGAGGAATGCTTTGATGGACGGAGGATCTTCG TTTGTTGCGGTTAAACGATTTCTCGACGAGTTGATCGGCGGAGATGTTTAG
>UGT71C4
ATGGTGAAGGAAACAGAGCTAATCTTCATTCCAGTTCCATCCACAGGTCATATTCTCGTC CATATTGAATTCGCCAAGCGTCTCATCAATCTCGACCATCGGATCCACACCATCACTATT CTCAACTTATCCTCACCCTCTTCTCCTCACGCCTCCGTCTTCGCCAGATCTCTCATCGCT TCCCAGCCCAAAATCCGTCTCCACGACCTTCCCCCTAT.CCAAGATCCTCCTCCATTCGAT CTTTACCAAAGAGCTCCCGAAGCTTACATAGTAAAACTCATCAAGAAAAATACTCCTCTG ATAAAAGACGCCGTCTCCAGCATCGTCGCGTCGCGTCGTGGAGGCTCAGATTCGGTTCAA GTCGCCGGTTTGGTTCTCGATTTATTCTGCAATTCATTGGTAAAAGATGTTGGCAACGAG CTTAATCTTCCTTCTTACATATACCTTACGTGTAACGCTAGATACTTGGGGATGATGAAA TATATTCCGGATCGGCATCGGAAAATCGCATCTGAGTTCGATTTGAGCTCCGGCGATGAA GAATTGCCGGTTCCGGGATTCATAAACGCTATTCCGACGAAATTTATGCCGCCTGGATTG TTCAATAAGGAAGCTTACGAGGCTTACGTAGAGCTAGCGCCGAGATTCGCAGATGCGAAG GGTATTTTGGTTAATTCCTTCACGGAGCTTGAGCCGCACCCGTTTGACTATTTCTCTCAC CTGGAGAAATTCCCTCCGGTTTACCCGGTCGGACCGATTCTCAGCTTGAAAGATCGAGCG AGTCCGAACGAAGAAGCAGTCGATCGGGATCAGATCGTTGGGTGGCTCGATGATCAGCCG GAGTCATCGGTGGTGTTCCTCTGTTTCGGGAGCAGAGGAAGCGTTGATGAGCCGCAAGTG AAGGAGATAGCTCGAGCTTTGGAACTCGTCGGCTGCAGATTTCTTTGGTCAATTAGAACA AGCGGCGACGTCGAGACGAATCCTAACGATGTGTTGCCGGAGGGGTTCATGGGCCGAGTA GCAGGCCGAGGTTTGGTATGTGGTTGGGCTCCACAAGTGGAAGTGTTGGCCCATAAAGCA ATAGGAGGATTTGTGTCTCACTGTGGTTGGAACTCCACGCTTGAAAGCTTATGGTTCGGG GTTCCTGTCGCAACGTGGCCGATGTACGCAGAGCAACAGCTTAACGCCTTCACGCTGGTG AAAGAGCTTGGGCTTGCGGTGGACCTGCGGATGGATTACGTGTCGAGTCGTGGGGGTTTG GTGACTTGTGATGAGATAGCCAGAGCCGTACGATCTTTGATGGACGGTGGAGATGAGAAG AGAAAAAAGGTTAAGGAGATGGCTGATGCGGCAAGGAAGGCTTTGATGGATGGAGGATCG TCTTCTTTGGCAACTGCTCGATTCATCGCAGAATTGTTTGAAGATGGTTCGTCGTGCTAA
>UGT71C5
ATGAAGACAGCAGAGCTCATATTCGTTCCTCTGCCGGAGACCGGCCATCTCTTGTCAACG ATCGAGTTTGGAAAGCGTCTACTCAATCTAGACCGTCGGATTTCTATGATTACAATCCTC TCCATGAATCTTCCTTACGCTCCTCACGCCGACGCTTCTCTTGCTTCGCTAACAGCCTCC GAGCCTGGTATCCGAATCATCAGTCTCCCGGAGATCCACGATCCACCTCCGATCAAGCTT CTTGACACTTCCTCCGAGACTTACATCCTCGATTTCATCCATAAAAACATACCTTGTCTC AGAAAAACCATCCAAGATTTAGTCTCATCATCATCATCTTCCGGAGGTGGTAGTAGTCAT GTCGCCGGCTTGATTCTTGATTTCTTCTGCGTTGGTTTGATCGACATCGGCCGTGAGGTA AACCTTCCTTCCTATATCTTCATGACTTCCAACTTTGGTTTCTTAGGGGTTCTACAGTAT CTCCCGGAACGACAACGTTTGACTCCGTCGGAGTTCGATGAGAGCTCCGGCGAGGAAGAG TTACATATTCCGGCGTTTGTGAACCGTGTTCCCGCCAAGGTTCTGCCGCCAGGTGTGTTC GATAAACTCTCTTACGGGTCTCTGGTCAAAATCGGCGAGCGATTACATGAAGCCAAGGGT ATTTTGGTTAATTCATTTACCCAAGTGGAGCCTTATGCTGCTGAACATTTTTCTCAAGGA CGAGATTACCCTCACGTGTATCCTGTTGGGCCGGTTCTCAACTTAACGGGCCGTACAAAT CCGGGTCTAGCTTCGGCCCAATATAAAGAGATGATGAAGTGGCTTGACGAGCAACCAGAC TCGTCGGTTTTGTTCCTGTGTTTCGGGAGCATGGGAGTCTTCCCTGCACCTCAGATCACA GAGATTGCTCACGCGCTCGAGCTTATCGGGTGCAGGTTCATCTGGGCGATCCGTACGAAC ATGGCGGGAGATGGCGATCCTCAGGAGCCGCTTCCAGAAGGATTTGTCGATCGAACAATG GGCCGTGGAATTGTGTGTAGTTGGGCTCCACAAGTGGATATCTTGGCCCACAAGGCAACA GGTGGATTCGTTTCTCACTGCGGGTGGAATTCCGTCCAAGAGAGTCTATGGTACGGTGTA CCTATTGCAACGTGGCCAATGTATGCGGAGCAACAACTGAACGCATTTGAGATGGTGAAG GAGTTGGGCTTAGCAGTGGAGATAAGGCTTGACTACGTGGCGGATGGTGATAGGGTTACT TTGGAGATCGTGTCAGCCGATGAAATAGCCACAGCCGTCCGATCATTGATGGATAGTGAT AACCCCGTGAGAAAGAAGGTTATAGAAAAATCTTCAGTGGCGAGGAAAGCTGTTGGTGAT GGTGGGTCTTCTACGGTGGCCACATGTAATTTTATCAAAGATATTCTTGGGGATCACTTT TGA
>UGT71D1
ATGCGGAATGTAGAGCTCATCTTCATCCCCACACCAACCGTTGGTCATCTTGTTCCGTTT CTTGAATTTGCTAGGCGTCTCATTGAGCAAGATGATAGGATCCGTATCACAATCCTCTTG ATGAAACTACAAGGTCAGTCTCATCTAGACACTTATGTTAAATCAATTGCCTCCTCTCAA CCGTTTGTTAGATTCATTGATGTCCCTGAGTTAGAGGAGAAACCTACACTTGGTAGTACA CAATCTGTGGAAGCTTATGTGTATGATGTTATTGAGAGAAATATCCCTCTTGTGAGGAAT ATAGTCATGGATATTTTAACTTCTCTTGCATTGGATGGAGTTAAGGTCAAGGGATTAGTT GTTGACTTTTTCTGTCTCCCTATGATTGACGTTGCTAAAGATATAAGTCTCCCTTTCTAT GTGTTCTTGACTACAAATTCCGGGTTCTTAGCTATGATGCAGTATCTAGCAGATCGACAT AGTAGAGATACATCGGTTTTTGTAAGAAACTCGGAAGAAATGTTGTCGATACCTGGATTT GTAAACCCTGTCCCAGCCAATGTTCTGCCGTCAGCTCTGTTTGTTGAAGATGGTTATGAT GCTTACGTTAAGCTGGCCATATTGTTTACAAAGGCCAATGGAATCCTAGTGAATAGCTCC TTTGATATTGAGCCTTACTCTGTGAATCATTTTCTTCAAGAACAGAATTATCCTTCTGTT TATGCTGTTGGCCCCATATTTGACTTGAAAGCCCAGCCTCATCCAGAGCAGGACCTAACC CGTCGTGACGAGTTGATGAAATGGCTTGATGATCAACCCGAGGCATCGGTTGTATTCCTT TGTTTTGGGAGTATGGCAAGGTTAAGAGGTTCTCTAGTGAAGGAAATAGCTCATGGACTT GAGCTATGTCAATATAGATTCCTCTGGTCACTCCGTAAAGAAGAGGTGACAAAGGATGAT TTGCCAGAGGGGTTCCTTGACCGTGTCGATGGACGTGGAATGATATGTGGTTGGTCTCCT CAGGTAGAAATACTGGCCCATAAGGCAGTGGGAGGCTTTGTTTCTCACTGTGGATGGAAC TCAATAGTAGAGAGTTTGTGGTTTGGCGTGCCAATTGTGACATGGCCAATGTATGCAGAG CAACAACTCAATGCGTTTCTGATGGTGAAGGAACTGAAGCTAGCTGTGGAGCTGAAGCTT GATTACAGGGTACATAGTGATGAGATAGTAAACGCAAACGAGATAGAGACCGCTATTCGT TATGTAATGGACACGGATAATAATGTTGTGAGGAAACGAGTGATGGATATCTCGCAGATG ATCCAGAGAGCTACGAAGAATGGTGGATCTTCGTTTGCCGCAATTGAGAAATTCATATAT GACGTGATAGGAATTAAGCCCTAG
>UGT71D2
ATGAGGAATGCAGAGCTCATCTTCATCCCAACACCAACTGTTGGTCATCTTGTTCCGTTT CTTGAATTTGCTAGGCGTCTCATTGAGCAGGATGATAGAATCCGTATCACCTTCCTCTTG ATGAAGCAACAAGGTCAGTCTCATCTGGATTCCTATGTTAAGACAATTTCCTCGTCTCTG CCGTTTGTTAGATTTATTGATGTCCCTGAGTTAGAGGAGAAACCAACACTTGGTACACAG TCTGTGGAAGCCTATGTGTACGATTTTATTGAAACAAATGTCCCTCTTGTGCAAAATATA ATCATGGGTATCCTATCTTCTCCTGCATTTGATGGAGTTACGGTCAAGGGATTCGTTGCT GATTTTTTCTGTCTCCCGATGATTGATGTTGCAAAAGATGCAAGTCTTCCTTTTTATGTG TTCTTGACTTCAAATTCCGGATTCCTAGCTATGATGCAGTATCTGGCATATGGACATAAG AAAGATACCTCAGTTTTTGCAAGAAACTCTGAAGAAATGTTGTCAATTCCTGGATTTGTA AACCCTGTCCCAGCCAAAGTACTGCCGTCAGCTCTGTTTATTGAGGATGGTTATGATGCT GACGTTAAACTGGCTATATTGTTTACAAAGGCTAATGGAATCCTAGTGAATACCTCCTTT GATATTGAGCCTACCTCTCTGAATCATTTTCTTGGAGAAGAGAATTACCCTTCTGTTTAT GCTGTTGGCCCCATATTTAACCCGAAGGCCCATCCTCATCCAGATCAAGACCTCGCCTGT TGTGACGAGTCGATGAAATGGCTTGATGCTCAACCCGAGGCATCAGTTGTATTCCTTTGT TTTGGGAGTATGGGTAGCTTAAGAGGTCCTCTAGTGAAGGAAATAGCACATGGACTTGAG CTATGTCAGTATAGATTCCTCTGGTCACTCCGCACAGAAGAAGTGACAAATGATGATCTT TTGCCAGAGGGATTCATGGACCGTGTCAGTGGACGGGGAATGATATGCGGTTGGTCTCCT CAGGTGGAAATACTGGCCCATAAAGCAGTGGGAGGTTTTGTTTCTCATTGTGGATGGAAC TCAATAGTAGAGAGTTTATGGTTTGGTGTGCCAATTGTGACATGGCCAATGTATGCAGAG CAACAGCTCAATGCGTTTCTGATGGTGAAGGAACTGAAGCTCGCAGTGGAGCTGAAACTC GATTATAGTGTACATAGTGGTGAGATTGTAAGTGCAAACGAGATAGAGACAGCGATTTCT TGTGTAATGAACAAGGATAATAATGTTGTGAGGAAACGAGTGATGGATATCTCGCAGATG ATCCAGAGAGCTACGAAGAATGGTGGATCTTCGTTTGCCGCAATTGAGAAATTCATACAT GACGTGATAGGAACCAGGACTTAG
>UGT72B1
ATGGAGGAATCCAAAACACCTCACGTTGCGATCATACCAAGTCCGGGAATGGGTCATCTC ATACCACTCGTCGAGTTTGCTAAACGACTCGTCCATCTTCACGGCCTCACCGTTACCTTC GTCATCGCCGGCGAAGGTCCACCATCAAAAGCTCAGAGAACCGTCCTCGACTCTCTCCCT TCTTCAATCTCCTCCGTCTTTCTCCCTCCTGTTGATCTCACCGATCTCTCTTCGTCCACT CGCATCGAATCTCGGATCTCCCTCACCGTGACTCGTTCAAACCCGGAGCTCCGGAAAGTC TTCGACTCGTTCGTGGAGGGAGGTCGTTTGCCAACGGCGCTCGTCGTCGATCTCTTCGGT ACGGACGCTTTCGACGTGGCCGTAGAATTTCACGTGCCACCGTATATTTTCTACCCAACA ACGGCCAACGTCTTGTCGTTTTTTCTCCATTTGCCTAAACTAGACGAAACGGTGTCGTGT GAGTTCAGGGAATTAACCGAACCGCTTATGCTTCCTGGATGTGTACCGGTTGCCGGGAAA GATTTCCTTGACCCGGCCCAAGACCGGAAAGACGATGCATACAAATGGCTTCTCCATAAC ACCAAGAGGTACAAAGAAGCCGAAGGTATTCTTGTGAATACCTTCTTTGAGCTAGAGCCA AATGCTATAAAGGCCTTGCAAGAACCGGGTCTTGATAAACCACCGGTTTATCCGGTTGGA CCGTTGGTTAACATTGGTAAGCAAGAGGCTAAGCAAACCGAAGAGTCTGAATGTTTAAAG TGGTTGGATAACCAGCCGCTCGGTTCGGTTTTATATGTGTCCTTTGGTAGTGGCGGTACC CTCACATGTGAGCAGCTCAATGAGCTTGCTCTTGGTCTTGCAGATAGTGAGCAACGGTTT CTTTGGGTCATACGAAGTCCTAGTGGGATCGCTAATTCGTCGTATTTTGATTCACATAGC CAAACAGATCCATTGACATTTTTACCACCGGGATTTTTAGAGCGGACTAAAAAAAGAGGT TTTGTGATCCCTTTTTGGGCTCCACAAGCCCAAGTCTTGGCGCATCCATCCACGGGAGGA TTTTTAACTCATTGTGGATGGAATTCGACTCTAGAGAGTGTAGTAAGCGGTATTCCACTT ATAGCATGGCCATTATACGCAGAACAGAAGATGAATGCGGTTTTGTTGAGTGAAGATATT CGTGCGGCACTTAGGCCGCGTGCCGGGGACGATGGGTTAGTTAGAAGAGAAGAGGTGGCT AGAGTGGTAAAAGGATTGATGGAAGGTGAAGAAGGCAAAGGAGTGAGGAACAAGATGAAG GAGTTGAAGGAAGCAGCTTGTAGGGTGTTGAAGGATGATGGGACTTCGACAAAAGCACTT AGTCTTGTGGCCTTAAAGTGGAAAGCCCACAAAAAAGAGTTAGAGCAAAATGGCAACCAC TAA
>UGT72B2
ATGCAAAAAATGGCAGATGGAAACACTCCACATGTAGCAATCATACCAAGTCCCGGTATA GGTCACCTCATCCCACTCGTCGAGTTAGCAAAGCGACTCCTTGACAATCACGGTTTCACC GTCACTTTCATCATCCCCGGCGATTCTCCTCCGTCTAAGGCTCAAAGATCCGTTCTCAAC TCTCTCCCTTCCTCCATAGCCTCCGTCTTCCTCCCTCCCGCCGATCTTTCCGACGTTCCT TCGACAGCTCGAATCGAAACTCGGATATCGCTCACCGTGACTCGTTCCAACCCGGCGCTC CGGGAGCTTTTTGGCTCGTTATCGGCGGAGAAACGTCTCCCGGCGGTTCTCGTCGTCGAT CTATTTGGTACGGATGCGTTCGACGTGGCTGCTGAGTTCCACGTGTCGCCATACATTTTC TATGCATCAAATGCCAACGTCCTCACGTTTCTGCTTCACTTGCCGAAGCTAGACGAAACG GTGTCGTGTGAGTTTAGGGAATTAACCGAACCGGTTATTATTCCCGGTTGTGTCCCCATA ACCGGTAAGGATTTCGTCGATCCGTGTCAAGACCGAAAAGATGAATCATACAAATGGCTT CTACACAACGTCAAGAGATTCAAAGAAGCTGAAGGGATTCTAGTGAATTCCTTCGTCGAT TTAGAGCCAAACACTATAAAGATTGTACAAGAACCGGCTCCTGATAAACCACCGGTTTAC CTGATTGGGCCGTTGGTTAACTCGGGTTCACACGATGCTGACGTGAACGATGAGTACAAA TGTTTAAATTGGCTAGACAACCAACCATTCGGGTCGGTTCTATACGTATCCTTTGGAAGC GGCGGAACACTCACGTTTGAGCAGTTCATTGAGCTGGCTCTTGGCCTAGCGGAGAGTGGA AAACGGTTTCTTTGGGTCATACGAAGTCCGAGTGGGATAGCTAGTTCATCGTATTTCAAT CCACAAAGCCGAAATGATCCATTTTCGTTTTTACCACAAGGCTTCTTAGACCGAACCAAA GAAAAAGGTCTAGTGGTTGGGTCATGGGCTCCACAGGCTCAAATTCTGACTCATACATCT ATAGGTGGATTTTTAACTCATTGTGGATGGAATTCGAGTCTAGAAAGTATTGTAAACGGT GTACCGCTCATAGCATGGCCGTTATACGCGGAGCAAAAGATGAACGCATTGCTACTCGTG GATGTTGGTGCGGCTCTAAGAGCACGACTGGGTGAAGACGGGGTCGTAGGAAGGGAAGAA GTGGCGAGAGTGGTAAAAGGATTGATAGAAGGAGAAGAAGGGAATGCGGTAAGGAAAAAA ATGAAAGAGTTGAAAGAAGGATCTGTTAGAGTCTTAAGGGACGATGGATTCTCTACCAAA TCGCTTAATGAAGTTTCGTTGAAGTGGAAAGCCCACCAACGAAAGATCGACCAAGAACAG GAATCATTTCTATGA
>UGT72B3
ATGAGCATAGATATTTTTCAAGAAATAAGAATAAAGAAAATTCTACTCTTAATGGCGGAA GCAAACACTCCACACATAGCAATCATGCCGAGTCCCGGTATGGGTCACCTTATCCCATTC GTCGAGTTAGCAAAGCGACTCGTTCAGCACGACTGTTTCACCGTCACAATGATCATCTCC GGTGAAACTTCGCCGTCTAAGGCACAAAGATCCGTTCTCAACTCTCTCCCTTCCTCCATA GCCTCCGTATTTCTCCCTCCCGCCGATCTTTCCGATGTTCCCTCCACAGCGCGAATCGAA ACTCGGGCCATGCTCACCATGACTCGTTCCAATCCGGCGCTCCGGGAGCTTTTTGGCTCT TTATCAACGAAGAAAAGTCTCCCGGCGGTTCTCGTCGTCGATATGTTTGGTGCGGATGCG TTCGACGTGGCCGTTGACTTCCACGTGTCACCATACATTTTCTATGCATCCAATGCAAAC GTCTTGTCGTTTTTTCTTCACTTGCCGAAACTAGACAAAACGGTGTCGTGTGAGTTTAGG TACTTAACCGAACCGCTTAAGATTCCCGGCTGTGTCCCGATAACCGGTAAGGACTTTCTT GATACGGTTCAAGACCGAAACGACGACGCATACAAATTGCTTCTCCATAACACCAAGAGG TACAAAGAAGCTAAAGGGATTCTAGTGAATTCCTTCGTTGATTTAGAGTCGAATGCAATA AAGGCCTTACAAGAACCGGCTCCTGATAAACCAACGGTATACCCGATTGGGCCGCTGGTT AACACAAGTTCATCTAATGTTAACTTGGAAGACAAGTTCGGATGTTTAAGTTGGCTAGAC AACCAACCATTCGGCTCGGTTCTATACATATCATTTGGAAGCGGCGGAACACTTACATGT GAGCAGTTTAATGAGCTTGCTATTGGTCTTGCGGAGAGCGGAAAACGGTTTATTTGGGTC ATACGAAGTCCAAGCGAGATAGTTAGTTCGTCGTATTTCAATCCACACAGCGAGACAGAC CCCTTTTCGTTTTTACCAATTGGGTTCTTAGACCGAACCAAAGAGAAAGGTTTGGTGGTT CCATCATGGGCTCCACAGGTTCAAATCCTGGCTCATCCATCCACATGCGGGTTTTTAACA CACTGTGGATGGAATTCGACCTTAGAAAGCATTGTAAACGGTGTACCACTCATAGCGTGG CCTTTATTCGCGGAGCAAAAGATGAATACATTGCTACTCGTGGAGGATGTTGGAGCGGCT CTAAGAATCCATGCGGGTGAAGATGGGATTGTACGGAGGGAAGAAGTGGTGAGAGTGGTG AAGGCACTGATGGAAGGTGAAGAGGGAAAAGCCATAGGAAATAAAGTGAAGGAGTTGAAA GAAGGAGTTGTTAGAGTCTTGGGTGACGATGGATTGTCCAGCAAGTCATTTGGTGAAGTT TTGTTAAAGTGGAAAACGCACCAGCGAGATATCAACCAAGAGACGTCCCACTAA >UGT72C1
ATGGAACTTCACGGAGCTCTAGTGGCTAGTCCGGGCATGGGACATGCCGTACCCATCTTA GAACTCGGTAAACATCTCCTGAACCACCACGGGTTCGACCGTGTCACTGTCTTCCTAGTC ACAGACGATGTCTCACGTTCGAAATCCCTAATTGGAAAAACGTTGATGGAAGAAGATCCA AAATTTGTGATCAGGTTTATTCCACTCGATGTTTCGGGTCAAGATCTGAGTGGTTCACTA TTGACTAAACTAGCAGAGATGATGAGGAAGGCATTACCAGAGATCAAGTCTTCAGTCATG GAGTTAGAACCGCGGCCTAGGGTTTTCGTAGTTGACTTGTTGGGCACGGAAGCTTTAGAG GTGGCTAAGGAGCTTGGGATCATGAGAAAACATGTTCTGGTTACTACCAGTGCTTGGTTT CTAGCTTTTACGGTTTATATGGCGAGTCTTGACAAACAGGAGTTGTATAAGCAGTTGAGT AGCATAGGAGCATTGCTTATACCCGGATGCAGCCCGGTTAAGTTTGAGCGGGCTCAAGAT CCGAGAAAATATATTCGGGAACTCGCTGAGTCTCAGCGTATTGGGGATGAGGTGATAACC GCAGATGGGGTGTTTGTGAATACGTGGCACAGTCTGGAGCAAGTGACCATCGGGTCTTTC TTGGATCCAGAGAATCTCGGTCGGGTTATGAGAGGAGTGCCGGTTTATCCTGTTGGACCG CTGGTTAGACCAGCAGAACCAGGTTTGAAACATGGCGTGCTGGACTGGCTTGACTTACAA CCCAAAGAGTCAGTGGTTTATGTTCTTTTGGGAGTGGTGGGGGCACTAACCTTCGAGCAG ACAAACGAGCTGGCTTACGGTTTGGAGCTGACTGGCCACAGATTTGTTTGGGTAGTCAGA CCACCGGCTGAAGACGACCCATCGGCATCAATGTTCGACAAGACCAAGAATGAGACAGAA CCTCTCGATTTCTTACCCAACGGGTTTCTAGACCGAACCAAAGACATCGGTTTGGTGGTC CGTACATGGGCACCACAAGAAGAGATTCTGGCACACAAGTCAACAGGAGGGTTTGTGACT CACTGCGGATGGAACTCAGTTTTGGAGAGTATTGTGAATGGTGTGCCAATGGTAGCTTGG CCGTTGTACTCAGAGCAGAAGATGAACGCGAGGATGGTTTCTGGGGAGCTAAAGATTGCG TTGCAGATTAATGTTGCAGATGGGATTGTAAAGAAGGAGGTGATAGCTGAAATGGTGAAG AGAGTGATGGATGAAGAAGAAGGAAAAGAGATGAGAAAGAATGTTAAGGAACTGAAGAAG ACAGCAGAAGAAGCTCTCAACATGACTCACATTCCATCTGCTTACTTCACCTAA
>UGT72D1
ATGGACCAGCCTCACGCGCTTCTAGTGGCTAGCCCTGGCTTGGGTCACCTCATCCCTATC CTGGAGCTCGGCAACCGTCTCTCCTCCGTCCTAAACATCCACGTCACCATTCTCGCGGTC ACCTCCGGCTCCTCTTCACCGACAGAAACCGAAGCCATACATGCAGCCGCGGCTAGAACA ATCTGTCAAATTACGGAAATTCCCTCGGTGGATGTAGACAACCTCGTGGAGCCAGATGCT ACAATTTTCACTAAGATGGTGGTGAAGATGCGAGCCATGAAGCCCGCGGTACGAGATGCC GTGAAATTAATGAAACGAAAACCAACGGTCATGATTGTTGACTTTTTGGGTACGGAACTG ATGTCCGTAGCCGATGACGTAGGCATGACGGCTAAATACGTTTACGTTCCAACTCATGCG TGGTTCTTGGCAGTCATGGTGTACTTGCCGGTGTTAGATACGGTAGTGGAAGGTGAGTAT GTTGATATTAAGGAGCCTTTGAAGATACCGGGTTGTAAACCGGTCGGACCGAAGGAGCTG ATGGAAACGATGTTAGACCGGTCGGGCCAGCAATATAAAGAGTGTGTACGAGCTGGCTTA GAGGTACCTATGAGCGATGGTGTTTTGGTAAATACTTGGGAGGAGTTACAAGGAAACACT CTCGCTGCGCTTAGAGAGGACGAAGAATTGAGCCGGGTCATGAAAGTACCGGTTTATCCT ATTGGGCCAATTGTTAGGACTAACCAGCATGTAGACAAACCCAATAGTATATTCGAGTGG CTAGACGAGCAACGGGAAAGGTCAGTGGTGTTTGTGTGTTTAGGGAGCGGTGGAACGTTG ACGTTTGAGCAAACAGTGGAACTCGCTTTGGGTTTAGAGTTAAGTGGTCAAAGGTTCGTT TGGGTTCTACGTAGGCCCGCTTCATATCTCGGGGCGATCTCCAGCGATGATGAACAGGTA AGTGCCAGTCTACCTGAAGGTTTCTTGGACCGCACGCGTGGTGTGGGGATTGTGGTTACG CAATGGGCACCACAAGTTGAGATCTTGAGCCATAGATCGATCGGTGGGTTCTTGTCTCAC TGCGGTTGGAGTTCGGCTTTGGAAAGTTTGACTAAAGGAGTTCCGATCATCGCTTGGCCT CTTTATGCGGAGCAGTGGATGAATGCCACGTTATTGACTGAGGAGATCGGTGTGGCCGTT CGTACATCGGAGTTACCGTCGGAGAGAGTCATCGGAAGGGAAGAAGTGGCATCTCTGGTG AGAAAGATTATGGCGGAAGAGGATGAAGAAGGACAGAAAATTAGGGCTAAAGCTGAGGAG GTGAGGGTTAGCTCCGAACGAGCTTGGAGTAAAGACGGGTCATCTTATAATTCTCTATTC GAATGGGCAAAACGATGTTATCTTGTACCGTGA
>UGT72E1 ATGAAGATTACAAAACCACATGTGGCCATGTTCGCTAGCCCCGGAATGGGCCACATCATC CCGGTGATCGAGCTCGGAAAACGCTTAGCTGGTTCCCACGGCTTCGATGTCACCATTTTC GTCCTTGAAACCGACGCAGCCTCAGCTCAATCTCAATTCCTTAACTCACCAGGCTGCGAC GCGGCCCTTGTTGATATCGTTGGCCTCCCAACGCCCGATATCTCCGGTTTAGTCGACCCA TCAGCCTTTTTTGGGATCAAGCTCTTGGTCATGATGCGTGAGACCATTCCTACCATCCGG TCAAAGATAGAGGAGATGCAACACAAACCAACGGCTCTGATCGTAGACTTGTTTGGTTTG GACGCGATACCGCTCGGTGGTGAGTTCAACATGTTGACTTATATCTTCATCGCTTCAAAC GCACGTTTTCTCGCGGTGGCTTTGTTTTTCCCAACGTTGGACAAAGACATGGAAGAAGAG CACATAATCAAGAAGCAACCTATGGTTATGCCTGGATGTGAACCGGTTCGGTTTGAAGAT ACACTTGAAACATTCCTTGACCCAAACAGCCAACTCTACCGGGAATTTGTTCCTTTCGGT TCGGTTTTCCCAACGTGTGATGGTATTATTGTGAATACATGGGATGATATGGAGCCCAAA ACTTTGAAATCTCTTCAAGACCCAAAGCTCTTGGGTCGAATTGCTGGTGTACCGGTTTAT CCAATTGGTCCTTTGTCTAGACCGGTTGATCCATCTAAAACTAATCATCCGGTTTTGGAT TGGTTAAACAAACAGCCGGACGAGTCGGTACTTTACATTTCATTTGGAAGCGGTGGCTCT CTCTCGGCTAAACAACTAACCGAATTGGCTTGGGGACTTGAGATGAGTCAGCAACGGTTC GTTTGGGTGGTTCGACCCCCGGTGGACGGTTCAGCTTGCAGTGCATATTTATCCGCTAAC AGTGGTAAAATACGAGACGGTACACCTGATTATCTCCCGGAAGGTTTTGTTAGCCGGACT CATGAGAGAGGCTTTATGGTCTCTTCTTGGGCTCCCCAAGCGGAGATCTTGGCCCACCAA GCCGTAGGTGGGTTTCTAACTCACTGCGGTTGGAATTCGATTCTCGAGAGCGTCGTTGGT GGCGTTCCGATGATCGCGTGGCCACTTTTTGCGGAGCAGATGATGAACGCGACACTCCTC AACGAAGAGCTTGGCGTTGCCGTCCGCTCTAAGAAACTACCGTCGGAGGGAGTGATTACG AGGGCGGAGATCGAGGCGTTGGTGAGAAAGATCATGGTGGAGGAGGAAGGTGCTGAGATG AGAAAGAAGATAAAGAAGCTGAAAGAGACCGCTGCCGAATCGCTGAGTTGCGACGGTGGA GTGGCGCATGAATCGTTGTCAAGAATCGCCGACGAGAGCGAGCATCTTTTGGAGCGTGTC AGGTGCATGGCACGTGGTGCCTAG
>UGT72E2
ATGCATATCACAAAACCACACGCCGCCATGTTTTCCAGTCCCGGAATGGGCCATGTCATC CCGGTGATCGAGCTTGGAAAGCGTCTCTCCGCTAACAACGGCTTCCACGTCACCGTCTTC GTCCTCGAAACCGACGCAGCCTCCGCTCAATCCAAGTTCCTAAACTCAACCGGCGTCGAC ATCGTCAAACTTCCATCGCCGGACATTTATGGTTTAGTGGACCCCGACGACCATGTAGTG ACCAAGATCGGAGTCATTATGCGTGCAGCAGTTCCAGCCCTCCGATCCAAGATCGCTGCC ATGCATCAAAAGCCAACGGCTCTGATCGTTGACTTGTTTGGCACAGATGCGTTATGTCTC GCAAAGGAATTTAACATGTTGAGTTATGTGTTTATCCCTACCAACGCACGTTTTCTCGGA GTTTCGATTTATTATCCAAATTTGGACAAAGATATCAAGGAAGAGCACACAGTGCAAAGA AACCCACTCGCTATACCGGGGTGTGAACCGGTTAGGTTCGAAGATACTCTGGATGCATAT CTGGTTCCCGACGAACCGGTGTACCGGGATTTTGTTCGTCATGGTCTGGCTTACCCAAAA GCCGATGGAATTTTGGTAAATACATGGGAAGAGATGGAGCCCAAATCATTGAAGTCCCTT CTAAACCCAAAGCTCTTGGGCCGGGTTGCTCGTGTACCGGTCTATCCAATCGGTCCCTTA TGCAGACCGATACAATCATCCGAAACCGATCACCCGGTTTTGGATTGGTTAAACGAACAA CCGAACGAGTCGGTTCTCTATATCTCCTTCGGGAGTGGTGGTTGTCTATCGGCGAAACAG TTAACTGAATTGGCGTGGGGACTCGAGCAGAGCCAGCAACGGTTCGTATGGGTGGTTCGA CCACCGGTCGACGGTTCGTGTTGTAGCGAGTATGTCTCGGCTAACGGTGGTGGAACCGAA GACAACACGCCAGAGTATCTACCGGAAGGGTTCGTGAGTCGTACTAGTGATAGAGGTTTC GTGGTCCCCTCATGGGCCCCACAAGCTGAAATCCTGTCCCATCGGGCCGTTGGTGGGTTT TTGACCCATTGCGGTTGGAGCTCGACGTTGGAAAGCGTCGTTGGCGGCGTTCCGATGATC GCATGGCCACTTTTTGCCGAGCAGAATATGAATGCGGCGTTGCTCAGCGACGAACTGGGA ATCGCAGTCAGATTGGATGATCCAAAGGAGGATATTTCTAGGTGGAAGATTGAGGCGTTG GTGAGGAAGGTTATGACTGAGAAGGAAGGTGAAGCGATGAGAAGGAAAGTGAAGAAGTTG AGAGACTCGGCGGAGATGTCACTGAGCATTGACGGTGGTGGTTTGGCGCACGAGTCGCTT TGCAGAGTCACCAAGGAGTGTCAACGGTTTTTGGAACGTGTCGTGGACTTGTCACGTGGT GCTTAG >UGT72E3
ATGCATATCACAAAACCACACGCCGCCATGTTTTCCAGTCCCGGAATGGGCCATGTCCTC CCGGTGATCGAGCTAGCTAAGCGTCTCTCCGCTAACCACGGCTTCCACGTCACCGTCTTC GTCCTTGAAACTGACGCAGCCTCCGTTCAGTCCAAGCTCCTTAACTCAACCGGTGTTGAC ATCGTCAACCTTCCATCGCCCGACATTTCTGGCTTGGTAGACCCCAACGCCCATGTGGTG ACCAAGATCGGAGTCATTATGCGTGAAGCTGTTCCAACCCTCCGATCCAAGATCGTTGCC ATGCATCAAAACCCAACGGCTCTGATCATTGACTTGTTTGGCACAGATGCGTTATGTCTT GCAGCGGAGTTAAACATGTTGACTTATGTCTTTATCGCTTCCAACGCGCGTTATCTCGGA GTTTCGATATATTATCCAACTTTGGACGAAGTTATCAAAGAAGAGCACACAGTGCAACGA AAACCGCTCACTATACCGGGGTGTGAACCGGTTAGATTTGAAGATATTATGGATGCATAT CTGGTTCCGGACGAACCGGTGTACCACGATTTGGTTCGTCACTGTCTGGCCTACCCAAAA GCGGATGGAATCTTGGTGAATACATGGGAAGAGATGGAGCCCAAATCATTAAAGTCCCTT CAAGACCCGAAACTTTTGGGCCGGGTCGCTCGTGTACCGGTTTATCCGGTTGGTCCGTTA TGCAGACCGATACAATCATCCACGACCGATCACCCGGTTTTTGATTGGTTAAACAAACAA CCAAACGAGTCGGTTCTCTACATTTCCTTCGGGAGTGGTGGTTCTCTAACGGCTCAACAG TTAACCGAATTGGCGTGGGGGCTCGAGGAGAGCCAGCAACGGTTTATATGGGTGGTTCGA CCGCCCGTTGACGGCTCGTCTTGCAGTGATTATTTCTCGGCTAAAGGCGGTGTAACCAAA GACAACACGCCAGAGTATCTACCAGAAGGGTTCGTGACTCGTACTTGCGATAGAGGTTTC ATGATCCCATCATGGGCACCGCAAGCTGAAATCCTAGCCCATCAGGCCGTTGGTGGGTTT TTAACACATTGTGGTTGGAGCTCGACGTTGGAAAGCGTCCTTTGCGGCGTTCCAATGATA GCGTGGCCGCTTTTCGCCGAGCAGAATATGAACGCGGCGTTGCTTAGCGATGAACTGGGA ATCTCTGTTAGAGTGGATGATCCAAAGGAGGCGATTTCTAGGTCGAAGATTGAGGCGATG GTGAGGAAGGTTATGGCTGAGGACGAAGGTGAAGAGATGAGAAGGAAAGTGAAGAAGTTG AGAGACACGGCGGAGATGTCACTTAGTATTCACGGTGGTGGTTCGGCGCATGAGTCGCTT TGCAGAGTCACGAAGGAGTGTCAACGGTTTTTGGAATGTGTCGGGGACTTGGGACGTGGT GCTTAG
>UGT73B1
ATGGGAACTCCTGTCGAAGTCTCTAAGCTCCATTTCTTGCTCTTCCCTTTCATGGCTCAT GGCCATATGATACCAACTCTAGACATGGCTAAGCTCTTTGCCACCAAAGGAGCTAAATCC ACTATCCTCACTACACCTCTCAATGCCAAGCTCTTCTTCGAGAAACCCATCAAATCATTC AACCAAGACAACCCGGGACTCGAAGACATCACCATCCAGATCCTTAATTTCCCTTGCACA GAGCTTGGTTTGCCTGATGGCTGTGAGAATACTGATTTCATCTTCTCCACACCTGACCTA AACGTAGGTGACTTGAGTCAAAAGTTTTTACTCGCAATGAAATATTTCGAAGAGCCACTA GAGGAGCTCCTCGTGACAATGAGACCAGACTGTCTTGTCGGTAACATGTTCTTCCCTTGG TCCACTAAAGTTGCTGAGAAGTTCGGAGTACCGAGACTTGTGTTCCACGGCACAGGCTAC TTCTCTTTATGTGCTTCTCATTGCATAAGGCTCCCTAAGAATGTGGCAACAAGTTCTGAG CCCTTTGTGATTCCTGATCTCCCGGGAGACATTTTGATTACAGAGGAACAGGTCATGGAG ACAGAAGAAGAGTCTGTAATGGGGAGGTTTATGAAGGCAATAAGAGACTCAGAGAGAGAT AGCTTTGGCGTGTTGGTGAACAGCTTCTACGAGCTTGAACAGGCTTACTCAGATTATTTC AAGAGCTTTGTGGCGAAAAGAGCGTGGCATATCGGTCCGCTTTCCTTAGGAAATAGAAAG TTCGAGGAGAAAGCAGAAAGAGGCAAAAAGGCAAGCATTGATGAGCATGAATGTTTGAAA TGGCTCGACTCCAAGAAATGTGATTCAGTGATTTACATGGCCTTTGGAACCATGTCTAGC TTTAAAAACGAGCAGCTGATAGAGATTGCAGCTGGTTTAGATATGTCAGGACATGATTTT GTCTGGGTGGTTAACAGAAAAGGCAGCCAAGTTGAGAAGGAAGATTGGTTACCAGAGGGG TTTGAAGAGAAGACCAAGGGAAAAGGATTGATAATCCGAGGGTGGGCGCCACAAGTGCTG ATACTTGAGCACAAAGCAATTGGCGGATTTTTGACGCATTGTGGATGGAACTCGTTATTA GAAGGGGTGGCAGCGGGCCTGCCAATGGTGACATGGCCCGTGGGAGCCGAGCAGTTCTAC AACGAGAAATTGGTGACACAAGTGTTGAAAACAGGAGTGAGTGTGGGAGTGAAGAAGATG ATGCAAGTAGTTGGAGACTTCATTAGCAGAGAGAAAGTGGAGGGAGCGGTGAGGGAAGTG ATGGTTGGAGAAGAGAGGAGGAAACGGGCCAAGGAGTTAGCAGAAATGGCGAAAAATGCG GTGAAAGAAGGAGGATCTTCAGATCTAGAGGTAGATAGGTTGATGGAAGAGCTTACGTTA GTTAAACTGCAAAAAGAGAAGGTATAA
>UGT73B2
ATGGGTAGTGATCATCATCATCGAAAGCTCCACGTTATGTTCTTCCCTTTCATGGCTTAT GGTCACATGATACCAACTCTAGACATGGCTAAGCTTTTCTCTAGCAGAGGAGCCAAATCC ACAATCCTCACCACATCTCTCAACTCCAAGATCCTCCAAAAACCCATCGACACATTCAAG AATCTGAATCCGGGTCTCGAAATCGACATCCAGATCTTCAATTTCCCTTGCGTGGAGCTG GGGTTACCAGAAGGATGTGAAAACGTTGATTTCTTCACTTCAAACAACAATGATGATAAA AACGAGATGATCGTGAAATTCTTTTTCTCGACAAGGTTTTTCAAAGACCAGCTTGAGAAA CTCCTCGGGACAACGAGACCAGACTGTCTTATCGCCGACATGTTCTTCCCCTGGGCTACT GAAGCTGCTGGGAAGTTCAATGTGCCAAGACTTGTGTTCCACGGCACTGGCTACTTCTCT TTATGCGCTGGTTATTGCATCGGAGTGCATAAACCACAGAAGAGAGTGGCTTCAAGCTCT GAGCCATTTGTGATTCCCGAGCTCCCTGGGAACATTGTGATAACTGAAGAACAGATCATA GATGGCGATGGAGAATCCGACATGGGAAAGTTTATGACTGAAGTTAGGGAATCGGAAGTG AAGAGCTCAGGAGTTGTTTTGAATAGTTTCTACGAGCTAGAACATGATTACGCCGATTTT TACAAAAGTTGTGTACAAAAGAGAGCGTGGCATATCGGTCCGCTATCGGTTTACAACAGG GGATTTGAGGAGAAGGCTGAGAGAGGAAAGAAAGCGAACATTGATGAGGCTGAATGCCTC AAATGGCTTGACTCCAAGAAACCAAATTCAGTCATTTATGTTTCCTTTGGGAGCGTGGCT TTCTTCAAGAATGAACAGTTATTCGAGATCGCTGCAGGGTTAGAAGCTTCCGGTACAAGT TTCATTTGGGTTGTTAGGAAAACCAAAGTGATAGAGAAGAATGGTTACCAGAAGGGTTCG AAGAGAGGGTGAAAGGGAAAGGTATGATAATAAGAGGATGGGCAGCACAGGTGCTGATAC TTGACCACCAAGCAACCGGTGGGTTTGTGACCCATTGCGGCTGGAACTCGCTTCTTGAAG GAGTGGCTGCAGGGCTACCAATGGTGACATGGCCTGTAGGAGCGGAGCAATTCTACAATG AGAAATTGGTTACGCAAGTGCTCAGAACAGGAGTGAGCGTGGGAGCGAGCAAGCATATGA AAGTTATGATGGGAGATTTCATTAGCAGAGAGAAAGTGGATAAAGCGGTGAGGGAGGTTT TGGCTGGGGAAGCAGCAGAGGAGAGGCGGAGACGGGCAAAGAAGCTAGCGGCGATGGCTA AAGCTGCCGTGGAAGAAGGAGGGTCTTCCTTCAACGATCTAAACAGCTTCATGGAAGAGT TTAGTTCATAA
>UGT73B3
ATGAGTAGTGATCCTCATCGTAAGCTCCATGTTGTGTTCTTCCCTTTCATGGCTTATGGT CACATGATACCAACTCTAGACATGGCTAAGCTTTTCTCTAGCAGAGGAGCCAAATCTACA ATCCTCACCACACCTCTCAACTCCAAGATCTTCCAAAAACCCATCGAAAGATTCAAGAAC CTGAATCCGAGTTTCGAAATCGACATCCAGATCTTCGATTTCCCTTGCGTGGATCTCGGG TTACCAGAAGGATGCGAAAACGTCGATTTCTTCACCTCAAACAACAATGATGATAGACAG TATCTGACCTTGAAGTTCTTTAAGTCGACAAGGTTTTTCAAAGATCAGCTTGAGAAGCTC CTCGAGACAACGAGACCAGACTGTCTTATCGCCGACATGTTCTTCCCCTGGGCTACGGAA GCTGCTGAGAAGTTCAATGTGCCAAGACTTGTGTTCCACGGTACTGGCTACTTTTCTTTA TGCTCTGAATATTGCATCAGAGTGCATAACCCACAAAACATAGTAGCTTCAAGGTACGAG CCATTTGTGATTCCTGATCTCCCGGGGAACATAGTGATAACTCAAGAACAGATAGCAGAC CGTGACGAAGAAAGCGAGATGGGGAAGTTTATGATTGAGGTCAAAGAATCTGATGTGAAG AGCTCAGGTGTTATTGTAAACAGCTTCTACGAGCTTGAACCTGATTACGCCGACTTTTAC AAGAGTGTTGTACTGAAGAGAGCGTGGCATATCGGTCCGCTTTCGGTTTACAACAGAGGA TTTGAGGAGAAGGCTGAGAGAGGAAAGAAAGCAAGCATTAATGAGGTTGAATGCCTCAAA TGGCTTGACTCCAAGAAACCAGATTCAGTCATTTACATTTCTTTTGGGAGCGTGGCTTGC TTCAAGAACGAGCAGCTATTCGAGATCGCTGCAGGATTAGAAACTTCTGGAGCAAATTTC ATCTGGGTTGTTAGGAAAAACATAGGTATTGAAAAAGAAGAATGGTTACCAGAAGGGTTC GAAGAGAGGGTGAAAGGAAAAGGGATGATTATAAGAGGATGGGCACCACAGGTGCTCATA CTTGATCATCAAGCAACTTGTGGGTTTGTGACCCATTGCGGCTGGAACTCGCTTCTGGAA GGAGTGGCTGCAGGGCTACCAATGGTGACATGGCCTGTAGCAGCGGAGCAATTCTACAAT GAGAAATTGGTTACGCAAGTGCTCAGAACAGGAGTGAGCGTGGGAGCGAAAAAGAATGTA | AGAACTACGGGAGATTTCATTAGCAGAGAGAAAGTGGTTAAAGCGGTGAGGGAGGTGTTG GTTGGGGAAGAGGCGGATGAGAGGCGGGAGAGGGCAAAGAAGTTGGCAGAGATGGCTAAA GCTGCCGTGGAAGGAGGGTCTTCTTTCAACGATCTAAACAGCTTCATAGAAGAGTTTACC TCGTAA
>UGT73B4
ATGAACAGAGAGCAAATTCATATTTTGTTCTTCCCCTTCATGGCTCATGGCCACATGATT CCACTCTTAGACATGGCCAAGCTTTTCGCTAGAAGAGGAGCCAAATCAACTCTCCTCACA ACCCCAATAAATGCTAAGATCTTGGAGAAACCCATTGAAGCATTCAAAGTTCAAAATCCT GATCTCGAAATCGGAATCAAGATCCTCAATTTCCCTTGTGTAGAGCTTGGATTGCCAGAA GGATGCGAGAACCGTGACTTCATTAACTCATACCAAAAATCTGACTCATTTGACTTGTTC TTGAAGTTTCTTTTCTCTACCAAGTATATGAAACAGCAGTTGGAGAGTTTCATTGAAACA ACCAAACCGAGTGCTCTTGTAGCCGATATGTTCTTCCCTTGGGCAACAGAATCCGCGGAG AAGATCGGTGTTCCAAGACTTGTGTTCCACGGCACATCATCCTTTGCCTTGTGTTGTTCG TATAACATGAGGATTCATAAGCCACACAAGAAAGTCGCTTCGAGTTCTACTCCATTTGTA ATCCCTGGTCTCCCTGGAGACATAGTTATTACAGAAGACCAAGCCAATGTCACCAACGAA GAAACTCCATTCGGAAAGTTTTGGAAAGAAGTCAGGGAATCAGAGACCAGTAGCTTTGGT GTTTTGGTGAATAGCTTCTACGAGCTGGAATCATCTTATGCTGATTTTTACCGTAGTTTT GTGGCGAAAAAAGCGTGGCATATAGGTCCACTTTCACTATCCAACAGAGGGATTGCAGAG AAAGCCGGAAGAGGGAAAAAGGCAAACATTGATGAGCAAGAATGCCTCAAATGGCTTGAC TCTAAGACACCTGGCTCAGTAGTTTACTTGTCCTTTGGTAGCGGAACCGGCTTACCCAAC GAACAGCTGTTAGAGATTGCTTTCGGCCTTGAAGGCTCTGGACAAAATTTCATTTGGGTG GTTAGCAAAAATGAAAACCAAGGTGAAAATGAAGATTGGTTGCCTAAAGGGTTTGAAGAG AGGAATAAAGGAAAAGGGCTGATAATACGCGGATGGGCCCCGCAAGTGCTGATACTTGAC CACAAAGCAATCGGAGGATTTGTGACGCATTGCGGATGGAACTCGACTTTGGAGGGCATT GCCGCAGGGCTGCCTATGGTGACTTGGCCGATGGGGGCAGAACAGTTCTACAACGAGAAG TTATTGACAAAAGTGTTGAGAATAGGAGTGAACGTTGGAGCTACCGAGTTGGTGAAAAAA GGAAAGTTGATTAGTAGAGCACAAGTGGAGAAGGCAGTAAGGGAAGTGATTGGTGGTGAG AAGGCAGAGGAAAGGCGGCTAAGGGCTAAGGAGCTGGGCGAGATGGCTAAAGCCGCTGTG GAAGAAGGAGGGTCTTCTTATAATGATGTGAACAAGTTTATGGAAGAGCTGAATGGTAGA AAGTAG
>UGT73B5
ATGAACAGAGAAGTCTCTGAGAGAATTCATATTTTGTTCTTCCCCTTCATGGCTCAAGGC CACATGATTCCAATTTTGGACATGGCCAAGCTTTTCTCGAGGAGAGGAGCCAAGTCAACC CTTCTCACAACCCCAATCAACGCTAAGATCTTCGAGAAACCTATTGAAGCATTCAAAAAT CAAAACCCTGATCTCGAAATCGGAATCAAGATCTTCAATTTCCCTTGTGTAGAGCTTGGA TTGCCTGAAGGATGCGAGAACGCTGACTTTATCAACTCATACCAAAAATCTGACTCAGGT GACTTGTTCTTGAAGTTTCTTTTCTCTACCAAGTATATGAAACAACAGTTGGAGAGTTTC ATTGAAACAACCAAACCAAGTGCTCTTGTTGCCGATATGTTCTTCCCTTGGGCGACAGAA TCTGCTGAGAAGCTCGGTGTACCAAGACTTGTGTTCCACGGTACATCTTTCTTTTCTTTG TGTTGTTCGTATAACATGAGGATTCATAAGCCACACAAGAAAGTCGCTACGAGTTCTACT CCTTTTGTAATCCCTGGTCTCCCAGGAGACATAGTTATTACAGAAGACCAAGCCAATGTT GCCAAAGAAGAAACGCCAATGGGAAAGTTTATGAAAGAGGTTAGGGAATCAGAGACCAAT AGCTTTGGTGTATTGGTTAATAGCTTCTACGAGCTGGAATCAGCTTATGCTGATTTTTAT CGTAGTTTTGTGGCGAAAAGAGCTTGGCATATCGGTCCGCTTTCGCTATCTAACAGAGAG TTAGGAGAGAAAGCCAGAAGAGGGAAAAAGGCTAACATTGATGAGCAAGAATGCCTAAAA TGGCTGGACTCTAAGACACCTGGTTCAGTAGTTTACTTGTCCTTTGGGAGCGGAACTAAT TTCACCAACGACCAGCTGTTAGAGATCGCTTTTGGTCTTGAAGGTTCTGGACAAAGTTTC ATCTGGGTGGTTAGGAAAAATGAAAACCAAGGTGACAATGAAGAGTGGTTGCCTGAAGGG TTTAAAGAGAGGACAACAGGGAAAGGGCTAATAATACCTGGATGGGCGCCGCAAGTGCTG ATACTTGACCATAAAGCAATTGGAGGATTTGTGACTCATTGCGGATGGAACTCGGCTATA GAGGGCATTGCCGCGGGGCTGCCTATGGTAACATGGCCAATGGGGGCAGAACAGTTCTAC AATGAGAAGCTATTGACAAAAGTGTTGAGAATAGGAGTGAACGTTGGAGCTACCGAGTTG GTGAAAAAAGGAAAGTTGATTAGTAGAGCACAAGTGGAGAAGGCAGTAAGGGAAGTGATT GGTGGTGAGAAGGCAGAGGAAAGGCGGCTATGGGCTAAGAAGCTGGGCGAGATGGCTAAA GCCGCTGTGGAAGAAGGAGGGTCCTCTTATAATGATGTGAACAAGTTTATGGAAGAGCTG AATGGTAGAAAGTAG
>UGT73C1
ATGGCATCGGAATTTCGTCCTCCTCTTCATTTTGTTCTCTTCCCTTTCATGGCTCAAGGC CACATGATCCCAATGGTAGATATTGCAAGGCTCCTGGCTCAGCGCGGGGTGACTATAACC ATTGTCACTACACCTCAAAACGCAGGCCGGTTCAAGAACGTTCTTAGCCGGGCTATCCAA TCCGGCTTGCCCATCAATCTCGTGCAAGTAAAGTTTCCATCTCAAGAATCGGGTTCACCG GAAGGACAGGAGAATTTGGACTTGCTCGATTCATTGGGGGCTTCATTAACCTTCTTCAAA GCATTTAGCCTGCTCGAGGAACCAGTCGAGAAGCTCTTGAAAGAGATTCAACCTAGGCCA AACTGCATAATCGCTGACATGTGTTTGCCTTATACAAACAGAATTGCCAAGAATCTTGGT ATACCAAAAATCATCTTTCATGGCATGTGTTGCTTCAATCTTCTTTGTACGCACATAATG CACCAAAACCACGAGTTCTTGGAAACTATAGAGTCTGACAAGGAATACTTCCCCATTCCT AATTTCCCTGACAGAGTTGAGTTCACAAAATCTCAGCTTCCAATGGTATTAGTTGCTGGA GATTGGAAAGACTTCCTTGACGGAATGACAGAAGGGGATAACACTTCTTATGGTGTGATT GTTAACACGTTTGAAGAGCTCGAGCCAGCTTATGTTAGAGACTACAAGAAGGTTAAAGCG GGTAAGATATGGAGCATCGGACCGGTTTCCTTGTGCAACAAGTTAGGAGAAGACCAAGCT GAGAGGGGAAACAAGGCGGACATTGATCAAGACGAGTGTATTAAATGGCTTGATTCTAAA GAAGAAGGGTCGGTGCTATATGTTTGCCTTGGAAGTATATGCAATCTTCCTCTGTCTCAG CTCAAAGAGCTCGGCTTAGGCCTCGAGGAATCCCAAAGACCTTTCATTTGGGTCATAAGA GGTTGGGAGAAGTATAACGAGTTACTTGAATGGATCTCAGAGAGCGGTTATAAGGAAAGA ATCAAAGAAAGAGGCCTTCTCATAACAGGATGGTCGCCTCAAATGCTTATCCTTACACAT CCTGCCGTTGGAGGATTCTTGACACATTGTGGATGGAACTCTACTCTTGAAGGAATCACT TCAGGCGTTCCATTACTCACGTGGCCACTGTTTGGAGACCAATTCTGCAATGAGAAATTG GCGGTGCAGATACTAAAAGCCGGTGTGAGAGCTGGGGTTGAAGAGTCCATGAGATGGGGA GAAGAGGAGAAAATAGGAGTACTGGTGGATAAAGAAGGAGTAAAGAAGGCAGTGGAGGAA TTGATGGGTGATAGTAATGATGCTAAGGAGAGAAGAAAAAGAGTGAAAGAGCTTGGAGAA TTAGCTCACAAGGCTGTGGAAGAAGGAGGCTCTTCTCATTCCAACATCACATTCTTGCTA CAAGACATAATGCAATTAGAACAACCCAAGAAATGA
>UGT73C2
ATGGCTTTCGAGAAGACCCGCCAATTTCTTCCTCGGCTTCACTTTGTTCTCTTCCCTTTC ATGGCTCAAGGCCACATGATCCCCATGGTGGATATTGCAAGGATCTTGGCTCAGCGCGGG GTGACTATTACCATTGTCACGACGCCTCACAACGCAGCCAGGTTCAAAGATGTCCTAAAC CGGGCCATCCAGTCAGGCTTGCACATTAGGGTTGAGCATGTGAAGTTTCCTTTTCAAGAA GCTGGTTTGCAAGAAGGACAAGAGAATGTTGATTTTCTTGACTCAATGGAGTTAATGGTA CATTTCTTTAAAGCGGTTAACATGCTTGAAAATCCGGTCATGAAGCTCATGGAAGAGATG AAACCTAAACCAAGCTGCCTAATTTCTGATTTTTGTTTGCCTTATACAAGCAAAATCGCT AAGAGGTTCAATATCCCAAAGATCGTTTTCCATGGCGTGTCTTGCTTTTGTCTTTTGAGT ATGCATATTCTACACCGAAACCACAATATCTTACATGGTTTAAAGTCGGACAAAGAGTAT TTCTTGGTTCCTAGTTTTCCAGATAGAGTTGAATTTACAAAGCTTCAAGTTACTGTGAAA ACAAACTTTAGTGGAGATTGGAAAGAGATCATGGACGAACAGGTGGATGCTGATGACACG TCCTATGGTGTAATTGTCAACACATTTCAGGATTTGGAGTCTGCCTATGTGAAAAACTAC ACGGAGGCTAGGGCTGGTAAAGTATGGAGCATCGGTCCGGTTTCCTTGTGCAACAAGGTA GGAGAAGACAAAGCTGAGAGGGGAAACAAGGCAGCCATTGATCAAGACGAGTGTATTAAA TGGCTTGATTCTAAAGATGTAGAGTCGGTGCTGTATGTTTGCCTTGGAAGTATATGCAAT CTTCCTCTGGCTCAGCTTAGAGAGCTCGGGCTAGGCCTCGAGGCAACTAAAAGACCATTC ATTTGGGTCATAAGAGGTGGGGGAAAGTATCATGAACTAGCTGAGTGGATCTTAGAGAGC GGTTTTGAAGAAAGAACCAAAGAGAGAAGCCTTCTCATAAAAGGATGGTCGCCTCAAATG CTTATCCTTTCACACCCTGCCGTTGGAGGATTCCTGACACATTGTGGATGGAACTCAACT TTAGAAGGAATCACCTCAGGGGTTCCATTGATCACTTGGCCATTATTTGGAGACCAATTC TGCAACCAGAAACTGATCGTGCAGGTGCTAAAAGCAGGTGTAAGTGTTGGGGTTGAAGAG GTCATGAAATGGGGAGAAGAGGAGAGTATTGGAGTGTTAGTGGATAAAGAAGGAGTGAAG AAGGCAGTGGACGAAATAATGGGCGAGAGTGATGAAGCAAAAGAGAGAAGAAAAAGAGTC AGAGAGCTTGGAGAATTAGCTCACAAGGCTGTGGAAGAAGGAGGCTCTTCTCATTCTAAT ATCATATTTTTGCTACAAGATATAATGCAACAAGTAGAATCCAAGAGTTGA
>UGT73C3
ATGGCTACGGAAAAAACCCACCAATTTCATCCTTCTCTTCACTTTGTCCTCTTCCCTTTC ATGGCTCAAGGCCACATGATTCCCATGATTGATATTGCAAGACTCTTGGCTCAGCGTGGT GTGACCATAACAATTGTCACGACACCTCACAACGCAGCAAGGTTTAAGAATGTCCTAAAC CGAGCGATCGAGTCTGGCTTGGCCATCAACATACTGCATGTGAAGTTTCCATATCAAGAG TTTGGTTTGCCAGAAGGAAAAGAGAATATAGATTCGTTAGACTCAACGGAGTTGATGGTA CCTTTCTTCAAAGCGGTGAACTTGCTTGAAGATCCGGTCATGAAGCTCATGGAAGAGATG AAACCTAGACCTAGCTGTCTAATTTCTGATTGGTGTTTGCCTTATACAAGCATAATCGCC AAGAACTTCAATATACCAAAGATAGTTTTCCACGGCATGGGTTGCTTTAATCTTTTGTGT ATGCATGTTCTACGCAGAAACTTAGAGATCCTAGAGAATGTAAAGTCGGATGAAGAGTAT TTCTTGGTTCCTAGTTTTCCTGATAGAGTTGAATTTACAAAGCTTCAACTTCCTGTGAAA GCAAATGCAAGTGGAGATTGGAAAGAGATAATGGATGAAATGGTAAAAGCAGAATACACA TCCTATGGTGTGATCGTCAACACATTTCAGGAGTTGGAGCCACCTTATGTCAAAGACTAC AAAGAGGCAATGGATGGAAAAGTATGGTCCATTGGACCCGTTTCCTTGTGTAACAAGGCA GGTGCAGACAAAGCTGAGAGGGGAAGCAAGGCCGCCATTGATCAAGATGAGTGTCTTCAA TGGCTTGATTCTAAAGAAGAAGGTTCGGTGCTCTATGTTTGCCTTGGAAGTATATGTAAT CTTCCTTTGTCTCAGCTCAAGGAGCTGGGGCTAGGCCTTGAGGAATCTCGAAGATCTTTT ATTTGGGTCATAAGAGGTTCGGAAAAGTATAAAGAACTATTTGAGTGGATGTTGGAGAGC GGTTTTGAAGAAAGAATCAAAGAGAGAGGACTTCTCATTAAAGGGTGGGCACCTCAAGTC CTTATCCTTTCACATCCTTCCGTTGGAGGATTCCTGACACACTGTGGATGGAACTCGACT CTCGAAGGAATCACCTCAGGCATTCCACTGATCACTTGGCCGCTGTTTGGAGACCAATTC TGCAACCAAAAACTGGTCGTTCAAGTACTAAAAGCCGGTGTAAGTGCCGGGGTTGAAGAA GTCATGAAATGGGGAGAAGAAGATAAAATAGGAGTGTTAGTGGATAAAGAAGGAGTGAAA AAGGCTGTGGAAGAATTGATGGGTGATAGTGATGATGCAAAAGAGAGGAGAAGAAGAGTC AAAGAGCTTGGAGAATTAGCTCACAAAGCTGTGGAAAAAGGAGGCTCTTCTCATTCTAAC ATCACACTCTTGCTACAAGACATAATGCAACTAGCACAATTCAAGAATTGA
>UGT73C4
ATGGCTTCCGAAAAATCCCACAAAGTTCATCCTCCTCTTCACTTTATTCTTTTCCCTTTC ATGGCTCAGGGCCACATGATTCCCATGATTGATATAGCAAGGCTCTTGGCTCAGCGCGGT GCGACAGTAACTATTGTCACGACACGTTATAATGCAGGGAGGTTCGAGAATGTCTTAAGT CGTGCCATGGAGTCTGGTTTACCCATCAACATAGTGCATGTGAATTTTCCATATCAAGAA TTTGGTTTGCCAGAAGGAAAAGAGAATATAGATTCGTATGACTCAATGGAGCTGATGGTA CCTTTCTTTCAAGCAGTTAACATGCTCGAAGATCCGGTCATGAAGCTCATGGAAGAGATG AAACCTAGACCTAGCTGTATTATTTCTGATTTGCTCTTGCCTTATACAAGCAAAATCGCA AGGAAATTCAGTATACCAAAGATAGTTTTCCACGGCACGGGTTGCTTTAATCTTTTGTGT ATGCATGTTCTACGCAGAAACCTCGAGATCTTGAAGAACTTAAAGTCGGATAAAGATTAT TTCCTGGTTCCTAGTTTTCCTGATAGAGTTGAATTTACAAAGCCTCAAGTTCCAGTGGAA ACAACTGCAAGTGGAGATTGGAAAGCGTTCTTGGACGAAATGGTAGAAGCAGAATACACA TCCTATGGTGTGATCGTCAACACATTTCAGGAGTTGGAGCCTGCTTATGTCAAAGACTAC ACGAAGGCTAGGGCTGGAAAAGTATGGTCCATTGGACCTGTTTCCTTGTGCAACAAGGCA GGTGCTGATAAAGCTGAGAGGGGAAACCAGGCCGCCATTGATCAAGATGAGTGTCTTCAA TGGCTTGATTCTAAAGAAGATGGTTCGGTGTTATATGTTTGCCTTGGAAGTATCTGTAAT CTACCTTTGTCTCAGCTCAAGGAGCTGGGGCTAGGCCTTGAAAAATCCCAAAGATCTTTT ATTTGGGTCATAAGAGGTTGGGAAAAGTATAATGAACTATATGAGTGGATGATGGAGAGC GGTTTTGAAGAAAGAATCAAAGAGAGAGGACTTCTTATTAAAGGGTGGTCACCTCAAGTC CTTATCCTTTCACATCCTTCCGTTGGAGGATTCCTGACACACTGTGGATGGAACTCGACT CTCGAAGGAATCACCTCAGGCATTCCACTGATCACTTGGCCGCTGTTTGGAGACCAATTC TGCAACCAAAAACTGGTCGTTCAAGTACTAAAAGCCGGTGTAAGTGCCGGGGTTGAAGAA GTCATGAAATGGGGAGAAGAGGAGAAAATAGGAGTGTTAGTGGATAAAGAAGGAGTAAAG AAGGCAGTGGAAGAGTTAATGGGTGCGAGTGATGATGCAAAAGAGAGGAGAAGAAGAGTC AAAGAGCTTGGAGAATCAGCTCACAAGGCTGTGGAAGAAGGAGGCTCTTCTCATTCTAAC ATCACATACTTGCTACAAGACATAATGCAACAAGTGAAATCCAAGAACTGA
>UGT73C5
ATGGTTTCCGAAACAACCAAATCTTCTCCACTTCACTTTGTTCTCTTCCCTTTCATGGCT CAAGGCCACATGATTCCCATGGTTGATATTGCAAGGCTCTTGGCTCAGCGTGGTGTGATC ATAACAATTGTCACGACGCCTCACAATGCAGCGAGGTTCAAGAATGTCCTAAACCGTGCC ATTGAGTCTGGCTTGCCCATCAACTTAGTGCAAGTCAAGTTTCCATATCTAGAAGCTGGT TTGCAAGAAGGACAAGAGAATATCGATTCTCTTGACACAATGGAGCGGATGATACCTTTC TTTAAAGCGGTTAACTTTCTCGAAGAACCAGTCCAGAAGCTCATTGAAGAGATGAACCCT CGACCAAGCTGTCTAATTTCTGATTTTTGTTTGCCTTATACAAGCAAAATCGCCAAGAAG TTCAATATCCCAAAGATCCTCTTCCATGGCATGGGTTGCTTTTGTCTTCTGTGTATGCAT GTTTTACGCAAGAACCGTGAGATCTTGGACAATTTAAAGTCAGATAAGGAGCTTTTCACT GTTCCTGATTTTCCTGATAGAGTTGAATTCACAAGAACGCAAGTTCCGGTAGAAACATAT GTTCCAGCTGGAGACTGGAAAGATATCTTTGATGGTATGGTAGAAGCGAATGAGACATCT TATGGTGTGATCGTCAACTCATTTCAAGAGCTCGAGCCTGCTTATGCCAAAGACTACAAG GAGGTAAGGTCCGGTAAAGCATGGACCATTGGACCCGTTTCCTTGTGCAACAAGGTAGGA GCCGACAAAGCAGAGAGGGGAAACAAATCAGACATTGATCAAGATGAGTGCCTTAAATGG CTCGATTCTAAGAAACATGGCTCGGTGCTTTACGTTTGTCTTGGAAGTATCTGTAATCTT CCTTTGTCTCAACTCAAGGAGCTGGGACTAGGCCTAGAGGAATCCCAAAGACCTTTCATT TGGGTCATAAGAGGTTGGGAGAAGTACAAAGAGTTAGTTGAGTGGTTCTCGGAAAGCGGC TTTGAAGATAGAATCCAAGATAGAGGACTTCTCATCAAAGGATGGTCCCCTCAAATGCTT ATCCTTTCACATCCATCAGTTGGAGGGTTCCTAACACACTGTGGTTGGAACTCGACTCTT GAGGGGATAACTGCTGGTCTACCGCTACTTACATGGCCGCTATTCGCAGACCAATTCTGC AATGAGAAATTGGTCGTTGAGGTACTAAAAGCCGGTGTAAGATCCGGGGTTGAACAGCCT ATGAAATGGGGAGAAGAGGAGAAAATAGGAGTGTTGGTGGATAAAGAAGGAGTGAAGAAG GCAGTGGAAGAATTAATGGGTGAGAGTGATGATGCAAAAGAGAGAAGAAGAAGAGCCAAA GAGCTTGGAGATTCAGCTCACAAGGCTGTGGAAGAAGGAGGCTCTTCTCATTCTAACATC TCTTTCTTGCTACAAGACATAATGGAACTGGCAGAACCCAATAATTGA
>UGT73C6
ATGGCTTTCGAAAAAAACAACGAACCTTTTCCTCTTCACTTTGTTCTCTTCCCTTTCATG GCTCAAGGCCACATGATTCCCATGGTTGATATTGCAAGGCTCTTGGCTCAGCGAGGTGTG CTTATAACAATTGTCACGACGCCTCACAATGCAGCAAGGTTCAAGAATGTCCTAAACCGT GCCATTGAGTCTGGTTTGCCCATCAACCTAGTGCAAGTCAAGTTTCCATATCAAGAAGCT GGTCTGCAAGAAGGACAAGAAAATATGGATTTGCTTACCACGATGGAGCAGATAACATCT TTCTTTAAAGCGGTTAACTTACTCAAAGAACCAGTCCAGAACCTTATTGAAGAGATGAGC CCGCGACCAAGCTGTCTAATCTCTGATATGTGTTTGTCGTATACAAGCGAAATCGCCAAG AAGTTCAAAATACCAAAGATCCTCTTGCATGGCATGGGTTGCTTTTGTCTTCTGTGTGTT AACGTTCTGCGCAAGAACCGTGAGATCTTGGACAATTTAAAGTCTGATAAGGAGTACTTC ATTGTTCCTTATTTTCCTGATAGAGTTGAATTCACAAGACCTCAAGTTCCGGTGGAAACA TATGTTCCTGCAGGCTGGAAAGAGATCTTGGAGGATATGGTAGAAGCGGATAAGACATCT TATGGTGTTATAGTCAACTCATTTCAAGAGCTCGAACCTGCGTATGCCAAAGACTTCAAG GAGGCAAGGTCTGGTAAAGCATGGACCATTGGACCTGTTTCCTTGTGCAACAAGGTAGGA GTAGACAAAGCAGAGAGGGGAAACAAATCAGATATTGATCAAGATGAGTGCCTTGAATGG CTCGATTCTAAGGAACCGGGATCTGTGCTCTACGTTTGCCTTGGAAGTATTTGTAATCTT CCTCTGTCTCAGCTCCTTGAGCTGGGACTAGGCCTAGAGGAATCCCAAAGACCTTTCATC TGGGTCATAAGAGGTTGGGAGAAATACAAAGAGTTAGTTGAGTGGTTCTCGGAAAGCGGC TTTGAAGATAGAATCCAAGATAGAGGACTTCTCATCAAAGGATGGTCCCCTCAAATGCTT ATCCTTTCACATCCTTCTGTTGGAGGGTTCTTAACGCACTGCGGATGGAACTCGACTCTT GAGGGGATAACTGCTGGTCTACCAATGCTTACATGGCCACTATTTGCAGACCAATTCTGC AACGAGAAACTGGTCGTACAAATACTAAAAGTCGGTGTAAGTGCCGAGGTTAAAGAGGTC ATGAAATGGGGAGAAGAAGAGAAGATAGGAGTGTTGGTGGATAAAGAAGGAGTGAAGAAG GCAGTGGAAGAACTAATGGGTGAGAGTGATGATGCAAAAGAGAGAAGAAGAAGAGCCAAA GAGCTTGGAGAATCAGCTCACAAGGCTGTGGAAGAAGGAGGCTCCTCTCATTCTAATATC ACTTTCTTGCTACAAGACATAATGCAACTAGCACAGTCCAATAATTGA
>UGT73C7
ATGTGTTCTCATGATCCTCTTCACTTCGTCGTAATACCCTTTATGGCCCAAGGCCATATG ATCCCATTGGTCGACATCTCTAGGCTCTTGTCCCAGCGCCAAGGCGTGACTGTCTGCATC ATCACAACTACTCAAAATGTAGCCAAGATCAAGACTTCACTCTCATTTTCCTCTTTGTTT GCGACTATCAACATCGTTGAAGTTAAGTTTCTGTCTCAACAAACGGGTTTGCCAGAAGGG TGCGAGAGTTTAGATATGTTGGCTTCAATGGGCGATATGGTGAAGTTCTTTGATGCTGCC AACTCACTTGAGGAGCAAGTTGAGAAAGCTATGGAAGAGATGGTTCAGCCGCGGCCAAGC TGCATCATTGGAGACATGAGCCTTCCTTTCACTTCAAGACTTGCCAAGAAATTCAAGATC CCCAAACTTATCTTCCATGGGTTTTCTTGTTTCAGCCTCATGTCTATACAAGTGGTTCGA GAAAGCGGGATCTTGAAAATGATAGAATCAAACGACGAGTATTTTGATTTGCCCGGCTTG CCTGACAAAGTTGAGTTCACGAAACCTCAGGTCTCTGTGTTGCAACCTGTTGAAGGAAAT ATGAAAGAGAGTACGGCCAAGATTATTGAAGCTGATAATGACTCTTATGGTGTTATTGTG AACACTTTTGAAGAGTTAGAGGTTGATTATGCAAGAGAATATAGGAAAGCAAGGGCTGGA AAAGTTTGGTGCGTTGGACCTGTTTCCTTGTGCAATAGGTTAGGGTTAGACAAAGCTAAA AGAGGAGATAAGGCTTCTATTGGTCAAGACCAATGTCTTCAATGGCTTGACTCTCAAGAA ACTGGTTCAGTGCTCTACGTTTGCCTTGGAAGTCTATGTAATCTTCCCTTGGCTCAGCTC AAAGAGCTGGGACTAGGCCTTGAGGCATCTAATAAACCTTTCATATGGGTTATAAGAGAA TGGGGAAAATATGGAGATTTAGCAAATTGGATGCAACAAAGCGGATTTGAAGAGCGGATC AAAGATAGAGGACTGGTGATCAAAGGTTGGGCGCCGCAAGTTTTCATCCTCTCACACGCA TCCATTGGAGGGTTTTTGACTCACTGTGGATGGAACTCGACACTAGAAGGAATTACTGCA GGAGTTCCATTATTGACATGGCCTTTGTTTGCTGAACAATTCTTGAATGAGAAGTTAGTT GTGCAGATACTAAAAGCAGGGTTAAAGATAGGAGTAGAGAAATTGATGAAATATGGAAAA GAAGAGGAGATAGGAGCGATGGTGAGCAGAGAATGTGTGAGAAAAGCTGTGGATGAGCTA ATGGGTGATAGTGAAGAAGCAGAAGAGAGAAGAAGAAAAGTTACAGAACTTAGTGACTTG GCAAATAAGGCTTTGGAAAAAGGAGGATCTTCAGATTCTAATATCACATTGCTCATTCAA GATATTATGGAGCAATCACAAAATCAATTTTAA
>UGT73D1
ATGGAATCAAAAATAGTTTCAAAAGCCAAAAGACTTCACTTTGTTTTGATCCCTCTCATG GCTCAAGGGCATCTGATCCCCATGGTCGACATCTCCAAGATTCTTGCACGACAAGGCAAC ATCGTTACCATAGTTACAACCCCTCAAAATGCTTCTAGGTTTGCGAAGACAGTTGACCGA GCAAGATTAGAGTCGGGTCTCGAAATCAATGTCGTTAAATTTCCAATTCCTTACAAAGAA TTCGGTCTTCCCAAAGATTGTGAGACTCTGGACACTTTGCCCTCCAAAGACCTCCTACGA AGATTCTATGACGCTGTGGATAAACTCCAAGAGCCCATGGAACGGTTTGTTGAGCAACAA GATATCCCTCCAAGTTGCATAATCTCCGATAAATGCCTTTTTTGGACGTCAAGAACCGCA AAGAGGTTCAAAATCCCGAGGATCGTGTTCCATGGAATGTGTTGCTTCTCTCTTTTGAGT TCGCACAATATCCATCTTCATAGCCCGCACCTCTCGGTTTCTTCGGCCGTAGAGCCATTC CCTATACCAGGAATGCCACATAGGATTGAGATAGCTAGAGCTCAGTTACCTGGTGCTTTT GAGAAGTTAGCAAATATGGATGACGTTCGCGAGAAGATGCGTGAATCTGAATCAGAAGCC TTTGGGGTTATTGTTAATAGCTTCCAGGAATTGGAGCCTGGCTATGCAGAGGCCTACGCT GAGGCCATCAATAAGAAGGTATGGTTCGTTGGACCCGTTTCTTTATGCAACGACCGTATG GCTGACCTATTCGATAGAGGAAGTAATGGTAACATCGCAATAAGCGAGACCGAATGCTTG CAGTTTCTTGACTCGATGAGACCAAGGTCAGTCTTATATGTTTCTCTTGGTAGCCTCTGT CGACTAATACCTAATCAATTGATAGAACTAGGTTTAGGGTTAGAAGAATCGGGAAAACCC TTTATTTGGGTGATAAAGACCGAGGAAAAACACATGATTGAGCTAGACGAATGGCTAAAA CGCGAAAATTTTGAAGAGCGAGTTAGAGGAAGAGGGATAGTAATAAAGGGTTGGAGTCCT CAGGCTATGATACTCTCACATGGTTCAACCGGCGGGTTCTTGACTCATTGCGGTTGGAAT TCTACAATAGAAGCGATATGTTTTGGTGTACCAATGATCACATGGCCGTTGTTCGCTGAA CAATTTCTCAATGAGAAACTCATCGTGGAGGTTTTGAACATCGGGGTTAGGGTTGGGGTG GAGATTCCGGTGAGATGGGGAGACGAGGAGAGACTTGGAGTGTTGGTCAAGAAACCGAGT GTTGTGAAAGCTATAAAGCTTTTGATGGACCAAGATTGTCAACGTGTAGACGAAAATGAT GATGATAATGAATTCGTGAGACGAAGGAGACGTATTCAAGAACTTGCAGTAATGGCGAAA AAGGCTGTGGAAGAAAAGGGATCTTCGAGTATTAACGTTTCAATTTTAATCCAAGATGTT TTGGAGCAATTGAGTCTCGTGTAG
>UGT74B1
ATGGCGGAAACAACTCCCAAAGTGAAAGGCCACGTCGTAATCTTACCATACCCAGTTCAA GGCCACCTAAACCCAATGGTTCAATTCGCTAAACGTCTAGTCTCCAAAAACGTCAAAGTC ACAATCGCCACCACTACCTACACCGCCTCCTCAATCACAACACCATCACTCTCCGTCGAA CCAATCTCCGATGGATTCGATTTCATCCCCATAGGTATCCCCGGTTTCAGCGTCGATACT TACTCAGAATCCTTCAAGCTCAACGGATCCGAAACCCTAACTCTCCTAATCGAGAAATTC AAATCCACAGATTCACCAATCGATTGCTTAATCTACGATTCGTTTCTTCCTTGGGGACTT GAAGTTGCTAGATCTATGGAACTTTCAGCTGCTTCTTTCTTCACTAATAATCTCACTGTT TGTTCTGTGTTGCGTAAATTCTCTAACGGTGACTTTCCTCTTCCCGCTGATCGTAATTCG GCGCCGTTTCGTATCCGTGGCTTACCGTCTTTGAGCTACGATGAGTTACCTTCGTTTGTG GGACGTCATTGGTTGACTCATCCTGAGCATGGCAGAGTTCTTCTGAATCAGTTTCCTAAC CATGAAAATGCTGATTGGTTATTCGTTAATGGCTTTGAAGGGTTAGAAGAAACACAAGAT TGTGAAAATGGTGAGTCTGATGCAATGAAGGCGACGTTGATCGGACCGATGATTCCATCG GCTTATCTTGATGATCGGATGGAAGATGATAAAGACTATGGTGCGAGTCTGTTGAAACCG ATATCGAAGGAGTGTATGGAGTGGCTTGAGACTAAGCAGGCTCAGTCAGTAGCATTTGTT TCGTTTGGTTCGTTTGGGATTCTCTTTGAGAAGCAACTTGCAGAGGTAGCTATTGCGCTA CAAGAATCGGATTTGAACTTCTTGTGGGTGATTAAAGAAGCTCATATAGCGAAATTGCCT GAAGGGTTTGTGGAATCGACTAAAGATAGAGCCTTGTTGGTTTCTTGGTGTAACCAGCTT GAGGTTTTAGCTCATGAATCGATAGGTTGCTTTTTGACTCATTGTGGTTGGAACTCTACG TTGGAAGGGTTGAGTTTGGGAGTTCCGATGGTTGGTGTGCCTCAGTGGAGTGATCAGATG AATGATGCTAAGTTTGTGGAGGAAGTTTGGAAAGTTGGGTATAGAGCGAAAGAGGAAGCT GGGGAAGTAATCGTGAAGAGTGAAGAATTGGTGAGGTGTTTGAAAGGAGTGATGGAAGGA GAGAGTAGTGTGAAGATTAGAGAGAGTTCGAAGAAGTGGAAAGATTTGGCTGTGAAGGCA ATGAGTGAAGGAGGAAGCTCTGATCGAAGCATTAACGAGTTTATAGAGAGTTTAGGGAAG TAA
>UGT74C1
ATGAGTGAAGCAAAGAAGGGTCACGTACTGTTTTTTCCATATCCATTACAAGGCCACATT AACCCAATGATCCAACTCGCTAAACGCTTATCCAAAAAGGGCATCACCAGCACACTCATC ATCGCCTCCAAAGACCACCGTGAACCTTACACCTCCGACGACTACTCCATCACCGTCCAC ACCATCCACGACGGTTTCTTTCCACATGAACACCCTCACGCCAAGTTCGTAGATCTTGAC CGTTTCCACAACTCTACTTCTCGAAGCCTGACCGATTTCATCTCTAGTGCGAAGTTGTCG GACAATCCTCCAAAAGCTCTGATCTATGATCCATTTATGCCCTTTGCATTGGACATAGCC AAGGACTTGGATCTATACGTAGTGGCATATTTCACTCAACCATGGTTGGCTAGTCTTGTT TACTACCATATCAACGAAGGCACCTACGATGTTCCCGTTGATAGACACGAGAACCCAACA CTTGCATCGTTTCCTGGTTTCCCATTGTTAAGCCAAGATGATCTGCCTTCGTTCGCCTGC GAAAAAGGGTCGTACCCTCTTCTACACGAGTTTGTGGTTAGGCAATTCTCTAATTTATTG CAAGCTGATTGCATTCTCTGCAACACTTTTGATCAACTTGAACCAAAGGTAGTGAAATGG ATGAATGATCAATGGCCGGTGAAGAACATTGGACCGGTGGTTCCATCGAAGTTCTTGGAT AACCGGTTGCCAGAAGACAAAGATTACGAACTCGAGAACTCCAAGACAGAGCCAGACGAG TCTGTTTTGAAGTGGTTGGGAAACAGGCCGGCGAAGTCGGTGGTTTACGTGGCGTTTGGG ACATTGGTGGCTTTGAGCGAAAAACAGATGAAGGAAATTGCAATGGCGATTAGCCAAACC GGATATCACTTCTTGTGGTCTGTTAGAGAATCCGAGAGAAGCAAACTACCCTCTGGTTTT ATCGAAGAGGCAGAGGAGAAAGACTCTGGACTTGTGGCTAAGTGGGTTCCTCAGCTAGAG GTTTTAGCACATGAATCAATCGGGTGTTTCGTGTCACACTGTGGATGGAACTCGACATTG GAGGCACTATGCTTAGGGGTTCCAATGGTGGGCGTGCCTCAGTGGACTGATCAGCCCACA AATGCTAAGTTTATAGAGGATGTGTGGAAGATTGGGGTTAGAGTGAGGACCGATGGAGAA GGGCTTTCGAGTAAAGAAGAGATTGCGAGATGCATTGTTGAGGTCATGGAAGGAGAGAGA GGGAAAGAGATAAGGAAGAATGTTGAGAAGCTTAAGGTGTTGGCTCGCGAAGCTATCTCT GAAGGAGGTAGTTCCGACAAGAAGATTGATGAGTTTGTTGCTCTTTTGACTTAA
>UGT74D1
ATGGGAGAGAAAGCGAAAGCAAATGTGTTAGTCTTCTCATTTCCGATACAAGGTGACATA AACCCTCTCCTCCAATTCTCAAAACGCCTACTCTCTAAAAACGTCAACGTCACATTCCTC ACCACTTCCTCCACCCACAACTCCATCCTCCGCCGTGCCATCACCGGCGGAGCCACTGCT CTTCCTCTCTCTTTTGTCCCCATTGACGATGGATTCGAGGAAGATCACCCATCTACGGAC ACATCTCCCGACTACTTCGCAAAGTTCCAAGAAAACGTATCTCGAAGCCTCTCAGAGCTT ATCTCCTCGATGGACCCAAAACCAAACGCCGTCGTTTACGACTCGTGCCTGCCTTATGTC CTCGACGTTTGCCGGAAACATCCTGGCGTTGCTGCGGCGTCGTTTTTCACTCAGTCCTCC ACCGTGAACGCGACCTATATTCATTTCTTGCGTGGAGAGTTTAAGGAGTTTCAAAATGAT GTCGTTTTGCCTGCAATGCCTCCGCTGAAGGGTAATGACTTACCGGTGTTTCTGTACGAT AACAATCTCTGCCGGCCGTTGTTTGAGCTCATTAGTAGCCAGTTGGTGAATGTTGACGAC ATTGACTTCTTCTTGGTTAACTCTTTCGACGAACTCGAAGTCGAGGTGCTACAATGGATG AAAAACCAATGGCCGGTCAAGAACATAGGACCGATGATTCCATCAATGTACTTAGACAAA CGATTAGCAGGTGACAAAGACTACGGAATCAACCTCTTCAATGCCCAAGTCAACGAATGC CTTGATTGGCTTGACTCAAAACCGCCCGGTTCAGTGATCTACGTGTCTTTTGGAAGCTTG GCCGTCTTAAAAGACGATCAAATGATAGAAGTCGCGGCTGGTCTAAAACAAACTGGCCAT AACTTCTTATGGGTTGTTAGAGAAACTGAAACAAAGAAGCTTCCAAGCAATTACATAGAG GACATTTGTGACAAGGGATTGATAGTGAATTGGAGTCCTCAATTACAAGTTCTTGCACAT AAATCAATCGGTTGTTTCATGACTCATTGCGGGTGGAATTCGACTTTAGAGGCATTGAGC TTAGGAGTTGCTTTGATAGGAATGCCGGCTTATAGCGACCAGCCGAGTAATGCTAAGTTT ATTGAAGATGTGTGGAAGGTTGGGGTTAGGGTTAAGGCAGATCAAAATGGGTTTGTTCCG AAGGAAGAGATTGTGAGATGTGTTGGAGAAGTTATGGAAGATATGTCGGAGAAAGGGAAG GAGATTAGAAAAAATGCTCGGAGGTTGATGGAGTTTGCAAGGGAAGCTTTGTCTGATGGA GGAAATTCTGATAAGAATATTGATGAGTTTGTTGCTAAAATTGTGAGGTAA
>UGT74E1
ATGAGAGAAGGATCTCATGTTATTGTTTTGCCTTTCCCAGCACAAGGCCACATAACTCCA ATGTCCCAATTCTGTAAACGCTTAGCCTCAAAAAGTCTTAAGATCACTCTTGTCCTCGTC TCCGACAAGCCCTCTCCGCCGTACAAAACAGAGCACGACACAATCACTGTCGTCCCCATC TCCAATGGTTTCCAAGAAGGCCAGGAACGATCAGAAGACCTAGATGAGTACATGGAAAGA GTAGAATCCAGCATCAAAAACCGCTTACCGAAGTTGATAGAAGACATGAAACTATCGGGA AATCCTCCTAGGGCTCTTGTGTACGACTCCACCATGCCGTGGCTTCTGGATGTAGCTCAT AGTTATGGTTTGAGCGGTGCCGTGTTTTTCACGCAGCCTTGGCTTGTCTCAGCTATTTAC TATCATGTATTCAAGGGCTCGTTCTCTGTACCGTCTACAAAGTATGGTCACTCGACGTTA GCATCTTTCCCTTCGTTACCGATTCTGAATGCGAATGATTTGCCGTCTTTCCTCTGTGAA TCTTCCTCTTACCCATATATTCTAAGGACTGTGATCGATCAGCTCTCAAACATTGATCGA GTTGATATAGTTTTGTGCAACACTTTCGATAAATTGGAAGAAAAGTTGCTGAAATGGATT AAAAGCGTGTGGCCTGTCCTGAACATAGGACCAACTGTTCCATCAATGTATTTAGATAAG CGACTGGCTGAAGACAAAAACTACGGATTCAGCCTCTTCGGTGCGAAAATCGCTGAATGC ATGGAGTGGCTCAACTCAAAGCAGCCTAGTTCAGTTGTTTATGTATCATTTGGGAGCTTG GTGGTTCTAAAAAAAGATCAACTGATAGAACTAGCGGCGGGTCTGAAACAGAGCGGACAT TTCTTTTTGTGGGTTGTGAGAGAGACGGAGAGAAGAAAACTTCCAGAAAACTATATAGAG GAAATTGGTGAGAAAGGACTGACCGTGAGCTGGAGTCCACAACTTGAAGTTCTTACACAT AAATCGATCGGTTGTTTCGTGACACATTGTGGATGGAACTCGACGTTAGAGGGATTGAGT TTGGGAGTTCCAATGATTGGTATGCCTCATTGGGCAGATCAGCCTACAAATGCTAAGTTC ATGGAGGATGTGTGGAAAGTTGGAGTTAGGGTTAAAGCAGACAGTGATGGGTTCGTGAGA AGAGAAGAGTTTGTGAGACGTGTGGAAGAAGTTATGGAGGCAGAGCAAGGTAAAGAGATT AGAAAGAATGCTGAGAAATGGAAAGTGTTGGCTCAAGAGGCTGTTTCTGAAGGAGGTAGT TCTGATAAGAACATCAATGAGTTTGTTTCTATGTTTTGTTGA
>UGT74E2
ATGAGAGAAGGATCTCATCTTATCGTCTTGCCTTTCCCAGGACAAGGCCACATAACTCCA ATGTCCCAGTTCTGCAAACGCTTAGCCTCAAAAGGTCTTAAGCTCACTCTGGTCCTCGTC TCCGACAAACCCTCTCCTCCATACAAAACAGAGCACGACTCAATCACTGTCTTCCCCATC TCCAACGGCTTCCAAGAAGGCGAGGAACCATTACAAGACCTCGATGATTACATGGAAAGA GTAGAAACCAGCATCAAAAACACCTTACCGAAGTTGGTTGAAGACATGAAACTGTCGGGA AATCCACCTAGGGCTATCGTGTACGACTCCACCATGCCATGGCTTCTTGATGTAGCTCAT AGTTATGGATTGAGCGGTGCCGTGTTTTTCACGCAACCTTGGCTTGTCACAGCTATTTAC TACCATGTTTTCAAGGGTTCGTTCTCTGTACCGTCTACAAAGTACGGTCACTCGACATTA GCATCTTTCCCTTCGTTCCCGATGCTGACTGCAAATGATTTGCCGTCTTTCCTCTGCGAA TCGTCCTCATACCCGAATATACTGAGGATTGTGGTGGATCAGCTCTCAAACATTGATCGA GTCGACATAGTGTTGTGCAACACTTTCGATAAATTGGAGGAAAAGTTGTTGAAATGGGTC CAAAGCTTGTGGCCAGTCTTGAATATTGGACCAACGGTTCCATCGATGTATTTAGACAAA CGACTGTCTGAAGACAAGAACTACGGTTTTAGCCTCTTCAATGCGAAAGTCGCTGAATGC ATGGAGTGGCTAAACTCAAAGGAGCCTAATTCTGTTGTCTATTTATCATTCGGAAGTTTG GTGATTCTAAAAGAAGATCAAATGTTGGAACTCGCTGCGGGTCTGAAACAGAGCGGACGT TTCTTTCTGTGGGTTGTGAGAGAGACAGAGACACACAAACTTCCAAGAAACTATGTCGAG GAAATCGGTGAAAAAGGACTTATTGTAAGCTGGAGTCCTCAGCTTGACGTACTTGCACAT AAATCAATCGGTTGTTTCTTGACACACTGTGGATGGAACTCGACGTTAGAGGGATTGAGT TTGGGAGTTCCAATGATTGGTATGCCACACTGGACTGATCAGCCCACGAATGCTAAGTTC ATGCAGGATGTGTGGAAGGTTGGGGTAAGGGTTAAGGCAGAAGGTGATGGGTTTGTGAGA AGAGAAGAGATTATGAGAAGTGTGGAAGAAGTTATGGAGGGAGAGAAAGGGAAAGAGATT AGAAAGAATGCTGAGAAATGGAAAGTGTTGGCTCAAGAGGCAGTTTCTGAAGGAGGTAGC TCTGATAAGAGCATCAATGAGTTTGTTTCTATGTTTTGTTGA
>UGT74F1
ATGGAGAAGATGAGAGGACATGTATTAGCAGTGCCATTTCCAAGCCAAGGACACATCACC CCGATTCGCCAATTCTGCAAACGACTTCACTCCAAAGGTTTCAAAACCACTCACACTCTC ACCACTTTTATCTTCAACACAATCCACCTCGACCCATCTAGTCCTATCTCCATAGCCACA ATCTCCGATGGCTATGACCAGGGAGGGTTCTCATCAGCCGGTTCTGTCCCGGAGTACCTA CAAAACTTCAAAACCTTCGGCTCCAAAACCGTCGCTGATATCATCCGCAAACACCAGAGT ACTGATAACCCTATTACTTGTATCGTCTATGATTCTTTCATGCCTTGGGCGCTTGACCTT GCAATGGATTTTGGTCTAGCTGCGGCTCCTTTCTTCACGCAGTCTTGCGCCGTTAACTAT ATCAATTATCTTTCTTACATAAACAATGGTAGCTTGACACTTCCCATCAAGGATTTGCCT CTTCTTGAGCTCCAAGATTTGCCTACTTTCGTCACTCCTACTGGTTCACACCTTGCTTAC TTTGAGATGGTGCTTCAACAGTTCACCAACTTCGACAAAGCTGATTTCGTACTCGTTAAT TCCTTCCATGACCTCGACCTTCATGAAGAGGAGTTGTTGTCGAAAGTATGTCCTGTGTTG ACAATTGGTCCAACTGTTCCATCAATGTACTTAGACCAACAGATCAAATCAGACAACGAC TATGATCTGAACCTCTTTGACTTAAAAGAAGCTGCCTTATGCACTGACTGGCTAGACAAG AGGCCAGAAGGATCGGTAGTATATATAGCTTTTGGGAGCATGGCTAAACTGAGTAGTGAG CAGATGGAAGAGATTGCTTCGGCGATAAGCAACTTCAGCTACCTCTGGGTTGTCAGAGCT TCAGAGGAGTCAAAGCTCCCACCAGGGTTTCTTGAAACAGTGGATAAAGACAAGAGCTTG GTCTTGAAGTGGAGTCCTCAGCTTCAAGTTCTGTCAAACAAAGCCATCGGTTGTTTCATG ACTCACTGTGGCTGGAACTCAACCATGGAGGGTTTGAGTTTAGGGGTTCCCATGGTGGCT ATGCCTCAATGGACTGATCAACCAATGAATGCAAAGTATATACAAGATGTATGGAAGGTT GGGGTTCGTGTGAAAGCAGAGAAAGAAAGTGGCATTTGCAAAAGAGAGGAGATTGAGTTT AGCATCAAGGAAGTGATGGAAGGAGAGAAGAGCAAAGAGATGAAAGAGAATGCGGGAAAA TGGAGAGACTTGGCTGTGAAGTCACTCAGTGAAGGAGGTTCTACAGATATCAACATTAAC GAATTTGTATCAAAAATTCAAATCAAATAA
>UGT74F2
ATGGAGCATAAGAGAGGACATGTATTAGCAGTGCCGTACCCAACGCAAGGACACATCACA CCATTCCGCCAATTCTGCAAACGACTTCACTTCAAAGGTCTCAAAACCACTCTCGCTCTC ACCACTTTCGTCTTCAACTCCATCAATCCTGACCTATCCGGTCCAATCTCCATAGCCACC ATCTCCGATGGCTATGACCATGGGGGTTTCGAGACAGCTGACTCCATCGACGACTACCTC AAAGACTTTAAAACTTCCGGCTCGAAAACCATTGCAGACATCATCCAAAAACACCAGACT AGTGATAACCCCATCACTTGTATCGTCTATGATGCTTTCCTGCCTTGGGCACTTGACGTT GCTAGAGAGTTTGGTTTAGTTGCGACTCCTTTCTTTACGCAGCCTTGTGCTGTTAACTAT GTTTATTATCTTTCTTACATAAACAATGGAAGCTTGCAACTTCCCATTGAGGAATTGCCT TTTCTTGAGCTCCAAGATTTGCCTTCTTTCTTCTCTGTTTCTGGCTCTTATCCTGCTTAC TTTGAGATGGTGCTTCAACAGTTCATAAATTTCGAAAAAGCTGATTTQGTTCTCGTTAAT AGCTTCCAAGAGTTGGAACTGCATGAGAATGAATTGTGGTCGAAAGCTTGTCCTGTGTTG ACAATTGGTCCAACTATTCCATCAATTTACTTAGACCAACGTATCAAATCAGACACCGGC TATGATCTTAATCTCTTTGAATCGAAAGATGATTCCTTCTGCATTAACTGGCTCGACACA AGGCCACAAGGGTCGGTGGTGTACGTAGCATTCGGAAGCATGGCTCAGCTGACTAATGTG CAGATGGAGGAGCTTGCTTCAGCAGTAAGCAACTTCAGCTTCCTGTGGGTGGTCAGATCT TCAGAGGAGGAAAAACTCCCATCAGGGTTTCTTGAGACAGTGAATAAAGAAAAGAGCTTG GTCTTGAAATGGAGTCCTCAGCTTCAAGTTCTGTCAAACAAAGCCATCGGTTGTTTCTTG ACTCACTGTGGCTGGAACTCAACCATGGAGGCTTTGACCTTCGGGGTTCCCATGGTGGCA ATGCCCCAATGGACTGATCAACCGATGAACGCAAAGTACATACAAGATGTGTGGAAGGCT GGAGTTCGTGTGAAGACAGAGAAGGAGAGTGGGATTGCCAAGAGAGAGGAGATTGAGTTT AGCATTAAGGAAGTGATGGAAGGAGAGAGGAGCAAAGAGATGAAGAAGAACGTGAAGAAA TGGAGAGACTTGGCTGTCAAGTCACTCAATGAAGGAGGTTCTACGGATACTAACATTGAT ACATTTGTATCAAGGGTTCAGAGCAAATAG
>UGT75B1
ATGGCGCCACCGCATTTTCTACTGGTAACGTTTCCGGCGCAAGGTCACGTGAACCCATCT CTCCGTTTTGCTCGTCGGCTCATCAAAAGAACCGGCGCACGTGTCACTTTCGTCACTTGT GTCTCCGTCTTCCACAACTCCATGATCGCAAACCACAACAAAGTCGAAAATCTCTCTTTC CTTACTTTCTCCGACGGTTTCGACGATGGAGGCATTTCCACCTACGAAGACCGTCAGAAA AGGTCGGTGAATCTCAAGGTTAACGGCGATAAGGCACTATCGGATTTCATCGAAGCTACT AAGAATGGTGACTCTCCCGTGACTTGCTTGATCTACACGATTCTTCTCAATTGGGCTCCA AAAGTAGCACGTAGATTTCAACTTCCCTCCGCTCTTCTCTGGATCCAACCGGCTTTGGTT TTCAACATCTATTACACTCATTTCATGGGAAACAAGTCCGTTTTCGAGTTACCTAATCTG TCTTCTCTGGAAATCAGAGATCTTCCATCTTTCCTCACACCTTCCAACACAAACAAAGGC GCATACGATGCGTTTCAAGAAATGATGGAGTTTCTCATAAAAGAAACCAAACCGAAAATT CTCATCAACACTTTCGATTCGCTGGAACCAGAGGCCTTAACGGCTTTCCCGAATATCGAT ATGGTGGCGGTTGGTCCTTTACTTCCCACGGAGATTTTCTCAGGAAGCACCAACAAATCA GTTAAAGATCAAAGTAGTAGTTATACACTTTGGCTAGACTCGAAAACAGAGTCCTCTGTT ATTTACGTTTCCTTTGGAACAATGGTTGAGTTGTCCAAGAAACAGATAGAGGAACTAGCG AGAGCACTCATAGAAGGGAAACGACCGTTTTTGTGGGTTATAACTGATAAATCCAACAGA GAAACGAAAACAGAAGGAGAAGAAGAGACAGAGATTGAGAAGATAGCTGGATTCAGACAC GAGCTTGAAGAGGTTGGGATGATTGTGTCGTGGTGTTCGCAGATAGAGGTTTTAAGTCAC CGAGCCGTAGGTTGTTTTGTGACTCATTGTGGGTGGAGCTCGACGCTGGAGAGTTTGGTT CTTGGCGTTCCGGTTGTGGCGTTTCCGATGTGGTCGGATCAACCGACGAACGCGAAGCTA CTGGAAGAAAGTTGGAAGACTGGTGTGAGGGTAAGAGAGAACAAGGATGGTTTGGTGGAG AGAGGAGAGATCAGGAGGTGTTTGGAAGCCGTGATGGAGGAGAAGTCGGTGGAGTTGAGG GAAAACGCAAAGAAATGGAAGCGTTTAGCGATGGAAGCGGGTAGAGAAGGAGGATCTTCG GATAAGAACATGGAGGCTTTTGTGGAGGATATTTGTGGAGAATCTCTTATTCAAAACTTG TGTGAAGCAGAGGAGGTAAAAGTAAAGTAA
>UGT75B2
ATGGCGCAACCGCATTTTCTACTGGTAACGTTTCCGGCGCAAGGTCACGTGAACCCATCT CTCCGTTTTGCTCGTCGGCTCATCAAAACAACTGGCGCACGTGTAACTTTCGCCACGTGT CTCTCTGTCATTCACCGCTCTATGATCCCAAACCACAACAACGTCGAAAATCTCTCTTTC CTTACTTTCTCCGACGGATTCGACGACGGAGTCATCTCCAACACCGACGACGTCCAAAAC CGGTTGGTACACTTCGAACGTAATGGCGATAAAGCTCTATCGGATTTCATCGAAGCTAAT CAGAATGGTGACTCTCCCGTAAGTTGCTTGATCTACACGATTCTTCCCAACTGGGTTCCA AAAGTGGCGCGTAGATTTCATCTTCCCTCTGTTCATCTCTGGATCCAACCAGCCTTCGCT TTCGACATTTATTACAATTACTCTACAGGAAACAACTCCGTTTTCGAGTTCCCGAATCTA CCTTCTCTCGAAATCCGCGATCTGCCTTCTTTCCTCTCACCTTCCAACACGAACAAAGCC GCACAAGCAGTATATCAAGAACTGATGGATTTTCTCAAAGAAGAATCTAACCCGAAAATT CTCGTCAACACATTCGATTCGCTGGAGCCAGAGTTCTTAACAGCTATTCCGAATATAGAA ATGGTGGCAGTTGGTCCTTTACTTCCTGCGGAGATTTTCACTGGAAGCGAATCAGGTAAA GATTTATCAAGAGATCATCAAAGTAGTAGTTATACACTTTGGTTAGACTCGAAAACAGAG TCCTCTGTTATTTATGTTTCTTTTGGAACAATGGTTGAGTTGTCGAAGAAACAGATAGAG GAACTAGCGAGAGCACTCATAGAAGGGGGAAGACCGTTCTTGTGGGTTATAACTGATAAA CTCAACAGAGAAGCGAAAATAGAAGGAGAAGAAGAGACAGAGATTGAGAAGATAGCTGGT TTTAGACACGAGCTTGAAGAGGTTGGGATGATTGTCTCGTGGTGTTCGCAGATAGAGGTT TTGAGACACCGAGCCATAGGTTGTTTTTTGACTCATTGTGGGTGGAGCTCATCACTGGAG AGTTTGGTTCTCGGCGTTCCAGTGGTGGCGTTTCCGATGTGGTCGGATCAGCCAGCAAAT GCGAAGCTTTTGGAAGAAATATGGAAGACAGGTGTGAGGGTGAGAGAGAACTCGGAAGGT TTAGTAGAGAGAGGAGAGATAATGCGGTGTTTGGAAGCAGTGATGGAGGCGAAATCGGTG GAGCTGAGGGAAAACGCAGAGAAATGGAAGCGTTTAGCGACTGAAGCGGGTAGAGAAGGA GGATCTTCGGACAAGAATGTGGAAGCTTTTGTGAAGAGTCTGTTTTGA
>UGT75C1
ATGGCCACTTCCGTCAATGGTTCCCATCGTCGTCCACATTACTTGCTTGTAACATTCCCA GCGCAAGGTCACATCAACCCGGCGCTTCAACTAGCCAACCGCCTCATCCACCACGGTGCA ACCGTCACATACTCCACCGCAGTCTCTGCTCACCGACGTATGGGCGAGCCACCTTCCACA AAAGGTCTATCCTTCGCTTGGTTCACCGATGGATTCGACGACGGTCTCAAGTCATTCGAA GACCAGAAAATCTACATGTCCGAACTCAAACGATGTGGTTCAAACGCCCTGAGAGACATC ATCAAAGCCAATCTTGACGCCACCACCGAAACAGAGCCTATCACCGGGGTAATCTACTCT GTTCTCGTCCCGTGGGTTTCTACGGTAGCGCGTGAGTTTCACCTCCCAACTACACTTCTC TGGATTGAACCAGCTACTGTACTAGACATCTACTACTACTACTTCAACACCTCTTACAAA CATCTCTTCGACGTTGAACCGATTAAATTACCGAAACTGCCACTGATCACCACCGGTGAC CTCCCGTCGTTTCTTCAACCTTCGAAGGCATTACCGTCAGCTCTTGTGACTCTAAGAGAA CATATCGAAGCTCTCGAAACGGAATCAAACCCTAAGATTCTTGTTAACACATTCTCTGCT TTGGAACACGATGCTTTAACCTCTGTTGAGAAACTCAAGATGATCCCAATCGGACCGTTG GTTTCTTCCTCCGAGGGTAAAACCGATCTTTTCAAATCTTCCGACGAGGATTACACGAAA TGGTTAGACTCGAAGCTCGAGAGATCAGTGATTTACATTTCCTTAGGCACACACGCCGAT GATTTACCAGAGAAACACATGGAAGCGCTTACTCACGGCGTGTTAGCTACAAACAGACCG TTTTTATGGATCGTGAGGGAGAAAAATCCAGAAGAGAAGAAGAAGAATCGGTTTCTTGAA TTGATCAGAGGAAGTGATCGAGGATTGGTGGTGGGATGGTGTTCTCAGACAGCTGTTTTG GCGCATTGTGCTGTGGGATGTTTTGTGACTCATTGTGGTTGGAATTCGACGTTGGAGAGT TTAGAGAGTGGTGTTCCGGTGGTTGCGTTTCCGCAGTTTGCTGATCAGTGTACAACGGCG AAGCTTGTGGAGGATACGTGGAGGATTGGAGTGAAGGTGAAGGTTGGGGAGGAAGGAGAT GTGGATGGGGAGGAGATTAGAAGGTGTTTGGAGAAGGTGATGAGTGGTGGAGAAGAGGCG GAGGAGATGAGAGAGAATGCAGAGAAGTGGAAGGCGATGGCTGTTGATGCGGCAGCGGAA GGTGGACCGTCGGATTTGAATCTTAAAGGTTTTGTGGACGAGGATGAGTAG
>UGT75D1
ATGGCCAACAACAATTCCAACTCTCCCACCGGTCCACACTTTCTATTCGTAACATTTCCA GCCCAAGGTCACATCAACCCATCTCTCGAGCTAGCCAAACGCCTCGCCGGAACAATCTCT GGTGCTCGAGTCACCTTCGCCGCCTCAATCTCTGCCTACAACCGCCGCATGTTCTCTACA GAAAACGTCCCCGAAACCCTAATCTTCGCTACCTACTCCGATGGCCACGACGACGGTTTC AAATCCTCTGCTTACTCCGACAAATCTCGTCAAGACGCCACTGGAAACTTCATGTCTGAG ATGAGACGACGTGGCAAAGAGACACTAACCGAACTAATCGAAGATAACCGGAAACAAAAC AGGCCTTTTACTTGCGTGGTTTACACGATTCTCCTCACTTGGGTCGCTGAGCTAGCGCGT GAGTTTCATCTTCCTTCTGCTCTTCTTTGGGTCCAACCAGTAACAGTCTTCTCCATTTTT TACCATTACTTCAATGGCTACGAAGATGCAATCTCAGAGATGGCTAATACCCCCTCTAGT TCTATTAAATTACCTTCTCTGCCACTGCTTACTGTCCGTGATATTCCTTCTTTCATTGTC TCTTCCAATGTCTACGCGTTTCTTCTACCCGCGTTTCGAGAACAGATTGATTCACTGAAG GAAGAAATAAACCCTAAGATCCTCATCAACACTTTCCAAGAGCTTGAGCCAGAAGCCATG AGCTCGGTTCCAGATAATTTCAAGATTGTCCCTGTCGGTCCGTTACTAACGTTGAGAACG GATTTTTCGAGTCGCGGTGAATACATAGAGTGGTTGGATACTAAAGCGGATTCGTCTGTG CTTTATGTTTCGTTCGGGACGCTTGCCGTGTTGAGCAAGAAACAGCTTGTGGAGCTTTGT AAAGCGTTGATACAAAGTCGGAGACCATTCTTGTGGGTGATTACGGATAAGTCGTACAGA AATAAAGAAGATGAGCAAGAGAAGGAAGAAGATTGCATAAGTAGTTTCAGAGAAGAGCTC GATGAGATAGGAATGGTGGTTTCATGGTGTGATCAGTTTAGGGTTTTGAATCATAGATCG ATAGGTTGTTTCGTGACGCATTGCGGGTGGAACTCTACGCTGGAGAGCTTGGTTTCAGGA GTTCCGGTGGTGGCGTTTCCGCAATGGAATGATCAGATGATGAACGCGAAGCTTTTAGAA GATTGTTGGAAAACAGGTGTAAGAGTGATGGAGAAGAAGGAAGAAGAAGGAGTTGTGGTG GTGGATAGTGAGGAGATACGGCGGTGCATTGAGGAAGTTATGGAAGACAAGGCGGAGGAG TTTAGAGGAAATGCCACGAGGTGGAAGGATTTAGCGGCGGAGGCTGTGAGAGAAGGAGGC TCTTCCTTTAATCATCTCAAAGCTTTTGTCGATGAGCACATGTGA
>UGT76B1
ATGGAGACTAGAGAAACAAAACCAGTGATCTTTCTCTTCCCTTTCCCTTTACAAGGTCAC TTAAACCCAATGTTTCAGCTCGCCAACATCTTCTTCAACAGAGGCTTCTCCATCACTGTG ATCCACACTGAGTTCAACTCTCCAAACTCTTCCAATTTCCCTCATTTCACTTTCGTATCC ATCCCCGATAGCTTGTCTGAACCTGAATCCTATCCCGATGTCATCGAGATTCTCCATGAC CTCAATTCCAAGTGTGTTGCTCCTTTTGGTGATTGCTTAAAGAAGCTTATATCTGAAGAA CCAACAGCAGCTTGTGTGATTGTTGACGCTCTTTGGTACTTCACTCACGATTTAACCGAG AAATTCAATTTCCCGAGGATTGTTCTCCGAACCGTTAACCTCTCAGCTTTCGTCGCTTTC TCAAAGTTTCATGTTTTACGAGAGAAAGGGTATCTTTCTTTACAAGAGACTAAGGCAGAC TCACCGGTTCCGGAGCTTCCGTATCTTAGAATGAAGGATCTTCCATGGTTCCAGACAGAA GATCCAAGATCAGGGGATAAGTTACAGATAGGTGTGATGAAGTCACTAAAGTCTTCCTCA GGAATCATATTCAACGCCATTGAAGATCTTGAAACAGATCAGCTTGATGAAGCCCGCATA GAATTCCCAGTTCCACTCTTCTGTATTGGACCCTTTCACAGGTACGTTTCAGCTTCATCC AGTAGCTTACTTGCACACGACATGACTTGTCTCTCCTGGTTAGACAAGCAAGCAACAAAT TCCGTAATCTACGCAAGTCTTGGAAGCATTGCTTCGATCGATGAATCTGAATTCTTGGAG ATTGCTTGGGGTCTAAGAAACAGCAACCAACCTTTTCTATGGGTGGTTAGACCCGGTTTA ATCCACGGGAAAGAATGGATCGAGATTCTGCCTAAAGGGTTCATCGAAAATCTCGAGGGC CGGGGTAAAATAGTGAAATGGGCACCTCAGCCTGAAGTTTTAGCTCACCGTGCAACAGGC GGATTCTTAACACATTGTGGATGGAACTCAACACTTGAGGGCATATGTGAAGCTATACCA ATGATATGCAGACCATCTTTTGGGGACCAGAGGGTGAATGCTAGATACATTAACGATGTT TGGAAGATCGGATTGCATTTGGAAAACAAGGTAGAGAGACTAGTGATCGAAAACGCGGTT AGAACACTAATGACGAGCTCGGAAGGGGAAGAGATCCGCAAGAGGATTATGCCCATGAAG GAAACTGTTGAACAATGCCTTAAGCTTGGAGGTTCATCATTTCGGAATCTCGAAAACTTA ATTGCTTATATATTGTCTTTCTAA
>UGT76C1
ATGGAGAAGAGAAACGAGAGACAAGTGATTCTTTTTCCTCTACCATTACAAGGTTGCATA AACCCTATGCTTCAGCTAGCAAAGATCCTTTACTCAAGAGGTTTTTCGATCACCATCATC CACACGCGCTTCAACGCGCCCAAATCTTCAGACCATCCTCTCTTCACTTTCTTACAAATC CGCGACGGCTTGTCTGAATCTCAGACTCAATCTCGTGATCTTTTGCTTCAACTCACGCTT CTCAACAACAATTGTCAGATCCCATTTCGAGAGTGTTTGGCTAAACTCATTAAACCTAGT TCAGATTCAGGAACAGAGGATAGGAAAATTAGCTGTGTGATCGATGATTCCGGTTGGGTT TTCACACAATCCGTGGCGGAGAGTTTTAATCTTCCTCGATTTGTCCTCTGTGCTTATAAG TTCTCTTTCTTTCTCGGACATTTTCTTGTTCCTCAGATTCGTCGTGAAGGGTTTCTTCCA GTACCAGATTCGGAGGCAGATGATCTAGTTCCTGAGTTTCCACCGCTTCGAAAGAAAGAT CTTTCGAGAATTATGGGAACCAGCGCTCAGAGTAAGCCTCTAGATGCTTACTTGCTTAAG ATACTCGACGCGACGAAGCCAGCTTCAGGGATTATAGTTATGTCCTGCAAAGAGCTTGAC CATGATTCACTTGCTGAGTCCAACAAAGTTTTCAGCATTCCGATATTTCCCATTGGCCCT TTTCACATTCATGACGTCCCAGCCTCGTCTAGCAGCTTGTTAGAACCGGACCAGAGTTGC ATTCCATGGTTAGATATGCGTGAAACGAGATCAGTAGTCTACGTGAGCTTAGGGAGCATT GCGAGTCTTAACGAGTCTGACTTCTTGGAGATTGCTTGTGGACTAAGAAACACCAACCAA TCCTTCTTGTGGGTTGTCCGGCCTGGTTCAGTCCATGGCAGAGATTGGATCGAATCATTA CCTTCAGGGTTCATGGAAAGTCTCGATGGTAAAGGAAAGATAGTGAGATGGGCACCGCAG CTAGACGTTCTTGCGCATAGAGCCACGGGAGGGTTTTTGACTCATAATGGATGGAACTCG ACATTAGAGAGTATATGCGAAGGAGTACCTATGATCTGCTTGCCTTGTAAGTGGGACCAA TTTGTAAACGCGAGATTCATAAGCGAAGTTTGGAGGGTTGGGATTCACTTGGAAGGTCGG ATAGAGCGAAGAGAAATCGAGAGAGCTGTTATAAGACTAATGGTTGAGTCGAAAGGAGAA GAGATTCGAGGTAGAATCAAAGTCTTGCGAGACGAAGTAAGAAGGTCAGTTAAACAAGGA GGTTCGTCATATCGATCTTTAGATGAGTTGGTTGATCGTATATCAATCATCATCGAGCCA CTAGTGCCTACGTGA
>UGT76C2
ATGGAGGAGAAGAGAAATGGTCTGCGTGTGATTCTCTTCCCTCTTCCATTACAAGGTTGC ATCAACCCTATGCTTCAGCTCGCCAACATCCTTCACGTAAGAGGCTTCTCCATTACCGTG ATCCACACGCGCTTCAACGCGCCAAAAGCTTCAAGCCATCCTCTCTTCACTTTCTTACAG ATTCCTGATGGTTTGTCTGAAACGGAGATTCAAGATGGTGTTATGTCTTTGCTCGCGCAA ATCAACCTTAACGCTGAGTCTCCGTTTCGTGATTGCTTGCGTAAAGTGTTGCTGGAATCA AAAGAGTCAGAGAGGGTTACTTGTTTGATCGATGACTGTGGATGGCTCTTCACACAATCT GTTTCAGAGAGTTTGAAGCTTCCGAGGCTCGTTCTCTGTACTTTTAAAGCCACTTTCTTC AATGCTTATCCGAGTCTTCCACTTATCCGAACCAAGGGATATCTTCCAGTTTCAGAATCG GAAGCAGAGGACTCTGTTCCTGAGTTCCCGCCGCTTCAAAAGAGAGATCTTTCAAAGGTT TTCGGGGAGTTCGGAGAGAAACTCGATCCGTTCTTACATGC'TGTAGTCGAAACGACAATA AGATCTTCAGGGTTAATATACATGTCCTGCGAAGAGCTTGAGAAAGATTCGTTGACTCTT TCTAACGAAATTTTTAAAGTTCCGGTTTTTGCAATTGGTCCGTTTCACAGCTACTTCTCT GCTTCGTCAAGCAGCTTGTTCACACAAGACGAGACTTGCATTCTGTGGTTAGATGATCAA GAAGATAAATCTGTGATCTACGTTAGTCTAGGAAGCGTTGTGAACATAACGGAAACAGAG TTCTTGGAGATTGCGTGTGGTTTAAGCAATAGCAAACAGCCTTTCTTGTGGGTAGTACGA CCCGGTTCAGTACTCGGCGCGAAATGGATCGAACCGCTCTCTGAAGGGCTGGTTAGTAGC CTTGAAGAGAAAGGAAAGATTGTGAAATGGGCACCACAACAGGAGGTTCTTGCGCATCGT GCCACAGGAGGGTTTTTGACACACAATGGTTGGAACTCAACGCTAGAGAGTATATGCGAA GGGGTTCCTATGATCTGCCTACCAGGAGGTTGGGATCAAATGCTGAATTCAAGATTTGTT AGCGATATTTGGAAGATTGGAATTCACTTGGAAGGTCGGATTGAAAAAAAGGAGATTGAG AAAGCTGTGAGGGTGTTAATGGAGGAAAGTGAAGGAAATAAGATTCGTGAGAGAATGAAA GTTCTGAAAGATGAGGTCGAGAAATCGGTCAAACAAGGAGGCTCATCTTTTCAATCTATT GAGACTCTAGCTAATCATATACTATTGTTGTAA
>UGT76C3
ATGGATAAGAGTAATGGCCTACGAGTGATTCTGTTTCCACTTCCATTACAAGGATGCATC AACCCCATGATTCAGCTAGCGAAGATCCTCCACTCAAGAGGTTTCTCCATCACTGTGATC CACACGCGCTTCAATGCGCCAAAAGCTTCAAACCACCCTCTGTTCACCTTCTTACAGATC CCAGATGGCTTGTCTGAAACAGAGACAAGAACTCACGATATCACACTTCTCCTAACGCTT CTCAACCGAAGCTGTGAGTCTCCATTTCGTGAATGTTTGACTAAACTTTTGCAGTCTGCA GATTCAGAAACAGGGGAAGAGAAACAGAGGATTAGCTGTTTGATCGATGATTCTGGATGG ATATTCACACAGCCCGTTGCTCAGAGTTTCAATCTCCCGAGATTGGTCCTTAACACCTAC AAAGTCTCCTTCTTTCGGGACCATTTTGTTCTTCCTCAACTCCGTCGTGAAATGTATCTT CCATTACAAGATTCAGAACAAGGTGATGATCCAGTTGAGGAGTTTCCACCCCTTCGAAAG AAAGATCTTTTACAAATTCTTGATCAAGAATCGGAGCAACTAGACTCGTACTCCAATATG ATTTTGGAAACAACAAAAGCGTCTTCAGGTCTTATATTTGTATCCACATGTGAAGAGTTG GACCAAGACTCACTGAGTCAAGCACGTGAAGATTATCAAGTCCCAATCTTTACGATAGGA CCTTCTCATAGCTACTTCCCAGGCTCATCTAGTAGCTTGTTCACAGTGGACGAGACTTGC ATTCCATGGTTAGACAAGCAAGAAGACAAATCCGTGATTTACGTGAGTTTTGGGAGCATC TCGACCATTGGCGAAGCAGAATTCATGGAGATTGCTTGGGCTCTAAGAAACAGCGACCAA CCGTTCTTGTGGGTCGTACGGGGTGGTTCGGTAGTCCATGGTGCAGAATGGATCGAACAG CTTCATGAGAAAGGAAAGATAGTGAATTGGGCCCCACAACAAGAGGTTCTAAAGCATCAA GCCATTGGAGGATTCTTGACACACAATGGTTGGAACTCGACGGTTGAGAGTGTTTTTGAA GGCGTCCCTATGATATGTATGCCTTTTGTATGGGACCAATTGCTTAATGCAAGATTTGTT AGTGATGTATGGATGGTTGGGCTGCATCTAGAGGGTCGGATTGAGAGGAATGTGATTGAG GGAATGATAAGAAGATTATTTTCGGAAACTGAAGGAAAAGCGATCCGAGAGAGGATGGAA ATTCTTAAGGAGAATGTAGGAAGATCCGTTAAACCAAAAGGTTCGGCGTATCGATCGTTA CAACATTTGATTGATTATATAACATATTTCTAG
>UGT76C4
ATGGAGAAGAGTAATGGCCTGCGAGTGATTCTGTTTCCACTTCCATTACAAGGCTGCATC AACCCTATGATTCAGCTCGCCAAGATCCTCCACTCAAGAGGTTTTTCAATCACTGTGATC CACACTTGCTTCAACGCGCCAAAAGCTTCAAGCCATCCACTCTTCACCTTCATACAGATC CAAGATGGCTTGTCTGAAACAGAGACAAGAACTCGCGACGTCAAACTTCTCATAACACTT CTCAACCAAAATTGCGAGTCTCCGGTTCGTGAATGTTTGCGTAAACTGTTGCAATCTGCC AAGGAAGAGAAACAGAGGATTAGCTGTTTGATCAATGATTCTGGTTGGATCTTCACTCAA CACTTAGCCAAGAGTTTGAATCTCATGAGATTGGCCTTTAATACCTATAAGATCTCCTTC TTTCGAAGCCATTTTGTTCTTCCTCAGCTCCGGCGTGAAATGTTTCTTCCATTACAAGAT TCAGAACAAGATGATCCAGTTGAGAAGTTTCCACCGCTTAGAAAGAAAGATCTTTTACGG ATTCTTGAAGCAGATTCGGTGCAGGGAGACTCGTACTCGGATATGATTTTGGAAAAGACA AAGGCGTCTTCAGGTCTTATATTCATGTCCTGTGAAGAGTTGGACCAAGACTCACTGAGT CAATCACGTGAAGATTTTAAGGTTCCGATATTTGCGATAGGACCTTCTCATAGCCATTTT CCTGCTTCTTCTAGTAGCTTGTTCACACCGGACGAGACTTGCATCCCATGGTTAGACAGA CAAGAAGACAAATCCGTAATATACGTGAGTATTGGGAGCCTCGTGACCATCAACGAAACA GAGCTAATGGAGATTGCTTGGGGTCTAAGTAACAGCGACCAACCATTTTTATGGGTCGTC CGGGTTGGTTCAGTCAATGGCACGGAATGGATTGAAGCAATCCCGGAATATTTCATCAAA AGGCTTAATGAGAAGGGAAAGATAGTGAAATGGGCTCCACAACAAGAGGTTCTAAAGCAT CGAGCTATTGGAGGTTTCTTGACACATAATGGTTGGAACTCGACGGTTGAGAGTGTTTGT GAAGGCGTCCCTATGATCTGTTTGCCTTTTCGTTGGGACCAATTGTTAAATGCAAGATTT GTTAGTGATGTATGGATGGTTGGGATACATCTCGAGGGTCGGATTGAGAGGGATGAGATC GAGAGAGCGATAAGGAGATTATTGTTGGAAACTGAAGGAGAAGCCATCCGAGAGAGGATA CAACTTCTTAAGGAAAAAGTAGGAAGATCAGTTAAACAAAACGGTTCGGCATATCAATCT CTACAAAATTTGATTAATTATATATCATCTTTCTAG
>UGT76C5
ATGGAGAAGAGTAATGGCCTTCGAGTGATTCTGTTTCCACTTCCATTACAAGGCTGCATC AACCCCATGATTCAGCTCGCCAAGATCCTCCACTCAAGAGGTTTCTCCATCACTGTGATC CACACGTGCTTCAACGCGCCAAAAGCTTCAAGCCATCCTCTCTTCACCTTCTTAGAGATC CCAGATGGCTTGTCCGAAACAGAGAAAAGAACTAACAATACCAAACTTCTCCTAACGCTT CTCAACCGGAACTGTGAGTCTCCGTTTCGTGAATGTTTGAGTAAACTGTTGCAGTCTGCA GATTCAGAAACAGGGGAAGAGAAACAGAGGATTAGCTGTTTGATCGCTGATTCTGGATGG ATGTTCACACAACCCATTGCTCAGAGTTTGAAACTCCCAATATTGGTCCTCAGTGTGTTT ACAGTCTCCTTCTTTCGCTGCCAATTTGTTCTTCCTAAGCTTCGGCGTGAAGTGTATCTT CCACTTCAAGATTCAGAACAGGAGGATCTAGTTCAAGAGTTTCCGCCGCTTCGAAAGAAG GATATTGTACGTATTCTTGATGTAGAAACAGATATACTAGATCCATTCTTGGACAAAGTT CTACAAATGACAAAGGCGTCTTCAGGTCTTATATTCATGTCATGTGAAGAGTTGGACCAC GACTCAGTGAGTCAGGCACGTGAAGATTTCAAAATTCCTATCTTTGGGATTGGACCATCT CACAGCCACTTTCCAGCTACCTCTAGTAGCTTGTCCACACCCGACGAGACTTGCATTCCA TGGTTAGACAAACAAGAAGACAAATCCGTGATTTACGTCAGTTACGGGAGCATCGTGACC ATCAGCGAATCAGATTTAATAGAGATTGCTTGGGGTCTAAGAAACAGCGACCAACCCTTC TTGTTGGTCGTACGGGTTGGTTCAGTCCGTGGCAGAGAATGGATCGAGACAATCCCGGAA GAGATCATGGAAAAGCTTAATGAGAAGGGAAAGATAGTGAAATGGGCTCCGCAACAAGAC GTTCTAAAGCATCGAGCCATTGGGGGATTCCTGACACATAATGGTTGGAGCTCGACTGTT GAGAGTGTTTGTGAAGCAGTCCCTATGATCTGTTTGCCTTTTCGTTGGGACCAAATGCTA AATGCAAGATTTGTTAGCGATGTATGGATGGTCGGGATAAACCTAGAGGATCGGGTTGAA AGGAATGAGATCGAGGGAGCGATAAGGAGATTATTGGTGGAACCTGAAGGAGAAGCCATC CGAGAGAGGATAGAACATCTTAAGGAGAAAGTAGGACGATCGTTTCAACAAAACGGTTCC GCATATCAATCGTTACAAAATTTGATTGATTATATATCATCTTTTTAG
>UGT76D1
ATGGCAGAGATTCGCCAGAGAAGAGTGTTGATGGTCCCAGCACCGTTCCAAGGCCATTTA CCTTCGATGATGAATCTAGCGTCCTACCTTTCTTCCCAAGGCTTTTCAATCACAATCGTT AGAAACGAATTCAATTTCAAAGATATCTCCCATAATTTCCCTGGTATAAAATTCTTCACC ATCAAGGACGGCTTGTCAGAATCTGACGTGAAGTCTCTGGGTCTCCTTGAATTTGTCCTG GAGCTTAACTCTGTCTGTGAACCCCTATTGAAAGAGTTTCTAACCAACCATGATGATGTT GTTGACTTTATCATTTATGATGAATTTGTTTACTTCCCTCGACGTGTTGCGGAAGATATG AATCTGCCAAAGATGGTCTTTAGCCCTTCTTCCGCCGCTACCTCGATCAGCCGGTGTGTG CTTATGGAGAACCAATCAAATGGGTTACTTCCTCCACAAGACGCAAGATCTCAACTAGAA GAAACGGTGCCAGAGTTTCATCCCTTTCGTTTCAAAGATCTGCCTTTTACAGCTTATGGA TCTATGGAGAGATTAATGATACTTTACGAGAATGTAAGCAATAGAGCCTCATCTTCTGGC ATAATACACAACTCTTCGGATTGCTTAGAGAACTCATTCATAACAACTGCACAAGAGAAA TGGGGAGTTCCGGTATACCCGGTTGGTCCACTCCATATGACCAATTCCGCAATGTCATGT CCAAGTTTATTTGAAGAAGAAAGAAACTGTCTTGAATGGCTTGAGAAGCAAGAAACAAGC TCAGTGATCTACATAAGCATGGGGAGCTTGGCGATGACACAAGATATAGAGGCTGTGGAG ATGGCCATGGGATTTGTCCAGAGTAATCAACCCTTCTTGTGGGTGATCCGACCAGGCTCT ATAAACGGACAAGAATCTTTAGACTTCTTACCGGAACAGTTCAACCAAACGGTGACCGAT GGAAGAGGTTTTGTTGTGAAATGGGCCCCACAAAAAGAGGTATTAAGGCATAGAGCAGTG GGAGGGTTTTGGAACCATGGTGGATGGAACTCGTGCTTGGAGAGCATAAGCAGTGGTGTA CCAATGATTTGTAGGCCGTATTCTGGTGATCAGAGGGTGAATACTCGACTTATGTCACAT GTTTGGCAAACCGCGTATGAGATCGAAGGTGAATTGGAAAGAGGAGCTGTTGAGATGGCC GTGAGGAGGCTCATTGTGGATCAAGAAGGTCAGGAGATGAGAATGAGAGCCACCATATTG AAGGAAGAGGTTGAAGCCTCTGTCACAACCGAAGGCTCTTCTCACAATTCTTTAAACAAT TTGGTCCATGCAATAATGATGCAAATTGACGAACAATGA
>UGT76E1
ATGGAAGAACTAGGAGTGAAGAGAAGGATAGTATTGGTTCCAGTTCCAGCACAAGGTCAT GTAACTCCGATTATGCAACTCGGGAAGGCTCTTTACTCCAAGGGCTTCTCCATCACTGTT GTTCTCACACAGTATAATCGAGTTAGCTCATCCAAGGACTTCTCTGATTTTCATTTCCTC ACCATCCCAGGCAGCTTGACCGAGTCTGATCTCAAAAACCTTGGACCATTCAAGTTTCTC TTCAAGCTCAATCAAATTTGCGAGGCAAGCTTCAAGCAATGTATTGGTCAACTATTGCAG GAGCAAGGTAATGATATCGCTTGTGTCGTCTACGATGAGTACATGTACTTCTCCCAAGCT GCAGTTAAAGAGTTTCAACTTCCTAGCGTCCTCTTCAGCACGACAAGTGCTACTGCCTTT GTCTGTCGCTCTGTTTTGTCTAGAGTCAACGCAGAGTCATTCTTGCTTGACATGAAAGAT CCCAAAGTGTCAGACAAGGAATTTCCAGGGTTGCATCCGCTAAGGTACAAGGACCTGCCA ACTTCAGCATTTGGGCCATTAGAGAGTATACTCAAGGTTTACAGTGAGACTGTCAACATT CGAACAGCTTCGGCAGTTATCATCAACTCAACAAGCTGTCTAGAGAGCTCATCTTTGGCA TGGTTACAAAAACAACTGCAAGTTCCAGTGTATCCTATAGGCCCACTTCACATTGCAGCT TCAGCGCCTTCTAGTTTACTTGAAGAGGACAGGAGTTGCCTTGAGTGGTTGAACAAGCAA AAAATAGGCTCAGTGATTTACATAAGTTTGGGAAGCTTGGCTCTAATGGAAACTAAAGAC ATGTTGGAGATGGCTTGGGGTTTACGTAATAGCAACCAACCTTTCTTATGGGTGATCCGA CCGGGTTCTATTCCCGGCTCGGAATGGACAGAGTCTTTACCGGAGGAATTCAGTAGGTTG GTTTCAGAAAGAGGTTACATTGTGAAATGGGCACCACAGATAGAAGTTCTCAGACATCCT GCAGTGGGAGGGTTTTGGAGTCACTGCGGATGGAACTCGACCCTAGAGAGCATCGGGGAA GGAGTTCCGATGATCTGTAGGCCTTTTACGGGAGATCAGAAAGTCAATGCGAGGTACTTA GAGAGAGTTTGGAGAATTGGGGT'TCAATTGGAAGGAGAGCTGGATAAAGGAACAGTGGAG AGAGCTGTAGAGAGATTGATTATGGATGAAGAAGGAGCAGAAATGAGGAAGAGAGTTATC AACTTGAAAGAGAAGCTTCAAGCCTCTGTCAAGAGTAGAGGTTCCTCATTCAGCTCATTA GACAACTTTGTCAATTCCTTAAAAATGATGAATTTCATGTAG
>UGT76E11
ATGGAGGAAAAGCCGGCGGGCAGAAGAGTAGTGTTGGTTGCAGTTCCAGCTCAAGGACAT ATCTCTCCAATAATGCAACTTGCAAAAACACTTCACTTGAAGGGTTTCTCAATCACAATC GCTCAGACAAAGTTCAATTACTTTAGCCCTTCAGATGACTTCACTGATTTTCAGTTTGTC ACCATTCCAGAAAGCTTACCAGAGTCTGATTTTGAGGATCTCGGGCCAATAGAGTTTCTG CATAAGCTCAACAAAGAGTGTCAGGTGAGCTTCAAAGACTGTTTGGGTCAGTTGTTGCTG CAACAAGGTAATGAGATAGCCTGTGTTGTCTACGACGAGTTCATGTACTTTGCTGAAGCT GCAGCCAAAGAGTTTAAGCTTCCAAACGTCATTTTCAGCACCACAAGTGCCACGGCTTTT GTTTGCCGCTCTGCATTCGACAAACTTTATGCAAACAGTATCCTGACTCCCTTGAAAGAA CCCAAAGGACAACAAAACGAGCTAGTGCCAGAGTTTCATCCCCTGAGATGCAAAGACTTT CCGGTTTCACATTGGGCATCATTAGAAAGCATGATGGAGCTGTATAGGAATACAGTTGAC AAACGGACAGCTTCCTCGGTGATAATCAACACAGCGAGCTGTCTAGAGAGCTCATCTCTG TCTCGTCTGCAGCAACAGCTACAAATTCCAGTTTATCCTATAGGCCCTCTTCACCTGGTG GCATCAGCTTCTACGAGTCTTCTTGAAGAGAACAAGAGCTGTATTGAATGGTTGAACAAA CAAAAGAAAAACTCTGTGATATTCGTAAGCTTGGGAAGCTTAGCTTTGATGGAAATCAAT GAGGTGATAGAAACTGCTTTGGGATTGGATAGTAGCAAGCAACAGTTCTTGTGGGTCATT CGGCCAGGGTCAGTACGTGGTTCGGAATGGATAGAGAACTTGCCTAAGGAGTTTAGTAAG ATAATTTCGGGTCGAGGTTACATTGTGAAATGGGCTCCACAGAAGGAAGTACTTTCTCAT CCTGCAGTAGGAGGATTTTGGAGCCATTGCGGATGGAACTCGACACTAGAGAGCATCGGG GAAGGAGTTCCAATGATTTGCAAGCCGTTTTCCAGTGATCAAATGGTGAATGCGAGATAC TTGGAGTGTGTATGGAAAATTGGGATTCAAGTTGAGGGTGATCTAGACAGAGGAGCGGTC GAGAGAGCTGTGAGGAGGTTAATGGTGGAGGAAGAAGGGGAGGGGATGAGGAAGAGAGCT ATCAGTTTGAAAGAGCAACTTAGAGCCTCTGTTATAAGTGGAGGTTCTTCACACAACTCG CTAGAGGAGTTTGTACACTACATGAGGACTCTATGA >UGT76E12
ATGGAGGAAAAGCCTGCAAGGAGAAGCGTAGTGTTGGTTCCATTTCCAGCACAAGGACAT ATATCTCCAATGATGCAACTTGCCAAAACCCTTCACTTAAAGGGTTTCTCGATCACAGTT GTTCAGACTAAGTTCAATTACTTTAGCCCTTCAGATGACTTCACTCATGATTTTCAGTTC GTCACCATTCCAGAAAGCTTACCAGAGTCTGATTTCAAGAATCTCGGACCAATACAGTTT CTGTTTAAGCTCAACAAAGAGTGTAAGGTGAGCTTCAAGGACTGTTTGGGTCAGTTGGTG CTGCAACAAAGTAATGAGATCTCATGTGTCATCTACGATGAGTTCATGTACTTTGCTGAA GCTGCAGCCAAAGAGTGTAAGCTTCCAAACATCATTTTCAGCACAACAAGTGCCACGGCT TTCGCTTGCCGCTCTGTATTTGACAAACTATATGCAAACAATGTCCAAGCTCCCTTGAAA GAAACTAAAGGACAACAAGAAGAGCTAGTTCCGGAGTTTTATCCCTTGAGATATAAAGAC TTTCCAGTTTCACGGTTTGCATCATTAGAGAGCATAATGGAGGTGTATAGGAATACAGTT GACAAACGGACAGCTTCCTCGGTGATAATCAACACTGCGAGCTGTCTAGAGAGCTCATCT CTGTCTTTTCTGCAACAACAACAGCTACAAATTCCAGTGTATCCTATAGGCCCTCTTCAC ATGGTGGCCTCAGCTCCTACAAGTCTGCTTGAAGAGAACAAGAGCTGCATCGAATGGTTG AACAAACAAAAGGTAAACTCGGTGATATACATAAGCATGGGAAGCATAGCTTTAATGGAA ATCAACGAGATAATGGAAGTCGCGTCAGGATTGGCTGCTAGCAACCAACACTTCTTATGG GTGATCCGACCAGGGTCAATACCTGGTTCCGAGTGGATAGAGTCCATGCCTGAAGAGTTT AGTAAGATGGTTTTGGACCGAGGTTACATTGTGAAATGGGCTCCACAGAAGGAAGTACTT TCTCATCCTGCAGTAGGAGGGTTTTGGAGCCATTGTGGATGGAACTCGACACTAGAAAGC ATCGGCCAAGGAGTTCCAATGATCTGCAGGCCATTTTCGGGTGATCAAAAGGTGAACGCT AGATACTTGGAGTGTGTATGGAAAATTGGGATTCAAGTGGAGGGTGAGCTAGACAGAGGA GTGGTCGAGAGAGCTGTGAAGAGGTTAATGGTTGACGAAGAAGGAGAGGAGATGAGGAAG AGAGCTTTCAGTTTAAAAGAGCAACTTAGAGCCTCTGTTAAAAGTGGAGGCTCTTCACAC AACTCGCTAGAAGAGTTTGTACACTTCATAAGGACTCTATGA
>UGT76E2
ATGGAGGAAAAGCAAGTGAAGGAGACAAGGATAGTGTTGGTTCCAGTTCCAGCTCAAGGT CATGTAACTCCGATGATGCAACTAGGAAAAGCTCTTCACTCAAAGGGTTTCTCCATCACT GTTGTTCTGACACAGTCTAATCGAGTTAGCTCTTCCAAAGACTTCTCTGATTTCCATTTC CTCACCATCCCAGGCAGCTTAACTGAGTCTGATCTCCAAAACCTAGGACCACAAAAGTTT GTGCTCAAGCTCAATCAAATTTGTGAGGCAAGCTTCAAGCAGTGTATAGGTCAACTATTG CATGAACAATGTAATAATGATATTGCTTGTGTCGTCTACGATGAGTACATGTACTTCTCT CATGCTGCAGTAAAAGAGTTTCAACTTCCTAGTGTCGTCTTTAGCACGACAAGTGCTACT GCTTTTGTCTGTCGCTCTGTTTTGTCTAGAGTCAACGCAGAGTCGTTCTTGATCGACATG AAAGATCCTGAAACACAAGACAAAGTATTTCCAGGGTTGCATCCTCTGAGGTACAAGGAT CTACCAACTTCAGTATTTGGGCCAATAGAGAGTACGCTCAAGGTTTACAGTGAGACTGTG AACACTCGAACAGCTTCCGCTGTTATCATCAACTCAGCAAGCTGTTTAGAGAGCTCATCT TTGGCAAGGTTGCAACAACAACTGCAAGTTCCGGTGTATCCTATAGGCCCACTTCATATT ACAGCTTCAGCGCCTTCTAGTTTACTAGAAGAAGACAGGAGTTGCGTTGAGTGGTTGAAC AAGCAAAAATCAAATTCAGTTATTTACATAAGCTTGGGAAGCTTGGCTCTAATGGACACC AAAGACATGTTGGAGATGGCTTGGGGATTAAGTAATAGCAACCAACCTTTCTTATGGGTG GTCAGACCGGGCTCTATTCCGGGGTCAGAATGGACAGAGTCCTTACCAGAGGAATTCAAT AGGTTGGTTTCAGAAAGAGGTTACATTGTGAAATGGGCTCCGCAGATGGAAGTTCTCAGA CATCdTGCAGTAGGAGGGTTTTGGAGTCACTGTGGATGGAACTCAACAGTAGAGAGCATC GGGGAAGGAGTTCCGATGATATGTAGGCCTTTCACCGGGGATCAGAAAGTCAATGCGAGG TACTTAGAGAGAGTTTGGAGAATTGGGGTTCAATTGGAGGGAGATCTGGATAAAGAAACT GTGGAGAGAGCTGTAGAGTGGTTGCTTGTGGATGAAGAAGGAGCAGAAATGAGGAAGAGA GCCATTGACTTGAAAGAAAAGATTGAAACCTCTGTTAGAAGTGGAGGTTCCTCATGCAGC TCACTAGACGACTTTGTTAATTCCATGTGA >UGT76E3
ATGGAGAAAAGAGTAGAGAAGAGAAGGATAGTGTTGGTTCCACTTCCATTACTAGGACAT TTCACTCCGATGATGCAACTCGGCCAAGCCCTTATCTTGAAGGGATTCTCAATTATAGTT CCTCAGGGAGAATTCAATCGAGTAAACTCTTCGCAGAAGTTCCCTGGTTTTCAATTTATC ACCATACCAGATTCTGAACTCGAGGCAAATGGACCAGTCGGGTCTCTAACACAGCTCAAC AAAATTATGGAGGCAAGCTTCAAGGACTGTATAAGGCAGTTGTTGAAACAACAAGGCAAT GATATTGCATGTATCATCTACGACGAGTTCATGTATTTTTGTGGAGCCGTAGCTGAGGAG TTGAAGCTTCCCAATTTCATCTTCAGTACTCAAACTGCTACACATAAAGTTTGCTGCAAT GTTTTAAGCAAACTTAATGCCAAGAAGTACTTGATCGACATGGAAGAGCATGACGTGCAA AACAAGGTAGTGGAAAATATGCATCCATTAAGATACAAAGACTTACCAACTGCAACATTT GGAGAACTAGAACCTTTTTTGGAGCTCTGTAGAGATGTAGTCAACAAAAGAACAGCCTCT GCTGTTATCATCAACACCGTGACCTGTCTAGAGAGCTCGTCTCTCACAAGGCTGCAACAA GAACTCCAAATTCCGGTGTATCCATTAGGCCCTCTTCACATTACAGATTCATCGACAGGA TTTACTGTGCTGCAAGAGGATAGGAGCTGCGTTGAATGGCTGAACAAGCAGAAACCAAGG TCTGTCATATACATAAGTTTAGGAAGCATGGTTCTCATGGAAACCAAGGAGATGTTAGAG ATGGCTTGGGGAATGTTGAATAGCAACCAACCTTTCTTATGGGTCATCCGACCTGGATCT GTCTCAGGCTCCGAGGGGATAGAGTCATTGCCAGAGGAAGTCAGTAAGATGGTTTTAGAG AAAGGATACATTGTGAAATGGGCACCACAAATAGAAGTACTAGGACATCCCTCAGTGGGA GGCTTTTGGAGCCACTGTGGATGGAACTCAACACTCGAGAGCATTGTGGAAGGAGTTCCA ATGATTTGCAGGCCTTATCAAGGCGAGCAGATGTTAAATGCAATATATCTAGAGAGTGTA TGGAGAATAGGGATTCAGGTAGGAGGTGAACTGGAAAGAGGAGCCGTCGAGAGAGCTGTG AAGAGGTTGATTGTGGATAAAGAAGGTGCAAGCATGAGGGAGAGAACCCTTGTTTTAAAA GAGAAGCTCAAAGCCTCTATTAGAGGTGGAGGCTCCTCATGCAATGCATTAGATGAGCTT GTCAAGCACTTGAAGACAGAGTGA
>UGT76E4
ATGGAGAAAAGGGTAGAGAAGAGAAGGATTGTGTTAGTTCCGGTTGCTGCACAAGGACAT GTAACCCCAATGATGCAGCTTGGGAAAGCCCTTCAATCAAAGGGCTTCTTAATTACTGTT GCTCAGAGACAGTTCAATCAAATAGGCTCATCATTGCAACACTTTCCTGGTTTTGACTTT GTCACCATACCAGAAAGCTTACCTCAGTCTGAATCTAAGAAACTAGGACCAGCTGAGTAT CTTATGAATCTCAACAAAACAAGCGAGGCAAGCTTCAAGGAGTGTATAAGTCAGTTATCG ATGCAACAAGGCAATGATATAGCATGTATCATCTATGACAAGCTTATGTACTTCTGTGAA GCAGCAGCTAAGGAGTTTAAGATTCCTAGTGTTATCTTCAGCACTAGCAGTGCTACAATT CAAGTTTGCTACTGTGTTTTAAGTGAACTCAGTGCCGAGAAGTTCTTGATCGACATGAAA GATCCTGAAAAGCAAGATAAGGTGTTGGAAGGTTTGCATCCTTTAAGGTACAAAGACCTA CCAACTTCAGGATTTGGACCATTAGAGCCACTTTTGGAGATGTGTAGGGAAGTAGTTAAC AAAAGAACAGCTTCCGCTGTTATCATCAACACGGCGAGCTGTCTAGAGAGCTTGTCTCTG TCATGGCTGCAACAAGAACTTGGAATTCCAGTGTATCCATTAGGCCCTCTTCACATTACA GCTTCATCGCCGGGACCTAGTTTACTGCAAGAGGACATGAGCTGCATTGAATGGCTGAAC AAGCAGAAACCAAGGTCAGTCATATACATAAGCTTGGGAACCAAAGCTCACATGGAGACC AAGGAGATGTTAGAGATGGCCTGGGGATTGTTGAATAGCAACCAACCTTTCTTATGGGTC ATCCGACCTGGCTCTGTTGCAGGCTTCGAGTGGATAGAGTTATTACCAGAGGAAGTCATT AAGATGGTAACAGAAAGAGGATACATAGCGAAATGGGCACCGCAGATAGAAGTACTTGGA CATCCTGCAGTGGGAGGATTCTGGAGCCACTGTGGATGGAACTCAACACTCGAGAGTATT GTGGAAGGAGTCCCAATGATTTGCAGGCCTTTACAAGGCGAACAAAAGTTAAATGCGATG TATATAGAAAGTGTTTGGAAAATAGGGATTCAACTTGAAGGTGAAGTGGAAAGGGAAGGT GTAGAGAGAGCTGTGAAGAGGTTGATCATAGATGAAGAAGGTGCAGCCATGAGGGAGAGG GCTCTTGATTTAAAAGAGAAGCTCAATGCCTCGGTAAGAAGTGGAGGCTCCTCATACAAC GCACTGGATGAGCTTGTCAAGTTCTTGAATACAGAGTGA
>UGT76E5 ATGGAGAAAAATGCAGAGAAGAAAAGAATAGTGTTGGTTCCATTTCCATTACAAGGACAT ATCACTCCAATGATGCAACTTGGTCAAGCACTTAACCTGAAAGGCTTCTCGATTACCGTT GCTCTTGGAGATTCCAATCGAGTAAGTTCTACGCAACACTTCCCTGGTTTTCAATTTGTC ACAATACCTGAAACCATACCACTATCTCAACACGAGGCACTCGGAGTTGTCGAGTTTGTG GTTACGCTCAACAAAACAAGCGAGACAAGTTTCAAGGACTGTATAGCTCATTTGTTGCTG CAACATGGAAATGATATTGCTTGTATCATTTACGACGAGCTCATGTACTTCTCTGAAGCT ACAGCTAAGGATTTAAGGATTCCTAGTGTCATATTCACCACTGGTAGTGCTACAAATCAT GTTTGTTCTTGTATTTTAAGCAAACTCAACGCCGAGAAGTTCTTGATCGACATGAAAGAT CCTGAAGTGCAAAACATGGTGGTGGAAAATTTACATCCACTAAAATACAAAGACTTACCA ACTTCAGGAATGGGGCCGCTAGAGCGATTTTTGGAGATTTGTGCCGAAGTTGTCAACAAA AGAACAGCTTCCGCTGTTATAATCAATACGTCAAGTTGTCTAGAGAGCTCGTCTCTGTCA TGGCTGAAACAAGAACTCAGTATTCCAGTGTATCCATTAGGCCCTCTTCACATTACAACT TCAGCAAATTTTAGTTTACTTGAAGAGGACAGGAGCTGCATTGAATGGCTGAACAAGCAG AAACTGAGGTCAGTTATATACATAAGCGTAGGAAGCATAGCTCACATGGAAACCAAGGAA GTATTGGAGATGGCTTGGGGATTGTATAATAGCAACCAACCTTTTCTATGGGTAATCCGA CCCGGTACAGAGTCAATGCCAGTGGAAGTCAGTAAGATTGTCTCGGAAAGAGGATGCATT GTGAAATGGGCGCCACAGAATGAAGTACTTGTGCATCCTGCAGTGGGAGGTTTCTGGAGC CACTGTGGATGGAACTCAACACTCGAGAGTATTGTGGAAGGAGTTCCAATGATTTGCAGA CCGTTTAACGGTGAGCAGAAGTTAAACGCGATGTATATAGAAAGTGTTTGGAGAGTAGGG GTTCTGCTTCAAGGAGAAGTGGAGAGAGGATGTGTAGAGAGAGCTGTGAAGAGGTTGATT GTGGATGATGAAGGTGTAGGAATGAGGGAGAGAGCCCTTGTTTTAAAAGAGAAGCTCAAT GCCTCTGTAAGAAGTGGAGGCTCTTCATACAATGCATTGGATGAGCTCGTCCATTACTTG GAGGCAGAGTATAGAAATACTTGA
>UGT76E6
ATGGAGAAAATGGAAGAGAAGAAAAGGATAGTGTTAGTTCCGGTTCCAGCACAAAGACAT GTAACTCCAATGATGCAGCTTGGCACAGCCCTAAACATGAAGGGCTTCTCTATTACTGTT GTTGAAGGACAGTTCAATAAAGTAAGCTCATCTCAAAACTTTCCTGGTTTTCAATTTGTA ACCATACCAGATACAGAGAGCTTGCCAGAGTCTGTGCTCGAGAGACTCGGACCGGTCGAG TTTTTATTCGAGATCAACAAAACCAGTGAGGCAAGCTTCAAGGACTGTATAAGGCAGTCG TTGCTGCAACAAGGCAATGATATAGCATGTATCATCTACGACGAGTATATGTACTTCTGT GGAGCTGCAGCTAAGGAGTTCAACCTTCCTAGTGTAATATTCAGCACACAAAGTGCTACT AATCAAGTTTCCCGTTGCGTTTTAAGAAAACTCAGTGCCGAGAAGTTCTTGGTGGACATG GAAGGTATCCTGAAGTGCAGGAAACGTTGGTGGAAAATTTGCATCCATTAAGATACAAAG ACCTACCAACTTCAGGAGTTGGGCCACTAGATCGATTATTTGAGCTCTGTAGGGAAATAG TCAACAAAAGAACAGCTTCCGCTGTTATCATCAACACAGTGAGATGTCTAGAGAGCTCGT CTCTGAAACGTCTGCAACATGAACTCGGGATTCCGGTGTACGCATTAGGCCCTCTTCACA TTACAGTTTCAGCAGCTTCTAGTTTACTGGAAGAGGACAGGAGCTGCGTTGAATGGTTGA ACAAGCAAAAACCGAGGTCAGTCGTTTACATAAGCTTGGGGAGCGTAGTTCAAATGGAAA CCAAAGAAGTGTTAGAGATGGCTCGGGGTTTATTTAATAGCAACCAGCCTTTCTTATGGG TCATTCGGCCTGGCTCTATCGCAGGCTCCGAATGGATAGAGTCACTGCCAGAGGAAGTCA TTAAGATGGTCTCCGAAAGAGGGTATATTGTGAAATGGGCACCACAGATAGAAGTACTTG GACATCCTGCAGTGGGAGGATTCTGGAGCCACTGTGGATGGAACTCAACGCTTGAAAGCA TTGTGGAAGGAGTTCCAATGATATGCAGGCCCTTTCATGGCGAGCAAAAGTTAAACGCAC TGTGTTTAGAGAGTATTTGGAGAATAGGGTTTCAGGTGCAAGGTAAGGTAGAGAGGGGAG GGGTCGAGAGAGCTGTGAAGAGGTTGATAGTGGATGAAGAAGGTGCAGACATGAGAGAGA GAGCCCTTGTTTTAAAAGAGAATCTCAAAGCCTCTGTAAGAAATGGAGGCTCCTCATACA ACGCATTGGAGGAGATCGTTAACCTCATGTAG
>UGT76E7
ATGGAGGAGAAGCTCTCGAGGAGAAGAAGAGTAGTGTTGGTTCCAGTTCCAGCTCAAGGA CATATAACTCCAATGATACAACTTGCAAAAGCACTTCACTCAAAAGGCTTCTCTATTACA GTTGTTCAAACCAAGTTCAACTACTTAAACCCTTCAAATGATTTGTCTGATTTTCAGTTT GTAACCATCCCAGAGAACTTACCAGTGTCTGATCTTAAGAATCTAGGACCAGGACGGTTT CTGATTAAGCTAGCTAATGAGTGTTATGTTAGCTTTAAGGATTTGTTAGGTCAGTTGTTG GTTAATGAAGAAGAAGAGATCGCTTGTGTTATCTACGACGAGTTCATGTACTTTGTTGAA GTAGCAGTTAAAGAGTTTAAGCTTCGTAATGTTATTTTAAGTACTACAAGTGCAACGGCT TTTGTTTGTCGCTTTGTTATGTGTGAACTCTATGCTAAAGATGGTTTGGCTCAACTTAAA GAAGGCGGTGAGCGAGAAGTGGAGTTAGTACCGGAGTTGTATCCTATACGGTACAAAGAT TTACCAAGTTCGGTATTTGCATCTGTAGAATCTTCAGTGGAGTTGTTTAAGAATACATGT TATAAAGGGACAGCTTCCTCTGTGATAATCAACACAGTGAGGTGTCTAGAGATGTCATCT TTGGAGTGGCTTCAACAAGAACTTGAAATCCCGGTGTATTCTATAGGCCCGCTTCATATG GTGGTGTCAGCTCCTCCTACGAGTCTTTTAGAAGAGAACGAGAGCTGTATAGAATGGTTG AACAAACAAAAGCCGAGCTCGGTGATATACATAAGCTTGGGAAGTTTTACTTTGATGGAA ACTAAAGAAATGTTGGAGATGGCTTATGGGTTTGTTAGTAGTAACCAACACTTCTTGTGG GTGATTCGACCGGGATCTATATGTGGTTCTGAAATCTCTGAGGAAGAGTTGTTGAAGAAG ATGGTAATTACGGATCGAGGTTACATTGTGAAATGGGCGCCGCAAAAACAAGTGCTTGCA CATTCTGCGGTTGGAGCGTTCTGGAGTCATTGTGGATGGAACTCGACTTTAGAAAGTCTT GGTGAAGGAGTTCCATTGATATGTAGGCCTTTTACTACTGATCAAAAGGGGAATGCAAGG TACTTGGAGTGTGTGTGGAAAGTAGGAATTCAAGTGGAGGGTGAGCTAGAGAGAGGCGCA ATCGAGAGAGCTGTGAAGAGGTTAATGGTGGATGAAGAAGGAGAAGAGATGAAGAGAAGA GCTCTAAGTTTAAAAGAGAAACTCAAAGCCTCTGTTTTAGCTCAAGGTTCTTCACATAAA TCACTAGATGACTTCATCAAGACTCTGTGA
>UGT76E9
ATGGAGGAAAAGCAAGAGAGGAGGAGAAGGATCGTGTTGATTCCCGCTCCAGCACAAGGA CACATATCTCCGATGATGCAACTTGCAAGAGCCCTTCACTTAAAGGGCTTCTCCATTACA GTTGCTCAAACCAAGTTCAATTACTTGAAGCCTTCAAAAGACTTAGCTGATTTTCAGTTT ATCACCATCCCAGAGAGCTTACCAGCCTCGGATCTTAAGAATCTAGGACCAGTTTGGTTT CTTCTTAAACTCAATAAAGAGTGTGAGTTTAGCTTCAAGGAGTGTTTAGGTCAATTGTTG CTGCAAAAACAACTTATACCGGAAGAAGAGATCGCTTQTGTCATCTACGACGAGTTCATG TACTTTGCTGAAGCTGCAGCCAAAGAGTTTAACCTTCC'CAAAGTTATTTTCAGTACCGAA AATGCGACGGCTTTTGCTTGTCGCTCTGCCATGTGCAAACTCTATGCAAAAGATGGTTTG GCTCCCCTTAAAGAAGGATGTGGGCGAGAAGAGGAGCTAGTGCCAAAGTTGCATCCCCTT AGATACAAAGACCTACCAACTTCAGCATTTGCACCAGTAGAAGCCTCAGTGGAAGTGTTT AAAAGTTCATGTGATAAAGGGACAGCTTCCGCTATGATAATCAACACAGTGAGGTGTCTA GAGATATCATCCTTGGAGTGGCTTCAACAAGAACTTAAGATTCCGATATATCCTATAGGC CCTCTTCACATGGTTTCTTCAGCTCCTCCTACGAGTCTACTAGACGAGAATGAGAGTTGC ATTGATTGGCTGAACAAACAAAAGCCGAGCTCGGTGATTTACATAAGTTTGGGAAGCTTT ACTTTGTTGGAAACTAAAGAAGTGTTGGAAATGGCTTCGGGCTTGGTTAGTAGTAACCAA CACTTCTTGTGGGTGATTCGACCCGGGTCCATACTTGGTTCTGAATTGACTAATGAGGAA TTATTGAGTATGATGGAAATACCGGATCGAGGCTACATTGTGAAATGGGCTCCACAAAAG CAAGTGCTTGCACATTCTGCGGTTGGAGCATTTTGGAGTCATTGTGGATGGAACTCGACT CTAGAGAGCATGGGTGAAGGAGTTCCGATGATTTGTAGGCCTTTTACTACTGATCAAAAG GTAAATGCGCGGTATGTGGAGTGTGTCTGGAGAGTTGGGGTTCAAGTGGAGGGTGAACTA AAGAGAGGAGTAGTCGAGAGAGCTGTGAAGAGGTTACTGGTGGATGAAGAAGGAGAAGAG ATGAAGTTGAGAGCTCTCAGTTTGAAAGAGAAACTCAAAGTTTCTGTTCTACCGGGAGGT TCTTCACACAGTTCACTAGATGACTTAATCAAGACTCTATGA
>UGT76F1-
ATGGAAGAGAGAAAAGTGAAGAGAATTATCATGTTCCCTCTACCGTTTACAGGACACTTC AACCCTATGATCGAGCTTGCTGGAATATTCCACAACCGTGGCTTCTCCGTCACGATACTC CACACTTCTTTCAACTTCCCGGATCCTTCTCGCCATCCACAGTTTACTTTTCGAACTATC ACTCACAAAAACGAAGGAGAAGAAGACCCTCTCTCTCAATCAGAAACTTCTTCGGGTAAG GACCTCGTCGTCCTTATTAGTCTGCTGAAACAATACTACACCGAGCCGTCTCTTGCAGAG GAAGTAGGCGAAGGAGGGACGGTGTGTTGTTTGGTCTCCGACGCTCTATGGGGGAGGAAC ACGGAGATTGTAGCGAAAGAGATTGGAGTGTGTACAATGGTGATGAGGACTAGTGGTGCG GCAACGTTTTGTGCTTATACAGCTTTCCCTCTCCTTATAGATAAGGGTTACCTTCCTATA CAAGGTTCTAGATTAGATGAGCTAGTGACAGAGCTTCCACCTTTGAAAGTGAAGGATCTT CCTGTAATAAAAACGAAAGAGCCTGAGGGACTAAACCGAATACTTAACGACATGGTGGAA GGAGCCAAGTTATCTTCCGGAGTCGTATGGAACACATTTGAAGATCTTGAAAGACATTCA CTCATGGATTGTCGCAGCAAGTTACAAGTTCCGTTGTTCCCAATCGGACCGTTTCACAAA CATAGAACCGATCTTCCACCGAAGCCAAAGAACAAGGACAAGGACGATGATGAAATATTA ACCGATTGGCTTAACAAGCAAGCTCCGCAGTCTGTGGTCTATGTGAGTTTTGGAAGCCTT GCAGCTATAGAAGAGAATGAGTTTTTCGAAATTGCTTGGGGTCTAAGAAACAGCGAACTA CCATTCTTGTGGGTGGTTAGGCCCGGGATGGTCCGGGGAACCGAGTGGCTTGAGTCATTG CCTTGTGGGTTTTTGGAAAATATTGGTCATCAGGGAAAAATTGTGAAATGGGTGAATCAA CTAGAGACATTGGCCCATCCTGCGGTTGGAGCGTTTTGGACGCACTGTGGATGGAACTCA ACAATAGAGAGCATATGTGAAGGTGTTCCAATGATATGTACGCCGTGTTTCTCGGACCAG CATGTGAACGCGAGGTACATCGTTGATGTATGGCGAGTCGGGATGATGTTAGAGAGATGT AAGATGGAAAGGACGGAGATTGAGAAGGTAGTAACAAGTGTAATGATGGAGAATGGAGCT GGATTGACAGAGATGTGTTTGGAGTTGAAAGAGAAAGCTAATGTTTGCTTAAGTGAAGAT GGGTCTTCTTCCAAGTATCTAGACAAACTTGTCAGTCATGTCCTGTCTTTTGATTCCTCG GCTTTTGCAAGTTAA
>UGT76F2
ATGGAAGAGAGAAAAGGGAGGAGAATAATCATGTTCCCTCTTCCAT TCCAGGGCACTTC AACCCCATGATCGAGCTCGCTGGAATATTCCACCACCGTGGCTTCTCCGTGACGATCCTC CACACTTCCTACAACTTCCCCGATCCTTCTCGCCACCCACACTTCACTTTTCGAACCATC TCTCACAACAAAGAAGGAGAAGAAGATCCTCTGTCTCAGTCAGAAACTTCGAGTATGGAC CTAATCGTTCTCGTTCGTCGGCTGAAACAACGCTACGCCGAACCGTTTCGTAAGTCTGTG GCGGCGGAAGTAGGTGGAGGAGAGACGGTGTGTTGTTTGGTCTCCGACGCTATATGGGGG AAGAACACGGAGGTTGTAGCGGAAGAGATTGGAGTTCGTAGGGTGGTGTTGAGGACAGGT GGTGCGTCGTCGTTTTGTGCTTTTGCCGCTTTCCCTCTCCTTAGGGATAAGGGTTACCTC CCTATACAAGATTCTAGATTAGATGAGCCAGTGACAGAGCTTCCACCTTTGAAAGTGAAG GATCTTCCGGTAATGGAAACGAATGAGCCGGAGGAACTTTACCGGGTAGTTAACGACATG GTGGAAGGAGCCAAGTCTTCTTCAGGAGTCATATGGAACACATTTGAAGATCTTGAAAGA CTATCACTTATGAATTGTAGCAGCAAATTACAAGTTCCATTTTTCCCGATCGGACCGTTT CACAAATATAGCGAAGATCCTACACCGAAGACAGAGAACAAGGAAGATACCGATTGGCTC GACAAGCAAGACCCACAGTCGGTGGTCTATGCGAGTTTCGGAAGCCTTGCAGCTATAGAA GAGAAGGAGTTTCTCGAGATTGCTTGGGGTCTAAGAAACAGTGAACGACCGTTTTTGTGG GTGGTTAGGCCGGGGTCTGTCAGGGGGACCGAGTGGCTCGAGTCATTGCCTTTAGGGTTT ATGGAAAACATTGGAGATAAGGGAAAAATCGTGAAATGGGCGAATCAGTTAGAGGTATTG GCGCATCCTGCCATTGGAGCGTTTTGGACACATTGTGGATGGAACTCGACACTAGAGAGC ATATGTGAAGGTGTTCCTATGATATGTACGTCATGTTTCACGGACCAGCATGTGAACGCG AGATACATCGTTGATGTATGGCGAGTCGGGATGTTGTTAGAGAGAAGTAAGATGGAAAAG AAGGAGATTGAAAAGGTGCTAAGAAGTGTAATGATGGAGAAGGGAGATGGATTGAGGGAA AGGAGTTTGAAGTTGAAAGAGAGAGCTGATTTTTGCTTAAGTAAAGATGGGTCTTCTTCC AAGTATTTAGACAAACTTGTGAGTCATGTCCTGTCTTTTGATTCTTATGCTTTTGCAAGT TAA
>UGT78D1
ATGACCAAATTCTCCGAGCCAATCAGAGACTCCCACGTGGCAGTTCTCGCGTTTTTCCCC GTTGGCGCTCATGCCGGTCCTCTCTTAGCCGTCACTCGCCGTCTCGCCGCCGCTTCTCCC TCCACCATCTTTTCTTTCTTCAACACCGCAAGATCAAACGCGTCGTTGTTCTCCTCTGAT CATCCCGAGAACATCAAGGTCCACGACGTCTCTGACGGTGTTCCGGAGGGAACCATGCTC GGGAATCCACTGGAGATGGTCGAGCTGTTTCTCGAAGCGGCTCCACGTATTTTCCGGAGC GAAATCGCGGCGGCAGAGATAGAAGTTGGAAAGAAAGTGACATGCATGCTAACAGATGCC TTCTTCTGGTTCGCAGCGGACATAGCGGCTGAGCTGAACGCGACTTGGGTTGCCTTCTGG GCCGGCGGAGCAAACTCACTCTGTGCTCATCTCTACACTGATCTCATCAGAGAAACCATC GGTCTCAAAGATGTGAGTATGGAAGAGACATTAGGGTTTATACCAGGAATGGAGAATTAC AGAGTTAAAGATATACCAGAGGAAGTTGTATTTGAAGATTTGGACTCTGTTTTCCCAAAG GCTTTATACCAAATGAGTCTTGCTTTACCTCGTGCCTCTGCTGTTTTCATCAGTTCCTTT GAAGAGTTAGAACCTACATTGAACTATAACCTAAGATCCAAACTTAAACGTTTCTTGAAC ATCGCCCCTCTCACGTTATTATCTTCTACATCGGAGAAAGAGATGCGTGATCCTCATGGC TGCTTTGCTTGGATGGGGAAGAGATCAGCTGCTTCTGTAGCGTACATTAGCTTCGGCACC GTCATGGAACCTCCTCCTGAAGAGCTTGTGGCGATAGCACAAGGGTTGGAATCAAGCAAA GTGCCGTTTGTTTGGTCGCTGAAGGAGAAGAACATGGTTCATCTACCAAAAGGGTTTTTG GATCGGACAAGAGAGCAAGGGATAGTGGTTCCTTGGGCTCCACAAGTGGAACTGCTGAAA CACGAGGCAATGGGTGTGAATGTGACACATTGTGGATGGAACTCAGTGTTGGAGAGTGTG TCGGCAGGTGTACCGATGATCGGCAGACCGATTTTGGCGGATAATAGGCTCAACGGAAGA GCAGTGGAGGTTGTGTGGAAGGTTGGAGTGATGATGGATAATGGAGTCTTCACGAAAGAA GGATTTGAGAAGTGTTTGAATGATGTTTTTGTTCATGATGATGGTAAGACGATGAAGGCT AATGCCAAGAAGCTTAAAGAAAAACTCCAAGAAGATTTCTCCATGAAAGGAAGCTCTTTA GAGAATTTCAAAATATTGTTGGACGAAATTGTGAAAGTTTAG
>UGT78D2
ATGACCAAACCCTCCGACCCAACCAGAGACTCCCACGTGGCAGTTCTCGCTTTTCCTTTC GGCACTCATGCAGCTCCTCTCCTCACCGTCACGCGCCGCCTCGCCTCCGCCTCTCCTTCC ACCGTCTTCTCTTTCTTCAACACCGCACAATCCAACTCTTCGTTATTTTCCTCCGGTGAC GAAGCAGATCGTCCGGCGAACATCAGAGTATACGATATTGCCGACGGTGTTCCGGAGGGA TACGTGTTTAGCGGGAGACCACAGGAGGCGATCGAGCTGTTTCTTCAAGCTGCGCCGGAG AATTTCCGGAGAGAAATCGCGAAGGCGGAGACGGAGGTTGGTACGGAAGTGAAATGTTTG ATGACTGATGCGTTCTTCTGGTTCGCGGCTGATATGGCGACGGAGATAAATGCGTCGTGG ATTGCGTTTTGGACCGCCGGAGCAAACTCACTCTCTGCTCATCTCTACACAGATCTCATC AGAGAAACCATCGGTGTCAAAGAAGTAGGTGAGCGTATGGAGGAGACAATAGGGGTTATC TCAGGAATGGAGAAGATCAGAGTCAAAGATACACCAGAAGGAGTTGTGTTTGGGAATTTA GACTCTGTTTTCTCAAAGATGCTTCATCAAATGGGTCTTGCTTTGCCTCGTGCCACTGCT GTTTTCATCAATTCTTTTGAAGATTTGGATCCTACATTGACGAATAACCTCAGATCGAGA TTTAAACGATATCTGAACATCGGTCCTCTCGGGTTATTATCTTCTACATTGCAACAACTA GTGCAAGATCCTCACGGTTGTTTGGCTTGGATGGAGAAGAGATCTTCTGGTTCTGTGGCG TACATTAGCTTTGGTACGGTCATGACACCGCCTCCTGGAGAGCTTGCGGCGATAGCAGAA GGGTTGGAATCGAGTAAAGTGCCGTTTGTTTGGTCGCTTAAGGAGAAGAGCTTGGTTCAG TTACCAAAAGGGTTTTTGGATAGGACAAGAGAGCAAGGGATAGTGGTTCCATGGGCACCG CAAGTGGAACTGCTGAAACACGAAGCAACGGGTGTGTTTGTGACGCATTGTGGATGGAAC TCGGTGTTGGAGAGTGTATCGGGTGGTGTACCGATGATTTGCAGGCCATTTTTTGGGGAT CAGAGATTGAACGGAAGAGCGGTGGAGGTTGTGTGGGAGATTGGAATGACGATTATCAAT GGAGTCTTCACGAAAGATGGGTTTGAGAAGTGTTTGGATAAAGTTTTAGTTCAAGATGAT GGTAAGAAGATGAAATGTAATGCTAAGAAACTTAAAGAACTAGCTTACGAAGCTGTCTCT TCTAAAGGAAGGTCCTCTGAGAATTTCAGAGGATTGTTGGATGGAGTTGTAAACATTATT TGA
>UGT78D3
ATGGCCAAACCCTCGCAGCCAACGCGAGACTCCCACGTGGCAGTTCTCGTTTTCCCCTTC GGCACTCATGCAGCTCCTCTCCTCGCCGTCACGTGCCGTCTCGCCACCGCTGCTCCCTCC ACCGTCTTCTCCTTCTTCAGCACCGCACGATCCAACTCGTCGTTACTCTCCTCCGATATC CCCACAAACATTCGTGTCCACAACGTCGATGACGGTGTTCCTGAGGGATTCGTGTTGACG GGGAATCCAGAGCACGCTGTTGAGCTGTTTCTTGAAGCGGCGCCAGAGATTTTCCGAAGA GAAATCAAGGCGGCCGAGACCGAAGTTGGTAGGAAGTTCAAGTGCATCCTTACGGATGCG TTCCTCTGGTTAGCAGCGGAGACGGCGGCTGCGGAGATGAAAGCGTCGTGGGTTGCGTAC TATGGAGGCGGAGCAACCTCGCTCACTGCTCATCTCTACACAGATGCCATCAGAGAAAAC GTCGGTGTCAAAAGTAGGTGAGCGTATGGAGGAGACAATAGGGTTTATCTCAGGAATGGA GAAGATCAGAGTCAAAGACACACAAGAAGGCGTTGTGTTTGGGAACTTAGACTCTGTTTT CTCTAAAACGTTGCACCAAATGGGTCTTGCTTTACCTCGTGCCACTGCTGTTTTCATCAA TTCCTTTGAAGAATTGGATCCTACGTTTACAAATGATTTCAGATCGGAATTCAAACGTTA CCTAAACATCGGTCCTCTCGCTTTATTATCTTCTCCATCGCAAACATCAACGCTAGTGCA CGATCCTCACGGTTGCTTGGCTTGGATCGAGAAGCGGTCCACTGCTTCTGTAGCGTACAT TGCCTTTGGTAGAGTCGCGACACCGCCTCCTGTAGAGCTTGTGGCGATAGCACAAGGATT GGAATCGAGTAAAGTGCCTTTTGTTTGGTCGCTACAAGAGATGAAAATGACTCATTTACC AGAAGGCTTTTTGGATCGGACCAGAGAGCAAGGGATGGTGGTTCCATGGGCACCACAAGT GGAGCTGCTAAACCATGAAGCAATGGGTGTGTTTGTTTCGCATGGTGGGTGGAACTCAGT GTTGGAGAGTGTGTCGGCAGGTGTACCGATGATTTGTAGACCGATTTTCGGGGATCATGC AATCAATGCAAGATCTGTGGAAGCTGTGTGGGAGATCGGAGTGACGATTAGTAGTGGAGT CTTCACGAAGGATGGATTTGAGGAGAGTTTGGATCGGGTTTTGGTTCAAGATGATGGCAA GAAGATGAAGGTTAATGCTAAAAAGCTTGAAGAACTAGCACAAGAAGCTGTCTCTACCAA AGGAAGCTCCTTTGAGAATTTTGGAGGATTGTTGGACGAAGTTGTGAACTTTGGATAA
>UGT79B1
ATGGGTGTTTTTGGATCGAATGAATCGTCAAGCATGAGTATTGTGATGTATCCGTGGTTA GCCTTTGGTCACATGACTCCTTTTCTTCACCTATCCAACAAGCTCGCAGAGAAAGGTCAC AAGATTGTTTTCTTGCTTCCCAAGAAAGCACTAAACCAGCTTGAACCTCTTAATCTCTAC CCAAATCTCATCACTTTCCACACCATCTCTATCCCTCAGGTCAAAGGGCTCCCTCCGGGT GCGGAGACAAACTCCGACGTCCCTTTCTTCTTGACACATTTGCTTGCAGTTGCAATGGAC CAAACCCGGCCAGAGGTCGAGACCATTTTCCGTACAATCAAACCGGACTTGGTTTTCTAT GATTCTGCCCATTGGATACCGGAAATTGCTAAACCGATCGGTGCTAAAACCGTTTGCTTC AACATCGTTAGCGCTGCGTCAATCGCACTGTCTCTTGTCCCTTCTGCGGAGAGAGAGGTC ATTGATGGCAAGGAAATGTCAGGGGAGGAGTTAGCTAAGACGCCTCTAGGTTACCCATCT TCGAAAGTAGTCTTACGTCCGCACGAAGCAAAATCCCTGAGTTTCGTGTGGAGGAAGCAC GAGGCGATTGGCTCTTTCTTTGATGGGAAAGTTACCGCGATGAGAAACTGCGACGCAATC GCTATAAGGACTTGCCGTGAGACAGAAGGCAAATTCTGCGATTACATAAGTAGGCAGTAC AGTAAACCGGTTTACCTAACAGGACCGGTTCTCCCTGGATCCCAACCTAATCAGCCCTCC TTAGATCCTCAATGGGCGGAGTGGCTAGCCAAATTCAACCACGGTTCGGTTGTGTTCTGC GCTTTCGGTAGCCAACCCGTTGTAAACAAGATAGATCAGTTTCAAGAACTCTGTTTAGGT CTAGAATCAACTGGTTTTCCGTTTCTGGTTGCCATTAAGCCTCCTTCGGGTGTATCAACC GTCGAGGAAGCCTTACCGGAAGGATTCAAAGAGAGGGTTCAAGGACGTGGCGTTGTGTTT GGAGGTTGGATTCAGCAACCGTTGGTGTTGAACCATCCTTCAGTGGGTTGTTTTGTTAGC CATTGCGGGTTTGGGTCGATGTGGGAGTCGTTGATGAGTGATTGTCAGATCGTTTTGGTT CCGCAGCACGGAGAACAGATTTTGAACGCAAGGCTGATGACGGAGGAGATGGAGGTGGCG GTTGAAGTGGAGAGGGAAAAGAAAGGGTGGTTCTCGCGGCAAAGCTTGGAGAATGCTGTG AAGAGTGTGATGGAGGAAGGTAGTGAGATCGGTGAGAAAGTGAGGAAGAATCATGACAAG TGGAGATGTGTTTTGACTGACTCTGGTTTTTCAGATGGTTATATTGATAAGTTTGAACAA AATTTAATTGAACTTGTGAAGTCATGA
>UGT79B10
ATGGGCCAAACGTTTCACGCCTTTATGTTCCCATGGTTCGCTTTTGGTCATATGACTCCA TACTTGCATTTAGCCAACAAGTTAGCTGAGAGAGGTCACAGAATCACTTTCTTGATCCCC AAGAAAGCTCAGAAGCAGCTTGAACATCTCAATCTGTTTCCAGACAGCATCGTCTTTCAC TCTCTTACTATTCCTCATGTTGATGGTCTCCCCGCTGGAGCCGAGACTTTCTCGGATATC CCTATGCCATTGTGGAAGTTCTTGCCCCCAGCTATAGATCTCACACGCGATCAAGTTGAA GCAGCGGTTAGTGCCTTGAGTCCGGACCTGATCTTGTTCGATATTGCTTCATGGGTTCCA GAAGTGGCTAAAGAGTATAGAGTCAAGAGTATGTTGTACAACATCATATCAGCTACTTCT ATAGCTCATGACTTTGTCCCAGGTGGTGAACTTGGAGTTCCTCCACCTGGTTATCCTTCC TCAAAGTTGTTGTACCGCAAACACGATGCTCACGCCTTGTTGTCCTTCTCCGTCTACTAC AAGAGGTTTTCTCATCGGCTCATCACAGGTCTTATGAATTGTGATTTCATTTCGATAAGG ACATGCAAAGAAATCGAGGGTAAATTCTGCGAGTATCTTGAGCGTCAATACCATAAAAAG GTTTTCTTGACGGGTCCAATGCTTCCTGAGCCAAACAAAGGTAAACCACTGGAAGATCGA TGGAGTCATTGGCTGAACGGGTTTGAACAAGGCTCTGTAGTGTTCTGTGCATTGGGAAGT CAAGTCACTCTAGAGAAGGACCAGTTCCAAGAACTTTGTTTAGGAATAGAGCTTACAGGT TTACCGTTTTTTGTAGCTGTAACACCACCAAAAGGCGCAAAGACGATTCAAGATGCGTTA CCAGAAGGGTTCGAGGAGAGGGTGAAAGATCGTGGAGTGGTTTTGGGAGAATGGGTGCAA CAACCGTTATTATTGGCTCATCCATCAGTAGGCTGCTTCTTGAGTCATTGCGGATTCGGG TCAATGTGGGAATCTATAATGAGTGATTGCCAAATAGTTTTGCTTCCATTTTTGGCTGAT CAAGTTCTCAACACAAGATTGATGACCGAAGAACTCAAGGTTTCGGTTGAAGTGCAAAGA GAAGAAACAGGATGGTTCTCGAAGGAGAGCTTGAGTGTTGCTATCACATCTGTGATGGAC CAAGCTAGTGAGATCGGGAATCTGGTGAGAAGGAACCATTCCAAATTGAAGGAGGTTTTG GTTAGTGATGGATTATTAACCGGTTACACCGATAAATTTGTTGACACTTTGGAGAATCTT GTCAGCGAGACAAAGCGTGAATGA
>UGT79B11
ATGGGCCAAAAGATTCACGCTTTTATGTTCCCCTGGTTTGCTTTTGGTCATATGACTCCG TACTTGCATCTAGGCAACAAGTTAGCCGAGAAAGGTCATAGGGTTACTTTCTTGCTACCT AAGAAAGCTCAGAAACAATTGGAACATCAGAATCTATTTCCACACGGTATCGTCTTTCAT CCTCTTGTTATTCCTCATGTTGATGGCCTCCCTGCTGGTGCCGAGACAGCCTCGGATATC CCCATCTCGTTGGTGAAGTTCTTGTCTATAGCCATGGATCTTACACGCGATCAGATCGAA GCCGCGATTGGTGCCTTGAGACCGGACCTAATCTTGTTCGATTTAGCTCACTGGGTTCCG GAAATGGCTAAAGCGCTTAAAGTCAAGAGTATGTTGTATAACGTGATGTCAGCTACCTCT ATAGCTCACGACCTTGTCCCAGGTGGTGAACTTGGAGTTGCTCCACCTGGTTATCCTTCA TCAAAGGCGTTGTACCGCGAACACGATGCTCACGCCTTGTTAACCTTCTCCGGCTTCTAC AAGAGGTTTTATCACCGGTTCACCACAGGTCTTATGAATTGCGATTTCATTTCGATTCGG ACATGTGAAGAAATCGAAGGTAAATTTTGTGACTATATTGAGAGTCAATACAAGAAGAAG GTTCTTTTAACCGGTCCAATGCTTCCCGAGCCTGACAAGAGTAAACCACTTGAAGATCAA TGGAGTCATTGGCTGAGTGGGTTTGGACAAGGCTCTGTAGTGTTCTGTGCATTGGGAAGT CAAACCATTCTAGAGAAAAACCAATTCCAAGAACTCTGTTTAGGAATAGAGCTTACGGGT TTACCATTTCTTGTCGCGGTTAAGCCACCAAAAGGCGCAAACACAATTCATGAAGCGTTA CCAGAAGGGTTCGAGGAAAGGGTGAAGGGTCGTGGAATAGTTTGGGGAGAATGGGTGCAG CAACCATCCTGGCAACCATTGATATTGGCTCATCCATCAGTAGGTTGCTTTGTGAGCCAT TGCGGATTCGGGTCAATGTGGGAATCTTTAATGAGTGATTGTCAAATAGTCTTTATTCCA GTTTTGAATGATCAAGTTCTCACCACGAGAGTAATGACGGAGGAACTCGAGGTCTCCGTT GAGGTACAGAGAGAAGAAACAGGATGGTTCTCAAAAGAAAACTTGAGTGGTGCAATCATG TCTTTGATGGACCAAGACAGCGAGATAGGGAACCAAGTGAGGAGGAACCATTCTAAATTG AAGGAGACTTTGGCTAGTCCTGGATTATTAACCGGTTACACCGATAAATTTGTTGACACT TTGGAGAATCTAGTCAACGAACAAGGATACATATCTTGA
>UGT79B2
ATGGGTGGTTTGAAGTTTCATGTACTTATGTATCCATGGTTCGCAACAGGCCATATGACC CCGTTCCTTTTTCTTGCCAACAAATTGGCTGAGAAAGGTCATACGGTCACTTTTTTGATT CCCAAGAAAGCTCTGAAACAGTTGGAAAATCTCAATCTGTTTCCACACAACATTGTCTTT CGCTCTGTCACCGTCCCTCATGTGGATGGTCTCCCCGTTGGCACAGAGACAGTCTCTGAG ATCCCCGTGACATCAGCTGATCTCTTGATGTCTGCTATGGATCTCACACGTGATCAAGTT GAAGGTGTGGTCCGAGCCGTGGAACCGGACCTGATCTTCTTTGACTTCGCTCATTGGATT CCAGAGGTAGCTAGAGACTTTGGCCTTAAGACTGTAAAGTACGTCGTGGTATCTGCATCG ACTATAGCTAGTATGCTTGTTCCAGGTGGTGAGTTAGGTGTTCCTCCGCCGGGATATCCT TCATCGAAGGTGCTGCTTCGTAAACAAGATGCTTACACCATGAAGAATCTGGAGTCTACA AATACAATCAATGTCGGACCAAACTTATTGGAAAGAGTCACTACAAGTCTTATGAACTCT GATGTCATTGCGATAAGGACAGCCAGAGAAATCGAAGGAAACTTTTGCGACTATATCGAA AAACATTGCAGGAAAAAGGTTCTCTTGACAGGTCCGGTGTTCCCTGAGCCAGACAAGACT AGAGAGCTAGAGGAACGATGGGTTAAGTGGCTAAGTGGGTATGAACCAGACTCAGTGGTG TTTTGTGCGTTGGGCTCACAAGTCATTTTAGAGAAAGATCAATTCCAAGAACTCTGCTTA GGAATGGAGCTAACAGGTTCACCGTTTCTTGTAGCGGTTAAGCCACCTAGAGGCTCATCA ACGATTCAAGAAGCACTTCCTGAAGGATTCGAGGAGAGGGTTAAAGGAAGAGGAGTTGTT TGGGGAGAATGGGTTCAACAACCATTGCTATTGTCTCATCCATCAGTCGGGTGCTTTGTG AGCCATTGTGGGTTTGGATCAATGTGGGAGTCTTTGCTGAGTGATTGTCAGATAGTCTTG GTACCACAGTTGGGTGATCAGGTCCTCAACACAAGATTGCTGAGTGACGAACTCAAGGTT TCGGTTGAAGTGGCAAGAGAGGAAACAGGATGGTTCTCGAAAGAGAGCTTGTTCGATGCT ATCAATAGTGTGATGAAAAGGGACAGTGAGATCGGGAATCTGGTGAAGAAGAATCACACC AAGTGGAGGGAGACACTAACTAGTCCTGGACTTGTGACCGGTTATGTCGATAATTTCATA GAGTCATTGCAGGATCTTGTCTCTGGGACCAACCATGTTTCGAAGTAG
>UGT79B3
ATGGGTGGTTTGAAGTTTCATGTACTTATGTATCCATGGTTCGCAACAGGCCATATGACC CCGTTCCTTTTTCTTGCCAACAAATTGGCTGAGAAAGGTCATACGGTCACTTTCTTGCTT CCCAAGAAATCTCTGAAACAGTTGGAACATTTCAATCTGTTTCCACACAACATTGTCTTT CGCTCTGTCACCGTCCCTCATGTGGATGGTCTCCCCGTTGGCACAGAGACAGCCTCTGAG ATCCCTGTGACATCAACTGATCTCTTGATGTCTGCTATGGATCTCACACGTGATCAAGTT GAAGCTGTGGTCCGAGCCGTTGAACCGGACCTGATCTTCTTTGACTTTGCTCATTGGATT CCAGAAGTAGCTAGGGACTTCGGCCTTAAGACTGTAAAGTACGTCGTGGTGTCTGCATCG ACTATAGCTAGTATGCTTGTCCCAGGTGGTGAGTTAGGTGTTCCTCCACCGGGATATCCA TCATCAAAGGTGCTGCTTCGTAAACAAGATGCTTACACTATGAAGAAACTGGAGCCTACA AATACAATCGATGTCGGACCAAACCTCTTGGAACGAGTCACTACAAGTCTTATGAACTCT GATGTCATTGCGATAAGGACAGCCAGAGAAATCGAAGGAAACTTTTGCGACTATATAGAA AAACATTGCAGGAAAAAGGTTCTCTTGACAGGTCCGGTGTTCCCTGAGCCAGACAAGACT AGAGAGCTAGAGGAACGATGGGTTAAGTGGCTAAGTGGGTATGAACCAGACTCAGTGGTG TTTTGTGCACTGGGCTCACAAGTCATTTTAGAGAAAGATCAATTCCAAGAACTCTGCTTA GGAATGGAGCTAACAGGTTCACCGTTTCTTGTAGCGGTTAAGCCCCCTAGAGGCTCATCA ACGATTCAAGAAGCACTTCCTGAAGGATTCGAAGAGCGGGTTAAAGGAAGAGGCCTTGTT TGGGGAGGATGGGTTCAACAACCATTGATATTGTCTCATCCATCAGTCGGGTGCTTTGTG AGCCATTGTGGGTTTGGATCAATGTGGGAGTCTTTGCTGAGTGATTGTCAGATAGTCTTA GTACCACAGTTGGGTGATCAAGTCCTGAACACAAGATTGCTGAGTGACGAACTCAAGGTT TCGGTTGAAGTGGCAAGAGAGGAAACAGGATGGTTCTCGAAAGAGAGCTTGTGCGATGCT GTCAATAGTGTGATGAAAAGGGACAGCGAGCTCGGGAACCTGGTGAGGAAGAATCACACC AAGTGGAGGGAGACAGTAGCTAGTCCTGGACTAATGACTGGTTATGTCGATGCTTTCGTA GAGTCATTGCAGGATCTTGTCTCTGGGACCACCCATGACTGA
>UGT79B4
ATGGGGTCAAAGTTTCATGCTTTTCTTTATCCATGGTTTGGTTTTGGTCATATGATTCCG TATCTTCATCTAGCTAACAAATTAGCTGAAAAAGGTCATAGGGTTACTTTCTTGGCTCCC AAGAAAGCTCAGAAACAACTCGAACCTCTCAACTTGTTCCCAAACAGCATTCACTTCGAG AATGTTACTCTTCCTCATGTTGATGGTCTCCCTGTTGGCGCAGAGACAACCGCGGATCTC CGGAACTCATCTAAGAGAGTCCTCGCTGATGCCATGGATCTTCTACGCGAACAGATTGAA GTTAAGATTCGTTCTTTGAAACCTGACCTAATTTTCTTCGATTTTGTTGATTGGATTCCA CAAATGGCAAAAGAATTAGGAATCAAAAGTGTAAGTTACCAGATCATATCGGCAGCTTTT ATAGCTATGTTTTTCGCTCCTCGTGCTGAATTAGGTTCTCCTCCACCTGGGTTTCCTTCA TCAAAAGTAGCATTACGTGGACATGACGCTAACATCTATTCACTCTTCGCAAACACCCGC AAATTTCTCTTTGATCGAGTCACCACAGGCCTTAAGAACTGCGACGTCATTGCCATAAGG ACATGTGCAGAAATCGAAGGTAACTTATGTGATTTCATCGAAAGACAATGTCAGAGAAAA GTTCTCTTAACCGGTCCAATGTTCCTTGATCCACAAGGGAAGAGTGGTAAGCCGCTAGAA GATCGATGGAATAATTGGTTAAACGGATTTGAACCAAGCTCGGTAGTGTACTGTGCGTTT GGCACCCATTTCTTTTTCGAGATAGATCAATTTCAAGAACTCTGTTTAGGAATGGAGCTC ACGGGTCTACCTTTTTTGGTAGCGGTTATGCCACCGAGAGGGTCTTCAACGATTCAAGAA GCATTACCAGAAGGGTTCGAAGAACGGATTAAAGGGCGTGGAATTGTTTGGGGAGGATGG GTGGAACAACCTTTGATATTGTCTCATCCATCAATAGGTTGCTTTGTGAACCATTGCGGG TTCGGTTCAATGTGGGAGTCTTTGGTTAGTGATTGCCAGATTGTGTTTATTCCACAATTG GTTGATCAAGTTCTCACAACGAGATTGTTGACCGAAGAACTCGAGGTCTCCGTGAAAGTA AAGAGAGATGAAATTACTGGTTGGTTTTCGAAGGAGAGCTTGAGGGATACGGTCAAATCT GTGATGGATAAAAATAGTGAGATTGGGAATCTAGTGAGGAGGAATCATAAGAAACTGAAG GAAACTTTGGTTAGTCCTGGATTGTTGAGTAGTTATGCTGATAAGTTTGTTGACGAATTA GAGAATCATATCCACAGTAAGAATTGA
>UGT79B5
ATGGGATCAAAATTTCATGCTTTTATGTATCCATGGTTTGGTTTTGGTCATATGATTCCA TATCTTCATTTAGCCAACAAACTAGCTGAGAAAGGTCATAGGGTCACTTTCTTCCTCCCC AAGAAAGCTCATAAGCAGCTCCAACCTCTCAATCTGTTCCCAGACAGCATTGTCTTTGAG CCTCTTACTCTCCCTCCTGTCGATGGTCTCCCTTTTGGCGCCGAGACAGCCTCGGATCTC CCAAACTCAACTAAGAAACCCATATTCGTTGCCATGGATCTCTTACGCGATCAGATCGAA GCAAAGGTCCGTGCTTTGAAACCAGATCTAATCTTTTTCGATTTTGTTCATTGGGTTCCA GAAATGGCAGAAGAGTTTGGAATAAAGAGTGTCAATTACCAGATCATATCGGCAGCTTGT GTAGCTATGGTTCTTGCACCTAGGGCTGAATTAGGGTTTCCTCCGCCGGATTATCCTTTA TCCAAAGTGGCGTTACGTGGACATGAAGCTAACGTCTGTTCTCTCTTTGCGAATTCCCAT GAGCTTTTCGGTCTGATCACCAAAGGCCTTAAGAACTGTGACGTCGTTTCCATAAGGACC TGCGTGGAACTTGAAGGTAAGCTATGCGGTTTCATCGAAAAAGAATGTCAAAAGAAACTT CTCTTAACCGGTCCAATGCTCCCTGAACCGCAAAATAAGAGTGGTAAATTTCTAGAAGAC CGATGGAATCACTGGTTAAACGGATTTGAACCAGGGTCGGTAGTGTTTTGTGCGTTTGGC ACTCAATTCTTTTTCGAGAAGGATCAATTTCAAGAATTCTGTTTAGGAATGGAGCTAATG GGTCTACCGTTTTTAATATCGGTTATGCCGCCAAAAGGCTCACCAACGGTTCAAGAAGCG TTACCAAAAGGATTCGAAGAACGGGTTAAAAAGCATGGAATCGTTTGGGAAGGATGGTTG GAACAACCTTTGATATTGTCTCATCCATCAGTAGGTTGCTTTGTGAACCATTGTGGCTTT GGTTCAATGTGGGAGTCTTTGGTTAGTGATTGTCAGATTGTGTTTATTCCACAATTGGCA GATCAAGTTCTCATCACAAGATTGTTGACTGAAGAACTCGAAGTCTCTGTGAAAGTGCAG AGAGAAGATTCCGGATGGTTCTCGAAAGAGGACTTGAGAGATACTGTTAAATCTGTGATG GATATAGATAGTGAGATTGGGAACTTAGTGAAGAGGAATCATAAGAAATTGAAAGAGACT TTAGTTAGTCCTGGATTGTTAAGTGGTTATGCTGATAAGTTTGTAGAAGCATTGGAGATT GAAGTCAACAACACCAAATTTTCTTGA
>UGT79B6
ATGGGGTCAAAGTTTCATGCTTTTATGTTCCCATGGTTTGGTTTTGGTCACATGACTGCA TTTTTGCATCTGGCTAACAAACTAGCGGAGAAAGACCACAAAATAACTTTCTTGCTCCCC AAGAAAGCTCGAAAGCAACTTGAATCTCTCAATCTCTTCCCAGACTGCATTGTCTTTCAG ACTCTTACCATCCCATCTGTAGATGGCCTCCCTGATGGTGCTGAGACAACCTCGGATATC CCGATCTCGTTAGGCAGTTTTCTCGCCTCGGCTATGGATCGGACACGCATTCAGGTCAAA GAAGCAGTTTCTGTTGGTAAACCGGATCTGATTTTCTTCGATTTTGCTCACTGGATTCCG GAAATAGCTAGAGAGTATGGAGTCAAGAGTGTCAATTTCATAACGATTTCTGCAGCATGT GTAGCTATTTCGTTCGTCCCTGGTCGTAGTCAAGATGACTTGGGTAGTACTCCACCGGGA TACCCTTCCTCCAAGGTGTTGCTTCGGGGACACGAAACCAACAGTTTGTCGTTCCTCTCC TATCCGTTTGGAGATGGAACTAGTTTTTACGAACGGATCATGATAGGACTTAAGAACTGC GATGTCATTTCGATAAGGACATGCCAAGAAATGGAAGGAAAGTTCTGCGATTTCATCGAA AACCAATTTCAAAGAAAAGTTCTCTTGACAGGTCCAATGCTTCCTGAGCCGGACAATAGC AAACCGCTAGAAGATCAATGGCGTCAGTGGCTTAGCAAGTTCGATCCGGGATCAGTAATA TATTGTGCATTGGGCAGCCAAATCATTCTTGAAAAGGATCAATTCCAAGAACTCTGTTTA GGAATGGAGCTGACAGGTTTACCATTTCTTGTAGCGGTAAAGCCACCAAAAGGTTCATCG ACAATCCAAGAAGCCTTACCAAAAGGGTTTGAAGAGAGGGTTAAAGCACGTGGAGTGGTT TGGGGAGGATGGGTGCAGCAACCATTGATATTAGCTCATCCATCAATAGGCTGCTTTGTG AGCCATTGTGGTTTCGGGTCAATGTGGGAGGCTCTAGTGAATGACTGCCAAATAGTGTTT ATTCCACATTTGGGTGAGCAAATATTGAACACAAGACTGATGAGCGAGGAACTCAAGGTC TCGGTAGAGGTGAAAAGAGAGGAAACGGGATGGTTTTCGAAGGAGAGCTTGAGCGGTGCG GTCAGGTCTGTGATGGACAGAGATAGCGAGCTCGGGAATTGGGCGAGGAGGAACCACGTA AAGTGGAAGGAGTCTCTGCTTCGTCATGGACTAATGAGTGGTTATCTTAATAAGTTCGTA GAAGCATTGGAGAAACTAGTCCAAAATATAAATCTTGAATGA
>UGT79B7
ATGGAGCCAAAGTTTCATGCTTTTATGTTTCCATGGTTTGCTTTTGGTCATATGATTCCA TTTCTACATCTTGCAAACAAACTAGCTGAAAAAGGTCACCGAGTTACTTTCTTGCTACCT AAGAAAGCACAAAAACAGTTGGAACATCACAACTTGTTCCCAGACAGTATTGTCTTTCAC CCTCTCACAGTTCCTCCTGTCAATGGCCTCCCTGCTGGTGCCGAGACAACCTCGGATATC CCCATCTCGTTGGACAACCTCTTGTCCAAAGCCTTGGATCTCACTCGCGATCAGGTTGAA GCTGCGGTTCGTGCTTTGAGACCTGACTTGATCTTTTTCGATTTTGCTCAATGGATTCCA GATATGGCTAAAGAACATATGATCAAGAGTGTGAGTTACATCATTGTATCTGCGACAACA ATAGCTCATACACATGTCCCTGGAGGTAAATTAGGTGTTCGCCCACCGGGTTATCCGTCA TCAAAGGTGATGTTCCGTGAAAACGATGTTCATGCCTTAGCAACCTTATCGATATTTTAC AAGAGACTGTATCATCAGATCACTACAGGTCTTAAGAGCTGTGATGTCATTGCATTGAGG ACTTGCAAAGAAGTCGAAGGTATGTTCTGCGACTTTATATCGCGTCAATACCATAAGAAG GTTCTCTTGACTGGTCCAATGTTCCCTGAGCCAGACACAAGTAAACCACTAGAAGAACGC TGGAATCATTTTCTAAGCGGGTTCGCGCCGAAGTCAGTAGTGTTTTGTTCACGTGGCAGC CAAGTAATTCTTGAGAAAGATCAATTCCAAGAACTCTGTTTAGGGATGGAGCTAACAGGT TTACCATTTCTTTTAGCGGTAAAGCCACCAAGAGGATCATCAACGGTCCAAGAAGGGTTA CCAGAAGGGTTCGAGGAGCGGGTGAAAGATCGTGGTGTTGTTTGGGGAGGATGGGTGCAA CAACCTTTGATATTGGCTCATCCATCAATAGGTTGCTTTGTGAACCATTGTGGTCCCGGA ACAATATGGGAGTCTTTGGTGAGTGATTGCCAAATGGTTTTGATTCCATTTTTAAGTGAT CAAGTTCTCTTCACAAGATTGATGACCGAGGAATTCGAGGTCTCTGTAGAAGTGCCGAGG GAAAAAACAGGATGGTTTTCAAAGGAGAGCTTGAGCAATGCTATCAAATCTGTGATGGAT AAAGACAGTGACATTGGGAAGTTAGTGAGGAGTAACCACACCAAATTGAAGGAGATTTTA GTTAGTCCTGGATTATTGACTGGTTACGTTGATCACTTTGTAGAGGGATTGCAAGAGAAT TTGATTTGA
>UGT79B8
ATGGAGCCAACGTTCCATGCTTTTATGTTTCCCTGGTTTGCTTTTGGTCATATGATTCCT TTTCTACATCTTGCAAACAAACTAGCTGAGAAAGGTCATCAAATCACTTTCTTGCTACCT AAGAAAGCCCAAAAACAGTTGGAACATCACAATCTGTTCCCAGACAGTATTGTCTTTCAC CCTCTCACAATCCCTCATGTCAATGGCCTCCCTGCTGGTGCTGAGACAACCTCGGATATC TCAATCTCGATGGACAACTTACTGTCGGAAGCCTTGGATCTCACTCGCGATCAGGTTGAA GCTGCGGTTCGTGCTCTGAGACCGGACTTGATCTTTTTTGATTTTGCTCATTGGATTCCA GAAATTGCCAAAGAGCATATGATCAAGAGTGTGAGTTACATGATAGTATCTGCAACAACA ATAGCTTATACATTTGCCCCTGGTGGTGTATTAGGTGTTCCCCCACCAGGTTATCCTTCA TCAAAGGTGTTGTACCGTGAAAACGATGCTCATGCCTTAGCAACCTTATCTATCTTCTAC AAGAGACTTTATCATCAGATCACTACAGGTTTTAAGAGCTGTGACATCATTGCATTGAGG ACATGTAATGAAATCGAAGGTAAATTCTGCGACTATATATCAAGTCAATACCATAAGAAG GTTCTCTTGACTGGTCCAATGCTCCCTGAGCAAGACACAAGTAAACCACTAGAAGAACAG TTGAGTCATTTTCTGAGCAGGTTCCCACCGAGGTCAGTGGTGTTTTGTGCACTTGGTAGC CAGATCGTTCTTGAAAAGGATCAATTCCAAGAACTCTGCTTAGGGATGGAGCTGACAGGT TTACCGTTTCTTATAGCGGTAAAGCCACCGAGAGGATCATCGACGGTCGAAGAAGGGTTA CCAGAAGGGTTCCAGGAGCGGGTGAAAGGGCGTGGTGTGGTTTGGGGAGGATGGGTGCAA CAACCATTGATATTGGATCATCCGTCAATAGGCTGCTTTGTGAACCATTGTGGTCCGGGA ACAATATGGGAGTGTCTTATGACTGATTGTCAAATGGTTTTGCTTCCATTTTTAGGTGAT CAAGTTCTCTTCACAAGATTGATGACCGAGGAATTCAAGGTGTCTGTAGAAGTGTCGAGA GAAAAAACAGGATGGTTTTCAAAGGAGAGCTTGAGCGATGCGATCAAGTCTGTGATGGAT AAAGATAGCGACCTCGGAAAGCTAGTGAGGAGTAACCACGCCAAATTGAAGGAGACTCTT GGTAGTCATGGATTATTAACTGGTTACGTGGATAAATTTGTAGAGGAATTGCAAGAGTAT TTGATTTGA
>UGT79B9
ATGGGCCAAAATTTTCACGCTTTTATGTTCCCATGGTTCGCTTTTGGTCATATGACTCCA TACTTGCATCTAGCCAACAAGCTAGCTGCTAAAGGTCATAGGGTTACTTTCTTGCTGCCT AAGAAAGCTCAAAAACAGTTGGAACATCACAATCTGTTTCCAGACAGGATCATCTTTCAT TCTCTTACTATTCCCCATGTTGATGGCCTACCTGCTGGCGCGGAGACCGCCTCGGACATC CCCATCTCGTTGGGGAAGTTTCTTACCGCAGCCATGGATCTCACTCGCGATCAGGTCGAA GCCGCGGTTCGTGCTTTGAGACCAGACCTGATCTTTTTCGATACTGCTTATTGGGTTCCG GAAATGGCGAAAGAACACAGAGTCAAGAGTGTGATATACTTTGTGATATCAGCTAACTCC ATAGCTCATGAACTTGTACCAGGTGGTGAATTAGGAGTTCCTCCACCTGGCTATCCTTCG TCAAAAGTGTTGTACCGTGGACACGATGCTCACGCTTTGTTGACTTTTTCCATCTTCTAC GAGAGGCTTCATTACCGGATAACAACAGGTCTAAAGAATTGTGATTTTATCTCAATTAGG ACTTGTAAAGAAATCGAAGGTAAATTCTGCGACTATATAGAGCGTCAATACCAGAGGAAG GTTCTTTTGACAGGTCCAATGCTTCCAGAGCCAGATAACAGTAGACCACTCGAAGATCGA TGGAATCACTGGCTGAATCAGTTCAAACCCGGCTCGGTAATATATTGTGCATTGGGAAGT CAAATCACTCTAGAGAAGGATCAATTCCAAGAACTCTGTTTAGGAATGGAGCTCACTGGT TTACCGTTTCTCGTAGCGGTAAAACCACCAAAAGGCGCAAAGACGATCCAAGAAGCGTTG CCAGAAGGGTTTGAGGAGAGGGTGAAGAATCATGGAGTAGTTTGGGGAGAATGGGTGCAG CAACCATTGATATTGGCTCATCCATCAGTAGGCTGCTTTGTGACCCATTGTGGGTTTGGA TCAATGTGGGAGTCTCTAGTGAGTGATTGTCAAATAGTCTTGCTTCCATATTTGTGTGAT CAAATTCTCAACACTAGATTGATGAGTGAGGAACTCGAGGTTTCGGTGGAAGTGAAAAGA GAAGAAACAGGATGGTTCTCGAAAGAGAGCTTAAGTGTTGCGATCACCTCGGTGATGGAC AAAGATAGTGAGTTAGGGAATCTGGTGAGGAGGAACCACGCTAAATTAAAGGAGGTTTTG GTTAGTCCTGGATTATTAACCGGTTACACCGATGAATTTGTTGAAACTTTGCAGAATATA GTCAACGATACAAATCTTGAATGA
>UGT82A1
ATGAAAGTAACACAAAAGCCAAAGATAATATTCATCCCTTATCCGGCGCAAGGCCACGTC ACTCCGATGCTTCACCTTGCATCGGCTTTCCTCAGCCGTGGATTCTCCCCTGTCGTTATG ACTCCCGAGTCTATCCACCGTAGGATCTCGGCTACTAACGAGGATCTTGGGATCACGTTC TTGGCCTTATCTGACGGTCAAGATCGTCCGGACGCACCTCCCTCGGACTTCTTCTCGATA GAGAACTCAATGGAGAACATCATGCCACCACAGCTCGAACGGCTCCTACTAGAAGAAGAC TTGGATGTGGCTTGTGTTGTGGTTGATTTGCTGGCTTCGTGGGCTATAGGAGTGGCTGAT CGGTGTGGAGTTCCGGTCGCCGGATTCTGGCCGGTGATGTTCGCTGCTTACCGTTTGATC CAAGCAATACCGGAGCTAGTCCGAACAGGCTTAGTTTCCCAAAAAGGTTGTCCTCGTCAA CTAGAAAAAACAATAGTCCAGCCAGAGCAACCGCTCCTATCCGCAGAAGATCTACCGTGG CTGATCGGAACTCCCAAAGCTCAGAAAAAACGATTCAAGTTCTGGCAAAGAACTCTAGAA CGAACAAAAAGTCTCCGTTGGATCTTGACAAGCTCCTTTAAAGATGAATATGAAGATGTC GACAACCACAAAGCATCCTACAAAAAATCTAACGATTTAAACAAAGAAAACAATGGTCAA AACCCTCAAATCCTTCATTTAGGTCCATTGCATAACCAAGAAGCAACAAATAATATAACT ATAACCAAGACTAGTTTTTGGGAAGAAGACATGTCTTGTCTAGGTTGGCTTCAAGAACAA AACCCGAACTCAGTCATTTATATCTCATTTGGAAGTTGGGTTTCTCCTATAGGAGAATCA AATATTCAAACGTTGGCATTGGCGTTGGAAGCGTCAGGGAGACCTTTCCTTTGGGCGTTA AACCGAGTGTGGCAAGAGGGACTACCACCAGGTTTTGTGCATAGAGTCACAATTACCAAA AACCAAGGAAGGATCGTCTCATGGGCTCCGCAACTTGAAGTTCTTAGAAACGATTCTGTG GGATGTTACGTGACTCATTGTGGCTGGAACTCGACTATGGAGGCAGTGGCAAGTTCCCGG AGGCTACTATGTTATCCGGTGGCCGGAGACCAGTTTGTTAACTGTAAATACATCGTGGAC GTTTGGAAGATTGGAGTGAGATTGAGCGGGTTTGGAGAGAAGGAGGTTGAAGATGGACTA AGGAAAGTAATGGAGGATCAAGATATGGGTGAGAGATTGAGGAAGTTAAGAGACAGAGCA ATGGGGAATGAAGCTCGTTTGAGTTCGGAAATGAATTTTACATTTTTAAAAAACGAGCTT AATTAG
>UGT83A1
ATGGATAATAACTCAAATAAAAGAATGGGAAGGCCACATGTTGTGGTCATACCTTACCCT GCACAAGGTCATGTTCTTCCTCTAATAAGTTTCTCACGTTACCTTGCGAAACAAGGAATC CAAATTACATTCATAAACACCGAGTTTAACCATAACCGCATCATCAGTTCCTTACCCAAT TCACCTCATGAAGATTATGTTGGGGATCAGATCAATCTTGTTTCAATCCCTGACGGTTTA GAAGATTCACCAGAAGAGAGGAACATTCCAGGGAAGTTGTCGGAGTCTGTTTTGCGTTTT ATGCCTAAAAAAGTAGAGGAATTGATCGAGAGGATGATGGCAGAAACTAGCGGTGGTACG ATCATTAGCTGCGTTGTAGCGGATCAGAGCTTGGGATGGGCAATTGAAGTTGCAGCTAAG TTTGGGATCAGACGCACCGCGTTTTGTCCTGCTGCAGCTGCGTCTATGGTTCTTGGATTT AGTATTCAAAAACTTATCGATGATGGTCTCATAGATTCTGATGGGACTGTGAGAGTAAAT AAGACAATTCAACTATCTCCCGGGATGCCAAAGATGGAAACAGACAAGTTTGTGTGGGTT TGTCTGAAGAACAAAGAATCTCAGAAAAACATATTCCAACTTATGCTTCAAAACAATAAC TCGATCGAGTCAACGGATTGGTTGTTGTGTAACTCTGTCCATGAACTTGAAACTGCAGCA TTTGGATTGGGCCCGAATATAGTACCAATTGGGCCCATTGGTTGGGCTCATAGTCTTGAA GAGGGATCCACGTCACTAGGAAGCTTTTTACCTCATGACCGGGATTGTCTAGATTGGTTG GACCGGCAGATTCCCGGTTCGGTTATATATGTTGCCTTTGGGAGTTTTGGGGTCATGGGC AACCCTCAGTTAGAAGAGCTAGCAATTGGTCTAGAGCTTACCAAGAGGCCAGTTTTGTGG GTCACTGGTGATCAACAACCAATCAAACTTGGGTCGGATCGAGTCAAAGTGGTGAGATGG GCTCCACAACGGGAGGTCCTTTCTTCTGGAGCCATTGGGTGTTTTGTGAGCCATTGTGGA TGGAATTCAACTCTGGAAGGAGCCCAAAATGGCATACCATTTCTATGCATCCCTTATTTT GCAGACCAATTTATCAACAAAGCATATATATGCGATGTGTGGAAGATTGGATTAGGACTT GAAAGAGACGCACGAGGAGTGGTTCCGAGGTTAGAGGTTAAGAAGAAGATCGATGAGATC ATGAGAGACGGTGGAGAGTATGAAGAACGAGCTATGAAGGTTAAAGAGATTGTGATGAAA AGTGTTGCAAAAGATGGAATATCTTGTGAGAATCTTAATAAATTTGTCAACTGGATCAAA TCACAAGTGAATTGA
>UGT84A1
ATGGTGTTCGAAACTTGTCCATCTCCAAACCCAATTCATGTAATGCTCGTCTCGTTTCAA GGACAAGGCCACGTCAACCCTCTTCTTCGTCTCGGCAAGTTAATTGCTTCAAAGGGTTTA CTCGTTACCTTCGTTACAACGGAGCTTTGGGGCAAGAAAATGAGACAAGCCAACAAAATC GTTGACGGTGAACTTAAACCGGTTGGTTCCGGTTCAATCCGGTTTGAGTTCTTTGATGAA GAATGGGCAGAGGATGATGACCGGAGAGCTGATTTCTCTTTGTACATTGCTCACCTAGAG AGCGTTGGGATACGAGAAGTGTCTAAGCTTGTGAGAAGATACGAGGAAGCGAACGAGCCT GTCTCGTGTCTTATCAATAACCCGTTTATCCCATGGGTCTGCCACGTGGCGGAAGAGTTC AACATTCCTTGTGCGGTTCTCTGGGTTCAGTCTTGTGCTTGTTTCTCTGCTTATTACCAT TACCAAGATGGCTCTGTTTCATTCCCTACGGAAACAGAGCCTGAGCTCGATGTGAAGCTT CCTTGTGTTCCTGTCTTGAAGAACGACGAGATTCCTAGCTTTCTCCATCCTTCTTCTAGG TTCACGGGTTTTCGACAAGCGATTCTTGGGCAATTCAAGAATCTGAGCAAGTCCTTCTGT GTTCTAATCGATTCTTTTGACTCATTGGAACAAGAAGTTATCGATTACATGTCAAGTCTT TGTCCGGTTAAAACCGTTGGACCGCTTTTCAAAGTTGCTAGGACAGTTACTTCTGACGTA AGCGGTGACATTTGCAAATCAACAGATAAATGCCTCGAGTGGTTAGACTCGAGGCCTAAA TCGTCAGTTGTCTACATTTCGTTCGGGACAGTTGCATATTTGAAGCAAGAACAGATCGAA GAGATCGCTCACGGAGTTTTGAAGTCGGGTTTATCGTTCTTGTGGGTGATTAGACCTCCA CCACACGATCTGAAGGTCGAGACACATGTCTTGCCTCAAGAACTTAAAGAGAGTAGTGCT AAAGGTAAAGGGATGATTGTGGATTGGTGCCCACAAGAGCAAGTCTTGTCTCATCCTTCA GTGGCATGCTTCGTGACTCATTGTGGATGGAACTCGACAATGGAATCTTTGTCTTCAGGT GTTCCGGTGGTTTGTTGTCCGCAATGGGGAGATCAAGTGACTGATGCAGTGTATTTGATC GATGTTTTCAAGACCGGGGTTAGACTAGGCCGTGGAGCGACCGAGGAGAGGGTAGTGCCA AGGGAGGAAGTGGCGGAGAAGCTTTTGGAAGCGACAGTTGGGGAGAAGGCAGAGGAGTTG AGAAAGAACGCTTTGAAATGGAAGGCGGAGGCGGAAGCAGCGGTGGCTCCAGGAGGTTCG TCGGATAAGAATTTTAGGGAGTTTGTGGAGAAGTTAGGTGCGGGAGTAACGAAGACTAAA GATAATGGATACTAG
>UGT84A2
ATGGAGCTAGAATCTTCTCCTCCTCTACCTCCTCATGTGATGCTCGTATCTTTTCCAGGG CAAGGCCACGTTAATCCACTTCTTCGTCTTGGTAAGCTCTTAGCTTCAAAGGGTTTGCTC ATAACCTTCGTCACCACTGAGTCATGGGGCAAAAAGATGCGAATCTCCAACAAAATCCAA GACCGTGTCCTCAAACCGGTTGGTAAAGGCTATCTCCGGTATGATTTCTTCGACGACGGG CTTCCTGAAGACGACGAAGCTAGCAGAACCAACTTAACCATCCTCCGACCACATCTAGAG CTGGTCGGCAAAAGAGAGATCAAGAΆCCTTGTGAAΆCGTTACAAGGAΆGTAACGAAΆCAG CCCGTGACATGTCTTATCAACAACCCTTTCGTCTCTTGGGTCTGTGACGTGGCAGAAGAT CTTCAAATCCCTTGTGCTGTTCTTTGGGTTCAATCTTGTGCCTGCTTAGCTGCTTATTAC TATTACCACCACAACCTAGTTGACTTCCCGACCAAAACAGAACCCGAGATCGATGTCCAA ATCTCTGGCATGCCTCTCTTGAAACATGACGAGATCCCTTCTTTCATTCACCCTTCAAGT CCTCACTCCGCTTTGCGAGAAGTGATCATAGATCAGATTAAACGGCTTCACAAGACTTTC TCCATTTTCATCGACACTTTCAACTCATTGGAGAAAGACATCATTGACCACATGTCGACG CTCTCTCTCCCCGGTGTTATCAGACCGCTAGGACCACTCTACAAAATGGCTAAAACCGTA GCTTATGATGTCGTTAAAGTAAACATCTCTGAGCCAACGGATCCTTGCATGGAGTGGTTA GACTCGCAGCCAGTTTCCTCCGTTGTTTACATCTCATTCGGGACCGTTGCTTACTTGAAA CAAGAACAAATAGACGAGATCGCTTACGGTGTGTTAAACGCCGACGTTACGTTCTTGTGG GTGATTAGACAACAAGAGTTAGGTTTCAACAAAGAGAAACATGTTTTGCCGGAAGAAGTT AAAGGGAAAGGGAAGATCGTTGAATGGTGTTCACAAGAGAAAGTATTATCTCATCCTTCA GTGGCATGTTTCGTGACTCACTGTGGATGGAACTCAACGATGGAAGCTGTGTCTTCCGGA GTCCCGACGGTTTGTTTTCCTCAATGGGGAGATCAAGTCACGGACGCCGTTTACATGATC GATGTTTGGAAGACGGGAGTGAGGCTAAGCCGTGGAGAGGCGGAGGAGAGGTTAGTGCCG AGGGAGGAAGTTGCGGAGAGGTTGAGAGAGGTTACTAAAGGAGAGAAAGCGATCGAGTTG AAAAAGAATGCTTTGAAGTGGAAGGAAGAGGCGGAGGCGGCGGTTGCTCGCGGTGGTTCG TCGGATAGGAATCTTGAAAAGTTTGTGGAGAAGTTGGGTGCCAAACCTGTGGGGAAAGTA CAAAACGGGAGTCATAATCATGTCTTGGCTGGATCAATCAAAAGCTTTTAA
>UGT8 A3
ATGGACCCGTCTCGTCATACTCATGTGATGCTCGTATCTTTCCCCGGCCAAGGTCACGTA AACCCTCTACTTCGTCTCGGAAAGCTCATAGCCTCTAAAGGCTTACTCGTCACCTTTGTC ACCACAGAGAAGCCATGGGGCAAGAAGATGCGTCAAGCCAACAAGATTCAAGACGGTGTG CTCAAACCGGTCGGTCTAGGTTTCATCCGGTTTGAGTTCTTCTCTGACGGCTTCGCCGAC GACGATGAAAAAAGATTCGACTTCGATGCCTTCCGACCACACCTTGAAGCTGTCGGAAAA CAAGAGATCAAGAATCTCGTTAAGAGATATAACAAGGAGCCGGTGACGTGTCTCATAAAC AACGCTTTTGTCCCATGGGTATGTGATGTCGCCGAGGAGCTTCACATCCCTTCGGCTGTT CTATGGGTCCAGTCTTGTGCTTGTCTCACGGCTTATTACTATTACCACCACCGGTTAGTT AAGTTCCCGACCAAAACCGAGCCGGACATCAGCGTTGAAATCCCTTGCTTGCCATTGTTA AAGCATGACGAGATCCCAAGCTTTCTTCACCCTTCGTCTCCGTATACAGCTTTTGGAGAT ATCATTTTAGACCAGTTAAAGAGATTCGAAAACCACAAGTCTTTCTATCTTTTCATCGAC ACTTTTCGCGAACTAGAAAAAGACATCATGGACCACATGTCACAACTTTGTCCTCAAGCC ATCATCAGTCCTGTCGGTCCGCTCTTCAAGATGGCTCAAACCTTGAGTTCTGACGTTAAG GGAGATATATCCGAGCCAGCGAGTGACTGCATGGAATGGCTTGACTCAAGAGAACCATCC TCAGTCGTTTACATCTCCTTTGGGACTATAGCCAACTTGAAGCAAGAGCAGATGGAGGAG ATCGCTCATGGCGTTTTGAGCTCTGGCTTGTCGGTCTTATGGGTGGTTCGGCCTCCCATG GAAGGGACATTTGTAGAACCACATGTTTTGCCTCGAGAGCTCGAAGAAAAGGGTAAAATC GTGGAATGGTGTCCCCAAGAGAGAGTCTTGGCTCATCCTGCGATTGCTTGTTTCTTAAGT CACTGCGGATGGAACTCGACAATGGAGGCTTTAACTGCCGGAGTCCCCGTTGTTTGTTTT CCGCAATGGGGAGATCAAGTGACTGATGCGGTGTACTTGGCTGATGTTTTCAAGACAGGA GTGAGACTAGGCCGCGGAGCCGCTGAGGAGATGATTGTTTCGAGGGAGGTTGTAGCAGAG AAGCTGCTTGAGGCCACAGTTGGGGAAAAGGCGGTGGAGCTGAGAGAAAACGCTCGGAGG TGGAAGGCGGAGGCCGAGGCCGCCGTGGCGGACGGTGGATCATCTGATATGAACTTTAAA GAGTTTGTGGACAAGTTGGTTACGAAACATGTGACGAGAGAAGACAACGGAGAACACTAG
>UGT84A4
ATGGAGATGGAATCGTCGTTACCTCATGTGATGCTCGTATCATTCCCAGGGCAAGGTCAC ATAAGCCCTCTTCTTCGTCTCGGAAAGATCATTGCCTCTAAAGGCTTAATCGTCACCTTT GTAACCACAGAGGAACCATTGGGCAAGAAGATGCGTCAAGCCAACAATATTCAAGACGGT GTGCTCAAACCGGTCGGGCTAGGTTTTCTCCGGTTCGAGTTCTTCGAGGATGGATTTGTC TACAAAGAAGACTTTGATTTGTTACAAAAATCACTTGAAGTTTCCGGAAAACGAGAGATC AAGAATCTTGTCAAGAAATATGAGAAGCAACCAGTGAGATGTCTCATAAATAATGCCTTT GTTCCATGGGTTTGTGACATAGCCGAGGAGCTTCAAATCCCATCAGCTGTTCTTTGGGTC CAGTCTTGTGCTTGCCTCGCCGCTTATTACTATTACCACCACCAGTTAGTTAAGTTTCCG ACCGAAACCGAGCCGGAAATAACCGTTGACGTCCCTTTCAAGCCATTAACATTGAAGCAT GACGAGATCCCTAGCTTTCTTCACCCTTCCTCTCCGCTGTCCTCTATAGGAGGTACCATT TTAGAGCAGATCAAGCGACTTCACAAGCCTTTCTCTGTTCTCATCGAAACTTTTCAAGAA CTTGAAAAAGATACCATTGACCACATGTCCCAGCTCTGCCCTCAAGTCAACTTCAACCCC ATCGGTCCGCTTTTTACTATGGCTAAAACCATAAGGTCTGACATCAAGGGAGACATCTCC AAGCCAGATAGTGACTGCATAGAGTGGCTTGACTCGAGAGAACCATCCTCCGTTGTTTAC ATCTCTTTTGGGACTTTGGCTTTCTTGAAGCAAAACCAGATCGACGAGATTGCTCACGGC ATTCTCAACTCCGGGTTGTCCTGCTTATGGGTTTTGCGGCCTCCCTTAGAAGGCTTAGCC ATAGAACCGCATGTCTTGCCTCTAGAGCTTGAAGAGAAAGGGAAGATTGTGGAATGGTGT CAACAAGAGAAAGTTTTGGCTCATCCTGCGGTTGCTTGCTTCTTAAGTCACTGTGGATGG AACTCAACCATGGAGGCTTTAACTTCAGGAGTTCCCGTTATTTGTTTCCCGCAGTGGGGA GATCAGGTGACAAATGCGGTGTACATGATTGATGTTTTCAAGACAGGATTGAGACTCAGC CGTGGAGCTTCCGATGAGAGGATTGTTCCAAGGGAGGAGGTTGCTGAGCGACTGCTTGAG GCCACCGTTGGAGAGAAGGCGGTGGAGCTGAGAGAAAACGCTCGGAGGTGGAAGGAGGAG GCGGAGTCTGCCGTGGCTTACGGTGGAACATCGGAAAGGAATTTTCAAGAGTTTGTTGAC AAGTTGGTTGATGTCAAGACAATGACAAACATTAATAATGTCGTGTAA
>UGT84B1
ATGGGCAGTAGTGAGGGTCAAGAAACACATGTCCTAATGGTAACACTACCATTCCAAGGT CACATCAATCCAATGCTCAAACTCGCAAAACATCTCTCGTTATCATCAAAGAACCTACAC ATCAATCTCGCCACTATTGAGTCAGCCCGTGATCTCCTCTCCACCGTAGAAAAACCTCGT TATCCGGTGGACCTCGTGTTCTTCTCCGATGGTCTACCTAAAGAAGATCCAAAGGCCCCT GAAACTCTTTTGAAGTCATTGAATAAAGTCGGAGCCATGAACTTGTCTAAAATCATCGAA GAAAAGAGATACTCTTGTATCATCTCTTCGCCTTTTACTCCATGGGTTCCAGCTGTTGCA GCCTCTCATAACATCTCTTGTGCAATACTTTGGATCCAAGCTTGTGGAGCTTACTCGGTT TATTACCGTTACTACATGAAGACAAACTCTTTCCCTGATCTTGAAGATCTGAATCAAACG GTGGAGTTACCAGCTTTACCATTGTTGGAAGTTCGAGATCTTCCATCGTTTATGTTACCT TCTGGTGGTGCTCACTTCTATAATCTAATGGCGGAATTTGCAGATTGTTTGAGGTATGTG AAATGGGTTTTGGTTAATTCATTCTATGAACTCGAATCAGAGATAATCGAATCGATGGCT GATTTAAAACCTGTAATTCCAATTGGTCCTCTGGTTTCTCCATTTCTGTTGGGCGATGGT GAGGAGGAAACCCTAGACGGTAAAAACCTAGATTTTTGTAAATCTGATGATTGTTGTATG GAGTGGCTTGACAAGCAAGCTAGGTCTTCTGTTGTGTACATATCTTTCGGAAGTATGCTC GAAACATTGGAGAATCAGGTCGAGACCATAGCGAAGGCGCTGAAGAACAGAGGACTTCCA TTTCTTTGGGTGATAAGGCCAAAGGAGAAAGCCCAAAACGTTGCTGTTTTGCAGGAGATG GTGAAAGAAGGACAAGGGGTTGTTCTCGAGTGGAGTCCACAAGAGAAGATTTTGAGCCAC GAGGCAATCTCTTGTTTTGTCACGCATTGCGGCTGGAACTCGACTATGGAGACGGTGGTG GCTGGTGTTCCTGTGGTAGCGTACCCTAGCTGGACGGATCAGCCCATTGACGCGCGGTTG CTTGTTGATGTGTTTGGAATCGGAGTAAGGATGAGGAATGACAGTGTCGATGGCGAGCTT AAGGTCGAAGAAGTAGAAAGATGCATTGAGGCCGTGACGGAGGGACCCGCTGCCGTGGAT ATAAGAAGGAGAGCGGCGGAGCTAAAGCGCGTGGCGAGATTGGCGTTGGCACCTGGTGGA TCTTCGACACGGAATTTAGACTTGTTCATTAGTGATATCACAATCGCCTAA
>UGT84B2
ATGGGAAGTAATGAGGGTCAAGAAACACATGTCCTAATGGTAGCATTAGCATTCCAAGGT CATCTCAATCCAATGCTCAAATTCGCAAAACATCTCGCACGAACCAATCTACACTTCACT CTCGCCACCACTGAGCAAGCCCGTGACCTCCTCTCTTCCACCGCTGACGAACCTCATAGA CCGGTGGACCTCGCTTTCTTCTCAGACGGTCTACCTAAAGACGATCCAAGAGATCCCGAC ACTCTCGCAAAGTCATTGAAAAAAGATGGAGCCAAGAACTTGTCAAAAATCATCGAAGAA AAGAGATTTGATTGCATCATCTCTGTGCCTTTTACTCCCTGGGTTCCAGCTGTTGCAGCT GCACATAACATTCCTTGTGCAATCCTCTGGATCCAAGCTTGTGGAGCTTTTTCTGTTTAT TACCGTTATTACATGAAGACAAATCCTTTCCCCGACCTTGAAGATCTGAATCAAACAGTG GAGTTACCAGCTTTACCATTGTTGGAAGTCCGAGATCTCCCGTCATTGATGTTACCTTCT CAAGGAGCTAATGTCAATACCCTAATGGCGGAATTTGCAGATTGTTTGAAAGATGTGAAA TGGGTTTTGGTTAACTCGTTTTACGAACTCGAATCAGAGATCATCGAGTCTATGTCTGAT TTAAAACCTATAATCCCAATTGGTCCTCTTGTTTCTCCATTCCTGTTGGGAAATGATGAA GAAAAAACCCTAGATATGTGGAAAGTTGATGATTATTGTATGGAGTGGCTTGACAAGCAA GCTAGGTCTTCAGTTGTTTACATATCTTTCGGAAGCATACTCAAATCATTGGAGAATCAA GTTGAGACCATAGCAACGGCATTAAAAAACAGAGGAGTTCCATTTCTTTGGGTGATACGG CCGAAGGAGAAAGGCGAAAACGTCCAGGTTTTGCAGGAGATGGTTAAAGAAGGTAAAGGG GTTGTAACTGAATGGGGTCAACAAGAAAAGATATTGAGCCACATGGCGATTTCTTGCTTC ATCACGCATTGTGGATGGAACTCGACGATCGAGACGGTGGTGACTGGTGTTCCCGTGGTG GCGTATCCGACTTGGATAGATCAGCCGCTTGATGCGAGACTGCTTGTGGATGTGTTTGGA ATCGGAGTAAGGATGAAGAACGACGCTATCGATGGAGAGCTTAAGGTTGCAGAGGTGGAG AGATGCATTGAGGCCGTGACAGAGGGACCTGCCGCCGCGGATATGAGGAGGAGAGCGACG GAGCTGAAGCACGCCGCAAGATCGGCGATGTCACCTGGTGGATCTTCCGCTCAGAATTTA GACTCGTTCATTAGTGATATCCCAATCACTTGA
>UGT85A1
ATGGGATCTCAGATCATTCATAACTCACAAAAACCACATGTAGTTTGTGTTCCATATCCG GCTCAAGGCCACATCAACCCTATGATGAGAGTGGCTAAACTCCTCCACGCCAGAGGCTTC TACGTCACCTTCGTCAACACCGTCTACAACCACAATCGTTTCCTTCGTTCTCGTGGGTCC AATGCCCTAGATGGACTTCCTTCGTTCCGATTTGAGTCCATTGCTGACGGTCTACCAGAG ACAGACATGGATGCCACGCAGGACATCACAGCTCTTTGCGAGTCCACCATGAAGAACTGT CTCGCTCCGTTCAGAGAGCTTCTCCAGCGGATCAACGCTGGAGATAATGTTCCTCCGGTA AGCTGTATTGTATCTGACGGTTGTATGAGCTTTACTCTTGATGTTGCGGAGGAGCTTGGA GTCCCGGAGGTTCTTTTTTGGACAACCAGTGGCTGTGCGTTCCTGGCTTATCTACACTTT TATCTCTTCATCGAGAAGGGCTTATGTCCGCTAAAAGATGAGAGTTACTTGACGAAGGAG TACTTAGAAGACACGGTTATAGATTTTATACCAACCATGAAGAATGTGAAACTAAAGGAT ATTCCTAGCTTCATACGTACCACTAATCCTGATGATGTTATGATTAGTTTCGCCCTCCGC GAGACCGAGCGAGCCAAACGTGCTTCTGCTATCATTCTAAACACATTTGATGACCTTGAG CATGATGTTGTTCATGCTATGCAATCTATCTTACCTCCGGTTTATTCAGTTGGACCGCTT CATCTCTTAGCAAACCGGGAGATTGAAGAAGGTAGTGAGATTGGAATGATGAGTTCGAAT TTATGGAAAGAGGAGATGGAGTGTTTGGATTGGCTTGATACTAAGACTCAAAATAGTGTC ATTTATATCAACTTTGGGAGCATAACGGTTTTGAGTGTGAAGCAGCTTGTGGAGTTTGCT TGGGGTTTGGCGGGAAGTGGGAAAGAGTTTTTATGGGTGATCCGGCCAGATTTAGTAGCG GGAGAGGAGGCTATGGTTCCGCCGGACTTTTTAATGGAGACTAAAGACCGCAGTATGCTA GCGAGTTGGTGTCCTCAAGAGAAAGTACTTTCTCATCCTGCTATTGGAGGGTTTTTGACG CATTGCGGGTGGAACTCGATATTGGAAAGTCTTTCGTGTGGAGTTCCGATGGTGTGTTGG CCATTTTTTGCTGACCAGCAAATGAATTGTAAGTTTTGTTGTGACGAGTGGGATGTTGGG ATTGAGATAGGTGGAGATGTGAAGAGAGAGGAAGTTGAGGCGGTGGTTAGAGAGCTCATG GATGGAGAGAAGGGAAAGAAAATGAGAGAAAAGGCGGTAGAGTGGCAGCGCTTAGCCGAG AAAGCGACGGAACATAAACTTGGTTCTTCCGTTATGAATTTTGAGACGGTTGTTAGCAAG TTTCTTTTGGGACAAAAATCACAGGATTAA
>UGT85A2
ATGGGATCTCATGTCGCACAAAAACAACACGTAGTTTGCGTTCCTTATCCGGCTCAAGGC CACATCAACCCAATGATGAAAGTGGCTAAACTCCTTTACGCCAAAGGCTTCCATATTACC TTCGTCAACACCGTCTACAACCACAACCGTCTCCTCCGGTCCCGTGGGCCTAACGCCGTT GACGGGCTTCCTTCTTTCCGGTTTGAGTCCATCCCTGACGGTCTACCCGAGACTGACGTG GACGTCACTCAGGACATCCCTACTCTTTGCGAGTCCACAATGAAGCACTGTCTCGCTCCA TTCAAGGAGCTTCTCCGGCAGATCAACGCAAGGGATGATGTTCCTCCTGTGAGCTGTATC GTATCCGACGGTTGTATGAGCTTCACACTTGATGCTGCGGAGGAGCTCGGTGTCCCGGAG GTTCTTTTTTGGACAACTAGTGCTTGTGGCTTCTTGGCTTACCTTTACTACTATCGCTTC ATCGAGAAGGGATTATCACCAATAAAAGATGAGAGTTACTTAACCAAGGAACACTTGGAC ACAAAAATAGACTGGATACCATCGATGAAGAACCTAAGACTAAAAGACATCCCTAGCTTC ATCCGAACGACTAATCCTGACGACATCATGCTCAACTTTATCATCCGTGAGGCTGACCGA GCCAAACGCGCTTCAGCTATCATTCTCAACACGTTTGATGATCTCGAACACGACGTTATC CAATCTATGAAATCCATTGTACCTCCGGTTTATTCTATTGGACCGTTACATTTACTAGAG AAACAAGAGAGCGGCGAGTATAGTGAAATCGGACGGACAGGATCGAATCTTTGGAGAGAG GAGACTGAGTGTCTGGACTGGCTAAACACGAAAGCTAGAAACAGTGTTGTGTACGTTAAC TTCGGGAGTATAACTGTTTTGAGCGCAAAACAGCTTGTGGAGTTTGCATGGGGTTTGGCT GCAACGGGGAAAGAGTTTTTGTGGGTGATCCGGCCGGATTTAGTAGCCGGGGATGAGGCA ATGGTTCCACCGGAGTTTTTAACGGCTACGGCGGACCGGAGGATGTTGGCAAGTTGGTGT CCTCAAGAGAAAGTCCTTTCTCATCCGGCCATTGGAGGGTTCTTGACGCATTGCGGGTGG AACTCGACGTTGGAAAGTCTATGCGGTGGAGTTCCAATGGTGTGTTGGCCGTTTTTTGCA GAGCAACAAACTAATTGTAAGTTTTCTCGTGACGAATGGGAGGTTGGGATTGAGATTGGT GGAGATGTGAAGAGAGAAGAGGTTGAGGCGGTGGTTAGGGAGTTGATGGATGAAGAGAAG GGAAAGAATATGAGAGAGAAGGCGGAAGAGTGGCGGCGCTTGGCGAATGAAGCGACGGAG CATAAGCATGGTTCTTCTAAATTGAACTTTGAGATGCTCGTTAATAAGGTTCTTTTAGGG GAGTAG
>UGT85A3
ATGGGATCCCGTTTTGTTTCTAACGAACAAAAACCACACGTAGTTTGCGTGCCTTACCCA GCTCAAGGCCACATTAACCCTATGATGAAAGTGGCTAAACTCCTCCACGTCAAAGGCTTC CACGTCACCTTCGTCAACACCGTCTACAACCACAACCGTCTACTCCGATCCCGTGGGGCC AACGCACTCGATGGACTTCCTTCCTTCCAGTTCGAGTCAATACCTGACGGTCTTCCGGAG ACTGGCGTGGACGCCACGCAGGACATCCCTGCCCTTTCCGAGTCCACAACGAAAAACTGT CTCGTTCCGTTCAAGAAGCTTCTCCAGCGGATTGTCACGAGAGAGGATGTCCCTCCGGTG AGCTGTATTGTATCAGATGGTTCGATGAGCTTTACTCTTGACGTAGCGGAAGAGCTTGGT GTTCCGGAGATTCATTTTTGGACCACTAGTGCTTGTGGCTTCATGGCTTATCTACACTTT TATCTCTTCATCGAGAAGGGTTTATGTCCAGTAAAAGATGCGAGTTGCTTGACGAAGGAA TACTTGGACACAGTTATAGATTGGATACCGTCAATGAACAATGTAAAACTAAAAGACATT CCTAGTTTTATACGTACCACTAATCCTAACGACATAATGCTCAACTTCGTTGTCCGTGAG GCATGTCGAACCAAACGTGCCTCTGCTATCATTCTGAACACGTTTGATGACCTTGAACAT GACATAATCCAGTCTATGCAATCCATTTTACCACCGGTTTATCCAATCGGACCGCTTCAT CTCTTAGTAAACAGGGAGATTGAAGAAGATAGTGAGATTGGAAGGATGGGATCAAATCTA TGGAAAGAGGAGACTGAGTGCTTGGGATGGCTTAATACTAAGTCTCGAAATAGCGTTGTT TATGTTAACTTTGGGAGCATAACAATAATGACCACGGCACAGCTTTTGGAGTTTGCTTGG GGTTTGGCGGCAACGGGAAAGGAGTTTCTATGGGTGATGCGGCCGGATTCAGTAGCCGGA GAGGAGGCAGTGATTCCAAAAGAGTTTTTAGCGGAGACAGCTGATCGAAGAATGCTGACA AGTTGGTGTCCTCAGGAGAAAGTTCTTTCTCATCCGGCGGTCGGAGGGTTCTTGACCCAT TGCGGGTGGAATTCGACGTTAGAAAGTCTTTCATGCGGAGTTCCAATGGTATGTTGGCCA TTTTTTGCTGAGCAACAAACAAATTGTAAGTTTTCTTGTGATGAATGGGAGGTTGGTATT GAGATCGGTGGAGATGTCAAGAGGGGAGAGGTTGAGGCGGTGGTTAGAGAGCTCATGGAT GGAGAGAAAGGAAAGAAAATGAGAGAGAAGGCTGTAGAGTGGCGGCGCTTGGCCGAGAAA GCTACAAAGCTTCCGTGTGGTTCGTCGGTGATAAATTTTGAGACGATTGTCAACAAGGTT CTCTTGGGAAAGATCCCTAACACGTAA
>UGT85A4
ATGGAACAACATGGCGGTTCTAGCTCACAGAAACCTCACGCAATGTGCATACCTTATCCA GCACAAGGCCACATCAACCCAATGCTGAAACTAGCCAAGCTCCTCCACGCTAGAGGCTTC CACGTCACTTTCGTCAACACCGACTACAACCACCGCCGTATCCTCCAATCACGTGGCCCT CACGCTCTCAACGGTCTCCCCTCGTTTCGCTTCGAGACTATCCCCGACGGTCTTCCTTGG ACAGACGTCGACGCTAAGCAAGACATGCTCAAGCTTATTGACTCCACAATAAACAACTGT TTAGCTCCATTCAAAGACCTCATCCTCCGGTTAAACTCCGGTTCTGATATACCACCGGTT AGCTGTATCATCTCCGACGCTTCAATGAGCTTCACAATTGACGCAGCGGAGGAGCTTAAA ATTCCGGTAGTTCTCCTCTGGACCAACAGTGCTACTGCTTTAATCTTGTATCTCCATTAC CAAAAACTCATCGAGAAAGAGATAATTCCCCTCAAAGATTCGAGTGACTTGAAGAAGCAT TTAGAGACGGAGATTGATTGGATACCGTCGATGAAGAAGATTAAGCTTAAGGATTTTCCA GATTTCGTCACCACGACGAATCCTCAAGATCCGATGATTAGTTTCATCCTTCATGTAACC GGAAGAATCAAAAGAGCTTCTGCGATCTTCATCAACACTTTCGAAAAACTCGAGCATAAC GTTCTCTTATCTCTGCGATCTCTTCTCCCTCAGATCTACTCCGTTGGACCGTTCCAGATT CTGGAGAATCGCGAAATCGATAAGAACAGCGAAATCAGAAAGCTAGGATTGAATCTCTGG GAAGAAGAGACGGAGTCTTTGGATTGGCTAGATACTAAAGCTGAGAAAGCTGTGATTTAC GTCAACTTCGGGAGTCTAACGGTTTTGACTAGTGAGCAGATCTTAGAGTTCGCTTGGGGT TTAGCGAGGAGCGGGAAAGAGTTTCTCTGGGTGGTGAGATCTGGTATGGTCGACGGAGAT GATTCGATTCTTCCGGCGGAGTTTTTATCGGAGACGAAGAATCGAGGAATGTTAATTAAA GGATGGTGTTCTCAGGAGAAGGTACTTTCGCATCCGGCGATTGGAGGATTTTTGACTCAC TGTGGATGGAATTCGACGTTGGAGAGTTTGTACGCCGGTGTTCCGATGATCTGTTGGCCA TTTTTTGCTGATCAGTTGACGAATCGAAAGTTCTGTTGCGAGGATTGGGGGATTGGGATG GAGATCGGCGAGGAGGTGAAGAGGGAGAGAGTGGAGACGGTGGTTAAAGAGCTCATGGAC GGAGAGAAGGGAAAGAGGTTAAGAGAGAAGGTGGTGGAGTGGCGGCGCTTGGCGGAAGAA GCTTCGGCGCCACCGTTGGGATCATCGTACGTGAATTTTGAAACGGTGGTTAATAAAGTC CTTACATGTCACACGATTAGATCGACCTAA
>UGT85A5
ATGGCGTCTCATGCTGTTACAAGCGGACAAAAACCACACGTAGTTTGCATACCTTTCCCG GCTCAAGGCCACATCAATCCGATGCTCAAAGTGGCTAAACTCCTCTATGCCAGAGGCTTC CATGTTACCTTCGTCAACACTAACTACAACCATAACCGTCTCATCCGGTCACGTGGTCCC AACTCCCTTGATGGGCTTCCTTCTTTTCGGTTCGAGTCCATCCCTGACGGTCTACCGGAG GAAAACAAGGACGTCATGCAGGATGTCCCTACCCTTTGTGAGTCCACCATGAAAAACTGT CTAGCTCCTTTCAAGGAGCTTCTCCGGCGGATCAACACCACAAAGGATGTTCCTCCGGTA AGCTGTATTGTATCCGACGGTGTGATGAGCTTTACTCTTGATGCTGCAGAGGAGCTTGGA GTCCCGGATGTTCTTTTTTGGACACCAAGTGCTTGTGGCTTCTTGGCTTATCTACACTTC TATCGCTTCATCGAGAAGGGGTTATCACCAATAAAAGATGAAAGTTCTTTGGACACAAAA ATAAATTGGATACCATCGATGAAAAACCTAGGACTTAAAGACATCCCAAGCTTTATCCGT GCAACTAATACTGAAGACATAATGCTTAACTTTTTTGTCCATGAGGCTGACCGAGCCAAA CGCGCTTCCGCTATCATTGTCAACACATTCGATAGTCTTGAGCATGATGTCGTCCGTTCT ATTCAATCTATCATACCTCAAGTGTACACTATTGGACCGCTTCATCTATTTGTGAATCGG GATATCGACGAGGAAAGTGACATCGGACAGATAGGAACGAATATGTGGAGAGAGGAGATG GAGTGTTTGGATTGGCTTGATACTAAGTCTCCAAACAGTGTCGTTTATGTTAATTTCGGT AGCATAACAGTGATGAGTGCGAAACAACTCGTGGAGTTTGCTTGGGGTTTAGCAGCGACC AAAAAAGATTTTTTGTGGGTGATTAGGCCGGATTTAGTAGCCGGTGATGTGCCAATGCTT CCGCCGGACTTTCTAATAGAGACGGCTAACCGAAGGATGCTAGCGAGTTGGTGTCCTCAA GAAAAAGTTCTTTCTCATCCGGCAGTTGGAGGGTTCTTAACGCATAGTGGATGGAATTCG ACTTTGGAGAGTCTCTCCGGTGGAGTTCCAATGGTGTGTTGGCCGTTCTTTGCGGAACAG CAAACAAATTGTAAATATTGTTGTGATGAATGGGAAGTGGGGATGGAGATCGGTGGAGAT GTGAGGAGGGAGGAGGTTGAGGAGTTGGTTAGAGAACTCATGGACGGAGACAAAGGAAAG AAAATGAGGCAAAAGGCCGAAGAGTGGCAGCGCTTGGCTGAGGAAGCGACGAAGCCTATT TATGGTTCGTCGGAACTAAATTTTCAGATGGTCGTTGACAAGGTTCTTTTAGGGGAGTAG
>UGT85A7
ATGGAATCTCATGTTGTTCATAACGCACAAAAGCCACACGTAGTTTGCGTGCCTTACCCG GCTCAAGGCCACATCAATCCGATGCTGAAAGTGGCTAAACTCCTCTACGCTAAAGGCTTT CACGTCACCTTCGTTAACACTCTCTACAACCACAACCGTCTCCTCCGGTCCCGTGGTCCC AACGCGCTCGACGGGTTTCCTTCATTCCGGTTCGAGTCCATCCCTGACGGTCTACCGGAG ACTGATGGCGATAGGACGCAGCATACTCCTACCGTTTGCATGTCCATTGAGAAAAACTGT CTCGCTCCATTCAAAGAGATTCTGCGCCGGATCAACGATAAAGATGATGTTCCTCCAGTG AGTTGTATTGTATCGGACGGTGTGATGAGTTTTACTCTTGACGCAGCCGAGGAACTAGGT GTCCCAGAGGTTATTTTTTGGACCAATAGTGCTTGTGGTTTCATGACTATTCTACACTTT TATCTTTTCATCGAGAAGGGTCTATCTCCTTTTAAAGACGAAAGTTACATGTCAAAGGAG CATCTAGACACAGTTATAGATTGGATACCATCAATGAAGAATCTTAGGTTAAAGGACATC CCTAGCTATATACGTACCACAAATCCTGACAACATAATGCTTAATTTCCTCATTCGAGAA GTTGAGCGATCTAAACGCGCTAGTGCTATCATTCTCAACACGTTTGATGAACTCGAGCAT GATGTTATCCAGTCTATGCAATCTATTTTACCTCCGGTTTATTCTATTGGGCCACTCCAT CTCCTTGTGAAGGAAGAAATAAACGAGGCTAGTGAAATAGGACAGATGGGATTAAATTTG TGGAGAGAGGAGATGGAATGTTTGGATTGGCTCGATACAAAAACTCCAAACAGTGTTCTT TTTGTTAACTTTGGATGCATAACGGTGATGAGTGCAAAACAGCTTGAAGAATTTGCTTGG GGTTTGGCGGCAAGTAGGAAAGAGTTTTTATGGGTGATCCGTCCTAATTTAGTGGTGGGA GAGGCGATGGTGGTTCTTCCACAAGAGTTTTTAGCGGAGACGATAGACCGGAGAATGTTA GCTAGTTGGTGTCCTCAGGAGAAAGTTCTTTCTCATCCCGCGATAGGAGGGTTCTTGACG CATTGCGGGTGGAACTCAACATTGGAGAGTCTCGCTGGTGGTGTTCCGATGATATGTTGG CCATGTTTTTCGGAGCAACCGACGAATTGTAAGTTTTGTTGTGATGAGTGGGGAGTGGGT ATAGAGATTGGTAAAGATGTGAAGAGAGAGGAGGTCGAGACGGTGGTTAGAGAACTTATG GATGGAGAAAAGGGGAAAAAGCTGAGAGAAAAGGCGGAAGAGTGGCGGCGGTTGGCCGAG GAAGCGACGAGGTATAAACATGGTTCGTCGGTCATGAATCTTGAGACGCTTATACATAAA GTTTTCTTAGAAAATCTTAGATGA
>UGT86A1
ATGGAGAGAGCAAAGTCGAGGAAGCCTCATATCATGATGATACCATACCCACTTCAAGGT CACGTTATCCCTTTTGTCCACTTAGCCATCAAACTTGCTTCTCATGGCTTCACCATCACT TTCGTCAACACCGACTCCATCCACCACCACATCTCCACCGCTCACCAAGATGACGCCGGT GACATCTTCTCCGCCGCTCGCAGCTCCGGCCAGCACGACATACGTTACACCACCGTGAGC GACGGCTTCCCTTTAGACTTTGACCGGTCACTGAACCATGACCAGTTTTTCGAAGGCATT CTCCACGTCTTCTCTGCCCACGTGGATGATCTCATCGCCAAACTCTCCCGCCGTGATGAT CCTCCCGTGACTTGCTTGATCGCCGACACGTTTTATGTTTGGTCATCTATGATTTGCGAC AAGCACAACCTTGTAAATGTCTCGTTTTGGACCGAACCTGCCTTGGTCCTCAATCTCTAT TATCACATGGATCTCCTCATATCTAACGGTCATTTCAAATCTCTTGATAATCGTAAAGAC GTGATCGATTACGTACCAGGGGTTAAAGCAATAGAACCAAAGGACTTGATGTCATATCTT CAAGTAAGCGACAAAGACGTAGACACAAATACAGTAGTATACAGAATATTATTCAAGGCC TTTAAAGACGTCAAGAGAGCCGACTTCGTCGTATGCAACACGGTGCAAGAGCTCGAACCA GACTCTCTCTCGGCTCTACAAGCCAAACAACCGGTTTACGCTATCGGTCCGGTTTTCTCA ACTGATTCGGTAGTTCCCACAAGCTTATGGGCCGAGTCAGACTGTACCGAGTGGCTTAAG GGCCGGCCCACTGGGTCAGTTCTCTACGTCTCGTTTGGTAGCTATGCACATGTTGGTAAG AAGGAGATTGTTGAGATAGCTCATGGGCTTTTGCTTAGTGGGATTAGTTTCATTTGGGTT TTACGTCCGGATATAGTTGGATCCAACGTACCAGATTTTCTTCCAGCCGGGTTTGTGGAC CAAGCCCAAGATCGAGGTCTTGTGGTCCAATGGTGCTGCCAGATGGAAGTTATTTCAAAT CCGGCCGTGGGAGGGTTTTTCACACATTGTGGATGGAATTCAATTCTAGAGAGCGTTTGG TGTGGTTTGCCTTTGTTGTGTTATCCACTTTTGACAGATCAGTTCACGAATAGGAAGCTT GTGGTCGATGATTGGTGCATTGGGATTAATCTTTGTGAGAAGAAGACAATCACAAGGGAC CAAGTCTCAGCGAATGTTAAAAGATTGATGAATGGAGAAACTTCAAGTGAGCTAAGAAAC AATGTTGAAAAGGTTAAACGTCATCTCAAAGATGCGGTTACAACCGTTGGATCTTCGGAG ACGAATTTTAACTTGTTTGTTAGTGAGGTCCGAAATAGAATAGAAACTAAATTGTGTAAT GTAAATGGACTAGAAATAAGTCCATCAAACTAA
>UGT86A2
ATGGCGGACGTTAGAAACCCTACAAAAAATCATCATGGTCATCATCATCTTCATGCTCTC TTGATCCCATATCCATTTCAAGGGCATGTAAACCCATTTGTACACTTAGCCATCAAGCTC GCGTCACAGGGGATCACCGTCACTTTCGTCAACACTCATTACATCCACCACCAGATCACA AACGGCTCCGATGGAGATATTTTCGCTGGAGTTAGGTCAGAGTCTGGCCTTGACATAAGG TACGCGACGGTTTCCGATGGTTTACCGGTCGGATTTGACCGGTCGTTGAACCATGACACG TACCAATCGTCGCTGTTGCACGTGTTCTATGCGCATGTGGAAGAGCTTGTGGCGAGTCTT GTTGGAGGAGACGGCGGTGTGAATGTGATGATCGCCGACACATTCTTTGTTTGGCCGTCT GTGGTGGCTAGGAAGTTTGGTTTGGTTTGTGTCTCGTTTTGGACCGAAGCTGCTTTAGTA TTTTCACTTTATTACCATATGGATCTGCTTCGGATTCATGGCCATTTTGGTGCTCAAGAA ACCCGCAGCGATCTAATCGACTACATTCCCGGAGTCGCCGCAATTAACCCAAAAGACACG GCGTCGTATCTTCAAGAAACCGACACGTCATCAGTAGTTCATCAAATCATCTTCAAAGCA TTCGAAGACGTGAAAAAAGTCGATTTTGTACTCTGCAACACAATTCAGCAATTCGAAGAC AAAACAATCAAAGCCCTAAACACAAAAATCCCATTTTACGCAATCGGACCAATCATACCA TTCAATAACCAAACCGGTTCAGTCACAACCTCACTCTGGTCTGAATCAGATTGTACACAA TGGCTCAACACTAAACCAAAAAGCTCCGTACTTTATATCTCCTTTGGTAGTTACGCTCAT GTCACAAAGAAGGATCTTGTTGAGATAGCTCACGGGATTTTGTTGAGTAAAGTTAATTTC GTTTGGGTGGTGAGACCAGACATTGTTAGTTCAGACGAAACCAATCCATTACCAGAAGGG TTTGAAACAGAAGCTGGAGATCGTGGGATTGTAATACCATGGTGTTGTCAAATGACGGTT TTGTCACATGAGAGTGTTGGTGGGTTTTTGACACATTGTGGTTGGAACTCGATATTGGAG ACGATTTGGTGTGAGGTTCCTGTGTTGTGTTTTCCATTGTTGAGTGATCAGGTTACGAAT AGGAAGCTTGTGGTTGATGATTGGGAGATTGGGATTAATCTTTGTGAAGATAAGAGTGAT TTTGGTAGAGATGAAGTTGGGAGGAATATTAACCGTTTGATGTGTGGTGTTTCGAAAGAG AAGATCGGACGGGTTAAAATGAGTTTGGAAGGTGCGGTGAGAAACAGTGGATCTTCTTCG GAGATGAATTTAGGTTTGTTTATTGATGGACTTTTGTCTAAGGTTGGTTTATCTAATGGG AAAGCTTAA
>UGT87A1
ATGAATCCAATCAAACCTCAGCCACTCGGAGTCCGCCACGTGGTGGCCATGCCTTGGCCA GGAAGAGGCCACATCAACCCAATGTTAAACCTCTGCAAAAGCCTCGTCCGGCGAGACCCA AACCTCACCGTCACATTCGTCGTCACCGAAGAATGGCTCGGGTTCATCGGGTCCGACCCG AAACCTAACCGGATCCATTTCGCCACTCTCCCCAACATCATTCCCTCCGAGCTCGTCCGA GCCAACGACTTCATCGCCTTCATCGACGCCGTCCTCACCAGATTAGAAGAGCCGTTCGAA CAGCTACTTGACCGTCTAAACTCTCCTCCCACCGCAATCATCGCCGATACTTACATCATT TGGGCAGTACGTGTAGGCACAAAAAGGAATATTCCGGTGGCTTCTTTCTGGACTACGTCA GCCACGATTCTCTCCCTCTTCATTAACTCCGATCTTCTCGCAAGTCACGGCCATTTTCCG ATCGAACCATCAGAATCAAAACTAGACGAGATTGTTGATTACATCCCCGGTTTATCTCCG ACAAGACTCAGTGACTTACAGATCTTACACGGCTATAGTCATCAAGTCTTCAATATATTC AAAAAGTCTTTCGGTGAGCTTTATAAAGCTAAGTATCTTCTCTTCCCTTCTGCTTATGAG CTCGAACCAAAAGCCATTGACTTTTTCACTTCCAAGTTTGATTTCCCGGTTTACTCCACT GGTCCGTTAATACCCTTGGAAGAACTATCCGTTGGAAATGAGAATAGAGAACTTGATTAC TTTAAGTGGCTTGATGAGCAACCTGAAAGCTCTGTTCTTTACATATCTCAAGGGAGTTTT CTTTCAGTCTCCGAAGCTCAGATGGAGGAGATTGTTGTAGGAGTTAGAGAGGCTGGAGTT AAGTTCTTTTGGGTGGCTCGTGGGGGTGAGTTAAAGCTTAAGGAGGCTCTTGAAGGTAGC TTGGGTGTTGTGGTGAGCTGGTGTGATCAGCTACGTGTTTTGTGTCATGCGGCTATAGGC GGGTTTTGGACGCATTGCGGGTATAACTCGACATTGGAAGGGATATGTTCGGGAGTACCG TTGCTTACATTTCCTGTTTTTTGGGATCAGTTTCTGAATGCTAAGATGATTGTTGAGGAG TGGAGAGTTGGAATGGGGATCGAGAGGAAGAAGCAGATGGAGTTGTTGATAGTGAGTGAT GAGATCAAGGAATTGGTAAAAAGGTTTATGGATGGAGAGAGTGAAGAAGGGAAAGAGATG AGAAGAAGGACTTGTGATCTCAGTGAGATATGTCGTGGAGCGGTTGCGAAAGGTGGTTCT TCTGATGCTAACATCGATGCTTTCATTAAAGATATTACTAAGATCGTGTGA
>UGT87A2
ATGGATCCAAATGAATCTCCACCAAACCAATTTCGCCACGTGGTGGCCATGCCTTATCCA GGTCGAGGACACATCAACCCTATGATGAACCTCTGCAAACGCCTTGTCCGTCGATACCCT AACCTTCACGTCACCTTCGTCGTCACAGAAGAATGGCTCGGGTTTATTGGACCCGACCCG AAACCCGACCGGATCCATTTCTCCACTCTCCCTAATCTCATCCCTTCCGAGCTTGTCAGG GCCAAAGACTTCATAGGCTTCATTGATGCCGTCTACACAAGATTGGAAGAACCATTCGAG AAGCTTCTTGACAGCCTCAATTCACCACCTCCGAGTGTAATATTCGCCGACACTTACGTC ATTTGGGCTGTGCGAGTCGGCAGAAAAAGGAATATTCCGGTGGTTTCTCTCTGGACCATG TCAGCCACGATTCTCTCCTTCTTCCTCCACTCTGATCTACTCATAAGTCATGGCCATGCT CTGTTCGAACCATCAGAAGAAGAGGTTGTTGATTACGTCCCCGGTTTATCTCCGACGAAA CTCCGAGATTTGCCGCCGATATTTGACGGTTACAGCGACCGAGTCTTCAAGACAGCTAAG TTGTGTTTCGATGAACTACCAGGAGCTAGGTCTTTACTCTTCACCACCGCCTATGAGCTT GAACACAAAGCTATTGACGCTTTCACCTCCAAGCTCGATATCCCGGTCTACGCTATTGGT CCTTTAATACCTTTTGAAGAACTTTCTGTTCAAAATGATAACAAGGAACCTAATTACATC CAGTGGCTTGAGGAACAACCGGAAGGCTCTGTTCTTTACATATCTCAGGGAAGTTTTCTT TCGGTCTCGGAAGCTCAGATGGAGGAAATAGTGAAAGGACTGAGAGAAAGTGGAGTCCGG TTTCTTTGGGTGGCTCGTGGGGGCGAGTTAAAGCTTAAGGAGGCTCTTGAAGGTAGCTTA GGTGTAGTGGTGAGCTGGTGTGATCAGCTTCGGGTGCTGTGTCACAAAGCTGTAGGCGGG TTTTGGACTCATTGCGGGTTTAACTCGACATTGGAAGGGATATATTCAGGAGTACCAATG CTAGCGTTTCCGTTGTTTTGGGATCAGATTCTGAACGCTAAGATGATTGTTGAGGACTGG AGAGTCGGAATGAGGATCGAGAGGACGAAAAAGAATGAGTTGTTGATAGGGAGAGAGGAG ATCAAGGAAGTAGTGAAGAGGTTTATGGATAGAGAGAGTGAAGAAGGGAAAGAGATGAGA AGAAGGGCTTGTGACCTTAGTGAAATCAGTCGAGGAGCTGTTGCGAAAAGCGGTTCGTCT AATGTAAACATCGATGAGTTCGTTCGGCATATTACCAATACAAATTAA
>UGT88A1
ATGGGTGAAGAAGCTATAGTTCTGTATCCTGCACCACCAATAGGTCACTTAGTGTCCATG GTTGAGTTAGGTAAAACCATCCTCTCCAAAAACCCATCTCTCTCCATCCACATTATCTTA GTTCCACCGCCTTATCAGCCGGAATCAACCGCCACTTACATCTCCTCCGTCTCCTCCTCC TTCCCTTCAATAACCTTCCACCATCTTCCCGCCGTCACACCGTACTCCTCCTCCTCCACC TCTCGCCACCACCACGAATCTCTCCTCCTAGAGATCCTCTGTTTTAGCAACCCAAGTGTC CACCGAACTCTTTTCTCACTCTCTCGGAATTTCAATGTCCGAGCAATGATCATCGATTTC TTCTGCACCGCCGTTTTAGACATCACCGCTGACTTCACGTTCCCGGTTTACTTCTTCTAC ACCTCTGGAGCCGCATGTCTCGCCTTTTCCTTCTATCTCCCGACCATCGACGAAACAACC CCCGGAAAAAACCTCAAAGACATTCCTACAGTTCATATCCCCGGCGTTCCTCCGATGAAG GGCTCCGATATGCCTAAGGCGGTGCTCGAACGAGACGATGAGGTCTACGATGTTTTTATA ATGTTCGGTAAACAGCTCTCGAAGTCGTCAGGGATTATTATCAATACGTTTGATGCTTTA GAAAACAGAGCCATCAAGGCCATAACAGAGGAGCTCTGTTTTCGCAATATTTATCCAATT GGACCGCTCATTGTAAACGGAAGAATCGAAGATAGAAACGACAACAAGGCAGTTTCTTGT CTCAATTGGCTGGATTCGCAGCCGGAAAAGAGTGTTGTGTTTCTCTGTTTTGGAAGCTTA GGTTTGTTCTCAAAAGAACAGGTGATAGAGATTGCTGTTGGTTTAGAGAAAAGTGGGCAG AGATTCTTGTGGGTGGTCCGTAATCCACCCGAGTTAGAAAAGACAGAACTGGATTTGAAA TCACTCTTACCAGAAGGATTCTTAAGCCGAACCGAAGACAAAGGGATGGTCGTGAAATCA TGGGCTCCGCAAGTTCCGGTTCTGAATCATAAAGCAGTCGGGGGATTCGTCACTCATTGC GGTTGGAATTCAATTCTTGAAGCTGTTTGTGCTGGTGTGCCGATGGTGGCTTGGCCGTTG TACGCTGAGCAGAGGTTTAATAGAGTGATGATTGTGGATGAGATCAAGATTGCGATTTCG ATGAATGAATCAGAGACGGGTTTCGTGAGCTCTACAGAGGTGGAGAAACGAGTCCAAGAG ATAATTGGGGAGTGTCCGGTTAGGGAGCGAACCATGGCTATGAAGAACGCAGCCGAATTA GCCTTGACAGAAACTGGTTCGTCTCATACCGCATTAACTACTTTACTCCAGTCGTGGAGC CCAAAGTGA
>UGT89A2
ATGACGGAAGTGTTATTGTTGCCGGGAACTAAATCGGAGAATTCAAAACCACCGCACATA GTGGTGTTTCCATTCCCAGCACAAGGCCACTTACTTCCTCTACTTGACTTAACTCACCAA CTCTGCCTCCGTGGATTCAACGTCTCCGTCATCGTTACTCCCGGTAACCTTACTTACCTC TCTCCTCTTCTCTCCGCTCATCCCTCCTCCGTCACCTCCGTCGTTTTCCCTTTCCCTCCT CATCCTTCACTCTCTCCCGGCGTCGAAAACGTTAAAGACGTCGGAAATTCAGGAAATCTC CCGATCATGGCTTCTCTTCGTCAGCTACGAGAACCAATCATCAACTGGTTCCAATCTCAT CCGAATCCGCCTATCGCTCTCATCTCCGATTTCTTCCTCGGATGGACTCACGATCTCTGC AATCAAATCGGTATCCCCAGATTCGCTTTCTTCTCCATCAGCTTCTTCTTAGTTTCCGTT CTTCAATTTTGCTTCGAGAACATCGATCTAATCAAATCAACGGATCCGATTCATCTCCTT GATCTTCCTCGCGCTCCGATTTTCAAAGAAGAGCATCTTCCGTCTATAGTCCGACGAAGT CTCCAΆACTCCGTCACCGGATCTCGAATCAATCAAAGATTTCTCCATGAΆTTTGTTGAGC TACGGATCTGTTTTCAATTCTTCTGAGATTCTGGAAGATGATTATCTTCAGTACGTGAAA CAGAGGATGGGTCATGATCGGGTTTATGTTATTGGCCCGCTTTGTTCAATCGGGTCGGGT CTTAAATCGAATTCGGGTTCTGTAGACCCGAGTTTGCTGAGTTGGTTAGACGGATCCCCA AACGGGTCAGTTCTATACGTTTGTTTCGGAAGTCAAAAGGCGTTGACTAAAGACCAGTGT GATGCTTTGGCTCTAGGCTTAGAGAAAAGCATGACCCGGTTTGTTTGGGTGGTTAAGAAA GATCCGATACCCGACGGGTTTGAGGATCGGGTTTCCGGAAGGGGATTGGTGGTAAGAGGA TGGGTCTCCCAGCTGGCGGTGTTGCGACACGTGGCGGTTGGTGGATTTTTGAGCCATTGT GGATGGAACTCAGTGCTTGAAGGGATAACGAGTGGGGCTGTGATCTTGGGCTGGCCCATG GAGGCGGACCAGTTTGTGAACGCGAGGTTGCTTGTGGAGCATTTGGGTGTTGCGGTTAGG GTTTGCGAAGGTGGTGAAACTGTGCCTGACTCGGATGAGTTGGGTCGGGTCATAGCGGAA ACGATGGGTGAGGGAGGACGCGAGGTGGCTGCTCGGGCTGAGGAGATACGGCGGAAGACC GAGGCTGCCGTGACGGAGGCAAATGGAAGCTCCGTTGAAAATGTACAAAGACTTGTCAAA GAATTTGAAAAAGTCTAA
>UGT89B1
ATGAAAGTGAACGAGGAAAACAACAAGCCGACAAAGACCCATGTCTTAATCTTCCCATTT CCGGCGCAAGGTCACATGATTCCCCTCCTCGACTTCACCCACCGCCTTGCTCTCCGCGGC GGCGCCGCCTTAAAAATAACCGTCCTAGTCACTCCAAAAAACCTTCCTTTTCTCTCTCCG CTTCTCTCCGCCGTAGTTAACATCGAACCACTTATCCTCCCTTTTCCCTCCCACCCTTCA ATCCCCTCCGGCGTCGAAAACGTCCAAGACTTACCTCCTTCAGGCTTCCCTTTAATGATC CACGCGCTTGGTAATCTCCACGCGCCGCTTATCTCTTGGATTACTTCTCACCCTTCTCCT CCAGTAGCCATCGTATCTGATTTCTTCCTTGGTTGGACCAAAAACCTCGGAATCCCTCGT TTCGATTTCTCTCCCTCCGCTGCTATCACTTGCTGCATACTCAATACTCTCTGGATCGAA ATGCCCACCAAGATCAACGAAGATGACGATAACGAGATCCTCCACTTTCCCAAGATCCCG AATTGTCCAAAATACCGTTTTGATCAGATCTCCTCTCTTTACAGAAGTTACGTTCACGGA GATCCAGCTTGGGAGTTCATAAGAGACTCCTTTAGAGATAACGTGGCGAGTTGGGGACTC GTCGTGAACTCGTTCACCGCCATGGAAGGTGTTTATCTCGAACATCTTAAGCGAGAGATG GGCCATGATCGTGTATGGGCTGTAGGCCCAATTATTCCGTTATCTGGGGATAACCGTGGT GGCCCGACTTCTGTTTCTGTTGATCACGTGATGTCGTGGCTTGACGCACGTGAGGATAAC CACGTGGTGTACGTGTGCTTTGGAAGTCAAGTAGTTTTGACTAAAGAGCAGACTCTTGCA CTCGCCTCTGGGCTTGAGAAAAGCGGCGTCCATTTCATATGGGCCGTAAAGGAGCCCGTT GAGAAAGACTCAACACGTGGCAACATCCTGGACGGTTTCGACGATCGCGTGGCTGGGAGA GGTCTGGTGATCAGAGGATGGGCTCCACAAGTAGCTGTGCTACGTCACCGAGCCGTTGGC GCGTTTTTAACGCACTGTGGTTGGAACTCTGTGGTGGAGGCGGTTGTCGCCGGCGTTTTG ATGCTGACGTGGCCGATGAGAGCTGACCAGTACACTGACGCGTCTCTGGTGGTTGATGAG TTGAAAGTAGGTGTGCGTGCTTGCGAAGGACCTGACACGGTGCCTGACCCGGACGAGTTA GCTCGAGTTTTCGCTGATTCCGTGACCGGAAATCAAACGGAGAGGATCAAAGCCGTGGAG CTGAGGAAAGCAGCGTTGGATGCGATTCAAGAACGTGGGAGCTCAGTGAATGATTTAGAT GGATTTATCCAACATGTCGTTAGTTTAGGACTAAACAAATGA
>UGT89C1
ATGACAACAACAACAACGAAGAAGCCGCACGTTCTGGTGATACCGTTTCCACAATCCGGT CACATGGTTCCACATCTTGACCTCACGCATCAGATTCTTCTCCGTGGAGCCACCGTCACT GTCCTCGTCACACCCAAAAACTCTTCCTATCTCGATGCTCTCCGTTCTCTTCACTCCCCG GAACACTTCAAAACCCTAATCCTTCCTTTTCCTTCTCACCCTTGTATACCTTCCGGTGTC GAATCTCTCCAGCAACTTCCTCTCGAAGCTATAGTTCACATGTTTGATGCTCTCTCTCGT CTCCACGACCCTCTCGTTGACTTTCTCAGCCGTCAACCACCGTCGGATCTCCCCGACGCC ATCCTAGGAAGCTCATTTCTCAGCCCTTGGATTAACAAAGTAGCTGATGCTTTCTCTATT AAGTCCATTAGTTTCTTACCCATCAATGCTCATTCGATCTCCGTCATGTGGGCTCAAGAA GATAGAAGCTTCTTCAACGATCTCGAGACTGCCACAACGGAAAGCTACGGGCTCGTCATC AACAGTTTCTACGACCTCGAGCCTGAGTTTGTAGAAACTGTTAAAACACGTTTCCTGAAT CACCACCGTATATGGACCGTCGGACCGTTGCTCCCCTTTAAAGCTGGCGTTGACCGTGGC GGACAAAGCTCAATCCCGCCGGCGAAAGTCTCGGCTTGGTTAGATTCGTGCCCCGAGGAT AACTCCGTCGTATACGTCGGTTTTGGAAGCCAGATCCGGCTCACGGCGGAGCAAACAGCT GCTTTAGCGGCGGCGTTGGAGAAAAGCAGTGTGCGTTTCATATGGGCGGTGAGAGACGCA GCTAAGAAGGTGAACTCCAGCGATAACTCCGTTGAGGAAGATGTGATCCCGGCGGGATTT GAAGAGAGAGTGAAGGAGAAAGGACTCGTGATAAGAGGATGGGCCCCACAAACTATGATT CTTGAGCATCGAGCCGTTGGATCTTACCTAACTCATTTGGGTTGGGGTTCGGTTCTGGAA GGAATGGTCGGAGGAGTTATGTTGCTAGCGTGGCCGATGCAAGCAGACCATTTCTTTAAC ACGACGCTCATCGTTGATAAACTAAGAGCCGCAGTGCGAGTTGGAGAGAACAGAGACTCG GTTCCTGACTCGGACAAGCTCGCTAGGATTTTGGCTGAGTCGGCGAGAGAGGACTTGCCG GAGAGAGTTACGTTGATGAAGCTGAGGGAGAAAGCTATGGAGGCCATTAAAGAAGGTGGG AGCTCTTACAAGAACTTGGATGAGCTCGTTGCAGAGATGTGTTTGTAA
>UGT90A1
ATGTCCGTTTCAACACATCACCACCACGTGGTCCTCTTCCCTTTCATGTCAAAAGGCCAC ATCATCCCTCTCCTCCAATTCGGTCGTCTCCTCCTCCGTCACCACCGCAAAGAACCAACC ATCACCGTCACCGTTTTCACCACTCCCAAGAACCAACCTTTCATCTCAGACTTCCTCTCG GATACGCCGGAGATCAAAGTCATCTCTCTCCCTTTCCCGGAAAACATCACCGGAATCCCT CCCGGCGTCGAGAACACCGAAAAGCTCCCATCCATGTCACTTTTCGTCCCCTTCACACGC GCCACGAAGCTTCTCCAACCTTTCTTCGAAGAAACACTCAAGACTCTTCCAAAAGTTTCG TTCATGGTCTCTGATGGATTCCTCTGGTGGACATCGGAGTCTGCAGCTAAGTTCAACATT CCAAGATTTGTCTCCTACGGCATGAACTCTTACTCCGCCGCTGTCTCCATCTCTGTTTTC AAACACGAACTCTTTACCGAACCGGAAAGTAAATCTGATACCGAACCGGTCACTGTACCA GACTTTCCATGGATCAAGGTCAAGAAGTGTGATTTCGACCATGGCACTACCGAGCCGGAA GAATCAGGTGCAGCCCTCGAACTATCTATGGACCAAATCAAGTCGACCACCACAAGCCAT GGGTTTTTAGTCAATAGCTTCTACGAGCTCGAGTCAGCATTTGTTGATTACAACAACAAC TCTGGTGATAAACCAAAGTCGTGGTGTGTTGGGCCACTGTGTTTGACAGATCCTCCTAAA CAGGGGAGTGCTAAACCGGCTTGGATTCATTGGTTGGATCAGAAGCGAGAGGAAGGGCGT CCGGTTTTGTACGTGGCGTTTGGAACGCAGGCAGAGATATCGAACAAGCAGCTTATGGAA CTAGCTTTCGGCTTGGAAGATTCAAAGGTGAACTTTCTGTGGGTCACAAGAAAAGATGTG GAGGAGATTATTGGAGAAGGATTCAACGATAGAATAAGAGAGAGTGGGATGATAGTGAGA GATTGGGTGGACCAATGGGAGATATTGTCACATGAAAGTGTCAAAGGATTTTTGAGCCAT TGTGGGTGGAACTCAGCACAAGAGAGCATATGTGTCGGGGTCCCATTGTTGGCTTGGCCG ATGATGGCCGAGCAACCGCTCAATGCGAAGATGGTTGTGGAGGAGATAAAGGTGGGAGTA AGAGTTGAAACGGAAGATGGGAGTGTAAAAGGTTTTGTGACAAGAGAAGAACTAAGTGGA AAGATTAAAGAACTGATGGAAGGAGAAACGGGGAAAACCGCAAGAAAGAATGTAAAAGAA TATTCGAAAATGGCGAAAGCGGCTTTGGTCGAAGGGACTGGTTCGTCATGGAAGAATTTA GATATGATTCTTAAGGAGTTATGTAAGAGTAGAGATTCAAACGGTGCTAGTGAGTAG
>UGT90A2
ATGGAGTTAGAAAAAGTTCACGTGGTTTTGTTCCCATACTTGTCCAAAGGGCACATGATT CCTATGCTCCAATTAGCTCGTCTCCTCTTATCCCACTCCTTCGCCGGAGACATCTCCGTC ACCGTCTTCACCACTCCTTTGAACCGTCCTTTCATCGTTGACTCACTCTCCGGCACCAAA GCGACCATCGTCGACGTACCTTTCCCTGATAACGTCCCGGAGATCCCACCCGGCGTCGAG TGCACTGACAAACTCCCTGCTTTGTCGTCCTCCCTCTTCGTTCCTTTCACAAGAGCCACC AAGTCAATGCAGGCAGACTTTGAGCGAGAGCTCATGTCACTGCCACGTGTCAGTTTCATG GTCTCAGACGGTTTCTTGTGGTGGACGCAAGAGTCAGCTCGAAAGCTAGGGTTTCCTCGG CTTGTTTTCTTTGGTATGAATTGCGCTTCCACCGTTATATGTGACAGTGTTTTTCAAAAC CAGCTTCTATCTAATGTTAAGTCCGAGACGGAGCCAGTTTCTGTACCGGAGTTTCCGTGG ATTAAGGTTAGGAAATGTGATTTCGTTAAAGATATGTTTGATCCAAAAACCACCACAGAT CCTGGATTCAAGCTTATCCTAGATCAAGTCACGTCTATGAATCAAAGCCAAGGTATCATA TTCAATACATTTGACGACCTTGAACCCGTGTTTATTGATTTCTACAAGCGTAAACGCAAA CTCAAGCTTTGGGCAGTTGGACCGCTTTGTTACGTAAATAACTTGGCTTGGATGATGAAG TAGAAGAGAAGGTCAAACCTAGTTGGATGAAATGGCTAGATGAAAAGCGAGACAAGGGAT GCAATGTTCTGTATGTGGCTTTCGGGTCACAAGCCGAGATCTCGAGAGAACAACTAGAGG AGATTGCGTTAGGGTTGGAAGAATCGAAGGTGAACTTCTTGTGGGTGGTCAAAGGAAATG AAATAGGAAAAGGGTTTGAAGAGAGAGTGGGAGAAAGAGGAATGATGGTGAGAGATGAAT GGGTTGATCAGAGGAAGATATTAGAGCACGAGAGTGTTAGAGGGTTCTTGAGCCATTGTG GGTGGAATTCTCTGACGGAGAGCATTTGCTCGGAGGTTCCAATCTTGGCGTTTCCTTTAG CAGCGGAGCAACCTCTGAATGCGATTTTGGTGGTGGAAGAGCTGAGAGTGGCGGAGAGAG TGGTGGCGGCGAGTGAAGGGGTTGTGAGAAGAGAAGAGATTGCAGAGAAAGTGAAGGAGT TGATGGAGGGAGAGAAAGGGAAAGAGCTGAGGAGGAATGTCGAGGCATATGGTAAGATGG
CGAAGAΆGGCTTTGGAGGAAGGTATTGGTTCGTCTAGGAAGAATTTAGACAACCTTATCA ACGAGTTTTGTAACAATGGAACATGA
>UGT90A4
ATGGCCGTTTCATCGTCGCATCATGCGGTTCTCTTCCCTTACATGTCAAAAGGCCACACG ATTCCTCTCCTCCAATTCGCCCGTCTCCTCCTCCGTCACCGCCGTATCGTCTCCGTAGAC GACGAAGAACCAACCATTTCCGTCACCGTCTTCACCACCCCAAAAAACCAACCATTCGTC TCAAACTTCCTCTCTGACGTCGCATCATCTATCAAAGTAATCTCCCTCCCTTTCCCTGAA AACATCGCCGGAATCCCTCCCGGCGTCGAGAGCACCGACATGCTCCCTTCCATATCACTT TACGTGCCCTTCACGCGCGCAACCAAATCTCTCCAGCCTTTCTTCGAAGCAGAACTCAAG AATCTTGAGAAAGTTTCTTTCATGGTCTCCGATGGATTCTTATGGTGGACATCGGAATCC GCCGCTAAATTTGAGATCCCGAGACTTGCCTTCTACGGCATGAACTCCTACGCATCGGCT ATGTGCTCCGCCATTTCGGTACACGAGCTCTTTACCAAACCGGAAAGTGTTAAATCTGAT ACTGAACCGGTTACTGTACCGGATTTTCCATGGATATGTGTTAAGAAGTGTGAGTTCGAT CCGGTTTTGACCGAACCGGATCAATCGGATCCAGCGTTCGAGCTACTCATTGACCATCTT ATGTCCACCAAGAAAAGCCGTGGAGTTATAGTGAACAGCTTTTACGAGCTCGAGTCAACG TTCGTTGACTACCGGCTCCGTGATAACGATGAACCAAAACCGTGGTGTGTTGGGCCTTTG TGTTTGGTAAATCCTCCAAAACCGGAGAGTGATAAACCGGATTGGATTCATTGGTTGGAC CGGAAACTAGAGGAAAGATGTCCGGTTATGTATGTGGCGTTTGGAACGCAGGCTGAGATA TCGAACGAGCAGCTCAAGGAAATAGCATTAGGGTTGGAAGATTCCAAGGTCAATTTCTTG TGGGTCACGAGAAAGGACTTGGAAGAAGTAACTGGAGGATTAGGGTTCGAAAAGAGAGTG AAAGAGCATGGGATGATTGTGAGAGATTGGGTAGACCAATGGGAGATATTGTCACATAAA AGTGTCAAAGGGTTTTTGAGTCATTGTGGATGGAACTCGGCGCAAGAGAGTATTTGCGCT GGGGTTCCACTACTCGCTTGGCCAATGATGGCAGAGCAGCCACTCAATGCGAAGTTGGTA GTGGAGGAGCTAAAGATCGGAGTAAGAΆTCGAAACAGAAGATGTAAGTGTGAAΆGGATTC GTGACAAGAGAAGAACTTAGTCGAAAGGTTAAACAATTGATGGAGGGAGAGATGGGGAAG ACAACGATGAAGAATGTAAAAGAGTATGCGAAAATGGCGAAAAAAGCTATGGCTCAAGGG ACTGGTTCGTCTTGGAAGAGTTTGGATTCGCTTCTGGAAGAGCTTTGTAAGAGTAGAGAG CCAGACGGTGTTAATAAGTTGTCAAGTTCTGATGCTTAG
>UGT91A1
ATGACAAACTTCAAAGACAACGATGGAGATGGAACCAAACTCCACGTGGTAATGTTTCCA TGGTTAGCCTTTGGTCACATGGTTCCATACTTGGAGCTCTCTAAACTCATAGCTCAAAAG GGTCACAAAGTCTCTTTCATTTCCACTCCACGTAACATCGACCGTCTCCTCCCATGGTTA CCGGAAAATCTCTCCTCCGTCATTAACTTCGTCAAGCTATCACTTCCCGTCGGCGACAAC AAACTCCCGGAAGACGGTGAAGCTACCACAGACGTCCCTTTCGAACTCATACCTTACTTA AAAATCGCTTACGACGGGTTAAAAGTTCCGGTGACGGAGTTTCTTGAATCTTCGAAACCC GATTGGGTTCTTCAAGATTTCGCGGGGTTTTGGCTTCCTCCAATCTCTCGTCGTCTCGGA ATCAAAACCGGATTCTTTAGCGCTTTCAACGGCGCGACGCTCGGTATTCTTAAACCGCCG GGGTTCGAAGAGTACCGTACTTCGCCGGCGGATTTTATGAAGCCGCCTAAGTGGGTTCCG TTTGAAACTTCGGTAGCTTTCAAGTTATTTGAATGCAGGTTCATTTTCAAAGGATTTATG GCGGAAACCACCGAAGGGAATGTTCCCGACATCCACCGTGTCGGCGGCGTAATTGACGGC TGTGACGTCATCTTCGTACGGAGCTGTTACGAGTATGAAGCGGAGTGGTTAGGACTTACA CAAGAACTTCACCGGAAACCGGTTATACCGGTCGGAGTTTTGCCTCCAAAACCGGACGAA AAGTTTGAAGATACCGACACGTGGCTGTCTGTTAAAAAATGGTTGGACTCACGGAAAAGT AAGTCCATTGTCTACGTAGCTTTTGGTTCAGAAGCTAAACCGAGTCAAACGGAGCTAAAT GAGATCGCTCTCGGTTTAGAGCTTTCTGGTTTACCTTTCTTTTGGGTGTTAAAGACTCGT CGTGGTCCGTGGGATACCGAACCGGTCGAGCTTCCGGAAGGATTCGAAGAGCGTACAGCG GATAGAGGGATGGTGTGGAGAGGTTGGGTTGAGCAATTGCGTACATTGAGCCATGACTCG ATCGGTTTGGTTCTGACTCATCCCGGTTGGGGAACGATAATTGAAGCTATCCGGTTTGCT AAACCGATGGCAATGCTGGTTTTTGTGTATGACCAAGGATTGAATGCGAGAGTCATTGAA GAGAAGAAAATTGGGTATATGATCCCTCGAGACGAGACAGAAGGTTTCTTTACTAAAGAA AGTGTTGCGAATTCGCTAAGATTGGTAATGGTGGAAGAAGAAGGAAAGGTTTATAGAGAG AATGTGAAGGAGATGAAAGGAGTGTTTGGAGATATGGATAGACAAGATCGTTATGTGGAT TCATTCTTGGAATATCTTGTTACTAATCGTTAA
>UGT91B1
ATGGCCGAGCCAAAACCGAAGCTTCATGTTGCAGTGTTCCCATGGTTAGCTTTAGGTCAC ATGATTCCTTACTTGCAACTCTCAAAGCTCATAGCAAGGAAAGGCCATACTGTGTCCTTC ATCTCCACAGCTCGTAACATTTCACGTCTTCCCAATATATCCTCCGACCTTTCCGTGAAT TTCGTTTCTTTGCCGTTAAGTCAAACCGTCGACCATCTCCCAGAGAACGCTGAGGCCACC ACTGATGTCCCGGAGACTCACATAGCTTATCTGAAGAAAGCATTTGATGGGCTTTCTGAA GCTTTCACAGAGTTTTTAGAAGCTTCCAAACCAAACTGGATAGTGTATGATATCTTGCAC CATTGGGTCCCGCCTATCGCTGAGAAGCTCGGCGTGAGACGAGCCATCTTCTGCACGTTC AACGCAGCTTCCATCATCATCATCGGTGGGCCAGCATCAGTCATGATTCAAGGTCATGAC CCTCGAAAGACTGCTGAAGATCTTATCGTGCCTCCACCATGGGTCCCGTTTGAGACCAAC ATAGTTTACCGTCTCTTTGAAGCTAAGAGGATCATGGAGTATCCCACGGCAGGTGTAACT GGAGTTGAATTGAACGACAACTGTAGATTGGGTTTGGCTTACGTTGGCTCTGAGGTTATT GTGATTAGATCATGTATGGAACTCGAACCTGAGTGGATTCAATTGCTCAGTAAACTCCAA GGAAAGCCTGTGATTCCAATTGGTTTACTCCCGGCTACACCAATGGATGATGCAGATGAC GAGGGAACATGGTTAGACATCAGAGAATGGCTAGACAGACATCAAGCAAAGTCTGTGGTT TATGTAGCCTTAGGAACTGAAGTGACAATTAGTAACGAAGAGATTCAAGGTTTAGCTCAT GGGTTGGAGCTTTGCAGGTTACCTTTCTTTTGGACGCTAAGGAAGAGGACTAGAGCTTCT ATGCTACTACCTGATGGGTTCAAAGAGAGAGTCAAAGAGCGTGGAGTCATTTGGACCGAG TGGGTACCTCAGACCAAGATACTGAGCCATGGTTCAGTTGGTGGGTTTGTTACTCATTGT GGTTGGGGATCAGCTGTGGAAGGGCTTAGCTTTGGTGTCCCTTTGATCATGTTTCCATGT AACCTAGACCAGCCGCTAGTGGCTAGGTTGCTCAGTGGGATGAATATAGGCTTGGAGATT CCAAGGAATGAGCGAGACGGGCTGTTCACGAGTGCTTCTGTTGCAGAGACAATCAGACAT GTTGTTGTGGAAGAAGAAGGAAAGATCTACAGGAACAATGCTGCATCTCAGCAAAAGAAA ATATTCGGGAACAAGAGATTGCAAGATCAGTATGCGGATGGTTTTATCGAGTTTCTGGAG AATCCTATAGCAGGAGTGTAG
>UGT91C1
ATGGTCGACAAGAGAGAAGAAGTTATGCACGTAGCCATGTTTCCATGGCTAGCTATGGGT CATCTCCTTCCTTTTCTTCGTCTCTCCAAGTTACTAGCTCAAAAGGGTCACAAGATCTCT TTCATATCAACACCAAGAAACATCGAAAGACTTCCTAAATTACAATCAAACCTCGCCTCC TCCATCACCTTCGTCTCTTTCCCTCTCCCTCCCATCTCAGGCTTGCCTCCTTCTTCAGAA TCATCCATGGACGTTCCTTACAACAAGCAACAGTCTCTTAAAGCCGCTTTTGATCTTCTT CAGCCACCGTTGAAAGAGTTTCTCCGACGGTCTTCTCCGGATTGGATCATATACGACTAT GCTTCTCACTGGCTTCCTTCTATTGCGGCCGAGCTTGGAATCTCTAAGGCTTTCTTTAGT CTCTTTAACGCAGCTACTCTCTGTTTCATGGGACCGTCTTCGTCTTTGATTGAAGAAATT AGATCAACGCCGGAAGATTTCACGGTGGTGCCACCGTGGGTCCCGTTCAAGTCAAACATC GTGTTTCGTTATCATGAAGTTACTAGATACGTTGAGAAGACAGAGGAAGATGTAACCGGA GTCTCTGACTCAGTTCGGTTTGGTTACTCGATTGACGAAAGCGATGCGGTTTTTGTCCGT AGCTGTCCGGAGTTTGAACCGGAATGGTTTGGTTTACTAAAAGACCTGTACCGTAAACCG GTATTTCCAATCGGGTTTTTGCCTCCGGTTATTGAAGACGACGATGCCGTTGATACTACA TGGGTTCGTATAAAGAAGTGGCTCGACAAGCAACGGCTTAATTCAGTTGTTTACGTGTCA CTTGGCACCGAAGCGAGTCTTCGTCATGAGGAAGTAACTGAGCTAGCTCTTGGGTTAGAG AAGTCAGAGACACCGTTCTTTTGGGTCCTAAGGAACGAGCCAAAGATTCCAGATGGGTTC AAAACACGAGTCAAGGGACGTGGAATGGTTCATGTTGGTTGGGTTCCACAAGTGAAAATA CTTAGTCACGAGTCAGTAGGAGGGTTCTTGACACATTGTGGTTGGAACTCAGTGGTGGAA GGGTTAGGGTTTGGTAAAGTTCCAATCTTTTTTCCGGTGTTGAATGAGCAAGGACTTAAT ACGAGGTTGTTGCATGGGAAAGGACTTGGTGTTGAGGTTTCAAGAGATGAGAGAGATGGG TCGTTTGATTCTGACTCGGTCGCTGACTCGATTAGGTTGGTGATGATTGATGATGCTGGC GAGGAGATAAGGGCTAAGGCTAAAGTGATGAAGGATTTGTTTGGGAACATGGATGAGAAT ATTCGTTATGTTGACGAACTTGTTAGGTTTATGAGAAGTAAAGGATCATCATCATCATCA TGA
>UGT92A1
ATGGCGGAAGCTAAACCCAGAAATCTGAGAATCGTGATGTTCCCTTTCATGGGACAAGGC CATATCATCCCGTTTGTAGCTTTAGCCCTTCGTTTAGAGAAGATTATGATTATGAACAGA GCCAACAAAACCACCATCTCTATGATCAATACTCCTTCGAACATCCCCAAAATACGCTCC AATCTTCCACCTGAATCCTCCATAAGTCTCATAGAGTTACCTTTCAACAGCTCTGATCAT GGCCTTCCTCACGACGGCGAGAATTTCGATTCTCTTCCTTACTCTCTCGTCATCAGCCTT CTTGAAGCTTCTAGGTCGCTTCGTGAGCCCTTTCGAGACTTCATGACGAAGATCTTGAAG GAAGAAGGGCAGAGCTCGGTTATAGTGATCGGTGATTTCTTCTTGGGTTGGATCGGTAAG GTTTGCAAAGAGGTTGGTGTTTATTCAGTGATCTTTAGTGCTTCTGGTGCTTTTGGTTTA GGTTGTTATAGATCCATATGGTTAAACTTGCCACATAAAGAAACCAAACAAGATCAGTTT CTCTTAGATGATTTCCCTGAAGCAGGGGAGATTGAGAAAACTCAGTTGAATTCTTTCATG TTAGAAGCTGATGGAACCGATGATTGGTCTGTTTTCATGAAGAAGATTATACCTGGATGG TCTGACTTCGATGGATTCTTGTTCAACACGGTTGCTGAAATCGATCAGATGGGATTATCC TACTTCCGTAGAATAACCGGTGTTCCGGTTTGGCCAGTTGGGCCGGTTTTGAAGTCTCCG GATAAGAAGGTGGGATCGAGGTCGACAGAGGAAGCAGTGAAGTCATGGCTTGACTCAAAA CCGGACCATTCGGTTGTGTACGTATGTTTCGGTTCAATGAACTCGATTTTGCAAACGCAT ATGTTAGAATTGGCTATGGCATTAGAGAGTAGCGAGAAGAACTTCATATGGGTGGTGAGG CCGCCCATAGGTGTGGAGGTGAAGAGTGAGTTTGATGTGAAAGGGTATCTACCGGAAGGA TTTGAGGAAAGAATAACAAGATCGGAAAGAGGGTTACTTGTGAAGAAATGGGCACCACAA GTTGATATATTGTCACACAAGGCAACATGTGTGTTTTTGAGTCATTGCGGATGGAACTCG ATACTCGAATCACTTAGCCACGGTGTGCCACTGCTCGGATGGCCCATGGCAGCCGAGCAG TTCTTCAATTCCATATTGATGGAGAAACATATTGGGGTATCGGTTGAGGTGGCGCGTGGG AAGAGATGTGAGATCAAATGTGATGACATTGTTTCTAAGATCAAACTGGTGATGGAGGAG ACTGAAGTAGGGAAAGAGATTAGGAAGAAGGCTAGAGAGGTGAAGGAGTTAGTGAGGAGA GCAATGGTAGATGGAGTTAAAGGTTCCTCCGTCATTGGTTTGGAAGAGTTTCTTGACCAA GCAATGGTCAAGAAAGTGGAGAATTGA

Claims

Claims
1. A nucleic acid molecule that encodes a chimeric polypeptide wherein said nucleic acid molecule comprises a first part selected from the group consisting of:
i) a nucleic acid molecule consisting of a nucleic acid sequence which encodes an amino acid sequence as represented in Figure 3; ii) a nucleic acid molecule which hybridises to the nucleic acid molecule in (i) and which specifically targets said chimeric polypeptide to an oil body; and iii) a nucleic acid molecule which differs from the nucleic acid molecules of (i) and (ii) due to the degeneracy in the genetic code; and a second part that encodes an heterologous polypeptide.
2. A nucleic acid molecule according to Claim 1 wherein said second part encodes an enzyme.
3. A nucleic acid according to Claim 1 wherein said second part encodes a pharmaceutical polypeptide.
4. A nucleic acid molecule according to Claim 1 wherein said second part encodes an antigenic polypeptide.
5. A nucleic acid molecule according to Claim 1 wherein said second part encodes a mussel adhesive polypeptide encoded by a nucleic acid molecule selected from the group consisting of:
i) a nucleic acid molecule which encodes a polypeptide comprising the amino acid motif
X-K-X-X-Y-P-X-X-Y-K wherein X is any amino acid residue; ii) a nucleic acid molecule which encodes a polypeptide wherein at least one of the residues in said motif is modified by hydroxylation; and iii) a nucleic acid molecule which encodes a polypeptide which has adhesive properties.
6. A nucleic acid molecule according to Claim 1 wherein said second part encodes a spider silk polypeptide comprising a nucleic acid sequence selected from the group consisting of: i) a nucleic acid molecule consisting of the nucleic acid sequence as represented in Figure 4a or 4b; ii) a nucleic acid molecule which encodes a polypeptide domain comprising the amino acid sequence AGRGQGGYGQGAGG and at least two motifs rich in polyalanine wherein said polyalanine motifs comprise at least 6 alanine amino acid residues; iii) a nucleic acid molecule which hybridises to the sequence presented in
(i) and (ii) above and which encodes a silk polypeptide; and iv) a nucleic acid sequences that are degenerate as a result of the genetic code to the nucleic acid sequence defined in (i) - (iii) above.
7. A nucleic acid molecule according to Claim 1 wherein said second part encodes a polypeptide with glucosylfransferase activity and is encoded by a nucleic acid molecule selected from the group consisting of: i) a nucleic acid molecule selected from the group consisting of nucleic acid sequences as represented in Table 1; ii) a nucleic acid molecule which hybridise to the sequences represented in (ii) above; and which have glucosyltransferase activity; and iii) nucleic acid molecules consisting of nucleic acid sequences which are degenerate as a result of the genetic code to the sequences defined in (i) and (ii) above.
8. A nucleic acid molecule according to any of Claims 1-7 wherein said molecule encodes a cleavage site to facilitate the purification of said second part from said first part.
9. A nucleic acid molecule according to Claim 8 wherein said cleavage site is a proteolytic cleavage site.
10. A chimeric polypeptide encoded by a nucleic acid molecule according to any of Claims 1-9.
11. A vector comprising a nucleic acid molecule that encodes a chimeric polypeptide according to Claim 10.
12. A cell transfected or transformed with a nucleic acid molecule according to any of Claims 1-9 or a vector according to Claim 11.
13. A cell according to Claim 12 wherein said cell is a plant cell.
14. A method to produce a chimeric polypeptide according to Claim 10 comprising the steps of:
i) providing a cell according to Claim 12 or 13 and growth conditions conducive to the production of a chimeric polypeptide according to the invention; and optionally
ii) purifying said chimeric polypeptide from said cell or growth media.
15. A plant comprising a cell according to Claim 13.
16. A seed comprising a cell according to Claim 13.
17. A vessel comprising a cell according to Claim 12 or 13.
16. A vessel comprising a polypeptide according to Claim 10.
17. A vessel according to Claim 16 wherein said vessel is a bioreactor.
18. A vessel according to Claim 17 wherein said bioreactor is a fermentor.
19. A vaccine preparation comprising a cell, or a fraction of a cell, according to Claim 12 or 13 or a plant according to Claim 15.
20. A preparation according to Claim 19 wherein said preparation is an oil body containing fraction.
21. A preparation according to Claim 19 or 20 wherein said preparation includes at least one carrier.
22. A preparation according to Claim 20 or 21 wherein said preparation includes at least one adjuvant.
23. A method to immunise an animal to at least the second part of the chimeric polypeptide according to Claim 10, comprising administering an effective amount of a chimeric polypeptide or preparation according to any of Claims 19-22 sufficient to stimulate an immune response to said second part.
24. A method according to Claim 23 wherein said animal is human.
25. An antibody obtainable by the method according to the invention.
26. An antibody according to Claim 25 wherein said antibody is a monoclonal antibody or binding fragment thereof.
27. A method for preparing a hybridoma cell-line producing monoclonal antibodies according to Claim 26 comprising the steps of: i) immunising an immunocompetent mammal with a chimeric polypeptide or preparation according to any previous Claim; ii) fusing lymphocytes of the immunised immunocompetent mammal with myeloma cells to form hybridoma cells; iii) screening monoclonal antibodies produced by the hybridoma cells of step (ii) for binding activity to polypeptide of the invention; iv) culturing the hybridoma cells to proliferate and/or to secrete said monoclonal antibody; and v) recovering the monoclonal antibody from the culture supernatant.
28. A hybridoma cell-line obtainable by the method according to Claim 27.
29. A method to prepare a vaccine preparation according to any of Claims 19-22 comprising; i) forming a cell extract from a plant cell culture or plant according to the invention; and iii) separating said extract into an oil body rich fraction.
30. A method according to Claim 29 wherein said extract is incubated with a protease that cleaves said second part from said first part.
PCT/GB2004/002432 2003-06-18 2004-06-09 Expression of heterologous protein WO2004113540A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
GB0314102A GB0314102D0 (en) 2003-06-18 2003-06-18 Expression of heterologous protein
GB0314102.5 2003-06-18
GB0402293.5 2004-02-03
GB0402293A GB0402293D0 (en) 2004-02-03 2004-02-03 Plant derived vaccine

Publications (1)

Publication Number Publication Date
WO2004113540A1 true WO2004113540A1 (en) 2004-12-29

Family

ID=33542669

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB2004/002432 WO2004113540A1 (en) 2003-06-18 2004-06-09 Expression of heterologous protein

Country Status (1)

Country Link
WO (1) WO2004113540A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005040363A1 (en) * 2003-10-13 2005-05-06 The University Of York Glucosyltransferases
WO2006134320A3 (en) * 2005-06-16 2007-02-22 Univ York Glycosyltransferase activity

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1993021320A1 (en) * 1991-02-22 1993-10-28 University Technologies International, Inc. Oil-body proteins as carriers of high-value peptides in plants
WO1998027115A1 (en) * 1996-12-16 1998-06-25 Sembiosys Genetics Inc. Oil bodies and associated proteins as affinity matrices

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1993021320A1 (en) * 1991-02-22 1993-10-28 University Technologies International, Inc. Oil-body proteins as carriers of high-value peptides in plants
WO1998027115A1 (en) * 1996-12-16 1998-06-25 Sembiosys Genetics Inc. Oil bodies and associated proteins as affinity matrices

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
ABELL BEN M ET AL: "Role of the proline knot motif in oleosin endoplasmic reticulum topology and oil body targeting", PLANT CELL, vol. 9, no. 8, 1997, pages 1481 - 1493, XP002303933, ISSN: 1040-4651 *
ALTAF A ET AL: "ACID LIPASE OF CASTOR BEAN LIPID BODIES: ISOLATION AND CHARACTERISATION", JOURNAL OF PLANT BIOCHEMISTRY AND BIOTECHNOLOGY, SOCIETY FOR PLANT BIOCHEMISTRY AND BIOTECHNOLOGY, NEW DEHLI, IN, vol. 6, no. 1, January 1997 (1997-01-01), pages 13 - 18, XP009037294, ISSN: 0971-7811 *
EASTMOND PETER J: "Cloning and characterization of the Acid lipase from castor beans.", THE JOURNAL OF BIOLOGICAL CHEMISTRY. 29 OCT 2004, vol. 279, no. 44, 29 October 2004 (2004-10-29), pages 45540 - 45545, XP002303934, ISSN: 0021-9258 *
FUCHS C ET AL: "PURIFICATION AND CHARACTERIZATION OF THE ACID LIPASE FROM THE ENDOSPERM OF CASTOR OIL SEEDS", JOURNAL OF PLANT PHYSIOLOGY, FISCHER, STUTTGART, DE, vol. 149, 1996, pages 23 - 29, XP001026860, ISSN: 0176-1617 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005040363A1 (en) * 2003-10-13 2005-05-06 The University Of York Glucosyltransferases
WO2006134320A3 (en) * 2005-06-16 2007-02-22 Univ York Glycosyltransferase activity

Similar Documents

Publication Publication Date Title
JP4610334B2 (en) Production of peptides and proteins by accumulating in protein granules from the endoplasmic reticulum of plants
Daniell et al. Optimization of codon composition and regulatory elements for expression of human insulin like growth factor-1 in transgenic chloroplasts and evaluation of structural identity and function
JP4864701B2 (en) Method for producing insulin in plants
KR102444024B1 (en) Recombinant vector for antigen production for diagnosis of African swine fever and use thereof
EP2367941B1 (en) Method for the transformation of plant cell mitochondria
US20150157731A1 (en) Transgenic plants expressing cobalamin binding proteins
WO2001042441A2 (en) Plastid transformation
WO2004113540A1 (en) Expression of heterologous protein
CA3064801A1 (en) Porcine epidemic diarrhea virus-like particles
BG64320B1 (en) RECOMBINANT PREVODENEOUS LIQUIDS AND POLYPEPTIDE DERIVATIVES, PRODUCED BY PLANTS, METHOD FOR RECEIVING AND USING THE
WO2006117927A1 (en) Transformed plant capable of producing polyglutamic acid
Siegert et al. Expression of the major mugwort pollen allergen Art v 1 in tobacco plants and cell cultures: problems and perspectives for allergen production in plants
CN107384948B (en) Method for highly expressing target protein in plant and method for producing composition for oral administration of medical protein for expressing plant
Ghorbaniparsa et al. Comparing Patatin Class I and Camv 35s Promoters in Expression of Human Calcitonin Gene in Potato (Solanum Tuberosum Cvs. Kardal And Marfona)
KR102726134B1 (en) Vaccinia virus recombinant A27L protein and kit for diagnosing orthopoxvirus antibody comprising the same
KR102726138B1 (en) Vaccinia virus recombinant A33R protein and kit for diagnosing orthopoxvirus antibody comprising the same
US9441232B2 (en) Pericarp tissue preferred regulatory region and method of using same
CN116457008A (en) Modified coronavirus structural proteins
de Oliveira Heterologous production of oligopeptides with antihypertensive effects in Lactuca sativa and Medicago truncatula
WO2005045046A1 (en) Expression of plastid-targeted polypeptides in plants
Class et al. Patent application title: METHODS OF DELIVERY OF MOLECULES TO CELLS USING A RICIN SUBUNIT AND COMPOSITIONS RELATING TO SAME Inventors: Carole L. Cramer (Jonesboro, AR, US) Maureen C. Dolan (Jonesboro, AR, US) Michael J. Reidy (Rockville, MD, US) Assignees: ARKANSAS STATE UNIVERSITY

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
122 Ep: pct application non-entry in european phase
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载