WO2004113540A1 - Expression of heterologous protein - Google Patents
Expression of heterologous protein Download PDFInfo
- Publication number
- WO2004113540A1 WO2004113540A1 PCT/GB2004/002432 GB2004002432W WO2004113540A1 WO 2004113540 A1 WO2004113540 A1 WO 2004113540A1 GB 2004002432 W GB2004002432 W GB 2004002432W WO 2004113540 A1 WO2004113540 A1 WO 2004113540A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- nucleic acid
- acid molecule
- polypeptide
- encodes
- cell
- Prior art date
Links
- 108090000623 proteins and genes Proteins 0.000 title abstract description 28
- 102000004169 proteins and genes Human genes 0.000 title abstract description 23
- 230000014509 gene expression Effects 0.000 title description 18
- 239000013598 vector Substances 0.000 claims abstract description 21
- 238000004519 manufacturing process Methods 0.000 claims abstract description 12
- 238000000746 purification Methods 0.000 claims abstract description 6
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 117
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 111
- 229920001184 polypeptide Polymers 0.000 claims description 108
- 150000007523 nucleic acids Chemical class 0.000 claims description 86
- 108020004707 nucleic acids Proteins 0.000 claims description 62
- 102000039446 nucleic acids Human genes 0.000 claims description 62
- 210000004027 cell Anatomy 0.000 claims description 43
- 238000000034 method Methods 0.000 claims description 24
- 238000002360 preparation method Methods 0.000 claims description 21
- 230000001070 adhesive effect Effects 0.000 claims description 20
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 20
- 239000000853 adhesive Substances 0.000 claims description 18
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 17
- 229960005486 vaccine Drugs 0.000 claims description 17
- 210000004408 hybridoma Anatomy 0.000 claims description 12
- 230000028993 immune response Effects 0.000 claims description 11
- 241001465754 Metazoa Species 0.000 claims description 7
- 230000027455 binding Effects 0.000 claims description 7
- 230000002068 genetic effect Effects 0.000 claims description 7
- 229920001872 Spider silk Polymers 0.000 claims description 6
- 239000002671 adjuvant Substances 0.000 claims description 6
- 230000000890 antigenic effect Effects 0.000 claims description 6
- 230000000694 effects Effects 0.000 claims description 6
- 239000012634 fragment Substances 0.000 claims description 6
- 102000004190 Enzymes Human genes 0.000 claims description 5
- 108090000790 Enzymes Proteins 0.000 claims description 5
- 241000124008 Mammalia Species 0.000 claims description 5
- 238000003776 cleavage reaction Methods 0.000 claims description 5
- 230000007017 scission Effects 0.000 claims description 5
- 241000237536 Mytilus edulis Species 0.000 claims description 4
- 230000033444 hydroxylation Effects 0.000 claims description 4
- 238000005805 hydroxylation reaction Methods 0.000 claims description 4
- 235000020638 mussel Nutrition 0.000 claims description 4
- 108010054442 polyalanine Proteins 0.000 claims description 4
- 108091005804 Peptidases Proteins 0.000 claims description 3
- 239000004365 Protease Substances 0.000 claims description 3
- 125000000539 amino acid group Chemical group 0.000 claims description 3
- 230000012010 growth Effects 0.000 claims description 3
- 238000012216 screening Methods 0.000 claims description 3
- 206010035226 Plasma cell myeloma Diseases 0.000 claims description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 claims description 2
- 235000004279 alanine Nutrition 0.000 claims description 2
- 238000004113 cell culture Methods 0.000 claims description 2
- 239000012228 culture supernatant Substances 0.000 claims description 2
- 238000012258 culturing Methods 0.000 claims description 2
- 239000001963 growth medium Substances 0.000 claims description 2
- 210000004698 lymphocyte Anatomy 0.000 claims description 2
- 201000000050 myeloid neoplasm Diseases 0.000 claims description 2
- 230000006337 proteolytic cleavage Effects 0.000 claims description 2
- 102000000340 Glucosyltransferases Human genes 0.000 claims 1
- 108010055629 Glucosyltransferases Proteins 0.000 claims 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 abstract description 9
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 abstract description 9
- 241000196324 Embryophyta Species 0.000 description 41
- 235000018102 proteins Nutrition 0.000 description 21
- 239000000427 antigen Substances 0.000 description 20
- 108091007433 antigens Proteins 0.000 description 20
- 102000036639 antigens Human genes 0.000 description 20
- 239000003921 oil Substances 0.000 description 20
- 239000002299 complementary DNA Substances 0.000 description 11
- 240000008042 Zea mays Species 0.000 description 8
- 230000002163 immunogen Effects 0.000 description 8
- 239000000243 solution Substances 0.000 description 8
- 230000001225 therapeutic effect Effects 0.000 description 8
- 235000010469 Glycine max Nutrition 0.000 description 7
- 244000068988 Glycine max Species 0.000 description 7
- 244000020551 Helianthus annuus Species 0.000 description 7
- 235000003222 Helianthus annuus Nutrition 0.000 description 7
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- 239000000126 substance Substances 0.000 description 7
- 210000001519 tissue Anatomy 0.000 description 7
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 6
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- 206010028980 Neoplasm Diseases 0.000 description 6
- 240000006394 Sorghum bicolor Species 0.000 description 6
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 6
- 239000000499 gel Substances 0.000 description 6
- 238000009396 hybridization Methods 0.000 description 6
- 239000011159 matrix material Substances 0.000 description 6
- 238000003752 polymerase chain reaction Methods 0.000 description 6
- 108010047041 Complementarity Determining Regions Proteins 0.000 description 5
- 240000003183 Manihot esculenta Species 0.000 description 5
- 240000007594 Oryza sativa Species 0.000 description 5
- 235000007164 Oryza sativa Nutrition 0.000 description 5
- 235000004443 Ricinus communis Nutrition 0.000 description 5
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 5
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 5
- 241000700605 Viruses Species 0.000 description 5
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 5
- 235000009973 maize Nutrition 0.000 description 5
- 239000012528 membrane Substances 0.000 description 5
- 230000008685 targeting Effects 0.000 description 5
- 240000002791 Brassica napus Species 0.000 description 4
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 4
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 4
- 235000004431 Linum usitatissimum Nutrition 0.000 description 4
- 240000006240 Linum usitatissimum Species 0.000 description 4
- 235000016735 Manihot esculenta subsp esculenta Nutrition 0.000 description 4
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 4
- 244000061176 Nicotiana tabacum Species 0.000 description 4
- 240000000528 Ricinus communis Species 0.000 description 4
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 4
- 235000002595 Solanum tuberosum Nutrition 0.000 description 4
- 244000061456 Solanum tuberosum Species 0.000 description 4
- 102000004142 Trypsin Human genes 0.000 description 4
- 108090000631 Trypsin Proteins 0.000 description 4
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 230000018109 developmental process Effects 0.000 description 4
- 229940088598 enzyme Drugs 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 238000004949 mass spectrometry Methods 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 238000010188 recombinant method Methods 0.000 description 4
- 235000009566 rice Nutrition 0.000 description 4
- 238000001228 spectrum Methods 0.000 description 4
- 229940031626 subunit vaccine Drugs 0.000 description 4
- 238000004885 tandem mass spectrometry Methods 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 239000012588 trypsin Substances 0.000 description 4
- 230000003612 virological effect Effects 0.000 description 4
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 3
- 244000299507 Gossypium hirsutum Species 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- 235000007340 Hordeum vulgare Nutrition 0.000 description 3
- 240000005979 Hordeum vulgare Species 0.000 description 3
- 241000725303 Human immunodeficiency virus Species 0.000 description 3
- 102000004882 Lipase Human genes 0.000 description 3
- 108090001060 Lipase Proteins 0.000 description 3
- 241000219823 Medicago Species 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 3
- 244000046052 Phaseolus vulgaris Species 0.000 description 3
- 241000283984 Rodentia Species 0.000 description 3
- 241000209056 Secale Species 0.000 description 3
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 3
- 241000592344 Spermatophyta Species 0.000 description 3
- 241000194017 Streptococcus Species 0.000 description 3
- 235000021307 Triticum Nutrition 0.000 description 3
- 244000098338 Triticum aestivum Species 0.000 description 3
- 235000001014 amino acid Nutrition 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 239000002775 capsule Substances 0.000 description 3
- 235000013339 cereals Nutrition 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 230000029087 digestion Effects 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- 230000002538 fungal effect Effects 0.000 description 3
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 3
- 239000004009 herbicide Substances 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 244000052769 pathogen Species 0.000 description 3
- 239000006187 pill Substances 0.000 description 3
- 235000021251 pulses Nutrition 0.000 description 3
- 238000012340 reverse transcriptase PCR Methods 0.000 description 3
- 230000028327 secretion Effects 0.000 description 3
- 235000019333 sodium laurylsulphate Nutrition 0.000 description 3
- 239000007909 solid dosage form Substances 0.000 description 3
- 239000003826 tablet Substances 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 238000011282 treatment Methods 0.000 description 3
- 238000005406 washing Methods 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- VBICKXHEKHSIBG-UHFFFAOYSA-N 1-monostearoylglycerol Chemical compound CCCCCCCCCCCCCCCCCC(=O)OCC(O)CO VBICKXHEKHSIBG-UHFFFAOYSA-N 0.000 description 2
- 108010043797 4-alpha-glucanotransferase Proteins 0.000 description 2
- 102100034540 Adenomatous polyposis coli protein Human genes 0.000 description 2
- 241000701242 Adenoviridae Species 0.000 description 2
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 2
- ATRRKUHOCOJYRX-UHFFFAOYSA-N Ammonium bicarbonate Chemical compound [NH4+].OC([O-])=O ATRRKUHOCOJYRX-UHFFFAOYSA-N 0.000 description 2
- 229910000013 Ammonium bicarbonate Inorganic materials 0.000 description 2
- 244000144725 Amygdalus communis Species 0.000 description 2
- 235000011437 Amygdalus communis Nutrition 0.000 description 2
- 244000226021 Anacardium occidentale Species 0.000 description 2
- 244000099147 Ananas comosus Species 0.000 description 2
- 206010002198 Anaphylactic reaction Diseases 0.000 description 2
- 244000105624 Arachis hypogaea Species 0.000 description 2
- 208000035143 Bacterial infection Diseases 0.000 description 2
- 241000219310 Beta vulgaris subsp. vulgaris Species 0.000 description 2
- 235000011331 Brassica Nutrition 0.000 description 2
- 241000219198 Brassica Species 0.000 description 2
- 235000006008 Brassica napus var napus Nutrition 0.000 description 2
- 235000011299 Brassica oleracea var botrytis Nutrition 0.000 description 2
- 240000003259 Brassica oleracea var. botrytis Species 0.000 description 2
- VTYYLEPIZMXCLO-UHFFFAOYSA-L Calcium carbonate Chemical compound [Ca+2].[O-]C([O-])=O VTYYLEPIZMXCLO-UHFFFAOYSA-L 0.000 description 2
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 2
- 235000009467 Carica papaya Nutrition 0.000 description 2
- 240000006432 Carica papaya Species 0.000 description 2
- 235000003255 Carthamus tinctorius Nutrition 0.000 description 2
- 244000020518 Carthamus tinctorius Species 0.000 description 2
- 244000060011 Cocos nucifera Species 0.000 description 2
- 235000013162 Cocos nucifera Nutrition 0.000 description 2
- 241000711573 Coronaviridae Species 0.000 description 2
- 229920000742 Cotton Polymers 0.000 description 2
- 241000699800 Cricetinae Species 0.000 description 2
- 108020004414 DNA Proteins 0.000 description 2
- 241000223924 Eimeria Species 0.000 description 2
- 102000005593 Endopeptidases Human genes 0.000 description 2
- 108010059378 Endopeptidases Proteins 0.000 description 2
- 241000224431 Entamoeba Species 0.000 description 2
- 102000003951 Erythropoietin Human genes 0.000 description 2
- 108090000394 Erythropoietin Proteins 0.000 description 2
- 241000588722 Escherichia Species 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- 108010051696 Growth Hormone Proteins 0.000 description 2
- 241000700586 Herpesviridae Species 0.000 description 2
- 241000228402 Histoplasma Species 0.000 description 2
- 102100037102 Homeobox protein MOX-2 Human genes 0.000 description 2
- 101710142888 Homeobox protein MOX-2 Proteins 0.000 description 2
- 102000014150 Interferons Human genes 0.000 description 2
- 108010050904 Interferons Proteins 0.000 description 2
- 102000015696 Interleukins Human genes 0.000 description 2
- 108010063738 Interleukins Proteins 0.000 description 2
- 239000004367 Lipase Substances 0.000 description 2
- 241000208467 Macadamia Species 0.000 description 2
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 2
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 2
- 241000186359 Mycobacterium Species 0.000 description 2
- 208000031888 Mycoses Diseases 0.000 description 2
- 240000007817 Olea europaea Species 0.000 description 2
- 108700022034 Opsonin Proteins Proteins 0.000 description 2
- 241000712464 Orthomyxoviridae Species 0.000 description 2
- 208000030852 Parasitic disease Diseases 0.000 description 2
- ZRWPUFFVAOMMNM-UHFFFAOYSA-N Patulin Chemical compound OC1OCC=C2OC(=O)C=C12 ZRWPUFFVAOMMNM-UHFFFAOYSA-N 0.000 description 2
- 206010057249 Phagocytosis Diseases 0.000 description 2
- 241000235648 Pichia Species 0.000 description 2
- 241000709664 Picornaviridae Species 0.000 description 2
- 235000010582 Pisum sativum Nutrition 0.000 description 2
- 240000004713 Pisum sativum Species 0.000 description 2
- 241000700625 Poxviridae Species 0.000 description 2
- 241000702247 Reoviridae Species 0.000 description 2
- 201000000582 Retinoblastoma Diseases 0.000 description 2
- 241000712907 Retroviridae Species 0.000 description 2
- 241000235070 Saccharomyces Species 0.000 description 2
- 241000607142 Salmonella Species 0.000 description 2
- 235000007238 Secale cereale Nutrition 0.000 description 2
- CDBYLPFSWZWCQE-UHFFFAOYSA-L Sodium Carbonate Chemical compound [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 102100038803 Somatotropin Human genes 0.000 description 2
- 229920002472 Starch Polymers 0.000 description 2
- 208000037065 Subacute sclerosing leukoencephalitis Diseases 0.000 description 2
- 206010042297 Subacute sclerosing panencephalitis Diseases 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- 235000021536 Sugar beet Nutrition 0.000 description 2
- 210000001744 T-lymphocyte Anatomy 0.000 description 2
- 244000299461 Theobroma cacao Species 0.000 description 2
- 235000009470 Theobroma cacao Nutrition 0.000 description 2
- 241000224526 Trichomonas Species 0.000 description 2
- 241000223104 Trypanosoma Species 0.000 description 2
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 2
- 108700010877 adenoviridae proteins Proteins 0.000 description 2
- 235000010443 alginic acid Nutrition 0.000 description 2
- 229920000615 alginic acid Polymers 0.000 description 2
- 235000012538 ammonium bicarbonate Nutrition 0.000 description 2
- 239000001099 ammonium carbonate Substances 0.000 description 2
- 238000012197 amplification kit Methods 0.000 description 2
- 230000036783 anaphylactic response Effects 0.000 description 2
- 208000003455 anaphylaxis Diseases 0.000 description 2
- 230000005875 antibody response Effects 0.000 description 2
- 208000022362 bacterial infectious disease Diseases 0.000 description 2
- 244000052616 bacterial pathogen Species 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 239000004202 carbamide Substances 0.000 description 2
- 238000000576 coating method Methods 0.000 description 2
- 238000001360 collision-induced dissociation Methods 0.000 description 2
- 238000001977 collision-induced dissociation tandem mass spectrometry Methods 0.000 description 2
- 235000005822 corn Nutrition 0.000 description 2
- 231100000433 cytotoxic Toxicity 0.000 description 2
- 230000001472 cytotoxic effect Effects 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 239000002552 dosage form Substances 0.000 description 2
- 238000001035 drying Methods 0.000 description 2
- 229940105423 erythropoietin Drugs 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 239000000945 filler Substances 0.000 description 2
- 235000004426 flaxseed Nutrition 0.000 description 2
- 244000053095 fungal pathogen Species 0.000 description 2
- 230000035784 germination Effects 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 239000008187 granular material Substances 0.000 description 2
- 239000000122 growth hormone Substances 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 238000002649 immunization Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 229940079322 interferon Drugs 0.000 description 2
- 229940047122 interleukins Drugs 0.000 description 2
- 239000008101 lactose Substances 0.000 description 2
- 238000011031 large-scale manufacturing process Methods 0.000 description 2
- 235000019421 lipase Nutrition 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 102100039604 mRNA guanylyltransferase Human genes 0.000 description 2
- 108010026228 mRNA guanylyltransferase Proteins 0.000 description 2
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical compound [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 125000003729 nucleotide group Chemical group 0.000 description 2
- 230000001662 opsonic effect Effects 0.000 description 2
- 230000003071 parasitic effect Effects 0.000 description 2
- 230000001717 pathogenic effect Effects 0.000 description 2
- 210000001986 peyer's patch Anatomy 0.000 description 2
- 230000008782 phagocytosis Effects 0.000 description 2
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 2
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 2
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 2
- OXCMYAYHXIHQOA-UHFFFAOYSA-N potassium;[2-butyl-5-chloro-3-[[4-[2-(1,2,4-triaza-3-azanidacyclopenta-1,4-dien-5-yl)phenyl]phenyl]methyl]imidazol-4-yl]methanol Chemical compound [K+].CCCCC1=NC(Cl)=C(CO)N1CC1=CC=C(C=2C(=CC=CC=2)C2=N[N-]N=N2)C=C1 OXCMYAYHXIHQOA-UHFFFAOYSA-N 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 125000001436 propyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])[H] 0.000 description 2
- 230000002797 proteolythic effect Effects 0.000 description 2
- 238000003908 quality control method Methods 0.000 description 2
- YGSDEFSMJLZEOE-UHFFFAOYSA-N salicylic acid Chemical compound OC(=O)C1=CC=CC=C1O YGSDEFSMJLZEOE-UHFFFAOYSA-N 0.000 description 2
- 230000021217 seedling development Effects 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 239000003381 stabilizer Substances 0.000 description 2
- 235000019698 starch Nutrition 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 239000005720 sucrose Substances 0.000 description 2
- 235000000346 sugar Nutrition 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 229960000814 tetanus toxoid Drugs 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 150000003626 triacylglycerols Chemical class 0.000 description 2
- DCXXMTOCNZCJGO-UHFFFAOYSA-N tristearoylglycerol Chemical compound CCCCCCCCCCCCCCCCCC(=O)OCC(OC(=O)CCCCCCCCCCCCCCCCC)COC(=O)CCCCCCCCCCCCCCCCC DCXXMTOCNZCJGO-UHFFFAOYSA-N 0.000 description 2
- 235000013311 vegetables Nutrition 0.000 description 2
- -1 phosphinotricin Chemical compound 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- 102100034689 2-hydroxyacylsphingosine 1-beta-galactosyltransferase Human genes 0.000 description 1
- CAAMSDWKXXPUJR-UHFFFAOYSA-N 3,5-dihydro-4H-imidazol-4-one Chemical class O=C1CNC=N1 CAAMSDWKXXPUJR-UHFFFAOYSA-N 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- 241001133760 Acoelorraphe Species 0.000 description 1
- 241000186046 Actinomyces Species 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 108010038310 Adenomatous polyposis coli protein Proteins 0.000 description 1
- 241000607534 Aeromonas Species 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 239000005995 Aluminium silicate Substances 0.000 description 1
- 102000013142 Amylases Human genes 0.000 description 1
- 108010065511 Amylases Proteins 0.000 description 1
- 235000001274 Anacardium occidentale Nutrition 0.000 description 1
- 235000007119 Ananas comosus Nutrition 0.000 description 1
- 101100437168 Arabidopsis thaliana A3G2XYLT gene Proteins 0.000 description 1
- 101100208517 Arabidopsis thaliana UGT71B1 gene Proteins 0.000 description 1
- 101100208518 Arabidopsis thaliana UGT71B2 gene Proteins 0.000 description 1
- 101100208519 Arabidopsis thaliana UGT71B5 gene Proteins 0.000 description 1
- 101100208520 Arabidopsis thaliana UGT71B6 gene Proteins 0.000 description 1
- 101100208521 Arabidopsis thaliana UGT71B7 gene Proteins 0.000 description 1
- 101100208522 Arabidopsis thaliana UGT71B8 gene Proteins 0.000 description 1
- 101100208523 Arabidopsis thaliana UGT71C1 gene Proteins 0.000 description 1
- 101100208524 Arabidopsis thaliana UGT71C2 gene Proteins 0.000 description 1
- 101100208525 Arabidopsis thaliana UGT71C3 gene Proteins 0.000 description 1
- 101100208526 Arabidopsis thaliana UGT71C4 gene Proteins 0.000 description 1
- 101100208527 Arabidopsis thaliana UGT71C5 gene Proteins 0.000 description 1
- 101100208528 Arabidopsis thaliana UGT71D1 gene Proteins 0.000 description 1
- 101100208529 Arabidopsis thaliana UGT71D2 gene Proteins 0.000 description 1
- 101100155005 Arabidopsis thaliana UGT72B1 gene Proteins 0.000 description 1
- 101100155006 Arabidopsis thaliana UGT72B2 gene Proteins 0.000 description 1
- 101100155007 Arabidopsis thaliana UGT72B3 gene Proteins 0.000 description 1
- 101100155008 Arabidopsis thaliana UGT72C1 gene Proteins 0.000 description 1
- 101100155009 Arabidopsis thaliana UGT72D1 gene Proteins 0.000 description 1
- 101100155010 Arabidopsis thaliana UGT72E1 gene Proteins 0.000 description 1
- 101100155011 Arabidopsis thaliana UGT72E2 gene Proteins 0.000 description 1
- 101100155012 Arabidopsis thaliana UGT72E3 gene Proteins 0.000 description 1
- 101100483362 Arabidopsis thaliana UGT73B1 gene Proteins 0.000 description 1
- 101100483363 Arabidopsis thaliana UGT73B2 gene Proteins 0.000 description 1
- 101100483364 Arabidopsis thaliana UGT73B3 gene Proteins 0.000 description 1
- 101100483365 Arabidopsis thaliana UGT73B4 gene Proteins 0.000 description 1
- 101100483366 Arabidopsis thaliana UGT73B5 gene Proteins 0.000 description 1
- 101100483367 Arabidopsis thaliana UGT73C1 gene Proteins 0.000 description 1
- 101100483368 Arabidopsis thaliana UGT73C2 gene Proteins 0.000 description 1
- 101100483369 Arabidopsis thaliana UGT73C3 gene Proteins 0.000 description 1
- 101100483372 Arabidopsis thaliana UGT73C5 gene Proteins 0.000 description 1
- 101100483373 Arabidopsis thaliana UGT73C6 gene Proteins 0.000 description 1
- 101100483374 Arabidopsis thaliana UGT73C7 gene Proteins 0.000 description 1
- 101100483375 Arabidopsis thaliana UGT73D1 gene Proteins 0.000 description 1
- 101100483378 Arabidopsis thaliana UGT74B1 gene Proteins 0.000 description 1
- 101100427134 Arabidopsis thaliana UGT74C1 gene Proteins 0.000 description 1
- 101100427135 Arabidopsis thaliana UGT74D1 gene Proteins 0.000 description 1
- 101100427136 Arabidopsis thaliana UGT74E1 gene Proteins 0.000 description 1
- 101100427137 Arabidopsis thaliana UGT74E2 gene Proteins 0.000 description 1
- 101100427138 Arabidopsis thaliana UGT74F1 gene Proteins 0.000 description 1
- 101100427139 Arabidopsis thaliana UGT74F2 gene Proteins 0.000 description 1
- 101100427141 Arabidopsis thaliana UGT75B1 gene Proteins 0.000 description 1
- 101100427142 Arabidopsis thaliana UGT75B2 gene Proteins 0.000 description 1
- 101100427145 Arabidopsis thaliana UGT75D1 gene Proteins 0.000 description 1
- 101100427146 Arabidopsis thaliana UGT76B1 gene Proteins 0.000 description 1
- 101100427147 Arabidopsis thaliana UGT76C1 gene Proteins 0.000 description 1
- 101100427148 Arabidopsis thaliana UGT76C2 gene Proteins 0.000 description 1
- 101100427149 Arabidopsis thaliana UGT76C3 gene Proteins 0.000 description 1
- 101100427150 Arabidopsis thaliana UGT76C4 gene Proteins 0.000 description 1
- 101100427151 Arabidopsis thaliana UGT76C5 gene Proteins 0.000 description 1
- 101100427152 Arabidopsis thaliana UGT76D1 gene Proteins 0.000 description 1
- 101100539112 Arabidopsis thaliana UGT76E11 gene Proteins 0.000 description 1
- 101100048040 Arabidopsis thaliana UGT76E12 gene Proteins 0.000 description 1
- 101100262407 Arabidopsis thaliana UGT76E2 gene Proteins 0.000 description 1
- 101100262408 Arabidopsis thaliana UGT76E3 gene Proteins 0.000 description 1
- 101100262409 Arabidopsis thaliana UGT76E4 gene Proteins 0.000 description 1
- 101100262410 Arabidopsis thaliana UGT76E5 gene Proteins 0.000 description 1
- 101100262411 Arabidopsis thaliana UGT76E6 gene Proteins 0.000 description 1
- 101100262412 Arabidopsis thaliana UGT76E7 gene Proteins 0.000 description 1
- 101100262413 Arabidopsis thaliana UGT76E9 gene Proteins 0.000 description 1
- 101100262415 Arabidopsis thaliana UGT76F2 gene Proteins 0.000 description 1
- 101100262419 Arabidopsis thaliana UGT78D1 gene Proteins 0.000 description 1
- 101100262420 Arabidopsis thaliana UGT78D2 gene Proteins 0.000 description 1
- 101100262421 Arabidopsis thaliana UGT78D3 gene Proteins 0.000 description 1
- 101100539110 Arabidopsis thaliana UGT79B10 gene Proteins 0.000 description 1
- 101100539111 Arabidopsis thaliana UGT79B11 gene Proteins 0.000 description 1
- 101100539096 Arabidopsis thaliana UGT79B2 gene Proteins 0.000 description 1
- 101100539097 Arabidopsis thaliana UGT79B3 gene Proteins 0.000 description 1
- 101100539098 Arabidopsis thaliana UGT79B4 gene Proteins 0.000 description 1
- 101100539099 Arabidopsis thaliana UGT79B5 gene Proteins 0.000 description 1
- 101100539100 Arabidopsis thaliana UGT79B6 gene Proteins 0.000 description 1
- 101100539101 Arabidopsis thaliana UGT79B7 gene Proteins 0.000 description 1
- 101100539102 Arabidopsis thaliana UGT79B8 gene Proteins 0.000 description 1
- 101100539103 Arabidopsis thaliana UGT79B9 gene Proteins 0.000 description 1
- 101100048043 Arabidopsis thaliana UGT82A1 gene Proteins 0.000 description 1
- 101100048044 Arabidopsis thaliana UGT83A1 gene Proteins 0.000 description 1
- 101100048045 Arabidopsis thaliana UGT84A1 gene Proteins 0.000 description 1
- 101100048046 Arabidopsis thaliana UGT84A2 gene Proteins 0.000 description 1
- 101100048048 Arabidopsis thaliana UGT84A4 gene Proteins 0.000 description 1
- 101100048049 Arabidopsis thaliana UGT84B1 gene Proteins 0.000 description 1
- 101100048050 Arabidopsis thaliana UGT84B2 gene Proteins 0.000 description 1
- 101100048051 Arabidopsis thaliana UGT85A1 gene Proteins 0.000 description 1
- 101100048052 Arabidopsis thaliana UGT85A2 gene Proteins 0.000 description 1
- 101100048053 Arabidopsis thaliana UGT85A3 gene Proteins 0.000 description 1
- 101100048054 Arabidopsis thaliana UGT85A4 gene Proteins 0.000 description 1
- 101100048055 Arabidopsis thaliana UGT85A5 gene Proteins 0.000 description 1
- 101100048056 Arabidopsis thaliana UGT85A7 gene Proteins 0.000 description 1
- 101100371447 Arabidopsis thaliana UGT86A1 gene Proteins 0.000 description 1
- 101100371448 Arabidopsis thaliana UGT86A2 gene Proteins 0.000 description 1
- 101100371449 Arabidopsis thaliana UGT87A1 gene Proteins 0.000 description 1
- 101100371450 Arabidopsis thaliana UGT87A2 gene Proteins 0.000 description 1
- 101100371451 Arabidopsis thaliana UGT88A1 gene Proteins 0.000 description 1
- 101100371458 Arabidopsis thaliana UGT89A2 gene Proteins 0.000 description 1
- 101100371459 Arabidopsis thaliana UGT89B1 gene Proteins 0.000 description 1
- 101100371461 Arabidopsis thaliana UGT89C1 gene Proteins 0.000 description 1
- 101100371462 Arabidopsis thaliana UGT90A1 gene Proteins 0.000 description 1
- 101100371463 Arabidopsis thaliana UGT90A2 gene Proteins 0.000 description 1
- 101100371465 Arabidopsis thaliana UGT91A1 gene Proteins 0.000 description 1
- 101100101353 Arabidopsis thaliana UGT91B1 gene Proteins 0.000 description 1
- 101100101354 Arabidopsis thaliana UGT91C1 gene Proteins 0.000 description 1
- 101100101359 Arabidopsis thaliana UGT92A1 gene Proteins 0.000 description 1
- 235000003911 Arachis Nutrition 0.000 description 1
- 241000239290 Araneae Species 0.000 description 1
- 241000712892 Arenaviridae Species 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 235000007319 Avena orientalis Nutrition 0.000 description 1
- 244000075850 Avena orientalis Species 0.000 description 1
- 102100035526 B melanoma antigen 1 Human genes 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000606125 Bacteroides Species 0.000 description 1
- KHBQMWCZKVMBLN-UHFFFAOYSA-N Benzenesulfonamide Chemical compound NS(=O)(=O)C1=CC=CC=C1 KHBQMWCZKVMBLN-UHFFFAOYSA-N 0.000 description 1
- 235000021533 Beta vulgaris Nutrition 0.000 description 1
- 241000335053 Beta vulgaris Species 0.000 description 1
- 241000335423 Blastomyces Species 0.000 description 1
- 102000015081 Blood Coagulation Factors Human genes 0.000 description 1
- 108010039209 Blood Coagulation Factors Proteins 0.000 description 1
- 241000588807 Bordetella Species 0.000 description 1
- 241000589969 Borreliella burgdorferi Species 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 description 1
- 235000011293 Brassica napus Nutrition 0.000 description 1
- 240000000385 Brassica napus var. napus Species 0.000 description 1
- 240000007124 Brassica oleracea Species 0.000 description 1
- 235000003899 Brassica oleracea var acephala Nutrition 0.000 description 1
- 235000011301 Brassica oleracea var capitata Nutrition 0.000 description 1
- 235000017647 Brassica oleracea var italica Nutrition 0.000 description 1
- 235000001169 Brassica oleracea var oleracea Nutrition 0.000 description 1
- 240000008100 Brassica rapa Species 0.000 description 1
- 235000011292 Brassica rapa Nutrition 0.000 description 1
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 description 1
- 235000004936 Bromus mango Nutrition 0.000 description 1
- 241000589562 Brucella Species 0.000 description 1
- 241000714198 Caliciviridae Species 0.000 description 1
- 240000001548 Camellia japonica Species 0.000 description 1
- 244000045232 Canavalia ensiformis Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 235000002566 Capsicum Nutrition 0.000 description 1
- 229920002134 Carboxymethyl cellulose Polymers 0.000 description 1
- 235000013912 Ceratonia siliqua Nutrition 0.000 description 1
- 240000008886 Ceratonia siliqua Species 0.000 description 1
- 241000606161 Chlamydia Species 0.000 description 1
- 239000005496 Chlorsulfuron Substances 0.000 description 1
- 235000010523 Cicer arietinum Nutrition 0.000 description 1
- 244000045195 Cicer arietinum Species 0.000 description 1
- 240000006740 Cichorium endivia Species 0.000 description 1
- 241000193403 Clostridium Species 0.000 description 1
- 241000252203 Clupea harengus Species 0.000 description 1
- 241000737241 Cocos Species 0.000 description 1
- 206010009944 Colon cancer Diseases 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 241001337994 Cryptococcus <scale insect> Species 0.000 description 1
- 244000007835 Cyamopsis tetragonoloba Species 0.000 description 1
- 235000002767 Daucus carota Nutrition 0.000 description 1
- 244000000626 Daucus carota Species 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 102100039371 ER lumen protein-retaining receptor 1 Human genes 0.000 description 1
- 241001115402 Ebolavirus Species 0.000 description 1
- 241000588877 Eikenella Species 0.000 description 1
- 241000194032 Enterococcus faecalis Species 0.000 description 1
- 241000709661 Enterovirus Species 0.000 description 1
- 241000991587 Enterovirus C Species 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 241000186811 Erysipelothrix Species 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 108050000194 Expansin Proteins 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- 241000218218 Ficus <angiosperm> Species 0.000 description 1
- 239000004606 Fillers/Extenders Substances 0.000 description 1
- 241000711950 Filoviridae Species 0.000 description 1
- 241000589565 Flavobacterium Species 0.000 description 1
- 235000016623 Fragaria vesca Nutrition 0.000 description 1
- 240000009088 Fragaria x ananassa Species 0.000 description 1
- 235000011363 Fragaria x ananassa Nutrition 0.000 description 1
- 241000605909 Fusobacterium Species 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- 241000206672 Gelidium Species 0.000 description 1
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 1
- 229930182566 Gentamicin Natural products 0.000 description 1
- 239000005562 Glyphosate Substances 0.000 description 1
- 235000009432 Gossypium hirsutum Nutrition 0.000 description 1
- 241000606768 Haemophilus influenzae Species 0.000 description 1
- 241000590002 Helicobacter pylori Species 0.000 description 1
- 241000700739 Hepadnaviridae Species 0.000 description 1
- 208000005176 Hepatitis C Diseases 0.000 description 1
- 239000004705 High-molecular-weight polyethylene Substances 0.000 description 1
- 101000946034 Homo sapiens 2-hydroxyacylsphingosine 1-beta-galactosyltransferase Proteins 0.000 description 1
- 101000874316 Homo sapiens B melanoma antigen 1 Proteins 0.000 description 1
- 101000812437 Homo sapiens ER lumen protein-retaining receptor 1 Proteins 0.000 description 1
- 241000598436 Human T-cell lymphotropic virus Species 0.000 description 1
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 1
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 1
- 241000701806 Human papillomavirus Species 0.000 description 1
- 241000341655 Human papillomavirus type 16 Species 0.000 description 1
- 241000709704 Human poliovirus 2 Species 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- 244000017020 Ipomoea batatas Species 0.000 description 1
- 235000002678 Ipomoea batatas Nutrition 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- 235000003228 Lactuca sativa Nutrition 0.000 description 1
- 240000008415 Lactuca sativa Species 0.000 description 1
- 241000589248 Legionella Species 0.000 description 1
- 208000007764 Legionnaires' Disease Diseases 0.000 description 1
- 241000222722 Leishmania <genus> Species 0.000 description 1
- 240000004322 Lens culinaris Species 0.000 description 1
- 235000014647 Lens culinaris subsp culinaris Nutrition 0.000 description 1
- 102000016267 Leptin Human genes 0.000 description 1
- 108010092277 Leptin Proteins 0.000 description 1
- 241000589902 Leptospira Species 0.000 description 1
- 240000007472 Leucaena leucocephala Species 0.000 description 1
- 235000010643 Leucaena leucocephala Nutrition 0.000 description 1
- 241000186781 Listeria Species 0.000 description 1
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 1
- 235000014826 Mangifera indica Nutrition 0.000 description 1
- 240000007228 Mangifera indica Species 0.000 description 1
- 235000004456 Manihot esculenta Nutrition 0.000 description 1
- 241000204801 Muraenidae Species 0.000 description 1
- 241000234295 Musa Species 0.000 description 1
- 240000005561 Musa balbisiana Species 0.000 description 1
- 235000018290 Musa x paradisiaca Nutrition 0.000 description 1
- 241000204031 Mycoplasma Species 0.000 description 1
- 208000002454 Nasopharyngeal Carcinoma Diseases 0.000 description 1
- 206010061306 Nasopharyngeal cancer Diseases 0.000 description 1
- 241000588653 Neisseria Species 0.000 description 1
- 241000588652 Neisseria gonorrhoeae Species 0.000 description 1
- 241000588650 Neisseria meningitidis Species 0.000 description 1
- 241000187654 Nocardia Species 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 101150101077 OBL1 gene Proteins 0.000 description 1
- 235000002725 Olea europaea Nutrition 0.000 description 1
- 101710089395 Oleosin Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 241000711504 Paramyxoviridae Species 0.000 description 1
- 241000701945 Parvoviridae Species 0.000 description 1
- 241000606860 Pasteurella Species 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 239000006002 Pepper Substances 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 241000150350 Peribunyaviridae Species 0.000 description 1
- 241000009328 Perro Species 0.000 description 1
- 244000025272 Persea americana Species 0.000 description 1
- 235000008673 Persea americana Nutrition 0.000 description 1
- 244000264897 Persea americana var. americana Species 0.000 description 1
- 235000010617 Phaseolus lunatus Nutrition 0.000 description 1
- 241000235400 Phycomyces Species 0.000 description 1
- 235000016761 Piper aduncum Nutrition 0.000 description 1
- 240000003889 Piper guineense Species 0.000 description 1
- 235000017804 Piper guineense Nutrition 0.000 description 1
- 235000008184 Piper nigrum Nutrition 0.000 description 1
- 241000224016 Plasmodium Species 0.000 description 1
- 102100024819 Prolactin Human genes 0.000 description 1
- 108010057464 Prolactin Proteins 0.000 description 1
- 241000588769 Proteus <enterobacteria> Species 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 101000762949 Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) Exotoxin A Proteins 0.000 description 1
- 241000508269 Psidium Species 0.000 description 1
- 240000001679 Psidium guajava Species 0.000 description 1
- 235000013929 Psidium pyriferum Nutrition 0.000 description 1
- 238000002123 RNA extraction Methods 0.000 description 1
- 241000711931 Rhabdoviridae Species 0.000 description 1
- 108010039491 Ricin Proteins 0.000 description 1
- 241000606701 Rickettsia Species 0.000 description 1
- 241000242678 Schistosoma Species 0.000 description 1
- 241000605036 Selenomonas Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000607768 Shigella Species 0.000 description 1
- 241000607762 Shigella flexneri Species 0.000 description 1
- 240000003768 Solanum lycopersicum Species 0.000 description 1
- 235000007230 Sorghum bicolor Nutrition 0.000 description 1
- 244000062793 Sorghum vulgare Species 0.000 description 1
- 241000256248 Spodoptera Species 0.000 description 1
- 235000009184 Spondias indica Nutrition 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 241000191967 Staphylococcus aureus Species 0.000 description 1
- 241000191963 Staphylococcus epidermidis Species 0.000 description 1
- SSZBUIDZHHWXNJ-UHFFFAOYSA-N Stearinsaeure-hexadecylester Natural products CCCCCCCCCCCCCCCCCC(=O)OCCCCCCCCCCCCCCCC SSZBUIDZHHWXNJ-UHFFFAOYSA-N 0.000 description 1
- 241000282898 Sus scrofa Species 0.000 description 1
- 241001122767 Theaceae Species 0.000 description 1
- RUJBDQSFYCKFAA-UHFFFAOYSA-N Tofisopam Chemical compound N=1N=C(C)C(CC)C2=CC(OC)=C(OC)C=C2C=1C1=CC=C(OC)C(OC)=C1 RUJBDQSFYCKFAA-UHFFFAOYSA-N 0.000 description 1
- 241000710924 Togaviridae Species 0.000 description 1
- 241000589886 Treponema Species 0.000 description 1
- 235000001484 Trigonella foenum graecum Nutrition 0.000 description 1
- 244000250129 Trigonella foenum graecum Species 0.000 description 1
- YZCKVEUIGOORGS-NJFSPNSNSA-N Tritium Chemical compound [3H] YZCKVEUIGOORGS-NJFSPNSNSA-N 0.000 description 1
- 101150114272 UGT73C4 gene Proteins 0.000 description 1
- 101150074360 UGT75C1 gene Proteins 0.000 description 1
- 101150040134 UGT76E1 gene Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 241000219977 Vigna Species 0.000 description 1
- 240000004922 Vigna radiata Species 0.000 description 1
- 235000010721 Vigna radiata var radiata Nutrition 0.000 description 1
- 235000011469 Vigna radiata var sublobata Nutrition 0.000 description 1
- 235000010726 Vigna sinensis Nutrition 0.000 description 1
- 235000007244 Zea mays Nutrition 0.000 description 1
- 239000002250 absorbent Substances 0.000 description 1
- 230000002745 absorbent Effects 0.000 description 1
- 239000003655 absorption accelerator Substances 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 235000010419 agar Nutrition 0.000 description 1
- 239000000783 alginic acid Substances 0.000 description 1
- 229960001126 alginic acid Drugs 0.000 description 1
- 150000004781 alginic acids Chemical class 0.000 description 1
- 235000020224 almond Nutrition 0.000 description 1
- 235000012211 aluminium silicate Nutrition 0.000 description 1
- 150000001413 amino acids Chemical group 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 235000019418 amylase Nutrition 0.000 description 1
- 229940025131 amylases Drugs 0.000 description 1
- 229940030225 antihemorrhagics Drugs 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 239000000440 bentonite Substances 0.000 description 1
- 229910000278 bentonite Inorganic materials 0.000 description 1
- SVPXDRXYRYOSEX-UHFFFAOYSA-N bentoquatam Chemical compound O.O=[Si]=O.O=[Al]O[Al]=O SVPXDRXYRYOSEX-UHFFFAOYSA-N 0.000 description 1
- ZYGHJZDHTFUPRJ-UHFFFAOYSA-N benzo-alpha-pyrone Natural products C1=CC=C2OC(=O)C=CC2=C1 ZYGHJZDHTFUPRJ-UHFFFAOYSA-N 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 239000003114 blood coagulation factor Substances 0.000 description 1
- 239000012888 bovine serum Substances 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 239000006172 buffering agent Substances 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 229910000019 calcium carbonate Inorganic materials 0.000 description 1
- 235000010216 calcium carbonate Nutrition 0.000 description 1
- CJZGTCYPCWQAJB-UHFFFAOYSA-L calcium stearate Chemical compound [Ca+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O CJZGTCYPCWQAJB-UHFFFAOYSA-L 0.000 description 1
- 239000008116 calcium stearate Substances 0.000 description 1
- 235000013539 calcium stearate Nutrition 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 235000010948 carboxy methyl cellulose Nutrition 0.000 description 1
- 239000001768 carboxy methyl cellulose Substances 0.000 description 1
- 239000008112 carboxymethyl-cellulose Substances 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 235000020226 cashew nut Nutrition 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 229960000541 cetyl alcohol Drugs 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 235000003733 chicria Nutrition 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- VJYIFXVZLXQVHO-UHFFFAOYSA-N chlorsulfuron Chemical compound COC1=NC(C)=NC(NC(=O)NS(=O)(=O)C=2C(=CC=CC=2)Cl)=N1 VJYIFXVZLXQVHO-UHFFFAOYSA-N 0.000 description 1
- 239000004927 clay Substances 0.000 description 1
- 235000018597 common camellia Nutrition 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 230000008602 contraction Effects 0.000 description 1
- NKLPQNGYXWVELD-UHFFFAOYSA-M coomassie brilliant blue Chemical compound [Na+].C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=2C=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=C1 NKLPQNGYXWVELD-UHFFFAOYSA-M 0.000 description 1
- 235000001671 coumarin Nutrition 0.000 description 1
- 150000004775 coumarins Chemical class 0.000 description 1
- 244000038559 crop plants Species 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 239000002875 cyclin dependent kinase inhibitor Substances 0.000 description 1
- 229940043378 cyclin-dependent kinase inhibitor Drugs 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 210000000172 cytosol Anatomy 0.000 description 1
- 239000000824 cytostatic agent Substances 0.000 description 1
- 230000001085 cytostatic effect Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000003795 desorption Methods 0.000 description 1
- IJKVHSBPTUYDLN-UHFFFAOYSA-N dihydroxy(oxo)silane Chemical compound O[Si](O)=O IJKVHSBPTUYDLN-UHFFFAOYSA-N 0.000 description 1
- 239000008298 dragée Substances 0.000 description 1
- 235000005489 dwarf bean Nutrition 0.000 description 1
- 244000013123 dwarf bean Species 0.000 description 1
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000002702 enteric coating Substances 0.000 description 1
- 238000009505 enteric coating Methods 0.000 description 1
- 244000000021 enteric pathogen Species 0.000 description 1
- 229940032049 enterococcus faecalis Drugs 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 229930003935 flavonoid Natural products 0.000 description 1
- 150000002215 flavonoids Chemical class 0.000 description 1
- 235000017173 flavonoids Nutrition 0.000 description 1
- 230000037406 food intake Effects 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 210000001035 gastrointestinal tract Anatomy 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 239000007903 gelatin capsule Substances 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 239000003292 glue Substances 0.000 description 1
- YQEMORVAKMFKLG-UHFFFAOYSA-N glycerine monostearate Natural products CCCCCCCCCCCCCCCCCC(=O)OC(CO)CO YQEMORVAKMFKLG-UHFFFAOYSA-N 0.000 description 1
- SVUQHVRAGMNPLW-UHFFFAOYSA-N glycerol monostearate Natural products CCCCCCCCCCCCCCCCC(=O)OCC(O)CO SVUQHVRAGMNPLW-UHFFFAOYSA-N 0.000 description 1
- XDDAORKBJWWYJS-UHFFFAOYSA-N glyphosate Chemical compound OC(=O)CNCP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-N 0.000 description 1
- 229940097068 glyphosate Drugs 0.000 description 1
- 229940047650 haemophilus influenzae Drugs 0.000 description 1
- 230000000025 haemostatic effect Effects 0.000 description 1
- 229940037467 helicobacter pylori Drugs 0.000 description 1
- 210000002443 helper t lymphocyte Anatomy 0.000 description 1
- 208000002672 hepatitis B Diseases 0.000 description 1
- 230000002363 herbicidal effect Effects 0.000 description 1
- 235000019514 herring Nutrition 0.000 description 1
- BXWNKGSJHAJOGX-UHFFFAOYSA-N hexadecan-1-ol Chemical compound CCCCCCCCCCCCCCCCO BXWNKGSJHAJOGX-UHFFFAOYSA-N 0.000 description 1
- 239000003906 humectant Substances 0.000 description 1
- 210000002865 immune cell Anatomy 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 238000012994 industrial processing Methods 0.000 description 1
- 239000012678 infectious agent Substances 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- NLYAJNPCOHFWQQ-UHFFFAOYSA-N kaolin Chemical compound O.O.O=[Al]O[Si](=O)O[Si](=O)O[Al]=O NLYAJNPCOHFWQQ-UHFFFAOYSA-N 0.000 description 1
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 1
- 229940039781 leptin Drugs 0.000 description 1
- NRYBAZVQPHGZNS-ZSOCWYAHSA-N leptin Chemical compound O=C([C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)CCSC)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CS)C(O)=O NRYBAZVQPHGZNS-ZSOCWYAHSA-N 0.000 description 1
- 208000032839 leukemia Diseases 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- 210000003041 ligament Anatomy 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 210000002540 macrophage Anatomy 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 238000001819 mass spectrum Methods 0.000 description 1
- 238000001869 matrix assisted laser desorption--ionisation mass spectrum Methods 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 210000005060 membrane bound organelle Anatomy 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 238000012737 microarray-based gene expression Methods 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 235000019713 millet Nutrition 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000012243 multiplex automated genomic engineering Methods 0.000 description 1
- HEGSGKPQLMEBJL-UHFFFAOYSA-N n-octyl beta-D-glucopyranoside Natural products CCCCCCCCOC1OC(CO)C(O)C(O)C1O HEGSGKPQLMEBJL-UHFFFAOYSA-N 0.000 description 1
- 201000011216 nasopharynx carcinoma Diseases 0.000 description 1
- 231100000956 nontoxicity Toxicity 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- HEGSGKPQLMEBJL-RKQHYHRCSA-N octyl beta-D-glucopyranoside Chemical compound CCCCCCCCO[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O HEGSGKPQLMEBJL-RKQHYHRCSA-N 0.000 description 1
- 229940049964 oleate Drugs 0.000 description 1
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- FJKROLUGYXJWQN-UHFFFAOYSA-N papa-hydroxy-benzoic acid Natural products OC(=O)C1=CC=C(O)C=C1 FJKROLUGYXJWQN-UHFFFAOYSA-N 0.000 description 1
- 239000012188 paraffin wax Substances 0.000 description 1
- 244000045947 parasite Species 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 235000020232 peanut Nutrition 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 239000003348 petrochemical agent Substances 0.000 description 1
- 238000002823 phage display Methods 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- 150000002995 phenylpropanoid derivatives Chemical class 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 239000003375 plant hormone Substances 0.000 description 1
- 239000010773 plant oil Substances 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 235000012015 potatoes Nutrition 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 229940097325 prolactin Drugs 0.000 description 1
- 150000003856 quaternary ammonium compounds Chemical class 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 239000003340 retarding agent Substances 0.000 description 1
- 229960004889 salicylic acid Drugs 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 229930000044 secondary metabolite Natural products 0.000 description 1
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 1
- 238000007493 shaping process Methods 0.000 description 1
- 150000004760 silicates Chemical class 0.000 description 1
- 210000000813 small intestine Anatomy 0.000 description 1
- 229910000029 sodium carbonate Inorganic materials 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- AJPJDKMHJJGVTQ-UHFFFAOYSA-M sodium dihydrogen phosphate Chemical compound [Na+].OP(O)([O-])=O AJPJDKMHJJGVTQ-UHFFFAOYSA-M 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 229910000162 sodium phosphate Inorganic materials 0.000 description 1
- 239000008247 solid mixture Substances 0.000 description 1
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 1
- 229960000268 spectinomycin Drugs 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 239000000454 talc Substances 0.000 description 1
- 229910052623 talc Inorganic materials 0.000 description 1
- 235000012222 talc Nutrition 0.000 description 1
- 210000002435 tendon Anatomy 0.000 description 1
- 229960002501 tofisopam Drugs 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 235000001019 trigonella foenum-graecum Nutrition 0.000 description 1
- 229910052722 tritium Inorganic materials 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241000712461 unidentified influenza virus Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 230000002792 vascular Effects 0.000 description 1
- 239000010455 vermiculite Substances 0.000 description 1
- 229910052902 vermiculite Inorganic materials 0.000 description 1
- 235000019354 vermiculite Nutrition 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
- 239000001993 wax Substances 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
- 230000037314 wound repair Effects 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8257—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits for the production of primary gene products, e.g. pharmaceutical products, interferon
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/43504—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates
- C07K14/43563—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates from insects
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
- C12N15/625—DNA sequences coding for fusion proteins containing a sequence coding for a signal sequence
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
- C12N15/8222—Developmentally regulated expression systems, tissue, organ specific, temporal or spatial regulation
- C12N15/823—Reproductive tissue-specific promoters
- C12N15/8234—Seed-specific, e.g. embryo, endosperm
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1048—Glycosyltransferases (2.4)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/54—Medicinal preparations containing antigens or antibodies characterised by the route of administration
- A61K2039/541—Mucosal route
- A61K2039/542—Mucosal route oral/gastrointestinal
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/30—Non-immunoglobulin-derived peptide or protein having an immunoglobulin constant or Fc region, or a fragment thereof, attached thereto
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/50—Fusion polypeptide containing protease site
Definitions
- the invention relates to the manufacture of recombinant proteins; vectors for use in the recombinant manufacture of protein and recombinant proteins that are adapted for purification.
- recombinant proteins require a high standard of quality control since many of these proteins are used as pharmaceuticals, for example: growth hormone; leptin; erythropoietin; prolactin, interleukins; interferon's.
- vaccines particularly subunit vaccines, (vaccines based on a defined antigen, for example gpl20 of HIV)
- the production of recombinant protein in cell expression systems is based either on prokaryotic cell expression or eukaryotic cell expression. The latter is preferred when post-translation modifications to the protein are required.
- Eukaryotic systems include the use of mammalian cells, e.g. Chinese Hamster Ovary cells; insect cells e.g. Spodoptera spp; fungal/yeast (e.g. Saccharomyces spp, Pichia spp, Aspergillus spp) and plants.
- mammalian cells e.g. Chinese Hamster Ovary cells
- insect cells e.g. Spodoptera spp
- fungal/yeast e.g. Saccharomyces spp, Pichia spp, Aspergillus spp
- heterologous recombinant proteins in an expression system relates to a variety of proteins used in both industrial and medical applications.
- tumour suppressor polypeptides include tumour suppressor polypeptides (e.g. p53 polypeptide, the APC polypeptide, the DPC-4 polypeptide, the BRCA-1 polypeptide, the BRCA-2 polypeptide, the WT-1 polypeptide, the retinoblastoma polypeptide (Lee, et al. (1987) Nature 329:642), the MMAC-1 polypeptide, the adenomatous polyposis coli protein (United States Patent 5,783,666), the deleted in colon carcinoma (DCC) polypeptide, the MMSC-2 polypeptide, the NF-1 polypeptide, nasopharyngeal carcinoma tumour suppressor polypeptide (Cheng, et al. 1998.
- tumour suppressor polypeptides e.g. p53 polypeptide, the APC polypeptide, the DPC-4 polypeptide, the BRCA-1 polypeptide, the BRCA-2 polypeptide, the WT-1 polypeptide, the retinoblastom
- Therapeutic polypeptides also include "antigenic polypeptides" (e.g. tumour rejection antigens the MAGE, BAGE, GAGE and DAGE families of tumour rejection antigens, see Schulz et al Proc Natl Acad Sci USA, 1991, 88, pp991-993).
- Antigenic polypeptides also includes polypeptide antigens used in the preparation of vaccines which provide protection against infectious agents.
- viruses such as Human Immunodeficiency Virus (HIV1 & 2); Human T Cell Leukaemia Virus (HTLV 1 & 2); Ebola virus; Human Papilloma Virus (e.g. HPV-2, HPV-5, HPV-8 HPV-16, HPV-18, HPV-31, HPV-33, HPV-52, HPV-54 and HPV-56); papovavirus; rhinovirus; poliovirus; herpesvirus; adenovirus; Epstein Barr virus; influenza virus, hepatitis B and C viruses.
- HIV1 & 2 Human Immunodeficiency Virus
- HTLV 1 & 2 Human T Cell Leukaemia Virus
- Ebola virus Human Papilloma Virus
- HPV-2, HPV-5, HPV-8 HPV-16, HPV-18, HPV-31, HPV-33, HPV-52, HPV-54 and HPV-56 papovavirus
- rhinovirus poliovirus
- herpesvirus
- Antigens derived from pathogenic bacteria such as Staphylococcus aureus; Staphylococcus epidermidis; Enterococcus faecalis; Mycobacterium tuberculsis; Streptococcus group B; Streptoccocus pneumoniae; Helicobacter pylori; Neisseria gonorrhea; Streptococcus group A; Borrelia burgdorferi; Coccidiodes immitis; Histoplasma sapsulatum; Neisseria meningitidis ; Shigella flexneri; Escherichia coli; Haemophilus influenzae.
- Antigens derived from parasites such as Trypanosoma spp, Plasmodia spp, Schiztosoma spp; and pathogenic fungi such as Candida spp.
- subunit vaccines e.g. vaccines in which the immunogen is a purified protein
- the development of subunit vaccines has been the focus of considerable research in recent years and requires the isolation and purification of the candidate protein from a complex mixture of proteins.
- the emergence of new pathogens and the growth of antibiotic resistance have created a need to develop new vaccines and to identify further candidate molecules useful in the development of subunit vaccines.
- routes via which vaccines may be administered to an animal The traditional route is by transdermal injection of the vaccine containing an immunogen that invokes an immune response.
- An alternative route that is not typically undertaken is an orally administered vaccine.
- the immunogen contacts the immune system via the mammalian muscosal membrane which lines the small intestine through the activity of so called M cells.
- Peyers Patches The M cells overlie regions of the intestine referred to as Peyers Patches and function to sample antigens contained in ingested material and are therefore essential components in mediating mucosal immunity.
- Peyers Patches have been identified as major sites for the adherence and entry of enteric pathogens such as Escherichia, Salmonella, Yersina and Shigella.
- Plant derived vaccines have been developed which utilise plant oil bodies which have been engineered to present an antigen to which an immune response is desired, see WO01/95934. Plants that bear oilseeds such as soybean, rapeseed and sunflower store triglycerides in their seeds which provide an energy store for germination and subsequent seedling development. Further examples of this are described in WO93/20216; WO96/21029 and WO98/21115.
- Therapeutic polypeptides can be "cytotoxic polypeptides" (e.g. pseudomonas exotoxin, ricin toxin, diptheria toxin); or polypeptides which are "cytostatic polypeptides” (e.g. p21, the retinoblastoma polypeptide, the E2F-Rb polypeptide, cyclin dependent kinase inhibitors such as P 16, pi 5, pi 8 and pi 9, the growth arrest specific homeobox (GAX) polypeptide as described in Branellec, et al, see WO97/16459 and WO96/30385.
- cytotoxic polypeptides e.g. pseudomonas exotoxin, ricin toxin, diptheria toxin
- cytostatic polypeptides e.g. p21, the retinoblastoma polypeptide, the E2F-Rb polypeptide, cyclin dependent kinase inhibitors such as P 16, pi 5, pi 8 and
- therapeutic antibodies are monoclonal antibodies or at least the active binding fragments thereof.
- Monoclonal antibodies may be humanised or chimeric antibodies.
- a chimeric antibody is produced by recombinant methods to contain the variable region of an antibody with an invariant or constant region of a human antibody.
- a humanised antibody is produced by recombinant methods to combine the complementarity determining regions (CDRs) of an antibody with both the constant (C) regions and the framework regions from the variable (V) regions of a human antibody.
- oilseeds such as soybean, rapeseed and sunflower store triglycerides in their seeds which provide an energy store for germination and subsequent seedling development.
- the oil globules form oil bodies in the cytoplasm that are membrane bound organelles that have protein in the membrane surface.
- An example of a protein found associated with oil bodies is an oleosm.
- the oleosins are a family of proteins that have divergent amino and carboxyl termini with more highly conserved lipophilic central domain. These proteins are strongly associated with oil bodies and it is thought that the association is due in part to the lipophilic domain.
- OBL1 and OBL 2 from castor bean that have different lipid specificities.
- the sequence of OBL 1 and OBL 2 has revealed a lipophilic domain that lacks any homology to that found in the oleosin described in WO93/21320 and represents an alternative means to localise polypeptides which comprise this sequence to oil bodies and thereby facilitate their purification.
- nucleic acid molecule that encodes a chimeric polypeptide wherein said nucleic acid molecule comprises a first part selected from the group consisting of: i) a nucleic acid molecule consisting of a nucleic acid sequence which encodes an amino acid sequence as represented in Figure 3; ii) a nucleic acid molecule which hybridises to the nucleic acid molecule in (i) and which specifically targets said polypeptide to an oil body; and iii) a nucleic acid molecule which differs from the nucleic acid molecules of (i) and (ii) due to the degeneracy in the genetic code; and a second part that encodes an heterologous polypeptide.
- heterologous polypeptide is a polypeptide to which a targeting domain as herein disclosed has been fused which would not naturally include such a targeting domain.
- said targeting domain is encoded by a nucleic acid molecule comprising a nucleic acid sequence which hybridises under stringent hybridisation conditions to a nucleic acid molecule which encodes an amino acid sequence as represented in Figure 3.
- nucleic acid hybrids that are stable after washing in O.lx SSC, 0.1% SDS at 60°C. It is well known in the art that optimal hybridisation conditions can be calculated if the sequence of the nucleic acid is known.
- hybridisation conditions uses 4 - 6 x SSPE (20x SSPE contains 175.3g NaCl, 88.2g NaH 2 PO 4 H 2 O and 7.4g EDTA dissolved to 1 litre and the pH adjusted to 7.4); 5-10x Denhardts solution (50x Denhardts solution contains 5g Ficoll (type 400, Pharmacia), 5g polyvinylpyrrolidone and 5g bovine serum albumen; lOO ⁇ g-l.Omg ml sonicated salmon/herring DNA; 0.1- 1.0% sodium dodecyl sulphate; optionally 40-60% deionised formamide.
- Hybridisation temperature will vary depending on the GC content of the nucleic acid target sequence but will typically be between 42°- 65° C.
- said nucleic acid molecule comprises a nucleic acid sequence that has at least 30% homology to the nucleic acid sequence which encodes an amino acid sequence as represented in Figure 3.
- said homology is at least 40%; 50%; 60%; 70%; 80%; 90%; or at least 99% identity with the nucleic acid sequence that encodes the amino acid sequence as represented in Figure 3.
- a chimeric polypeptide encoded by a nucleic acid molecule according to the invention.
- Recombinant protein production relates to the synthesis of protein in expression systems (e.g. bacterial expression systems, yeast expression systems, insect expression systems, mammalian expression systems and plant expression systems).
- Heterologous polypeptides include commercially important polypeptides, for example enzymes used in biocatalysis (e.g. restriction enzymes, enzymes used in industrial processing; e.g. amylases, proteases, nucleases, lipases) and therapeutic polypeptides as herein disclosed.
- the large scale production of recombinant proteins requires a high standard of quality control since many of these proteins are used as pharmaceuticals, for example, interleukins, growth hormone, erythropoietin, interferon.
- vaccines particularly subunit vaccines, (vaccines based on a defined antigen, for example gpl20 of HIV)
- a defined antigen for example gpl20 of HIV
- said second part encodes an enzyme
- said second part encodes a pharmaceutical polypeptide.
- said second part encodes an antigenic polypeptide.
- said second part encodes an antigenic polypeptide associated with an infectious disease condition.
- said second part encodes a polypeptide associated with a viral disease condition.
- said viral disease is the result of an infection caused by a virus selected from the families consisting of: Picornaviridae;
- Caliciviridae Caliciviridae; Togaviridae; Flaviridae; Coronaviridae; Rhabdoviridae; Filoviridae;
- Paramyxoviridae Orthomyxoviridae; Bunyaviridae; Arenaviridae; Reoviridae; Bimavirdae; Retroviridae; Hepadnaviridae; Parvoviridae; Papoviridae; Adenoviridae;
- Herpesviridae and Poxviridae.
- said virus is selected from the families consisting of: Adenoviridae; Coronaviridae; Herpesviridae; Orthomyxoviridae; Picornaviridae; Poxviridae; Reoviridae; and Retroviridae.
- said second part encodes a polypeptide associated with a bacterial disease condition.
- said bacterial disease is caused by a bacterial pathogen selected from the group consisting of: Actinomyces; Aeromonas; Bacillus; Bacteroides; Bordetella; Brucella; Compylobacter; Capnocyclophaga; Chlamydia; Clostridium; Corynebacterium; Eikenella; Erysipelothrix; Escherichia; Fusobacterium; Hemophilus; Klebseilla; Legionella; Leptospira; Listeria; Mycobacterium; Mycoplasma; Neisseria; Nocardia; Pasteurella; Proteus; Pseudomonas; Rickettsia; Salmonella; Selenomonas; Shigelia; Staphylococcus; Streptococcus; Treponema; Vibro and Yersnia.
- a bacterial pathogen selected from the group consisting of: Actinomyces; Aeromonas; Bacillus; Bac
- said second part encodes a polypeptide associated with a fungal disease condition.
- said fungal disease condition is caused by a fungal pathogen selected from the group consisting of: Aspercillus; Blastomyces; Candida; Cryptococcus; Histoplasma; Phycomyces; and Saccharomyces.
- a fungal pathogen selected from the group consisting of: Aspercillus; Blastomyces; Candida; Cryptococcus; Histoplasma; Phycomyces; and Saccharomyces.
- said second part encodes a polypeptide associated with a parasitic disease condition.
- said parasitic disease condition is caused by a parasitic pathogen selected from the group consisting of: Trypanosoma; Leishmania; Schistosoma; Plasmodium; Entamoeba; Eimeria; and Trichomonas.
- a parasitic pathogen selected from the group consisting of: Trypanosoma; Leishmania; Schistosoma; Plasmodium; Entamoeba; Eimeria; and Trichomonas.
- said parasitic pathogen is selected from the group consisting of: Entamoeba; Eimeria; and Trichomonas.
- said second part encodes a polypeptide associated with cancer.
- said polypeptide is a tumour rejection antigen polypeptide.
- said second part encodes a polypeptide that is an adhesive polypeptide, preferably said adhesive polypeptide is a mussel adhesive polypeptide.
- Mussel-derived adhesives possess a variety of characteristics that make them of particular interest to material scientists.
- the adhesives display a high bonding strength, versatility to a variety of bonding surfaces, resistance and durability, non- toxicity, waterproof and cure without high temperatures properties, characteristics which give these adhesives a range of potential uses and which no man-made adhesive on the market currently possesses. These properties have led to predictions of wide-ranging applications in engineering and medicine. For example a requirement for a natural, strong and waterproof adhesive can be found in the marine industry, where it can be used in ship-building. Medical uses, where such an adhesive, with its further advantages of being neither cytotoxic nor antigenic could be used in orthopaedic and dental applications.
- nucleic acid molecule which encodes a polypeptide comprising the amino acid motif
- said adhesive polypeptide comprises the amino acid sequence motif: A-K-P-S-Y-P-P-T-Y-K.
- said adhesive polypeptide comprises of at least one motif which has the amino acid sequence A-K- P-S-Y-P-P-T-Y-K.
- said adhesive polypeptide consists of a plurality of said motifs linked in tandem.
- said adhesive polypeptide comprises at least 10 motifs, preferably greater than 10 motifs. More preferably still said adhesive polypeptide comprises at least 50 motifs, preferably between 50 and 100 motifs, more preferably between 60-85 motifs.
- said hydroxylation is to an amino acid at position 6 and/or 7 and/or 9 of at least one motif of the amino acid sequence, X-K-X-X-Y-P-X-X-Y-K. More preferably still said hydroxylation is to amino acid position 6 and/or 7 and or 9 of at least one motif of the amino acid sequence, A-K-P- S-Y-P-P-T-Y-K.
- said adhesive polypeptide further comprises a second motif wherein said second motif is represented by the amino acid sequence: X P X P X X X X Y K.
- said second motif is selected from the group consisting of at least one of P P K P T P P T Y K; S P P P P V Y K Y K; or S P P P P T P V Y K, or combinations thereof.
- nucleic acid molecule encodes a secretion signal, for example an expansin secretion signal.
- said secretion signal comprises the amino acid sequence: MAFSYSPFSSLFLLPFFFVFTFADYGGWQSGHATFYGGGDASGT, or functional variant thereof.
- said adhesive polypeptide comprises an endoplasmic reticulum targeting sequence, preferably said sequence is represented by the amino acid sequence HDEL, or functional variant thereof.
- said second part encodes a spider polypeptide, preferably a spider silk polypeptide.
- said spider silk polypeptide is encoded by a nucleic acid molecule comprising a nucleic acid sequence selected from the group consisting of: i) a nucleic acid sequence consisting of the sequence as represented in
- Figure 4a or 4b ii) a nucleic acid sequence which encodes a polypeptide domain comprising the amino acid sequence AGRGQGGYGQGAGG and at least two motifs rich in polyalanine wherein said polyalanine motifs comprise at least 6 alanine amino acid residues; iii) a nucleic acid sequence which hybridises to the sequence presented in
- said domain comprises a motif comprising a repetitive tripeptide sequence, GGX (glycine:glycine: X is any amino acid residue), and at least one SS(serine: serine)-polyalanine sequence.
- said second part encodes a polypeptide with glucosylfransferase activity.
- GTases are enzymes that post-translationally transfer glucosyl residues from an activated nucleotide sugar to monomeric and polymeric acceptor molecules such as other sugars, proteins, lipids and other organic substrates. These glucosylated molecules take part in diverse metabolic pathways and processes. The transfer of a glucosyl moiety can alter the acceptor's bioactivity, solubility and transport properties within the cell and throughout the plant.
- One family of GTases in higher plants is defined by the presence of a C-terminal consensus sequence.
- the GTases of this family function in the cytosol of plant cells and catalyse the transfer of glucose to small molecular weight substrates, such as phenylpropanoid derivatives, coumarins, flavonoids, other secondary metabolites and molecules known to act as plant hormones.
- Other GTases transfers glucosyl residues to peptides and polypeptides.
- said glucosylfransferase is encoded by a nucleic acid molecule selected from the group consisting of: i) a nucleic acid molecule selected from the group consisting of nucleic acid sequences as represented in Table 1; ii) a nucleic acid molecule which hybridise to the sequences represented in (ii) above; and which have glucosylfransferase activity; and iii) nucleic acid molecules consisting of nucleic acid sequences which are degenerate as a result of the genetic code to the sequences defined in (i) and (ii) above.
- nucleic acid molecules hybridise under stringent hybridisation conditions to the nucleic acid molecules as represented in Table 1.
- the GTase sequences disclosed in Table 1 are cDNA sequences to which the oil body targeting domain can be fused typically as an in-frame translational fusion.
- said nucleic acid encodes a cleavage site to facilitate the purification of said second part from said first part.
- said cleavage site is a proteolytic cleavage site.
- a vector comprising a nucleic acid molecule that encodes a chimeric polypeptide according to the invention.
- the nucleic acid in the vector is operably linked to an appropriate promoter or other regulatory elements for transcription in a host cell such as a prokaryotic, (e.g. bacterial), or eukaryotic (e.g. fungal, plant, mammalian or insect cell).
- a host cell such as a prokaryotic, (e.g. bacterial), or eukaryotic (e.g. fungal, plant, mammalian or insect cell).
- the vector may be a bi-functional expression vector which functions in multiple hosts.
- this may contain its native promoter or other regulatory elements and in the case of cDNA this may be under the control of an appropriate promoter or other regulatory elements for expression in the host cell.
- promoter is meant a nucleotide sequence upstream from the transcriptional initiation site and which contains all the regulatory regions required for transcription.
- Suitable promoters include constitutive, tissue-specific, inducible, developmental or other promoters for expression in plant cells comprised in plants depending on design.
- Such promoters include viral, fungal, bacterial, animal and plant-derived promoters capable of functioning in plant cells.
- Constitutive promoters include, for example CaMV 35S promoter (Odell et al (1985) Nature 313, 9810-812); rice actin (McElroy et al (1990) Plant Cell 2: 163-171); ubiquitin (Christian et al . (1989) Plant Mol. Biol. 18 (675-689); pEMU (Last et al (1991) Theor Appl. Genet. 81: 581-588); MAS (Velten et al (1984) EMBO J. 3. 2723-2730); ALS promoter (U.S. Application Seriel No. 08/409,297), and the like.
- Other constitutive promoters include those in U.S. Patent Nos. 5,608,149; 5,608,144; 5,604,121; 5,569,597; 5,466,785; 5,399,680, 5,268,463; and 5,608,142.
- Chemical-regulated promoters can be used to modulate the expression of a gene in a plant through the application of an exogenous chemical regulator.
- the promoter may be a chemical-inducible promoter, where application of the chemical induced gene expression, or a chemical-repressible promoter, where application of the chemical represses gene expression.
- Chemical-inducible promoters are known in the art and include, but are not limited to, the maize In2-2 promoter, which is activated by benzenesulfonamide herbicide safeners, the maize GST promoter, which is activated by hydrophobic electrophilic compounds that are used as pre-emergent herbicides, and the tobacco PR-la promoter, which is activated by salicylic acid.
- promoters of interest include steroid- responsive promoters (see, for example, the glucocorticoid-inducible promoter in Schena et al (1991) Proc. Natl. Acad. Sci. USA 88: 10421-10425 and McNellie et al. (1998) Plant J. 14(2): 247-257) and tetracycline-inducible and tetracycline- repressible promoters (see, for example, Gatz et al. (1991) Mol. Gen. Genet. 227: 229-237, and US Patent Nos. 5,814,618 and 5,789,156, herein incorporated by reference.
- tissue-specific promoters can be utilised.
- Tissue-specific promoters include those described by Yamamoto et al. (1997) Plant J. 12(2): 255-265; Kawamata et al (1997) Plant Cell Physiol. 38(7): 792-803; Hansen et al (1997) Mol. Gen. Genet. 254(3): 337-343; Russell et al. (1997) Transgenic Res. 6(2): 157-168; Rinehart et al (1996) Plant Physiol. 112(3): 1331-1341; Van Camp et al (1996) Plant Physiol. 112(2): 525-535; Canevascni et al (1996) Plant Physiol.
- tissue specific promoter is a promoter which is active during the accumulation of oil in developing oil seeds, (for example see Broun et al. (1998) Plant J. 13(2): 201-210.
- operably linked means joined as part of the same nucleic acid molecule, suitably positioned and oriented for transcription to be initiated from the promoter.
- DNA operably linked to a promoter is "under transcriptional initiation regulation" of the promoter, hi a preferred embodiment the promoter is an inducible promoter or a developmentally regulated promoter.
- vectors are nucleic acid constructs which operate as plant vectors. Specific procedures and vectors previously used with wide success upon plants are described by Guerineau and Mullineaux (1993) (Plant transformation and expression vectors. In: Plant Molecular Biology Labfax (Croy RRD ed) Oxford, BIOS Scientific Publishers, pp 121-148. Suitable vectors may include plant viral-derived vectors (see e.g. EP-A-194809).
- Vectors may also include selectable genetic marker such as those that confer selectable phenotypes such as resistance to herbicides (e.g. kanamycin, hygromycin, phosphinotricin, chlorsulfuron, methotrexate, gentamycin, spectinomycin, imidazolinones and glyphosate).
- selectable genetic marker such as those that confer selectable phenotypes such as resistance to herbicides (e.g. kanamycin, hygromycin, phosphinotricin, chlorsulfuron, methotrexate, gentamycin, spectinomycin, imidazolinones and glyphosate).
- said vectors are vectors suitable for mammalian cell transfection or yeast cell transfection.
- multi-copy vectors such as 2 ⁇ episomal vectors are preferred.
- yeast CEN vectors and integrating vectors such as YIP vectors are suitable for transformation of yeast species such as Saccharomyces cerevisiae and Pichia spp.
- a cell transfected or transformed with at least one nucleic acid molecule or vector according to the invention is provided.
- said cell is eukaryotic cell.
- said cell is a plant cell.
- a plant comprising a cell according to the invention.
- said plant is selected from: corn (Zea mays), canola (Brassica napus, Brassica rapa ssp.), flax (Linum usitatissimum), alfalfa (Medicago sativd), rice (Oryza sativa), rye (Secale cerale), sorghum (Sorghum bicolor, Sorghum vulgare), sunflower (Helianthus annus), wheat (Tritium aestivum), soybean (Glycine max), tobacco (Nicotiana tabacum), potato (Solanum tuberosum), peanuts (Arachis hypogaed), cotton (Gossypium hirsutum), sweet potato (Iopmoea batatus), cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nucifer ⁇ ), pineapple (Anana comosus), citris tree (Cirrus' spp.)
- plants of the present invention are crop plants (for example, cereals and pulses, maize, wheat, potatoes, tapioca, rice, sorghum, millet, cassava, barley, pea), and other root, tuber or seed crops.
- Important seed crops are oil-seed rape, sugar beet, maize, sunflower, soybean,sorghum, and flax (linseed).
- Horticultural plants to which the present invention may be applied may include lettuce, endive, and vegetable brassicas including cabbage, broccoli, and cauliflower.
- the present invention may be applied in tobacco, cucurbits, carrot, strawberry, sunflower, tomato, pepper.
- Grain plants that provide seeds of interest include oil-seed plants and leguminous plants. Seeds of interest include grain seeds, such as corn, wheat, barley, rice, sorghum, rye, etc.
- said plant is an oil seed plant.
- Oil seed plants include cotton, soybean, safflower, sunflower, Brassica, maize, alfalfa, palm, coconut, etc.
- Leguminous plants include beans and peas. Beans include guar, locust bean, fenugreek, soybean, garden beans, cowpea, mungbean, lima bean, fava been, lentils, chickpea, etc.
- said oil seed plant is safflower.
- said cell is a prokaryotic cell.
- a seed comprising a cell according to the invention.
- a vaccine preparation comprising a cell, or a fraction of a cell, according to the invention.
- said preparation is an oil body containing fraction.
- said fraction includes a chimeric polypeptide according to the invention associated with oil bodies in said fraction.
- said preparation includes at least one carrier.
- a carrier is an immunogenic molecule which, when bound to a second molecule augments immune responses to the latter.
- Some antigens are not intrinsically immunogenic (i.e. not immunogenic in their own right) yet may be capable of generating antibody responses when associated with a foreign protein molecule such as keyhole-limpet haemocyanin or tetanus toxoid.
- Certain antigens which lack T-cell epitopes, such as polymers with a repeating B-cell epitope are intrinsically immunogenic to a limited extent. These are known as T-independent antigens.
- Such antigens benefit from association with a carrier such as tetanus toxoid, under which circumstance they elicit much stronger antibody responses.
- said preparation includes at least one adjuvant.
- An adjuvant is a substance or procedure which augments specific immune responses to antigens by modulating the activity of immune cells. Examples of adjuvant include, by example only, Freund's adjuvant, moray deputies, liposome's. Adjuvant are distinct from carriers.
- the vaccine preparations may conveniently be presented in unit dosage form and may be prepared by any of the methods well-known in the art of pharmacy. In general, the vaccine preparations are prepared by uniformly and intimately bringing the active polypeptide into association with a liquid stabiliser, a finely divided solid stabiliser, or both, and then, if necessary, shaping the product.
- Solid dosage forms for oral administration include capsules, tablets, pills, powders and granules.
- the active polypeptide is typically mixed with at least one inert, pharmaceutically acceptable recipient such as sodium citrate or declaim phosphate and/or one or more: a) fillers or extenders such as starches, lactose, sucrose, glucose, manifold and silica acid; b) binders such as carboxymethylcellulose, alginates, gelatin, polyvinylpyrrolidone, sucrose and acacia; c) humectants such as glycerol; d) disintegrating agents such as agar-agar, calcium carbonate, potato or tapioca starch, alginic acid, certain silicates and sodium carbonate; e) solution retarding agents such as paraffin; f) absorption accelerators such as quaternary ammonium compounds; g) wetting agents such as cetyl alcohol and glycerol monostearate; h) absorbents such as kaolin and bentonite clay and i) lubricants such as
- Solid compositions of a similar type may also be employed as fillers in soft and hard-filled gelatin capsules using such excipients as lactose or milk sugar as well as high molecular weight polyethylene glycol, for example.
- the solid dosage forms of tablets, dragees, capsules, pills, and granules can be prepared with coatings and shells such as enteric coatings and other coatings well known in the pharmaceutical formulating art. They may optionally contain opacifying agents and may also be of a composition such that they release the active ingredient(s) only, or preferentially, in a certain part of the intestinal tract, and/or in delayed fashion.
- Examples of embedding compositions that can be used include polymeric substances and waxes.
- a method to immunise an animal to at least the second part of the chimeric polypeptide according to the invention comprising administering an effective amount of a chimeric polypeptide or preparation according to the invention sufficient to stimulate an immune response to said second part.
- said animal is human.
- said animal is selected from the group consisting of: mouse; rat; hamster; goat; cow, horse, pig, dog, cat, or sheep.
- said immune response is the production of antibodies to said chimeric polypeptide.
- said immune response is the production of T-helper cells which recognise the second part of said chimeric polypeptide.
- a preferred route of administration is oral.
- intradermal, subcutaneous, intramuscular or intranasal immunisation is within the scope of the invention.
- an antibody obtainable by the method according to the invention.
- said antibody is a therapeutic antibody.
- said antibody is a diagnostic antibody.
- said diagnostic antibody is provided with a label or tag.
- said antibody is a monoclonal antibody or binding fragment thereof.
- said antibody is a humanised or chimeric antibody.
- a chimeric antibody is produced by recombinant methods to contain the variable region of an antibody with an invariant or constant region of a human antibody.
- a humanised antibody is produced by recombinant methods to combine the complementarity determining regions (CDRs) of an antibody with both the constant (C) regions and the framework regions from the variable (V) regions of a human antibody.
- Chimeric antibodies are recombinant antibodies in which all of the V-regions of a mouse or rat antibody are combined with human antibody C-regions.
- Humanised antibodies are recombinant hybrid antibodies which fuse the comphmentarity determining regions from a rodent antibody V-region with the framework regions from the human antibody V-regions. The C-regions from the human antibody are also used.
- the complimentarity determining regions (CDRs) are the regions within the N- terminal domain of both the heavy and light chain of the antibody to where the majority of the variation of the V-region is restricted. These regions form loops at the surface of the antibody molecule. These loops provide the binding surface between the antibody and antigen.
- Antibodies from non-human animals provoke an immune response to the foreign antibody and its removal from the circulation.
- Both chimeric and humanised antibodies have reduced antigenicity when injected to a human subject because there is a reduced amount of rodent (i.e. foreign) antibody within the recombinant hybrid antibody, while the human antibody regions do not elicit an immune response. This results in a weaker immune response and a decrease in the clearance of the antibody. This is clearly desirable when using therapeutic antibodies in the treatment of human diseases.
- Humanised antibodies are designed to have less "foreign" antibody regions and are therefore thought to be less immunogenic than chimeric antibodies.
- scFv's single chain antibody variable region fragments
- scFv's single chain antibody variable region fragments
- a hybridoma exists for a specific monoclonal antibody it is well within the knowledge of the skilled person to isolate scFv's from mRNA extracted from said hybridoma via RT PCR.
- phage display screening can be undertaken to identify clones expressing scFv's.
- said fragments are "domain antibody fragments". Domain antibodies are the smallest binding part of an antibody (approximately 13kDa). Examples of this technology is disclosed in US6, 248, 516, US6, 291, 158, US6,127, 197 and EP0368684 which are all incorporated by reference in their entirety.
- said antibodies are opsonic antibodies.
- Phagocytosis is mediated by macrophages and polymorphic leukocytes and involves the ingestion and digestion of micro-organisms, damaged or dead cells, cell debris, insoluble particles and activated clotting factors.
- Opsonins are agents which facilitate the phagocytosis of the above foreign bodies.
- Opsonic antibodies are therefore antibodies which provide the same function.
- Examples of opsonins are the Fc portion of an antibody or compliment C3. Antibodies raised by immunisation and in the form of an immune complex with antigen may bring about opsonisation via the fixation of complement on the antigen, or molecules in its immediate microenvironment.
- a method for preparing a hybridoma cell-line producing monoclonal antibodies comprising the steps of: i) immunising an immunocompetent mammal with a chimeric polypeptide or preparation according to the invention; ii) fusing lymphocytes of the immunised immunocompetent mammal with myeloma cells to form hybridoma cells; iii) screening monoclonal antibodies produced by the hybridoma cells of step (ii) for binding activity to polypeptide of the invention; iv) culturing the hybridoma cells to proliferate and/or to secrete said monoclonal antibody; and v) recovering the monoclonal antibody from the culture supernatant.
- said immunocompetent mammal is a rodent, for example a mouse, rat or hamster.
- hybridoma cell-line obtainable by the method according to the invention.
- a method to prepare a vaccine preparation according to the invention comprising; i) forming a cell extract from a plant cell culture or plant according to the invention; and ii) separating said extract into an oil body rich fraction.
- said extract is incubated with a protease which cleaves said second part from said first part.
- Figure la is the nucleic acid sequence of OLB1
- Figure lb is the amino acid sequence ofOLBl
- Figure 2a is the nucleic acid sequence of OLB 2;
- Figure 2b is the amino acid sequence of OLB2;
- Figure 3 is the amino acid sequence of the hydrophobic domain of OLB 1;
- Figure 4a and 4b is the nucleic acid sequence of a spider silk protein
- Table 1 shows the nucleic acid sequences of plant GTase cDNA's.
- Oil bodies were isolated from the endosperm of imbibed castor beans and the membranes delipidated using the methods of Hills et al., (2). Peripheral proteins were removed from the oil bodies using urea buffer, according to Millichip et al, (3). The membrane fraction was solublized in SDS-loading buffer, heated at 70°C for 10 min and the polypeptides separated on a 14% SDS-PAGE gel as described by Laemmli (5) and stained using 0.2% (w/v) Coomassie blue R-250 in methanol: acetic acid: water (4:1:5 v/v/v).
- Sequencing-grade modified porcine trypsin (Promega) was dissolved in the 50 mM acetic acid supplied by the manufacturer, then diluted 5-fold by adding 25 mM ammonium bicarbonate containing 0.1% (v.v) octyl- ⁇ -D-glucopyranoside (Sigma) to give a final trypsin concentration is 0.02 ⁇ g/ ⁇ L. Gel pieces were rehydrated by adding 10 ⁇ L of trypsin solution, and after 30 min enough trypsin solution was added to cover the gel. Digests were incubated overnight at 37°C.
- Tryptic digests were applied directly to the MALDI target plate. Dried peptides were redissolved in 0.5 ⁇ L of formic acid followed by addition of 9.5 ⁇ L of water, and one or more 0.5 ⁇ L aliquots were spotted and dried onto the MALDI target plate. A matrix solution containing 10 mg/mL 4-hydroxy-c-cyano-cinnamic acid (Sigma) in 50% aqueous (v:v) acetonitrile containing 0.1%) TFA (v:v) was freshly prepared. Serial 2-fold dilutions of matrix solution were spotted onto the MALDI target plate, and the dilution that gave a uniform layer of crystals when viewed in the mass spectrometer was selected for use with samples. A 0.5 ⁇ L aliquot of matrix solution was pipetted onto the dried peptide spot. In some cases, the matrix was added directly to the sample droplet on the target plate before drying.
- MALDI mass spectra were obtained using an Applied Biosystems 4700 Proteomics Analyzer (CTS version, Applied Biosystems, Foster City, CA, USA) in reflection mode with an accelerating voltage of 20 kV.
- MS spectra were acquired with a total of 1000 laser pulses over a mass range of m z 800-4000.
- Final mass spectra were the summation of 20 sub-spectra, each acquired with 50 laser pulses, and internally calibrated using the tryptic peptides at m/z 842.509 and 2211.104.
- Monoisotopic masses were obtained from centroids of raw, unsmoothed data.
- a Source 1 accelerating voltage of 8 kV, a collision energy of 1 kV, and a Source 2 accelerating voltage of 15 kV were used. Air was used as the collision gas at the instrument's 'medium' pressure setting with a recharge threshold of 9.9xl0 "7 torr, which produced a Source 2 pressure of about lxlO "6 torr.
- the precursor mass window was set to +/-10 "Da", and the metastable suppressor was enabled. The default calibration was used for MS/MS spectra.
- Mass spectral data obtained in batch mode were submitted to database searching using a locally-running copy of the Mascot program (Matrix Science Ltd., versionl.7).
- Batch-acquired MS and MS/MS spectral data were submitted to a combined peptide mass fingerprint and MS/MS ion search through the Applied Biosystems GPS Explorer software interface (version 1.0) to Mascot.
- Peptide sequence tags were generated from CID-MS/MS spectra by manual interpretation or using a de novo sequencing program supplied by Applied Biosystems.
- RNA extraction, cDNA synthesis and PCR - Total RNA from various tissues was isolated using the RNeasy kit from Qiagen.
- the synthesis of single stranded cDNA was carried out using SuperscriptTM II RNase H " reverse transcriptase from Invitrogen.
- Degenerate primers corresponding to peptide sequences 1 and 2 were designed (5'-ttgatagtirtyagyttyaga and 5'-ctgtccraatgtrtaiarctt) and used to amplify a fragment of the acid lipase (RcOBLl) cDNA from imbibed seed endosperm.
- Gene specific primers (5'-gaccacttggtatgggcatatgatgg and 5'-catgtcattgcagtaaaccaccctga) were then used to obtain the full-length cDNA sequence by 3'- and 5 '-RACE using the SMARTTM RACE cDNA Amplification kit from BD Biosciences, following the manufacturers protocols.
- Primers to a castor actin-like gene (RcACT) (5'- cgttctctccttgtatgccagtggtc and 5'-gagctgctcttggcagtctcaagttc) were used as a constitutive control for RT-PCR experiments on various tissues.
- a partial clone of RcOBL2 was obtained by performing PCR on cDNA from four-day old endosperm using different combinations of four pairs of degenerate primers designed using alignments of multiple OBL-like genes (5'- rtyagyttyagaggiachgarc or 5'-rtyagyttyagaggiachgayc or 5'-rtyagyttyagaggcachgarc or 5'- rtyagyttyagaggcachgayc with 5'-giaccatrtcrttrtrtabac or 5'-giaccatrtcrttrcartabac or 5'-giaciayrtcrtttrtabac or 5'-giaciayrtcrttrcartabac).
- Gene specific primers (5'- taggtctgggcaacagaagtgacgctac and 5'-tgcccaaatgtgtatatgttcagcaacc) were then used to obtain the full-length RcOBLl cDNA sequence by 3'- and 5 '-RACE using the SMARTTM RACE cDNA Amplification kit from BD Biosciences, following the manufacturers protocols.
- TAG triacylglycerol
- CHAPS 3-cholamidopropyl- dimethylammonio-1-propanesulphonate
- PEP propyl endopeptidase
- TFA trifluoroacetic acid
- MALDI matrix-assisted laser desorption/ionization
- MS mass spectrometry
- CE collision-induced dissociation
- PCR polymerase chain reaction
- RT-PCR reverse transcriptase-PCR
- RACE-PCR rapid amplification if cDNA ends- PCR
- PAGE polyacrylamide gel electrophoresis
- MBP maltose binding protein.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Biomedical Technology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Pharmacology & Pharmacy (AREA)
- Physics & Mathematics (AREA)
- Immunology (AREA)
- Cell Biology (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Epidemiology (AREA)
- Veterinary Medicine (AREA)
- Mycology (AREA)
- Insects & Arthropods (AREA)
- Tropical Medicine & Parasitology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Toxicology (AREA)
- Gastroenterology & Hepatology (AREA)
- Developmental Biology & Embryology (AREA)
- Pregnancy & Childbirth (AREA)
- Reproductive Health (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
The invention relates to the manufacture of recombinant proteins; vectors for use in the recombinant manufacture of protein and recombinant proteins that are adapted for purification.
Description
Expression of Heterologous Protein
The invention relates to the manufacture of recombinant proteins; vectors for use in the recombinant manufacture of protein and recombinant proteins that are adapted for purification.
The large scale production of recombinant proteins requires a high standard of quality control since many of these proteins are used as pharmaceuticals, for example: growth hormone; leptin; erythropoietin; prolactin, interleukins; interferon's. Moreover, the development of vaccines, particularly subunit vaccines, (vaccines based on a defined antigen, for example gpl20 of HIV), requires the production of large amounts of pure protein free from contaminating antigens which may provoke anaphylaxis. The production of recombinant protein in cell expression systems is based either on prokaryotic cell expression or eukaryotic cell expression. The latter is preferred when post-translation modifications to the protein are required. Eukaryotic systems include the use of mammalian cells, e.g. Chinese Hamster Ovary cells; insect cells e.g. Spodoptera spp; fungal/yeast (e.g. Saccharomyces spp, Pichia spp, Aspergillus spp) and plants.
The production of heterologous recombinant proteins in an expression system relates to a variety of proteins used in both industrial and medical applications.
For example, "therapeutic polypeptides" include tumour suppressor polypeptides (e.g. p53 polypeptide, the APC polypeptide, the DPC-4 polypeptide, the BRCA-1 polypeptide, the BRCA-2 polypeptide, the WT-1 polypeptide, the retinoblastoma polypeptide (Lee, et al. (1987) Nature 329:642), the MMAC-1 polypeptide, the adenomatous polyposis coli protein (United States Patent 5,783,666), the deleted in colon carcinoma (DCC) polypeptide, the MMSC-2 polypeptide, the NF-1 polypeptide, nasopharyngeal carcinoma tumour suppressor polypeptide (Cheng, et al. 1998. Proc. Nat. Acad. Sci. 95:3042-3047), the MTSl polypeptide, the CD 4 polypeptide, the NF-1 polypeptide, the NF2 polypeptide, and the VHL polypeptide.
Therapeutic polypeptides also include "antigenic polypeptides" (e.g. tumour rejection antigens the MAGE, BAGE, GAGE and DAGE families of tumour rejection antigens, see Schulz et al Proc Natl Acad Sci USA, 1991, 88, pp991-993). Antigenic polypeptides also includes polypeptide antigens used in the preparation of vaccines which provide protection against infectious agents. For example, viruses such as Human Immunodeficiency Virus (HIV1 & 2); Human T Cell Leukaemia Virus (HTLV 1 & 2); Ebola virus; Human Papilloma Virus (e.g. HPV-2, HPV-5, HPV-8 HPV-16, HPV-18, HPV-31, HPV-33, HPV-52, HPV-54 and HPV-56); papovavirus; rhinovirus; poliovirus; herpesvirus; adenovirus; Epstein Barr virus; influenza virus, hepatitis B and C viruses. Antigens derived from pathogenic bacteria such as Staphylococcus aureus; Staphylococcus epidermidis; Enterococcus faecalis; Mycobacterium tuberculsis; Streptococcus group B; Streptoccocus pneumoniae; Helicobacter pylori; Neisseria gonorrhea; Streptococcus group A; Borrelia burgdorferi; Coccidiodes immitis; Histoplasma sapsulatum; Neisseria meningitidis ; Shigella flexneri; Escherichia coli; Haemophilus influenzae. Antigens derived from parasites such as Trypanosoma spp, Plasmodia spp, Schiztosoma spp; and pathogenic fungi such as Candida spp.
The development of subunit vaccines (e.g. vaccines in which the immunogen is a purified protein) has been the focus of considerable research in recent years and requires the isolation and purification of the candidate protein from a complex mixture of proteins. The emergence of new pathogens and the growth of antibiotic resistance have created a need to develop new vaccines and to identify further candidate molecules useful in the development of subunit vaccines. There are a number of routes via which vaccines may be administered to an animal. The traditional route is by transdermal injection of the vaccine containing an immunogen that invokes an immune response. An alternative route that is not typically undertaken is an orally administered vaccine. The immunogen contacts the immune system via the mammalian muscosal membrane which lines the small intestine through the activity of so called M cells. The M cells overlie regions of the intestine
referred to as Peyers Patches and function to sample antigens contained in ingested material and are therefore essential components in mediating mucosal immunity. Peyers Patches have been identified as major sites for the adherence and entry of enteric pathogens such as Escherichia, Salmonella, Yersina and Shigella.
Plant derived vaccines have been developed which utilise plant oil bodies which have been engineered to present an antigen to which an immune response is desired, see WO01/95934. Plants that bear oilseeds such as soybean, rapeseed and sunflower store triglycerides in their seeds which provide an energy store for germination and subsequent seedling development. Further examples of this are described in WO93/20216; WO96/21029 and WO98/21115.
Therapeutic polypeptides can be "cytotoxic polypeptides" (e.g. pseudomonas exotoxin, ricin toxin, diptheria toxin); or polypeptides which are "cytostatic polypeptides" (e.g. p21, the retinoblastoma polypeptide, the E2F-Rb polypeptide, cyclin dependent kinase inhibitors such as P 16, pi 5, pi 8 and pi 9, the growth arrest specific homeobox (GAX) polypeptide as described in Branellec, et al, see WO97/16459 and WO96/30385.
Also included within the scope of therapeutic polypeptides are therapeutic antibodies. Preferably said antibodies are monoclonal antibodies or at least the active binding fragments thereof. Monoclonal antibodies may be humanised or chimeric antibodies. A chimeric antibody is produced by recombinant methods to contain the variable region of an antibody with an invariant or constant region of a human antibody. A humanised antibody is produced by recombinant methods to combine the complementarity determining regions (CDRs) of an antibody with both the constant (C) regions and the framework regions from the variable (V) regions of a human antibody.
Plants that bear oilseeds such as soybean, rapeseed and sunflower store triglycerides in their seeds which provide an energy store for germination and subsequent seedling
development. In the seeds the oil globules form oil bodies in the cytoplasm that are membrane bound organelles that have protein in the membrane surface. An example of a protein found associated with oil bodies is an oleosm. The oleosins are a family of proteins that have divergent amino and carboxyl termini with more highly conserved lipophilic central domain. These proteins are strongly associated with oil bodies and it is thought that the association is due in part to the lipophilic domain.
We have cloned two plant lipases referred to as OBL1 and OBL 2 from castor bean that have different lipid specificities. The sequence of OBL 1 and OBL 2 has revealed a lipophilic domain that lacks any homology to that found in the oleosin described in WO93/21320 and represents an alternative means to localise polypeptides which comprise this sequence to oil bodies and thereby facilitate their purification.
According to an aspect of the invention there is provided a nucleic acid molecule that encodes a chimeric polypeptide wherein said nucleic acid molecule comprises a first part selected from the group consisting of: i) a nucleic acid molecule consisting of a nucleic acid sequence which encodes an amino acid sequence as represented in Figure 3; ii) a nucleic acid molecule which hybridises to the nucleic acid molecule in (i) and which specifically targets said polypeptide to an oil body; and iii) a nucleic acid molecule which differs from the nucleic acid molecules of (i) and (ii) due to the degeneracy in the genetic code; and a second part that encodes an heterologous polypeptide.
An heterologous polypeptide is a polypeptide to which a targeting domain as herein disclosed has been fused which would not naturally include such a targeting domain.
In a preferred embodiment of the invention said targeting domain is encoded by a nucleic acid molecule comprising a nucleic acid sequence which hybridises under
stringent hybridisation conditions to a nucleic acid molecule which encodes an amino acid sequence as represented in Figure 3.
Stringent hybridisation/washing conditions are well known in the art. For example, nucleic acid hybrids that are stable after washing in O.lx SSC, 0.1% SDS at 60°C. It is well known in the art that optimal hybridisation conditions can be calculated if the sequence of the nucleic acid is known. Typically, hybridisation conditions uses 4 - 6 x SSPE (20x SSPE contains 175.3g NaCl, 88.2g NaH2PO4 H2O and 7.4g EDTA dissolved to 1 litre and the pH adjusted to 7.4); 5-10x Denhardts solution (50x Denhardts solution contains 5g Ficoll (type 400, Pharmacia), 5g polyvinylpyrrolidone and 5g bovine serum albumen; lOOμg-l.Omg ml sonicated salmon/herring DNA; 0.1- 1.0% sodium dodecyl sulphate; optionally 40-60% deionised formamide. Hybridisation temperature will vary depending on the GC content of the nucleic acid target sequence but will typically be between 42°- 65° C.
In a preferred embodiment of the invention said nucleic acid molecule comprises a nucleic acid sequence that has at least 30% homology to the nucleic acid sequence which encodes an amino acid sequence as represented in Figure 3. Preferably said homology is at least 40%; 50%; 60%; 70%; 80%; 90%; or at least 99% identity with the nucleic acid sequence that encodes the amino acid sequence as represented in Figure 3.
According to a further aspect of the invention there is provided a chimeric polypeptide encoded by a nucleic acid molecule according to the invention.
Recombinant protein production relates to the synthesis of protein in expression systems (e.g. bacterial expression systems, yeast expression systems, insect expression systems, mammalian expression systems and plant expression systems). Heterologous polypeptides include commercially important polypeptides, for example enzymes used in biocatalysis (e.g. restriction enzymes, enzymes used in industrial processing; e.g. amylases, proteases, nucleases, lipases) and therapeutic
polypeptides as herein disclosed. The large scale production of recombinant proteins requires a high standard of quality control since many of these proteins are used as pharmaceuticals, for example, interleukins, growth hormone, erythropoietin, interferon. Moreover, the development of vaccines, particularly subunit vaccines, (vaccines based on a defined antigen, for example gpl20 of HIV), requires the production of large amounts of pure protein free from contaminating antigens which may provoke anaphylaxis.
In a preferred embodiment of the invention said second part encodes an enzyme.
In a further preferred embodiment of the invention said second part encodes a pharmaceutical polypeptide.
In a yet further preferred embodiment of the invention said second part encodes an antigenic polypeptide. Preferably said second part encodes an antigenic polypeptide associated with an infectious disease condition.
i a preferred embodiment of the invention said second part encodes a polypeptide associated with a viral disease condition.
hi a preferred embodiment of the invention said viral disease is the result of an infection caused by a virus selected from the families consisting of: Picornaviridae;
Caliciviridae; Togaviridae; Flaviridae; Coronaviridae; Rhabdoviridae; Filoviridae;
Paramyxoviridae; Orthomyxoviridae; Bunyaviridae; Arenaviridae; Reoviridae; Bimavirdae; Retroviridae; Hepadnaviridae; Parvoviridae; Papoviridae; Adenoviridae;
Herpesviridae; and Poxviridae.
In a preferred embodiment of the invention said virus is selected from the families consisting of: Adenoviridae; Coronaviridae; Herpesviridae; Orthomyxoviridae; Picornaviridae; Poxviridae; Reoviridae; and Retroviridae.
In a further preferred embodiment of the invention said second part encodes a polypeptide associated with a bacterial disease condition.
In a preferred embodiment of the invention said bacterial disease is caused by a bacterial pathogen selected from the group consisting of: Actinomyces; Aeromonas; Bacillus; Bacteroides; Bordetella; Brucella; Compylobacter; Capnocyclophaga; Chlamydia; Clostridium; Corynebacterium; Eikenella; Erysipelothrix; Escherichia; Fusobacterium; Hemophilus; Klebseilla; Legionella; Leptospira; Listeria; Mycobacterium; Mycoplasma; Neisseria; Nocardia; Pasteurella; Proteus; Pseudomonas; Rickettsia; Salmonella; Selenomonas; Shigelia; Staphylococcus; Streptococcus; Treponema; Vibro and Yersnia.
In a further preferred embodiment of the invention said second part encodes a polypeptide associated with a fungal disease condition.
In a preferred embodiment of the invention said fungal disease condition is caused by a fungal pathogen selected from the group consisting of: Aspercillus; Blastomyces; Candida; Cryptococcus; Histoplasma; Phycomyces; and Saccharomyces.
In a further preferred embodiment of the invention said second part encodes a polypeptide associated with a parasitic disease condition.
In a preferred embodiment of the invention said parasitic disease condition is caused by a parasitic pathogen selected from the group consisting of: Trypanosoma; Leishmania; Schistosoma; Plasmodium; Entamoeba; Eimeria; and Trichomonas.
hi a preferred embodiment of the invention said parasitic pathogen is selected from the group consisting of: Entamoeba; Eimeria; and Trichomonas.
In an alternative preferred embodiment of the invention said second part encodes a polypeptide associated with cancer.
In a preferred embodiment of the invention said polypeptide is a tumour rejection antigen polypeptide.
In an alternative preferred embodiment of the invention said second part encodes a polypeptide that is an adhesive polypeptide, preferably said adhesive polypeptide is a mussel adhesive polypeptide.
Mussel-derived adhesives possess a variety of characteristics that make them of particular interest to material scientists. The adhesives display a high bonding strength, versatility to a variety of bonding surfaces, resistance and durability, non- toxicity, waterproof and cure without high temperatures properties, characteristics which give these adhesives a range of potential uses and which no man-made adhesive on the market currently possesses. These properties have led to predictions of wide-ranging applications in engineering and medicine. For example a requirement for a natural, strong and waterproof adhesive can be found in the marine industry, where it can be used in ship-building. Medical uses, where such an adhesive, with its further advantages of being neither cytotoxic nor antigenic could be used in orthopaedic and dental applications.
In a preferred embodiment of the invention said mussel adhesive polypeptide is encoded by a nucleic acid molecule selected from the group consisting of:
i) a nucleic acid molecule which encodes a polypeptide comprising the amino acid motif
X-K-X-X-Y-P-X-X-Y-K wherein X is any amino acid residue; ii) a nucleic acid molecule which encodes a polypeptide wherein at least one of the residues in said motif is modified by hydroxylation; and iii) a nucleic acid molecule which encodes a polypeptide which has adhesive properties.
In a preferred embodiment of the invention said adhesive polypeptide comprises the amino acid sequence motif: A-K-P-S-Y-P-P-T-Y-K. Preferably said adhesive polypeptide comprises of at least one motif which has the amino acid sequence A-K- P-S-Y-P-P-T-Y-K. Preferably said adhesive polypeptide consists of a plurality of said motifs linked in tandem.
In a preferred embodiment of the invention said adhesive polypeptide comprises at least 10 motifs, preferably greater than 10 motifs. More preferably still said adhesive polypeptide comprises at least 50 motifs, preferably between 50 and 100 motifs, more preferably between 60-85 motifs.
In a further preferred embodiment of the invention said hydroxylation is to an amino acid at position 6 and/or 7 and/or 9 of at least one motif of the amino acid sequence, X-K-X-X-Y-P-X-X-Y-K. More preferably still said hydroxylation is to amino acid position 6 and/or 7 and or 9 of at least one motif of the amino acid sequence, A-K-P- S-Y-P-P-T-Y-K.
In a further preferred embodiment of the invention said adhesive polypeptide further comprises a second motif wherein said second motif is represented by the amino acid sequence: X P X P X X X X Y K.
Preferably said second motif is selected from the group consisting of at least one of P P K P T P P T Y K; S P P P P V Y K Y K; or S P P P P T P V Y K, or combinations thereof.
In a further preferred embodiment of the invention said nucleic acid molecule encodes a secretion signal, for example an expansin secretion signal.
h a preferred embodiment of the invention said secretion signal comprises the amino acid sequence: MAFSYSPFSSLFLLPFFFVFTFADYGGWQSGHATFYGGGDASGT,
or functional variant thereof.
In a yet further preferred embodiment of the invention said adhesive polypeptide comprises an endoplasmic reticulum targeting sequence, preferably said sequence is represented by the amino acid sequence HDEL, or functional variant thereof.
In a further preferred embodiment of the invention said second part encodes a spider polypeptide, preferably a spider silk polypeptide.
In our co-pending application WO04/016651, we describe a new spider silk protein. In addition to the tensile strength and elasticity of the silk, the additional characteristics of super contraction, insolubility and non-allergenicity make the use of silk in medical and industrial applications an attractive option. Medical applications include; artificial tendons/ligaments, wound-closure systems, such as vascular wound repair devices, haemostatic dressings, patches, glues and sutures. Industrial applications include; bullet-proof vests, light-weight body armour, cables, ropes and parachute cords, even fishing line. Other advantages in using spider silk rather than the synthetic alternatives such as Kevlar™ is that this fibre is derived from petrochemicals, which are processed under extreme, often polluting conditions and add to the waste accumulating in landfills.
In a preferred embodiment of the invention said spider silk polypeptide is encoded by a nucleic acid molecule comprising a nucleic acid sequence selected from the group consisting of: i) a nucleic acid sequence consisting of the sequence as represented in
Figure 4a or 4b; ii) a nucleic acid sequence which encodes a polypeptide domain comprising the amino acid sequence AGRGQGGYGQGAGG and at least two motifs rich in polyalanine wherein said polyalanine motifs comprise at least 6 alanine amino acid residues;
iii) a nucleic acid sequence which hybridises to the sequence presented in
(i) and (ii) above and which encodes a silk polypeptide; and iv) a nucleic acid sequences that are degenerate as a result of the genetic code to the nucleic acid sequence defined in (i) - (iii) above.
In a preferred embodiment of the invention said domain comprises a motif comprising a repetitive tripeptide sequence, GGX (glycine:glycine: X is any amino acid residue), and at least one SS(serine: serine)-polyalanine sequence.
In a yet further preferred embodiment of the invention said second part encodes a polypeptide with glucosylfransferase activity.
Typically, GTases are enzymes that post-translationally transfer glucosyl residues from an activated nucleotide sugar to monomeric and polymeric acceptor molecules such as other sugars, proteins, lipids and other organic substrates. These glucosylated molecules take part in diverse metabolic pathways and processes. The transfer of a glucosyl moiety can alter the acceptor's bioactivity, solubility and transport properties within the cell and throughout the plant. One family of GTases in higher plants is defined by the presence of a C-terminal consensus sequence. The GTases of this family function in the cytosol of plant cells and catalyse the transfer of glucose to small molecular weight substrates, such as phenylpropanoid derivatives, coumarins, flavonoids, other secondary metabolites and molecules known to act as plant hormones. Other GTases transfers glucosyl residues to peptides and polypeptides.
In a preferred embodiment of the invention said glucosylfransferase is encoded by a nucleic acid molecule selected from the group consisting of: i) a nucleic acid molecule selected from the group consisting of nucleic acid sequences as represented in Table 1; ii) a nucleic acid molecule which hybridise to the sequences represented in (ii) above; and which have glucosylfransferase activity; and
iii) nucleic acid molecules consisting of nucleic acid sequences which are degenerate as a result of the genetic code to the sequences defined in (i) and (ii) above.
I a preferred embodiment of the invention said nucleic acid molecules hybridise under stringent hybridisation conditions to the nucleic acid molecules as represented in Table 1.
The GTase sequences disclosed in Table 1 are cDNA sequences to which the oil body targeting domain can be fused typically as an in-frame translational fusion.
In a still further preferred embodiment of the invention said nucleic acid encodes a cleavage site to facilitate the purification of said second part from said first part. Preferably said cleavage site is a proteolytic cleavage site.
According to a further aspect of the invention there is provided a vector comprising a nucleic acid molecule that encodes a chimeric polypeptide according to the invention.
Preferably the nucleic acid in the vector is operably linked to an appropriate promoter or other regulatory elements for transcription in a host cell such as a prokaryotic, (e.g. bacterial), or eukaryotic (e.g. fungal, plant, mammalian or insect cell). The vector may be a bi-functional expression vector which functions in multiple hosts. In the example of nucleic acids encoding polypeptides according to the invention this may contain its native promoter or other regulatory elements and in the case of cDNA this may be under the control of an appropriate promoter or other regulatory elements for expression in the host cell.
By "promoter" is meant a nucleotide sequence upstream from the transcriptional initiation site and which contains all the regulatory regions required for transcription.
Suitable promoters include constitutive, tissue-specific, inducible, developmental or other promoters for expression in plant cells comprised in plants depending on
design. Such promoters include viral, fungal, bacterial, animal and plant-derived promoters capable of functioning in plant cells.
Constitutive promoters include, for example CaMV 35S promoter (Odell et al (1985) Nature 313, 9810-812); rice actin (McElroy et al (1990) Plant Cell 2: 163-171); ubiquitin (Christian et al . (1989) Plant Mol. Biol. 18 (675-689); pEMU (Last et al (1991) Theor Appl. Genet. 81: 581-588); MAS (Velten et al (1984) EMBO J. 3. 2723-2730); ALS promoter (U.S. Application Seriel No. 08/409,297), and the like. Other constitutive promoters include those in U.S. Patent Nos. 5,608,149; 5,608,144; 5,604,121; 5,569,597; 5,466,785; 5,399,680, 5,268,463; and 5,608,142.
Chemical-regulated promoters can be used to modulate the expression of a gene in a plant through the application of an exogenous chemical regulator. Depending upon the objective, the promoter may be a chemical-inducible promoter, where application of the chemical induced gene expression, or a chemical-repressible promoter, where application of the chemical represses gene expression. Chemical-inducible promoters are known in the art and include, but are not limited to, the maize In2-2 promoter, which is activated by benzenesulfonamide herbicide safeners, the maize GST promoter, which is activated by hydrophobic electrophilic compounds that are used as pre-emergent herbicides, and the tobacco PR-la promoter, which is activated by salicylic acid. Other chemical-regulated promoters of interest include steroid- responsive promoters (see, for example, the glucocorticoid-inducible promoter in Schena et al (1991) Proc. Natl. Acad. Sci. USA 88: 10421-10425 and McNellie et al. (1998) Plant J. 14(2): 247-257) and tetracycline-inducible and tetracycline- repressible promoters (see, for example, Gatz et al. (1991) Mol. Gen. Genet. 227: 229-237, and US Patent Nos. 5,814,618 and 5,789,156, herein incorporated by reference.
Where enhanced expression in particular tissues is desired, tissue-specific promoters can be utilised. Tissue-specific promoters include those described by Yamamoto et al. (1997) Plant J. 12(2): 255-265; Kawamata et al (1997) Plant Cell Physiol. 38(7): 792-803; Hansen et al (1997) Mol. Gen. Genet. 254(3): 337-343; Russell et al. (1997) Transgenic Res. 6(2): 157-168; Rinehart et al (1996) Plant Physiol. 112(3): 1331-1341; Van Camp et al (1996) Plant Physiol. 112(2): 525-535; Canevascni et al (1996) Plant Physiol. 112(2): 513-524; Yamamoto et al (1994) Plant Cell Physiol. 35(5): 773-778; Lam (1994) Results Probl. Cell Differ. 20: 181-196; Orozco et al (1993) Plant Mol. Biol. 23(6): 1129-1138; Mutsuoka et al (1993) Proc. Natl. Acad. Sci. USA 90(20): 9586-9590; and Guevara-Garcia et al (1993) Plant J. 4(3): 495-50.
In a preferred embodiment of the invention said tissue specific promoter is a promoter which is active during the accumulation of oil in developing oil seeds, (for example see Broun et al. (1998) Plant J. 13(2): 201-210.
"Operably linked" means joined as part of the same nucleic acid molecule, suitably positioned and oriented for transcription to be initiated from the promoter. DNA operably linked to a promoter is "under transcriptional initiation regulation" of the promoter, hi a preferred embodiment the promoter is an inducible promoter or a developmentally regulated promoter.
Particular vectors are nucleic acid constructs which operate as plant vectors. Specific procedures and vectors previously used with wide success upon plants are described by Guerineau and Mullineaux (1993) (Plant transformation and expression vectors. In: Plant Molecular Biology Labfax (Croy RRD ed) Oxford, BIOS Scientific Publishers, pp 121-148. Suitable vectors may include plant viral-derived vectors (see e.g. EP-A-194809).
Vectors may also include selectable genetic marker such as those that confer selectable phenotypes such as resistance to herbicides (e.g. kanamycin, hygromycin,
phosphinotricin, chlorsulfuron, methotrexate, gentamycin, spectinomycin, imidazolinones and glyphosate).
Alternatively, or in addition, said vectors are vectors suitable for mammalian cell transfection or yeast cell transfection. In the latter example multi-copy vectors such as 2μ episomal vectors are preferred. Alternatively yeast CEN vectors and integrating vectors such as YIP vectors are suitable for transformation of yeast species such as Saccharomyces cerevisiae and Pichia spp.
According to a yet further aspect of the invention there is provided a method to produce a chimeric polypeptide according to the invention comprising the steps of:
i) providing a cell according to the invention and growth conditions conducive to the production of a chimeric polypeptide according to the invention; and optionally
ii) purifying said chimeric polypeptide from said cell or growth media.
According to a further aspect of the invention there is provided a cell transfected or transformed with at least one nucleic acid molecule or vector according to the invention.
In a preferred embodiment of the invention said cell is eukaryotic cell.
h a preferred embodiment of the invention said cell is a plant cell.
According to a further aspect of the invention there is provided a plant comprising a cell according to the invention.
In a preferred embodiment of the invention said plant is selected from: corn (Zea mays), canola (Brassica napus, Brassica rapa ssp.), flax (Linum usitatissimum),
alfalfa (Medicago sativd), rice (Oryza sativa), rye (Secale cerale), sorghum (Sorghum bicolor, Sorghum vulgare), sunflower (Helianthus annus), wheat (Tritium aestivum), soybean (Glycine max), tobacco (Nicotiana tabacum), potato (Solanum tuberosum), peanuts (Arachis hypogaed), cotton (Gossypium hirsutum), sweet potato (Iopmoea batatus), cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nuciferά), pineapple (Anana comosus), citris tree (Cirrus' spp.) cocoa (Theobroma cacao), tea (Camellia senensis), banana (Musa spp.), avacado (Persea americana), fig (Ficus casica), guava (Psidium guajava), mango (Mangifer indica), olive (Olea europaea), papaya (Carica papaya), cashew (Anacardium occidentale), macadamia (Macadamia inter grifolia), almond (Prunus amygdalus), sugar beets (Beta vulgaris), oats, barley, vegetables and ornamentals.
Preferably, plants of the present invention are crop plants (for example, cereals and pulses, maize, wheat, potatoes, tapioca, rice, sorghum, millet, cassava, barley, pea), and other root, tuber or seed crops. Important seed crops are oil-seed rape, sugar beet, maize, sunflower, soybean,sorghum, and flax (linseed). Horticultural plants to which the present invention may be applied may include lettuce, endive, and vegetable brassicas including cabbage, broccoli, and cauliflower. The present invention may be applied in tobacco, cucurbits, carrot, strawberry, sunflower, tomato, pepper. Grain plants that provide seeds of interest include oil-seed plants and leguminous plants. Seeds of interest include grain seeds, such as corn, wheat, barley, rice, sorghum, rye, etc.
In a preferred embodiment of the invention said plant is an oil seed plant.
Oil seed plants include cotton, soybean, safflower, sunflower, Brassica, maize, alfalfa, palm, coconut, etc. Leguminous plants include beans and peas. Beans include guar, locust bean, fenugreek, soybean, garden beans, cowpea, mungbean, lima bean, fava been, lentils, chickpea, etc.
In a preferred embodiment of the invention said oil seed plant is safflower.
hi a further preferred embodiment of the invention said cell is a prokaryotic cell.
According to a further aspect of the invention there is provided a seed comprising a cell according to the invention.
According to a further aspect of the invention there is provided a vaccine preparation comprising a cell, or a fraction of a cell, according to the invention.
In a preferred embodiment of the invention said preparation is an oil body containing fraction. Preferably said fraction includes a chimeric polypeptide according to the invention associated with oil bodies in said fraction.
In a further preferred embodiment of the invention said preparation includes at least one carrier.
The term carrier is construed in the following manner. A carrier is an immunogenic molecule which, when bound to a second molecule augments immune responses to the latter. Some antigens are not intrinsically immunogenic (i.e. not immunogenic in their own right) yet may be capable of generating antibody responses when associated with a foreign protein molecule such as keyhole-limpet haemocyanin or tetanus toxoid. Certain antigens which lack T-cell epitopes, such as polymers with a repeating B-cell epitope are intrinsically immunogenic to a limited extent. These are known as T-independent antigens. Such antigens benefit from association with a carrier such as tetanus toxoid, under which circumstance they elicit much stronger antibody responses.
In a further preferred embodiment of the invention said preparation includes at least one adjuvant.
An adjuvant is a substance or procedure which augments specific immune responses to antigens by modulating the activity of immune cells. Examples of adjuvant include, by example only, Freund's adjuvant, moray deputies, liposome's. Adjuvant are distinct from carriers.
The vaccine preparations may conveniently be presented in unit dosage form and may be prepared by any of the methods well-known in the art of pharmacy. In general, the vaccine preparations are prepared by uniformly and intimately bringing the active polypeptide into association with a liquid stabiliser, a finely divided solid stabiliser, or both, and then, if necessary, shaping the product. Solid dosage forms for oral administration include capsules, tablets, pills, powders and granules. In such solid dosage forms, the active polypeptide is typically mixed with at least one inert, pharmaceutically acceptable recipient such as sodium citrate or declaim phosphate and/or one or more: a) fillers or extenders such as starches, lactose, sucrose, glucose, manifold and silica acid; b) binders such as carboxymethylcellulose, alginates, gelatin, polyvinylpyrrolidone, sucrose and acacia; c) humectants such as glycerol; d) disintegrating agents such as agar-agar, calcium carbonate, potato or tapioca starch, alginic acid, certain silicates and sodium carbonate; e) solution retarding agents such as paraffin; f) absorption accelerators such as quaternary ammonium compounds; g) wetting agents such as cetyl alcohol and glycerol monostearate; h) absorbents such as kaolin and bentonite clay and i) lubricants such as talc, calcium stearate, magnesium stearate, solid polyethylene glycols, sodium lauryl sulfate and mixtures thereof, hi the case of capsules, tablets and pills, the dosage form may also comprise buffering agents. Solid compositions of a similar type may also be employed as fillers in soft and hard-filled gelatin capsules using such excipients as lactose or milk sugar as well as high molecular weight polyethylene glycol, for example. The solid dosage forms of tablets, dragees, capsules, pills, and granules can be prepared with coatings and shells such as enteric coatings and other coatings well known in the pharmaceutical formulating art. They may optionally contain opacifying agents and may also be of a composition such that they release the active ingredient(s) only, or preferentially, in a
certain part of the intestinal tract, and/or in delayed fashion. Examples of embedding compositions that can be used include polymeric substances and waxes.
According to a further aspect of the invention there is provided a method to immunise an animal to at least the second part of the chimeric polypeptide according to the invention, comprising administering an effective amount of a chimeric polypeptide or preparation according to the invention sufficient to stimulate an immune response to said second part.
In a preferred method of the invention said animal is human.
In an alternative preferred method of the invention said animal is selected from the group consisting of: mouse; rat; hamster; goat; cow, horse, pig, dog, cat, or sheep.
In a further preferred method of the invention said immune response is the production of antibodies to said chimeric polypeptide.
In an alternative preferred method of the invention said immune response is the production of T-helper cells which recognise the second part of said chimeric polypeptide.
A preferred route of administration is oral. However, intradermal, subcutaneous, intramuscular or intranasal immunisation is within the scope of the invention.
According to a yet further aspect of the invention there is provided an antibody obtainable by the method according to the invention.
In a preferred embodiment of the invention said antibody is a therapeutic antibody.
In a further preferred embodiment of the invention said antibody is a diagnostic antibody. Preferably said diagnostic antibody is provided with a label or tag.
In a preferred embodiment of the invention said antibody is a monoclonal antibody or binding fragment thereof. Preferably said antibody is a humanised or chimeric antibody.
A chimeric antibody is produced by recombinant methods to contain the variable region of an antibody with an invariant or constant region of a human antibody. A humanised antibody is produced by recombinant methods to combine the complementarity determining regions (CDRs) of an antibody with both the constant (C) regions and the framework regions from the variable (V) regions of a human antibody.
Chimeric antibodies are recombinant antibodies in which all of the V-regions of a mouse or rat antibody are combined with human antibody C-regions. Humanised antibodies are recombinant hybrid antibodies which fuse the comphmentarity determining regions from a rodent antibody V-region with the framework regions from the human antibody V-regions. The C-regions from the human antibody are also used. The complimentarity determining regions (CDRs) are the regions within the N- terminal domain of both the heavy and light chain of the antibody to where the majority of the variation of the V-region is restricted. These regions form loops at the surface of the antibody molecule. These loops provide the binding surface between the antibody and antigen.
Antibodies from non-human animals provoke an immune response to the foreign antibody and its removal from the circulation. Both chimeric and humanised antibodies have reduced antigenicity when injected to a human subject because there is a reduced amount of rodent (i.e. foreign) antibody within the recombinant hybrid antibody, while the human antibody regions do not elicit an immune response. This results in a weaker immune response and a decrease in the clearance of the antibody. This is clearly desirable when using therapeutic antibodies in the treatment of human
diseases. Humanised antibodies are designed to have less "foreign" antibody regions and are therefore thought to be less immunogenic than chimeric antibodies.
It is also possible to create single variable regions, so called single chain antibody variable region fragments (scFv's). If a hybridoma exists for a specific monoclonal antibody it is well within the knowledge of the skilled person to isolate scFv's from mRNA extracted from said hybridoma via RT PCR. Alternatively, phage display screening can be undertaken to identify clones expressing scFv's. Alternatively said fragments are "domain antibody fragments". Domain antibodies are the smallest binding part of an antibody (approximately 13kDa). Examples of this technology is disclosed in US6, 248, 516, US6, 291, 158, US6,127, 197 and EP0368684 which are all incorporated by reference in their entirety.
In a further preferred embodiment of the invention said antibodies are opsonic antibodies.
Phagocytosis is mediated by macrophages and polymorphic leukocytes and involves the ingestion and digestion of micro-organisms, damaged or dead cells, cell debris, insoluble particles and activated clotting factors. Opsonins are agents which facilitate the phagocytosis of the above foreign bodies. Opsonic antibodies are therefore antibodies which provide the same function. Examples of opsonins are the Fc portion of an antibody or compliment C3. Antibodies raised by immunisation and in the form of an immune complex with antigen may bring about opsonisation via the fixation of complement on the antigen, or molecules in its immediate microenvironment.
In a further aspect of the invention there is provided a method for preparing a hybridoma cell-line producing monoclonal antibodies according to the invention comprising the steps of: i) immunising an immunocompetent mammal with a chimeric polypeptide or preparation according to the invention;
ii) fusing lymphocytes of the immunised immunocompetent mammal with myeloma cells to form hybridoma cells; iii) screening monoclonal antibodies produced by the hybridoma cells of step (ii) for binding activity to polypeptide of the invention; iv) culturing the hybridoma cells to proliferate and/or to secrete said monoclonal antibody; and v) recovering the monoclonal antibody from the culture supernatant.
Preferably, said immunocompetent mammal is a rodent, for example a mouse, rat or hamster.
According to a further aspect of the invention there is provided a hybridoma cell-line obtainable by the method according to the invention.
According to a further aspect of the invention there is provided a method to prepare a vaccine preparation according to the invention comprising; i) forming a cell extract from a plant cell culture or plant according to the invention; and ii) separating said extract into an oil body rich fraction.
In a preferred embodiment of the invention said extract is incubated with a protease which cleaves said second part from said first part.
Throughout the description and claims of this specification, the word "comprise" and variations of this word, for example "comprising" and "comprises", means "including but not limited to", and is not intended to (and does not) exclude other moieties, additives, components, integers or steps.
An embodiment of the invention will be described by example only and with reference to the following figures;
Figure la is the nucleic acid sequence of OLB1; Figure lb is the amino acid sequence ofOLBl;
Figure 2a is the nucleic acid sequence of OLB 2; Figure 2b is the amino acid sequence of OLB2;
Figure 3 is the amino acid sequence of the hydrophobic domain of OLB 1;
Figure 4a and 4b is the nucleic acid sequence of a spider silk protein; and
Table 1 shows the nucleic acid sequences of plant GTase cDNA's.
Materials and Methods
Materials- Castor beans (Ricinus communis var. gibsonii) were supplied by Chiltern Seeds (Ulverston, Cumbria, UK). The beans were soaked in running water for 1 d and germinated in the dark on moist vermiculite at 30°C. All reagents were obtained from Sigma-Aldrich Co. (Gillingham, UK), except for glycerol tri[l-14C]oleate that was from Amersham Biosciences UK Ltd. (Little Chalfont, UK) and Flavobacterium meningsepticum propyl endopeptidase (PEP) from Seikagaku America (Falmouth, USA).
Preparation of oil body membranes and proteolytic treatment — Oil bodies were isolated from the endosperm of imbibed castor beans and the membranes delipidated using the methods of Hills et al., (2). Peripheral proteins were removed from the oil bodies using urea buffer, according to Millichip et al, (3). The membrane fraction was solublized in SDS-loading buffer, heated at 70°C for 10 min and the polypeptides separated on a 14% SDS-PAGE gel as described by Laemmli (5) and stained using 0.2% (w/v) Coomassie blue R-250 in methanol: acetic acid: water (4:1:5 v/v/v). For proteolytic treatments purified urea washed oil bodies were incubated in 0.1M phosphate buffer (pH 7) containing 0.2 U ml"1 of PEP for lh at 37°C. The oil bodies were recovered by centrifugation.
Tryptic digestion and peptide sequence analysis - fri-gel tryptic digestion was performed by washing the gel pieces in Eppendorf tubes three times with 50% (v:v) aqueous acetonitrile containing 25 mM ammonium bicarbonate and drying the gel pieces in a vacuum concentrator for 30 min. Sequencing-grade modified porcine trypsin (Promega) was dissolved in the 50 mM acetic acid supplied by the manufacturer, then diluted 5-fold by adding 25 mM ammonium bicarbonate containing 0.1% (v.v) octyl-β-D-glucopyranoside (Sigma) to give a final trypsin concentration is 0.02 μg/μL. Gel pieces were rehydrated by adding 10 μL of trypsin solution, and after 30 min enough trypsin solution was added to cover the gel. Digests were incubated overnight at 37°C.
Tryptic digests were applied directly to the MALDI target plate. Dried peptides were redissolved in 0.5 μL of formic acid followed by addition of 9.5 μL of water, and one or more 0.5 μL aliquots were spotted and dried onto the MALDI target plate. A matrix solution containing 10 mg/mL 4-hydroxy-c-cyano-cinnamic acid (Sigma) in 50% aqueous (v:v) acetonitrile containing 0.1%) TFA (v:v) was freshly prepared. Serial 2-fold dilutions of matrix solution were spotted onto the MALDI target plate, and the dilution that gave a uniform layer of crystals when viewed in the mass spectrometer was selected for use with samples. A 0.5 μL aliquot of matrix solution was pipetted onto the dried peptide spot. In some cases, the matrix was added directly to the sample droplet on the target plate before drying.
Positive-ion MALDI mass spectra were obtained using an Applied Biosystems 4700 Proteomics Analyzer (CTS version, Applied Biosystems, Foster City, CA, USA) in reflection mode with an accelerating voltage of 20 kV. MS spectra were acquired with a total of 1000 laser pulses over a mass range of m z 800-4000. Final mass spectra were the summation of 20 sub-spectra, each acquired with 50 laser pulses, and internally calibrated using the tryptic peptides at m/z 842.509 and 2211.104. Monoisotopic masses were obtained from centroids of raw, unsmoothed data.
For CID-MS/MS, a Source 1 accelerating voltage of 8 kV, a collision energy of 1 kV, and a Source 2 accelerating voltage of 15 kV were used. Air was used as the collision gas at the instrument's 'medium' pressure setting with a recharge threshold of 9.9xl0"7 torr, which produced a Source 2 pressure of about lxlO"6 torr. The precursor mass window was set to +/-10 "Da", and the metastable suppressor was enabled. The default calibration was used for MS/MS spectra.
Mass spectral data obtained in batch mode were submitted to database searching using a locally-running copy of the Mascot program (Matrix Science Ltd., versionl.7). Batch-acquired MS and MS/MS spectral data were submitted to a combined peptide mass fingerprint and MS/MS ion search through the Applied Biosystems GPS Explorer software interface (version 1.0) to Mascot. Search criteria included: Maximum missed cleavages, 1; Variable modifications, Oxidation (M); Peptide tolerance, 25 ppm; MS/MS tolerance, 0.2 Da. Peptide sequence tags were generated from CID-MS/MS spectra by manual interpretation or using a de novo sequencing program supplied by Applied Biosystems.
RNA extraction, cDNA synthesis and PCR - Total RNA from various tissues was isolated using the RNeasy kit from Qiagen. The synthesis of single stranded cDNA was carried out using Superscript™ II RNase H" reverse transcriptase from Invitrogen. Degenerate primers corresponding to peptide sequences 1 and 2 were designed (5'-ttgatagtirtyagyttyaga and 5'-ctgtccraatgtrtaiarctt) and used to amplify a fragment of the acid lipase (RcOBLl) cDNA from imbibed seed endosperm. Gene specific primers (5'-gaccacttggtatgggcatatgatgg and 5'-catgtcattgcagtaaaccaccctga) were then used to obtain the full-length cDNA sequence by 3'- and 5 '-RACE using the SMART™ RACE cDNA Amplification kit from BD Biosciences, following the manufacturers protocols. Primers to a castor actin-like gene (RcACT) (5'- cgttctctccttgtatgccagtggtc and 5'-gagctgctcttggcagtctcaagttc) were used as a constitutive control for RT-PCR experiments on various tissues.
A partial clone of RcOBL2 was obtained by performing PCR on cDNA from four-day old endosperm using different combinations of four pairs of degenerate primers designed using alignments of multiple OBL-like genes (5'- rtyagyttyagaggiachgarc or 5'-rtyagyttyagaggiachgayc or 5'-rtyagyttyagaggcachgarc or 5'- rtyagyttyagaggcachgayc with 5'-giaccatrtcrttrttrtabac or 5'-giaccatrtcrttrcartabac or 5'-giaciayrtcrttrttrtabac or 5'-giaciayrtcrttrcartabac). Gene specific primers (5'- taggtctgggcaacagaagtgacgctac and 5'-tgcccaaatgtgtatatgttcagcaacc) were then used to obtain the full-length RcOBLl cDNA sequence by 3'- and 5 '-RACE using the SMART™ RACE cDNA Amplification kit from BD Biosciences, following the manufacturers protocols.
ABBREVIATIONS
The abbreviations used are: TAG, triacylglycerol; CHAPS, 3-cholamidopropyl- dimethylammonio-1-propanesulphonate; PEP, propyl endopeptidase; TFA, trifluoroacetic acid; MALDI, matrix-assisted laser desorption/ionization; MS, mass spectrometry; CE), collision-induced dissociation; PCR, polymerase chain reaction; RT-PCR, reverse transcriptase-PCR; RACE-PCR, rapid amplification if cDNA ends- PCR; PAGE, polyacrylamide gel electrophoresis; MBP, maltose binding protein.
REFERENCES
1. Fuchs, C. Vine, N. and Hills, M. J. (1996) J. Plant Physiol. 149, 23-29.
2. Hills, M.J., Murphy, DJ. and Beevers, H. (1989) Plant Physiol. 89, 1006-1010.
3. Millichip, M.5 Tatham, A.S., Jackson, F., Griffiths, G., Shewry, P.R. and Stobart, K.A. (1996) Biochem. J. 314, 333-337.
4. Bradford, M.M. (1976) Anal Biochem. 72, 248-254. 5. Laemmli, U.K. (1970) Nature 227, 680-685.
>UGT71B1 TABLE 1
ATGAAAGTAGAACTTGTGTTCATACCATCGCCGGGCGTTGGCCATATCCGAGCAACAACG GCGTTAGCAAAGCTTCTCGTTGCCAGCGACAACCGCCTCTCCGTCACTCTCATCGTCATT CCTTCACGAGTCTCCGACGACGCTTCTTCCTCCGTCTACACGAACTCCGAAGACCGTCTC CGCTACATCCTCCTCCCCGCCCGAGATCAAACTACTGATCTCGTATCTTACATCGACAGC CAGAAACCACAAGTAAGAGCCGTCGTGTCCAAGGTCGCTGGAGATGTTTCAACACGTTCA GACTCACGGCTAGCTGGGATTGTCGTAGACATGTTCTGCACGTCCATGATAGACATCGCC GATGAGTTTAACCTCTCGGCTTATATCTTCTACACGTCCAACGCTTCTTATCTCGGGCTA CAGTTCCACGTTCAATCTCTTTACGACGAGAAAGAACTCGACGTAAGTGAGTTCAAAGAT ACGGAGATGAAGTTTGACGTTCCAACTCTGACTCAGCCTTTTCCGGCAAAATGTTTGCCT TCAGTGATGCTAAACAAGAAATGGTTTCCTTACGTTTTGGGTCGAGCTAGAAGTTTTAGA GCAACGAAGGGTATTTTGGTAAATTCGGTGGCTGACATGGAACCTCAGGCGTTGAGTTTC TTTTCCGGTGGAAATGGGAATACAAATATCCCTCCGGTGTACGCGGTTGGGCCCATTATG GACTTAGAATCTAGCGGCGATGAAGAGAAGAGAAAGGAGATTTTACATTGGCTAAAAGAG CAACCGACGAAATCTGTAGTGTTTCTCTGTTTTGGGAGCATGGGAGGTTTCAGTGAGGAA CAAGCAAGAGAAATAGCTGTGGCGCTCGAGCGAAGCGGACACAGGTTTCTCTGGTCGCTT CGCCGCGCTTCTCCTGTTGGAAACAAGTCTAATCCTCCTCCCGGAGAATTCACGAACTTA GAGGAGATTCTTCCAAAAGGGTTTTTAGATCGGACGGTGGAGATAGGGAAGATCATAAGC TGGGCACCACAAGTAGATGTGTTGAATAGTCCTGCTATAGGAGCGTTCGTGACACATTGT GGATGGAACTCAATTCTCGAGAGTCTTTGGTTCGGTGTTCCGATGGCGGCGTGGCCTATC TATGCTGAGCAACAGTTTAACGCGTTTCATATGGTGGATGAGCTTGGTTTAGCGGCGGAG GTAAAGAAGGAGTACCGTAGAGATTTTCTGGTGGAGGAGCCGGAGATTGTGACGGCTGAT GAGATAGAGAGAGGGATCAAGTGTGCGATGGAGCAGGATAGCAAGATGAGGAAGAGGGTG ATGGAGATGAAGGATAAGCTCCACGTGGCGTTGGTGGACGGTGGATCTTCGAACTGTGCT CTAAAGAAGTTTGTTCAAGACGTGGTCGATAATGTTCCATAA
>UGT71B2
ATGAAACTGGAGCTGGTGTTCATACCATCACCTGGTGACGGACATCTCCGGCCATTAGTG GAGGTAGCTAAGCTTCATGTTGACCGTGACGACCATCTCTCCATCACCATCATCATCATC CCTCAGATGCATGGATTTAGTAGCAGTAACTCTTCTTCTTACATCGCTTCTCTCTCCTCT GATTCTGAAGAACGTCTTAGCTACAACGTTCTCTCCGTCCCTGATAAACCAGACTCCGAT GACACCAAACCACATTTTTTCGACTACATTGATAACTTCAAGCCGCAGGTCAAAGCCACG GTGGAAAAACTTACTGACCCGGGTCCACCAGATTCGCCGTCGCGTCTTGCTGGATTCGTG GTGGATATGTTTTGCATGATGATGATTGATGTCGCTAATGAGTTTGGTGTTCCCAGTTAC ATGTTTTACACATCCAACGCAACGTTTCTTGGATTGCAAGTTCATGTTGAATACCTTTAC GACGTTAAGAACTATGACGTTAGTGACCTCAAGGACTCGGACACTACTGAGCTGGAAGTT CCTTGTTTGACTCGTCCTTTACCGGTTAAGTGTTTCCCCTCGGTTCTATTAACCAAGGAG TGGTTACCGGTTATGTTTAGACAAACCAGAAGATTCCGAGAAACTAAAGGTATTTTGGTA AATACATTCGCTGAGCTTGAGCCTCAAGCTATGAAGTTTTTCTCCGGCGTAGATAGTCCT CTGCCTACGGTGTACACAGTTGGACCGGTTATGAATCTTAAAATCAACGGTCCAAATTCA TCTGACGATAAGCAATCGGAGATCCTACGGTGGCTAGACGAGCAGCCACGTAAATCCGTT GTTTTCCTCTGTTTCGGAAGCATGGGAGGTTTCCGTGAGGGCCAAGCTAAAGAAATCGCA ATCGCGCTTGAGCGAAGTGGTCACCGCTTTGTCTGGTCTCTTCGTCGTGCTCAACCAAAA GGATCGATAGGACCTCCCGAAGAATTTACGAATCTTGAGGAAATTCTCCCGGAAGGATTC TTGGAACGGACGGCAGAGATAGGAAAGATTGTAGGTTGGGCTCCACAAAGCGCCATTCTA GCAAATCCTGCGATCGGAGGGTTCGTGTCGCATTGTGGATGGAACTCGACGCTAGAGAGT CTATGGTTCGGAGTTCCGATGGCTACGTGGCCGCTTTACGCAGAGCAACAAGTTAACGCG TTCGAGATGGTTGAGGAGCTAGGGCTAGCGGTGGAGGTCCGAAATAGTTTCCGAGGAGAT TTCATGGCGGCGGATGATGAGTTGATGACGGCAGAGGAGATAGAGAGAGGGATCCGGTGT TTGATGGAGCAGGATAGTGACGTGAGGAGTAGAGTGAAGGAGATGAGCGAGAAGAGTCAC GTAGCTTTAATGGACGGTGGATCTTCGCACGTTGCTCTTCTAAAGTTTATTCAAGACGTC ACTAAGAATATCTCTTGA
>UGT71B5
ATGAAGATTGAGCTTGTGTTCATACCTTTGCCGGGGATTGGTCATCTCAGGCCAACCGTG AAGCTAGCGAAGCAACTCATAGGCAGCGAAAACCGTCTTTCGATCACCATAATCATCATC CCTTCAAGATTTGACGCCGGTGATGCATCCGCCTGTATCGCATCTCTCACCACGTTGTCT CAAGATGATCGCCTCCATTACGAATCCATATCCGTCGCAAAACAACCACCAACCTCCGAC CCGGATCCTGTTCCGGCTCAAGTGTACATAGAGAAACAAAAGACGAAAGTGAGAGATGCA GTCGCGGCGAGAATCGTCGATCCAACAAGAAAGCTCGCGGGATTCGTGGTGGACATGTTC TGTTCCTCGATGATCGATGTAGCTAACGAGTTTGGAGTTCCGTGTTATATGGTATACACA TCGAACGCTACGTTTTTAGGAACCATGCTTCACGTTCAACAAATGTACGATCAAAAGAAG TATGACGTCAGCGAGTTAGAAAACTCGGTCACCGAGTTGGAGTTTCCGTCTCTGACTCGT CCTTATCCAGTGAAGTGTCTTCCTCATATCCTCACTTCAAAGGAGTGGTTACCTCTCTCT CTAGCTCAAGCTAGGTGTTTCCGGAAGATGAAGGGTATTTTGGTAAATACAGTTGCTGAG CTTGAACCTCACGCTTTGAAAATGTTCAATATTAATGGTGACGATCTTCCTCAAGTTTAT CCTGTTGGACCAGTGTTGCATCTCGAAAACGGCAATGACGATGATGAGAAGCAATCGGAA ATTTTGCGGTGGCTCGACGAGCAACCGTCTAAATCTGTTGTGTTTCTCTGCTTTGGGAGC TTGGGAGGTTTCACTGAAGAACAAACAAGAGAAACCGCTGTGGCCCTAGATAGAAGCGGT CAGCGGTTTCTTTGGTGTCTTCGTCACGCATCGCCAAATATAAAAACAGATCGTCCCAGA GATTACACGAATCTTGAGGAGGTTTTACCGGAGGGGTTCTTGGAACGGACTTTGGATAGA GGGAAAGTGATTGGATGGGCACCACAAGTGGCGGTACTAGAGAAGCCGGCGATAGGAGGG TTTGTCACTCACTGCGGTTGGAACTCTATTTTAGAGAGCTTGTGGTTCGGTGTTCCAATG GTGACGTGGCCGCTATACGCGGAACAGAAGGTTAACGCGTTTGAGATGGTTGAGGAGCTG GGTTTGGCGGTGGAGATACGGAAGTACTTAAAAGGAGATTTGTTCGCCGGAGAGATGGAG ACGGTTACCGCGGAGGATATAGAGAGAGCCATTAGGCGTGTGATGGAGCAAGACAGTGAC GTTAGGAACAACGTGAAAGAGATGGCGGAGAAGTGCCACTTCGCGTTAATGGACGGTGGA TCTTCGAAGGCGGCTTTGGAAAAGTTTATTCAAGACGTGATAGAGAATATGGATTAA
>UGT71B6
ATGAAAATAGAGCTAGTATTCATTCCCTCTCCGGCAATTAGTCATCTCATGGCGACGGTA GAGATGGCGGAGCAACTAGTTGATAAAAACGACAACCTCTCTATCACCGTAATCATCATA TCTTTTAGTTCTAAAAATACATCCATGATCACCTCTCTTACATCCAACAACCGCCTCCGG TACGAAATAATCTCCGGAGGAGATCAACAACCAACGGAGCTCAAAGCAACTGATTCCCAC ATCCAAAGTCTAAAGCCACTGGTGAGAGACGCGGTTGCTAAACTCGTAGATTCCACTCTA CCAGACGCGCCTCGTCTTGCGGGATTCGTTGTTGACATGTACTGCACGTCGATGATCGAT GTCGCTAACGAATTTGGCGTCCCTAGTTACTTGTTTTACACCTCTAACGCTGGATTTCTT GGACTTTTGCTTCACATTCAGTTCATGTACGATGCAGAGGATATCTATGACATGAGCGAA TTAGAAGACTCTGACGTAGAGTTGGTGGTTCCGAGTTTGACTAGTCCTTATCCGTTGAAA TGTCTTCCTTACATTTTCAAATCAAAAGAGTGGCTCACTTTTTTTGTAACTCAAGCGAGA AGATTCAGAGAAACTAAGGGCATTTTGGTAAACACGGTTCCTGACTTGGAACCTCAAGCG TTGACGTTTCTTTCCAATGGTAACATTCCACGTGCTTACCCAGTAGGACCATTGTTGCAT CTCAAAAACGTAAATTGTGATTACGTGGACAAGAAGCAATCGGAGATTTTACGGTGGCTA GACGAGCAACCGCCAAGATCTGTAGTGTTCCTCTGTTTCGGGAGCATGGGAGGGTTCAGT GAGGAACAAGTGAGAGAAACCGCATTAGCTCTCGATCGAAGCGGCCACCGGTTTCTTTGG TCTCTCCGTCGTGCATCTCCGAATATATTGAGAGAGCCTCCCGGAGAATTCACAAACCTA GAGGAGATTCTCCCAGAAGGGTTTTTCGATCGGACGGCTAACAGAGGAAAGGTTATCGGA TGGGCTGAACAGGTGGCCATATTGGCGAAGCCGGCGATCGGAGGTTTTGTTTCTCACGGC GGATGGAATTCGACGTTGGAGAGTTTGTGGTTTGGTGTTCCGATGGCGATTTGGCCGCTT TACGCTGAACAGAAGTTTAACGCTTTCGAGATGGTGGAAGAGCTTGGTTTGGCTGTGGAG ATCAAGAAGCATTGGCGAGGAGATCTTTTGTTGGGGAGGTCGGAGATTGTGACGGCGGAG GAGATTGAGAAAGGAATCATATGTTTGATGGAGCAAGACAGTGACGTCAGGAAGAGAGTG AATGAGATCAGCGAGAAGTGCCACGTGGCTTTAATGGACGGTGGATCGTCAGAAACTGCT TTGAAAAGATTTATTCAAGACGTAACGGAGAATATTGCTTGGTCGGAAACTGAAAGCTAG
>UGT71B7
ATGAAATTTGAGCTTGTTTTCATCCCCTATCCCGGAATCGGTCATCTCCGATCAACGGTA GAAATGGCAAAGCTACTAGTGGACCGTGAAACTCGTCTCTCTATCTCCGTTATCATCCTT
CCTTTCATTTCCGAAGGCGAAGTCGGTGCTTCCGATTACATCGCAGCCCTCTCCGCCTCA TCCAACAACCGCCTCCGCTACGAAGTTATCTCCGCCGTAGATCAACCAACCATCGAGATG ACGACAATTGAAATCCATATGAAGAACCAAGAACCAAAGGTGAGAAGCACCGTTGCAAAA CTCCTTGAAGACTATTCGTCTAAACCGGACTCGCCGAAGATCGCTGGCTTTGTTCTAGAC ATGTTTTGCACTTCGATGGTAGATGTAGCGAACGAGTTTGGTTTCCCGAGTTATATGTTT TACACCTCCAGTGCCGGGATTCTCTCAGTTACATATCATGTTCAAATGTTGTGCGATGAG AACAAGTACGATGTTAGTGAAAATGATTATGCAGACTCGGAAGCTGTGTTGAACTTTCCG AGTTTGAGTCGTCCTTATCCGGTGAAGTGTCTTCCTCACGCTCTGGCAGCTAATATGTGG CTCCCGGTGTTTGTAAACCAAGCGAGAAAGTTTAGGGAGATGAAAGGTATTTTGGTAAAT ACTGTTGCTGAGCTTGAACCTTATGTGTTAAAGTTTCTTTCTAGTAGTGATACTCCTCCT GTTTATCCTGTTGGACCATTGTTGCATCTTGAGAACCAACGTGATGATTCTAAGGACGAG AAACGGTTGGAGATTATACGGTGGTTGGATCAGCAACCACCAAGTTCGGTTGTGTTTCTC TGCTTTGGGAGCATGGGAGGCTTCGGTGAGGAACAAGTAAGAGAGATCGCAATCGCGTTA GAGCGAAGTGGGCACCGGTTTCTCTGGTCTCTTCGTCGCGCATCTCCGAATATATTCAAA GAACTTCCAGGAGAGTTTACTAATCTAGAGGAAGTTCTCCCGGAAGGATTCTTTGATCGA ACGAAAGATATAGGTAAAGTGATTGGATGGGCTCCACAAGTAGCCGTTCTTGCGAATCCG GCTATAGGAGGTTTCGTAACTCATTGCGGGTGGAATTCTACGCTAGAGAGTCTTTGGTTT GGTGTTCCAACAGCTGCATGGCCGTTATACGCAGAGCAGAAGTTCAATGCTTTCTTAATG GTGGAGGAGCTTGGATTGGCGGTGGAGATAAGGAAGTATTGGCGAGGTGAACATTTGGCG GGATTACCGACGGCTACTGTGACAGCGGAGGAGATAGAGAAAGCAATCATGTGTCTAATG GAACAAGATAGTGACGTGAGGAAAAGAGTGAAGGATATGAGCGAGAAATGCCATGTGGCT TTAATGGATGGTGGATCGTCGCGTACTGCGTTGCAAAAGTTTATTGAAGAGGTTGCGAAG AATATAGTTTCACTAGATAAGGAATTTGAGCATGTAGCTCTTAAATGA
>UGT71B8
ATGAACAAATTTGCGCTTGTCTTCGTACCATTTCCTATACTTGGTCATCTCAAATCAACC GCCGAGATGGCTAAGCTACTAGTGGAGCAAGAAACTCGCCTCTCTATCTCCATTATCATC CTTCCTCTTCTTTCCGGAGACGACGTCAGTGCTTCCGCTTATATCTCAGCTCTTTCCGCC GCATCCAACGACCGCCTTCACTATGAAGTGATCTCGGACGGAGATCAACCAACCGTCGGG TTACATGTCGATAACCACATCCCGATGGTGAAACGTACCGTTGCAAAACTCGTTGATGAC TACTCAAGGCGGCCGGACTCGCCGAGGCTCGCTGGTTTAGTTGTTGACATGTTTTGTATC TCGGTGATAGACGTGGCTAATGAGGTTAGTGTTCCGTGTTACTTGTTTTACACGTCAAAC GTTGGGATTCTTGCTCTTGGGTTACATATTCAGATGTTGTTTGATAAGAAGGAGTACAGT GTCAGTGAAACTGATTTTGAAGACTCGGAAGTTGTGTTGGATGTTCCGAGTTTGACTTGT CCTTATCCGGTGAAGTGTCTTCCTTATGGTTTGGCAACGAAAGAGTGGCTTCCTATGTAT CTAAATCAAGGTAGAAGATTCAGAGAGATGAAAGGTATTTTGGTAAATACTTTTGCTGAG CTTGAACCTTATGCGTTGGAGTCTCTTCACTCTAGTGGTGATACTCCTCGTGCTTATCCA GTGGGACCATTGTTGCATCTCGAGAACCATGTTGACGGTTCTAAAGACGAGAAGGGTTCG GACATTTTACGGTGGTTAGATGAACAACCACCTAAATCGGTAGTGTTCCTCTGCTTTGGA AGCATAGGAGGCTTTAACGAGGAACAAGCAAGAGAAATGGCCATTGCACTTGAGAGAAGT GGTCACCGCTTCTTGTGGTCTCTTCGCCGTGCATCTCGAGATATAGATAAGGAACTTCCC GGAGAATTCAAGAATCTTGAAGAAATTCTCCCGGAAGGATTCTTTGATCGGACAAAGGAT AAAGGAAAGGTGATCGGATGGGCTCCACAAGTAGCCGTGCTGGCTAAGCCAGCAATCGGA GGTTTTGTTACTCATTGCGGGTGGAACTCGATACTCGAGAGTCTTTGGTTCGGTGTTCCT ATAGCGCCATGGCCGTTATACGCTGAGCAGAAGTTTAATGCTTTCGTGATGGTGGAGGAG CTTGGTTTGGCAGTGAAGATAAGAAAGTATTGGCGAGGCGATCAGTTGGTGGGAACGGCG ACGGTCATAGTGACGGCAGAGGAGATAGAGAGAGGAATCAGATGTTTGATGGAGCAAGAT AGTGACGTGAGGAATAGAGTGAAGGAGATGAGTAAGAAATGTCACATGGCTTTAAAGGAT GGTGGCTCGTCTCAATCTGCTTTGAAATTATTTATTCAAGACGTTACGAAGTATATTGCT TGA
>UGT71C1
ATGGGGAAGCAAGAAGATGCAGAGCTCGTCATCATACCTTTCCCTTTCTCCGGACACATT CTCGCAACAATCGAACTCGCCAAACGTCTCATAAGTCAAGACAATCCTCGGATCCACACC
ATCACCATCCTCTATTGGGGATTACCTTTTATTCCTCAAGCTGACACAATCGCTTTCCTC CGATCCCTAGTCAAAAATGAGCCTCGTATCCGTCTCGTTACGTTGCCCGAAGTCCAAGAC CCTCCACCAATGGAACTCTTTGTGGAATTTGCCGAATCTTACATTCTTGAATACGTCAAG AAAATGGTTCCCATCATCAGAGAAGCTCTCTCCACTCTCTTGTCTTCCCGCGATGAATCG GGTTCAGTTCGTGTGGCTGGATTGGTTCTTGACTTCTTCTGCGTCCCTATGATCGATGTA GGAAACGAGTTTAATCTCCCTTCTTACATTTTCTTGACGTGTAGCGCAGGGTTCTTGGGT ATGATGAAGTATCTTCCAGAGAGACACCGCGAAATCAAATCGGAATTCAACCGGAGCTTC AACGAGGAGTTGAATCTCATTCCTGGTTATGTCAACTCTGTTCCTACTAAGGTTTTGCCG TCAGGTCTATTCATGAAAGAGACCTACGAGCCTTGGGTCGAACTAGCAGAGAGGTTTCCT GAAGCTAAGGGTATTTTGGTTAATTCATACACAGCTCTCGAGCCAAACGGTTTTAAATAT TTCGATCGTTGTCCGGATAACTACCCAACCATTTACCCAATCGGGCCGATATTATGCTCC AACGACCGTCCGAATTTGGACTCATCGGAACGAGATCGGATCATAACTTGGCTAGATGAC CAACCCGAGTCATCGGTCGTGTTCCTCTGTTTCGGGAGCTTGAAGAATCTCAGCGCTACT CAGATCAACGAGATAGCTCAAGCCTTAGAGATCGTTGACTGCAAATTCATCTGGTCGTTT CGAACCAACCCGAAGGAGTACGCGAGCCCTTACGAGGCTCTACCACACGGGTTCATGGAC CGGGTCATGGATCAAGGCATTGTTTGTGGTTGGGCTCCTCAAGTTGAAATCCTAGCCCAT AAAGCTGTGGGAGGATTCGTATCTCATTGTGGTTGGAACTCGATATTGGAGAGTTTGGGT TTCGGCGTTCCAATCGCCACGTGGCCGATGTACGCGGAACAACAACTAAACGCGTTCACG ATGGTGAAGGAGCTTGGTTTAGCCTTGGAGATGCGGTTGGATTACGTGTCGGAAGATGGA GATATAGTGAAAGCTGATGAGATCGCAGGAACCGTTAGATCTTTAATGGACGGTGTGGAT GTGCCGAAGAGTAAAGTGAAGGAGATTGCTGAGGCGGGAAAAGAAGCTGTGGACGGTGGA TCTTCGTTTCTTGCGGTTAAAAGATTCATCGGTGACTTGATCGACGGCGTTTCTATAAGT AAGTAG
>UGT71C2
ATGGCGAAGCAGCAAGAAGCAGAGCTCATCTTCATCCCATTTCCAATCCCCGGACACATT CTCGCCACAATCGAACTCGCGAAACGTCTCATCAGTCACCAACCTAGTCGGATCCACACC ATCACCATCCTCCATTGGAGCTTACCTTTTCTTCCTCAATCTGACACTATCGCCTTCCTC AAATCCCTAATCGAAACAGAGTCTCGTATCCGTCTCATTACCTTACCCGATGTCCAAAAC CCTCCACCAATGGAGCTATTTGTGAAAGCTTCCGAATCTTACATTCTTGAATACGTCAAG AAAATGGTTCCTTTGGTCAGAAACGCTCTCTCCACTCTCTTGTCTTCTCGTGATGAATCG GATTCAGTTCATGTCGCCGGATTAGTTCTTGATTTCTTCTGTGTCCCTTTGATCGATGTC GGAAACGAGTTTAATCTCCCTTCTTACATCTTCTTGACGTGTAGCGCAAGTTTCTTGGGT ATGATGAAGTATCTTCTGGAGAGAAACCGCGAAACCAAACCGGAACTTAACCGGAGCTCT GACGAGGAAACAATATCAGTTCCTGGTTTTGTTAACTCCGTTCCGGTTAAAGTTTTGCCA CCGGGTTTGTTCACGACTGAGTCTTACGAAGCTTGGGTCGAAATGGCGGAAAGGTTCCCT GAAGCCAAGGGTATTTTGGTCAATTCATTTGAATCTCTAGAACGTAACGCTTTTGATTAT TTCGATCGTCGTCCGGATAATTACCCACCCGTTTACCCAATCGGGCCAATTCTATGCTCC AACGATCGTCCGAATTTGGATTTATCGGAACGAGACCGGATCTTGAAATGGCTCGATGAC CAACCCGAGTCATCTGTTGTGTTTCTCTGCTTCGGGAGCTTGAAGAGTCTCGCTGCGTCT CAGATTAAAGAGATCGCTCAAGCCTTAGAGCTCGTCGGAATCAGATTCGTCTGGTCGATT CGAACGGACCCGAAGGAGTACGCGAGCCCGAACGAGATTTTACCGGACGGGTTTATGAAC CGAGTCATGGGTTTGGGCCTTGTTTGTGGTTGGGCTCCTCAAGTTGAAATTCTGGCCCAT AAAGCAATTGGAGGGTTCGTGTCACACTGCGGTTGGAACTCGATATTGGAGAGTTTGCGT TTCGGAGTTCCAATTGCCACGTGGCCAATGTACGCGGAACAACAACTAAACGCGTTCACG ATTGTGAAGGAGCTTGGTTTGGCGTTGGAGATGCGGTTGGATTACGTGTCGGAATATGGA GAAATCGTGAAAGCTGATGAAATCGCAGGAGCCGTACGATCTTTGATGGACGGTGAGGAT GTGCCGAGGAGGAAACTGAAGGAGATTGCGGAGGCGGGAAAAGAGGCTGTGATGGACGGT GGATCTTCGTTTGTTGCGGTTAAAAGATTCATAGATGGGCTTTGA
>UGT71C3
ATGAAAGCAGAAGCAGAGATCATCTTCGTTACATATCCATCCCCTGGTCATCTTCTTGTC TCCATTGAATTCGCTAAATCTCTCATCAAACGTGATGATCGCATCCACACCATCACCATC CTCTACTGGGCTTTACCTCTCGCTCCTCAAGCCCACCTTTTCGCTAAGTCCCTCGTTGCT
TCACAGCCTCGAATCCGTCTCCTTGCGTTGCCTGATGTTCAAAACCCTCCACCATTGGAA CTCTTCTTTAAAGCTCCCGAAGCTTATATTCTTGAGTCCACCAAGAAAACAGTTCCTTTA GTCAGAGACGCTCTCTCCACTCTAGTTTCTTCACGTAAAGAATCCGGTTCGGTTCGTGTA GTCGGTTTGGTTATCGATTTTTTTTGTGTTCCAATGATCGAAGTGGCAAACGAGCTTAAC CTTCCTTCTTACATCTTCCTAACGTGTAACGCTGGGTTTTTAAGTATGATGAAGTATCTC CCTGAGAGACATCGCATAACCACTTCTGAGCTAGATTTAAGCTCCGGCAACGTAGAACAT CCAATTCCTGGCTACGTCTGCTCCGTGCCGACGAAGGTTTTGCCTCCAGGTCTATTCGTG AGAGAGTCCTACGAGGCTTGGGTCGAGATTGCAGAGAAGTTCCCTGGAGCCAAGGGCATT TTGGTAAACTCAGTCACATGTCTTGAGCAGAATGCATTTGATTACTTCGCTCGTCTTGAT GAGAACTATCCTCCGGTTTACCCGGTCGGACCGGTTCTTAGTTTGAAGGATCGTCCGTCT CCAAATCTGGACGCATCGGACCGGGATCGGATCATGAGATGGCTCGAGGACCAGCCGGAG TCGTCAATTGTGTATATCTGCTTCGGAAGCCTCGGAATCATTGGCAAGCTGCAGATTGAA GAGATAGCTGAAGCCTTGGAACTCACCGGCCACAGGTTTCTTTGGTCAATACGTACAAAT CCGACGGAGAAAGCGAGCCCGTACGATCTGTTGCCGGAGGGATTTCTCGATCGGACGGCC AGTAAGGGATTGGTGTGTGATTGGGCCCCGCAAGTAGAAGTTCTGGCCCATAAAGCGCTC GGAGGATTCGTGTCTCACTGCGGTTGGAACTCTGTACTGGAGAGCTTATGGTTCGGTGTT CCGATCGCCACGTGGCCAATGTACGCTGAGCAACAGTTAAACGCATTCTCGATGGTGAAG GAGTTAGGGTTAGCCGTGGAGCTGCGTTTAGACTACGTTTCGGCGTACGGAGAGATAGTA AAAGCTGAGGAGATCGCGGGAGCCATACGATCATTGATGGACGGTGAGGATACGCCGAGG AAGAGAGTGAAGGAGATGGCGGAAGCGGCGAGGAATGCTTTGATGGACGGAGGATCTTCG TTTGTTGCGGTTAAACGATTTCTCGACGAGTTGATCGGCGGAGATGTTTAG
>UGT71C4
ATGGTGAAGGAAACAGAGCTAATCTTCATTCCAGTTCCATCCACAGGTCATATTCTCGTC CATATTGAATTCGCCAAGCGTCTCATCAATCTCGACCATCGGATCCACACCATCACTATT CTCAACTTATCCTCACCCTCTTCTCCTCACGCCTCCGTCTTCGCCAGATCTCTCATCGCT TCCCAGCCCAAAATCCGTCTCCACGACCTTCCCCCTAT.CCAAGATCCTCCTCCATTCGAT CTTTACCAAAGAGCTCCCGAAGCTTACATAGTAAAACTCATCAAGAAAAATACTCCTCTG ATAAAAGACGCCGTCTCCAGCATCGTCGCGTCGCGTCGTGGAGGCTCAGATTCGGTTCAA GTCGCCGGTTTGGTTCTCGATTTATTCTGCAATTCATTGGTAAAAGATGTTGGCAACGAG CTTAATCTTCCTTCTTACATATACCTTACGTGTAACGCTAGATACTTGGGGATGATGAAA TATATTCCGGATCGGCATCGGAAAATCGCATCTGAGTTCGATTTGAGCTCCGGCGATGAA GAATTGCCGGTTCCGGGATTCATAAACGCTATTCCGACGAAATTTATGCCGCCTGGATTG TTCAATAAGGAAGCTTACGAGGCTTACGTAGAGCTAGCGCCGAGATTCGCAGATGCGAAG GGTATTTTGGTTAATTCCTTCACGGAGCTTGAGCCGCACCCGTTTGACTATTTCTCTCAC CTGGAGAAATTCCCTCCGGTTTACCCGGTCGGACCGATTCTCAGCTTGAAAGATCGAGCG AGTCCGAACGAAGAAGCAGTCGATCGGGATCAGATCGTTGGGTGGCTCGATGATCAGCCG GAGTCATCGGTGGTGTTCCTCTGTTTCGGGAGCAGAGGAAGCGTTGATGAGCCGCAAGTG AAGGAGATAGCTCGAGCTTTGGAACTCGTCGGCTGCAGATTTCTTTGGTCAATTAGAACA AGCGGCGACGTCGAGACGAATCCTAACGATGTGTTGCCGGAGGGGTTCATGGGCCGAGTA GCAGGCCGAGGTTTGGTATGTGGTTGGGCTCCACAAGTGGAAGTGTTGGCCCATAAAGCA ATAGGAGGATTTGTGTCTCACTGTGGTTGGAACTCCACGCTTGAAAGCTTATGGTTCGGG GTTCCTGTCGCAACGTGGCCGATGTACGCAGAGCAACAGCTTAACGCCTTCACGCTGGTG AAAGAGCTTGGGCTTGCGGTGGACCTGCGGATGGATTACGTGTCGAGTCGTGGGGGTTTG GTGACTTGTGATGAGATAGCCAGAGCCGTACGATCTTTGATGGACGGTGGAGATGAGAAG AGAAAAAAGGTTAAGGAGATGGCTGATGCGGCAAGGAAGGCTTTGATGGATGGAGGATCG TCTTCTTTGGCAACTGCTCGATTCATCGCAGAATTGTTTGAAGATGGTTCGTCGTGCTAA
>UGT71C5
ATGAAGACAGCAGAGCTCATATTCGTTCCTCTGCCGGAGACCGGCCATCTCTTGTCAACG ATCGAGTTTGGAAAGCGTCTACTCAATCTAGACCGTCGGATTTCTATGATTACAATCCTC TCCATGAATCTTCCTTACGCTCCTCACGCCGACGCTTCTCTTGCTTCGCTAACAGCCTCC
GAGCCTGGTATCCGAATCATCAGTCTCCCGGAGATCCACGATCCACCTCCGATCAAGCTT CTTGACACTTCCTCCGAGACTTACATCCTCGATTTCATCCATAAAAACATACCTTGTCTC AGAAAAACCATCCAAGATTTAGTCTCATCATCATCATCTTCCGGAGGTGGTAGTAGTCAT GTCGCCGGCTTGATTCTTGATTTCTTCTGCGTTGGTTTGATCGACATCGGCCGTGAGGTA AACCTTCCTTCCTATATCTTCATGACTTCCAACTTTGGTTTCTTAGGGGTTCTACAGTAT CTCCCGGAACGACAACGTTTGACTCCGTCGGAGTTCGATGAGAGCTCCGGCGAGGAAGAG TTACATATTCCGGCGTTTGTGAACCGTGTTCCCGCCAAGGTTCTGCCGCCAGGTGTGTTC GATAAACTCTCTTACGGGTCTCTGGTCAAAATCGGCGAGCGATTACATGAAGCCAAGGGT ATTTTGGTTAATTCATTTACCCAAGTGGAGCCTTATGCTGCTGAACATTTTTCTCAAGGA CGAGATTACCCTCACGTGTATCCTGTTGGGCCGGTTCTCAACTTAACGGGCCGTACAAAT CCGGGTCTAGCTTCGGCCCAATATAAAGAGATGATGAAGTGGCTTGACGAGCAACCAGAC TCGTCGGTTTTGTTCCTGTGTTTCGGGAGCATGGGAGTCTTCCCTGCACCTCAGATCACA GAGATTGCTCACGCGCTCGAGCTTATCGGGTGCAGGTTCATCTGGGCGATCCGTACGAAC ATGGCGGGAGATGGCGATCCTCAGGAGCCGCTTCCAGAAGGATTTGTCGATCGAACAATG GGCCGTGGAATTGTGTGTAGTTGGGCTCCACAAGTGGATATCTTGGCCCACAAGGCAACA GGTGGATTCGTTTCTCACTGCGGGTGGAATTCCGTCCAAGAGAGTCTATGGTACGGTGTA CCTATTGCAACGTGGCCAATGTATGCGGAGCAACAACTGAACGCATTTGAGATGGTGAAG GAGTTGGGCTTAGCAGTGGAGATAAGGCTTGACTACGTGGCGGATGGTGATAGGGTTACT TTGGAGATCGTGTCAGCCGATGAAATAGCCACAGCCGTCCGATCATTGATGGATAGTGAT AACCCCGTGAGAAAGAAGGTTATAGAAAAATCTTCAGTGGCGAGGAAAGCTGTTGGTGAT GGTGGGTCTTCTACGGTGGCCACATGTAATTTTATCAAAGATATTCTTGGGGATCACTTT TGA
>UGT71D1
ATGCGGAATGTAGAGCTCATCTTCATCCCCACACCAACCGTTGGTCATCTTGTTCCGTTT CTTGAATTTGCTAGGCGTCTCATTGAGCAAGATGATAGGATCCGTATCACAATCCTCTTG ATGAAACTACAAGGTCAGTCTCATCTAGACACTTATGTTAAATCAATTGCCTCCTCTCAA CCGTTTGTTAGATTCATTGATGTCCCTGAGTTAGAGGAGAAACCTACACTTGGTAGTACA CAATCTGTGGAAGCTTATGTGTATGATGTTATTGAGAGAAATATCCCTCTTGTGAGGAAT ATAGTCATGGATATTTTAACTTCTCTTGCATTGGATGGAGTTAAGGTCAAGGGATTAGTT GTTGACTTTTTCTGTCTCCCTATGATTGACGTTGCTAAAGATATAAGTCTCCCTTTCTAT GTGTTCTTGACTACAAATTCCGGGTTCTTAGCTATGATGCAGTATCTAGCAGATCGACAT AGTAGAGATACATCGGTTTTTGTAAGAAACTCGGAAGAAATGTTGTCGATACCTGGATTT GTAAACCCTGTCCCAGCCAATGTTCTGCCGTCAGCTCTGTTTGTTGAAGATGGTTATGAT GCTTACGTTAAGCTGGCCATATTGTTTACAAAGGCCAATGGAATCCTAGTGAATAGCTCC TTTGATATTGAGCCTTACTCTGTGAATCATTTTCTTCAAGAACAGAATTATCCTTCTGTT TATGCTGTTGGCCCCATATTTGACTTGAAAGCCCAGCCTCATCCAGAGCAGGACCTAACC CGTCGTGACGAGTTGATGAAATGGCTTGATGATCAACCCGAGGCATCGGTTGTATTCCTT TGTTTTGGGAGTATGGCAAGGTTAAGAGGTTCTCTAGTGAAGGAAATAGCTCATGGACTT GAGCTATGTCAATATAGATTCCTCTGGTCACTCCGTAAAGAAGAGGTGACAAAGGATGAT TTGCCAGAGGGGTTCCTTGACCGTGTCGATGGACGTGGAATGATATGTGGTTGGTCTCCT CAGGTAGAAATACTGGCCCATAAGGCAGTGGGAGGCTTTGTTTCTCACTGTGGATGGAAC TCAATAGTAGAGAGTTTGTGGTTTGGCGTGCCAATTGTGACATGGCCAATGTATGCAGAG CAACAACTCAATGCGTTTCTGATGGTGAAGGAACTGAAGCTAGCTGTGGAGCTGAAGCTT GATTACAGGGTACATAGTGATGAGATAGTAAACGCAAACGAGATAGAGACCGCTATTCGT TATGTAATGGACACGGATAATAATGTTGTGAGGAAACGAGTGATGGATATCTCGCAGATG ATCCAGAGAGCTACGAAGAATGGTGGATCTTCGTTTGCCGCAATTGAGAAATTCATATAT GACGTGATAGGAATTAAGCCCTAG
>UGT71D2
ATGAGGAATGCAGAGCTCATCTTCATCCCAACACCAACTGTTGGTCATCTTGTTCCGTTT CTTGAATTTGCTAGGCGTCTCATTGAGCAGGATGATAGAATCCGTATCACCTTCCTCTTG
ATGAAGCAACAAGGTCAGTCTCATCTGGATTCCTATGTTAAGACAATTTCCTCGTCTCTG CCGTTTGTTAGATTTATTGATGTCCCTGAGTTAGAGGAGAAACCAACACTTGGTACACAG TCTGTGGAAGCCTATGTGTACGATTTTATTGAAACAAATGTCCCTCTTGTGCAAAATATA ATCATGGGTATCCTATCTTCTCCTGCATTTGATGGAGTTACGGTCAAGGGATTCGTTGCT GATTTTTTCTGTCTCCCGATGATTGATGTTGCAAAAGATGCAAGTCTTCCTTTTTATGTG TTCTTGACTTCAAATTCCGGATTCCTAGCTATGATGCAGTATCTGGCATATGGACATAAG AAAGATACCTCAGTTTTTGCAAGAAACTCTGAAGAAATGTTGTCAATTCCTGGATTTGTA AACCCTGTCCCAGCCAAAGTACTGCCGTCAGCTCTGTTTATTGAGGATGGTTATGATGCT GACGTTAAACTGGCTATATTGTTTACAAAGGCTAATGGAATCCTAGTGAATACCTCCTTT GATATTGAGCCTACCTCTCTGAATCATTTTCTTGGAGAAGAGAATTACCCTTCTGTTTAT GCTGTTGGCCCCATATTTAACCCGAAGGCCCATCCTCATCCAGATCAAGACCTCGCCTGT TGTGACGAGTCGATGAAATGGCTTGATGCTCAACCCGAGGCATCAGTTGTATTCCTTTGT TTTGGGAGTATGGGTAGCTTAAGAGGTCCTCTAGTGAAGGAAATAGCACATGGACTTGAG CTATGTCAGTATAGATTCCTCTGGTCACTCCGCACAGAAGAAGTGACAAATGATGATCTT TTGCCAGAGGGATTCATGGACCGTGTCAGTGGACGGGGAATGATATGCGGTTGGTCTCCT CAGGTGGAAATACTGGCCCATAAAGCAGTGGGAGGTTTTGTTTCTCATTGTGGATGGAAC TCAATAGTAGAGAGTTTATGGTTTGGTGTGCCAATTGTGACATGGCCAATGTATGCAGAG CAACAGCTCAATGCGTTTCTGATGGTGAAGGAACTGAAGCTCGCAGTGGAGCTGAAACTC GATTATAGTGTACATAGTGGTGAGATTGTAAGTGCAAACGAGATAGAGACAGCGATTTCT TGTGTAATGAACAAGGATAATAATGTTGTGAGGAAACGAGTGATGGATATCTCGCAGATG ATCCAGAGAGCTACGAAGAATGGTGGATCTTCGTTTGCCGCAATTGAGAAATTCATACAT GACGTGATAGGAACCAGGACTTAG
>UGT72B1
ATGGAGGAATCCAAAACACCTCACGTTGCGATCATACCAAGTCCGGGAATGGGTCATCTC ATACCACTCGTCGAGTTTGCTAAACGACTCGTCCATCTTCACGGCCTCACCGTTACCTTC GTCATCGCCGGCGAAGGTCCACCATCAAAAGCTCAGAGAACCGTCCTCGACTCTCTCCCT TCTTCAATCTCCTCCGTCTTTCTCCCTCCTGTTGATCTCACCGATCTCTCTTCGTCCACT CGCATCGAATCTCGGATCTCCCTCACCGTGACTCGTTCAAACCCGGAGCTCCGGAAAGTC TTCGACTCGTTCGTGGAGGGAGGTCGTTTGCCAACGGCGCTCGTCGTCGATCTCTTCGGT ACGGACGCTTTCGACGTGGCCGTAGAATTTCACGTGCCACCGTATATTTTCTACCCAACA ACGGCCAACGTCTTGTCGTTTTTTCTCCATTTGCCTAAACTAGACGAAACGGTGTCGTGT GAGTTCAGGGAATTAACCGAACCGCTTATGCTTCCTGGATGTGTACCGGTTGCCGGGAAA GATTTCCTTGACCCGGCCCAAGACCGGAAAGACGATGCATACAAATGGCTTCTCCATAAC ACCAAGAGGTACAAAGAAGCCGAAGGTATTCTTGTGAATACCTTCTTTGAGCTAGAGCCA AATGCTATAAAGGCCTTGCAAGAACCGGGTCTTGATAAACCACCGGTTTATCCGGTTGGA CCGTTGGTTAACATTGGTAAGCAAGAGGCTAAGCAAACCGAAGAGTCTGAATGTTTAAAG TGGTTGGATAACCAGCCGCTCGGTTCGGTTTTATATGTGTCCTTTGGTAGTGGCGGTACC CTCACATGTGAGCAGCTCAATGAGCTTGCTCTTGGTCTTGCAGATAGTGAGCAACGGTTT CTTTGGGTCATACGAAGTCCTAGTGGGATCGCTAATTCGTCGTATTTTGATTCACATAGC CAAACAGATCCATTGACATTTTTACCACCGGGATTTTTAGAGCGGACTAAAAAAAGAGGT TTTGTGATCCCTTTTTGGGCTCCACAAGCCCAAGTCTTGGCGCATCCATCCACGGGAGGA TTTTTAACTCATTGTGGATGGAATTCGACTCTAGAGAGTGTAGTAAGCGGTATTCCACTT ATAGCATGGCCATTATACGCAGAACAGAAGATGAATGCGGTTTTGTTGAGTGAAGATATT CGTGCGGCACTTAGGCCGCGTGCCGGGGACGATGGGTTAGTTAGAAGAGAAGAGGTGGCT AGAGTGGTAAAAGGATTGATGGAAGGTGAAGAAGGCAAAGGAGTGAGGAACAAGATGAAG GAGTTGAAGGAAGCAGCTTGTAGGGTGTTGAAGGATGATGGGACTTCGACAAAAGCACTT AGTCTTGTGGCCTTAAAGTGGAAAGCCCACAAAAAAGAGTTAGAGCAAAATGGCAACCAC TAA
•>UGT72B2
ATGCAAAAAATGGCAGATGGAAACACTCCACATGTAGCAATCATACCAAGTCCCGGTATA
GGTCACCTCATCCCACTCGTCGAGTTAGCAAAGCGACTCCTTGACAATCACGGTTTCACC GTCACTTTCATCATCCCCGGCGATTCTCCTCCGTCTAAGGCTCAAAGATCCGTTCTCAAC TCTCTCCCTTCCTCCATAGCCTCCGTCTTCCTCCCTCCCGCCGATCTTTCCGACGTTCCT TCGACAGCTCGAATCGAAACTCGGATATCGCTCACCGTGACTCGTTCCAACCCGGCGCTC CGGGAGCTTTTTGGCTCGTTATCGGCGGAGAAACGTCTCCCGGCGGTTCTCGTCGTCGAT CTATTTGGTACGGATGCGTTCGACGTGGCTGCTGAGTTCCACGTGTCGCCATACATTTTC TATGCATCAAATGCCAACGTCCTCACGTTTCTGCTTCACTTGCCGAAGCTAGACGAAACG GTGTCGTGTGAGTTTAGGGAATTAACCGAACCGGTTATTATTCCCGGTTGTGTCCCCATA ACCGGTAAGGATTTCGTCGATCCGTGTCAAGACCGAAAAGATGAATCATACAAATGGCTT CTACACAACGTCAAGAGATTCAAAGAAGCTGAAGGGATTCTAGTGAATTCCTTCGTCGAT TTAGAGCCAAACACTATAAAGATTGTACAAGAACCGGCTCCTGATAAACCACCGGTTTAC CTGATTGGGCCGTTGGTTAACTCGGGTTCACACGATGCTGACGTGAACGATGAGTACAAA TGTTTAAATTGGCTAGACAACCAACCATTCGGGTCGGTTCTATACGTATCCTTTGGAAGC GGCGGAACACTCACGTTTGAGCAGTTCATTGAGCTGGCTCTTGGCCTAGCGGAGAGTGGA AAACGGTTTCTTTGGGTCATACGAAGTCCGAGTGGGATAGCTAGTTCATCGTATTTCAAT CCACAAAGCCGAAATGATCCATTTTCGTTTTTACCACAAGGCTTCTTAGACCGAACCAAA GAAAAAGGTCTAGTGGTTGGGTCATGGGCTCCACAGGCTCAAATTCTGACTCATACATCT ATAGGTGGATTTTTAACTCATTGTGGATGGAATTCGAGTCTAGAAAGTATTGTAAACGGT GTACCGCTCATAGCATGGCCGTTATACGCGGAGCAAAAGATGAACGCATTGCTACTCGTG GATGTTGGTGCGGCTCTAAGAGCACGACTGGGTGAAGACGGGGTCGTAGGAAGGGAAGAA GTGGCGAGAGTGGTAAAAGGATTGATAGAAGGAGAAGAAGGGAATGCGGTAAGGAAAAAA ATGAAAGAGTTGAAAGAAGGATCTGTTAGAGTCTTAAGGGACGATGGATTCTCTACCAAA TCGCTTAATGAAGTTTCGTTGAAGTGGAAAGCCCACCAACGAAAGATCGACCAAGAACAG GAATCATTTCTATGA
>UGT72B3
ATGAGCATAGATATTTTTCAAGAAATAAGAATAAAGAAAATTCTACTCTTAATGGCGGAA GCAAACACTCCACACATAGCAATCATGCCGAGTCCCGGTATGGGTCACCTTATCCCATTC GTCGAGTTAGCAAAGCGACTCGTTCAGCACGACTGTTTCACCGTCACAATGATCATCTCC GGTGAAACTTCGCCGTCTAAGGCACAAAGATCCGTTCTCAACTCTCTCCCTTCCTCCATA GCCTCCGTATTTCTCCCTCCCGCCGATCTTTCCGATGTTCCCTCCACAGCGCGAATCGAA ACTCGGGCCATGCTCACCATGACTCGTTCCAATCCGGCGCTCCGGGAGCTTTTTGGCTCT TTATCAACGAAGAAAAGTCTCCCGGCGGTTCTCGTCGTCGATATGTTTGGTGCGGATGCG TTCGACGTGGCCGTTGACTTCCACGTGTCACCATACATTTTCTATGCATCCAATGCAAAC GTCTTGTCGTTTTTTCTTCACTTGCCGAAACTAGACAAAACGGTGTCGTGTGAGTTTAGG TACTTAACCGAACCGCTTAAGATTCCCGGCTGTGTCCCGATAACCGGTAAGGACTTTCTT GATACGGTTCAAGACCGAAACGACGACGCATACAAATTGCTTCTCCATAACACCAAGAGG TACAAAGAAGCTAAAGGGATTCTAGTGAATTCCTTCGTTGATTTAGAGTCGAATGCAATA AAGGCCTTACAAGAACCGGCTCCTGATAAACCAACGGTATACCCGATTGGGCCGCTGGTT AACACAAGTTCATCTAATGTTAACTTGGAAGACAAGTTCGGATGTTTAAGTTGGCTAGAC AACCAACCATTCGGCTCGGTTCTATACATATCATTTGGAAGCGGCGGAACACTTACATGT GAGCAGTTTAATGAGCTTGCTATTGGTCTTGCGGAGAGCGGAAAACGGTTTATTTGGGTC ATACGAAGTCCAAGCGAGATAGTTAGTTCGTCGTATTTCAATCCACACAGCGAGACAGAC CCCTTTTCGTTTTTACCAATTGGGTTCTTAGACCGAACCAAAGAGAAAGGTTTGGTGGTT CCATCATGGGCTCCACAGGTTCAAATCCTGGCTCATCCATCCACATGCGGGTTTTTAACA CACTGTGGATGGAATTCGACCTTAGAAAGCATTGTAAACGGTGTACCACTCATAGCGTGG CCTTTATTCGCGGAGCAAAAGATGAATACATTGCTACTCGTGGAGGATGTTGGAGCGGCT CTAAGAATCCATGCGGGTGAAGATGGGATTGTACGGAGGGAAGAAGTGGTGAGAGTGGTG AAGGCACTGATGGAAGGTGAAGAGGGAAAAGCCATAGGAAATAAAGTGAAGGAGTTGAAA GAAGGAGTTGTTAGAGTCTTGGGTGACGATGGATTGTCCAGCAAGTCATTTGGTGAAGTT TTGTTAAAGTGGAAAACGCACCAGCGAGATATCAACCAAGAGACGTCCCACTAA
>UGT72C1
ATGGAACTTCACGGAGCTCTAGTGGCTAGTCCGGGCATGGGACATGCCGTACCCATCTTA GAACTCGGTAAACATCTCCTGAACCACCACGGGTTCGACCGTGTCACTGTCTTCCTAGTC ACAGACGATGTCTCACGTTCGAAATCCCTAATTGGAAAAACGTTGATGGAAGAAGATCCA AAATTTGTGATCAGGTTTATTCCACTCGATGTTTCGGGTCAAGATCTGAGTGGTTCACTA TTGACTAAACTAGCAGAGATGATGAGGAAGGCATTACCAGAGATCAAGTCTTCAGTCATG GAGTTAGAACCGCGGCCTAGGGTTTTCGTAGTTGACTTGTTGGGCACGGAAGCTTTAGAG GTGGCTAAGGAGCTTGGGATCATGAGAAAACATGTTCTGGTTACTACCAGTGCTTGGTTT CTAGCTTTTACGGTTTATATGGCGAGTCTTGACAAACAGGAGTTGTATAAGCAGTTGAGT AGCATAGGAGCATTGCTTATACCCGGATGCAGCCCGGTTAAGTTTGAGCGGGCTCAAGAT CCGAGAAAATATATTCGGGAACTCGCTGAGTCTCAGCGTATTGGGGATGAGGTGATAACC GCAGATGGGGTGTTTGTGAATACGTGGCACAGTCTGGAGCAAGTGACCATCGGGTCTTTC TTGGATCCAGAGAATCTCGGTCGGGTTATGAGAGGAGTGCCGGTTTATCCTGTTGGACCG CTGGTTAGACCAGCAGAACCAGGTTTGAAACATGGCGTGCTGGACTGGCTTGACTTACAA CCCAAAGAGTCAGTGGTTTATGTTCTTTTGGGAGTGGTGGGGGCACTAACCTTCGAGCAG ACAAACGAGCTGGCTTACGGTTTGGAGCTGACTGGCCACAGATTTGTTTGGGTAGTCAGA CCACCGGCTGAAGACGACCCATCGGCATCAATGTTCGACAAGACCAAGAATGAGACAGAA CCTCTCGATTTCTTACCCAACGGGTTTCTAGACCGAACCAAAGACATCGGTTTGGTGGTC CGTACATGGGCACCACAAGAAGAGATTCTGGCACACAAGTCAACAGGAGGGTTTGTGACT CACTGCGGATGGAACTCAGTTTTGGAGAGTATTGTGAATGGTGTGCCAATGGTAGCTTGG CCGTTGTACTCAGAGCAGAAGATGAACGCGAGGATGGTTTCTGGGGAGCTAAAGATTGCG TTGCAGATTAATGTTGCAGATGGGATTGTAAAGAAGGAGGTGATAGCTGAAATGGTGAAG AGAGTGATGGATGAAGAAGAAGGAAAAGAGATGAGAAAGAATGTTAAGGAACTGAAGAAG ACAGCAGAAGAAGCTCTCAACATGACTCACATTCCATCTGCTTACTTCACCTAA
>UGT72D1
ATGGACCAGCCTCACGCGCTTCTAGTGGCTAGCCCTGGCTTGGGTCACCTCATCCCTATC CTGGAGCTCGGCAACCGTCTCTCCTCCGTCCTAAACATCCACGTCACCATTCTCGCGGTC ACCTCCGGCTCCTCTTCACCGACAGAAACCGAAGCCATACATGCAGCCGCGGCTAGAACA ATCTGTCAAATTACGGAAATTCCCTCGGTGGATGTAGACAACCTCGTGGAGCCAGATGCT ACAATTTTCACTAAGATGGTGGTGAAGATGCGAGCCATGAAGCCCGCGGTACGAGATGCC GTGAAATTAATGAAACGAAAACCAACGGTCATGATTGTTGACTTTTTGGGTACGGAACTG ATGTCCGTAGCCGATGACGTAGGCATGACGGCTAAATACGTTTACGTTCCAACTCATGCG TGGTTCTTGGCAGTCATGGTGTACTTGCCGGTGTTAGATACGGTAGTGGAAGGTGAGTAT GTTGATATTAAGGAGCCTTTGAAGATACCGGGTTGTAAACCGGTCGGACCGAAGGAGCTG ATGGAAACGATGTTAGACCGGTCGGGCCAGCAATATAAAGAGTGTGTACGAGCTGGCTTA GAGGTACCTATGAGCGATGGTGTTTTGGTAAATACTTGGGAGGAGTTACAAGGAAACACT CTCGCTGCGCTTAGAGAGGACGAAGAATTGAGCCGGGTCATGAAAGTACCGGTTTATCCT ATTGGGCCAATTGTTAGGACTAACCAGCATGTAGACAAACCCAATAGTATATTCGAGTGG CTAGACGAGCAACGGGAAAGGTCAGTGGTGTTTGTGTGTTTAGGGAGCGGTGGAACGTTG ACGTTTGAGCAAACAGTGGAACTCGCTTTGGGTTTAGAGTTAAGTGGTCAAAGGTTCGTT TGGGTTCTACGTAGGCCCGCTTCATATCTCGGGGCGATCTCCAGCGATGATGAACAGGTA AGTGCCAGTCTACCTGAAGGTTTCTTGGACCGCACGCGTGGTGTGGGGATTGTGGTTACG CAATGGGCACCACAAGTTGAGATCTTGAGCCATAGATCGATCGGTGGGTTCTTGTCTCAC TGCGGTTGGAGTTCGGCTTTGGAAAGTTTGACTAAAGGAGTTCCGATCATCGCTTGGCCT CTTTATGCGGAGCAGTGGATGAATGCCACGTTATTGACTGAGGAGATCGGTGTGGCCGTT CGTACATCGGAGTTACCGTCGGAGAGAGTCATCGGAAGGGAAGAAGTGGCATCTCTGGTG AGAAAGATTATGGCGGAAGAGGATGAAGAAGGACAGAAAATTAGGGCTAAAGCTGAGGAG GTGAGGGTTAGCTCCGAACGAGCTTGGAGTAAAGACGGGTCATCTTATAATTCTCTATTC GAATGGGCAAAACGATGTTATCTTGTACCGTGA
>UGT72E1
ATGAAGATTACAAAACCACATGTGGCCATGTTCGCTAGCCCCGGAATGGGCCACATCATC CCGGTGATCGAGCTCGGAAAACGCTTAGCTGGTTCCCACGGCTTCGATGTCACCATTTTC GTCCTTGAAACCGACGCAGCCTCAGCTCAATCTCAATTCCTTAACTCACCAGGCTGCGAC GCGGCCCTTGTTGATATCGTTGGCCTCCCAACGCCCGATATCTCCGGTTTAGTCGACCCA TCAGCCTTTTTTGGGATCAAGCTCTTGGTCATGATGCGTGAGACCATTCCTACCATCCGG TCAAAGATAGAGGAGATGCAACACAAACCAACGGCTCTGATCGTAGACTTGTTTGGTTTG GACGCGATACCGCTCGGTGGTGAGTTCAACATGTTGACTTATATCTTCATCGCTTCAAAC GCACGTTTTCTCGCGGTGGCTTTGTTTTTCCCAACGTTGGACAAAGACATGGAAGAAGAG CACATAATCAAGAAGCAACCTATGGTTATGCCTGGATGTGAACCGGTTCGGTTTGAAGAT ACACTTGAAACATTCCTTGACCCAAACAGCCAACTCTACCGGGAATTTGTTCCTTTCGGT TCGGTTTTCCCAACGTGTGATGGTATTATTGTGAATACATGGGATGATATGGAGCCCAAA ACTTTGAAATCTCTTCAAGACCCAAAGCTCTTGGGTCGAATTGCTGGTGTACCGGTTTAT CCAATTGGTCCTTTGTCTAGACCGGTTGATCCATCTAAAACTAATCATCCGGTTTTGGAT TGGTTAAACAAACAGCCGGACGAGTCGGTACTTTACATTTCATTTGGAAGCGGTGGCTCT CTCTCGGCTAAACAACTAACCGAATTGGCTTGGGGACTTGAGATGAGTCAGCAACGGTTC GTTTGGGTGGTTCGACCCCCGGTGGACGGTTCAGCTTGCAGTGCATATTTATCCGCTAAC AGTGGTAAAATACGAGACGGTACACCTGATTATCTCCCGGAAGGTTTTGTTAGCCGGACT CATGAGAGAGGCTTTATGGTCTCTTCTTGGGCTCCCCAAGCGGAGATCTTGGCCCACCAA GCCGTAGGTGGGTTTCTAACTCACTGCGGTTGGAATTCGATTCTCGAGAGCGTCGTTGGT GGCGTTCCGATGATCGCGTGGCCACTTTTTGCGGAGCAGATGATGAACGCGACACTCCTC AACGAAGAGCTTGGCGTTGCCGTCCGCTCTAAGAAACTACCGTCGGAGGGAGTGATTACG AGGGCGGAGATCGAGGCGTTGGTGAGAAAGATCATGGTGGAGGAGGAAGGTGCTGAGATG AGAAAGAAGATAAAGAAGCTGAAAGAGACCGCTGCCGAATCGCTGAGTTGCGACGGTGGA GTGGCGCATGAATCGTTGTCAAGAATCGCCGACGAGAGCGAGCATCTTTTGGAGCGTGTC AGGTGCATGGCACGTGGTGCCTAG
>UGT72E2
ATGCATATCACAAAACCACACGCCGCCATGTTTTCCAGTCCCGGAATGGGCCATGTCATC CCGGTGATCGAGCTTGGAAAGCGTCTCTCCGCTAACAACGGCTTCCACGTCACCGTCTTC GTCCTCGAAACCGACGCAGCCTCCGCTCAATCCAAGTTCCTAAACTCAACCGGCGTCGAC ATCGTCAAACTTCCATCGCCGGACATTTATGGTTTAGTGGACCCCGACGACCATGTAGTG ACCAAGATCGGAGTCATTATGCGTGCAGCAGTTCCAGCCCTCCGATCCAAGATCGCTGCC ATGCATCAAAAGCCAACGGCTCTGATCGTTGACTTGTTTGGCACAGATGCGTTATGTCTC GCAAAGGAATTTAACATGTTGAGTTATGTGTTTATCCCTACCAACGCACGTTTTCTCGGA GTTTCGATTTATTATCCAAATTTGGACAAAGATATCAAGGAAGAGCACACAGTGCAAAGA AACCCACTCGCTATACCGGGGTGTGAACCGGTTAGGTTCGAAGATACTCTGGATGCATAT CTGGTTCCCGACGAACCGGTGTACCGGGATTTTGTTCGTCATGGTCTGGCTTACCCAAAA GCCGATGGAATTTTGGTAAATACATGGGAAGAGATGGAGCCCAAATCATTGAAGTCCCTT CTAAACCCAAAGCTCTTGGGCCGGGTTGCTCGTGTACCGGTCTATCCAATCGGTCCCTTA TGCAGACCGATACAATCATCCGAAACCGATCACCCGGTTTTGGATTGGTTAAACGAACAA CCGAACGAGTCGGTTCTCTATATCTCCTTCGGGAGTGGTGGTTGTCTATCGGCGAAACAG TTAACTGAATTGGCGTGGGGACTCGAGCAGAGCCAGCAACGGTTCGTATGGGTGGTTCGA CCACCGGTCGACGGTTCGTGTTGTAGCGAGTATGTCTCGGCTAACGGTGGTGGAACCGAA GACAACACGCCAGAGTATCTACCGGAAGGGTTCGTGAGTCGTACTAGTGATAGAGGTTTC GTGGTCCCCTCATGGGCCCCACAAGCTGAAATCCTGTCCCATCGGGCCGTTGGTGGGTTT TTGACCCATTGCGGTTGGAGCTCGACGTTGGAAAGCGTCGTTGGCGGCGTTCCGATGATC GCATGGCCACTTTTTGCCGAGCAGAATATGAATGCGGCGTTGCTCAGCGACGAACTGGGA ATCGCAGTCAGATTGGATGATCCAAAGGAGGATATTTCTAGGTGGAAGATTGAGGCGTTG GTGAGGAAGGTTATGACTGAGAAGGAAGGTGAAGCGATGAGAAGGAAAGTGAAGAAGTTG AGAGACTCGGCGGAGATGTCACTGAGCATTGACGGTGGTGGTTTGGCGCACGAGTCGCTT TGCAGAGTCACCAAGGAGTGTCAACGGTTTTTGGAACGTGTCGTGGACTTGTCACGTGGT GCTTAG
>UGT72E3
ATGCATATCACAAAACCACACGCCGCCATGTTTTCCAGTCCCGGAATGGGCCATGTCCTC CCGGTGATCGAGCTAGCTAAGCGTCTCTCCGCTAACCACGGCTTCCACGTCACCGTCTTC GTCCTTGAAACTGACGCAGCCTCCGTTCAGTCCAAGCTCCTTAACTCAACCGGTGTTGAC ATCGTCAACCTTCCATCGCCCGACATTTCTGGCTTGGTAGACCCCAACGCCCATGTGGTG ACCAAGATCGGAGTCATTATGCGTGAAGCTGTTCCAACCCTCCGATCCAAGATCGTTGCC ATGCATCAAAACCCAACGGCTCTGATCATTGACTTGTTTGGCACAGATGCGTTATGTCTT GCAGCGGAGTTAAACATGTTGACTTATGTCTTTATCGCTTCCAACGCGCGTTATCTCGGA GTTTCGATATATTATCCAACTTTGGACGAAGTTATCAAAGAAGAGCACACAGTGCAACGA AAACCGCTCACTATACCGGGGTGTGAACCGGTTAGATTTGAAGATATTATGGATGCATAT CTGGTTCCGGACGAACCGGTGTACCACGATTTGGTTCGTCACTGTCTGGCCTACCCAAAA GCGGATGGAATCTTGGTGAATACATGGGAAGAGATGGAGCCCAAATCATTAAAGTCCCTT CAAGACCCGAAACTTTTGGGCCGGGTCGCTCGTGTACCGGTTTATCCGGTTGGTCCGTTA TGCAGACCGATACAATCATCCACGACCGATCACCCGGTTTTTGATTGGTTAAACAAACAA CCAAACGAGTCGGTTCTCTACATTTCCTTCGGGAGTGGTGGTTCTCTAACGGCTCAACAG TTAACCGAATTGGCGTGGGGGCTCGAGGAGAGCCAGCAACGGTTTATATGGGTGGTTCGA CCGCCCGTTGACGGCTCGTCTTGCAGTGATTATTTCTCGGCTAAAGGCGGTGTAACCAAA GACAACACGCCAGAGTATCTACCAGAAGGGTTCGTGACTCGTACTTGCGATAGAGGTTTC ATGATCCCATCATGGGCACCGCAAGCTGAAATCCTAGCCCATCAGGCCGTTGGTGGGTTT TTAACACATTGTGGTTGGAGCTCGACGTTGGAAAGCGTCCTTTGCGGCGTTCCAATGATA GCGTGGCCGCTTTTCGCCGAGCAGAATATGAACGCGGCGTTGCTTAGCGATGAACTGGGA ATCTCTGTTAGAGTGGATGATCCAAAGGAGGCGATTTCTAGGTCGAAGATTGAGGCGATG GTGAGGAAGGTTATGGCTGAGGACGAAGGTGAAGAGATGAGAAGGAAAGTGAAGAAGTTG AGAGACACGGCGGAGATGTCACTTAGTATTCACGGTGGTGGTTCGGCGCATGAGTCGCTT TGCAGAGTCACGAAGGAGTGTCAACGGTTTTTGGAATGTGTCGGGGACTTGGGACGTGGT GCTTAG
>UGT73B1
ATGGGAACTCCTGTCGAAGTCTCTAAGCTCCATTTCTTGCTCTTCCCTTTCATGGCTCAT GGCCATATGATACCAACTCTAGACATGGCTAAGCTCTTTGCCACCAAAGGAGCTAAATCC ACTATCCTCACTACACCTCTCAATGCCAAGCTCTTCTTCGAGAAACCCATCAAATCATTC AACCAAGACAACCCGGGACTCGAAGACATCACCATCCAGATCCTTAATTTCCCTTGCACA GAGCTTGGTTTGCCTGATGGCTGTGAGAATACTGATTTCATCTTCTCCACACCTGACCTA AACGTAGGTGACTTGAGTCAAAAGTTTTTACTCGCAATGAAATATTTCGAAGAGCCACTA GAGGAGCTCCTCGTGACAATGAGACCAGACTGTCTTGTCGGTAACATGTTCTTCCCTTGG TCCACTAAAGTTGCTGAGAAGTTCGGAGTACCGAGACTTGTGTTCCACGGCACAGGCTAC TTCTCTTTATGTGCTTCTCATTGCATAAGGCTCCCTAAGAATGTGGCAACAAGTTCTGAG CCCTTTGTGATTCCTGATCTCCCGGGAGACATTTTGATTACAGAGGAACAGGTCATGGAG ACAGAAGAAGAGTCTGTAATGGGGAGGTTTATGAAGGCAATAAGAGACTCAGAGAGAGAT AGCTTTGGCGTGTTGGTGAACAGCTTCTACGAGCTTGAACAGGCTTACTCAGATTATTTC AAGAGCTTTGTGGCGAAAAGAGCGTGGCATATCGGTCCGCTTTCCTTAGGAAATAGAAAG TTCGAGGAGAAAGCAGAAAGAGGCAAAAAGGCAAGCATTGATGAGCATGAATGTTTGAAA TGGCTCGACTCCAAGAAATGTGATTCAGTGATTTACATGGCCTTTGGAACCATGTCTAGC TTTAAAAACGAGCAGCTGATAGAGATTGCAGCTGGTTTAGATATGTCAGGACATGATTTT GTCTGGGTGGTTAACAGAAAAGGCAGCCAAGTTGAGAAGGAAGATTGGTTACCAGAGGGG TTTGAAGAGAAGACCAAGGGAAAAGGATTGATAATCCGAGGGTGGGCGCCACAAGTGCTG ATACTTGAGCACAAAGCAATTGGCGGATTTTTGACGCATTGTGGATGGAACTCGTTATTA GAAGGGGTGGCAGCGGGCCTGCCAATGGTGACATGGCCCGTGGGAGCCGAGCAGTTCTAC AACGAGAAATTGGTGACACAAGTGTTGAAAACAGGAGTGAGTGTGGGAGTGAAGAAGATG ATGCAAGTAGTTGGAGACTTCATTAGCAGAGAGAAAGTGGAGGGAGCGGTGAGGGAAGTG ATGGTTGGAGAAGAGAGGAGGAAACGGGCCAAGGAGTTAGCAGAAATGGCGAAAAATGCG
GTGAAAGAAGGAGGATCTTCAGATCTAGAGGTAGATAGGTTGATGGAAGAGCTTACGTTA GTTAAACTGCAAAAAGAGAAGGTATAA
>UGT73B2
ATGGGTAGTGATCATCATCATCGAAAGCTCCACGTTATGTTCTTCCCTTTCATGGCTTAT GGTCACATGATACCAACTCTAGACATGGCTAAGCTTTTCTCTAGCAGAGGAGCCAAATCC ACAATCCTCACCACATCTCTCAACTCCAAGATCCTCCAAAAACCCATCGACACATTCAAG AATCTGAATCCGGGTCTCGAAATCGACATCCAGATCTTCAATTTCCCTTGCGTGGAGCTG GGGTTACCAGAAGGATGTGAAAACGTTGATTTCTTCACTTCAAACAACAATGATGATAAA AACGAGATGATCGTGAAATTCTTTTTCTCGACAAGGTTTTTCAAAGACCAGCTTGAGAAA CTCCTCGGGACAACGAGACCAGACTGTCTTATCGCCGACATGTTCTTCCCCTGGGCTACT GAAGCTGCTGGGAAGTTCAATGTGCCAAGACTTGTGTTCCACGGCACTGGCTACTTCTCT TTATGCGCTGGTTATTGCATCGGAGTGCATAAACCACAGAAGAGAGTGGCTTCAAGCTCT GAGCCATTTGTGATTCCCGAGCTCCCTGGGAACATTGTGATAACTGAAGAACAGATCATA GATGGCGATGGAGAATCCGACATGGGAAAGTTTATGACTGAAGTTAGGGAATCGGAAGTG AAGAGCTCAGGAGTTGTTTTGAATAGTTTCTACGAGCTAGAACATGATTACGCCGATTTT TACAAAAGTTGTGTACAAAAGAGAGCGTGGCATATCGGTCCGCTATCGGTTTACAACAGG GGATTTGAGGAGAAGGCTGAGAGAGGAAAGAAAGCGAACATTGATGAGGCTGAATGCCTC AAATGGCTTGACTCCAAGAAACCAAATTCAGTCATTTATGTTTCCTTTGGGAGCGTGGCT TTCTTCAAGAATGAACAGTTATTCGAGATCGCTGCAGGGTTAGAAGCTTCCGGTACAAGT TTCATTTGGGTTGTTAGGAAAACCAAAGTGATAGAGAAGAATGGTTACCAGAAGGGTTCG AAGAGAGGGTGAAAGGGAAAGGTATGATAATAAGAGGATGGGCAGCACAGGTGCTGATAC TTGACCACCAAGCAACCGGTGGGTTTGTGACCCATTGCGGCTGGAACTCGCTTCTTGAAG GAGTGGCTGCAGGGCTACCAATGGTGACATGGCCTGTAGGAGCGGAGCAATTCTACAATG AGAAATTGGTTACGCAAGTGCTCAGAACAGGAGTGAGCGTGGGAGCGAGCAAGCATATGA AAGTTATGATGGGAGATTTCATTAGCAGAGAGAAAGTGGATAAAGCGGTGAGGGAGGTTT TGGCTGGGGAAGCAGCAGAGGAGAGGCGGAGACGGGCAAAGAAGCTAGCGGCGATGGCTA AAGCTGCCGTGGAAGAAGGAGGGTCTTCCTTCAACGATCTAAACAGCTTCATGGAAGAGT TTAGTTCATAA
>UGT73B3
ATGAGTAGTGATCCTCATCGTAAGCTCCATGTTGTGTTCTTCCCTTTCATGGCTTATGGT CACATGATACCAACTCTAGACATGGCTAAGCTTTTCTCTAGCAGAGGAGCCAAATCTACA ATCCTCACCACACCTCTCAACTCCAAGATCTTCCAAAAACCCATCGAAAGATTCAAGAAC CTGAATCCGAGTTTCGAAATCGACATCCAGATCTTCGATTTCCCTTGCGTGGATCTCGGG TTACCAGAAGGATGCGAAAACGTCGATTTCTTCACCTCAAACAACAATGATGATAGACAG TATCTGACCTTGAAGTTCTTTAAGTCGACAAGGTTTTTCAAAGATCAGCTTGAGAAGCTC CTCGAGACAACGAGACCAGACTGTCTTATCGCCGACATGTTCTTCCCCTGGGCTACGGAA GCTGCTGAGAAGTTCAATGTGCCAAGACTTGTGTTCCACGGTACTGGCTACTTTTCTTTA TGCTCTGAATATTGCATCAGAGTGCATAACCCACAAAACATAGTAGCTTCAAGGTACGAG CCATTTGTGATTCCTGATCTCCCGGGGAACATAGTGATAACTCAAGAACAGATAGCAGAC CGTGACGAAGAAAGCGAGATGGGGAAGTTTATGATTGAGGTCAAAGAATCTGATGTGAAG AGCTCAGGTGTTATTGTAAACAGCTTCTACGAGCTTGAACCTGATTACGCCGACTTTTAC AAGAGTGTTGTACTGAAGAGAGCGTGGCATATCGGTCCGCTTTCGGTTTACAACAGAGGA TTTGAGGAGAAGGCTGAGAGAGGAAAGAAAGCAAGCATTAATGAGGTTGAATGCCTCAAA TGGCTTGACTCCAAGAAACCAGATTCAGTCATTTACATTTCTTTTGGGAGCGTGGCTTGC TTCAAGAACGAGCAGCTATTCGAGATCGCTGCAGGATTAGAAACTTCTGGAGCAAATTTC ATCTGGGTTGTTAGGAAAAACATAGGTATTGAAAAAGAAGAATGGTTACCAGAAGGGTTC GAAGAGAGGGTGAAAGGAAAAGGGATGATTATAAGAGGATGGGCACCACAGGTGCTCATA CTTGATCATCAAGCAACTTGTGGGTTTGTGACCCATTGCGGCTGGAACTCGCTTCTGGAA GGAGTGGCTGCAGGGCTACCAATGGTGACATGGCCTGTAGCAGCGGAGCAATTCTACAAT GAGAAATTGGTTACGCAAGTGCTCAGAACAGGAGTGAGCGTGGGAGCGAAAAAGAATGTA |
AGAACTACGGGAGATTTCATTAGCAGAGAGAAAGTGGTTAAAGCGGTGAGGGAGGTGTTG GTTGGGGAAGAGGCGGATGAGAGGCGGGAGAGGGCAAAGAAGTTGGCAGAGATGGCTAAA GCTGCCGTGGAAGGAGGGTCTTCTTTCAACGATCTAAACAGCTTCATAGAAGAGTTTACC TCGTAA
>UGT73B4
ATGAACAGAGAGCAAATTCATATTTTGTTCTTCCCCTTCATGGCTCATGGCCACATGATT CCACTCTTAGACATGGCCAAGCTTTTCGCTAGAAGAGGAGCCAAATCAACTCTCCTCACA ACCCCAATAAATGCTAAGATCTTGGAGAAACCCATTGAAGCATTCAAAGTTCAAAATCCT GATCTCGAAATCGGAATCAAGATCCTCAATTTCCCTTGTGTAGAGCTTGGATTGCCAGAA GGATGCGAGAACCGTGACTTCATTAACTCATACCAAAAATCTGACTCATTTGACTTGTTC TTGAAGTTTCTTTTCTCTACCAAGTATATGAAACAGCAGTTGGAGAGTTTCATTGAAACA ACCAAACCGAGTGCTCTTGTAGCCGATATGTTCTTCCCTTGGGCAACAGAATCCGCGGAG AAGATCGGTGTTCCAAGACTTGTGTTCCACGGCACATCATCCTTTGCCTTGTGTTGTTCG TATAACATGAGGATTCATAAGCCACACAAGAAAGTCGCTTCGAGTTCTACTCCATTTGTA ATCCCTGGTCTCCCTGGAGACATAGTTATTACAGAAGACCAAGCCAATGTCACCAACGAA GAAACTCCATTCGGAAAGTTTTGGAAAGAAGTCAGGGAATCAGAGACCAGTAGCTTTGGT GTTTTGGTGAATAGCTTCTACGAGCTGGAATCATCTTATGCTGATTTTTACCGTAGTTTT GTGGCGAAAAAAGCGTGGCATATAGGTCCACTTTCACTATCCAACAGAGGGATTGCAGAG AAAGCCGGAAGAGGGAAAAAGGCAAACATTGATGAGCAAGAATGCCTCAAATGGCTTGAC TCTAAGACACCTGGCTCAGTAGTTTACTTGTCCTTTGGTAGCGGAACCGGCTTACCCAAC GAACAGCTGTTAGAGATTGCTTTCGGCCTTGAAGGCTCTGGACAAAATTTCATTTGGGTG GTTAGCAAAAATGAAAACCAAGGTGAAAATGAAGATTGGTTGCCTAAAGGGTTTGAAGAG AGGAATAAAGGAAAAGGGCTGATAATACGCGGATGGGCCCCGCAAGTGCTGATACTTGAC CACAAAGCAATCGGAGGATTTGTGACGCATTGCGGATGGAACTCGACTTTGGAGGGCATT GCCGCAGGGCTGCCTATGGTGACTTGGCCGATGGGGGCAGAACAGTTCTACAACGAGAAG TTATTGACAAAAGTGTTGAGAATAGGAGTGAACGTTGGAGCTACCGAGTTGGTGAAAAAA GGAAAGTTGATTAGTAGAGCACAAGTGGAGAAGGCAGTAAGGGAAGTGATTGGTGGTGAG AAGGCAGAGGAAAGGCGGCTAAGGGCTAAGGAGCTGGGCGAGATGGCTAAAGCCGCTGTG GAAGAAGGAGGGTCTTCTTATAATGATGTGAACAAGTTTATGGAAGAGCTGAATGGTAGA AAGTAG
>UGT73B5
ATGAACAGAGAAGTCTCTGAGAGAATTCATATTTTGTTCTTCCCCTTCATGGCTCAAGGC CACATGATTCCAATTTTGGACATGGCCAAGCTTTTCTCGAGGAGAGGAGCCAAGTCAACC CTTCTCACAACCCCAATCAACGCTAAGATCTTCGAGAAACCTATTGAAGCATTCAAAAAT CAAAACCCTGATCTCGAAATCGGAATCAAGATCTTCAATTTCCCTTGTGTAGAGCTTGGA TTGCCTGAAGGATGCGAGAACGCTGACTTTATCAACTCATACCAAAAATCTGACTCAGGT GACTTGTTCTTGAAGTTTCTTTTCTCTACCAAGTATATGAAACAACAGTTGGAGAGTTTC ATTGAAACAACCAAACCAAGTGCTCTTGTTGCCGATATGTTCTTCCCTTGGGCGACAGAA TCTGCTGAGAAGCTCGGTGTACCAAGACTTGTGTTCCACGGTACATCTTTCTTTTCTTTG TGTTGTTCGTATAACATGAGGATTCATAAGCCACACAAGAAAGTCGCTACGAGTTCTACT CCTTTTGTAATCCCTGGTCTCCCAGGAGACATAGTTATTACAGAAGACCAAGCCAATGTT GCCAAAGAAGAAACGCCAATGGGAAAGTTTATGAAAGAGGTTAGGGAATCAGAGACCAAT AGCTTTGGTGTATTGGTTAATAGCTTCTACGAGCTGGAATCAGCTTATGCTGATTTTTAT CGTAGTTTTGTGGCGAAAAGAGCTTGGCATATCGGTCCGCTTTCGCTATCTAACAGAGAG TTAGGAGAGAAAGCCAGAAGAGGGAAAAAGGCTAACATTGATGAGCAAGAATGCCTAAAA TGGCTGGACTCTAAGACACCTGGTTCAGTAGTTTACTTGTCCTTTGGGAGCGGAACTAAT TTCACCAACGACCAGCTGTTAGAGATCGCTTTTGGTCTTGAAGGTTCTGGACAAAGTTTC ATCTGGGTGGTTAGGAAAAATGAAAACCAAGGTGACAATGAAGAGTGGTTGCCTGAAGGG TTTAAAGAGAGGACAACAGGGAAAGGGCTAATAATACCTGGATGGGCGCCGCAAGTGCTG ATACTTGACCATAAAGCAATTGGAGGATTTGTGACTCATTGCGGATGGAACTCGGCTATA
GAGGGCATTGCCGCGGGGCTGCCTATGGTAACATGGCCAATGGGGGCAGAACAGTTCTAC AATGAGAAGCTATTGACAAAAGTGTTGAGAATAGGAGTGAACGTTGGAGCTACCGAGTTG GTGAAAAAAGGAAAGTTGATTAGTAGAGCACAAGTGGAGAAGGCAGTAAGGGAAGTGATT GGTGGTGAGAAGGCAGAGGAAAGGCGGCTATGGGCTAAGAAGCTGGGCGAGATGGCTAAA GCCGCTGTGGAAGAAGGAGGGTCCTCTTATAATGATGTGAACAAGTTTATGGAAGAGCTG AATGGTAGAAAGTAG
>UGT73C1
ATGGCATCGGAATTTCGTCCTCCTCTTCATTTTGTTCTCTTCCCTTTCATGGCTCAAGGC CACATGATCCCAATGGTAGATATTGCAAGGCTCCTGGCTCAGCGCGGGGTGACTATAACC ATTGTCACTACACCTCAAAACGCAGGCCGGTTCAAGAACGTTCTTAGCCGGGCTATCCAA TCCGGCTTGCCCATCAATCTCGTGCAAGTAAAGTTTCCATCTCAAGAATCGGGTTCACCG GAAGGACAGGAGAATTTGGACTTGCTCGATTCATTGGGGGCTTCATTAACCTTCTTCAAA GCATTTAGCCTGCTCGAGGAACCAGTCGAGAAGCTCTTGAAAGAGATTCAACCTAGGCCA AACTGCATAATCGCTGACATGTGTTTGCCTTATACAAACAGAATTGCCAAGAATCTTGGT ATACCAAAAATCATCTTTCATGGCATGTGTTGCTTCAATCTTCTTTGTACGCACATAATG CACCAAAACCACGAGTTCTTGGAAACTATAGAGTCTGACAAGGAATACTTCCCCATTCCT AATTTCCCTGACAGAGTTGAGTTCACAAAATCTCAGCTTCCAATGGTATTAGTTGCTGGA GATTGGAAAGACTTCCTTGACGGAATGACAGAAGGGGATAACACTTCTTATGGTGTGATT GTTAACACGTTTGAAGAGCTCGAGCCAGCTTATGTTAGAGACTACAAGAAGGTTAAAGCG GGTAAGATATGGAGCATCGGACCGGTTTCCTTGTGCAACAAGTTAGGAGAAGACCAAGCT GAGAGGGGAAACAAGGCGGACATTGATCAAGACGAGTGTATTAAATGGCTTGATTCTAAA GAAGAAGGGTCGGTGCTATATGTTTGCCTTGGAAGTATATGCAATCTTCCTCTGTCTCAG CTCAAAGAGCTCGGCTTAGGCCTCGAGGAATCCCAAAGACCTTTCATTTGGGTCATAAGA GGTTGGGAGAAGTATAACGAGTTACTTGAATGGATCTCAGAGAGCGGTTATAAGGAAAGA ATCAAAGAAAGAGGCCTTCTCATAACAGGATGGTCGCCTCAAATGCTTATCCTTACACAT CCTGCCGTTGGAGGATTCTTGACACATTGTGGATGGAACTCTACTCTTGAAGGAATCACT TCAGGCGTTCCATTACTCACGTGGCCACTGTTTGGAGACCAATTCTGCAATGAGAAATTG GCGGTGCAGATACTAAAAGCCGGTGTGAGAGCTGGGGTTGAAGAGTCCATGAGATGGGGA GAAGAGGAGAAAATAGGAGTACTGGTGGATAAAGAAGGAGTAAAGAAGGCAGTGGAGGAA TTGATGGGTGATAGTAATGATGCTAAGGAGAGAAGAAAAAGAGTGAAAGAGCTTGGAGAA TTAGCTCACAAGGCTGTGGAAGAAGGAGGCTCTTCTCATTCCAACATCACATTCTTGCTA CAAGACATAATGCAATTAGAACAACCCAAGAAATGA
>UGT73C2
ATGGCTTTCGAGAAGACCCGCCAATTTCTTCCTCGGCTTCACTTTGTTCTCTTCCCTTTC ATGGCTCAAGGCCACATGATCCCCATGGTGGATATTGCAAGGATCTTGGCTCAGCGCGGG GTGACTATTACCATTGTCACGACGCCTCACAACGCAGCCAGGTTCAAAGATGTCCTAAAC CGGGCCATCCAGTCAGGCTTGCACATTAGGGTTGAGCATGTGAAGTTTCCTTTTCAAGAA GCTGGTTTGCAAGAAGGACAAGAGAATGTTGATTTTCTTGACTCAATGGAGTTAATGGTA CATTTCTTTAAAGCGGTTAACATGCTTGAAAATCCGGTCATGAAGCTCATGGAAGAGATG AAACCTAAACCAAGCTGCCTAATTTCTGATTTTTGTTTGCCTTATACAAGCAAAATCGCT AAGAGGTTCAATATCCCAAAGATCGTTTTCCATGGCGTGTCTTGCTTTTGTCTTTTGAGT ATGCATATTCTACACCGAAACCACAATATCTTACATGGTTTAAAGTCGGACAAAGAGTAT TTCTTGGTTCCTAGTTTTCCAGATAGAGTTGAATTTACAAAGCTTCAAGTTACTGTGAAA ACAAACTTTAGTGGAGATTGGAAAGAGATCATGGACGAACAGGTGGATGCTGATGACACG TCCTATGGTGTAATTGTCAACACATTTCAGGATTTGGAGTCTGCCTATGTGAAAAACTAC ACGGAGGCTAGGGCTGGTAAAGTATGGAGCATCGGTCCGGTTTCCTTGTGCAACAAGGTA GGAGAAGACAAAGCTGAGAGGGGAAACAAGGCAGCCATTGATCAAGACGAGTGTATTAAA TGGCTTGATTCTAAAGATGTAGAGTCGGTGCTGTATGTTTGCCTTGGAAGTATATGCAAT CTTCCTCTGGCTCAGCTTAGAGAGCTCGGGCTAGGCCTCGAGGCAACTAAAAGACCATTC ATTTGGGTCATAAGAGGTGGGGGAAAGTATCATGAACTAGCTGAGTGGATCTTAGAGAGC
GGTTTTGAAGAAAGAACCAAAGAGAGAAGCCTTCTCATAAAAGGATGGTCGCCTCAAATG CTTATCCTTTCACACCCTGCCGTTGGAGGATTCCTGACACATTGTGGATGGAACTCAACT TTAGAAGGAATCACCTCAGGGGTTCCATTGATCACTTGGCCATTATTTGGAGACCAATTC TGCAACCAGAAACTGATCGTGCAGGTGCTAAAAGCAGGTGTAAGTGTTGGGGTTGAAGAG GTCATGAAATGGGGAGAAGAGGAGAGTATTGGAGTGTTAGTGGATAAAGAAGGAGTGAAG AAGGCAGTGGACGAAATAATGGGCGAGAGTGATGAAGCAAAAGAGAGAAGAAAAAGAGTC AGAGAGCTTGGAGAATTAGCTCACAAGGCTGTGGAAGAAGGAGGCTCTTCTCATTCTAAT ATCATATTTTTGCTACAAGATATAATGCAACAAGTAGAATCCAAGAGTTGA
>UGT73C3
ATGGCTACGGAAAAAACCCACCAATTTCATCCTTCTCTTCACTTTGTCCTCTTCCCTTTC ATGGCTCAAGGCCACATGATTCCCATGATTGATATTGCAAGACTCTTGGCTCAGCGTGGT GTGACCATAACAATTGTCACGACACCTCACAACGCAGCAAGGTTTAAGAATGTCCTAAAC CGAGCGATCGAGTCTGGCTTGGCCATCAACATACTGCATGTGAAGTTTCCATATCAAGAG TTTGGTTTGCCAGAAGGAAAAGAGAATATAGATTCGTTAGACTCAACGGAGTTGATGGTA CCTTTCTTCAAAGCGGTGAACTTGCTTGAAGATCCGGTCATGAAGCTCATGGAAGAGATG AAACCTAGACCTAGCTGTCTAATTTCTGATTGGTGTTTGCCTTATACAAGCATAATCGCC AAGAACTTCAATATACCAAAGATAGTTTTCCACGGCATGGGTTGCTTTAATCTTTTGTGT ATGCATGTTCTACGCAGAAACTTAGAGATCCTAGAGAATGTAAAGTCGGATGAAGAGTAT TTCTTGGTTCCTAGTTTTCCTGATAGAGTTGAATTTACAAAGCTTCAACTTCCTGTGAAA GCAAATGCAAGTGGAGATTGGAAAGAGATAATGGATGAAATGGTAAAAGCAGAATACACA TCCTATGGTGTGATCGTCAACACATTTCAGGAGTTGGAGCCACCTTATGTCAAAGACTAC AAAGAGGCAATGGATGGAAAAGTATGGTCCATTGGACCCGTTTCCTTGTGTAACAAGGCA GGTGCAGACAAAGCTGAGAGGGGAAGCAAGGCCGCCATTGATCAAGATGAGTGTCTTCAA TGGCTTGATTCTAAAGAAGAAGGTTCGGTGCTCTATGTTTGCCTTGGAAGTATATGTAAT CTTCCTTTGTCTCAGCTCAAGGAGCTGGGGCTAGGCCTTGAGGAATCTCGAAGATCTTTT ATTTGGGTCATAAGAGGTTCGGAAAAGTATAAAGAACTATTTGAGTGGATGTTGGAGAGC GGTTTTGAAGAAAGAATCAAAGAGAGAGGACTTCTCATTAAAGGGTGGGCACCTCAAGTC CTTATCCTTTCACATCCTTCCGTTGGAGGATTCCTGACACACTGTGGATGGAACTCGACT CTCGAAGGAATCACCTCAGGCATTCCACTGATCACTTGGCCGCTGTTTGGAGACCAATTC TGCAACCAAAAACTGGTCGTTCAAGTACTAAAAGCCGGTGTAAGTGCCGGGGTTGAAGAA GTCATGAAATGGGGAGAAGAAGATAAAATAGGAGTGTTAGTGGATAAAGAAGGAGTGAAA AAGGCTGTGGAAGAATTGATGGGTGATAGTGATGATGCAAAAGAGAGGAGAAGAAGAGTC AAAGAGCTTGGAGAATTAGCTCACAAAGCTGTGGAAAAAGGAGGCTCTTCTCATTCTAAC ATCACACTCTTGCTACAAGACATAATGCAACTAGCACAATTCAAGAATTGA
>UGT73C4
ATGGCTTCCGAAAAATCCCACAAAGTTCATCCTCCTCTTCACTTTATTCTTTTCCCTTTC ATGGCTCAGGGCCACATGATTCCCATGATTGATATAGCAAGGCTCTTGGCTCAGCGCGGT GCGACAGTAACTATTGTCACGACACGTTATAATGCAGGGAGGTTCGAGAATGTCTTAAGT CGTGCCATGGAGTCTGGTTTACCCATCAACATAGTGCATGTGAATTTTCCATATCAAGAA TTTGGTTTGCCAGAAGGAAAAGAGAATATAGATTCGTATGACTCAATGGAGCTGATGGTA CCTTTCTTTCAAGCAGTTAACATGCTCGAAGATCCGGTCATGAAGCTCATGGAAGAGATG AAACCTAGACCTAGCTGTATTATTTCTGATTTGCTCTTGCCTTATACAAGCAAAATCGCA AGGAAATTCAGTATACCAAAGATAGTTTTCCACGGCACGGGTTGCTTTAATCTTTTGTGT ATGCATGTTCTACGCAGAAACCTCGAGATCTTGAAGAACTTAAAGTCGGATAAAGATTAT TTCCTGGTTCCTAGTTTTCCTGATAGAGTTGAATTTACAAAGCCTCAAGTTCCAGTGGAA ACAACTGCAAGTGGAGATTGGAAAGCGTTCTTGGACGAAATGGTAGAAGCAGAATACACA TCCTATGGTGTGATCGTCAACACATTTCAGGAGTTGGAGCCTGCTTATGTCAAAGACTAC ACGAAGGCTAGGGCTGGAAAAGTATGGTCCATTGGACCTGTTTCCTTGTGCAACAAGGCA GGTGCTGATAAAGCTGAGAGGGGAAACCAGGCCGCCATTGATCAAGATGAGTGTCTTCAA TGGCTTGATTCTAAAGAAGATGGTTCGGTGTTATATGTTTGCCTTGGAAGTATCTGTAAT
CTACCTTTGTCTCAGCTCAAGGAGCTGGGGCTAGGCCTTGAAAAATCCCAAAGATCTTTT ATTTGGGTCATAAGAGGTTGGGAAAAGTATAATGAACTATATGAGTGGATGATGGAGAGC GGTTTTGAAGAAAGAATCAAAGAGAGAGGACTTCTTATTAAAGGGTGGTCACCTCAAGTC CTTATCCTTTCACATCCTTCCGTTGGAGGATTCCTGACACACTGTGGATGGAACTCGACT CTCGAAGGAATCACCTCAGGCATTCCACTGATCACTTGGCCGCTGTTTGGAGACCAATTC TGCAACCAAAAACTGGTCGTTCAAGTACTAAAAGCCGGTGTAAGTGCCGGGGTTGAAGAA GTCATGAAATGGGGAGAAGAGGAGAAAATAGGAGTGTTAGTGGATAAAGAAGGAGTAAAG AAGGCAGTGGAAGAGTTAATGGGTGCGAGTGATGATGCAAAAGAGAGGAGAAGAAGAGTC AAAGAGCTTGGAGAATCAGCTCACAAGGCTGTGGAAGAAGGAGGCTCTTCTCATTCTAAC ATCACATACTTGCTACAAGACATAATGCAACAAGTGAAATCCAAGAACTGA
>UGT73C5
ATGGTTTCCGAAACAACCAAATCTTCTCCACTTCACTTTGTTCTCTTCCCTTTCATGGCT CAAGGCCACATGATTCCCATGGTTGATATTGCAAGGCTCTTGGCTCAGCGTGGTGTGATC ATAACAATTGTCACGACGCCTCACAATGCAGCGAGGTTCAAGAATGTCCTAAACCGTGCC ATTGAGTCTGGCTTGCCCATCAACTTAGTGCAAGTCAAGTTTCCATATCTAGAAGCTGGT TTGCAAGAAGGACAAGAGAATATCGATTCTCTTGACACAATGGAGCGGATGATACCTTTC TTTAAAGCGGTTAACTTTCTCGAAGAACCAGTCCAGAAGCTCATTGAAGAGATGAACCCT CGACCAAGCTGTCTAATTTCTGATTTTTGTTTGCCTTATACAAGCAAAATCGCCAAGAAG TTCAATATCCCAAAGATCCTCTTCCATGGCATGGGTTGCTTTTGTCTTCTGTGTATGCAT GTTTTACGCAAGAACCGTGAGATCTTGGACAATTTAAAGTCAGATAAGGAGCTTTTCACT GTTCCTGATTTTCCTGATAGAGTTGAATTCACAAGAACGCAAGTTCCGGTAGAAACATAT GTTCCAGCTGGAGACTGGAAAGATATCTTTGATGGTATGGTAGAAGCGAATGAGACATCT TATGGTGTGATCGTCAACTCATTTCAAGAGCTCGAGCCTGCTTATGCCAAAGACTACAAG GAGGTAAGGTCCGGTAAAGCATGGACCATTGGACCCGTTTCCTTGTGCAACAAGGTAGGA GCCGACAAAGCAGAGAGGGGAAACAAATCAGACATTGATCAAGATGAGTGCCTTAAATGG CTCGATTCTAAGAAACATGGCTCGGTGCTTTACGTTTGTCTTGGAAGTATCTGTAATCTT CCTTTGTCTCAACTCAAGGAGCTGGGACTAGGCCTAGAGGAATCCCAAAGACCTTTCATT TGGGTCATAAGAGGTTGGGAGAAGTACAAAGAGTTAGTTGAGTGGTTCTCGGAAAGCGGC TTTGAAGATAGAATCCAAGATAGAGGACTTCTCATCAAAGGATGGTCCCCTCAAATGCTT ATCCTTTCACATCCATCAGTTGGAGGGTTCCTAACACACTGTGGTTGGAACTCGACTCTT GAGGGGATAACTGCTGGTCTACCGCTACTTACATGGCCGCTATTCGCAGACCAATTCTGC AATGAGAAATTGGTCGTTGAGGTACTAAAAGCCGGTGTAAGATCCGGGGTTGAACAGCCT ATGAAATGGGGAGAAGAGGAGAAAATAGGAGTGTTGGTGGATAAAGAAGGAGTGAAGAAG GCAGTGGAAGAATTAATGGGTGAGAGTGATGATGCAAAAGAGAGAAGAAGAAGAGCCAAA GAGCTTGGAGATTCAGCTCACAAGGCTGTGGAAGAAGGAGGCTCTTCTCATTCTAACATC TCTTTCTTGCTACAAGACATAATGGAACTGGCAGAACCCAATAATTGA
>UGT73C6
ATGGCTTTCGAAAAAAACAACGAACCTTTTCCTCTTCACTTTGTTCTCTTCCCTTTCATG GCTCAAGGCCACATGATTCCCATGGTTGATATTGCAAGGCTCTTGGCTCAGCGAGGTGTG CTTATAACAATTGTCACGACGCCTCACAATGCAGCAAGGTTCAAGAATGTCCTAAACCGT GCCATTGAGTCTGGTTTGCCCATCAACCTAGTGCAAGTCAAGTTTCCATATCAAGAAGCT GGTCTGCAAGAAGGACAAGAAAATATGGATTTGCTTACCACGATGGAGCAGATAACATCT TTCTTTAAAGCGGTTAACTTACTCAAAGAACCAGTCCAGAACCTTATTGAAGAGATGAGC CCGCGACCAAGCTGTCTAATCTCTGATATGTGTTTGTCGTATACAAGCGAAATCGCCAAG AAGTTCAAAATACCAAAGATCCTCTTGCATGGCATGGGTTGCTTTTGTCTTCTGTGTGTT AACGTTCTGCGCAAGAACCGTGAGATCTTGGACAATTTAAAGTCTGATAAGGAGTACTTC ATTGTTCCTTATTTTCCTGATAGAGTTGAATTCACAAGACCTCAAGTTCCGGTGGAAACA TATGTTCCTGCAGGCTGGAAAGAGATCTTGGAGGATATGGTAGAAGCGGATAAGACATCT TATGGTGTTATAGTCAACTCATTTCAAGAGCTCGAACCTGCGTATGCCAAAGACTTCAAG GAGGCAAGGTCTGGTAAAGCATGGACCATTGGACCTGTTTCCTTGTGCAACAAGGTAGGA
GTAGACAAAGCAGAGAGGGGAAACAAATCAGATATTGATCAAGATGAGTGCCTTGAATGG CTCGATTCTAAGGAACCGGGATCTGTGCTCTACGTTTGCCTTGGAAGTATTTGTAATCTT CCTCTGTCTCAGCTCCTTGAGCTGGGACTAGGCCTAGAGGAATCCCAAAGACCTTTCATC TGGGTCATAAGAGGTTGGGAGAAATACAAAGAGTTAGTTGAGTGGTTCTCGGAAAGCGGC TTTGAAGATAGAATCCAAGATAGAGGACTTCTCATCAAAGGATGGTCCCCTCAAATGCTT ATCCTTTCACATCCTTCTGTTGGAGGGTTCTTAACGCACTGCGGATGGAACTCGACTCTT GAGGGGATAACTGCTGGTCTACCAATGCTTACATGGCCACTATTTGCAGACCAATTCTGC AACGAGAAACTGGTCGTACAAATACTAAAAGTCGGTGTAAGTGCCGAGGTTAAAGAGGTC ATGAAATGGGGAGAAGAAGAGAAGATAGGAGTGTTGGTGGATAAAGAAGGAGTGAAGAAG GCAGTGGAAGAACTAATGGGTGAGAGTGATGATGCAAAAGAGAGAAGAAGAAGAGCCAAA GAGCTTGGAGAATCAGCTCACAAGGCTGTGGAAGAAGGAGGCTCCTCTCATTCTAATATC ACTTTCTTGCTACAAGACATAATGCAACTAGCACAGTCCAATAATTGA
>UGT73C7
ATGTGTTCTCATGATCCTCTTCACTTCGTCGTAATACCCTTTATGGCCCAAGGCCATATG ATCCCATTGGTCGACATCTCTAGGCTCTTGTCCCAGCGCCAAGGCGTGACTGTCTGCATC ATCACAACTACTCAAAATGTAGCCAAGATCAAGACTTCACTCTCATTTTCCTCTTTGTTT GCGACTATCAACATCGTTGAAGTTAAGTTTCTGTCTCAACAAACGGGTTTGCCAGAAGGG TGCGAGAGTTTAGATATGTTGGCTTCAATGGGCGATATGGTGAAGTTCTTTGATGCTGCC AACTCACTTGAGGAGCAAGTTGAGAAAGCTATGGAAGAGATGGTTCAGCCGCGGCCAAGC TGCATCATTGGAGACATGAGCCTTCCTTTCACTTCAAGACTTGCCAAGAAATTCAAGATC CCCAAACTTATCTTCCATGGGTTTTCTTGTTTCAGCCTCATGTCTATACAAGTGGTTCGA GAAAGCGGGATCTTGAAAATGATAGAATCAAACGACGAGTATTTTGATTTGCCCGGCTTG CCTGACAAAGTTGAGTTCACGAAACCTCAGGTCTCTGTGTTGCAACCTGTTGAAGGAAAT ATGAAAGAGAGTACGGCCAAGATTATTGAAGCTGATAATGACTCTTATGGTGTTATTGTG AACACTTTTGAAGAGTTAGAGGTTGATTATGCAAGAGAATATAGGAAAGCAAGGGCTGGA AAAGTTTGGTGCGTTGGACCTGTTTCCTTGTGCAATAGGTTAGGGTTAGACAAAGCTAAA AGAGGAGATAAGGCTTCTATTGGTCAAGACCAATGTCTTCAATGGCTTGACTCTCAAGAA ACTGGTTCAGTGCTCTACGTTTGCCTTGGAAGTCTATGTAATCTTCCCTTGGCTCAGCTC AAAGAGCTGGGACTAGGCCTTGAGGCATCTAATAAACCTTTCATATGGGTTATAAGAGAA TGGGGAAAATATGGAGATTTAGCAAATTGGATGCAACAAAGCGGATTTGAAGAGCGGATC AAAGATAGAGGACTGGTGATCAAAGGTTGGGCGCCGCAAGTTTTCATCCTCTCACACGCA TCCATTGGAGGGTTTTTGACTCACTGTGGATGGAACTCGACACTAGAAGGAATTACTGCA GGAGTTCCATTATTGACATGGCCTTTGTTTGCTGAACAATTCTTGAATGAGAAGTTAGTT GTGCAGATACTAAAAGCAGGGTTAAAGATAGGAGTAGAGAAATTGATGAAATATGGAAAA GAAGAGGAGATAGGAGCGATGGTGAGCAGAGAATGTGTGAGAAAAGCTGTGGATGAGCTA ATGGGTGATAGTGAAGAAGCAGAAGAGAGAAGAAGAAAAGTTACAGAACTTAGTGACTTG GCAAATAAGGCTTTGGAAAAAGGAGGATCTTCAGATTCTAATATCACATTGCTCATTCAA GATATTATGGAGCAATCACAAAATCAATTTTAA
>UGT73D1
ATGGAATCAAAAATAGTTTCAAAAGCCAAAAGACTTCACTTTGTTTTGATCCCTCTCATG GCTCAAGGGCATCTGATCCCCATGGTCGACATCTCCAAGATTCTTGCACGACAAGGCAAC ATCGTTACCATAGTTACAACCCCTCAAAATGCTTCTAGGTTTGCGAAGACAGTTGACCGA GCAAGATTAGAGTCGGGTCTCGAAATCAATGTCGTTAAATTTCCAATTCCTTACAAAGAA TTCGGTCTTCCCAAAGATTGTGAGACTCTGGACACTTTGCCCTCCAAAGACCTCCTACGA AGATTCTATGACGCTGTGGATAAACTCCAAGAGCCCATGGAACGGTTTGTTGAGCAACAA GATATCCCTCCAAGTTGCATAATCTCCGATAAATGCCTTTTTTGGACGTCAAGAACCGCA AAGAGGTTCAAAATCCCGAGGATCGTGTTCCATGGAATGTGTTGCTTCTCTCTTTTGAGT TCGCACAATATCCATCTTCATAGCCCGCACCTCTCGGTTTCTTCGGCCGTAGAGCCATTC CCTATACCAGGAATGCCACATAGGATTGAGATAGCTAGAGCTCAGTTACCTGGTGCTTTT GAGAAGTTAGCAAATATGGATGACGTTCGCGAGAAGATGCGTGAATCTGAATCAGAAGCC
TTTGGGGTTATTGTTAATAGCTTCCAGGAATTGGAGCCTGGCTATGCAGAGGCCTACGCT GAGGCCATCAATAAGAAGGTATGGTTCGTTGGACCCGTTTCTTTATGCAACGACCGTATG GCTGACCTATTCGATAGAGGAAGTAATGGTAACATCGCAATAAGCGAGACCGAATGCTTG CAGTTTCTTGACTCGATGAGACCAAGGTCAGTCTTATATGTTTCTCTTGGTAGCCTCTGT CGACTAATACCTAATCAATTGATAGAACTAGGTTTAGGGTTAGAAGAATCGGGAAAACCC TTTATTTGGGTGATAAAGACCGAGGAAAAACACATGATTGAGCTAGACGAATGGCTAAAA CGCGAAAATTTTGAAGAGCGAGTTAGAGGAAGAGGGATAGTAATAAAGGGTTGGAGTCCT CAGGCTATGATACTCTCACATGGTTCAACCGGCGGGTTCTTGACTCATTGCGGTTGGAAT TCTACAATAGAAGCGATATGTTTTGGTGTACCAATGATCACATGGCCGTTGTTCGCTGAA CAATTTCTCAATGAGAAACTCATCGTGGAGGTTTTGAACATCGGGGTTAGGGTTGGGGTG GAGATTCCGGTGAGATGGGGAGACGAGGAGAGACTTGGAGTGTTGGTCAAGAAACCGAGT GTTGTGAAAGCTATAAAGCTTTTGATGGACCAAGATTGTCAACGTGTAGACGAAAATGAT GATGATAATGAATTCGTGAGACGAAGGAGACGTATTCAAGAACTTGCAGTAATGGCGAAA AAGGCTGTGGAAGAAAAGGGATCTTCGAGTATTAACGTTTCAATTTTAATCCAAGATGTT TTGGAGCAATTGAGTCTCGTGTAG
>UGT74B1
ATGGCGGAAACAACTCCCAAAGTGAAAGGCCACGTCGTAATCTTACCATACCCAGTTCAA GGCCACCTAAACCCAATGGTTCAATTCGCTAAACGTCTAGTCTCCAAAAACGTCAAAGTC ACAATCGCCACCACTACCTACACCGCCTCCTCAATCACAACACCATCACTCTCCGTCGAA CCAATCTCCGATGGATTCGATTTCATCCCCATAGGTATCCCCGGTTTCAGCGTCGATACT TACTCAGAATCCTTCAAGCTCAACGGATCCGAAACCCTAACTCTCCTAATCGAGAAATTC AAATCCACAGATTCACCAATCGATTGCTTAATCTACGATTCGTTTCTTCCTTGGGGACTT GAAGTTGCTAGATCTATGGAACTTTCAGCTGCTTCTTTCTTCACTAATAATCTCACTGTT TGTTCTGTGTTGCGTAAATTCTCTAACGGTGACTTTCCTCTTCCCGCTGATCGTAATTCG GCGCCGTTTCGTATCCGTGGCTTACCGTCTTTGAGCTACGATGAGTTACCTTCGTTTGTG GGACGTCATTGGTTGACTCATCCTGAGCATGGCAGAGTTCTTCTGAATCAGTTTCCTAAC CATGAAAATGCTGATTGGTTATTCGTTAATGGCTTTGAAGGGTTAGAAGAAACACAAGAT TGTGAAAATGGTGAGTCTGATGCAATGAAGGCGACGTTGATCGGACCGATGATTCCATCG GCTTATCTTGATGATCGGATGGAAGATGATAAAGACTATGGTGCGAGTCTGTTGAAACCG ATATCGAAGGAGTGTATGGAGTGGCTTGAGACTAAGCAGGCTCAGTCAGTAGCATTTGTT TCGTTTGGTTCGTTTGGGATTCTCTTTGAGAAGCAACTTGCAGAGGTAGCTATTGCGCTA CAAGAATCGGATTTGAACTTCTTGTGGGTGATTAAAGAAGCTCATATAGCGAAATTGCCT GAAGGGTTTGTGGAATCGACTAAAGATAGAGCCTTGTTGGTTTCTTGGTGTAACCAGCTT GAGGTTTTAGCTCATGAATCGATAGGTTGCTTTTTGACTCATTGTGGTTGGAACTCTACG TTGGAAGGGTTGAGTTTGGGAGTTCCGATGGTTGGTGTGCCTCAGTGGAGTGATCAGATG AATGATGCTAAGTTTGTGGAGGAAGTTTGGAAAGTTGGGTATAGAGCGAAAGAGGAAGCT GGGGAAGTAATCGTGAAGAGTGAAGAATTGGTGAGGTGTTTGAAAGGAGTGATGGAAGGA GAGAGTAGTGTGAAGATTAGAGAGAGTTCGAAGAAGTGGAAAGATTTGGCTGTGAAGGCA ATGAGTGAAGGAGGAAGCTCTGATCGAAGCATTAACGAGTTTATAGAGAGTTTAGGGAAG TAA
>UGT74C1
ATGAGTGAAGCAAAGAAGGGTCACGTACTGTTTTTTCCATATCCATTACAAGGCCACATT AACCCAATGATCCAACTCGCTAAACGCTTATCCAAAAAGGGCATCACCAGCACACTCATC ATCGCCTCCAAAGACCACCGTGAACCTTACACCTCCGACGACTACTCCATCACCGTCCAC ACCATCCACGACGGTTTCTTTCCACATGAACACCCTCACGCCAAGTTCGTAGATCTTGAC CGTTTCCACAACTCTACTTCTCGAAGCCTGACCGATTTCATCTCTAGTGCGAAGTTGTCG GACAATCCTCCAAAAGCTCTGATCTATGATCCATTTATGCCCTTTGCATTGGACATAGCC AAGGACTTGGATCTATACGTAGTGGCATATTTCACTCAACCATGGTTGGCTAGTCTTGTT TACTACCATATCAACGAAGGCACCTACGATGTTCCCGTTGATAGACACGAGAACCCAACA CTTGCATCGTTTCCTGGTTTCCCATTGTTAAGCCAAGATGATCTGCCTTCGTTCGCCTGC
GAAAAAGGGTCGTACCCTCTTCTACACGAGTTTGTGGTTAGGCAATTCTCTAATTTATTG CAAGCTGATTGCATTCTCTGCAACACTTTTGATCAACTTGAACCAAAGGTAGTGAAATGG ATGAATGATCAATGGCCGGTGAAGAACATTGGACCGGTGGTTCCATCGAAGTTCTTGGAT AACCGGTTGCCAGAAGACAAAGATTACGAACTCGAGAACTCCAAGACAGAGCCAGACGAG TCTGTTTTGAAGTGGTTGGGAAACAGGCCGGCGAAGTCGGTGGTTTACGTGGCGTTTGGG ACATTGGTGGCTTTGAGCGAAAAACAGATGAAGGAAATTGCAATGGCGATTAGCCAAACC GGATATCACTTCTTGTGGTCTGTTAGAGAATCCGAGAGAAGCAAACTACCCTCTGGTTTT ATCGAAGAGGCAGAGGAGAAAGACTCTGGACTTGTGGCTAAGTGGGTTCCTCAGCTAGAG GTTTTAGCACATGAATCAATCGGGTGTTTCGTGTCACACTGTGGATGGAACTCGACATTG GAGGCACTATGCTTAGGGGTTCCAATGGTGGGCGTGCCTCAGTGGACTGATCAGCCCACA AATGCTAAGTTTATAGAGGATGTGTGGAAGATTGGGGTTAGAGTGAGGACCGATGGAGAA GGGCTTTCGAGTAAAGAAGAGATTGCGAGATGCATTGTTGAGGTCATGGAAGGAGAGAGA GGGAAAGAGATAAGGAAGAATGTTGAGAAGCTTAAGGTGTTGGCTCGCGAAGCTATCTCT GAAGGAGGTAGTTCCGACAAGAAGATTGATGAGTTTGTTGCTCTTTTGACTTAA
>UGT74D1
ATGGGAGAGAAAGCGAAAGCAAATGTGTTAGTCTTCTCATTTCCGATACAAGGTGACATA AACCCTCTCCTCCAATTCTCAAAACGCCTACTCTCTAAAAACGTCAACGTCACATTCCTC ACCACTTCCTCCACCCACAACTCCATCCTCCGCCGTGCCATCACCGGCGGAGCCACTGCT CTTCCTCTCTCTTTTGTCCCCATTGACGATGGATTCGAGGAAGATCACCCATCTACGGAC ACATCTCCCGACTACTTCGCAAAGTTCCAAGAAAACGTATCTCGAAGCCTCTCAGAGCTT ATCTCCTCGATGGACCCAAAACCAAACGCCGTCGTTTACGACTCGTGCCTGCCTTATGTC CTCGACGTTTGCCGGAAACATCCTGGCGTTGCTGCGGCGTCGTTTTTCACTCAGTCCTCC ACCGTGAACGCGACCTATATTCATTTCTTGCGTGGAGAGTTTAAGGAGTTTCAAAATGAT GTCGTTTTGCCTGCAATGCCTCCGCTGAAGGGTAATGACTTACCGGTGTTTCTGTACGAT AACAATCTCTGCCGGCCGTTGTTTGAGCTCATTAGTAGCCAGTTGGTGAATGTTGACGAC ATTGACTTCTTCTTGGTTAACTCTTTCGACGAACTCGAAGTCGAGGTGCTACAATGGATG AAAAACCAATGGCCGGTCAAGAACATAGGACCGATGATTCCATCAATGTACTTAGACAAA CGATTAGCAGGTGACAAAGACTACGGAATCAACCTCTTCAATGCCCAAGTCAACGAATGC CTTGATTGGCTTGACTCAAAACCGCCCGGTTCAGTGATCTACGTGTCTTTTGGAAGCTTG GCCGTCTTAAAAGACGATCAAATGATAGAAGTCGCGGCTGGTCTAAAACAAACTGGCCAT AACTTCTTATGGGTTGTTAGAGAAACTGAAACAAAGAAGCTTCCAAGCAATTACATAGAG GACATTTGTGACAAGGGATTGATAGTGAATTGGAGTCCTCAATTACAAGTTCTTGCACAT AAATCAATCGGTTGTTTCATGACTCATTGCGGGTGGAATTCGACTTTAGAGGCATTGAGC TTAGGAGTTGCTTTGATAGGAATGCCGGCTTATAGCGACCAGCCGAGTAATGCTAAGTTT ATTGAAGATGTGTGGAAGGTTGGGGTTAGGGTTAAGGCAGATCAAAATGGGTTTGTTCCG AAGGAAGAGATTGTGAGATGTGTTGGAGAAGTTATGGAAGATATGTCGGAGAAAGGGAAG GAGATTAGAAAAAATGCTCGGAGGTTGATGGAGTTTGCAAGGGAAGCTTTGTCTGATGGA GGAAATTCTGATAAGAATATTGATGAGTTTGTTGCTAAAATTGTGAGGTAA
>UGT74E1
ATGAGAGAAGGATCTCATGTTATTGTTTTGCCTTTCCCAGCACAAGGCCACATAACTCCA ATGTCCCAATTCTGTAAACGCTTAGCCTCAAAAAGTCTTAAGATCACTCTTGTCCTCGTC TCCGACAAGCCCTCTCCGCCGTACAAAACAGAGCACGACACAATCACTGTCGTCCCCATC TCCAATGGTTTCCAAGAAGGCCAGGAACGATCAGAAGACCTAGATGAGTACATGGAAAGA GTAGAATCCAGCATCAAAAACCGCTTACCGAAGTTGATAGAAGACATGAAACTATCGGGA AATCCTCCTAGGGCTCTTGTGTACGACTCCACCATGCCGTGGCTTCTGGATGTAGCTCAT AGTTATGGTTTGAGCGGTGCCGTGTTTTTCACGCAGCCTTGGCTTGTCTCAGCTATTTAC TATCATGTATTCAAGGGCTCGTTCTCTGTACCGTCTACAAAGTATGGTCACTCGACGTTA GCATCTTTCCCTTCGTTACCGATTCTGAATGCGAATGATTTGCCGTCTTTCCTCTGTGAA TCTTCCTCTTACCCATATATTCTAAGGACTGTGATCGATCAGCTCTCAAACATTGATCGA GTTGATATAGTTTTGTGCAACACTTTCGATAAATTGGAAGAAAAGTTGCTGAAATGGATT
AAAAGCGTGTGGCCTGTCCTGAACATAGGACCAACTGTTCCATCAATGTATTTAGATAAG CGACTGGCTGAAGACAAAAACTACGGATTCAGCCTCTTCGGTGCGAAAATCGCTGAATGC ATGGAGTGGCTCAACTCAAAGCAGCCTAGTTCAGTTGTTTATGTATCATTTGGGAGCTTG GTGGTTCTAAAAAAAGATCAACTGATAGAACTAGCGGCGGGTCTGAAACAGAGCGGACAT TTCTTTTTGTGGGTTGTGAGAGAGACGGAGAGAAGAAAACTTCCAGAAAACTATATAGAG GAAATTGGTGAGAAAGGACTGACCGTGAGCTGGAGTCCACAACTTGAAGTTCTTACACAT AAATCGATCGGTTGTTTCGTGACACATTGTGGATGGAACTCGACGTTAGAGGGATTGAGT TTGGGAGTTCCAATGATTGGTATGCCTCATTGGGCAGATCAGCCTACAAATGCTAAGTTC ATGGAGGATGTGTGGAAAGTTGGAGTTAGGGTTAAAGCAGACAGTGATGGGTTCGTGAGA AGAGAAGAGTTTGTGAGACGTGTGGAAGAAGTTATGGAGGCAGAGCAAGGTAAAGAGATT AGAAAGAATGCTGAGAAATGGAAAGTGTTGGCTCAAGAGGCTGTTTCTGAAGGAGGTAGT TCTGATAAGAACATCAATGAGTTTGTTTCTATGTTTTGTTGA
>UGT74E2
ATGAGAGAAGGATCTCATCTTATCGTCTTGCCTTTCCCAGGACAAGGCCACATAACTCCA ATGTCCCAGTTCTGCAAACGCTTAGCCTCAAAAGGTCTTAAGCTCACTCTGGTCCTCGTC TCCGACAAACCCTCTCCTCCATACAAAACAGAGCACGACTCAATCACTGTCTTCCCCATC TCCAACGGCTTCCAAGAAGGCGAGGAACCATTACAAGACCTCGATGATTACATGGAAAGA GTAGAAACCAGCATCAAAAACACCTTACCGAAGTTGGTTGAAGACATGAAACTGTCGGGA AATCCACCTAGGGCTATCGTGTACGACTCCACCATGCCATGGCTTCTTGATGTAGCTCAT AGTTATGGATTGAGCGGTGCCGTGTTTTTCACGCAACCTTGGCTTGTCACAGCTATTTAC TACCATGTTTTCAAGGGTTCGTTCTCTGTACCGTCTACAAAGTACGGTCACTCGACATTA GCATCTTTCCCTTCGTTCCCGATGCTGACTGCAAATGATTTGCCGTCTTTCCTCTGCGAA TCGTCCTCATACCCGAATATACTGAGGATTGTGGTGGATCAGCTCTCAAACATTGATCGA GTCGACATAGTGTTGTGCAACACTTTCGATAAATTGGAGGAAAAGTTGTTGAAATGGGTC CAAAGCTTGTGGCCAGTCTTGAATATTGGACCAACGGTTCCATCGATGTATTTAGACAAA CGACTGTCTGAAGACAAGAACTACGGTTTTAGCCTCTTCAATGCGAAAGTCGCTGAATGC ATGGAGTGGCTAAACTCAAAGGAGCCTAATTCTGTTGTCTATTTATCATTCGGAAGTTTG GTGATTCTAAAAGAAGATCAAATGTTGGAACTCGCTGCGGGTCTGAAACAGAGCGGACGT TTCTTTCTGTGGGTTGTGAGAGAGACAGAGACACACAAACTTCCAAGAAACTATGTCGAG GAAATCGGTGAAAAAGGACTTATTGTAAGCTGGAGTCCTCAGCTTGACGTACTTGCACAT AAATCAATCGGTTGTTTCTTGACACACTGTGGATGGAACTCGACGTTAGAGGGATTGAGT TTGGGAGTTCCAATGATTGGTATGCCACACTGGACTGATCAGCCCACGAATGCTAAGTTC ATGCAGGATGTGTGGAAGGTTGGGGTAAGGGTTAAGGCAGAAGGTGATGGGTTTGTGAGA AGAGAAGAGATTATGAGAAGTGTGGAAGAAGTTATGGAGGGAGAGAAAGGGAAAGAGATT AGAAAGAATGCTGAGAAATGGAAAGTGTTGGCTCAAGAGGCAGTTTCTGAAGGAGGTAGC TCTGATAAGAGCATCAATGAGTTTGTTTCTATGTTTTGTTGA
>UGT74F1
ATGGAGAAGATGAGAGGACATGTATTAGCAGTGCCATTTCCAAGCCAAGGACACATCACC CCGATTCGCCAATTCTGCAAACGACTTCACTCCAAAGGTTTCAAAACCACTCACACTCTC ACCACTTTTATCTTCAACACAATCCACCTCGACCCATCTAGTCCTATCTCCATAGCCACA ATCTCCGATGGCTATGACCAGGGAGGGTTCTCATCAGCCGGTTCTGTCCCGGAGTACCTA CAAAACTTCAAAACCTTCGGCTCCAAAACCGTCGCTGATATCATCCGCAAACACCAGAGT ACTGATAACCCTATTACTTGTATCGTCTATGATTCTTTCATGCCTTGGGCGCTTGACCTT GCAATGGATTTTGGTCTAGCTGCGGCTCCTTTCTTCACGCAGTCTTGCGCCGTTAACTAT ATCAATTATCTTTCTTACATAAACAATGGTAGCTTGACACTTCCCATCAAGGATTTGCCT CTTCTTGAGCTCCAAGATTTGCCTACTTTCGTCACTCCTACTGGTTCACACCTTGCTTAC TTTGAGATGGTGCTTCAACAGTTCACCAACTTCGACAAAGCTGATTTCGTACTCGTTAAT TCCTTCCATGACCTCGACCTTCATGAAGAGGAGTTGTTGTCGAAAGTATGTCCTGTGTTG ACAATTGGTCCAACTGTTCCATCAATGTACTTAGACCAACAGATCAAATCAGACAACGAC TATGATCTGAACCTCTTTGACTTAAAAGAAGCTGCCTTATGCACTGACTGGCTAGACAAG
AGGCCAGAAGGATCGGTAGTATATATAGCTTTTGGGAGCATGGCTAAACTGAGTAGTGAG CAGATGGAAGAGATTGCTTCGGCGATAAGCAACTTCAGCTACCTCTGGGTTGTCAGAGCT TCAGAGGAGTCAAAGCTCCCACCAGGGTTTCTTGAAACAGTGGATAAAGACAAGAGCTTG GTCTTGAAGTGGAGTCCTCAGCTTCAAGTTCTGTCAAACAAAGCCATCGGTTGTTTCATG ACTCACTGTGGCTGGAACTCAACCATGGAGGGTTTGAGTTTAGGGGTTCCCATGGTGGCT ATGCCTCAATGGACTGATCAACCAATGAATGCAAAGTATATACAAGATGTATGGAAGGTT GGGGTTCGTGTGAAAGCAGAGAAAGAAAGTGGCATTTGCAAAAGAGAGGAGATTGAGTTT AGCATCAAGGAAGTGATGGAAGGAGAGAAGAGCAAAGAGATGAAAGAGAATGCGGGAAAA TGGAGAGACTTGGCTGTGAAGTCACTCAGTGAAGGAGGTTCTACAGATATCAACATTAAC GAATTTGTATCAAAAATTCAAATCAAATAA
>UGT74F2
ATGGAGCATAAGAGAGGACATGTATTAGCAGTGCCGTACCCAACGCAAGGACACATCACA CCATTCCGCCAATTCTGCAAACGACTTCACTTCAAAGGTCTCAAAACCACTCTCGCTCTC ACCACTTTCGTCTTCAACTCCATCAATCCTGACCTATCCGGTCCAATCTCCATAGCCACC ATCTCCGATGGCTATGACCATGGGGGTTTCGAGACAGCTGACTCCATCGACGACTACCTC AAAGACTTTAAAACTTCCGGCTCGAAAACCATTGCAGACATCATCCAAAAACACCAGACT AGTGATAACCCCATCACTTGTATCGTCTATGATGCTTTCCTGCCTTGGGCACTTGACGTT GCTAGAGAGTTTGGTTTAGTTGCGACTCCTTTCTTTACGCAGCCTTGTGCTGTTAACTAT GTTTATTATCTTTCTTACATAAACAATGGAAGCTTGCAACTTCCCATTGAGGAATTGCCT TTTCTTGAGCTCCAAGATTTGCCTTCTTTCTTCTCTGTTTCTGGCTCTTATCCTGCTTAC TTTGAGATGGTGCTTCAACAGTTCATAAATTTCGAAAAAGCTGATTTQGTTCTCGTTAAT AGCTTCCAAGAGTTGGAACTGCATGAGAATGAATTGTGGTCGAAAGCTTGTCCTGTGTTG ACAATTGGTCCAACTATTCCATCAATTTACTTAGACCAACGTATCAAATCAGACACCGGC TATGATCTTAATCTCTTTGAATCGAAAGATGATTCCTTCTGCATTAACTGGCTCGACACA AGGCCACAAGGGTCGGTGGTGTACGTAGCATTCGGAAGCATGGCTCAGCTGACTAATGTG CAGATGGAGGAGCTTGCTTCAGCAGTAAGCAACTTCAGCTTCCTGTGGGTGGTCAGATCT TCAGAGGAGGAAAAACTCCCATCAGGGTTTCTTGAGACAGTGAATAAAGAAAAGAGCTTG GTCTTGAAATGGAGTCCTCAGCTTCAAGTTCTGTCAAACAAAGCCATCGGTTGTTTCTTG ACTCACTGTGGCTGGAACTCAACCATGGAGGCTTTGACCTTCGGGGTTCCCATGGTGGCA ATGCCCCAATGGACTGATCAACCGATGAACGCAAAGTACATACAAGATGTGTGGAAGGCT GGAGTTCGTGTGAAGACAGAGAAGGAGAGTGGGATTGCCAAGAGAGAGGAGATTGAGTTT AGCATTAAGGAAGTGATGGAAGGAGAGAGGAGCAAAGAGATGAAGAAGAACGTGAAGAAA TGGAGAGACTTGGCTGTCAAGTCACTCAATGAAGGAGGTTCTACGGATACTAACATTGAT ACATTTGTATCAAGGGTTCAGAGCAAATAG
>UGT75B1
ATGGCGCCACCGCATTTTCTACTGGTAACGTTTCCGGCGCAAGGTCACGTGAACCCATCT CTCCGTTTTGCTCGTCGGCTCATCAAAAGAACCGGCGCACGTGTCACTTTCGTCACTTGT GTCTCCGTCTTCCACAACTCCATGATCGCAAACCACAACAAAGTCGAAAATCTCTCTTTC CTTACTTTCTCCGACGGTTTCGACGATGGAGGCATTTCCACCTACGAAGACCGTCAGAAA AGGTCGGTGAATCTCAAGGTTAACGGCGATAAGGCACTATCGGATTTCATCGAAGCTACT AAGAATGGTGACTCTCCCGTGACTTGCTTGATCTACACGATTCTTCTCAATTGGGCTCCA AAAGTAGCACGTAGATTTCAACTTCCCTCCGCTCTTCTCTGGATCCAACCGGCTTTGGTT TTCAACATCTATTACACTCATTTCATGGGAAACAAGTCCGTTTTCGAGTTACCTAATCTG TCTTCTCTGGAAATCAGAGATCTTCCATCTTTCCTCACACCTTCCAACACAAACAAAGGC GCATACGATGCGTTTCAAGAAATGATGGAGTTTCTCATAAAAGAAACCAAACCGAAAATT CTCATCAACACTTTCGATTCGCTGGAACCAGAGGCCTTAACGGCTTTCCCGAATATCGAT ATGGTGGCGGTTGGTCCTTTACTTCCCACGGAGATTTTCTCAGGAAGCACCAACAAATCA GTTAAAGATCAAAGTAGTAGTTATACACTTTGGCTAGACTCGAAAACAGAGTCCTCTGTT ATTTACGTTTCCTTTGGAACAATGGTTGAGTTGTCCAAGAAACAGATAGAGGAACTAGCG AGAGCACTCATAGAAGGGAAACGACCGTTTTTGTGGGTTATAACTGATAAATCCAACAGA
GAAACGAAAACAGAAGGAGAAGAAGAGACAGAGATTGAGAAGATAGCTGGATTCAGACAC GAGCTTGAAGAGGTTGGGATGATTGTGTCGTGGTGTTCGCAGATAGAGGTTTTAAGTCAC CGAGCCGTAGGTTGTTTTGTGACTCATTGTGGGTGGAGCTCGACGCTGGAGAGTTTGGTT CTTGGCGTTCCGGTTGTGGCGTTTCCGATGTGGTCGGATCAACCGACGAACGCGAAGCTA CTGGAAGAAAGTTGGAAGACTGGTGTGAGGGTAAGAGAGAACAAGGATGGTTTGGTGGAG AGAGGAGAGATCAGGAGGTGTTTGGAAGCCGTGATGGAGGAGAAGTCGGTGGAGTTGAGG GAAAACGCAAAGAAATGGAAGCGTTTAGCGATGGAAGCGGGTAGAGAAGGAGGATCTTCG GATAAGAACATGGAGGCTTTTGTGGAGGATATTTGTGGAGAATCTCTTATTCAAAACTTG TGTGAAGCAGAGGAGGTAAAAGTAAAGTAA
>UGT75B2
ATGGCGCAACCGCATTTTCTACTGGTAACGTTTCCGGCGCAAGGTCACGTGAACCCATCT CTCCGTTTTGCTCGTCGGCTCATCAAAACAACTGGCGCACGTGTAACTTTCGCCACGTGT CTCTCTGTCATTCACCGCTCTATGATCCCAAACCACAACAACGTCGAAAATCTCTCTTTC CTTACTTTCTCCGACGGATTCGACGACGGAGTCATCTCCAACACCGACGACGTCCAAAAC CGGTTGGTACACTTCGAACGTAATGGCGATAAAGCTCTATCGGATTTCATCGAAGCTAAT CAGAATGGTGACTCTCCCGTAAGTTGCTTGATCTACACGATTCTTCCCAACTGGGTTCCA AAAGTGGCGCGTAGATTTCATCTTCCCTCTGTTCATCTCTGGATCCAACCAGCCTTCGCT TTCGACATTTATTACAATTACTCTACAGGAAACAACTCCGTTTTCGAGTTCCCGAATCTA CCTTCTCTCGAAATCCGCGATCTGCCTTCTTTCCTCTCACCTTCCAACACGAACAAAGCC GCACAAGCAGTATATCAAGAACTGATGGATTTTCTCAAAGAAGAATCTAACCCGAAAATT CTCGTCAACACATTCGATTCGCTGGAGCCAGAGTTCTTAACAGCTATTCCGAATATAGAA ATGGTGGCAGTTGGTCCTTTACTTCCTGCGGAGATTTTCACTGGAAGCGAATCAGGTAAA GATTTATCAAGAGATCATCAAAGTAGTAGTTATACACTTTGGTTAGACTCGAAAACAGAG TCCTCTGTTATTTATGTTTCTTTTGGAACAATGGTTGAGTTGTCGAAGAAACAGATAGAG GAACTAGCGAGAGCACTCATAGAAGGGGGAAGACCGTTCTTGTGGGTTATAACTGATAAA CTCAACAGAGAAGCGAAAATAGAAGGAGAAGAAGAGACAGAGATTGAGAAGATAGCTGGT TTTAGACACGAGCTTGAAGAGGTTGGGATGATTGTCTCGTGGTGTTCGCAGATAGAGGTT TTGAGACACCGAGCCATAGGTTGTTTTTTGACTCATTGTGGGTGGAGCTCATCACTGGAG AGTTTGGTTCTCGGCGTTCCAGTGGTGGCGTTTCCGATGTGGTCGGATCAGCCAGCAAAT GCGAAGCTTTTGGAAGAAATATGGAAGACAGGTGTGAGGGTGAGAGAGAACTCGGAAGGT TTAGTAGAGAGAGGAGAGATAATGCGGTGTTTGGAAGCAGTGATGGAGGCGAAATCGGTG GAGCTGAGGGAAAACGCAGAGAAATGGAAGCGTTTAGCGACTGAAGCGGGTAGAGAAGGA GGATCTTCGGACAAGAATGTGGAAGCTTTTGTGAAGAGTCTGTTTTGA
>UGT75C1
ATGGCCACTTCCGTCAATGGTTCCCATCGTCGTCCACATTACTTGCTTGTAACATTCCCA GCGCAAGGTCACATCAACCCGGCGCTTCAACTAGCCAACCGCCTCATCCACCACGGTGCA ACCGTCACATACTCCACCGCAGTCTCTGCTCACCGACGTATGGGCGAGCCACCTTCCACA AAAGGTCTATCCTTCGCTTGGTTCACCGATGGATTCGACGACGGTCTCAAGTCATTCGAA GACCAGAAAATCTACATGTCCGAACTCAAACGATGTGGTTCAAACGCCCTGAGAGACATC ATCAAAGCCAATCTTGACGCCACCACCGAAACAGAGCCTATCACCGGGGTAATCTACTCT GTTCTCGTCCCGTGGGTTTCTACGGTAGCGCGTGAGTTTCACCTCCCAACTACACTTCTC TGGATTGAACCAGCTACTGTACTAGACATCTACTACTACTACTTCAACACCTCTTACAAA CATCTCTTCGACGTTGAACCGATTAAATTACCGAAACTGCCACTGATCACCACCGGTGAC CTCCCGTCGTTTCTTCAACCTTCGAAGGCATTACCGTCAGCTCTTGTGACTCTAAGAGAA CATATCGAAGCTCTCGAAACGGAATCAAACCCTAAGATTCTTGTTAACACATTCTCTGCT TTGGAACACGATGCTTTAACCTCTGTTGAGAAACTCAAGATGATCCCAATCGGACCGTTG GTTTCTTCCTCCGAGGGTAAAACCGATCTTTTCAAATCTTCCGACGAGGATTACACGAAA TGGTTAGACTCGAAGCTCGAGAGATCAGTGATTTACATTTCCTTAGGCACACACGCCGAT GATTTACCAGAGAAACACATGGAAGCGCTTACTCACGGCGTGTTAGCTACAAACAGACCG TTTTTATGGATCGTGAGGGAGAAAAATCCAGAAGAGAAGAAGAAGAATCGGTTTCTTGAA
TTGATCAGAGGAAGTGATCGAGGATTGGTGGTGGGATGGTGTTCTCAGACAGCTGTTTTG GCGCATTGTGCTGTGGGATGTTTTGTGACTCATTGTGGTTGGAATTCGACGTTGGAGAGT TTAGAGAGTGGTGTTCCGGTGGTTGCGTTTCCGCAGTTTGCTGATCAGTGTACAACGGCG AAGCTTGTGGAGGATACGTGGAGGATTGGAGTGAAGGTGAAGGTTGGGGAGGAAGGAGAT GTGGATGGGGAGGAGATTAGAAGGTGTTTGGAGAAGGTGATGAGTGGTGGAGAAGAGGCG GAGGAGATGAGAGAGAATGCAGAGAAGTGGAAGGCGATGGCTGTTGATGCGGCAGCGGAA GGTGGACCGTCGGATTTGAATCTTAAAGGTTTTGTGGACGAGGATGAGTAG
>UGT75D1
ATGGCCAACAACAATTCCAACTCTCCCACCGGTCCACACTTTCTATTCGTAACATTTCCA GCCCAAGGTCACATCAACCCATCTCTCGAGCTAGCCAAACGCCTCGCCGGAACAATCTCT GGTGCTCGAGTCACCTTCGCCGCCTCAATCTCTGCCTACAACCGCCGCATGTTCTCTACA GAAAACGTCCCCGAAACCCTAATCTTCGCTACCTACTCCGATGGCCACGACGACGGTTTC AAATCCTCTGCTTACTCCGACAAATCTCGTCAAGACGCCACTGGAAACTTCATGTCTGAG ATGAGACGACGTGGCAAAGAGACACTAACCGAACTAATCGAAGATAACCGGAAACAAAAC AGGCCTTTTACTTGCGTGGTTTACACGATTCTCCTCACTTGGGTCGCTGAGCTAGCGCGT GAGTTTCATCTTCCTTCTGCTCTTCTTTGGGTCCAACCAGTAACAGTCTTCTCCATTTTT TACCATTACTTCAATGGCTACGAAGATGCAATCTCAGAGATGGCTAATACCCCCTCTAGT TCTATTAAATTACCTTCTCTGCCACTGCTTACTGTCCGTGATATTCCTTCTTTCATTGTC TCTTCCAATGTCTACGCGTTTCTTCTACCCGCGTTTCGAGAACAGATTGATTCACTGAAG GAAGAAATAAACCCTAAGATCCTCATCAACACTTTCCAAGAGCTTGAGCCAGAAGCCATG AGCTCGGTTCCAGATAATTTCAAGATTGTCCCTGTCGGTCCGTTACTAACGTTGAGAACG GATTTTTCGAGTCGCGGTGAATACATAGAGTGGTTGGATACTAAAGCGGATTCGTCTGTG CTTTATGTTTCGTTCGGGACGCTTGCCGTGTTGAGCAAGAAACAGCTTGTGGAGCTTTGT AAAGCGTTGATACAAAGTCGGAGACCATTCTTGTGGGTGATTACGGATAAGTCGTACAGA AATAAAGAAGATGAGCAAGAGAAGGAAGAAGATTGCATAAGTAGTTTCAGAGAAGAGCTC GATGAGATAGGAATGGTGGTTTCATGGTGTGATCAGTTTAGGGTTTTGAATCATAGATCG ATAGGTTGTTTCGTGACGCATTGCGGGTGGAACTCTACGCTGGAGAGCTTGGTTTCAGGA GTTCCGGTGGTGGCGTTTCCGCAATGGAATGATCAGATGATGAACGCGAAGCTTTTAGAA GATTGTTGGAAAACAGGTGTAAGAGTGATGGAGAAGAAGGAAGAAGAAGGAGTTGTGGTG GTGGATAGTGAGGAGATACGGCGGTGCATTGAGGAAGTTATGGAAGACAAGGCGGAGGAG TTTAGAGGAAATGCCACGAGGTGGAAGGATTTAGCGGCGGAGGCTGTGAGAGAAGGAGGC TCTTCCTTTAATCATCTCAAAGCTTTTGTCGATGAGCACATGTGA
>UGT76B1
ATGGAGACTAGAGAAACAAAACCAGTGATCTTTCTCTTCCCTTTCCCTTTACAAGGTCAC TTAAACCCAATGTTTCAGCTCGCCAACATCTTCTTCAACAGAGGCTTCTCCATCACTGTG ATCCACACTGAGTTCAACTCTCCAAACTCTTCCAATTTCCCTCATTTCACTTTCGTATCC ATCCCCGATAGCTTGTCTGAACCTGAATCCTATCCCGATGTCATCGAGATTCTCCATGAC CTCAATTCCAAGTGTGTTGCTCCTTTTGGTGATTGCTTAAAGAAGCTTATATCTGAAGAA CCAACAGCAGCTTGTGTGATTGTTGACGCTCTTTGGTACTTCACTCACGATTTAACCGAG AAATTCAATTTCCCGAGGATTGTTCTCCGAACCGTTAACCTCTCAGCTTTCGTCGCTTTC TCAAAGTTTCATGTTTTACGAGAGAAAGGGTATCTTTCTTTACAAGAGACTAAGGCAGAC TCACCGGTTCCGGAGCTTCCGTATCTTAGAATGAAGGATCTTCCATGGTTCCAGACAGAA GATCCAAGATCAGGGGATAAGTTACAGATAGGTGTGATGAAGTCACTAAAGTCTTCCTCA GGAATCATATTCAACGCCATTGAAGATCTTGAAACAGATCAGCTTGATGAAGCCCGCATA GAATTCCCAGTTCCACTCTTCTGTATTGGACCCTTTCACAGGTACGTTTCAGCTTCATCC AGTAGCTTACTTGCACACGACATGACTTGTCTCTCCTGGTTAGACAAGCAAGCAACAAAT TCCGTAATCTACGCAAGTCTTGGAAGCATTGCTTCGATCGATGAATCTGAATTCTTGGAG ATTGCTTGGGGTCTAAGAAACAGCAACCAACCTTTTCTATGGGTGGTTAGACCCGGTTTA ATCCACGGGAAAGAATGGATCGAGATTCTGCCTAAAGGGTTCATCGAAAATCTCGAGGGC CGGGGTAAAATAGTGAAATGGGCACCTCAGCCTGAAGTTTTAGCTCACCGTGCAACAGGC
GGATTCTTAACACATTGTGGATGGAACTCAACACTTGAGGGCATATGTGAAGCTATACCA ATGATATGCAGACCATCTTTTGGGGACCAGAGGGTGAATGCTAGATACATTAACGATGTT TGGAAGATCGGATTGCATTTGGAAAACAAGGTAGAGAGACTAGTGATCGAAAACGCGGTT AGAACACTAATGACGAGCTCGGAAGGGGAAGAGATCCGCAAGAGGATTATGCCCATGAAG GAAACTGTTGAACAATGCCTTAAGCTTGGAGGTTCATCATTTCGGAATCTCGAAAACTTA ATTGCTTATATATTGTCTTTCTAA
>UGT76C1
ATGGAGAAGAGAAACGAGAGACAAGTGATTCTTTTTCCTCTACCATTACAAGGTTGCATA AACCCTATGCTTCAGCTAGCAAAGATCCTTTACTCAAGAGGTTTTTCGATCACCATCATC CACACGCGCTTCAACGCGCCCAAATCTTCAGACCATCCTCTCTTCACTTTCTTACAAATC CGCGACGGCTTGTCTGAATCTCAGACTCAATCTCGTGATCTTTTGCTTCAACTCACGCTT CTCAACAACAATTGTCAGATCCCATTTCGAGAGTGTTTGGCTAAACTCATTAAACCTAGT TCAGATTCAGGAACAGAGGATAGGAAAATTAGCTGTGTGATCGATGATTCCGGTTGGGTT TTCACACAATCCGTGGCGGAGAGTTTTAATCTTCCTCGATTTGTCCTCTGTGCTTATAAG TTCTCTTTCTTTCTCGGACATTTTCTTGTTCCTCAGATTCGTCGTGAAGGGTTTCTTCCA GTACCAGATTCGGAGGCAGATGATCTAGTTCCTGAGTTTCCACCGCTTCGAAAGAAAGAT CTTTCGAGAATTATGGGAACCAGCGCTCAGAGTAAGCCTCTAGATGCTTACTTGCTTAAG ATACTCGACGCGACGAAGCCAGCTTCAGGGATTATAGTTATGTCCTGCAAAGAGCTTGAC CATGATTCACTTGCTGAGTCCAACAAAGTTTTCAGCATTCCGATATTTCCCATTGGCCCT TTTCACATTCATGACGTCCCAGCCTCGTCTAGCAGCTTGTTAGAACCGGACCAGAGTTGC ATTCCATGGTTAGATATGCGTGAAACGAGATCAGTAGTCTACGTGAGCTTAGGGAGCATT GCGAGTCTTAACGAGTCTGACTTCTTGGAGATTGCTTGTGGACTAAGAAACACCAACCAA TCCTTCTTGTGGGTTGTCCGGCCTGGTTCAGTCCATGGCAGAGATTGGATCGAATCATTA CCTTCAGGGTTCATGGAAAGTCTCGATGGTAAAGGAAAGATAGTGAGATGGGCACCGCAG CTAGACGTTCTTGCGCATAGAGCCACGGGAGGGTTTTTGACTCATAATGGATGGAACTCG ACATTAGAGAGTATATGCGAAGGAGTACCTATGATCTGCTTGCCTTGTAAGTGGGACCAA TTTGTAAACGCGAGATTCATAAGCGAAGTTTGGAGGGTTGGGATTCACTTGGAAGGTCGG ATAGAGCGAAGAGAAATCGAGAGAGCTGTTATAAGACTAATGGTTGAGTCGAAAGGAGAA GAGATTCGAGGTAGAATCAAAGTCTTGCGAGACGAAGTAAGAAGGTCAGTTAAACAAGGA GGTTCGTCATATCGATCTTTAGATGAGTTGGTTGATCGTATATCAATCATCATCGAGCCA CTAGTGCCTACGTGA
>UGT76C2
ATGGAGGAGAAGAGAAATGGTCTGCGTGTGATTCTCTTCCCTCTTCCATTACAAGGTTGC ATCAACCCTATGCTTCAGCTCGCCAACATCCTTCACGTAAGAGGCTTCTCCATTACCGTG ATCCACACGCGCTTCAACGCGCCAAAAGCTTCAAGCCATCCTCTCTTCACTTTCTTACAG ATTCCTGATGGTTTGTCTGAAACGGAGATTCAAGATGGTGTTATGTCTTTGCTCGCGCAA ATCAACCTTAACGCTGAGTCTCCGTTTCGTGATTGCTTGCGTAAAGTGTTGCTGGAATCA AAAGAGTCAGAGAGGGTTACTTGTTTGATCGATGACTGTGGATGGCTCTTCACACAATCT GTTTCAGAGAGTTTGAAGCTTCCGAGGCTCGTTCTCTGTACTTTTAAAGCCACTTTCTTC AATGCTTATCCGAGTCTTCCACTTATCCGAACCAAGGGATATCTTCCAGTTTCAGAATCG GAAGCAGAGGACTCTGTTCCTGAGTTCCCGCCGCTTCAAAAGAGAGATCTTTCAAAGGTT TTCGGGGAGTTCGGAGAGAAACTCGATCCGTTCTTACATGC'TGTAGTCGAAACGACAATA AGATCTTCAGGGTTAATATACATGTCCTGCGAAGAGCTTGAGAAAGATTCGTTGACTCTT TCTAACGAAATTTTTAAAGTTCCGGTTTTTGCAATTGGTCCGTTTCACAGCTACTTCTCT GCTTCGTCAAGCAGCTTGTTCACACAAGACGAGACTTGCATTCTGTGGTTAGATGATCAA GAAGATAAATCTGTGATCTACGTTAGTCTAGGAAGCGTTGTGAACATAACGGAAACAGAG TTCTTGGAGATTGCGTGTGGTTTAAGCAATAGCAAACAGCCTTTCTTGTGGGTAGTACGA CCCGGTTCAGTACTCGGCGCGAAATGGATCGAACCGCTCTCTGAAGGGCTGGTTAGTAGC CTTGAAGAGAAAGGAAAGATTGTGAAATGGGCACCACAACAGGAGGTTCTTGCGCATCGT GCCACAGGAGGGTTTTTGACACACAATGGTTGGAACTCAACGCTAGAGAGTATATGCGAA
GGGGTTCCTATGATCTGCCTACCAGGAGGTTGGGATCAAATGCTGAATTCAAGATTTGTT AGCGATATTTGGAAGATTGGAATTCACTTGGAAGGTCGGATTGAAAAAAAGGAGATTGAG AAAGCTGTGAGGGTGTTAATGGAGGAAAGTGAAGGAAATAAGATTCGTGAGAGAATGAAA GTTCTGAAAGATGAGGTCGAGAAATCGGTCAAACAAGGAGGCTCATCTTTTCAATCTATT GAGACTCTAGCTAATCATATACTATTGTTGTAA
>UGT76C3
ATGGATAAGAGTAATGGCCTACGAGTGATTCTGTTTCCACTTCCATTACAAGGATGCATC AACCCCATGATTCAGCTAGCGAAGATCCTCCACTCAAGAGGTTTCTCCATCACTGTGATC CACACGCGCTTCAATGCGCCAAAAGCTTCAAACCACCCTCTGTTCACCTTCTTACAGATC CCAGATGGCTTGTCTGAAACAGAGACAAGAACTCACGATATCACACTTCTCCTAACGCTT CTCAACCGAAGCTGTGAGTCTCCATTTCGTGAATGTTTGACTAAACTTTTGCAGTCTGCA GATTCAGAAACAGGGGAAGAGAAACAGAGGATTAGCTGTTTGATCGATGATTCTGGATGG ATATTCACACAGCCCGTTGCTCAGAGTTTCAATCTCCCGAGATTGGTCCTTAACACCTAC AAAGTCTCCTTCTTTCGGGACCATTTTGTTCTTCCTCAACTCCGTCGTGAAATGTATCTT CCATTACAAGATTCAGAACAAGGTGATGATCCAGTTGAGGAGTTTCCACCCCTTCGAAAG AAAGATCTTTTACAAATTCTTGATCAAGAATCGGAGCAACTAGACTCGTACTCCAATATG ATTTTGGAAACAACAAAAGCGTCTTCAGGTCTTATATTTGTATCCACATGTGAAGAGTTG GACCAAGACTCACTGAGTCAAGCACGTGAAGATTATCAAGTCCCAATCTTTACGATAGGA CCTTCTCATAGCTACTTCCCAGGCTCATCTAGTAGCTTGTTCACAGTGGACGAGACTTGC ATTCCATGGTTAGACAAGCAAGAAGACAAATCCGTGATTTACGTGAGTTTTGGGAGCATC TCGACCATTGGCGAAGCAGAATTCATGGAGATTGCTTGGGCTCTAAGAAACAGCGACCAA CCGTTCTTGTGGGTCGTACGGGGTGGTTCGGTAGTCCATGGTGCAGAATGGATCGAACAG CTTCATGAGAAAGGAAAGATAGTGAATTGGGCCCCACAACAAGAGGTTCTAAAGCATCAA GCCATTGGAGGATTCTTGACACACAATGGTTGGAACTCGACGGTTGAGAGTGTTTTTGAA GGCGTCCCTATGATATGTATGCCTTTTGTATGGGACCAATTGCTTAATGCAAGATTTGTT AGTGATGTATGGATGGTTGGGCTGCATCTAGAGGGTCGGATTGAGAGGAATGTGATTGAG GGAATGATAAGAAGATTATTTTCGGAAACTGAAGGAAAAGCGATCCGAGAGAGGATGGAA ATTCTTAAGGAGAATGTAGGAAGATCCGTTAAACCAAAAGGTTCGGCGTATCGATCGTTA CAACATTTGATTGATTATATAACATATTTCTAG
>UGT76C4
ATGGAGAAGAGTAATGGCCTGCGAGTGATTCTGTTTCCACTTCCATTACAAGGCTGCATC AACCCTATGATTCAGCTCGCCAAGATCCTCCACTCAAGAGGTTTTTCAATCACTGTGATC CACACTTGCTTCAACGCGCCAAAAGCTTCAAGCCATCCACTCTTCACCTTCATACAGATC CAAGATGGCTTGTCTGAAACAGAGACAAGAACTCGCGACGTCAAACTTCTCATAACACTT CTCAACCAAAATTGCGAGTCTCCGGTTCGTGAATGTTTGCGTAAACTGTTGCAATCTGCC AAGGAAGAGAAACAGAGGATTAGCTGTTTGATCAATGATTCTGGTTGGATCTTCACTCAA CACTTAGCCAAGAGTTTGAATCTCATGAGATTGGCCTTTAATACCTATAAGATCTCCTTC TTTCGAAGCCATTTTGTTCTTCCTCAGCTCCGGCGTGAAATGTTTCTTCCATTACAAGAT TCAGAACAAGATGATCCAGTTGAGAAGTTTCCACCGCTTAGAAAGAAAGATCTTTTACGG ATTCTTGAAGCAGATTCGGTGCAGGGAGACTCGTACTCGGATATGATTTTGGAAAAGACA AAGGCGTCTTCAGGTCTTATATTCATGTCCTGTGAAGAGTTGGACCAAGACTCACTGAGT CAATCACGTGAAGATTTTAAGGTTCCGATATTTGCGATAGGACCTTCTCATAGCCATTTT CCTGCTTCTTCTAGTAGCTTGTTCACACCGGACGAGACTTGCATCCCATGGTTAGACAGA CAAGAAGACAAATCCGTAATATACGTGAGTATTGGGAGCCTCGTGACCATCAACGAAACA GAGCTAATGGAGATTGCTTGGGGTCTAAGTAACAGCGACCAACCATTTTTATGGGTCGTC CGGGTTGGTTCAGTCAATGGCACGGAATGGATTGAAGCAATCCCGGAATATTTCATCAAA AGGCTTAATGAGAAGGGAAAGATAGTGAAATGGGCTCCACAACAAGAGGTTCTAAAGCAT CGAGCTATTGGAGGTTTCTTGACACATAATGGTTGGAACTCGACGGTTGAGAGTGTTTGT GAAGGCGTCCCTATGATCTGTTTGCCTTTTCGTTGGGACCAATTGTTAAATGCAAGATTT GTTAGTGATGTATGGATGGTTGGGATACATCTCGAGGGTCGGATTGAGAGGGATGAGATC
GAGAGAGCGATAAGGAGATTATTGTTGGAAACTGAAGGAGAAGCCATCCGAGAGAGGATA CAACTTCTTAAGGAAAAAGTAGGAAGATCAGTTAAACAAAACGGTTCGGCATATCAATCT CTACAAAATTTGATTAATTATATATCATCTTTCTAG
>UGT76C5
ATGGAGAAGAGTAATGGCCTTCGAGTGATTCTGTTTCCACTTCCATTACAAGGCTGCATC AACCCCATGATTCAGCTCGCCAAGATCCTCCACTCAAGAGGTTTCTCCATCACTGTGATC CACACGTGCTTCAACGCGCCAAAAGCTTCAAGCCATCCTCTCTTCACCTTCTTAGAGATC CCAGATGGCTTGTCCGAAACAGAGAAAAGAACTAACAATACCAAACTTCTCCTAACGCTT CTCAACCGGAACTGTGAGTCTCCGTTTCGTGAATGTTTGAGTAAACTGTTGCAGTCTGCA GATTCAGAAACAGGGGAAGAGAAACAGAGGATTAGCTGTTTGATCGCTGATTCTGGATGG ATGTTCACACAACCCATTGCTCAGAGTTTGAAACTCCCAATATTGGTCCTCAGTGTGTTT ACAGTCTCCTTCTTTCGCTGCCAATTTGTTCTTCCTAAGCTTCGGCGTGAAGTGTATCTT CCACTTCAAGATTCAGAACAGGAGGATCTAGTTCAAGAGTTTCCGCCGCTTCGAAAGAAG GATATTGTACGTATTCTTGATGTAGAAACAGATATACTAGATCCATTCTTGGACAAAGTT CTACAAATGACAAAGGCGTCTTCAGGTCTTATATTCATGTCATGTGAAGAGTTGGACCAC GACTCAGTGAGTCAGGCACGTGAAGATTTCAAAATTCCTATCTTTGGGATTGGACCATCT CACAGCCACTTTCCAGCTACCTCTAGTAGCTTGTCCACACCCGACGAGACTTGCATTCCA TGGTTAGACAAACAAGAAGACAAATCCGTGATTTACGTCAGTTACGGGAGCATCGTGACC ATCAGCGAATCAGATTTAATAGAGATTGCTTGGGGTCTAAGAAACAGCGACCAACCCTTC TTGTTGGTCGTACGGGTTGGTTCAGTCCGTGGCAGAGAATGGATCGAGACAATCCCGGAA GAGATCATGGAAAAGCTTAATGAGAAGGGAAAGATAGTGAAATGGGCTCCGCAACAAGAC GTTCTAAAGCATCGAGCCATTGGGGGATTCCTGACACATAATGGTTGGAGCTCGACTGTT GAGAGTGTTTGTGAAGCAGTCCCTATGATCTGTTTGCCTTTTCGTTGGGACCAAATGCTA AATGCAAGATTTGTTAGCGATGTATGGATGGTCGGGATAAACCTAGAGGATCGGGTTGAA AGGAATGAGATCGAGGGAGCGATAAGGAGATTATTGGTGGAACCTGAAGGAGAAGCCATC CGAGAGAGGATAGAACATCTTAAGGAGAAAGTAGGACGATCGTTTCAACAAAACGGTTCC GCATATCAATCGTTACAAAATTTGATTGATTATATATCATCTTTTTAG
>UGT76D1
ATGGCAGAGATTCGCCAGAGAAGAGTGTTGATGGTCCCAGCACCGTTCCAAGGCCATTTA CCTTCGATGATGAATCTAGCGTCCTACCTTTCTTCCCAAGGCTTTTCAATCACAATCGTT AGAAACGAATTCAATTTCAAAGATATCTCCCATAATTTCCCTGGTATAAAATTCTTCACC ATCAAGGACGGCTTGTCAGAATCTGACGTGAAGTCTCTGGGTCTCCTTGAATTTGTCCTG GAGCTTAACTCTGTCTGTGAACCCCTATTGAAAGAGTTTCTAACCAACCATGATGATGTT GTTGACTTTATCATTTATGATGAATTTGTTTACTTCCCTCGACGTGTTGCGGAAGATATG AATCTGCCAAAGATGGTCTTTAGCCCTTCTTCCGCCGCTACCTCGATCAGCCGGTGTGTG CTTATGGAGAACCAATCAAATGGGTTACTTCCTCCACAAGACGCAAGATCTCAACTAGAA GAAACGGTGCCAGAGTTTCATCCCTTTCGTTTCAAAGATCTGCCTTTTACAGCTTATGGA TCTATGGAGAGATTAATGATACTTTACGAGAATGTAAGCAATAGAGCCTCATCTTCTGGC ATAATACACAACTCTTCGGATTGCTTAGAGAACTCATTCATAACAACTGCACAAGAGAAA TGGGGAGTTCCGGTATACCCGGTTGGTCCACTCCATATGACCAATTCCGCAATGTCATGT CCAAGTTTATTTGAAGAAGAAAGAAACTGTCTTGAATGGCTTGAGAAGCAAGAAACAAGC TCAGTGATCTACATAAGCATGGGGAGCTTGGCGATGACACAAGATATAGAGGCTGTGGAG ATGGCCATGGGATTTGTCCAGAGTAATCAACCCTTCTTGTGGGTGATCCGACCAGGCTCT ATAAACGGACAAGAATCTTTAGACTTCTTACCGGAACAGTTCAACCAAACGGTGACCGAT GGAAGAGGTTTTGTTGTGAAATGGGCCCCACAAAAAGAGGTATTAAGGCATAGAGCAGTG GGAGGGTTTTGGAACCATGGTGGATGGAACTCGTGCTTGGAGAGCATAAGCAGTGGTGTA CCAATGATTTGTAGGCCGTATTCTGGTGATCAGAGGGTGAATACTCGACTTATGTCACAT GTTTGGCAAACCGCGTATGAGATCGAAGGTGAATTGGAAAGAGGAGCTGTTGAGATGGCC GTGAGGAGGCTCATTGTGGATCAAGAAGGTCAGGAGATGAGAATGAGAGCCACCATATTG AAGGAAGAGGTTGAAGCCTCTGTCACAACCGAAGGCTCTTCTCACAATTCTTTAAACAAT
TTGGTCCATGCAATAATGATGCAAATTGACGAACAATGA
>UGT76E1
ATGGAAGAACTAGGAGTGAAGAGAAGGATAGTATTGGTTCCAGTTCCAGCACAAGGTCAT GTAACTCCGATTATGCAACTCGGGAAGGCTCTTTACTCCAAGGGCTTCTCCATCACTGTT GTTCTCACACAGTATAATCGAGTTAGCTCATCCAAGGACTTCTCTGATTTTCATTTCCTC ACCATCCCAGGCAGCTTGACCGAGTCTGATCTCAAAAACCTTGGACCATTCAAGTTTCTC TTCAAGCTCAATCAAATTTGCGAGGCAAGCTTCAAGCAATGTATTGGTCAACTATTGCAG GAGCAAGGTAATGATATCGCTTGTGTCGTCTACGATGAGTACATGTACTTCTCCCAAGCT GCAGTTAAAGAGTTTCAACTTCCTAGCGTCCTCTTCAGCACGACAAGTGCTACTGCCTTT GTCTGTCGCTCTGTTTTGTCTAGAGTCAACGCAGAGTCATTCTTGCTTGACATGAAAGAT CCCAAAGTGTCAGACAAGGAATTTCCAGGGTTGCATCCGCTAAGGTACAAGGACCTGCCA ACTTCAGCATTTGGGCCATTAGAGAGTATACTCAAGGTTTACAGTGAGACTGTCAACATT CGAACAGCTTCGGCAGTTATCATCAACTCAACAAGCTGTCTAGAGAGCTCATCTTTGGCA TGGTTACAAAAACAACTGCAAGTTCCAGTGTATCCTATAGGCCCACTTCACATTGCAGCT TCAGCGCCTTCTAGTTTACTTGAAGAGGACAGGAGTTGCCTTGAGTGGTTGAACAAGCAA AAAATAGGCTCAGTGATTTACATAAGTTTGGGAAGCTTGGCTCTAATGGAAACTAAAGAC ATGTTGGAGATGGCTTGGGGTTTACGTAATAGCAACCAACCTTTCTTATGGGTGATCCGA CCGGGTTCTATTCCCGGCTCGGAATGGACAGAGTCTTTACCGGAGGAATTCAGTAGGTTG GTTTCAGAAAGAGGTTACATTGTGAAATGGGCACCACAGATAGAAGTTCTCAGACATCCT GCAGTGGGAGGGTTTTGGAGTCACTGCGGATGGAACTCGACCCTAGAGAGCATCGGGGAA GGAGTTCCGATGATCTGTAGGCCTTTTACGGGAGATCAGAAAGTCAATGCGAGGTACTTA GAGAGAGTTTGGAGAATTGGGGT'TCAATTGGAAGGAGAGCTGGATAAAGGAACAGTGGAG AGAGCTGTAGAGAGATTGATTATGGATGAAGAAGGAGCAGAAATGAGGAAGAGAGTTATC AACTTGAAAGAGAAGCTTCAAGCCTCTGTCAAGAGTAGAGGTTCCTCATTCAGCTCATTA GACAACTTTGTCAATTCCTTAAAAATGATGAATTTCATGTAG
>UGT76E11
ATGGAGGAAAAGCCGGCGGGCAGAAGAGTAGTGTTGGTTGCAGTTCCAGCTCAAGGACAT ATCTCTCCAATAATGCAACTTGCAAAAACACTTCACTTGAAGGGTTTCTCAATCACAATC GCTCAGACAAAGTTCAATTACTTTAGCCCTTCAGATGACTTCACTGATTTTCAGTTTGTC ACCATTCCAGAAAGCTTACCAGAGTCTGATTTTGAGGATCTCGGGCCAATAGAGTTTCTG CATAAGCTCAACAAAGAGTGTCAGGTGAGCTTCAAAGACTGTTTGGGTCAGTTGTTGCTG CAACAAGGTAATGAGATAGCCTGTGTTGTCTACGACGAGTTCATGTACTTTGCTGAAGCT GCAGCCAAAGAGTTTAAGCTTCCAAACGTCATTTTCAGCACCACAAGTGCCACGGCTTTT GTTTGCCGCTCTGCATTCGACAAACTTTATGCAAACAGTATCCTGACTCCCTTGAAAGAA CCCAAAGGACAACAAAACGAGCTAGTGCCAGAGTTTCATCCCCTGAGATGCAAAGACTTT CCGGTTTCACATTGGGCATCATTAGAAAGCATGATGGAGCTGTATAGGAATACAGTTGAC AAACGGACAGCTTCCTCGGTGATAATCAACACAGCGAGCTGTCTAGAGAGCTCATCTCTG TCTCGTCTGCAGCAACAGCTACAAATTCCAGTTTATCCTATAGGCCCTCTTCACCTGGTG GCATCAGCTTCTACGAGTCTTCTTGAAGAGAACAAGAGCTGTATTGAATGGTTGAACAAA CAAAAGAAAAACTCTGTGATATTCGTAAGCTTGGGAAGCTTAGCTTTGATGGAAATCAAT GAGGTGATAGAAACTGCTTTGGGATTGGATAGTAGCAAGCAACAGTTCTTGTGGGTCATT CGGCCAGGGTCAGTACGTGGTTCGGAATGGATAGAGAACTTGCCTAAGGAGTTTAGTAAG ATAATTTCGGGTCGAGGTTACATTGTGAAATGGGCTCCACAGAAGGAAGTACTTTCTCAT CCTGCAGTAGGAGGATTTTGGAGCCATTGCGGATGGAACTCGACACTAGAGAGCATCGGG GAAGGAGTTCCAATGATTTGCAAGCCGTTTTCCAGTGATCAAATGGTGAATGCGAGATAC TTGGAGTGTGTATGGAAAATTGGGATTCAAGTTGAGGGTGATCTAGACAGAGGAGCGGTC GAGAGAGCTGTGAGGAGGTTAATGGTGGAGGAAGAAGGGGAGGGGATGAGGAAGAGAGCT ATCAGTTTGAAAGAGCAACTTAGAGCCTCTGTTATAAGTGGAGGTTCTTCACACAACTCG CTAGAGGAGTTTGTACACTACATGAGGACTCTATGA
>UGT76E12
ATGGAGGAAAAGCCTGCAAGGAGAAGCGTAGTGTTGGTTCCATTTCCAGCACAAGGACAT ATATCTCCAATGATGCAACTTGCCAAAACCCTTCACTTAAAGGGTTTCTCGATCACAGTT GTTCAGACTAAGTTCAATTACTTTAGCCCTTCAGATGACTTCACTCATGATTTTCAGTTC GTCACCATTCCAGAAAGCTTACCAGAGTCTGATTTCAAGAATCTCGGACCAATACAGTTT CTGTTTAAGCTCAACAAAGAGTGTAAGGTGAGCTTCAAGGACTGTTTGGGTCAGTTGGTG CTGCAACAAAGTAATGAGATCTCATGTGTCATCTACGATGAGTTCATGTACTTTGCTGAA GCTGCAGCCAAAGAGTGTAAGCTTCCAAACATCATTTTCAGCACAACAAGTGCCACGGCT TTCGCTTGCCGCTCTGTATTTGACAAACTATATGCAAACAATGTCCAAGCTCCCTTGAAA GAAACTAAAGGACAACAAGAAGAGCTAGTTCCGGAGTTTTATCCCTTGAGATATAAAGAC TTTCCAGTTTCACGGTTTGCATCATTAGAGAGCATAATGGAGGTGTATAGGAATACAGTT GACAAACGGACAGCTTCCTCGGTGATAATCAACACTGCGAGCTGTCTAGAGAGCTCATCT CTGTCTTTTCTGCAACAACAACAGCTACAAATTCCAGTGTATCCTATAGGCCCTCTTCAC ATGGTGGCCTCAGCTCCTACAAGTCTGCTTGAAGAGAACAAGAGCTGCATCGAATGGTTG AACAAACAAAAGGTAAACTCGGTGATATACATAAGCATGGGAAGCATAGCTTTAATGGAA ATCAACGAGATAATGGAAGTCGCGTCAGGATTGGCTGCTAGCAACCAACACTTCTTATGG GTGATCCGACCAGGGTCAATACCTGGTTCCGAGTGGATAGAGTCCATGCCTGAAGAGTTT AGTAAGATGGTTTTGGACCGAGGTTACATTGTGAAATGGGCTCCACAGAAGGAAGTACTT TCTCATCCTGCAGTAGGAGGGTTTTGGAGCCATTGTGGATGGAACTCGACACTAGAAAGC ATCGGCCAAGGAGTTCCAATGATCTGCAGGCCATTTTCGGGTGATCAAAAGGTGAACGCT AGATACTTGGAGTGTGTATGGAAAATTGGGATTCAAGTGGAGGGTGAGCTAGACAGAGGA GTGGTCGAGAGAGCTGTGAAGAGGTTAATGGTTGACGAAGAAGGAGAGGAGATGAGGAAG AGAGCTTTCAGTTTAAAAGAGCAACTTAGAGCCTCTGTTAAAAGTGGAGGCTCTTCACAC AACTCGCTAGAAGAGTTTGTACACTTCATAAGGACTCTATGA
>UGT76E2
ATGGAGGAAAAGCAAGTGAAGGAGACAAGGATAGTGTTGGTTCCAGTTCCAGCTCAAGGT CATGTAACTCCGATGATGCAACTAGGAAAAGCTCTTCACTCAAAGGGTTTCTCCATCACT GTTGTTCTGACACAGTCTAATCGAGTTAGCTCTTCCAAAGACTTCTCTGATTTCCATTTC CTCACCATCCCAGGCAGCTTAACTGAGTCTGATCTCCAAAACCTAGGACCACAAAAGTTT GTGCTCAAGCTCAATCAAATTTGTGAGGCAAGCTTCAAGCAGTGTATAGGTCAACTATTG CATGAACAATGTAATAATGATATTGCTTGTGTCGTCTACGATGAGTACATGTACTTCTCT CATGCTGCAGTAAAAGAGTTTCAACTTCCTAGTGTCGTCTTTAGCACGACAAGTGCTACT GCTTTTGTCTGTCGCTCTGTTTTGTCTAGAGTCAACGCAGAGTCGTTCTTGATCGACATG AAAGATCCTGAAACACAAGACAAAGTATTTCCAGGGTTGCATCCTCTGAGGTACAAGGAT CTACCAACTTCAGTATTTGGGCCAATAGAGAGTACGCTCAAGGTTTACAGTGAGACTGTG AACACTCGAACAGCTTCCGCTGTTATCATCAACTCAGCAAGCTGTTTAGAGAGCTCATCT TTGGCAAGGTTGCAACAACAACTGCAAGTTCCGGTGTATCCTATAGGCCCACTTCATATT ACAGCTTCAGCGCCTTCTAGTTTACTAGAAGAAGACAGGAGTTGCGTTGAGTGGTTGAAC AAGCAAAAATCAAATTCAGTTATTTACATAAGCTTGGGAAGCTTGGCTCTAATGGACACC AAAGACATGTTGGAGATGGCTTGGGGATTAAGTAATAGCAACCAACCTTTCTTATGGGTG GTCAGACCGGGCTCTATTCCGGGGTCAGAATGGACAGAGTCCTTACCAGAGGAATTCAAT AGGTTGGTTTCAGAAAGAGGTTACATTGTGAAATGGGCTCCGCAGATGGAAGTTCTCAGA CATCdTGCAGTAGGAGGGTTTTGGAGTCACTGTGGATGGAACTCAACAGTAGAGAGCATC GGGGAAGGAGTTCCGATGATATGTAGGCCTTTCACCGGGGATCAGAAAGTCAATGCGAGG TACTTAGAGAGAGTTTGGAGAATTGGGGTTCAATTGGAGGGAGATCTGGATAAAGAAACT GTGGAGAGAGCTGTAGAGTGGTTGCTTGTGGATGAAGAAGGAGCAGAAATGAGGAAGAGA GCCATTGACTTGAAAGAAAAGATTGAAACCTCTGTTAGAAGTGGAGGTTCCTCATGCAGC TCACTAGACGACTTTGTTAATTCCATGTGA
>UGT76E3
ATGGAGAAAAGAGTAGAGAAGAGAAGGATAGTGTTGGTTCCACTTCCATTACTAGGACAT TTCACTCCGATGATGCAACTCGGCCAAGCCCTTATCTTGAAGGGATTCTCAATTATAGTT CCTCAGGGAGAATTCAATCGAGTAAACTCTTCGCAGAAGTTCCCTGGTTTTCAATTTATC ACCATACCAGATTCTGAACTCGAGGCAAATGGACCAGTCGGGTCTCTAACACAGCTCAAC AAAATTATGGAGGCAAGCTTCAAGGACTGTATAAGGCAGTTGTTGAAACAACAAGGCAAT GATATTGCATGTATCATCTACGACGAGTTCATGTATTTTTGTGGAGCCGTAGCTGAGGAG TTGAAGCTTCCCAATTTCATCTTCAGTACTCAAACTGCTACACATAAAGTTTGCTGCAAT GTTTTAAGCAAACTTAATGCCAAGAAGTACTTGATCGACATGGAAGAGCATGACGTGCAA AACAAGGTAGTGGAAAATATGCATCCATTAAGATACAAAGACTTACCAACTGCAACATTT GGAGAACTAGAACCTTTTTTGGAGCTCTGTAGAGATGTAGTCAACAAAAGAACAGCCTCT GCTGTTATCATCAACACCGTGACCTGTCTAGAGAGCTCGTCTCTCACAAGGCTGCAACAA GAACTCCAAATTCCGGTGTATCCATTAGGCCCTCTTCACATTACAGATTCATCGACAGGA TTTACTGTGCTGCAAGAGGATAGGAGCTGCGTTGAATGGCTGAACAAGCAGAAACCAAGG TCTGTCATATACATAAGTTTAGGAAGCATGGTTCTCATGGAAACCAAGGAGATGTTAGAG ATGGCTTGGGGAATGTTGAATAGCAACCAACCTTTCTTATGGGTCATCCGACCTGGATCT GTCTCAGGCTCCGAGGGGATAGAGTCATTGCCAGAGGAAGTCAGTAAGATGGTTTTAGAG AAAGGATACATTGTGAAATGGGCACCACAAATAGAAGTACTAGGACATCCCTCAGTGGGA GGCTTTTGGAGCCACTGTGGATGGAACTCAACACTCGAGAGCATTGTGGAAGGAGTTCCA ATGATTTGCAGGCCTTATCAAGGCGAGCAGATGTTAAATGCAATATATCTAGAGAGTGTA TGGAGAATAGGGATTCAGGTAGGAGGTGAACTGGAAAGAGGAGCCGTCGAGAGAGCTGTG AAGAGGTTGATTGTGGATAAAGAAGGTGCAAGCATGAGGGAGAGAACCCTTGTTTTAAAA GAGAAGCTCAAAGCCTCTATTAGAGGTGGAGGCTCCTCATGCAATGCATTAGATGAGCTT GTCAAGCACTTGAAGACAGAGTGA
>UGT76E4
ATGGAGAAAAGGGTAGAGAAGAGAAGGATTGTGTTAGTTCCGGTTGCTGCACAAGGACAT GTAACCCCAATGATGCAGCTTGGGAAAGCCCTTCAATCAAAGGGCTTCTTAATTACTGTT GCTCAGAGACAGTTCAATCAAATAGGCTCATCATTGCAACACTTTCCTGGTTTTGACTTT GTCACCATACCAGAAAGCTTACCTCAGTCTGAATCTAAGAAACTAGGACCAGCTGAGTAT CTTATGAATCTCAACAAAACAAGCGAGGCAAGCTTCAAGGAGTGTATAAGTCAGTTATCG ATGCAACAAGGCAATGATATAGCATGTATCATCTATGACAAGCTTATGTACTTCTGTGAA GCAGCAGCTAAGGAGTTTAAGATTCCTAGTGTTATCTTCAGCACTAGCAGTGCTACAATT CAAGTTTGCTACTGTGTTTTAAGTGAACTCAGTGCCGAGAAGTTCTTGATCGACATGAAA GATCCTGAAAAGCAAGATAAGGTGTTGGAAGGTTTGCATCCTTTAAGGTACAAAGACCTA CCAACTTCAGGATTTGGACCATTAGAGCCACTTTTGGAGATGTGTAGGGAAGTAGTTAAC AAAAGAACAGCTTCCGCTGTTATCATCAACACGGCGAGCTGTCTAGAGAGCTTGTCTCTG TCATGGCTGCAACAAGAACTTGGAATTCCAGTGTATCCATTAGGCCCTCTTCACATTACA GCTTCATCGCCGGGACCTAGTTTACTGCAAGAGGACATGAGCTGCATTGAATGGCTGAAC AAGCAGAAACCAAGGTCAGTCATATACATAAGCTTGGGAACCAAAGCTCACATGGAGACC AAGGAGATGTTAGAGATGGCCTGGGGATTGTTGAATAGCAACCAACCTTTCTTATGGGTC ATCCGACCTGGCTCTGTTGCAGGCTTCGAGTGGATAGAGTTATTACCAGAGGAAGTCATT AAGATGGTAACAGAAAGAGGATACATAGCGAAATGGGCACCGCAGATAGAAGTACTTGGA CATCCTGCAGTGGGAGGATTCTGGAGCCACTGTGGATGGAACTCAACACTCGAGAGTATT GTGGAAGGAGTCCCAATGATTTGCAGGCCTTTACAAGGCGAACAAAAGTTAAATGCGATG TATATAGAAAGTGTTTGGAAAATAGGGATTCAACTTGAAGGTGAAGTGGAAAGGGAAGGT GTAGAGAGAGCTGTGAAGAGGTTGATCATAGATGAAGAAGGTGCAGCCATGAGGGAGAGG GCTCTTGATTTAAAAGAGAAGCTCAATGCCTCGGTAAGAAGTGGAGGCTCCTCATACAAC GCACTGGATGAGCTTGTCAAGTTCTTGAATACAGAGTGA
>UGT76E5
ATGGAGAAAAATGCAGAGAAGAAAAGAATAGTGTTGGTTCCATTTCCATTACAAGGACAT ATCACTCCAATGATGCAACTTGGTCAAGCACTTAACCTGAAAGGCTTCTCGATTACCGTT GCTCTTGGAGATTCCAATCGAGTAAGTTCTACGCAACACTTCCCTGGTTTTCAATTTGTC ACAATACCTGAAACCATACCACTATCTCAACACGAGGCACTCGGAGTTGTCGAGTTTGTG GTTACGCTCAACAAAACAAGCGAGACAAGTTTCAAGGACTGTATAGCTCATTTGTTGCTG CAACATGGAAATGATATTGCTTGTATCATTTACGACGAGCTCATGTACTTCTCTGAAGCT ACAGCTAAGGATTTAAGGATTCCTAGTGTCATATTCACCACTGGTAGTGCTACAAATCAT GTTTGTTCTTGTATTTTAAGCAAACTCAACGCCGAGAAGTTCTTGATCGACATGAAAGAT CCTGAAGTGCAAAACATGGTGGTGGAAAATTTACATCCACTAAAATACAAAGACTTACCA ACTTCAGGAATGGGGCCGCTAGAGCGATTTTTGGAGATTTGTGCCGAAGTTGTCAACAAA AGAACAGCTTCCGCTGTTATAATCAATACGTCAAGTTGTCTAGAGAGCTCGTCTCTGTCA TGGCTGAAACAAGAACTCAGTATTCCAGTGTATCCATTAGGCCCTCTTCACATTACAACT TCAGCAAATTTTAGTTTACTTGAAGAGGACAGGAGCTGCATTGAATGGCTGAACAAGCAG AAACTGAGGTCAGTTATATACATAAGCGTAGGAAGCATAGCTCACATGGAAACCAAGGAA GTATTGGAGATGGCTTGGGGATTGTATAATAGCAACCAACCTTTTCTATGGGTAATCCGA CCCGGTACAGAGTCAATGCCAGTGGAAGTCAGTAAGATTGTCTCGGAAAGAGGATGCATT GTGAAATGGGCGCCACAGAATGAAGTACTTGTGCATCCTGCAGTGGGAGGTTTCTGGAGC CACTGTGGATGGAACTCAACACTCGAGAGTATTGTGGAAGGAGTTCCAATGATTTGCAGA CCGTTTAACGGTGAGCAGAAGTTAAACGCGATGTATATAGAAAGTGTTTGGAGAGTAGGG GTTCTGCTTCAAGGAGAAGTGGAGAGAGGATGTGTAGAGAGAGCTGTGAAGAGGTTGATT GTGGATGATGAAGGTGTAGGAATGAGGGAGAGAGCCCTTGTTTTAAAAGAGAAGCTCAAT GCCTCTGTAAGAAGTGGAGGCTCTTCATACAATGCATTGGATGAGCTCGTCCATTACTTG GAGGCAGAGTATAGAAATACTTGA
>UGT76E6
ATGGAGAAAATGGAAGAGAAGAAAAGGATAGTGTTAGTTCCGGTTCCAGCACAAAGACAT GTAACTCCAATGATGCAGCTTGGCACAGCCCTAAACATGAAGGGCTTCTCTATTACTGTT GTTGAAGGACAGTTCAATAAAGTAAGCTCATCTCAAAACTTTCCTGGTTTTCAATTTGTA ACCATACCAGATACAGAGAGCTTGCCAGAGTCTGTGCTCGAGAGACTCGGACCGGTCGAG TTTTTATTCGAGATCAACAAAACCAGTGAGGCAAGCTTCAAGGACTGTATAAGGCAGTCG TTGCTGCAACAAGGCAATGATATAGCATGTATCATCTACGACGAGTATATGTACTTCTGT GGAGCTGCAGCTAAGGAGTTCAACCTTCCTAGTGTAATATTCAGCACACAAAGTGCTACT AATCAAGTTTCCCGTTGCGTTTTAAGAAAACTCAGTGCCGAGAAGTTCTTGGTGGACATG GAAGGTATCCTGAAGTGCAGGAAACGTTGGTGGAAAATTTGCATCCATTAAGATACAAAG ACCTACCAACTTCAGGAGTTGGGCCACTAGATCGATTATTTGAGCTCTGTAGGGAAATAG TCAACAAAAGAACAGCTTCCGCTGTTATCATCAACACAGTGAGATGTCTAGAGAGCTCGT CTCTGAAACGTCTGCAACATGAACTCGGGATTCCGGTGTACGCATTAGGCCCTCTTCACA TTACAGTTTCAGCAGCTTCTAGTTTACTGGAAGAGGACAGGAGCTGCGTTGAATGGTTGA ACAAGCAAAAACCGAGGTCAGTCGTTTACATAAGCTTGGGGAGCGTAGTTCAAATGGAAA CCAAAGAAGTGTTAGAGATGGCTCGGGGTTTATTTAATAGCAACCAGCCTTTCTTATGGG TCATTCGGCCTGGCTCTATCGCAGGCTCCGAATGGATAGAGTCACTGCCAGAGGAAGTCA TTAAGATGGTCTCCGAAAGAGGGTATATTGTGAAATGGGCACCACAGATAGAAGTACTTG GACATCCTGCAGTGGGAGGATTCTGGAGCCACTGTGGATGGAACTCAACGCTTGAAAGCA TTGTGGAAGGAGTTCCAATGATATGCAGGCCCTTTCATGGCGAGCAAAAGTTAAACGCAC TGTGTTTAGAGAGTATTTGGAGAATAGGGTTTCAGGTGCAAGGTAAGGTAGAGAGGGGAG GGGTCGAGAGAGCTGTGAAGAGGTTGATAGTGGATGAAGAAGGTGCAGACATGAGAGAGA GAGCCCTTGTTTTAAAAGAGAATCTCAAAGCCTCTGTAAGAAATGGAGGCTCCTCATACA ACGCATTGGAGGAGATCGTTAACCTCATGTAG
>UGT76E7
ATGGAGGAGAAGCTCTCGAGGAGAAGAAGAGTAGTGTTGGTTCCAGTTCCAGCTCAAGGA CATATAACTCCAATGATACAACTTGCAAAAGCACTTCACTCAAAAGGCTTCTCTATTACA
GTTGTTCAAACCAAGTTCAACTACTTAAACCCTTCAAATGATTTGTCTGATTTTCAGTTT GTAACCATCCCAGAGAACTTACCAGTGTCTGATCTTAAGAATCTAGGACCAGGACGGTTT CTGATTAAGCTAGCTAATGAGTGTTATGTTAGCTTTAAGGATTTGTTAGGTCAGTTGTTG GTTAATGAAGAAGAAGAGATCGCTTGTGTTATCTACGACGAGTTCATGTACTTTGTTGAA GTAGCAGTTAAAGAGTTTAAGCTTCGTAATGTTATTTTAAGTACTACAAGTGCAACGGCT TTTGTTTGTCGCTTTGTTATGTGTGAACTCTATGCTAAAGATGGTTTGGCTCAACTTAAA GAAGGCGGTGAGCGAGAAGTGGAGTTAGTACCGGAGTTGTATCCTATACGGTACAAAGAT TTACCAAGTTCGGTATTTGCATCTGTAGAATCTTCAGTGGAGTTGTTTAAGAATACATGT TATAAAGGGACAGCTTCCTCTGTGATAATCAACACAGTGAGGTGTCTAGAGATGTCATCT TTGGAGTGGCTTCAACAAGAACTTGAAATCCCGGTGTATTCTATAGGCCCGCTTCATATG GTGGTGTCAGCTCCTCCTACGAGTCTTTTAGAAGAGAACGAGAGCTGTATAGAATGGTTG AACAAACAAAAGCCGAGCTCGGTGATATACATAAGCTTGGGAAGTTTTACTTTGATGGAA ACTAAAGAAATGTTGGAGATGGCTTATGGGTTTGTTAGTAGTAACCAACACTTCTTGTGG GTGATTCGACCGGGATCTATATGTGGTTCTGAAATCTCTGAGGAAGAGTTGTTGAAGAAG ATGGTAATTACGGATCGAGGTTACATTGTGAAATGGGCGCCGCAAAAACAAGTGCTTGCA CATTCTGCGGTTGGAGCGTTCTGGAGTCATTGTGGATGGAACTCGACTTTAGAAAGTCTT GGTGAAGGAGTTCCATTGATATGTAGGCCTTTTACTACTGATCAAAAGGGGAATGCAAGG TACTTGGAGTGTGTGTGGAAAGTAGGAATTCAAGTGGAGGGTGAGCTAGAGAGAGGCGCA ATCGAGAGAGCTGTGAAGAGGTTAATGGTGGATGAAGAAGGAGAAGAGATGAAGAGAAGA GCTCTAAGTTTAAAAGAGAAACTCAAAGCCTCTGTTTTAGCTCAAGGTTCTTCACATAAA TCACTAGATGACTTCATCAAGACTCTGTGA
>UGT76E9
ATGGAGGAAAAGCAAGAGAGGAGGAGAAGGATCGTGTTGATTCCCGCTCCAGCACAAGGA CACATATCTCCGATGATGCAACTTGCAAGAGCCCTTCACTTAAAGGGCTTCTCCATTACA GTTGCTCAAACCAAGTTCAATTACTTGAAGCCTTCAAAAGACTTAGCTGATTTTCAGTTT ATCACCATCCCAGAGAGCTTACCAGCCTCGGATCTTAAGAATCTAGGACCAGTTTGGTTT CTTCTTAAACTCAATAAAGAGTGTGAGTTTAGCTTCAAGGAGTGTTTAGGTCAATTGTTG CTGCAAAAACAACTTATACCGGAAGAAGAGATCGCTTQTGTCATCTACGACGAGTTCATG TACTTTGCTGAAGCTGCAGCCAAAGAGTTTAACCTTCC'CAAAGTTATTTTCAGTACCGAA AATGCGACGGCTTTTGCTTGTCGCTCTGCCATGTGCAAACTCTATGCAAAAGATGGTTTG GCTCCCCTTAAAGAAGGATGTGGGCGAGAAGAGGAGCTAGTGCCAAAGTTGCATCCCCTT AGATACAAAGACCTACCAACTTCAGCATTTGCACCAGTAGAAGCCTCAGTGGAAGTGTTT AAAAGTTCATGTGATAAAGGGACAGCTTCCGCTATGATAATCAACACAGTGAGGTGTCTA GAGATATCATCCTTGGAGTGGCTTCAACAAGAACTTAAGATTCCGATATATCCTATAGGC CCTCTTCACATGGTTTCTTCAGCTCCTCCTACGAGTCTACTAGACGAGAATGAGAGTTGC ATTGATTGGCTGAACAAACAAAAGCCGAGCTCGGTGATTTACATAAGTTTGGGAAGCTTT ACTTTGTTGGAAACTAAAGAAGTGTTGGAAATGGCTTCGGGCTTGGTTAGTAGTAACCAA CACTTCTTGTGGGTGATTCGACCCGGGTCCATACTTGGTTCTGAATTGACTAATGAGGAA TTATTGAGTATGATGGAAATACCGGATCGAGGCTACATTGTGAAATGGGCTCCACAAAAG CAAGTGCTTGCACATTCTGCGGTTGGAGCATTTTGGAGTCATTGTGGATGGAACTCGACT CTAGAGAGCATGGGTGAAGGAGTTCCGATGATTTGTAGGCCTTTTACTACTGATCAAAAG GTAAATGCGCGGTATGTGGAGTGTGTCTGGAGAGTTGGGGTTCAAGTGGAGGGTGAACTA AAGAGAGGAGTAGTCGAGAGAGCTGTGAAGAGGTTACTGGTGGATGAAGAAGGAGAAGAG ATGAAGTTGAGAGCTCTCAGTTTGAAAGAGAAACTCAAAGTTTCTGTTCTACCGGGAGGT TCTTCACACAGTTCACTAGATGACTTAATCAAGACTCTATGA
>UGT76F1-
ATGGAAGAGAGAAAAGTGAAGAGAATTATCATGTTCCCTCTACCGTTTACAGGACACTTC AACCCTATGATCGAGCTTGCTGGAATATTCCACAACCGTGGCTTCTCCGTCACGATACTC CACACTTCTTTCAACTTCCCGGATCCTTCTCGCCATCCACAGTTTACTTTTCGAACTATC ACTCACAAAAACGAAGGAGAAGAAGACCCTCTCTCTCAATCAGAAACTTCTTCGGGTAAG
GACCTCGTCGTCCTTATTAGTCTGCTGAAACAATACTACACCGAGCCGTCTCTTGCAGAG GAAGTAGGCGAAGGAGGGACGGTGTGTTGTTTGGTCTCCGACGCTCTATGGGGGAGGAAC ACGGAGATTGTAGCGAAAGAGATTGGAGTGTGTACAATGGTGATGAGGACTAGTGGTGCG GCAACGTTTTGTGCTTATACAGCTTTCCCTCTCCTTATAGATAAGGGTTACCTTCCTATA CAAGGTTCTAGATTAGATGAGCTAGTGACAGAGCTTCCACCTTTGAAAGTGAAGGATCTT CCTGTAATAAAAACGAAAGAGCCTGAGGGACTAAACCGAATACTTAACGACATGGTGGAA GGAGCCAAGTTATCTTCCGGAGTCGTATGGAACACATTTGAAGATCTTGAAAGACATTCA CTCATGGATTGTCGCAGCAAGTTACAAGTTCCGTTGTTCCCAATCGGACCGTTTCACAAA CATAGAACCGATCTTCCACCGAAGCCAAAGAACAAGGACAAGGACGATGATGAAATATTA ACCGATTGGCTTAACAAGCAAGCTCCGCAGTCTGTGGTCTATGTGAGTTTTGGAAGCCTT GCAGCTATAGAAGAGAATGAGTTTTTCGAAATTGCTTGGGGTCTAAGAAACAGCGAACTA CCATTCTTGTGGGTGGTTAGGCCCGGGATGGTCCGGGGAACCGAGTGGCTTGAGTCATTG CCTTGTGGGTTTTTGGAAAATATTGGTCATCAGGGAAAAATTGTGAAATGGGTGAATCAA CTAGAGACATTGGCCCATCCTGCGGTTGGAGCGTTTTGGACGCACTGTGGATGGAACTCA ACAATAGAGAGCATATGTGAAGGTGTTCCAATGATATGTACGCCGTGTTTCTCGGACCAG CATGTGAACGCGAGGTACATCGTTGATGTATGGCGAGTCGGGATGATGTTAGAGAGATGT AAGATGGAAAGGACGGAGATTGAGAAGGTAGTAACAAGTGTAATGATGGAGAATGGAGCT GGATTGACAGAGATGTGTTTGGAGTTGAAAGAGAAAGCTAATGTTTGCTTAAGTGAAGAT GGGTCTTCTTCCAAGTATCTAGACAAACTTGTCAGTCATGTCCTGTCTTTTGATTCCTCG GCTTTTGCAAGTTAA
>UGT76F2
ATGGAAGAGAGAAAAGGGAGGAGAATAATCATGTTCCCTCTTCCAT TCCAGGGCACTTC AACCCCATGATCGAGCTCGCTGGAATATTCCACCACCGTGGCTTCTCCGTGACGATCCTC CACACTTCCTACAACTTCCCCGATCCTTCTCGCCACCCACACTTCACTTTTCGAACCATC TCTCACAACAAAGAAGGAGAAGAAGATCCTCTGTCTCAGTCAGAAACTTCGAGTATGGAC CTAATCGTTCTCGTTCGTCGGCTGAAACAACGCTACGCCGAACCGTTTCGTAAGTCTGTG GCGGCGGAAGTAGGTGGAGGAGAGACGGTGTGTTGTTTGGTCTCCGACGCTATATGGGGG AAGAACACGGAGGTTGTAGCGGAAGAGATTGGAGTTCGTAGGGTGGTGTTGAGGACAGGT GGTGCGTCGTCGTTTTGTGCTTTTGCCGCTTTCCCTCTCCTTAGGGATAAGGGTTACCTC CCTATACAAGATTCTAGATTAGATGAGCCAGTGACAGAGCTTCCACCTTTGAAAGTGAAG GATCTTCCGGTAATGGAAACGAATGAGCCGGAGGAACTTTACCGGGTAGTTAACGACATG GTGGAAGGAGCCAAGTCTTCTTCAGGAGTCATATGGAACACATTTGAAGATCTTGAAAGA CTATCACTTATGAATTGTAGCAGCAAATTACAAGTTCCATTTTTCCCGATCGGACCGTTT CACAAATATAGCGAAGATCCTACACCGAAGACAGAGAACAAGGAAGATACCGATTGGCTC GACAAGCAAGACCCACAGTCGGTGGTCTATGCGAGTTTCGGAAGCCTTGCAGCTATAGAA GAGAAGGAGTTTCTCGAGATTGCTTGGGGTCTAAGAAACAGTGAACGACCGTTTTTGTGG GTGGTTAGGCCGGGGTCTGTCAGGGGGACCGAGTGGCTCGAGTCATTGCCTTTAGGGTTT ATGGAAAACATTGGAGATAAGGGAAAAATCGTGAAATGGGCGAATCAGTTAGAGGTATTG GCGCATCCTGCCATTGGAGCGTTTTGGACACATTGTGGATGGAACTCGACACTAGAGAGC ATATGTGAAGGTGTTCCTATGATATGTACGTCATGTTTCACGGACCAGCATGTGAACGCG AGATACATCGTTGATGTATGGCGAGTCGGGATGTTGTTAGAGAGAAGTAAGATGGAAAAG AAGGAGATTGAAAAGGTGCTAAGAAGTGTAATGATGGAGAAGGGAGATGGATTGAGGGAA AGGAGTTTGAAGTTGAAAGAGAGAGCTGATTTTTGCTTAAGTAAAGATGGGTCTTCTTCC AAGTATTTAGACAAACTTGTGAGTCATGTCCTGTCTTTTGATTCTTATGCTTTTGCAAGT TAA
>UGT78D1
ATGACCAAATTCTCCGAGCCAATCAGAGACTCCCACGTGGCAGTTCTCGCGTTTTTCCCC GTTGGCGCTCATGCCGGTCCTCTCTTAGCCGTCACTCGCCGTCTCGCCGCCGCTTCTCCC TCCACCATCTTTTCTTTCTTCAACACCGCAAGATCAAACGCGTCGTTGTTCTCCTCTGAT CATCCCGAGAACATCAAGGTCCACGACGTCTCTGACGGTGTTCCGGAGGGAACCATGCTC
GGGAATCCACTGGAGATGGTCGAGCTGTTTCTCGAAGCGGCTCCACGTATTTTCCGGAGC GAAATCGCGGCGGCAGAGATAGAAGTTGGAAAGAAAGTGACATGCATGCTAACAGATGCC TTCTTCTGGTTCGCAGCGGACATAGCGGCTGAGCTGAACGCGACTTGGGTTGCCTTCTGG GCCGGCGGAGCAAACTCACTCTGTGCTCATCTCTACACTGATCTCATCAGAGAAACCATC GGTCTCAAAGATGTGAGTATGGAAGAGACATTAGGGTTTATACCAGGAATGGAGAATTAC AGAGTTAAAGATATACCAGAGGAAGTTGTATTTGAAGATTTGGACTCTGTTTTCCCAAAG GCTTTATACCAAATGAGTCTTGCTTTACCTCGTGCCTCTGCTGTTTTCATCAGTTCCTTT GAAGAGTTAGAACCTACATTGAACTATAACCTAAGATCCAAACTTAAACGTTTCTTGAAC ATCGCCCCTCTCACGTTATTATCTTCTACATCGGAGAAAGAGATGCGTGATCCTCATGGC TGCTTTGCTTGGATGGGGAAGAGATCAGCTGCTTCTGTAGCGTACATTAGCTTCGGCACC GTCATGGAACCTCCTCCTGAAGAGCTTGTGGCGATAGCACAAGGGTTGGAATCAAGCAAA GTGCCGTTTGTTTGGTCGCTGAAGGAGAAGAACATGGTTCATCTACCAAAAGGGTTTTTG GATCGGACAAGAGAGCAAGGGATAGTGGTTCCTTGGGCTCCACAAGTGGAACTGCTGAAA CACGAGGCAATGGGTGTGAATGTGACACATTGTGGATGGAACTCAGTGTTGGAGAGTGTG TCGGCAGGTGTACCGATGATCGGCAGACCGATTTTGGCGGATAATAGGCTCAACGGAAGA GCAGTGGAGGTTGTGTGGAAGGTTGGAGTGATGATGGATAATGGAGTCTTCACGAAAGAA GGATTTGAGAAGTGTTTGAATGATGTTTTTGTTCATGATGATGGTAAGACGATGAAGGCT AATGCCAAGAAGCTTAAAGAAAAACTCCAAGAAGATTTCTCCATGAAAGGAAGCTCTTTA GAGAATTTCAAAATATTGTTGGACGAAATTGTGAAAGTTTAG
>UGT78D2
ATGACCAAACCCTCCGACCCAACCAGAGACTCCCACGTGGCAGTTCTCGCTTTTCCTTTC GGCACTCATGCAGCTCCTCTCCTCACCGTCACGCGCCGCCTCGCCTCCGCCTCTCCTTCC ACCGTCTTCTCTTTCTTCAACACCGCACAATCCAACTCTTCGTTATTTTCCTCCGGTGAC GAAGCAGATCGTCCGGCGAACATCAGAGTATACGATATTGCCGACGGTGTTCCGGAGGGA TACGTGTTTAGCGGGAGACCACAGGAGGCGATCGAGCTGTTTCTTCAAGCTGCGCCGGAG AATTTCCGGAGAGAAATCGCGAAGGCGGAGACGGAGGTTGGTACGGAAGTGAAATGTTTG ATGACTGATGCGTTCTTCTGGTTCGCGGCTGATATGGCGACGGAGATAAATGCGTCGTGG ATTGCGTTTTGGACCGCCGGAGCAAACTCACTCTCTGCTCATCTCTACACAGATCTCATC AGAGAAACCATCGGTGTCAAAGAAGTAGGTGAGCGTATGGAGGAGACAATAGGGGTTATC TCAGGAATGGAGAAGATCAGAGTCAAAGATACACCAGAAGGAGTTGTGTTTGGGAATTTA GACTCTGTTTTCTCAAAGATGCTTCATCAAATGGGTCTTGCTTTGCCTCGTGCCACTGCT GTTTTCATCAATTCTTTTGAAGATTTGGATCCTACATTGACGAATAACCTCAGATCGAGA TTTAAACGATATCTGAACATCGGTCCTCTCGGGTTATTATCTTCTACATTGCAACAACTA GTGCAAGATCCTCACGGTTGTTTGGCTTGGATGGAGAAGAGATCTTCTGGTTCTGTGGCG TACATTAGCTTTGGTACGGTCATGACACCGCCTCCTGGAGAGCTTGCGGCGATAGCAGAA GGGTTGGAATCGAGTAAAGTGCCGTTTGTTTGGTCGCTTAAGGAGAAGAGCTTGGTTCAG TTACCAAAAGGGTTTTTGGATAGGACAAGAGAGCAAGGGATAGTGGTTCCATGGGCACCG CAAGTGGAACTGCTGAAACACGAAGCAACGGGTGTGTTTGTGACGCATTGTGGATGGAAC TCGGTGTTGGAGAGTGTATCGGGTGGTGTACCGATGATTTGCAGGCCATTTTTTGGGGAT CAGAGATTGAACGGAAGAGCGGTGGAGGTTGTGTGGGAGATTGGAATGACGATTATCAAT GGAGTCTTCACGAAAGATGGGTTTGAGAAGTGTTTGGATAAAGTTTTAGTTCAAGATGAT GGTAAGAAGATGAAATGTAATGCTAAGAAACTTAAAGAACTAGCTTACGAAGCTGTCTCT TCTAAAGGAAGGTCCTCTGAGAATTTCAGAGGATTGTTGGATGGAGTTGTAAACATTATT TGA
>UGT78D3
ATGGCCAAACCCTCGCAGCCAACGCGAGACTCCCACGTGGCAGTTCTCGTTTTCCCCTTC GGCACTCATGCAGCTCCTCTCCTCGCCGTCACGTGCCGTCTCGCCACCGCTGCTCCCTCC ACCGTCTTCTCCTTCTTCAGCACCGCACGATCCAACTCGTCGTTACTCTCCTCCGATATC CCCACAAACATTCGTGTCCACAACGTCGATGACGGTGTTCCTGAGGGATTCGTGTTGACG GGGAATCCAGAGCACGCTGTTGAGCTGTTTCTTGAAGCGGCGCCAGAGATTTTCCGAAGA
GAAATCAAGGCGGCCGAGACCGAAGTTGGTAGGAAGTTCAAGTGCATCCTTACGGATGCG TTCCTCTGGTTAGCAGCGGAGACGGCGGCTGCGGAGATGAAAGCGTCGTGGGTTGCGTAC TATGGAGGCGGAGCAACCTCGCTCACTGCTCATCTCTACACAGATGCCATCAGAGAAAAC GTCGGTGTCAAAAGTAGGTGAGCGTATGGAGGAGACAATAGGGTTTATCTCAGGAATGGA GAAGATCAGAGTCAAAGACACACAAGAAGGCGTTGTGTTTGGGAACTTAGACTCTGTTTT CTCTAAAACGTTGCACCAAATGGGTCTTGCTTTACCTCGTGCCACTGCTGTTTTCATCAA TTCCTTTGAAGAATTGGATCCTACGTTTACAAATGATTTCAGATCGGAATTCAAACGTTA CCTAAACATCGGTCCTCTCGCTTTATTATCTTCTCCATCGCAAACATCAACGCTAGTGCA CGATCCTCACGGTTGCTTGGCTTGGATCGAGAAGCGGTCCACTGCTTCTGTAGCGTACAT TGCCTTTGGTAGAGTCGCGACACCGCCTCCTGTAGAGCTTGTGGCGATAGCACAAGGATT GGAATCGAGTAAAGTGCCTTTTGTTTGGTCGCTACAAGAGATGAAAATGACTCATTTACC AGAAGGCTTTTTGGATCGGACCAGAGAGCAAGGGATGGTGGTTCCATGGGCACCACAAGT GGAGCTGCTAAACCATGAAGCAATGGGTGTGTTTGTTTCGCATGGTGGGTGGAACTCAGT GTTGGAGAGTGTGTCGGCAGGTGTACCGATGATTTGTAGACCGATTTTCGGGGATCATGC AATCAATGCAAGATCTGTGGAAGCTGTGTGGGAGATCGGAGTGACGATTAGTAGTGGAGT CTTCACGAAGGATGGATTTGAGGAGAGTTTGGATCGGGTTTTGGTTCAAGATGATGGCAA GAAGATGAAGGTTAATGCTAAAAAGCTTGAAGAACTAGCACAAGAAGCTGTCTCTACCAA AGGAAGCTCCTTTGAGAATTTTGGAGGATTGTTGGACGAAGTTGTGAACTTTGGATAA
>UGT79B1
ATGGGTGTTTTTGGATCGAATGAATCGTCAAGCATGAGTATTGTGATGTATCCGTGGTTA GCCTTTGGTCACATGACTCCTTTTCTTCACCTATCCAACAAGCTCGCAGAGAAAGGTCAC AAGATTGTTTTCTTGCTTCCCAAGAAAGCACTAAACCAGCTTGAACCTCTTAATCTCTAC CCAAATCTCATCACTTTCCACACCATCTCTATCCCTCAGGTCAAAGGGCTCCCTCCGGGT GCGGAGACAAACTCCGACGTCCCTTTCTTCTTGACACATTTGCTTGCAGTTGCAATGGAC CAAACCCGGCCAGAGGTCGAGACCATTTTCCGTACAATCAAACCGGACTTGGTTTTCTAT GATTCTGCCCATTGGATACCGGAAATTGCTAAACCGATCGGTGCTAAAACCGTTTGCTTC AACATCGTTAGCGCTGCGTCAATCGCACTGTCTCTTGTCCCTTCTGCGGAGAGAGAGGTC ATTGATGGCAAGGAAATGTCAGGGGAGGAGTTAGCTAAGACGCCTCTAGGTTACCCATCT TCGAAAGTAGTCTTACGTCCGCACGAAGCAAAATCCCTGAGTTTCGTGTGGAGGAAGCAC GAGGCGATTGGCTCTTTCTTTGATGGGAAAGTTACCGCGATGAGAAACTGCGACGCAATC GCTATAAGGACTTGCCGTGAGACAGAAGGCAAATTCTGCGATTACATAAGTAGGCAGTAC AGTAAACCGGTTTACCTAACAGGACCGGTTCTCCCTGGATCCCAACCTAATCAGCCCTCC TTAGATCCTCAATGGGCGGAGTGGCTAGCCAAATTCAACCACGGTTCGGTTGTGTTCTGC GCTTTCGGTAGCCAACCCGTTGTAAACAAGATAGATCAGTTTCAAGAACTCTGTTTAGGT CTAGAATCAACTGGTTTTCCGTTTCTGGTTGCCATTAAGCCTCCTTCGGGTGTATCAACC GTCGAGGAAGCCTTACCGGAAGGATTCAAAGAGAGGGTTCAAGGACGTGGCGTTGTGTTT GGAGGTTGGATTCAGCAACCGTTGGTGTTGAACCATCCTTCAGTGGGTTGTTTTGTTAGC CATTGCGGGTTTGGGTCGATGTGGGAGTCGTTGATGAGTGATTGTCAGATCGTTTTGGTT CCGCAGCACGGAGAACAGATTTTGAACGCAAGGCTGATGACGGAGGAGATGGAGGTGGCG GTTGAAGTGGAGAGGGAAAAGAAAGGGTGGTTCTCGCGGCAAAGCTTGGAGAATGCTGTG AAGAGTGTGATGGAGGAAGGTAGTGAGATCGGTGAGAAAGTGAGGAAGAATCATGACAAG TGGAGATGTGTTTTGACTGACTCTGGTTTTTCAGATGGTTATATTGATAAGTTTGAACAA AATTTAATTGAACTTGTGAAGTCATGA
>UGT79B10
ATGGGCCAAACGTTTCACGCCTTTATGTTCCCATGGTTCGCTTTTGGTCATATGACTCCA TACTTGCATTTAGCCAACAAGTTAGCTGAGAGAGGTCACAGAATCACTTTCTTGATCCCC AAGAAAGCTCAGAAGCAGCTTGAACATCTCAATCTGTTTCCAGACAGCATCGTCTTTCAC TCTCTTACTATTCCTCATGTTGATGGTCTCCCCGCTGGAGCCGAGACTTTCTCGGATATC CCTATGCCATTGTGGAAGTTCTTGCCCCCAGCTATAGATCTCACACGCGATCAAGTTGAA GCAGCGGTTAGTGCCTTGAGTCCGGACCTGATCTTGTTCGATATTGCTTCATGGGTTCCA
GAAGTGGCTAAAGAGTATAGAGTCAAGAGTATGTTGTACAACATCATATCAGCTACTTCT ATAGCTCATGACTTTGTCCCAGGTGGTGAACTTGGAGTTCCTCCACCTGGTTATCCTTCC TCAAAGTTGTTGTACCGCAAACACGATGCTCACGCCTTGTTGTCCTTCTCCGTCTACTAC AAGAGGTTTTCTCATCGGCTCATCACAGGTCTTATGAATTGTGATTTCATTTCGATAAGG ACATGCAAAGAAATCGAGGGTAAATTCTGCGAGTATCTTGAGCGTCAATACCATAAAAAG GTTTTCTTGACGGGTCCAATGCTTCCTGAGCCAAACAAAGGTAAACCACTGGAAGATCGA TGGAGTCATTGGCTGAACGGGTTTGAACAAGGCTCTGTAGTGTTCTGTGCATTGGGAAGT CAAGTCACTCTAGAGAAGGACCAGTTCCAAGAACTTTGTTTAGGAATAGAGCTTACAGGT TTACCGTTTTTTGTAGCTGTAACACCACCAAAAGGCGCAAAGACGATTCAAGATGCGTTA CCAGAAGGGTTCGAGGAGAGGGTGAAAGATCGTGGAGTGGTTTTGGGAGAATGGGTGCAA CAACCGTTATTATTGGCTCATCCATCAGTAGGCTGCTTCTTGAGTCATTGCGGATTCGGG TCAATGTGGGAATCTATAATGAGTGATTGCCAAATAGTTTTGCTTCCATTTTTGGCTGAT CAAGTTCTCAACACAAGATTGATGACCGAAGAACTCAAGGTTTCGGTTGAAGTGCAAAGA GAAGAAACAGGATGGTTCTCGAAGGAGAGCTTGAGTGTTGCTATCACATCTGTGATGGAC CAAGCTAGTGAGATCGGGAATCTGGTGAGAAGGAACCATTCCAAATTGAAGGAGGTTTTG GTTAGTGATGGATTATTAACCGGTTACACCGATAAATTTGTTGACACTTTGGAGAATCTT GTCAGCGAGACAAAGCGTGAATGA
>UGT79B11
ATGGGCCAAAAGATTCACGCTTTTATGTTCCCCTGGTTTGCTTTTGGTCATATGACTCCG TACTTGCATCTAGGCAACAAGTTAGCCGAGAAAGGTCATAGGGTTACTTTCTTGCTACCT AAGAAAGCTCAGAAACAATTGGAACATCAGAATCTATTTCCACACGGTATCGTCTTTCAT CCTCTTGTTATTCCTCATGTTGATGGCCTCCCTGCTGGTGCCGAGACAGCCTCGGATATC CCCATCTCGTTGGTGAAGTTCTTGTCTATAGCCATGGATCTTACACGCGATCAGATCGAA GCCGCGATTGGTGCCTTGAGACCGGACCTAATCTTGTTCGATTTAGCTCACTGGGTTCCG GAAATGGCTAAAGCGCTTAAAGTCAAGAGTATGTTGTATAACGTGATGTCAGCTACCTCT ATAGCTCACGACCTTGTCCCAGGTGGTGAACTTGGAGTTGCTCCACCTGGTTATCCTTCA TCAAAGGCGTTGTACCGCGAACACGATGCTCACGCCTTGTTAACCTTCTCCGGCTTCTAC AAGAGGTTTTATCACCGGTTCACCACAGGTCTTATGAATTGCGATTTCATTTCGATTCGG ACATGTGAAGAAATCGAAGGTAAATTTTGTGACTATATTGAGAGTCAATACAAGAAGAAG GTTCTTTTAACCGGTCCAATGCTTCCCGAGCCTGACAAGAGTAAACCACTTGAAGATCAA TGGAGTCATTGGCTGAGTGGGTTTGGACAAGGCTCTGTAGTGTTCTGTGCATTGGGAAGT CAAACCATTCTAGAGAAAAACCAATTCCAAGAACTCTGTTTAGGAATAGAGCTTACGGGT TTACCATTTCTTGTCGCGGTTAAGCCACCAAAAGGCGCAAACACAATTCATGAAGCGTTA CCAGAAGGGTTCGAGGAAAGGGTGAAGGGTCGTGGAATAGTTTGGGGAGAATGGGTGCAG CAACCATCCTGGCAACCATTGATATTGGCTCATCCATCAGTAGGTTGCTTTGTGAGCCAT TGCGGATTCGGGTCAATGTGGGAATCTTTAATGAGTGATTGTCAAATAGTCTTTATTCCA GTTTTGAATGATCAAGTTCTCACCACGAGAGTAATGACGGAGGAACTCGAGGTCTCCGTT GAGGTACAGAGAGAAGAAACAGGATGGTTCTCAAAAGAAAACTTGAGTGGTGCAATCATG TCTTTGATGGACCAAGACAGCGAGATAGGGAACCAAGTGAGGAGGAACCATTCTAAATTG AAGGAGACTTTGGCTAGTCCTGGATTATTAACCGGTTACACCGATAAATTTGTTGACACT TTGGAGAATCTAGTCAACGAACAAGGATACATATCTTGA
>UGT79B2
ATGGGTGGTTTGAAGTTTCATGTACTTATGTATCCATGGTTCGCAACAGGCCATATGACC CCGTTCCTTTTTCTTGCCAACAAATTGGCTGAGAAAGGTCATACGGTCACTTTTTTGATT CCCAAGAAAGCTCTGAAACAGTTGGAAAATCTCAATCTGTTTCCACACAACATTGTCTTT CGCTCTGTCACCGTCCCTCATGTGGATGGTCTCCCCGTTGGCACAGAGACAGTCTCTGAG ATCCCCGTGACATCAGCTGATCTCTTGATGTCTGCTATGGATCTCACACGTGATCAAGTT GAAGGTGTGGTCCGAGCCGTGGAACCGGACCTGATCTTCTTTGACTTCGCTCATTGGATT CCAGAGGTAGCTAGAGACTTTGGCCTTAAGACTGTAAAGTACGTCGTGGTATCTGCATCG ACTATAGCTAGTATGCTTGTTCCAGGTGGTGAGTTAGGTGTTCCTCCGCCGGGATATCCT
TCATCGAAGGTGCTGCTTCGTAAACAAGATGCTTACACCATGAAGAATCTGGAGTCTACA AATACAATCAATGTCGGACCAAACTTATTGGAAAGAGTCACTACAAGTCTTATGAACTCT GATGTCATTGCGATAAGGACAGCCAGAGAAATCGAAGGAAACTTTTGCGACTATATCGAA AAACATTGCAGGAAAAAGGTTCTCTTGACAGGTCCGGTGTTCCCTGAGCCAGACAAGACT AGAGAGCTAGAGGAACGATGGGTTAAGTGGCTAAGTGGGTATGAACCAGACTCAGTGGTG TTTTGTGCGTTGGGCTCACAAGTCATTTTAGAGAAAGATCAATTCCAAGAACTCTGCTTA GGAATGGAGCTAACAGGTTCACCGTTTCTTGTAGCGGTTAAGCCACCTAGAGGCTCATCA ACGATTCAAGAAGCACTTCCTGAAGGATTCGAGGAGAGGGTTAAAGGAAGAGGAGTTGTT TGGGGAGAATGGGTTCAACAACCATTGCTATTGTCTCATCCATCAGTCGGGTGCTTTGTG AGCCATTGTGGGTTTGGATCAATGTGGGAGTCTTTGCTGAGTGATTGTCAGATAGTCTTG GTACCACAGTTGGGTGATCAGGTCCTCAACACAAGATTGCTGAGTGACGAACTCAAGGTT TCGGTTGAAGTGGCAAGAGAGGAAACAGGATGGTTCTCGAAAGAGAGCTTGTTCGATGCT ATCAATAGTGTGATGAAAAGGGACAGTGAGATCGGGAATCTGGTGAAGAAGAATCACACC AAGTGGAGGGAGACACTAACTAGTCCTGGACTTGTGACCGGTTATGTCGATAATTTCATA GAGTCATTGCAGGATCTTGTCTCTGGGACCAACCATGTTTCGAAGTAG
>UGT79B3
ATGGGTGGTTTGAAGTTTCATGTACTTATGTATCCATGGTTCGCAACAGGCCATATGACC CCGTTCCTTTTTCTTGCCAACAAATTGGCTGAGAAAGGTCATACGGTCACTTTCTTGCTT CCCAAGAAATCTCTGAAACAGTTGGAACATTTCAATCTGTTTCCACACAACATTGTCTTT CGCTCTGTCACCGTCCCTCATGTGGATGGTCTCCCCGTTGGCACAGAGACAGCCTCTGAG ATCCCTGTGACATCAACTGATCTCTTGATGTCTGCTATGGATCTCACACGTGATCAAGTT GAAGCTGTGGTCCGAGCCGTTGAACCGGACCTGATCTTCTTTGACTTTGCTCATTGGATT CCAGAAGTAGCTAGGGACTTCGGCCTTAAGACTGTAAAGTACGTCGTGGTGTCTGCATCG ACTATAGCTAGTATGCTTGTCCCAGGTGGTGAGTTAGGTGTTCCTCCACCGGGATATCCA TCATCAAAGGTGCTGCTTCGTAAACAAGATGCTTACACTATGAAGAAACTGGAGCCTACA AATACAATCGATGTCGGACCAAACCTCTTGGAACGAGTCACTACAAGTCTTATGAACTCT GATGTCATTGCGATAAGGACAGCCAGAGAAATCGAAGGAAACTTTTGCGACTATATAGAA AAACATTGCAGGAAAAAGGTTCTCTTGACAGGTCCGGTGTTCCCTGAGCCAGACAAGACT AGAGAGCTAGAGGAACGATGGGTTAAGTGGCTAAGTGGGTATGAACCAGACTCAGTGGTG TTTTGTGCACTGGGCTCACAAGTCATTTTAGAGAAAGATCAATTCCAAGAACTCTGCTTA GGAATGGAGCTAACAGGTTCACCGTTTCTTGTAGCGGTTAAGCCCCCTAGAGGCTCATCA ACGATTCAAGAAGCACTTCCTGAAGGATTCGAAGAGCGGGTTAAAGGAAGAGGCCTTGTT TGGGGAGGATGGGTTCAACAACCATTGATATTGTCTCATCCATCAGTCGGGTGCTTTGTG AGCCATTGTGGGTTTGGATCAATGTGGGAGTCTTTGCTGAGTGATTGTCAGATAGTCTTA GTACCACAGTTGGGTGATCAAGTCCTGAACACAAGATTGCTGAGTGACGAACTCAAGGTT TCGGTTGAAGTGGCAAGAGAGGAAACAGGATGGTTCTCGAAAGAGAGCTTGTGCGATGCT GTCAATAGTGTGATGAAAAGGGACAGCGAGCTCGGGAACCTGGTGAGGAAGAATCACACC AAGTGGAGGGAGACAGTAGCTAGTCCTGGACTAATGACTGGTTATGTCGATGCTTTCGTA GAGTCATTGCAGGATCTTGTCTCTGGGACCACCCATGACTGA
>UGT79B4
ATGGGGTCAAAGTTTCATGCTTTTCTTTATCCATGGTTTGGTTTTGGTCATATGATTCCG TATCTTCATCTAGCTAACAAATTAGCTGAAAAAGGTCATAGGGTTACTTTCTTGGCTCCC AAGAAAGCTCAGAAACAACTCGAACCTCTCAACTTGTTCCCAAACAGCATTCACTTCGAG AATGTTACTCTTCCTCATGTTGATGGTCTCCCTGTTGGCGCAGAGACAACCGCGGATCTC CGGAACTCATCTAAGAGAGTCCTCGCTGATGCCATGGATCTTCTACGCGAACAGATTGAA GTTAAGATTCGTTCTTTGAAACCTGACCTAATTTTCTTCGATTTTGTTGATTGGATTCCA CAAATGGCAAAAGAATTAGGAATCAAAAGTGTAAGTTACCAGATCATATCGGCAGCTTTT ATAGCTATGTTTTTCGCTCCTCGTGCTGAATTAGGTTCTCCTCCACCTGGGTTTCCTTCA TCAAAAGTAGCATTACGTGGACATGACGCTAACATCTATTCACTCTTCGCAAACACCCGC AAATTTCTCTTTGATCGAGTCACCACAGGCCTTAAGAACTGCGACGTCATTGCCATAAGG
ACATGTGCAGAAATCGAAGGTAACTTATGTGATTTCATCGAAAGACAATGTCAGAGAAAA GTTCTCTTAACCGGTCCAATGTTCCTTGATCCACAAGGGAAGAGTGGTAAGCCGCTAGAA GATCGATGGAATAATTGGTTAAACGGATTTGAACCAAGCTCGGTAGTGTACTGTGCGTTT GGCACCCATTTCTTTTTCGAGATAGATCAATTTCAAGAACTCTGTTTAGGAATGGAGCTC ACGGGTCTACCTTTTTTGGTAGCGGTTATGCCACCGAGAGGGTCTTCAACGATTCAAGAA GCATTACCAGAAGGGTTCGAAGAACGGATTAAAGGGCGTGGAATTGTTTGGGGAGGATGG GTGGAACAACCTTTGATATTGTCTCATCCATCAATAGGTTGCTTTGTGAACCATTGCGGG TTCGGTTCAATGTGGGAGTCTTTGGTTAGTGATTGCCAGATTGTGTTTATTCCACAATTG GTTGATCAAGTTCTCACAACGAGATTGTTGACCGAAGAACTCGAGGTCTCCGTGAAAGTA AAGAGAGATGAAATTACTGGTTGGTTTTCGAAGGAGAGCTTGAGGGATACGGTCAAATCT GTGATGGATAAAAATAGTGAGATTGGGAATCTAGTGAGGAGGAATCATAAGAAACTGAAG GAAACTTTGGTTAGTCCTGGATTGTTGAGTAGTTATGCTGATAAGTTTGTTGACGAATTA GAGAATCATATCCACAGTAAGAATTGA
>UGT79B5
ATGGGATCAAAATTTCATGCTTTTATGTATCCATGGTTTGGTTTTGGTCATATGATTCCA TATCTTCATTTAGCCAACAAACTAGCTGAGAAAGGTCATAGGGTCACTTTCTTCCTCCCC AAGAAAGCTCATAAGCAGCTCCAACCTCTCAATCTGTTCCCAGACAGCATTGTCTTTGAG CCTCTTACTCTCCCTCCTGTCGATGGTCTCCCTTTTGGCGCCGAGACAGCCTCGGATCTC CCAAACTCAACTAAGAAACCCATATTCGTTGCCATGGATCTCTTACGCGATCAGATCGAA GCAAAGGTCCGTGCTTTGAAACCAGATCTAATCTTTTTCGATTTTGTTCATTGGGTTCCA GAAATGGCAGAAGAGTTTGGAATAAAGAGTGTCAATTACCAGATCATATCGGCAGCTTGT GTAGCTATGGTTCTTGCACCTAGGGCTGAATTAGGGTTTCCTCCGCCGGATTATCCTTTA TCCAAAGTGGCGTTACGTGGACATGAAGCTAACGTCTGTTCTCTCTTTGCGAATTCCCAT GAGCTTTTCGGTCTGATCACCAAAGGCCTTAAGAACTGTGACGTCGTTTCCATAAGGACC TGCGTGGAACTTGAAGGTAAGCTATGCGGTTTCATCGAAAAAGAATGTCAAAAGAAACTT CTCTTAACCGGTCCAATGCTCCCTGAACCGCAAAATAAGAGTGGTAAATTTCTAGAAGAC CGATGGAATCACTGGTTAAACGGATTTGAACCAGGGTCGGTAGTGTTTTGTGCGTTTGGC ACTCAATTCTTTTTCGAGAAGGATCAATTTCAAGAATTCTGTTTAGGAATGGAGCTAATG GGTCTACCGTTTTTAATATCGGTTATGCCGCCAAAAGGCTCACCAACGGTTCAAGAAGCG TTACCAAAAGGATTCGAAGAACGGGTTAAAAAGCATGGAATCGTTTGGGAAGGATGGTTG GAACAACCTTTGATATTGTCTCATCCATCAGTAGGTTGCTTTGTGAACCATTGTGGCTTT GGTTCAATGTGGGAGTCTTTGGTTAGTGATTGTCAGATTGTGTTTATTCCACAATTGGCA GATCAAGTTCTCATCACAAGATTGTTGACTGAAGAACTCGAAGTCTCTGTGAAAGTGCAG AGAGAAGATTCCGGATGGTTCTCGAAAGAGGACTTGAGAGATACTGTTAAATCTGTGATG GATATAGATAGTGAGATTGGGAACTTAGTGAAGAGGAATCATAAGAAATTGAAAGAGACT TTAGTTAGTCCTGGATTGTTAAGTGGTTATGCTGATAAGTTTGTAGAAGCATTGGAGATT GAAGTCAACAACACCAAATTTTCTTGA
>UGT79B6
ATGGGGTCAAAGTTTCATGCTTTTATGTTCCCATGGTTTGGTTTTGGTCACATGACTGCA TTTTTGCATCTGGCTAACAAACTAGCGGAGAAAGACCACAAAATAACTTTCTTGCTCCCC AAGAAAGCTCGAAAGCAACTTGAATCTCTCAATCTCTTCCCAGACTGCATTGTCTTTCAG ACTCTTACCATCCCATCTGTAGATGGCCTCCCTGATGGTGCTGAGACAACCTCGGATATC CCGATCTCGTTAGGCAGTTTTCTCGCCTCGGCTATGGATCGGACACGCATTCAGGTCAAA GAAGCAGTTTCTGTTGGTAAACCGGATCTGATTTTCTTCGATTTTGCTCACTGGATTCCG GAAATAGCTAGAGAGTATGGAGTCAAGAGTGTCAATTTCATAACGATTTCTGCAGCATGT GTAGCTATTTCGTTCGTCCCTGGTCGTAGTCAAGATGACTTGGGTAGTACTCCACCGGGA TACCCTTCCTCCAAGGTGTTGCTTCGGGGACACGAAACCAACAGTTTGTCGTTCCTCTCC TATCCGTTTGGAGATGGAACTAGTTTTTACGAACGGATCATGATAGGACTTAAGAACTGC GATGTCATTTCGATAAGGACATGCCAAGAAATGGAAGGAAAGTTCTGCGATTTCATCGAA AACCAATTTCAAAGAAAAGTTCTCTTGACAGGTCCAATGCTTCCTGAGCCGGACAATAGC
AAACCGCTAGAAGATCAATGGCGTCAGTGGCTTAGCAAGTTCGATCCGGGATCAGTAATA TATTGTGCATTGGGCAGCCAAATCATTCTTGAAAAGGATCAATTCCAAGAACTCTGTTTA GGAATGGAGCTGACAGGTTTACCATTTCTTGTAGCGGTAAAGCCACCAAAAGGTTCATCG ACAATCCAAGAAGCCTTACCAAAAGGGTTTGAAGAGAGGGTTAAAGCACGTGGAGTGGTT TGGGGAGGATGGGTGCAGCAACCATTGATATTAGCTCATCCATCAATAGGCTGCTTTGTG AGCCATTGTGGTTTCGGGTCAATGTGGGAGGCTCTAGTGAATGACTGCCAAATAGTGTTT ATTCCACATTTGGGTGAGCAAATATTGAACACAAGACTGATGAGCGAGGAACTCAAGGTC TCGGTAGAGGTGAAAAGAGAGGAAACGGGATGGTTTTCGAAGGAGAGCTTGAGCGGTGCG GTCAGGTCTGTGATGGACAGAGATAGCGAGCTCGGGAATTGGGCGAGGAGGAACCACGTA AAGTGGAAGGAGTCTCTGCTTCGTCATGGACTAATGAGTGGTTATCTTAATAAGTTCGTA GAAGCATTGGAGAAACTAGTCCAAAATATAAATCTTGAATGA
>UGT79B7
ATGGAGCCAAAGTTTCATGCTTTTATGTTTCCATGGTTTGCTTTTGGTCATATGATTCCA TTTCTACATCTTGCAAACAAACTAGCTGAAAAAGGTCACCGAGTTACTTTCTTGCTACCT AAGAAAGCACAAAAACAGTTGGAACATCACAACTTGTTCCCAGACAGTATTGTCTTTCAC CCTCTCACAGTTCCTCCTGTCAATGGCCTCCCTGCTGGTGCCGAGACAACCTCGGATATC CCCATCTCGTTGGACAACCTCTTGTCCAAAGCCTTGGATCTCACTCGCGATCAGGTTGAA GCTGCGGTTCGTGCTTTGAGACCTGACTTGATCTTTTTCGATTTTGCTCAATGGATTCCA GATATGGCTAAAGAACATATGATCAAGAGTGTGAGTTACATCATTGTATCTGCGACAACA ATAGCTCATACACATGTCCCTGGAGGTAAATTAGGTGTTCGCCCACCGGGTTATCCGTCA TCAAAGGTGATGTTCCGTGAAAACGATGTTCATGCCTTAGCAACCTTATCGATATTTTAC AAGAGACTGTATCATCAGATCACTACAGGTCTTAAGAGCTGTGATGTCATTGCATTGAGG ACTTGCAAAGAAGTCGAAGGTATGTTCTGCGACTTTATATCGCGTCAATACCATAAGAAG GTTCTCTTGACTGGTCCAATGTTCCCTGAGCCAGACACAAGTAAACCACTAGAAGAACGC TGGAATCATTTTCTAAGCGGGTTCGCGCCGAAGTCAGTAGTGTTTTGTTCACGTGGCAGC CAAGTAATTCTTGAGAAAGATCAATTCCAAGAACTCTGTTTAGGGATGGAGCTAACAGGT TTACCATTTCTTTTAGCGGTAAAGCCACCAAGAGGATCATCAACGGTCCAAGAAGGGTTA CCAGAAGGGTTCGAGGAGCGGGTGAAAGATCGTGGTGTTGTTTGGGGAGGATGGGTGCAA CAACCTTTGATATTGGCTCATCCATCAATAGGTTGCTTTGTGAACCATTGTGGTCCCGGA ACAATATGGGAGTCTTTGGTGAGTGATTGCCAAATGGTTTTGATTCCATTTTTAAGTGAT CAAGTTCTCTTCACAAGATTGATGACCGAGGAATTCGAGGTCTCTGTAGAAGTGCCGAGG GAAAAAACAGGATGGTTTTCAAAGGAGAGCTTGAGCAATGCTATCAAATCTGTGATGGAT AAAGACAGTGACATTGGGAAGTTAGTGAGGAGTAACCACACCAAATTGAAGGAGATTTTA GTTAGTCCTGGATTATTGACTGGTTACGTTGATCACTTTGTAGAGGGATTGCAAGAGAAT TTGATTTGA
>UGT79B8
ATGGAGCCAACGTTCCATGCTTTTATGTTTCCCTGGTTTGCTTTTGGTCATATGATTCCT TTTCTACATCTTGCAAACAAACTAGCTGAGAAAGGTCATCAAATCACTTTCTTGCTACCT AAGAAAGCCCAAAAACAGTTGGAACATCACAATCTGTTCCCAGACAGTATTGTCTTTCAC CCTCTCACAATCCCTCATGTCAATGGCCTCCCTGCTGGTGCTGAGACAACCTCGGATATC TCAATCTCGATGGACAACTTACTGTCGGAAGCCTTGGATCTCACTCGCGATCAGGTTGAA GCTGCGGTTCGTGCTCTGAGACCGGACTTGATCTTTTTTGATTTTGCTCATTGGATTCCA GAAATTGCCAAAGAGCATATGATCAAGAGTGTGAGTTACATGATAGTATCTGCAACAACA ATAGCTTATACATTTGCCCCTGGTGGTGTATTAGGTGTTCCCCCACCAGGTTATCCTTCA TCAAAGGTGTTGTACCGTGAAAACGATGCTCATGCCTTAGCAACCTTATCTATCTTCTAC AAGAGACTTTATCATCAGATCACTACAGGTTTTAAGAGCTGTGACATCATTGCATTGAGG ACATGTAATGAAATCGAAGGTAAATTCTGCGACTATATATCAAGTCAATACCATAAGAAG GTTCTCTTGACTGGTCCAATGCTCCCTGAGCAAGACACAAGTAAACCACTAGAAGAACAG TTGAGTCATTTTCTGAGCAGGTTCCCACCGAGGTCAGTGGTGTTTTGTGCACTTGGTAGC CAGATCGTTCTTGAAAAGGATCAATTCCAAGAACTCTGCTTAGGGATGGAGCTGACAGGT
TTACCGTTTCTTATAGCGGTAAAGCCACCGAGAGGATCATCGACGGTCGAAGAAGGGTTA CCAGAAGGGTTCCAGGAGCGGGTGAAAGGGCGTGGTGTGGTTTGGGGAGGATGGGTGCAA CAACCATTGATATTGGATCATCCGTCAATAGGCTGCTTTGTGAACCATTGTGGTCCGGGA ACAATATGGGAGTGTCTTATGACTGATTGTCAAATGGTTTTGCTTCCATTTTTAGGTGAT CAAGTTCTCTTCACAAGATTGATGACCGAGGAATTCAAGGTGTCTGTAGAAGTGTCGAGA GAAAAAACAGGATGGTTTTCAAAGGAGAGCTTGAGCGATGCGATCAAGTCTGTGATGGAT AAAGATAGCGACCTCGGAAAGCTAGTGAGGAGTAACCACGCCAAATTGAAGGAGACTCTT GGTAGTCATGGATTATTAACTGGTTACGTGGATAAATTTGTAGAGGAATTGCAAGAGTAT TTGATTTGA
>UGT79B9
ATGGGCCAAAATTTTCACGCTTTTATGTTCCCATGGTTCGCTTTTGGTCATATGACTCCA TACTTGCATCTAGCCAACAAGCTAGCTGCTAAAGGTCATAGGGTTACTTTCTTGCTGCCT AAGAAAGCTCAAAAACAGTTGGAACATCACAATCTGTTTCCAGACAGGATCATCTTTCAT TCTCTTACTATTCCCCATGTTGATGGCCTACCTGCTGGCGCGGAGACCGCCTCGGACATC CCCATCTCGTTGGGGAAGTTTCTTACCGCAGCCATGGATCTCACTCGCGATCAGGTCGAA GCCGCGGTTCGTGCTTTGAGACCAGACCTGATCTTTTTCGATACTGCTTATTGGGTTCCG GAAATGGCGAAAGAACACAGAGTCAAGAGTGTGATATACTTTGTGATATCAGCTAACTCC ATAGCTCATGAACTTGTACCAGGTGGTGAATTAGGAGTTCCTCCACCTGGCTATCCTTCG TCAAAAGTGTTGTACCGTGGACACGATGCTCACGCTTTGTTGACTTTTTCCATCTTCTAC GAGAGGCTTCATTACCGGATAACAACAGGTCTAAAGAATTGTGATTTTATCTCAATTAGG ACTTGTAAAGAAATCGAAGGTAAATTCTGCGACTATATAGAGCGTCAATACCAGAGGAAG GTTCTTTTGACAGGTCCAATGCTTCCAGAGCCAGATAACAGTAGACCACTCGAAGATCGA TGGAATCACTGGCTGAATCAGTTCAAACCCGGCTCGGTAATATATTGTGCATTGGGAAGT CAAATCACTCTAGAGAAGGATCAATTCCAAGAACTCTGTTTAGGAATGGAGCTCACTGGT TTACCGTTTCTCGTAGCGGTAAAACCACCAAAAGGCGCAAAGACGATCCAAGAAGCGTTG CCAGAAGGGTTTGAGGAGAGGGTGAAGAATCATGGAGTAGTTTGGGGAGAATGGGTGCAG CAACCATTGATATTGGCTCATCCATCAGTAGGCTGCTTTGTGACCCATTGTGGGTTTGGA TCAATGTGGGAGTCTCTAGTGAGTGATTGTCAAATAGTCTTGCTTCCATATTTGTGTGAT CAAATTCTCAACACTAGATTGATGAGTGAGGAACTCGAGGTTTCGGTGGAAGTGAAAAGA GAAGAAACAGGATGGTTCTCGAAAGAGAGCTTAAGTGTTGCGATCACCTCGGTGATGGAC AAAGATAGTGAGTTAGGGAATCTGGTGAGGAGGAACCACGCTAAATTAAAGGAGGTTTTG GTTAGTCCTGGATTATTAACCGGTTACACCGATGAATTTGTTGAAACTTTGCAGAATATA GTCAACGATACAAATCTTGAATGA
>UGT82A1
ATGAAAGTAACACAAAAGCCAAAGATAATATTCATCCCTTATCCGGCGCAAGGCCACGTC ACTCCGATGCTTCACCTTGCATCGGCTTTCCTCAGCCGTGGATTCTCCCCTGTCGTTATG ACTCCCGAGTCTATCCACCGTAGGATCTCGGCTACTAACGAGGATCTTGGGATCACGTTC TTGGCCTTATCTGACGGTCAAGATCGTCCGGACGCACCTCCCTCGGACTTCTTCTCGATA GAGAACTCAATGGAGAACATCATGCCACCACAGCTCGAACGGCTCCTACTAGAAGAAGAC TTGGATGTGGCTTGTGTTGTGGTTGATTTGCTGGCTTCGTGGGCTATAGGAGTGGCTGAT CGGTGTGGAGTTCCGGTCGCCGGATTCTGGCCGGTGATGTTCGCTGCTTACCGTTTGATC CAAGCAATACCGGAGCTAGTCCGAACAGGCTTAGTTTCCCAAAAAGGTTGTCCTCGTCAA CTAGAAAAAACAATAGTCCAGCCAGAGCAACCGCTCCTATCCGCAGAAGATCTACCGTGG CTGATCGGAACTCCCAAAGCTCAGAAAAAACGATTCAAGTTCTGGCAAAGAACTCTAGAA CGAACAAAAAGTCTCCGTTGGATCTTGACAAGCTCCTTTAAAGATGAATATGAAGATGTC GACAACCACAAAGCATCCTACAAAAAATCTAACGATTTAAACAAAGAAAACAATGGTCAA AACCCTCAAATCCTTCATTTAGGTCCATTGCATAACCAAGAAGCAACAAATAATATAACT ATAACCAAGACTAGTTTTTGGGAAGAAGACATGTCTTGTCTAGGTTGGCTTCAAGAACAA AACCCGAACTCAGTCATTTATATCTCATTTGGAAGTTGGGTTTCTCCTATAGGAGAATCA AATATTCAAACGTTGGCATTGGCGTTGGAAGCGTCAGGGAGACCTTTCCTTTGGGCGTTA
AACCGAGTGTGGCAAGAGGGACTACCACCAGGTTTTGTGCATAGAGTCACAATTACCAAA AACCAAGGAAGGATCGTCTCATGGGCTCCGCAACTTGAAGTTCTTAGAAACGATTCTGTG GGATGTTACGTGACTCATTGTGGCTGGAACTCGACTATGGAGGCAGTGGCAAGTTCCCGG AGGCTACTATGTTATCCGGTGGCCGGAGACCAGTTTGTTAACTGTAAATACATCGTGGAC GTTTGGAAGATTGGAGTGAGATTGAGCGGGTTTGGAGAGAAGGAGGTTGAAGATGGACTA AGGAAAGTAATGGAGGATCAAGATATGGGTGAGAGATTGAGGAAGTTAAGAGACAGAGCA ATGGGGAATGAAGCTCGTTTGAGTTCGGAAATGAATTTTACATTTTTAAAAAACGAGCTT AATTAG
>UGT83A1
ATGGATAATAACTCAAATAAAAGAATGGGAAGGCCACATGTTGTGGTCATACCTTACCCT GCACAAGGTCATGTTCTTCCTCTAATAAGTTTCTCACGTTACCTTGCGAAACAAGGAATC CAAATTACATTCATAAACACCGAGTTTAACCATAACCGCATCATCAGTTCCTTACCCAAT TCACCTCATGAAGATTATGTTGGGGATCAGATCAATCTTGTTTCAATCCCTGACGGTTTA GAAGATTCACCAGAAGAGAGGAACATTCCAGGGAAGTTGTCGGAGTCTGTTTTGCGTTTT ATGCCTAAAAAAGTAGAGGAATTGATCGAGAGGATGATGGCAGAAACTAGCGGTGGTACG ATCATTAGCTGCGTTGTAGCGGATCAGAGCTTGGGATGGGCAATTGAAGTTGCAGCTAAG TTTGGGATCAGACGCACCGCGTTTTGTCCTGCTGCAGCTGCGTCTATGGTTCTTGGATTT AGTATTCAAAAACTTATCGATGATGGTCTCATAGATTCTGATGGGACTGTGAGAGTAAAT AAGACAATTCAACTATCTCCCGGGATGCCAAAGATGGAAACAGACAAGTTTGTGTGGGTT TGTCTGAAGAACAAAGAATCTCAGAAAAACATATTCCAACTTATGCTTCAAAACAATAAC TCGATCGAGTCAACGGATTGGTTGTTGTGTAACTCTGTCCATGAACTTGAAACTGCAGCA TTTGGATTGGGCCCGAATATAGTACCAATTGGGCCCATTGGTTGGGCTCATAGTCTTGAA GAGGGATCCACGTCACTAGGAAGCTTTTTACCTCATGACCGGGATTGTCTAGATTGGTTG GACCGGCAGATTCCCGGTTCGGTTATATATGTTGCCTTTGGGAGTTTTGGGGTCATGGGC AACCCTCAGTTAGAAGAGCTAGCAATTGGTCTAGAGCTTACCAAGAGGCCAGTTTTGTGG GTCACTGGTGATCAACAACCAATCAAACTTGGGTCGGATCGAGTCAAAGTGGTGAGATGG GCTCCACAACGGGAGGTCCTTTCTTCTGGAGCCATTGGGTGTTTTGTGAGCCATTGTGGA TGGAATTCAACTCTGGAAGGAGCCCAAAATGGCATACCATTTCTATGCATCCCTTATTTT GCAGACCAATTTATCAACAAAGCATATATATGCGATGTGTGGAAGATTGGATTAGGACTT GAAAGAGACGCACGAGGAGTGGTTCCGAGGTTAGAGGTTAAGAAGAAGATCGATGAGATC ATGAGAGACGGTGGAGAGTATGAAGAACGAGCTATGAAGGTTAAAGAGATTGTGATGAAA AGTGTTGCAAAAGATGGAATATCTTGTGAGAATCTTAATAAATTTGTCAACTGGATCAAA TCACAAGTGAATTGA
>UGT84A1
ATGGTGTTCGAAACTTGTCCATCTCCAAACCCAATTCATGTAATGCTCGTCTCGTTTCAA GGACAAGGCCACGTCAACCCTCTTCTTCGTCTCGGCAAGTTAATTGCTTCAAAGGGTTTA CTCGTTACCTTCGTTACAACGGAGCTTTGGGGCAAGAAAATGAGACAAGCCAACAAAATC GTTGACGGTGAACTTAAACCGGTTGGTTCCGGTTCAATCCGGTTTGAGTTCTTTGATGAA GAATGGGCAGAGGATGATGACCGGAGAGCTGATTTCTCTTTGTACATTGCTCACCTAGAG AGCGTTGGGATACGAGAAGTGTCTAAGCTTGTGAGAAGATACGAGGAAGCGAACGAGCCT GTCTCGTGTCTTATCAATAACCCGTTTATCCCATGGGTCTGCCACGTGGCGGAAGAGTTC AACATTCCTTGTGCGGTTCTCTGGGTTCAGTCTTGTGCTTGTTTCTCTGCTTATTACCAT TACCAAGATGGCTCTGTTTCATTCCCTACGGAAACAGAGCCTGAGCTCGATGTGAAGCTT CCTTGTGTTCCTGTCTTGAAGAACGACGAGATTCCTAGCTTTCTCCATCCTTCTTCTAGG TTCACGGGTTTTCGACAAGCGATTCTTGGGCAATTCAAGAATCTGAGCAAGTCCTTCTGT GTTCTAATCGATTCTTTTGACTCATTGGAACAAGAAGTTATCGATTACATGTCAAGTCTT TGTCCGGTTAAAACCGTTGGACCGCTTTTCAAAGTTGCTAGGACAGTTACTTCTGACGTA AGCGGTGACATTTGCAAATCAACAGATAAATGCCTCGAGTGGTTAGACTCGAGGCCTAAA TCGTCAGTTGTCTACATTTCGTTCGGGACAGTTGCATATTTGAAGCAAGAACAGATCGAA GAGATCGCTCACGGAGTTTTGAAGTCGGGTTTATCGTTCTTGTGGGTGATTAGACCTCCA
CCACACGATCTGAAGGTCGAGACACATGTCTTGCCTCAAGAACTTAAAGAGAGTAGTGCT AAAGGTAAAGGGATGATTGTGGATTGGTGCCCACAAGAGCAAGTCTTGTCTCATCCTTCA GTGGCATGCTTCGTGACTCATTGTGGATGGAACTCGACAATGGAATCTTTGTCTTCAGGT GTTCCGGTGGTTTGTTGTCCGCAATGGGGAGATCAAGTGACTGATGCAGTGTATTTGATC GATGTTTTCAAGACCGGGGTTAGACTAGGCCGTGGAGCGACCGAGGAGAGGGTAGTGCCA AGGGAGGAAGTGGCGGAGAAGCTTTTGGAAGCGACAGTTGGGGAGAAGGCAGAGGAGTTG AGAAAGAACGCTTTGAAATGGAAGGCGGAGGCGGAAGCAGCGGTGGCTCCAGGAGGTTCG TCGGATAAGAATTTTAGGGAGTTTGTGGAGAAGTTAGGTGCGGGAGTAACGAAGACTAAA GATAATGGATACTAG
>UGT84A2
ATGGAGCTAGAATCTTCTCCTCCTCTACCTCCTCATGTGATGCTCGTATCTTTTCCAGGG CAAGGCCACGTTAATCCACTTCTTCGTCTTGGTAAGCTCTTAGCTTCAAAGGGTTTGCTC ATAACCTTCGTCACCACTGAGTCATGGGGCAAAAAGATGCGAATCTCCAACAAAATCCAA GACCGTGTCCTCAAACCGGTTGGTAAAGGCTATCTCCGGTATGATTTCTTCGACGACGGG CTTCCTGAAGACGACGAAGCTAGCAGAACCAACTTAACCATCCTCCGACCACATCTAGAG CTGGTCGGCAAAAGAGAGATCAAGAΆCCTTGTGAAΆCGTTACAAGGAΆGTAACGAAΆCAG CCCGTGACATGTCTTATCAACAACCCTTTCGTCTCTTGGGTCTGTGACGTGGCAGAAGAT CTTCAAATCCCTTGTGCTGTTCTTTGGGTTCAATCTTGTGCCTGCTTAGCTGCTTATTAC TATTACCACCACAACCTAGTTGACTTCCCGACCAAAACAGAACCCGAGATCGATGTCCAA ATCTCTGGCATGCCTCTCTTGAAACATGACGAGATCCCTTCTTTCATTCACCCTTCAAGT CCTCACTCCGCTTTGCGAGAAGTGATCATAGATCAGATTAAACGGCTTCACAAGACTTTC TCCATTTTCATCGACACTTTCAACTCATTGGAGAAAGACATCATTGACCACATGTCGACG CTCTCTCTCCCCGGTGTTATCAGACCGCTAGGACCACTCTACAAAATGGCTAAAACCGTA GCTTATGATGTCGTTAAAGTAAACATCTCTGAGCCAACGGATCCTTGCATGGAGTGGTTA GACTCGCAGCCAGTTTCCTCCGTTGTTTACATCTCATTCGGGACCGTTGCTTACTTGAAA CAAGAACAAATAGACGAGATCGCTTACGGTGTGTTAAACGCCGACGTTACGTTCTTGTGG GTGATTAGACAACAAGAGTTAGGTTTCAACAAAGAGAAACATGTTTTGCCGGAAGAAGTT AAAGGGAAAGGGAAGATCGTTGAATGGTGTTCACAAGAGAAAGTATTATCTCATCCTTCA GTGGCATGTTTCGTGACTCACTGTGGATGGAACTCAACGATGGAAGCTGTGTCTTCCGGA GTCCCGACGGTTTGTTTTCCTCAATGGGGAGATCAAGTCACGGACGCCGTTTACATGATC GATGTTTGGAAGACGGGAGTGAGGCTAAGCCGTGGAGAGGCGGAGGAGAGGTTAGTGCCG AGGGAGGAAGTTGCGGAGAGGTTGAGAGAGGTTACTAAAGGAGAGAAAGCGATCGAGTTG AAAAAGAATGCTTTGAAGTGGAAGGAAGAGGCGGAGGCGGCGGTTGCTCGCGGTGGTTCG TCGGATAGGAATCTTGAAAAGTTTGTGGAGAAGTTGGGTGCCAAACCTGTGGGGAAAGTA CAAAACGGGAGTCATAATCATGTCTTGGCTGGATCAATCAAAAGCTTTTAA
>UGT8 A3
ATGGACCCGTCTCGTCATACTCATGTGATGCTCGTATCTTTCCCCGGCCAAGGTCACGTA AACCCTCTACTTCGTCTCGGAAAGCTCATAGCCTCTAAAGGCTTACTCGTCACCTTTGTC ACCACAGAGAAGCCATGGGGCAAGAAGATGCGTCAAGCCAACAAGATTCAAGACGGTGTG CTCAAACCGGTCGGTCTAGGTTTCATCCGGTTTGAGTTCTTCTCTGACGGCTTCGCCGAC GACGATGAAAAAAGATTCGACTTCGATGCCTTCCGACCACACCTTGAAGCTGTCGGAAAA CAAGAGATCAAGAATCTCGTTAAGAGATATAACAAGGAGCCGGTGACGTGTCTCATAAAC AACGCTTTTGTCCCATGGGTATGTGATGTCGCCGAGGAGCTTCACATCCCTTCGGCTGTT CTATGGGTCCAGTCTTGTGCTTGTCTCACGGCTTATTACTATTACCACCACCGGTTAGTT AAGTTCCCGACCAAAACCGAGCCGGACATCAGCGTTGAAATCCCTTGCTTGCCATTGTTA AAGCATGACGAGATCCCAAGCTTTCTTCACCCTTCGTCTCCGTATACAGCTTTTGGAGAT ATCATTTTAGACCAGTTAAAGAGATTCGAAAACCACAAGTCTTTCTATCTTTTCATCGAC ACTTTTCGCGAACTAGAAAAAGACATCATGGACCACATGTCACAACTTTGTCCTCAAGCC ATCATCAGTCCTGTCGGTCCGCTCTTCAAGATGGCTCAAACCTTGAGTTCTGACGTTAAG GGAGATATATCCGAGCCAGCGAGTGACTGCATGGAATGGCTTGACTCAAGAGAACCATCC
TCAGTCGTTTACATCTCCTTTGGGACTATAGCCAACTTGAAGCAAGAGCAGATGGAGGAG ATCGCTCATGGCGTTTTGAGCTCTGGCTTGTCGGTCTTATGGGTGGTTCGGCCTCCCATG GAAGGGACATTTGTAGAACCACATGTTTTGCCTCGAGAGCTCGAAGAAAAGGGTAAAATC GTGGAATGGTGTCCCCAAGAGAGAGTCTTGGCTCATCCTGCGATTGCTTGTTTCTTAAGT CACTGCGGATGGAACTCGACAATGGAGGCTTTAACTGCCGGAGTCCCCGTTGTTTGTTTT CCGCAATGGGGAGATCAAGTGACTGATGCGGTGTACTTGGCTGATGTTTTCAAGACAGGA GTGAGACTAGGCCGCGGAGCCGCTGAGGAGATGATTGTTTCGAGGGAGGTTGTAGCAGAG AAGCTGCTTGAGGCCACAGTTGGGGAAAAGGCGGTGGAGCTGAGAGAAAACGCTCGGAGG TGGAAGGCGGAGGCCGAGGCCGCCGTGGCGGACGGTGGATCATCTGATATGAACTTTAAA GAGTTTGTGGACAAGTTGGTTACGAAACATGTGACGAGAGAAGACAACGGAGAACACTAG
>UGT84A4
ATGGAGATGGAATCGTCGTTACCTCATGTGATGCTCGTATCATTCCCAGGGCAAGGTCAC ATAAGCCCTCTTCTTCGTCTCGGAAAGATCATTGCCTCTAAAGGCTTAATCGTCACCTTT GTAACCACAGAGGAACCATTGGGCAAGAAGATGCGTCAAGCCAACAATATTCAAGACGGT GTGCTCAAACCGGTCGGGCTAGGTTTTCTCCGGTTCGAGTTCTTCGAGGATGGATTTGTC TACAAAGAAGACTTTGATTTGTTACAAAAATCACTTGAAGTTTCCGGAAAACGAGAGATC AAGAATCTTGTCAAGAAATATGAGAAGCAACCAGTGAGATGTCTCATAAATAATGCCTTT GTTCCATGGGTTTGTGACATAGCCGAGGAGCTTCAAATCCCATCAGCTGTTCTTTGGGTC CAGTCTTGTGCTTGCCTCGCCGCTTATTACTATTACCACCACCAGTTAGTTAAGTTTCCG ACCGAAACCGAGCCGGAAATAACCGTTGACGTCCCTTTCAAGCCATTAACATTGAAGCAT GACGAGATCCCTAGCTTTCTTCACCCTTCCTCTCCGCTGTCCTCTATAGGAGGTACCATT TTAGAGCAGATCAAGCGACTTCACAAGCCTTTCTCTGTTCTCATCGAAACTTTTCAAGAA CTTGAAAAAGATACCATTGACCACATGTCCCAGCTCTGCCCTCAAGTCAACTTCAACCCC ATCGGTCCGCTTTTTACTATGGCTAAAACCATAAGGTCTGACATCAAGGGAGACATCTCC AAGCCAGATAGTGACTGCATAGAGTGGCTTGACTCGAGAGAACCATCCTCCGTTGTTTAC ATCTCTTTTGGGACTTTGGCTTTCTTGAAGCAAAACCAGATCGACGAGATTGCTCACGGC ATTCTCAACTCCGGGTTGTCCTGCTTATGGGTTTTGCGGCCTCCCTTAGAAGGCTTAGCC ATAGAACCGCATGTCTTGCCTCTAGAGCTTGAAGAGAAAGGGAAGATTGTGGAATGGTGT CAACAAGAGAAAGTTTTGGCTCATCCTGCGGTTGCTTGCTTCTTAAGTCACTGTGGATGG AACTCAACCATGGAGGCTTTAACTTCAGGAGTTCCCGTTATTTGTTTCCCGCAGTGGGGA GATCAGGTGACAAATGCGGTGTACATGATTGATGTTTTCAAGACAGGATTGAGACTCAGC CGTGGAGCTTCCGATGAGAGGATTGTTCCAAGGGAGGAGGTTGCTGAGCGACTGCTTGAG GCCACCGTTGGAGAGAAGGCGGTGGAGCTGAGAGAAAACGCTCGGAGGTGGAAGGAGGAG GCGGAGTCTGCCGTGGCTTACGGTGGAACATCGGAAAGGAATTTTCAAGAGTTTGTTGAC AAGTTGGTTGATGTCAAGACAATGACAAACATTAATAATGTCGTGTAA
>UGT84B1
ATGGGCAGTAGTGAGGGTCAAGAAACACATGTCCTAATGGTAACACTACCATTCCAAGGT CACATCAATCCAATGCTCAAACTCGCAAAACATCTCTCGTTATCATCAAAGAACCTACAC ATCAATCTCGCCACTATTGAGTCAGCCCGTGATCTCCTCTCCACCGTAGAAAAACCTCGT TATCCGGTGGACCTCGTGTTCTTCTCCGATGGTCTACCTAAAGAAGATCCAAAGGCCCCT GAAACTCTTTTGAAGTCATTGAATAAAGTCGGAGCCATGAACTTGTCTAAAATCATCGAA GAAAAGAGATACTCTTGTATCATCTCTTCGCCTTTTACTCCATGGGTTCCAGCTGTTGCA GCCTCTCATAACATCTCTTGTGCAATACTTTGGATCCAAGCTTGTGGAGCTTACTCGGTT TATTACCGTTACTACATGAAGACAAACTCTTTCCCTGATCTTGAAGATCTGAATCAAACG GTGGAGTTACCAGCTTTACCATTGTTGGAAGTTCGAGATCTTCCATCGTTTATGTTACCT TCTGGTGGTGCTCACTTCTATAATCTAATGGCGGAATTTGCAGATTGTTTGAGGTATGTG AAATGGGTTTTGGTTAATTCATTCTATGAACTCGAATCAGAGATAATCGAATCGATGGCT GATTTAAAACCTGTAATTCCAATTGGTCCTCTGGTTTCTCCATTTCTGTTGGGCGATGGT GAGGAGGAAACCCTAGACGGTAAAAACCTAGATTTTTGTAAATCTGATGATTGTTGTATG GAGTGGCTTGACAAGCAAGCTAGGTCTTCTGTTGTGTACATATCTTTCGGAAGTATGCTC
GAAACATTGGAGAATCAGGTCGAGACCATAGCGAAGGCGCTGAAGAACAGAGGACTTCCA TTTCTTTGGGTGATAAGGCCAAAGGAGAAAGCCCAAAACGTTGCTGTTTTGCAGGAGATG GTGAAAGAAGGACAAGGGGTTGTTCTCGAGTGGAGTCCACAAGAGAAGATTTTGAGCCAC GAGGCAATCTCTTGTTTTGTCACGCATTGCGGCTGGAACTCGACTATGGAGACGGTGGTG GCTGGTGTTCCTGTGGTAGCGTACCCTAGCTGGACGGATCAGCCCATTGACGCGCGGTTG CTTGTTGATGTGTTTGGAATCGGAGTAAGGATGAGGAATGACAGTGTCGATGGCGAGCTT AAGGTCGAAGAAGTAGAAAGATGCATTGAGGCCGTGACGGAGGGACCCGCTGCCGTGGAT ATAAGAAGGAGAGCGGCGGAGCTAAAGCGCGTGGCGAGATTGGCGTTGGCACCTGGTGGA TCTTCGACACGGAATTTAGACTTGTTCATTAGTGATATCACAATCGCCTAA
>UGT84B2
ATGGGAAGTAATGAGGGTCAAGAAACACATGTCCTAATGGTAGCATTAGCATTCCAAGGT CATCTCAATCCAATGCTCAAATTCGCAAAACATCTCGCACGAACCAATCTACACTTCACT CTCGCCACCACTGAGCAAGCCCGTGACCTCCTCTCTTCCACCGCTGACGAACCTCATAGA CCGGTGGACCTCGCTTTCTTCTCAGACGGTCTACCTAAAGACGATCCAAGAGATCCCGAC ACTCTCGCAAAGTCATTGAAAAAAGATGGAGCCAAGAACTTGTCAAAAATCATCGAAGAA AAGAGATTTGATTGCATCATCTCTGTGCCTTTTACTCCCTGGGTTCCAGCTGTTGCAGCT GCACATAACATTCCTTGTGCAATCCTCTGGATCCAAGCTTGTGGAGCTTTTTCTGTTTAT TACCGTTATTACATGAAGACAAATCCTTTCCCCGACCTTGAAGATCTGAATCAAACAGTG GAGTTACCAGCTTTACCATTGTTGGAAGTCCGAGATCTCCCGTCATTGATGTTACCTTCT CAAGGAGCTAATGTCAATACCCTAATGGCGGAATTTGCAGATTGTTTGAAAGATGTGAAA TGGGTTTTGGTTAACTCGTTTTACGAACTCGAATCAGAGATCATCGAGTCTATGTCTGAT TTAAAACCTATAATCCCAATTGGTCCTCTTGTTTCTCCATTCCTGTTGGGAAATGATGAA GAAAAAACCCTAGATATGTGGAAAGTTGATGATTATTGTATGGAGTGGCTTGACAAGCAA GCTAGGTCTTCAGTTGTTTACATATCTTTCGGAAGCATACTCAAATCATTGGAGAATCAA GTTGAGACCATAGCAACGGCATTAAAAAACAGAGGAGTTCCATTTCTTTGGGTGATACGG CCGAAGGAGAAAGGCGAAAACGTCCAGGTTTTGCAGGAGATGGTTAAAGAAGGTAAAGGG GTTGTAACTGAATGGGGTCAACAAGAAAAGATATTGAGCCACATGGCGATTTCTTGCTTC ATCACGCATTGTGGATGGAACTCGACGATCGAGACGGTGGTGACTGGTGTTCCCGTGGTG GCGTATCCGACTTGGATAGATCAGCCGCTTGATGCGAGACTGCTTGTGGATGTGTTTGGA ATCGGAGTAAGGATGAAGAACGACGCTATCGATGGAGAGCTTAAGGTTGCAGAGGTGGAG AGATGCATTGAGGCCGTGACAGAGGGACCTGCCGCCGCGGATATGAGGAGGAGAGCGACG GAGCTGAAGCACGCCGCAAGATCGGCGATGTCACCTGGTGGATCTTCCGCTCAGAATTTA GACTCGTTCATTAGTGATATCCCAATCACTTGA
>UGT85A1
ATGGGATCTCAGATCATTCATAACTCACAAAAACCACATGTAGTTTGTGTTCCATATCCG GCTCAAGGCCACATCAACCCTATGATGAGAGTGGCTAAACTCCTCCACGCCAGAGGCTTC TACGTCACCTTCGTCAACACCGTCTACAACCACAATCGTTTCCTTCGTTCTCGTGGGTCC AATGCCCTAGATGGACTTCCTTCGTTCCGATTTGAGTCCATTGCTGACGGTCTACCAGAG ACAGACATGGATGCCACGCAGGACATCACAGCTCTTTGCGAGTCCACCATGAAGAACTGT CTCGCTCCGTTCAGAGAGCTTCTCCAGCGGATCAACGCTGGAGATAATGTTCCTCCGGTA AGCTGTATTGTATCTGACGGTTGTATGAGCTTTACTCTTGATGTTGCGGAGGAGCTTGGA GTCCCGGAGGTTCTTTTTTGGACAACCAGTGGCTGTGCGTTCCTGGCTTATCTACACTTT TATCTCTTCATCGAGAAGGGCTTATGTCCGCTAAAAGATGAGAGTTACTTGACGAAGGAG TACTTAGAAGACACGGTTATAGATTTTATACCAACCATGAAGAATGTGAAACTAAAGGAT ATTCCTAGCTTCATACGTACCACTAATCCTGATGATGTTATGATTAGTTTCGCCCTCCGC GAGACCGAGCGAGCCAAACGTGCTTCTGCTATCATTCTAAACACATTTGATGACCTTGAG CATGATGTTGTTCATGCTATGCAATCTATCTTACCTCCGGTTTATTCAGTTGGACCGCTT CATCTCTTAGCAAACCGGGAGATTGAAGAAGGTAGTGAGATTGGAATGATGAGTTCGAAT TTATGGAAAGAGGAGATGGAGTGTTTGGATTGGCTTGATACTAAGACTCAAAATAGTGTC ATTTATATCAACTTTGGGAGCATAACGGTTTTGAGTGTGAAGCAGCTTGTGGAGTTTGCT
TGGGGTTTGGCGGGAAGTGGGAAAGAGTTTTTATGGGTGATCCGGCCAGATTTAGTAGCG GGAGAGGAGGCTATGGTTCCGCCGGACTTTTTAATGGAGACTAAAGACCGCAGTATGCTA GCGAGTTGGTGTCCTCAAGAGAAAGTACTTTCTCATCCTGCTATTGGAGGGTTTTTGACG CATTGCGGGTGGAACTCGATATTGGAAAGTCTTTCGTGTGGAGTTCCGATGGTGTGTTGG CCATTTTTTGCTGACCAGCAAATGAATTGTAAGTTTTGTTGTGACGAGTGGGATGTTGGG ATTGAGATAGGTGGAGATGTGAAGAGAGAGGAAGTTGAGGCGGTGGTTAGAGAGCTCATG GATGGAGAGAAGGGAAAGAAAATGAGAGAAAAGGCGGTAGAGTGGCAGCGCTTAGCCGAG AAAGCGACGGAACATAAACTTGGTTCTTCCGTTATGAATTTTGAGACGGTTGTTAGCAAG TTTCTTTTGGGACAAAAATCACAGGATTAA
>UGT85A2
ATGGGATCTCATGTCGCACAAAAACAACACGTAGTTTGCGTTCCTTATCCGGCTCAAGGC CACATCAACCCAATGATGAAAGTGGCTAAACTCCTTTACGCCAAAGGCTTCCATATTACC TTCGTCAACACCGTCTACAACCACAACCGTCTCCTCCGGTCCCGTGGGCCTAACGCCGTT GACGGGCTTCCTTCTTTCCGGTTTGAGTCCATCCCTGACGGTCTACCCGAGACTGACGTG GACGTCACTCAGGACATCCCTACTCTTTGCGAGTCCACAATGAAGCACTGTCTCGCTCCA TTCAAGGAGCTTCTCCGGCAGATCAACGCAAGGGATGATGTTCCTCCTGTGAGCTGTATC GTATCCGACGGTTGTATGAGCTTCACACTTGATGCTGCGGAGGAGCTCGGTGTCCCGGAG GTTCTTTTTTGGACAACTAGTGCTTGTGGCTTCTTGGCTTACCTTTACTACTATCGCTTC ATCGAGAAGGGATTATCACCAATAAAAGATGAGAGTTACTTAACCAAGGAACACTTGGAC ACAAAAATAGACTGGATACCATCGATGAAGAACCTAAGACTAAAAGACATCCCTAGCTTC ATCCGAACGACTAATCCTGACGACATCATGCTCAACTTTATCATCCGTGAGGCTGACCGA GCCAAACGCGCTTCAGCTATCATTCTCAACACGTTTGATGATCTCGAACACGACGTTATC CAATCTATGAAATCCATTGTACCTCCGGTTTATTCTATTGGACCGTTACATTTACTAGAG AAACAAGAGAGCGGCGAGTATAGTGAAATCGGACGGACAGGATCGAATCTTTGGAGAGAG GAGACTGAGTGTCTGGACTGGCTAAACACGAAAGCTAGAAACAGTGTTGTGTACGTTAAC TTCGGGAGTATAACTGTTTTGAGCGCAAAACAGCTTGTGGAGTTTGCATGGGGTTTGGCT GCAACGGGGAAAGAGTTTTTGTGGGTGATCCGGCCGGATTTAGTAGCCGGGGATGAGGCA ATGGTTCCACCGGAGTTTTTAACGGCTACGGCGGACCGGAGGATGTTGGCAAGTTGGTGT CCTCAAGAGAAAGTCCTTTCTCATCCGGCCATTGGAGGGTTCTTGACGCATTGCGGGTGG AACTCGACGTTGGAAAGTCTATGCGGTGGAGTTCCAATGGTGTGTTGGCCGTTTTTTGCA GAGCAACAAACTAATTGTAAGTTTTCTCGTGACGAATGGGAGGTTGGGATTGAGATTGGT GGAGATGTGAAGAGAGAAGAGGTTGAGGCGGTGGTTAGGGAGTTGATGGATGAAGAGAAG GGAAAGAATATGAGAGAGAAGGCGGAAGAGTGGCGGCGCTTGGCGAATGAAGCGACGGAG CATAAGCATGGTTCTTCTAAATTGAACTTTGAGATGCTCGTTAATAAGGTTCTTTTAGGG GAGTAG
>UGT85A3
ATGGGATCCCGTTTTGTTTCTAACGAACAAAAACCACACGTAGTTTGCGTGCCTTACCCA GCTCAAGGCCACATTAACCCTATGATGAAAGTGGCTAAACTCCTCCACGTCAAAGGCTTC CACGTCACCTTCGTCAACACCGTCTACAACCACAACCGTCTACTCCGATCCCGTGGGGCC AACGCACTCGATGGACTTCCTTCCTTCCAGTTCGAGTCAATACCTGACGGTCTTCCGGAG ACTGGCGTGGACGCCACGCAGGACATCCCTGCCCTTTCCGAGTCCACAACGAAAAACTGT CTCGTTCCGTTCAAGAAGCTTCTCCAGCGGATTGTCACGAGAGAGGATGTCCCTCCGGTG AGCTGTATTGTATCAGATGGTTCGATGAGCTTTACTCTTGACGTAGCGGAAGAGCTTGGT GTTCCGGAGATTCATTTTTGGACCACTAGTGCTTGTGGCTTCATGGCTTATCTACACTTT TATCTCTTCATCGAGAAGGGTTTATGTCCAGTAAAAGATGCGAGTTGCTTGACGAAGGAA TACTTGGACACAGTTATAGATTGGATACCGTCAATGAACAATGTAAAACTAAAAGACATT CCTAGTTTTATACGTACCACTAATCCTAACGACATAATGCTCAACTTCGTTGTCCGTGAG GCATGTCGAACCAAACGTGCCTCTGCTATCATTCTGAACACGTTTGATGACCTTGAACAT GACATAATCCAGTCTATGCAATCCATTTTACCACCGGTTTATCCAATCGGACCGCTTCAT CTCTTAGTAAACAGGGAGATTGAAGAAGATAGTGAGATTGGAAGGATGGGATCAAATCTA
TGGAAAGAGGAGACTGAGTGCTTGGGATGGCTTAATACTAAGTCTCGAAATAGCGTTGTT TATGTTAACTTTGGGAGCATAACAATAATGACCACGGCACAGCTTTTGGAGTTTGCTTGG GGTTTGGCGGCAACGGGAAAGGAGTTTCTATGGGTGATGCGGCCGGATTCAGTAGCCGGA GAGGAGGCAGTGATTCCAAAAGAGTTTTTAGCGGAGACAGCTGATCGAAGAATGCTGACA AGTTGGTGTCCTCAGGAGAAAGTTCTTTCTCATCCGGCGGTCGGAGGGTTCTTGACCCAT TGCGGGTGGAATTCGACGTTAGAAAGTCTTTCATGCGGAGTTCCAATGGTATGTTGGCCA TTTTTTGCTGAGCAACAAACAAATTGTAAGTTTTCTTGTGATGAATGGGAGGTTGGTATT GAGATCGGTGGAGATGTCAAGAGGGGAGAGGTTGAGGCGGTGGTTAGAGAGCTCATGGAT GGAGAGAAAGGAAAGAAAATGAGAGAGAAGGCTGTAGAGTGGCGGCGCTTGGCCGAGAAA GCTACAAAGCTTCCGTGTGGTTCGTCGGTGATAAATTTTGAGACGATTGTCAACAAGGTT CTCTTGGGAAAGATCCCTAACACGTAA
>UGT85A4
ATGGAACAACATGGCGGTTCTAGCTCACAGAAACCTCACGCAATGTGCATACCTTATCCA GCACAAGGCCACATCAACCCAATGCTGAAACTAGCCAAGCTCCTCCACGCTAGAGGCTTC CACGTCACTTTCGTCAACACCGACTACAACCACCGCCGTATCCTCCAATCACGTGGCCCT CACGCTCTCAACGGTCTCCCCTCGTTTCGCTTCGAGACTATCCCCGACGGTCTTCCTTGG ACAGACGTCGACGCTAAGCAAGACATGCTCAAGCTTATTGACTCCACAATAAACAACTGT TTAGCTCCATTCAAAGACCTCATCCTCCGGTTAAACTCCGGTTCTGATATACCACCGGTT AGCTGTATCATCTCCGACGCTTCAATGAGCTTCACAATTGACGCAGCGGAGGAGCTTAAA ATTCCGGTAGTTCTCCTCTGGACCAACAGTGCTACTGCTTTAATCTTGTATCTCCATTAC CAAAAACTCATCGAGAAAGAGATAATTCCCCTCAAAGATTCGAGTGACTTGAAGAAGCAT TTAGAGACGGAGATTGATTGGATACCGTCGATGAAGAAGATTAAGCTTAAGGATTTTCCA GATTTCGTCACCACGACGAATCCTCAAGATCCGATGATTAGTTTCATCCTTCATGTAACC GGAAGAATCAAAAGAGCTTCTGCGATCTTCATCAACACTTTCGAAAAACTCGAGCATAAC GTTCTCTTATCTCTGCGATCTCTTCTCCCTCAGATCTACTCCGTTGGACCGTTCCAGATT CTGGAGAATCGCGAAATCGATAAGAACAGCGAAATCAGAAAGCTAGGATTGAATCTCTGG GAAGAAGAGACGGAGTCTTTGGATTGGCTAGATACTAAAGCTGAGAAAGCTGTGATTTAC GTCAACTTCGGGAGTCTAACGGTTTTGACTAGTGAGCAGATCTTAGAGTTCGCTTGGGGT TTAGCGAGGAGCGGGAAAGAGTTTCTCTGGGTGGTGAGATCTGGTATGGTCGACGGAGAT GATTCGATTCTTCCGGCGGAGTTTTTATCGGAGACGAAGAATCGAGGAATGTTAATTAAA GGATGGTGTTCTCAGGAGAAGGTACTTTCGCATCCGGCGATTGGAGGATTTTTGACTCAC TGTGGATGGAATTCGACGTTGGAGAGTTTGTACGCCGGTGTTCCGATGATCTGTTGGCCA TTTTTTGCTGATCAGTTGACGAATCGAAAGTTCTGTTGCGAGGATTGGGGGATTGGGATG GAGATCGGCGAGGAGGTGAAGAGGGAGAGAGTGGAGACGGTGGTTAAAGAGCTCATGGAC GGAGAGAAGGGAAAGAGGTTAAGAGAGAAGGTGGTGGAGTGGCGGCGCTTGGCGGAAGAA GCTTCGGCGCCACCGTTGGGATCATCGTACGTGAATTTTGAAACGGTGGTTAATAAAGTC CTTACATGTCACACGATTAGATCGACCTAA
>UGT85A5
ATGGCGTCTCATGCTGTTACAAGCGGACAAAAACCACACGTAGTTTGCATACCTTTCCCG GCTCAAGGCCACATCAATCCGATGCTCAAAGTGGCTAAACTCCTCTATGCCAGAGGCTTC CATGTTACCTTCGTCAACACTAACTACAACCATAACCGTCTCATCCGGTCACGTGGTCCC AACTCCCTTGATGGGCTTCCTTCTTTTCGGTTCGAGTCCATCCCTGACGGTCTACCGGAG GAAAACAAGGACGTCATGCAGGATGTCCCTACCCTTTGTGAGTCCACCATGAAAAACTGT CTAGCTCCTTTCAAGGAGCTTCTCCGGCGGATCAACACCACAAAGGATGTTCCTCCGGTA AGCTGTATTGTATCCGACGGTGTGATGAGCTTTACTCTTGATGCTGCAGAGGAGCTTGGA GTCCCGGATGTTCTTTTTTGGACACCAAGTGCTTGTGGCTTCTTGGCTTATCTACACTTC TATCGCTTCATCGAGAAGGGGTTATCACCAATAAAAGATGAAAGTTCTTTGGACACAAAA ATAAATTGGATACCATCGATGAAAAACCTAGGACTTAAAGACATCCCAAGCTTTATCCGT GCAACTAATACTGAAGACATAATGCTTAACTTTTTTGTCCATGAGGCTGACCGAGCCAAA CGCGCTTCCGCTATCATTGTCAACACATTCGATAGTCTTGAGCATGATGTCGTCCGTTCT
ATTCAATCTATCATACCTCAAGTGTACACTATTGGACCGCTTCATCTATTTGTGAATCGG GATATCGACGAGGAAAGTGACATCGGACAGATAGGAACGAATATGTGGAGAGAGGAGATG GAGTGTTTGGATTGGCTTGATACTAAGTCTCCAAACAGTGTCGTTTATGTTAATTTCGGT AGCATAACAGTGATGAGTGCGAAACAACTCGTGGAGTTTGCTTGGGGTTTAGCAGCGACC AAAAAAGATTTTTTGTGGGTGATTAGGCCGGATTTAGTAGCCGGTGATGTGCCAATGCTT CCGCCGGACTTTCTAATAGAGACGGCTAACCGAAGGATGCTAGCGAGTTGGTGTCCTCAA GAAAAAGTTCTTTCTCATCCGGCAGTTGGAGGGTTCTTAACGCATAGTGGATGGAATTCG ACTTTGGAGAGTCTCTCCGGTGGAGTTCCAATGGTGTGTTGGCCGTTCTTTGCGGAACAG CAAACAAATTGTAAATATTGTTGTGATGAATGGGAAGTGGGGATGGAGATCGGTGGAGAT GTGAGGAGGGAGGAGGTTGAGGAGTTGGTTAGAGAACTCATGGACGGAGACAAAGGAAAG AAAATGAGGCAAAAGGCCGAAGAGTGGCAGCGCTTGGCTGAGGAAGCGACGAAGCCTATT TATGGTTCGTCGGAACTAAATTTTCAGATGGTCGTTGACAAGGTTCTTTTAGGGGAGTAG
>UGT85A7
ATGGAATCTCATGTTGTTCATAACGCACAAAAGCCACACGTAGTTTGCGTGCCTTACCCG GCTCAAGGCCACATCAATCCGATGCTGAAAGTGGCTAAACTCCTCTACGCTAAAGGCTTT CACGTCACCTTCGTTAACACTCTCTACAACCACAACCGTCTCCTCCGGTCCCGTGGTCCC AACGCGCTCGACGGGTTTCCTTCATTCCGGTTCGAGTCCATCCCTGACGGTCTACCGGAG ACTGATGGCGATAGGACGCAGCATACTCCTACCGTTTGCATGTCCATTGAGAAAAACTGT CTCGCTCCATTCAAAGAGATTCTGCGCCGGATCAACGATAAAGATGATGTTCCTCCAGTG AGTTGTATTGTATCGGACGGTGTGATGAGTTTTACTCTTGACGCAGCCGAGGAACTAGGT GTCCCAGAGGTTATTTTTTGGACCAATAGTGCTTGTGGTTTCATGACTATTCTACACTTT TATCTTTTCATCGAGAAGGGTCTATCTCCTTTTAAAGACGAAAGTTACATGTCAAAGGAG CATCTAGACACAGTTATAGATTGGATACCATCAATGAAGAATCTTAGGTTAAAGGACATC CCTAGCTATATACGTACCACAAATCCTGACAACATAATGCTTAATTTCCTCATTCGAGAA GTTGAGCGATCTAAACGCGCTAGTGCTATCATTCTCAACACGTTTGATGAACTCGAGCAT GATGTTATCCAGTCTATGCAATCTATTTTACCTCCGGTTTATTCTATTGGGCCACTCCAT CTCCTTGTGAAGGAAGAAATAAACGAGGCTAGTGAAATAGGACAGATGGGATTAAATTTG TGGAGAGAGGAGATGGAATGTTTGGATTGGCTCGATACAAAAACTCCAAACAGTGTTCTT TTTGTTAACTTTGGATGCATAACGGTGATGAGTGCAAAACAGCTTGAAGAATTTGCTTGG GGTTTGGCGGCAAGTAGGAAAGAGTTTTTATGGGTGATCCGTCCTAATTTAGTGGTGGGA GAGGCGATGGTGGTTCTTCCACAAGAGTTTTTAGCGGAGACGATAGACCGGAGAATGTTA GCTAGTTGGTGTCCTCAGGAGAAAGTTCTTTCTCATCCCGCGATAGGAGGGTTCTTGACG CATTGCGGGTGGAACTCAACATTGGAGAGTCTCGCTGGTGGTGTTCCGATGATATGTTGG CCATGTTTTTCGGAGCAACCGACGAATTGTAAGTTTTGTTGTGATGAGTGGGGAGTGGGT ATAGAGATTGGTAAAGATGTGAAGAGAGAGGAGGTCGAGACGGTGGTTAGAGAACTTATG GATGGAGAAAAGGGGAAAAAGCTGAGAGAAAAGGCGGAAGAGTGGCGGCGGTTGGCCGAG GAAGCGACGAGGTATAAACATGGTTCGTCGGTCATGAATCTTGAGACGCTTATACATAAA GTTTTCTTAGAAAATCTTAGATGA
>UGT86A1
ATGGAGAGAGCAAAGTCGAGGAAGCCTCATATCATGATGATACCATACCCACTTCAAGGT CACGTTATCCCTTTTGTCCACTTAGCCATCAAACTTGCTTCTCATGGCTTCACCATCACT TTCGTCAACACCGACTCCATCCACCACCACATCTCCACCGCTCACCAAGATGACGCCGGT GACATCTTCTCCGCCGCTCGCAGCTCCGGCCAGCACGACATACGTTACACCACCGTGAGC GACGGCTTCCCTTTAGACTTTGACCGGTCACTGAACCATGACCAGTTTTTCGAAGGCATT CTCCACGTCTTCTCTGCCCACGTGGATGATCTCATCGCCAAACTCTCCCGCCGTGATGAT CCTCCCGTGACTTGCTTGATCGCCGACACGTTTTATGTTTGGTCATCTATGATTTGCGAC AAGCACAACCTTGTAAATGTCTCGTTTTGGACCGAACCTGCCTTGGTCCTCAATCTCTAT TATCACATGGATCTCCTCATATCTAACGGTCATTTCAAATCTCTTGATAATCGTAAAGAC GTGATCGATTACGTACCAGGGGTTAAAGCAATAGAACCAAAGGACTTGATGTCATATCTT CAAGTAAGCGACAAAGACGTAGACACAAATACAGTAGTATACAGAATATTATTCAAGGCC
TTTAAAGACGTCAAGAGAGCCGACTTCGTCGTATGCAACACGGTGCAAGAGCTCGAACCA GACTCTCTCTCGGCTCTACAAGCCAAACAACCGGTTTACGCTATCGGTCCGGTTTTCTCA ACTGATTCGGTAGTTCCCACAAGCTTATGGGCCGAGTCAGACTGTACCGAGTGGCTTAAG GGCCGGCCCACTGGGTCAGTTCTCTACGTCTCGTTTGGTAGCTATGCACATGTTGGTAAG AAGGAGATTGTTGAGATAGCTCATGGGCTTTTGCTTAGTGGGATTAGTTTCATTTGGGTT TTACGTCCGGATATAGTTGGATCCAACGTACCAGATTTTCTTCCAGCCGGGTTTGTGGAC CAAGCCCAAGATCGAGGTCTTGTGGTCCAATGGTGCTGCCAGATGGAAGTTATTTCAAAT CCGGCCGTGGGAGGGTTTTTCACACATTGTGGATGGAATTCAATTCTAGAGAGCGTTTGG TGTGGTTTGCCTTTGTTGTGTTATCCACTTTTGACAGATCAGTTCACGAATAGGAAGCTT GTGGTCGATGATTGGTGCATTGGGATTAATCTTTGTGAGAAGAAGACAATCACAAGGGAC CAAGTCTCAGCGAATGTTAAAAGATTGATGAATGGAGAAACTTCAAGTGAGCTAAGAAAC AATGTTGAAAAGGTTAAACGTCATCTCAAAGATGCGGTTACAACCGTTGGATCTTCGGAG ACGAATTTTAACTTGTTTGTTAGTGAGGTCCGAAATAGAATAGAAACTAAATTGTGTAAT GTAAATGGACTAGAAATAAGTCCATCAAACTAA
>UGT86A2
ATGGCGGACGTTAGAAACCCTACAAAAAATCATCATGGTCATCATCATCTTCATGCTCTC TTGATCCCATATCCATTTCAAGGGCATGTAAACCCATTTGTACACTTAGCCATCAAGCTC GCGTCACAGGGGATCACCGTCACTTTCGTCAACACTCATTACATCCACCACCAGATCACA AACGGCTCCGATGGAGATATTTTCGCTGGAGTTAGGTCAGAGTCTGGCCTTGACATAAGG TACGCGACGGTTTCCGATGGTTTACCGGTCGGATTTGACCGGTCGTTGAACCATGACACG TACCAATCGTCGCTGTTGCACGTGTTCTATGCGCATGTGGAAGAGCTTGTGGCGAGTCTT GTTGGAGGAGACGGCGGTGTGAATGTGATGATCGCCGACACATTCTTTGTTTGGCCGTCT GTGGTGGCTAGGAAGTTTGGTTTGGTTTGTGTCTCGTTTTGGACCGAAGCTGCTTTAGTA TTTTCACTTTATTACCATATGGATCTGCTTCGGATTCATGGCCATTTTGGTGCTCAAGAA ACCCGCAGCGATCTAATCGACTACATTCCCGGAGTCGCCGCAATTAACCCAAAAGACACG GCGTCGTATCTTCAAGAAACCGACACGTCATCAGTAGTTCATCAAATCATCTTCAAAGCA TTCGAAGACGTGAAAAAAGTCGATTTTGTACTCTGCAACACAATTCAGCAATTCGAAGAC AAAACAATCAAAGCCCTAAACACAAAAATCCCATTTTACGCAATCGGACCAATCATACCA TTCAATAACCAAACCGGTTCAGTCACAACCTCACTCTGGTCTGAATCAGATTGTACACAA TGGCTCAACACTAAACCAAAAAGCTCCGTACTTTATATCTCCTTTGGTAGTTACGCTCAT GTCACAAAGAAGGATCTTGTTGAGATAGCTCACGGGATTTTGTTGAGTAAAGTTAATTTC GTTTGGGTGGTGAGACCAGACATTGTTAGTTCAGACGAAACCAATCCATTACCAGAAGGG TTTGAAACAGAAGCTGGAGATCGTGGGATTGTAATACCATGGTGTTGTCAAATGACGGTT TTGTCACATGAGAGTGTTGGTGGGTTTTTGACACATTGTGGTTGGAACTCGATATTGGAG ACGATTTGGTGTGAGGTTCCTGTGTTGTGTTTTCCATTGTTGAGTGATCAGGTTACGAAT AGGAAGCTTGTGGTTGATGATTGGGAGATTGGGATTAATCTTTGTGAAGATAAGAGTGAT TTTGGTAGAGATGAAGTTGGGAGGAATATTAACCGTTTGATGTGTGGTGTTTCGAAAGAG AAGATCGGACGGGTTAAAATGAGTTTGGAAGGTGCGGTGAGAAACAGTGGATCTTCTTCG GAGATGAATTTAGGTTTGTTTATTGATGGACTTTTGTCTAAGGTTGGTTTATCTAATGGG AAAGCTTAA
>UGT87A1
ATGAATCCAATCAAACCTCAGCCACTCGGAGTCCGCCACGTGGTGGCCATGCCTTGGCCA GGAAGAGGCCACATCAACCCAATGTTAAACCTCTGCAAAAGCCTCGTCCGGCGAGACCCA AACCTCACCGTCACATTCGTCGTCACCGAAGAATGGCTCGGGTTCATCGGGTCCGACCCG AAACCTAACCGGATCCATTTCGCCACTCTCCCCAACATCATTCCCTCCGAGCTCGTCCGA GCCAACGACTTCATCGCCTTCATCGACGCCGTCCTCACCAGATTAGAAGAGCCGTTCGAA CAGCTACTTGACCGTCTAAACTCTCCTCCCACCGCAATCATCGCCGATACTTACATCATT TGGGCAGTACGTGTAGGCACAAAAAGGAATATTCCGGTGGCTTCTTTCTGGACTACGTCA GCCACGATTCTCTCCCTCTTCATTAACTCCGATCTTCTCGCAAGTCACGGCCATTTTCCG ATCGAACCATCAGAATCAAAACTAGACGAGATTGTTGATTACATCCCCGGTTTATCTCCG
ACAAGACTCAGTGACTTACAGATCTTACACGGCTATAGTCATCAAGTCTTCAATATATTC AAAAAGTCTTTCGGTGAGCTTTATAAAGCTAAGTATCTTCTCTTCCCTTCTGCTTATGAG CTCGAACCAAAAGCCATTGACTTTTTCACTTCCAAGTTTGATTTCCCGGTTTACTCCACT GGTCCGTTAATACCCTTGGAAGAACTATCCGTTGGAAATGAGAATAGAGAACTTGATTAC TTTAAGTGGCTTGATGAGCAACCTGAAAGCTCTGTTCTTTACATATCTCAAGGGAGTTTT CTTTCAGTCTCCGAAGCTCAGATGGAGGAGATTGTTGTAGGAGTTAGAGAGGCTGGAGTT AAGTTCTTTTGGGTGGCTCGTGGGGGTGAGTTAAAGCTTAAGGAGGCTCTTGAAGGTAGC TTGGGTGTTGTGGTGAGCTGGTGTGATCAGCTACGTGTTTTGTGTCATGCGGCTATAGGC GGGTTTTGGACGCATTGCGGGTATAACTCGACATTGGAAGGGATATGTTCGGGAGTACCG TTGCTTACATTTCCTGTTTTTTGGGATCAGTTTCTGAATGCTAAGATGATTGTTGAGGAG TGGAGAGTTGGAATGGGGATCGAGAGGAAGAAGCAGATGGAGTTGTTGATAGTGAGTGAT GAGATCAAGGAATTGGTAAAAAGGTTTATGGATGGAGAGAGTGAAGAAGGGAAAGAGATG AGAAGAAGGACTTGTGATCTCAGTGAGATATGTCGTGGAGCGGTTGCGAAAGGTGGTTCT TCTGATGCTAACATCGATGCTTTCATTAAAGATATTACTAAGATCGTGTGA
>UGT87A2
ATGGATCCAAATGAATCTCCACCAAACCAATTTCGCCACGTGGTGGCCATGCCTTATCCA GGTCGAGGACACATCAACCCTATGATGAACCTCTGCAAACGCCTTGTCCGTCGATACCCT AACCTTCACGTCACCTTCGTCGTCACAGAAGAATGGCTCGGGTTTATTGGACCCGACCCG AAACCCGACCGGATCCATTTCTCCACTCTCCCTAATCTCATCCCTTCCGAGCTTGTCAGG GCCAAAGACTTCATAGGCTTCATTGATGCCGTCTACACAAGATTGGAAGAACCATTCGAG AAGCTTCTTGACAGCCTCAATTCACCACCTCCGAGTGTAATATTCGCCGACACTTACGTC ATTTGGGCTGTGCGAGTCGGCAGAAAAAGGAATATTCCGGTGGTTTCTCTCTGGACCATG TCAGCCACGATTCTCTCCTTCTTCCTCCACTCTGATCTACTCATAAGTCATGGCCATGCT CTGTTCGAACCATCAGAAGAAGAGGTTGTTGATTACGTCCCCGGTTTATCTCCGACGAAA CTCCGAGATTTGCCGCCGATATTTGACGGTTACAGCGACCGAGTCTTCAAGACAGCTAAG TTGTGTTTCGATGAACTACCAGGAGCTAGGTCTTTACTCTTCACCACCGCCTATGAGCTT GAACACAAAGCTATTGACGCTTTCACCTCCAAGCTCGATATCCCGGTCTACGCTATTGGT CCTTTAATACCTTTTGAAGAACTTTCTGTTCAAAATGATAACAAGGAACCTAATTACATC CAGTGGCTTGAGGAACAACCGGAAGGCTCTGTTCTTTACATATCTCAGGGAAGTTTTCTT TCGGTCTCGGAAGCTCAGATGGAGGAAATAGTGAAAGGACTGAGAGAAAGTGGAGTCCGG TTTCTTTGGGTGGCTCGTGGGGGCGAGTTAAAGCTTAAGGAGGCTCTTGAAGGTAGCTTA GGTGTAGTGGTGAGCTGGTGTGATCAGCTTCGGGTGCTGTGTCACAAAGCTGTAGGCGGG TTTTGGACTCATTGCGGGTTTAACTCGACATTGGAAGGGATATATTCAGGAGTACCAATG CTAGCGTTTCCGTTGTTTTGGGATCAGATTCTGAACGCTAAGATGATTGTTGAGGACTGG AGAGTCGGAATGAGGATCGAGAGGACGAAAAAGAATGAGTTGTTGATAGGGAGAGAGGAG ATCAAGGAAGTAGTGAAGAGGTTTATGGATAGAGAGAGTGAAGAAGGGAAAGAGATGAGA AGAAGGGCTTGTGACCTTAGTGAAATCAGTCGAGGAGCTGTTGCGAAAAGCGGTTCGTCT AATGTAAACATCGATGAGTTCGTTCGGCATATTACCAATACAAATTAA
>UGT88A1
ATGGGTGAAGAAGCTATAGTTCTGTATCCTGCACCACCAATAGGTCACTTAGTGTCCATG GTTGAGTTAGGTAAAACCATCCTCTCCAAAAACCCATCTCTCTCCATCCACATTATCTTA GTTCCACCGCCTTATCAGCCGGAATCAACCGCCACTTACATCTCCTCCGTCTCCTCCTCC TTCCCTTCAATAACCTTCCACCATCTTCCCGCCGTCACACCGTACTCCTCCTCCTCCACC TCTCGCCACCACCACGAATCTCTCCTCCTAGAGATCCTCTGTTTTAGCAACCCAAGTGTC CACCGAACTCTTTTCTCACTCTCTCGGAATTTCAATGTCCGAGCAATGATCATCGATTTC TTCTGCACCGCCGTTTTAGACATCACCGCTGACTTCACGTTCCCGGTTTACTTCTTCTAC ACCTCTGGAGCCGCATGTCTCGCCTTTTCCTTCTATCTCCCGACCATCGACGAAACAACC CCCGGAAAAAACCTCAAAGACATTCCTACAGTTCATATCCCCGGCGTTCCTCCGATGAAG GGCTCCGATATGCCTAAGGCGGTGCTCGAACGAGACGATGAGGTCTACGATGTTTTTATA ATGTTCGGTAAACAGCTCTCGAAGTCGTCAGGGATTATTATCAATACGTTTGATGCTTTA
GAAAACAGAGCCATCAAGGCCATAACAGAGGAGCTCTGTTTTCGCAATATTTATCCAATT GGACCGCTCATTGTAAACGGAAGAATCGAAGATAGAAACGACAACAAGGCAGTTTCTTGT CTCAATTGGCTGGATTCGCAGCCGGAAAAGAGTGTTGTGTTTCTCTGTTTTGGAAGCTTA GGTTTGTTCTCAAAAGAACAGGTGATAGAGATTGCTGTTGGTTTAGAGAAAAGTGGGCAG AGATTCTTGTGGGTGGTCCGTAATCCACCCGAGTTAGAAAAGACAGAACTGGATTTGAAA TCACTCTTACCAGAAGGATTCTTAAGCCGAACCGAAGACAAAGGGATGGTCGTGAAATCA TGGGCTCCGCAAGTTCCGGTTCTGAATCATAAAGCAGTCGGGGGATTCGTCACTCATTGC GGTTGGAATTCAATTCTTGAAGCTGTTTGTGCTGGTGTGCCGATGGTGGCTTGGCCGTTG TACGCTGAGCAGAGGTTTAATAGAGTGATGATTGTGGATGAGATCAAGATTGCGATTTCG ATGAATGAATCAGAGACGGGTTTCGTGAGCTCTACAGAGGTGGAGAAACGAGTCCAAGAG ATAATTGGGGAGTGTCCGGTTAGGGAGCGAACCATGGCTATGAAGAACGCAGCCGAATTA GCCTTGACAGAAACTGGTTCGTCTCATACCGCATTAACTACTTTACTCCAGTCGTGGAGC CCAAAGTGA
>UGT89A2
ATGACGGAAGTGTTATTGTTGCCGGGAACTAAATCGGAGAATTCAAAACCACCGCACATA GTGGTGTTTCCATTCCCAGCACAAGGCCACTTACTTCCTCTACTTGACTTAACTCACCAA CTCTGCCTCCGTGGATTCAACGTCTCCGTCATCGTTACTCCCGGTAACCTTACTTACCTC TCTCCTCTTCTCTCCGCTCATCCCTCCTCCGTCACCTCCGTCGTTTTCCCTTTCCCTCCT CATCCTTCACTCTCTCCCGGCGTCGAAAACGTTAAAGACGTCGGAAATTCAGGAAATCTC CCGATCATGGCTTCTCTTCGTCAGCTACGAGAACCAATCATCAACTGGTTCCAATCTCAT CCGAATCCGCCTATCGCTCTCATCTCCGATTTCTTCCTCGGATGGACTCACGATCTCTGC AATCAAATCGGTATCCCCAGATTCGCTTTCTTCTCCATCAGCTTCTTCTTAGTTTCCGTT CTTCAATTTTGCTTCGAGAACATCGATCTAATCAAATCAACGGATCCGATTCATCTCCTT GATCTTCCTCGCGCTCCGATTTTCAAAGAAGAGCATCTTCCGTCTATAGTCCGACGAAGT CTCCAΆACTCCGTCACCGGATCTCGAATCAATCAAAGATTTCTCCATGAΆTTTGTTGAGC TACGGATCTGTTTTCAATTCTTCTGAGATTCTGGAAGATGATTATCTTCAGTACGTGAAA CAGAGGATGGGTCATGATCGGGTTTATGTTATTGGCCCGCTTTGTTCAATCGGGTCGGGT CTTAAATCGAATTCGGGTTCTGTAGACCCGAGTTTGCTGAGTTGGTTAGACGGATCCCCA AACGGGTCAGTTCTATACGTTTGTTTCGGAAGTCAAAAGGCGTTGACTAAAGACCAGTGT GATGCTTTGGCTCTAGGCTTAGAGAAAAGCATGACCCGGTTTGTTTGGGTGGTTAAGAAA GATCCGATACCCGACGGGTTTGAGGATCGGGTTTCCGGAAGGGGATTGGTGGTAAGAGGA TGGGTCTCCCAGCTGGCGGTGTTGCGACACGTGGCGGTTGGTGGATTTTTGAGCCATTGT GGATGGAACTCAGTGCTTGAAGGGATAACGAGTGGGGCTGTGATCTTGGGCTGGCCCATG GAGGCGGACCAGTTTGTGAACGCGAGGTTGCTTGTGGAGCATTTGGGTGTTGCGGTTAGG GTTTGCGAAGGTGGTGAAACTGTGCCTGACTCGGATGAGTTGGGTCGGGTCATAGCGGAA ACGATGGGTGAGGGAGGACGCGAGGTGGCTGCTCGGGCTGAGGAGATACGGCGGAAGACC GAGGCTGCCGTGACGGAGGCAAATGGAAGCTCCGTTGAAAATGTACAAAGACTTGTCAAA GAATTTGAAAAAGTCTAA
>UGT89B1
ATGAAAGTGAACGAGGAAAACAACAAGCCGACAAAGACCCATGTCTTAATCTTCCCATTT CCGGCGCAAGGTCACATGATTCCCCTCCTCGACTTCACCCACCGCCTTGCTCTCCGCGGC GGCGCCGCCTTAAAAATAACCGTCCTAGTCACTCCAAAAAACCTTCCTTTTCTCTCTCCG CTTCTCTCCGCCGTAGTTAACATCGAACCACTTATCCTCCCTTTTCCCTCCCACCCTTCA ATCCCCTCCGGCGTCGAAAACGTCCAAGACTTACCTCCTTCAGGCTTCCCTTTAATGATC CACGCGCTTGGTAATCTCCACGCGCCGCTTATCTCTTGGATTACTTCTCACCCTTCTCCT CCAGTAGCCATCGTATCTGATTTCTTCCTTGGTTGGACCAAAAACCTCGGAATCCCTCGT TTCGATTTCTCTCCCTCCGCTGCTATCACTTGCTGCATACTCAATACTCTCTGGATCGAA ATGCCCACCAAGATCAACGAAGATGACGATAACGAGATCCTCCACTTTCCCAAGATCCCG AATTGTCCAAAATACCGTTTTGATCAGATCTCCTCTCTTTACAGAAGTTACGTTCACGGA GATCCAGCTTGGGAGTTCATAAGAGACTCCTTTAGAGATAACGTGGCGAGTTGGGGACTC
GTCGTGAACTCGTTCACCGCCATGGAAGGTGTTTATCTCGAACATCTTAAGCGAGAGATG GGCCATGATCGTGTATGGGCTGTAGGCCCAATTATTCCGTTATCTGGGGATAACCGTGGT GGCCCGACTTCTGTTTCTGTTGATCACGTGATGTCGTGGCTTGACGCACGTGAGGATAAC CACGTGGTGTACGTGTGCTTTGGAAGTCAAGTAGTTTTGACTAAAGAGCAGACTCTTGCA CTCGCCTCTGGGCTTGAGAAAAGCGGCGTCCATTTCATATGGGCCGTAAAGGAGCCCGTT GAGAAAGACTCAACACGTGGCAACATCCTGGACGGTTTCGACGATCGCGTGGCTGGGAGA GGTCTGGTGATCAGAGGATGGGCTCCACAAGTAGCTGTGCTACGTCACCGAGCCGTTGGC GCGTTTTTAACGCACTGTGGTTGGAACTCTGTGGTGGAGGCGGTTGTCGCCGGCGTTTTG ATGCTGACGTGGCCGATGAGAGCTGACCAGTACACTGACGCGTCTCTGGTGGTTGATGAG TTGAAAGTAGGTGTGCGTGCTTGCGAAGGACCTGACACGGTGCCTGACCCGGACGAGTTA GCTCGAGTTTTCGCTGATTCCGTGACCGGAAATCAAACGGAGAGGATCAAAGCCGTGGAG CTGAGGAAAGCAGCGTTGGATGCGATTCAAGAACGTGGGAGCTCAGTGAATGATTTAGAT GGATTTATCCAACATGTCGTTAGTTTAGGACTAAACAAATGA
>UGT89C1
ATGACAACAACAACAACGAAGAAGCCGCACGTTCTGGTGATACCGTTTCCACAATCCGGT CACATGGTTCCACATCTTGACCTCACGCATCAGATTCTTCTCCGTGGAGCCACCGTCACT GTCCTCGTCACACCCAAAAACTCTTCCTATCTCGATGCTCTCCGTTCTCTTCACTCCCCG GAACACTTCAAAACCCTAATCCTTCCTTTTCCTTCTCACCCTTGTATACCTTCCGGTGTC GAATCTCTCCAGCAACTTCCTCTCGAAGCTATAGTTCACATGTTTGATGCTCTCTCTCGT CTCCACGACCCTCTCGTTGACTTTCTCAGCCGTCAACCACCGTCGGATCTCCCCGACGCC ATCCTAGGAAGCTCATTTCTCAGCCCTTGGATTAACAAAGTAGCTGATGCTTTCTCTATT AAGTCCATTAGTTTCTTACCCATCAATGCTCATTCGATCTCCGTCATGTGGGCTCAAGAA GATAGAAGCTTCTTCAACGATCTCGAGACTGCCACAACGGAAAGCTACGGGCTCGTCATC AACAGTTTCTACGACCTCGAGCCTGAGTTTGTAGAAACTGTTAAAACACGTTTCCTGAAT CACCACCGTATATGGACCGTCGGACCGTTGCTCCCCTTTAAAGCTGGCGTTGACCGTGGC GGACAAAGCTCAATCCCGCCGGCGAAAGTCTCGGCTTGGTTAGATTCGTGCCCCGAGGAT AACTCCGTCGTATACGTCGGTTTTGGAAGCCAGATCCGGCTCACGGCGGAGCAAACAGCT GCTTTAGCGGCGGCGTTGGAGAAAAGCAGTGTGCGTTTCATATGGGCGGTGAGAGACGCA GCTAAGAAGGTGAACTCCAGCGATAACTCCGTTGAGGAAGATGTGATCCCGGCGGGATTT GAAGAGAGAGTGAAGGAGAAAGGACTCGTGATAAGAGGATGGGCCCCACAAACTATGATT CTTGAGCATCGAGCCGTTGGATCTTACCTAACTCATTTGGGTTGGGGTTCGGTTCTGGAA GGAATGGTCGGAGGAGTTATGTTGCTAGCGTGGCCGATGCAAGCAGACCATTTCTTTAAC ACGACGCTCATCGTTGATAAACTAAGAGCCGCAGTGCGAGTTGGAGAGAACAGAGACTCG GTTCCTGACTCGGACAAGCTCGCTAGGATTTTGGCTGAGTCGGCGAGAGAGGACTTGCCG GAGAGAGTTACGTTGATGAAGCTGAGGGAGAAAGCTATGGAGGCCATTAAAGAAGGTGGG AGCTCTTACAAGAACTTGGATGAGCTCGTTGCAGAGATGTGTTTGTAA
>UGT90A1
ATGTCCGTTTCAACACATCACCACCACGTGGTCCTCTTCCCTTTCATGTCAAAAGGCCAC ATCATCCCTCTCCTCCAATTCGGTCGTCTCCTCCTCCGTCACCACCGCAAAGAACCAACC ATCACCGTCACCGTTTTCACCACTCCCAAGAACCAACCTTTCATCTCAGACTTCCTCTCG GATACGCCGGAGATCAAAGTCATCTCTCTCCCTTTCCCGGAAAACATCACCGGAATCCCT CCCGGCGTCGAGAACACCGAAAAGCTCCCATCCATGTCACTTTTCGTCCCCTTCACACGC GCCACGAAGCTTCTCCAACCTTTCTTCGAAGAAACACTCAAGACTCTTCCAAAAGTTTCG TTCATGGTCTCTGATGGATTCCTCTGGTGGACATCGGAGTCTGCAGCTAAGTTCAACATT CCAAGATTTGTCTCCTACGGCATGAACTCTTACTCCGCCGCTGTCTCCATCTCTGTTTTC AAACACGAACTCTTTACCGAACCGGAAAGTAAATCTGATACCGAACCGGTCACTGTACCA GACTTTCCATGGATCAAGGTCAAGAAGTGTGATTTCGACCATGGCACTACCGAGCCGGAA GAATCAGGTGCAGCCCTCGAACTATCTATGGACCAAATCAAGTCGACCACCACAAGCCAT GGGTTTTTAGTCAATAGCTTCTACGAGCTCGAGTCAGCATTTGTTGATTACAACAACAAC TCTGGTGATAAACCAAAGTCGTGGTGTGTTGGGCCACTGTGTTTGACAGATCCTCCTAAA
CAGGGGAGTGCTAAACCGGCTTGGATTCATTGGTTGGATCAGAAGCGAGAGGAAGGGCGT CCGGTTTTGTACGTGGCGTTTGGAACGCAGGCAGAGATATCGAACAAGCAGCTTATGGAA CTAGCTTTCGGCTTGGAAGATTCAAAGGTGAACTTTCTGTGGGTCACAAGAAAAGATGTG GAGGAGATTATTGGAGAAGGATTCAACGATAGAATAAGAGAGAGTGGGATGATAGTGAGA GATTGGGTGGACCAATGGGAGATATTGTCACATGAAAGTGTCAAAGGATTTTTGAGCCAT TGTGGGTGGAACTCAGCACAAGAGAGCATATGTGTCGGGGTCCCATTGTTGGCTTGGCCG ATGATGGCCGAGCAACCGCTCAATGCGAAGATGGTTGTGGAGGAGATAAAGGTGGGAGTA AGAGTTGAAACGGAAGATGGGAGTGTAAAAGGTTTTGTGACAAGAGAAGAACTAAGTGGA AAGATTAAAGAACTGATGGAAGGAGAAACGGGGAAAACCGCAAGAAAGAATGTAAAAGAA TATTCGAAAATGGCGAAAGCGGCTTTGGTCGAAGGGACTGGTTCGTCATGGAAGAATTTA GATATGATTCTTAAGGAGTTATGTAAGAGTAGAGATTCAAACGGTGCTAGTGAGTAG
>UGT90A2
ATGGAGTTAGAAAAAGTTCACGTGGTTTTGTTCCCATACTTGTCCAAAGGGCACATGATT CCTATGCTCCAATTAGCTCGTCTCCTCTTATCCCACTCCTTCGCCGGAGACATCTCCGTC ACCGTCTTCACCACTCCTTTGAACCGTCCTTTCATCGTTGACTCACTCTCCGGCACCAAA GCGACCATCGTCGACGTACCTTTCCCTGATAACGTCCCGGAGATCCCACCCGGCGTCGAG TGCACTGACAAACTCCCTGCTTTGTCGTCCTCCCTCTTCGTTCCTTTCACAAGAGCCACC AAGTCAATGCAGGCAGACTTTGAGCGAGAGCTCATGTCACTGCCACGTGTCAGTTTCATG GTCTCAGACGGTTTCTTGTGGTGGACGCAAGAGTCAGCTCGAAAGCTAGGGTTTCCTCGG CTTGTTTTCTTTGGTATGAATTGCGCTTCCACCGTTATATGTGACAGTGTTTTTCAAAAC CAGCTTCTATCTAATGTTAAGTCCGAGACGGAGCCAGTTTCTGTACCGGAGTTTCCGTGG ATTAAGGTTAGGAAATGTGATTTCGTTAAAGATATGTTTGATCCAAAAACCACCACAGAT CCTGGATTCAAGCTTATCCTAGATCAAGTCACGTCTATGAATCAAAGCCAAGGTATCATA TTCAATACATTTGACGACCTTGAACCCGTGTTTATTGATTTCTACAAGCGTAAACGCAAA CTCAAGCTTTGGGCAGTTGGACCGCTTTGTTACGTAAATAACTTGGCTTGGATGATGAAG TAGAAGAGAAGGTCAAACCTAGTTGGATGAAATGGCTAGATGAAAAGCGAGACAAGGGAT GCAATGTTCTGTATGTGGCTTTCGGGTCACAAGCCGAGATCTCGAGAGAACAACTAGAGG AGATTGCGTTAGGGTTGGAAGAATCGAAGGTGAACTTCTTGTGGGTGGTCAAAGGAAATG AAATAGGAAAAGGGTTTGAAGAGAGAGTGGGAGAAAGAGGAATGATGGTGAGAGATGAAT GGGTTGATCAGAGGAAGATATTAGAGCACGAGAGTGTTAGAGGGTTCTTGAGCCATTGTG GGTGGAATTCTCTGACGGAGAGCATTTGCTCGGAGGTTCCAATCTTGGCGTTTCCTTTAG CAGCGGAGCAACCTCTGAATGCGATTTTGGTGGTGGAAGAGCTGAGAGTGGCGGAGAGAG TGGTGGCGGCGAGTGAAGGGGTTGTGAGAAGAGAAGAGATTGCAGAGAAAGTGAAGGAGT TGATGGAGGGAGAGAAAGGGAAAGAGCTGAGGAGGAATGTCGAGGCATATGGTAAGATGG
CGAAGAΆGGCTTTGGAGGAAGGTATTGGTTCGTCTAGGAAGAATTTAGACAACCTTATCA ACGAGTTTTGTAACAATGGAACATGA
>UGT90A4
ATGGCCGTTTCATCGTCGCATCATGCGGTTCTCTTCCCTTACATGTCAAAAGGCCACACG ATTCCTCTCCTCCAATTCGCCCGTCTCCTCCTCCGTCACCGCCGTATCGTCTCCGTAGAC GACGAAGAACCAACCATTTCCGTCACCGTCTTCACCACCCCAAAAAACCAACCATTCGTC TCAAACTTCCTCTCTGACGTCGCATCATCTATCAAAGTAATCTCCCTCCCTTTCCCTGAA AACATCGCCGGAATCCCTCCCGGCGTCGAGAGCACCGACATGCTCCCTTCCATATCACTT TACGTGCCCTTCACGCGCGCAACCAAATCTCTCCAGCCTTTCTTCGAAGCAGAACTCAAG AATCTTGAGAAAGTTTCTTTCATGGTCTCCGATGGATTCTTATGGTGGACATCGGAATCC GCCGCTAAATTTGAGATCCCGAGACTTGCCTTCTACGGCATGAACTCCTACGCATCGGCT ATGTGCTCCGCCATTTCGGTACACGAGCTCTTTACCAAACCGGAAAGTGTTAAATCTGAT ACTGAACCGGTTACTGTACCGGATTTTCCATGGATATGTGTTAAGAAGTGTGAGTTCGAT CCGGTTTTGACCGAACCGGATCAATCGGATCCAGCGTTCGAGCTACTCATTGACCATCTT ATGTCCACCAAGAAAAGCCGTGGAGTTATAGTGAACAGCTTTTACGAGCTCGAGTCAACG TTCGTTGACTACCGGCTCCGTGATAACGATGAACCAAAACCGTGGTGTGTTGGGCCTTTG
TGTTTGGTAAATCCTCCAAAACCGGAGAGTGATAAACCGGATTGGATTCATTGGTTGGAC CGGAAACTAGAGGAAAGATGTCCGGTTATGTATGTGGCGTTTGGAACGCAGGCTGAGATA TCGAACGAGCAGCTCAAGGAAATAGCATTAGGGTTGGAAGATTCCAAGGTCAATTTCTTG TGGGTCACGAGAAAGGACTTGGAAGAAGTAACTGGAGGATTAGGGTTCGAAAAGAGAGTG AAAGAGCATGGGATGATTGTGAGAGATTGGGTAGACCAATGGGAGATATTGTCACATAAA AGTGTCAAAGGGTTTTTGAGTCATTGTGGATGGAACTCGGCGCAAGAGAGTATTTGCGCT GGGGTTCCACTACTCGCTTGGCCAATGATGGCAGAGCAGCCACTCAATGCGAAGTTGGTA GTGGAGGAGCTAAAGATCGGAGTAAGAΆTCGAAACAGAAGATGTAAGTGTGAAΆGGATTC GTGACAAGAGAAGAACTTAGTCGAAAGGTTAAACAATTGATGGAGGGAGAGATGGGGAAG ACAACGATGAAGAATGTAAAAGAGTATGCGAAAATGGCGAAAAAAGCTATGGCTCAAGGG ACTGGTTCGTCTTGGAAGAGTTTGGATTCGCTTCTGGAAGAGCTTTGTAAGAGTAGAGAG CCAGACGGTGTTAATAAGTTGTCAAGTTCTGATGCTTAG
>UGT91A1
ATGACAAACTTCAAAGACAACGATGGAGATGGAACCAAACTCCACGTGGTAATGTTTCCA TGGTTAGCCTTTGGTCACATGGTTCCATACTTGGAGCTCTCTAAACTCATAGCTCAAAAG GGTCACAAAGTCTCTTTCATTTCCACTCCACGTAACATCGACCGTCTCCTCCCATGGTTA CCGGAAAATCTCTCCTCCGTCATTAACTTCGTCAAGCTATCACTTCCCGTCGGCGACAAC AAACTCCCGGAAGACGGTGAAGCTACCACAGACGTCCCTTTCGAACTCATACCTTACTTA AAAATCGCTTACGACGGGTTAAAAGTTCCGGTGACGGAGTTTCTTGAATCTTCGAAACCC GATTGGGTTCTTCAAGATTTCGCGGGGTTTTGGCTTCCTCCAATCTCTCGTCGTCTCGGA ATCAAAACCGGATTCTTTAGCGCTTTCAACGGCGCGACGCTCGGTATTCTTAAACCGCCG GGGTTCGAAGAGTACCGTACTTCGCCGGCGGATTTTATGAAGCCGCCTAAGTGGGTTCCG TTTGAAACTTCGGTAGCTTTCAAGTTATTTGAATGCAGGTTCATTTTCAAAGGATTTATG GCGGAAACCACCGAAGGGAATGTTCCCGACATCCACCGTGTCGGCGGCGTAATTGACGGC TGTGACGTCATCTTCGTACGGAGCTGTTACGAGTATGAAGCGGAGTGGTTAGGACTTACA CAAGAACTTCACCGGAAACCGGTTATACCGGTCGGAGTTTTGCCTCCAAAACCGGACGAA AAGTTTGAAGATACCGACACGTGGCTGTCTGTTAAAAAATGGTTGGACTCACGGAAAAGT AAGTCCATTGTCTACGTAGCTTTTGGTTCAGAAGCTAAACCGAGTCAAACGGAGCTAAAT GAGATCGCTCTCGGTTTAGAGCTTTCTGGTTTACCTTTCTTTTGGGTGTTAAAGACTCGT CGTGGTCCGTGGGATACCGAACCGGTCGAGCTTCCGGAAGGATTCGAAGAGCGTACAGCG GATAGAGGGATGGTGTGGAGAGGTTGGGTTGAGCAATTGCGTACATTGAGCCATGACTCG ATCGGTTTGGTTCTGACTCATCCCGGTTGGGGAACGATAATTGAAGCTATCCGGTTTGCT AAACCGATGGCAATGCTGGTTTTTGTGTATGACCAAGGATTGAATGCGAGAGTCATTGAA GAGAAGAAAATTGGGTATATGATCCCTCGAGACGAGACAGAAGGTTTCTTTACTAAAGAA AGTGTTGCGAATTCGCTAAGATTGGTAATGGTGGAAGAAGAAGGAAAGGTTTATAGAGAG AATGTGAAGGAGATGAAAGGAGTGTTTGGAGATATGGATAGACAAGATCGTTATGTGGAT TCATTCTTGGAATATCTTGTTACTAATCGTTAA
>UGT91B1
ATGGCCGAGCCAAAACCGAAGCTTCATGTTGCAGTGTTCCCATGGTTAGCTTTAGGTCAC ATGATTCCTTACTTGCAACTCTCAAAGCTCATAGCAAGGAAAGGCCATACTGTGTCCTTC ATCTCCACAGCTCGTAACATTTCACGTCTTCCCAATATATCCTCCGACCTTTCCGTGAAT TTCGTTTCTTTGCCGTTAAGTCAAACCGTCGACCATCTCCCAGAGAACGCTGAGGCCACC ACTGATGTCCCGGAGACTCACATAGCTTATCTGAAGAAAGCATTTGATGGGCTTTCTGAA GCTTTCACAGAGTTTTTAGAAGCTTCCAAACCAAACTGGATAGTGTATGATATCTTGCAC CATTGGGTCCCGCCTATCGCTGAGAAGCTCGGCGTGAGACGAGCCATCTTCTGCACGTTC AACGCAGCTTCCATCATCATCATCGGTGGGCCAGCATCAGTCATGATTCAAGGTCATGAC CCTCGAAAGACTGCTGAAGATCTTATCGTGCCTCCACCATGGGTCCCGTTTGAGACCAAC ATAGTTTACCGTCTCTTTGAAGCTAAGAGGATCATGGAGTATCCCACGGCAGGTGTAACT GGAGTTGAATTGAACGACAACTGTAGATTGGGTTTGGCTTACGTTGGCTCTGAGGTTATT GTGATTAGATCATGTATGGAACTCGAACCTGAGTGGATTCAATTGCTCAGTAAACTCCAA
GGAAAGCCTGTGATTCCAATTGGTTTACTCCCGGCTACACCAATGGATGATGCAGATGAC GAGGGAACATGGTTAGACATCAGAGAATGGCTAGACAGACATCAAGCAAAGTCTGTGGTT TATGTAGCCTTAGGAACTGAAGTGACAATTAGTAACGAAGAGATTCAAGGTTTAGCTCAT GGGTTGGAGCTTTGCAGGTTACCTTTCTTTTGGACGCTAAGGAAGAGGACTAGAGCTTCT ATGCTACTACCTGATGGGTTCAAAGAGAGAGTCAAAGAGCGTGGAGTCATTTGGACCGAG TGGGTACCTCAGACCAAGATACTGAGCCATGGTTCAGTTGGTGGGTTTGTTACTCATTGT GGTTGGGGATCAGCTGTGGAAGGGCTTAGCTTTGGTGTCCCTTTGATCATGTTTCCATGT AACCTAGACCAGCCGCTAGTGGCTAGGTTGCTCAGTGGGATGAATATAGGCTTGGAGATT CCAAGGAATGAGCGAGACGGGCTGTTCACGAGTGCTTCTGTTGCAGAGACAATCAGACAT GTTGTTGTGGAAGAAGAAGGAAAGATCTACAGGAACAATGCTGCATCTCAGCAAAAGAAA ATATTCGGGAACAAGAGATTGCAAGATCAGTATGCGGATGGTTTTATCGAGTTTCTGGAG AATCCTATAGCAGGAGTGTAG
>UGT91C1
ATGGTCGACAAGAGAGAAGAAGTTATGCACGTAGCCATGTTTCCATGGCTAGCTATGGGT CATCTCCTTCCTTTTCTTCGTCTCTCCAAGTTACTAGCTCAAAAGGGTCACAAGATCTCT TTCATATCAACACCAAGAAACATCGAAAGACTTCCTAAATTACAATCAAACCTCGCCTCC TCCATCACCTTCGTCTCTTTCCCTCTCCCTCCCATCTCAGGCTTGCCTCCTTCTTCAGAA TCATCCATGGACGTTCCTTACAACAAGCAACAGTCTCTTAAAGCCGCTTTTGATCTTCTT CAGCCACCGTTGAAAGAGTTTCTCCGACGGTCTTCTCCGGATTGGATCATATACGACTAT GCTTCTCACTGGCTTCCTTCTATTGCGGCCGAGCTTGGAATCTCTAAGGCTTTCTTTAGT CTCTTTAACGCAGCTACTCTCTGTTTCATGGGACCGTCTTCGTCTTTGATTGAAGAAATT AGATCAACGCCGGAAGATTTCACGGTGGTGCCACCGTGGGTCCCGTTCAAGTCAAACATC GTGTTTCGTTATCATGAAGTTACTAGATACGTTGAGAAGACAGAGGAAGATGTAACCGGA GTCTCTGACTCAGTTCGGTTTGGTTACTCGATTGACGAAAGCGATGCGGTTTTTGTCCGT AGCTGTCCGGAGTTTGAACCGGAATGGTTTGGTTTACTAAAAGACCTGTACCGTAAACCG GTATTTCCAATCGGGTTTTTGCCTCCGGTTATTGAAGACGACGATGCCGTTGATACTACA TGGGTTCGTATAAAGAAGTGGCTCGACAAGCAACGGCTTAATTCAGTTGTTTACGTGTCA CTTGGCACCGAAGCGAGTCTTCGTCATGAGGAAGTAACTGAGCTAGCTCTTGGGTTAGAG AAGTCAGAGACACCGTTCTTTTGGGTCCTAAGGAACGAGCCAAAGATTCCAGATGGGTTC AAAACACGAGTCAAGGGACGTGGAATGGTTCATGTTGGTTGGGTTCCACAAGTGAAAATA CTTAGTCACGAGTCAGTAGGAGGGTTCTTGACACATTGTGGTTGGAACTCAGTGGTGGAA GGGTTAGGGTTTGGTAAAGTTCCAATCTTTTTTCCGGTGTTGAATGAGCAAGGACTTAAT ACGAGGTTGTTGCATGGGAAAGGACTTGGTGTTGAGGTTTCAAGAGATGAGAGAGATGGG TCGTTTGATTCTGACTCGGTCGCTGACTCGATTAGGTTGGTGATGATTGATGATGCTGGC GAGGAGATAAGGGCTAAGGCTAAAGTGATGAAGGATTTGTTTGGGAACATGGATGAGAAT ATTCGTTATGTTGACGAACTTGTTAGGTTTATGAGAAGTAAAGGATCATCATCATCATCA TGA
>UGT92A1
ATGGCGGAAGCTAAACCCAGAAATCTGAGAATCGTGATGTTCCCTTTCATGGGACAAGGC CATATCATCCCGTTTGTAGCTTTAGCCCTTCGTTTAGAGAAGATTATGATTATGAACAGA GCCAACAAAACCACCATCTCTATGATCAATACTCCTTCGAACATCCCCAAAATACGCTCC AATCTTCCACCTGAATCCTCCATAAGTCTCATAGAGTTACCTTTCAACAGCTCTGATCAT GGCCTTCCTCACGACGGCGAGAATTTCGATTCTCTTCCTTACTCTCTCGTCATCAGCCTT CTTGAAGCTTCTAGGTCGCTTCGTGAGCCCTTTCGAGACTTCATGACGAAGATCTTGAAG GAAGAAGGGCAGAGCTCGGTTATAGTGATCGGTGATTTCTTCTTGGGTTGGATCGGTAAG GTTTGCAAAGAGGTTGGTGTTTATTCAGTGATCTTTAGTGCTTCTGGTGCTTTTGGTTTA GGTTGTTATAGATCCATATGGTTAAACTTGCCACATAAAGAAACCAAACAAGATCAGTTT CTCTTAGATGATTTCCCTGAAGCAGGGGAGATTGAGAAAACTCAGTTGAATTCTTTCATG TTAGAAGCTGATGGAACCGATGATTGGTCTGTTTTCATGAAGAAGATTATACCTGGATGG TCTGACTTCGATGGATTCTTGTTCAACACGGTTGCTGAAATCGATCAGATGGGATTATCC
TACTTCCGTAGAATAACCGGTGTTCCGGTTTGGCCAGTTGGGCCGGTTTTGAAGTCTCCG GATAAGAAGGTGGGATCGAGGTCGACAGAGGAAGCAGTGAAGTCATGGCTTGACTCAAAA CCGGACCATTCGGTTGTGTACGTATGTTTCGGTTCAATGAACTCGATTTTGCAAACGCAT ATGTTAGAATTGGCTATGGCATTAGAGAGTAGCGAGAAGAACTTCATATGGGTGGTGAGG CCGCCCATAGGTGTGGAGGTGAAGAGTGAGTTTGATGTGAAAGGGTATCTACCGGAAGGA TTTGAGGAAAGAATAACAAGATCGGAAAGAGGGTTACTTGTGAAGAAATGGGCACCACAA GTTGATATATTGTCACACAAGGCAACATGTGTGTTTTTGAGTCATTGCGGATGGAACTCG ATACTCGAATCACTTAGCCACGGTGTGCCACTGCTCGGATGGCCCATGGCAGCCGAGCAG TTCTTCAATTCCATATTGATGGAGAAACATATTGGGGTATCGGTTGAGGTGGCGCGTGGG AAGAGATGTGAGATCAAATGTGATGACATTGTTTCTAAGATCAAACTGGTGATGGAGGAG ACTGAAGTAGGGAAAGAGATTAGGAAGAAGGCTAGAGAGGTGAAGGAGTTAGTGAGGAGA GCAATGGTAGATGGAGTTAAAGGTTCCTCCGTCATTGGTTTGGAAGAGTTTCTTGACCAA GCAATGGTCAAGAAAGTGGAGAATTGA
Claims
1. A nucleic acid molecule that encodes a chimeric polypeptide wherein said nucleic acid molecule comprises a first part selected from the group consisting of:
i) a nucleic acid molecule consisting of a nucleic acid sequence which encodes an amino acid sequence as represented in Figure 3; ii) a nucleic acid molecule which hybridises to the nucleic acid molecule in (i) and which specifically targets said chimeric polypeptide to an oil body; and iii) a nucleic acid molecule which differs from the nucleic acid molecules of (i) and (ii) due to the degeneracy in the genetic code; and a second part that encodes an heterologous polypeptide.
2. A nucleic acid molecule according to Claim 1 wherein said second part encodes an enzyme.
3. A nucleic acid according to Claim 1 wherein said second part encodes a pharmaceutical polypeptide.
4. A nucleic acid molecule according to Claim 1 wherein said second part encodes an antigenic polypeptide.
5. A nucleic acid molecule according to Claim 1 wherein said second part encodes a mussel adhesive polypeptide encoded by a nucleic acid molecule selected from the group consisting of:
i) a nucleic acid molecule which encodes a polypeptide comprising the amino acid motif
X-K-X-X-Y-P-X-X-Y-K wherein X is any amino acid residue; ii) a nucleic acid molecule which encodes a polypeptide wherein at least one of the residues in said motif is modified by hydroxylation; and iii) a nucleic acid molecule which encodes a polypeptide which has adhesive properties.
6. A nucleic acid molecule according to Claim 1 wherein said second part encodes a spider silk polypeptide comprising a nucleic acid sequence selected from the group consisting of: i) a nucleic acid molecule consisting of the nucleic acid sequence as represented in Figure 4a or 4b; ii) a nucleic acid molecule which encodes a polypeptide domain comprising the amino acid sequence AGRGQGGYGQGAGG and at least two motifs rich in polyalanine wherein said polyalanine motifs comprise at least 6 alanine amino acid residues; iii) a nucleic acid molecule which hybridises to the sequence presented in
(i) and (ii) above and which encodes a silk polypeptide; and iv) a nucleic acid sequences that are degenerate as a result of the genetic code to the nucleic acid sequence defined in (i) - (iii) above.
7. A nucleic acid molecule according to Claim 1 wherein said second part encodes a polypeptide with glucosylfransferase activity and is encoded by a nucleic acid molecule selected from the group consisting of: i) a nucleic acid molecule selected from the group consisting of nucleic acid sequences as represented in Table 1; ii) a nucleic acid molecule which hybridise to the sequences represented in (ii) above; and which have glucosyltransferase activity; and iii) nucleic acid molecules consisting of nucleic acid sequences which are degenerate as a result of the genetic code to the sequences defined in (i) and (ii) above.
8. A nucleic acid molecule according to any of Claims 1-7 wherein said molecule encodes a cleavage site to facilitate the purification of said second part from said first part.
9. A nucleic acid molecule according to Claim 8 wherein said cleavage site is a proteolytic cleavage site.
10. A chimeric polypeptide encoded by a nucleic acid molecule according to any of Claims 1-9.
11. A vector comprising a nucleic acid molecule that encodes a chimeric polypeptide according to Claim 10.
12. A cell transfected or transformed with a nucleic acid molecule according to any of Claims 1-9 or a vector according to Claim 11.
13. A cell according to Claim 12 wherein said cell is a plant cell.
14. A method to produce a chimeric polypeptide according to Claim 10 comprising the steps of:
i) providing a cell according to Claim 12 or 13 and growth conditions conducive to the production of a chimeric polypeptide according to the invention; and optionally
ii) purifying said chimeric polypeptide from said cell or growth media.
15. A plant comprising a cell according to Claim 13.
16. A seed comprising a cell according to Claim 13.
17. A vessel comprising a cell according to Claim 12 or 13.
16. A vessel comprising a polypeptide according to Claim 10.
17. A vessel according to Claim 16 wherein said vessel is a bioreactor.
18. A vessel according to Claim 17 wherein said bioreactor is a fermentor.
19. A vaccine preparation comprising a cell, or a fraction of a cell, according to Claim 12 or 13 or a plant according to Claim 15.
20. A preparation according to Claim 19 wherein said preparation is an oil body containing fraction.
21. A preparation according to Claim 19 or 20 wherein said preparation includes at least one carrier.
22. A preparation according to Claim 20 or 21 wherein said preparation includes at least one adjuvant.
23. A method to immunise an animal to at least the second part of the chimeric polypeptide according to Claim 10, comprising administering an effective amount of a chimeric polypeptide or preparation according to any of Claims 19-22 sufficient to stimulate an immune response to said second part.
24. A method according to Claim 23 wherein said animal is human.
25. An antibody obtainable by the method according to the invention.
26. An antibody according to Claim 25 wherein said antibody is a monoclonal antibody or binding fragment thereof.
27. A method for preparing a hybridoma cell-line producing monoclonal antibodies according to Claim 26 comprising the steps of: i) immunising an immunocompetent mammal with a chimeric polypeptide or preparation according to any previous Claim; ii) fusing lymphocytes of the immunised immunocompetent mammal with myeloma cells to form hybridoma cells; iii) screening monoclonal antibodies produced by the hybridoma cells of step (ii) for binding activity to polypeptide of the invention; iv) culturing the hybridoma cells to proliferate and/or to secrete said monoclonal antibody; and v) recovering the monoclonal antibody from the culture supernatant.
28. A hybridoma cell-line obtainable by the method according to Claim 27.
29. A method to prepare a vaccine preparation according to any of Claims 19-22 comprising; i) forming a cell extract from a plant cell culture or plant according to the invention; and iii) separating said extract into an oil body rich fraction.
30. A method according to Claim 29 wherein said extract is incubated with a protease that cleaves said second part from said first part.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB0314102A GB0314102D0 (en) | 2003-06-18 | 2003-06-18 | Expression of heterologous protein |
GB0314102.5 | 2003-06-18 | ||
GB0402293.5 | 2004-02-03 | ||
GB0402293A GB0402293D0 (en) | 2004-02-03 | 2004-02-03 | Plant derived vaccine |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2004113540A1 true WO2004113540A1 (en) | 2004-12-29 |
Family
ID=33542669
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/GB2004/002432 WO2004113540A1 (en) | 2003-06-18 | 2004-06-09 | Expression of heterologous protein |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2004113540A1 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005040363A1 (en) * | 2003-10-13 | 2005-05-06 | The University Of York | Glucosyltransferases |
WO2006134320A3 (en) * | 2005-06-16 | 2007-02-22 | Univ York | Glycosyltransferase activity |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1993021320A1 (en) * | 1991-02-22 | 1993-10-28 | University Technologies International, Inc. | Oil-body proteins as carriers of high-value peptides in plants |
WO1998027115A1 (en) * | 1996-12-16 | 1998-06-25 | Sembiosys Genetics Inc. | Oil bodies and associated proteins as affinity matrices |
-
2004
- 2004-06-09 WO PCT/GB2004/002432 patent/WO2004113540A1/en active Application Filing
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1993021320A1 (en) * | 1991-02-22 | 1993-10-28 | University Technologies International, Inc. | Oil-body proteins as carriers of high-value peptides in plants |
WO1998027115A1 (en) * | 1996-12-16 | 1998-06-25 | Sembiosys Genetics Inc. | Oil bodies and associated proteins as affinity matrices |
Non-Patent Citations (4)
Title |
---|
ABELL BEN M ET AL: "Role of the proline knot motif in oleosin endoplasmic reticulum topology and oil body targeting", PLANT CELL, vol. 9, no. 8, 1997, pages 1481 - 1493, XP002303933, ISSN: 1040-4651 * |
ALTAF A ET AL: "ACID LIPASE OF CASTOR BEAN LIPID BODIES: ISOLATION AND CHARACTERISATION", JOURNAL OF PLANT BIOCHEMISTRY AND BIOTECHNOLOGY, SOCIETY FOR PLANT BIOCHEMISTRY AND BIOTECHNOLOGY, NEW DEHLI, IN, vol. 6, no. 1, January 1997 (1997-01-01), pages 13 - 18, XP009037294, ISSN: 0971-7811 * |
EASTMOND PETER J: "Cloning and characterization of the Acid lipase from castor beans.", THE JOURNAL OF BIOLOGICAL CHEMISTRY. 29 OCT 2004, vol. 279, no. 44, 29 October 2004 (2004-10-29), pages 45540 - 45545, XP002303934, ISSN: 0021-9258 * |
FUCHS C ET AL: "PURIFICATION AND CHARACTERIZATION OF THE ACID LIPASE FROM THE ENDOSPERM OF CASTOR OIL SEEDS", JOURNAL OF PLANT PHYSIOLOGY, FISCHER, STUTTGART, DE, vol. 149, 1996, pages 23 - 29, XP001026860, ISSN: 0176-1617 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005040363A1 (en) * | 2003-10-13 | 2005-05-06 | The University Of York | Glucosyltransferases |
WO2006134320A3 (en) * | 2005-06-16 | 2007-02-22 | Univ York | Glycosyltransferase activity |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4610334B2 (en) | Production of peptides and proteins by accumulating in protein granules from the endoplasmic reticulum of plants | |
Daniell et al. | Optimization of codon composition and regulatory elements for expression of human insulin like growth factor-1 in transgenic chloroplasts and evaluation of structural identity and function | |
JP4864701B2 (en) | Method for producing insulin in plants | |
KR102444024B1 (en) | Recombinant vector for antigen production for diagnosis of African swine fever and use thereof | |
EP2367941B1 (en) | Method for the transformation of plant cell mitochondria | |
US20150157731A1 (en) | Transgenic plants expressing cobalamin binding proteins | |
WO2001042441A2 (en) | Plastid transformation | |
WO2004113540A1 (en) | Expression of heterologous protein | |
CA3064801A1 (en) | Porcine epidemic diarrhea virus-like particles | |
BG64320B1 (en) | RECOMBINANT PREVODENEOUS LIQUIDS AND POLYPEPTIDE DERIVATIVES, PRODUCED BY PLANTS, METHOD FOR RECEIVING AND USING THE | |
WO2006117927A1 (en) | Transformed plant capable of producing polyglutamic acid | |
Siegert et al. | Expression of the major mugwort pollen allergen Art v 1 in tobacco plants and cell cultures: problems and perspectives for allergen production in plants | |
CN107384948B (en) | Method for highly expressing target protein in plant and method for producing composition for oral administration of medical protein for expressing plant | |
Ghorbaniparsa et al. | Comparing Patatin Class I and Camv 35s Promoters in Expression of Human Calcitonin Gene in Potato (Solanum Tuberosum Cvs. Kardal And Marfona) | |
KR102726134B1 (en) | Vaccinia virus recombinant A27L protein and kit for diagnosing orthopoxvirus antibody comprising the same | |
KR102726138B1 (en) | Vaccinia virus recombinant A33R protein and kit for diagnosing orthopoxvirus antibody comprising the same | |
US9441232B2 (en) | Pericarp tissue preferred regulatory region and method of using same | |
CN116457008A (en) | Modified coronavirus structural proteins | |
de Oliveira | Heterologous production of oligopeptides with antihypertensive effects in Lactuca sativa and Medicago truncatula | |
WO2005045046A1 (en) | Expression of plastid-targeted polypeptides in plants | |
Class et al. | Patent application title: METHODS OF DELIVERY OF MOLECULES TO CELLS USING A RICIN SUBUNIT AND COMPOSITIONS RELATING TO SAME Inventors: Carole L. Cramer (Jonesboro, AR, US) Maureen C. Dolan (Jonesboro, AR, US) Michael J. Reidy (Rockville, MD, US) Assignees: ARKANSAS STATE UNIVERSITY |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
122 | Ep: pct application non-entry in european phase |